BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|254781108|ref|YP_003065521.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] (398 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done >gi|254781108|ref|YP_003065521.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] gi|254040785|gb|ACT57581.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] Length = 398 Score = 261 bits (666), Expect = 2e-67, Method: Composition-based stats. Identities = 398/398 (100%), Positives = 398/398 (100%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT Sbjct: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL Sbjct: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL Sbjct: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT Sbjct: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG 300 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG Sbjct: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG 300 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC Sbjct: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 Query: 361 TDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 TDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR Sbjct: 361 TDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 >gi|190893432|ref|YP_001979974.1| hypothetical protein RHECIAT_CH0003859 [Rhizobium etli CIAT 652] gi|190698711|gb|ACE92796.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 410 Score = 187 bits (474), Expect = 2e-45, Method: Composition-based stats. Identities = 94/404 (23%), Positives = 157/404 (38%), Gaps = 27/404 (6%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 MTAI+ V AI + ++ + Q+Q A + + + + Sbjct: 24 MTAILAPVLLGAAGMAIQVGDMLISKQQLQEAA--DSAALATATALANGTIQTSQAEAFA 81 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 Q+ +L+ G I+ G Q T N Y Y++ L Sbjct: 82 RNFVAGQMANYLQSGVDIKSATGVTVQ------TNTSGNSTSYQVTVSPSYDLTVNPL-- 133 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 + +LS T I S +IS+ + LD S SM + N+ + S Y Sbjct: 134 MQAVGFTTQHLSTSGTTIGGHSQTQGSISMYLALDKSGSMGED-TATVNEEDPTESYTYD 192 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 K T + + A KI+ L +AGNL + A + VR G Sbjct: 193 CNGHYNKKGKWIYDTCTG---SRANYYTKIEALKMAAGNLFGQLSSA--DPNAQYVRTGA 247 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYREL-------YNEKE 293 ++Y IV + L+ + V + +N L TN+ AM AY L + ++ Sbjct: 248 VSY--DIVQYTPSALAWGTSGVSTYVNALQAGGGTNSSGAMSTAYSSLTAKNAAGNDAED 305 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT--LQICEYMRNAGMKIYSVAVSAPP 351 ++H KK+++F+TDG+N+ S+ + +T C+ ++ G++IY++A AP Sbjct: 306 AAHKLKTGQTPKKYIVFMTDGDNNDDSSGGRSYDTLTKATCDTAKSKGIEIYTIAFMAPE 365 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 GQ LL C +F +LL +F I K Q R+ Sbjct: 366 GGQALLHYCASDDSHYFQAEKMEDLLAAFKAIGAKASSQLTRLT 409 >gi|150397936|ref|YP_001328403.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] gi|150029451|gb|ABR61568.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] Length = 419 Score = 185 bits (470), Expect = 7e-45, Method: Composition-based stats. Identities = 96/415 (23%), Positives = 166/415 (40%), Gaps = 39/415 (9%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 MTA++ + ++D+A+++ +NQ+Q A DAA L+ +++VSD D KD Sbjct: 25 MTALVAPLLLAVGGVSVDVANMLMTKNQLQDATDAAALAAASALVSDA-RPDIEEAKDLA 83 Query: 61 STIFKKQIKKHLK------------------QGSYIRENAGDIAQKAQINITKDKNNPLQ 102 K Q + +I+IT N Sbjct: 84 RKFLKTQAAAATASDLPDEGPSIGARGGGNADDEVPATPRWEDVNATEIDITATPNGAKG 143 Query: 103 YIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMED 162 + + + + L+ + RST S+N A+S+ +VLD S SM Sbjct: 144 KSFQVTVANKHLLQFNAMTRLLGPESIEIETRSTAESATESKN-ALSMYLVLDRSGSMAW 202 Query: 163 LYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVN 222 + P ++ WSK P KID L + G+L+ Sbjct: 203 KTNTINTGKAK--------CPNYTEANWSKYPDLKATGPC---YVTKIDALKTAVGDLLA 251 Query: 223 SIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMH 282 + A + ++ VR G I+Y + + LS ++ L T + A Sbjct: 252 QLVTA--DPESAYVRTGAISY--NSAQDAASSLSWGTRGAAGYVDALVAIGGTASGNAFK 307 Query: 283 HAYRELYNEKESS-HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK 341 A++++ N E S H K+++F+TDGEN+ A+ + T Q C+ + + ++ Sbjct: 308 TAFQKVTNAAEDSEHGAKNGQVPTKYIVFMTDGENNHAN---DDTVTRQWCDTAKASKVQ 364 Query: 342 IYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 IYSVA AP GQ LL+ C SS +F ++ +L+ +F I ++ R+ Sbjct: 365 IYSVAFMAPDRGQKLLKSCASSSSHYFEAEEASDLVAAFKAIGERAAASVSRLTK 419 >gi|83955719|ref|ZP_00964299.1| hypothetical protein NAS141_07930 [Sulfitobacter sp. NAS-14.1] gi|83840013|gb|EAP79189.1| hypothetical protein NAS141_07930 [Sulfitobacter sp. NAS-14.1] Length = 480 Score = 184 bits (467), Expect = 2e-44, Method: Composition-based stats. Identities = 61/471 (12%), Positives = 116/471 (24%), Gaps = 111/471 (23%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + +I + +D R+++Q+ D AVL+ + K Sbjct: 42 LALFMIMMMIAVGGIQLDFMRHEMERSRLQAVSDRAVLAAA-------DLDQMRDPKTVV 94 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 F K + A + T+ + Sbjct: 95 EDYFAKSGMTEFLSN------------------VVVDDGLNFRTVTVDASKNMDTQFI-- 134 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 G + S + +++ VLD+S SM N + Sbjct: 135 -GRFGFPTLEVPAHSQAEERVAKVEISL----VLDISGSMATNNRLGEVQNAADIFLDTV 189 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV-----LIESAGNLVNSIQKAIQEKKNLS 235 L + S+ A +++V Sbjct: 190 LKDENQDLISVSLVPYSEQVNAGPLIMDRMNVNRKHDYSHCIDFDNGDFDSIAMNSSTRY 249 Query: 236 VRIGTIAYNIG-------------IVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMH 282 ++ +N + TP S N +K++++ L P T+ + M Sbjct: 250 NQMQHFQWNYDGRNNYRDDTVCPRYDYERITPFSQNKRTLKNQIDDLVPRAGTSIFLGMK 309 Query: 283 HAYRELYNEKE-----------------SSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 A L + + + K VI +TDG N + ++ Sbjct: 310 WAAAMLDPAFRDINNSLVNAGYVDREFYNRPASYTDSETLKTVILMTDGANDNSYRIRSN 369 Query: 326 LNTLQ-------------------------------------------ICEYMRNAGMKI 342 IC+ + + I Sbjct: 370 YYDSDSEYVHWNKYNLWWYLRREVDSRYWGYFYYHKYNKTLGNTLLSNICDAAKAKRIVI 429 Query: 343 YSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 +S+ E ++ C S FF V EL E+F I +I + + Sbjct: 430 WSIGFEVDDEDVPAMQDCASSPSHFFRVEGV-ELSEAFRAIARQINQLRLT 479 >gi|83941160|ref|ZP_00953622.1| hypothetical protein EE36_02988 [Sulfitobacter sp. EE-36] gi|83846980|gb|EAP84855.1| hypothetical protein EE36_02988 [Sulfitobacter sp. EE-36] Length = 480 Score = 183 bits (465), Expect = 3e-44, Method: Composition-based stats. Identities = 62/472 (13%), Positives = 118/472 (25%), Gaps = 112/472 (23%) Query: 1 MTAIIISVCF-LFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQ 59 + A+ + + +D R+++Q+ D AVL+ + K Sbjct: 41 ILALFMIMMMIAVGGIQLDFMRHEMERSRLQAVSDRAVLAAA-------DLDQMRDPKTV 93 Query: 60 TSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLF 119 F K + A ++ T+ + Sbjct: 94 VEDYFAKSGMTEFLSN------------------VVVDDGLNFRTVTVDASKDMDTQFI- 134 Query: 120 LKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKY 179 G + S + +++ VLD+S SM + Sbjct: 135 --GRFGFPTLEVPAHSQAEERVAKVEISL----VLDISGSMATNNRLGEVQDAADIFLDT 188 Query: 180 LLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV-----LIESAGNLVNSIQKAIQEKKNL 234 +L + S+ A +++V Sbjct: 189 VLKDENEDLISVSLVPYSEQVNAGPLIMDRMNVNRKHDYSHCIDFDNGDFDSIAMNSSTR 248 Query: 235 SVRIGTIAYNIG-------------IVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAM 281 ++ +N + TP S N +K++++ L P T+ + M Sbjct: 249 YNQMQHFQWNYDGRNNYRDDTVCPRYDYERITPFSQNKRTLKNQIDDLVPRAGTSIFLGM 308 Query: 282 HHAYRELYNEKE-----------------SSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 A L + + + K VI +TDG N + N Sbjct: 309 KWAAAMLDPAFRDINNSLVNAGHVDREFYNRPASYTDSETLKTVILMTDGANDNSFRISN 368 Query: 325 TLNTLQ-------------------------------------------ICEYMRNAGMK 341 T IC+ + + Sbjct: 369 TYYNEDSEYVHWNRYNLWWYLRREVNSRYWGYFYYQKYNKSLGNTLLSNICDAAKAKRIV 428 Query: 342 IYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 I+S+ E ++ C S FF V EL E+F I +I + + Sbjct: 429 IWSIGFEVDDEDVPAMQDCASSPSHFFRVEGV-ELSEAFRAIARQINQLRLT 479 >gi|15966595|ref|NP_386948.1| hypothetical protein SMc04059 [Sinorhizobium meliloti 1021] gi|307300370|ref|ZP_07580150.1| TadE family protein [Sinorhizobium meliloti BL225C] gi|307319653|ref|ZP_07599079.1| TadE family protein [Sinorhizobium meliloti AK83] gi|15075867|emb|CAC47421.1| Hypothetical protein SMc04059 [Sinorhizobium meliloti 1021] gi|306894775|gb|EFN25535.1| TadE family protein [Sinorhizobium meliloti AK83] gi|306904536|gb|EFN35120.1| TadE family protein [Sinorhizobium meliloti BL225C] Length = 410 Score = 182 bits (462), Expect = 6e-44, Method: Composition-based stats. Identities = 87/420 (20%), Positives = 161/420 (38%), Gaps = 47/420 (11%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 MTA+I + ++D+A+++ +NQ+Q A A + + D ++ Sbjct: 14 MTALIAPLLLAVGGVSVDVANMLMTKNQLQDAT-----DAAALAAASALVSDARPDIEEA 68 Query: 61 STIFKKQIKKHLKQGSYI---------------RENAGDIAQKAQINITKDKNNPLQYIA 105 I +K +K + S + D +++ I + N Sbjct: 69 KAIARKFLKTQMAATSSADVPGEAVGTMAAAGSTAPSWDDVNTSEVVIVETPNGTKGKSF 128 Query: 106 ESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYL 165 + + + + L+ L RST S+N AIS+ +VLD S SM + Sbjct: 129 QVSVANKHLLQFNAMTRLLGKESIELETRSTADSATESKN-AISMYLVLDRSGSMA--WK 185 Query: 166 QKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAP-ANRKIDVLIESAGNLVNSI 224 D + P+ W+ + A +P KI L + L + Sbjct: 186 TDTVDTSR-----------PRCINWTASNWGESNVRATSPCYVDKITTLKSAVDKLFTPL 234 Query: 225 QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHA 284 K + N +R G +YN ++ + L+ + + L+ T++ A A Sbjct: 235 AKM--DPGNEYLRAGAASYNDR--QDRASKLTWGTKNASAHVQGLDATGGTDSSSAFAAA 290 Query: 285 YRELY-NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN-------TLQICEYMR 336 EL + + +H +K+++F+TDGEN+ + + + T C + Sbjct: 291 VEELLLDGENEAHLAKNGQTPEKYIVFMTDGENTSYNGKTSPRDLEKADSVTKAACTTAK 350 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 N G+ I++VA AP G+DLL+ C S + +D+ L+ F+KI K R+ Sbjct: 351 NNGIAIFTVAFMAPQRGKDLLKACATSPDHYKEADDAAALVSEFEKIGQKAAAMIARLTK 410 >gi|241206334|ref|YP_002977430.1| hypothetical protein Rleg_3648 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240860224|gb|ACS57891.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 400 Score = 180 bits (457), Expect = 3e-43, Method: Composition-based stats. Identities = 92/396 (23%), Positives = 152/396 (38%), Gaps = 22/396 (5%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 MTAI++ V F AI + ++ + Q+Q A + + + + Sbjct: 25 MTAIVLPVLFGAAGMAIQVGDLLLSKQQLQEAA--DSAALATATALANGTIQTSQAEAFA 82 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 Q+ +L+ G I+ G + T Y Y I L Sbjct: 83 RDFVAGQMANYLQSGIDIKSTTGVDVR------TTTSGKSTSYQVTVSPDYNIAVNPL-- 134 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 I N+S ST S ++S+ +VLD S SM + N ++ Y Sbjct: 135 MQTIGFTTQNISTSSTTTSGNSQTQGSVSMFLVLDRSGSMGED-TATVNASDPTEEYNY- 192 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 +K + T K KI+ L + G L + + + VR G Sbjct: 193 --DCSEKDRYGNVTKKKTCTDTRPHYYTKIEALKLAVGTLTGELDAV--DPEKEYVRTGA 248 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK-ESSHNTI 299 ++Y I + L V +NKL + T++ A AY +L + + +H Sbjct: 249 VSY--NIEMQKAKALDWGTAHVTKYVNKLTATDGTDSGEAFKTAYNKLADAAEDKAHVDK 306 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 K+++F+TDG+N+ SA T C+ R+A M++Y++A AP GQ LL Sbjct: 307 TGQVPTKYIVFMTDGDNNYTSADT---ETKTWCDKARDAKMQVYTIAFMAPARGQALLSY 363 Query: 360 CTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 C + G +F D LL++F +I K Q R+ Sbjct: 364 CATAPGNYFPAGDMTALLKAFKEIGMKASNQVTRLT 399 >gi|307945905|ref|ZP_07661241.1| putative von Willebrand factor type A [Roseibium sp. TrichSKD4] gi|307771778|gb|EFO31003.1| putative von Willebrand factor type A [Roseibium sp. TrichSKD4] Length = 432 Score = 180 bits (455), Expect = 4e-43, Method: Composition-based stats. Identities = 82/397 (20%), Positives = 147/397 (37%), Gaps = 60/397 (15%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALD-AAVLSGCASIVSDRTIKDPTTKKDQ 59 + I+I + +T ID++ R ++Q+A D AAV +G A + + TI Sbjct: 85 LFGILIMLLLAVVTIGIDMSQTFGERTRLQTAADMAAVQTGRALLAEEITIAQANAYAKD 144 Query: 60 TSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINIT-KDKNNPLQYIAESKAQYEIPTEN- 117 + GS G + K + IT N Y+ + +IP Sbjct: 145 AFNRIASGLSAS-GDGSSGTSIFGTMTVKPAVQITETVDGNTTNYVVKVNGTAKIPASPL 203 Query: 118 --LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMT 175 +F G +L S ++ ++S+ +VLD S SM Sbjct: 204 SFMFFDGETGKNTISLGFESET-TAKAEAGASLSMALVLDRSGSMGWER----------- 251 Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 ++ L ++ +L+ +Q + + Sbjct: 252 -------------------------------PSRMSELKKAVRSLIKELQTV--DPDDQF 278 Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE-S 294 R+G AY+ G + L+ N N V+S +N L T PA+ A +L E + Sbjct: 279 TRLGAYAYHWYYAGKKE--LTWNKNSVRSWVNSLPASGGTRAAPAIQKAKNDLLTNSELN 336 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 +H F++++TDG + + + C +NAG+ IY+VA AP G+ Sbjct: 337 AHINKNEQEPDLFILYMTDGIDGDPNW------AKRECTSAKNAGITIYTVAFKAPASGR 390 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 +LL+ C S ++ ++ EL + F I + + Sbjct: 391 NLLKACATSDAHYYDAKNANELNKVFKDIARETTKSI 427 >gi|209550922|ref|YP_002282839.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209536678|gb|ACI56613.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 411 Score = 179 bits (454), Expect = 6e-43, Method: Composition-based stats. Identities = 90/404 (22%), Positives = 155/404 (38%), Gaps = 26/404 (6%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 MTAI+ V AI + +M + Q+Q A A++ + + + T + Q Sbjct: 24 MTAIMAPVLLGVAGVAIQVGDMMLSKQQLQEAA------DSAALATATALANGTIQTSQA 77 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + + + +Y++ + + T Y Y++ L Sbjct: 78 EAFAQNFVAGQMA--NYVQSGVDFKSGTSVNVQTSTSGKSTSYQVTVSPSYDLTVNPL-- 133 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 + +LS T + S +IS+ + LD S SM + N ++ S Y Sbjct: 134 MQAVGFKTQHLSTSGTTVGGHSQTQGSISMFLALDKSGSMGE-ATATVNADDPTESYTYD 192 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 K + KI+ L +AGNL + A + VR G Sbjct: 193 CNLHYNSKNNKWV--YDKCTGSRTNYYTKIEALKIAAGNLFGQLNSA--DPNAEYVRTGA 248 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYREL-------YNEKE 293 ++Y I + L+ V S +N L TN+ AM AY L + ++ Sbjct: 249 VSY--DINQYTPSNLAWGTAGVTSYVNALQANGGTNSSGAMSTAYSSLTAKNAAGNDAED 306 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT--LQICEYMRNAGMKIYSVAVSAPP 351 S+H KK+++F+TDG+N+ S+ + +T C+ ++ G++IY++A AP Sbjct: 307 SAHKLKTGQTPKKYIVFMTDGDNNDDSSGGRSYDTLTKATCDTAKSKGIEIYTIAFMAPA 366 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 GQ LL C +F +LL +F I K Q R+ Sbjct: 367 GGQTLLHYCASDDSHYFQAEKMEDLLAAFKAIGAKASAQMTRLT 410 >gi|163760496|ref|ZP_02167578.1| hypothetical protein HPDFL43_04296 [Hoeflea phototrophica DFL-43] gi|162282447|gb|EDQ32736.1| hypothetical protein HPDFL43_04296 [Hoeflea phototrophica DFL-43] Length = 363 Score = 179 bits (454), Expect = 6e-43, Method: Composition-based stats. Identities = 80/397 (20%), Positives = 159/397 (40%), Gaps = 58/397 (14%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A + V F+ + A+D + M ++ ++Q+A+D+A L+ A + + + + Sbjct: 24 IAAAAVPVLFMAGSLAVDTTNAMSMKVRLQNAVDSAALATAARLSEEENLTAAQAQA-FA 82 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 Q+K+ + +NI + + E + Sbjct: 83 LKFVNGQVKEDFGA-------FNGFSVTPTVNIDPVETGGRTVW-KVAVSMEGSQSLTPM 134 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 ++ +S+ A S+ +VLD S SM+ + Sbjct: 135 ARIMGKDKLTVSVVGKSESA-GEAQGAFSMALVLDRSGSMDWNLNGQK------------ 181 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 KI+VL + G L+ ++A + + VR+G Sbjct: 182 ----------------------------KINVLKTAVGGLIEQFEEA--DPERKYVRLGA 211 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS-HNTI 299 +YN + G T L N + K ++ L T++ A AY + +++E++ H+ Sbjct: 212 SSYNSKLTG--STKLRWNPGKTKEFVDALPASGGTDSTDAFDWAYTAVTHKRENNTHDAK 269 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 KKF++F+TDG+N+ +SA + T +C+ ++ G+++Y+VA +AP G+ LL Sbjct: 270 SGQVPKKFIVFMTDGDNNYSSADSS---TKHLCDDAKDDGIEVYTVAFAAPNRGKQLLSY 326 Query: 360 CTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 C + FF +S +L+E+F I + R+ Sbjct: 327 CASTEEHFFDAQNSAQLIEAFKNIGYAASKVVSRLTE 363 >gi|327189644|gb|EGE56794.1| hypothetical protein RHECNPAF_570041 [Rhizobium etli CNPAF512] Length = 415 Score = 179 bits (453), Expect = 7e-43, Method: Composition-based stats. Identities = 89/407 (21%), Positives = 159/407 (39%), Gaps = 28/407 (6%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 MTAI+ V AI + ++ + Q+Q A + + + + Sbjct: 24 MTAILAPVLLGAAGLAIQVGDMLLSKQQLQEAA--DSAALATATALGNGTIQTSQAEAFA 81 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 Q+ +L+ G I+ Q + Y Y++ L Sbjct: 82 RNFVAGQMANYLQNGVDIKNATAVNVQTSN------SGKSASYQVTVTPSYDLTVNPLMQ 135 Query: 121 KGLIPS---ALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSN 177 + + ++ ++ S+ ++S+ + LD S SM D + D T Sbjct: 136 AVGFSTQHLSTSSTTVSGPSQTPGSNSQGSVSMFLALDKSGSMGDPTETVNKDQPTETFT 195 Query: 178 KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVR 237 P KK W +T + KI+ L +AGNL + A + VR Sbjct: 196 YDCNPHLNKKGKWVYDT----CTGSRTNYYTKIEALKMAAGNLFGQLTSA--DPDAQYVR 249 Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYREL-------YN 290 G ++Y+I + L+ + V S +N L TN+ AM AY L + Sbjct: 250 TGAVSYDIDQY--TPSTLAWGTSGVSSYVNALQAGGGTNSSGAMGTAYSSLTAKNAAGND 307 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT--LQICEYMRNAGMKIYSVAVS 348 ++++H KK+++F+TDG+N+ S+ + +T C+ ++ G++IY++A Sbjct: 308 AEDAAHKLKTGQIPKKYIVFMTDGDNNNDSSGGRSYDTLTKATCDTAKSKGIEIYTIAFM 367 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 APP GQ LL+ C + +F +LL +F I K Q R+ Sbjct: 368 APPGGQALLQYCASDAAHYFQAEQMEDLLAAFKAIGAKASAQLTRLT 414 >gi|86359182|ref|YP_471074.1| hypothetical protein RHE_CH03592 [Rhizobium etli CFN 42] gi|86283284|gb|ABC92347.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 411 Score = 178 bits (450), Expect = 2e-42, Method: Composition-based stats. Identities = 91/404 (22%), Positives = 152/404 (37%), Gaps = 26/404 (6%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 MTAI+ V AI + ++ + Q+Q A + + T + Sbjct: 24 MTAILAPVLLGAAGMAIQVGDMLLSKQQLQEAA--DSAALATATALANGTIQTTEAEAFA 81 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 Q+ +L+ G+ I+ Q T Y Y + L Sbjct: 82 RNFVAGQMANYLQSGTDIKSTTSVNVQ------TTTSGKSTSYQVTVSPAYVLTVNPL-- 133 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 + +LS T I S +IS+ + LD S SM + N+ + S Y Sbjct: 134 MQAVGFTTQHLSTSGTTIGGHSQTQGSISMFLALDKSGSMGED-TATVNEESPTESYTYD 192 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 K + KI+ L +AGNL + + A + VR G Sbjct: 193 CNLHYNTKNNKWV--YDKCTGSRTNYYTKIEALKMAAGNLFSQLNSA--DPNAQYVRTGA 248 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYREL-------YNEKE 293 ++Y I + L+ + V S +N L TN+ AM+ AY L + + Sbjct: 249 VSY--DINQYAPSSLAWGITGVSSYVNALQANGGTNSSGAMNTAYTSLTAKNAAGNDVEN 306 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT--LQICEYMRNAGMKIYSVAVSAPP 351 S+H KK+++F+TDG+N+ + + +T + C+ ++ G++IY++A AP Sbjct: 307 SAHQQKTGQVPKKYIVFMTDGDNNNDPSGGRSYDTATKKTCDDAKSKGIEIYTIAFMAPA 366 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 GQ LL C +F +LL +F I K Q R+ Sbjct: 367 GGQALLHYCASDDSHYFQAEKMEDLLAAFQAIGAKASAQLTRLT 410 >gi|218662625|ref|ZP_03518555.1| hypothetical protein RetlI_26027 [Rhizobium etli IE4771] Length = 389 Score = 173 bits (437), Expect = 6e-41, Method: Composition-based stats. Identities = 95/406 (23%), Positives = 166/406 (40%), Gaps = 30/406 (7%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 TAI+ V A+ + ++ + Q+Q A D+A L+ ++ + + + + Sbjct: 1 TAILAPVLLGAAGMAVHVGDMLLSKQQLQEAADSAALATATALANGKI--QTSEAEAYAR 58 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 Q+ +L+ G I+ G Q T N Y Y++ L Sbjct: 59 NFVAGQMANYLQSGVDIKSATGVSVQ------TNTSGNSTSYQVTVSPSYDLTVNPL--M 110 Query: 122 GLIPSALTNLSLRSTGIIERSSE---NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNK 178 + +LS T I S+ +IS+ + LD S SM + N+ + S Sbjct: 111 QAVGFTTQHLSTSGTTIGGGHSQTQGQGSISMYLALDKSGSMGED-TATVNEEDPTESYT 169 Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 Y P + T + + A KI+ L +AGNL + + + VR Sbjct: 170 YPCNPHYNRKGKEVWDTCTG---SRANYYTKIEALKMAAGNLFAQL--SGADPNAQYVRT 224 Query: 239 GTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYREL-------YNE 291 G ++Y IV + L+ V S +N L TN+ AM AY L + Sbjct: 225 GAVSY--DIVQYAPSSLAWGAIGVSSYVNALQAGGGTNSSGAMSTAYLSLTAKNAAGNDA 282 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT--LQICEYMRNAGMKIYSVAVSA 349 ++S+H +K+++F+TDG+N+ S+ + +T C+ ++ G++IY++A A Sbjct: 283 EDSAHKLKSGQIPQKYIVFMTDGDNNNDSSGGRSYDTLTKATCDTAKSKGIEIYTIAFMA 342 Query: 350 PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 PP GQ LL+ C + +F +L +F I K Q R+ Sbjct: 343 PPGGQALLQYCASDASHYFQAEKMEDLFAAFKAIGAKASTQVTRLT 388 >gi|255261929|ref|ZP_05341271.1| conserved hypothetical protein [Thalassiobium sp. R2A62] gi|255104264|gb|EET46938.1| conserved hypothetical protein [Thalassiobium sp. R2A62] Length = 478 Score = 170 bits (430), Expect = 4e-40, Method: Composition-based stats. Identities = 59/469 (12%), Positives = 114/469 (24%), Gaps = 113/469 (24%) Query: 4 IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTI 63 + + L A+DL R ++Q LD AVL+ + + + Sbjct: 41 FMFVLMLLTAGMALDLMRYETHRARLQGTLDRAVLAAA-------DLDQTLSPAAVVTDY 93 Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGL 123 F K + T I ++ T L Sbjct: 94 FAKAGLSSF------------------LTSTTVDQGLNYRIISAQGNM---TMPTTFMRL 132 Query: 124 IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPP 183 + +T S+ +++ V+D+S SM + T ++ P Sbjct: 133 SGQTELAIRGDATAEERVSNVEISL----VVDISGSMGRNNKLSTLRTASHTFIDTVIRP 188 Query: 184 PPKKSFWSKNTTKSKYAPAPAPANRKIDVL-----IESAGNLVNSIQKAIQEK----KNL 234 + + A ++ V A + Sbjct: 189 ETEDLISLNIIPYTAQVNAGPDIFDQLTVDQKHNFSHCIDFEPADFNTAALDVPPVSTRT 248 Query: 235 SVRIGTIAYNIGI-----------VGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 ++ Y + P S + +KS + L NT + M Sbjct: 249 YKQMQHFQYGWSSSYVNNPGCPMQSYERIVPFSQDATSLKSTVTSLRARANTAIHLGMKW 308 Query: 284 -----------------AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 A ++ E K ++ +TDG+N + Sbjct: 309 GVSMLDPTFRPIVTAMIANNKVDPEFAGRPVAYNDPETLKTIVLMTDGQNVDTYRISDEF 368 Query: 327 NTL-------------------------------------------QICEYMRNAGMKIY 343 + IC+ + G+ ++ Sbjct: 369 YSTPSQIAHWDRYQLFFFTNNYIDRDIDQNYYYKKFTATQADTMLQSICDAAKAEGILVW 428 Query: 344 SVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 ++ + C S FF V EL E+F I +I + + Sbjct: 429 TIGFEVSNHAAGEMLDCASSPSHFFRVEGV-ELSEAFASIARQINQLRL 476 >gi|84502751|ref|ZP_01000870.1| hypothetical protein OB2597_00965 [Oceanicola batsensis HTCC2597] gi|84389146|gb|EAQ01944.1| hypothetical protein OB2597_00965 [Oceanicola batsensis HTCC2597] Length = 470 Score = 169 bits (427), Expect = 8e-40, Method: Composition-based stats. Identities = 52/466 (11%), Positives = 118/466 (25%), Gaps = 103/466 (22%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + ++ +D+ R ++Q +DA+ L+ + K Sbjct: 36 LVMFMLLTMMTVAGIGVDVMRTEMERTRIQQVIDASTLAAAH-------KDNALDPKQVV 88 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 F K I E ++ T + Sbjct: 89 LDYFDKAALASYISADDI----------------LVGGGETSTAVEVNLTAQVKTPFI-- 130 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 + + N+ R + +++ VLD+S SM+D + Sbjct: 131 -RHLGNESFNVPARGRAEQAYGNSEVSL----VLDISGSMDDNRRMSRLHRAANEFVDTV 185 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV-----LIESAGNLVNSIQKAIQEKKNLS 235 L P + +++V + + ++ Sbjct: 186 LTPDSVDRVSVSLIPYTGDVNVGWDIFSRMNVRQLHDYSYCVQFTPDDFSTTAIDPEDAY 245 Query: 236 VRIGTIA-------YNIGIV--GNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYR 286 ++ + Y TP S N +++++N+L E T+ + + Sbjct: 246 IQGQHFSHVDARFNYISCPTQSYETVTPFSQNNAALEAQINRLTGRERTSIHIGIKWGAA 305 Query: 287 ELYNEKES----------------SHNTIGSTRLKKFVIFITDGENS------------- 317 L ++ K ++ +TDG N+ Sbjct: 306 MLDEAFRPLVNDLVDNSIVDEAFRDRPAPFTSNTLKVIVVMTDGMNTETKRIKEFAYDTP 365 Query: 318 ----------GASAYQN-------------------TLNTLQICEYMRNAGMKIYSVAVS 348 + IC + G+ IYS+ Sbjct: 366 DMRAHWARHAMDDWDNDVDGSVEDHLFDTYYDTAIGNALLQNICNAAKANGIIIYSIGFE 425 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRI 394 + + C S F+ V ++ E+F I ++++ + + Sbjct: 426 INNDAAQEMEDCASSPSHFYRVEGV-QISEAFSSIAQQLKQLRLTL 470 >gi|254780833|ref|YP_003065246.1| hypothetical protein CLIBASIA_03630 [Candidatus Liberibacter asiaticus str. psy62] gi|254040510|gb|ACT57306.1| hypothetical protein CLIBASIA_03630 [Candidatus Liberibacter asiaticus str. psy62] Length = 371 Score = 168 bits (425), Expect = 1e-39, Method: Composition-based stats. Identities = 84/398 (21%), Positives = 173/398 (43%), Gaps = 56/398 (14%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQ- 59 +TAI++ V F+ + I+ +H +++ ++ LD ++L I++ + +K+ Sbjct: 20 LTAILLPVIFIVMGLVIETSHKFFVKAKLHYILDHSLLYTATKILNQENGNNGKKQKNDF 79 Query: 60 ----TSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPT 115 I++ + L++ + ++ +I + ++I D + Y + ++YE+P Sbjct: 80 SYRIIKNIWQTDFRNELRENGF-AQDINNIERSTSLSIIIDDQHK-DYNLSAVSRYEMPF 137 Query: 116 ENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMT 175 S+ L + S + S ++ + + MVLDVS SM D + Sbjct: 138 IFCTFPWCANSSHAPLLITS-SVKISSKSDIGLDMMMVLDVSLSMNDHF----------- 185 Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 P K+ V S +++ I+ N Sbjct: 186 ----------------------------GPGMDKLGVATRSIREMLDIIKSIPDV--NNV 215 Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKES- 294 VR G + ++ IV PL+ + ++ ++N+L T + P + +AY ++++ KE Sbjct: 216 VRSGLVTFSSKIVQTF--PLAWGVQHIQEKINRLIFGSTTKSTPGLEYAYNKIFDAKEKL 273 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 H G KK++IF+TDGENS + + +L C + G +Y++ V A Q Sbjct: 274 EHIAKGHDDYKKYIIFLTDGENSSPNI--DNKESLFYCNEAKRRGAIVYAIGVQAEAADQ 331 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 L+ C +F++V +SR+L ++F +I ++ +Q + Sbjct: 332 -FLKNCAS-PDRFYSVQNSRKLHDAFLRIGKEMVKQRI 367 >gi|114764812|ref|ZP_01443994.1| hypothetical protein 1100011001322_R2601_10469 [Pelagibaca bermudensis HTCC2601] gi|114542698|gb|EAU45721.1| hypothetical protein R2601_10469 [Roseovarius sp. HTCC2601] Length = 477 Score = 168 bits (424), Expect = 2e-39, Method: Composition-based stats. Identities = 63/470 (13%), Positives = 125/470 (26%), Gaps = 110/470 (23%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M + + +F ID+ + R ++Q+ LD AVL+ Sbjct: 40 MAVALSLLMMIFGGIGIDMMYAELQRTKIQNTLDRAVLAAA------------------- 80 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + L + + ++ + + T Sbjct: 81 ------DLDNELDAQGVVEDYMSKMSLADALVSVNVDEGLNYRTVTADG---YRTMPSNF 131 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 LI S + + + MVLD+S SM+D L Sbjct: 132 MQLIGIENMQAGGHSQAMERI----NKVEVSMVLDISGSMDDGDKMAELQTAASDFVDTL 187 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV-----LIESAGNLVNSIQKAIQEKKNLS 235 L + S++ A ++V ++ A + Sbjct: 188 LDDGSEDLVSISLVPYSEHVNAGPEILSYLNVNYMHDDSYCLEMPNSAFNSAALDLSLTY 247 Query: 236 VRIGTIAYNI------------GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 ++ +N Q P S + +K+++++L P T+ + M Sbjct: 248 DQMQHFQWNYSGSNSLTDTVCPRYAYEQIRPWSQDAGALKTQISQLQPRAGTSIFMGMKW 307 Query: 284 AYRELYNE-----------------KESSHNTIGSTRLKKFVIFITDGE----------- 315 A L E T + K ++ +TDG+ Sbjct: 308 ASALLDPSTRPIASGMIADGTVDAVFEGRPVAYSDTDVLKTIVLMTDGQHDRSFRIQNWA 367 Query: 316 -------------------------NSGASAYQNTLNTLQ-------ICEYMRNAGMKIY 343 + +S Y + +C + G+ I+ Sbjct: 368 YNDENEVEHWSQYNLWHYLNYYVNSWNRSSFYYQKYDAATGDTLLSSVCTAAKRQGILIW 427 Query: 344 SVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 S+ G +++ C S FF V E+ E+F I + + + Sbjct: 428 SIGFEVSDHGANVMESCASSPAHFFRVEGV-EISEAFSTIAQTLNQLRLT 476 >gi|260425757|ref|ZP_05779737.1| conserved hypothetical protein [Citreicella sp. SE45] gi|260423697|gb|EEX16947.1| conserved hypothetical protein [Citreicella sp. SE45] Length = 479 Score = 167 bits (422), Expect = 3e-39, Method: Composition-based stats. Identities = 62/471 (13%), Positives = 126/471 (26%), Gaps = 111/471 (23%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M ++ + +F ID+ + R ++Q+ LD AVL+ Sbjct: 41 MAVVLSMMMMIFGGLGIDMIYAELQRTKVQNTLDRAVLAAA------------------- 81 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + L+ + + +A + ++ T Sbjct: 82 ------DLDNELEAQGVVEDYMDKMALADALISVDVDEGLNYRTVVAEG---YKTMPSNF 132 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 ++ + + +++ VLD+S SM+D + + T L Sbjct: 133 MQILGVDNLQAYGLAEATERINKVEVSL----VLDISGSMDDNDKLANMQDAAGTFIDTL 188 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPA-----NRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 L + S+ A + A + Sbjct: 189 LAEGNEDLVSISLVPYSEQVNAGPEILSYLSANWKHGYSHCIEMPNSVFGSAALDFSRTY 248 Query: 236 VRIGTIAYNIGIVGNQ-------------CTPLSNNLNEVKSRLNKLNPYENTNTYPAMH 282 ++ +N N S++ + +K+++N+L P T+ + M Sbjct: 249 EQMQHYQWNYDGYNNTLSDTVCPRYGYERIQAWSHDASALKAQVNQLQPRAGTSIFMGMK 308 Query: 283 HAYRELYNE-----------------KESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 L E T + K V+ +TDG++ + Q+ Sbjct: 309 WGTALLDPSTRPIASGMIARGSVDQVFEGRPVAYDDTDVLKTVVLMTDGQHDRSYRIQDW 368 Query: 326 LNTLQ-------------------------------------------ICEYMRNAGMKI 342 + IC + G+ I Sbjct: 369 AYNSESEYAHWNRYNLWYYLSRYVSSYERSSFYYQKYNADLGDALLGSICAAAKAQGIII 428 Query: 343 YSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 +SV G D++ C S FF V E+ E+F I + + + Sbjct: 429 WSVGFEVGDHGADVMESCASSPAHFFRVEGV-EITEAFSTIAHTLNQLRLT 478 >gi|15891094|ref|NP_356766.1| hypothetical protein Atu3868 [Agrobacterium tumefaciens str. C58] gi|15159433|gb|AAK89551.1| hypothetical protein Atu3868 [Agrobacterium tumefaciens str. C58] Length = 412 Score = 167 bits (421), Expect = 3e-39, Method: Composition-based stats. Identities = 87/408 (21%), Positives = 158/408 (38%), Gaps = 33/408 (8%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 MTAI++ V ++LA++M ++ MQ+ S + ++ +++ +Q Sbjct: 24 MTAILLPVLLGVAGAGMELANVMQVKADMQNT----ADSAALAAATEARLREGKLSDEQI 79 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 I K I +++ E T + Y E+ +++I + Sbjct: 80 KEIAKNFIAAQMEKNLTAEEKIELEKNSPTRVTTTENARGKTYAVETTIKHQIQLNPM-- 137 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 G I + +LS+ T + + IS+ + LD S SM + Sbjct: 138 LGFIGAKTLDLSVTGTAKSTIN-KGAPISMYLALDRSGSMSFKTDTVDTTKTS------- 189 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE----KKNLSV 236 WSK +K +P K L + G LV ++ KA + V Sbjct: 190 -CQNYTSDNWSKYPNLAKTSPC---YVNKAASLKTAVGFLVATLNKADPTYTVNGGSELV 245 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP---YENTNTYPAMHHAYRELYNEKE 293 R G Y Q + V S ++K P T+ +++ AY L Sbjct: 246 RTGASVYTHETYVAQSIG--WGTSGVTSYVDKQIPEFPSGGTDARSSLNAAYNALKKANP 303 Query: 294 SS---HNTIGSTRLKKFVIFITDGENSGASAYQNTL---NTLQICEYMRNAGMKIYSVAV 347 H GS +++++ +TDGE +G SA N+ + CE + G+KI+SVA Sbjct: 304 DEARYHKEKGSESFERYIVLMTDGEMTGNSAAWNSSIDQSVRTTCETAKKDGIKIFSVAF 363 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 AP +G+ LL+ C S+ ++A + +++ +F +I K + Sbjct: 364 MAPDKGKSLLQYCASSADNYYAPENMEQIVTAFGEIARKAAGSIATLT 411 >gi|254460794|ref|ZP_05074210.1| conserved hypothetical protein [Rhodobacterales bacterium HTCC2083] gi|206677383|gb|EDZ41870.1| conserved hypothetical protein [Rhodobacteraceae bacterium HTCC2083] Length = 480 Score = 165 bits (417), Expect = 1e-38, Method: Composition-based stats. Identities = 61/471 (12%), Positives = 125/471 (26%), Gaps = 111/471 (23%) Query: 1 MTA-IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQ 59 + A ++ + +DL R +Q LD A+LS + T + Sbjct: 42 IFAVFMVLMILTIGGIGVDLMRSERDRTVLQHTLDRAILSAA-------DLDQTQTPQAV 94 Query: 60 TSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLF 119 F+ + + + A+ Sbjct: 95 VDDYFETAGLESFLSNVTVDQGINYKTVGAEAQSITT---------------------TA 133 Query: 120 LKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKY 179 + N + ++ ++ MVLD+S SM + + Sbjct: 134 FMKMAGVDTLNATAAGVAEERIANVEIS----MVLDISGSMGIGSKMTQLRSAATSFVNT 189 Query: 180 LLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV-----LIESAGNLVNSIQKAIQEKKNL 234 +L P + S++ A +++ ++ + + Sbjct: 190 VLSPENEDLVSVSLVPYSQHVNAGPKIYNELNTNHRHNYSHCVEMADSAYSETELDLSVT 249 Query: 235 SVRIGTIAYNI------------GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMH 282 ++ +N + T S + + + +++ +L P T + M Sbjct: 250 YDQMQHFQWNYSGANQLTDTICPRYSYERITAFSQDASALNAQIAQLQPRAGTQIFMGMK 309 Query: 283 HAYRELYNEK-----------------ESSHNTIGSTRLKKFVIFITDGEN--------- 316 A L ++ T K V+ +TDG+N Sbjct: 310 WAAAMLDPAFNPVVNALVTSNDIDSVFDNRPAAFDDTETLKTVVLMTDGKNSSSMRIKSW 369 Query: 317 ------SGASAYQNTLNTL----------------------------QICEYMRNAGMKI 342 + L IC ++AG+ I Sbjct: 370 AYDSSSDYYHWSRYNLWYYLRRNVNRHYHSRYYWFTHDAAQGDALLDDICNASKDAGIVI 429 Query: 343 YSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 +S+ G D++ C S FF V E+ E+FD I +I + + Sbjct: 430 WSIGFEVDDHGADVMANCASSPSHFFRVEGI-EISEAFDAIARQINQLRLT 479 >gi|227823417|ref|YP_002827390.1| hypothetical protein NGR_c28930 [Sinorhizobium fredii NGR234] gi|227342419|gb|ACP26637.1| hypothetical protein NGR_c28930 [Sinorhizobium fredii NGR234] Length = 413 Score = 164 bits (414), Expect = 2e-38, Method: Composition-based stats. Identities = 94/397 (23%), Positives = 163/397 (41%), Gaps = 33/397 (8%) Query: 13 ITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHL 72 +ID+A+++ +NQ+Q A DAA L+ +++VSD D K+ K Q Sbjct: 37 GGVSIDMANMLMTKNQLQDATDAAALAAASALVSDE-QPDIAAAKEIARKFLKTQAGGTT 95 Query: 73 ------------KQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 G+ D ++NIT+ N I + + TE + Sbjct: 96 TPDAPADSGEGASSGAASSTPDWDDVNTLEVNITETPNGTKGKIFQVTVINKRVTEFNAM 155 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 L+ + L ST S+N A+S+ +VLD S SM + + + Sbjct: 156 TRLLGTDSIELEASSTAESATESKN-ALSMYLVLDRSGSMA--WKTNTINAAKKS----- 207 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 P +S WS+ +P KID L + +L+ + + + + VR Sbjct: 208 -CPNYTESNWSRYPNLWASSPC---YVTKIDALKTAVTDLLAQL--LVADPDQIYVRTAA 261 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKES-SHNTI 299 I+Y V + L+ + + +N L T + A AY+++ E+ +H Sbjct: 262 ISY--NSVQDTAGTLAWGTSGAAAYVNALVATGGTASAGAFKTAYQKVIAATENTAHAAK 319 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 K+++F+TDGEN+ + T Q C+ + ++IYSVA AP GQ LL+ Sbjct: 320 NGQVPSKYMVFMTDGENN---YANDDTVTKQWCDTAKANKVEIYSVAFMAPERGQALLKY 376 Query: 360 CTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 C SS +F + +L+ +F I ++ R+ Sbjct: 377 CASSSSHYFEAEEVTDLVAAFKAIGERAAAVVSRLTK 413 >gi|315122473|ref|YP_004062962.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495875|gb|ADR52474.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 403 Score = 164 bits (414), Expect = 2e-38, Method: Composition-based stats. Identities = 129/401 (32%), Positives = 217/401 (54%), Gaps = 29/401 (7%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M+A II VC +F+++ ID+ H+++++N +QS+LD A++SGC+ +VSD I D ++++ Sbjct: 27 MSASIIFVCLIFVSFVIDITHLLHMKNHIQSSLDNAIISGCSIVVSDPKINDLNPQEERI 86 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + KK ++ Q + E+A I + A I+ +KD N +Y +A++++ +N L Sbjct: 87 RDVIKKNAYVNMVQN-FPAEHAAYIIENANISFSKDLTNKYEYKITMEAKHQLSGKNFIL 145 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 L+P+ +T++S STGII++ S+ A S+ MVLD S SM D Sbjct: 146 GFLMPNVITHISSISTGIIQKPSDKKAFSVEMVLDCSGSMLDSM---------------- 189 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 S Y+ KI L ++ + VN IQ+ +Q +S RIG Sbjct: 190 ---QESCDLSSGRGGYYFYSKNNNKPKSKIYALKTASSDFVNLIQETVQTFPQISARIGL 246 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY--NEKESSHNT 298 I +N I+ + + LSNN N +K ++++ P T+T+ M+ AY L + +HN Sbjct: 247 ITFNHYIMQD--SKLSNNFNVIKKTISRMKPKGGTDTFLPMNAAYEYLNNIPNETKAHNI 304 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP--EGQDL 356 + LK+++I +TDGEN+ S T+N +C+ R G+ IYS+ ++ +G +L Sbjct: 305 SDNVPLKRYIILMTDGENNHPSYDLKTIN---VCDNARKNGIIIYSIFLNYYEYTDGYEL 361 Query: 357 LRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 RKC S FF N+++ LL+SF I IQ+++VRIA N Sbjct: 362 ARKCASSEKHFFYANNTKALLDSFKSIAHAIQDKAVRIASN 402 >gi|163747459|ref|ZP_02154811.1| hypothetical protein OIHEL45_00415 [Oceanibulbus indolifex HEL-45] gi|161379312|gb|EDQ03729.1| hypothetical protein OIHEL45_00415 [Oceanibulbus indolifex HEL-45] Length = 476 Score = 163 bits (412), Expect = 4e-38, Method: Composition-based stats. Identities = 57/464 (12%), Positives = 114/464 (24%), Gaps = 116/464 (25%) Query: 13 ITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHL 72 +DL R ++Q+ D AVL+ + + + F K Sbjct: 45 GGVGVDLMRHERERARVQAVADRAVLAAA-------DLDQTLSPEAVARDYFDKSGLADY 97 Query: 73 KQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLS 132 I+ + A ++ T + ++ Sbjct: 98 ------------------ISSVTVEEGLNYRRVTVDASRDLKTMFI---DKFGQEKLHVP 136 Query: 133 LRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSK 192 ++T + + I MVLD+S SM + + + + ++ + Sbjct: 137 AKATA----EEKVAKVEISMVLDISGSMRENDKMNNLHDASNVFIDTVIQTDTEDLISIS 192 Query: 193 NTTKSKYAPAPAPANR-----KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI---AYN 244 + ++ + + + Y Sbjct: 193 VVPYTAQVNVGKDIMDELNVTQLHSYSHCVDFEDSDFNLTTISQTRSYEHMQHFEAGYYW 252 Query: 245 IGI-------------------VGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAY 285 G + S N +KSR+ P NT + + Sbjct: 253 NGNDRDRTGHYDNISNPGCPKQSYEEIETFSQNAAALKSRIANFQPRANTAIHLGLKWGV 312 Query: 286 RELYNE-------------KESSHNTIGSTRLKKFVIFITDGEN---------------- 316 L K VI +TDG N Sbjct: 313 ALLDPSFRAINEAIGGDAVFRGRPAEYNDIDTLKTVILMTDGVNVTTRRIAPEAYSNRDH 372 Query: 317 ---------------------------SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + +A Q IC+ + G+ I+S+ Sbjct: 373 YRHWSDYPFYWWLGRNVRSSEHYRWYRTKYTAGQADNLLDNICDAAKAKGIVIWSIGFEV 432 Query: 350 PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 G +++ C S FF V E++++F+ I +I + + Sbjct: 433 TDHGAAVMKNCASSDSHFFRVEGV-EIVDAFEAIARQINQLRLT 475 >gi|114798549|ref|YP_759188.1| hypothetical protein HNE_0458 [Hyphomonas neptunium ATCC 15444] gi|114738723|gb|ABI76848.1| conserved domain protein [Hyphomonas neptunium ATCC 15444] Length = 460 Score = 161 bits (406), Expect = 2e-37, Method: Composition-based stats. Identities = 64/441 (14%), Positives = 125/441 (28%), Gaps = 50/441 (11%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+ I +AID + ++Q A+D+AVL+ S+ + + + Sbjct: 19 IAALTIIPIVGIAGFAIDFQVTTTQKARVQQAVDSAVLAATKSMQDGKDRAYSLKEANDY 78 Query: 61 STIFKKQIKKH--------LKQGSYIRENAGDIAQKAQINITKDKN-NPLQYIAESKAQY 111 Q L E G + ++K L + S A Y Sbjct: 79 FKGILNQSNNSGLNCTNIDLVYIDETEELEGHVECSQNTTLSKVAGIRHLDFNVSSAATY 138 Query: 112 -----EIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ 166 EI + + + + DV +M Sbjct: 139 GIGKLEIAFVFDVSGSMANDNRMGNLKVAAREAVNTLLPVEGYAGDPEDVRLAMVSYDTM 198 Query: 167 ----------KHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE- 215 + D + + N T ++ + + Sbjct: 199 VNAGPYFKAVTNQDPERTEPFYGYIRERTTCRRYRNNGTCREWNYEWRGPYHRSYTIKST 258 Query: 216 SAGNLVNSIQKAIQEKKNLS----VRIGTIAYNIG----------IVGNQCTPLSNNLNE 261 + + + V +YN N PL+ N N+ Sbjct: 259 CVWEREGAERYTDASPGHNRWLPPVSATFDSYNDSWSTDHQTDPWCNDNTPIPLTYNRNK 318 Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNT------IGSTRLKKFVIFITDGE 315 + ++ + P NT + + + E S K VI ++DG+ Sbjct: 319 LHDFIDDMTPRRNTAGHIGQAWGWYLVSPEWNSVWPAGSKALPYDEPDATKVVIMMSDGQ 378 Query: 316 NS----GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 + + + IC+ M+ + IY+V A GQD+L C + + Sbjct: 379 YNETRHNNAYPSSVTQAEAICDKMKEKEVVIYTVGFDAG-YGQDVLNYCASNPAFAYKPT 437 Query: 372 DSRELLESFDKITDKIQEQSV 392 + +EL E++ I I + + Sbjct: 438 NGQELTEAYKSIARSISDLRI 458 >gi|332716587|ref|YP_004444053.1| hypothetical protein AGROH133_12352 [Agrobacterium sp. H13-3] gi|325063272|gb|ADY66962.1| hypothetical protein AGROH133_12352 [Agrobacterium sp. H13-3] Length = 412 Score = 160 bits (405), Expect = 3e-37, Method: Composition-based stats. Identities = 81/408 (19%), Positives = 157/408 (38%), Gaps = 33/408 (8%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 MTAI++ V F ++LA++M ++ +Q+ S + ++ +K+ +Q Sbjct: 24 MTAILLPVLLGFAGAGMELANVMQVKADLQNTA----DSAALAAATEARLKEGALTDEQI 79 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 I K I +++ E T D Y ++ Y++ L Sbjct: 80 KEIAKAFIASQMEKTLTEEEKKALEKNSPVNIGTTDDARGKTYTIQTTINYQMQLNPLL- 138 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 G + +L+ T + ++ IS+ +VLD S SM + + + Sbjct: 139 -GFFGAKTLDLAATGTA-VSTVNKGAPISMYLVLDRSGSM--SFKTDTLNTKKTS----- 189 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE----KKNLSV 236 W +P K L + G LV ++ KA + V Sbjct: 190 -CQNYTVDNWGSYPNLKNTSPC---YVNKATSLKTAVGYLVATLNKADPTYTANGGSELV 245 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP---YENTNTYPAMHHAYRELYNEKE 293 R G Y Q P++ + V + ++K P T+ +++ AY L Sbjct: 246 RTGASVYTHETYAAQ--PITWGTSSVATYVDKQIPEFPSGGTDARSSLNAAYNALKKANT 303 Query: 294 ---SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI---CEYMRNAGMKIYSVAV 347 H S +++++ +TDGE +G S+ ++ + C+ + G+KI+SVA Sbjct: 304 VEAKEHKDKKSESFERYIVLMTDGEMTGNSSSWSSSIDQTVRNTCDTAKKDGIKIFSVAF 363 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 AP +G+ LL+ C S ++A + +++ +F +I K + Sbjct: 364 MAPDKGKSLLQHCASSLDNYYAPENMEQIVTAFGEIARKAAGSLATLT 411 >gi|218887819|ref|YP_002437140.1| von Willebrand factor A [Desulfovibrio vulgaris str. 'Miyazaki F'] gi|218758773|gb|ACL09672.1| von Willebrand factor type A [Desulfovibrio vulgaris str. 'Miyazaki F'] Length = 406 Score = 159 bits (402), Expect = 6e-37, Method: Composition-based stats. Identities = 55/435 (12%), Positives = 130/435 (29%), Gaps = 74/435 (17%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+++ V ID + N++Q A+DAA L+G + D + Sbjct: 4 LMAVLLPVVLGLAGLGIDSGMLYLAHNRLQGAVDAAALAGSLELPYDPQLDKG------- 56 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + K + +++ G + ++T + Sbjct: 57 --LVKGAVNQYMAANYPAAVLKGVTPGTEERSVTVKAEATVD---------------TIF 99 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 G + + + ++T + + V+D + SM+ +Q+ N + + Sbjct: 100 MGALGIGSSTVRAQATAGYNN------LEVVFVIDNTGSMKGTAIQQANAAATQLAELIM 153 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 + K PA + D + G L S ++ G+ Sbjct: 154 PDGMETSVKVGLVPFRGK-VHIPAGVDGLADGCRNADGTLAPSWILEEYKQTKYRYPTGS 212 Query: 241 ---IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE---NTNTYPAMHHAYRELYNEKES 294 + + L++N + S + K + T + L E Sbjct: 213 SLNVPKGTCDSIPRVQALTSNRTTIVSAIAKQDALGDASGTVISEGIKWGRHVLTPEAPF 272 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQN------------------------------ 324 + + ++K +I +TDG+ N Sbjct: 273 TQGSSN-KDMRKVMIVLTDGDTEDGKCGGNYALNYTPNAYWTNAYYGMFDMNTHCENGGK 331 Query: 325 -TLNTLQICEYMRNAGMKIYSVAVS-APPEGQDLLRKCTD----SSGQFFAVNDSRELLE 378 L + ++ G++I+++ + L++ + ++ + +L E Sbjct: 332 LNAAMLSEAQIAKDKGIEIFAIRYGDSDSTDISLMKAIASSKAGTDDHYYNAPSAYDLEE 391 Query: 379 SFDKITDKIQEQSVR 393 F KI ++ + +R Sbjct: 392 IFKKIGRQLGWRLLR 406 >gi|323700353|ref|ZP_08112265.1| von Willebrand factor type A [Desulfovibrio sp. ND132] gi|323460285|gb|EGB16150.1| von Willebrand factor type A [Desulfovibrio desulfuricans ND132] Length = 400 Score = 159 bits (401), Expect = 9e-37, Method: Composition-based stats. Identities = 51/432 (11%), Positives = 134/432 (31%), Gaps = 73/432 (16%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A+++ V A+D+ ++ ++Q+A+DA L+G + D + K + Sbjct: 1 MALLLPVLLGVAGIAVDMGNMYMTHTRLQAAVDAGALAGSLELPYDPDL-----SKGIVT 55 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 ++ ++++ +T+ + AQ E+ + L Sbjct: 56 QAVNDMVETNMEEAV----------------VTEISAGTEIRSVKVTAQAEVR---MLLM 96 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 ++ A + + + + + V+D S SM+ + + L+ Sbjct: 97 EVLGMADKTVEASAMAGFNK------LEVVFVIDNSGSMKGTPI-DLVKQASEELTDLLI 149 Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN--LSVRIG 239 P + + + + G+L I + ++ N Sbjct: 150 PDGTTPDTKVGLVPFRGKIRLGEAVDGYAEGCVNADGSLNTGINEEFMDEYNALPYYYKR 209 Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE---NTNTYPAMHHAYRELYNEKESSH 296 I + PLS N + + + + T + L + + Sbjct: 210 YITLDTCSDIPTVLPLSKNKSTIIAAIGSQTATGAASGTVISEGIKWGRNILTPDAPFT- 268 Query: 297 NTIGSTRLKKFVIFITDGE----------------NSGASAYQN---------------T 325 +K +I +TDG+ N+ + Sbjct: 269 QAGSKEDFRKIMIVLTDGDTEDGECGGTYRATYRPNNYWTNAYYGMGVDTAHCNDGGVLN 328 Query: 326 LNTLQICEYMRNAGMKIYSVAVS-APPEGQDLLRKCTD----SSGQFFAVNDSRELLESF 380 + L + ++AG++I+S+ + +L+++ + +F ++ + F Sbjct: 329 ADMLSEAQLAKDAGIEIFSIRFGSSDTTDINLMKEIASSKAGTDDHYFDAPSVYDIPDIF 388 Query: 381 DKITDKIQEQSV 392 +I ++ + + Sbjct: 389 KQIGKQLGWRLL 400 >gi|254780388|ref|YP_003064801.1| hypothetical protein CLIBASIA_01365 [Candidatus Liberibacter asiaticus str. psy62] gi|254040065|gb|ACT56861.1| hypothetical protein CLIBASIA_01365 [Candidatus Liberibacter asiaticus str. psy62] Length = 458 Score = 158 bits (398), Expect = 2e-36, Method: Composition-based stats. Identities = 82/446 (18%), Positives = 175/446 (39%), Gaps = 62/446 (13%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 +TA+++ V +D+ Y + ++ A A+++ ++ +++++ +++ + Sbjct: 25 ITALLMPVMLGVGGMLVDVVRWSYYEHALKQAAQTAIITASVPLI--QSLEEVSSRAKNS 82 Query: 61 STIFKKQIKKHLKQG--SYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENL 118 T K++I+++L + + +++N D + + T + NP + + Sbjct: 83 FTFPKQKIEEYLIRNFENNLKKNFTDREVRDIVRDTAVEMNPRKSAYQVVLSSRYDLLLN 142 Query: 119 FLKGL-----IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKH----- 168 L I S L + + + +SI V+D SRSM D Sbjct: 143 PLSLFLRSMGIKSWLIQTKAEAETVSRSYHKEHGVSIQWVIDFSRSMLDYQRDSEGQPLN 202 Query: 169 ---------NDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAP-------------- 205 + + + K + + Y P P Sbjct: 203 CFGQPADRTVKSYSSQNGKVGIRDEKLSPYMVSCNKSLYYMLYPGPLDPSLSEEHFVDSS 262 Query: 206 ----ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 +K ++ ++ +++ SI+K + N +VR+G +N ++ + S +++ Sbjct: 263 SLRHVIKKKHLVRDALASVIRSIKKI--DNVNDTVRMGATFFNDRVISD--PSFSWGVHK 318 Query: 262 -----VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS-HNTIGSTRLKKFVIFITDGE 315 VK+ N +T AM AY + + E H + KK+++ +TDGE Sbjct: 319 LIRTIVKTFAIDENEMGSTAINDAMQTAYDTIISSNEDEVHRMKNNLEAKKYIVLLTDGE 378 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD----LLRKCTDSSGQFFAVN 371 N+ Q+ + IC ++ G++I ++A S Q+ L C FF N Sbjct: 379 NT-----QDNEEGIAICNKAKSQGIRIMTIAFSVNKTQQEKARYFLSNCAS-PNSFFEAN 432 Query: 372 DSRELLESF-DKITDKIQEQSVRIAP 396 + EL + F D+I ++I E+ +RI Sbjct: 433 STHELNKIFRDRIGNEIFERVIRITK 458 >gi|222149754|ref|YP_002550711.1| hypothetical protein Avi_3756 [Agrobacterium vitis S4] gi|221736736|gb|ACM37699.1| conserved hypothetical protein [Agrobacterium vitis S4] Length = 437 Score = 157 bits (397), Expect = 2e-36, Method: Composition-based stats. Identities = 72/400 (18%), Positives = 150/400 (37%), Gaps = 23/400 (5%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 MTA+++ V A+D ++ R+ +QS++ + A+ + Sbjct: 53 MTAVLLPVSIGVAGLAMDATEMVQSRSALQSSV---DAAALAAASAMSNGMSEADAIALA 109 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + Q+ + + + I++ + N + + L Sbjct: 110 KSFLSSQLANTMARDENTSSVDQITQAEPDISVKTTQVNSSSTSYDVELTGSYTITMNPL 169 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 ++ L ++ +S+ +VLD S SM N+ + Sbjct: 170 SRVLGWETVTLKAYGKAQAATTASESPLSMYLVLDRSGSM----------NDETATTYTG 219 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 S + N + + + KI+ L + +L ++KA + + VR G Sbjct: 220 TCTKTTTSGYGWNKKTTTTSYSCTKNYTKIESLKLAVADLAAQLKKA--DPNSEYVRTGA 277 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG 300 +YN + +S V + +N L+ T+ A+ AY L ++ Sbjct: 278 DSYNASA--DTAQAMSWGTANVVTYVNALSATGGTDARGALSAAYSALQTSNKTEITAHN 335 Query: 301 STRLKK---FVIFITDGENSGASAYQNTLNTLQI---CEYMRNAGMKIYSVAVSAPPEGQ 354 + + K +++F+TDGE +G S+ ++ + C ++ G++IY+VA AP G+ Sbjct: 336 VSSVSKIGRYIVFMTDGEMTGNSSSWSSSIDSAVRSQCTSIKADGIQIYTVAFMAPANGK 395 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRI 394 LL C + ++ D+ L+ +F +I K S R+ Sbjct: 396 SLLSACASDASHYYEATDAASLVAAFGEIGKKATSTSTRL 435 >gi|118591415|ref|ZP_01548813.1| hypothetical protein SIAM614_27248 [Stappia aggregata IAM 12614] gi|118436087|gb|EAV42730.1| hypothetical protein SIAM614_27248 [Stappia aggregata IAM 12614] Length = 474 Score = 156 bits (394), Expect = 5e-36, Method: Composition-based stats. Identities = 61/463 (13%), Positives = 143/463 (30%), Gaps = 90/463 (19%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 +++ + + ID++ + R ++ A+DAA LS A + + +Q Sbjct: 28 FGLMVVLIVVIAGITIDVSRTVNAREKLSFAIDAAALSVAADLSTSV------MSDEQIK 81 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 K +L ++ E +++ + +N ++ + + + Sbjct: 82 AALADSFKANLADVEFLDEAIKNLSF-----VVDAENGTIKVSSFATLDNYFIDMGGYGM 136 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 + N S + + + +V+DV+ SM + + + L+ Sbjct: 137 QALGPETFNFGTSSQ----VTYSRFDVELALVVDVTGSMRNDM--DTLRDASKGLVNILI 190 Query: 182 PPP----PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS-V 236 P K S A K V + ++ + ++ + +++ V Sbjct: 191 PETTEEADSKVRISLVPYSQGVNLGTYAAKVKGGVYGYADSSVCVTERQDYDDGEDIYKV 250 Query: 237 RIGTIAYNI-------------------GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNT 277 R + YN ++ PL+ + + + + L+ T Sbjct: 251 RYTDMPYNYYVKTDPPPKDVFYGGGSNRCSGTSKMIPLTADRDTLLDAIADLDDNGGTAG 310 Query: 278 YPAMHHAYRELYNEKESSHN------TIGSTRLKKFVIFITDGENSG------------- 318 + + + + + KF I +TDG+N+ Sbjct: 311 QTGVVWGWNSISPNYSDVWPLASKPEPYDNDDVLKFAIIMTDGDNNRFYEFVKEREECDW 370 Query: 319 ---------------------------ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 + ++ +C+ M++ G+ I+ V Sbjct: 371 VYSRRYGWQWTCEMVSVNQWQERSESESYNNNSSKAQRALCQAMKDEGISIFGVYFGTND 430 Query: 352 E--GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 G ++ C + G ++ S EL+ +F I KIQ+ V Sbjct: 431 SSAGSKNMQSCAST-GNYYKATSSDELINAFANIAKKIQQIYV 472 >gi|254486311|ref|ZP_05099516.1| conserved hypothetical protein [Roseobacter sp. GAI101] gi|214043180|gb|EEB83818.1| conserved hypothetical protein [Roseobacter sp. GAI101] Length = 476 Score = 155 bits (392), Expect = 1e-35, Method: Composition-based stats. Identities = 59/469 (12%), Positives = 115/469 (24%), Gaps = 116/469 (24%) Query: 8 VCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQ 67 + + +D R ++Q + + Sbjct: 40 MMVMVGGIQLDFMRHEMERTKLQ-------------------------AAADRAVLAAAD 74 Query: 68 IKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSA 127 + + L + E ++ +N KA E+ T+ L G Sbjct: 75 LDQTLAPADVVDEYFAKSGMSDYLSSVTIENGLNFRTVTVKANNEMKTQFL---GRFGFP 131 Query: 128 LTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKK 187 ++ S +++ VLDVS SM++ + T +L P K Sbjct: 132 TLDVPALSKAEERVEKVEISL----VLDVSGSMKNNSKLTTMKDAAKTFIDTVLRPETKN 187 Query: 188 SFWSKNTTKSKYAPA-----PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIA 242 + S+ A + + Sbjct: 188 NVSLSLIPYSEQVNVGPDIFNALWVDTRHDFSYCIDVPDGHFVQTQMTPGFPWDQTQHFQ 247 Query: 243 YNI------------------GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHA 284 +N V + P+S + +K++++ P T Y M Sbjct: 248 WNTYSIESGYQQNTLHDTVCPRAVYERVRPISQDGPSLKAQIDLFQPRAGTAIYMGMKWG 307 Query: 285 YRELYNEKE-----------------SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 L K ++ +TDG+NS + Sbjct: 308 TALLDPSFRETTASLVSDSVVESTFADRPADYSDRETLKTIVLMTDGQNSNSQRISTAYY 367 Query: 328 TLQ-------------------------------------------ICEYMRNAGMKIYS 344 IC ++ G+ I++ Sbjct: 368 NSSSEVVHWSKWNFNYYLSQYIKEKDWHRYYYTRYTAEKGNTLMDNICSAAKDEGIVIWT 427 Query: 345 VAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + G D+++KC S FF V EL ++F I +I + + Sbjct: 428 IGFEVNDTGADVMKKCASSPSHFFRVEGV-ELTDAFSAIASQINQLRLT 475 >gi|114705525|ref|ZP_01438428.1| Flp pilus assembly protein TadG [Fulvimarina pelagi HTCC2506] gi|114538371|gb|EAU41492.1| Flp pilus assembly protein TadG [Fulvimarina pelagi HTCC2506] Length = 461 Score = 154 bits (388), Expect = 3e-35, Method: Composition-based stats. Identities = 55/463 (11%), Positives = 121/463 (26%), Gaps = 76/463 (16%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCAS-----------------I 44 A+ I L + +D+ + +Q A+D A L + Sbjct: 1 MALAILPMLLAVGGTVDVGRQSSLATDLQEAIDIAALHIAKAPSDAIPGEEDVLQLIKSN 60 Query: 45 VSDRTIKDPTTKKDQTSTIFKKQIKKHL----------------KQGSYIRENAGDIAQK 88 ++ + + K D T + + RE G+I Sbjct: 61 ITTKDSRIALKKLDVTEKDVSLHATAEITPFFLGLAGIKNLTAQRATKTAREARGEIEVA 120 Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI 148 ++ T + + L T +++ Sbjct: 121 LVLDTTWSMSEKDSSGKSRLDSLKGAAAKLVDTIFTEDGKTRVAVVPYADYVNVGTQ--H 178 Query: 149 SICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF----------------WSK 192 LDV S ++ + + P + + S+ Sbjct: 179 RNQSWLDVPPSYSTTPSERRCETRTTRTQCTSYAPTYQCTRTVDGVSESTTCGGGCTSSE 238 Query: 193 NTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQC 252 + Y + + + ++ +R Sbjct: 239 TVQVAPYEYCTGGGSSRTYDWYGCVASR--TVGDYRLTDARPDIRYPGFLGTSRECPGPL 296 Query: 253 TPLSNNLNEVKSRLNKLNPYEN-----TNTYPAMHHAYRELYNEKESSHNTI--GSTRLK 305 LS +VK+ ++ L+ T + L + + Sbjct: 297 LSLSTREADVKTSISNLSYGGGGYRPSTFIPAGLIWGLNVLSPPAPFEEQAYDPNNKLPR 356 Query: 306 KFVIFITDGENSGASAYQN---------------TLNTLQICEYMRNAGMKIYSVAVSAP 350 K ++ +TDG N+ + +T+ IC ++ +G++I++V Sbjct: 357 KALVLMTDGANTMVFNSSDGRHRNARSGTEVAQSDRDTISICNNIKRSGIEIFTVGFMVN 416 Query: 351 PEGQ-DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 DLL++C +F EL +F +I D + + + Sbjct: 417 SSSALDLLKECATDGEHYFDATSPEELHSAFGRIADGLTQIRL 459 >gi|218515283|ref|ZP_03512123.1| hypothetical protein Retl8_17130 [Rhizobium etli 8C-3] Length = 329 Score = 153 bits (387), Expect = 4e-35, Method: Composition-based stats. Identities = 84/344 (24%), Positives = 137/344 (39%), Gaps = 25/344 (7%) Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 Q+ +L+ G I+ G Q T N Y Y++ L Sbjct: 1 RNFVAGQMANYLQSGVDIKSATGVTVQ------TNTSGNSTSYQVTVSPSYDLTVNPL-- 52 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 + +LS T I S +IS+ + LD S SM + N+ + S Y Sbjct: 53 MQAVGFTTQHLSTSGTTIGGHSQTQGSISMYLALDKSGSMGED-TATVNEEDPTESYTYD 111 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 K T + + A KI+ L +AGNL + A + VR G Sbjct: 112 CNGHYNKKGKWIYDTCTG---SRANYYTKIEALKMAAGNLFGQLSSA--DPNAQYVRTGA 166 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYREL-------YNEKE 293 ++Y IV + L+ + V + +N L TN+ AM AY L + ++ Sbjct: 167 VSY--DIVQYTPSALAWGTSGVSTYVNALQAGGGTNSSGAMSTAYSSLTAKNAAGNDAED 224 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT--LQICEYMRNAGMKIYSVAVSAPP 351 ++H KK+++F+TDG+N+ S+ + +T C+ ++ G++IY++A AP Sbjct: 225 AAHKLKTGQTPKKYIVFMTDGDNNDDSSGGRSYDTLTKATCDTAKSKGIEIYTIAFMAPE 284 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 GQ LL C +F +LL +F I K Q R+ Sbjct: 285 GGQALLHYCASDDSHYFQAEKMEDLLAAFKAIGAKASSQLTRLT 328 >gi|269105138|ref|ZP_06157832.1| protein TadG associated with Flp pilus assembly [Photobacterium damselae subsp. damselae CIP 102761] gi|268160588|gb|EEZ39087.1| protein TadG associated with Flp pilus assembly [Photobacterium damselae subsp. damselae CIP 102761] Length = 436 Score = 153 bits (386), Expect = 4e-35, Method: Composition-based stats. Identities = 62/439 (14%), Positives = 122/439 (27%), Gaps = 66/439 (15%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + AI+I V F T A D A + + +++ A +AA L+ A + + Sbjct: 14 LFAIMIPVLFGIFTLASDGARAIQTKARIEDATEAASLAIAAHNDPNVNSDGLGSGSKVN 73 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 I +K ++ I I N + +YE+ Sbjct: 74 RRIATDYLKAYITDIDSISSLKIYRRNCEDIPECSSGLNKGKSRF---FEYEVEALTTQN 130 Query: 121 KGLIPSALTNLSLRSTGIIERS----SENLAISICMVLDVSRSMEDLYLQKH-------- 168 + + + + S ++ A+ + D S+SME+ + Sbjct: 131 SWFPGNNVISGFGDTFSTRGHSLARKYQSEAVDVVFAADFSKSMEEPWTGGRQKYKDLVR 190 Query: 169 ---------------NDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL 213 N + N + P ++ + S + D Sbjct: 191 VINDVTSELEKFNNINIADKKNQNTIGISPYNSNTYSKFDNYNSCFMKQDYFEKNSRDHR 250 Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE 273 + ++ ++ EK N S + L+N+ + + K P Sbjct: 251 KKKYVDIKRTLNNIFIEKGNDS---CGFKSDDPDAVFHDIYLTNDFDTFNKEIRKFRPGN 307 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T + + + + L T ++ +I I+DG + +I Sbjct: 308 GTASCQGIIRSAQML----------RKGTNSRRLLIIISDGNDWYYPYSGYKETDKEIAN 357 Query: 334 YMRNAGM---------------------KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND 372 + NAGM +I + L C F Sbjct: 358 KLVNAGMCNKIRETLNLDKTPSGQEIKTRIAVIGFDYDANKNKALLNCA-GEDNVFKAQY 416 Query: 373 SRE-LLESFDKITDKIQEQ 390 E L + IT++I Sbjct: 417 RDELLDQILSLITEEIGHL 435 >gi|254501086|ref|ZP_05113237.1| hypothetical protein SADFL11_1122 [Labrenzia alexandrii DFL-11] gi|222437157|gb|EEE43836.1| hypothetical protein SADFL11_1122 [Labrenzia alexandrii DFL-11] Length = 465 Score = 153 bits (385), Expect = 5e-35, Method: Composition-based stats. Identities = 60/461 (13%), Positives = 128/461 (27%), Gaps = 86/461 (18%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A ++ V + AID++ + R ++ A+DAA LS + + + Q Sbjct: 20 IFAGMVLVLVVIGGAAIDISRAVNAREKLAYAIDAAALSVATDLSTTVLRDN------QI 73 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 T + + +L ++ + ++ + + + L + Sbjct: 74 KTRIENSFRANLSDAEFLDQAIDNLDF---------DVDSNAGTVTVSSSAGLNNYFLNI 124 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 G L E + + + +V+DV+ SM + L Sbjct: 125 PGFGKDGLGPDVFNFGTSAEVNYSRFDVELALVVDVTGSMAGDM--GALRDAAEEVVDIL 182 Query: 181 LPPPPKKS----FWSKNTTKSKYA--PAPAPANRKIDVLIESAGNLVNSIQKAI----QE 230 + S S + + + Sbjct: 183 IEDDASNSASKVRISLVPYSQGVNLGSYASTVTNGSTSWRNCVNEREGQQKYTDAVYNYD 242 Query: 231 KKNLSVRIGTIAYN--------------IGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 N G +Y + PL+++ N + S + L+ T Sbjct: 243 GTNSEYFHGLQSYFIWDYGSSENWSSARDDCPSSSLQPLTSDKNTLISDIRNLSSGGGTG 302 Query: 277 TYPAMHHAYRELYNEK----------ESSHNTIGSTRLKKFVIFITDGENS--------- 317 + + L E N +KKF + +TDG+ + Sbjct: 303 GQTGVAWGWYTLSPNWTSLWPTDSDPEPYGNGTPDDDVKKFALIMTDGDFNAQYGKEERT 362 Query: 318 ------------------------GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE- 352 +C+ M+ ++I++V Sbjct: 363 TCTGRGRNRVCTTNEYWVERYHRYSDYNDPPATRARTLCDAMKAENIEIFTVFFDTGGSA 422 Query: 353 -GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 G DL+ C S ++ ++ EL+ +F I +IQ+ + Sbjct: 423 FGDDLMSYCASGSDYYYEADNKDELITAFSNIAKRIQQIYL 463 >gi|288956977|ref|YP_003447318.1| hypothetical protein AZL_001360 [Azospirillum sp. B510] gi|288909285|dbj|BAI70774.1| hypothetical protein AZL_001360 [Azospirillum sp. B510] Length = 456 Score = 152 bits (383), Expect = 9e-35, Method: Composition-based stats. Identities = 57/439 (12%), Positives = 122/439 (27%), Gaps = 63/439 (14%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M A+ V + AID A ++ +++ A DAA L+ + DQ Sbjct: 32 MVALSFLVLLGMLGVAIDFARAQFVSSRIYYAADAATLAVSRE-------NFQVSTNDQL 84 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + + + G + +++ P +P L Sbjct: 85 KALAQSYFDANFPP--------GTMGATTSLSVATSGTPPTVQGFTVTVTATLPLVFAPL 136 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLA----ISICMVLDVSRSMEDLYLQKHNDNNNMTS 176 + S + + + + +VLD S SM+ + Sbjct: 137 VETLGGPTIGSVGISKASGAVFTTQTSNQGGMELVIVLDNSASMKGSQEDLRGGVKALLD 196 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN--- 233 Y K + + K D++ G + N + K N Sbjct: 197 MLYGNADTRKNLYVGIVHYSGAVNVLQSALKNKADIVAPVVGGMANCPMATVNGKLNGSR 256 Query: 234 ------------LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAM 281 + I Y + LS N + + +T + Sbjct: 257 LSNAPPKTFKFDSTTDGVEIQYCGASTLGTSSALSPNRGDADKAIKSYVAGGDTLIGEGL 316 Query: 282 HHAYRELYNEKESSHNTIGS-----------TRLKKFVIFITDGEN-----SGASAYQNT 325 +R L NT +KK ++ +TDG N + + Y + Sbjct: 317 VWGWRMLTPSWRGLWNTKDQPGASLPLDYDLPYMKKVLVLMTDGVNHIAGRNYTAYYSDP 376 Query: 326 LNTLQ-----------ICEYM-RNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDS 373 T+ IC ++ + +Y++ + + + C + + Sbjct: 377 YQTVADASKADADLMTICNAAKKDHNVVLYTITYGSD-TDEQQMSDCASDPSKHYHAALP 435 Query: 374 RELLESFDKITDKIQEQSV 392 ++L ++F ++ + + Sbjct: 436 QDLAKAFTQVGTDLTTMKL 454 >gi|312621090|ref|YP_003993818.1| protein tadg, associated with flp pilus assembly [Photobacterium damselae subsp. damselae] gi|311872811|emb|CBX86902.1| Protein TadG, associated with Flp pilus assembly [Photobacterium damselae subsp. damselae] Length = 436 Score = 152 bits (383), Expect = 1e-34, Method: Composition-based stats. Identities = 63/439 (14%), Positives = 123/439 (28%), Gaps = 66/439 (15%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + AI+I V F T A D A + + +++ A +AA L+ A + + Sbjct: 14 LFAIMIPVLFGIFTLASDGARAIQTKARIEDATEAASLAIAAHNDPNVNSDGLGSGSKVN 73 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 I +K ++ I I N + +YE+ Sbjct: 74 RRIATDYLKAYITDIDSISSLKIYRRNCEDIPECSSGLNKGKSRF---FEYEVEALTTQN 130 Query: 121 KGLIPSALTNLSLRSTGIIERS----SENLAISICMVLDVSRSMEDLYLQKH-------- 168 + + + + S ++ A+ + D S+SME+ + Sbjct: 131 SWFPGNNVISGFGDTFSTRGHSLARKYQSEAVDVVFAADFSKSMEEPWTGGRQKYKDLVR 190 Query: 169 ---------------NDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL 213 N + N + P ++ + S + D Sbjct: 191 VINDVTSELEKFNNINIADKKNQNTIGISPYNSNTYSKFDNYNSCFMKQDYFEKNSRDHR 250 Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE 273 + ++ ++ EK N S + L+N+ + + K P Sbjct: 251 KKKYVDIKRTLNNIFIEKGNDS---CGFKSDDPDAVFHDIYLTNDFDTFNKEIMKFRPGN 307 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T +Y + + + L T ++ +I I+DG + +I Sbjct: 308 GTASYQGIIRSAQML----------RKGTNSRRLLIIISDGNDWYYPYSGYKETDKEIAN 357 Query: 334 YMRNAGM---------------------KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND 372 + NAGM +I + L C F Sbjct: 358 KLVNAGMCNKIRETLNLDKTPSGQEIKTRIAVIGFDYDANKNKALLNCA-GEDNVFKAQY 416 Query: 373 SRE-LLESFDKITDKIQEQ 390 E L + IT++I Sbjct: 417 RDELLDQILSLITEEIGHL 435 >gi|110679843|ref|YP_682850.1| hypothetical protein RD1_2614 [Roseobacter denitrificans OCh 114] gi|109455959|gb|ABG32164.1| conserved hypothetical protein [Roseobacter denitrificans OCh 114] Length = 488 Score = 151 bits (382), Expect = 1e-34, Method: Composition-based stats. Identities = 57/483 (11%), Positives = 124/483 (25%), Gaps = 118/483 (24%) Query: 1 MTA-IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQ 59 + A +++ + L A+DL +R ++Q+ LD A+L+ + P + Sbjct: 33 IFATMMVLMMLLVCGIAVDLMQNEMMRTRVQNTLDRAILAAS-------DLDQPLPADEV 85 Query: 60 TSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLF 119 F K I + Sbjct: 86 VDDYFAKAGMTEFLDDVQITP----------------GAHLPTTNFRVVQAEARTRTPSI 129 Query: 120 LKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKY 179 + + + T + +++ VLD+S SM + + Sbjct: 130 YMAMTGVRSLPVYVAGTAEETIENTEISL----VLDISGSMRNNGKIGNLRTAAKDFIGA 185 Query: 180 LLPPPPKKSFWSKNTTKSKYAPAPAPANRKI----------------------------- 210 +L + + ++ Sbjct: 186 VLEGNAANTTSLNIVPYAGQTNPGPIVFQRAGGRPFATFIEDSDGNEILYGQTFVDDEGN 245 Query: 211 ------DVLIESAGNLVNSIQKAI-------QEKKNLSVRIGTIAYNIGIVGNQCTPL-- 255 + + Q ++ I + G + + Sbjct: 246 SIDVPYNTMSSCLDLTNGDFDNIDLPSGGYDQTPYFMNWPIDAPTMDWGWCPQNKSSIRY 305 Query: 256 -SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKES-----------------SHN 297 NN +++ ++ + ++ T T M + L Sbjct: 306 AQNNAGQLQDFIDDMRLHDGTGTQYGMKYGVALLNPSSRDTFVALNAAGLVPDGFKDRPA 365 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTL---------------------------Q 330 G+T +KF++ +TDG+ + ++ + Sbjct: 366 DFGTTDTRKFIVLMTDGQITDQFRPEDKNDPKNDEIALNQRIGDRDTYATQSTNVANFYS 425 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 IC + AG+ +Y++A AP +R C S F+ V E+ +F I +I E Sbjct: 426 ICNKAKAAGITVYTIAFEAPANAITQMRTCATSPAFFYKVEGV-EIKTAFKSIARQINEL 484 Query: 391 SVR 393 + Sbjct: 485 RLT 487 >gi|86749514|ref|YP_486010.1| hypothetical protein RPB_2394 [Rhodopseudomonas palustris HaA2] gi|86572542|gb|ABD07099.1| conserved hypothetical protein [Rhodopseudomonas palustris HaA2] Length = 456 Score = 150 bits (378), Expect = 4e-34, Method: Composition-based stats. Identities = 65/432 (15%), Positives = 132/432 (30%), Gaps = 45/432 (10%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIV--------------- 45 + AI + FI AID + R MQ+ALD+A L + Sbjct: 28 IFAIALLPMIGFIGAAIDYSRANKARTSMQAALDSAALMVSKDLASGVITAGQVSAKAQS 87 Query: 46 ------SDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNN 99 ++ + T T+ LK I ++ + I Sbjct: 88 YFASLYNNTEAPNITVTATYTAKDSTGSSTVLLKGTGDISTEFMNMFGFPTLGIGSAATA 147 Query: 100 PLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRS 159 A T ++ G +P+ + + + + I ++ Sbjct: 148 TWGGTRLRVAIALDVTGSMASAGKMPAMQSAAKTLVDNLRANAQTADDLYISII----PF 203 Query: 160 MEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGN 219 + + + K N N + Y ++W T S + ++ + Sbjct: 204 AQMVNVGKSNKNASWIKWDYWEDTTGSCNWWWLTTKSSCESAGRTWSSTNQSQWGGCVTD 263 Query: 220 LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP---LSNNLNEVKSRLNKLNPYENTN 276 + R Y+ S+N +K +++ L+P TN Sbjct: 264 RDQPADTTKDAPTTAATRFPAANYSACPEQILPMTSAYSSSNATTIKDKIDALSPNGGTN 323 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTR-LKKFVIFITDGENSGASAYQNTL--------- 326 MH A+ L + + + +I ++DG N+ Y N Sbjct: 324 QPIGMHWAWMSLQDGAPLNTPAKDADYKYTDAIILLSDGMNTIDRWYGNGSSWSKDVDAR 383 Query: 327 -----NTLQICEYMRNAGMKIYSVAVSAPPEGQ-DLLRKCTDSSGQFFAVNDSRELLESF 380 + ++ IY++ V+ + + ++L+ C DS G FFA + + +F Sbjct: 384 QKLLCDNIRAASAASTTKTVIYTIQVNTDGDPESEVLKYCADS-GNFFATTTASGISTAF 442 Query: 381 DKITDKIQEQSV 392 +I + + + Sbjct: 443 AQIGASLSKLRI 454 >gi|154250683|ref|YP_001411507.1| von Willebrand factor type A [Parvibaculum lavamentivorans DS-1] gi|154154633|gb|ABS61850.1| von Willebrand factor type A [Parvibaculum lavamentivorans DS-1] Length = 436 Score = 150 bits (378), Expect = 4e-34, Method: Composition-based stats. Identities = 59/408 (14%), Positives = 120/408 (29%), Gaps = 25/408 (6%) Query: 8 VCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQ 67 +D++ + +++++ALDA+ L+ + + + + + Sbjct: 29 PVVAAAGATVDISRAYIVESRLKAALDASALAVGGATGMTTSQMQAMAQSFFNANYPASK 88 Query: 68 IKKHLKQGSYIRENAGDIAQKAQINITKDK-NNPLQYIAESKAQYE---------IPTEN 117 + N ++ AQ+ T + +Q + +N Sbjct: 89 LGVPGTLSVSQSGNVVSLSVHAQLPTTLMGVVGINTLNVSATSQVTRMGKKLEVALVLDN 148 Query: 118 LFLKGLIPSALT-------NLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 ++ S +AI V + + H D Sbjct: 149 TGSMASGGRMTVLKTAAKNLITTVSAAATNPGDVKVAIVPFNVDVNIGTTNENVSWLHWD 208 Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE 230 + + + + + N+ Sbjct: 209 EFTPSGGGGNGNGNCNIIQILLGLCNNNNNSNSHAGWEGCVMDRDQNYDAQNTFPPPNPG 268 Query: 231 KKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYREL 288 N + + N PLS N + + S ++ + NTNT + + L Sbjct: 269 GSNATRYPASNSDSDNSNCNLQTIMPLSTNWSALNSHIDAMASAGNTNTTIGLAWGWNML 328 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN----TLNTLQICEYMRNAGMKIYS 344 S + L K ++F+TDG+N+ N T IC ++ AG+K+YS Sbjct: 329 TQGGPLSSAAAPAANLDKVIVFLTDGDNTRNRWSNNSNTINARTTLICNNIKAAGIKVYS 388 Query: 345 VAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 V V L+R C G +++V + EL F I + + Sbjct: 389 VRVIEG--NATLIRNCATEPGMYYSVTTASELTSVFASIAQSLSNLRI 434 >gi|254506100|ref|ZP_05118244.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus 16] gi|219550918|gb|EED27899.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus 16] Length = 415 Score = 150 bits (377), Expect = 5e-34, Method: Composition-based stats. Identities = 65/416 (15%), Positives = 129/416 (31%), Gaps = 41/416 (9%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+II F T A D A + + +++ A + AVL+ A ++ + + Sbjct: 14 LFAMIIPGLFGIFTLATDGARALQTKARIEDASEIAVLAIAAHNDDNQDSQGAGSGSRVN 73 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 I + +L+ + + QI L QYEI ++ Sbjct: 74 RQIATDYLNAYLRDSTQLTGLKVKKYNCDQIAEC---RAGLARGEPRFFQYEIEVSSVQD 130 Query: 121 KGLIPSALTNLSLRST----GIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS 176 + + + R ++ A+ I V D S SM + N Sbjct: 131 TWFPGNDSIEGFGDTFSAKGAAVARKYQSEAVDIIFVSDYSGSMAWNWSGGRNRKYIDLR 190 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPA-PANRKIDVLIESAGNLVNSIQKAIQEKK-NL 234 N + F N T + A K S + + + Sbjct: 191 NIIQEVTDELQKFNDLNNTDNNTVGLTAFNYYTKTVPSNRSNHCFMTQLVNPNGRFSASQ 250 Query: 235 SVRIGTIAYNIGIVGN-------QCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 +VR + N N Q PL++N + + + P T ++ + + Sbjct: 251 TVRNIFVEKNNRYCVNHGDSSRFQDLPLTDNYSSFNNSVRSFYPNHGTASFQGIIRGAQM 310 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA--------- 338 L ++ +I ++DGE+ S + +N +C + N Sbjct: 311 L----------RKGRNPRRLLIVLSDGEDGDPSRHMQLVNA-GMCSTIVNTLSGDLTPDG 359 Query: 339 ---GMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE-LLESFDKITDKIQEQ 390 ++ V L+KC + + + + L + + IT++I Sbjct: 360 HKVKARLAVVGFDYDVNKNRALQKCVGAE-NVYKAQNRDDILNKILELITEEIGHL 414 >gi|83859217|ref|ZP_00952738.1| hypothetical protein OA2633_12470 [Oceanicaulis alexandrii HTCC2633] gi|83852664|gb|EAP90517.1| hypothetical protein OA2633_12470 [Oceanicaulis alexandrii HTCC2633] Length = 436 Score = 150 bits (377), Expect = 5e-34, Method: Composition-based stats. Identities = 67/442 (15%), Positives = 134/442 (30%), Gaps = 77/442 (17%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+ V + A+D + + +++QSALD+ L+ + T+ Sbjct: 22 IMALCSGVLVTAVGGALDYSRSTTVSSELQSALDSGALAAASL-----------TQDRNP 70 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + + ++ L + + + + + A +PT L Sbjct: 71 EDVVRAYVEAALADHPQLLASLQLDVVA--------DISLNSRVVNATASVAMPTTML-- 120 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 GL+ L S I + +++ VLDVS SM + D + + + Sbjct: 121 -GLVGINTLTLEHASEAIEQVRDVEISL----VLDVSGSMGGSKINALQD-AAIEFVEIV 174 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 L + + P N+ I + + +++ Sbjct: 175 LAADAAERTSISVIPYNGGVRTPREVNQDIVSGNNNHRRQSGCVDMGTDYPVEMTLPYRE 234 Query: 241 IAYNIGIVGNQCTP---------------LSNNLNEVKSRLNKLNPYENTNTYPAMHHAY 285 + + Q LS N ++ +N L NT A Sbjct: 235 MEFTEYYGSEQTGNSSSAFCPRSNMESEFLSQNEGRMRGLINSLRAEGNTGLDVATMWGA 294 Query: 286 RELYNEKES--------SHNTIGSTRLKKFVIFITDGENSG------------------- 318 R L + K ++ +TDGE + Sbjct: 295 RALDPAWRGNLGGSFSDRPASYDDRDTIKILVVMTDGEATAQIRSEEYTYYDWWGRERTG 354 Query: 319 ------ASAYQNTLNTLQICEYMRNAGMKIYSVAVS-APPEGQDLLRKCTDSSGQFFAVN 371 SA Q N + C+ G++IY++A + +DL+R C + ++ V Sbjct: 355 TRSYELYSARQARENMAEACDIAEGNGVQIYTIAFQLSGQTNRDLMRNCANKPQNYYQVE 414 Query: 372 DSRELLESFDKITDKIQEQSVR 393 + ++ E+F I I + Sbjct: 415 NL-DIAEAFSSIAADINRLRLT 435 >gi|294139879|ref|YP_003555857.1| hypothetical protein SVI_1108 [Shewanella violacea DSS12] gi|293326348|dbj|BAJ01079.1| hypothetical protein [Shewanella violacea DSS12] Length = 405 Score = 149 bits (376), Expect = 6e-34, Method: Composition-based stats. Identities = 66/401 (16%), Positives = 126/401 (31%), Gaps = 49/401 (12%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M I + I +I LA + + A DAA L+ S +D+ Sbjct: 17 MFVICLPFILTMIAVSILLAMYLLTVTRAGQASDAASLACGYSQRADQD----------- 65 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 L G G + + ++ D N + + Sbjct: 66 -----------LLVGILDYYRPGFVVHDGEALVSIDGKN------RCSIEATYRFNPTMM 108 Query: 121 KGLIPSALTNLSLRST----GIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS 176 L SA T++SL S + +S L + + +VLD+S SM Q N Sbjct: 109 ALLPESARTHVSLSSDTGATSHLVINSTPLPMDLALVLDISSSMSAQLPQLKLIINGALE 168 Query: 177 NKYLLPPPP-KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK--- 232 P +S ++ AP K + L + Sbjct: 169 EIRQQDPNEVGGVRFSLVPFETGVGVLNAPWMPKSAAKVTCVDGLSYGQHSVDYARTVDD 228 Query: 233 --------NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHA 284 N+ + + + PL+ +LN VK R++ L T++Y + Sbjct: 229 LAEPAANLNIKSVFASQWLDACSMDATILPLTQDLNLVKQRVDALVTSGTTSSYQGLIWG 288 Query: 285 YRELYNEKESSHNTIGSTRLK--KFVIFITDGENSGASAYQNTLNTLQICEYMRNA-GMK 341 R L + + + ++ TDG + G + + L +C +++ ++ Sbjct: 289 VRTLLPQWQEEWQIPPVESPALIQRLVLFTDGADQG--FHLDDLIEQGLCRVIQDKHHIE 346 Query: 342 IYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + R+C G+ + +++EL F + Sbjct: 347 MSFIGFGVSDRRLQQFRECAGDKGKVYDAQNTQELEAFFRE 387 >gi|307943467|ref|ZP_07658811.1| putative Flp pilus assembly protein TadG [Roseibium sp. TrichSKD4] gi|307773097|gb|EFO32314.1| putative Flp pilus assembly protein TadG [Roseibium sp. TrichSKD4] Length = 466 Score = 149 bits (375), Expect = 9e-34, Method: Composition-based stats. Identities = 61/463 (13%), Positives = 137/463 (29%), Gaps = 93/463 (20%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A + + + A+D + R+++ +A+DAA L+ + + ++ Q Sbjct: 24 LVAGVCLILLVVAGSAVDYGRALGYRHKIANAVDAAALTVAKQLSTTVLTEN------QI 77 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 T K + +L + ++ K P + + + +I T + L Sbjct: 78 RTGLKNAFRANLNAAGINSQGIDNLDFKVD---------PGEGTLDVWSSVDIQTNFIKL 128 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 G+ P L + + + + +VLDV+ SM + + L Sbjct: 129 GGIGPE-----KLEVGAASQVNYSRFDVELALVLDVTGSMRPDMNA--LKEASKSIVNIL 181 Query: 181 LPPPPK----KSFWSKNTTKSKYA--PAPAPANRKIDVLIESAGNLVNSIQKAIQ----- 229 LP K S + Sbjct: 182 LPDDSNSRESKVRISLVPYSQGVNLGSYATRVTNGGSTWRNCVNERSGPQKFTDAPYNYA 241 Query: 230 ----EKKNLSVRIGTIAYNIGIVGNQCT---------PLSNNLNEVKSRLNKLNPYENTN 276 + + + Y PL+ + ++ ++ L T Sbjct: 242 GSRSDFFHGKPKQFVWDYGWTEQWQTRPEACPKTAVEPLTADRTKLLRAISGLKDGGGTG 301 Query: 277 TYPAMHHAYRELYNEKESSHNTIGS----------TRLKKFVIFITDGENSGASAYQNT- 325 + + L + ++ + KKF + +TDG+ + A + Sbjct: 302 GQTGIAWGWYTLSPKWKNLWPRDSAPATYGTGSHTDDTKKFALIMTDGDFNAAYGWDCGC 361 Query: 326 ----------------------------------LNTLQICEYMRNAGMKIYSVAVSAPP 351 ++C+ M++ ++I++V Sbjct: 362 RKIRDKPLYCRKKSNKKSWIERYFSPSKISHAPAQRAKKLCDEMKSKNIEIFTVYFDTGG 421 Query: 352 E--GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 G DL+ C S ++ ++S EL+++F I ++IQ + Sbjct: 422 ATFGDDLMSYCASGSRNYYRADNSNELIQAFSNIANEIQSIYI 464 >gi|39936212|ref|NP_948488.1| hypothetical protein RPA3149 [Rhodopseudomonas palustris CGA009] gi|39650067|emb|CAE28590.1| conserved hypothetical protein [Rhodopseudomonas palustris CGA009] Length = 455 Score = 148 bits (374), Expect = 1e-33, Method: Composition-based stats. Identities = 63/429 (14%), Positives = 126/429 (29%), Gaps = 40/429 (9%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+ + FI A+D + R +Q+ALD+A L + D + K Sbjct: 28 IFALALVPLLGFIGVAVDYSRANNARTSLQNALDSAALMLSRDLGVGTITPDQVSSK--A 85 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIA--------------E 106 T F + I +Q Sbjct: 86 QTYFNSLYTNKETGAVTVTATYTAKDGSGSSTIAMSGQGAVQTQFMKILGFQTMAIGSST 145 Query: 107 SKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD-----VSRSME 161 + + S + + + ++ + + +D V + Sbjct: 146 TTTWGGTRLRVAMALDVTGSMASAGKMSAMKTAAKNLVDSLRASAQTVDDVYISVVPFAQ 205 Query: 162 DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLV 221 + + N N + S W +T S + + Sbjct: 206 MVNVGSSNRNASWVRWDLWDESNGSCSSWWYSTKSSCEYAGRTWTATSHNQWAGCVTDRD 265 Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN---NLNEVKSRLNKLNPYENTNTY 278 + + R + Y+ + N +K++++ L+P TN Sbjct: 266 QPADTTKDVPTSYATRFPAVDYDACPQQLLGMTSAYSLSNATTIKNKIDALSPNGGTNQA 325 Query: 279 PAMHHAYRELYNEKESSHNTIGSTR-LKKFVIFITDGENSGASAYQNTLN--------TL 329 MH A+ L + S +I ++DG N+ Y N + Sbjct: 326 IGMHWAWMSLRTGDPLNTPAKDSNYKYTDAIILLSDGLNTVDRWYGNGRDWSPQVDARQR 385 Query: 330 QICEYMRN-----AGMKIYSVAVSAPPEGQ-DLLRKCTDSSGQFFAVNDSRELLESFDKI 383 +C+ +R + IY++ V+ + + +L+ C DS G FFA S + +F +I Sbjct: 386 ILCDNIRASATNTNPVVIYTIQVNTDGDPESAVLKYCADS-GNFFATTTSSGIGTAFAQI 444 Query: 384 TDKIQEQSV 392 + + V Sbjct: 445 GSSLSKLRV 453 >gi|315122199|ref|YP_004062688.1| hypothetical protein CKC_02245 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495601|gb|ADR52200.1| hypothetical protein CKC_02245 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 463 Score = 148 bits (372), Expect = 2e-33, Method: Composition-based stats. Identities = 78/444 (17%), Positives = 156/444 (35%), Gaps = 57/444 (12%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 ++A+++ V F+ I IDL Y N + A++ A LS +++ K Sbjct: 25 ISALLLPVIFMVIGLLIDLVRWGYYHNSLVQAVNTAALSASVQLLNSVEDKSKEKALSSV 84 Query: 61 --STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN- 117 K+ + +LK Y D + Q N +I + Y +P Sbjct: 85 LGENNIKQYLLNNLKISLYNNFGEMDSQRIIQHTKVNIYNRKGTHIINVYSHYNLPLNPF 144 Query: 118 -LFLKGLIPSALTNLSLRSTGIIE---RSSENLAISICMVLDVSRSMED-----LYLQKH 168 LF LI ++ + + +S+ ++D S SM + K Sbjct: 145 SLFFMNLINIKSWPITTVGEAEVTSKKNYHKEEGVSVQWLIDDSGSMGSIIDRACFGSKQ 204 Query: 169 NDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPA-------------------------- 202 + +K + + Sbjct: 205 LKSQYNVGSKIGIVRNENADTSDSFYPIVGELVSCDRSLYYVLNDKKILEDDDLEEKNLD 264 Query: 203 --PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 RK ++ ++ + ++K K L + Y + P++ + Sbjct: 265 NHSQYYIRKRYLVRDALATFIKRVRKIDNLKDKLR---MSFMYFNE-RIDHYFPMTWGIK 320 Query: 261 EVKSRLNK----LNPYENTNTYPAMHHAYRELYNEK-ESSHNTIGSTRLKKFVIFITDGE 315 E K ++ + T+ +P + AY +L+++ + H S +KKF++ +TDG Sbjct: 321 EFKQEVSSHYKRKHENTATDIHPILQEAYNKLHSKNEDDEHKKKNSVEVKKFIVLLTDGA 380 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----GQDLLRKCTDSSGQFFAVN 371 + +++ L+IC+ + G+KI++++ S D L +C +FF Sbjct: 381 QNEGVHSVDSV--LKICDAAKEEGIKIFTISYSVDSSERKKANDFLSRCAS-PDKFFEAY 437 Query: 372 DSRELLESFDK-ITDKIQEQSVRI 394 D+ +L F + I D I E+ V+I Sbjct: 438 DADKLNMIFKEHIGDAIFERLVKI 461 >gi|192291928|ref|YP_001992533.1| hypothetical protein Rpal_3558 [Rhodopseudomonas palustris TIE-1] gi|192285677|gb|ACF02058.1| conserved hypothetical protein [Rhodopseudomonas palustris TIE-1] Length = 455 Score = 148 bits (372), Expect = 2e-33, Method: Composition-based stats. Identities = 62/430 (14%), Positives = 128/430 (29%), Gaps = 42/430 (9%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+ + FI A+D + R +Q+ALD+A L + D + K Sbjct: 28 IFALALVPLLGFIGVAVDYSRANNARTSLQNALDSAALMLSRDLGVGTITPDQVSSK--A 85 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIA--------------- 105 T F + I +Q Sbjct: 86 QTYFNSLYTNKETGAVTVTATYTAKDGSGSSTIAMSGQGAVQTQFMKILGFQTMAIGSST 145 Query: 106 -----ESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSM 160 ++ + + + + + +++ + + + + V Sbjct: 146 TTTWGGTRLRVAMALDVTGSMASAGKMS-AMKTAAKNLVDSLRASAQTADDVYISVVPFA 204 Query: 161 EDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNL 220 + + + N N N S W +T S + + Sbjct: 205 QMVNVGSSNRNANWVRWDLWDESNGSCSSWWYSTKSSCEYAGRTWTATSHNQWAGCVTDR 264 Query: 221 VNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN---NLNEVKSRLNKLNPYENTNT 277 + + R + Y+ + N +K++++ L+P TN Sbjct: 265 DQPADTTKDVPTSYATRFPAVDYDACPQQLLGMTSAYSLSNATTIKNKIDALSPNGGTNQ 324 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTR-LKKFVIFITDGENSGASAYQNTLN--------T 328 MH A+ L + S +I ++DG N+ Y N + Sbjct: 325 AIGMHWAWMSLRTGDPLNTPAKDSNYKYTDAIILLSDGLNTVDRWYGNGRDWSPQVDARQ 384 Query: 329 LQICEYMRN-----AGMKIYSVAVSAPPEGQ-DLLRKCTDSSGQFFAVNDSRELLESFDK 382 +C+ +R + IY++ V+ + + +L+ C DS G FFA S + +F + Sbjct: 385 RILCDNIRASATNTNPVVIYTIQVNTDGDPESTVLKYCADS-GNFFATTTSSGIGTAFAQ 443 Query: 383 ITDKIQEQSV 392 I + + V Sbjct: 444 IGSSLSKLRV 453 >gi|209809179|ref|YP_002264717.1| membrane associated secretion system protein [Aliivibrio salmonicida LFI1238] gi|208010741|emb|CAQ81132.1| membrane associated secretion system protein [Aliivibrio salmonicida LFI1238] Length = 422 Score = 147 bits (371), Expect = 2e-33, Method: Composition-based stats. Identities = 59/424 (13%), Positives = 130/424 (30%), Gaps = 50/424 (11%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A++I F T A D A + + +++ A + A L+ A ++ + Sbjct: 14 LFAMMIPALFGIFTLASDGARAIQTKARIEDAAEVATLAVSAHNDPNQDYGGGGSPSSAN 73 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 I I ++ I E +I K L ++E+ Sbjct: 74 QQIVTDYINAYISDVDSINEIKVYKRNCEEIPECK---AGLAVGEPRYFEHEVGVTTSQK 130 Query: 121 KGLIPSALTNLSLRSTGIIERS----SENLAISICMVLDVSRSMEDLYLQ---------K 167 + S S ++ A+ + D S SM D + Sbjct: 131 SWFPGNDAIVGMGDSFSTSGHSLARKYQSEAVDVMFAADFSGSMGDRWTGGNKKYEDLID 190 Query: 168 HNDNNNMTSNKYLLPPPPKKSF------WSKNTTKSKYAPAPAPANRKIDVLIESAGNLV 221 D+ + K+ +++ T + + + Sbjct: 191 IIDSISKELQKFNDLEHNDNDNTMGITAYNEYTYSQYSGSSGGWWGDDCYLSQAESDGFW 250 Query: 222 N--SIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYP 279 SI K I N + N PL++N + V +++ P T++Y Sbjct: 251 GGVSISKTIDGLWNEKSKDHCNNSYNSGRFNDI-PLTSNFDVVNQDVSRFWPEGGTSSYQ 309 Query: 280 AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC------- 332 A+ + L T ++ +I ++DG ++ + + +N +C Sbjct: 310 ALIRGAQLLTYG----------TNSRRLLIVLSDGMDTDNNLTSSLVNA-GMCRDIQQGL 358 Query: 333 ---EYMRNAGMK--IYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE-LLESFDKITDK 386 + + N ++ + + P L+ C + + +S + L + I+++ Sbjct: 359 ESDKTLDNRPIRAQMAVIGFDYEPSENQALKDCVGAE-NVYKAENSDDILNTILELISEE 417 Query: 387 IQEQ 390 I Sbjct: 418 IGHL 421 >gi|91977525|ref|YP_570184.1| hypothetical protein RPD_3057 [Rhodopseudomonas palustris BisB5] gi|91683981|gb|ABE40283.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB5] Length = 464 Score = 146 bits (369), Expect = 4e-33, Method: Composition-based stats. Identities = 50/438 (11%), Positives = 124/438 (28%), Gaps = 49/438 (11%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+ + FI AID + R MQ+ALD+ L + +D+ + ++ Sbjct: 28 IFALTLLPILGFIGAAIDYSRASRARTAMQAALDSTALMVSKDLGADKIKT--SEVSEKA 85 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQY----------------- 103 T F +G + N + + + + Sbjct: 86 QTYFNSLYTGTEARGVTLTTNYTAKDDSGSSTVVVNGDGAVSTHFMKMFGFPSLAIGSAA 145 Query: 104 ---IAESKAQYEIPTENLFLKGLIPSALT-NLSLRSTGIIERSSENLAISICMVLDVSRS 159 ++ + + + L S + ++ +++ + + + V Sbjct: 146 TATWGGTRLRVAMALDVTGSMVLNGSTKLAEMKKAASALVDTLRASAQSKDDLYISVVPF 205 Query: 160 MEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGN 219 + + + N + + S T + Sbjct: 206 AQMVNVGSSNIDASWIKWDVWDETEGSCSKSKFKTKTDCEDNGRTWTVTDRSKWKGCVTD 265 Query: 220 LVNSIQKAIQEKKNLSVRIGT--IAYNIGIVGNQCTPLSN-----NLNEVKSRLNKLNPY 272 + R Q P+++ + ++K ++ L Sbjct: 266 RDQPADTTKDAPTSDDTRFPALRTLLGTTSCPAQIFPMTSAYAATDAQKIKDVIDDLVAD 325 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIG-STRLKKFVIFITDGENSGASAYQN------- 324 TN M A+ L + + + +I ++DG N+ Sbjct: 326 GGTNQPIGMAWAWMSLQQGNPLNTPAKDPNYKYTDAIILLSDGLNTMDRWPDYGDGQRQF 385 Query: 325 ----TLNTLQICEYMR---NAGMK--IYSVAVSA-PPEGQDLLRKCTDSSGQFFAVNDSR 374 +C+ ++ + G + +Y++ V+ +L+ C G FFA + Sbjct: 386 DGKIDARQKLLCDNIKLPDSNGKRPVVYTIQVNTTGDPESTILKYCA-DGGNFFATTTAS 444 Query: 375 ELLESFDKITDKIQEQSV 392 + +F +I + + + Sbjct: 445 GIGTAFAQIGSSLSKLRI 462 >gi|329850249|ref|ZP_08265094.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] gi|328840564|gb|EGF90135.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] Length = 412 Score = 145 bits (366), Expect = 9e-33, Method: Composition-based stats. Identities = 64/407 (15%), Positives = 125/407 (30%), Gaps = 43/407 (10%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+ + V F F+ AID + + Y R ++Q A D+AVL A + + K Sbjct: 32 IFALSVFVIFGFVGAAIDFSRVDYARRRLQDAADSAVLRAMALKSATDESRGVAADKAFA 91 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 G Y A I Y + + Sbjct: 92 ENFG--------HPGVYDLNGALKREVNENII-------SQTYTVHATVSS-------YF 129 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNK-Y 179 +++ S I VLD + SM + + ++ ++ Sbjct: 130 GAFFGKDSYPVTVVSQAKTSL----DVFEIAFVLDTTGSMAEANKMPNLKSSVDSAMAGL 185 Query: 180 LLPPPPKKSFWSKNTTKSKYAP---APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 L + A + + + + + Sbjct: 186 LQNGKNLSGSKIAVVPFNTQVRLSDATVTTMSSQGLSSGWGNCVHDRDLATSHDVSASAA 245 Query: 237 RI------GTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 + + LS+N++ ++ + L P TN + L Sbjct: 246 QKGKAQTLYPLETCDEASLKPVQGLSDNISSARNFIKTLQPGGYTNVTMGVQWGMEVLSP 305 Query: 291 EKESS-HNTIGSTRLKKFVIFITDGENSGASAYQN----TLNTLQICEYMRNAGMKIYSV 345 + S GST+ +KF+I +TDG+N+ + + T CE + G+ +Y+V Sbjct: 306 NQPFSDATEFGSTKARKFMIVVTDGDNTKSFTSWSASVIDKRTALACENAKAKGITVYTV 365 Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + +LRKC + F+ + + +L + I I + + Sbjct: 366 KIIQGNSN--MLRKCASAPEYFYDLTSANQLNAAMSGIFKSINKTRL 410 >gi|317154611|ref|YP_004122659.1| von Willebrand factor type A [Desulfovibrio aespoeensis Aspo-2] gi|316944862|gb|ADU63913.1| von Willebrand factor type A [Desulfovibrio aespoeensis Aspo-2] Length = 395 Score = 145 bits (366), Expect = 1e-32, Method: Composition-based stats. Identities = 47/426 (11%), Positives = 121/426 (28%), Gaps = 73/426 (17%) Query: 8 VCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQ 67 + A+D+ ++ ++Q+A+DA L+G + D + K Sbjct: 1 MLLAVAGLAVDMGNMYVTHTRLQAAVDAGALAGSLELPYDPDL-----SKGIVQQAVSDM 55 Query: 68 IKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSA 127 I ++ + A+ ++ NL + G + A Sbjct: 56 IHTNMPDAV----------------VESVSPGTEVRSVVVTAKAKV---NLLVMGFLNLA 96 Query: 128 LTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKK 187 + + + + I V+D S SM+ + + + ++ + Sbjct: 97 DQWVEAGAAAGFNK------LEIVFVIDNSGSMKGTPINLVKEASIGLTDLLIPDGQQPD 150 Query: 188 SFWSKNTTKSKY--APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 + + K S ++ ++ + R + Sbjct: 151 TKVGLVAFRGKVRLGGDVDGLEAGCRNADGSVNTGIHEDFMSMYWALSSYYRNQI-DLDT 209 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYE---NTNTYPAMHHAYRELYNEKESSHNTIGST 302 + PLS + ++ +N T + A L E Sbjct: 210 CSSIPESRPLSQDKGDIVEGINSQTALGSASGTVISEGIKWARHMLTPEAP-YTQAGDKK 268 Query: 303 RLKKFVIFITDGE----------------NSGASAYQN---------------TLNTLQI 331 +K +I +TDG+ N+ + + L Sbjct: 269 DFRKIMIVLTDGDTEDGECGGSYRASFRPNNYWTNAYYGMGVDTAHCQDGGVLNQDMLAE 328 Query: 332 CEYMRNAGMKIYSVAVSAPPE-GQDLLRKCTD----SSGQFFAVNDSRELLESFDKITDK 386 + ++ G++I+++ L+++ ++ +F ++ + F KI + Sbjct: 329 AQLAKDEGIEIFAIRFGVSDNTDISLMKQIASSKAGTNDHYFDAPSVYDIPDVFKKIGKQ 388 Query: 387 IQEQSV 392 + + + Sbjct: 389 LGWRLL 394 >gi|323495646|ref|ZP_08100717.1| membrane associated secretion system protein [Vibrio sinaloensis DSM 21326] gi|323319281|gb|EGA72221.1| membrane associated secretion system protein [Vibrio sinaloensis DSM 21326] Length = 419 Score = 145 bits (365), Expect = 1e-32, Method: Composition-based stats. Identities = 61/420 (14%), Positives = 128/420 (30%), Gaps = 45/420 (10%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+II F T A D A + + +++ A + AVL+ A ++ + + Sbjct: 14 LFAMIIPGLFGLFTLASDGARAIQTKARIEDASEIAVLAIAAHNDDNKNSQGSGSGSAVN 73 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 I ++ +L + QI L QYE+ + + Sbjct: 74 RKIATDYLEAYLHDVDSVNNLKIHKYNCDQIPECV---AGLARGEPRFFQYEVEATSRHV 130 Query: 121 KGLIPSALTNLSLRST----GIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS 176 + ++ R ++ A+ I V D S SM + N Sbjct: 131 SWFPGDSSIPGFGKTFDAKGAATARKYQSEAVDILFVADYSGSMAGGWNGGSNRKYIDLR 190 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPA-PANRKIDVLIESAGNLVNSI----------Q 225 N + + F N T + K S + + Sbjct: 191 NIIKVVTDELQKFNDLNNTDNNTVGMTGFNYYTKTKPTNRSNSCFMTQLVYNNNYNINYT 250 Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAY 285 K + N ++ + + L++N + + +N P T +Y + Sbjct: 251 KTVNNIFNEKNNKYCVS-HSDSSRFRDIDLTDNYSSFNTTVNGFYPNHGTASYQGIMRGA 309 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGEN--SGASAYQNTLNTLQICEYMRNA---GM 340 + L T ++ +I ++DG++ + L +C ++ G+ Sbjct: 310 QML----------KKGTNPRRLLIVLSDGDDSGTSQKNIHKQLVNAGMCTKIKQELSTGI 359 Query: 341 ---------KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE-LLESFDKITDKIQEQ 390 ++ V LR C + F ++ + L + + IT++I Sbjct: 360 SSSGQSIKARLAVVGFDYNVNNNTALRDCAGAE-NVFKAQNTDDILNKILELITEEIGHL 418 >gi|163731887|ref|ZP_02139334.1| hypothetical protein RLO149_21324 [Roseobacter litoralis Och 149] gi|161395341|gb|EDQ19663.1| hypothetical protein RLO149_21324 [Roseobacter litoralis Och 149] Length = 468 Score = 145 bits (365), Expect = 1e-32, Method: Composition-based stats. Identities = 62/483 (12%), Positives = 133/483 (27%), Gaps = 118/483 (24%) Query: 1 MTA-IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQ 59 + A I++ + L A+DL +R ++Q+ LD A+L+ + P + Sbjct: 13 IFATIMVLMMLLVCGIAVDLMQNEMMRTRVQNTLDRAILAAS-------DLDQPLPADEV 65 Query: 60 TSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLF 119 F K +I D I +++A+ Sbjct: 66 VDDYFAKAGMTEFLNDV-------------RITPGSDLPTTNFRIVQAEAR---TRTPSI 109 Query: 120 LKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKY 179 + + + T I I +VLD+S SM + + Sbjct: 110 YMAMTGVRTLPVYVSGTAEETIE----KIEISLVLDISGSMRNNGKIGNLRTAAKDFIGA 165 Query: 180 LLPPPPKKSFWSKNTTKSKYAPAPAPANRKI----------------------------- 210 +L K+ + + Sbjct: 166 VLEGNAAKTTSLNIVPYAGQTNPGRIVFERAGGLPFATFIEDSNGDEILYGQTIVDDEGN 225 Query: 211 ------DVLIESAGNLVNSIQKAI-------QEKKNLSVRIGTIAYNIGIVGNQCTPL-- 255 + + + Q ++ I + G + + Sbjct: 226 SIDVPYNTMSSCLDLTNSDFDNIDLPSGGYDQTPYFMNWPIDAPTMDWGWCPQNNSSIRY 285 Query: 256 -SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKES-----------------SHN 297 N+ ++ ++ + ++ T T M + L + Sbjct: 286 AQNDAGRLQDFIDDMRLHDGTGTQYGMKYGVALLNPSSRNTFLALNAAGLVPDGFKNRPA 345 Query: 298 TIGSTRLKKFVIFITDGE---------------------------NSGASAYQNTLNTLQ 330 G+T +KF++ +TDG+ ++ ++ N N Sbjct: 346 DFGTTDTRKFIVLMTDGQITDQFRPEDKNDPKNDEIALNQRTGDRDTYSTQSTNVTNFYS 405 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 +C + G+ +Y++A AP + +R C S F+ V ++ +F I +I E Sbjct: 406 VCNKAKAEGITVYTIAFEAPADAVTQMRTCATSPAFFYKVEGV-QIKTAFKSIARQINEL 464 Query: 391 SVR 393 + Sbjct: 465 RLT 467 >gi|116253849|ref|YP_769687.1| hypothetical protein RL4112 [Rhizobium leguminosarum bv. viciae 3841] gi|115258497|emb|CAK09601.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae 3841] Length = 398 Score = 145 bits (364), Expect = 2e-32, Method: Composition-based stats. Identities = 72/411 (17%), Positives = 140/411 (34%), Gaps = 53/411 (12%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 MTAI++ V AID +++ + ++Q A A + + Sbjct: 24 MTAIMMPVLLGAAGLAIDYSNMALSKRELQEAT-----DSAALAAATALASGAASTTADA 78 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 I K + + + A + Y Y I Sbjct: 79 EAIAKDFVSGQMANYVDTDAISSIKAGTSVDIDVSATATSKSYKVTVATSYGIAATP--F 136 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 ++ N+ ++ S A+S+ +VLD S SM + Sbjct: 137 MSVLGYKTLNIGASTSTSSGTSDTKTALSMELVLDQSGSMGEKTTTCA------------ 184 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 + KID L ++A L +++ A + VR G Sbjct: 185 ----------------TYNGKNCKTYVTKIDALKKAADALFDALDTADPDHS--LVRTGA 226 Query: 241 IAYNIGIVGN-------QCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 +YN G++ N + ++ + ++ + T+ M A + + Sbjct: 227 YSYNNGLIYNSQKTQIKSMSGMAWGTATTATYVSGITASGGTDATEPMRQATLSIAKASD 286 Query: 294 S------SHNTIGSTRLKKFVIFITDGENSGASAYQNTL---NTLQICEYMRNAGMKIYS 344 +H G+T + +++I +TDGE +G + + N C+ + AG+KI++ Sbjct: 287 GSDVETQAHAVKGNTIVSRYIILMTDGEMTGNTGVWQSSFDQNVRNQCDATKTAGIKIFT 346 Query: 345 VAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 VA AP +G+ LL+ C G ++ +L+ SF I + + + Sbjct: 347 VAFMAPDKGKQLLQYCASPGGNYYEAETMEKLVASFTSIAKEATKAVTLLT 397 >gi|149909171|ref|ZP_01897828.1| hypothetical protein PE36_09171 [Moritella sp. PE36] gi|149807695|gb|EDM67641.1| hypothetical protein PE36_09171 [Moritella sp. PE36] Length = 402 Score = 145 bits (364), Expect = 2e-32, Method: Composition-based stats. Identities = 53/408 (12%), Positives = 126/408 (30%), Gaps = 39/408 (9%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 ++ + + A + + A D+AVL+ + + Sbjct: 15 FTFMLPAIVSLLAITVFFAMYSQVVIRAGQAADSAVLACAYQQNDTGVVTEG-------- 66 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + + + + QI+ + + + Sbjct: 67 --ILDYYRPNFVLPELNKSVKLNSNNGCQISAQYRFEPAMVNALPVAIDSDTEVVSNS-- 122 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 S + + + + +VLD+S SM + +T + Sbjct: 123 --------QSSAKLVQNVNVNGIQNPVDFSLVLDISGSMTWHLPELKK---IITDVISDI 171 Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK--------- 232 P + +S ++ + AP + + LV +K Sbjct: 172 VPSSNQVRFSIVPFQTGVGVSGAPWLLSSEASPKCVDGLVYRNGNLDADKTVQSLNYSSD 231 Query: 233 --NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELY 289 + + + + PL+NNLN V + L+ T +Y R L Sbjct: 232 RLDFNEVTPGRWLDRCSETSFILPLTNNLNRVIRYVESLDTSGGSTASYQGFIWGVRTLT 291 Query: 290 NEKESSHNTIG--STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA-GMKIYSVA 346 ++ + S+ L + +I TDG+++ + + ++ +C+ ++ +++ + Sbjct: 292 DQWQKEWQVTPVQSSSLTQRLILFTDGDDNRRDYFNDLMSA-GLCDVIQQDLNIQVSFIG 350 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRI 394 + ++C +G F N++ EL + F+ + E VRI Sbjct: 351 FGVSADRIKQFKQCAGRNGSVFDANNTAELADYFEDAININIETKVRI 398 >gi|300023811|ref|YP_003756422.1| von Willebrand factor A [Hyphomicrobium denitrificans ATCC 51888] gi|299525632|gb|ADJ24101.1| von Willebrand factor type A [Hyphomicrobium denitrificans ATCC 51888] Length = 466 Score = 144 bits (362), Expect = 3e-32, Method: Composition-based stats. Identities = 69/437 (15%), Positives = 136/437 (31%), Gaps = 60/437 (13%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + ++ V F I A+D + R+Q +A DAAVL+G ++ ++ D Sbjct: 43 LFGLMALVLFAMIGLAVDYGRFVNARSQTIAATDAAVLAGARALQTNG--GDQAAALRVA 100 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + + + K L + +N + A P L Sbjct: 101 QSYYAQATKNRLSLSNDTINF-------------AIADNATAMVTTGNAVITTPFMGLAG 147 Query: 121 KGLIPSALTNLSLRSTGIIER-SSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKY 179 G +P + S S ++ + L + I M+LD++ SM L + N Sbjct: 148 TGSLPILRKDGSDYSKAVLAVGGNAELNLEIAMMLDITGSMRGQKLTDMKAAASDLLNIV 207 Query: 180 LLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE----SAGNLVNSIQKAIQEKKNLS 235 + K + + PA A +K + + Sbjct: 208 VWTDQSKFTSKVAIVPFAYDVRLPAAAFKKATGTTSTNYPCVVERTGTEKYTDAAPATGK 267 Query: 236 VRIGTIA------------YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + + PL+++ + + +++N L+ +T + Sbjct: 268 YVMVHNTSSTKKNKTTYSPTCDVASSAEVLPLTSDKSTLLAKVNGLSTAGSTAGHIGTAW 327 Query: 284 AYRELYNEK-------ESSHNTIGSTRLKKFVIFITDGE--------------------N 316 A+ L S+ + L+K + +TDGE Sbjct: 328 AWYMLAPNWSSLWTSASSTPAAYNADNLRKIAVLMTDGEYNTQYTTNGVPDDSSSLTRCP 387 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-EGQDLLRKCTDSSGQFFAVNDSRE 375 + A+ ++ + C M+ G+++Y+V D L +C S F+ Sbjct: 388 NAANGVCSSAQAVSQCTAMKAKGIEVYTVGFQLDNQTAIDTLSQCATDSSHFYNSTTGDA 447 Query: 376 LLESFDKITDKIQEQSV 392 L +F I KI + Sbjct: 448 LKAAFRDIALKISTLYL 464 >gi|92117939|ref|YP_577668.1| hypothetical protein Nham_2418 [Nitrobacter hamburgensis X14] gi|91800833|gb|ABE63208.1| conserved hypothetical protein [Nitrobacter hamburgensis X14] Length = 483 Score = 143 bits (360), Expect = 5e-32, Method: Composition-based stats. Identities = 55/456 (12%), Positives = 124/456 (27%), Gaps = 66/456 (14%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + AI + F+ A+D R+ MQ+A+D+A L + Sbjct: 28 IFAIALLPMLGFVGAAVDYTRANAARSSMQAAMDSAALMVAKDANAASPQMTADQVTAAA 87 Query: 61 STIFKKQIKKH-----------------------LKQGSYIRENAGDIAQKAQINITKDK 97 F L ++ + + QI+ + Sbjct: 88 QKYFNALYHNTDAQGASVSAVYTPYNNGTPATVVLSGSGNVQTDFMKVVGFPQISFKTNS 147 Query: 98 NNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAIS----ICMV 153 A T ++ G + + ++ + V Sbjct: 148 TATWGNTKLRVAMALDVTGSMSSAGKLVQMKIAAKKLIDTLKASATAEGDVYISIIPFNV 207 Query: 154 LDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL 213 + + + ++ + ++ NT S A + I Sbjct: 208 MVNVGANNNTASWLEWEDGSYDNSSSNYGSCSGSGKSKPNTKSSCIAAGKTWTPKNISSW 267 Query: 214 IESAGNL-------VNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN--------- 257 + E + +A N + P+++ Sbjct: 268 KGCVTDRGPVSKPGSGDYDTTKDEPVASTPYTLYLARNYSTCPSSILPMTSAYDSKESDS 327 Query: 258 --NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTR-LKKFVIFITDG 314 + + +K ++N L TN AM A+ L +I ++DG Sbjct: 328 STDDSTLKGKINNLVANGATNQAIAMQMAWMMLQPTAPFPAPAKDEKYKYTDAIILLSDG 387 Query: 315 ENSGASAYQNTLNTLQ--------ICEYMRNAGM---------KIYSVAVSAPPEGQ-DL 356 N+ Y N + +C ++N + +IY++ V+ + + + Sbjct: 388 LNTQDRWYGNGSDWSSQVDTRQALLCNNIKNDPISKTDPTRRTRIYTIQVNTDGDPESTV 447 Query: 357 LRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 L+ C FF + + + +F +I + + + Sbjct: 448 LKNCATDG--FFPTSTASGIASAFAQIGASLSQLRI 481 >gi|260775644|ref|ZP_05884540.1| protein TadG associated with Flp pilus assembly [Vibrio coralliilyticus ATCC BAA-450] gi|260608060|gb|EEX34229.1| protein TadG associated with Flp pilus assembly [Vibrio coralliilyticus ATCC BAA-450] Length = 407 Score = 143 bits (359), Expect = 6e-32, Method: Composition-based stats. Identities = 56/418 (13%), Positives = 127/418 (30%), Gaps = 43/418 (10%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A++I + F D A + + +++ A A+ ++ D D+ Sbjct: 12 LFAMLIPLLFGVFALGSDGARAIQSKARIEDA-------SEAAALALSARDDEHAMSDEN 64 Query: 61 STIFKKQIKKHLKQGSYIRENAGD--IAQKAQINITKDKNNPLQYIAESKAQYEIPTENL 118 TI + I+++L G + + + + Sbjct: 65 KTIVQAYIEEYLPVEDSDVTILGIERLECDDMPECRQGSGRGEARYTQYSVRVSADQTPW 124 Query: 119 FLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNK 178 F G + + G R ++ A+ I D S SM + + Sbjct: 125 FGGGSPEVEVPEVWRSQGGAKARKYQSNAVDIVFAADFSGSMASPWTGGSQPKYRDLIDI 184 Query: 179 ------YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK 232 L P ++ + S + V L+ + + + Sbjct: 185 LEKVTVELAPYNFDSQRYNSSVGVSGFNALTYRNEL-CAVNNLEKQGLLGVVDYSRTVAR 243 Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 + G PL+++ + +++ T +Y A+ R L + Sbjct: 244 MWETKSCRPPSISNSAGFHDVPLTDDYSTFNRTVDRFTARGGTASYQAVMSGARLLDHGS 303 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM------------RNAGM 340 + + +I I+DG+++ + + N L +C + R+ Sbjct: 304 NNR----------QILIVISDGQDNNLN-HTNGLVNAGMCRDIISRLEGRPSANGRDVSA 352 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 ++ + P + +C F ++ EL F++I I+E+ +A R Sbjct: 353 RLAFIGFDFEPSMNPAMVRCV-GEDNVFKAENTDEL---FEQIMFLIREEVGHLATRR 406 >gi|83859216|ref|ZP_00952737.1| hypothetical protein OA2633_12465 [Oceanicaulis alexandrii HTCC2633] gi|83852663|gb|EAP90516.1| hypothetical protein OA2633_12465 [Oceanicaulis alexandrii HTCC2633] Length = 441 Score = 143 bits (359), Expect = 7e-32, Method: Composition-based stats. Identities = 55/430 (12%), Positives = 124/430 (28%), Gaps = 66/430 (15%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M A+++ + + A+D + I ++QSA+DA L+ + ++ + Sbjct: 38 MFAMLLGPLVVSVGGALDYSRTFTIGAEIQSAMDAGTLAAASL-----------SQGEDP 86 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 TI + I L + + + E N + A++ + Sbjct: 87 ETIVRNYITAALSEHNGVLERLNVQVSSD------LAINSREVTADAVISV-----PTLM 135 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 G+I L+ S + I +VLD+S SM + + + Sbjct: 136 LGIIGYDALTLNRVSEANERVR----NLEISLVLDISGSMSGSKITA-LRDAAEEFVGVM 190 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 + P + + P + + + + Sbjct: 191 MDPDLEGLTSLSVIPYNGGVRLPQTVTNDLVPGTPNDSGCLELGVSDPVTMDLAANGYDW 250 Query: 241 IAYNIGIV------------GNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYREL 288 + + L + + + + L+ NT A R L Sbjct: 251 LDWQDRDQRGWRSSAFCPEENEATVFLEQTPSVLVNLIRDLDAGGNTGLDVATAWGARAL 310 Query: 289 YNEKES--------SHNTIGSTRLKKFVIFITDGENS-----GASAYQNTLNTLQI-CEY 334 K ++ +TDG + + Y + + Sbjct: 311 DPAWRGRLGGDFASRPAAYDDPSTMKVLVVMTDGAATAQIRRAQNWYGDWYSYEIYSASQ 370 Query: 335 MRNA-----------GMKIYSVAVS-APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 R+ G+ IY++A + ++L+R C ++AV + ++ +F+ Sbjct: 371 ARDNMADACDAAEAEGVHIYTIAFQVSGSTNRNLMRDCASRPENYYAVENL-DISAAFNS 429 Query: 383 ITDKIQEQSV 392 I + + Sbjct: 430 IAADLNNLRL 439 >gi|254780934|ref|YP_003065347.1| hypothetical protein CLIBASIA_04165 [Candidatus Liberibacter asiaticus str. psy62] gi|254040611|gb|ACT57407.1| hypothetical protein CLIBASIA_04165 [Candidatus Liberibacter asiaticus str. psy62] Length = 374 Score = 142 bits (358), Expect = 7e-32, Method: Composition-based stats. Identities = 72/400 (18%), Positives = 159/400 (39%), Gaps = 59/400 (14%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKD-- 58 +TAI + + FL + I+++HI +++ + S +D +++ I+++ + K Sbjct: 22 LTAIFLPIIFLVLGMIIEVSHIFFMKTVLHSMIDRSLVHAATQIMNEGNGNNRKKLKGGD 81 Query: 59 ---QTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPT 115 + + + L+ ++ + DI + ++I N Y + ++Y+IP Sbjct: 82 ILCRIKNTWNMSFRNELRDNGFVNDI-DDIVRSTSLDIVVVPQNE-GYSISAISRYKIPL 139 Query: 116 EN-LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNM 174 + F+ S + + S + S + + + +VLDVSRSME + Sbjct: 140 KFCTFIPWYTNSRHIVMPITS-SVKVNSQTDARLDMMIVLDVSRSMESFFDSS------- 191 Query: 175 TSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNL 234 KID+ I+S ++ ++ N Sbjct: 192 --------------------------------ITKIDMAIKSINAMLEEVKLIPDV--NN 217 Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYN-EK 292 V+ G + ++ I L ++ ++ ++ L+ + +TN+ P + +AY ++++ + Sbjct: 218 VVQSGLVTFSNKIEEFFL--LEWGVSHLQRKIKYLSKFGVSTNSTPGLKYAYNQIFDMQG 275 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 H KK ++F+TDGEN Q +L C + G +Y++ + Sbjct: 276 MRQHCNTEDANYKKIIVFMTDGENLSTKEDQQSLY---YCNEAKKRGAIVYAIGIRV-IR 331 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + LR C F+ V + + ++F I I + + Sbjct: 332 SHEFLRACAS-PNSFYLVENPHSMYDAFSHIGKDIVTKRI 370 >gi|261251589|ref|ZP_05944163.1| hypothetical protein VIA_001610 [Vibrio orientalis CIP 102891] gi|260938462|gb|EEX94450.1| hypothetical protein VIA_001610 [Vibrio orientalis CIP 102891] Length = 396 Score = 141 bits (356), Expect = 1e-31, Method: Composition-based stats. Identities = 48/398 (12%), Positives = 122/398 (30%), Gaps = 51/398 (12%) Query: 5 IISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIF 64 ++ + I + + + + N+ A DAA L+ + T+ Sbjct: 21 MLIPMIIAAASTIVIGYQVQLSNRGMQATDAASLAC-------------EFSGEYDGTMA 67 Query: 65 KKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLI 124 + + + + + G + Y + T L Sbjct: 68 QGYLDYYRPKIDKVSGQIGTHSGCN-----------------VSLSYSLSTIFTSLTLSD 110 Query: 125 PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLY--------------LQKHND 170 S + + + + + + + +VLD+S SM ++ N+ Sbjct: 111 ASFVVSSTANEKAYVTEDVASEPLELILVLDISGSMASDLDDLKAILKRGLASLKEQQNN 170 Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE 230 + K + P + ++ + +A + N Q Sbjct: 171 ALSKDHIKVSIVPFSDGVSVNNAPWLNETGTFCVEGITESGGKFSAAHTVANLDITHDQT 230 Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 + + PL+ +LN+V + ++ L T +Y + R+L Sbjct: 231 PVKTFQPDKWLM--DCSAMSVTLPLTADLNQVTNAVDSLRTEGGTASYQGLIWGLRQLTP 288 Query: 291 EKESSHNTIGSTRLKKF---VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 + + + K ++ +TDG ++ + L +C+ ++ G+ + V Sbjct: 289 NWQKAWEVGPNRNFDKVERKLVLMTDGA--DYGSHFDELINAGLCDRAKDYGVALNFVGF 346 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + +C + F+ ++++EL F ++ Sbjct: 347 GVYGARLEQFTRCAGDANGVFSASNTQELDSYFSQLLS 384 >gi|316933619|ref|YP_004108601.1| hypothetical protein Rpdx1_2276 [Rhodopseudomonas palustris DX-1] gi|315601333|gb|ADU43868.1| hypothetical protein Rpdx1_2276 [Rhodopseudomonas palustris DX-1] Length = 483 Score = 140 bits (353), Expect = 3e-31, Method: Composition-based stats. Identities = 62/455 (13%), Positives = 130/455 (28%), Gaps = 64/455 (14%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVS-------------- 46 + I + F+ A+D + R MQSALD+ L + S Sbjct: 28 IFGIALLPLLGFVGAAVDYSRASRARTAMQSALDSTALMVAKDLTSGKITAENVQSAANT 87 Query: 47 -------DRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNN 99 + T + K + I + +Q+++ Sbjct: 88 YFTSLYKNTDAPSIDVTATYTPKTSSENAKLTVGGTGSINTEFMKVMNISQMSLGASSTT 147 Query: 100 PLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSEN----LAISICMVLD 155 A T ++ G + + T + S+ ++I V+ Sbjct: 148 TWGGTRLRVALALDVTGSMDSAGKLSAMKTAAKQLIDTLKATSTTKEDVYISIVPFNVMV 207 Query: 156 VSRSMEDLYLQKHNDNN------NMTSNKYLLPPPPKKSFWSKNTTKSK------YAPAP 203 D + T+ ++WS K A Sbjct: 208 NVGPGNKNATWLDWDTSYGSCKSKYTTKNACQAGGDSWNYWSNTCQSQKTLKSACQAGGH 267 Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL-------- 255 ++ + + E + + +A N P+ Sbjct: 268 TWTASNVNSWKGCVTDRTQNYDTTKTEPTSATPDTLFLAQNYSDCMASLLPMKSAYEATE 327 Query: 256 ---SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTR-LKKFVIFI 311 S + +K R+N L+ TN M A+ L S ++ + Sbjct: 328 SDSSTDATTLKGRINTLDAQGGTNQGIGMFWAWMTLQATAPLYTPAKDSEYKYTDAIVLL 387 Query: 312 TDGENSGASAYQNTLN--------TLQICEYM--RNAGM---KIYSVAVSAPPEGQ-DLL 357 +DG N+ Y N N +C+ + + G+ IY++ V+ + + +L Sbjct: 388 SDGMNTKNRWYGNGSNWSPQVDDRQKILCDNITTKVNGVPETTIYTIQVNTSGDPESSVL 447 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + C + G FF+ + + +F ++ + + + Sbjct: 448 KYCGSTGG-FFSTTTASGIQSAFQEVGASLTKLRI 481 >gi|59713412|ref|YP_206187.1| TadG-like protein [Vibrio fischeri ES114] gi|59481660|gb|AAW87299.1| TadG-like protein [Vibrio fischeri ES114] Length = 423 Score = 140 bits (352), Expect = 4e-31, Method: Composition-based stats. Identities = 52/426 (12%), Positives = 120/426 (28%), Gaps = 50/426 (11%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A++I F A D A + + +++ A + A L+ A D+ T + Sbjct: 15 LFAMMIPALFGIFALASDGARAIQTKARIEDASEVAALAISAHNDPDQPDNGSYTPSTRN 74 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 I + ++ + + + I+ ++EI Sbjct: 75 RQIVVDYVNAYISDIDAVTDIKVAKRRCELISGCVAGLYKGD---ARYLEHEIDVTTRQN 131 Query: 121 KGLIPSALTNLSLRSTGIIERS----SENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS 176 + + +S ++ A+ + D S SM D + N Sbjct: 132 SWFPGNEAIEGMGETFSTRGKSLARKYQSEAVDVMFAADFSGSMLDTWSGSSNPKYIDLI 191 Query: 177 NKYLLPPPPKKSFWSK--NTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNL 234 + F N KS + ++ + + N Sbjct: 192 EIIRNISVELQKFNDLPENRDKSTMGISAFSTFTNSFTSDTGIQCSLSQGVNSKNKPGNW 251 Query: 235 S--------------VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPA 280 + + G L++N N + ++ T +Y A Sbjct: 252 FRPVKPANTVANIWNEKTEDYCKSGAYAGFHDVNLTSNFNSLNGQVGSFYAGGGTASYQA 311 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN--- 337 + + L ++ +I ++DG ++ + N L + +C ++ Sbjct: 312 LIRGAQLLD----------RGRNSRRLLIVLSDGMDNDRNLA-NGLVSNGMCREIQAGLE 360 Query: 338 -------AGM--KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 + K+ + P L+ C + D+ E+ + I + I Sbjct: 361 SDRTPDGRPIAAKMAVIGFDYDPFANKALKDCV-GEKNVYKAEDADEVEDI---ILELIN 416 Query: 389 EQSVRI 394 E+ + Sbjct: 417 EEVGHL 422 >gi|90406741|ref|ZP_01214934.1| hypothetical protein PCNPT3_01875 [Psychromonas sp. CNPT3] gi|90312194|gb|EAS40286.1| hypothetical protein PCNPT3_01875 [Psychromonas sp. CNPT3] Length = 404 Score = 140 bits (351), Expect = 5e-31, Method: Composition-based stats. Identities = 61/398 (15%), Positives = 133/398 (33%), Gaps = 42/398 (10%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 ++ + +I A + + A D + ++ S ++ ++ Sbjct: 16 FIFLLPAMLAMLALSILTAMYLLSVTRASQASDVSSIACAYSQRANVSLTQG-------- 67 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 F + K + + ++ +I I L+ + + +Q ++ Sbjct: 68 --FAQYYKPNFISHVNAQSTFLSGQKQCKIQIGYAFTPLLKDLLPASSQNKVHA------ 119 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 ++ ++ST + SE + + +VLD+S SM N N Sbjct: 120 --------SVQIQSTSTLTVHSEIKPMDLSLVLDISGSMSGRIGLLKRIINQAIQNIEQQ 171 Query: 182 PP-PPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQ-----------KAIQ 229 + +S S + + AP K + Sbjct: 172 NTKNNTQIRFSIVPFSSGVSISNAPWLAKSKGKALCVDAMSYPGNVLNTAQTVADIDTHP 231 Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 K N+ + N V + PL+NNL++V+ ++ L+ +T +Y R L Sbjct: 232 SKLNIRAKEPLSLINDCNVYSLLLPLTNNLSKVRKHVDSLSILGSTASYQGFIWGVRTLL 291 Query: 290 NEKESSHNTIGSTRL--KKFVIFITDGENSGASAYQNTLNTLQICEYMRNA-GMKIYSVA 346 + + N T + +I TDGE + + L +C+ +++ + I + Sbjct: 292 PNWQKAWNLQPETSSLLSQRLILFTDGE-DDSRDQFDKLVRSGMCQRIQDDFNIDISFIG 350 Query: 347 VSAPPEGQDLLRKCTDSSGQ--FFAVNDSRELLESFDK 382 P D +KC S+G+ + + +L + F + Sbjct: 351 FGLSPRRLDQFKKCIGSNGKGVVYDAKNGSDLEKFFAE 388 >gi|90418244|ref|ZP_01226156.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] gi|90337916|gb|EAS51567.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] Length = 489 Score = 137 bits (344), Expect = 4e-30, Method: Composition-based stats. Identities = 52/491 (10%), Positives = 120/491 (24%), Gaps = 134/491 (27%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 MTA+++ + AIDL +R+ +Q LD VL+ + + ++ Sbjct: 26 MTALMLVPMIVISGGAIDLIAHERLRSVLQDGLDRGVLAAAS-------LTQTRPPRETI 78 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + K + + + K E+ A + Sbjct: 79 ESFLKAAVT-----------------KGSYALDVKADELSNAKRVEASATA---VTDTAF 118 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKH-----------N 169 LI + + + I I ++LD+S SM + Sbjct: 119 LRLIGIDKLTVEAHAEA----EEKRKNIEISLLLDMSGSMRFDKSGSYPGPSGAMRINYL 174 Query: 170 DNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR------KIDVLIESAGNLVNS 223 + +L + + + N Sbjct: 175 RPAAKSFMDMVLADGAEDYTTVSIVPYAGQVSIGPVLFDALARNRRQHDRSSCFQFGRND 234 Query: 224 IQKAIQEKKN-----------------------LSVRIGTIAYNIGIVGNQCTP------ 254 + + N ++ Sbjct: 235 FTLGVPDFANLPQTQHFTQANHHDALKKAGEAQITEPWWCPDDPHDPRPGTTPDFVAGEG 294 Query: 255 ----------LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE----------- 293 LSN+ +K +++ Y+ T T A+ L + Sbjct: 295 KDTDRTSVSFLSNDREYLKRQIDNYKLYDGTGTPIALKWGLLLLDPAIQPMLREAARYRA 354 Query: 294 ------------SSHNTIGSTRLKKFVIFITDG----------------ENSGASAYQNT 325 + + KF++ +TDG + S + Sbjct: 355 LSEELDIDARFSNRPASFTDPDTMKFLVLMTDGAISSQRIPKDASKPVQYYNNGSLNTDL 414 Query: 326 LN-------TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + +C + + ++++ + C + +F+ V ++ ++ + Sbjct: 415 YSVGDAERFAAALCTAAKQKNVIVFTIGFDVNDTAAKQMSNCASGAERFYRV-NALDIQD 473 Query: 379 SFDKITDKIQE 389 +F I IQ+ Sbjct: 474 AFKSIATAIQK 484 >gi|78357411|ref|YP_388860.1| von Willebrand factor, type A [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] gi|78219816|gb|ABB39165.1| von Willebrand factor, type A [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] Length = 402 Score = 136 bits (341), Expect = 8e-30, Method: Composition-based stats. Identities = 49/434 (11%), Positives = 123/434 (28%), Gaps = 74/434 (17%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A+++ V + +D + +++Q+A+DAA L+G + D + Sbjct: 1 MAVLLPVILGIMGLGLDSGMLYLSHSRLQAAVDAAALAGSLQLPYDPAMDKG-----LVR 55 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + + + Q G A+ T Sbjct: 56 AAVDEYMHANFPQAVVQSVLPGAEE----------------RSVTVNAEA---TVGTIFM 96 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 G + + + +++ + + V+D S SM+ + + N + + Sbjct: 97 GALGIGSSTVRAQASAGYNN------LEVVFVIDNSGSMKGSPINETNAAATRLVDLIMP 150 Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG-- 239 + K PA + + G+L +K Sbjct: 151 EGMATSVKIGLVPFRGK-VRIPADVDGLPSGCRNADGSLNEDGLLDEYKKPEYRYPYNDR 209 Query: 240 -TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE---NTNTYPAMHHAYRELYNEKESS 295 + L+ + + + + + T + A L E + Sbjct: 210 LRVTPYSCSSIPLTQGLTADRATITQAIGRQDARGDSSGTVISEGLKWARHVLTPEAPFT 269 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQN------------------------------- 324 + + ++K +I +TDG+ + N Sbjct: 270 EGS-SAKDMRKVIILLTDGDTEDGNCGGNYSVYYRPNNYWTNAYYGMMDMDSHCEDGGVL 328 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSA-PPEGQDLLRKCTD----SSGQFFAVNDSRELLES 379 L ++AG++I+++ + ++L+R + +F ++ + Sbjct: 329 NNAMLSEAALAKDAGIEIFAIRYGSSDAVDRNLMRAVASSKEGTDDHYFDAPSPYDIDDV 388 Query: 380 FDKITDKIQEQSVR 393 F I ++ + +R Sbjct: 389 FKLIGRQLGWRLLR 402 >gi|86147193|ref|ZP_01065509.1| TadG-like protein [Vibrio sp. MED222] gi|85835077|gb|EAQ53219.1| TadG-like protein [Vibrio sp. MED222] Length = 435 Score = 135 bits (339), Expect = 1e-29, Method: Composition-based stats. Identities = 52/449 (11%), Positives = 123/449 (27%), Gaps = 83/449 (18%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + AI+I F D A + + +++ A +AAVL+ K +Q Sbjct: 15 LFAIMIPALFGVFMLGSDGARALQTKARLEEASEAAVLAVS-------------AKDEQD 61 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + ++ I+ +L I + +I + +Y + + L Sbjct: 62 HQLAERYIQHYLYDMDSILDIEVKKLGCDEIPEC---IAATERGEARYFEYRVAGQTLHK 118 Query: 121 KGLIPSALTNLSLRST----GIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS 176 + + + S R ++ I I ++D S SM D + + N Sbjct: 119 SWFPGNDVISGFGDSFNVTGSSKARRYQSQPIDITFIVDFSESMNDSWSGGRHSKLNDLK 178 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA-----GNLVNSIQKAIQEK 231 + ++ R I+ +V+ + ++ Sbjct: 179 DIIEDVADELGAYNDLYPEHPHRVALTGFNRRTINKDKNDNLVVRDQRVVSREGEYDKDD 238 Query: 232 KNLSVRIGTIAY-----------NIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPA 280 + + + + + + ++ K T + Sbjct: 239 TVNFNKTIAQQFIVKGEASRVPNSDDDARFYDLYFTTDFSSFTKKVKKFKAGGGTASLQG 298 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG-ENSGASAYQNTLNTLQICEYMRNA- 338 + A + + + S K+ +I ++DG + + + N L + +C + N Sbjct: 299 IIRAGQIVTS---------MSKNQKQLIIILSDGEDWNHYAGQTNKLVSKGMCSNILNMV 349 Query: 339 --------------------------------GMKIYSVAVSAPPEGQDLLRKCTDSSGQ 366 ++ + LR C Sbjct: 350 NGGKVSADNTHDDIEVIGGVSQGMMTPDGERMNARMAVIGFDYELNKNVGLRNCV-GRDN 408 Query: 367 FFAVNDSRELLESFDKITDKIQEQSVRIA 395 + + + +KI I E+ +A Sbjct: 409 VYKAENKE---DILNKILGLITEEVGHLA 434 >gi|218708116|ref|YP_002415737.1| hypothetical protein VS_0028 [Vibrio splendidus LGP32] gi|218321135|emb|CAV17085.1| Conserved hypothetical protein, putative exported, TadG [Vibrio splendidus LGP32] Length = 435 Score = 134 bits (336), Expect = 3e-29, Method: Composition-based stats. Identities = 51/449 (11%), Positives = 122/449 (27%), Gaps = 83/449 (18%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + AI+I F D A + + +++ A +AAVL+ K +Q Sbjct: 15 LFAIMIPALFGVFMLGSDGARALQTKARLEEASEAAVLAVS-------------AKDEQD 61 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + ++ I+ +L I + ++ + +Y + + L Sbjct: 62 HQLAERYIQHYLYDMDSILDIEVKKLGCDEMPEC---IAATERGEARYFEYRVAGQTLHK 118 Query: 121 KGLIPSALTNLSLRST----GIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS 176 + + + S R ++ I I ++D S SM D + + N Sbjct: 119 SWFPGNDVISGFGDSFNVTGSSKARRYQSQPIDITFIVDFSESMNDSWSGGRHSKLNDLK 178 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA-----GNLVNSIQKAIQEK 231 + ++ R I+ +V+ + ++ Sbjct: 179 DIIEDVADELGAYNDLYPEHPHRVALTGFNRRTINKDKNDNLVVRDQRVVSREGEYDKDD 238 Query: 232 KNLSVRIGTIAYN-----------IGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPA 280 + + + + + ++ K T + Sbjct: 239 TVNFNKTIAQQFIVKGEASRVPNGDDDARFYDLYFTTDFSSFTKKVKKFKAGGGTASLQG 298 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG-ENSGASAYQNTLNTLQICEYMRNA- 338 + A + + + S K+ +I ++DG + + + N L + +C + N Sbjct: 299 IIRAGQIVTS---------MSKNQKQLIIILSDGEDWNHYAGQTNKLVSKGMCSNILNMV 349 Query: 339 --------------------------------GMKIYSVAVSAPPEGQDLLRKCTDSSGQ 366 ++ + LR C Sbjct: 350 NGGKVSADNTHDDVEVIGGVSQGMMTPDGERMNARMAVIGFDYELNKNVGLRNCV-GRDN 408 Query: 367 FFAVNDSRELLESFDKITDKIQEQSVRIA 395 + + + +KI I E+ +A Sbjct: 409 VYKAENKE---DILNKILGLITEEVGHLA 434 >gi|114799275|ref|YP_759187.1| hypothetical protein HNE_0457 [Hyphomonas neptunium ATCC 15444] gi|114739449|gb|ABI77574.1| conserved domain protein [Hyphomonas neptunium ATCC 15444] Length = 512 Score = 134 bits (336), Expect = 3e-29, Method: Composition-based stats. Identities = 55/490 (11%), Positives = 125/490 (25%), Gaps = 100/490 (20%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 +TA +I AIDL + + ++++Q+ALD+AVL+G + + Sbjct: 25 ITAFVIPCILALTGIAIDLQNTVRQKSKVQAALDSAVLAGA---LGRQAGNTAAETTLDV 81 Query: 61 STIFKKQIK-----------KHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKA 109 T + + ++ + ++ L++ S + Sbjct: 82 QTYALALFTDQGGGLDCDPVAVTFDETNLDILGTVRCRQPTYLSSLIGHDELEFNVASTS 141 Query: 110 QY-----EIPTENLFLKGLIPSALT-NLSLRSTGIIERSSENLAISICMVLDVSRSMEDL 163 Y ++ + L + ++ + V S Sbjct: 142 TYGVGKLDVAFIFDVSGSMNSYNRLAQLKTAAVAAVDELLPDSRERDGTVRLAIASYNHS 201 Query: 164 YLQKHNDNNNMTSNKYLLPPPP------------------------------KKSFWSKN 193 + S W+ Sbjct: 202 LNAGAYIGAVTETVTLSADGSNSTALSRYNSHNTKRMIDQDSGKRFFYYQSGTCSSWNCG 261 Query: 194 TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAI------------------------- 228 S + + Sbjct: 262 KYSSWSWDTKRRFFDDTGLADACVYERTGTQAATDAAPGSGAWIGAGNPRWSFYAGSSSK 321 Query: 229 QEKKNLSVRIGTIAY--------NIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPA 280 + Y + + + PL+ + +K +N L T + Sbjct: 322 YDGWQNVENQNATGYGVGAYEGRHGTCMPSGPVPLTEDKTVLKDHVNALVAEGGTAGHLG 381 Query: 281 MHHAYRELYNEKESSHNTIGSTRL------KKFVIFITDGENSGAS---AYQNTLNTLQI 331 + + + E + K VI +TDG+ + + + ++ + Sbjct: 382 IAWGWYLVSPEWAAIWPEASEPLPYRQPQTSKAVILMTDGDFNIEHPTASRDSFRQSMDL 441 Query: 332 CEYMRN--AGMKIYSVAVSAPP------EGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 C+ M+ ++IY+V P +G+ +L C S F+ + EL+E + I Sbjct: 442 CDGMKASSRRIQIYTVGFQVPSSVQRTGDGRTILEYCATSPSHAFSADSGEELIEVYRSI 501 Query: 384 TDKIQEQSVR 393 I + ++ Sbjct: 502 ARSISDLRLK 511 >gi|218506715|ref|ZP_03504593.1| hypothetical protein RetlB5_03444 [Rhizobium etli Brasil 5] Length = 269 Score = 133 bits (335), Expect = 3e-29, Method: Composition-based stats. Identities = 68/273 (24%), Positives = 113/273 (41%), Gaps = 16/273 (5%) Query: 132 SLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWS 191 S + S +IS+ + LD S SM D N ++ Y P K Sbjct: 3 STSGRTVSGHSQSQGSISMFLALDKSGSMGDP-TATVNADDPTEPFTYDCNPHLNKK--G 59 Query: 192 KNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ 251 + A KI+ L +AGNL + + A + VR G ++Y +V Sbjct: 60 TKIIYDTCTGSRAHYYTKIEALKIAAGNLFSQLNSA--DPNAEYVRTGAVSY--DLVEYT 115 Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYREL-------YNEKESSHNTIGSTRL 304 + L+ + V S +N L TN+ A++ AY L + ++++H Sbjct: 116 PSKLAWGITAVTSYVNALESGGGTNSSGAVNTAYTSLTAKNAAGNDAEDAAHKLKTGQLP 175 Query: 305 KKFVIFITDGENSGASAYQNTLNT--LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 KK+++F+TDG+N+ S + +T C+ + G++ Y++A AP GQ LL C Sbjct: 176 KKYIVFMTDGDNNDDSRGGRSYDTLTKATCDTAKAKGIETYTIAFMAPEGGQALLHYCAS 235 Query: 363 SSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 +F +LL +F I K Q R+ Sbjct: 236 DDAHYFQAEKMEDLLAAFKAIGAKASAQVTRLT 268 >gi|197337036|ref|YP_002157821.1| hypothetical protein VFMJ11_A0264 [Vibrio fischeri MJ11] gi|197314288|gb|ACH63737.1| conserved hypothetical protein [Vibrio fischeri MJ11] Length = 423 Score = 133 bits (335), Expect = 4e-29, Method: Composition-based stats. Identities = 51/423 (12%), Positives = 119/423 (28%), Gaps = 48/423 (11%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A++I F A D A + + +++ A + A L+ A D+ T + Sbjct: 15 LFAMMIPALFGIFALASDGARAIQTKARIEDASEVAALAISAHNDPDQPDNGSYTPSTRN 74 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 I + ++ + + + I ++EI Sbjct: 75 RQIVVDYVNAYISDVDAVTDIKVAKRRCELIPECVAGLYDGDMRY---LEHEIDVTTRQN 131 Query: 121 KGLIPSALTNLSLRSTGIIERS----SENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS 176 + + +S ++ A+ D S SM D + N Sbjct: 132 SWFPGNEAIEGMGETFSTRGKSLARKYQSEAVDAMFAADFSGSMLDTWSGSSNPKYVDLI 191 Query: 177 NKYLLPPPPKKSFWSK--NTTKSKYAPAPAPANRKIDVLIESAGNLVNS----------- 223 + F N KS + ++ Sbjct: 192 EIIRNISAELQKFNDLPENRNKSTMGISAFSTFTNSFTSDTGIQCSLSQGVNGRNGPATW 251 Query: 224 ---IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPA 280 ++ A + + + G L++N N + ++ T +Y A Sbjct: 252 FRPVKAANTVANIWNPKTEDYCKSGAYAGFHDVNLTSNFNYLNGQVGSFYAGGGTASYQA 311 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN--- 337 + + L ++ +I ++DG ++ + + L + +C ++N Sbjct: 312 LIRGAQLL----------RKGNNSRRLLIVLSDGMDN-DTQLADGLVSAGMCRDIQNGLE 360 Query: 338 -------AGM--KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE-SFDKITDKI 387 + K+ + P L+ C + D+ E+ + + I ++I Sbjct: 361 SDRTPDRRPIAAKMAVIGFDYNPFANKALKDCV-GEKNVYKAEDADEVEDIILELINEEI 419 Query: 388 QEQ 390 Sbjct: 420 GHL 422 >gi|144898053|emb|CAM74917.1| conserved hypothetical protein, secreted [Magnetospirillum gryphiswaldense MSR-1] Length = 460 Score = 133 bits (334), Expect = 4e-29, Method: Composition-based stats. Identities = 60/472 (12%), Positives = 118/472 (25%), Gaps = 110/472 (23%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+ + L + A+D A +++++ ALDAA L+ +S + + Sbjct: 17 IFALALIPLSLSVGLAVDTARAYAVKSKLSQALDAAALAVGSSTGT----------AAEL 66 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 I +K + K + + + + L Sbjct: 67 QQIGQKFFDANFKDSGLDAAGSF----------------SVSVTGDVVSANGSAQVQTTL 110 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 L+ +S + + + +VLD + SM + L Sbjct: 111 MQLVGIDTIAVSESAQ----VIRSIKGLELALVLDNTGSMTTSDNIGALRDAAQELVDIL 166 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPAN------------RKIDVLIESAGNLVNSIQKAI 228 + P P + V + Sbjct: 167 FGGRADHPTLRVAVVPYSASVNPGPIAPTLISGNDAYAPTNLLGWKGCVIERVGRAMEDS 226 Query: 229 QEKKNLSVRIGTIAYNIGI-------------------------VGNQCTPLSNNLNEVK 263 +R + TPL+ V Sbjct: 227 PASTAPWLRYQWLPAIDNYYDATKASTVRADPSQGNGGTGPNLGCPTPITPLTGVKATVD 286 Query: 264 SRLNKLNP--YENTNTYPAMHHAYRELYNEKE-SSHNTIGSTRLKKFVIFITDGENSGAS 320 S + L T M R L E + + + K VI +TDG+N Sbjct: 287 SAIQALRAWSRGGTMGDIGMAWGLRVLSPEPPFTEGLAWNTPKWAKAVILMTDGDNQFYK 346 Query: 321 AYQN--------------------------------------TLNTLQICEYMRNAGMKI 342 Q+C+ M++ G+ + Sbjct: 347 LTSTTGPNKVNSAVNSDYSGYGRLDQYGALGTTSTTTAKSVINTRLTQVCQAMKDKGITV 406 Query: 343 YSVAV--SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 Y++ +D+ + C S+ ++F +L SF I ++ + V Sbjct: 407 YTITFTSGINQATKDIYKACASSTAKWFDSPSQADLRASFRAIATELSQLRV 458 >gi|312883763|ref|ZP_07743482.1| hypothetical protein VIBC2010_14219 [Vibrio caribbenthicus ATCC BAA-2122] gi|309368512|gb|EFP96045.1| hypothetical protein VIBC2010_14219 [Vibrio caribbenthicus ATCC BAA-2122] Length = 396 Score = 133 bits (333), Expect = 6e-29, Method: Composition-based stats. Identities = 43/389 (11%), Positives = 121/389 (31%), Gaps = 33/389 (8%) Query: 5 IISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDP-----TTKKDQ 59 ++ + I + + + + N+ A DAA ++ D+ + K D+ Sbjct: 21 MLIPMVIAAASTIVIGYQVQLSNRAMQAADAASIACEFKGEYDQALTQSYLDYYQPKIDK 80 Query: 60 TSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLF 119 + ++ G + + + N Y+ E + Sbjct: 81 VRGQIRTNSGCNMSLGYSLSTIFTSLTLSDTSFVVSSTANEKAYVTEDVVSDPLELV--- 137 Query: 120 LKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKY 179 L + ++ + + +V ++ N+ + K Sbjct: 138 ---------IVLDISTSMYGAINDLKAILKRGIV---------SLKEQQNNAQSEDHIKV 179 Query: 180 LLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG 239 + P + + + + +A + N ++ ++ + Sbjct: 180 SIIPFSTGVSVNNAPWLNDARTFCVDGTTESEDKFYAARTVANL--DITHDQISVKLSQP 237 Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 + PL+ +L++V + ++ L T +Y + R+L + + Sbjct: 238 NKWRESCSAASFTLPLTADLDQVTNTVDSLRTEGGTASYQGLIWGLRQLTPNWQKAWEVG 297 Query: 300 GSTRLKKF---VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + + K ++ +TDG + Y + L +C+ ++ G+ + V + Sbjct: 298 PNRNVDKVERKLVLMTDG--NDYGRYFDDLINAGLCDRAKDYGIALNFVGFGVNGSRLEQ 355 Query: 357 LRKCTDSSGQFFAVNDSRELLESFDKITD 385 +C F+ +D+++L F ++ Sbjct: 356 FTRCAVDPKGVFSASDTQDLDHYFSQLLS 384 >gi|148258759|ref|YP_001243344.1| hypothetical protein BBta_7591 [Bradyrhizobium sp. BTAi1] gi|146410932|gb|ABQ39438.1| hypothetical protein BBta_7591 [Bradyrhizobium sp. BTAi1] Length = 449 Score = 132 bits (332), Expect = 7e-29, Method: Composition-based stats. Identities = 56/433 (12%), Positives = 120/433 (27%), Gaps = 59/433 (13%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 AI+ + +D + +R ++QSA+DAA + G S S I D Sbjct: 23 FAIVCVPLITAVGCGVDYSRANQLRAKLQSAVDAASV-GAVSRTSPAFIAAGAMTADGII 81 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 T + + ++ + + + + Sbjct: 82 TAGNDDARNIFNGNMNGTTGYTLNSVTPEV-------KKTGSVLTATVSFSASV-PMMFM 133 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 ++ L S +S I ++LD S SM ++ Sbjct: 134 NIVGIKTMTLQGMSKA---TASMPKYIDFYLLLDNSPSMGVAATPDDVTKMVNATSDAKY 190 Query: 182 PPPPKKSFWSKNTTKSKYAPA---PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 +F + S +IDVL + L+++ + + I Sbjct: 191 GSNRYCAFACHDYNDSNNFYNLAKSIGVTTRIDVLRSATQQLMDTATQTQTYPNQFRMAI 250 Query: 239 GTIAYNIGI-VGNQCTPLSNNLNEVKSRLNKLNPYE----NTNTYPAMHHAYRELYNEKE 293 LS NL+ KS ++ N + ++ Sbjct: 251 YDFGAASKTIGLRALFALSANLSSAKSAAGNIDLMGVYGNNDAYTADKDTPFTAVFPAVN 310 Query: 294 SS---HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL----------QICEYMRNAGM 340 + + K++ F++DG ++A +C ++N G+ Sbjct: 311 NEISTPGDGTTGSPLKYLFFVSDGVADESNAACLKPKASGNRCQSPINPALCTTLKNRGI 370 Query: 341 KIYSVA-----------------------VSAPPEGQ--DLLRKCTDSSGQFFAVNDSRE 375 KI + P + ++ C G +F V+ ++ Sbjct: 371 KIAVLYTTYLQLPTNSWYMSWIDPFNKGPFGPSPNSEIAQNMQACAS-PGFYFEVSPTQG 429 Query: 376 LLESFDKITDKIQ 388 + ++ + + K Sbjct: 430 IADAMNALFKKAV 442 >gi|149909538|ref|ZP_01898192.1| TadG-like protein [Moritella sp. PE36] gi|149807443|gb|EDM67394.1| TadG-like protein [Moritella sp. PE36] Length = 405 Score = 132 bits (332), Expect = 9e-29, Method: Composition-based stats. Identities = 57/406 (14%), Positives = 126/406 (31%), Gaps = 31/406 (7%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A++I F T A D A + + +++ DAA + A + ++ Sbjct: 14 LFAMMIPAFFGIFTLASDGARALQSKARLE---DAAEAAVLAIAAHNADNSGSSSGSAIN 70 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK-NNPLQYIAESKAQYEIPTENLF 119 I I ++++ I + I K+ N + + + + F Sbjct: 71 KKIASDWIGQYMQDMQAISDIKITKLNCNDIAECKEGLENGESRYFQYEILAKTNHLSWF 130 Query: 120 LKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKY 179 + R ++ ++ + V D S SM + + N K Sbjct: 131 PGNNSTAGFGESFDVVGSATARKFQSESVDVMFVSDFSGSMNNKWSGGSNSRRYKDLIKI 190 Query: 180 LLPPPPKKSFWSK--NTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVR 237 + + ++ TT ++ + + + A + + Sbjct: 191 IGDVIKELDKFNNAHTTTTNRVGFTGFNTYTRKTADNSCYQDQYDR--SAGRTVNKIFEV 248 Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHN 297 G + + G +++N NE K+ + P T +Y + + + E Sbjct: 249 KGCKSRSSGGAKFHDIAMTDNYNEFKNTIKYFKPGGGTASYQGIIRGAQMMDAAPEPRPR 308 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM------------RNAGMKIYSV 345 + +I ++DG +S S N L +C + + K+ V Sbjct: 309 --------RIMIILSDGIDSKRSRA-NKLVEEGMCSKILLKLGNANTSDGKAIKTKMAVV 359 Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRE-LLESFDKITDKIQEQ 390 P L KC + N+ + L + + I+++I Sbjct: 360 GFDYNPASNPSLAKCV-GEHNVYGANNPEDVLNKILELISEEIGHL 404 >gi|84515372|ref|ZP_01002734.1| hypothetical protein SKA53_01901 [Loktanella vestfoldensis SKA53] gi|84510655|gb|EAQ07110.1| hypothetical protein SKA53_01901 [Loktanella vestfoldensis SKA53] Length = 485 Score = 131 bits (329), Expect = 2e-28, Method: Composition-based stats. Identities = 61/485 (12%), Positives = 118/485 (24%), Gaps = 126/485 (25%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 MT +++ + A+D R +QS D AVL+ + + + Sbjct: 36 MTILLLVTMLIMGGMAVDFMRYEARRATLQSVSDRAVLAAASLNQTLDS----------- 84 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + ++ + + + G I D N S N F Sbjct: 85 ----RDVVEDYFAKAGFPNALVG-------APIVVDNGNSRTVTVRSALDV-----NTFY 128 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 L RS+ + I +VLD+S SM + + + + Sbjct: 129 LRLAGMDRLTAPARSSATEGV----GKVEISLVLDISGSMRFSNRFVNMQAAAIAFAEEV 184 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLI--------------ESAGNLVNSIQK 226 L P + + + + + Sbjct: 185 LDPANGGTVSLTIIPYAGATNPGPEMFAFMGGVRYPDTLLAGDDGILGTEDDYFFPQVSS 244 Query: 227 AIQEKKNL------------------------SVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 ++ + SV + Sbjct: 245 CVEMVGSDWSSAGLPGAGRAQVPHFQVWDIARSVMDWGWCPQDRSSIQYAMA---TPAQA 301 Query: 263 KSRLNKLNPYENTNTYPAMHHAYRELYNEKE-------------------SSHNTIGSTR 303 +S +N L ++ T T+ AM +A L + + Sbjct: 302 RSFINGLRMHDGTGTHYAMKYALATLDPSSQPAFMHLSHPGRGLVPPQFANRPAAWDDPE 361 Query: 304 LKKFVIFITDGENSGASAY------------------------------QNTLNTLQICE 333 KK ++ +TDG+ + N IC Sbjct: 362 TKKIIVLMTDGDITQQERPRIAQQERDIDYIISRSINGRDNRGQFVDAATNVGRFEAICT 421 Query: 334 YMRN--AGMKIYSVAVSA--PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + +Y+VA +R C FF EL++ F I ++I + Sbjct: 422 LANQPARSVDVYTVAFEVQPNSAADLQMRNCASDPSMFFR-TSGAELIDVFSGIAERITD 480 Query: 390 QSVRI 394 + + Sbjct: 481 LRLNL 485 >gi|56696619|ref|YP_166980.1| hypothetical protein SPO1742 [Ruegeria pomeroyi DSS-3] gi|56678356|gb|AAV95022.1| conserved hypothetical protein [Ruegeria pomeroyi DSS-3] Length = 558 Score = 130 bits (326), Expect = 4e-28, Method: Composition-based stats. Identities = 43/352 (12%), Positives = 87/352 (24%), Gaps = 66/352 (18%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M + +DL R +Q +D AVL+ + + + Sbjct: 38 MALFLFLALVGAAGIGVDLMRYEQKRAALQYTMDRAVLAAA-------DLDQQVSPETVV 90 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + +K ++ + + A E+P Sbjct: 91 RSYLEKA------------------GLLEYLSSVTVQEGLGYRKVSATATAELP---THF 129 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 L + ST + I +VLDVS SM + N ++ Sbjct: 130 MKLSGYDSLTIPAASTAEESI----GNVEISLVLDVSGSMNSNSRLYNLKNAAKEFVDHM 185 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV-----LIESAGNLVNSIQKAIQEKKNLS 235 L + + A A +V + + + + Sbjct: 186 LSATEPGTVSISIVPYATQVNAGADILSYYNVSTEHNYSHCVNFIDDEFSQPGLSRVTPL 245 Query: 236 VRIGTIAYNIGI------------VGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 R + P SN+ + + ++ L NT+ Sbjct: 246 ERTMHFDPFSYTKDPISTPVCPVRASTEILPFSNDQTVLNNYIDGLTGRGNTSIDIGTKW 305 Query: 284 AY-----------------RELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 ++ + + S + K +I ++DGEN+ Sbjct: 306 GVVMLDPGTQSVISGLISDNKVPASFQGRPSAYDSGDVLKVLIVMSDGENTN 357 Score = 78.0 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 42/329 (12%), Positives = 100/329 (30%), Gaps = 15/329 (4%) Query: 73 KQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKA-----QYEIPTENLFLKGLIPSA 127 + G + TKD + + + N ++ GL Sbjct: 236 QPGLSRVTPLERTMHFDPFSYTKDPISTPVCPVRASTEILPFSNDQTVLNNYIDGLTGRG 295 Query: 128 LTNLSLRST-GIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPK 186 T++ + + G++ ++ ++ D D+ ++ ++ Sbjct: 296 NTSIDIGTKWGVVMLDPGTQSVISGLISDNKVPASFQGRPSAYDSGDVLKVLIVMSDGEN 355 Query: 187 KSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIG 246 + + N + + + N N+ N + Sbjct: 356 TNQYMLNPSLRDGDSPVWYNAAEDVISGSPDNNTTNAFSIYHDNGNNSYYWPDQNRWADH 415 Query: 247 IVGNQCTPL-SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGS-TRL 304 GN + N + S + P A+ Y EL+ + ++N + Sbjct: 416 PYGNGQSEACGYNSSGYYSCAMRDEPG------EAVRLTYAELFAKVSLAYNAYYNFEFN 469 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS 364 + A T +C+ ++ G+ +Y+V AP G+ +L++C S Sbjct: 470 SNAWAEWYTAAMTHKEASAKDQRTDHVCDAAKDEGIIVYTVGFEAPYSGRRVLKRCASSD 529 Query: 365 GQFFAVNDSRELLESFDKITDKIQEQSVR 393 ++ D E+ ++F I I++ + Sbjct: 530 SHYYDA-DGLEISDAFTSIASSIRKLRLT 557 >gi|254466920|ref|ZP_05080331.1| conserved hypothetical protein [Rhodobacterales bacterium Y4I] gi|206687828|gb|EDZ48310.1| conserved hypothetical protein [Rhodobacterales bacterium Y4I] Length = 550 Score = 128 bits (322), Expect = 1e-27, Method: Composition-based stats. Identities = 47/376 (12%), Positives = 92/376 (24%), Gaps = 68/376 (18%) Query: 6 ISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFK 65 +DL + R ++Q LD AVL+ + Sbjct: 42 FLAMLAVGGIGVDLMRMERDRTELQYTLDRAVLAAA-------DLDQSLDADAVVLDYLT 94 Query: 66 KQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIP 125 K + ++ G + +A I+ + +L Sbjct: 95 KAGLEQYYSDPDDQKGLGYKSVEATIDTDFEA---------------------YLLKFAG 133 Query: 126 SALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPP 185 +L S ++ I MVLD+S SM + + + Sbjct: 134 GDNMSLYANSRAEEII----GSVEISMVLDISGSMNSGNRLVNLQAAAKSFVTQITSNTD 189 Query: 186 KKSFWSKNTTKSKYAPAPAPANRKI------DVLIESAGNLVNSIQKAIQEKKNLSVRIG 239 + + A K + + K + +R Sbjct: 190 VSNLSISIIPYATQVNAGEKLLSKYTKVSQEHDYSYCVNFIKDQFSKHTLNQNEDLIRTA 249 Query: 240 TIAYNIGI------------VGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 G+ P +N+ ++ + ++ L NT+ M Sbjct: 250 HFDTFTYSMNMIDRPVCPTRPGSAILPFTNDAAKLHAYIDSLTASGNTSIDIGMKWGSAL 309 Query: 288 LYNEKE-----------------SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 L + GS K +I ++DG+ + N Sbjct: 310 LDPTAQPVVNALVDDKVISENFRGRPKAYGSGDTLKIIILMSDGQ-NTNQYMVNEHRRDG 368 Query: 331 ICEYMRNAGMKIYSVA 346 I + N ++SV Sbjct: 369 ISDVWYNEEADVFSVY 384 >gi|325106974|ref|YP_004268042.1| von Willebrand factor A [Planctomyces brasiliensis DSM 5305] gi|324967242|gb|ADY58020.1| von Willebrand factor type A [Planctomyces brasiliensis DSM 5305] Length = 396 Score = 128 bits (322), Expect = 1e-27, Method: Composition-based stats. Identities = 56/407 (13%), Positives = 124/407 (30%), Gaps = 59/407 (14%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A ++SV + + + D+A++ +R Q+ + DAA +G ++ + Sbjct: 23 LIAALLSVMLILVVFTTDVAYMQLVRTQLHVSTDAAAKAGMEALARTESRGQARVVAKDI 82 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQ-----YEIPT 115 + ++ I D ++ + Sbjct: 83 FSKNLIGGRELKLHNKDIEFGRTDANPDGTWEFLPNERPFQAIRISVNLDDNRQKGRNGS 142 Query: 116 ENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMT 175 L ++ + + S NL I + LD S SM Sbjct: 143 VPLLFGKVLGQSSFATNHSSVA------ANLVHEIVLCLDRSHSMCFDETG--------- 187 Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK--------A 227 Y PP + Y P P + L + V+++ Sbjct: 188 -VDYAYPPGTPS-------YPAGYITPPNPVGSRWAKLQGAIQVFVDTLDDLQIVPDVGV 239 Query: 228 IQEKKNLSVRIGTIAYNIGIVGN--QCTPLSNNLN----EVKSRLNKLNPYENTNTYPAM 281 + ++++ + PL NLN + ++L + TN + Sbjct: 240 VTWGSDITLSWSWYPFQGRSFPAVMVDVPLGQNLNLVSPAIAAKLGDIM-MGGTNMSSGI 298 Query: 282 HHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK 341 + L + +K +I ++DG+ + N L + + Sbjct: 299 DRSVSLLT-------ANGTHSLAQKTIILMSDGQWNAGR------NPLDAANDAADKNIT 345 Query: 342 IYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKI 387 I+++A Q ++R+ + + G+FF D L ++F ++ + Sbjct: 346 IHTIAFLNG--DQSVMRQIAERTGGKFFNAPDGESLEDTFKELAKML 390 >gi|13473479|ref|NP_105046.1| hypothetical protein mll4092 [Mesorhizobium loti MAFF303099] gi|14024228|dbj|BAB50832.1| mll4092 [Mesorhizobium loti MAFF303099] Length = 477 Score = 128 bits (321), Expect = 1e-27, Method: Composition-based stats. Identities = 58/473 (12%), Positives = 124/473 (26%), Gaps = 103/473 (21%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + SV L +++D++ + ++ +Q +DAAV S + + + +K Q Sbjct: 25 LFGFAASVLALAAGFSVDISQLYNAKSGLQGVVDAAVTSTARDLTTGVIKEADASKAVQN 84 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + + D Q + + Sbjct: 85 FLVANSMAGILQPDQIVLDRLVVDRTAN-------------------TVQADAHVDVALF 125 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND---------N 171 + T R T + I + M+LDV+ SM + K + Sbjct: 126 FPVFGMGNTQ---RVTASTTSLYSDKTIEVAMMLDVTGSMAANWWAKTDKIGDLQAAAST 182 Query: 172 NNMTSNKYLLPPPPKKSFWSKNTTKSKYA-----------------------------PA 202 + P + + P Sbjct: 183 AVENLLDNNIDPNNPRVRVAIVPYAEAVNTGGLADSVFVEQAGGSNLPPPVPSAGAPIPV 242 Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYN---------IGIVGNQCT 253 + + D + + L Y + Sbjct: 243 GSSVTLRPDKCATERKDKDGYADYSSDGPSELRRNNQNQEYLAKVNRDDRMGTCPKPELI 302 Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKES---------SHNTIGSTRL 304 PL+ + ++ + T A+ Y L S + ++ Sbjct: 303 PLTADKQKLLDTIADFKAAGVTAGGIAVQWGYYMLSPSWRSTIVNARLGSGPANFDNRKV 362 Query: 305 KKFVIFITDGENSGASAYQ------------NTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 K I +TDG+ + A A + N IC+ M+ G++I+++ Sbjct: 363 GKVAILMTDGQFNTAFAAGRGAPRSQNAGQMSRSNAESICDNMKRDGIEIFTIGFDLDDP 422 Query: 353 G---------QDLLRKCTDSS----GQFFAVNDSRELLESFDKITDKIQEQSV 392 + +L+ C+ + ++ EL E+F+ I I+ ++ Sbjct: 423 SMTSTERDQAKSVLQDCSTADTSTLKHYYEAATGPELDEAFNAIVQNIERLTI 475 >gi|116252440|ref|YP_768278.1| hypothetical protein RL2693 [Rhizobium leguminosarum bv. viciae 3841] gi|115257088|emb|CAK08182.1| conserved hypothetical exported protein [Rhizobium leguminosarum bv. viciae 3841] Length = 427 Score = 128 bits (321), Expect = 2e-27, Method: Composition-based stats. Identities = 59/437 (13%), Positives = 134/437 (30%), Gaps = 73/437 (16%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQM----------QSALDAAVLSGCASIVSDRTI 50 MTA+++ F A+D AH + +R Q+ A + ++ ++ + TI Sbjct: 19 MTALLVVPLFGAAGMAVDFAHALSLRTQLYAAADAAAVGSIAEKSGAVAAAMTMSGNGTI 78 Query: 51 KDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQ 110 KD IF Q+ L + IN+TK N + Sbjct: 79 SLG---KDDARNIFMSQMSGELTD----------VHIDLGINVTKTANKLNSQV------ 119 Query: 111 YEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 T ++ +S +T + ++LD + SM Sbjct: 120 SFSATVPTTFMRILGRDSITISGAATA---EYQTAAFMDFYILLDNTPSMGVGATANDVS 176 Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE 230 + +S + +IDV+ ++ L ++ + Sbjct: 177 KLQAKTGCAFACHQMDQSTNNYTI------AKGLGVAMRIDVVRQATQALTDTAKTERVS 230 Query: 231 KKNLSVRIGTIAYNIGIVG-NQCTPLSNNLNEVKSRLNKLNPY------ENTNTYPAMHH 283 + + T + +++L +VK+ + ++ N + + Sbjct: 231 SDQFRMGVYTFGTKAEDAKLTTISSPTSDLTKVKNYTDTVDLMTIPYQNYNQDQLTSFDS 290 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ----------ICE 333 A ++ + + + + +K + F++DG T T C+ Sbjct: 291 ALTQMNTIIDPAGDGTSNISPEKILFFVSDGVGDSYKPSTCTKKTTGGRCQEPIDTSFCK 350 Query: 334 YMRNAGMKI---YSVAVSAPPEG-------------QDLLRKCTDSSGQFFAVNDSRELL 377 +++ G+KI Y+ + P ++ C G +F V + + Sbjct: 351 PLKDRGVKIAVLYTTYLPLPSNDWYNKWISPFQSEIPTKMQACAS-PGFYFEVTPTEGIT 409 Query: 378 ESFDKIT-DKIQEQSVR 393 ++ + I+ + Sbjct: 410 DAMKALFLKVIRSPRIT 426 >gi|85859126|ref|YP_461328.1| von Willebrand factor type A domain-containing protein [Syntrophus aciditrophicus SB] gi|85722217|gb|ABC77160.1| von Willebrand factor type A domain protein [Syntrophus aciditrophicus SB] Length = 447 Score = 128 bits (320), Expect = 2e-27, Method: Composition-based stats. Identities = 58/456 (12%), Positives = 138/456 (30%), Gaps = 86/456 (18%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+++ V F A+D+ R+++ ++DA ++G +I + +D + Sbjct: 15 IFALLLIVLLGFTALAVDVGRWYTTRSELSKSVDAGAIAGAKNISNPYLGEDGHLR---- 70 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + ++ +++ G + ++G + A D+++ ++ + L Sbjct: 71 --LAEEVARENFSAGYLMTPDSG--ERSATFTAYADEDHRIRVEGTVSS-------PGNL 119 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNM------ 174 GL S + + I +VLD S SM+ + Sbjct: 120 AGLFGVDWVATSAMGVA------KKNEVEIMLVLDRSGSMDGTPMNDLKKAARSFVSFFE 173 Query: 175 -----------TSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNS 223 + + P + + + T A A D L ++ Sbjct: 174 ETQDQDKMGLVSFATSVKVDVPLGNNYVSSMTSKINAMDAVGATNAEDSLSQAGNPAKGG 233 Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL----------------- 266 + N V+ I ++ G N + + Sbjct: 234 LTDQSGVPGNKRVQQFVIFFSDGNPTAFRGKFKYNGTDNIDAVVCGTGNDCGTVYTKLGK 293 Query: 267 -----------------NKLNPYENTN------TYPAMHHAYRELYNEKESSHNTIGSTR 303 P T+ ++ + ++ + G+T Sbjct: 294 PEREEWLSYNPRFTGDGKPKPPGTGTSKCTTRYGGSYVNTTKWYVLDDPDYRLTYRGTTY 353 Query: 304 LKKFVIFITDGENSG----ASAYQNTLNTLQI--CEYMRNAGMKIYSVAVSAPPEGQDLL 357 + T G ++ ++ T + + + +++ +KIY++ + +D L Sbjct: 354 NSESCFIPTVGSSNTTAPLSTYICTTARGMAVEHAQELKDNNVKIYTIGLG--NIDRDFL 411 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + F S EL F+KI I+ + V+ Sbjct: 412 SQIASGPSFEFYAPTSGELQAIFNKIAKDIKLRLVQ 447 >gi|254292617|ref|YP_003058640.1| hypothetical protein Hbal_0241 [Hirschia baltica ATCC 49814] gi|254041148|gb|ACT57943.1| hypothetical protein Hbal_0241 [Hirschia baltica ATCC 49814] Length = 514 Score = 128 bits (320), Expect = 2e-27, Method: Composition-based stats. Identities = 76/509 (14%), Positives = 153/509 (30%), Gaps = 137/509 (26%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M A+ ++V I + ID + + +Q+A D+AVL+ + ++ T +++ + Sbjct: 22 MFALFLTVILFIIGFTIDFRRMDSAKMHLQAATDSAVLAAARAYLTSSVQVKETKRQEDS 81 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 I + +L S EN QI + ++ + A +K + L Sbjct: 82 QKIASDYLTANLLSSSNNFENN-------QIQLVFKEDGEIVGNASTKIK-------LIF 127 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 GL + L + + S + + I +VLD S SM K ++ + Sbjct: 128 GGLFGKSDVVLPALAAATVGDSRK---LEIVLVLDTSGSMSSQNRMKQLRTASINFVNSV 184 Query: 181 LP-------------PPPKKSF--------WSKNTTKSKYAPAPAPANRKIDVLIESAGN 219 P W + + + ++ + N Sbjct: 185 FDNAVYERTVQVGVVPWNATVNINMDRPGTWDASPGPAIHNSNYGNGTNQVTSFQDFTEN 244 Query: 220 L-------------------------------------------VNSIQKAIQEKKNLSV 236 L V + K Sbjct: 245 LYPPGFSDFGSYSDSDIDDDFGSSGWLGCITATKDERKISSSGNVTPLTDVPPSKMKWPA 304 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN------ 290 R + +S + ++ +LN+LNP NT+ + YR Sbjct: 305 RKVAGWDPNSDCPSPMLAMSQSRPQIIKKLNQLNPSGNTHADIGLMWGYRMFSQQANWNN 364 Query: 291 ----EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN------------------- 327 ++ ++ ST+ +K +I +TDGEN+ ++ + Sbjct: 365 FFGYNSDTKPDSFHSTKSRKIMIMLTDGENTATNSEGYSYYGWCTYTNHYNKWGRYTGST 424 Query: 328 ----------------------TLQICEYMRNAGMKIYSVAVS----APPEGQDLLRKCT 361 L CE +R+ ++++++A+ LLR+C Sbjct: 425 KDCEVPKGINKDEISNNDLNSLMLDACEVIRSKDVELFTIALDLHSYYDSTAIALLRECA 484 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQ 390 S + + EL E+F ++ K Sbjct: 485 GSDSHAYNIK-GNELDETFQELASKALRL 512 >gi|90424817|ref|YP_533187.1| hypothetical protein RPC_3326 [Rhodopseudomonas palustris BisB18] gi|90106831|gb|ABD88868.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB18] Length = 479 Score = 127 bits (319), Expect = 3e-27, Method: Composition-based stats. Identities = 60/451 (13%), Positives = 136/451 (30%), Gaps = 60/451 (13%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCA------------------ 42 + I + F+ A+D + R+ MQ A D+A L Sbjct: 28 LFGIAVIPLISFVGVAVDYSRATAARSAMQGAADSATLMVSKDYAAGVIRASDIQATAEK 87 Query: 43 ---SIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNN 99 ++ + I + T T+ + + + +A + T + Sbjct: 88 YFKALYTSPGINNVTVTATYTARSANGSSTVVMNTSGSMPTSFLKVAGFTALPFTASSTS 147 Query: 100 PLQYI-AESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENL---AISICMVLD 155 ++ + L + L +T S++ +I V+ Sbjct: 148 TWGATRLRVAMALDVTGSMDWDDKLTAMKTAAIKLVNTLKATASTDADVYISIIPFNVMV 207 Query: 156 VSRSMEDLYLQKHNDNN------NMTSNKYLLPPPPKKSFWSKNTT--KSKYAPAPAPAN 207 + D + N T+ S+W+ + T ++ + A Sbjct: 208 NVGTANKDAEWLDWDTDYGSCKSNRTTQNSCQAAGETWSWWANSCTSRYTRKSTCVAGGE 267 Query: 208 RKIDVLIESAGNLVNSIQKAIQE---KKNLSVRIGTIAYNIGIVGNQCTPL--------- 255 I + + V + K + + L Sbjct: 268 TWIPSGVSNWKGCVTDRTTSNDYDVIKTPPTTATPATLFLAKSYSACPLSLLPMKAAYSS 327 Query: 256 -----SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTR-LKKFVI 309 S + +K ++NKL+ NTN + A+ L + + +I Sbjct: 328 NESDTSTAESTLKGKINKLDAEGNTNQPIGLFWAWMSLQTGVPLNTPAKDTEYKYTDAII 387 Query: 310 FITDGEN----SGASAYQNTLNTLQICEYMRN--AG-MKIYSVAVSAPPEGQ-DLLRKCT 361 ++DG+N + S ++C+ +++ G I+++ V+ + + +L+ C Sbjct: 388 LLSDGDNTQSGNSNSVSAIDARQKKLCDNIKDPLNGTTTIFTIQVNTDGDDESAVLKYCA 447 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 GQFF + ++ +F I + + + Sbjct: 448 S-DGQFFQSTTADQIEIAFQSIGSSLTKLRL 477 >gi|163738634|ref|ZP_02146048.1| hypothetical protein RGBS107_11437 [Phaeobacter gallaeciensis BS107] gi|161387962|gb|EDQ12317.1| hypothetical protein RGBS107_11437 [Phaeobacter gallaeciensis BS107] Length = 558 Score = 127 bits (319), Expect = 3e-27, Method: Composition-based stats. Identities = 41/362 (11%), Positives = 77/362 (21%), Gaps = 74/362 (20%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M ++S +DL + R +Q LD AVL+ + P Sbjct: 40 MVGFLLS-MLAVGGIGVDLMRMERDRTILQYTLDRAVLAAA-------DLDQPLPPAAVV 91 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 K G ++ I+ T F Sbjct: 92 QDYLSKAGLNKYYTPPVAETGLGFKKVQSTIDTT------------------------FE 127 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 ++ + + + I +VLDVS SM + + Sbjct: 128 THMLKFSSGQ-DMPLYATSRAEESIDGLEISLVLDVSGSMGSNSRLANLKVAAKDFVDTM 186 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKI-----DVLIESAGNLVNSIQKAIQEKKNLS 235 + + P + + + Q Sbjct: 187 IANTIDNKMSISIIPYATQVSLPTELMDQYNTTDEHAYSNCVNFVGSHFQTTALSTTQEL 246 Query: 236 VRIGTIAYNI-----------------GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 R + + P + N +K ++ L NT+ Sbjct: 247 DRTMHFSVWSGSDYRASANPLDSPTCEDSANREILPFQKDANTLKGFIDGLQAEGNTSID 306 Query: 279 PAMHHAYRELYNE-------------------KESSHNTIGSTRLKKFVIFITDGENSGA 319 M L + K ++ +TDG+N+ Sbjct: 307 VGMKWGTALLDPSARPAISALASGGGAMVPATFNNRPAAFNDHETVKVIVLMTDGKNTNQ 366 Query: 320 SA 321 Sbjct: 367 YY 368 Score = 85.7 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 21/115 (18%), Positives = 42/115 (36%), Gaps = 10/115 (8%) Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 ++ + Y+ LY + + G + T +C +N Sbjct: 453 TSLKYLYKYLYGDWMGQSSARSVWYY---------GVYDYWNTSTKDARTRAVCNAAKNQ 503 Query: 339 GMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 G+ +Y++ AP G +L+ C S F V E+ ++F I I++ + Sbjct: 504 GIVVYTIGFEAPSSGTAVLKDCASSDAHHFDVRGL-EIRDAFASIATSIRQLRLT 557 >gi|110634434|ref|YP_674642.1| hypothetical protein Meso_2084 [Mesorhizobium sp. BNC1] gi|110285418|gb|ABG63477.1| conserved hypothetical protein [Chelativorans sp. BNC1] Length = 549 Score = 126 bits (317), Expect = 5e-27, Method: Composition-based stats. Identities = 60/523 (11%), Positives = 132/523 (25%), Gaps = 131/523 (25%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQS---------------ALDAAVLS-GCASI 44 + A+ F A+D ++ R+++Q+ DA S + + Sbjct: 25 ILALSALPVFGAAGLAVDYTNMSRTRSELQNALDAAVLAVAQRGDKISDAEARSIAASFL 84 Query: 45 VSDRTIKDPTTKKDQTSTIFKKQIKKHLK---------------QGSYIRENAGDIAQKA 89 + + ++ T K + + S Sbjct: 85 TGNLSSAYKNMAVERNGTSVKLSAEATMPLSFGGLIGRKEATVGASSTADMAFAYYEIAL 144 Query: 90 QINITKDKNNPL---------------QYIAESKAQYEIPTENLFLKGLIP-------SA 127 ++ T K + + + Sbjct: 145 VLDTTGSMRGGKLQAMKEAVNGLIDDLSSRVTDKERLKFALVPFASFVNVGPQFGPEFDR 204 Query: 128 LTNLSLRSTGIIERSSENLAISICMVLD------VSRSMEDLYLQKHNDNNNMTSNKYLL 181 + + IS +L V+ + + +S+ Y + Sbjct: 205 NGRIVPGTGADWLDLQGISPISQLDLLPGLSRFEVAHHLGQDWKGCVETRMPTSSSAYDV 264 Query: 182 P---------------------PPPKKSFWSKNTTKSKYAPAPAPAN-----RKIDVLIE 215 P + + + + A + + + Sbjct: 265 DDAPVVATDRYSLFVPTFAIDEPDGGRLYANNYIASNTSAFGNSAVAIARRLLRYGLDDA 324 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIG------IVGNQCTPLSNNLNEVKSRLNKL 269 + L + + + R Y+ G + TPLSN+ +K +++ Sbjct: 325 AQAALTGATNLIGLDIRPERWRKVEHEYSDGRGPAYGCLSRPITPLSNDYAALKREVSRF 384 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT---- 325 NTN + R L + + ++ ++K +I +TDG N+ + Sbjct: 385 TADGNTNIMEGVAWGMRVLSPREPFTEGKEPASDVEKIMIVLTDGANNMGLSNNRNHALG 444 Query: 326 ----------------------------LNTLQICEYMR-------NAGMKIYSVAVSAP 350 TL CE + + IY++ + P Sbjct: 445 SSYSSFGYLVEDRLTRERSQRRVTEEMNRRTLAACENAKREYTPSKEDDVTIYTIRLEEP 504 Query: 351 PEGQ-DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 LL++C G +F +L F +I D I + + Sbjct: 505 DVATGTLLQECATGPGYYFDSPSRTQLNAIFKEIRDGITKLRL 547 >gi|163742980|ref|ZP_02150363.1| hypothetical protein RG210_01902 [Phaeobacter gallaeciensis 2.10] gi|161383663|gb|EDQ08049.1| hypothetical protein RG210_01902 [Phaeobacter gallaeciensis 2.10] Length = 560 Score = 126 bits (317), Expect = 5e-27, Method: Composition-based stats. Identities = 42/362 (11%), Positives = 79/362 (21%), Gaps = 74/362 (20%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M ++S +DL + R +Q LD AVL+ + P Sbjct: 42 MVGFLLS-MLAVGGIGVDLMRMERDRTILQYTLDRAVLAAA-------DLDQPLPPAAVV 93 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 K G ++ I+ T F Sbjct: 94 QDYLSKAGLNKYYTPPVAETGLGFKKVQSTIDTT------------------------FE 129 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 ++ + + + I +VLDVS SM + + Sbjct: 130 THMLKFSSGQ-DMPLYATSRAEESIDGLEISLVLDVSGSMGSNSRLANLKVAAKDFVDTM 188 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKI-----DVLIESAGNLVNSIQKAIQEKKNLS 235 + + P + + + Q Sbjct: 189 IANTIDNKMSISIIPYATQVSLPTELMDQYNTTDEHAYSNCVNFVGSHFQTTALSTTEEL 248 Query: 236 VRIGTIAYNIGIVG-----------------NQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 R + G + P + N +K ++ L+ NT+ Sbjct: 249 DRTMHFSVWSGSDYRASANPLGSPTCEDRADREILPFQKDANTLKGFIDGLSAKGNTSID 308 Query: 279 PAMHHAYRELYNE-------------------KESSHNTIGSTRLKKFVIFITDGENSGA 319 M L + K ++ +TDG+N+ Sbjct: 309 VGMKWGTALLDPSARPAISALASGGGAMVPATFNNRPAAFNDHETVKVIVLMTDGKNTNQ 368 Query: 320 SA 321 Sbjct: 369 YY 370 Score = 85.7 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 21/115 (18%), Positives = 42/115 (36%), Gaps = 10/115 (8%) Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 ++ + Y+ LY + + G + T +C +N Sbjct: 455 TSLKYLYKYLYGDWMGQSSARSVWYY---------GVYDYWNTSTKDARTRAVCNAAKNQ 505 Query: 339 GMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 G+ +Y++ AP G +L+ C S F V E+ ++F I I++ + Sbjct: 506 GIVVYTIGFEAPSSGTAVLKDCASSDAHHFDVRGL-EIRDAFASIATSIRQLRLT 559 >gi|190892054|ref|YP_001978596.1| hypothetical protein RHECIAT_CH0002466 [Rhizobium etli CIAT 652] gi|190697333|gb|ACE91418.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 427 Score = 126 bits (315), Expect = 8e-27, Method: Composition-based stats. Identities = 62/437 (14%), Positives = 136/437 (31%), Gaps = 73/437 (16%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQM----------QSALDAAVLSGCASIVSDRTI 50 MTA+++ A+D AH + +R Q+ A + ++ ++ + TI Sbjct: 19 MTALLMVPLVGTAGMAVDFAHALSLRTQLYAAADAAAVGSIAEKSGAVAAAMAMNGNGTI 78 Query: 51 KDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQ 110 K IF Q+ L + + I++TK N + Sbjct: 79 SLG---KTDARNIFMSQVSGELAE----------VHVDLGIDVTKTANKLNSQV------ 119 Query: 111 YEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 T + +S +T + ++LD + SM Sbjct: 120 SFTATVPTTFMQIFGRDSITISGTATA---EYQTAAFMDFYILLDNTPSMGVGATPSDVS 176 Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE 230 KS + KS +IDV+ ++ L ++ + Sbjct: 177 KLEAKVGCAFACHQMDKSTNNYTIAKS------LGVAMRIDVVRQATQALTDTAKTERVS 230 Query: 231 KKNLSVRIGTIAYNIGIVG-NQCTPLSNNLNEVKSRLNKLNPY------ENTNTYPAMHH 283 + + T + L+++L +VKS + ++ N++ Sbjct: 231 SDQFRMGVYTFGTKAEDAKLTTISGLTSDLTKVKSYTDAVDLMTIPYQNYNSDQITNFDS 290 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ----------ICE 333 A ++ + + + +T +K + F++DG T T C+ Sbjct: 291 AMTQMNTIIDPAGDGTSNTSAEKILFFVSDGVGDSYKPSTCTKKTTGGRCQEPIDTSFCK 350 Query: 334 YMRNAGMKI---YSVAVSAPPEG-------------QDLLRKCTDSSGQFFAVNDSRELL 377 +++ G+KI Y+ + P ++ C G +F V+ + + Sbjct: 351 PLKDRGVKIAVLYTTYLPLPSNSWYNTWIKPFQSEIPTKMQACAS-PGFYFEVSPTDGIT 409 Query: 378 ESFDKIT-DKIQEQSVR 393 ++ + I+ + Sbjct: 410 DAMKALFLKVIRAPRIT 426 >gi|218528586|ref|YP_002419402.1| hypothetical protein Mchl_0543 [Methylobacterium chloromethanicum CM4] gi|218520889|gb|ACK81474.1| conserved hypothetical protein [Methylobacterium chloromethanicum CM4] Length = 518 Score = 126 bits (315), Expect = 8e-27, Method: Composition-based stats. Identities = 70/491 (14%), Positives = 134/491 (27%), Gaps = 99/491 (20%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+ + ID A +R Q Q A+DA L+ + + Q Sbjct: 26 IFALALVPMAFLAGMTIDYAQNTNLRQQAQVAVDATALALAKLPLDTTDKDLAAKAEAQV 85 Query: 61 STIFK----KQIKKHLKQGSYIRENAGDIAQKAQIN----ITKDKNNPLQYIAESKAQYE 112 T K + ++ + E A A + + S E Sbjct: 86 LTALKGLPIDALTVTMRHNGDLIEVAAKGATPTSLTRLAGFMSMPLSVSAISNRSMTNLE 145 Query: 113 IPTENLFLKGLIPSALTNLSLRSTGIIERSSENLA-----------ISICMVLDVSRSME 161 I + + LTNL + ++ + + M ++V Sbjct: 146 IALVLDNTGSMKGTKLTNLKAAARDLVTSLFQQADPAKPNALKIGVVPFSMTVNVGSGFA 205 Query: 162 DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPA------------------- 202 N + + + P F +A Sbjct: 206 GSDWLDINAKSPIHQQIFNAQGVPANRFSLFADMGKPWAGCVESRPAPYDVQDTAPSQAT 265 Query: 203 ------PAPANRKIDVLIESAGNLVNSIQ----------------KAIQEKKNLSVRI-- 238 P A + D + + + + A +K V Sbjct: 266 PSTLFVPFFAPDESDNDSRAVNDYMADLPSGGSAGGASNRQLQGMTAKYDKNAFKVSTTA 325 Query: 239 ------GTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 N G T L+ + ++ + + + +TN + + L Sbjct: 326 RQDGTNYLFGPNAGCEIQPLTRLTTSQTQLTNAIAAMTVIGDTNIPIGLAWGWHLLSPNG 385 Query: 293 E-SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ--------------------- 330 G + KKF++ +TDG+N A + + + Sbjct: 386 PFKDGVAYGEIKTKKFIVLMTDGQNQSAVSSSDNRSYYSGLGFIWQNRIGTTSNDNAVRT 445 Query: 331 ---------ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFD 381 +C+ +R A +++++V V +L+ C S FF V +S L F Sbjct: 446 KAIDTRLTLLCDNIRKARIQVFAVRVEVNDGDSAVLKACATSPNMFFDVKNSSGLPAVFR 505 Query: 382 KITDKIQEQSV 392 I D+I E + Sbjct: 506 AIADQISELRI 516 >gi|146337718|ref|YP_001202766.1| hypothetical protein BRADO0587 [Bradyrhizobium sp. ORS278] gi|146190524|emb|CAL74523.1| conserved hypothetical protein; putative vWFA domain [Bradyrhizobium sp. ORS278] Length = 442 Score = 125 bits (313), Expect = 1e-26, Method: Composition-based stats. Identities = 57/430 (13%), Positives = 121/430 (28%), Gaps = 60/430 (13%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 AI+ + +D + +R ++Q+A+DAA + G S S I D Sbjct: 23 FAIVCVPVITAVGCGVDYSRTNQMRAKLQAAVDAASV-GAVSRTSPAFIAAGAMTTDGVI 81 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 +K + ++ + + + T Sbjct: 82 AAGNDDARKIFNGNMSGTTGYTLDSLTPEV-------KKTGSVLTATVSFS-ATVPTLFM 133 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 ++ +L ST I ++LD S SM ++ Sbjct: 134 SIVGYKTMSLQGSSTAKAS---MPKYIDFYLLLDNSPSMGVAATPADVTKMVSATSDKCA 190 Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 + + +K +IDVL + L+++ Q+ + I Sbjct: 191 FACHDYNDANNYYNLAK----TLGVTTRIDVLRSATQQLMDTAQQTQTYSNQFRMAIYDF 246 Query: 242 AYNIGI-VGNQCTPLSNNLNEVKSRLNKLNPYE----NTNTYPAMHHAYRELYNEKESS- 295 + LS++L KS ++ N + Y + Sbjct: 247 GASSKTIGLRALFALSSSLTSAKSAAGNIDLMGVYGNNDSFTADKDTPYTTALPAINNEI 306 Query: 296 --HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL----------QICEYMRNAGMKIY 343 S K++ F++DG ++A +C ++N G+KI Sbjct: 307 ATPGDGTSGSPLKYLFFVSDGVADESNAACLKPKASGNRCQSPINPALCTALKNRGIKIA 366 Query: 344 SVA-----------------------VSAPPEGQ--DLLRKCTDSSGQFFAVNDSRELLE 378 + P + ++ C G +F V+ ++ + + Sbjct: 367 VLYTTYLQLPTNSWYMSWIDPFNKGPFGPSPNSEIAQNMQACAS-DGFYFEVSPTQGIAD 425 Query: 379 SFDKITDKIQ 388 + + + K Sbjct: 426 AMNALFKKAV 435 >gi|327190622|gb|EGE57710.1| hypothetical protein RHECNPAF_409007 [Rhizobium etli CNPAF512] Length = 427 Score = 125 bits (313), Expect = 1e-26, Method: Composition-based stats. Identities = 61/437 (13%), Positives = 136/437 (31%), Gaps = 73/437 (16%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQM----------QSALDAAVLSGCASIVSDRTI 50 MTA+++ A+D AH + +R Q+ A + ++ ++ + TI Sbjct: 19 MTALLMVPLVGTAGMAVDFAHALSLRTQLYAAADAAAVGSIAEKSGAVAAAMAMNGNGTI 78 Query: 51 KDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQ 110 K IF Q+ L + + I++TK N + Sbjct: 79 SLG---KTDARDIFMSQVSGELAE----------VHVDLGIDVTKTANKLNSQV------ 119 Query: 111 YEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 T + +S +T + ++LD + SM Sbjct: 120 SFTATVPTTFMRIFGRDSITISGTATA---EYQTAAFMDFYILLDNTPSMGVGATPSDVS 176 Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE 230 KS + KS +IDV+ ++ L ++ + Sbjct: 177 KLEAKVGCAFACHQMDKSTNNYTIAKS------LGVAMRIDVVRQATQALTDTAKTERVS 230 Query: 231 KKNLSVRIGTIAYNIGIVG-NQCTPLSNNLNEVKSRLNKLNPY------ENTNTYPAMHH 283 + + T + L+++L +VK+ + ++ N++ Sbjct: 231 SDQFRMGVYTFGTKAEDAKLTTISGLTSDLTKVKNYTDAVDLMTIPYQNYNSDQITNFDS 290 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ----------ICE 333 A ++ + + + +T +K + F++DG T T C+ Sbjct: 291 AMTQMNTIIDLAGDGTSNTSAEKILFFVSDGVGDSYKPSTCTKKTTGGRCQEPIDTSFCK 350 Query: 334 YMRNAGMKI---YSVAVSAPPEG-------------QDLLRKCTDSSGQFFAVNDSRELL 377 +++ G+KI Y+ + P ++ C G +F V+ + + Sbjct: 351 PLKDRGVKIAVLYTTYLPLPSNSWYNTWIKPFQSEIPTKMQACAS-PGFYFEVSPTDGIT 409 Query: 378 ESFDKIT-DKIQEQSVR 393 ++ + I+ + Sbjct: 410 DAMKALFLKVIRAPRIT 426 >gi|32477945|ref|NP_870939.1| hypothetical protein RB13237 [Rhodopirellula baltica SH 1] gi|32448502|emb|CAD78017.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 388 Score = 125 bits (312), Expect = 2e-26, Method: Composition-based stats. Identities = 53/396 (13%), Positives = 129/396 (32%), Gaps = 49/396 (12%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M I++ V Y I++ ++ R ++Q + D A + + + ++ Sbjct: 39 MLVILLPVMLAVAAYCINVVYMEMARTELQISTDLATRAAGRVLAVTGDKAEAIEAAER- 97 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + +L + I + + + + S + +++ + Sbjct: 98 ----LLEANPYLDRTLSIGDADIIFGKSNRTEENRRYEFTPDKKVNSVSLRAFGADDVPM 153 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 + + I + + + + I +VLD S SM Sbjct: 154 L----FPTMGVPIEFRPIKQAVATQVELDIAIVLDRSGSMAFS----------------- 192 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 K S K A R +D + + ++ ++ + + R+ Sbjct: 193 HDEVAKNGSPSSAPPGWKMGHAVPENARWLDTV-AAVNGFLDIMEDSSHD-----ERVSL 246 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP---YENTNTYPAMHHAYRELYNEKESSHN 297 Y+ L+ + E+++ +N + TN + L ++K + Sbjct: 247 STYSDK--SKADVKLTGDYTEIRAAMNAHSTKFKGGATNIGSGILEGGATLGDKKLAR-- 302 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 + + +I ++DG ++ + + + + N + I++V S Q++ Sbjct: 303 ----SWASRVLIVMSDGIHNTG------IEPIPAAQQVANEKIMIFTVTFSDEANVQEME 352 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + GQ F DS++L E+F KI + Sbjct: 353 KVAVSGGGQHFHAKDSQQLTEAFRKIAKSLPTLITF 388 >gi|37680183|ref|NP_934792.1| hypothetical protein VV1999 [Vibrio vulnificus YJ016] gi|37198930|dbj|BAC94763.1| conserved hypothetical protein [Vibrio vulnificus YJ016] Length = 481 Score = 124 bits (311), Expect = 2e-26, Method: Composition-based stats. Identities = 51/456 (11%), Positives = 129/456 (28%), Gaps = 69/456 (15%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 +++ +F + +D+ I + NQM +A DAA L+ + + TS Sbjct: 28 MLVLLMSMLVFAAWVMDVMRIYSVHNQMANATDAA-LASAIISEVPESTAVELLHANLTS 86 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINIT-------KDKNNPLQYIAESKAQYEIP 114 +++ + + + S + EI Sbjct: 87 GAASPYVEEVRLTHLRDEQEESLQVVLDFVPNSLNIAAQESVPIRTNAKAGISSNKAEIV 146 Query: 115 TENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNM 174 + + ++ + V+ + + ++ + Sbjct: 147 FMLDVSNSMSGEPMNKTKEALLAFADKLYARGNRNQNYVVSIVPASGNVNTGPMEEIYLG 206 Query: 175 TSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE------------------- 215 + +Y +++ WS ++ P ++ + + Sbjct: 207 SFRRYDHAQVKRENRWSDMFDRASGRTPAVPGRQRNAMCRDLDFEGNNPATLGLRYFRNL 266 Query: 216 ---------------------SAGNL-----VNSIQKAIQEKKNLSVRIG---TIAYNIG 246 + + ++ N I +I Sbjct: 267 EKAPQFASNNSKRIIRPIHKPAVLHFDDGTPLDPPVYPSTNPSNNYRPFHEDKAIFDDIE 326 Query: 247 IVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNT-------- 298 N P +S + +L P NTN M A R L + + Sbjct: 327 CHVNPIVPFITERRHFESTVQRLVPGMNTNNAEGMVWAMRLLSPYWQGIWDKTRPELPRR 386 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN--AGMKIYSVAVSAPPEGQDL 356 K+++ +DG + A+++ IC ++ G+K+ +V + L Sbjct: 387 YSDETSNKYLVMFSDGNHLIDPAFRDKK-MKLICTQLKQPGRGVKVMTVNFG-GAASERL 444 Query: 357 LRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 ++ C +++ V + + F++I +++ S+ Sbjct: 445 MQSCASGP-EYYHVASLFSVEKVFEQIAEQVISSSL 479 >gi|323493494|ref|ZP_08098616.1| hypothetical protein VIBR0546_14275 [Vibrio brasiliensis LMG 20546] gi|323312317|gb|EGA65459.1| hypothetical protein VIBR0546_14275 [Vibrio brasiliensis LMG 20546] Length = 393 Score = 124 bits (311), Expect = 3e-26, Method: Composition-based stats. Identities = 53/393 (13%), Positives = 138/393 (35%), Gaps = 44/393 (11%) Query: 5 IISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIF 64 ++ + I + + + + N+ A+D A L+ + + +I Sbjct: 21 MLIPMIIAAASTIVIGYQVLLSNRAMQAVDTASLAC-------------EFRGEYDRSIA 67 Query: 65 KKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLI 124 + + + + + G + + + Y + L Sbjct: 68 QGYLDYYKPKIDKVTATLGASSGC-----------------KVELGYSYSSIFTSLTFSD 110 Query: 125 PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKH---NDNNNMTSNKYLL 181 S + ++ + +++ I + +VLD+S SM + N ++ Sbjct: 111 ASYVAGVTASQKVYVTEVTDSDPIELVLVLDISGSMMGALDELKSILNRGLTTLRSQQAN 170 Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPA-----NRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 S + + AP +D + S G+ + A + + Sbjct: 171 VAGQDHIKVSIVPFSNGVSVTDAPWLKSGGTLCVDATVNSGGSFSPANTVANLDVTHDQA 230 Query: 237 RIGTIAYNIGIVGNQ-CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS 295 + T + + PL++NLN+V +N+L +T +Y + R+L +S+ Sbjct: 231 PVTTSSSSSDCSLTSVILPLTSNLNDVVDAVNRLQTIGSTASYQGLLWGLRQLTPNWQSA 290 Query: 296 HN---TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 +++ ++ +TDG +++ + L +C ++ G+++ + Sbjct: 291 WRVGPNRNQDNVQRKLVLMTDGM--DDNSHLDELINAGLCTRAKDLGIELNFIGFGVQSW 348 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + +C S+G F+ N++++L + F ++ Sbjct: 349 RLEQFTRCAGSAGAVFSANNTQDLDDYFSQLLS 381 >gi|87311197|ref|ZP_01093320.1| hypothetical protein DSM3645_16250 [Blastopirellula marina DSM 3645] gi|87286105|gb|EAQ78016.1| hypothetical protein DSM3645_16250 [Blastopirellula marina DSM 3645] Length = 373 Score = 124 bits (310), Expect = 3e-26, Method: Composition-based stats. Identities = 52/400 (13%), Positives = 117/400 (29%), Gaps = 60/400 (15%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+++ V + +D+A++ R +++ A D+A +G ++ ++ Sbjct: 25 LIAVLLPVILWMAAFCVDVAYMQLTRTELRIATDSAARAGARTLSLEQDASLAHKSA--I 82 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKA-----QINITKDKNNPLQYIAESKAQYEIPT 115 K + + + G + + N + Q Sbjct: 83 EYAAKNNVAGNTLTLADSDVQIGLSVRTDDVGRFTFSSGGKLLNSVNVTGRRTQQAPDGA 142 Query: 116 ENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMT 175 L+L + +T + I +V+D S SM Sbjct: 143 VRLYLTPIFGHEFFQPVADATA------SQIDRDIALVVDRSGSMTFRIN---------- 186 Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 + P P+ + L++S + + Sbjct: 187 ---------------RNSYESGWRNNDPVPSRARWWALVDSVDGFLTELGSTP-----QL 226 Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEK 292 + YN ++ L++ + ++ L+ + P +TN M L N+K Sbjct: 227 ELVSLSTYNSSAKIDEQ--LTDKYSRIEDALDDYSRRYPDGSTNITAGMDRGISTLQNKK 284 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + K ++ +TDG ++ S+ N + + ++++ S Sbjct: 285 YARP------YASKTMVVMTDGNHNYGSSPTN------AAYDAASDDIVVHTITYSDGAN 332 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + GQ + D EL E F +I Sbjct: 333 QSLMREVARIGGGQHWHAPDGDELEEIFREIARNAPTLLT 372 >gi|126738776|ref|ZP_01754472.1| hypothetical protein RSK20926_02629 [Roseobacter sp. SK209-2-6] gi|126719957|gb|EBA16664.1| hypothetical protein RSK20926_02629 [Roseobacter sp. SK209-2-6] Length = 530 Score = 124 bits (310), Expect = 3e-26, Method: Composition-based stats. Identities = 43/366 (11%), Positives = 77/366 (21%), Gaps = 71/366 (19%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 T +DL + R +Q LD AVL+ + Sbjct: 38 TIAFFLAMLAVGGVGVDLMRLERDRTVLQYTLDRAVLAAA-------DLDQTQEPAVVVQ 90 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 K + + G KA I+ T D L Sbjct: 91 DYLNKAGLGEYYEAPEVETGLGYKKVKATIDATFDA---------------------HLL 129 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 + + ST + I +VLDVS SM + ++ Sbjct: 130 QFAGGSDLPVYASSTAEESI----DGLEISLVLDVSGSMNSNSRLSNLKVAARDFIDTMV 185 Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLI-----ESAGNLVNSIQKAIQEKKNLSV 236 + + + + Sbjct: 186 ENTTDGRMSISIVPYATQVSVSDELFDEYTTSGTNNFANCINFETSDYSTTALSTTSERE 245 Query: 237 RIGTIAYN-----------------IGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYP 279 R + + PL + +KS + L + NT+ Sbjct: 246 RTMHFSPWYTSNTRASGSPIDYEICDDRSSREILPLQKDATTLKSFITNLTAWGNTSIDI 305 Query: 280 AMHHAYRELYNEKESS-----------------HNTIGSTRLKKFVIFITDGENSGASAY 322 M L + K ++ +TDG+N+ Sbjct: 306 GMKWGVALLDPSARPAISSLASGASVPSEFSVRPVDYSDPDTLKIIVLMTDGQNTSQYYV 365 Query: 323 QNTLNT 328 ++ Sbjct: 366 EDDHRA 371 Score = 88.8 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 35/403 (8%), Positives = 99/403 (24%), Gaps = 49/403 (12%) Query: 13 ITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHL 72 I+ +D++ M +++ + ++ I + + Sbjct: 154 ISLVLDVSGSMNSNSRLSNL----KVAARDFIDTMVENTTDGRMSISIVPYATQVSVSDE 209 Query: 73 KQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSA----- 127 Y + A + L +E + S Sbjct: 210 LFDEYTTSGTNNFANCINFETSDYSTTALSTTSERERTMHFSPWYTSNTRASGSPIDYEI 269 Query: 128 -----------------LTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 + + +S ++ + + L + + Sbjct: 270 CDDRSSREILPLQKDATTLKSFITNLTAWGNTSIDIGMKWGVALLDPSARPAISSLASGA 329 Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE 230 + + + + + + ++ + + Sbjct: 330 SVPSEFSVRPVDYSDPDTLKIIVLMTDGQNTSQYYVEDDHRAGDSNVWYDFSANRYSTYN 389 Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 N +Y + ++ + Y ++ + Y+ +Y Sbjct: 390 PDNGYYWRDGYSYWYSSPYGGNNAQQLSYPDLFA-------------YTSLKYLYKYIYA 436 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + S++ G T +C + G+ +Y++ AP Sbjct: 437 DWMGSYSARSEWYY---------GVYDYHGNSTKNTRTSNVCSAAKAQGIIVYTIGFEAP 487 Query: 351 PEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 G +L+ C S +F V D E+ ++F+ I I++ + Sbjct: 488 SNGVAVLQDCASSDSHYFDV-DGLEIRDAFESIATSIRKLRLT 529 >gi|327538644|gb|EGF25299.1| protein containing von Willebrand factor, type A domains [Rhodopirellula baltica WH47] Length = 388 Score = 124 bits (310), Expect = 3e-26, Method: Composition-based stats. Identities = 52/396 (13%), Positives = 127/396 (32%), Gaps = 49/396 (12%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M I++ V Y I++ ++ R ++Q + D A + + + ++ Sbjct: 39 MLVILLPVMLAVAAYCINVVYMEMARTELQISTDLATRAAGRVLAVTGDKAEAIEAAER- 97 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + +L + I + + + + S +++ + Sbjct: 98 ----LLEANPYLDRTLSIGDADIIFGKSNRTEENRRYEFTPDKKVNSVGLRAFGADDVPM 153 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 + + I + + + + I +VLD S SM Sbjct: 154 L----FPTMGVPIEFRPIKQAVATQVELDIAIVLDRSGSMAFS----------------- 192 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 K S K A R +D + + ++ ++ + + R+ Sbjct: 193 HDEVAKNGSPSSAPPGWKMGHAVPKNARWLDTV-AAVNGFLDIMEDSSHD-----ERVSL 246 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP---YENTNTYPAMHHAYRELYNEKESSHN 297 Y+ L+ + E+++ +N + TN + L ++ + Sbjct: 247 STYSDK--SKADVKLTGDYTEIRAAMNAHSTNFKGGATNIGSGILEGGATLGDKNLAR-- 302 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 + + +I ++DG ++ + + + + N + I++V S Q++ Sbjct: 303 ----SWASRVLIVMSDGIHNTG------IEPIPAAQQVANEKIMIFTVTFSNEANVQEME 352 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + GQ F DS++L E+F KI + Sbjct: 353 KVAVSGGGQHFHAKDSQQLAEAFRKIAKSLPTLITF 388 >gi|262275460|ref|ZP_06053270.1| protein TadG associated with Flp pilus assembly [Grimontia hollisae CIP 101886] gi|262220705|gb|EEY72020.1| protein TadG associated with Flp pilus assembly [Grimontia hollisae CIP 101886] Length = 453 Score = 124 bits (310), Expect = 3e-26, Method: Composition-based stats. Identities = 55/445 (12%), Positives = 118/445 (26%), Gaps = 70/445 (15%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + I + F A++ + ++ ++ A L+ A+I SD T ++ T K+ Sbjct: 17 IFVIAYPLLFGVFVLAVESTRYLQTHARIGDGVEVASLAVAANISSDIT-ENKTLAKNYV 75 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + + + D+ + + E + Sbjct: 76 DGFVPDGTISLADINIERKSCDEIYGSQCGVAGVYDEEGLVFTQYKVTLSSEFESWYPED 135 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYL--------------- 165 + + R + I + V D S SM+ + Sbjct: 136 DFAPGFE--EIVELGGTAVARKYQGFTIDVAFVADFSGSMQQTWNREIKYKGVVNVISDI 193 Query: 166 ---------------QKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKI 210 N + Y P +F+S K+ Y+ I Sbjct: 194 TRKLETFNDHTEQELNGKKVANKVAFIGYNFYPHNGSTFYSNVDYKANYSRLSYKWQENI 253 Query: 211 DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN 270 + + TI L++N + ++ ++ Sbjct: 254 PEIN--YRRTARDPINNKRTPIIGRYVNNTIPLYSDDSYFYTLDLTDNFTQFRNTISTFY 311 Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS-GASAYQNTLNTL 329 P T +Y + A + + ++K +I ++DGE+S + + Sbjct: 312 PDYGTASYEGIIEAAKIVN----------NGENIRKLIIVLSDGEDSINENNPYDNRYPG 361 Query: 330 QIC------------------EYMRNAGM--KIYSVAVSAPPEGQDLLRKCTDSSGQFFA 369 I + + + KI+ + E L+ C Sbjct: 362 FIAPLIYQSGLCQNIINDLESKEINGRNVEAKIFVIGFGYDLEKNPGLKICAGEEN---- 417 Query: 370 VNDSRELLESFDKITDKIQEQSVRI 394 V + E FD + I E+ + Sbjct: 418 VQSADSYQEIFDTVLQLISEEVGHL 442 >gi|209549601|ref|YP_002281518.1| hypothetical protein Rleg2_2008 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209535357|gb|ACI55292.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 429 Score = 123 bits (309), Expect = 4e-26, Method: Composition-based stats. Identities = 55/437 (12%), Positives = 138/437 (31%), Gaps = 71/437 (16%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQM----------QSALDAAVLSGCASIVSDRTI 50 MTA+++ A+D AH M +R Q+ A + ++ ++ + TI Sbjct: 19 MTALLMVPLLGTAGMAVDFAHAMSLRTQLFAAADAAAVGSIAEKSGAVAAAMTMTGNGTI 78 Query: 51 KDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQ 110 K +IF Q+ L + I++TK N + Sbjct: 79 SLG---KTDARSIFLSQVSGELADVN----------VDLGIDVTKTANKLNSQV------ 119 Query: 111 YEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 ++ +S +T + ++LD + SM K Sbjct: 120 SFTAVVPTTFMRVLGKDSITISGTATA---EYLTASFMDFYILLDNTPSMGVGATAKDVA 176 Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE 230 ++ + + +K + +IDV+ ++ L + + Sbjct: 177 TMEKNTSDSCAFACHETENKNNYYNLAK----TLGVSMRIDVVRQATKELTLTAKSTRVS 232 Query: 231 KKNLSVRIGTI-AYNIGIVGNQCTPLSNNLNEVKSRLNKLNP------YENTNTYPAMHH 283 + + T + +++L++V++ + ++ N + + + Sbjct: 233 TNQFRMGVYTFGTKAEDANLTTISDPTDDLDKVRTYTDAVDLMTIPKQGYNNDQQTSFDN 292 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ----------ICE 333 A ++ + + + +T +K + F++DG T C+ Sbjct: 293 ALTQMKDIITTPGDGSTATTPQKILFFVSDGVGDSEKPKGCTKKLTGNRCQEPIDTSFCK 352 Query: 334 YMRNAGMKI---YSVAVSAPPEG-------------QDLLRKCTDSSGQFFAVNDSRELL 377 +++ G++I Y+ + P +++C G +F V + + Sbjct: 353 PLKDKGIRIAVLYTTYLPLPKNSWYNTWISPFQSQIPTKMQECAS-PGLYFEVTPTEGIA 411 Query: 378 ESFDKIT-DKIQEQSVR 393 ++ + I+ + Sbjct: 412 DAMKALFLKAIRAPRIT 428 >gi|86357991|ref|YP_469883.1| hypothetical protein RHE_CH02376 [Rhizobium etli CFN 42] gi|86282093|gb|ABC91156.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 427 Score = 123 bits (308), Expect = 6e-26, Method: Composition-based stats. Identities = 60/437 (13%), Positives = 134/437 (30%), Gaps = 73/437 (16%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQM----------QSALDAAVLSGCASIVSDRTI 50 MTA+++ A+D+AH + +R Q+ A + ++ ++ + T+ Sbjct: 19 MTALLMVPLMGAAGMAVDVAHALSLRTQLYAAADAAAVGSIAEKSGAVAAAMTMNGNGTV 78 Query: 51 KDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQ 110 K IF Q L I I++TK N + Sbjct: 79 SLG---KTDARNIFMSQTSGELTD----------IHIDLGIDVTKTANKLNSQV------ 119 Query: 111 YEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 T + +S +T + ++LD + SM Sbjct: 120 SFTATVPTTFMRIFGRDSIIISGTATA---EYQTAAFMDFYILLDNTPSMGVGATASDVS 176 Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE 230 + +S + KS +IDV+ ++ L ++ + Sbjct: 177 KLQAKTGCAFACHQMDQSTNNYTIAKS------LGVTMRIDVVRQATQALTDTAKAERVS 230 Query: 231 KKNLSVRIGTIAYNIGIVG-NQCTPLSNNLNEVKSRLNKLNPY------ENTNTYPAMHH 283 + + T + L+++L +VK+ N ++ N++ + Sbjct: 231 SDQFRMGVYTFGTKAEDAKLTTISGLTSDLTKVKNYTNAVDLMTIPYQNYNSDQLTSFDS 290 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT----------LQICE 333 A ++ + + + + +K + F+ DG T T C+ Sbjct: 291 AMTQINTIIDPAGDGTSNISPEKILFFVADGVGDSYKPSTCTKKTTGGRCQEPIDTTFCK 350 Query: 334 YMRNAGMKI---YSVAVSAPPEG-------------QDLLRKCTDSSGQFFAVNDSRELL 377 +++ G+KI Y+ + P ++ C G +F V + + Sbjct: 351 PLKDRGVKIAVLYTTYLPLPSNSWYNTWIKPFQNEIPTKMQACAS-PGLYFEVTPTDGIA 409 Query: 378 ESFDKIT-DKIQEQSVR 393 ++ + I+ + Sbjct: 410 DAMKALFLKVIRAPRIT 426 >gi|27365660|ref|NP_761188.1| hypothetical protein VV1_2340 [Vibrio vulnificus CMCP6] gi|27361808|gb|AAO10715.1| hypothetical protein VV1_2340 [Vibrio vulnificus CMCP6] Length = 465 Score = 122 bits (306), Expect = 7e-26, Method: Composition-based stats. Identities = 51/456 (11%), Positives = 128/456 (28%), Gaps = 69/456 (15%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 +++ +F + D+ I + NQM +A DAA L+ + + TS Sbjct: 12 MLVLLMSMLVFAAWVTDVMRIYSVHNQMANATDAA-LASAIISEVPESTAVELLHANLTS 70 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINIT-------KDKNNPLQYIAESKAQYEIP 114 +++ + + + S + EI Sbjct: 71 GAASPYVEEVRLTHLRDEQEESLQVALDFVPNSLNIAAQESVPIRTNAKAGISSNKAEIV 130 Query: 115 TENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNM 174 + + ++ + V+ + + ++ + Sbjct: 131 FMLDVSNSMSGEPMNKTKEALLAFADKLYARGNRNQNYVVSIVPASGNVNTGPMEEIYLG 190 Query: 175 TSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE------------------- 215 + +Y +++ WS ++ P ++ + + Sbjct: 191 SFRRYDHAQVKRENRWSDMFDRASGRTPAVPGRQRNAMCRDLDFEGNNPATLGLRYFRNL 250 Query: 216 ---------------------SAGNL-----VNSIQKAIQEKKNLSVRIG---TIAYNIG 246 + + ++ N I +I Sbjct: 251 EKAPQFASNNSKRIIRPIHKPAVLHFDDGTPLDPPVYPSTNPSNNYRPFHEDKAIFDDIE 310 Query: 247 IVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNT-------- 298 N P +S + +L P NTN M A R L + + Sbjct: 311 CHVNPIVPFITERRHFESTVQRLVPGMNTNNAEGMVWAMRLLSPYWQGIWDKTRPELPRR 370 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN--AGMKIYSVAVSAPPEGQDL 356 K+++ +DG + A+++ IC ++ G+K+ +V + L Sbjct: 371 YSDETSNKYLVMFSDGNHLIDPAFRDKK-MKLICTQLKQPGRGVKVMTVNFG-GAASERL 428 Query: 357 LRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 ++ C +++ V + + F++I +++ S+ Sbjct: 429 MQSCASGP-EYYHVASLFSVEKVFEQIAEQVISSSL 463 >gi|86137906|ref|ZP_01056482.1| hypothetical protein MED193_08588 [Roseobacter sp. MED193] gi|85825498|gb|EAQ45697.1| hypothetical protein MED193_08588 [Roseobacter sp. MED193] Length = 543 Score = 122 bits (306), Expect = 8e-26, Method: Composition-based stats. Identities = 42/370 (11%), Positives = 76/370 (20%), Gaps = 78/370 (21%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 T +DL + R +Q LD AVL+ + Sbjct: 38 TVAFFLAMLAVGGIGVDLMRMERDRTVLQYTLDRAVLAAA-------DLDQTQPPAVVVQ 90 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 K Q + G +A I+ T + L Sbjct: 91 DYLNKAGLGEYYQEPIVESGLGYKRVQATIDATFEA---------------------HLL 129 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 + S + I +VLDVS SM + ++ Sbjct: 130 RFSNGNDLPVFATSKAEESI----DGLEISLVLDVSGSMNSNSRLSNLKVAAKDFIDTMV 185 Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL-----IESAGNLVNSIQKAIQEKKNLSV 236 + P + + A + Sbjct: 186 ANTTDGKMSISVVPYATQVSLPDDLIDQYTTVGENPYSNCINFEAAEYNSASLSTLDTLE 245 Query: 237 RIGTIAYN----------------------IGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 R + PL + +K+ + L+ N Sbjct: 246 RSMHFTPWGYSNRDMRTYYSSPRLVRSPVCDERASREVLPLQKDATTLKNFIQNLSAGGN 305 Query: 275 TNTYPAMHHAYRELYNEKESS-------------------HNTIGSTRLKKFVIFITDGE 315 T+ M L + + K ++ +TDG+ Sbjct: 306 TSIDVGMKWGTALLDPSARPAISAISTGIGASVPGDFSDRPAEYSDSDTIKIIVLMTDGQ 365 Query: 316 NSGASAYQNT 325 N+ + Sbjct: 366 NTSQYYVDDD 375 Score = 86.5 bits (212), Expect = 7e-15, Method: Composition-based stats. Identities = 58/396 (14%), Positives = 102/396 (25%), Gaps = 49/396 (12%) Query: 37 VLSGCASIVSDRTIKDPT-TKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK 95 VL S+ S+ + + KD T+ + + T Sbjct: 157 VLDVSGSMNSNSRLSNLKVAAKDFIDTMVANTTDGKMSISVVPYATQVSLPDDLIDQYTT 216 Query: 96 DKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD 155 NP +A L L S S + + + V D Sbjct: 217 VGENPYSNCINFEAAEYNSASLSTLDTLERSMHFTPWGYSNRDMRTYYSSPRLVRSPVCD 276 Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF---------------WSKNTTKSKYA 200 S E L LQK + A Sbjct: 277 ERASREVLPLQKDATTLKNFIQNLSAGGNTSIDVGMKWGTALLDPSARPAISAISTGIGA 336 Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT------IAYNIGIVGNQCTP 254 P + + +S + + Q V + YN Sbjct: 337 SVPGDFSDRPAEYSDSDTIKIIVLMTDGQNTSQYYVDDDHRDGPSGVWYNSHYKSYSTYD 396 Query: 255 LSNNLNEVKSRLNKL--NPYENTNTYPA---------------MHHAYRELYNEKESSHN 297 N PY N + + + YR ++ E + ++ Sbjct: 397 SRYGGRYFFHYNNNWYNEPYGNGSGQSGTAVELNYAELFARTSLKYIYRYIFYEWMNFYD 456 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 I+ + G ++ T +CE + G+ +Y++ AP G +L Sbjct: 457 ARDDWYYG---IYSSHGNST------KNARTRSVCEAAKAKGIVVYTIGFEAPSNGVAVL 507 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 R C S +F V D E+ ++F I I++ + Sbjct: 508 RDCASSDAHYFDV-DGLEIKDAFASIATSIRQLRLT 542 >gi|241204947|ref|YP_002976043.1| hypothetical protein Rleg_2227 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240858837|gb|ACS56504.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 429 Score = 122 bits (305), Expect = 1e-25, Method: Composition-based stats. Identities = 57/437 (13%), Positives = 136/437 (31%), Gaps = 71/437 (16%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQM----------QSALDAAVLSGCASIVSDRTI 50 MTA+++ F A+D AH + +R Q+ A + ++ ++ + TI Sbjct: 19 MTALLVVPLFGAAGMAVDFAHALSLRTQLYAAADAAAVGSIAEKSGAVAAAMTMSGNGTI 78 Query: 51 KDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQ 110 KD +IF QI L + I++TK N + Sbjct: 79 SLG---KDDARSIFMSQISGELTD----------VQVDLGIDVTKTANKLNSQV------ 119 Query: 111 YEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 T ++ +S +T + ++LD + SM Sbjct: 120 SFSATVPTTFMRVLGRDSITISGTATA---EYQTASFMDFYILLDNTPSMGVGATATDVS 176 Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE 230 ++ + + +K + +IDV+ ++ L + + Sbjct: 177 TMEKNTSDTCAFACHETQNNNNYYNLAK----KLGVSMRIDVVRQATKELTVTAKSTRVS 232 Query: 231 KKNLSVRIGTIAYNIGIVG-NQCTPLSNNLNEVKSRLNKLNP------YENTNTYPAMHH 283 + + T + +++L++V+S + ++ N + + Sbjct: 233 SNQFRMGVYTFGTKAEDAKLTTISDPTDDLDKVRSYTDAVDLMTIPFQGYNNDQQTSFDS 292 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ----------ICE 333 A ++ + + +T +K + F++DG T C+ Sbjct: 293 ALTQMKTIITTPGDGSTATTPQKILFFVSDGVGDSEKPKGCTKKLTGNRCQEPIDTSFCQ 352 Query: 334 YMRNAGMKI---YSVAVSAPPEG-------------QDLLRKCTDSSGQFFAVNDSRELL 377 +++ ++I Y+ + P ++ C G +F V + + Sbjct: 353 PLKDKSIRIAVLYTTYLPLPKNSWYNTWIKPFQGEIPTKMQACAS-PGLYFEVTPTEGIA 411 Query: 378 ESFDKIT-DKIQEQSVR 393 ++ + I+ + Sbjct: 412 DAMKALFLKVIRAPRIT 428 >gi|123443829|ref|YP_001007800.1| putative tight adherance operon protein [Yersinia enterocolitica subsp. enterocolitica 8081] gi|122090790|emb|CAL13672.1| putative tight adherance operon protein [Yersinia enterocolitica subsp. enterocolitica 8081] Length = 459 Score = 121 bits (303), Expect = 2e-25, Method: Composition-based stats. Identities = 58/450 (12%), Positives = 127/450 (28%), Gaps = 72/450 (16%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 II+ I +++H + + ++ A++ A L+ + + Sbjct: 29 FMIILPFFIALIFITFEISHYLQRKAKLSDAIEQATLALTIENNAIPDEPQQIK----NN 84 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + +L + + I +N + P + L Sbjct: 85 ALVLSYANAYLPSKEF------------SVPIININDNTYYLEYNAAVTMAYPAKFLTQT 132 Query: 122 GLIP-SALTNLSLRSTGIIERSSENLAI-SICMVLDVSRSMEDLY--------------- 164 L N++ I ++ E + + V D S SM + Sbjct: 133 SLTNAITDINITDNGVAIKNKAIEASDLTDVIFVADYSGSMLYNFDVNEPNDHERINALR 192 Query: 165 ----LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNL 220 NN N P + + + P + KI L Sbjct: 193 SAFRKLHDIIMNNSNINAIGYIPFSWGTKRIVFENQQQKIYCHFPFSSKIYKPKG--NYL 250 Query: 221 VNSIQKAIQEKKNLSVRIGTIAYN------IGIVGNQCTPLSNNLNE------------- 261 + I+K+ L I Y+ G P+S+ + Sbjct: 251 SDEIKKSSNALLLLDYIGDIIDYDKTIESITGNAQPIDIPMSDVRTKNVCLQASNAYSLE 310 Query: 262 VKSRLNKLN------PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + +N ++ PY T + A E + H + + + Sbjct: 311 QEQYINNIDNIIKMEPYGWTLISSGILSANNLFKKEANNRHRKLMIILSDGVDTYQDNFL 370 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ------DLLRKCTDSSGQFFA 369 + TL +CE + ++G+++ +A++ P+ R+C ++ Sbjct: 371 PNKGLFISKTLVEKGMCERVISSGIQMAFIAIAYSPDDDVNEPEYINWRQCVGKD-NYYE 429 Query: 370 VNDSREL-LESFDKITDKIQEQSVRIAPNR 398 +++ EL + I+ + R P + Sbjct: 430 AHNADELMRDIQQAISKSATSEVGRNTPKK 459 >gi|320156062|ref|YP_004188441.1| hypothetical protein VVM_02402 [Vibrio vulnificus MO6-24/O] gi|319931374|gb|ADV86238.1| hypothetical protein VVMO6_01216 [Vibrio vulnificus MO6-24/O] Length = 465 Score = 121 bits (303), Expect = 2e-25, Method: Composition-based stats. Identities = 51/456 (11%), Positives = 128/456 (28%), Gaps = 69/456 (15%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 +++ +F + D+ I + NQ+ +A DAA L+ + + TS Sbjct: 12 MLVLLMSMLVFAAWVTDVMRIYSVHNQIANATDAA-LASAIISEVPESTAVELLHANLTS 70 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINIT-------KDKNNPLQYIAESKAQYEIP 114 +++ + + + S + EI Sbjct: 71 GAASPYVEEVRLTHLRDEQEESLQVALDFVPNSLNIAAQESVPIRTNAKAGISSNKAEIV 130 Query: 115 TENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNM 174 + + ++ + V+ + + ++ + Sbjct: 131 FMLDVSNSMSGEPMNKTKEALLAFADKLYARGNRNQNYVVSIVPASGNVNTGPMEEIYLG 190 Query: 175 TSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE------------------- 215 + +Y +++ WS K+ P ++ + + Sbjct: 191 SFRRYDHAQVKRENRWSDMFDKASGRTPAVPGRQRNAMCRDLDFEGNNPATLGLRYFRNL 250 Query: 216 ---------------------SAGNL-----VNSIQKAIQEKKNLSVRIG---TIAYNIG 246 + + ++ N I +I Sbjct: 251 EKAPQFASNNSKRIIRPIHKPAVLHFDDGTPLDPPVYPSTNPSNNYRPFHEDKAIFDDIE 310 Query: 247 IVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNT-------- 298 N P +S + +L P NTN M A R L + + Sbjct: 311 CHVNPIVPFITERRHFESTVQRLVPGMNTNNAEGMVWAMRLLSPYWQGIWDKTRPELPRR 370 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN--AGMKIYSVAVSAPPEGQDL 356 K+++ +DG + A+++ IC ++ G+K+ +V + L Sbjct: 371 YSDETSNKYLVMFSDGNHLIDPAFRDKK-MKLICTQLKQPGRGVKVMTVNFG-GAASERL 428 Query: 357 LRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 ++ C +++ V + + F++I +++ S+ Sbjct: 429 MQSCASGP-EYYHVASLFSVEKVFEQIAEQVISSSL 463 >gi|254781110|ref|YP_003065523.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] gi|254040787|gb|ACT57583.1| von Willebrand factor type A [Candidatus Liberibacter asiaticus str. psy62] Length = 420 Score = 121 bits (303), Expect = 2e-25, Method: Composition-based stats. Identities = 86/420 (20%), Positives = 159/420 (37%), Gaps = 62/420 (14%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIK--------- 51 + A+ + L I + I + Y +N M+SA +AA+L+G + +VS+ + Sbjct: 25 IFALSVMSFLLLIGFLIYVLDWHYKKNSMESANNAAILAGASKMVSNLSRLGDRFESISN 84 Query: 52 -DPTTKKDQTSTIFKKQIKKHLK--QGSYIRENAGDIAQKAQINIT-------KDKNNPL 101 D K IK+ L + +I ++I++T NN + Sbjct: 85 HAKRALIDDAKRFIKNHIKESLSGYSAVFYNTEIQNIVNSSRISMTHMANNRLDSSNNTI 144 Query: 102 QYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSME 161 Y + Y+ + + +S + E I +V+D+S SM Sbjct: 145 FYNMDVMTSYDYRLQFIEHLLNQRYNQKIVSFIPALLRIEMGERPIFLIELVVDLSGSMH 204 Query: 162 DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLV 221 D N+ K+ L + + Sbjct: 205 CAMNSDPEDVNSAPI-------------------------CQDKKRTKMAALKNALLLFL 239 Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENTNTY 278 +SI V +G I Y + N S +V+ + + + T++ Sbjct: 240 DSIDL--LSHVKEDVYMGLIGYTTRVEKNIEP--SWGTEKVRQYVTRDMDSLILKPTDST 295 Query: 279 PAMHHAYRELYNEKESS--------HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 PAM AY+ L ++K+ S I S +KF+IF+TDGEN+ + +NT++ Sbjct: 296 PAMKQAYQILTSDKKRSFFTNFFRQGVKIPSLPFQKFIIFLTDGENNNFK---SNVNTIK 352 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 IC+ + +KI +++++A P GQ LL+ C S + V ++ L+ F I+ + + Sbjct: 353 ICDKAKENFIKIVTISINASPNGQRLLKTCVSSPEYHYNVVNADSLIHVFQNISQLMVHR 412 >gi|159044810|ref|YP_001533604.1| hypothetical protein Dshi_2267 [Dinoroseobacter shibae DFL 12] gi|157912570|gb|ABV94003.1| hypothetical protein Dshi_2267 [Dinoroseobacter shibae DFL 12] Length = 553 Score = 121 bits (303), Expect = 2e-25, Method: Composition-based stats. Identities = 48/408 (11%), Positives = 97/408 (23%), Gaps = 90/408 (22%) Query: 1 MTAIII---SVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKK 57 +T + + + IDL +R ++Q+ D AVL+ + T K Sbjct: 32 LTGFSLYIFILMMMIAGLTIDLMRYEAVRTRLQATSDRAVLAAA-------DLDQTTNAK 84 Query: 58 DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 F K G + + +AQ++ T Sbjct: 85 AVVEDYFAKAGMSQYLDGVQVSKGLNFKEVEAQVSAT---------------------IP 123 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSN 177 + + + RS I + +VLD+S SM + Sbjct: 124 TWFMNMSGIETLDAFARSKAEERIQ----NIEVSLVLDISGSMGWDGKLANMRTAADQFV 179 Query: 178 KYLLPPPP------KKSFWSKNTTKSKYAPAPAPANRKIDV-----LIESAGNLVNSIQK 226 + ++ P + V + Q Sbjct: 180 RTMMAGNDNVAADGTGLTSVSIIPYHAVVNVPDELLDEYAVSTQQTVSNCVRFTATDFQS 239 Query: 227 AIQEKKNLSVRIGTIA------------------YNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++ R+ + P S ++ ++ +++ + Sbjct: 240 ISIDRTKTLDRLAHFDRNNSNLHTFNGDRLIGRPWCQVGTYGAILPWSTSVTDLTNKVAE 299 Query: 269 LNPYENTNTYPAMHHAYRELYNEKES-----------------SHNTIGSTRLKKFVIFI 311 L NT T M A L ++ K V+ + Sbjct: 300 LGASGNTATDIGMKWAAALLDPGTQNIVDDMIDGGHLEADLAGRPVLYSDPETIKVVVLM 359 Query: 312 TDGENSGASAYQNTLNTLQIC----EYMRNAGMKIYSVAVSAPPEGQD 355 TDGE + + + + ++ Y V Sbjct: 360 TDGE-NTSQYDLKNEFKGTMSPVWWDEASDS----YFVYFQNRANNDK 402 Score = 77.3 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 24/77 (31%), Positives = 39/77 (50%), Gaps = 1/77 (1%) Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 T QIC+ ++ + I+++ AP GQDL+R C SSG +F V E+ Sbjct: 478 YTGNSTADGYTEQICDQLKAQDVVIFTIGFEAPQRGQDLMRYCASSSGHYFDVEGV-EIS 536 Query: 378 ESFDKITDKIQEQSVRI 394 E+F I + IQ+ + + Sbjct: 537 EAFSSIANTIQQLRLSL 553 >gi|260434111|ref|ZP_05788082.1| conserved hypothetical protein [Silicibacter lacuscaerulensis ITI-1157] gi|260417939|gb|EEX11198.1| conserved hypothetical protein [Silicibacter lacuscaerulensis ITI-1157] Length = 600 Score = 121 bits (302), Expect = 2e-25, Method: Composition-based stats. Identities = 53/382 (13%), Positives = 94/382 (24%), Gaps = 78/382 (20%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 +T +I + F+ +A+D+ R ++Q ALD AVL+ + KD Sbjct: 36 LTLFLIMIVFVASGFAVDVMRYDRERAKLQYALDRAVLAAA-------DLDQELCPKDVV 88 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 +LK+ + GD + + + E+ A +I + Sbjct: 89 ID--------YLKKEGLDKYLTGDPKVEPDVCGSTAAVLKGYRRVEANADMDIE---MHF 137 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 + S + I +VLDVS SM ++ + Sbjct: 138 MKWRGIETIASAATSVAEESI----GNVEISLVLDVSGSMRGS-KLENLKKAANLFIDDM 192 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV-----LIESAGNLVNSIQKAIQE----- 230 S+ P K++ + Sbjct: 193 FAKTEDGKVSISIVPYSEQVSIPDYLMNKLNTQGTNSIANCVDFASADFATTRFTAFDVT 252 Query: 231 ----------------------------KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 N V N + T L + + Sbjct: 253 DPVTGIVTPGTTLARTIHHDIGDGSDRRPYNGFVSSTICRPNTSTNHREITILQKDPVAL 312 Query: 263 KSRLNKLNPYENTNTYPAMHHAYRELYNEKES-----------------SHNTIGSTRLK 305 K +N LN T+ L + + + Sbjct: 313 KKEINLLNASGWTSIDVGAKWGVTLLDDSFQPLTKKLVTESKVPSIFKDRPDQNKGYDTM 372 Query: 306 KFVIFITDGENSGASAYQNTLN 327 K +I +TDGEN+ N Sbjct: 373 KVMILMTDGENTKQHKVNPPYN 394 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 17/83 (20%), Positives = 35/83 (42%), Gaps = 1/83 (1%) Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 + + + Q IC+ ++ + I+S+A AP + LL+ C G ++ Sbjct: 517 YNTSTTVLNQVQKDPRLTSICQKAKDEKIIIFSIAFDAPDGVKPLLKGCVSDDGAYYEAK 576 Query: 372 DSR-ELLESFDKITDKIQEQSVR 393 D+ +++ F I IQ + Sbjct: 577 DNDKDIISVFSSIGSTIQNLRLT 599 >gi|222087111|ref|YP_002545646.1| hypothetical protein Arad_3867 [Agrobacterium radiobacter K84] gi|221724559|gb|ACM27715.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 401 Score = 121 bits (302), Expect = 3e-25, Method: Composition-based stats. Identities = 82/402 (20%), Positives = 157/402 (39%), Gaps = 31/402 (7%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 +TAI I V A+D+ ++ +Q+Q A A++ + + + Sbjct: 25 LTAIAIPVVAATAGVAVDVTNMTVSNSQLQQAT------DAAALATATALANGNATTSNA 78 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNP---LQYIAESKAQYEIPTEN 117 + + + + N D + N Y A Y++ Sbjct: 79 QQLATQFVTGQMSNYLSGDTNTADALKAGTTANVTSATNSSGGTSYTVAVNASYDMSVNG 138 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSN 177 + I + + + ST +++ A+S+ + LD S SM ++ Sbjct: 139 MSQLLGIKTMHVSAASTSTSGSAAAAKQAALSMEIALDKSGSMLLN------------TD 186 Query: 178 KYLLPPPPKKSFWSKNTTKSKYAPAPAP-ANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 ++++ +Y A +P +KI L + G L++ + A + K+ V Sbjct: 187 VIDTSQKSCTQYYTEGNYLYQYPKAKSPCYIKKIAALKTAVGTLLDQLDSA--DPKSQYV 244 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSR-LNKLNPYENTNTYPAMHHAYRELYNEKE-S 294 R I + + L+ +S ++ LN T + M AY+ + E + Sbjct: 245 RTAAI--AWSSEVDSSSALAWGTTTTRSNVISGLNANGGTESSAPMALAYKNVSASSEAT 302 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + G+T +K ++ +TDGEN+ S+ TL T C+ ++AG+ IYSVA AP GQ Sbjct: 303 AQAAKGNTTFQKIIVLMTDGENNATSSDTKTLAT---CKAAKDAGVLIYSVAFMAPDRGQ 359 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 LL+ C S +F +L+ +F I ++ +Q + Sbjct: 360 TLLKNCASSPSNYFDAQQMSDLIAAFKTIGNQASKQITLLTK 401 >gi|218461471|ref|ZP_03501562.1| von Willebrand factor type A [Rhizobium etli Kim 5] Length = 459 Score = 121 bits (302), Expect = 3e-25, Method: Composition-based stats. Identities = 54/446 (12%), Positives = 122/446 (27%), Gaps = 53/446 (11%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQ---------------SALDAAVLSGCASIV 45 + ++ L + +ID R ++Q S + A ++ + Sbjct: 15 IVILVAVPMLLAVGASIDYIRAYNGRTELQAAADSAVLAAAAKYKSGMPEATIAKTINAF 74 Query: 46 SDRTIKDPTTKKDQTSTI---------FKKQIKKHLKQGSYIRENAGDIAQKAQINITKD 96 + T + + + + I+ I A + K Sbjct: 75 LSANGEFETAVAGKPQVASDESELCLDVADAVPTTFMKLANIQSVPISIRSCAALPGVKQ 134 Query: 97 KNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDV 156 L S E + + + + S + + L Sbjct: 135 LEIALVLDVSSSMIEENRFTPMQTAVAGFLQAFSSNTSLVDKTKISIVPFSSRVNFGLAN 194 Query: 157 SRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPAN----RKIDV 212 + ++ K S+W T + DV Sbjct: 195 TAWLKSYNGTAAVPKRWTDPESVYTSSGYKLSYWIDGVTPVMSTSKNYYWMGCIEPRADV 254 Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY 272 + G + + + A TPL+ + +KS + L Sbjct: 255 EVRDTGAIGDGMGDAPPSTSAFVAMDANPKSGTSFCPPPVTPLTGDFAYLKSVVKNLTSE 314 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRL-----KKFVIFITDGENSGASAYQNTLN 327 +T + + L + + S K ++F+TDGE + + + Sbjct: 315 GSTRLDAGVVAGWYTLSPKWQGVWGDQSSPAPVSDSVHKVMVFMTDGEMNTKYDPNDKFD 374 Query: 328 TL------------------QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA 369 + C M+ +G++IY+++ SA + ++ R C ++ FF Sbjct: 375 WICSQTQSSACNAFATAARQTACTAMKKSGIEIYTLSYSADADVVNI-RNCATNTAHFFT 433 Query: 370 VNDSRELLESFDKITDKIQEQSVRIA 395 + ++ I I+ ++R+ Sbjct: 434 A-SPATIKTVYETIAAAIRGDTLRLT 458 >gi|254477542|ref|ZP_05090928.1| conserved hypothetical protein [Ruegeria sp. R11] gi|214031785|gb|EEB72620.1| conserved hypothetical protein [Ruegeria sp. R11] Length = 523 Score = 120 bits (300), Expect = 4e-25, Method: Composition-based stats. Identities = 47/361 (13%), Positives = 77/361 (21%), Gaps = 73/361 (20%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M A ++S IDL + R +Q LD AVL+ + P Sbjct: 6 MIAFLLS-MVAVGGIGIDLMRMERDRTILQYTLDRAVLAAA-------DLDQPLPPDVVV 57 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 K Q G ++ I+ T + Sbjct: 58 QDYLNKANLSEYYQPPIAETGIGYKRVESTIDTTFETQW--------------------- 96 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 L S + I +VLDVS SM + N + Sbjct: 97 LDFSGGQDMPLYANSRAEESI----DGLEISLVLDVSGSMNSNSRLYNLKNAARDFIDTM 152 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV-----LIESAGNLVNSIQKAIQEKKNLS 235 + + P + +V Sbjct: 153 VANTADNKMSVSIVPYATQVSLPKDMLDQYNVTDEHEYSNCVNFTGTHFTSTGLSTTASL 212 Query: 236 VRIGTIAYNIGI-----------------VGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 R + P + N +K + L + NT+ Sbjct: 213 NRTMHFTPWWSGDARPSNGLIQYPVCDERAHREVMPFQKDANRLKDFIQNLQAWGNTSID 272 Query: 279 PAMHHAYRELYNEKES------------------SHNTIGSTRLKKFVIFITDGENSGAS 320 M L + T K ++ +TDG+N+ Sbjct: 273 VGMKWGTVLLDPSAQPVISALTSSSVNVPGVFADRPAAYNDTETVKVIVLMTDGQNTSQY 332 Query: 321 A 321 Sbjct: 333 Y 333 Score = 96.9 bits (239), Expect = 5e-18, Method: Composition-based stats. Identities = 42/410 (10%), Positives = 98/410 (23%), Gaps = 59/410 (14%) Query: 20 AHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIR 79 + + ++N + +D D + + K + + Sbjct: 136 SRLYNLKNAARDFID----------TMVANTADNKMSVSIVPYATQVSLPKDMLDQYNVT 185 Query: 80 ENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGII 139 + +N T + + + PS Sbjct: 186 DEHEY---SNCVNFTGTHFTSTGLSTTASLNRTMHFTPWWSGDARPSNGLIQYPVCDERA 242 Query: 140 ERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKS-FWSKNTTKSK 198 R + D ++++ + + P S S + Sbjct: 243 HREVMPFQKDANRLKDFIQNLQAWGNTSIDVGMKWGTVLLDPSAQPVISALTSSSVNVPG 302 Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 A + + + + + YN ++ + N Sbjct: 303 VFADRPAAYNDTETVKVIVLMTDGQNTSQYYVESDHRGGNAPVFYNSAASSDKARYSTYN 362 Query: 259 LNEVKSRLNKLN-----PYE------------------NTNTY------------PAMHH 283 + +K+N Y T ++ + Sbjct: 363 PGNQRYYWDKMNRWEDHAYGQGEYRECGYYNCWWQDEQGTPATVTQLTHAELFARTSLRY 422 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 Y+ L+ + + S G T IC+ + G+ +Y Sbjct: 423 VYQRLFADWMGNSAAKNSWYY---------GVYDSWGTSTKNARTKAICDAAKARGIVVY 473 Query: 344 SVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 ++ AP G +L+ C S +F V E+ ++F I I++ + Sbjct: 474 TIGFEAPSGGVSVLKDCASSDAHYFDVQGL-EISDAFASIATSIRQLRLT 522 >gi|260466792|ref|ZP_05812977.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259029404|gb|EEW30695.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 492 Score = 120 bits (299), Expect = 6e-25, Method: Composition-based stats. Identities = 46/463 (9%), Positives = 117/463 (25%), Gaps = 74/463 (15%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + SV L ++++++ + ++ +Q +DAAV S + + + + Sbjct: 25 LFGFAASVLALAAGFSVNISQLYNAKSSLQGVVDAAVTSTARDLTTGVIKEADADNSVKA 84 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNP-------------------- 100 + + + D K Sbjct: 85 FLVANSAAGILQPDQVVLDKLIVDKTAKTVQANVHVDVALYFPLFGIGDMQRVAASTTAL 144 Query: 101 -LQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRS 159 E +I + + R+ +++ V V + Sbjct: 145 YSDKTVEVAMMLDITGSMAKRGNVDKIGDLRAAARNAVQTMLQNQDPKRPRIRVAIVPYA 204 Query: 160 MEDLY---------LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKI 210 + + + LL K+ + A P Sbjct: 205 SGVNAGKLAENVYAETQGSSELPPVAGSSLLVAKTGKALLPSFSDYISIVGAAMPHPDNC 264 Query: 211 DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIG-----------IVGNQCTPLSNNL 259 ++ + + A + PL+ + Sbjct: 265 TTERKNKNGDADLSADGPDTVRTDRNGKKYYALVNRDDHLDGGGMNRCPDAEVIPLTADS 324 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNT---------IGSTRLKKFVIF 310 + + ++ T A+ Y L + ++ + ++ K I Sbjct: 325 DALLDSIDDFRAAGYTAGAIAIQWTYYMLSPQWRAAIKNVGLGNGASDANAKKIAKVAIL 384 Query: 311 ITDGENS----------GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG------- 353 +TDG+ + N +C M+N G++I+++ + Sbjct: 385 MTDGQFNTAFAGAGGSYNGQGDLARGNAEALCGNMKNDGIEIFTIGFDLNDKDMSATERD 444 Query: 354 --QDLLRKCTDSSG-----QFFAVNDSRELLESFDKITDKIQE 389 + +L+ C+ +F + EL +F +I ++ Sbjct: 445 QAKAVLKGCSSKDASAAERHYFEASTGAELDAAFQEIIRNTEK 487 >gi|315122347|ref|YP_004062836.1| hypothetical protein CKC_02995 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495749|gb|ADR52348.1| hypothetical protein CKC_02995 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 362 Score = 119 bits (298), Expect = 6e-25, Method: Composition-based stats. Identities = 77/387 (19%), Positives = 163/387 (42%), Gaps = 54/387 (13%) Query: 3 AIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTST 62 AII + + + ++++I + ++Q+ +D A+L +++ + I+D +T Sbjct: 22 AIIFPLIIILMAIVFEMSNIYLEKERLQAVIDRALL-DTVTMIKLKNIEDVVKNVGPVNT 80 Query: 63 IFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKG 122 I+ K +K L+ + + I + + +N + +QY++P + + Sbjct: 81 IWTKNLKYELEHSDFSSDVQNVIDDTSMK--LESDSNFKTLSITAISQYKMPFKICNIHL 138 Query: 123 LIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLP 182 L P + + +E I + +VLDVS SM+D + + P Sbjct: 139 LCPKNKYVTVPVLSSMKIGRNEGSDIDLMIVLDVSSSMDDNF---------------MKP 183 Query: 183 PPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIA 242 S +++V +S ++ +K R G++ Sbjct: 184 EEAPCS--------------------RLEVAKKSIRKMLEDFRKVPNYANVF--RTGSVG 221 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 +N + PL L + + + K + +TN+Y M +A+ +LY + + Sbjct: 222 FND--MVQFPMPLKRGLKRIYNDIKKYRAFGSTNSYVGMKYAWEQLYGNPQ------DTK 273 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 KK VIF+TDGEN T T+++C M+ IYS+A++ +++L+ C+ Sbjct: 274 DRKKIVIFLTDGEN---MIINATRKTIELCNDMKKKKAVIYSIALAVD--NKEVLQGCSS 328 Query: 363 SSGQFFAVNDSRELLESFDKITDKIQE 389 S G +A +D++ L++++ I + + Sbjct: 329 S-GNVYAADDAQSLVQAYSLIGKDVMK 354 >gi|254472518|ref|ZP_05085918.1| conserved hypothetical protein [Pseudovibrio sp. JE062] gi|211958801|gb|EEA94001.1| conserved hypothetical protein [Pseudovibrio sp. JE062] Length = 479 Score = 119 bits (298), Expect = 7e-25, Method: Composition-based stats. Identities = 55/465 (11%), Positives = 130/465 (27%), Gaps = 90/465 (19%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A ++ + +F AID R + A + + + + + + Sbjct: 30 LVAFLMVLLIVFAGMAIDFGLGFNTRRAVNQ----------ALDAAVLAVANKLSTTELS 79 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 S I ++ ++ + K + + + +P L Sbjct: 80 SNTVDSLIDQYFEENLKNSVGGDVVHTKPVVTYDPKGDTVAATATATVKTSFLPVLKLLN 139 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 ++ ST ++ +A+ + + +S S+ + L Sbjct: 140 SESGDFGELTVTSSSTARFPKTKVEVAVVVDVTGSMSGSIGS------LKTASRDMLDTL 193 Query: 181 LPPPPKKSFW---SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN---- 233 LP + + RK G + ++ +N Sbjct: 194 LPDDNTRLQSRVRISYVPYNVGVKLDKTLARKATFEKSQYGCVHARVRDLAYSGENHDYE 253 Query: 234 -----LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYREL 288 V Y+ Q PL+N+ +++S +N L T + + L Sbjct: 254 DEDDDERVDYIGTNYSW-CPNAQMVPLTNDRTKIESSINALRASSATAGQIGIAWGWYTL 312 Query: 289 ---------------------------------YNEKESSHNTIGSTRLKKFVIFITDGE 315 +N ++ + KK + +D + Sbjct: 313 SPEWRGFWPTESKPDFYDNNGVRKYAVLMTDGSFNAYYAADYSKADAEHKKLIKNKSDVQ 372 Query: 316 NSGASAYQNTLNT----------------------------LQICEYMRNAGMKIYSVAV 347 NS L+ +C+ M+ + IY+V Sbjct: 373 NSQDPMDSGKLDADDHKKIASKVKWEYDYSSSLSGVPFKTASNLCKNMKKEDIVIYTVFF 432 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + +G+ ++ +C +S F+ + L+++F I + I+ + Sbjct: 433 GSDYKGKKIMEECASNSETFYHATNQSALIQAFSSIANDIKSIYL 477 >gi|311234271|gb|ADP87125.1| Protein of unknown function DUF2134, membrane [Desulfovibrio vulgaris RCH1] Length = 440 Score = 119 bits (298), Expect = 8e-25, Method: Composition-based stats. Identities = 50/419 (11%), Positives = 123/419 (29%), Gaps = 74/419 (17%) Query: 17 IDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGS 76 ID + +++Q+A+DAA L+G + D + + + + + S Sbjct: 54 IDSGMLYLSHSRLQAAVDAAALAGSLQLPYDPQLDKG-----LVRGAVTQYMDANYPEAS 108 Query: 77 YIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRST 136 G + T + + + ++T Sbjct: 109 LNGVTPGTEERSVT-------------------VTATATVPTIFMNALGIGSSEVHAKAT 149 Query: 137 GIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTK 196 + + + V+D S SM+ +Q+ N + + + Sbjct: 150 AGYNK------LEVVFVIDNSGSMKGTPIQQTNSAASQLVELIMPEGMMTSVKVGLVPFR 203 Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT---IAYNIGIVGNQCT 253 K PA + D + G L S K + G+ + N + Sbjct: 204 GK-VHLPAGVDGLPDGCRNADGTLNPSWLHEEYFKTSYRYPSGSSLNVPKNTCTSIPRVQ 262 Query: 254 PLSNNLNEVKSRL---NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 L+ + + + + N L T + L E + + + ++K +I Sbjct: 263 GLTEDRETILTAISKQNGLGDASGTVISEGLKWGRHVLTPEAPFTEGS-SAKDIRKVIIV 321 Query: 311 ITDGENSGASAYQN-------------------------------TLNTLQICEYMRNAG 339 +TDG+ + L+ + AG Sbjct: 322 LTDGDTEDGKCGGSYAINYTPNAYWTNAFYGMLDMTSHCENGGKLNAAMLEEARKAKEAG 381 Query: 340 MKIYSVAVS-APPEGQDLLRKCTD----SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 ++++++ + L++ ++ ++ + ++ + F KI ++ + +R Sbjct: 382 IEVFAIRFGDSDSVDVSLMKSIASSKAGTNDHYYDAPSAYDIDDVFKKIGRQLGWRLLR 440 >gi|86747937|ref|YP_484433.1| hypothetical protein RPB_0811 [Rhodopseudomonas palustris HaA2] gi|86570965|gb|ABD05522.1| conserved hypothetical protein [Rhodopseudomonas palustris HaA2] Length = 435 Score = 119 bits (297), Expect = 9e-25, Method: Composition-based stats. Identities = 67/433 (15%), Positives = 132/433 (30%), Gaps = 73/433 (16%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + AI + FI AID A IR ++QSA DAAVL ++ +RT +Q Sbjct: 28 IFAIALLPILGFIGAAIDYATANRIRTKLQSAQDAAVLLAVSNSEINRTTAQAKADAEQF 87 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + EN G + A T Sbjct: 88 FNATIGAYGLTATIKIEVTENDGKRSATADFTSTVT---------------------TNF 126 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 LI + RST + R + ++LD S SM ++ Sbjct: 127 LNLIGYPTLAIGNRSTSTVSR---PIYQDFYLLLDNSPSMGVAATTADIATMVGNTSDKC 183 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVR-IG 239 S + +K +IDV+ ++ L ++ + Sbjct: 184 AFACHDLSDSNNYYNLAK----KLGVKMRIDVVRQAVQQLTSTATLMTAVNNQFRMAVYT 239 Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY------ENTNTYPAMHHAYRELYNEKE 293 + LS+ ++ V++ ++ N + + A + Sbjct: 240 LGGSCASLGLTTIASLSSAMSSVQTAAGAIDLMSIPKQNYNNDQCTDFNSALAAMNTTIP 299 Query: 294 SSHNTIGSTRLKKFVIFITDG-----------ENSGASAYQNTLNTLQICEYMRNAGMKI 342 SS + +K++ F++DG + + + T+ C+ M++ G++I Sbjct: 300 SSGTGTAAQ-PQKWLFFVSDGVADFNNPSGCTQPTVSGGRCQEPLTVTQCKAMKDRGIQI 358 Query: 343 ---YSVAVSAPPEG----------------------QDLLRKCTDSSGQFFAVNDSRELL 377 Y+ ++ P ++ C +F V+ ++ + Sbjct: 359 AVLYTTYLALPTNQWYNDHIAPFNAGPYGPSVNSQIAAKMKSCAS-PDFYFEVSPTQGIS 417 Query: 378 ESFDKITDKIQEQ 390 E+ D + K + Sbjct: 418 EAMDALFKKAVAK 430 >gi|315498202|ref|YP_004087006.1| von willebrand factor type a [Asticcacaulis excentricus CB 48] gi|315416214|gb|ADU12855.1| von Willebrand factor type A [Asticcacaulis excentricus CB 48] Length = 489 Score = 118 bits (296), Expect = 1e-24, Method: Composition-based stats. Identities = 59/486 (12%), Positives = 123/486 (25%), Gaps = 117/486 (24%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M + S+ + + A+D ++++ R++ Q ALDAA L+ +Q Sbjct: 17 MFGLFFSILIVSMAGAVDYSNVISRRSKAQDALDAATLAVAVLRP---------ATVEQA 67 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 K ++ K L D Q N Y +K Y+ + Sbjct: 68 QAAVKLRLDKELGDNP-------DKVVIGQFNY---DTKTRTYYVTAKGTYKPFLLGVVN 117 Query: 121 KGLIPSALTNLSLRSTG---------IIERSSENLAISICMVLDVSRSMEDLYLQKHNDN 171 IP + + ++++ S + LDV ++ + + Sbjct: 118 IKEIPYEVISETIQAANGTLELALVLDNTDSMGQILNGSSTRLDVLKTAATNLVNTVMTS 177 Query: 172 NNMTSNKYLLPPPPKKSFWSKNTTKSKYAP----------------------APAPANRK 209 N K + P + Sbjct: 178 ANKDYVKVAVVPYADYVNVGLANRSQSWVSVGADYTVPAAAKTCTTISTKQVCTGGVYGT 237 Query: 210 IDVLIESA------GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIV--------------- 248 D + + + Y Sbjct: 238 CDSIKDGVPIKVGCWKTPQTCTTVNITPYQSCNNPQPTYYKWYGCVRHQVDSKTKMLVLP 297 Query: 249 ----------------GNQCTPLSNNLNEVKSRLNKLNPYEN-----TNTYPAMHHAYRE 287 PLSN+ V + + L T +H Sbjct: 298 DPLTAYTGVLETAQKCPTAIQPLSNDKTVVTNSIKGLVNSIGSYKPDTFIPGGLHWGVNT 357 Query: 288 LYNEKE---SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ-------------- 330 L + KK ++ +TDG N+ + + + Sbjct: 358 LSPPAPFKEGMAYDSKNKEPKKVIVLMTDGANTLYTNSSGQIVSAATGSPPTISSSLVAP 417 Query: 331 -------ICEYMRNAGMKIYSVAVS-APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 C+Y + ++++ + + P L+ C + +F ++ +L+E+F+ Sbjct: 418 TYTAQDNACKYAKGKNIEVFVIGLGVTDPTALSALKSCATDAQHYFDAQNANDLIEAFEI 477 Query: 383 ITDKIQ 388 I K+ Sbjct: 478 IGGKLS 483 >gi|260778153|ref|ZP_05887046.1| hypothetical protein VIC_003555 [Vibrio coralliilyticus ATCC BAA-450] gi|260606166|gb|EEX32451.1| hypothetical protein VIC_003555 [Vibrio coralliilyticus ATCC BAA-450] Length = 397 Score = 118 bits (296), Expect = 1e-24, Method: Composition-based stats. Identities = 54/392 (13%), Positives = 125/392 (31%), Gaps = 27/392 (6%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 ++ + I + + + ++ A+DAA L+ + SD ++ Sbjct: 13 FLALLIPLVVLSAATIMIGFQVQLSSRAMQAVDAASLACAFADYSDPSVNQA-------- 64 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 ++ + + ++ A ++N+ + ++A Y + ++ Sbjct: 65 ------YLEYYQPNVKLVKSEIYSASGCELNMGYQLTGLFSSLKFAQASYSAQSGSVEQA 118 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 + SA + + + SS +I + Q + + + + Sbjct: 119 HVNQSASVTPTEMTLVLDISSSMAGSIDTLKSILTRAIER--IEQDNVQIDGRRAISISI 176 Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 P K V ++ + EK +S R Sbjct: 177 VPFSDGVSARNADWLDDKGVFCIDGLTKESGGSVLVNETVQNLDRIHSEK-AVSHRAPDE 235 Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNT-IG 300 PL++N++EVK+ +N L T +Y + R+L Sbjct: 236 FLADCSASATLVPLTDNMSEVKTAINALTTTGGTRSYQGVIWGARQLIPRWRQEWGYNPY 295 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEY-MRNAGMKIYSVAVSAPPEGQDLLRK 359 S K+ +I +TDG + + L +C+ +++ + + + Sbjct: 296 SLAPKQKLILMTDGV--DSGYVLDDLIDAGLCDRLANEFAIELNFIGFNVQDSRLAQFQS 353 Query: 360 C---TDSS---GQFFAVNDSRELLESFDKITD 385 C ++ GQ F+ ++ +L E F KI + Sbjct: 354 CINAANTDGIKGQVFSATNTEKLDEYFSKILE 385 >gi|91975399|ref|YP_568058.1| hypothetical protein RPD_0919 [Rhodopseudomonas palustris BisB5] gi|91681855|gb|ABE38157.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB5] Length = 435 Score = 118 bits (295), Expect = 1e-24, Method: Composition-based stats. Identities = 63/433 (14%), Positives = 136/433 (31%), Gaps = 73/433 (16%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + AI + FI A+D + +R +++SA DAAVL ++ ++T+ D Q Sbjct: 28 IFAIALLPILGFIGAAVDYTNASRVRAKLESAQDAAVLLAVSNSAINKTVADAQADAVQF 87 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + EN G S + Sbjct: 88 FNATLDGYGLSATIDLSVSENDGK---------------------RSAVSSFSSSVKTHF 126 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 +I + RST + + + ++LD S SM ++ Sbjct: 127 LDMIGYPTLAIGNRSTSTVSL---PVYVDFYLLLDNSPSMGVAATTSDIATMVANTSDQC 183 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 S + +K +IDV+ ++ L + + + T Sbjct: 184 AFACHDLSTSNNYYNLAK----KLGVTMRIDVVRQAVQRLTTTATAMSAVTNQFRMGVYT 239 Query: 241 I-AYNIGIVGNQCTPLSNNLNEVKSRLNKLNP------YENTNTYPAMHHAYRELYNEKE 293 + I LS++++ V++ + ++ N + + + + Sbjct: 240 FGSSCTAIGLTTVANLSSSMSSVQTSVGTIDLMTIPYQGYNNDQCTDFDGSLTAINSAIP 299 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN-----------TLQICEYMRNAGMKI 342 S + ST+ +K++ F++DG T T+ C +++ G++I Sbjct: 300 -SPGSGISTQPQKWLFFVSDGVADANYPSTCTKPTVSGGRCQEPLTVAQCTAIKSRGIQI 358 Query: 343 ---YSVAVSAPPEG----------------------QDLLRKCTDSSGQFFAVNDSRELL 377 Y+ ++ P ++ C G +F V+ ++ + Sbjct: 359 AVLYTTYLALPTNSWYNTYIAPFNPGPYGPSTNSQIAANMQSCAS-PGFYFEVSPTQGIA 417 Query: 378 ESFDKITDKIQEQ 390 E+ D + K + Sbjct: 418 EAMDALFKKAVAK 430 >gi|59711129|ref|YP_203905.1| TadG-like protein [Vibrio fischeri ES114] gi|59479230|gb|AAW85017.1| TadG-like protein [Vibrio fischeri ES114] Length = 465 Score = 118 bits (295), Expect = 1e-24, Method: Composition-based stats. Identities = 50/469 (10%), Positives = 131/469 (27%), Gaps = 94/469 (20%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + + I V F T A D A + + +++ A +AAVL+ A + + + Sbjct: 16 LFVMCIPVLFGVFTLASDGARALQSKARLEDAAEAAVLAVSA----------YGEEDEVS 65 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNN---PLQYIAESKAQYEIPTEN 117 + K + ++ S + + + + +++ +N ++Y + +++ Sbjct: 66 TQTGKDYVAHYMHDMSNLVDIEVEKLECSELPECTADDNDRPFVEYQVSGRTKHKSWFPG 125 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNM--- 174 + + + + + + I +LD S SM + Sbjct: 126 NDVTVGFGES---FDVTGMSKARKFQSSQPMDITFILDFSGSMNYDWEGHAPSYMEEEVP 182 Query: 175 -------------TSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLV 221 + + ++ +TT K+ A NR+ + ++ Sbjct: 183 KVPGRYSPPSRLSDLKDVVQMVTDELQVYNNSTTGPKHRVAMTGYNRRTVNESSNGKFVI 242 Query: 222 NS-----IQKAIQEKKNLSVRIG-------------TIAYNIGIVGNQCTPLSNNLNEVK 263 + + + +++ Sbjct: 243 RDQRITKYNSDGYDAGDKFYPKKTINKQFMVKGAAARVPNGDEKAEFTDIMYTSDFASFN 302 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG-ENSGASAY 322 ++ + T + + A + + + K+ +I ++DG + + Sbjct: 303 HKIKSFEAFGGTASLQGIIRASQIVSYHITNDGEEAN---PKQLIIILSDGEDFNHYLGQ 359 Query: 323 QNTLNTLQICEYMRNA----------------------------------GMKIYSVAV- 347 TL +C+ +RNA ++I + Sbjct: 360 TETLVDYGMCDNLRNAIEGGPVSSEDNKADKIVFSSGSSSGLPTNTGEDPSVRIAMIGFG 419 Query: 348 -SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 L C F+ N+ E+ + I + E+ +A Sbjct: 420 DGYDIHANTGLLNCV-GEENAFSANNKDEI---LNLIMSLVSEEVGHLA 464 >gi|46580532|ref|YP_011340.1| von Willebrand factor type A domain-containing protein [Desulfovibrio vulgaris str. Hildenborough] gi|46449951|gb|AAS96600.1| von Willebrand factor type A domain protein [Desulfovibrio vulgaris str. Hildenborough] Length = 420 Score = 118 bits (295), Expect = 2e-24, Method: Composition-based stats. Identities = 50/419 (11%), Positives = 123/419 (29%), Gaps = 74/419 (17%) Query: 17 IDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGS 76 ID + +++Q+A+DAA L+G + D + + + + + S Sbjct: 34 IDSGMLYLSHSRLQAAVDAAALAGSLQLPYDPQLDKG-----LVRGAVTQYMDANYPEAS 88 Query: 77 YIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRST 136 G + T + + + ++T Sbjct: 89 LNGVTPGTEERSVT-------------------VTATATVPTIFMNALGIGSSEVHAKAT 129 Query: 137 GIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTK 196 + + + V+D S SM+ +Q+ N + + + Sbjct: 130 AGYNK------LEVVFVIDNSGSMKGTPIQQTNSAASQLVELIMPEGMMTSVKVGLVPFR 183 Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT---IAYNIGIVGNQCT 253 K PA + D + G L S K + G+ + N + Sbjct: 184 GK-VHLPAGVDGLPDGCRNADGTLNPSWLHEEYFKTSYRYPSGSSLNVPKNTCTSIPRVQ 242 Query: 254 PLSNNLNEVKSRL---NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 L+ + + + + N L T + L E + + + ++K +I Sbjct: 243 GLTEDRETILTAISKQNGLGDASGTVISEGLKWGRHVLTPEAPFTEGS-SAKDIRKVIIV 301 Query: 311 ITDGENSGASAYQN-------------------------------TLNTLQICEYMRNAG 339 +TDG+ + L+ + AG Sbjct: 302 LTDGDTEDGKCGGSYAINYTPNAYWTNAFYGMLDMTSHCENGGKLNAAMLEEARKAKEAG 361 Query: 340 MKIYSVAVS-APPEGQDLLRKCTD----SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 ++++++ + L++ ++ ++ + ++ + F KI ++ + +R Sbjct: 362 IEVFAIRFGDSDSVDVSLMKSIASSKAGTNDHYYDAPSAYDIDDVFKKIGRQLGWRLLR 420 >gi|328541712|ref|YP_004301821.1| hypothetical protein SL003B_0088 [polymorphum gilvum SL003B-26A1] gi|326411464|gb|ADZ68527.1| hypothetical protein SL003B_0088 [Polymorphum gilvum SL003B-26A1] Length = 454 Score = 117 bits (293), Expect = 3e-24, Method: Composition-based stats. Identities = 56/448 (12%), Positives = 129/448 (28%), Gaps = 78/448 (17%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M +++++ + +D + +R + AL A + R + + Sbjct: 27 MVGVLVALMVVIGGAGLDYGRAIMLRASISHAL------DAAVLAVARQLSVSIMTDSEL 80 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 K ++ GD+ + + A +PT + + Sbjct: 81 DKAIKDAFAANMASAGLSGATLGDLTYVLDPDAG---------TISATATALVPTYFIHV 131 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 GL P ++ + + + + MV+DV+ SM + + L Sbjct: 132 GGLGPE-----NVAIAASADATYSRFDVELAMVVDVTGSMRNSM--ASLRTAAQSVVDIL 184 Query: 181 LPPPPKKS----FWSKNTTKSKYA--PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNL 234 +P KKS + + + + + Sbjct: 185 IPDGTKKSASKVRIALVPYSQGVNLGEYAPKVSNGDAGTQNCVTERMGNEKYTDATYNYN 244 Query: 235 SVRIGTIAYNIGIVGNQCT--PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 + PL++ N + S ++KL T + + L + Sbjct: 245 GTSSEFFGGGSNSCASTPQMEPLTSKRNTLTSAISKLKDNGRTAGQTGIAWGWYALSPKW 304 Query: 293 ES------SHNTIGSTRLKKFVIFITDGENS----------------------------- 317 + + + + KF + +TDG+ + Sbjct: 305 SNLWPNDSVPGSYTDSDILKFALIMTDGDFNEYYDKATAQSNCKWQFNWSTFKWEQVCDS 364 Query: 318 ----------GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE--GQDLLRKCT-DSS 364 + ++ +C ++ G+++YS+ + G +++ C + Sbjct: 365 SYVWTAYSEAAGYSNVSSTRAKTLCAAIKQTGIQVYSIYFGSNANSAGAKVMKDCASSTK 424 Query: 365 GQFFAVNDSRELLESFDKITDKIQEQSV 392 FF EL+ +F KI +KIQ + Sbjct: 425 ETFFMATSDSELIAAFAKIANKIQNIYL 452 >gi|99081991|ref|YP_614145.1| hypothetical protein TM1040_2151 [Ruegeria sp. TM1040] gi|99038271|gb|ABF64883.1| hypothetical protein TM1040_2151 [Ruegeria sp. TM1040] Length = 582 Score = 117 bits (292), Expect = 4e-24, Method: Composition-based stats. Identities = 46/370 (12%), Positives = 90/370 (24%), Gaps = 64/370 (17%) Query: 4 IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTI 63 +I+ + F+ +D+ + R ++Q LD AVL+ D+ + D S Sbjct: 41 MIVVLMFMIGGLGMDMVRLERDRTKLQYTLDRAVLAAADL---DQPLDPEAVVLDYMSKS 97 Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGL 123 + + + N + Y++ L L Sbjct: 98 GLGDYTTVVVPEVSPTAKRVKASVDTNFTASWMNNVFYDDYIRNPDTYQLEPITLPLL-- 155 Query: 124 IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPP 183 + I +VLDVS SM + + + Sbjct: 156 -------------ASSTAVESIGNVEISLVLDVSGSMRSNDRLVNLKRAAKEFVQTMDDN 202 Query: 184 PPKKSFWSKNTTKSKYAPAPAPANRKIDV-----LIESAGNLVNSIQKAIQEKKNLSVRI 238 S P +++V + A R Sbjct: 203 TEDGKMSISIVPYSTQVSMPEAFLDELNVSSEHDYSHCINFSGSDFNNAGISTTQAYERT 262 Query: 239 GTIAYNI--------------------GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 LS+N+ ++++ ++ P ENT+ Sbjct: 263 MHFTVWNSGDYRSRTRLVRQPTCAAHSDNPERTALLLSDNVTQLQNYIDAFVPSENTSID 322 Query: 279 PAMHHAYRELYNE---------------------KESSHNTIGSTRLKKFVIFITDGENS 317 M L + T K ++ +TDG+N+ Sbjct: 323 LGMKWGSALLDPSVQPVIASLADDANPNQSIASRFANRPVPYTDTETLKVIVMMTDGQNT 382 Query: 318 GASAYQNTLN 327 +N+ Sbjct: 383 SQYYLRNSYR 392 Score = 80.3 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 32/227 (14%), Positives = 74/227 (32%), Gaps = 14/227 (6%) Query: 170 DNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ 229 D + + ++ +N+ + +P A + + + + + Sbjct: 366 DTETLKVIVMMTDGQNTSQYYLRNSYREGDSPVWYNAQER-------VYSTYDPNRGSKP 418 Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 + R Y G + A+ +Y +L+ Sbjct: 419 YYWDNLQRWADHPYGNGTYEETYCTGTLYYGNCYY--GSWQTRTVDEPGTAVQLSYPDLF 476 Query: 290 NEKESSH---NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 + + G + + +S S TL ICE + G+ ++++ Sbjct: 477 ADTSLRYLRDRLFGDWMSNANYYWFSGLFSSVGST-TKDARTLDICEAAKAKGVVVFTIG 535 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 AP GQ++L+ C S+ ++ V D E+ ++F I I++ + Sbjct: 536 FEAPSRGQEVLQACASSASHYYDV-DGLEISDAFASIASAIRQLRLT 581 >gi|259416688|ref|ZP_05740608.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] gi|259348127|gb|EEW59904.1| conserved hypothetical protein [Silicibacter sp. TrichCH4B] Length = 583 Score = 116 bits (291), Expect = 5e-24, Method: Composition-based stats. Identities = 45/370 (12%), Positives = 88/370 (23%), Gaps = 64/370 (17%) Query: 4 IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTI 63 +I+ + F +DL + R +Q LD AVL+ D+ + D S Sbjct: 41 MILVLMFALGGLGMDLVRMERDRTNLQYTLDRAVLAAADL---DQPLDPEAVVIDYMSKS 97 Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGL 123 + + + + + + YE+ L L Sbjct: 98 GLSDYTTVVVPEVSPTAKRVKASVDTEFTAGWMNSIFYEDYMRNPDTYELEPITLPLL-- 155 Query: 124 IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPP 183 + I +VLDVS SM + + + Sbjct: 156 -------------ASSTAVESIGNVEISLVLDVSGSMRSNNRLVNLKRAAKEFVQTMDDN 202 Query: 184 PPKKSFWSKNTTKSKYAPAPAPANRKI-----DVLIESAGNLVNSIQKAIQEKKNLSVRI 238 S PA ++ + R Sbjct: 203 TEDGKMSISIVPYSTQVSMPAAFLDEMRVSDEHSYSNCINFDGSDFNTTGLNLSREYERT 262 Query: 239 GTIAYNIGI--------------------VGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 + +S+N+ +++S ++ ENT+ Sbjct: 263 MHFSVWNYYDYRDDDEHVRQPTCASDADNPERTALLMSDNVAQLQSYIDAFEHSENTSID 322 Query: 279 PAMHHAYRELYNEKE---------------------SSHNTIGSTRLKKFVIFITDGENS 317 M L + + + T K ++ +TDG+N+ Sbjct: 323 LGMKWGTALLDPSVQPVIATLANDANPNQSIEARYANRPVSYQDTETLKVIVMMTDGQNT 382 Query: 318 GASAYQNTLN 327 +N Sbjct: 383 AQYYIKNDYR 392 Score = 86.9 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 29/236 (12%), Positives = 72/236 (30%), Gaps = 25/236 (10%) Query: 167 KHNDNNNMTSNKYLLPPPPKKSFWSKN--------TTKSKYAPAPAPANRKIDVLIESAG 218 + D + + ++ KN + + + + Sbjct: 363 SYQDTETLKVIVMMTDGQNTAQYYIKNDYREGLTPVWYNSEENVYSTYDPNRYGSDKYYW 422 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 + + G + S V L Sbjct: 423 HQTGQWEDHPYGNGTYQETYCDGREYYGRCYDG----SWKTRTVDEPGEALQLSYADLFA 478 Query: 279 -PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 ++ + YR+L+ + S+ + + ++ D TL +C+ + Sbjct: 479 ETSLRYLYRDLFGDWMSNASWY---WYNRLYSYVGD--------STKDSRTLAVCDAAKE 527 Query: 338 AGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 G+ ++++ AP GQ +L++C S+ ++ V D E+ ++F I I++ + Sbjct: 528 KGIVVFTIGFEAPWRGQQVLQQCASSASHYYDV-DGLEISDAFASIASAIRQLRLT 582 >gi|329848522|ref|ZP_08263550.1| flp pilus assembly protein TadG [Asticcacaulis biprosthecum C19] gi|328843585|gb|EGF93154.1| flp pilus assembly protein TadG [Asticcacaulis biprosthecum C19] Length = 486 Score = 116 bits (291), Expect = 5e-24, Method: Composition-based stats. Identities = 54/460 (11%), Positives = 123/460 (26%), Gaps = 67/460 (14%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + + I + AID A + + Q ALD+AVL+ S+ Sbjct: 26 IFGLAIFAIMAALGTAIDFAVLQRAKRSTQDALDSAVLAAAIVNNSNEGDLKKLAADVFK 85 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKA--------------QINITKDKNNPLQYIAE 106 + + + Y + A + + Sbjct: 86 ENLGAADLDAKVTAFKYDAKARTVKATAQGSYDPVIMQLFGFKNLPYAVTSDAIKAADGT 145 Query: 107 SKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ 166 + + + + ++ + + + + + V + + + Sbjct: 146 LEVALVLDNTWSMSATVNGTPKIDILKTAAQGLVSTILTKDNKDYVKIAVVPYADYVNVG 205 Query: 167 KHNDNNNMTSNKYLLPPPPKKSFWSKNT---------------------TKSKYAPAPAP 205 N N S K+ + +T T S + A Sbjct: 206 MANRNMPWVSVAADYSTTSTKTCKTVSTATQCTGGTKGTCTGNQDGVPYTYSCWIVAQTC 265 Query: 206 ANRKIDVLIES----------AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL 255 + G + N + + + + G + N PL Sbjct: 266 KTVNVTPYQSCSGGGTTNYKWYGCVKNQVASSKVVMPDPTTPYGGLVQTSQTCLNPILPL 325 Query: 256 SNNLNEVKSRLNKLNPYEN-----TNTYPAMHHAYRELYNEKE---SSHNTIGSTRLKKF 307 SN+ V + + L T M L + +K Sbjct: 326 SNDATVVTNTIKGLVVNIGGYKPETYIPGGMIWGVNALTPPAPFTEGKPYDANNKEPRKT 385 Query: 308 VIFITDGENSGASAYQNTLNTLQI-------------CEYMRNAGMKIYSVAVS-APPEG 353 ++ +TDG N+ + + C+Y ++ ++IY++ + Sbjct: 386 IVLMTDGANTLYANTSGGIAVANATQVAVTYSDQIRVCDYAKSKKIEIYTIGFDVTDSKA 445 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 L+ C + +F S +L+++F+ I K+ + + Sbjct: 446 LSTLKACATDAQHYFDAKSSADLIKAFETIGGKLSKVRLT 485 >gi|283782262|ref|YP_003373017.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] gi|283440715|gb|ADB19157.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] Length = 395 Score = 116 bits (291), Expect = 5e-24, Method: Composition-based stats. Identities = 50/404 (12%), Positives = 119/404 (29%), Gaps = 60/404 (14%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQS----ALDAAVLSGCASIVSDRTIKDPTTK 56 + A ++ V +AID++++ +R+++++ A A L+ + D Sbjct: 23 LIAFLLVVVVCMAAFAIDVSYMQLVRSELRAATDAAAKAGTLALAKT-DGDAASARTAAI 81 Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 + + + +A + ++ ++ + Sbjct: 82 QAAARNKVAGRALVLTTDQVQVGRSAAQANGTWSFTANQTPYTSVKILSSMSDSTAAGSV 141 Query: 117 NLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDN---NN 173 LFL + + +T + IC+V+D S SM Sbjct: 142 PLFLGTFMGRGSFQPAQSATA------SQMEQEICLVIDRSHSMCFNMSGVEWSYPPGTK 195 Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 T + PP S W L S +++I + + Sbjct: 196 TTPHTICYPPHATLSRW--------------------AALQSSVNLFMDTILETNNTPRV 235 Query: 234 LSVRI----------GTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENTNTYPA 280 + + + LS + VKS++ TN Sbjct: 236 ALITWGSTIGTNTAEYSYTKKTEVAVANELGLSTDYAAVKSKIAARTTKVMLGGTNMSAG 295 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + L + +I +TDG+ + + + E + G+ Sbjct: 296 IDAGRTLLNGNTVRALAKKT-------MILMTDGQWNQGR------DPIDAAEDAADEGI 342 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKIT 384 +I+++ + + + + G+++ ++ EL E+F + Sbjct: 343 QIHTITFLSGSAQNTMRQVAEITGGKYYVSSNQAELEEAFRDLA 386 >gi|170724979|ref|YP_001759005.1| hypothetical protein Swoo_0614 [Shewanella woodyi ATCC 51908] gi|169810326|gb|ACA84910.1| conserved hypothetical protein [Shewanella woodyi ATCC 51908] Length = 503 Score = 116 bits (291), Expect = 5e-24, Method: Composition-based stats. Identities = 51/476 (10%), Positives = 127/476 (26%), Gaps = 99/476 (20%) Query: 3 AIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTST 62 ++ F + A++ + +N++ A +AA L+ + D + Sbjct: 21 VFLLIPLFGMVFLALEGTRYIQKKNRLGDATEAASLAVSMA-----NRDDKGYETQLAKD 75 Query: 63 IFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKG 122 ++ + E DI + + + QY +K ++ + + Sbjct: 76 YISSYMRNIKEISQVKVERKEDIDHYPMADGSFEDREYTQYRVTAKTEHTSWLHSDLIPS 135 Query: 123 LIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDN----------- 171 + + + + I V D S SM+ + D Sbjct: 136 FKETETLANRALARA-YPEYLGDRDVDIVFVSDFSGSMKGSRINSLKDAITEISNEILVP 194 Query: 172 -----------------NNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLI 214 + + + + K+ + + Sbjct: 195 RDGETEIRNRIALVPYNMRVVEGDSGRSVCMTQLKYRNPSGKTGSNYTNYESINWREWAN 254 Query: 215 ESAGNLVNSIQKA--------------------IQEKKNLSVRIGTIAYNIGIVGNQCTP 254 +S + + + + + + N Sbjct: 255 KSYNQVSSCVSNSRKCNGLPGPRADARTIKSVVNDAPSSYRSSKWPDSSNWIDYSRSVDQ 314 Query: 255 LSN-NLNEVKSR------------------------------LNKLNPYENTNTYPAMHH 283 L N N N V+ +N+++P T+ Y + Sbjct: 315 LFNENSNNVQHHPSYQRLYSGSMCNGKFWTVPLTNQKSEIMKVNQMSPDGGTSVYQGLLR 374 Query: 284 AYRELYNEKESSHNTIGSTRLK---KFVIFITDG-ENSGASAYQNTLNTLQICEYMRNA- 338 + L + + N K ++ ++DG E+ S + +N +C +R Sbjct: 375 GAQILDKGRPVNPNEEELEEYNKRLKMILILSDGMESPYESTFSKLVNNYGMCNKIRAQF 434 Query: 339 -----GMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + + + + GQ+ + C + + D +L + +I D I++ Sbjct: 435 NDGELPLHMGVIGIKFSASGQNAFKNCVGADN----IIDVNDLDDLIQEILDLIKK 486 >gi|295691296|ref|YP_003594989.1| TadE family protein [Caulobacter segnis ATCC 21756] gi|295433199|gb|ADG12371.1| TadE family protein [Caulobacter segnis ATCC 21756] Length = 531 Score = 116 bits (291), Expect = 5e-24, Method: Composition-based stats. Identities = 56/507 (11%), Positives = 124/507 (24%), Gaps = 116/507 (22%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVL-SGCASIVSDRTIKDPTTKKDQT 60 A++ + + IDL I R+QMQ ALDAA L + ++ V+D ++ Sbjct: 23 FALLAIPMSILVFALIDLGRISLQRHQMQDALDAATLMAARSTAVTDAELESVGDPAFLA 82 Query: 61 STIFKKQI-----------------------------------KKHLKQGSYIRENAGDI 85 +L S + ++ ++ Sbjct: 83 EIAGLNLGLSASNASFKAGAGNHIIGTATATVKPIIANLWTTDDFNLTATSDVVRSSKNL 142 Query: 86 AQKAQINITKDKNNPLQYIAESKAQ---------------YEIPTENLFLKGLIPSALTN 130 ++IT + ++ A ++ + + + Sbjct: 143 EVAVVLDITGSMSGSRITDLKTGASDLVDIVVKDQQAPFYSKVAIVPYSVGVNVGTYADA 202 Query: 131 LSLRSTGIIERSSENLAI-------------SICMVLDVSRSMEDLYLQKHNDNNNMTSN 177 + + VS + + S Sbjct: 203 VRGAVIARTITGVSKTNAAVVASAAHGFIVGDKVTISGVSGPTMLNGNTYNITAASADSF 262 Query: 178 KYLLPPPPKKSFWS---------KNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAI 228 + S N + A ++ L + Sbjct: 263 TINANTSNAPKYVSGGVATCDTSTNPGCLNFTFTSASNTKETRTLSTCVTERTGTYAYTD 322 Query: 229 QEK-------KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAM 281 + ++ TPLS++ +K ++N L+ +T Sbjct: 323 IAPSIAPVGRNYPTTKLENSMQPNPCPTATITPLSSDRVTLKGQINALSIGGSTAGQIGF 382 Query: 282 HHAYRELYNEK-------ESSHNTIGSTRLKKFVIFITDGENSGA--------------- 319 + + S L K V+ +TDG + Sbjct: 383 AWGWYMVSPNFGYLWPNATQRPAPYNSKDLVKVVVLMTDGAFNTPYCKGVIAKDAGSGSG 442 Query: 320 ----------SAYQNTLNTLQICEYMRNA--GMKIYSVAV--SAPPEGQDLLRKCTDSSG 365 + T ++C+ M++ + I++V ++L+ C + Sbjct: 443 AVDDHINCVATNGDAFTQTRKLCDAMKDPSLKLTIFTVGFDVGGDANAVNMLKYCATDAQ 502 Query: 366 QFFAVNDSRELLESFDKITDKIQEQSV 392 + EL +F I +I + Sbjct: 503 HVYFPATGSELKTAFKSIAQEISSLRI 529 >gi|120602151|ref|YP_966551.1| von Willebrand factor type A [Desulfovibrio vulgaris DP4] gi|120562380|gb|ABM28124.1| von Willebrand factor, type A [Desulfovibrio vulgaris DP4] Length = 420 Score = 116 bits (290), Expect = 6e-24, Method: Composition-based stats. Identities = 50/419 (11%), Positives = 124/419 (29%), Gaps = 74/419 (17%) Query: 17 IDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGS 76 ID + +++Q+A+DAA L+G + D + + + + + S Sbjct: 34 IDSGMLYLSHSRLQAAVDAAALAGSLQLPYDPQLDKG-----LVRGAVTQYMDANYPEAS 88 Query: 77 YIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRST 136 G + T + + + ++T Sbjct: 89 LNGVTPGTEERSVT-------------------VTATATVPTIFMNALGIGSSEVHAKAT 129 Query: 137 GIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTK 196 + + + V+D S SM+ +Q+ N + + + Sbjct: 130 AGYNK------LEVVFVIDNSGSMKGTPIQQTNSAASQLVELIMPEGMMTSVKVGLVPFR 183 Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT---IAYNIGIVGNQCT 253 K PA + D + G L S K + G+ + N + Sbjct: 184 GK-VHLPAGVDGLPDGCRNADGTLNPSWLHEEYFKTSYRYPSGSSLNVPKNTCTSIPRVQ 242 Query: 254 PLSNNLNEVKSRL---NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 L+ + + + + N L T + L E + + + ++K +I Sbjct: 243 GLTEDRETILTAISKQNGLGDASGTVISEGLKWGRHVLTPEAPFTEGS-SAKDIRKVIIV 301 Query: 311 ITDGENSGASAYQN-------------------------------TLNTLQICEYMRNAG 339 +TDG+ + L+ ++ AG Sbjct: 302 LTDGDTEDGKCGGSYAINYTPNAYWTNAFYGMLDMTSHCENGGKLNAAMLEEARKVKEAG 361 Query: 340 MKIYSVAVS-APPEGQDLLRKCTD----SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 ++++++ + L++ ++ ++ + ++ + F KI ++ + +R Sbjct: 362 IEVFAIRFGDSDSVDVSLMKSIASSKAGTNDHYYDAPSAYDIDDVFKKIGRQLGWRLLR 420 >gi|32471725|ref|NP_864718.1| hypothetical protein RB2055 [Rhodopirellula baltica SH 1] gi|32397096|emb|CAD72400.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 402 Score = 116 bits (289), Expect = 7e-24, Method: Composition-based stats. Identities = 47/399 (11%), Positives = 126/399 (31%), Gaps = 45/399 (11%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + I++ V + Y I++A++ + Q DAAV + + Sbjct: 43 LLVIMLPVLLILAAYVINVAYVEAVTADSQVVTDAAVCAAGRVYIQTGDKNAALAAARDA 102 Query: 61 ST---IFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 + + K + ++ + + + D + + + + Sbjct: 103 AERNPVAGKVVPINMSDLEFGISLRESLDEGYSFQPLSDDDEFGNAVRLTTLSLSNSPQP 162 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSN 177 +F +L S + + +V+D S SM + + N + Sbjct: 163 VFSPLF---PTMGTNLEIRPQRVAVSTQSTMDVALVIDRSGSMAYANDEAPDPYVNPAAA 219 Query: 178 KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVR 237 Y P +R +D++ S + + + Sbjct: 220 ----------------PPGWTYGDPVPPNSRWLDLV-ASVNAFNGFLADSP-----QYEK 257 Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP---YENTNTYPAMHHAYRELYNEKES 294 + Y+ ++ L++ E+ ++L+ ++ T+ + H L + + Sbjct: 258 LCLATYSDNA--SRDCDLTHTYAEISNQLDAISYQFNGGGTSVGYGLEHGLAVLTDATHA 315 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + ++ +TDG ++ + ++ ++N G+ ++++ S + Sbjct: 316 R------KFAVRVMVLMTDGHHNTGKSPESMTYH------LQNHGVTLFTITFSDDADQS 363 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + G+ F D+ +L +F KI K+ + Sbjct: 364 RMSNLANACGGENFHATDASQLQNAFQKIAKKLPSLMTQ 402 >gi|89055932|ref|YP_511383.1| hypothetical protein Jann_3441 [Jannaschia sp. CCS1] gi|88865481|gb|ABD56358.1| hypothetical protein Jann_3441 [Jannaschia sp. CCS1] Length = 612 Score = 116 bits (289), Expect = 7e-24, Method: Composition-based stats. Identities = 45/392 (11%), Positives = 89/392 (22%), Gaps = 67/392 (17%) Query: 1 MTA---IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKK 57 +TA ++ + AID+ R+Q+Q LD AVL+ ++ P + Sbjct: 52 ITAFATMLFILMVGASGIAIDVMRYETQRSQLQYTLDRAVLAAA-------SLTQPYDPE 104 Query: 58 DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 F I G + + + A+ E+ Sbjct: 105 GVVRDYFA------------IAGIDGYRLD------VRVEEGLNFRRVHAYAELEVR--- 143 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSN 177 + I + MVLD+S SM + + Sbjct: 144 SIFMQMFGVRAMTSPAIGAAEERVRR----IEVSMVLDISGSMGENNRMTNMRPAAREFV 199 Query: 178 KYLLPPPPKKSF----WSKNTTKSKYAPAP-----APANRKIDVLIESAGNLVNSIQKAI 228 +L + + + Sbjct: 200 TEVLSANENVNNELLVSVSIVPYNGRVNGGDLIESVFTYDDLHSESNCTRFAEADFTSTA 259 Query: 229 QEKKNLSVRIGTIAYNI-------------GIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 + RI P + + + ++ LN T Sbjct: 260 IDPAVPLQRIAHWDRGNEEEDESFQWAHCQTDQYGAILPWQHTEAALHAHIDSLNTGGWT 319 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 M+ A L + + + + S + Sbjct: 320 AIDLGMNWAVGLLDPAAAPALTG----------LIASGHVHPEFSDRPAPYRDGDRATTI 369 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF 367 + +K+ + Q LR DS+G++ Sbjct: 370 DDETIKVVVLMTDGDNTRQYDLRDIYDSAGRY 401 Score = 79.2 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 39/300 (13%), Positives = 75/300 (25%), Gaps = 20/300 (6%) Query: 112 EIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDN 171 + T L + L + D D D+ Sbjct: 312 SLNTGGWTAIDLGMNWAVGLLDPAAAPALTGLIASGHVHPEFSDRPAPYRDGDRATTIDD 371 Query: 172 NNMTSNKYLLPPPPKKSFWSKNTTKSKYAP-----APAPANRKIDVLIESAGNLVNSIQK 226 + + + + ++ S AP R S Sbjct: 372 ETIKVVVLMTDGDNTRQYDLRDIYDSAGRYVGFREGYAPIFRNTSTGQYSIWWEDQGAFW 431 Query: 227 AIQEKKNLSVRIGTIAYN--IGIVGNQCTPLSNNLNEVKSRLNK------LNPY---ENT 275 + G + + L ++T Sbjct: 432 IPTGNTRDPGGSWQAQPDGGWSRYGMTALEFTEDRANAFDPAEAGNGEVLLWADLFSDHT 491 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI-FITDGENSGASAYQNTLNT--LQIC 332 Y A + ++ + I + +DG Q+ +T + IC Sbjct: 492 AGYIAAEWFHAPADESEQWDFYNQLAENPSHNYIGWDSDGVRPDGVVGQSQADTNLMAIC 551 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + AG+ +Y++ AP GQ ++ C +F V RE+ E+F I I + + Sbjct: 552 DVANAAGIIVYAIGFEAPDRGQRVMEHCASVDANYFDVE-GREISEAFASIARSINQLRL 610 >gi|197335948|ref|YP_002155278.1| hypothetical protein VFMJ11_0524 [Vibrio fischeri MJ11] gi|197317438|gb|ACH66885.1| conserved hypothetical protein [Vibrio fischeri MJ11] Length = 463 Score = 116 bits (289), Expect = 7e-24, Method: Composition-based stats. Identities = 52/469 (11%), Positives = 125/469 (26%), Gaps = 94/469 (20%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + + I V F T A D A + + +++ A +AAVL+ A + + + Sbjct: 14 LFVMCIPVLFGVFTLASDGARALQSKARLEDAAEAAVLAVSA----------YGEEDEVS 63 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNN---PLQYIAESKAQYEIPTEN 117 + K + +L S + + + + +++ +N ++Y + ++ Sbjct: 64 TQTGKDYVAHYLHDMSSLVDIKVEKLECSELPECTADDNDRPFVEYQVSGRTKHISWFPG 123 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSN 177 + + + + + + + I +LD S SM + Sbjct: 124 NDVTVGFGES---FDVTGSSKARKFQSSQPMDITFILDFSGSMNYDWEGHAPSYMEEEIP 180 Query: 178 KYLLPPPPKKSFWS---------------KNTTKSKYAPAPAPANRKIDVLIESAGNL-- 220 K P N+T + V S G Sbjct: 181 KVPGRYSPPSRLSDLKYVVQMVTDELQVYNNSTAGPKHRVAMTGYNRRTVNESSNGKFVI 240 Query: 221 ----VNSIQKAIQEKKNLSVRIG-------------TIAYNIGIVGNQCTPLSNNLNEVK 263 + + + + +++ Sbjct: 241 RDQRITKYNSDGYDAGDTFYPKKTINKQFMVKGAAARVPNGDEKAEFTDIMYTSDFASFN 300 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG-ENSGASAY 322 ++ + T + + A + + + K+ +I ++DG + + Sbjct: 301 HKIKSFEAFGGTASLQGIIRASQIVSYHITNDGEEAN---PKQLIIILSDGEDFNHYLGQ 357 Query: 323 QNTLNTLQICEYMRNA----------------------------------GMKIYSVAV- 347 TL +C+ +RNA ++I + Sbjct: 358 TETLVDYGMCDNLRNAIEGGPVSSEDNKEDKIVFSSGSSSGLPTNTGEDPSVRIAMIGFG 417 Query: 348 -SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 L C F+ N+ E+ + I + E+ +A Sbjct: 418 DGYDIHANTGLLNCV-GEENAFSANNKDEI---LNLIMSLVSEEVGHLA 462 >gi|319780897|ref|YP_004140373.1| hypothetical protein Mesci_1159 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317166785|gb|ADV10323.1| hypothetical protein Mesci_1159 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 492 Score = 116 bits (289), Expect = 8e-24, Method: Composition-based stats. Identities = 52/463 (11%), Positives = 124/463 (26%), Gaps = 74/463 (15%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + + SV L + ++++++ + R+ +Q +DAAV S + + + K Q Sbjct: 25 LFGLSASVLALAVGFSVNVSQLYNARSSLQGVVDAAVTSTARDLTTGAIKEADANKSVQA 84 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNP-------------------- 100 Q + + K Sbjct: 85 FLDANSQAGILQADQIVLDRLIVNRTAKTVQADAHVDVGLYFPIFGTGDMKRVAASTTAL 144 Query: 101 -LQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRS 159 E +I + + ++ ++ V V + Sbjct: 145 YSDKTVEVAMMLDITGSMAKRGKVDKIGDLKTAAKNAVQTMLQKQDPQNPRIRVAIVPYA 204 Query: 160 MEDLYLQKHNDNNNMTSNKYLLPPPPK------KSFWSKNTTKSKYAPAPAPANRKIDVL 213 + + LPP K+ + + S Y A + D Sbjct: 205 SGVNAGKLAENVYAEKQASTELPPVAGSPLLVAKTGKNLLPSFSDYISIVGAAMPRPDNC 264 Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYN--------------IGIVGNQCTPLSNNL 259 + + + + + + PL+ + Sbjct: 265 ATERKDKNGNADMSADGPDTVRTDGNGKKFYALVNRDDHLGDGDMNRCPDAKVIPLTADS 324 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNT---------IGSTRLKKFVIF 310 + + + T A+ Y L + ++ ++ K I Sbjct: 325 DALLESIEDFRANGFTAGAIAIQWTYYMLSPQWRTAIRNAGLGKGASDADPKKIAKVAIL 384 Query: 311 ITDGENSG---ASAYQNTL-------NTLQICEYMRNAGMKIYSVAVSAPPEG------- 353 +TDG+ + + N +C+ M+N G++I+++ + Sbjct: 385 MTDGQFNTAFAGAGDSYNRQGTLARGNAETLCDNMKNDGIEIFTIGFDLDDKDMSTTERD 444 Query: 354 --QDLLRKCTDSSG-----QFFAVNDSRELLESFDKITDKIQE 389 + +L+ C+ FF V+ EL ++F +I ++ Sbjct: 445 QAKAVLKDCSSKDTSGAKRHFFDVSTGAELDDAFQEIIRNTEK 487 >gi|222147837|ref|YP_002548794.1| hypothetical protein Avi_1104 [Agrobacterium vitis S4] gi|221734825|gb|ACM35788.1| conserved hypothetical protein [Agrobacterium vitis S4] Length = 483 Score = 115 bits (287), Expect = 1e-23, Method: Composition-based stats. Identities = 59/466 (12%), Positives = 144/466 (30%), Gaps = 75/466 (16%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSA-----------------LDAAVLSGCAS 43 M+AI++ L + A+D + RN +Q +D+ L+ Sbjct: 21 MSAILLMPLLLAVGAAVDYSSARDHRNDIQVTADSAILAAASSYSSSSGVDS--LAAGID 78 Query: 44 IVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGS---------YIRENAGDIAQKAQINIT 94 D + D + T+ + K+ L + + +A ++++ Sbjct: 79 SYLDSKLTDQGSNDVDTAAVPKRLSGPTLSADGKEICIVVGEGVPTSFMQLAGVKTVDVS 138 Query: 95 KDKNN--PLQYIAESKAQYEIPTENLFLKGLIPSAL----TNLSLRSTGIIERSSENLAI 148 P E ++ + + +P S + + + S+ Sbjct: 139 AKSCAALPGNIDLEVSLVLDVSSSMIEEGRFVPMQTAVKSFLTSFANDATVAKRSKIAIA 198 Query: 149 SICMVLDVSRSMEDLYL----QKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 ++ + +D + Y S W N T Y + Sbjct: 199 PFSSRFNIGLTHKDWLKAYGGNDAVPSRWTDPKSYYKDSKYSFSQWIDNVTTLAYTSSNY 258 Query: 205 PA----NRKIDVL-IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 + DV ++ + A + + + PL+++ Sbjct: 259 YWIGCVEPRADVEMKDNGAIGTYGLSDAPPSTEAFVAQDYNTGSSTSFCPPPIVPLTSSF 318 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG-----STRLKKFVIFITDG 314 + ++S + + +T M + L + S+ S ++KK ++F+TDG Sbjct: 319 STLQSAIADMTSEGSTRLDAGMLAGWYTLSPKWRSAWGGGTAPADYSEKVKKVIVFMTDG 378 Query: 315 ENSGASAYQNTLNT-------------------------LQICEYMRNAGMKIYSVAVSA 349 E + + + L C+ +++ ++IY+++ S+ Sbjct: 379 EMNVKFGSTDPAKSSTEKLDWICDKNRTKSCNDTATNALLTTCDSIKSNNIEIYAISYSS 438 Query: 350 PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 L+ C S +++ + + + + I+ I +VR+ Sbjct: 439 EA-DVQNLQTC-SSGTKYYFSASTTNIKDVYTAISKNIIGSTVRLT 482 >gi|316931543|ref|YP_004106525.1| hypothetical protein Rpdx1_0148 [Rhodopseudomonas palustris DX-1] gi|315599257|gb|ADU41792.1| Protein of unknown function DUF2134, membrane [Rhodopseudomonas palustris DX-1] Length = 443 Score = 115 bits (286), Expect = 2e-23, Method: Composition-based stats. Identities = 52/429 (12%), Positives = 127/429 (29%), Gaps = 54/429 (12%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+++ I A+D + +R ++Q+A DAA + + + Sbjct: 21 IFALVLIPLISAIGCAVDYSRANALRTKLQAAADAASVGAVSRTSPAYIAAGSMSTDGAI 80 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 ++ ++ + G ++K ++ +I + Sbjct: 81 TSGADDALR---IFNGNLANLTGYTLDSVAATVSKSGE---AVTSKVTFSAQI---STMF 131 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 + + + ST +S I ++LD S SM + Sbjct: 132 MKAVAVSSMTVGGVSTA---TASMPKYIDFYLLLDNSPSMGVGATPTDVAAMVAATANKS 188 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPA-PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG 239 + N + Y A A +IDVL + L+++ + I Sbjct: 189 SDDHCAFACHDVNNKNNYYNLAKALGITTRIDVLRSATQQLMDTAAATATYTNQFRMAIY 248 Query: 240 TIAYNIGIV-GNQCTPLSNNLNEVKSRLNKLNPYE--NTNTYPAMHHAYRELYNEKES-- 294 + LS +L+ K+ ++ N Y + + Sbjct: 249 DFGASAQTAGLRNLFALSASLSSAKTAAGAIDLMTVKGQNDNNDQDTQYTAILPAIDKLI 308 Query: 295 -SHNTIGSTRLKKFVIFITDGENSGASAY---------QNTLNTLQICEYMRNAGMKI-- 342 + T + K++ F++DG + + +C+ +++ G+++ Sbjct: 309 AAPGTGAAGSPLKYLFFVSDGVADEYNPACLKPKTGNRCQSPINPALCKTLKDRGVRVAV 368 Query: 343 -YSVAVSAPPEG----------------------QDLLRKCTDSSGQFFAVNDSRELLES 379 Y+ + P + C G +F V+ ++ + E+ Sbjct: 369 LYTTYLDLPSNDWYKKWIAPFNEGPYGPSPNSEIAKNMEACAS-PGFYFEVSPTQGIAEA 427 Query: 380 FDKITDKIQ 388 + + + Sbjct: 428 MNALFKRAV 436 >gi|192288907|ref|YP_001989512.1| hypothetical protein Rpal_0477 [Rhodopseudomonas palustris TIE-1] gi|192282656|gb|ACE99036.1| conserved hypothetical protein [Rhodopseudomonas palustris TIE-1] Length = 443 Score = 115 bits (286), Expect = 2e-23, Method: Composition-based stats. Identities = 55/431 (12%), Positives = 132/431 (30%), Gaps = 58/431 (13%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+++ + A+D + +R+++Q+A DAA + + + Sbjct: 21 IFALVLVPLISAVGCAVDYSRANALRSKLQAAADAASVGAVSRTSPAYVAAGAMSGDGVI 80 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 S+ ++ + G + +TK + ++ +I + Sbjct: 81 SSGADDALR---IFNGNLNGLTGYTLASSSATVTKASD---VVTSQVTFSAQI---STMF 131 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 ++ + + ST +S I ++LD S SM + Sbjct: 132 MKVVGMSAMTVGGTSTA---TASMPKYIDFYLLLDNSPSMGVGATPTDVSAMIAATANKS 188 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPA-PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG 239 + N + Y A A +IDVL + L+++ R+ Sbjct: 189 SDDHCAFACHDVNNKNNYYNLAKALGITTRIDVLRSATQQLMDTASATATYSNQF--RMA 246 Query: 240 TIAYNIGIVGNQCTPL-----SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKES 294 + L S + + + L + N +Y ++ + Sbjct: 247 IYDFGASAQTAGLRNLFSLSASLSSAKTAASAIDLMTVKGQNDNNDQDTSYTAIFPAINN 306 Query: 295 SH---NTIGSTRLKKFVIFITDGENSGASAY---------QNTLNTLQICEYMRNAGMKI 342 + S +K++ F++DG + + +C+ +++ G+KI Sbjct: 307 EISSPGSGVSGSPQKYLFFVSDGVADEYNPSCLKPKTGNRCQSPINPALCKTLKDRGIKI 366 Query: 343 ---YSVAVSAPPEG----------------------QDLLRKCTDSSGQFFAVNDSRELL 377 Y+ ++ P + C G +F V+ ++ + Sbjct: 367 AVLYTTYLNLPSNDWYKKWIAPFNAGPYGPSPNSEIAQNMEACAS-PGFYFEVSPTQGIA 425 Query: 378 ESFDKITDKIQ 388 E+ + + K Sbjct: 426 EAMNALFKKAV 436 >gi|86356688|ref|YP_468580.1| hypothetical protein RHE_CH01044 [Rhizobium etli CFN 42] gi|86280790|gb|ABC89853.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 445 Score = 114 bits (285), Expect = 2e-23, Method: Composition-based stats. Identities = 59/437 (13%), Positives = 125/437 (28%), Gaps = 61/437 (13%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 MTA+++ A+D A + +R ++ + A + +I + Q Sbjct: 16 MTALLVVPLLGAAGTAVDFASALSLRTELYA---------AADAAAVGSITPTSEAAAQA 66 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 +T+ K + + ++ +T D + + S T Sbjct: 67 NTMSGDGSLTLGKSEAQKIFFSQMSKKQGDAPVTVDISVQKKGDTLSSTVSFNATMPTTF 126 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 ++ ++ +T + M+LD + SM + Sbjct: 127 MQVMGFDEIAVTGAATAQ---YQTPSYMDFFMLLDNTPSMGVAATTDDITAMKKATANGH 183 Query: 181 LPPPPKKSFWSKNT---------TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEK 231 K ++ + +IDV+ + L+ + Sbjct: 184 DGGKDKNCAFACHIVSEKGVEDKNSYYNVARNNGVTIRIDVVASAVKALMAKAKDTQSMP 243 Query: 232 KNLSVRIGTIAYNIGIVGN----QCTPLSNNLNEVKSRLN--KLNPYENTNTYPAMHHAY 285 V T + + L+ +L V + N KL N Y ++ Sbjct: 244 SQFRVAAYTSGKTAQDAKAAKLFKVSDLNYDLGAVAAAANMIKLMSIPYQNYYSDQQTSF 303 Query: 286 RELYNEKESS-----HNTIGSTRLKKFVIFITDGENSGASAYQNTL------------NT 328 E E + +K V F+ DG T Sbjct: 304 DEALKGIEGEIKGNIGTGTSNADRQKIVFFVADGVGDSYKPTGCTSPKGANGGRCIEPID 363 Query: 329 LQICEYMRNAGMKI---YSVAVSAPPEG-------------QDLLRKCTDSSGQFFAVND 372 C+ +++ G+K+ Y+ + P G + +C + G +FAV+ Sbjct: 364 TTYCKKLKDRGIKVAVLYTTYLPLPDNGFYKDWVKPFETRIAAKMEECA-TPGFYFAVSP 422 Query: 373 SRELLESFDKITDKIQE 389 + + E+ + + KI Sbjct: 423 TEGIEEAMEALFRKIVS 439 >gi|303248312|ref|ZP_07334574.1| von Willebrand factor type A [Desulfovibrio fructosovorans JJ] gi|302490337|gb|EFL50249.1| von Willebrand factor type A [Desulfovibrio fructosovorans JJ] Length = 452 Score = 114 bits (285), Expect = 2e-23, Method: Composition-based stats. Identities = 54/470 (11%), Positives = 131/470 (27%), Gaps = 111/470 (23%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A + + A+DL + N++Q+A+DAA L+G + D + + + T+ Sbjct: 3 VAATLVGLMAAVGVAVDLGRVYVAHNKLQNAVDAAALAGSLQLPDDPDVDNGKVSQAVTT 62 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + + T + A+ ++ + L Sbjct: 63 NLAAND---------------------PEAKATDISSGGATRSVCVTAEADVD---MTLS 98 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 ++ T ++ + I + MVLD + SM + + + + Sbjct: 99 KVVGLDATTVTAEACAGYN------DIELVMVLDATGSMRGTPIANVKEAAANLVDLIMP 152 Query: 182 PPPPKKSFWSKNTTKSKYAP--APAPANRKID------VLIESAGNLVNSIQKAIQEKKN 233 P + D + G L + K Sbjct: 153 DSGANTRSKIGLVPFQGKVRIDGNDPVTAERDPDGVGAGCRNADGTLNDGKLKTEYSDTR 212 Query: 234 L-------SVRIGTIAYNIGIVG--NQCTPLSNNLNEVKSRLNKLNPY---ENTNTYPAM 281 G Y + LS++ + + +N T + Sbjct: 213 SRNSIFYGYTISGVSTYYDRTCSGMSPIRALSSDKEAILDNIGAINAGAVTSGTLISEGI 272 Query: 282 HHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE-------------------------- 315 ++ L + + ++K +I +TDG+ Sbjct: 273 KWGHKVLSPKAPYTEGNTDKK-VRKIMIVLTDGDTEDGRCGGRYASASRTVNAYWTNAYF 331 Query: 316 -----NSGASAYQNTLNTLQI---------------------CEYMR---NAGMKIYSVA 346 + AS+ +TL+T + + + ++I+++ Sbjct: 332 GQGLRPNSASSPYDTLSTASATLAQIPDCTDGGKLNQYVLDEADDAKNDADYPVEIFAIR 391 Query: 347 VS-APPEGQDLLRKCTD----SSGQFFAVNDSRELLESFDKITDKIQEQS 391 + L+++ + ++ DS ++ + F KI ++ ++ Sbjct: 392 FGDSDATDISLMKRIASSKSGTDDHYYDAPDSSDIKDMFKKIGQQLGQRL 441 >gi|39933553|ref|NP_945829.1| hypothetical protein RPA0476 [Rhodopseudomonas palustris CGA009] gi|39647399|emb|CAE25920.1| conserved hypothetical protein [Rhodopseudomonas palustris CGA009] Length = 443 Score = 114 bits (285), Expect = 3e-23, Method: Composition-based stats. Identities = 55/431 (12%), Positives = 132/431 (30%), Gaps = 58/431 (13%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+++ + A+D + +R+++Q+A DAA + + + Sbjct: 21 IFALVLVPLISAVGCAVDYSRANALRSKLQAAADAASVGAVSRTSPAYVAAGAMSGDGVI 80 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 S+ ++ + G + +TK + ++ +I + Sbjct: 81 SSGADDALR---IFNGNLNGLTGYTLASSSATVTKASD---VVTSQVTFSAQI---STMF 131 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 ++ + + ST +S I ++LD S SM + Sbjct: 132 MKVVGMSAMAVGGTSTA---TASMPKYIDFYLLLDNSPSMGVGATPTDVSAMIAATANKS 188 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPA-PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG 239 + N + Y A A +IDVL + L+++ R+ Sbjct: 189 SDDHCAFACHDVNNKNNYYNLAKALGITTRIDVLRSATQQLMDTATATATYSNQF--RMA 246 Query: 240 TIAYNIGIVGNQCTPL-----SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKES 294 + L S + + + L + N +Y ++ + Sbjct: 247 IYDFGASAQTAGLRNLFSLSASLSSAKTAASAIDLMTVKGQNDNNDQDTSYTAIFPAINN 306 Query: 295 SH---NTIGSTRLKKFVIFITDGENSGASAY---------QNTLNTLQICEYMRNAGMKI 342 + S +K++ F++DG + + +C+ +++ G+KI Sbjct: 307 EISSPGSGVSGSPQKYLFFVSDGVADEYNPSCLKPKTGNRCQSPINPALCKTLKDRGIKI 366 Query: 343 ---YSVAVSAPPEG----------------------QDLLRKCTDSSGQFFAVNDSRELL 377 Y+ ++ P + C G +F V+ ++ + Sbjct: 367 AVLYTTYLNLPSNDWYKKWIAPFNAGPYGPSPNSEIAQNMEACAS-PGFYFEVSPTQGIA 425 Query: 378 ESFDKITDKIQ 388 E+ + + K Sbjct: 426 EAMNALFKKAV 436 >gi|254440702|ref|ZP_05054195.1| hypothetical protein OA307_117 [Octadecabacter antarcticus 307] gi|198250780|gb|EDY75095.1| hypothetical protein OA307_117 [Octadecabacter antarcticus 307] Length = 590 Score = 114 bits (284), Expect = 3e-23, Method: Composition-based stats. Identities = 51/361 (14%), Positives = 89/361 (24%), Gaps = 75/361 (20%) Query: 4 IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTI 63 +++ + F A+DL R ++Q +LD A L+ + D Sbjct: 36 MMMVMILWFGGMAVDLMRYETTRAKLQGSLDRATLAAA-------DLDQVMAPADVVRDY 88 Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGL 123 K H QG I I + A +P L + Sbjct: 89 MDKAGMLHFLQGDPI-----------------VDQGINYRIVTANASAPMPLFFYDLPKV 131 Query: 124 I------PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSN 177 + +S ST S +++ VLDVS SM + Sbjct: 132 FSSPFTPGMSSLTVSGSSTAEERVSDVEISL----VLDVSSSMNSNNRMTNLRPAAREFV 187 Query: 178 KYLLPPPPKK----SFWSKNTTKSKYAP----APAPANRKIDVLIESAGNLVNSIQKAI- 228 +L S + P AP + Sbjct: 188 TTVLANNTNAPQGLITISMIPYSAVVNPGTDIAPHLNINRTHEYSTCPMFDDTEFTTTAL 247 Query: 229 -------------QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 N + N P + N ++ + +N L+ Y NT Sbjct: 248 NLGASYDHVSHFSYGGSNDMPINPNYTWCFAGDLNAIKPHTTNEADLHTAINNLHAYGNT 307 Query: 276 NTYPAMHHAYRELYNEKE-------------------SSHNTIGSTRLKKFVIFITDGEN 316 + L + + + K ++ +TDG+N Sbjct: 308 AIDMGVKWGVALLDSSTQSLISSLAGASGTGVPAIANGRPELHTQADVLKVLVLMTDGQN 367 Query: 317 S 317 + Sbjct: 368 T 368 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 41/214 (19%), Positives = 80/214 (37%), Gaps = 20/214 (9%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGI-----VGNQCT 253 Y +P +D + + ++ + + + + R Y G + Sbjct: 377 YKSGMSPVWFDLDDVNQPLWDVDFNKTSVQYQGEATNSRWDDWFYWNGYSGTLRYRDYPN 436 Query: 254 PLSNNLNEVKSR-LNKLNPYENT----NTYPAMHHAYRELYNEKE----SSHNTIGSTRL 304 ++ L V + ++ P E T N H ++++L+ ++ ++H + + Sbjct: 437 GFNSRLAYVNASPVDASGPGEGTRYVDNGDELYHASWQQLFADRSYFLINNHYFLDAYYA 496 Query: 305 KKFV---IFITDGENSGASAYQNTLNTL--QICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 + + TD + NT IC R G+ IY+VA AP GQ L+ Sbjct: 497 GAWSWNEYWGTDNSIDHLIVNGSEANTRLSNICAAARAQGIVIYTVAFEAPSGGQTALQD 556 Query: 360 CTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 C SS +F V D ++ +F I I+ + Sbjct: 557 CASSSSHYFDV-DGTDISGAFSAIASDIRNLKLT 589 >gi|254504856|ref|ZP_05117007.1| hypothetical protein SADFL11_4895 [Labrenzia alexandrii DFL-11] gi|222440927|gb|EEE47606.1| hypothetical protein SADFL11_4895 [Labrenzia alexandrii DFL-11] Length = 455 Score = 114 bits (284), Expect = 3e-23, Method: Composition-based stats. Identities = 59/471 (12%), Positives = 124/471 (26%), Gaps = 108/471 (22%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 +TA+ L ++D+ + + ++QS LD+A L+ + + Sbjct: 12 LTALAFVPLMLITIGSLDVVRMTTAQAKLQSTLDSATLAAASLSNT-----------ADI 60 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + I+ +L + + D N + E + + Sbjct: 61 EDTVDEYIQANLPDTAPWTTLKLTMGD------VTDSLNAKSVEITATVDIE-----MTI 109 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 L T++ S I + +VLD+S SM + + + L Sbjct: 110 LKLAGIDKTSVLASSVAQQAAQ----NIEVSVVLDISSSMGGSKITSLREAAKGFIDTML 165 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNL------ 234 K+ + + S +++S A Sbjct: 166 KEDEDKEYTSLSIIPFGGTVNIGDFYDT--YAVNSSTPGVIDSPSSANYYVNKNVPYGKF 223 Query: 235 ---SVRIGTIAYNIGIVGNQCTPL-----------------------------SNNLNEV 262 + R G I Y P SNN ++ Sbjct: 224 MFSTEREGCIEYTDDDFDMAAIPANSRPQVPDFTKWVATNPWCPSEDSAMVLNSNNTTDL 283 Query: 263 KSRLNKLNPYENTNTYPAMHHAYRELYNEKES--------SHNTIGSTRLKKFVIFITDG 314 K+ ++ ++ + T + L K + +TDG Sbjct: 284 KALIDDMDLSDGTGMDIGALWGAKVLSGSMRGQLGGDFSDRPADFNDEDTLKVAVIMTDG 343 Query: 315 ----------ENSGASAYQNTLNT---------------------LQICEYMRNAGMKIY 343 + T T ++CEY+ + +++Y Sbjct: 344 AITAQFRPRDYTTTGKIKNKTQQTIVSKGNINTASTKADDAVAYFKRVCEYLNDNNVQVY 403 Query: 344 SVAVSAPPEG--QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 ++ LL+ C S ++ V + ++F+ I + V Sbjct: 404 TIGFQINSGSLPDQLLKYCASSLSNYYFVEGLN-IEDAFNAIASAVNNLRV 453 >gi|332558842|ref|ZP_08413164.1| hypothetical protein RSWS8N_07295 [Rhodobacter sphaeroides WS8N] gi|332276554|gb|EGJ21869.1| hypothetical protein RSWS8N_07295 [Rhodobacter sphaeroides WS8N] Length = 566 Score = 114 bits (284), Expect = 3e-23, Method: Composition-based stats. Identities = 43/363 (11%), Positives = 90/363 (24%), Gaps = 81/363 (22%) Query: 4 IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTI 63 ++ + + A+D+ + R ++Q LD AVL+ ++ + + Sbjct: 30 FMLILMLMIGGLAVDVMRFEFQRARLQGTLDRAVLAAA-------SLTQSRSPAEVVRDY 82 Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGL 123 K + + N ++ + A Y +PT L Sbjct: 83 VAKAGLEDYLDEPVVNANTLNV-----------------RSVTATAAYSMPTV---FMKL 122 Query: 124 IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ-----KHNDNNNMTSNK 178 + ST S+ +++ VLD+S SM + Sbjct: 123 LDIDRLEAPAVSTAEERVSNVEISL----VLDMSNSMVTDGTNPRDRLDNLKVAARDFID 178 Query: 179 YLLPPPPKK-----SFWSKNTTKSKYAPAP------APANRKIDVLIESAGNLVNSIQKA 227 ++ + A P + Sbjct: 179 IVMAGANSGLDGAPVISVSIVPYTGQVNAGADLLATYPNVSHRQPYSSCVEFAASDFTTT 238 Query: 228 IQEKK------------NLSVRIGTIAYNIGIVGN-----QCTPLSNNLNEVKSRLNKLN 270 + S Y TP S++ +K+ +++L+ Sbjct: 239 ALANGAPLTGSGNSELFSSSSSTQAPTYYWCPEETAAGNPTVTPFSHDPEALKAAIDRLS 298 Query: 271 PYENTNTYPAMHHAYRELYNEKE-----------------SSHNTIGSTRLKKFVIFITD 313 +T M L + S + K V+ +TD Sbjct: 299 GEGSTAIDTGMKWGVTLLDPSTQPSVAALIEDGKVNGAFAGRPLAYQSGNVMKVVVLMTD 358 Query: 314 GEN 316 G++ Sbjct: 359 GQH 361 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 20/70 (28%), Positives = 32/70 (45%), Gaps = 1/70 (1%) Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 T QIC+ R G+ +YSVA A GQ LL+ C ++G ++ ++ F I Sbjct: 497 KNERTRQICDAARAQGITVYSVAFEAEAGGQALLQYCASTTGHYY-ATVGPQIRTVFHSI 555 Query: 384 TDKIQEQSVR 393 I + + Sbjct: 556 ASHITQLRLT 565 >gi|126462813|ref|YP_001043927.1| hypothetical protein Rsph17029_2052 [Rhodobacter sphaeroides ATCC 17029] gi|126104477|gb|ABN77155.1| conserved hypothetical protein [Rhodobacter sphaeroides ATCC 17029] Length = 566 Score = 114 bits (284), Expect = 3e-23, Method: Composition-based stats. Identities = 41/363 (11%), Positives = 88/363 (24%), Gaps = 81/363 (22%) Query: 4 IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTI 63 ++ + + A+D+ + R ++Q LD AVL+ ++ + + Sbjct: 30 FMLILMLMIGGLAVDVMRFEFQRARLQGTLDRAVLAAA-------SLTQSRSPAEVVRDY 82 Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGL 123 K + + N ++ + A Y +PT L Sbjct: 83 VAKAGLEDYLDEPVVNANTLNV-----------------RSVTATAAYSMPTV---FMKL 122 Query: 124 IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ-----KHNDNNNMTSNK 178 + ST S+ +++ VLD+S SM + Sbjct: 123 LDIDRLEAPAVSTAEERVSNVEISL----VLDMSNSMVTDGTNPRDRLDNLKVAARDFID 178 Query: 179 YLLPPPPKK-----SFWSKNTTKSKYAPAP------APANRKIDVLIESAGNLVNSIQKA 227 ++ + A P + Sbjct: 179 IVMAGANSGLDGAPVISVSIVPYTGQVNAGADLLATYPNVSHRQPYSSCVEFAASDFTTT 238 Query: 228 IQEKK-----------------NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN 270 + TP S++ +K+ +++L+ Sbjct: 239 ALANGATLTGSGNSELFSSSSSTQTPTYYWCPEETAAGNPTVTPFSHDPEALKAAIDRLS 298 Query: 271 PYENTNTYPAMHHAYRELYNEKE-----------------SSHNTIGSTRLKKFVIFITD 313 +T M L + S + K V+ +TD Sbjct: 299 GEGSTAIDTGMKWGVTLLDPSTQPSVAALIEDGKVNGAFAGRPLAYQSGNVMKVVVLMTD 358 Query: 314 GEN 316 G++ Sbjct: 359 GQH 361 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 20/70 (28%), Positives = 32/70 (45%), Gaps = 1/70 (1%) Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 T QIC+ R G+ +YSVA A GQ LL+ C ++G ++ ++ F I Sbjct: 497 KNERTRQICDAARAQGITVYSVAFEAEAGGQALLQYCASTTGHYY-ATVGPQIRTVFHSI 555 Query: 384 TDKIQEQSVR 393 I + + Sbjct: 556 ASHITQLRLT 565 >gi|77463970|ref|YP_353474.1| hypothetical protein RSP_0399 [Rhodobacter sphaeroides 2.4.1] gi|77388388|gb|ABA79573.1| conserved hypothetical protein [Rhodobacter sphaeroides 2.4.1] Length = 566 Score = 113 bits (283), Expect = 4e-23, Method: Composition-based stats. Identities = 43/363 (11%), Positives = 89/363 (24%), Gaps = 81/363 (22%) Query: 4 IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTI 63 ++ + + A+D+ + R ++Q LD AVL+ ++ + + Sbjct: 30 FMLILMLMIGGLAVDVMRFEFQRARLQGTLDRAVLAAA-------SLTQSRSPAEVVRDY 82 Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGL 123 K + N ++ + A Y +PT L Sbjct: 83 VTKAGLADYLDEPVVNANTLNV-----------------RSVTATAAYSMPTV---FMKL 122 Query: 124 IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ-----KHNDNNNMTSNK 178 + ST S+ +++ VLD+S SM + Sbjct: 123 LDIDRLEAPAVSTAEERVSNVEISL----VLDMSNSMVTDGTNPRDRLDNLKVAARDFID 178 Query: 179 YLLPPPPKK-----SFWSKNTTKSKYAPAPAPANRKIDVLIE------SAGNLVNSIQKA 227 ++ + A A + + Sbjct: 179 IVMAGANSGLDGAPVISVSIVPYTGQVNAGADLLSTYPNVSHRQPYSSCVEFAASDFTTT 238 Query: 228 IQEKK------------NLSVRIGTIAYNIGIVGN-----QCTPLSNNLNEVKSRLNKLN 270 + S Y TP S++ +K +++L+ Sbjct: 239 ALANGAPLTGSGNSELFSSSSSTQAPTYYWCPEETAAGNPTVTPFSHDPEALKLAIDRLS 298 Query: 271 PYENTNTYPAMHHAYRELYNEKE-----------------SSHNTIGSTRLKKFVIFITD 313 +T M L + S + K V+ +TD Sbjct: 299 GEGSTAIDTGMKWGVTLLDPSTQPSVAALIEDGKVNGAFAGRPLAYQSGNVMKVVVLMTD 358 Query: 314 GEN 316 G++ Sbjct: 359 GQH 361 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 20/70 (28%), Positives = 32/70 (45%), Gaps = 1/70 (1%) Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 T QIC+ R G+ +YSVA A GQ LL+ C ++G ++ ++ F I Sbjct: 497 KNERTRQICDAARAQGITVYSVAFEAEAGGQALLQYCASTTGHYY-ATVGPQIRTVFHSI 555 Query: 384 TDKIQEQSVR 393 I + + Sbjct: 556 ASHITQLRLT 565 >gi|114571146|ref|YP_757826.1| Flp pilus assembly protein TadG [Maricaulis maris MCS10] gi|114341608|gb|ABI66888.1| Flp pilus assembly protein TadG [Maricaulis maris MCS10] Length = 500 Score = 113 bits (283), Expect = 4e-23, Method: Composition-based stats. Identities = 50/476 (10%), Positives = 131/476 (27%), Gaps = 84/476 (17%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + ++++ + A+D+ + R ++ +ALDAA L+ T + Sbjct: 23 LFSLMLIPITVLSGGAVDINQALNARARLSAALDAAALAVGVHTSVSETEAAGIASEFIA 82 Query: 61 STIFKKQIK--KHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIA--------ESKAQ 110 + +++ ++ ++ + ++++ ++YI +++ Sbjct: 83 ANYPDRELGLVGNIVVQLDPDQDRVTVGAESRVETIVLGLIGIEYITVHWESEVQRARSS 142 Query: 111 YEIPTENLFLKGLIPSALTNLSLRS--------TGIIERSSENLAISICMVLDVSRSMED 162 E+ + S +++L G + + ++V E Sbjct: 143 LELVMVLDNTGSMGGSKISSLRSAGLLLTDILFDGADPNRLKIGLVPFSATVNVGTWHER 202 Query: 163 LYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKS------------------------- 197 + N + + + + + S Sbjct: 203 AWWLDANAQSPLHAENFDPAANRWDLYDSLQNRAWEGCVEARAIPHDIEDTAPDTGYPET 262 Query: 198 ----KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIA-----YNIGIV 248 +AP + + + + + + I G Sbjct: 263 LFLPYFAPDESNYANNAGYANSYLNDGMGGSNERARMRNTPKYTNAWINSSSRGPEWGCT 322 Query: 249 GNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE-SSHNTIGSTRLKKF 307 TPL+N N + + + TN + R L + + K Sbjct: 323 ARPITPLTNQRNVIDDAIEDMIASGTTNIPIGISWGVRVLSPGMPFTEGVSYDEEGTIKA 382 Query: 308 VIFITDG--------ENSGASAYQN----------------------TLNTLQICEYMRN 337 ++ +TDG + + T CEY ++ Sbjct: 383 MVVLTDGENYLDGRNNPNYSHYSGYGYMRDGRLGIQTSSDSTIRNALNDRTEAACEYAKS 442 Query: 338 AGMKIYSVAVSAPPEG-QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 G+++Y++ +D++R C +F L +F+ I + + Sbjct: 443 LGIRVYTITFQVNSSSTRDMMRDCATHPTLYFDSPSDDALRSAFEMIAGDLTNLRL 498 >gi|221639828|ref|YP_002526090.1| hypothetical protein RSKD131_1729 [Rhodobacter sphaeroides KD131] gi|221160609|gb|ACM01589.1| Hypothetical Protein RSKD131_1729 [Rhodobacter sphaeroides KD131] Length = 566 Score = 113 bits (283), Expect = 4e-23, Method: Composition-based stats. Identities = 41/363 (11%), Positives = 88/363 (24%), Gaps = 81/363 (22%) Query: 4 IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTI 63 ++ + + A+D+ + R ++Q LD AVL+ ++ + + Sbjct: 30 FMLILMLMIGGLAVDVMRFEFQRARLQGTLDRAVLAAA-------SLTQSRSPAEVVEDY 82 Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGL 123 K + + N ++ + A Y +PT L Sbjct: 83 VTKAGLEDYLDEPVVNANTLNV-----------------RSVTATAAYSMPTV---FMKL 122 Query: 124 IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ-----KHNDNNNMTSNK 178 + ST S+ +++ VLD+S SM + Sbjct: 123 LDIDRLEAPAVSTAEERVSNVEISL----VLDMSNSMVTDGTNPRDRLDNLKVAARDFID 178 Query: 179 YLLPPPPKK-----SFWSKNTTKSKYAPAP------APANRKIDVLIESAGNLVNSIQKA 227 ++ + A P + Sbjct: 179 IVMAGANSGLDGAPVISISIVPYTGQVNAGADLLATYPNVSHRQPYSSCVEFAASDFTTT 238 Query: 228 IQEKK-----------------NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN 270 + TP S++ +K+ +++L+ Sbjct: 239 ALANGATLTGSGNSELFSSSSSTQTPTYYWCPEETAAGNPTVTPFSHDPEALKAAIDRLS 298 Query: 271 PYENTNTYPAMHHAYRELYNEKE-----------------SSHNTIGSTRLKKFVIFITD 313 +T M L + S + K V+ +TD Sbjct: 299 GEGSTAIDTGMKWGVTLLDPSTQPSVAALIEDGKVNGAFAGRPLAYQSGNVMKVVVLMTD 358 Query: 314 GEN 316 G++ Sbjct: 359 GQH 361 Score = 68.8 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 20/70 (28%), Positives = 32/70 (45%), Gaps = 1/70 (1%) Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 T QIC+ R G+ +YSVA A GQ LL+ C ++G ++ ++ F I Sbjct: 497 KNERTRQICDAARAQGITVYSVAFEAEAGGQALLQYCASTTGHYY-ATVGPQIRTVFHSI 555 Query: 384 TDKIQEQSVR 393 I + + Sbjct: 556 ASHITQLRLT 565 >gi|332716075|ref|YP_004443541.1| hypothetical protein AGROH133_11102 [Agrobacterium sp. H13-3] gi|325062760|gb|ADY66450.1| hypothetical protein AGROH133_11102 [Agrobacterium sp. H13-3] Length = 429 Score = 112 bits (279), Expect = 1e-22, Method: Composition-based stats. Identities = 51/429 (11%), Positives = 127/429 (29%), Gaps = 51/429 (11%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 +TA+++ A+D+ M ++ +Q A D A K + Sbjct: 15 LTALLMVPLCGAAGVALDITRGMSVKADLQQAAD-----SAALAAVADMSASVQAAKKMS 69 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + + + ++++ K+ + + + L Sbjct: 70 GDGVIPVGNEEARAFFDGNQRGDADYTITSVDVSVIKHG----NVVESSVSFKASVSTTL 125 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 GL+ +++ +T + ++LD + SM + Sbjct: 126 SGLLGKDFVSVAGTATA---KYETETFSDFYLLLDNTPSMGVGATPTDVATLVANTGDKC 182 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 + + +IDV+ ++ +L+++ + K + R+ Sbjct: 183 AFACHIVKDGVADPNSYYFKAKKLGVTTRIDVVAKATASLMDTAKSTR--KSSNQYRMAV 240 Query: 241 IAYNIGIVGNQCT---PLSNNLNEVKSRLNKLNP------YENTNTYPAMHHAYRELYNE 291 + + L+++L+ K + ++N N + A ++ ++ Sbjct: 241 YTFGERAEDTKLLEVVSLTSDLDAAKKKAGEINLMSIPYQGYNNDQQTDFDRALIQIGDK 300 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI----------CEYMRNAGMK 341 SS S K + F++DG T C ++ G + Sbjct: 301 VGSSGTGASSANPDKVIFFVSDGVGDSYKPSSCTKKLTGGRCQEPIDIKDCTKLKEKGFR 360 Query: 342 I---YSVAVSAPPEGQ-------------DLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 I Y+ + P ++ C G +F V+ S+ + ++ + Sbjct: 361 IAVLYTTYLPLPTNDWYNSWIKPFQAEIGSRMQSCAS-PGLYFEVSPSQGISDAMTVLFK 419 Query: 386 K-IQEQSVR 393 K I + Sbjct: 420 KAITSPRLT 428 >gi|170741048|ref|YP_001769703.1| hypothetical protein M446_2844 [Methylobacterium sp. 4-46] gi|168195322|gb|ACA17269.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 432 Score = 112 bits (279), Expect = 1e-22, Method: Composition-based stats. Identities = 50/422 (11%), Positives = 125/422 (29%), Gaps = 59/422 (13%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 +++ + + ++D A + A+ + ++ + Sbjct: 32 FGLLLLPMMVAMGVSVDYARVSAA----------RSDLAAAADAAVLSVTNKAAMSLDML 81 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + + LK + + +G A I++ + L Y A + Sbjct: 82 SAQARVRDAFLKNIQTMPDISGVSADAVVIDLLGVRAATLSYTASYR---------TAFS 132 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 G++ ++S + + + ++LD S SM + Sbjct: 133 GILGMRTLSVSGNAASKSAV---PIYMDFYLLLDNSPSMGVGATSADISTMVSRTPDKCA 189 Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVR-IGT 240 S + + +IDV+ ++ L+++ Sbjct: 190 FACHDL---SAGNSDYYHLAKSLGVTMRIDVVRQATQRLMDTAANTALVPGQFRTALYTM 246 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLN------PYENTNTYPAMHHAYRELYNEKES 294 A + +PLS++L K+ ++ P N + ++ L + + Sbjct: 247 GADCASVGLTTVSPLSSDLAAAKTNAQAIDLMTIQKPGYNNDQCTDFDGVFQSLNGKIDV 306 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL----------QICEYMRNAGMKI-- 342 + + + +K V ++DG T T C ++N G+KI Sbjct: 307 AGDGSTALTPQKVVFLVSDGVADAYYPSTCTRKTTGGRCQEPLTLANCTTLKNRGIKIAV 366 Query: 343 -YSVAVSAPPEG-------------QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 Y+ + P ++ C G +F V+ ++ + ++ + + Sbjct: 367 LYTTYLPLPTNDWYNTWIAPFQATLPSAMQGCAS-PGLYFEVSPTQGIADAMTTLFQRTV 425 Query: 389 EQ 390 Q Sbjct: 426 SQ 427 >gi|296124353|ref|YP_003632131.1| von Willebrand factor type A [Planctomyces limnophilus DSM 3776] gi|296016693|gb|ADG69932.1| von Willebrand factor type A [Planctomyces limnophilus DSM 3776] Length = 390 Score = 111 bits (278), Expect = 1e-22, Method: Composition-based stats. Identities = 50/391 (12%), Positives = 112/391 (28%), Gaps = 51/391 (13%) Query: 12 FITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIK--DPTTKKDQTSTIFKKQIK 69 + +D+A++ +R ++++A DA+ +G ++ + + + + Sbjct: 30 MTMFTVDVAYMQLVRTELRAATDASAKAGMEALRRTQDTEAAIDAAIATAAANKVGGRSL 89 Query: 70 KHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALT 129 ++ N + ++ + LF + + Sbjct: 90 TLTADQIEFGLAFRNVDNSVSFNAGQLPYTAVRVNSAMTESSAAGAVPLFFGSIFGTGQF 149 Query: 130 NLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF 189 + + + + IC +D S SM T P Sbjct: 150 EPTRSAVS------ASTEVEICFAIDRSHSMCFDLTGVDWSYPPGTPRNPDPVAFP---- 199 Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT--------- 240 P P + L + V+ + + V + Sbjct: 200 -------------PHPTLSRWASLSRAMQTFVSITASQEPKPRVAMVTWASKITQSNYEG 246 Query: 241 -IAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSH 296 + PL+ NL ++ + + TN + A + L K + Sbjct: 247 KLTKTNSPEVFVDVPLTTNLADLNQAIKGRSEKVMLGATNMAAGIDEARKILNATKSTRP 306 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + +I +TDG + N L + N G+ I+SV++ P G Sbjct: 307 ------YAHRIIILMTDGLWNQGR------NPLLAAQDAANEGIVIHSVSL-LPRSGDIT 353 Query: 357 LRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 + + + G + +S L +F I + Sbjct: 354 PQVSSTTGGVNYPATNSAALEAAFADIARTL 384 >gi|330862285|emb|CBX72446.1| hypothetical protein YEW_HH31780 [Yersinia enterocolitica W22703] Length = 457 Score = 111 bits (278), Expect = 2e-22, Method: Composition-based stats. Identities = 45/448 (10%), Positives = 121/448 (27%), Gaps = 72/448 (16%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 II I +++H + + ++ A++ A L+ + Sbjct: 29 FMIIFPFFIALIFITFEISHYLQRKAKLSDAIEQATLALAIENNEIPDEPQQIK----NN 84 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + + +L ++ + I +N + P + L Sbjct: 85 ALVLSYVNAYLPSKKFL------------VPIININDNTHYLEYNAAVTMAYPAKFLSQS 132 Query: 122 GLIP-SALTNLSLRSTGIIERS-SENLAISICMVLDVSRSMEDLYLQK------------ 167 + N++ I ++ + + V D S SM + + Sbjct: 133 PFTNTISDMNITDNGVAIKNKAIEASEPTDVIFVADYSGSMLYNFNENKPRDHERIDALR 192 Query: 168 -------HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA--- 217 +N N P + + + P + KI + Sbjct: 193 SAFRKLHDIIMDNSNINAIGYIPFSWGTKRIVFENQQQKTYCHFPFSPKIHKPKGNYLSD 252 Query: 218 ------------------GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 + +I + + + + + + + S Sbjct: 253 EIKRSSNTLLLLDYIGDIIDYDKTIDSITGNAQTIDIPMSDVRFGD-VCLQGSNAYSLEQ 311 Query: 260 NEVKSRLNK---LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + + ++ + P+ T + A N+ ++ H + D + Sbjct: 312 EQYINNIDNIIEMEPHGWTLISSGILSANNIFKNKAKNGHKKLMIILSDGVDT--DDFPS 369 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD------LLRKCTDSSGQFFAV 370 S L +CE ++ +++ +A++ P+ +KC ++ Sbjct: 370 SKGIIISKMLVEKGMCEEIKENDIQMAFIAIAYSPDNNKNEPYHINWKKCV-GEDNYYEA 428 Query: 371 NDSRELL-ESFDKITDKIQEQSVRIAPN 397 +++ EL + ++ + R P Sbjct: 429 HNAHELEHKLQQAVSGSTTREVGRNIPK 456 >gi|315122479|ref|YP_004062968.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495881|gb|ADR52480.1| von Willebrand factor type A [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 427 Score = 111 bits (277), Expect = 2e-22, Method: Composition-based stats. Identities = 74/408 (18%), Positives = 160/408 (39%), Gaps = 35/408 (8%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASI-------VSDRTIKDP 53 + ++I+ LFI I + + +N M++A +A+LSG + I + + Sbjct: 25 LFSVILISILLFIGILIYVLDYYHKKNAMENANTSAILSGASKIISRISYFGDNMSSHTH 84 Query: 54 TTKKDQTSTIFKKQIKKHLK--QGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQY 111 D + K IK+ L + I+Q ++++IT++ + + + +++ Sbjct: 85 RAIVDDVTRFIKSYIKESLLMDSSVFDISEKNIISQNSKVSITREPHPNVFHEFNNQSIL 144 Query: 112 EIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDN 171 + + + + +S+ ++ +V + ++Q D Sbjct: 145 QNKKTFYHISVETFYDYHIKFFDNLLNKKINSKIISFVPALVKIDTGEHPFFFVQLVVDL 204 Query: 172 NNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEK 231 + S P F + N K+D L ++ ++S+ + + Sbjct: 205 SASMSCLMNSDPEHATEFS---------VCGKSKKNSKMDALKKAVLLFLDSVDRGSKT- 254 Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL---NKLNPYENTNTYPAMHHAYREL 288 + S +V+ + +N T++ PAM AY+ L Sbjct: 255 --QKDTHYIGLTGYTTRVEKNIEPSWGTGKVRKYIVEEIDVNMLGQTDSTPAMKKAYQIL 312 Query: 289 YNEKESS--------HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 ++K+ + I +KF+IF+TDGEN+ + + T++ICE + + Sbjct: 313 TSDKKRNFIRNILHKRIKIPPLPFQKFLIFLTDGENNDPKSD---VKTIKICEKAKKNSI 369 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 KI +++++A G+ LL+KC + ++ V D+ LL F I+ I Sbjct: 370 KILTISINASANGKRLLKKCVSAPEYYYNVVDTGSLLRVFQDISTLIT 417 >gi|84386788|ref|ZP_00989813.1| hypothetical protein V12B01_19181 [Vibrio splendidus 12B01] gi|84378316|gb|EAP95174.1| hypothetical protein V12B01_19181 [Vibrio splendidus 12B01] Length = 404 Score = 111 bits (276), Expect = 2e-22, Method: Composition-based stats. Identities = 51/411 (12%), Positives = 107/411 (26%), Gaps = 49/411 (11%) Query: 3 AIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTST 62 A++ F+ + L A ++ Sbjct: 24 ALLFPALFIAVG----------------------TLMVSAQVMVSNRAAQAADSAALACA 61 Query: 63 IFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNN-PLQYIAESKAQYEIPTENLFLK 121 ++ + + I + L Y P F Sbjct: 62 FADTATLPMMRAYQEYYKPTLKGVSGLEPEIIGSECRISLGYSLS-------PLLPNFQY 114 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKH--NDNNNMTSNKY 179 + +T ++E ++ + +VLDVS SM N T Sbjct: 115 ESYATKVTATGGGYKSVVESKQSSIPTELVLVLDVSGSMGSNIQSLKSILSNALNTIQSQ 174 Query: 180 LLPPPPKKSFWSKNTTK-SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 S S A P K I L ++ Sbjct: 175 SNNANDLDSVSISIVPFDSGVAAQRPPWLSKEAAGIYCIDGLNYRNGNFSAALTVDNLAT 234 Query: 239 GTI-----------AYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 + + PL++ + V++ +N L T ++ + R+ Sbjct: 235 LHSQQPVKFAKPNGWLSDCNQSSPMLPLTSVFSRVRNSINSLTANGGTRSFHGLLWGVRQ 294 Query: 288 LYNEKESSHNTIGSTRLK--KFVIFITDGENSGASAYQNTLNTLQICEYM-RNAGMKIYS 344 L + + ST + + ++ TDG + G + Q L C G+++ Sbjct: 295 LIPSWQQAWGINVSTVPETRRKLVLFTDGADEGDTFDQ--LVNAGFCTTAINQYGIEMNF 352 Query: 345 VAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 + +C + + F+ ++ +L E F I ++++ Sbjct: 353 IGYGVSSSRIAQFERCAGNPSRVFSATNTTQLNEYFSDILAVEYSATIKLT 403 >gi|318604213|emb|CBY25711.1| protein TadG, associated with Flp pilus assembly [Yersinia enterocolitica subsp. palearctica Y11] Length = 457 Score = 110 bits (275), Expect = 3e-22, Method: Composition-based stats. Identities = 45/448 (10%), Positives = 121/448 (27%), Gaps = 72/448 (16%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 II I +++H + + ++ A++ A L+ + Sbjct: 29 FMIIFPFFIALIFITFEISHYLQRKAKLSDAIEQATLALTIENNEIPDEPQQIK----NN 84 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + + +L ++ + I +N + P + L Sbjct: 85 ALVLSYVNAYLPSKKFL------------VPIININDNTHYLEYNAAVTMAYPAKFLSQS 132 Query: 122 GLIP-SALTNLSLRSTGIIERS-SENLAISICMVLDVSRSMEDLYLQK------------ 167 + N++ I ++ + + V D S SM + + Sbjct: 133 PFTNTISDMNITDNGVAIKNKAIEASEPTDVIFVADYSGSMLYNFNENKPRDHERIDALR 192 Query: 168 -------HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA--- 217 +N N P + + + P + KI + Sbjct: 193 SAFRKLHDIIMDNSNINAIGYIPFSWGTKRIVFENQQQKTYCHFPFSPKIHKPKGNYLSD 252 Query: 218 ------------------GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 + +I + + + + + + + S Sbjct: 253 EIKRSSNTLLLLDYIGDIIDYDKTIDSITGNAQTIDIPMSDVRFGD-VCLQGSNAYSLEQ 311 Query: 260 NEVKSRLNK---LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + + ++ + P+ T + A N+ ++ H + D + Sbjct: 312 EQYINNIDNIIEMEPHGWTLISSGILSANNLFKNKAKNGHKKLMIILSDGVDT--DDFPS 369 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD------LLRKCTDSSGQFFAV 370 S L +CE ++ +++ +A++ P+ +KC ++ Sbjct: 370 SKGIIISKMLVEKGMCEEIKENDIQMAFIAIAYSPDNNKNEPYHINWKKCV-GEDNYYEA 428 Query: 371 NDSRELL-ESFDKITDKIQEQSVRIAPN 397 +++ EL + ++ + R P Sbjct: 429 HNAHELEHKLQQAVSGSTTREVGRNIPK 456 >gi|332162963|ref|YP_004299540.1| putative tight adherance operon protein [Yersinia enterocolitica subsp. palearctica 105.5R(r)] gi|325667193|gb|ADZ43837.1| putative tight adherance operon protein [Yersinia enterocolitica subsp. palearctica 105.5R(r)] Length = 457 Score = 110 bits (275), Expect = 3e-22, Method: Composition-based stats. Identities = 45/448 (10%), Positives = 121/448 (27%), Gaps = 72/448 (16%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 II I +++H + + ++ A++ A L+ + Sbjct: 29 FMIIFPFFIALIFITFEISHYLQRKAKLSDAIEQATLALTIENNEIPDEPQQIK----NN 84 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + + +L ++ + I +N + P + L Sbjct: 85 ALVLSYVNAYLPSKKFL------------VPIININDNTHYLEYNAAVTMAYPAKFLSQS 132 Query: 122 GLIP-SALTNLSLRSTGIIERS-SENLAISICMVLDVSRSMEDLYLQK------------ 167 + N++ I ++ + + V D S SM + + Sbjct: 133 PFTNTISDMNITDNGVAIKNKAIEASEPTDVIFVADYSGSMLYNFNENKPRDHERIDALR 192 Query: 168 -------HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA--- 217 +N N P + + + P + KI + Sbjct: 193 SAFRKLHDIIMDNSNINAIGYIPFSWGTKRIVFENQQQKTYCHFPFSPKIHKPKGNYLSD 252 Query: 218 ------------------GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 + +I + + + + + + + S Sbjct: 253 EIKRSSNTLLLLDYIGDIIDYDKTIDSITGNAQTIDIPMSDVRFGD-VCLQGSNAYSLEQ 311 Query: 260 NEVKSRLNK---LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + + ++ + P+ T + A N+ ++ H + D + Sbjct: 312 EQYINNIDNIIEMEPHGWTLISSGILSANNIFKNKAKNGHKKLMIILSDGVDT--DDFPS 369 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD------LLRKCTDSSGQFFAV 370 S L +CE ++ +++ +A++ P+ +KC ++ Sbjct: 370 SKGIIISKMLVEKGMCEEIKENDIQMAFIAIAYSPDNNKNEPYHINWKKCV-GEDNYYEA 428 Query: 371 NDSRELL-ESFDKITDKIQEQSVRIAPN 397 +++ EL + ++ + R P Sbjct: 429 HNAHELEHKLQQAVSGSTTREVGRNIPK 456 >gi|148253748|ref|YP_001238333.1| hypothetical protein BBta_2249 [Bradyrhizobium sp. BTAi1] gi|146405921|gb|ABQ34427.1| putative exported protein of unknown function [Bradyrhizobium sp. BTAi1] Length = 432 Score = 110 bits (274), Expect = 4e-22, Method: Composition-based stats. Identities = 50/430 (11%), Positives = 121/430 (28%), Gaps = 67/430 (15%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+ + F+ AID + + + ++ ++LDAA+L+ T D T Sbjct: 25 IFALALLPILTFVGSAIDYSMAVRAKAKLSASLDAAMLAATGYTAMRGTAADAKTSAT-- 82 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 Y + + +NIT + A N Sbjct: 83 --------------NMYNGQMSSHKLTSNSLNITVTDSVT-ARTVTGTASV---VVNTAF 124 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 + ++ S+ + +++D S S ++ Sbjct: 125 MYMFGFPTMTVTASSSASASF---PTYMDFYVLVDNSPSQGLGATTADMTTLQNATSDKC 181 Query: 181 LPPPPKKSFWSKNTTKSKYAPA----PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 S T + +IDV+ + +L ++ + Sbjct: 182 AFACHDTYTSSTKKTLQTNSYYQIAKNKGVTMRIDVVRSATQSLTDTATSSQVVSNQ--Y 239 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP---------YENTNTYPAMHHAYRE 287 R+ + T +++ + + S + + N + A Sbjct: 240 RMAVYSLGSDCGSLGLTTVASLSSSMSSVKSSVGALDLMTIPYSGYNNDMCTDFDGAMSG 299 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN-----------TLQICEYMR 336 + N + ST +K++ F++DG + + C+ ++ Sbjct: 300 M-NGVIPAQGDGSSTSPQKWLFFVSDGVADYSYPTTCSKTVLSGGRCQEPLNTTTCDTLK 358 Query: 337 NAGMKI---YSVAVSAPPEG-------------QDLLRKCTDSSGQFFAVNDSRELLESF 380 G+KI Y+ ++ +++ C G ++ V+ S + + Sbjct: 359 ARGIKIAVLYTTYLAITNNSWYTTYIAPWRDSISGIMKSCAS-PGYYYEVDSSGSIGAAL 417 Query: 381 DKITDKIQEQ 390 + + Sbjct: 418 TALFQQAIAS 427 >gi|327541799|gb|EGF28311.1| von Willebrand factor type A [Rhodopirellula baltica WH47] Length = 363 Score = 110 bits (274), Expect = 4e-22, Method: Composition-based stats. Identities = 48/386 (12%), Positives = 115/386 (29%), Gaps = 73/386 (18%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + AI++ + + + ++ID+A + R +++S+ DAA + ++ + Q Sbjct: 40 LIAIMMFLFLIVVAFSIDIAQMHLARTELRSSTDAAANAAATTLADTLDRNLAIQRGQQI 99 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQ--INITKDKNNPLQYIAESKAQYEIPTENL 118 + + L + D + N + N ++ + A L Sbjct: 100 AQANLVNGQPLLLADGDFQFGRSDRQVNGKYAFNAGEAPFNGVRVNGQRTAGSLSGPVPL 159 Query: 119 FLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNK 178 F + +++ +T I +V+D S SM Sbjct: 160 FFGNVTGTSIFEPEAFATATYVE------RDITLVVDRSGSMAG---------------- 197 Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 + + L + + + + +I Sbjct: 198 -----------------------------SRFNDLQAAIRIFTDLLATTPVD-----EQI 223 Query: 239 GTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNT 298 G +YN + L+ N EV + +++L T+ M + Sbjct: 224 GLASYNDRASED--VQLTENFAEVNNAMDRLRTGGFTSISRGMQAGQEIALRGRPPEFVE 281 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLR 358 +I +TDG ++ + + G+ I+++ A + + Sbjct: 282 RT-------MIVMTDGRHNRG------PEPRVVATDLAADGVTIHTITFGAGADFGRMQD 328 Query: 359 KCTDSSGQFFAVNDSRELLESFDKIT 384 G+ F + +L + + +I Sbjct: 329 VARIGGGRHFHATNGDQLRDIYREIA 354 >gi|167644155|ref|YP_001681818.1| Flp pilus assembly protein TadG [Caulobacter sp. K31] gi|167346585|gb|ABZ69320.1| Flp pilus assembly protein TadG [Caulobacter sp. K31] Length = 562 Score = 109 bits (271), Expect = 1e-21, Method: Composition-based stats. Identities = 63/533 (11%), Positives = 132/533 (24%), Gaps = 141/533 (26%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSD-------RTIKDPT 54 A+++ + ID++ + Q+Q ALDAA L S + T Sbjct: 29 FALLLIPIAVLTFGLIDISRASVQKRQLQDALDAATLMAARSTATTNADLDTIGDAALAT 88 Query: 55 TKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIP 114 T L + + ++ K I+ N + + Sbjct: 89 EMAGLGVTFGPGNSSFVLGDNNTVVGTIQNVVIKPIISNLWSSTNTPVSATATVMRSINH 148 Query: 115 TENLFLKGLIPSA---------------------------------------------LT 129 E + S Sbjct: 149 LEVALVLDNTGSMASSLGSGGSKITALITASKSLVDVLSAAAARATEADAVKISVVPFSM 208 Query: 130 NLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHN---------DNNNMTSNKYL 180 +++ ST + S +D + ++ + N ++ + Sbjct: 209 TVNIGSTYQTQTSWLTGTQPAAYGVDNFATSQNRFTLLSNLGLTWGGCVESRPAPFDVTD 268 Query: 181 LPPPPKKSFWSKNTTK-------SKYAPAPAPANRKIDVLIESAGN------LVNSIQKA 227 P P + + + + + D S L +++ Sbjct: 269 DAPNPAIGASMFVPFFAPDEPDDNTVNISASSSTKYRDARRYSTSYPITNTYLTDTVTPT 328 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLS--------------------------NNLNE 261 S R +A L+ + + Sbjct: 329 GTATNAWSTRSTVVAKYATSNKATLLSLAKTGTAYGPNAGCGMTSLMRLTNVKAKADRDT 388 Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS----HNTIGSTRLKKFVIFITDGENS 317 VK +L+++ NTN + + L + T R K ++ +TDG+N+ Sbjct: 389 VKGKLDQMIASGNTNVAMGLIWGWHTLSKNAPFADGVDPATTVGKRTTKVIVLLTDGDNT 448 Query: 318 GASAYQNTLNTLQ-------------------------------------ICEYMRNAGM 340 + + C + AG+ Sbjct: 449 NDTYNNPNASIYTGYGYITQGRLLNASNSPLGATSTATNRRDAIDSREARACTNAKAAGV 508 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 +IY++ V + +L+ C ++ V D+ +L F+ I IQ + Sbjct: 509 QIYAIGVGVSSHSRGILQDCASKPEMYYDVTDAAQLASVFNTIAGSIQNLRIT 561 >gi|312878233|ref|ZP_07738157.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] gi|311794982|gb|EFR11387.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] Length = 1221 Score = 109 bits (271), Expect = 1e-21, Method: Composition-based stats. Identities = 41/343 (11%), Positives = 102/343 (29%), Gaps = 40/343 (11%) Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 +QT T + + + I + I + + + Sbjct: 394 DEQTGTKVEVVAEGDISPFVEINSLKDEEVFSEIYGIVSTP---VDIEVYAPFKEATVFI 450 Query: 117 NLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDL-------YLQKHN 169 + + N+ + + L ++ + + Sbjct: 451 PIDTSKIPNQDFQNVKMFYLDEDLMTFVPLDEQGVDPINKVVWAKTDHFTTFVLFYIPTW 510 Query: 170 DNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ 229 ++ + + N +A + V+++ + Sbjct: 511 KAIWEVPINKGEREVNQQIKYIDLV-FVLDSSGSMSWNDPNGYRKIAAKSFVDALIQGD- 568 Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 R + ++ PL+ + VK+ +++++ + TN + A +L Sbjct: 569 -------RAAVVDFDDYGYL--LQPLTTDFQTVKNAIDRIDSWGGTNIAEGIRIANHQL- 618 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + S K +I +TDGE + +N G+ IY++ + Sbjct: 619 -------ISQSSDDRIKVIILLTDGEGYYDNNLT---------TEAKNNGITIYTIGLGT 662 Query: 350 PPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQS 391 ++LLR + G +F V+ + +L + F +IT+ + E Sbjct: 663 -SVDENLLRNIATQTGGMYFPVSSASQLPQVFKRITEIVTEPI 704 >gi|222529355|ref|YP_002573237.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] gi|222456202|gb|ACM60464.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] Length = 1188 Score = 108 bits (270), Expect = 1e-21, Method: Composition-based stats. Identities = 43/346 (12%), Positives = 99/346 (28%), Gaps = 46/346 (13%) Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 DQT T + + + I + I + + + Sbjct: 360 DDQTGTKVEIVAEGDISSFVEINNLKDEEVFSEIYGIVSTP---VDIEVYAPFKEATVFI 416 Query: 117 NLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDL-------YLQKHN 169 + + N+ + + L ++ + + Sbjct: 417 PIDTSKIPNQDFQNVKMFYLDEDLMTFVPLDEQGVDPVNKVVWAKTNHFTTFVLFYIPTW 476 Query: 170 DNNNMTSNKYLLPPPPKKSFWSKNTT---KSKYAPAPAPANRKIDVLIESAGNLVNSIQK 226 ++ + S P + L+ + Sbjct: 477 KAIWEVPINKGEREINQQVNYIDLVFVLDSSGSMSWNDPNGYRKIAAKSFVDALIQGDRA 536 Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYR 286 A+ + N PL+ + VK+ +++++ + TN + A + Sbjct: 537 AVVDFDNFGY--------------LLQPLTTDFQAVKNAIDRIDSWGGTNIAEGIRIANQ 582 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 +L + S K +I +TDGE + +N G+ IY++ Sbjct: 583 QL--------ISRSSEDRIKVIILLTDGEGYYDNNLT---------TEAKNNGITIYTIG 625 Query: 347 VSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQS 391 + ++LLR + G +F V+ + +L + F +IT+ + E Sbjct: 626 LGT-SVDENLLRDIATQTGGMYFPVSSASQLPQVFKRITEIVTEPI 670 >gi|327541056|gb|EGF27607.1| von Willebrand factor type A [Rhodopirellula baltica WH47] Length = 497 Score = 108 bits (269), Expect = 1e-21, Method: Composition-based stats. Identities = 52/417 (12%), Positives = 132/417 (31%), Gaps = 44/417 (10%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A ++ + L + I+LA + ++ ++ A DAA +G + ++T++ T Sbjct: 89 LMAFVLPMLALLAAFCINLAQMQLVKTELAIATDAAARAGGRAFSEEQTVEAAKAAARLT 148 Query: 61 STIFKKQIKKHLKQGSYIRENAG------DIAQKAQINITKDKNNPLQYIAESKAQYEIP 114 + + + + + + TK + + + + I Sbjct: 149 AAMNEVAGEPYQLNTDDSANEFEFGVSAQTDGNTGRFYFTKVPTSDVAANLVAVSSVRIN 208 Query: 115 TENLFLKGLIPSALTNLSLRSTGIIERSSE----NLAISICMVLDVSRSMED--LYLQKH 168 + L P + S G + + I +VLD S SM+ Sbjct: 209 GKRTDDSLLGPVPFIFPNTFSIGDFSPVASATAMQVDRDISLVLDRSGSMDWKTYDWPDD 268 Query: 169 NDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAP--APAPANRKIDVLIESAGNLVNSIQK 226 D S W + +Y + + D+ + + Sbjct: 269 ADPWGEDSLISAEDAGIVDLEWKYRNGQPQYIRRVSYNRGYDEYDLYDHAWEEVFGLGPA 328 Query: 227 AIQ----------------EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN 270 ++ + ++ +YN + L ++ + V++ + +L Sbjct: 329 PNTPWEDLVLAVDAFLRVLDQTPQNEQVSIASYNSHGTLDCW--LLDDFDSVRAAVAQLG 386 Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 P +T M+ +E K ++ +TDG ++ + Sbjct: 387 PNGSTGIGNGMNSGKTAFTHENAR-------PYASKTMVVMTDGNHNYGTQPNTVAQ--- 436 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 + M ++ + I +V + + + G+ + + EL+ +F++I + + Sbjct: 437 --QLMSSSNLNIQTVTFGGGADQETMQEVAVTGLGRHYHADSGDELVSAFEEIANNL 491 >gi|283852082|ref|ZP_06369356.1| von Willebrand factor type A [Desulfovibrio sp. FW1012B] gi|283572472|gb|EFC20458.1| von Willebrand factor type A [Desulfovibrio sp. FW1012B] Length = 442 Score = 108 bits (268), Expect = 2e-21, Method: Composition-based stats. Identities = 57/464 (12%), Positives = 126/464 (27%), Gaps = 111/464 (23%) Query: 10 FLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIK 69 A+DL+ + NQ+Q+A+DAA L+G + D + + K + Sbjct: 1 MAAAGVAVDLSRVYVAHNQLQNAVDAAALAGSLQLPDDPDVTNGK---------VKAAVT 51 Query: 70 KHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALT 129 +L + ++KA ++ +I T Sbjct: 52 ANLALNDPDATDIQ----------VTSGGATRSVCVDAKANVDMTL-----TKVIGIGDT 96 Query: 130 NLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF 189 ++ + I + +VLD + SM+ + D N + Sbjct: 97 TVTAEACAGYN------DIELVLVLDSTGSMKGSPIDSAKDAARDLVNLIMPASTSSTRS 150 Query: 190 WSKNTTKSKYA----PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYN- 244 P A R D + N ++ + + T + Sbjct: 151 KIGLVPFQGKVRIDGSDPVTAERNPDGVGPGCRNADGTLNTGKLKVEYSRTATSTNIFYG 210 Query: 245 ------------IGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRELY 289 + LS++ N + + + +N T + + L Sbjct: 211 YTLSGVSTFTDKTCSGMSPIRALSSDKNTILNNIEAINAGAVTSGTLISEGIKWGRKVLS 270 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT--------------------- 328 E + ++K +I +TDG+ N + Sbjct: 271 PEAPYVEGSTDKK-VRKIMIVLTDGDTEDGRCGGNFASASKTVNTYWTNAYFGQGLKPDT 329 Query: 329 -------------------------------LQICEYMR---NAGMKIYSVAVS-APPEG 353 L + + N ++I+SV + Sbjct: 330 ATSPYATLSTATATLAQIPDCKDGGKLNQFVLDEADAAKNDLNYPVEIFSVRFGASDATD 389 Query: 354 QDLLRKCTD----SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + L++K ++ ++ S + + F KI ++ ++ + Sbjct: 390 KSLMQKIASSKPGTTDHYYDAPSSTGIQDMFKKIGQQLGQRLMT 433 >gi|90422080|ref|YP_530450.1| hypothetical protein RPC_0556 [Rhodopseudomonas palustris BisB18] gi|90104094|gb|ABD86131.1| conserved hypothetical protein [Rhodopseudomonas palustris BisB18] Length = 453 Score = 108 bits (268), Expect = 2e-21, Method: Composition-based stats. Identities = 65/443 (14%), Positives = 129/443 (29%), Gaps = 74/443 (16%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A + + I AID A IR++MQSA DAA + G S S + + D Sbjct: 23 LFAFSLIPLLVAIGCAIDYARATQIRSKMQSAADAASV-GSVSKASPAFLAAGSMTTDGP 81 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + + +G K +TK L A + Sbjct: 82 IAVGSTDATNIF--NGNMASQSGYTLSKLDAAVTKS-GATLTSTVTFSASVATTFLTIIG 138 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 K + T++S S + I ++LD S SM ++ Sbjct: 139 KTALAIGGTSVSTSSM--------PVYIDFYLLLDNSPSMGVGATPTDVATMVDNTSDKC 190 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 + +K +IDVL ++ L+++ R+ Sbjct: 191 AFACHDVNDEHNYYELAK----TLGVKTRIDVLRDATQQLMDTAAATATYPNQF--RMAI 244 Query: 241 IAYNIGIVGNQCTPL-----SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE-- 293 + L S + + + L + N +Y +L + Sbjct: 245 YDFGASAQSAALRRLFALSSSLSSAKTAAGAIDLMTVKGQNDNDDRDTSYSKLLPAIDKQ 304 Query: 294 -SSHNTIGSTRLKKFVIFITDGENSGASAYQNTL----------------NTLQICEYMR 336 ++ S +K+++F++DG +A +C+ M Sbjct: 305 ITAAGAGTSDAPQKYLLFVSDGVADETNAGCAKTMKNAFWGNKSPRCQSPIDPALCKAMT 364 Query: 337 NAGMKIYSVA-----------------------------VSAPPEGQ--DLLRKCTDSSG 365 + G+K+ + P + + ++ C G Sbjct: 365 DRGVKVAVLYTTYLALPLKQANGDPSWYASWIAPFNVGPYGPSPNSEIANNMKACAS-PG 423 Query: 366 QFFAVNDSRELLESFDKITDKIQ 388 +F V+ + + ++ + I K Sbjct: 424 FYFEVSPTDGIADAMNAIFRKAV 446 >gi|146338996|ref|YP_001204044.1| hypothetical protein BRADO1945 [Bradyrhizobium sp. ORS278] gi|146191802|emb|CAL75807.1| conserved hypothetical protein [Bradyrhizobium sp. ORS278] Length = 432 Score = 108 bits (268), Expect = 2e-21, Method: Composition-based stats. Identities = 50/430 (11%), Positives = 121/430 (28%), Gaps = 67/430 (15%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + AI + FI AID + + + ++ +++DAA+L+ + D T Sbjct: 25 IFAIALLPILAFIGSAIDYSMAVRAKAKLSASIDAALLAATGYTAMRGSSSDAKTAAT-- 82 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 +F Q+ H + + + D + + Sbjct: 83 -NMFNGQMSAHKLTSNSLSIDITDSVSARTVTGSAT-----------------VVVKTSF 124 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 + +S S+ + +++D S S + Sbjct: 125 MYMFGYPTMTVSASSSASASF---PTYMDFYVLVDNSPSQGLGATTADMTTLQNATTDTC 181 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPA----PANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 S T + +IDV+ + +L ++ + Sbjct: 182 AFACHDTYTSSSKKTLQTNSYYDKAKKLGVTMRIDVVRSATQSLTDTATSSQIVSNQ--Y 239 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP---------YENTNTYPAMHHAYRE 287 R+ + T +++ + + S + + N + Sbjct: 240 RMAVYSMGADCGSLGLTTVASLSSSMSSVKSSVGALDLMTIPYSGYNNDMCTDFDGTMSA 299 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN-----------TLQICEYMR 336 + + + ST +K++ F++DG A + T C ++ Sbjct: 300 M-SGVIPTQGDGSSTNPQKWLFFVSDGVADYAYPTTCSKTTQSGGRCVEPLTTTTCTALK 358 Query: 337 NAGMKI---YSVAVSAPPEGQ-------------DLLRKCTDSSGQFFAVNDSRELLESF 380 G+KI Y+ ++ G +++ C G ++ V+ S + + Sbjct: 359 ARGIKIAVLYTTYLAITSNGYYNTWVKPWRDSIGTIMKSCAS-PGYYYEVDSSGSIGSAL 417 Query: 381 DKITDKIQEQ 390 + + Sbjct: 418 TALFQQAIAS 427 >gi|83951473|ref|ZP_00960205.1| hypothetical protein ISM_12960 [Roseovarius nubinhibens ISM] gi|83836479|gb|EAP75776.1| hypothetical protein ISM_12960 [Roseovarius nubinhibens ISM] Length = 550 Score = 107 bits (267), Expect = 3e-21, Method: Composition-based stats. Identities = 47/372 (12%), Positives = 91/372 (24%), Gaps = 76/372 (20%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M + + A+D+ R Q+QS LD+AVL+ + Sbjct: 1 MALVFFLIMIAAGGIAVDMMRYEMKRAQIQSTLDSAVLASAGAPYGS-----------DH 49 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 I + + E G+I + A T + +L Sbjct: 50 RAIIEDYFRVANMTDYLAAEKEGEIVVTVN-----------SASVTANADM---TMDTYL 95 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 L + ST + + + +VLDVS SM + L Sbjct: 96 MKLSGIKELRTTGGSTA----VRKVPKLEVVLVLDVSGSMGSNSKLVNLKKAAKEFVTSL 151 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPAN-----RKIDVLIESA------------------ 217 L + S + Sbjct: 152 LNGSEPGNTVISIVPFSWSVSPSVATFEALAVDRKHEFSTCIRFKANDHSHASLATGNSG 211 Query: 218 -------GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN 270 ++ + ++ + S + P S + E+ ++++ L Sbjct: 212 FSSGQPLDQMIYTALYGNFDEFSGSESSSDYRSCYANDYMEILPFSVSETELHAKIDSLQ 271 Query: 271 PYENTNTYPAMHHAYRELYNEKES-----------------SHNTIGSTRLKKFVIFITD 313 NT+ M L + G+ K + + D Sbjct: 272 ASGNTSGNQGMIWGAALLDPSFRQITDDLIAAGEVASSQAAIPSNYGTAETLKVAVVMGD 331 Query: 314 GENSGASAYQNT 325 G+N+ + + N Sbjct: 332 GQNTTSYFFSNG 343 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 40/375 (10%), Positives = 94/375 (25%), Gaps = 45/375 (12%) Query: 36 AVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK 95 A L+ S S D + Y A D + ++++ Sbjct: 203 ASLATGNSGFSSGQPLDQMIYTALYGNF--DEFSGSESSSDYRSCYANDYMEILPFSVSE 260 Query: 96 DKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD 155 + + ++ ++ L+ + ++ E +S AI Sbjct: 261 TELHAKIDSLQASGNTSGNQGMIWGAALLDPSFRQITDDLIAAGEVASSQAAIPSNYGTA 320 Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 + + + N + F+S + V Sbjct: 321 ETLKVAVVMGDGQNTTS---------------YFFSNGGQWRGQNSDLYEVKSQKRVFKY 365 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 + N K ++ S Y L +N + N Y NT Sbjct: 366 AYRK--NKKDKISYDQSKCSNNSWECVYESSGEIESAFYLHDNYGD--------NRYYNT 415 Query: 276 NTYPAMHHA-YRELYNEKESSHNTIGST-----RLKKFVIFITDGENSGA---------S 320 + + + +L + E + + + G+ + Sbjct: 416 EEGEYLSSSDWDDLQDSDEFVSMRRLDWEEAWGYMSPYYYYQVTGDPNAYYDYYYYDRLD 475 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAP--PEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + C +N G+ ++S+ + +L+ C S +F + + Sbjct: 476 GSEKDTRMKASCTATKNEGVVVFSIGFEIDQGGTAEQVLKNCASSENHYFRAEGIN-IND 534 Query: 379 SFDKITDKIQEQSVR 393 +F I + + Sbjct: 535 AFSAIASNVVNLRLT 549 >gi|32474888|ref|NP_867882.1| hypothetical protein RB7557 [Rhodopirellula baltica SH 1] gi|32445428|emb|CAD75429.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 327 Score = 107 bits (267), Expect = 3e-21, Method: Composition-based stats. Identities = 47/386 (12%), Positives = 114/386 (29%), Gaps = 73/386 (18%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + AI++ + + + ++ID+A + R +++S+ DAA + ++ + Q Sbjct: 4 LIAIMMFLFLIVVAFSIDIAQMHLARTELRSSTDAAANAAATTLADTLDRNLAIQRGQQI 63 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQ--INITKDKNNPLQYIAESKAQYEIPTENL 118 + + L + D + N + N ++ + L Sbjct: 64 AQANLVNGQPLLLADGDFQFGRSDRQVNGKYAFNAGEAPFNGVRVNGQRTTGSLSGPVPL 123 Query: 119 FLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNK 178 F + +++ +T I +V+D S SM Sbjct: 124 FFGNVTGTSIFEPEAFATATYVE------RDITLVVDRSGSMAG---------------- 161 Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 + + L + + + + +I Sbjct: 162 -----------------------------SRFNDLQAAIRIFTDLLATTPVD-----EQI 187 Query: 239 GTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNT 298 G +YN + L+ N EV + +++L T+ M + Sbjct: 188 GLASYNDRASED--VQLTENFAEVNNAMDRLRTGGFTSISRGMQAGQEIALRGRPPEFVE 245 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLR 358 +I +TDG ++ + + G+ I+++ A + + Sbjct: 246 RT-------MIVMTDGRHNRG------PEPRVVATDLAADGVTIHTITFGAGADFGRMQD 292 Query: 359 KCTDSSGQFFAVNDSRELLESFDKIT 384 G+ F + +L + + +I Sbjct: 293 VARIGGGRHFHATNGDQLRDIYREIA 318 >gi|114571147|ref|YP_757827.1| hypothetical protein Mmar10_2603 [Maricaulis maris MCS10] gi|114341609|gb|ABI66889.1| conserved hypothetical protein [Maricaulis maris MCS10] Length = 520 Score = 107 bits (266), Expect = 3e-21, Method: Composition-based stats. Identities = 49/495 (9%), Positives = 124/495 (25%), Gaps = 103/495 (20%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+ + L A+DL+ M R+++ ALDAA L+ + + Sbjct: 24 IFALTLVPVALLSGGAVDLSQSMNARSRLAQALDAAALAVGVNTNLSSSEATGIANDFIA 83 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIA----------ESKAQ 110 + +++ YI + + + + + ++ + Sbjct: 84 ANYPGRELGVVQNVNVYIDDETDTVTVSGEARVRTTMLGMIGLDYITVHWESEVQRARQR 143 Query: 111 YEIPTENLFLKGLIPSALTNLSLRST---------GIIERSSENLAISICMVLDVSRSME 161 E+ + S + NL + + + ++V + Sbjct: 144 LELVMVLDNTGSMGGSKIRNLRESAELLTGILFDAADDPSDVKIGLVPFAATVNVGTNHA 203 Query: 162 DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKS--------KYAPAPAPANRKIDVL 213 + + + + + P ++ + + + Sbjct: 204 RAWWMDPDALSPVHAEWAGGNPVEIETCSGRGRGRRRRCQTEEIWVNHWDLFDQLRNTGW 263 Query: 214 IESAGNL-----VNSIQKAIQEKKNLSVRIGTIAYNI----------------------- 245 ++ +I L V Sbjct: 264 EGCVEARPIPMDIDDTPPSIGNPSTLFVPYFAPDEPDNGSYSNSYLSDGVSGGVSERLQA 323 Query: 246 ----------------GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 TPL++ V + + + TN + R + Sbjct: 324 LDKYDNGRPNREGPNRSCTTTPVTPLTSTERTVLNAIGDMGASGTTNIPNGVGWGIRLIS 383 Query: 290 NEKE-SSHNTIGSTRLKKFVIFITDGENSGASAYQN------------------------ 324 + + K ++ +TDG+N + Sbjct: 384 PGAPFTEGSAWDDDEYIKAMVILTDGDNVMRGRNTDQMSDYEAYGFVADGRLGRRSSSSN 443 Query: 325 ------TLNTLQICEYMRNAGMKIYSVAVSAPPEG-QDLLRKCTDSSGQFFAVNDSRELL 377 T C Y R+ G+++Y++ + L++ C + +F S L Sbjct: 444 VLSNELDDRTEAACAYARSLGIRVYTITFQVNSSSTRSLMQNCASNPSLYFDSPSSEALE 503 Query: 378 ESFDKITDKIQEQSV 392 ++F+ I + + Sbjct: 504 DAFEMIAGDLTNLRL 518 >gi|170746808|ref|YP_001753068.1| hypothetical protein Mrad2831_0362 [Methylobacterium radiotolerans JCM 2831] gi|170653330|gb|ACB22385.1| conserved hypothetical protein; putative vWFA domain protein [Methylobacterium radiotolerans JCM 2831] Length = 437 Score = 107 bits (266), Expect = 3e-21, Method: Composition-based stats. Identities = 59/422 (13%), Positives = 122/422 (28%), Gaps = 63/422 (14%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A++ A+D + Q+ +ALD AVL+ + + + Q Sbjct: 31 FALVTLPVMFATAAAVDYGRRNAAKTQLDAALDGAVLAVMSQKTNTIPTTTLQNMETQFR 90 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 T K G + ++ + + +IP Sbjct: 91 TEAAKV------PGVTVTSFTPGAPVNTSKTLSLTASYTATVKTSLASMMQIPAMP---- 140 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSN-KYL 180 S + + I+ ++LD S SM N + +N Sbjct: 141 ------------VSGTSSATRNTSQYINYYLLLDNSPSMGLAATDADVQNMKIATNGCAF 188 Query: 181 LPPPPKKSFWSKNT----TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 T + + +I VL E+ LV+ ++ + Sbjct: 189 ACHQHTFDKKGNITGDDQNDNYHIALRNNIKLRIQVLREAVSALVDQANVSMLLPQQF-- 246 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTY-PAMHHAYRELYNEK 292 ++ +N + + ++ LN +K+ ++ Y N + A + Sbjct: 247 QMEMWTFNDSVTQTKLQAMTPTLNNIKNAAPNIDIAYAYYNQSDNQTDFERAIARMNTTI 306 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI-------------CEYMRNAG 339 +S + + + +F+ +TDG + N QI C ++N Sbjct: 307 PASGDGLTPDKPIRFLFLVTDGVEDTGGSVTNQSAGFQIQSNRFIGPLSPSTCSALKNKN 366 Query: 340 MKIYSVAVSA-PPEGQDL---------------LRKCTDSSGQFFAVNDSRELLESFDKI 383 +KI + P D L+ C G +F V + ++ + K+ Sbjct: 367 VKIGIIYTQYLPIYDNDFYNRYVRPYESQIGPSLQACAS-DGMYFPVTTNGDITAAMLKL 425 Query: 384 TD 385 Sbjct: 426 FS 427 >gi|163761157|ref|ZP_02168234.1| hypothetical protein HPDFL43_13595 [Hoeflea phototrophica DFL-43] gi|162281708|gb|EDQ32002.1| hypothetical protein HPDFL43_13595 [Hoeflea phototrophica DFL-43] Length = 444 Score = 106 bits (265), Expect = 5e-21, Method: Composition-based stats. Identities = 54/434 (12%), Positives = 130/434 (29%), Gaps = 53/434 (12%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + +++ A+D ++ + ++ Q +DA VL I+ + + Sbjct: 20 LAGLVMVALVWVAGLAVDFSNALRVKTTAQDIVDATVLRATRDIIEEG------KTLAEA 73 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK---DKNNPLQYIAESKAQYEIPTEN 117 +K L S + ++ + ++ + EIP Sbjct: 74 ELSARKYFDAELAFSSGVGLEVSTFTLTQGVDGIVKLGVSGKTSTSLLKAVGREEIPVSV 133 Query: 118 LFLKGLIPSA-------LTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 + + S+ ++ +A ++ + S SM ++ + Sbjct: 134 DAAAHVGGGSVEIAIAFDVTNSMGFGTTWGEATSVIASALNALKANSGSMALTFIPFTDR 193 Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAI-- 228 N LL P + K + K E+ + +S + Sbjct: 194 VNVGMGRANLLNPGDQ-----TAVKKGGWGGCVDVRATKKKNKGETEYFMPDSAPEKGDR 248 Query: 229 -QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 + N + Y + ++N+++V S+L KL + Sbjct: 249 FTKFDNGTPAAHKSGYKLACNPQSIIGPTSNVSDVTSQLGKLTKGGTGRFDLGFAWLWYA 308 Query: 288 LYNEKESSHNTI--------------GSTRLKKFVIFITDGENSGASAYQ---------- 323 L + + ST +K + TDG + Sbjct: 309 LSPNWKGFWSGGAPADNGVNLADYPTASTNTRKIAVLATDGLTNAYVYEYGKTNLAGWNT 368 Query: 324 ----NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLE 378 + N + IC+ M ++++ + V+ + + R+C + G ++ V + L++ Sbjct: 369 GSKDHFENVVAICKSMAAQKIEVHVMHVNGNDKAEPYFRECASATGGGYYKVASKQTLVD 428 Query: 379 SFDKITDKIQEQSV 392 + IT+ + Sbjct: 429 ALTGITNGGGNLRL 442 >gi|85705211|ref|ZP_01036310.1| hypothetical protein ROS217_17122 [Roseovarius sp. 217] gi|85670084|gb|EAQ24946.1| hypothetical protein ROS217_17122 [Roseovarius sp. 217] Length = 580 Score = 106 bits (265), Expect = 5e-21, Method: Composition-based stats. Identities = 50/362 (13%), Positives = 94/362 (25%), Gaps = 71/362 (19%) Query: 4 IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTI 63 I + + +D R +Q+ LD AVL+G + TI Sbjct: 33 FIFVMFLMMGGIGLDTMRQEMARASLQATLDRAVLAGATASTEAG-----------ARTI 81 Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGL 123 + K + + + GDI+ + A E+ + +L L Sbjct: 82 VEDYFAKSGQSDYLLAQKDGDISTTLNAA-------------KVTAGAELSLD-TYLMKL 127 Query: 124 IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPP 183 + S +T + + +VLDVS SM ++ ++ Sbjct: 128 AGVPTLSASGTATAEVRI----PKLEAILVLDVSGSMASNSKIQNLQTAAKDFVTTVMNS 183 Query: 184 PPKKSFWSKNTTKSKYAPAPAPANR-----KIDVLIESAGNLVNSIQKAIQE-------- 230 S P + N Q A Sbjct: 184 SKPGDTVMSIVPFSFSVTPPQSVFDALAVEETHNYSTCLEFKENDYQHATLSSGSSSLSS 243 Query: 231 --KKNLSVRIGT----------IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 N V + P S ++ ++ ++++ L P NT+ Sbjct: 244 GIPVNQMVYTSVYGDFDNLDSGWRSCYTDEYIRILPYSTSITDLHAKIDALQPAGNTSGN 303 Query: 279 PAMHHAYRELYNEKE-----------------SSHNTIGSTRLKKFVIFITDGENSGASA 321 M+ L + + K +IF+ DG N+ + Sbjct: 304 EGMNWGAALLDPTFREVTASMIAAGHLSETLANVPSDYDEPETLKAIIFMGDGANTTSYF 363 Query: 322 YQ 323 + Sbjct: 364 FD 365 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 23/207 (11%), Positives = 57/207 (27%), Gaps = 18/207 (8%) Query: 189 FWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIV 248 ++ N + + ++ V + + E +R + Sbjct: 389 KYAYNIYNVDWKKYGDDGKSRC---SQNRWECVYDVAENSPEYSVYYLRNPDTGKFWSVA 445 Query: 249 GNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 + +N N +S ++ T M +++ N Sbjct: 446 EEKWIE-ANTFNNFESTMDGFISR--TQLDWEMAWGLMSPEYYGQTTGNWGPWNDYIGSE 502 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 S +C+ + G+ +YS+ P G ++ L C S Sbjct: 503 FV---------SGSMKNGLMQNVCKATKTEGVVVYSIGFEVPVNGTAENQLSACASSPAH 553 Query: 367 FFAVNDSRELLESFDKITDKIQEQSVR 393 +F ++ +F I +++ + Sbjct: 554 YFRA-SGTDIKSAFSAIAANVKQLRLT 579 >gi|312622403|ref|YP_004024016.1| von willebrand factor type a [Caldicellulosiruptor kronotskyensis 2002] gi|312202870|gb|ADQ46197.1| von Willebrand factor type A [Caldicellulosiruptor kronotskyensis 2002] Length = 1166 Score = 106 bits (264), Expect = 6e-21, Method: Composition-based stats. Identities = 42/346 (12%), Positives = 100/346 (28%), Gaps = 46/346 (13%) Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 DQT T + + + I + I + + + Sbjct: 360 DDQTGTKVEIVAEGDISSFVEINNLKDEEVFSEIYGIVSTP---VDIEVYAPFKEATVFI 416 Query: 117 NLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDL-------YLQKHN 169 + + N+ + + L ++ + + Sbjct: 417 PIDTSKIPNQDFQNVKMFYLDEDLMTFVPLDEQGVDPVNKVVWAKTNHFTTFVLFYIPTW 476 Query: 170 DNNNMTSNKYLLPPPPKKSFWSKNTT---KSKYAPAPAPANRKIDVLIESAGNLVNSIQK 226 ++ + S P + L+ + Sbjct: 477 KAIWEVPINKGEREINQQINYIDLVFVLDSSGSMSWNDPNGYRKIAAKSFVDALIQGDRA 536 Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYR 286 A+ + + PL+ + VK+ +++++ + TN + A + Sbjct: 537 AVVDFDDFGY--------------LLQPLTTDFQAVKNAIDRIDSWGGTNIAEGIRIANQ 582 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 +L ++ S K +I +TDGE + +N G+ IY++ Sbjct: 583 QL--------ISLSSEDRIKVIILLTDGEGYYDNNLT---------TEAKNNGITIYTIG 625 Query: 347 VSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQS 391 + ++LLR + G +F V+ + +L + F +IT+ + E Sbjct: 626 LGT-SVDENLLRDIATQTGGMYFPVSSASQLPQVFKRITEIVTEPI 670 >gi|238762416|ref|ZP_04623387.1| tight adherance operon protein [Yersinia kristensenii ATCC 33638] gi|238699401|gb|EEP92147.1| tight adherance operon protein [Yersinia kristensenii ATCC 33638] Length = 459 Score = 106 bits (264), Expect = 6e-21, Method: Composition-based stats. Identities = 46/439 (10%), Positives = 110/439 (25%), Gaps = 69/439 (15%) Query: 11 LFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKK 70 ++++ + + ++ A++ A L+ T + Sbjct: 37 ALTFIILEVSIFLQKKAKLSDAIEQATLALTVENDGIPNAAQQTK----NRELVLSYANA 92 Query: 71 HLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIP-SALT 129 +L + I +N + P E L L + Sbjct: 93 YLPSEGFSD------------PIINIDDNTNYLGYNAAVTMTYPVEFLGRSPLTNSISNI 140 Query: 130 NLSLRSTGIIERS-SENLAISICMVLDVSRSMEDLYLQK--------------------H 168 + I ++ + + V D S SM + + Sbjct: 141 QTTDNGEAIKNKTIEVSEPTDVVFVADYSGSMLLSFSDDVSIKNGERINALRSAFRILHN 200 Query: 169 NDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKID------------VLIES 216 NN N P + + P + KI + Sbjct: 201 TIKNNSNVNTIGFIPFGSGTKRKVSENGENKEYCHLPFSPKIYKPNGDYLSENAEATKNA 260 Query: 217 AGNLVNSIQKAIQEKK----NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR------- 265 L +K +V+ I C +N+ + + + Sbjct: 261 WTFLDVIGDHIDYKKTIMSITENVQPIDIPMRDIKHKEICLSGTNSYSLEREQFDYSIEN 320 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + ++ P T + A ++ H + + + + T Sbjct: 321 IIEMAPLGGTLISSGILSANNIFKETADNGHKKLMIILSDGMDSYNSTMLPNKGFFISKT 380 Query: 326 LNTLQICEYMRNAGMKIYSVAV------SAPPEGQDLLRKCTDSSGQFFAVNDSRELL-E 378 L +CE + G+++ +A+ + ++C ++ +++ EL E Sbjct: 381 LIDEGMCEMIIKNGIQMAFIAIAYSPENNVNAPEYINWKQCV-GEDNYYEAHNAHELELE 439 Query: 379 SFDKITDKIQEQSVRIAPN 397 ++ + R P Sbjct: 440 LQQAVSVSATSEVGRNTPK 458 >gi|86144309|ref|ZP_01062641.1| Flp pilus assembly protein TadG [Vibrio sp. MED222] gi|218676258|ref|YP_002395077.1| putative Flp pilus assembly protein TadG [Vibrio splendidus LGP32] gi|85837208|gb|EAQ55320.1| Flp pilus assembly protein TadG [Vibrio sp. MED222] gi|218324526|emb|CAV26007.1| putative Flp pilus assembly protein TadG [Vibrio splendidus LGP32] Length = 438 Score = 105 bits (262), Expect = 1e-20, Method: Composition-based stats. Identities = 47/447 (10%), Positives = 125/447 (27%), Gaps = 77/447 (17%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + ++ +F+ +++ ++ M +++ A + A L+ AS D Sbjct: 14 LFVGLLPAMVIFMAFSMQMSQQMLAHSRLLEAAEVASLALIASPKEDEDKN-----VKYA 68 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + + I + + + + + + ++ + A+Y Sbjct: 69 RYLVDRYILDNSEDVDVAVFTRKCEYKDGCVQASGELAPFSDFVVSATAKYTSWI----- 123 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND------NNNM 174 L S + R ++ + + D S SM + + Sbjct: 124 -SYEDVDLEPEFTVSGRAVTRKYLPQSVDVYFIGDFSGSMGNSWKNGKMKLDVVKETIKR 182 Query: 175 TSNKYLLPPPPKKSFWSKNTTK-------SKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 + +KS + K A + +S Sbjct: 183 VVDDIEKFNTEEKSRVALLGYNPFHVKQTDKTVRVNAYGYYGSWRKKYAYNYARSSPGTT 242 Query: 228 IQ-------------EKKNLSVRIGTIAYNI------GIVGNQCTPLSNNLNEVKSRLNK 268 ++ E K R + PL+ + +E +S+L Sbjct: 243 VRRMFDKPKLYNEILEPKRGMSRYEVERLHTHNVNFAKYYKFYDIPLTEDYDEFRSQLMN 302 Query: 269 --LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 L T+++ + A +E + ++ I ++DG++ + Q + Sbjct: 303 TKLQAGGGTSSWNGIIAAAQEANKA--------TNLNPEQVFIVLSDGQDGDKNYLQKLV 354 Query: 327 NTLQICEYMR-------------------NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF 367 + +C+ +R + + + ++ D C Sbjct: 355 D-QGLCKKLRSTISAKRNRFQSNSPTEAEKTKVTMGVIGINYKVNESDGFGDC-FGKKNI 412 Query: 368 FAVNDSRELLESFDKITDKIQEQSVRI 394 + D + + I + I E++ ++ Sbjct: 413 YHAKDGE---DVYKYILNLINEETGKL 436 >gi|116753518|ref|YP_842636.1| von Willebrand factor, type A [Methanosaeta thermophila PT] gi|116664969|gb|ABK13996.1| von Willebrand factor, type A [Methanosaeta thermophila PT] Length = 795 Score = 105 bits (261), Expect = 1e-20, Method: Composition-based stats. Identities = 30/227 (13%), Positives = 76/227 (33%), Gaps = 21/227 (9%) Query: 167 KHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK 226 + +++ L + S + + D+ +A V + Sbjct: 39 DVISPSEISTVTITLRGGEIPCASPVDVVLSIDSSGSMTTSDPGDLRKSAAKEFVTGLDL 98 Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYR 286 ++ +S I++ PL+NN +++S ++ NT + A Sbjct: 99 SMDRVGVVSWNTSAISW----------PLTNNTKDIESAIDSTGADGNTCLDTGLKSAID 148 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 L K ++ +TDG ++ Y + R+ G+ ++++ Sbjct: 149 LLSE-----------CSGSKVIVLLTDGISTDGGHYTPPGVPGSPVDEARSKGILVFTIG 197 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + + ++L + G+F++ D+ L + +I I + Sbjct: 198 LGPDADARNLTEIAHSTGGEFYSAPDANALAGIYKRIRSSITGIVAK 244 >gi|254420933|ref|ZP_05034657.1| hypothetical protein BBAL3_3243 [Brevundimonas sp. BAL3] gi|196187110|gb|EDX82086.1| hypothetical protein BBAL3_3243 [Brevundimonas sp. BAL3] Length = 646 Score = 105 bits (261), Expect = 1e-20, Method: Composition-based stats. Identities = 48/428 (11%), Positives = 102/428 (23%), Gaps = 50/428 (11%) Query: 15 YAIDLAHIMYIRNQ------MQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQI 68 +++ R + QS A L+G ++ + I Sbjct: 217 MGVNVGDTYASRARGSLDSNTQSITAATWLTGSVKTITSISRAYTAVVTASKHGFKTGDI 276 Query: 69 KKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSAL 128 + N + + T + + + I L Sbjct: 277 VTIWSAETMAPLNGVALTVGSVTTNTFSLVGEDSRYYSAFSGQAYVAKCARTDCNIVITL 336 Query: 129 TNLSLRSTGIIERSSENLAISIC----MVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP 184 L S G +S + + N T+ K Sbjct: 337 ARHGLSSEGDAAVLGNMGGLSQLNNIGFRVASVTPTTATLALDASQANLATTAKGGAAYT 396 Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI-AY 243 S A S S + + Sbjct: 397 SGGQLICGVDGCSNRDFVNAIGAWTRFPGTPCVSERAGSQAYTDAAPSASSWVGRSYASG 456 Query: 244 NIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE-----SSHNT 298 +Q PL+N + ++ + +T + + + + + Sbjct: 457 GNACPASQIVPLTNVKKTLTDAVDGMTAVGSTAGHIGLAWGWYLVSPNFGLWSGLGAPAA 516 Query: 299 IGSTRLKKFVIFITDGENSGASA-------------------------YQNTLNTLQICE 333 S++ K V+ +TDGE + + ++CE Sbjct: 517 YDSSKTLKAVVLMTDGEFNTPYFRGVIASDAGNGSGGADTHINQPATNGSSFEQAYRLCE 576 Query: 334 YMRNAGMKIYSVAVSAPPEGQ---------DLLRKCTDSSGQFFAVNDSRELLESFDKIT 384 M+ A + +Y+V +L+ +C + + F + S +L ++F I Sbjct: 577 NMKAADVIVYTVGFDIGAARNMTGPIDSAGELMARCATNPDRAFQASSSTDLSDAFRDIG 636 Query: 385 DKIQEQSV 392 I + Sbjct: 637 RDITRLRI 644 >gi|86361153|ref|YP_473040.1| hypothetical protein RHE_PF00423 [Rhizobium etli CFN 42] gi|86285255|gb|ABC94313.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 545 Score = 105 bits (261), Expect = 1e-20, Method: Composition-based stats. Identities = 45/327 (13%), Positives = 100/327 (30%), Gaps = 32/327 (9%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A+ + + + + D +R +MQS LDAA+++ I + K++ S Sbjct: 122 VALSLVPMLVAVGASFDYIRSYNVRQRMQSDLDAALIAAVKQINNTEDTD---ALKEKVS 178 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 F Q+ G + + + + A +PT + Sbjct: 179 DWFHAQVDNSYTLGD-------------------IDIDTVNHNITATANGTVPTTFM--- 216 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 A S + +++ +V+D S SM Sbjct: 217 ---KIANIETVPVSVASAVKGPATSYLNVYVVIDTSPSMLLAATTSGQSTMYSGIGCQFA 273 Query: 182 PPPPKKSFWSKNTTKSKYAPAPA-PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 K + YA + A + DV ++ +++ I ++ ++ + +++G Sbjct: 274 CHTGDAHTVGKTKYANNYAYSTAKKIKLRADVAGDAVREVLDMIDES--DENHERIKVGL 331 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN-TYPAMHHAYRELYNEKESSHNTI 299 + + LS + + T+ + L + + + Sbjct: 332 YSLGDTLSEVLAPTLSTDTARTRLADASYGLTSATSKAATYFDVSLATLKQKVGAGGDGT 391 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTL 326 S K V+ +TDG S + + Sbjct: 392 SSGSPLKLVLLLTDGVQSKREWVTDGV 418 >gi|323138519|ref|ZP_08073587.1| hypothetical protein Met49242DRAFT_2975 [Methylocystis sp. ATCC 49242] gi|322396153|gb|EFX98686.1| hypothetical protein Met49242DRAFT_2975 [Methylocystis sp. ATCC 49242] Length = 458 Score = 105 bits (261), Expect = 2e-20, Method: Composition-based stats. Identities = 57/433 (13%), Positives = 136/433 (31%), Gaps = 55/433 (12%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + + + F+ + A+D + +R+++ D A L+ + + + Sbjct: 33 IFGLALIPMFMMMGAAVDYTQAVTVRSRLNHLADRAALAAVKAAAQKESDCVANPAGNNV 92 Query: 61 STI----FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 S K IK + G + + + ++ + A +IPT Sbjct: 93 SNFQGCGQKDIIKAGVAAGVQYMNGDPLMRGADRKPTIELSSSEGSWSATVNYSADIPTN 152 Query: 117 NLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS 176 + L+ ++ + T I ++ ++ ++LD S SM + Sbjct: 153 ---IARLMGVQTIPVNGKVTSNIALG-THMYLNFHLLLDRSMSMGIGATSDDISRLQALT 208 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 + P +ID L ++ G LV + + Sbjct: 209 GCAFACHSEG------YEAQYYDQPKAQGIRFRIDDLRDATGALVAQAKMVASANAREHI 262 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 ++G A+N + ++++L V + + L+ + + A + N+ + + Sbjct: 263 QMGVYAFNH--HVSPLVEMTSDLTNVANAVKNLDLPTHDDGTQAADAVTWLVANKIKGNG 320 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLN----------------------TLQICEY 334 + S + V +TDG G N + + C+ Sbjct: 321 TGLTSAAPLEIVFLVTDGVEDGIYTGWNKMVGPTGLPLPWWPSWMTKAPTSAFPVTACDA 380 Query: 335 MRNAG---MKIYSVAVSAPPEGQ-------------DLLRKCTDSSGQFFAVNDSRELLE 378 +++ G +Y+ V P Q L+ C G FF ++ ++ Sbjct: 381 LKSKGAIVAVVYTTYVPFPGTVQYDRLIGPFAPNISPNLQGCASQ-GYFFTASEPGDITR 439 Query: 379 SFDKITDKIQEQS 391 + ++ ++ Sbjct: 440 GMQSLFNRALQEL 452 >gi|302382135|ref|YP_003817958.1| von Willebrand factor A [Brevundimonas subvibrioides ATCC 15264] gi|302192763|gb|ADL00335.1| von Willebrand factor type A [Brevundimonas subvibrioides ATCC 15264] Length = 560 Score = 104 bits (260), Expect = 2e-20, Method: Composition-based stats. Identities = 68/530 (12%), Positives = 133/530 (25%), Gaps = 138/530 (26%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQ---------------------SALDAAVLS 39 M A+ + L +D+A + +R +Q +A+ A L Sbjct: 29 MFALALPPMMLMTLGGVDIARVSTVRMNVQDALDAATLAAARSQYTDNPRINAVGLAALQ 88 Query: 40 GCASIVSDRTIKDPTT--------------KKDQTSTIFKK--------QIKKHLKQGSY 77 + D T+ T K + L ++ Sbjct: 89 ANLAPYGDVTLDTTQTNFRLNTATGAVEADAKVNVRALVANIFLPPYGQFFDDQLPANAH 148 Query: 78 IRENAGD--IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRS 135 + I I+ T + ++ A I I +SL Sbjct: 149 SEVLRSNNRIEVALVIDNTGSMDGAKLTNTKTAAIDLINRLEAADGRSIEQDAIKISLVP 208 Query: 136 TGIIERSSENLAISICMVLDVSRSMEDLYLQ----------------------------- 166 + R ++ + + + + Sbjct: 209 FSMTVRVAQGGTNTPPSFMSNADTHTGGGAWNSGSNPYSAFDTAVGRFTLFGRLNTTWGG 268 Query: 167 ---------KHNDNN------NMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKID 211 D Y P P ++ + ++T + Sbjct: 269 CVESRPQPYDIRDTAPSSGTQATMFVPYFAPDEPDRADYPNHSTWQNWQYEGNDYLDDGR 328 Query: 212 VLIESAGNLVNSIQKAI-----------QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + N+ + + N G L++N Sbjct: 329 PGSNANSPFANTAARTTEWFARVRSVSRYSTTPRNTLNTGFGPNRGCDLQPIIRLTDNYT 388 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE-SSHNTIGSTRLKKFVIFITDGEN--- 316 +++ +N + NTN + L G+ RLKK +I +TDG N Sbjct: 389 ALRTAVNNMIASGNTNVPLGTMWGWHTLSPNAPFGDGRPYGTERLKKIIIIMTDGANVMS 448 Query: 317 ----------------------------------SGASAYQNTLNTLQICEYMRNAGMKI 342 T T +C M++ +++ Sbjct: 449 DTTSPNDSTYNGLGYIWQNRLGIVSGNDTTRRTRMDNRFDHATAATEDMCGNMKDKDIEV 508 Query: 343 YSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 Y+VAV Q LLR+C + +F V+ + + +FD+I I+ + Sbjct: 509 YTVAVQVDSTAQTLLRRCATDTDHYFPVDSAAGIGAAFDRIAGAIENLRI 558 >gi|83312851|ref|YP_423115.1| Flp pilus assembly protein TadG [Magnetospirillum magneticum AMB-1] gi|82947692|dbj|BAE52556.1| Flp pilus assembly protein TadG [Magnetospirillum magneticum AMB-1] Length = 464 Score = 104 bits (260), Expect = 2e-20, Method: Composition-based stats. Identities = 58/473 (12%), Positives = 120/473 (25%), Gaps = 111/473 (23%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + AI + I +D+A ++++M A + + + Q Sbjct: 20 ILAIGLLPIITTIGLGVDVARAYAVKSRMS----------AALDAAALAVGSSSGTDAQL 69 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 S + +K + G + + ++ + + + + Sbjct: 70 SAVAQKFFDANYPTG----------------ALGAHPSVAVKVTGDVISASAVAEVDTVF 113 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNN------- 173 ++ + + + + + MVLD + SM + + Sbjct: 114 MKVVGLNDVPV----HADSTVNRQIAGLELAMVLDNTGSMTTNNNIQAVRDAANQLTDIL 169 Query: 174 -----------MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVN 222 + Y S TT YAP + V A + + Sbjct: 170 FGTATVHPYLKIALVPYSAAVNVGSVAPSLITTGDTYAPNDLLGWKGCVVERAGANGVGD 229 Query: 223 SIQKAIQEKKNLSVRIGTIAY--------------------NIGIVGNQCTPLSNNLNEV 262 + + + Y TPL+N + Sbjct: 230 TSAATAPWTRYKWLPAVDNNYDATKSSTVLANPSNGNASTGPNLGCPTAITPLTNVKATL 289 Query: 263 KSRLNKLNP--YENTNTYPAMHHAYRELYNEKE-SSHNTIGSTRLKKFVIFITDGENSGA 319 +N + T + M R L E + G+ + K VI +TDG+N Sbjct: 290 TPAINAMEAWSRGGTLSDVGMAWGLRVLSPEPPFTEGLPWGTPKWSKAVILMTDGDNQFY 349 Query: 320 SAYQNTLNTL--------------------------------------QICEYMRNAGMK 341 T +C M+ + Sbjct: 350 KLTSTTGGNKVNSAVNSDYGAYGRLDELGRIGTTNATTAKTTINTRLTSVCNAMKAKNII 409 Query: 342 IYSVAV--SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 +Y+V +D+ + C + ++F EL +F I + V Sbjct: 410 VYTVTFTSGINQATKDIYKACATDASKYFDSPSQDELKSAFRAIATSLSNLRV 462 >gi|329848392|ref|ZP_08263420.1| von Willebrand factor type A [Asticcacaulis biprosthecum C19] gi|328843455|gb|EGF93024.1| von Willebrand factor type A [Asticcacaulis biprosthecum C19] Length = 434 Score = 104 bits (259), Expect = 2e-20, Method: Composition-based stats. Identities = 68/431 (15%), Positives = 146/431 (33%), Gaps = 59/431 (13%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVL-SGCASIVSDRTIKDPTTKKDQ 59 + + + V FL I A+D + +M ++ ++Q A D A + S + + + + Sbjct: 19 IIGLALPVVFLAIGGAVDFSRVMQLKKELQDAADVASVGSVAVNSYAYKANTKGHSSFKT 78 Query: 60 TSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLF 119 ++K+ + + K Q + A Y + Sbjct: 79 GENQALAIFNSNVKKHNDLNNIKVKAKIKKQSTNLVSEIG-------VTADYRP-----Y 126 Query: 120 LKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKY 179 L GL+ ++++ST S+ I ++LD S SM K D ++ Sbjct: 127 LLGLMGMNTMPITIKSTS---SSTFPPYIDFYLLLDNSPSMGVGATTKDIDTMVANTSDK 183 Query: 180 LLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG 239 + + +IDV+ ++ NL+ + + + I Sbjct: 184 CAFACHQMDKAGNDYYALAK---KLKVTTRIDVVRQATQNLMTTAKNTQTLTDQYRMAIY 240 Query: 240 TIAYNIG------IVGNQCTPLSNNLNEVKSRLNKLN----PYENTNTYPAMHHAYREL- 288 + + L+ NL+ S K++ PY+N N+ + L Sbjct: 241 HFGMAADQIDSKNPAPYEVSALTTNLSTSASNAAKIDLMTIPYQNYNSDRQTNFPSYLLG 300 Query: 289 -YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ--------ICEYMRNAG 339 SS + S++ ++ + F++DG N G + + + C+ M+ G Sbjct: 301 MNKVIPSSGDGSSSSKPQQVLFFVSDGANDGYDCAYSNGASCRRISPLDTPQCKAMKARG 360 Query: 340 MKI---YSVAVSAPPEG----------------QDLLRKCTDSSGQFFAVNDSRELLESF 380 +KI Y+ + P +++C + G +F V + + E+ Sbjct: 361 VKIAVLYTTYLPLPTNAFYNSHLAKYVSPTSQLAAKMQECA-TEGLYFEVGPNEGISEAM 419 Query: 381 DKITDKIQEQS 391 + + K+ Sbjct: 420 NALFAKVISTV 430 >gi|32472883|ref|NP_865877.1| signal peptide [Rhodopirellula baltica SH 1] gi|32444120|emb|CAD73562.1| hypothetical protein-signal peptide and transmembrane prediction [Rhodopirellula baltica SH 1] Length = 434 Score = 104 bits (259), Expect = 2e-20, Method: Composition-based stats. Identities = 52/417 (12%), Positives = 132/417 (31%), Gaps = 44/417 (10%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A ++ + L + I+LA + ++ ++ A DAA +G + ++T++ T Sbjct: 26 LMAFVLPMLALLAAFCINLAQMQLVKTELAIATDAAARAGGRAFSEEQTVEAAKAAARLT 85 Query: 61 STIFKKQIKKHLKQGSYIRENAG------DIAQKAQINITKDKNNPLQYIAESKAQYEIP 114 + + + + + + TK + + + + I Sbjct: 86 AAMNEVAGEPYQLNTDDSANEFEFGVSAQTDGNTGRFYFTKVPTSDVAANLVAVSSVRIN 145 Query: 115 TENLFLKGLIPSALTNLSLRSTGIIERSSE----NLAISICMVLDVSRSMED--LYLQKH 168 + L P + S G + + I +VLD S SM+ Sbjct: 146 GKRTDDSLLGPVPFIFPNTFSIGDFSPVASATAMQVDRDISLVLDRSGSMDWKTYDWPDD 205 Query: 169 NDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAP--APAPANRKIDVLIESAGNLVNSIQK 226 D S W + +Y + + D+ + + Sbjct: 206 ADPWGEDSLISAEDAGIVDLEWKYRNGQPQYIRRVSYNRGYDEYDLYDHAWEEVFGLGPA 265 Query: 227 AIQ----------------EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN 270 ++ + ++ +YN + L ++ + V++ + +L Sbjct: 266 PNTPWEDLVLAVDAFLRVLDQTPQNEQVSIASYNSHGTLDCW--LLDDFDSVRAAVAQLA 323 Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 P +T M+ +E K ++ +TDG ++ + Sbjct: 324 PNGSTGIGNGMNSGKTAFTHENAR-------PYASKTMVVMTDGNHNYGTQPNTVAQ--- 373 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 + M ++ + I +V + + + G+ + + EL+ +F++I + + Sbjct: 374 --QLMSSSNLNIQTVTFGGGADQETMQEVAVTGLGRHYHADSGDELVSAFEEIANNL 428 >gi|312793553|ref|YP_004026476.1| von willebrand factor type a [Caldicellulosiruptor kristjanssonii 177R1B] gi|312180693|gb|ADQ40863.1| von Willebrand factor type A [Caldicellulosiruptor kristjanssonii 177R1B] Length = 726 Score = 104 bits (259), Expect = 2e-20, Method: Composition-based stats. Identities = 31/147 (21%), Positives = 62/147 (42%), Gaps = 19/147 (12%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 G PL+ + VK+ +++++ + TN + A +L + S Sbjct: 81 DDYGYLLQPLTTDFQTVKNAIDRIDSWGGTNIAEGIRIANHQL--------ISQSSDDRI 132 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SS 364 K +I +TDGE + +N G+ IY++ + ++LLR + Sbjct: 133 KVIILLTDGEGYYDNNLT---------TEAKNNGITIYTIGLGT-SVDENLLRNIATQTG 182 Query: 365 GQFFAVNDSRELLESFDKITDKIQEQS 391 G +F V+ + +L + F +IT+ + E Sbjct: 183 GMYFPVSSASQLPQVFKRITEIVTEPI 209 >gi|163745746|ref|ZP_02153106.1| hypothetical protein OIHEL45_09145 [Oceanibulbus indolifex HEL-45] gi|161382564|gb|EDQ06973.1| hypothetical protein OIHEL45_09145 [Oceanibulbus indolifex HEL-45] Length = 554 Score = 103 bits (257), Expect = 4e-20, Method: Composition-based stats. Identities = 51/362 (14%), Positives = 98/362 (27%), Gaps = 74/362 (20%) Query: 1 MTAIIISVCFL---FITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKK 57 M +++ V F F A+DLA+ R Q+ LD AVL+ + ++ Sbjct: 18 MLVLMLIVFFGITIFGGLAVDLANHERTRTTFQTHLDNAVLAAASL-----------SQD 66 Query: 58 DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPT-E 116 + + + +++ I + + + +P Sbjct: 67 LDAEEVVRSYLTSAGLD-------------PSEVEIETREEKIGGILVGRTVEASLPAGL 113 Query: 117 NLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMED------LYLQKHND 170 N + +++ S I I +VLDVS SM D Sbjct: 114 NTYFFRFFDIDTLGMTISSEATERVE----DIEISLVLDVSGSMGDITSDRSGIKMDLLK 169 Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA-----PANRKIDVLIESAGNLVNSIQ 225 + +L + S + + + Sbjct: 170 RAAGDFVETILSDAEEGRVSISIVPYSTKVNPGSALLGQYTVSQEHSYSHCVDFDADDFT 229 Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGN--------------QCTPLSNNLNEVKSRLNKLNP 271 + R G N TPLS+++ E+K+++ L P Sbjct: 230 HLRIDTATELQRTGHFLIGSESTSNRTAGQWVCRFDSGFAVTPLSSSVAELKAQIAALTP 289 Query: 272 YENTNTYPAMHHAYRELYNE-----------------KESSHNTIGSTRLKKFVIFITDG 314 +T+ L + + G+ K ++ +TDG Sbjct: 290 LGSTSIDMGAKWGLALLDPSAQTPIAAMIASGQVNRAFQGRPHVYGADNSMKVLVLMTDG 349 Query: 315 EN 316 EN Sbjct: 350 EN 351 Score = 63.0 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 52/181 (28%), Gaps = 7/181 (3%) Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNT 277 G S+ + +N S + I + +N + + +T Sbjct: 375 GYTYYSVASSESNYENDSDWTYPESNFYAIHPFGTQRMWSNYTLANNS-DFRQARMSTEV 433 Query: 278 YPAMHHAYRELYNE----KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 + E+ S + D + + QIC Sbjct: 434 RLDWPEVWAEMSPYYYGYNMYGRRYNSSWYWYQRATDFRDYIQWTVDSVEKDRRLRQICG 493 Query: 334 YMRNAGMKIYSVAVSAPPEGQ-DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 AG+ IYS+ + +LL+ C S +F V E+ +FD I I + Sbjct: 494 VANAAGVVIYSIGMDVDNTNSLNLLKDCASSESHYFDVEGL-EIQTAFDMIAASISMLRL 552 Query: 393 R 393 Sbjct: 553 T 553 >gi|327193254|gb|EGE60160.1| hypothetical protein RHECNPAF_1700073 [Rhizobium etli CNPAF512] Length = 457 Score = 103 bits (257), Expect = 4e-20, Method: Composition-based stats. Identities = 49/334 (14%), Positives = 103/334 (30%), Gaps = 38/334 (11%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A+ + + + + D +R +MQS LDAA+++ I + K + Sbjct: 44 VALSLVPMLVAVGASFDYIRSYNVRQRMQSDLDAALIAAVKQINNSEDTD---ALKQKVY 100 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 F Q++ G + + + + A T Sbjct: 101 DWFHAQVENSYALGE-------------------IEIDTTNHNITATAS---GTVPTTFM 138 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 + +S+ S + +++ +V+D S SM Sbjct: 139 KIANIDTVPVSVGS---AVKGPATSYLNVYIVIDRSPSMLLAATTSGQSTMYSGIGCQFA 195 Query: 182 PPPPKKSFWSKNTTKSKYAPAP-APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 K T + Y + + DV ++ +++ I ++ + RI Sbjct: 196 CHTGDAHTVGKKTYANNYDYSTEKNIKLRADVAGDAVREVLDMIDESD----SNHERIKV 251 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNT----YPAMHHAYRELYNEKESSH 296 Y++G + + + + + RL+ + Y T+ Y A L + Sbjct: 252 GLYSLGDTTKEVLAPTLDTSNARKRLSD-DSYGLTSATSMNYTYFDVALAALQKIVGTGG 310 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 + S K V+ +TDG S + L+ Sbjct: 311 DGTSSANPLKLVLLLTDGVQSQRGWVVKNSSNLK 344 >gi|85716351|ref|ZP_01047324.1| hypothetical protein NB311A_19225 [Nitrobacter sp. Nb-311A] gi|85696867|gb|EAQ34752.1| hypothetical protein NB311A_19225 [Nitrobacter sp. Nb-311A] Length = 542 Score = 103 bits (257), Expect = 4e-20, Method: Composition-based stats. Identities = 28/226 (12%), Positives = 77/226 (34%), Gaps = 18/226 (7%) Query: 184 PPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAY 243 P + + + A A + + S + + Sbjct: 316 HPWRDTDNPGAAAAAPAAPQPSAAANDGGWTGCINDRDREYDISNTAPSTGSDGTPSTKF 375 Query: 244 ----NIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKES--SHN 297 + T +S+ + +K++++ + P NTN + ++ L + Sbjct: 376 YAEQWKDCLPATITAMSSQWSTLKNQIDAMTPSGNTNQSIGLAWGWQSLSTTNGPIAAPG 435 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNT---------LNTLQICEYMRNAGMKIYSVAVS 348 + +++ ++DG N+ Y +C+ ++++G+ I+++ V+ Sbjct: 436 KESGYVYQDYIVLLSDGLNTQNRWYSCPPSGPCPTIDARQALLCQKVKDSGVTIFTIQVN 495 Query: 349 APPEG--QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + +L+ C G F + + E ++F I +I + + Sbjct: 496 VGSKDPLSQVLQNCAS-DGNFQMITSATETADAFQNILTQISQLRL 540 Score = 76.5 bits (186), Expect = 6e-12, Method: Composition-based stats. Identities = 33/278 (11%), Positives = 62/278 (22%), Gaps = 27/278 (9%) Query: 10 FLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIK 69 F+ A+D R+ MQ+ALD+AVL ++ T+ Sbjct: 1 MGFVGAAVDYTRANAARSSMQAALDSAVLMVSKDAAANPTMTSQQITNAVQR-------- 52 Query: 70 KHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALT 129 + + G + + + Q + Sbjct: 53 -YFTSLYNDKSAFGVTVSATYTPSSSSAAAKILASGQGAIQ-------TDFMKIAGFPQL 104 Query: 130 NLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL--LPPPPKK 187 + ST + N + + +VLD + SM L Sbjct: 105 SFGTSSTS----TWGNSRMRVALVLDNTGSMSSNGKMSALQRAAKDMIDSLSAFAKKTGD 160 Query: 188 SFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGI 247 + S + N E G + S + + Sbjct: 161 VYISIIPFSKDVNVDTSNYNAAWINWAEWLGEPPVLDPASSYGGSKPSNWDDIVEDSNCP 220 Query: 248 VGNQCTPLS-----NNLNEVKSRLNKLNPYENTNTYPA 280 + L+ KS ++ Y Sbjct: 221 FKKNSHGFTCADRPATLSGAKSDTKRIPSSGKYAGYIC 258 >gi|89069885|ref|ZP_01157219.1| hypothetical protein OG2516_06272 [Oceanicola granulosus HTCC2516] gi|89044561|gb|EAR50680.1| hypothetical protein OG2516_06272 [Oceanicola granulosus HTCC2516] Length = 536 Score = 103 bits (256), Expect = 6e-20, Method: Composition-based stats. Identities = 44/340 (12%), Positives = 85/340 (25%), Gaps = 69/340 (20%) Query: 15 YAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQ 74 A+D R ++Q+ LD AVL+ + + K Sbjct: 36 MAVDFMRTETARGRLQATLDGAVLAAA-------DLDQDKDPVEVVRDYVAKAGLDPFLI 88 Query: 75 GSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLR 134 + E AG Q I + A+ ++ + ++ R Sbjct: 89 DVDVTEIAG------------------QRIVTASAKSDVT---MHFMKMVGIDFLPAPAR 127 Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 ST S + + +VLD+S SME + + Sbjct: 128 STASEAVS----NLDVSLVLDMSGSME-GDKLDQLQAAAKNFVGIVYDTMGAEKILLNVV 182 Query: 195 TKSKYAPAPAPANR------KIDVLIESAGNLVNSIQKAIQ---------EKKNLSVRIG 239 + APA + + + G Sbjct: 183 PYATQVAAPAGLLDMLGAFLREHSYSNCVSFSAADFTETSILEAAALPQGGHFDPFYTWG 242 Query: 240 TIAY------NIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN--- 290 + Y + L++ E++ ++ L NT+ M + Sbjct: 243 PLRYDDVTFVCNPDPSTEVLTLASTQREIEDYIDGLVAEGNTSIDVGMKWGAALIDPDLG 302 Query: 291 ------------EKESSHNTIGSTRLKKFVIFITDGENSG 318 + G K ++ +TDG+N+ Sbjct: 303 STLNEFANGPSAAGINPVALWGDRSTDKVIVLMTDGKNTT 342 Score = 84.2 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 22/132 (16%), Positives = 46/132 (34%), Gaps = 5/132 (3%) Query: 267 NKLNPYENTNTYPAMHHAYRELYN-----EKESSHNTIGSTRLKKFVIFITDGENSGASA 321 N+ Y EL++ H+ + + + + D + S Sbjct: 404 NRFPHGGGRIEDNVRRLTYEELFSRVSVYYNAYYHHYLQNFDRTELDTWYWDFLDMSLST 463 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFD 381 IC +N G+++++V + ++ C S FF V+ +L +F+ Sbjct: 464 SAKNARLEAICTAAKNQGVQVFTVGFEVEDDEAIIMEDCASSRAHFFRVSGGGDLTTAFE 523 Query: 382 KITDKIQEQSVR 393 I +I E + Sbjct: 524 SIARQITELRLT 535 >gi|297581617|ref|ZP_06943539.1| flp pilus assembly protein TadG [Vibrio cholerae RC385] gi|297534024|gb|EFH72863.1| flp pilus assembly protein TadG [Vibrio cholerae RC385] Length = 467 Score = 102 bits (254), Expect = 9e-20, Method: Composition-based stats. Identities = 47/448 (10%), Positives = 121/448 (27%), Gaps = 66/448 (14%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A+++ F ++ + + + +++ DA + S S + + + Sbjct: 21 AALMLLGMLTFFSFVLLVIVLSTTDSRLSMLADAVLYSTTNSYNAKADAQQMSEANTPQP 80 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKA-------QYEIP 114 + ++ + D + +A EI Sbjct: 81 NLGLSSLQVDTGNNENAAQVQVSGRVDRGSLALTDTLGTSDVLVTHQAQSKIHQTTLEIV 140 Query: 115 TENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ-------- 166 + +T + V+ + + + + Sbjct: 141 VMLDVSNSMKGEPMTQSIKGLRDFADILYAEERRDFSKVVSIVPATGLVNIGHRPEFFSA 200 Query: 167 ------KHNDNNNMTSNKYLLPPPPKKSFWSKN---------TTKSKYAPAPAPANRKID 211 + + L P W K + + R+++ Sbjct: 201 SAFAIPRDWRSLAKERGWKDLLHPEVPGRWRKAMCTALPEEQDELTSVSALTPNWIRRLE 260 Query: 212 VL--IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI--------------------GIVG 249 + ++ + + K E + + T Y+ Sbjct: 261 LSPPDQNLRLHMEWMSKPAIEHYENDMPLFTYYYSGNPKEKYSPNKHEQRGLFDSPDCGV 320 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRL----- 304 +Q PL + L+ L P NTN + A+R L + S Sbjct: 321 SQIQPLLSTRRAFIKALDTLYPEFNTNNAEGVMWAWRLLSPHWRGYWDKGKSELPRDYQH 380 Query: 305 ---KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT 361 +K ++ TDG + A ++ +C M+ G++I S+ +++ C Sbjct: 381 PNNRKVMLLFTDGNHLVDVAKRDRKQV-ALCREMKKQGIEIISIDF---NNRSQVMKSCA 436 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQE 389 +GQ++ ++ + ++ + + Sbjct: 437 S-AGQYYIADNRT-IRSVLKQVATTLSK 462 >gi|188580137|ref|YP_001923582.1| hypothetical protein Mpop_0869 [Methylobacterium populi BJ001] gi|179343635|gb|ACB79047.1| conserved hypothetical protein [Methylobacterium populi BJ001] Length = 477 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 57/459 (12%), Positives = 122/459 (26%), Gaps = 90/459 (19%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M A+ + + ID + + ++ +A DAAVL+G + Sbjct: 29 MFALALLPTLGLVGLGIDYGMAITSKTRLDNAADAAVLAGVVTAKEYIASNAKQGDATAA 88 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + Q + A +++++ + Y +N Sbjct: 89 GLTAGRN------QATKAFAINTGKVPFATVSVSRLDVTRSGQTLTATVIYTATIQNT-F 141 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 ++ + T + T + + + +++DVS SM + Sbjct: 142 GKILGLSSTTFTNTITASADLA---SYLDFYLMVDVSGSMG----LPTAAADAEKLASIT 194 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 F + + A + D + + L+ RIG Sbjct: 195 KEDQGNCQFACHFPGRKGWNNAAGKIQLRSDAVNNAVCELLKRAATP---VVPNQYRIGF 251 Query: 241 IAYNIG-IVGNQCTPLSNNLNEVKSRLN----------KLNPYENTNTYPA--------- 280 + + + + ++ +++ L +T + Sbjct: 252 YPFINRLATLSPLSDTTTSMTALRTAAQCDKTWPLAFTNLLDTGSTQLFTGNNPTTGTGS 311 Query: 281 ----MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL------- 329 A ++ + + +T K FV ITDG + S N Sbjct: 312 GGTHFEKALPQMKATIQPYGDGSSTTNSKPFVFLITDGMQNSQSYSTNNDARTFPGSPSL 371 Query: 330 -------------------QICEYMRNAGMKIYSVAV----------------------S 348 C+ +++AG I + + Sbjct: 372 FKGYGNAGWDGSQPAQIDPSKCKELKDAGAIISILYIPYNQVKNYTNDSYIVWENNRVNG 431 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 P D LRKC G F+ N + ++ S + D+ Sbjct: 432 FSPTLADPLRKCASQ-GFFYTANSADDITASLGAMFDQA 469 >gi|170740935|ref|YP_001769590.1| hypothetical protein M446_2717 [Methylobacterium sp. 4-46] gi|168195209|gb|ACA17156.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 432 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 55/415 (13%), Positives = 118/415 (28%), Gaps = 59/415 (14%) Query: 14 TYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLK 73 A+D R ++ + D AVL+ ++ + D + T + Sbjct: 43 GAAVDFTSYQKARTELDAVADQAVLAAVSAAGMKMSQADAEAAMAKLFTDAAAALPNVSA 102 Query: 74 QGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSL 133 D + A + + + L + Sbjct: 103 SPRAATAPTTDGVRTASLTYSAT-------------------IRTGIMRLAGFSTVAFGG 143 Query: 134 RSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKN 193 +T S + ++LD S SM ++ S Sbjct: 144 TATA---ASPNPIFTDFYLLLDNSPSMGVAATTADIATMVANTSDQCAFACHD---MSAG 197 Query: 194 TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI-AYNIGIVGNQC 252 +IDV+ ++ L+++ + I + GI NQ Sbjct: 198 GNDYYAKAKNLGVKMRIDVVRDATQQLMDTASAKAIAAGQYRMAIYSFGTSCSGIGLNQV 257 Query: 253 TPLSNNLNEVKSRLNKLNPY------ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 + L+ NL+ K+ L+ N + + L + + + + +K Sbjct: 258 SALTANLSTSKTDAGALDLMTVPYQNYNNDQCTDFDGIFARLNSAVPNPGSGASAASPQK 317 Query: 307 FVIFITDGENSGASAYQNTL----------NTLQICEYMRNAGMKI---YSVAVSAPPEG 353 V F++DG T TL C+ +++ G+++ Y+ + P G Sbjct: 318 VVFFVSDGVADANYPSTCTKPTTNGRCQEPITLANCQALKDRGIRVAVLYTTYLPLPTNG 377 Query: 354 -------------QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 + C ++ V+ S + ++ + K+ + RI Sbjct: 378 WYNTWIAPFSSQIATNMAACAS-PDLYWPVSPSEGIADAMKGLFKKVVDSQRRIT 431 >gi|51597679|ref|YP_071870.1| membrane protein. [Yersinia pseudotuberculosis IP 32953] gi|51590961|emb|CAH22619.1| Putative membrane protein [Yersinia pseudotuberculosis IP 32953] Length = 518 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 56/472 (11%), Positives = 124/472 (26%), Gaps = 98/472 (20%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 +I V I + + +H + R ++ AL+ A L+ + Sbjct: 29 FMALIPVFIGLIFLSFEFSHFIQKRAKLSDALEQASLALSTENNYRNDRASNNRNNYLVT 88 Query: 62 TIFKKQI-----------------------KKHLKQGSYIRENAGDIAQKAQINITKDKN 98 + + + L+ + + Q ++N Sbjct: 89 SYAQSYLPSERFSQPRVVNTYNESLGYTEYNASLQMNYQLALLNSYLKQTPSPTWDVNEN 148 Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE------NLAISICM 152 + S A+ G + ++ + + N I Sbjct: 149 GAARKYLSSIAEPIDVVFVTDFSGSMDLPFGDIERNNRITKLDELKAIFVKLNNRIFSND 208 Query: 153 VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV 212 ++ + + K N S+ Y P K +Y + +D Sbjct: 209 GINTIGFVPFSWGTKRISANGQVSSTYCHFPYSPKKIDRNGHYLQRYTASNLKNIPGLDN 268 Query: 213 LIE----SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV------ 262 L + G L AI + R I + +N++ Sbjct: 269 LSGIDNLAYGQLDEDKHHAILSEIEKKHRDNEIPTKTRDQAKNFLDKAYKVNQISTITKI 328 Query: 263 -------KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG--------------- 300 K +N ++ T M + KE++ ++ Sbjct: 329 VEEHIDYKETINSIDRNGET-IDIPMDDILDPFFCLKETNAKSLNFDPNSKGDINEILNM 387 Query: 301 ---------------------STRLKKFVIFITDGENSGASAY------------QNTLN 327 S K +I ++DG+++ L Sbjct: 388 KAEGGTLASSGILVGNKMLTESQNNNKLMIILSDGDDNTQKMSSPHDQKAGIINITQKLI 447 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQ--DLLRKCTDSSGQFFAVNDSRELL 377 T +C+ +++ G+K+ + + P+ D + C + G F+ ++ EL Sbjct: 448 TEGMCQKIKDNGIKMVFIGIGYVPDNNIIDWEKDCVGT-GNFYLAKNAHELE 498 >gi|238759128|ref|ZP_04620297.1| tight adherance operon protein [Yersinia aldovae ATCC 35236] gi|238702676|gb|EEP95224.1| tight adherance operon protein [Yersinia aldovae ATCC 35236] Length = 448 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 45/436 (10%), Positives = 117/436 (26%), Gaps = 57/436 (13%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 I + + + + +++H + ++ A++ A L+ ++ T+ ++ Sbjct: 29 FIIFLPLFIGLLYLSFEISHYLQKAAKLSDAIEQATLALTIENNTN---NPDETQTEKNI 85 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 ++ + +L S+ I+ P + + L + Sbjct: 86 SLVNAYARAYLPSESFSAPVIDIISH------------PNYIEYRAATTLNYTPKFLTKE 133 Query: 122 GLIPSA-LTNLSLRSTGIIERSSENLAI-SICMVLDVSRSMEDLYLQKHNDNNNMTSNKY 179 + +S I + + I + V+D S SM+ + + + Sbjct: 134 LITNIDRRIIVSDNGVAIKNKFTSPGEITDVVFVVDYSVSMDGNFGDEKKTTKIQELRRI 193 Query: 180 LLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG 239 + T + + + K + Sbjct: 194 FEDLNNTILKNNNTHTIGFVPFSWGTKKIIGKGIHRKIYCHFPFVPKTPMPPSYYLGDLK 253 Query: 240 TIAY---------NIGIVGNQCTPLSNNLNEVKSRLNKLNPYE----------------- 273 + N ++ N N + ++ + P Sbjct: 254 SYNPAKELTDAVKNNIDYDETIKSITANYNFINIPIDDIKPSSFCLKGSDAYTLRSDDIT 313 Query: 274 NTNTYPAMHHAYRELYNEKE-----SSHNTIGSTRLKKFVIFITDGENSGASAY------ 322 N N + H L + S K +I I+DG + S+ Sbjct: 314 NDNIQENIEHEVNGLTLISSGILVANDIFRKDSKNKDKLMIIISDGNDQEISSDLTQEKI 373 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL-ESFD 381 TL +CE ++ +++ + ++ + C ++ ++ EL + Sbjct: 374 TKTLIEKGMCERIKENNIRMVFIGIAYTVKEIK-WEDCVGKR-NYYEAQNAHELEADLRQ 431 Query: 382 KITDKIQEQSVRIAPN 397 + + R P Sbjct: 432 ALGTIEASEVGRNIPK 447 >gi|239908012|ref|YP_002954753.1| hypothetical protein DMR_33760 [Desulfovibrio magneticus RS-1] gi|239797878|dbj|BAH76867.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 451 Score = 102 bits (253), Expect = 1e-19, Method: Composition-based stats. Identities = 51/457 (11%), Positives = 119/457 (26%), Gaps = 112/457 (24%) Query: 18 DLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSY 77 DL + ++++Q+A+D+A L+G + D + T + Sbjct: 17 DLGRVSVEQSRLQNAVDSAALAGSLQLPDDPDVSTGAVTAAATQNLLANDADA------- 69 Query: 78 IRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTG 137 ++ A+ ++ + L +I ++ + Sbjct: 70 --------------TGILVESGGATRSVCVSAEAKVE---MTLSQVIGIGDQTVTAEACA 112 Query: 138 IIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSN-------------KYLLPPP 184 I + MVLD + SM+ + + N K L P Sbjct: 113 GYN------DIELVMVLDATGSMKGTPIANVKEAATNLVNLIMPSSSSTSTRSKIGLVPF 166 Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT---- 240 K N + A + + + K N+ Sbjct: 167 QGKVRIDGNDPVTAEANPDGVGPGCRNADGTLNNGKLRTEYSKTTTKTNIFYGYTLSGVS 226 Query: 241 -IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRELYNEKESSH 296 + + LS++ + + S + LN T + L Sbjct: 227 TTSDKTCSGMSPIRALSSDKSAILSNITALNAGQVTSGTIISEGIKWGRHVLTPTAPYVE 286 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQN-------------------------------- 324 + + ++K +I +TDG+ + Sbjct: 287 GSTDTK-VRKIMIVLTDGDTEDGRCGGSYASASKTINTYWTNAYFGQGLKPDSSASPYST 345 Query: 325 --------------------TLNTLQICEYMR---NAGMKIYSVAVS-APPEGQDLLRKC 360 + + N ++I+S+ + ++L+++ Sbjct: 346 LSTAALTLAQIPDCKDGGLLNTYVVNEATLAKTDANYPIEIFSIRFGDSDSTDKNLMKQI 405 Query: 361 TD----SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + +F D + E F KI ++ ++ + Sbjct: 406 ASSKPGTEDHYFDAPDEAGIKEMFKKIGQQLGQRLMT 442 >gi|186896818|ref|YP_001873930.1| hypothetical protein YPTS_3520 [Yersinia pseudotuberculosis PB1/+] gi|186699844|gb|ACC90473.1| conserved hypothetical protein [Yersinia pseudotuberculosis PB1/+] Length = 518 Score = 101 bits (252), Expect = 1e-19, Method: Composition-based stats. Identities = 57/472 (12%), Positives = 125/472 (26%), Gaps = 98/472 (20%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 +I V I + + +H + R ++ AL+ A L+ + Sbjct: 29 FMALIPVFIGLIFLSFEFSHFIQKRAKLSDALEQASLALSTENNYRNDRASNNRNNYLVT 88 Query: 62 TIFKKQI-----------------------KKHLKQGSYIRENAGDIAQKAQINITKDKN 98 + + + L+ + + Q ++N Sbjct: 89 SYAQSYLPSERFSQPRVVNTYNEILGYTEYNASLQMNYQLALLNSYLKQTPSPTWDVNEN 148 Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE------NLAISICM 152 + S A+ G + ++ L + + N I Sbjct: 149 GAARKYLSSIAEPIDVVFVTDFSGSMNLPFGDIELNNRITKLDELKAIFVKLNNRIFSND 208 Query: 153 VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV 212 ++ + + K N S+ Y P K +Y + +D Sbjct: 209 GINTIGFVPFSWGTKRISANGQVSSTYCHFPYSPKKIDGNGHYLQRYTASNLKNIPGLDN 268 Query: 213 LIE----SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV------ 262 L + G L AI + R I + +N++ Sbjct: 269 LSGIDNLAYGQLDEDKHHAILSEIEKKHRDNEIPTKTRDQAKNFLDKAYKVNQISTITKI 328 Query: 263 -------KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG--------------- 300 K +N ++ T M + KE++ ++ Sbjct: 329 VEEHIDYKETINSIDRNGET-IDIPMDDILDPFFCLKETNAKSLNFDPNSKGDINEILNM 387 Query: 301 ---------------------STRLKKFVIFITDGENSGASAY------------QNTLN 327 S K +I ++DG+++ L Sbjct: 388 KAEGGTLASSGILVGNKMLTESQNNNKLMIILSDGDDNTQKMSSPHDQKAGIINITQKLI 447 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQ--DLLRKCTDSSGQFFAVNDSRELL 377 T +C+ +++ G+K+ + + P+ D + C + G F+ ++ EL Sbjct: 448 TEGMCQKIKDNGIKMVFIGIGYVPDNNIIDWEKDCVGT-GNFYLAKNAHELE 498 >gi|323493925|ref|ZP_08099042.1| putative Flp pilus assembly protein TadG [Vibrio brasiliensis LMG 20546] gi|323311866|gb|EGA65013.1| putative Flp pilus assembly protein TadG [Vibrio brasiliensis LMG 20546] Length = 427 Score = 101 bits (252), Expect = 1e-19, Method: Composition-based stats. Identities = 42/445 (9%), Positives = 125/445 (28%), Gaps = 75/445 (16%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + ++ + + + +++ ++ M ++ A + A L+ I P +D+ Sbjct: 5 LFIGLLPIMVILMAFSMQMSQQMLAHARVLEAAEVASLAL---------IASPKESEDEN 55 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 ++ + +++ + + + ++ + ++ + ++ Sbjct: 56 VKYARQLVDRYVVDNINDVDVEVYTRKCEYKDGCVQESGEVAPFSDFVVSAKAEHKSWIA 115 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND------NNNM 174 L + + R + + + D S SM + Sbjct: 116 YE--KVDLKPEFEVAGKSVTRKYLPQPVDVYFIGDFSGSMNGHWKGGKTKLDVVKQTIER 173 Query: 175 TSNKYLLPPPPKKSFWSKNTTK------SKYAPAPAPANRKIDVLIESAGNLVNSIQKA- 227 +KS + + + R+ + + + Sbjct: 174 VVEDIENFNTEEKSRVALLGYNPLHVKQTGTVYLNSYGYRRSWPKKVAYDYARGTTAQTV 233 Query: 228 --IQEKKNLSVRIGTI----------------AYNIGIVGNQCTPLSNNLNEVKSRLNK- 268 + + ++ R+ + PL+ + KSRL Sbjct: 234 AKMFDPPSVYSRVQEYVRGMSRHDVKNLVVNNDRFVDYYKFYDIPLTEDYTNFKSRLANA 293 Query: 269 -LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 L T+++ + A +E + ++ I ++DG ++ + Q +N Sbjct: 294 SLGAEGGTSSWNGIIAAAQEANRA--------TNINPEQVFIVLSDGADNDTNYLQRLVN 345 Query: 328 TLQICEYMR------------------NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA 369 +C +R + + + V + D C + Sbjct: 346 -QGLCTKLRSTISAKRNRFQSKTGSAGKTKVTMGVIGVDYRVKESDGFGDC-FGRKNIYH 403 Query: 370 VNDSRELLESFDKITDKIQEQSVRI 394 D + + I + I E++ R+ Sbjct: 404 AKDGD---DVYKYILNLINEETGRL 425 >gi|261251272|ref|ZP_05943846.1| protein TadG associated with Flp pilus assembly [Vibrio orientalis CIP 102891] gi|260938145|gb|EEX94133.1| protein TadG associated with Flp pilus assembly [Vibrio orientalis CIP 102891] Length = 436 Score = 101 bits (252), Expect = 1e-19, Method: Composition-based stats. Identities = 44/445 (9%), Positives = 125/445 (28%), Gaps = 75/445 (16%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + ++ + +F+ +++ ++ M ++ A + A L+ I P +D Sbjct: 14 LFIGLLPIMVIFMAFSMQMSQQMLAHARVLEAAEVASLAL---------IASPKESEDDN 64 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 ++ + +++ + + + + + ++ + ++ Sbjct: 65 VKYARQLVDRYVVDNINDVDVEVYTRKCEYKDGCVQASGEVAPFSDFVVSAKAEHKSWIA 124 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND------NNNM 174 + L + + R + + + D S SM + Sbjct: 125 YE--EAELKPEFEVAGKSVTRKYLPQPVDVYFIGDFSGSMTGHWKGGKTKLDVVKQTIER 182 Query: 175 TSNKYLLPPPPKKSFWSKNTTK------SKYAPAPAPANRKIDVLIESAGNLVNSIQKA- 227 +KS + S A R+ + + + Sbjct: 183 VVEDIADFNTEEKSRVALLGYNPLHVKQSGTVYLNAYGYRRSWPKKIAYDYARGTSAQTV 242 Query: 228 --IQEKKNLSVRIGTI----------------AYNIGIVGNQCTPLSNNLNEVKSRLNK- 268 + + ++ R+ + PL+ + K+RL Sbjct: 243 AKMFDPPSVYSRVQEYVRGMSRLDVENLVVNNDRFVDYYKFYDIPLTEDYTHFKARLASA 302 Query: 269 -LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 L T+++ + A +E + ++ I ++DG ++ + Q +N Sbjct: 303 WLGAEGGTSSWNGIIAAAQEANRA--------TNINPEQVFIVLSDGADNDTNYLQRLVN 354 Query: 328 TLQICEYMR------------------NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA 369 +C +R + + + V + D C + Sbjct: 355 -QGLCTKLRSTISAKRNRFQSKTGSAGKTKVTMGVIGVDYRVKESDGFGDC-FGRKNIYH 412 Query: 370 VNDSRELLESFDKITDKIQEQSVRI 394 D + + I + I E++ R+ Sbjct: 413 AKDGD---DVYKYILNLINEETGRL 434 >gi|22127367|ref|NP_670790.1| hypothetical protein y3493 [Yersinia pestis KIM 10] gi|45442761|ref|NP_994300.1| hypothetical protein YP_2999 [Yersinia pestis biovar Microtus str. 91001] gi|108809099|ref|YP_653015.1| hypothetical protein YPA_3108 [Yersinia pestis Antiqua] gi|108810706|ref|YP_646473.1| hypothetical protein YPN_0541 [Yersinia pestis Nepal516] gi|150260286|ref|ZP_01917014.1| putative fimbrial anchor [Yersinia pestis CA88-4125] gi|162419964|ref|YP_001604884.1| hypothetical protein YpAngola_A0266 [Yersinia pestis Angola] gi|165939877|ref|ZP_02228416.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. IP275] gi|166009017|ref|ZP_02229915.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166211928|ref|ZP_02237963.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. B42003004] gi|167466384|ref|ZP_02331088.1| hypothetical protein YpesF_00480 [Yersinia pestis FV-1] gi|218927875|ref|YP_002345750.1| hypothetical protein YPO0684 [Yersinia pestis CO92] gi|229837366|ref|ZP_04457529.1| putative fimbrial anchor [Yersinia pestis Pestoides A] gi|229840578|ref|ZP_04460737.1| putative fimbrial anchor [Yersinia pestis biovar Orientalis str. PEXU2] gi|229842872|ref|ZP_04463024.1| putative fimbrial anchor [Yersinia pestis biovar Orientalis str. India 195] gi|229900904|ref|ZP_04516028.1| putative fimbrial anchor [Yersinia pestis Nepal516] gi|294502750|ref|YP_003566812.1| hypothetical protein YPZ3_0640 [Yersinia pestis Z176003] gi|21960452|gb|AAM87041.1|AE013952_8 hypothetical [Yersinia pestis KIM 10] gi|45437627|gb|AAS63177.1| putative membrane protein [Yersinia pestis biovar Microtus str. 91001] gi|108774354|gb|ABG16873.1| membrane protein [Yersinia pestis Nepal516] gi|108781012|gb|ABG15070.1| putative membrane protein [Yersinia pestis Antiqua] gi|115346486|emb|CAL19360.1| putative membrane protein [Yersinia pestis CO92] gi|149289694|gb|EDM39771.1| putative fimbrial anchor [Yersinia pestis CA88-4125] gi|162352779|gb|ABX86727.1| conserved hypothetical protein [Yersinia pestis Angola] gi|165912188|gb|EDR30826.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. IP275] gi|165992356|gb|EDR44657.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166206674|gb|EDR51154.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. B42003004] gi|229682243|gb|EEO78335.1| putative fimbrial anchor [Yersinia pestis Nepal516] gi|229690139|gb|EEO82196.1| putative fimbrial anchor [Yersinia pestis biovar Orientalis str. India 195] gi|229696944|gb|EEO86991.1| putative fimbrial anchor [Yersinia pestis biovar Orientalis str. PEXU2] gi|229705489|gb|EEO91499.1| putative fimbrial anchor [Yersinia pestis Pestoides A] gi|262364727|gb|ACY61284.1| hypothetical protein YPD8_0594 [Yersinia pestis D182038] gi|294353209|gb|ADE63550.1| hypothetical protein YPZ3_0640 [Yersinia pestis Z176003] gi|320016753|gb|ADW00325.1| putative fimbrial anchor [Yersinia pestis biovar Medievalis str. Harbin 35] Length = 518 Score = 101 bits (252), Expect = 1e-19, Method: Composition-based stats. Identities = 56/472 (11%), Positives = 124/472 (26%), Gaps = 98/472 (20%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 +I V I + + +H + R ++ AL+ A L+ + Sbjct: 29 FMALIPVFIGLIFLSFEFSHFIQKRAKLSDALEQASLALSTENNYRNDRASNNRNNYLVT 88 Query: 62 TIFKKQI-----------------------KKHLKQGSYIRENAGDIAQKAQINITKDKN 98 + + + L+ + + Q ++N Sbjct: 89 SYAQSYLPSERFSQPRVVNTYNESLGYTEYNASLQMNYQLALLNSYLKQTPSPTWDVNEN 148 Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE------NLAISICM 152 + S A+ G + ++ + + N I Sbjct: 149 GAARKYLSSIAEPIDVVFVTDFSGSMDLPFGDIERNNRITKLDELKAIFVKLNNRIFSND 208 Query: 153 VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV 212 ++ + + K N S+ Y P K +Y + +D Sbjct: 209 GINTIGFVPFSWGTKRISANGQVSSTYCHFPYSPKKIDGNGHYLQRYTASNLKNIPGLDN 268 Query: 213 LIE----SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV------ 262 L + G L AI + R I + +N++ Sbjct: 269 LSGIDNLAYGQLDEDKHHAILSEIEKKHRDNEIPTKTRDQAKNFLDKAYKVNQISTITKI 328 Query: 263 -------KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG--------------- 300 K +N ++ T M + KE++ ++ Sbjct: 329 VEEHIDYKETINSIDRNGET-IDIPMDDILDPFFCLKETNAKSLNFDPNSKGDINEILNM 387 Query: 301 ---------------------STRLKKFVIFITDGENSGASAY------------QNTLN 327 S K +I ++DG+++ L Sbjct: 388 KAEGGTLASSGILVGNKMLTESQNNNKLMIILSDGDDNTQKMSSPHDQKAGIINITQKLI 447 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQ--DLLRKCTDSSGQFFAVNDSRELL 377 T +C+ +++ G+K+ + + P+ D + C + G F+ ++ EL Sbjct: 448 TEGMCQKIKDNGIKMVFIGIGYVPDNNIIDWEKDCVGT-GNFYLAKNAHELE 498 >gi|306821351|ref|ZP_07454960.1| von Willebrand factor [Eubacterium yurii subsp. margaretiae ATCC 43715] gi|304550638|gb|EFM38620.1| von Willebrand factor [Eubacterium yurii subsp. margaretiae ATCC 43715] Length = 467 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 39/209 (18%), Positives = 78/209 (37%), Gaps = 26/209 (12%) Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 L K + N + N + E ++ + N + Sbjct: 14 TFLIEDEKDKYDGINIAFVIDSSGSMFYNDPNGLRREVTHKFIDRLTD------NDMAAV 67 Query: 239 GTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNT 298 Y ++ ++N ++ ++K+ TN A+ AY L+N +++ Sbjct: 68 IGFDYKATVLEQ----FTSNKEKLHDAVDKIRSDGGTNIGRAVSIAYD-LFNNLDNNRKE 122 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLR 358 KF+I +TDG+ + Y + AG+KIY++ + + LL+ Sbjct: 123 K----YPKFLILLTDGDGDYSEEYTI---------LAKKAGIKIYTIGLG-NGVSEKLLK 168 Query: 359 KCT-DSSGQFFAVNDSRELLESFDKITDK 386 + G++F D+ +L + F+KI DK Sbjct: 169 DIAKGTDGEYFHAKDASKLNKIFEKIADK 197 >gi|145597778|ref|YP_001161854.1| hypothetical protein YPDSF_0468 [Yersinia pestis Pestoides F] gi|145209474|gb|ABP38881.1| membrane protein [Yersinia pestis Pestoides F] Length = 513 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 56/472 (11%), Positives = 124/472 (26%), Gaps = 98/472 (20%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 +I V I + + +H + R ++ AL+ A L+ + Sbjct: 24 FMALIPVFIGLIFLSFEFSHFIQKRAKLSDALEQASLALSTENNYRNDRASNNRNNYLVT 83 Query: 62 TIFKKQI-----------------------KKHLKQGSYIRENAGDIAQKAQINITKDKN 98 + + + L+ + + Q ++N Sbjct: 84 SYAQSYLPSERFSQPRVVNTYNESLGYTEYNASLQMNYQLALLNSYLKQTPSPTWDVNEN 143 Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE------NLAISICM 152 + S A+ G + ++ + + N I Sbjct: 144 GAARKYLSSIAEPIDVVFVTDFSGSMDLPFGDIERNNRITKLDELKAIFVKLNNRIFSND 203 Query: 153 VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV 212 ++ + + K N S+ Y P K +Y + +D Sbjct: 204 GINTIGFVPFSWGTKRISANGQVSSTYCHFPYSPKKIDGNGHYLQRYTASNLKNIPGLDN 263 Query: 213 LIE----SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV------ 262 L + G L AI + R I + +N++ Sbjct: 264 LSGIDNLAYGQLDEDKHHAILSEIEKKHRDNEIPTKTRDQAKNFLDKAYKVNQISTITKI 323 Query: 263 -------KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG--------------- 300 K +N ++ T M + KE++ ++ Sbjct: 324 VEEHIDYKETINSIDRNGET-IDIPMDDILDPFFCLKETNAKSLNFDPNSKGDINEILNM 382 Query: 301 ---------------------STRLKKFVIFITDGENSGASAY------------QNTLN 327 S K +I ++DG+++ L Sbjct: 383 KAEGGTLASSGILVGNKMLTESQNNNKLMIILSDGDDNTQKMSSPHDQKAGIINITQKLI 442 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQ--DLLRKCTDSSGQFFAVNDSRELL 377 T +C+ +++ G+K+ + + P+ D + C + G F+ ++ EL Sbjct: 443 TEGMCQKIKDNGIKMVFIGIGYVPDNNIIDWEKDCVGT-GNFYLAKNAHELE 493 >gi|190894968|ref|YP_001985261.1| hypothetical protein RHECIAT_PC0000634 [Rhizobium etli CIAT 652] gi|190700629|gb|ACE94711.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 444 Score = 101 bits (252), Expect = 2e-19, Method: Composition-based stats. Identities = 50/334 (14%), Positives = 104/334 (31%), Gaps = 38/334 (11%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A+ + + + + D +R +MQS LDAA+++ I + K + S Sbjct: 31 VALSLVPMLVAVGASFDYIRSYNVRQRMQSDLDAALIAAVKQINNSEDTD---ALKQKVS 87 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 F Q++ G + + + + A T Sbjct: 88 DWFHAQVENSYALGE-------------------IEIDTTNHNITATAS---GTVPTTFM 125 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 + +S+ S + +++ +V+D S SM Sbjct: 126 KIANIDTVPVSVGS---AVKGPATSYLNVYIVIDRSPSMLLAATTSGQSTMYSGIGCQFA 182 Query: 182 PPPPKKSFWSKNTTKSKYAPAP-APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 K T + Y + + DV ++ +++ I ++ + RI Sbjct: 183 CHTGDAHTVGKKTYANNYDYSTEKNIKLRADVAGDAVREVLDMIDESD----SNHERIKV 238 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNT----YPAMHHAYRELYNEKESSH 296 Y++G + + + + + RL+ + Y T+ Y A L + Sbjct: 239 GLYSLGDTTKEVLAPTLDTSNARKRLSD-DSYGLTSATSMNYTYFDVALAALQKIVGTGG 297 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 + S K V+ +TDG S + L+ Sbjct: 298 DGTSSANPLKLVLLLTDGVQSQRGWVVKNSSNLK 331 >gi|260576512|ref|ZP_05844501.1| conserved hypothetical protein [Rhodobacter sp. SW2] gi|259021235|gb|EEW24542.1| conserved hypothetical protein [Rhodobacter sp. SW2] Length = 529 Score = 101 bits (251), Expect = 2e-19, Method: Composition-based stats. Identities = 44/363 (12%), Positives = 90/363 (24%), Gaps = 66/363 (18%) Query: 16 AIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQG 75 A+DL R +Q LD + L+ + +++ L Sbjct: 56 ALDLMRHEQKRTTLQQTLDRSTLAAAS-------------------------LQQSLDPE 90 Query: 76 SYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRS 135 S +R+ ++ + A + F ++ + S Sbjct: 91 SVVRDYFAKANMTQYLSGVTVDEGMNYREVNALAAADTNP---FFMQMVGIDSFDAKAAS 147 Query: 136 TGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTT 195 T S + + MVLD+S SM ++ Sbjct: 148 TAEQRIS----NVEVSMVLDISGSMASNSRLTRLRPAAKEFIDTVINGSDPGRVSISVVP 203 Query: 196 KSKYAPAPAPANRKIDV-----LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 + A + +V + + V G + Sbjct: 204 YNAQVNLGAGMMSQFNVNALHSTSYCVELPNSVFGSTGLSQATSFVHNGHFDPWGTGNSS 263 Query: 251 Q----------CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKES------ 294 TP+S + +K R++ L T+ + L + Sbjct: 264 NYNCPPTANVAVTPMSGDAAYLKGRVDLLASMGYTSIDVGVKWGTLLLDPSAQPLINGLV 323 Query: 295 -----------SHNTIGSTRLKKFVIFITDGENS--GASAYQNTLNTLQICEYMRNAGMK 341 + K ++ ++DGEN+ I + N+ + Sbjct: 324 GLGQVDEDFTDRPLDPDEANVLKVLVVMSDGENTEEYKLTAPYRSGPSAIYKKTSNSKLT 383 Query: 342 IYS 344 +YS Sbjct: 384 LYS 386 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 25/188 (13%), Positives = 56/188 (29%), Gaps = 15/188 (7%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 D L + + Y+ ++ ++ + ++ Sbjct: 352 SDGENTEEYKLTAPYRSGPSAIYKKTSNSKLTLYSDRASTTSDYYWFSD-SKWHTTID-- 408 Query: 270 NPYENTNTYPAMHH----AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 T M A + + + + TD S Q Sbjct: 409 ---GGTTGSVQMTWPEVWAKWSVRYVAKDIYTKALGGSENSWFETFTDEI----SYGQKD 461 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + QIC+ +++G+ I+S+ AP G++ LR C +F ++ +F I Sbjct: 462 VRLQQICDAAKDSGIVIFSIGFEAPENGRNQLRDCASQPSNYFNATGV-QITTAFRAIAT 520 Query: 386 KIQEQSVR 393 ++ + Sbjct: 521 QLSHLRLT 528 >gi|13476511|ref|NP_108081.1| hypothetical protein mlr7847 [Mesorhizobium loti MAFF303099] gi|14027272|dbj|BAB54226.1| mlr7847 [Mesorhizobium loti MAFF303099] Length = 548 Score = 101 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 60/531 (11%), Positives = 136/531 (25%), Gaps = 138/531 (25%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDP------- 53 + + + + +A+D++ +M ++ +Q+ALDAA L+ D + D Sbjct: 17 LMGLGLPAILSAVAFAVDVSTVMRAKSNLQNALDAANLASSHLGDLDISRTDAFDRYFQA 76 Query: 54 -----------------------TTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQ 90 K S +I +A + Q Sbjct: 77 NIAGHGELANAQATLTVDRGVNFIKTKAVASADVNLNFGFLFGHNRHIAVDASAVESDNQ 136 Query: 91 INITKDKNNPLQYIAESKAQYEIPTENLFLKG-LIPSALTNLSLRSTGIIERSSENLAIS 149 + + +N T++L S + + + N Sbjct: 137 LEVVLVLDNTGSMAGARMTALRTATKSLLDTLEATKSPTRQIRASLVPFVTAVNVNGDEF 196 Query: 150 ICMVLDV------------------------------SRSMEDLYLQKHND--------- 170 +D+ + E + Sbjct: 197 DPSWIDMDGKSSTNGVNFPVIDGKRPNHMALFKQLKDTGWTEAGWNGTGWKGCVEARPGA 256 Query: 171 -----------NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRK-IDVLIES-- 216 + Y P P+ + ++ + D + Sbjct: 257 YNISDTPPDPDKPDTLFVPYFAPDDPEDAQKPSSSYGNAAKYYNNSYLDDVSDKTKTAKL 316 Query: 217 --------AGNLVNSIQKAIQEKKNLSVRIGTIAY-------------NIGIVGNQCTPL 255 +L + + A ++ K + L Sbjct: 317 KGNRLGIDLSSLADPVPPADKDAKEKVAKYVAPTKALITETGSPITVGPNRACPTPVVSL 376 Query: 256 SNNLNEVKSRLNKLN--PYENTNTYPAMHHAYRELYNEKESSHNTI-GSTRLKKFVIFIT 312 +++ ++++ +++ TN + R L + + + K V+ +T Sbjct: 377 TDDFDKLRKAASEMTEWNGSGTNVSEGLSWGMRVLSPAAPYTDGAPWKTPGISKIVLLLT 436 Query: 313 DGENSGASAYQNTLN-----------------------------TLQICEYMRNAGMKIY 343 DGEN A + T +C ++N G++IY Sbjct: 437 DGENVVYGASEQPTKSDYTSYGYLAGGRFGSDDQTAAARNVDGWTKSVCTQLKNQGVQIY 496 Query: 344 SVAVSAP-PEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 ++ + + + L C ++AVND +L + F I +K + Sbjct: 497 TMVLQSDTAANRALYSACASDPSGYYAVNDPAKLPDVFQHIANKFSRLQLT 547 >gi|149202124|ref|ZP_01879097.1| hypothetical protein RTM1035_12393 [Roseovarius sp. TM1035] gi|149144222|gb|EDM32253.1| hypothetical protein RTM1035_12393 [Roseovarius sp. TM1035] Length = 584 Score = 101 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 41/321 (12%), Positives = 76/321 (23%), Gaps = 59/321 (18%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 AI + + ID+ R +Q+ LD AVL+G ++ + Sbjct: 27 FAIF-VMFLVMGGIGIDMMRQEMARASLQATLDRAVLAGATAVNN-----------ATAR 74 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + + K + + AGDI + N + A + + +L Sbjct: 75 AVIEDYFAKSGQSDYLAAQEAGDID---------IRLNSSKVTARATQTLD-----TYLM 120 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 L + ST + + I M LDVS SM + +L Sbjct: 121 RLAGVDTLTSAGNSTAEVTI----PKLEIAMALDVSGSM-IGARIDALKPAAIEFVDSIL 175 Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPAN-----RKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 S + + + Sbjct: 176 DSTEPNDAVISVVPFSWGVTPSKEIYEALTVNETHKYSSCLELNDSHFTDTTIDPNTAYN 235 Query: 237 R--------------------IGTIAYNIGIVGN---QCTPLSNNLNEVKSRLNKLNPYE 273 + YN P + + ++N L Sbjct: 236 QLIYTSREGVTFGDLTTTPLGDFLDTYNQTCYTQDYFNILPYATTKTALHDKINGLQAGG 295 Query: 274 NTNTYPAMHHAYRELYNEKES 294 +T+ + A L + Sbjct: 296 STSNDEGVKWAAALLDPAFQP 316 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 16/121 (13%), Positives = 38/121 (31%), Gaps = 10/121 (8%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 S + + + P ++ + A+ + + + Sbjct: 453 SVTWSTFNNYNSNTLPGFISSERLSWETAWGLMSPRFYGNTTGNWGPW--------NNFL 504 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--SAPPEGQDLLRKCTDSSGQFFAVNDS 373 N+ + + IC ++ G+ IY++A + P G D ++KC S + Sbjct: 505 NNPINRSKKDERLDDICREAKSEGIVIYTIAFEMGSQPTGADKIKKCASSVNHHYNATTV 564 Query: 374 R 374 Sbjct: 565 N 565 >gi|153946957|ref|YP_001399586.1| hypothetical protein YpsIP31758_0593 [Yersinia pseudotuberculosis IP 31758] gi|152958452|gb|ABS45913.1| conserved hypothetical protein [Yersinia pseudotuberculosis IP 31758] Length = 518 Score = 101 bits (250), Expect = 3e-19, Method: Composition-based stats. Identities = 56/472 (11%), Positives = 125/472 (26%), Gaps = 98/472 (20%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 +I V I + + +H + R ++ A++ A L+ + Sbjct: 29 FMALIPVFIGLIFLSFEFSHFIQKRAKLSDAIEQASLALSTENNYRNDRASNNRNNYLVT 88 Query: 62 TIFKKQI-----------------------KKHLKQGSYIRENAGDIAQKAQINITKDKN 98 + + + L+ + + Q ++N Sbjct: 89 SYAQSYLPSERFSQPRVVNTYNEILGYTEYNASLQMNYQLALLNSYLKQTPSPTWDVNEN 148 Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE------NLAISICM 152 + S A+ G + ++ L + + N I Sbjct: 149 GAARKYLSSIAEPIDVVFVTDFSGSMNLPFGDIELNNRITKLDELKAIFVKLNNRIFSND 208 Query: 153 VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV 212 ++ + + K N S+ Y P K +Y + +D Sbjct: 209 GINTIGFVPFSWGTKRISANGQVSSTYCHFPYSPKKIDGNGHYLQRYTASNLKNIPGLDN 268 Query: 213 LIE----SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV------ 262 L + G L AI + R I + +N++ Sbjct: 269 LSGIDNLAYGQLDEDKHHAILSEIEKKHRDNEIPTKTRDQAKNFLDKAYKVNQISTITKI 328 Query: 263 -------KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG--------------- 300 K +N ++ T M + KE++ ++ Sbjct: 329 VEEHIDYKETINSIDRNGET-IDIPMDDILDPFFCLKETNAKSLNFDPNSKGDINEILNM 387 Query: 301 ---------------------STRLKKFVIFITDGENSGASAY------------QNTLN 327 S K +I ++DG+++ L Sbjct: 388 KAEGGTLASSGILVGNKMLTESQNNNKLMIILSDGDDNTQKMSSPHDQKAGIINITQKLI 447 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQ--DLLRKCTDSSGQFFAVNDSRELL 377 T +C+ +++ G+K+ + + P+ D + C + G F+ ++ EL Sbjct: 448 TEGMCQKIKDNGIKMVFIGIGYVPDNNIIDWEKDCVGT-GNFYLAKNAHELE 498 >gi|103487755|ref|YP_617316.1| hypothetical protein Sala_2274 [Sphingopyxis alaskensis RB2256] gi|98977832|gb|ABF53983.1| hypothetical protein Sala_2274 [Sphingopyxis alaskensis RB2256] Length = 666 Score = 100 bits (249), Expect = 3e-19, Method: Composition-based stats. Identities = 47/400 (11%), Positives = 106/400 (26%), Gaps = 43/400 (10%) Query: 32 ALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQI 91 A ++A + + + +QT + + Sbjct: 269 ASNSAACAALTPPANTTPSPSGSPDYNQTGQYVDGDTRVTTYDTVQTYTFRTYRYVWSSN 328 Query: 92 NITKDKNNPLQYIAESKAQYEIPTE-----NLFLKGLIPSALTNLSLRSTGIIERSSENL 146 + N + E T+ + + + S ++ Sbjct: 329 RCRRQYRNGNFTRTYTTTVTETKTDVFDNKYTYEDRVFDVSGVKSGSAIVTDTGDSGVSI 388 Query: 147 AISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPA 206 + + L + R+ ++ + + L+P + W + + A P Sbjct: 389 SHNWGGCL-IERATTPFDADDTAPSDALDMDIDLVPDADADTQWRILIPEIAFPRARHPQ 447 Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP--------LSNN 258 + + + I + + R + G C +++ Sbjct: 448 YAPSSSDPMTVNVFNSDGTRNITSDTSSNGRWQRYSKYWGSGWGVCPAAAMKLTTMTASD 507 Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNE---KESSHNTIGSTRLKKFVIFITDG- 314 + + L P T M R L + + + + ++F+TDG Sbjct: 508 RATFNTYVQSLQPLGGTYHDAGMVWGARLLSPTGLFADENATAPNDRPISRHIVFMTDGA 567 Query: 315 ----------------------ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + ++ Q+C R G+ I+ V+ Sbjct: 568 MAPNMGNLTFQGYEFLMHRVGGTSDSDLRDRHNNRFTQLCRAARQRGITIWVVSFGVGSN 627 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 L C S GQ F +++ EL E F I +I + + Sbjct: 628 DS--LNNCASS-GQAFEADNAAELNEQFQAIARQISKLRL 664 Score = 79.2 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 45/327 (13%), Positives = 87/327 (26%), Gaps = 31/327 (9%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 +TA I F+ A+D+ + ++Q A DA VL+G ++ + + Sbjct: 27 LTAAAIIPVIGFVGSAVDIGRAYMTQLRLQQACDAGVLAGRRAMG-------GASYDEAA 79 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 K + + I + N ++ A L Sbjct: 80 QAEANKMFNFNFPEA---------KYGATGILFSSRALNASDVEGQASA-----VLPTEL 125 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKH-NDNNNMTSNKY 179 + LS T +E S + + +VLDV+ SM N + Sbjct: 126 MFMFGKEEFRLSADCTAKLEIS----NVDVMLVLDVTGSMAQTNAGDSVNRITALKDATM 181 Query: 180 LLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG 239 + + P + AN +L ++ L +++ + V Sbjct: 182 DFFDTLTNADVGDGRLRFGVVPYSSTANVGQILLAKNPAWLADTVTLPSRTPIFREVYTE 241 Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 T + T S+N + P N+ A+ + S Sbjct: 242 TGTETSDDYTDSPTTYSSNWTN-----DGTVPASNSAACAALTPPANTTPSPSGSPDYNQ 296 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTL 326 + T + Sbjct: 297 TGQYVDGDTRVTTYDTVQTYTFRTYRY 323 >gi|218509981|ref|ZP_03507859.1| hypothetical protein RetlB5_22275 [Rhizobium etli Brasil 5] Length = 448 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 45/328 (13%), Positives = 101/328 (30%), Gaps = 32/328 (9%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A+ + + + + D +R +MQS LDAA+++ I + K + S Sbjct: 28 VALSLVPMLVAVGASFDYIRSYNVRQRMQSDLDAALIAAVKQINNTEDTD---ALKQKVS 84 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 F Q++ G + + + + A +PT + Sbjct: 85 DWFHAQVENSYALGE-------------------IEIDTTNHNITATASGTVPTTFM--- 122 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 A + S + +++ +V+D S SM Sbjct: 123 ---KIANIDTVPVSVASAVKGPATSYLNVYIVVDTSPSMLLAATTAGQSTMYSGIKCQFA 179 Query: 182 PPPPKKSFWSKNTTKSKYAPAP-APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 K T + Y + + DV ++ +++ I ++ + + +++G Sbjct: 180 CHTGDTHTIGKKTYANNYDYSTEKGIKLRADVAGDAVREVLDMIDES--DSNHERIKVGL 237 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN-TYPAMHHAYRELYNEKESSHNTI 299 + LS ++ + + T+ + L + + + Sbjct: 238 YGLGDTLTEVLAPTLSTDIARTRLADSSYGLTSATSKAATYFDVSLATLKQKVGAGGDGT 297 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLN 327 S K V+ +TDG S + +N Sbjct: 298 TSGTPLKLVLLLTDGVQSQREWVTDKVN 325 >gi|165924896|ref|ZP_02220728.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. F1991016] gi|167418907|ref|ZP_02310660.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167425152|ref|ZP_02316905.1| conserved hypothetical protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|270487722|ref|ZP_06204796.1| conserved hypothetical protein [Yersinia pestis KIM D27] gi|165923096|gb|EDR40247.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. F1991016] gi|166962901|gb|EDR58922.1| conserved hypothetical protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167055915|gb|EDR65696.1| conserved hypothetical protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|262360782|gb|ACY57503.1| membrane protein [Yersinia pestis D106004] gi|270336226|gb|EFA47003.1| conserved hypothetical protein [Yersinia pestis KIM D27] Length = 492 Score = 100 bits (249), Expect = 4e-19, Method: Composition-based stats. Identities = 56/472 (11%), Positives = 124/472 (26%), Gaps = 98/472 (20%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 +I V I + + +H + R ++ AL+ A L+ + Sbjct: 3 FMALIPVFIGLIFLSFEFSHFIQKRAKLSDALEQASLALSTENNYRNDRASNNRNNYLVT 62 Query: 62 TIFKKQI-----------------------KKHLKQGSYIRENAGDIAQKAQINITKDKN 98 + + + L+ + + Q ++N Sbjct: 63 SYAQSYLPSERFSQPRVVNTYNESLGYTEYNASLQMNYQLALLNSYLKQTPSPTWDVNEN 122 Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE------NLAISICM 152 + S A+ G + ++ + + N I Sbjct: 123 GAARKYLSSIAEPIDVVFVTDFSGSMDLPFGDIERNNRITKLDELKAIFVKLNNRIFSND 182 Query: 153 VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV 212 ++ + + K N S+ Y P K +Y + +D Sbjct: 183 GINTIGFVPFSWGTKRISANGQVSSTYCHFPYSPKKIDGNGHYLQRYTASNLKNIPGLDN 242 Query: 213 LIE----SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV------ 262 L + G L AI + R I + +N++ Sbjct: 243 LSGIDNLAYGQLDEDKHHAILSEIEKKHRDNEIPTKTRDQAKNFLDKAYKVNQISTITKI 302 Query: 263 -------KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG--------------- 300 K +N ++ T M + KE++ ++ Sbjct: 303 VEEHIDYKETINSIDRNGET-IDIPMDDILDPFFCLKETNAKSLNFDPNSKGDINEILNM 361 Query: 301 ---------------------STRLKKFVIFITDGENSGASAY------------QNTLN 327 S K +I ++DG+++ L Sbjct: 362 KAEGGTLASSGILVGNKMLTESQNNNKLMIILSDGDDNTQKMSSPHDQKAGIINITQKLI 421 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQ--DLLRKCTDSSGQFFAVNDSRELL 377 T +C+ +++ G+K+ + + P+ D + C + G F+ ++ EL Sbjct: 422 TEGMCQKIKDNGIKMVFIGIGYVPDNNIIDWEKDCVGT-GNFYLAKNAHELE 472 >gi|209884898|ref|YP_002288755.1| hypothetical protein OCAR_5764 [Oligotropha carboxidovorans OM5] gi|209873094|gb|ACI92890.1| conserved hypothetical protein [Oligotropha carboxidovorans OM5] Length = 600 Score = 100 bits (248), Expect = 4e-19, Method: Composition-based stats. Identities = 30/195 (15%), Positives = 74/195 (37%), Gaps = 16/195 (8%) Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP 271 + + S RI + + TP+SN + S++N +NP Sbjct: 406 TWTGCVNDRDQDADTTN-AVMSGSGRIYPEQWKD-CLSATITPMSNQWATLNSKVNAMNP 463 Query: 272 YENTNTYPAMHHAYRELYNEKES--SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT- 328 NTN + ++ L + + + + + +++ ++DG N+ Y Sbjct: 464 SGNTNQAIGLFWGWQTLNTANDPFKAPSKDPNWVYQDYIVILSDGLNTQNRWYTCPNAGP 523 Query: 329 --------LQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQFFA-VNDSRELL 377 +C+ ++ + I+++ V+ + +L+ C S +F + + + Sbjct: 524 CPTIDGREKTLCDNIKADKITIFTIQVNINSKDPESQVLKDCASSGSGYFQLITSANDTA 583 Query: 378 ESFDKITDKIQEQSV 392 +FD + +KI + + Sbjct: 584 TAFDNVLNKIAKLRI 598 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 23/211 (10%), Positives = 53/211 (25%), Gaps = 27/211 (12%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + ++ + A+D + R MQSALD+A L + + T + Sbjct: 27 IFTLVAIPLVALVGAAVDYTRVSSARTAMQSALDSAALMISKDAATMSDSEITTRARQYV 86 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 +++ + NN A + Sbjct: 87 NSL-----------------YTNTETPIQTFSAVYTPNNGSGATILLNAGGN---MPTYF 126 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMT----- 175 ++ + + L + + + + + +VLD + SM+ Sbjct: 127 MKIVGTNFSTLPINTAS--TTKWGSSRMRVALVLDNTGSMDQNGKMTALKKAAANATTGL 184 Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPA 206 K + + +K Sbjct: 185 IKKLSAFNTNEGDVYISVVPFAKDVNVGTSN 215 >gi|114704798|ref|ZP_01437706.1| hypothetical protein FP2506_07676 [Fulvimarina pelagi HTCC2506] gi|114539583|gb|EAU42703.1| hypothetical protein FP2506_07676 [Fulvimarina pelagi HTCC2506] Length = 545 Score = 99 bits (247), Expect = 5e-19, Method: Composition-based stats. Identities = 50/358 (13%), Positives = 113/358 (31%), Gaps = 27/358 (7%) Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 + + + +K S + AG+++ + ++ + Sbjct: 191 SNDLTQVALVPFDTQVKATSSLFGAAGNVSVANPLATGSCATISDPLDRDACYASQNAAP 250 Query: 117 NLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS 176 + + + + + + + + D + Sbjct: 251 PVVDCSKLTDLIDAVLCGVNNLGFKVGTTAITDLRYISDRRYDAFIDGNMFRITRKIGEA 310 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQ----------- 225 + + K +S T A APA K + +L+ Sbjct: 311 DCSSVCT--WKKTYSTTTIFETAAGGGAPATSKPNDAETPNNDLIAQYPGPWPRCFVDRS 368 Query: 226 ---KAIQEKKNLSVRIGTIAYNIGIV--GNQCTPLSNNLNEVKSRLNKLNPYENTNTYPA 280 A N+S + T L+ +L V++ +NKL P NTN Sbjct: 369 QPYDANATAMNISQKDTIYPEAHCATGSLEPITGLTFDLQSVETAVNKLTPSGNTNVTIG 428 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN------TLNTLQICEY 334 + L + GS ++K +I +TDG N+ + + TL C Sbjct: 429 VQWGMEALTAAAPLTGVRTGSE-VRKVMIVLTDGLNTQNRWWGSRDRNKIDARTLAACNN 487 Query: 335 MRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + G+++Y+V + +DLL+ C ++ ++ V + +L +F + +++ + Sbjct: 488 AKAMGIELYTVRLVEG--NEDLLKTCAETEDKYHYVTSASQLKTTFADLARQVKGVRL 543 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 37/299 (12%), Positives = 86/299 (28%), Gaps = 38/299 (12%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 +T + + A+DL + ++N +Q+A+D + L+ + ++T ++ Sbjct: 42 ITCLALVPLIAAAGGAVDLWNARRVQNAVQNAVDTSALAAVSYSGEEQTEREKRA----- 96 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + + A I I + KA+Y+I T L Sbjct: 97 --------------------DTLFLNNTAGIAIEDTDLSEEDGAWVYKAEYKIKTNFL-- 134 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 ++ + + + N + + +VLD S SM + + Sbjct: 135 -RVVGIDEFEMESQGAA----ALANSPMDVVLVLDSSGSMAQDNRMVELKASVKLFLEEF 189 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 + A + + + L I + + Sbjct: 190 KSNDLTQV---ALVPFDTQVKATSSLFGAAGNVSVA-NPLATGSCATISDPLDRD--ACY 243 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 + N C+ L++ ++ V +N L T + + Y+ + Sbjct: 244 ASQNAAPPVVDCSKLTDLIDAVLCGVNNLGFKVGTTAITDLRYISDRRYDAFIDGNMFR 302 >gi|315266493|gb|ADT93346.1| von Willebrand factor type A [Shewanella baltica OS678] Length = 627 Score = 99 bits (247), Expect = 6e-19, Method: Composition-based stats. Identities = 43/377 (11%), Positives = 105/377 (27%), Gaps = 36/377 (9%) Query: 22 IMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIREN 81 + +M+ DAA + + + + + + L Q + ++ Sbjct: 67 ASQRQAEMR---DAAKVEMARVAAPMQMSSNGAVMGMSIAPMPRDYAAIPLAQNKFEQQV 123 Query: 82 AGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIER 141 I +I ++ + Y L L + Sbjct: 124 QNGIMVAGEIPVS-----TFSIDVD-TGSYATLRRMLREGHLPEKGTVRVEEMLNYFAYD 177 Query: 142 SSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAP 201 + S + E ++D + P + S + Sbjct: 178 Y----PLPAKNAAPFSVTTELAPSPYNDDMMLLRIGLKGYDLPKSQLGASNLVFLLDVSG 233 Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + A K+ +L + L + ++ + Y N+ Sbjct: 234 SMASV-DKLPLLQTALKLLTAQLSAQD--------KVSIVVYAGAAGVVLDGASGNDTQT 284 Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + L +L+ +TN + AY+ + VI TDG+ + Sbjct: 285 LNYALEQLSAGGSTNGGQGITQAYQLAKKHFIPNGINR--------VILATDGDFNVGVT 336 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESF 380 + + + + E ++ G+ + ++ L+ + D G + ++ L E+ Sbjct: 337 DFD--DLIALIEKEKDHGIGLTTLGFGLGNYNDQLMEQLADKGNGNYAYIDT---LNEAR 391 Query: 381 DKITDKIQEQSVRIAPN 397 + D++ IA + Sbjct: 392 KVLVDELSSTLFTIAKD 408 >gi|304412560|ref|ZP_07394165.1| von Willebrand factor type A [Shewanella baltica OS183] gi|307303576|ref|ZP_07583329.1| von Willebrand factor type A [Shewanella baltica BA175] gi|304349036|gb|EFM13449.1| von Willebrand factor type A [Shewanella baltica OS183] gi|306912474|gb|EFN42897.1| von Willebrand factor type A [Shewanella baltica BA175] Length = 627 Score = 99 bits (247), Expect = 6e-19, Method: Composition-based stats. Identities = 47/410 (11%), Positives = 111/410 (27%), Gaps = 45/410 (10%) Query: 1 MTAIIISVCFLFI----TYAIDLAHIMY-IRNQMQSAL-------DAAVLSGCASIVSDR 48 M A+++ L ++ + Q A DAA + + Sbjct: 31 MAALLLVAVSLTACGGKGAEVEHRQAEQQAEQRHQEASQRQAEMRDAAKVEMARVAAPMQ 90 Query: 49 TIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK 108 + + + + L Q + ++ I +I ++ + Sbjct: 91 MSSNGAVMGMSIAPMPRDYAVIPLAQNKFEQQVQNGIMVAGEIPVS-----TFSIDVD-T 144 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKH 168 Y L L + + + V+ + Sbjct: 145 GSYATLRRMLREGRLPEKGIVRVEEMLNYFAYDYPLPAKNAAPFS--VTTELAPSPYNDD 202 Query: 169 NDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAI 228 + Y L PK + N + K+ +L + L + Sbjct: 203 MMLLRIGLKGYDL---PKSQLGASNLVFLLDVSGSMASADKLPLLQTALKLLTAQLSAQD 259 Query: 229 QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYREL 288 ++ + Y N+ + L +L+ +TN + AY+ Sbjct: 260 --------KVSIVVYAGAAGVVLDGASGNDTQTLTYALEQLSAGGSTNGGQGITQAYQLA 311 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + VI TDG+ + + + + + E ++ G+ + ++ Sbjct: 312 KKHFIPNGINR--------VILATDGDFNVGVTDFD--DLIALIEKEKDHGIGLTTLGFG 361 Query: 349 APPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 L+ + D G + ++ L E+ + D++ IA + Sbjct: 362 LGNYNDQLMEQLADKGNGNYAYIDT---LNEARKVLVDELSSTLFTIAKD 408 >gi|299143633|ref|ZP_07036713.1| von Willebrand factor type A domain protein [Peptoniphilus sp. oral taxon 386 str. F0131] gi|298518118|gb|EFI41857.1| von Willebrand factor type A domain protein [Peptoniphilus sp. oral taxon 386 str. F0131] Length = 1217 Score = 99.6 bits (246), Expect = 8e-19, Method: Composition-based stats. Identities = 39/277 (14%), Positives = 79/277 (28%), Gaps = 8/277 (2%) Query: 127 ALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPK 186 ++ I +V+D S SM+D K N +L Sbjct: 180 PNIANKWTVKMLVAARDSVKTSKIVLVIDTSGSMKDFGRMKGAKNAANAFVDNVLDGS-- 237 Query: 187 KSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIG 246 +S + + L + L Q + + + Sbjct: 238 QSTQIGIVRFASNVSIVSDFTSNKAKLHSAINALSAEGGTFTQAGVKQARTMLAGSGADK 297 Query: 247 IV----GNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 + + ++ + L P +T+ + ES+ + S Sbjct: 298 KYMVVLSDGVPTFNYKISHPNNYLIDGGPGSHTHEKQTGKQLPENEFLYNESATGSGNSM 357 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEY--MRNAGMKIYSVAVSAPPEGQDLLRKC 360 + + I G + I E + A M++++V + G +LR Sbjct: 358 WKQYERVKIGGGIFPTYEYHYYNSGNCAIAESGFAKAANMRVFTVGLQTDATGSGVLRDI 417 Query: 361 TDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + G F V D +L F++I +I + + N Sbjct: 418 ASAPGDFTEVTDVSQLTPVFEQIAAQILKTVNQATVN 454 >gi|328470527|gb|EGF41438.1| hypothetical protein VP10329_07002 [Vibrio parahaemolyticus 10329] Length = 461 Score = 99.2 bits (245), Expect = 9e-19, Method: Composition-based stats. Identities = 56/467 (11%), Positives = 133/467 (28%), Gaps = 94/467 (20%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + + + + ++++ + ++++ A + A L+ I P Sbjct: 14 IFVSFLPILIITFSFSVGYTQRLLAHSKIEEAAEVASLAL---------IASPGKDNKDD 64 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 ++ + ++ E + + + +NN L A+ ++ Sbjct: 65 QDYAQRIVDLYITDNISDIEISVSTKKCEYKDGCVQRNNELSPFADFTVVATAEHDSWIS 124 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKY- 179 I S I R + I +LD S+SM + + + N Sbjct: 125 HNEIGVE--PKFKVSGDSITRKYLPQPVDIYFILDTSQSMSNPWYGERNKTQMQVVKDTI 182 Query: 180 ---------LLPPPPKKSFWSKNTTKSKYA-PAPAPANRKIDVLIESAGNLVNSIQKAIQ 229 P KKS + T + A K+ + S + + Sbjct: 183 TRVVKELENFKTGPDKKSRVALLTYNAYNAKFDKGAGRVKLYDYASEFSHTEASFESIVD 242 Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK---LNP-YENTNTYPAMHHAY 285 + + SV Y +Q PL++ E LN + T ++ + A Sbjct: 243 KMFDESVVEQKPHYASDYNKSQDIPLTDKYQEFIDILNSNKVMPARGGGTQSWLGLIAA- 301 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGEN---------SGASAYQNTLNTLQ------ 330 KE+ ++ I ++DG + + +Y++ + + Sbjct: 302 -----AKEADKVKKEDRNPEQVFIILSDGADTDVQFPMGLNRNRSYRDKYDVVTKYYVDQ 356 Query: 331 -----------------------ICEYMRNA--------------------GMKIYSVAV 347 +CE ++N + + + V Sbjct: 357 YDGRTYYYQVYDKFLKSLVGEHGLCESLKNRISSKENKFQSEHVKLEGEKTKVTMGVIGV 416 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRI 394 + + D +C + + + + I + I E++ R+ Sbjct: 417 NYNVQKDDGFGECV-GEKNIYHAKNG---KDVYKYILNLINEETGRL 459 >gi|160874259|ref|YP_001553575.1| von Willebrand factor type A [Shewanella baltica OS195] gi|160859781|gb|ABX48315.1| von Willebrand factor type A [Shewanella baltica OS195] Length = 642 Score = 99.2 bits (245), Expect = 1e-18, Method: Composition-based stats. Identities = 43/377 (11%), Positives = 105/377 (27%), Gaps = 36/377 (9%) Query: 22 IMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIREN 81 + +M+ DAA + + + + + + L Q + ++ Sbjct: 82 ASQRQAEMR---DAAKVEMARVAAPMQMSSNGAVMGMSIAPMPRDYAAIPLAQNKFEQQV 138 Query: 82 AGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIER 141 I +I ++ + Y L L + Sbjct: 139 QNGIMVAGEIPVS-----TFSIDVD-TGSYATLRRMLREGHLPEKGTVRVEEMLNYFAYD 192 Query: 142 SSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAP 201 + S + E ++D + P + S + Sbjct: 193 Y----PLPAKNAAPFSVTTELAPSPYNDDMMLLRIGLKGYDLPKSQLGASNLVFLLDVSG 248 Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + A K+ +L + L + ++ + Y N+ Sbjct: 249 SMASV-DKLPLLQTALKLLTAQLSAQD--------KVSIVVYAGAAGVVLDGASGNDTQT 299 Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + L +L+ +TN + AY+ + VI TDG+ + Sbjct: 300 LNYALEQLSAGGSTNGGQGITQAYQLAKKHFIPNGINR--------VILATDGDFNVGVT 351 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESF 380 + + + + E ++ G+ + ++ L+ + D G + ++ L E+ Sbjct: 352 DFD--DLIALIEKEKDHGIGLTTLGFGLGNYNDQLMEQLADKGNGNYAYIDT---LNEAR 406 Query: 381 DKITDKIQEQSVRIAPN 397 + D++ IA + Sbjct: 407 KVLVDELSSTLFTIAKD 423 >gi|238782874|ref|ZP_04626903.1| tight adherance operon protein [Yersinia bercovieri ATCC 43970] gi|238716297|gb|EEQ08280.1| tight adherance operon protein [Yersinia bercovieri ATCC 43970] Length = 530 Score = 99.2 bits (245), Expect = 1e-18, Method: Composition-based stats. Identities = 47/480 (9%), Positives = 118/480 (24%), Gaps = 90/480 (18%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 II+ + + +++ ++ + ++ A++ A L+ D S Sbjct: 45 FIIILPFFIALLFLSFEISQLLQKKAKLSDAIEQATLALTVENDDLPDELQMRKNVDLVS 104 Query: 62 TIFKKQI---------------------KKHLKQGSYIRENAGDIAQKAQINITKDKNNP 100 + + + + A I + N Sbjct: 105 NFSSAYLPLEHFSVPEIDIKNNCGQLTYNAKITMSYFANFLSKTAMTNAITTIGTEDNGA 164 Query: 101 LQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSM 160 + + + + + I + +L S Sbjct: 165 AIKQVSTIQDKATDVIFVADYSGSMNEGFHGKVPRGEKINALRDVFNRLNGSILKNSNIN 224 Query: 161 EDLYLQKHNDNNNMTSNK-----YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKI----D 211 ++ + Y P K + + N +Y + + D Sbjct: 225 LIGFVPFSWGTKRIVIENSQEKKYCHFPFVPKQYRADNNYFRQYTVSGLKKFPGLEGLTD 284 Query: 212 VLIESAGNLV----NSIQKAIQEKKNLSVRIGTIAY-------NIGIVGNQCTPLSNNLN 260 + + G L N++ I+ R + + + + + + Sbjct: 285 IDKINYGELTLGEYNTLTNVIKNMAKQEYRNKALEFLRITLNIPTYMQQMIFITTTIDYD 344 Query: 261 EVKSRLNK------------------------------------------LNPYENTNTY 278 +N ++P T Sbjct: 345 ATIKSINSDAQYIDIPLDDIINESICLNNSNAYSLDSHNSHDDLIDKMIAMSPLGQTLVS 404 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 + +A E +S+N + +FI D + TL +CE ++ Sbjct: 405 SGILYANTLFKKESNNSNNKLMVIISDGIDVFINDTTIQQSIYISKTLIDKGMCERIKEN 464 Query: 339 GMKIYSVAVSAPPEGQDL------LRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 +K+ +A+ + +KC ++ V+D+ EL + + + V Sbjct: 465 NIKMVFIAIKDGSNETNEPANYIDWKKCV-GEDNYYYVSDAHELEAALRQSLTTTSSEVV 523 >gi|296120496|ref|YP_003628274.1| von Willebrand factor type A [Planctomyces limnophilus DSM 3776] gi|296012836|gb|ADG66075.1| von Willebrand factor type A [Planctomyces limnophilus DSM 3776] Length = 396 Score = 98.8 bits (244), Expect = 1e-18, Method: Composition-based stats. Identities = 54/397 (13%), Positives = 119/397 (29%), Gaps = 41/397 (10%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A ++ + + L+++ R ++++A DAA S +V ++ Sbjct: 25 LAAFVMVALLALAGFFLSLSYVELTRAELRAATDAAARSAVIRLVETQSTTSGRAAARDI 84 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQ--KAQINITKDKNNPLQYIAESKAQYEIPTENL 118 ++ F+ K + I+ I N + L Sbjct: 85 ASRFEVGGKALSLNDNDIQFGRSTRQSNGSYSFAINGTPTNAARVFGRKTKTSAAGPVEL 144 Query: 119 FLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNK 178 G + + + L + + I +VLD S SM Sbjct: 145 PFGGFVGAPEYSTELNAVAMRLDY------DIVIVLDRSGSMGWDLSG------------ 186 Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK---AIQEKKNLS 235 + Y P P + +L S + + + + A + Sbjct: 187 ---VEFEYPEAVRQRPLVENYFSPPDPTGSRWAILSASVNDFLTILNQRQVAARVGLVTY 243 Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVK---SRLNKLNPYENTNTYPAMHHAYRELYNEK 292 T + + L++ + + + + ++ T+ + A L Sbjct: 244 AGDYTFGKYSSVKLTVESDLTSTFSTITSKLTAIGQVPLIGGTDIGAGITAAQTMLTTSS 303 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 ++ T + +I +DG + + + + I+SV A + Sbjct: 304 QARLKTG-----QPIIIVFSDGMFNQGTEPVSL-----AASAYSQSSTIIHSVTFGATAQ 353 Query: 353 GQDLLRKCTDSSGQ--FFAVNDSRELLESFDKITDKI 387 G+ + T ++G+ N + EL ESF I + I Sbjct: 354 GRATMNSVTATAGKGLSLHANTAAELAESFRSIANAI 390 >gi|217974408|ref|YP_002359159.1| von Willebrand factor type A [Shewanella baltica OS223] gi|217499543|gb|ACK47736.1| von Willebrand factor type A [Shewanella baltica OS223] Length = 627 Score = 98.4 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 47/410 (11%), Positives = 110/410 (26%), Gaps = 45/410 (10%) Query: 1 MTAIIISVCFLFI----TYAIDLAHI-MYIRNQMQSAL-------DAAVLSGCASIVSDR 48 M A+++ L + + Q A DAA + + Sbjct: 31 MAALLLVAVSLTACGGKGAEVQHRQAKQQAEQRHQEASQRQAEMRDAAKVEMARVAAPMQ 90 Query: 49 TIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK 108 + + + + L Q + ++ I +I ++ + Sbjct: 91 MSSNGAVMGMSIAPMPRDYAAIPLAQNKFEQQVQNGIMVAGEIPVS-----TFSIDVD-T 144 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKH 168 Y L L + + + V+ + Sbjct: 145 GSYATLRRMLREGRLPEKGIVRVEEMLNYFAYDYPLPAKNAAPFS--VTTELAPSPYNDD 202 Query: 169 NDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAI 228 + Y L PK + N + K+ +L + L + Sbjct: 203 MMLLRIGLKGYDL---PKSQLGASNLVFLLDVSGSMASTDKLPLLQTALKLLTAQLSAQD 259 Query: 229 QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYREL 288 ++ + Y N+ + L +L+ +TN + AY+ Sbjct: 260 --------KVSIVVYAGAAGVVLDGASGNDTQTLTYALEQLSAGGSTNGGQGITQAYQLA 311 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + VI TDG+ + + + + + E ++ G+ + ++ Sbjct: 312 KKHFIPNGINR--------VILATDGDFNVGVTDFD--DLIALIEKEKDHGIGLTTLGFG 361 Query: 349 APPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 L+ + D G + ++ L E+ + D++ IA + Sbjct: 362 LGNYNDQLMEQLADKGNGNYAYIDT---LNEARKVLVDELSSTLFTIAKD 408 >gi|153836414|ref|ZP_01989081.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AQ3810] gi|260365465|ref|ZP_05778002.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus K5030] gi|260877530|ref|ZP_05889885.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AN-5034] gi|260897529|ref|ZP_05906025.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus Peru-466] gi|260901731|ref|ZP_05910126.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AQ4037] gi|149750316|gb|EDM61061.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AQ3810] gi|308087122|gb|EFO36817.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus Peru-466] gi|308090607|gb|EFO40302.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AN-5034] gi|308108829|gb|EFO46369.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus AQ4037] gi|308114384|gb|EFO51924.1| Flp pilus assembly protein TadG [Vibrio parahaemolyticus K5030] Length = 461 Score = 98.4 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 55/467 (11%), Positives = 132/467 (28%), Gaps = 94/467 (20%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + + + + ++++ + ++++ A + A L+ I P Sbjct: 14 IFVSFLPILIITFSFSVGYTQRLLAHSKIEEAAEVASLAL---------IASPGKDNKDD 64 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 ++ + ++ E + + + +NN L A+ ++ Sbjct: 65 QDYAQRIVDLYITDNISDIEISVSTKKCEYKDGCVQRNNELSPFADFTVVATAEHDSWIS 124 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKY- 179 I S I R + I +LD S+SM + + + N Sbjct: 125 HNEIGVE--PKFKVSGDSITRKYLPQPVDIYFILDTSQSMSNPWYGERNKTQMQVVKDTI 182 Query: 180 ---------LLPPPPKKSFWSKNTTKSKYA-PAPAPANRKIDVLIESAGNLVNSIQKAIQ 229 P KKS + T + A K+ + S + + Sbjct: 183 TRVVKELENFKTGPDKKSRVALLTYNAYNAKFDKGAGRVKLYDYASEFSHTEASFESIVD 242 Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK---LNP-YENTNTYPAMHHAY 285 + + SV Y +Q PL++ E LN + T ++ + A Sbjct: 243 KMFDKSVVEQKPHYASDYNKSQDIPLTDKYQEFIDILNSNKVMPARGGGTQSWLGLIAA- 301 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGEN---------SGASAYQNTLNTLQ------ 330 KE+ ++ I ++DG + + +Y++ + + Sbjct: 302 -----AKEADKVKKEDRNPEQVFIILSDGADTDVQFPMGLNRNRSYRDKYDVVTKYYVDQ 356 Query: 331 -----------------------ICEYMRNA--------------------GMKIYSVAV 347 +CE ++ + + + V Sbjct: 357 YDGRTYYYQVYDKFLKSLVGEHGLCESLKKRISSKENKFQSEHAKLEGEKTKVTMGVIGV 416 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRI 394 + + D +C + + + + I + I E++ R+ Sbjct: 417 NYNVQKDDGFGECV-GEKNIYHAKNG---KDVYKYILNLINEETGRL 459 >gi|28900585|ref|NP_800240.1| hypothetical protein VPA0730 [Vibrio parahaemolyticus RIMD 2210633] gi|28808965|dbj|BAC62073.1| hypothetical protein [Vibrio parahaemolyticus RIMD 2210633] Length = 466 Score = 98.4 bits (243), Expect = 2e-18, Method: Composition-based stats. Identities = 55/467 (11%), Positives = 132/467 (28%), Gaps = 94/467 (20%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + + + + ++++ + ++++ A + A L+ I P Sbjct: 19 IFVSFLPILIITFSFSVGYTQRLLAHSKIEEAAEVASLAL---------IASPGKDNKDD 69 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 ++ + ++ E + + + +NN L A+ ++ Sbjct: 70 QDYAQRIVDLYITDNISDIEISVSTKKCEYKDGCVQRNNELSPFADFTVVATAEHDSWIS 129 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKY- 179 I S I R + I +LD S+SM + + + N Sbjct: 130 HNEIGVE--PKFKVSGDSITRKYLPQPVDIYFILDTSQSMSNPWYGERNKTQMQVVKDTI 187 Query: 180 ---------LLPPPPKKSFWSKNTTKSKYA-PAPAPANRKIDVLIESAGNLVNSIQKAIQ 229 P KKS + T + A K+ + S + + Sbjct: 188 TRVVKELENFKTGPDKKSRVALLTYNAYNAKFDKGAGRVKLYDYASEFSHTEASFESIVD 247 Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK---LNP-YENTNTYPAMHHAY 285 + + SV Y +Q PL++ E LN + T ++ + A Sbjct: 248 KMFDKSVVEQKPHYASDYNKSQDIPLTDKYQEFIDILNSNKVMPARGGGTQSWLGLIAA- 306 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGEN---------SGASAYQNTLNTLQ------ 330 KE+ ++ I ++DG + + +Y++ + + Sbjct: 307 -----AKEADKVKKEDRNPEQVFIILSDGADTDVQFPMGLNRNRSYRDKYDVVTKYYVDQ 361 Query: 331 -----------------------ICEYMRNA--------------------GMKIYSVAV 347 +CE ++ + + + V Sbjct: 362 YDGRTYYYQVYDKFLKSLVGEHGLCESLKKRISSKENKFQSEHAKLEGEKTKVTMGVIGV 421 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRI 394 + + D +C + + + + I + I E++ R+ Sbjct: 422 NYNVQKDDGFGECV-GEKNIYHAKNG---KDVYKYILNLINEETGRL 464 >gi|148261962|ref|YP_001236089.1| hypothetical protein Acry_2980 [Acidiphilium cryptum JF-5] gi|326405471|ref|YP_004285553.1| hypothetical protein ACMV_33240 [Acidiphilium multivorum AIU301] gi|146403643|gb|ABQ32170.1| hypothetical protein Acry_2980 [Acidiphilium cryptum JF-5] gi|325052333|dbj|BAJ82671.1| hypothetical protein ACMV_33240 [Acidiphilium multivorum AIU301] Length = 431 Score = 98.1 bits (242), Expect = 2e-18, Method: Composition-based stats. Identities = 52/429 (12%), Positives = 119/429 (27%), Gaps = 63/429 (14%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 +TA++ + ID + ++QM+S Sbjct: 22 ITALVSLTLIFILGMGIDYGLAIDRKSQMES----------------YADAAALAAVTPA 65 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + Q + + N + QY+ ++ + L Sbjct: 66 MVAAGQSSAITTAQNVFNAQALTMTGVTYNANDVTVSIATSGDKRTATVQYQAQSQAM-L 124 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS---- 176 ++ + ++T + I ++LD S SM Q + + Sbjct: 125 PDVMGFGSIKIGGQATATTTIA---PNIDFYLLLDDSPSMAIAATQSGINTMVANTTAQG 181 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPA---PAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 S + A +ID+L ++ +L+ + Q +K Sbjct: 182 GCAFGCHEENPSADKLGNPYGEDNYALARSLGVTLRIDMLRQATQDLMTTAQTTETQKGT 241 Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP------------YENTNTYPAM 281 Y I N L+++L++ ++ + N + Sbjct: 242 TY---RMAIYTFDIGLNTIGNLTSDLSQAQTEAGNIQLLEVYSNNWLTQNDYNDDEDTNY 298 Query: 282 HHAYRELYN--EKESSHNTIGSTRLKKFVIFITDG--ENSGASAYQNTLNTLQICEYMRN 337 A + + ++ + F+TDG + Q +L +C ++N Sbjct: 299 DTALNGINAIMPNPGNGTGAAGDTPQEVLFFVTDGVEDEDVNGNRQQSLLNTDLCTAIKN 358 Query: 338 AGMKI---YSVAVSAPPEG-------------QDLLRKCTDSSGQFFAVNDSRELLESFD 381 G++I Y+ + P L++C G +F V ++ + Sbjct: 359 RGIRIAVLYTEYLPLPTNSWYNTYIAPFQNSIAPTLQQCAS-PGLYFEVKSGGDISAAMS 417 Query: 382 KITDKIQEQ 390 + + Sbjct: 418 ALFQTAVQS 426 >gi|75675889|ref|YP_318310.1| hypothetical protein Nwi_1697 [Nitrobacter winogradskyi Nb-255] gi|74420759|gb|ABA04958.1| hypothetical protein Nwi_1697 [Nitrobacter winogradskyi Nb-255] Length = 605 Score = 98.1 bits (242), Expect = 2e-18, Method: Composition-based stats. Identities = 36/256 (14%), Positives = 96/256 (37%), Gaps = 20/256 (7%) Query: 153 VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV 212 V+ + S N + + K +W + T + A APA+ Sbjct: 352 VVVSTGSGASCPSTTPNCSCTGSGRNRKCTQAKYKHYWRAHPTDTNQAKDAAPAHS---T 408 Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYN----IGIVGNQCTPLSNNLNEVKSRLNK 268 + + + + + S + + G + TP+S+ + +K++++ Sbjct: 409 WTGCINDRDQAYDISNADPSSGSSGTPSTKFYAEQLNGCLPATITPVSSQSSTLKNQIDS 468 Query: 269 LNPYENTNTYPAMHHAYRELYNEKE--SSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 ++P +TN + ++ L + + + +++ ++DG N+ N Sbjct: 469 MSPSGSTNQAIGLAWGWQTLSTTNGPFPAPAKDKAYVYQDYLVLLSDGLNTRNRWSGNGS 528 Query: 327 NTL--------QICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQFFAVNDSREL 376 + +C+ ++++G I++V V+ +L+ C +G F + + + Sbjct: 529 DHSPEVDVRQALLCQKVKDSGTVIFTVQVNVGNRDPLSQVLQDCAS-NGNFQMITSANQT 587 Query: 377 LESFDKITDKIQEQSV 392 ++F I +I + + Sbjct: 588 ADAFQNILTQISQLRI 603 Score = 87.7 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 28/209 (13%), Positives = 51/209 (24%), Gaps = 21/209 (10%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + AI + F+ A+D R+ MQ+A+D+AVL ++ + D Sbjct: 28 IFAIALLPVLGFVGAAVDYTRANAARSSMQAAMDSAVLMVSRDAAANPAMTSQQIT-DAV 86 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 F + A I ++ Sbjct: 87 QRYFNSLYNDKSAFNVSVSAAYTPSTSSAAAKILASGQGAIE---------------TDF 131 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 + + ST + N + + +VLD + SM D L Sbjct: 132 MKIAGFPQLSFGTSSTS----TWGNSRMRVALVLDNTGSMRDNGKMAALQRAAKDMIDSL 187 Query: 181 -LPPPPKKSFWSKNTTKSKYAPAPAPANR 208 + +K Sbjct: 188 SAFAKTADDVYISIIPFAKDVNVDKSNYN 216 >gi|320158392|ref|YP_004190770.1| protein TadG, associated with Flp pilus assembly [Vibrio vulnificus MO6-24/O] gi|319933704|gb|ADV88567.1| protein TadG, associated with Flp pilus assembly [Vibrio vulnificus MO6-24/O] Length = 442 Score = 98.1 bits (242), Expect = 2e-18, Method: Composition-based stats. Identities = 44/435 (10%), Positives = 126/435 (28%), Gaps = 64/435 (14%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + ++ V + + +++ + ++ A + A L+ I P ++ Sbjct: 29 IFMGMLPVLVIIMVFSMQMTQRHMAHAKITEAAEVASLAL---------IASPKEGDEKN 79 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 +K + ++ + + ++ L + + ++ Sbjct: 80 QEYAQKIVDHYIPDNKGEVVARVFNRRCEYKDGCVQRSGELAPFTDFVVSAKTKHDSWIS 139 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 LT R + I ++D+S SM + + ++ ++ Sbjct: 140 YNDGEMGLTKDFEVMGTSTSRKFLPQPLDIYFIIDMSGSMVNPWGGSGKTKYDVVADTIN 199 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 + F + ++ A +K+ + +S + N+ Sbjct: 200 RIVDDLREFKTDRKSRVAVIGFHHTAVKKVGRQRTAFDY--SSYRTPSATVNNMFTAPKI 257 Query: 241 IAYNIGIV--GNQCTPLSNNLNEVKSRLNKLNP----YENTNTYPAMHHAYRELYNEKES 294 + N + PL+ + + ++ N N Y T ++ + A + + Sbjct: 258 HSRNDSSNIKTFEDIPLTEDYDAFLTKFNSSNYYASRYGLTESWQGIIGAAQMAEQATD- 316 Query: 295 SHNTIGSTRLKKFVIFITDGENSG---------------ASAYQNTLNTLQICEYMRNA- 338 ++ I ++DG + + Y N L +CE ++ Sbjct: 317 -------LNPEQVFILLSDGRDGDFVRYYLEGRQWREVRYNKYLNRLVKAGLCEKLKTRI 369 Query: 339 -------------------GMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 + + + V+ + D + C + + + Sbjct: 370 SQKRNVFQSENPSDKASKTKVTMGVIGVNYVVDKSDGIGDC-FGHDNIYHAKEG---NDV 425 Query: 380 FDKITDKIQEQSVRI 394 + I + I E++ R+ Sbjct: 426 YKYILNLINEETGRL 440 >gi|255066322|ref|ZP_05318177.1| von Willebrand factor type A domain protein [Neisseria sicca ATCC 29256] gi|255049532|gb|EET44996.1| von Willebrand factor type A domain protein [Neisseria sicca ATCC 29256] Length = 538 Score = 98.1 bits (242), Expect = 2e-18, Method: Composition-based stats. Identities = 36/363 (9%), Positives = 99/363 (27%), Gaps = 29/363 (7%) Query: 36 AVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK 95 A L+ C+ + + + + + + + + E D + ++ + Sbjct: 23 AALAACSGPLEHSSSSPEGLQSPPNAALSTAAVAEENLPLAENTERYQDQPDQPVKSVAQ 82 Query: 96 DKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD 155 + + + Y L P + V Sbjct: 83 EPVSTFSIDVD-TGSYANVRRFLTNGEQPPKDAVRIEEIVNYFPYNYPLPTDNRPFAV-- 139 Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 + + KK N K+ ++ + Sbjct: 140 ----HTETIDSPWQPEAKLIKIGIQAQDTAKKDLPPANLVFLVDVSGSMDEENKLPLVQK 195 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 + L ++ ++ I Y G + + S ++KL T Sbjct: 196 TLRILTQQLRPQD--------KVTLITYASGEDLVLPPTSGADKETILSAIDKLRAGGAT 247 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + A+ AY + + ++ TDG+ + + TL + Sbjct: 248 DGESALQMAYEQAQKAFVPNGINR--------ILLATDGDFNVGVSDTETL--KSMVAEK 297 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRI 394 R +G+ + ++ +D++ + D+ G + +++ +E + ++ + Sbjct: 298 RKSGVSLSTLGFGMGNYNEDMMEQIADAGDGNYSYIDNEKEAKKVLQ---QQLTSTLATV 354 Query: 395 APN 397 A + Sbjct: 355 AQD 357 >gi|152999639|ref|YP_001365320.1| von Willebrand factor type A [Shewanella baltica OS185] gi|151364257|gb|ABS07257.1| von Willebrand factor type A [Shewanella baltica OS185] Length = 642 Score = 98.1 bits (242), Expect = 2e-18, Method: Composition-based stats. Identities = 44/410 (10%), Positives = 107/410 (26%), Gaps = 45/410 (10%) Query: 1 MTAIIISVCFLFI----TYAIDLAHIMY-IRNQMQSAL-------DAAVLSGCASIVSDR 48 + A+++ L ++ + Q A DAA + + Sbjct: 46 IAALLLVAVSLTACGGKGAEVEHRQAEQQAEQRHQEASQRQAEMRDAAKVEMARVAAPMQ 105 Query: 49 TIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK 108 + + + + L Q + ++ I +I ++ + Sbjct: 106 MSSNGAVMGMSIAPMPRDYAAIPLAQNKFEQQVQNGIMVAGEIPVS-----TFSIDVD-T 159 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKH 168 Y L L + + S + E + Sbjct: 160 GSYATLRRMLREGRLPEKGTVRVEEMLNYFAYDY----PLPAKNAAPFSVTTELAPSPYN 215 Query: 169 NDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAI 228 +D + + S + K+ +L + L + Sbjct: 216 DDMMLLRIGLKGYDLLKSQLGASNLVFL-LDVSGSMASTDKLPLLQTALKLLTAQLSAQD 274 Query: 229 QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYREL 288 ++ + Y N+ + L +L+ +TN + AY+ Sbjct: 275 --------KVSIVVYAGAAGVVLDGASGNDTQTLTYALEQLSAGGSTNGGQGITQAYQLA 326 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + VI TDG+ + + + + E + G+ + ++ Sbjct: 327 KKHFIPNGINR--------VILATDGDFNVGVTDFD--DLTALIEKEKAHGIGLTTLGFG 376 Query: 349 APPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 L+ + D G + ++ L E+ + D++ IA + Sbjct: 377 LGNYNDQLMEQLADKGNGNYAYIDT---LNEARKVLVDELSSTLFTIAKD 423 >gi|170681089|ref|YP_001744470.1| von Willebrand factor type A domain-containing protein [Escherichia coli SMS-3-5] gi|218700745|ref|YP_002408374.1| hypothetical protein ECIAI39_2418 [Escherichia coli IAI39] gi|170518807|gb|ACB16985.1| von Willebrand factor type A domain protein [Escherichia coli SMS-3-5] gi|218370731|emb|CAR18544.1| conserved hypothetical protein [Escherichia coli IAI39] Length = 588 Score = 98.1 bits (242), Expect = 3e-18, Method: Composition-based stats. Identities = 40/381 (10%), Positives = 111/381 (29%), Gaps = 45/381 (11%) Query: 20 AHIMYIRNQMQSALDAAVL--SGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSY 77 A + +Q L AA + + T + Q KQ+ ++ Sbjct: 75 AQQYSDKQALQGRLQAAPKYQHAAREKAASQIANPGTARYQQFDDNPVKQVAQNPLATFS 134 Query: 78 IRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTG 137 + + G A + P E Y P++ + S Sbjct: 135 LDVDTGSYANVRRFLNHGQLPPPDAVRVEEMVNY-FPSDWVINDKSNNKEPVPASKPIPF 193 Query: 138 IIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKS 197 + + L + + +N+ Sbjct: 194 AMRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLI------------------- 234 Query: 198 KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN 257 ++ ++ ++ S LV +++ I + Y + + Sbjct: 235 -DTSGSMISDERLPLIQSSLKLLVKELREQDN--------IAIVTYAGDSRIALPSISGS 285 Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + E+ + ++ L+ +TN + AY++ ++ TDG+ + Sbjct: 286 HKAEINAAIDSLDADGSTNGGAGLELAYQQAAKGFIKGGINR--------ILLATDGDFN 337 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSREL 376 + + + + R +G+ + + V + ++ + D +G + ++ L Sbjct: 338 VG--IDDPKSIESMVKKQRESGVSLSTFGVGDSNYNEAMMVRIADVGNGNYSYIDT---L 392 Query: 377 LESFDKITDKIQEQSVRIAPN 397 E+ + ++++ + +A + Sbjct: 393 AEAQKVLNSEMRQTLISVAKD 413 >gi|163759224|ref|ZP_02166310.1| hypothetical protein HPDFL43_05650 [Hoeflea phototrophica DFL-43] gi|162283628|gb|EDQ33913.1| hypothetical protein HPDFL43_05650 [Hoeflea phototrophica DFL-43] Length = 541 Score = 98.1 bits (242), Expect = 3e-18, Method: Composition-based stats. Identities = 35/202 (17%), Positives = 70/202 (34%), Gaps = 34/202 (16%) Query: 226 KAIQEKKNLSV-RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHA 284 K N + G PL+ + +++++ + L +TN + Sbjct: 339 KVDNSPSNFYSNKKDPKGPGFGCEMEPLVPLTTDFSKIRTTVKALEANGSTNMLEGVMWG 398 Query: 285 YRELYNEKESSHNTIGST-RLKKFVIFITDGENSGASAYQN------------------- 324 +R L + + + S ++K +IF+TDG+NS + + Sbjct: 399 WRVLSDREPFAQGAPKSDASVEKIMIFLTDGQNSFGNLNNDLGSAYTSMGYLVDGRLDGM 458 Query: 325 ------------TLNTLQICEYMRNAGMKIYSVAVSAPPEGQ-DLLRKCTDSSGQFFAVN 371 T CE + G+ IY++ + G +L +C SS +F Sbjct: 459 TAANIGQTNNALDKKTKAACENAKEDGVTIYTIRLEEADVGTGKMLEECATSSAHYFDAP 518 Query: 372 DSRELLESFDKITDKIQEQSVR 393 ++L FD I + + + Sbjct: 519 SRQQLTPIFDAIKKGVVKLRLT 540 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 37/339 (10%), Positives = 94/339 (27%), Gaps = 35/339 (10%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 I+ + A+D + ++++Q+A+D+A L + T K T+ Sbjct: 19 FGILAVPVMVAGGLAVDYVGLSVEKSKLQNAVDSAALLIARAGDMSETQAMKLAKTTITT 78 Query: 62 TIFKKQIK---------KHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYE 112 K +K G + ++ + Y A Sbjct: 79 NYGINVAKVAVSMVDGDATVKASMDQALVFGGFMGRKNAAVSAEATATYAYTKYEIALVL 138 Query: 113 IPTENLFLKGLIPSALTNL----SLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ-- 166 T ++ L + + + G+ + + + ++V Sbjct: 139 DTTGSMLGGKLTSLQNAVIGLVDGMEALGLNKEQLKFAVVPYAGFVNVGPEYGPTINGAG 198 Query: 167 -------KHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAP---APANRKIDVLIES 216 D + P F N K ++ PA++ + + ++ Sbjct: 199 KVKKPAAAWIDQDAKAPIPQSDLPSDFSRFAMFNHLKVEWPGCVETRVPADKILHDVKDT 258 Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE--- 273 ++ + + + Y + + P+ N + ++L Y Sbjct: 259 VPDITDP-KSLFTPFFAIDEPDNKWGYPNSYLPDGGKPVKGNKATEAEKQDQLARYGKTG 317 Query: 274 ------NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 NT+ A+ ++++ + S+ K Sbjct: 318 EYKKPKNTDDAIALTGKWKKVKVDNSPSNFYSNKKDPKG 356 >gi|284922261|emb|CBG35346.1| putative lipoprotein [Escherichia coli 042] Length = 588 Score = 97.7 bits (241), Expect = 3e-18, Method: Composition-based stats. Identities = 40/381 (10%), Positives = 111/381 (29%), Gaps = 45/381 (11%) Query: 20 AHIMYIRNQMQSALDAAVL--SGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSY 77 A + +Q L AA + + T + Q KQ+ ++ Sbjct: 75 AQQYSDKQALQGRLQAAPKYQHAAREKAASQIANPGTARYQQFDDNPVKQVAQNPLATFS 134 Query: 78 IRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTG 137 + + G A + P E Y P++ + S Sbjct: 135 LDVDTGSYANVRRFLNHGQLPPPDAVRVEEMVNY-FPSDWVINDKSNNKEPVPASKPIPF 193 Query: 138 IIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKS 197 + + L + + +N+ Sbjct: 194 AMRCELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLI------------------- 234 Query: 198 KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN 257 ++ ++ ++ S LV +++ I + Y + + Sbjct: 235 -DTSGSMISDERLPLIQSSLKLLVKELREQDN--------IAIVTYAGDSRIALPSISGS 285 Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + E+ + ++ L+ +TN + AY++ ++ TDG+ + Sbjct: 286 HKAEINAAIDSLDAEGSTNGGAGLELAYQQAAKGFIKGGINR--------ILLATDGDFN 337 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSREL 376 + + + + R +G+ + + V + ++ + D +G + ++ L Sbjct: 338 VG--IDDPKSIESMVKKQRESGVSLSTFGVGDSNYNEAMMVRIADVGNGNYSYIDT---L 392 Query: 377 LESFDKITDKIQEQSVRIAPN 397 E+ + ++++ + +A + Sbjct: 393 AEAQKVLNSEMRQTLISVAKD 413 >gi|319638170|ref|ZP_07992933.1| von Willebrand factor type A domain-containing protein [Neisseria mucosa C102] gi|317400443|gb|EFV81101.1| von Willebrand factor type A domain-containing protein [Neisseria mucosa C102] Length = 530 Score = 97.7 bits (241), Expect = 3e-18, Method: Composition-based stats. Identities = 37/363 (10%), Positives = 98/363 (26%), Gaps = 29/363 (7%) Query: 36 AVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK 95 A L+ C+ + S + + + + E D + ++ + Sbjct: 15 AALTACSGPLDRSDSSTENLHGAPDSALPATAVAEENLSLTENTERYQDQPDQPVKSVAQ 74 Query: 96 DKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD 155 + + + Y L P + V Sbjct: 75 EPVSTFSIDVD-TGSYANVRRFLNSGKQPPKDAVRIEEIINYFPYNYPLPTDGRPFAV-- 131 Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 + + KK N K+ ++ + Sbjct: 132 ----HTETIDSPWQPEAKLIKIGIQAQDTAKKDLPPANLVFLVDVSGSMDEENKLPLVQK 187 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 + L ++ ++ I Y+ G ++ + ++KL +T Sbjct: 188 TLRILTQQLRPQD--------KVTLITYSSGEELVLPPTSGSDKETILKAIDKLKAEGST 239 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + A+ AY E + ++ TDG+ + + TL + Sbjct: 240 SGESALRMAYEEAQKAFVPNGINR--------ILLATDGDFNVGVSDTETL--KSMVAEK 289 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRI 394 R G+ + ++ +D++ + D+ G + +++ +E + ++ + Sbjct: 290 RKTGISLSTLGFGTDNYNEDMMEQIADAGDGNYSYIDNEKEAKKVLQ---QQLTSTLATV 346 Query: 395 APN 397 A + Sbjct: 347 AQD 349 >gi|146337717|ref|YP_001202765.1| hypothetical protein BRADO0586 [Bradyrhizobium sp. ORS278] gi|146190523|emb|CAL74522.1| hypothetical protein BRADO0586 [Bradyrhizobium sp. ORS278] Length = 418 Score = 96.9 bits (239), Expect = 5e-18, Method: Composition-based stats. Identities = 61/425 (14%), Positives = 120/425 (28%), Gaps = 61/425 (14%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + AI F+ ID + +R ++Q A+D AVL+G A+ D Sbjct: 22 LFAIACVPVLAFVGAGIDYSMANKLRTKLQMAIDEAVLAGVAA---------GKAALDSG 72 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 +T SY N I IN T L + + N Sbjct: 73 ATQAAAIAMAQAASSSYFTGNTAKIDATPTINFT-TMGRTLSGTGSATS-----VMNTSF 126 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 L+ L+ S +++ +++D+S SM Q + L Sbjct: 127 MRLVGFPTMTLNASSASSAT---MQPYLNVYLLVDISSSMLLPATQAGITQMRNGTGCAL 183 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 S + + V+ + NL+ + + K V++G Sbjct: 184 ACHETTNGTDSYSYALKNNV------LLRYQVVNQGVQNLLTYLNSSAVYKN--YVKVGL 235 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG 300 +++ + S + L + + ++ + Sbjct: 236 WSFDNQLTQLSSLTSSFSSVAANFPAPGLAYNDA-AAATPFDSLIGSFVSSVGTAGDGST 294 Query: 301 STRLKKFVIFITDGENSGASAYQNTLN--------TLQICEYMRNAGMKIYSV------- 345 S +K VI TDG N A+ + + C ++ G+ + + Sbjct: 295 SATPQKLVIIATDGVNDPTRAWTSQTSLRSQVRVFNTAFCNTFKSNGVTVAIINTPYYPM 354 Query: 346 ----AVSAP-------------PEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 +A + L+ C F +D + +F + +K Sbjct: 355 TWDWGYNATLGQPGSLGGATRVDDIPIALKSCA--GSNFIIASDVATIQNAFTTLFNKAS 412 Query: 389 EQSVR 393 + Sbjct: 413 PVRLT 417 >gi|37676262|ref|NP_936658.1| hypothetical protein VVA0602 [Vibrio vulnificus YJ016] gi|37200803|dbj|BAC96628.1| conserved hypothetical protein [Vibrio vulnificus YJ016] Length = 442 Score = 96.9 bits (239), Expect = 5e-18, Method: Composition-based stats. Identities = 42/435 (9%), Positives = 126/435 (28%), Gaps = 64/435 (14%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + ++ + + + +++ + ++ A + A L+ I P ++ Sbjct: 29 IFMGMLPILVIIMVFSMQMTQRHMAHAKITEAAEVASLAL---------IASPKEGDEKN 79 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 +K + ++ + + ++ L + + ++ Sbjct: 80 QEYAQKIVDHYIPDNKGEVVARVFHRRCEYKDGCVQRSGELAPFTDFVVSAKTKHDSWIS 139 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 LT R + I ++D+S SM + + ++ ++ Sbjct: 140 YNEGEMGLTKDFEVMGTSTSRKFLPQPLDIYFIIDMSGSMVNPWGGSGKTKYDVVADTIN 199 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 + F + ++ A +++ + +S + N+ Sbjct: 200 RIVDDLREFKTDRKSRVAVIGFHHTAVKQVGRQRTAFDY--SSYRTPSATVNNMFTAPKI 257 Query: 241 IAYNIGIV--GNQCTPLSNNLNEVKSRLNKLNP----YENTNTYPAMHHAYRELYNEKES 294 + N + PL+ + + ++ N N Y T ++ + A + + Sbjct: 258 HSRNDSGNIKTFEDIPLTEDYDAFLTKFNSSNYYASRYGLTESWQGIIGAAQMAEQATD- 316 Query: 295 SHNTIGSTRLKKFVIFITDGENSG---------------ASAYQNTLNTLQICEYMRNA- 338 ++ I ++DG + + Y N L +CE ++ Sbjct: 317 -------LNPEQVFILLSDGRDGDFVRYYLEGRQWREVRYNKYLNRLVKAGLCEKLKTRI 369 Query: 339 -------------------GMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 + + + V+ + D + C + + + Sbjct: 370 SQKRNVFQSENPSDKASKTKVTMGVIGVNYVVDKSDGIGDC-FGHDNIYHAKEG---NDV 425 Query: 380 FDKITDKIQEQSVRI 394 + I + I E++ R+ Sbjct: 426 YKYILNLINEETGRL 440 >gi|254512360|ref|ZP_05124427.1| conserved hypothetical protein [Rhodobacteraceae bacterium KLH11] gi|221536071|gb|EEE39059.1| conserved hypothetical protein [Rhodobacteraceae bacterium KLH11] Length = 668 Score = 96.9 bits (239), Expect = 5e-18, Method: Composition-based stats. Identities = 53/415 (12%), Positives = 101/415 (24%), Gaps = 100/415 (24%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGC------------------- 41 +T +I + F +A+DL R ++Q ALD AVL+ Sbjct: 37 LTLFLIMIVFTVAGFAVDLMRYDRERVRLQYALDRAVLAAADLDQELCPRVVVNDYISKE 96 Query: 42 -----------ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQ 90 + + D T + + +G + Sbjct: 97 GFDPGIIDEIKVDPETCLNTDSSDSDGDGTDSSDASGSDSDPSDTASSGTESGSDGTSSG 156 Query: 91 INITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISI 150 + + K + E + + + + ++ ST + + I Sbjct: 157 GDTAGTSTTTNAVELQGKRKVEASAQLNIETHFMKWSGVD-TINSTAVSAAEESIGNVEI 215 Query: 151 CMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKI 210 +VLDVS SME + K +L S S+ P KI Sbjct: 216 SLVLDVSGSME-GAKLTNLQKAAKDFVKEMLEKSADDSLSISIIPYSEQVGVPDYMMDKI 274 Query: 211 DV-----LIESAGNLVNSIQKAIQE----------------------------------- 230 + + Sbjct: 275 NTTGGNKVANCIEFQPADFTAIPFTAFSIGAPSEATNPPPSVPQSLHFTNRSNDFRRGGN 334 Query: 231 -----KKNLSVRIGTIAYNIGIVGNQCTP------LSNNLNEVKSRLNKLNPYENTNTYP 279 ++ R N + T + N+L+ + ++N L +T+ Sbjct: 335 RDHRSTNDVVSRFSPWDANFPCREDTPTDRREMVVIQNDLDTLNKQINNLVAAGSTSINI 394 Query: 280 AMHHAYRELYNEKES-----------------SHNTIGSTRLKKFVIFITDGENS 317 + L + +T K V+ +TDG+N Sbjct: 395 GLKWGLALLDESIQPLIKTVANDTNVPKIFEDRPRPTNTTDTLKVVVLMTDGKND 449 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 13/74 (17%), Positives = 29/74 (39%), Gaps = 1/74 (1%) Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 + + +C + I+S+A AP + +L+ C +++ ++ Sbjct: 595 DISKKNEQVVSLCGKAEEKEVLIFSIAFEAPSSVKQMLKDCAVKPARYYEAT-GTQIERV 653 Query: 380 FDKITDKIQEQSVR 393 FD I+ IQ + Sbjct: 654 FDSISTSIQNLRLT 667 >gi|84688081|ref|ZP_01015939.1| hypothetical protein 1099457000215_RB2654_05415 [Maritimibacter alkaliphilus HTCC2654] gi|84663909|gb|EAQ10415.1| hypothetical protein RB2654_05415 [Rhodobacterales bacterium HTCC2654] Length = 595 Score = 96.9 bits (239), Expect = 5e-18, Method: Composition-based stats. Identities = 42/398 (10%), Positives = 97/398 (24%), Gaps = 85/398 (21%) Query: 1 MTAIIISVCF-LFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQ 59 + I + + L A+D R +Q LD A+L+ + D+ + D Sbjct: 31 IFGIAVFMLMCLAGGIAVDTMRYETHRVHVQGTLDRAILAAASL---DQDLDPEEVVLDY 87 Query: 60 TSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLF 119 + + + EN + + +T E+ +PT L Sbjct: 88 FTKAGLGHVISQ--DDIDVFENQTNGEVADDVAVTT-------RRVEASVSALMPTTFLR 138 Query: 120 LKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKY 179 L + + + S +L + + + S S + + N Sbjct: 139 LAHMYDLG-LYTEGGAEEALSLSEISLVLDVSGSMGNSSSSGYSKIYELRRAAKRFVNVM 197 Query: 180 LLPPPP----------KKSFWSKNTTKSKYAPAPAPANRKI-----DVLIESAGNLVNSI 224 L P + ++ P+ ++ + Sbjct: 198 LCNPADADETEDCTLTEGDISINIVPYAEQVLLPSNLLQRFNHTSEHTESRCITFYEDEF 257 Query: 225 QK---------------------------------------AIQEKKNLSVRIGTIAYNI 245 + + S + + Sbjct: 258 DTVAVPTFSLDTFVTNGRPLPALYGDPIQLTGYFDPSGGTNSTPNPGSNSPCYNDYSGST 317 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH-----------------AYREL 288 + P+ + ++ ++ L NT+ M A E+ Sbjct: 318 NDYWREIYPMGFSAEALRDEIDDLGASGNTSIDLGMKWGAALLDPAAQPAISDLVAANEV 377 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 + ++K ++ +TDGEN+ + Sbjct: 378 NEAFDGRPFEYTQRGIEKVIVLMTDGENTSQDYLRRGY 415 Score = 78.0 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 16/102 (15%), Positives = 38/102 (37%), Gaps = 3/102 (2%) Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + ++ + + + N +IC +NAGM ++++ Sbjct: 495 TFPQLWLQKTANWYDQWNFLA--DAHDYFNYSEKNDNLDEICTAAKNAGMVVFTIGFEVS 552 Query: 351 PEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 D++R C + +F V D ++ +F I +I + + Sbjct: 553 GSQHDIMRSCASAPAYYFDV-DGLDISAAFAAIAREISKLRL 593 >gi|209546922|ref|YP_002278840.1| hypothetical protein Rleg2_4864 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209538166|gb|ACI58100.1| hypothetical protein Rleg2_4864 [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 462 Score = 96.9 bits (239), Expect = 5e-18, Method: Composition-based stats. Identities = 43/327 (13%), Positives = 99/327 (30%), Gaps = 32/327 (9%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A+ + + + + D +R +MQS LDAA+++ I + K + S Sbjct: 39 VALSLVPMLVAVGASFDYIRSYNVRQKMQSDLDAALIAAVKQINNTADAD---ALKAKVS 95 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 F Q+ + + + + + A T Sbjct: 96 DWFHAQVDNSY-------------------TLGEIDIDTANHNITATAS---GTVPTTFM 133 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 + +S+ S + +++ +V+D S SM Sbjct: 134 KIANIDTVPVSVGS---AVKGPATSYLNVYIVIDTSPSMLLAATTSGQSTMYSGIGCQFA 190 Query: 182 PPPPKKSFWSKNTTKSKYAPAPA-PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 K T + Y + A + DV ++ ++++ I + + + +++G Sbjct: 191 CHTGDAHTVGKKTYANNYEYSTAKNIKLRADVAGDAVKDVLSLIDTS--DSNHERIKVGL 248 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN-TYPAMHHAYRELYNEKESSHNTI 299 + + LS + + T+ + L + + + Sbjct: 249 YSLGDTLTEVLAPTLSTDTARTRLSTASYGLTSATSKAATYFDVSLATLKQKVGAGGDGT 308 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTL 326 S K V+ +TDG S +++ Sbjct: 309 TSGTPLKLVLLLTDGVQSKREWVTDSV 335 >gi|307941972|ref|ZP_07657325.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307945282|ref|ZP_07660618.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307771155|gb|EFO30380.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307774878|gb|EFO34086.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] Length = 412 Score = 96.9 bits (239), Expect = 5e-18, Method: Composition-based stats. Identities = 57/405 (14%), Positives = 127/405 (31%), Gaps = 61/405 (15%) Query: 15 YAIDLAHIMYIRNQMQSALDAAVL-SGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLK 73 ID++ R+Q Q D L + + + T +K + K Sbjct: 37 VGIDMSFAYNKRDQSQLVADEVSLFAVTTFRKYVADGMSKNQARKRAETDARKFLTARTK 96 Query: 74 QGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSL 133 E INI + ++ ++E ++ + + + Sbjct: 97 SLDGTTEKFSIK-----INIVDREAKVVKANVNISGKHES-----YMTHAMGFDNIDYTA 146 Query: 134 RSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKN 193 S S ++DVS SM + P S S+ Sbjct: 147 DSES--TISFGQGKYEFIFLVDVSPSMGIGASNRDRQIMQRAIGCQFACHEPWYSSVSR- 203 Query: 194 TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 A A +IDV+ ++ +LV +++A + +R G +++ + T Sbjct: 204 -------AKSAGARLRIDVVKDALKSLVTQLEEA----TEVDLRTGLYSFSN--YLHIQT 250 Query: 254 PLSNNLNEVKSRLNKLN------PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 L+ +++ K NK+ TN + + + +K+ Sbjct: 251 GLNKGISKFKREANKIAIHREYLRGGGTNFHG-------VFSDFNGVLRSLKPKADVKQH 303 Query: 308 VIFITDGEN--------------SGASAYQNTLN-TLQICEYMRNAGM-KIYSVAVSAPP 351 +I I+DG N + + + C+ + + ++++ V Sbjct: 304 IIIISDGVNHLNLRSGTNRHLWNQTPNWRPYNYSFNPRWCDEFKKGEVRTVHTMLVEPDR 363 Query: 352 E-----GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 +R C S+ F++ N + E+ ++F + + + + Sbjct: 364 AHYVRASTSSMRACATSADFFYSANSAAEIDKAFKDLFEALLKSV 408 >gi|54309668|ref|YP_130688.1| hypothetical protein PBPRA2504 [Photobacterium profundum SS9] gi|46914106|emb|CAG20886.1| hypothetical protein PBPRA2504 [Photobacterium profundum SS9] Length = 494 Score = 96.5 bits (238), Expect = 6e-18, Method: Composition-based stats. Identities = 52/474 (10%), Positives = 134/474 (28%), Gaps = 97/474 (20%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 TA+ + F I +A++ + +N++ A +AA L+ + D+T ++ + Sbjct: 15 TALALIPLFGMIFWALEGTRYIQKKNRLADATEAATLAVTTANQDDKTYEN-----QLAT 69 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + I+ ++ + I+ + ++ QY +K + + + Sbjct: 70 NYVQTYIR-NIAIINDIKVERSEGIDYYPTPDGNEEREYFQYRVTAKTDHTSWLSSDIIP 128 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND----------- 170 P+ + + I I V D S SM+ ++ D Sbjct: 129 SFSPTETVANRALAR-NYPIYLGDKDIDIVFVSDFSGSMKGNKIRALKDAIQAIANEILV 187 Query: 171 --------NNNMTSNKYLLPPPPKKSF-------WSKNTTKSKYAPAPAP---ANRKIDV 212 N + Y + K+S + + + Sbjct: 188 PRDGEVEVTNRIAFVPYNMRVQEKRSNTRWCLTQLDYRPNFNGGNYSSYEDIDWSTWSTW 247 Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK--------- 263 + N +K+ + + I + + + Sbjct: 248 TRNQVRDCSNGYYSCTGKKRRDARTVYAILNTSKSETGSGWYFPDPYSYINFPDSVAKTF 307 Query: 264 -SRLNKLN--------PYEN------------------------------TNTYPAMHHA 284 ++ N L T+ Y + Sbjct: 308 TAKANNLQFQSTNQKLYSGGMCSGNFWTIPLTSEKTTLSPIQNNMSPDGGTSVYQGLIRG 367 Query: 285 YRELYNEKESSHNTIGSTRLK---KFVIFITDGENSGASAYQNTLNTLQICEYMR----- 336 + L + +S + S K ++ ++DG+ + N L +C ++ Sbjct: 368 AQILEQGRPTSPSPETSAAYNSRIKMILMLSDGQEMPYVSTFNQLVNQGLCNTIKAQFND 427 Query: 337 -NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + + + + + +GQ + C + + + ++ + +I + I+E Sbjct: 428 SDQQLYMGVLGIEFDAQGQQGFKNCVGQNN----ITNVDDVDDLIKEILEMIKE 477 >gi|70733679|ref|YP_257319.1| von Willebrand factor type A domain-containing protein [Pseudomonas fluorescens Pf-5] gi|68347978|gb|AAY95584.1| von Willebrand factor type A domain protein [Pseudomonas fluorescens Pf-5] Length = 582 Score = 96.5 bits (238), Expect = 7e-18, Method: Composition-based stats. Identities = 46/406 (11%), Positives = 111/406 (27%), Gaps = 45/406 (11%) Query: 3 AIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTST 62 + + + Q AL AA + D T K Sbjct: 14 GFAAGLLLAVAGCGVSSKPESAAGSSTQGALQAA-----PQAQYEVQHADATMAKRAVH- 67 Query: 63 IFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAES----------KAQYE 112 + S + + K +NP+ +AE+ Y Sbjct: 68 -PMRLSAPMPAPISSRDSLVAGYRDEPREQYQKLPDNPIHSVAEAPVSTFSADVDTGAYA 126 Query: 113 IPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNN 172 L L P L + S V + Sbjct: 127 NVRRLLNQGSLPPEGAVRLEELVNYFPYDYALPTDGSPFGVTTELAPSPWNPHTRLLRIG 186 Query: 173 NMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK 232 S++ + P + + + S P ++ + LV+ ++ Sbjct: 187 IKASDRAVAELAPANLVFLVDVSGSMDRREGLP------LVKSTLKLLVDQLRDQD---- 236 Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 R+ + Y + ++++ +++L +T + AY+ Sbjct: 237 ----RVSLVVYAGESRVVLEPTSGRDKAKIRTAIDQLTAGGSTAGASGIQLAYQMAQQGF 292 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 ++ TDG+ + + ++L + R +G+ + ++ Sbjct: 293 IDQGINR--------ILLATDGDFNVGVSDFDSL--KAMAAEKRKSGVSLTTLGFGVDNY 342 Query: 353 GQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + L+ + D+ G + +++ L E+ + D++ +A + Sbjct: 343 NEHLMEQLADAGDGNYAYIDN---LREARKVLVDQLSSTLAVVAKD 385 >gi|315499132|ref|YP_004087936.1| von willebrand factor type a [Asticcacaulis excentricus CB 48] gi|315417144|gb|ADU13785.1| von Willebrand factor type A [Asticcacaulis excentricus CB 48] Length = 519 Score = 96.5 bits (238), Expect = 7e-18, Method: Composition-based stats. Identities = 32/216 (14%), Positives = 72/216 (33%), Gaps = 8/216 (3%) Query: 183 PPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQ-KAIQEKKNLSVRIGTI 241 + + + + + ++ + S ++ Sbjct: 304 TTSTGTSVAYSPSGYTAFSSSYFSSLATSSWGGCLTDRNQSFDVSVAPYATDVVDSNYIA 363 Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE-SSHNTIG 300 A + L+++ V + L+ L+P NTN + L + + G Sbjct: 364 ASCSTTALAKVLDLTSDFTSVNTYLSSLSPGGNTNITLGVQFGMEMLSPAEPYTKATAFG 423 Query: 301 STRLKKFVIFITDGENSGASAYQN----TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 T +KK++I +TDG N+ + T C + G+ ++ V V L Sbjct: 424 DTDVKKYMIIVTDGANTQNRWSTSNSAINARTALACTAAKAQGITLFVVRV--EDGDSSL 481 Query: 357 LRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 L C S ++ ++ + +L ++ I I + + Sbjct: 482 LEACASQSSYYYDLSQASDLTKTMQDIFATINKLRL 517 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 25/219 (11%), Positives = 53/219 (24%), Gaps = 27/219 (12%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + + + A+D+ +++Q A DAAVL I Sbjct: 22 IFGLCAVILVGAAGGAVDMMRYFDTSSRLQDATDAAVLKATQKIEVSEAAAKTAAAMAFE 81 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + + I + IT+ + Sbjct: 82 MNLSDHPELQTASHTFAIETSDNAKVVHYTSEITQRP---------------------YF 120 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 L+ + + S+ E + + VLD + SM + + + L Sbjct: 121 LQLLGLGEQTIRVASSAQSE----SDPFELLFVLDTTGSMASNNKMTYLKTSVSSVLSSL 176 Query: 181 LPPPPKKSFWSK--NTTKSKYAPAPAPANRKIDVLIESA 217 + + K + PA + + Sbjct: 177 ISTYGDGNEDVKVGVVAFNTQVRLPASTSYSFVDYTQCY 215 >gi|312958282|ref|ZP_07772803.1| von Willebrand factor type A domain [Pseudomonas fluorescens WH6] gi|311287346|gb|EFQ65906.1| von Willebrand factor type A domain [Pseudomonas fluorescens WH6] Length = 546 Score = 96.1 bits (237), Expect = 9e-18, Method: Composition-based stats. Identities = 43/366 (11%), Positives = 106/366 (28%), Gaps = 29/366 (7%) Query: 33 LDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQIN 92 +D LS P + KQ L S RE ++ Sbjct: 38 VDTGPLSVLEPYPVPVARSKPMPMPAPMTARLAKQESATLDYRSEPREQYANLPDNPVHR 97 Query: 93 ITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICM 152 + + + + Y L L P L + S Sbjct: 98 VAETPVSTFSVDVD-TGSYANVRRFLNQGSLPPEGAVRLKEMVNYFPYHYALPTDGSPFG 156 Query: 153 VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV 212 V + + S++ + P + + + S P + Sbjct: 157 VTTEVAATPWNPRTQLLRIGIKASDRAVAELAPANLVFLVDVSGSMDRREGLP------L 210 Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY 272 + + LV+ +++ R+ + Y + ++++ +++L Sbjct: 211 VQSTLKLLVDQLREQD--------RVSLVVYAGESRVVLPPTSGRDKAKIRTAIDQLTAG 262 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 +T + AY+ ++ TDG+ + + ++L Q+ Sbjct: 263 GSTAGASGIQLAYQMAREGFIDKGINR--------ILLATDGDFNVGISDFDSL--KQMA 312 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQS 391 R +G+ + ++ + L+ + D+ G + ++ L E+ + D++ Sbjct: 313 AEQRKSGVSLTTLGFGVDNYNEHLMEQLADAGDGNYAYIDT---LREARKVLVDQLSSTL 369 Query: 392 VRIAPN 397 +A + Sbjct: 370 TVVARD 375 >gi|327262385|ref|XP_003216005.1| PREDICTED: vitrin-like [Anolis carolinensis] Length = 748 Score = 95.7 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 45/352 (12%), Positives = 91/352 (25%), Gaps = 31/352 (8%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPL 101 + + + +K+ K Q + ++ ++ G+ + I P Sbjct: 418 KTYANSKDLKNAIEKILQKGGLSNVGKALSFVNKNFFEDSNGNRGAAPNVAIVMVDGWPT 477 Query: 102 QYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSME 161 + E+ N+F + + ++V Sbjct: 478 DKVEEASRLARESGINIFFVTIEGPDENEKQNVVEANF-VDKAVCRTNGYYSINVPSWFS 536 Query: 162 DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLV 221 K S + + VL A Sbjct: 537 LHK---VVQPLVKRICDSDHLACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVAN--- 590 Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPA 280 E + RIG + Y + +V + + ++N T+T A Sbjct: 591 ---ISKEFEISDTDTRIGAVQYTYEQRLEFGFEKQSTKQDVLNAIKRINYWSGGTSTGAA 647 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 +++A+ +L+ +K +I ITDG + + G+ Sbjct: 648 INYAFEQLF--------IKSKPNKRKIMILITDGRSYD--------DVQGPATAAHQNGV 691 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQ 390 YSV + QD L F V++ L +I I + Sbjct: 692 ITYSVGI--AWAAQDELEAIATDPDKEHSFFVDEFDSLYRFVPRIIQNICTE 741 Score = 47.2 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 31/321 (9%), Positives = 82/321 (25%), Gaps = 41/321 (12%) Query: 92 NITKDKNNPLQYIAESKAQYEIPTE--NLFLKGLIPSALTNLSLRSTGIIERSSENLAIS 149 +I +P + A ++ + L ++ + I+ Sbjct: 250 DIGSSNVHPAYSSVAAVASRQVQAVQGRTQNQALRGASSYASNRNIPRPNTGLQRQEPIA 309 Query: 150 ICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPP----KKSFWSKNTTKSKYAPA--- 202 + + + + S K Sbjct: 310 AFR-KPANNPANLAMERDLWKPASTLLDTGFSSKEDLVPKTLEPASHGNPNCKVDVCFLI 368 Query: 203 ---PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 + R+ + ++ S+ + +G I Y N Sbjct: 369 DGSWSIGKRRFQIQKHFLKDVAQSLDVGVAGPL-----MGIIQYGDDPSTEFNLKTYANS 423 Query: 260 NEVKSRLNK-LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 ++K+ + K L +N A+ + + + + + I + DG + Sbjct: 424 KDLKNAIEKILQKGGLSNVGKALSFVNKNFFEDSNGNRGAAPN-----VAIVMVDGWPTD 478 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD-------LLRKCTDSSGQFFAVN 371 + R +G+ I+ V + P E + + K + ++++N Sbjct: 479 --------KVEEASRLARESGINIFFVTIEGPDENEKQNVVEANFVDKAVCRTNGYYSIN 530 Query: 372 DSR--ELLESFDKITDKIQEQ 390 L + + +I + Sbjct: 531 VPSWFSLHKVVQPLVKRICDS 551 >gi|323138635|ref|ZP_08073702.1| hypothetical protein Met49242DRAFT_3090 [Methylocystis sp. ATCC 49242] gi|322396123|gb|EFX98657.1| hypothetical protein Met49242DRAFT_3090 [Methylocystis sp. ATCC 49242] Length = 547 Score = 95.7 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 48/413 (11%), Positives = 119/413 (28%), Gaps = 66/413 (15%) Query: 46 SDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIA 105 S + D +K + + + + ++ + + + + A Sbjct: 137 SMNSSSDGQSKISILKSAANSFVDTMFSKSNNVK--FSVVPFSSGVAAVDPSEPSSRNAA 194 Query: 106 ESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYL 165 + G + + R + + N A+ + ++ Sbjct: 195 WVDKNGANSQHWIAFGGKTAANAAGFTSRFDIFDKLKARNSALDWRGCFEPQVYPLNVND 254 Query: 166 QKHNDNNN-MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 N ++ +L P P S W N + Y A + + Sbjct: 255 TTPNPSDAETLFVPFLAPDEPDNSGWGGNPYWNNYFGDNPSACSSSASGAWARLSRACKY 314 Query: 225 QKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMH 282 + S G+ ++ + L+ ++V++++N+L TN + Sbjct: 315 NATGSLGGSFGPSDFKGSSSFCPDPGTQRILQLTQKKSDVQNKINQLVANGATNLHEGFM 374 Query: 283 HAYRELYNEKE-SSHNTIGSTRLKKFVIFITDG---------------ENSGASAYQNTL 326 +R L S + + +K ++F+TDG ++ N Sbjct: 375 WGWRTLSPNAPFSGGRAYQAPKNRKIMVFMTDGFNSWNSRVNTATGSTYDTLGYYSYNGA 434 Query: 327 N----------------------------------------TLQICEYMRNAGMKIYSVA 346 T Q C + AG++++++ Sbjct: 435 ENERFPDGSQGNGVNYRSLLAAAANNSSSYQTISRAMQDELTRQACTNAKTAGIEVFTIG 494 Query: 347 VSA-----PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRI 394 S +G L+++C + +F D+ +L +F +I + + + + Sbjct: 495 FSVSGDPIDAQGLALMKECATNEDHYFKAEDASQLNAAFSQIGIGLGKLRLSL 547 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 26/205 (12%), Positives = 53/205 (25%), Gaps = 28/205 (13%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + + + L + D R +Q A D+AVL+ + + T + Sbjct: 2 IFGLSLMPVMLMLGATADYTRFTTTRAALQQAADSAVLTVASKMTESTTNAQAKDQAQVV 61 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + +T + + + A+ I + + Sbjct: 62 LNA---------------------QPRMTTAIVTGATVSEDKRTVCATAKVTIQNSFMQM 100 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKH----NDNNNMTS 176 L + LT + + I +VLD S SM + + + Sbjct: 101 AQL--ATLTPTVKSCANLAGGADPGTTYEIALVLDNSGSMNSSSDGQSKISILKSAANSF 158 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAP 201 + +S S A Sbjct: 159 VDTMFS-KSNNVKFSVVPFSSGVAA 182 >gi|229587743|ref|YP_002869862.1| hypothetical protein PFLU0165 [Pseudomonas fluorescens SBW25] gi|229359609|emb|CAY46451.1| conserved hypothetical protein [Pseudomonas fluorescens SBW25] Length = 551 Score = 95.7 bits (236), Expect = 1e-17, Method: Composition-based stats. Identities = 38/401 (9%), Positives = 116/401 (28%), Gaps = 51/401 (12%) Query: 3 AIIISVCFLFITYAIDLAHIMYIR-----NQMQSALDAAVLSGCASIVSDRTIKDPTTKK 57 +++ + + + ++Q+A L + ++ + Sbjct: 25 GFAVTMVVALAGCGLSSSP-ELAKPAEPVAELQTASPQGAL--VKRMAMPAPMRMQESAA 81 Query: 58 DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 + ++Q +L + + + + Y Sbjct: 82 MDYRSEPREQY-ANLPDN-------------PVHRVAETPVSTFSVDVD-TGSYANVRRF 126 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSN 177 L L P L + S V + + S+ Sbjct: 127 LNQGSLPPEGAVRLEEMVNYFPYHYALPTDGSPFGVTTEVAATPWNPRTQLLRIGIKASD 186 Query: 178 KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVR 237 + + P + + + S P ++ + LV+ +++ R Sbjct: 187 RPVAELAPANLVFLVDVSGSMDRREGLP------LVKSTLKLLVDQLREQD--------R 232 Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHN 297 + + Y + ++++ +++L +T + AY+ + Sbjct: 233 VSLVVYAGESRVVLKPTSGRDKVKIRNAIDQLTAGGSTAGASGIELAYQMAREGFIDNGI 292 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 ++ TDG+ + + ++L Q+ R +G+ + ++ + L+ Sbjct: 293 NR--------ILLATDGDFNVGISDFDSL--KQMAVEQRKSGVSLTTLGFGVDNYNEHLM 342 Query: 358 RKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + D+ G + +++ L E+ + D++ +A + Sbjct: 343 EQLADAGDGNYAYIDN---LREAHKVLVDQLSSTLAVVARD 380 >gi|329888464|ref|ZP_08267062.1| hypothetical protein BDIM_03870 [Brevundimonas diminuta ATCC 11568] gi|328847020|gb|EGF96582.1| hypothetical protein BDIM_03870 [Brevundimonas diminuta ATCC 11568] Length = 650 Score = 95.4 bits (235), Expect = 1e-17, Method: Composition-based stats. Identities = 40/283 (14%), Positives = 83/283 (29%), Gaps = 43/283 (15%) Query: 151 CMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKI 210 V VS S L + + K + + A + Sbjct: 368 YRVASVSGSQVVLDTTRSLGLAGAS--KGGVTYSSGGKLMNGRDGSEWRVFPTADGYVNV 425 Query: 211 DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN 270 V + V ++ + + + LS + + +KS++++++ Sbjct: 426 HASSTCVSERVGVERYTDARPSTAYVGRSYLSSSNSCPSAELSALSTSASSLKSKIDQMS 485 Query: 271 PYENTNTYPAMHHAYRELYN------EKESSHNTIGSTRLKKFVIFITDGENS------- 317 +T + A+ L E + K I +TDGE + Sbjct: 486 AGGSTAGQIGIAWAWYALSPDFASLFSGEGQPGAYAPSDTLKVAILMTDGEFNTPFRDGV 545 Query: 318 ------------------GASAYQNTLNTLQICEYMRNAGMKIYSVAV---------SAP 350 +S ++ +C+ M+ G+ +Y+V Sbjct: 546 IALDAGTGSGGLDSHIDLNSSNGDPFAQSVALCQAMQAKGVVVYTVGFDLGSATGREGVV 605 Query: 351 PEGQDLLRKCTDSSG-QFFAVNDSRELLESFDKITDKIQEQSV 392 D++R+C + FF +D +L E+F I I + Sbjct: 606 DTALDVMRECATNEQTHFFQADDGTDLKEAFRAIGRDITRLRI 648 Score = 36.8 bits (83), Expect = 5.6, Method: Composition-based stats. Identities = 18/212 (8%), Positives = 47/212 (22%), Gaps = 18/212 (8%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M A+ + V + AID+ ++ L + + T Sbjct: 31 MFAMALPVLLMITLGAIDIHQASKVKA---------QLQDALDAAALAAARSTFTDDVNI 81 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + + +K ++ +T + K + Sbjct: 82 NKVGLAALKANMPSYFGEASGDTASFVLLNNRVTGEA------TVNVKVLVANVVLPPYG 135 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDV-SRSMEDLYLQKHNDNNNMTSNKY 179 K L + +A+ I +D SR+ + Sbjct: 136 KLLDDFLPVSSRSEVLRASRNVEVAMALDITGSMDNCSRNCPPTSKLEDLQAAAKELVDI 195 Query: 180 LLPPPPKKSFWS--KNTTKSKYAPAPAPANRK 209 ++ + + + + + Sbjct: 196 VVQDQQTPFYSKVALIPYAAGVNVGSSAISAR 227 >gi|145224243|ref|YP_001134921.1| hypothetical protein Mflv_3659 [Mycobacterium gilvum PYR-GCK] gi|189040172|sp|A4T9I4|Y3659_MYCGI RecName: Full=UPF0353 protein Mflv_3659 gi|145216729|gb|ABP46133.1| von Willebrand factor, type A [Mycobacterium gilvum PYR-GCK] Length = 335 Score = 95.4 bits (235), Expect = 2e-17, Method: Composition-based stats. Identities = 37/222 (16%), Positives = 79/222 (35%), Gaps = 26/222 (11%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 ++ S+ A A ++ E+A + + I +G IAY Sbjct: 95 NRAVVMLVIDVSQSMRATDVAPNRLTAAQEAAKQFADQLTPGIN--------LGLIAYAG 146 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + N K+ ++KL + T T + A + + + G Sbjct: 147 TATVLVSP--TTNRESTKTAIDKLQLADRTATGEGIFTALQAIATV--GAVIGGGDEPPP 202 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PE 352 V+ ++DG+ + S N ++ G+ I +V+ P P Sbjct: 203 ARVVLMSDGKETVPSNPDNPKGAYTAARTAKDQGVPISTVSFGTPYGYVEINEQRQPVPV 262 Query: 353 GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 ++L+K D S G+ F + +L + F + ++I ++++ Sbjct: 263 DDEMLKKIADLSGGEAFTASSLEQLKQVFTNLQEQIGYETIK 304 >gi|90412167|ref|ZP_01220173.1| hypothetical protein P3TCK_27759 [Photobacterium profundum 3TCK] gi|90326891|gb|EAS43276.1| hypothetical protein P3TCK_27759 [Photobacterium profundum 3TCK] Length = 504 Score = 95.4 bits (235), Expect = 2e-17, Method: Composition-based stats. Identities = 56/477 (11%), Positives = 139/477 (29%), Gaps = 103/477 (21%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 TA+ + F I +A++ + +N++ A +AA L+ + D+T ++ Sbjct: 25 TALALIPLFGMIFWALEGTRYIQKKNRLADATEAATLAITTANQDDKTYEN--------- 75 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPL---QYIAESKAQYEIPTENL 118 + I+ +++ + I + ++ T D N QY +K + + Sbjct: 76 QLATGYIQAYIRNITSINNIKIERSEGIDNYPTPDGNEEREYFQYRVTAKTNHISWLSSD 135 Query: 119 FLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND-------- 170 + P+ + + I I V D S SM+ ++ D Sbjct: 136 IIPSFAPTETVANRALAR-NYPIYLGDKDIDIVFVSDFSGSMKGNKIRALKDAIQAIANE 194 Query: 171 -----------NNNMTSNKYLLPPPPKKSF-------WSKNTTKSKYAPAPAPANRKI-- 210 N + Y + K+S + + Sbjct: 195 ILVPRDGEVEVTNRIAFVPYNMRVQEKRSNTRWCITQLDYRPNFNGGNYSSYEDIDWSTW 254 Query: 211 -----DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN------NL 259 + + + + + K ++ + + + G P S Sbjct: 255 STWTRNQVRDCSNGYYSCTGKKRRDARTVYAILNASKSETGSGWYFPDPYSYINFPGSVA 314 Query: 260 NEVKSRLNKLN--------PYEN------------------------------TNTYPAM 281 ++ N L T+ Y + Sbjct: 315 KTFTAKANNLQFQSTNQKLYSGGMCSGNFWTIPLTSEKTALSPIQNNMSPDGGTSVYQGL 374 Query: 282 HHAYRELYNEKESSHNTIGSTRLK---KFVIFITDGENSGASAYQNTLNTLQICEYMR-- 336 + L + +S +T S K ++ ++DG+ + N L +C ++ Sbjct: 375 IRGAQILEQGRPTSPSTETSAAYNSRIKMILMLSDGQEMPYVSTFNQLVNQGLCNTIKAQ 434 Query: 337 ----NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + + + + + +GQ + C + + + ++ + +I + I++ Sbjct: 435 FNDSDQPLYMGVLGIEFDAQGQQGFKNCVGQNN----ITNVDDVDDLIKEILEMIKK 487 >gi|146307722|ref|YP_001188187.1| von Willebrand factor, type A [Pseudomonas mendocina ymp] gi|145575923|gb|ABP85455.1| von Willebrand factor, type A [Pseudomonas mendocina ymp] Length = 566 Score = 95.0 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 40/366 (10%), Positives = 109/366 (29%), Gaps = 32/366 (8%) Query: 33 LDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQIN 92 D V + A + R S++ + + RE + Sbjct: 48 ADELVAAAPAELTRKRAAPLAGQLAPMPSSVADAALPGY---RDVPREQYQNYPDNPVFA 104 Query: 93 ITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICM 152 +++ + + Y L L P L + Sbjct: 105 VSETPVSTFSIDVD-TGSYANVRRFLNGGQLPPKDAVRLEELVNYFPYAYPLPQGDAPFG 163 Query: 153 VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV 212 V + S++ + PP + + + S + P + Sbjct: 164 VSTELAVTPWNPQTRLLRIAIKASDRSVEELPPANLVFLVDVSGSMHRREGLP------M 217 Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY 272 + + LV+ ++ R+ + Y + ++ ++++ +++L Sbjct: 218 VQGTLKLLVDQLRPQD--------RVSLVTYAGATQVVLDSTPGSDKAKIRAAIDQLTAG 269 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 +T + AY++ ++ TDG+ + + ++L Q+ Sbjct: 270 GSTAGESGIQLAYQQASKHLIEGGINR--------ILLATDGDFNVGISDFDSL--KQLA 319 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQS 391 R G+ + ++ + L+ + D+ G + +++ L E+ + D++ Sbjct: 320 ADKRKTGVSLTTLGFGVDNYNERLMEQLADAGNGNYAYIDN---LREARKVLVDQLASTL 376 Query: 392 VRIAPN 397 +A + Sbjct: 377 ATVASD 382 >gi|145299821|ref|YP_001142662.1| flp pilus assembly protein FlpL [Aeromonas salmonicida subsp. salmonicida A449] gi|88866595|gb|ABD57363.1| FlpL [Aeromonas salmonicida subsp. salmonicida A449] gi|142852593|gb|ABO90914.1| putative flp pilus assembly protein FlpL [Aeromonas salmonicida subsp. salmonicida A449] Length = 460 Score = 95.0 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 52/452 (11%), Positives = 116/452 (25%), Gaps = 67/452 (14%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A++++ I+L ++ + +A D AVL A + + + Sbjct: 15 FALMLTGILALTGVVIELVRGYSGQSLLSAAAD-AVLYSAADSDTAAEDAVALVQANLAG 73 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNP------LQYIAESKAQYEIPT 115 + L Q A ++++ + ++ + EI Sbjct: 74 R-PLQVGPPSLSQSEQGARVILQGHVPALMDLSVIGEGGDMPVAAAARASSARTRIEIAL 132 Query: 116 ENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQK-------- 167 + + + + T E + V+ + + + + Sbjct: 133 VLDVSNSMSGAPMKAIKQGLTEFGEVLFGRERRNQDRVVSIIPATGLVNIGDHPELFHPE 192 Query: 168 ---------------HNDNN---------NMTSNKYLLPPPPKKSFWSKNTTKS------ 197 N L ++ T Sbjct: 193 SLAFPFGLQTLAHERGWSNLLTRDVPGRQRKAFCARLPEHVDGIDRLAELTPGWIRKLEQ 252 Query: 198 --------KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 + + P K + +N ++G V Sbjct: 253 APVGETQPRLHYSTKPPAIKQYEDGTPLRAFAPRENPLERYLENRRDKLGIFDDADCGVS 312 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNT--------IGS 301 LS E + L+ L NTNT + +R L + + G Sbjct: 313 PIQAHLST-RAEYRQALDTLYAAFNTNTAEGVMWGWRLLSPQWQGRWGQGAAELPRPYGQ 371 Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT 361 +K ++ +DGE+ G A L +C M+ G+++Y+VA + +C Sbjct: 372 ADNRKIMVLFSDGEHMGPEAALRDRKQLLLCREMKRKGIQVYTVAFEGDAR---FVAQCA 428 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + + ++ I + + Sbjct: 429 SDRSHAYKATS-GNIRTVLTRLASAINDVVLT 459 >gi|315444579|ref|YP_004077458.1| Mg-chelatase subunit ChlD [Mycobacterium sp. Spyr1] gi|315262882|gb|ADT99623.1| Mg-chelatase subunit ChlD [Mycobacterium sp. Spyr1] Length = 335 Score = 95.0 bits (234), Expect = 2e-17, Method: Composition-based stats. Identities = 37/222 (16%), Positives = 79/222 (35%), Gaps = 26/222 (11%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 ++ S+ A A ++ E+A + + I +G IAY Sbjct: 95 NRAVVMLVIDVSQSMRATDVAPNRLTAAQEAAKQFADQLTPGIN--------LGLIAYAG 146 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + N K+ ++KL + T T + A + + + G Sbjct: 147 TATVLVSP--TTNRESTKTAIDKLQLADRTATGEGIFTALQAIATV--GAVIGGGDEPPP 202 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PE 352 V+ ++DG+ + S N ++ G+ I +V+ P P Sbjct: 203 ARVVLMSDGKETVPSNPDNPKGAYTAARTAKDQGVPISTVSFGTPYGYVEINEQRQPVPV 262 Query: 353 GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 ++L+K D S G+ F + +L + F + ++I ++++ Sbjct: 263 DDEMLKKIADLSGGEAFTASSLEQLKQVFTNLQEQIGYETIK 304 >gi|330830423|ref|YP_004393375.1| FlpL [Aeromonas veronii B565] gi|328805559|gb|AEB50758.1| FlpL [Aeromonas veronii B565] Length = 460 Score = 94.6 bits (233), Expect = 2e-17, Method: Composition-based stats. Identities = 49/452 (10%), Positives = 124/452 (27%), Gaps = 67/452 (14%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A+++ AI+L ++ + +A DA + S S + + + Sbjct: 15 FALMLVGVLALTGVAIELVRGYNGQSLLSAAADAVIYSAADSDTALQDAQALMQANLSGR 74 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQY-------IAESKAQYEIP 114 + + L QG + A ++++ + ++++ E+ Sbjct: 75 NL--QVATPRLSQGEQGAQVILQGKVPALMDLSAIDEGEEGMPIAAAARASSARSRIEVA 132 Query: 115 TENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQK------- 167 + + + + E + V+ + + + + Sbjct: 133 LVLDISDSMSGAPMKAIKQGLVEFGEVLFGRERRNQERVVSIIPATGLVNIGDHPELFHP 192 Query: 168 -------HNDNNN------------------MTSNKYLLPPPPKKSFWSKNTTKSKYAPA 202 L ++ T Sbjct: 193 ESIEIPFGLRTLAEERGWLNLLSREVPGRQRKAFCARLPEHVDGIDRVAEVTPGWIRKLE 252 Query: 203 PAP---------ANRK---IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI-GIVG 249 AP + K I + L + ++ ++ + R ++ Sbjct: 253 QAPRDELRPHLFYSTKPPPIARYGDGTPLLAFAPKENPLDRYLENRRDKLGIFDDPDCGV 312 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNT--------IGS 301 + + + + L+ L NTNT + +R L E + Sbjct: 313 SPIQAHLSTRADYRQALDTLYAAFNTNTAEGVMWGWRLLSPEWQGRWRQGAAALPRPYEL 372 Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT 361 +K ++ +DGE+ +A ++ L +C M+ G++IY+VA + +C Sbjct: 373 QDNRKIMVLFSDGEHMTEAALRDRKQLL-LCREMKRKGIQIYTVAF---EGDTRFVAQCA 428 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 F + ++ I + + Sbjct: 429 SDRSLAFKAT-KSNIRTVLTRLASSINDVVLT 459 >gi|315498201|ref|YP_004087005.1| von willebrand factor type a [Asticcacaulis excentricus CB 48] gi|315416213|gb|ADU12854.1| von Willebrand factor type A [Asticcacaulis excentricus CB 48] Length = 570 Score = 94.6 bits (233), Expect = 2e-17, Method: Composition-based stats. Identities = 41/305 (13%), Positives = 83/305 (27%), Gaps = 8/305 (2%) Query: 95 KDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVL 154 + + + + + R+ + + Sbjct: 267 YTTSGGTYRVFVTSNYVQNNGYCWPNYYTYTYCGYTVVSRTDYYSTTTWTRTNNASNSTP 326 Query: 155 DVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLI 214 S S + N + + S D Sbjct: 327 WPSASYYGTPSYSYAQYNGTITATPTSAGGYGSGSTTTIKDNSTITANSDLLGVGTDSWN 386 Query: 215 ESAGNLVNSIQKAIQEKK-NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE 273 + + Q + + + A L+ ++ V++ KL P Sbjct: 387 GCVIDRKQPYDVSGQSPIASNTDTLYPAAKCATNNLLPVMGLTTDIAAVRAHAQKLTPAG 446 Query: 274 NTNTYPAMHHAYRELYNEKE-SSHNTIGSTRLKKFVIFITDGENSGASAYQN----TLNT 328 NTN + L E ++ K++I ITDGEN+ + T Sbjct: 447 NTNITIGVQWGMELLSPELPFNTAKPYSDKTNYKYMIVITDGENTQNRWSTSASTINART 506 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 L C+ ++ G+ +Y++ V +L+ C F+ V S +L + K+ IQ Sbjct: 507 LLACQAAKDLGITVYTIRVMEGNSD--MLKSCASRPEYFYDVTASSQLTSTLAKVFYSIQ 564 Query: 389 EQSVR 393 + Sbjct: 565 STRLT 569 >gi|27366553|ref|NP_762080.1| protein TadG, associated with Flp pilus assembly [Vibrio vulnificus CMCP6] gi|27358119|gb|AAO07070.1| Protein TadG, associated with Flp pilus assembly [Vibrio vulnificus CMCP6] Length = 426 Score = 94.6 bits (233), Expect = 2e-17, Method: Composition-based stats. Identities = 42/435 (9%), Positives = 126/435 (28%), Gaps = 64/435 (14%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + ++ + + + +++ + ++ A + A L+ I P ++ Sbjct: 13 IFMGMLPILVIIMVFSMQMTQRHMAHAKITEAAEVASLAL---------IASPKEGDEKN 63 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 +K + ++ + + ++ L + + ++ Sbjct: 64 QEYAQKIVDHYIPDNKGEVVARVFHRRCEYKDGCVQRSGELAPFTDFVVSAKTKHDSWIS 123 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 LT R + I ++D+S SM + + ++ ++ Sbjct: 124 YNDGEMGLTKDFEVMGTSTSRKFLPQPLDIYFIIDMSGSMVNPWGGSGKTKYDVVADTIN 183 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 + F + ++ A +++ + +S + N+ Sbjct: 184 RIVDDLREFKTDRKSRVAVIGFHHTAVKQVGRQRTAFDY--SSYRTPSATVNNMFTAPKI 241 Query: 241 IAYNIGIV--GNQCTPLSNNLNEVKSRLNKLNP----YENTNTYPAMHHAYRELYNEKES 294 + N + PL+ + + ++ N N Y T ++ + A + + Sbjct: 242 HSRNDSSNIKTFEDIPLTEDYDAFLTKFNSSNYYASRYGLTESWQGIIGAAQMAEQATD- 300 Query: 295 SHNTIGSTRLKKFVIFITDGENSG---------------ASAYQNTLNTLQICEYMRNA- 338 ++ I ++DG + + Y N L +CE ++ Sbjct: 301 -------LNPEQVFILLSDGRDGDFVRYYLEGRQWREVRYNKYLNRLVKAGLCEKLKTRI 353 Query: 339 -------------------GMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 + + + V+ + D + C + + + Sbjct: 354 SQKRNVFQSENPSDKASKTKVTMGVIGVNYVVDKSDGIGDC-FGHDNIYHAKEG---NDV 409 Query: 380 FDKITDKIQEQSVRI 394 + I + I E++ R+ Sbjct: 410 YKYILNLINEETGRL 424 >gi|317491692|ref|ZP_07950127.1| hypothetical protein HMPREF0864_00890 [Enterobacteriaceae bacterium 9_2_54FAA] gi|316920126|gb|EFV41450.1| hypothetical protein HMPREF0864_00890 [Enterobacteriaceae bacterium 9_2_54FAA] Length = 416 Score = 94.6 bits (233), Expect = 3e-17, Method: Composition-based stats. Identities = 51/423 (12%), Positives = 126/423 (29%), Gaps = 65/423 (15%) Query: 2 TAIIISVC----FLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKK 57 AI + + ++ + + R ++ A++ A L+ A D Sbjct: 22 FAISFVMMSGFLLSMAAFGLEGSRYITERARLSDAMEQAALALTAEDNGDG--------A 73 Query: 58 DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 + T+ + +++ + + + NN E + + ++ Sbjct: 74 QRNYTLSSDYFRAYMRHDVDVFKPTVI------VKSGISPNNQNLSYVEYRVSGQTLQDS 127 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSN 177 F PS + + G + N + + V D S SM + + Sbjct: 128 WFSSTFFPSFDKQVVIGDNGAARKFRSN--MDVIFVTDFSGSMNEGFGGSTKLAELKRIV 185 Query: 178 --------------KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNS 223 K P + + P P++ ++ ++ Sbjct: 186 LKLSDELFSYNIDNKVGFVPFGWGGKEGIDCDFPFVSHGPVPSDILAGGNYKALEKYIDI 245 Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIV-----GNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 + ++ G + PL+++L E+ +++N ++ T Sbjct: 246 SGSVAAIPNPVHDIQIPLSNVNGSTCLRNSHSWKVPLTSSLAEI-NQINGMSAEGGTLVS 304 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG-ENSGASAYQNTLNTLQICEYMR- 336 + L + T +K ++ ++DG ++ L +C+ +R Sbjct: 305 SGVLLGVPYLASG----------TASRKVMVIVSDGTDDPKNVMITPNLINAGMCDKIRQ 354 Query: 337 ----NAGM-KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE-----SFDKITDK 386 + + KI + ++ P + C F+ EL E F+++ Sbjct: 355 VLSTDESVGKISFIGIAYYPTVD--WKSCV-GDKNFYLPQTIDELEEDLRRAVFEEVGHN 411 Query: 387 IQE 389 I + Sbjct: 412 ILK 414 >gi|330502932|ref|YP_004379801.1| von Willebrand factor, type A [Pseudomonas mendocina NK-01] gi|328917218|gb|AEB58049.1| von Willebrand factor, type A [Pseudomonas mendocina NK-01] Length = 566 Score = 94.2 bits (232), Expect = 3e-17, Method: Composition-based stats. Identities = 38/362 (10%), Positives = 100/362 (27%), Gaps = 32/362 (8%) Query: 38 LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK 97 L+ A+ I L A + Sbjct: 51 LAAPAAAEMHVEASQKRMA---PMAYAPAPIADILPPSYRDESREQYQAYADNPVFAVAE 107 Query: 98 NNPLQYIAES-KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDV 156 + + Y L L P L V Sbjct: 108 TPVSTFSIDVDTGSYANVRRFLNDGQLPPKDAVRLEELVNYFPYAYPLPQGDVPFGVSTE 167 Query: 157 SRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIES 216 + S++ + PP + + + S + P ++ + Sbjct: 168 LAVTPWNPQTRLLRIAIKASDRSVEELPPANLVFLVDVSGSMHRREGLP------MVQGT 221 Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 LV+ ++ R+ + Y + ++ ++++ +++L +T Sbjct: 222 LKLLVDQLRPQD--------RVSLVTYAGDSQVLLDSAPGSDKAKIRAAIDQLTAGGSTA 273 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 + AY++ ++ TDG+ + + ++L Q+ R Sbjct: 274 GESGIQLAYQQASKHLIDGGINR--------ILLATDGDFNVGISDFDSL--KQLAADKR 323 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 +G+ + ++ + L+ + D+ G + +++ L E+ + D++ +A Sbjct: 324 KSGVSLTTLGFGVDNYNERLMEQLADAGNGNYAYIDN---LREARKVLVDQLSSTLATVA 380 Query: 396 PN 397 + Sbjct: 381 SD 382 >gi|225377140|ref|ZP_03754361.1| hypothetical protein ROSEINA2194_02786 [Roseburia inulinivorans DSM 16841] gi|225211045|gb|EEG93399.1| hypothetical protein ROSEINA2194_02786 [Roseburia inulinivorans DSM 16841] Length = 1406 Score = 94.2 bits (232), Expect = 3e-17, Method: Composition-based stats. Identities = 55/315 (17%), Positives = 93/315 (29%), Gaps = 24/315 (7%) Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI 148 QI++ Q E + + L+ G R + I Sbjct: 694 YQIDLDASSLATSQSTVEKIQTVDAMMVFDLSGSMNEIMSGQNQLKDIGEFSRVKNQMDI 753 Query: 149 SICMVLDVSRSMEDLYLQKHN-----DNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAP 203 + + + P W K S + + Sbjct: 754 NKVYYWNKYEKSGWWPWTYDKSVGMGTAAVSGNVYAKYPVKYIDGQWKKYVDGSYQSISD 813 Query: 204 ----APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP--LSN 257 A KI L ++A V I + I LS Sbjct: 814 SDVMAVWTSKISALKDAASGFVTGISDTSPDSLVGIATFYGIGNGWNSSTEGKLNHGLSK 873 Query: 258 -NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 N NE+ +N L T+ + HAY EL ++ + KK+VI +DGE Sbjct: 874 VNKNEMLKSVNALFADGGTSPQKGLEHAYSELQKAEDGN---------KKYVILFSDGEP 924 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 S ++ + + T ++ AG + +V + E L + S+G F + + EL Sbjct: 925 SDSN---DKMETEASAVKLKEAGYTVITVGLGLNNETATWLGEKVASAGCAFTADTAEEL 981 Query: 377 LESFDKITDKIQEQS 391 + F I I + Sbjct: 982 NKIFQNIQSTITQSR 996 >gi|227822378|ref|YP_002826350.1| hypothetical protein NGR_c18330 [Sinorhizobium fredii NGR234] gi|227341379|gb|ACP25597.1| conserved hypothetical protein [Sinorhizobium fredii NGR234] Length = 602 Score = 94.2 bits (232), Expect = 3e-17, Method: Composition-based stats. Identities = 35/367 (9%), Positives = 95/367 (25%), Gaps = 28/367 (7%) Query: 31 SALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQ 90 ++D A + + + + TI L + RE G+ Sbjct: 82 QSVDQAADAAAPMSTMGGAV--GLAARSRMETIPAPAPADMLPPPAENRERFGNADANPV 139 Query: 91 INITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISI 150 ++ + + + A Y +L + + + Sbjct: 140 KSVAAEPVSTFSVDVD-TASYSFVRRSLMAGEMPNPDAVRVEEMVNYFPYDWPRPTTAAE 198 Query: 151 CMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKI 210 V+ + + + + + P+ + K+ Sbjct: 199 PFKATVTVTPTPWNAGTRLMHVAIKGYEVVQKEAPRANLV-----FLIDVSGSMDEPDKL 253 Query: 211 DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN 270 +L + LV+ ++ + + Y + ++ S ++ L Sbjct: 254 PLLKNAFRLLVDRLRPDDT--------VSIVTYAGNAGTVLEPTAVKDKTKILSAIDTLQ 305 Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 P +T + AY+ ++ TDG+ + + + Sbjct: 306 PGGSTAGAAGIDAAYQLAEKAFVRDGVNR--------ILLATDGDFNVG--PSSDEELKR 355 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + E R +G+ + + L++ + L E+ + ++ Sbjct: 356 MVETKRRSGIFLSVLGFGRGNYNDALMQTIAQNGNGV--AAYIDTLAEAQKTLVEEAGSS 413 Query: 391 SVRIAPN 397 IA + Sbjct: 414 LFPIAKD 420 >gi|120403735|ref|YP_953564.1| hypothetical protein Mvan_2751 [Mycobacterium vanbaalenii PYR-1] gi|166988604|sp|A1T8Q8|Y2751_MYCVP RecName: Full=UPF0353 protein Mvan_2751 gi|119956553|gb|ABM13558.1| von Willebrand factor, type A [Mycobacterium vanbaalenii PYR-1] Length = 335 Score = 93.8 bits (231), Expect = 4e-17, Method: Composition-based stats. Identities = 36/222 (16%), Positives = 78/222 (35%), Gaps = 26/222 (11%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 ++ S+ A A ++ E+A + + I +G IAY Sbjct: 95 NRAVVMLVIDVSQSMRATDVAPNRLVAAQEAAKQFADQLTPGIN--------LGLIAYAG 146 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + N K+ ++KL + T T + A + + + G Sbjct: 147 TATVLVSP--TTNREATKAAIDKLQLADRTATGEGIFTALQAVATV--GAVIGGGDEPPP 202 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PE 352 ++ ++DG+ + S N ++ G+ I +V+ P P Sbjct: 203 ARIVLMSDGKETVPSNPDNPKGAYTAARTAKDQGVPISTVSFGTPYGYVEINDQRQPVPV 262 Query: 353 GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 ++L+K D S G F + +L + F + ++I ++++ Sbjct: 263 DDEMLKKIADLSGGDAFTASSLEQLKQVFTNLQEQIGYETIK 304 >gi|170751925|ref|YP_001758185.1| hypothetical protein Mrad2831_5557 [Methylobacterium radiotolerans JCM 2831] gi|170658447|gb|ACB27502.1| hypothetical protein Mrad2831_5557 [Methylobacterium radiotolerans JCM 2831] Length = 568 Score = 93.8 bits (231), Expect = 5e-17, Method: Composition-based stats. Identities = 58/421 (13%), Positives = 108/421 (25%), Gaps = 69/421 (16%) Query: 38 LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK 97 ++ K + ++ + D Sbjct: 151 IALVLDNTGSMAASSGGQSKLRAVQTAATDFVNYVYTSPAFSSATKVAIVPFAAAVAVDP 210 Query: 98 NNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 + + + + ++ S R + + N L+ Sbjct: 211 SAYRY----ASWIDQNGLSSYHWTNIVQSNPAPFKNRFEVFAKLQAANRNWGWAGCLESL 266 Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 ++ ++ P + + N ID Sbjct: 267 PYPFNVQDGAPTTKDSFYVPLLAPDEPGDGTSGGASFPVGNQRSTAYSYNSYIDDETGPD 326 Query: 218 GNLVNSIQKA---------IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 N+ A ++ + S N G L+NN N +K+ +N Sbjct: 327 CRNSNTYNTALGQACKYVNPKDPASGSPLGIPNGPNFGCTTQPLQRLTNNTNALKTLINN 386 Query: 269 LNPYENTNTYPAMHHAYRELYNEK--------ESSHNTIGSTRLKKFVIFITDG------ 314 + P +TN + +R L SS N+ +T + K +I +TDG Sbjct: 387 MAPSGSTNIHEGFMWGWRTLSPNSVFADGQPYASSANSSNATNINKIIILMTDGTNSWGT 446 Query: 315 -------------------------------ENSGASAYQNTLN------TLQICEYMRN 337 + A NT T + C + Sbjct: 447 NSSAPTGSLYFAAGYFRNANGTTPNPRLTTAYQNTNIADGNTARKALDALTAEACANTKA 506 Query: 338 AGMKIYSVAVSA-----PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + IY++ S GQ LLR C S QF+ N S +L+++F I I + Sbjct: 507 VNISIYTIGFSVPTDPIDSAGQTLLRNCASSPDQFYLANSSDDLIKAFKSIQASIGALRL 566 Query: 393 R 393 Sbjct: 567 T 567 Score = 63.8 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 26/217 (11%), Positives = 61/217 (28%), Gaps = 5/217 (2%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDA-AVLSGCASIVSDRTIKDPTTKKDQ 59 + A + + AID + ++Q+A DA A+L + + + + Sbjct: 30 LFAFLSVPMVMIGGAAIDYGFATRLETKLQTATDATALLLCQTPLTTSEAELNTLAQTTM 89 Query: 60 TSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLF 119 T + + + A K + + +Q P F Sbjct: 90 TGAMGAANLVVDRLAITSSPRKITLTAHKQSTTFFGGLTGTQRINPGAVSQCATPLPKTF 149 Query: 120 LKGLI----PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMT 175 L+ S + +S +++ ++ S + + + Sbjct: 150 EIALVLDNTGSMAASSGGQSKLRAVQTAATDFVNYVYTSPAFSSATKVAIVPFAAAVAVD 209 Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV 212 + Y ++ S + PAP + +V Sbjct: 210 PSAYRYASWIDQNGLSSYHWTNIVQSNPAPFKNRFEV 246 >gi|260856317|ref|YP_003230208.1| hypothetical protein ECO26_3261 [Escherichia coli O26:H11 str. 11368] gi|260868996|ref|YP_003235398.1| hypothetical protein ECO111_3021 [Escherichia coli O111:H- str. 11128] gi|300903656|ref|ZP_07121573.1| von Willebrand factor type A domain protein [Escherichia coli MS 84-1] gi|301303269|ref|ZP_07209394.1| von Willebrand factor type A domain protein [Escherichia coli MS 124-1] gi|257754966|dbj|BAI26468.1| conserved predicted protein [Escherichia coli O26:H11 str. 11368] gi|257765352|dbj|BAI36847.1| conserved predicted protein [Escherichia coli O111:H- str. 11128] gi|300404332|gb|EFJ87870.1| von Willebrand factor type A domain protein [Escherichia coli MS 84-1] gi|300841443|gb|EFK69203.1| von Willebrand factor type A domain protein [Escherichia coli MS 124-1] gi|315255206|gb|EFU35174.1| von Willebrand factor type A domain protein [Escherichia coli MS 85-1] gi|323156423|gb|EFZ42578.1| von Willebrand factor type A domain protein [Escherichia coli EPECa14] gi|323176802|gb|EFZ62392.1| von Willebrand factor type A domain protein [Escherichia coli 1180] Length = 584 Score = 93.8 bits (231), Expect = 5e-17, Method: Composition-based stats. Identities = 38/381 (9%), Positives = 111/381 (29%), Gaps = 49/381 (12%) Query: 20 AHIMYIRNQMQSALDAAVL--SGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSY 77 A + +Q L AA + + T + Q KQ+ ++ Sbjct: 75 AQQYSDKQALQGRLQAAPKYQHAAREKAASQIANPGTARYKQFDDNPVKQVAQNPLATFS 134 Query: 78 IRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTG 137 + + G A + P E Y + + + Sbjct: 135 LDVDTGSYANVRRFLNHGLLPPPDAVRVEEIVNY-----FPYDWDIKDKQSIPATKPIPF 189 Query: 138 IIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKS 197 + + + L + + + +N+ Sbjct: 190 AMRYELAPAPWNEQLTLLKIDILAKDHKSEELPASNLVFLI------------------- 230 Query: 198 KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN 257 ++ ++ ++ S LV +++ I + Y + + Sbjct: 231 -DTSGSMISDERLPLIQSSLKLLVKELREQDN--------IAIVTYAGDSRIALPSISGS 281 Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + E+ + ++ L+ +TN + AY++ ++ TDG+ + Sbjct: 282 HKAEINAAIDSLDAEGSTNGGAGLELAYQQAAKGFIKGGINR--------ILLATDGDFN 333 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSREL 376 + + + + R +G+ + + V + ++ + D +G + ++ L Sbjct: 334 VG--IDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDT---L 388 Query: 377 LESFDKITDKIQEQSVRIAPN 397 E+ + ++++ + +A + Sbjct: 389 SEAQKVLNSEMRQTLITVAKD 409 >gi|187934443|ref|YP_001887479.1| von Willebrand factor type A domain protein [Clostridium botulinum B str. Eklund 17B] gi|187722596|gb|ACD23817.1| von Willebrand factor type A domain protein [Clostridium botulinum B str. Eklund 17B] Length = 1596 Score = 93.4 bits (230), Expect = 5e-17, Method: Composition-based stats. Identities = 36/232 (15%), Positives = 71/232 (30%), Gaps = 37/232 (15%) Query: 196 KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI---GTIAYNIGIV---- 248 A KI L ++A N ++S+ + + +V+ G ++YN Sbjct: 181 YCNNHKAYESYTTKIHELKKAAKNFIDSLTSTKTDGQTPNVKNLKIGIVSYNNSGYINEG 240 Query: 249 ----GNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK----------ES 294 + + N+NE+K + L TNT + A L E + Sbjct: 241 LVQVTDSDRKNNGNINELKDTIENLRADGGTNTGDGLRKAAYLLNEENEANKTVIFMGDG 300 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLN------TLQICEYMRNAGMKIYSVAVS 348 S R + D + + I E ++ ++SV Sbjct: 301 EPTYYSSDRWGNDYTNLDDTNQYVGGTGYSDADGKCLSYAKTIGEIIKGEQYNVFSVGYG 360 Query: 349 APPEG---QDLLRK-------CTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 E + +++ + F +D + + F +I D I + Sbjct: 361 LGDENSASNNKMKQIHESMGGISSGENSTFFASDEGAIDKVFQQIADTIIKT 412 >gi|148976298|ref|ZP_01813022.1| hypothetical protein VSWAT3_18848 [Vibrionales bacterium SWAT-3] gi|145964392|gb|EDK29647.1| hypothetical protein VSWAT3_18848 [Vibrionales bacterium SWAT-3] Length = 401 Score = 93.4 bits (230), Expect = 5e-17, Method: Composition-based stats. Identities = 43/309 (13%), Positives = 81/309 (26%), Gaps = 54/309 (17%) Query: 92 NITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISIC 151 N+T + + + Sbjct: 117 NVTATGGGFKGV-----VESKHSAIPT------------------------------ELV 141 Query: 152 MVLDVSRSMEDLYLQKH--NDNNNMTSNKYLLPPPPKKSFWSKNTTK-SKYAPAPAPANR 208 +VLDVS SM N T S S A P Sbjct: 142 LVLDVSGSMSPNIQSLKSILSNALNTIQSQSNNANDLDSVSISIVPFDSGVATHRPPWLS 201 Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI-----------AYNIGIVGNQCTPLSN 257 + I L ++ + + PL+N Sbjct: 202 EETAGIYCIDGLSYRNGDFSASLTVDNLATLHSERPVKFTPPSKWLSDCNQESPMLPLTN 261 Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK--KFVIFITDGE 315 + V++ +N L T +Y + R+L + + S+ + + ++ TDG Sbjct: 262 VFSRVQNSINSLTANGGTRSYQGLVWGVRQLIPSWQQAWGMKVSSVPETRRKLVLFTDGA 321 Query: 316 NSGASAYQNTLNTLQICEYM-RNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSR 374 + G + N L C + G+++ + P C + + F+ ++ Sbjct: 322 DEGDAF--NQLVNAGFCTTAIKQYGIEMNFIGYGVSPSRITQFENCAGNPLRVFSATNTT 379 Query: 375 ELLESFDKI 383 +L E F I Sbjct: 380 QLNEYFSDI 388 >gi|304393172|ref|ZP_07375100.1| Flp pilus assembly protein TadG [Ahrensia sp. R2A130] gi|303294179|gb|EFL88551.1| Flp pilus assembly protein TadG [Ahrensia sp. R2A130] Length = 692 Score = 93.4 bits (230), Expect = 6e-17, Method: Composition-based stats. Identities = 44/309 (14%), Positives = 88/309 (28%), Gaps = 53/309 (17%) Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 + ++ L + + ++ + N YL + Sbjct: 384 TNPSYRPHWDDRIEYNRGNLIRPTNTQQVWEDYQRIDEKDFQNNYLDDAHNFDDKNGADH 443 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 KS+ A N+ E N + N + Sbjct: 444 VKSEANTGYASGNQ--YKRQEWINKYFTDDGGNRPSVGNSNPLGMGAGPNSMCSSVSVSD 501 Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE-SSHNTIGSTRLKKFVIFITD 313 L++N N +++L + TN + +R L + + + KK +I +TD Sbjct: 502 LTDNKNTTQAKLTSMQASGATNVQMGVAWGWRTLSPGEPFTEGRPYDAEDNKKIMIIMTD 561 Query: 314 GENSGASAYQNTLNT--------------------------------------------L 329 G N+ Sbjct: 562 GNNTYYPTNIYGNQYAQDNKSFYGGHGHSVKGRIFDGYDGEANPGHNSQTFTKAMDEHLT 621 Query: 330 QICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDS----SGQFFAVNDSRELLESFDKI 383 + C +NAG+ IYS+A P + L C S +F N++ L+++F+KI Sbjct: 622 ETCTNAKNAGITIYSIAFDVPNGSSVKATLEDCASSDVGGGKLYFDANNNAALIDTFEKI 681 Query: 384 TDKIQEQSV 392 +++ + + Sbjct: 682 AERLADLRI 690 Score = 45.3 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 34/350 (9%), Positives = 88/350 (25%), Gaps = 24/350 (6%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 TA+ + V + I D A + R QS ++ + Sbjct: 38 TALSLPVMLMAIGAGADYAELYRARVNFQS-----------AVDAGAIAAAKNLAATGQV 86 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYE--IPTENLF 119 K ++ + + ++ I + ++ + Sbjct: 87 QTSKDIGEEVFRSNLSHLGEKAVREGQINFDMGDGDCAVQGVITTATLPHDRFFSLSFVD 146 Query: 120 LKGLIPSALTNL-----SLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNM 174 + + N I I +VLD S SM + Sbjct: 147 QSQQKGFGANKIVKGQEEFILSASSTVECGNDTIEIALVLDNSGSMRWNGKIGTLRQASN 206 Query: 175 TSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNL 234 + + L + + + A N + + +++ G + Sbjct: 207 SLVETLHTTMGSANKAIQFSVVPFAATVNVGTNNRNEPWMDTQGRSSTHWEMIDPSTSTD 266 Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNL--NEVKSRLNKLNPYENT-NTYPAMHHAYRELYNE 291 G L ++L + + + +T + P++ + + Sbjct: 267 FSFSGGRYLQSNGQALSRFTLYDSLPNTSWQGCVEQRPHPYHTQDDTPSISNPDTLIVPS 326 Query: 292 K-ESSHNTIGSTRLKKF--VIFITDGENSGASAYQNTLNTLQICEYMRNA 338 + + + K+ ++ D + +N T++ C + Sbjct: 327 FAPDTPDNWDNDYNKRLSNIVVGADPHCTRFQGSKNGRRTIRYCNRWSDN 376 >gi|301609920|ref|XP_002934508.1| PREDICTED: vitrin-like [Xenopus (Silurana) tropicalis] Length = 779 Score = 93.4 bits (230), Expect = 6e-17, Method: Composition-based stats. Identities = 53/384 (13%), Positives = 110/384 (28%), Gaps = 47/384 (12%) Query: 27 NQMQSALD---AAVLSGCASIVSDRTIK-------DPTTKKDQTSTIFKKQIKKHLKQG- 75 N+M ALD + L G D + + + K+ I +K ++ + Sbjct: 418 NEMAEALDVGLSGPLMGVVQYGDDPSTEFGLGAHFNVGDLKNAIEKIPQKGGHSNVGKAL 477 Query: 76 -----SYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTN 130 +Y +++G+ + + P + E+ N+F + + Sbjct: 478 SYVNKNYFSDSSGNRGSAPNVAVVLVDGWPTDKVEEASRLARESGINIFFVTIEGADDNE 537 Query: 131 LSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFW 190 S + ++V + K Sbjct: 538 RSSVVEPNF-VDKAVCRTNGHFSINVPSWFGLHK---VVNPLVNRMCDIDKLVCSKTCLN 593 Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 + + + VL + N+ N + + + RIG + Y Sbjct: 594 AADIGFVIDGSSSVGTGNFRTVL-QFIANITNEFEISDTD-----TRIGAVQYTYEQRLE 647 Query: 251 QCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 + +V + + ++ T+T A+ +A +L++ +K +I Sbjct: 648 FGFDKYSTKQDVMNAIMRIGYWSGGTSTGAAITYASEQLFS--------KSKPNKRKILI 699 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QF 367 ITDG + + + G+ Y+V + QD L Sbjct: 700 VITDGRSYD--------DVRAPAAAVHRNGVIAYAVGI--AWAAQDELESIATDPDKDHS 749 Query: 368 FAVNDSRELLESFDKITDKIQEQS 391 F V D L + KI I + Sbjct: 750 FFVEDFDSLYKFVGKIFQNICTEY 773 >gi|260781661|ref|XP_002585922.1| hypothetical protein BRAFLDRAFT_90332 [Branchiostoma floridae] gi|229270990|gb|EEN41933.1| hypothetical protein BRAFLDRAFT_90332 [Branchiostoma floridae] Length = 4065 Score = 93.4 bits (230), Expect = 6e-17, Method: Composition-based stats. Identities = 43/404 (10%), Positives = 102/404 (25%), Gaps = 46/404 (11%) Query: 13 ITYAIDLAHIMYIRNQMQSALDAA----VLSGCASIVSDRTIKDPTTKKDQTSTIF---- 64 A+D + ++SA D +L S + P T+ Sbjct: 1382 TGAALDFVRQTMM---VESAGDRTMSPNILVVATDGESSDDQRTPAEVLRNAGTLVYAVG 1438 Query: 65 --KKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKG 122 L + A + + + A P Sbjct: 1439 IGAGVSSTTLLDIAGYNSRVLQATDFASLEVIGRELQEFICNAAYCGDPGTPEFGSRSGD 1498 Query: 123 LIPSALTNLS---------LRSTGIIERSSENLAISICMVLD--VSRSMEDLYLQKHNDN 171 ++ +T + + + IC+ D + + + Sbjct: 1499 FFEGSVVTFQCDPTYTLLGSATTTCFGNGTWSDPVPICVSPDPCNNSPCLNGGTCQRVGL 1558 Query: 172 NNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK---AI 228 + + + + + +L+ + Sbjct: 1559 TTEFTCLCPEGYHGPICQFYAACSNRTLNLDVVFLLDGSGSVGSANFDLLKTFTTRIATN 1618 Query: 229 QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRE 287 + R+G + Y+ + EV + + ++ T+T A+ + + Sbjct: 1619 FDVSTNLTRVGVVQYSDQTNSEFVLNTFSTEAEVLAAIAAISYQNGGTSTGAALDYVRQN 1678 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 ++ +I +TDG +S ++ RNAG+ IYSV + Sbjct: 1679 VFISASG-----DRPDAANILIVLTDGVSSDDVSFP--------AMAARNAGITIYSVGI 1725 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 + L++ + L + I +++E Sbjct: 1726 G-DGVDYNTLQQIAGDPNKVLQATGFSSLDD----IGGQLEELV 1764 Score = 81.9 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 42/338 (12%), Positives = 93/338 (27%), Gaps = 23/338 (6%) Query: 54 TTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEI 113 S + + +L + + + A+ T E Sbjct: 2274 EAIAQDQSRVLQANTFTNLSNTAQALQESLGDARYCGTPGTPQDGYTYGLFFEGSTVTYS 2333 Query: 114 PTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNN 173 L G + N + ++ + + C + + Sbjct: 2334 CEVGFTLDGASSAQCINGTWDNSPPTCIPPDPCLANPCY--NGGTCVRTGLTTDFTCTCP 2391 Query: 174 MT-SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK 232 + + + + +++ + A LV++ + + + Sbjct: 2392 SDYTGDTCQSYTACQGRSLDFDLALLLDGSGSVGADNFNLVKQFAKRLVDNFEISQTD-- 2449 Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNE 291 ++G + Y+ + V +N + TNT A+ + +E++ Sbjct: 2450 ---TKVGVVQYSSSSNVEFYLNAFSTKQAVLDAINAVTYQQGGTNTGAAITYTMQEIFAS 2506 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 + +I +TDGE+S A RNAG IY+V V Sbjct: 2507 ANGARA-----NYPDVLIVVTDGESSDDVAVP--------ALSARNAGTLIYAVGVGNGV 2553 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 LL+ ++GQ D L + + + Sbjct: 2554 NQATLLQ-IAGNAGQVLQAADFAGLTTVVQSLQQNLCD 2590 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 43/383 (11%), Positives = 93/383 (24%), Gaps = 55/383 (14%) Query: 32 ALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQI 91 + +A VL+ A+I D + I D + Sbjct: 1647 STEAEVLAAIAAISYQNGGTSTGAALDYVRQNVFISASGDRPDAANILIVLTDGVSSDDV 1706 Query: 92 NITKDKNNPLQYI---------AESKAQYEIPTENLFLKGLIPSALT-----NLSLRSTG 137 + + +I + + + L Sbjct: 1707 SFPAMAARNAGITIYSVGIGDGVDYNTLQQIAGDPNKVLQATGFSSLDDIGGQLEELVCD 1766 Query: 138 IIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKS 197 I+ + + N L WS+ + Sbjct: 1767 ATYCGDPGAPINGFRNGNFFEGGTVTWGCF----NGFNLVGALTAVCLGNGSWSEPVPEC 1822 Query: 198 KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS--------------------VR 237 P P D L +LV + + N + Sbjct: 1823 MAPTTPPP--PGCDELSFGGWDLVFLLDGSGSVGSNNFLNVKNFTKLITDLFPVGDNATK 1880 Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSH 296 +G + ++ I + E+ S ++ ++ T T A+ + + +N + Sbjct: 1881 VGLVQFSDTIQKEFDLRDYDTKAEILSAIDNISYLGGGTYTGNAIDYVRQVSFNTINGNR 1940 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + +I +TDGE+ + + R+ G+ I+++ V Sbjct: 1941 GSHPD-----MLIVLTDGESFDPVTF--------ASQSARDQGITIFAIGVGTGV-DYAT 1986 Query: 357 LRKCTDSSGQFFAVNDSRELLES 379 L + + V D +L Sbjct: 1987 LEEIAGDPQKVQQVTDFADLTSV 2009 Score = 73.4 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 37/352 (10%), Positives = 92/352 (26%), Gaps = 29/352 (8%) Query: 40 GCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNN 99 G + + + + L+Q GD N Sbjct: 2552 GVNQATLLQIAGNAGQVLQAADFAGLTTVVQSLQQNLCDAAYCGDPGAPVNGNRVGSYFV 2611 Query: 100 PLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRS 159 A Y I + + + + + Sbjct: 2612 GNTVTFSCSAGYIIQGSATATCQSSGQWTSAVPTCV--------ASDPCPANPCQNGATC 2663 Query: 160 MEDLYLQKHNDNNNMTSNKYLLPPPPKK-SFWSKNTTKSKYAPAPAPANRKIDVLIESAG 218 + ++ + + + + ++L Sbjct: 2664 TQVGSTTQYTCTCPQGYSGNNCEVFSACTDRALDVDLVVLLDGSGSVGSDNFNLLKAFTQ 2723 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNT 277 N+V + + + R+G + Y+ EV + + ++ T T Sbjct: 2724 NIVGNF-----DIAVNNTRVGVVQYSDFNNIEFNLNAYATEAEVLAAIGAISYQRGGTFT 2778 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 A+ ++++ + ++ +TDGE+S + A + N Sbjct: 2779 GAAIDFVRQDVFTTAGGNRADKPD-----ILLVLTDGESSDSVAGP--------AQNTLN 2825 Query: 338 AGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 AG+ IY+V + + L++ G+ V D + L +++ + + Sbjct: 2826 AGITIYAVGIGSGVNADT-LQEIAGDPGRVLQVADFQGLAAITNQLQEALCS 2876 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 35/360 (9%), Positives = 97/360 (26%), Gaps = 31/360 (8%) Query: 35 AAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINIT 94 AA G + + D + + G A D Q++ + Sbjct: 598 AATNVGAVQYSDTVRSEFFLSSFDTDFEVVRALDGISYLAGGTFTGFALDFVQQSAFSPV 657 Query: 95 KDKNNP---LQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISIC 151 + + + + A+ + + S + I Sbjct: 658 AGARDGYPDILVVVTDGVSQDDVVAPAESARKEGIAVFAVGIGSAVDYATLLQIAGIDGR 717 Query: 152 MVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKID 211 ++ + + + T Y+ + + + + D Sbjct: 718 IL--------QINNFVDLASASQTLPDYICNYASCQGRDVPLDIVFLLDGSGSVGSANFD 769 Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP 271 ++ + L + + +IG + Y+ + +N +V + ++ ++ Sbjct: 770 LVKDFTRTLARNF-----DIAANMTQIGVVQYSDTVNREFGLGDFHNRQDVLNAISAVSY 824 Query: 272 Y-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T T A+ + + + + +I +TDG + + Sbjct: 825 QQGGTLTGAAIDFVRQTSFTTGDG-----DRPDVPNMLIVVTDGVSGDSVQGP------- 872 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + R G+ + V + + LL S + +D L ++ + + + Sbjct: 873 -ADAARREGITTFGVGIGNGIDFGTLLE-IAGDSARVLQADDFGALATVAQRLQEVVCDL 930 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 29/163 (17%), Positives = 55/163 (33%), Gaps = 15/163 (9%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYR 286 + + RIG Y+ + +EV + +N ++ T T A+ + Sbjct: 1332 NFDVSPNATRIGVAQYSDTNSLEFNLNRYSTKDEVLNAVNGISYQGGGTYTGAALDFVRQ 1391 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 + E T ++ TDGE ++ + E +RNAG +Y+V Sbjct: 1392 TMMVESAG-----DRTMSPNILVVATDGE--------SSDDQRTPAEVLRNAGTLVYAVG 1438 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + A LL + + D L ++ + I Sbjct: 1439 IGAGVSSTTLL-DIAGYNSRVLQATDFASLEVIGRELQEFICN 1480 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 29/267 (10%), Positives = 75/267 (28%), Gaps = 22/267 (8%) Query: 131 LSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFW 190 S + C+ +D L + S + + + Sbjct: 2057 AGSSSITCQPDGQWTDVVPTCLFIDNCDPNPCLNGAQCFQTA--DSYRCTCAEGYEGTNC 2114 Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQ---KAIQEKKNLSVRIGTIAYNI-- 245 T + + S+ +L+ S + S R+G + Y+ Sbjct: 2115 EIYTALNAQTFDLVFLLDGSGSVGASSFDLMKSFTNRITTNFDVSPTSTRVGVVQYSSQG 2174 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 + +N ++V + +N + NT T A+++ + + + Sbjct: 2175 SVATEFRLDSYSNKDDVIAAVNGIVYQNGNTYTGEALNYVRQNSFAVANGGRA-----DV 2229 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS 364 ++ ITDG++ + + G+ +Y++ + L Sbjct: 2230 ANILVVITDGQSVDDVTGP--------AQDLLREGVTVYALGIG-DGIQYSTLEAIAQDQ 2280 Query: 365 GQFFAVNDSRELLESFDKITDKIQEQS 391 + N L + + + + + Sbjct: 2281 SRVLQANTFTNLSNTAQALQESLGDAR 2307 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 25/196 (12%), Positives = 67/196 (34%), Gaps = 22/196 (11%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG-------TIAYNIGIVGNQCTP 254 + + + +++ + A + + N+S R + Y+ + Sbjct: 559 SASITSPNFELVKDFAERVARHFTISSSRNDNMSYRSFTAATNVGAVQYSDTVRSEFFLS 618 Query: 255 LSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 + EV L+ ++ T T A+ + ++ + ++ +TD Sbjct: 619 SFDTDFEVVRALDGISYLAGGTFTGFALDFVQQSAFSPVAGAR-----DGYPDILVVVTD 673 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDS 373 G + + + E R G+ +++V + + + LL+ G+ +N+ Sbjct: 674 GVSQD--------DVVAPAESARKEGIAVFAVGIGSAVDYATLLQ-IAGIDGRILQINNF 724 Query: 374 RELLESFDKITDKIQE 389 +L + + D I Sbjct: 725 VDLASASQTLPDYICN 740 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 21/191 (10%), Positives = 68/191 (35%), Gaps = 20/191 (10%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + +++ + +V + + +G + Y+ + + Sbjct: 1029 SGSVGTTNFELVKDFTSEVVLNFNISADTTN-----VGVVQYSDTVRNEFFLSSYDTKLP 1083 Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + +N+++ T T A+ + + ++ + NT ++ +TDG++ Sbjct: 1084 LIDAINQISYLTGGTLTGFAIDYVRQSSFSRPAGARNTFPD-----VLVVLTDGQSQDDV 1138 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 R+ G+ I++V + + + LL+ + + + D L+ Sbjct: 1139 VSS--------AAAARSQGITIFAVGIGSEVDFTTLLQ-ISGYPSRILQIQDFATLVTEG 1189 Query: 381 DKITDKIQEQS 391 ++ + I + + Sbjct: 1190 RRLPEIICQST 1200 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 24/172 (13%), Positives = 52/172 (30%), Gaps = 16/172 (9%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHA 284 + + RIG + Y+ + EV ++ + +T ++ Sbjct: 381 TMKFDVSSDITRIGVVQYSTDVNTEFELKTYATEAEVIHAISNITRQRGSTFIGAGINFV 440 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + + ++ ITDG + + + R+ G+ YS Sbjct: 441 RTNSFTVAAGDRPLAPN-----ILVTITDGI--------SADDVAGPAQAARDQGILTYS 487 Query: 345 VAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 + + + LL + + F V EL +T + E + P Sbjct: 488 IGIGEEIQWPTLL-SIAGARHRVFNVTSFSELPGIEASLTALLCE-VLPTTP 537 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 51/166 (30%), Gaps = 15/166 (9%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHA 284 A E + R+G + Y+ GI EV + + L+ +T T + Sbjct: 163 TAGFEVSSSQTRVGVVQYSTGINTEFDLNSFATEAEVINAIRGLSHQRGSTFTGAGITFT 222 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 E + +I ITDG ++ + E R + YS Sbjct: 223 RLESFTGASG-----DRPDAPNVLIVITDGISADSVDAP--------AEAARADNITTYS 269 Query: 345 VAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + + LL + V +L + + + + E+ Sbjct: 270 IGIGDEINYLTLL-SIAGMRERVLNVTTFGDLNDLDEVLLQILCER 314 >gi|257062895|ref|YP_003142567.1| hypothetical protein Shel_01450 [Slackia heliotrinireducens DSM 20476] gi|256790548|gb|ACV21218.1| uncharacterized protein [Slackia heliotrinireducens DSM 20476] Length = 744 Score = 93.4 bits (230), Expect = 6e-17, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 68/193 (35%), Gaps = 30/193 (15%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 ++ + ++I K+ + ++Y+ ++N +K+ + L Sbjct: 397 LNETKTATREFASTIFKSDADV-------CLVSYDSSAR--NVIDSTDNEYALKAAVRDL 447 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 + TN A+ +Y L K+ ++ ++DGE + + + Sbjct: 448 SAGGGTNIEDALRVSYERLEGSGSD----------KRIIVLMSDGEANEGLVGDDL---I 494 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDL----LRKCTDSSGQFFAVNDSRELLESFDKITD 385 +++ G+ IY++ + + G + V+D+ +L F I D Sbjct: 495 AYANEIKDDGVTIYTLGFFQSVSDKAECQRVMEGIAS-PGCHYEVDDASQLRYFFGDIGD 553 Query: 386 KIQEQS---VRIA 395 I VRIA Sbjct: 554 DINGTRFIYVRIA 566 >gi|152994336|ref|YP_001339171.1| hypothetical protein Mmwyl1_0295 [Marinomonas sp. MWYL1] gi|150835260|gb|ABR69236.1| conserved hypothetical protein [Marinomonas sp. MWYL1] Length = 528 Score = 93.0 bits (229), Expect = 7e-17, Method: Composition-based stats. Identities = 61/488 (12%), Positives = 131/488 (26%), Gaps = 106/488 (21%) Query: 2 TAIIISVC-FLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 A++I V + +++A+D ++ Q++ A D A L+ +++ D Sbjct: 37 FAVLIIVGGVIAVSFAVDTTRMVNSSAQLKRATDVAALAIGNIQLTNGNDDDV-----DL 91 Query: 61 STIFKKQIKKHLK-QGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKA---------- 109 I + +L I + + + + + +++ Sbjct: 92 QKIASGYVLSNLGMDSGLINQIEAEQVTVTKGEVDGSPTYKVSVSLIAQSDLLKAGGQEQ 151 Query: 110 ----QYEIPTENLFLKGLIPSALTNLSLRSTGIIERSS--------------ENLAISIC 151 E+ + + ++P++ T + + S Sbjct: 152 VIFSTVEVVSRPTEVALILPNSGTEDRGELAALRKVSKEFARNLLGDESAGVSPTQKVWL 211 Query: 152 MVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP------PKKSFWSKNTTKSKYAPAPAP 205 ++ S+S+ + N + L PP K+ + Sbjct: 212 SLVPFSQSVNVYDESDPDRINRWAAFGELNPPELRSLFRTGKARSLADPRFPDRKANLLC 271 Query: 206 ANRKIDVLIESAGNL------------------------VNSIQKAIQEKKNLSVRIGTI 241 +R + + ++ + + I Sbjct: 272 VHRGLSAGQNFFWDQPPSGQFEIYYRHDLPENGSPGAPPISWVGPNPDFPDTQAEDTRWI 331 Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGS 301 + G PL++NL E+ +RL+K++ N N AM A L + Sbjct: 332 VADKGCPDAALLPLTDNLAEIDARLDKMSTRFNVNYAIAMSWAGAALSPNMRGGAGWGNN 391 Query: 302 TRL------------------------KKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 DGE + S +C + Sbjct: 392 ELPLDFSLDGNNVKVIVMLVNTIGDWFDTDAYNFNDGETTNDSMAFARQRFSDLCRDFNS 451 Query: 338 AGMKIYSVAV---SAPPEGQDL--------LRKCTDSSGQFFAV------NDSRELLESF 380 +K + + V G+ L LR+C G F ++ S Sbjct: 452 KNIKFFFIGVRPGDPEDWGRTLFADVAGPGLRECASGDGNFHFADASNFSEGQSQISSSL 511 Query: 381 DKITDKIQ 388 +KI D I+ Sbjct: 512 EKIADDIR 519 >gi|319784437|ref|YP_004143913.1| hypothetical protein Mesci_4754 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317170325|gb|ADV13863.1| hypothetical protein Mesci_4754 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 553 Score = 93.0 bits (229), Expect = 7e-17, Method: Composition-based stats. Identities = 61/534 (11%), Positives = 135/534 (25%), Gaps = 143/534 (26%) Query: 3 AIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDP--------- 53 + + V + +A D++ +M ++ +Q+ALD+A L+ D T D Sbjct: 19 GLGLPVILTAVAFATDVSTLMRAKSNLQNALDSANLASSHLGDLDITRNDAFNRYFQANI 78 Query: 54 ---------------------TTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQIN 92 K S +I +A + Q+ Sbjct: 79 VGHGELDNAQATLTVDKGVNFVKTKAVASADVHLNFAFLFGDSKHIVVDASAVESNNQLE 138 Query: 93 ITKDKNNPLQYIAESKAQYEIPTENLFLKG-LIPSALTNLSLRSTGIIERSSENLAISIC 151 + +N T++L S + + + N Sbjct: 139 VVLVLDNTGSMAGARMTALRTATKSLLDTLEAAKSPTRKIRASPVPFVTAVNVNGDGFDP 198 Query: 152 MVLDV------------------------------SRSMEDLYLQKHN------------ 169 +D+ + + + Sbjct: 199 SWIDMDGKSSTNGVNFPVIDGKRPNHMALFKQLKKTGWADAGWNGTGWKGCVEARPGAYN 258 Query: 170 ------DNNN--MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLV 221 D Y P P+ + + + + A + + Sbjct: 259 ISDTPPDPAKPDTLFVPYFAPDDPEDAQKPSGSYGNSDKYYNNSYLNDVSDKTRLAQSGI 318 Query: 222 NSIQKAIQE----------KKNLSVRIGTIAY------------------NIGIVGNQCT 253 N + + + R Y Sbjct: 319 NILGIDLSNLLGNLIELLSPDDRDAREKIAKYVAPAKELITEIGSPVTVGPNRACPTPVV 378 Query: 254 PLSNNLNEVKSRLNKLN--PYENTNTYPAMHHAYRELYNEKESSHNTI-GSTRLKKFVIF 310 PL+++ ++++ +++ TN + R L + + + K V+ Sbjct: 379 PLTDDFDKLRKAASQMTEWNGSGTNVSEGLSWGMRVLSPAAPYTDGAPWKTPGISKIVLL 438 Query: 311 ITDGEN---------------------SGASAYQNTLNTLQ---------ICEYMRNAGM 340 +TDGEN +G + T +C ++N G+ Sbjct: 439 LTDGENVVYGASEQEPTKSDYTSYGYLAGGRFGSDNQTTAARNVDGWTKNVCTQLKNEGV 498 Query: 341 KIYSVAVSAP-PEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 +IY++ + + + L C ++AVND +L F +I + + + Sbjct: 499 QIYTMVLQSDTAANRALYSACASDPSNYYAVNDPTKLPNVFLQIANNFTKLQLT 552 >gi|126303712|ref|XP_001380869.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 628 Score = 93.0 bits (229), Expect = 8e-17, Method: Composition-based stats. Identities = 47/386 (12%), Positives = 98/386 (25%), Gaps = 47/386 (12%) Query: 8 VCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQ 67 + + D + ++ + R +K K Q + Sbjct: 280 PLLGIVQFGDDPSMEFNLKTH----------------ANSRDLKAAIEKIPQKGGLSNVG 323 Query: 68 IKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSA 127 ++ G+ + I P + E+ N+F + + Sbjct: 324 RALSFVTKNFFSNANGNRGGAPNVAIVMVDGWPTDKVEEASRLARESGINIFFITIEGAV 383 Query: 128 LTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKK 187 + L+V + K Sbjct: 384 ENEKQNVIEPNF-VDKAVCRRNGFYSLNVLSWFSLHKI---VQPLVKRVCDTNRLACSKT 439 Query: 188 SFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGI 247 S + + +L + NL + + + RIG + Y Sbjct: 440 CLNSADIGFVIDGSSSVGTGNFRTLL-QFVANLSKEFEISDTD-----TRIGAVQYTYEQ 493 Query: 248 VGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 + ++ + + ++N T+T A+++A L+ +K Sbjct: 494 RLEFGFDKYSTKQDILNAIKRVNYWSGGTSTGAAINYALEHLF--------KKSKPNKRK 545 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-- 364 +I ITDG + G+ YS+ + QD L Sbjct: 546 LMILITDGRSYDDVRIP--------AMAAHQNGVITYSIGI--AWAAQDELEVIATHPTK 595 Query: 365 GQFFAVNDSRELLESFDKITDKIQEQ 390 F V++ +L +S KI I + Sbjct: 596 DHSFFVDEFDDLYKSVPKIIQNICTE 621 >gi|260426558|ref|ZP_05780537.1| thrombospondin type 3 repeat family protein [Citreicella sp. SE45] gi|260421050|gb|EEX14301.1| thrombospondin type 3 repeat family protein [Citreicella sp. SE45] Length = 1088 Score = 93.0 bits (229), Expect = 8e-17, Method: Composition-based stats. Identities = 43/402 (10%), Positives = 106/402 (26%), Gaps = 51/402 (12%) Query: 10 FLFITYAIDLAHIMYIRNQMQSALDAAVLSGCA-----------SIVSDRTIKDPTTKKD 58 ++ Q+ +A+D A + G +D ++ + Sbjct: 277 LPAAVISLFGTRTDMAEWQITAAVDGAAIGGSQGVISVVESWTLQFNADGSLASVSEASP 336 Query: 59 QTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQ-----------YIAES 107 + L G + IT + Sbjct: 337 SMNVTGLTTGAGDLTVALDFGTIGGTDGLTTGVGITSVSDTHDGTSQCNAADCAQRWNAG 396 Query: 108 KAQYEIPTENLFLKGLIPSALT----NLSLRSTGIIERSSENLAISICMVLDVSRSMEDL 163 ++E + L GL + E ++ V DV S + + Sbjct: 397 TGRFETTQQTLVNGGLSGWDTLVRNYPFTTAPALPAEAPPATCFRAVSFVDDVVGSDQIM 456 Query: 164 YLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNS 223 + + + + + + A+ +I+ + + V+ Sbjct: 457 MIVDRS----GSMSWSSNSGQAEVCLNGLDDDNDGTVDEGDCADSRIEFVRAAGRAFVDL 512 Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN-NLNEVKSRLNKLNPYENTNTYPAMH 282 + + L+ N + K ++ L+P +T A Sbjct: 513 QTSQGIDLG------LLEFNEGNTLLRPIDTLNAGNAQDYKDAIDALSPGGDTAIGDAFD 566 Query: 283 HAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKI 342 + E E + +TDG N+ + + E + + G++I Sbjct: 567 ASTGEFTRVAEVGRV--------RTAYLLTDGFNTAGG------DPVAAAERLDDIGVRI 612 Query: 343 YSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKIT 384 +++ + ++L + + GQ + + L F ++ Sbjct: 613 HAIPAGNDVDREELTDIASGTGGQVYEARNVNALTGIFAELA 654 >gi|12850399|dbj|BAB28702.1| unnamed protein product [Mus musculus] Length = 650 Score = 92.7 bits (228), Expect = 9e-17, Method: Composition-based stats. Identities = 45/352 (12%), Positives = 99/352 (28%), Gaps = 31/352 (8%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPL 101 + ++ + +K K Q + ++ + G+ + + P Sbjct: 320 KTHMNSQDLKTAIEKITQRGGLSNVGRAISFVTKTFFSKANGNRGGAPNVAVVMVDGWPT 379 Query: 102 QYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSME 161 + E N+F + +A ++ S + +V + Sbjct: 380 DKVEEVSRVARESGINVFFITVEGAAERDIQHVVEPGFA-SKAVCRTNGFYSFNVQSWLS 438 Query: 162 DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLV 221 K S + + + VL + NL Sbjct: 439 LHKT---VQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSMGTSNFRTVL-QFVANLS 494 Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPA 280 + + + R+G + Y N+ ++ S + ++ T+T A Sbjct: 495 KEFEISDTD-----TRVGAVQYTYEQRLEFGFDKYNSKADILSAIRRVGYWSGGTSTGAA 549 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + +A +L+ +K +I ITDG + + C+ G+ Sbjct: 550 IQYALEQLF--------KKSKPNKRKVMIIITDGRSYDDVRIP----AMAACQ----KGV 593 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 Y++ + QD L F V+D L + +I I + Sbjct: 594 ITYAIGI--AWAAQDELEVMATHPAKDHSFFVDDFDNLYKIAPRIIQNICTE 643 >gi|197250621|ref|YP_002147271.1| von Willebrand factor type A domain-containing protein [Salmonella enterica subsp. enterica serovar Agona str. SL483] gi|197214324|gb|ACH51721.1| von Willebrand factor type A domain protein [Salmonella enterica subsp. enterica serovar Agona str. SL483] Length = 598 Score = 92.7 bits (228), Expect = 9e-17, Method: Composition-based stats. Identities = 37/369 (10%), Positives = 99/369 (26%), Gaps = 31/369 (8%) Query: 35 AAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINIT 94 AA L+ + + +T + K + ++ Sbjct: 79 AASLA--DPKAGSLATAEAPQHEMRTRAVASKAFAAQ-GGNVMGTARYEHYDENPIKQVS 135 Query: 95 KDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICM-- 152 + + Y L L P + Sbjct: 136 QAPLATFSLDVD-TGSYANVRRFLNQGQLPPPEAVRVEEMLNYFPAPQPVADKQDNTKPI 194 Query: 153 --VLDVSRSMEDLYLQKHNDNNNMTS-NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRK 209 + + +++ + + N + Sbjct: 195 AACIPMPFAVKYELAPSPWNAQRTLLKVDVQARDMQTRDLPPANLVFLIDTSGSMQPAER 254 Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 + ++ + LVN ++ I + Y G + NN +K+ ++ L Sbjct: 255 LPLIRSALKLLVNDLRAQDN--------ITIVTYAGGTHVALASTAGNNTTAIKAAIDNL 306 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 + Y +T + AY + ++ TDG+ + + + Sbjct: 307 DAYGSTGGEAGLRLAYEQAEKGFIKGGVNR--------ILLTTDGDFNLG--ITDPKDIE 356 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQ 388 + + R G+ + ++ V + ++ + D +G + ++ L E+ + D++ Sbjct: 357 ALVKKEREKGITLSTLGVGDDNFNEAMMVRIADVGNGNYSYIDS---LSEAQKVLKDEMH 413 Query: 389 EQSVRIAPN 397 + V +A + Sbjct: 414 QTLVTVAKD 422 >gi|126173282|ref|YP_001049431.1| von Willebrand factor type A [Shewanella baltica OS155] gi|125996487|gb|ABN60562.1| von Willebrand factor, type A [Shewanella baltica OS155] Length = 642 Score = 92.3 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 42/377 (11%), Positives = 102/377 (27%), Gaps = 36/377 (9%) Query: 22 IMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIREN 81 + +M+ DAA + + + + + + L Q + ++ Sbjct: 82 ASQRQAEMR---DAAKVEMARVAAPMQMSSNGAVMGMSIAPMPRDYAAIPLAQNKFEQQV 138 Query: 82 AGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIER 141 I +I ++ + Y L L + Sbjct: 139 QNGIMVAGEIPVS-----TFFIDVD-TGSYATLRRMLREGRLPEKGTVRVEEMLNYFAYD 192 Query: 142 SSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAP 201 + V+ + + Y L PK + N Sbjct: 193 YPLPAKNAAPFS--VTTELAPSPYNDDMMLLRIGLKGYDL---PKSQLGASNLVFLLDVS 247 Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + K+ +L + L + ++ + Y N+ Sbjct: 248 GSMASADKLPLLQTALKLLTAQLSAQD--------KVSIVVYAGAAGVVLDGVSGNDTQT 299 Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + L +L+ + N + AY+ + VI TDG+ + Sbjct: 300 LTYALEQLSAGGSINGGQGITQAYQLAKKHFIPNGINR--------VILATDGDFNVGVT 351 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESF 380 + + + + E ++ G+ + ++ L+ + D G + ++ L E+ Sbjct: 352 DFD--DLIALIEKEKDHGIGLTTLGFGLGNYNDQLMEQLADKGNGNYAYIDT---LNEAR 406 Query: 381 DKITDKIQEQSVRIAPN 397 + D++ IA + Sbjct: 407 KVLVDELSSTLFTIAKD 423 >gi|18042139|gb|AAL57848.1|AF454755_1 vitrin [Mus musculus] Length = 650 Score = 92.3 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 44/352 (12%), Positives = 95/352 (26%), Gaps = 31/352 (8%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPL 101 + ++ + +K K Q + ++ + G+ + + P Sbjct: 320 KTHMNSQDLKTAIEKITQRGGLSNVGRAISFVTKTFFSKANGNRGGAPNVAVVMVDGWPT 379 Query: 102 QYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSME 161 + E N+F + +A + S + +V + Sbjct: 380 DKVEEVSRVARESGINVFFITVEGAAEREKQHVVEPVFA-SKAVCRTNGFYSFNVQSWLS 438 Query: 162 DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLV 221 K S + + + VL + NL Sbjct: 439 LHKT---VQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTSNFRTVL-QFVANLS 494 Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPA 280 + + + R+G + Y N+ ++ S + ++ T+T A Sbjct: 495 KEFEISDTD-----TRVGAVQYTYEQRLQFGFDKYNSKADILSAIRRVGYWSGGTSTGAA 549 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + +A +L+ +K +I ITDG + G+ Sbjct: 550 IQYALEQLF--------KKSKPNKRKVMIIITDGRSYDDVRIP--------AMAAYQKGV 593 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 Y++ + QD L F V+D L + +I I + Sbjct: 594 ITYAIGI--AWAAQDELEVMATHPAKDHSFFVDDFDNLYKIAPRIIQNICTE 643 >gi|77456411|ref|YP_345916.1| von Willebrand factor, type A [Pseudomonas fluorescens Pf0-1] gi|77380414|gb|ABA71927.1| putative exported protein [Pseudomonas fluorescens Pf0-1] Length = 563 Score = 92.3 bits (227), Expect = 1e-16, Method: Composition-based stats. Identities = 41/402 (10%), Positives = 116/402 (28%), Gaps = 42/402 (10%) Query: 4 IIISVCFLFITYAIDLAHIMY-----IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKD 58 + SV + + + + +++S++ + +++ + P Sbjct: 11 LAASVLLVVAGCGVSSSPETTVAPPPAQTELKSSVQPEAVMADSAMAKRSALTAPIAS-- 68 Query: 59 QTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQI--NITKDKNNPLQYIAESKAQYEIPTE 116 + QG + ++T+ + + Y Sbjct: 69 ----FAPMPAGESYPQGYRDEQREQYAKLADNPIHSVTETPVSTFSADVD-TGAYANVRR 123 Query: 117 NLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS 176 L L P L + S V + + S Sbjct: 124 LLNQGRLPPEGAVRLEEMVNYFPYDYALPSDGSPFGVTTELAASPWNPHTRLLRIGIKAS 183 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 ++ + P + + + S P ++ + LV+ +++ Sbjct: 184 DRAVAELAPANLVFLVDVSGSMDRREGLP------LVKSTLKLLVDQLREQD-------- 229 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 R+ + Y ++++ + +L +T + AY+ Sbjct: 230 RVSLVVYAGESSVVLEPTSGREKAKIRTAIERLTAGGSTAGASGIELAYQMAQQAFIPKG 289 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 ++ TDG+ + ++ ++L Q+ R G+ + ++ + L Sbjct: 290 INR--------ILLATDGDFNVGTSDFDSL--KQMAVDKRKTGISLTTLGFGVDNYNEHL 339 Query: 357 LRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + + D+ G + +++ L E+ + D++ +A N Sbjct: 340 MEQLADAGDGNYAYIDN---LREARKVLVDQLGSTLAVVAKN 378 >gi|194433366|ref|ZP_03065646.1| von Willebrand factor type A domain protein [Shigella dysenteriae 1012] gi|194418460|gb|EDX34549.1| von Willebrand factor type A domain protein [Shigella dysenteriae 1012] gi|320178755|gb|EFW53718.1| hypothetical protein SGB_04028 [Shigella boydii ATCC 9905] gi|332090753|gb|EGI95846.1| von Willebrand factor type A domain protein [Shigella dysenteriae 155-74] Length = 575 Score = 91.9 bits (226), Expect = 1e-16, Method: Composition-based stats. Identities = 38/379 (10%), Positives = 107/379 (28%), Gaps = 48/379 (12%) Query: 21 HIMYIRNQMQSALDAA-VLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIR 79 + +Q L A + A + T + Q KQ+ ++ + Sbjct: 68 QQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLD 127 Query: 80 ENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGII 139 + G A + P E Y + S + Sbjct: 128 VDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDW-----DIKDKQSIPASKPIPFAM 182 Query: 140 ERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY 199 + L + + +N+ Sbjct: 183 RYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLI--------------------D 222 Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 ++ ++ ++ S LV +++ I + Y + ++ Sbjct: 223 TSGSMISDERLPLIQSSLKLLVKELREQDN--------IAIVTYAGDSRIALPSISGSHK 274 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 E+ + ++ L+ +TN + AY++ ++ TDG+ + Sbjct: 275 AEINAAIDSLDAEGSTNGGAGLELAYQQAAKGFIKGGINR--------ILLATDGDFNVG 326 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLE 378 + + + + R +G+ + + V + ++ + D +G + ++ L E Sbjct: 327 --IDDPKSIESMVKKQRESGVTLSTFGVGDDNYNEAMMVRIADVGNGNYSYIDT---LSE 381 Query: 379 SFDKITDKIQEQSVRIAPN 397 + + ++++ + +A + Sbjct: 382 AQKVLNSEMRQTLITVAKD 400 >gi|148706512|gb|EDL38459.1| vitrin, isoform CRA_a [Mus musculus] Length = 650 Score = 91.9 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 44/352 (12%), Positives = 94/352 (26%), Gaps = 31/352 (8%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPL 101 + ++ + +K K Q + ++ + G+ + + P Sbjct: 320 KTHMNSQDLKTAIEKITQRGGLSNVGRAISFVTKTFFSKANGNRGGAPNVAVVMVDGWPT 379 Query: 102 QYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSME 161 + E N+F + +A S + +V + Sbjct: 380 DKVEEVSRVARESGINVFFITVEGAAEREKQHVVEPGFA-SKAVCRTNGFYSFNVQSWLS 438 Query: 162 DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLV 221 K S + + + VL + NL Sbjct: 439 LHKT---VQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTSNFRTVL-QFVANLS 494 Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPA 280 + + + R+G + Y N+ ++ S + ++ T+T A Sbjct: 495 KEFEISDTD-----TRVGAVQYTYEQRLEFGFDKYNSKADILSAIRRVGYWSGGTSTGAA 549 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + +A +L+ +K +I ITDG + G+ Sbjct: 550 IQYALEQLF--------KKSKPNKRKVMIIITDGRSYDDVRIP--------AMAAYQKGV 593 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 Y++ + QD L F V+D L + +I I + Sbjct: 594 ITYAIGI--AWAAQDELEVMATHPAKDHSFFVDDFDNLYKIAPRIIQNICTE 643 >gi|58037355|ref|NP_083089.1| vitrin isoform 1 precursor [Mus musculus] gi|114154829|sp|Q8VHI5|VITRN_MOUSE RecName: Full=Vitrin; Flags: Precursor gi|21707639|gb|AAH34120.1| Vitrin [Mus musculus] Length = 650 Score = 91.9 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 44/352 (12%), Positives = 96/352 (27%), Gaps = 31/352 (8%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPL 101 + ++ + +K K Q + ++ + G+ + + P Sbjct: 320 KTHMNSQDLKTAIEKITQRGGLSNVGRAISFVTKTFFSKANGNRGGAPNVAVVMVDGWPT 379 Query: 102 QYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSME 161 + E N+F + +A ++ S + +V + Sbjct: 380 DKVEEVSRVARESGINVFFITVEGAAERDIQHVVEPGFA-SKAVCRTNGFYSFNVQSWLS 438 Query: 162 DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLV 221 K S + + + VL + NL Sbjct: 439 LHKT---VQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSMGTSNFRTVL-QFVANLS 494 Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPA 280 + + + R+G + Y N+ ++ S + ++ T+T A Sbjct: 495 KEFEISDTD-----TRVGAVQYTYEQRLEFGFDKYNSKADILSAIRRVGYWSGGTSTGAA 549 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + +A +L+ +K +I ITDG + G+ Sbjct: 550 IQYALEQLF--------KKSKPNKRKVMIIITDGRSYDDVRIP--------AMAAYQKGV 593 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 Y++ + QD L F V+D L + +I I + Sbjct: 594 ITYAIGI--AWAAQDELEVMATHPAKDHSFFVDDFDNLYKIAPRIIQNICTE 643 >gi|74196449|dbj|BAE34363.1| unnamed protein product [Mus musculus] Length = 650 Score = 91.9 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 41/323 (12%), Positives = 85/323 (26%), Gaps = 31/323 (9%) Query: 71 HLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTN 130 ++ + G+ + + P + E N+F + +A Sbjct: 349 SFVTKTFFSKANGNRGGAPNVAVVMVDGWPTDKVEEVSRVARESGINVFFITVEGAAERE 408 Query: 131 LSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFW 190 S + +V + K Sbjct: 409 KQHVVEPGFA-SKAVCRTNGFYSFNVQSWLSLHKT---VQPLVKRVCDTDRLACSKTCLN 464 Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 S + + + VL + NL + + + R+G + Y Sbjct: 465 SADIGFVIDGSSSVGTSNFRTVL-QFVANLSKEFEISDTD-----TRVGAVQYTYEQRLE 518 Query: 251 QCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 N+ ++ S + ++ T+T A+ +A +L+ +K +I Sbjct: 519 FGFDKYNSKADILSAIRRVGYWSGGTSTGAAIQYALEQLF--------KKSKPNKRKVMI 570 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQF 367 ITDG + G+ Y++ + QD L Sbjct: 571 IITDGRSYDDVRIP--------AMAAYQKGVITYAIGI--AWAAQDELEVMATHPAKDHS 620 Query: 368 FAVNDSRELLESFDKITDKIQEQ 390 F V+D L + +I I + Sbjct: 621 FFVDDFDNLYKIAPRIIQNICTE 643 >gi|126730251|ref|ZP_01746062.1| hypothetical protein SSE37_10864 [Sagittula stellata E-37] gi|126708984|gb|EBA08039.1| hypothetical protein SSE37_10864 [Sagittula stellata E-37] Length = 614 Score = 91.9 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 45/346 (13%), Positives = 91/346 (26%), Gaps = 67/346 (19%) Query: 7 SVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKK 66 V +F ID+ H R+Q+Q+ LD AVL+ + T+ + Sbjct: 43 LVMMVFGGIGIDMMHAELKRSQVQNTLDRAVLAAANLSNT-----------RDPQTVVED 91 Query: 67 QIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPS 126 + + + + + ++ ++ I N GLI Sbjct: 92 YFRA--------------MKLEDTLGDVQTGDSLGAKRVRAEGNGSI---NSHFLGLIGV 134 Query: 127 ALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPK 186 ++ +T + I +VLDVS SM+ ++ + + L Sbjct: 135 DQLDVYGAATAENAT----APLEISLVLDVSGSMQGQKIRDLKEAAKAFVDAVLGEGGDN 190 Query: 187 KSFWSKNTTKSKYAPAPA-----PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 + + A + + ++ Sbjct: 191 SRVTVSLIPYNATVNLGDDLSERFNLDRWQNYSSCAIFESSDYNSLSIDPNAGLEQLAHF 250 Query: 242 AYNIGIVGNQ--------------CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 + P S++ + + ++ NT M Sbjct: 251 DPYDYSGNSPDLTAPWCAEGNNLAIVPHSSDADYLSDVIDSFEAQGNTAIDLGMKWGLAL 310 Query: 288 LYNEKES----------------SHNTIGSTRLKKFVIFITDGENS 317 L + T+ KFV+ +TDGEN+ Sbjct: 311 LDPAARPVIGDMQADGLVPSSARYRPSDYGTQTMKFVVVMTDGENT 356 Score = 66.1 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 39/311 (12%), Positives = 83/311 (26%), Gaps = 47/311 (15%) Query: 105 AESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLY 164 S A+Y + + + + A+S V D + Sbjct: 328 VPSSARYRPSDYGTQTMKFVVVMTDGENTQEYNLKPWMLNPNALSDVWVDDHGTPGKGDD 387 Query: 165 LQK-HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNS 223 +N SN P ++ + V +A + Sbjct: 388 RYSIRVKDNYGDSNDVFYWPHASRNNYRNGPYSW--------------VTRTAAQMVNGV 433 Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 A+ + + + + +Y + + ++ + Sbjct: 434 ---AVVDGDSETTKAKCSSYKGAGHNA-------GQETLIENVLGMDYGTLDLDGDGIAG 483 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFIT--------DGENSGASA-------------Y 322 A + N K + DG S Sbjct: 484 ANDDCSNYPPVRLTWQELFGNVKTTYYANAWYWQAYMDGRASYNDYYNAYYSWETTVDAS 543 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 Q N IC + + I+++ V AP G + +R C S+ ++ V+ +L+++F Sbjct: 544 QANTNLATICAKAKQQDVTIFTIGVEAPQAGLNAMRNCASSASHYYNVSS-NQLVDTFRS 602 Query: 383 ITDKIQEQSVR 393 I+D + E + Sbjct: 603 ISDVVVELRLT 613 >gi|167946540|ref|ZP_02533614.1| BatB protein, putative [Endoriftia persephone 'Hot96_1+Hot96_2'] Length = 345 Score = 91.9 bits (226), Expect = 2e-16, Method: Composition-based stats. Identities = 28/177 (15%), Positives = 66/177 (37%), Gaps = 43/177 (24%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNP---YENTNTYPAMHHAYRELYNEKESSHNTIGST 302 G +PL+ + N + L+ + P T + ++L E S Sbjct: 148 GDTSYVLSPLTFDRNAIHQLLDGIVPTLAGGGTAIGDGIGLGIKKLRERPEGS------- 200 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE---------- 352 + +I +TDG+N + L+ + + G++IY++ V + Sbjct: 201 ---RVLILVTDGKNETGTIP-----PLKAAQLAKQEGIRIYTIGVGSTKNRVRLLSPDLR 252 Query: 353 ----------GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITD----KIQEQSVRI 394 ++ L++ + G +F ND+ L + + +I + + + +++ I Sbjct: 253 TYEIATGLAIDEETLQQIAETTGGAYFRANDTAGLEKVYQRIDELEKSEAESRTIFI 309 >gi|293415564|ref|ZP_06658207.1| yfbK protein [Escherichia coli B185] gi|291433212|gb|EFF06191.1| yfbK protein [Escherichia coli B185] Length = 575 Score = 91.5 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 38/379 (10%), Positives = 107/379 (28%), Gaps = 48/379 (12%) Query: 21 HIMYIRNQMQSALDAA-VLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIR 79 + +Q L A + A + T + Q KQ+ ++ + Sbjct: 68 QQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLD 127 Query: 80 ENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGII 139 + G A + P E Y + S + Sbjct: 128 VDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDW-----DIKDKQSIPASKPIPFAM 182 Query: 140 ERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY 199 + L + + +N+ Sbjct: 183 RYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLI--------------------D 222 Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 ++ ++ ++ S LV +++ I + Y + ++ Sbjct: 223 TSGSMISDERLPLIQSSLKLLVKELREQDN--------IAIVTYAGDSRIALPSISGSHK 274 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 E+ + ++ L+ +TN + AY++ ++ TDG+ + Sbjct: 275 AEINAAIDSLDAEGSTNGGAGLELAYQQAAKGFIKGGINR--------ILLATDGDFNVG 326 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLE 378 + + + + R +G+ + + V + ++ + D +G + ++ L E Sbjct: 327 --IDDPKSIESMIKKQRESGVTLSTFGVGDDNYNEAMMVRIADVGNGNYSYIDT---LSE 381 Query: 379 SFDKITDKIQEQSVRIAPN 397 + + ++++ + +A + Sbjct: 382 AQKVLNSEMRQTLITVAKD 400 >gi|108799422|ref|YP_639619.1| hypothetical protein Mmcs_2455 [Mycobacterium sp. MCS] gi|119868535|ref|YP_938487.1| hypothetical protein Mkms_2500 [Mycobacterium sp. KMS] gi|126435076|ref|YP_001070767.1| hypothetical protein Mjls_2492 [Mycobacterium sp. JLS] gi|122976988|sp|Q1B971|Y2455_MYCSS RecName: Full=UPF0353 protein Mmcs_2455 gi|166987492|sp|A3PZE9|Y2492_MYCSJ RecName: Full=UPF0353 protein Mjls_2492 gi|166987495|sp|A1UFT9|Y2500_MYCSK RecName: Full=UPF0353 protein Mkms_2500 gi|108769841|gb|ABG08563.1| von Willebrand factor, type A [Mycobacterium sp. MCS] gi|119694624|gb|ABL91697.1| von Willebrand factor, type A [Mycobacterium sp. KMS] gi|126234876|gb|ABN98276.1| von Willebrand factor, type A [Mycobacterium sp. JLS] Length = 335 Score = 91.5 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 25/152 (16%), Positives = 57/152 (37%), Gaps = 16/152 (10%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + N K+ ++KL + T T + A + + + G ++ +DG+ Sbjct: 155 TTNREATKTAIDKLQLADRTATGEGIFTALQAIATV--GAVIGGGDEPPPARIVLFSDGK 212 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PEGQDLLRKCTD 362 + S N ++ G+ I +++ P P +L+K D Sbjct: 213 ETVPSNPDNPKGAFTAARTAKDQGVPISTISFGTPYGYVEINEQRQPVPVDDQMLKKIAD 272 Query: 363 -SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 S G+ F + +L E + + +I ++++ Sbjct: 273 LSEGEAFTASSLEQLREVYANLQQQIGYETIK 304 >gi|227820127|ref|YP_002824098.1| transmembrane protein [Sinorhizobium fredii NGR234] gi|227339126|gb|ACP23345.1| putative transmembrane protein [Sinorhizobium fredii NGR234] Length = 451 Score = 91.5 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 58/443 (13%), Positives = 130/443 (29%), Gaps = 89/443 (20%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 AI I L + +D ++++MQS LDAA+++ I + KD Sbjct: 31 VAICIIPMILAVGAGLDYTRAYNVQSRMQSDLDAALVAAIKEIDEYDEDEIAEKIKDWFD 90 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 +KQ + ++T+ + + + A +PT + L Sbjct: 91 AQSEKQSATY--------------------DLTEITVDKSGHTITASASGTVPTTLMTLA 130 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 + + +V+D S SM + +N Sbjct: 131 ------DIKTVPVGVISAIEGPATSYLEVYIVIDKSPSMLLAATSEDQAMLRADANITCE 184 Query: 182 PPPPKK----SFWSKNTTKSKYAPA-PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 + Y + DV +++ +++ + A ++ Sbjct: 185 FACHDTKDPVKKNGTVIASTYYNYIKSLGVKLRTDVALDAVEEVLDMVDAADEDH----A 240 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN----TYPAMHHAYRELYNEK 292 RI Y++G ++ + + + + +L+ + T+ + A + L + Sbjct: 241 RIKVGLYSLGETISEVLEPTYSTSTARKKLSD-DSSGLTSATSMSATYFQTALKALKKKV 299 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGA---------SAYQNTLNTLQICEYMRNAGMKIY 343 ++ + + K V+ +TDG S + T C+Y+++ + Sbjct: 300 GTAGDGTSAASPLKLVLLLTDGVQSNRDWVIKWSGKYWGRVTPLNPDWCDYLKDNDATMA 359 Query: 344 SV------------------------------------AVSAPPEGQDL----LRKCTDS 363 + VS+ D L+ C S Sbjct: 360 VLYTEYLAIPADWGYNATLAKSMGNSDWTSTWGGTLHSGVSSSTTRHDYIPIALQDCASS 419 Query: 364 SGQFFAVNDSRELLESFDKITDK 386 S F + E+ + ++ Sbjct: 420 SDLFISAASEDEITAGLSTLFNQ 442 >gi|328545070|ref|YP_004305179.1| von Willebrand factor type A [polymorphum gilvum SL003B-26A1] gi|326414812|gb|ADZ71875.1| von Willebrand factor type A [Polymorphum gilvum SL003B-26A1] Length = 552 Score = 91.5 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 48/417 (11%), Positives = 108/417 (25%), Gaps = 58/417 (13%) Query: 19 LAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYI 78 + + ++ Q+A D L +++ + +D I ++ Sbjct: 149 GSRMTTLK---QAASD---LIRTLMSINEISTEDDRVMVGLVPFTAFVNIGADKATQPWM 202 Query: 79 RENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGI 138 D + S P+ + L + Sbjct: 203 DREGRSPVHWTNFQTGSDGTPVPSLFSSSALVNGRPSRFSLYQQLGGTDWLGCVEARPMP 262 Query: 139 IERSS--ENLAISICMVLDVSRSMEDLYLQKHND----NNNMTSNKYLLPPPPKKSFWSK 192 + + + + + + E + D +NN ++ K Sbjct: 263 YDVTDDAADPDVPASLYVPAFAPDEPDSSPDNRDGYRYSNNWLADNAGACSLTAKQAAQV 322 Query: 193 NTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQC 252 N +P + + G L + + S + Sbjct: 323 NIYDQGDSPIHGSLATREV----AQGRLCKYRNQPKSYGTSSSQGPNFL-----CKTQPI 373 Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE----SSHNTIGSTRLKKFV 308 T L+N+ + + + TN + + +R L ++ S + ++ + Sbjct: 374 TDLTNDKQALLDAVAAMRADGYTNIHQGVVWGWRVLTPQEPFSRGRSPDQKREKDHRRIM 433 Query: 309 IFITDGENSGASAYQN-------------------------------TLNTLQICEYMRN 337 I +TDG N+ + T C Sbjct: 434 IVMTDGANTYQDKSSSHNRTEYNAYGYGTEQRLGSGIDTAGEIAAKMDERTALACRNAAT 493 Query: 338 AGMK-IYSVAVSAPPEG-QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 +Y++A + LLR C S F + EL+ +F++I +I + Sbjct: 494 YEATQVYTIAFQVGDYATRKLLRDCASSPEMAFDAGSNSELVTAFERIGKEISRLRL 550 Score = 43.7 bits (101), Expect = 0.056, Method: Composition-based stats. Identities = 33/354 (9%), Positives = 81/354 (22%), Gaps = 58/354 (16%) Query: 3 AIIISVCFLFI--TYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 A L ID++ ++ ++++QS + K T +Q Sbjct: 17 AFGSFAFLLTAGSGVGIDMSRVVTEKSRLQS--------AADATALAANYKSGTYTAEQI 68 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + + I++ T F Sbjct: 69 RQHAEAYFDGLYTAPERGSVSRNVTVGDGTISVEAG-----------------VTMPTFF 111 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 L+ + ++ + + + + +VLD S SM + + + Sbjct: 112 APLLGVEEISFAVMAESKVGT----ASFDVVLVLDNSGSMAGSRMTTLKQAASDLIRTLM 167 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 + K + + Sbjct: 168 SINEISTEDDRVMVGLVPFTAFVNIGADKA---------------TQPWMDREGRSPVHW 212 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENTNTYPAMH---HAYRELYNEKE- 293 + G G L ++ V R ++ + Y T+ + Y + + Sbjct: 213 TNFQTGSDGTPVPSLFSSSALVNGRPSRFSLYQQLGGTDWLGCVEARPMPYDVTDDAADP 272 Query: 294 --SSHNTIGSTRLKKFVIFIT--DGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 + + + + DG + + + + A + IY Sbjct: 273 DVPASLYVPAFAPDEPDSSPDNRDGYRYSNNWLADNAGACSL-TAKQAAQVNIY 325 >gi|90420284|ref|ZP_01228192.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] gi|90335618|gb|EAS49368.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] Length = 593 Score = 91.5 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 40/336 (11%), Positives = 90/336 (26%), Gaps = 68/336 (20%) Query: 125 PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP 184 L S E SS+ + + E ++ P Sbjct: 256 GVKLAERSPTGDYWQEISSKTPLTRFFVYENAHHKNELGPWLGCVESRPNGLAITDAEPN 315 Query: 185 PKK---SFWSKNTTKSKYAPAPAPANRKIDVLIE--SAGNLVNSIQKAIQEKKNLSVRIG 239 F + + + + + A + R Sbjct: 316 YANPDTLFVPSFGPDEYDGSKGDNDYLEDEGRRSMPAETAMSVQAKVAKYFDGSDLQRGN 375 Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 N G + TPL++N + + +N ++ TN + +R L + + Sbjct: 376 HPGPNRGCLSTPVTPLTDNQATINAAINAMDADGETNIPEGIAWGWRLLSAREPFTQGRA 435 Query: 300 GS-TRLKKFVIFITDGENSGASAYQN---------------------------------- 324 K ++ +TDG+N+ S + Sbjct: 436 NDAKDNLKVLVLMTDGDNNYGSDENDYNESGYGTFGYASTYDAYGNHSWGRIFDDTSTTS 495 Query: 325 ------------TLNTLQICEYMRN--------AGMKIYSVAVSAPPEG--QDLLRKCTD 362 IC+ +++ G+ I+++A + L+ +C Sbjct: 496 KRANRSSFVSAMNEKVAAICQNIKDDGRKATGEDGIVIFTIAFDLNDGSSVKKLMEQCAS 555 Query: 363 ------SSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + ++ S +L+ +FD IT+++ + Sbjct: 556 YGITDPTKKLYYDAKSSSDLMAAFDSITEQVSSLRI 591 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 23/161 (14%), Positives = 51/161 (31%), Gaps = 17/161 (10%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 + + + + A+D++ I + +Q ++D A L+ +++ + + Sbjct: 27 FGLTLPILACCMGAAVDISGIYASKRNLQHSVDIAALAAGREYSNNQQDSHLSKVAE--- 83 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 G + D + N + + A PT L Sbjct: 84 -------------GYFFENAGADARANTDFSYDGIFNEDGSTVLQVSAARRHPTIFGDLL 130 Query: 122 GLIPSALTNLSLRS-TGIIERSSENLAISICMVLDVSRSME 161 + + + + +N +I + MVLD S SM Sbjct: 131 SFVTAGELDWRAFPLAARSQIVVQNQSIELVMVLDNSGSMT 171 >gi|326335930|ref|ZP_08202107.1| aerotolerance protein BatA [Capnocytophaga sp. oral taxon 338 str. F0234] gi|325691894|gb|EGD33856.1| aerotolerance protein BatA [Capnocytophaga sp. oral taxon 338 str. F0234] Length = 332 Score = 91.5 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 33/211 (15%), Positives = 73/211 (34%), Gaps = 53/211 (25%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 +I+ L A + + RIG + Y+ P + + + V L Sbjct: 111 NRIEALKRVASQFIEERKS---------DRIGIVVYSGESYTK--VPATTDKSIVLQSLK 159 Query: 268 KLNPY---ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + T + A L S K +I +TDG N+ Sbjct: 160 DIKQGEIEDGTAIGMGLGTAINRL----------KDSKTKSKVIILMTDGVNNTGV---- 205 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPE----------------------GQDLLRKCT- 361 ++ L E + G+++Y++ + + + LL + Sbjct: 206 -IDPLSAAELAKEYGIRVYTIGIGTNGKALSPVAYNPDGSLQYDMVPVEIDEKLLGEIAQ 264 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + G++F D+++L + + +I DK+++ + Sbjct: 265 STGGKYFRATDNKKLAQIYTEI-DKLEKSKI 294 >gi|308081588|ref|NP_001183957.1| vitrin isoform 2 [Mus musculus] gi|26342052|dbj|BAC34688.1| unnamed protein product [Mus musculus] Length = 628 Score = 91.5 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 44/352 (12%), Positives = 96/352 (27%), Gaps = 31/352 (8%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPL 101 + ++ + +K K Q + ++ + G+ + + P Sbjct: 298 KTHMNSQDLKTAIEKITQRGGLSNVGRAISFVTKTFFSKANGNRGGAPNVAVVMVDGWPT 357 Query: 102 QYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSME 161 + E N+F + +A ++ S + +V + Sbjct: 358 DKVEEVSRVARESGINVFFITVEGAAERDIQHVVEPGFA-SKAVCRTNGFYSFNVQSWLS 416 Query: 162 DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLV 221 K S + + + VL + NL Sbjct: 417 LHKT---VQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSMGTSNFRTVL-QFVANLS 472 Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPA 280 + + + R+G + Y N+ ++ S + ++ T+T A Sbjct: 473 KEFEISDTD-----TRVGAVQYTYEQRLEFGFDKYNSKADILSAIRRVGYWSGGTSTGAA 527 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + +A +L+ +K +I ITDG + G+ Sbjct: 528 IQYALEQLF--------KKSKPNKRKVMIIITDGRSYDDVRIP--------AMAAYQKGV 571 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 Y++ + QD L F V+D L + +I I + Sbjct: 572 ITYAIGI--AWAAQDELEVMATHPAKDHSFFVDDFDNLYKIAPRIIQNICTE 621 >gi|323135950|ref|ZP_08071033.1| hypothetical protein Met49242DRAFT_0420 [Methylocystis sp. ATCC 49242] gi|322399041|gb|EFY01560.1| hypothetical protein Met49242DRAFT_0420 [Methylocystis sp. ATCC 49242] Length = 432 Score = 91.5 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 54/442 (12%), Positives = 117/442 (26%), Gaps = 77/442 (17%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 + + L A+D + I ++ + A DA VL+ + K Sbjct: 17 FGLALMPLALMAGGAVDFSQISRQKSALNQAADAGVLTALKEA-----REQLKQGKPDWQ 71 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 +I +KQ K + ++++ + Y Sbjct: 72 SIAEKQGGKAFTNNASKIGGVSGTGATINLSLS-------GGVLSGSLNYAANA-PTHFL 123 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 + NL ++ S I V+DVS SM + + + Sbjct: 124 RIAGLNTINLKGSASA---TMSAAQYRDIHFVIDVSASMGIGATKADQQAMQNSVGCAVA 180 Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 + + T + A A +IDV+ ++ + + I + S R+ Sbjct: 181 CHHAEAA---DPATDNLAAVRAIGATLRIDVVRKAVMDALAKI------PNDGSTRVAIH 231 Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-----ENTNTYPAMHHAYRELYNEKESSH 296 +++ PLS N+ S ++ TN + +++ L + Sbjct: 232 SFSNS--LKTVFPLSTNIAGAISATQSIDLTNENGQGGTNFHYSLNQLNNLLASAGNGLT 289 Query: 297 NTIGSTRL--------KKFVIFITDGENS-------------------GASAYQNTLNTL 329 + + + F DG + Sbjct: 290 ASQPRGFVLLATDAVEDSSLFFYADGVAPPFARQWVEPNFVVGNPSYFAWGLHYVQAPDA 349 Query: 330 QICEYMRNAGMKIYSV--------AVSAPPEGQ----------DLLRKCTDSSGQFFAVN 371 C ++ G + ++ V P + C + +F Sbjct: 350 ANCSAIKAKGYTMMTLETEYLIPDGVYNPTFDAVRGDMGPAMTKSMTDCASAPDYYFHAE 409 Query: 372 DSRELLESFDKITDKIQEQSVR 393 +E+ + + K S+ Sbjct: 410 SPQEIDRAVQTMVSKTVNLSLT 431 >gi|149050644|gb|EDM02817.1| similar to vitrin (predicted) [Rattus norvegicus] Length = 427 Score = 91.5 bits (225), Expect = 2e-16, Method: Composition-based stats. Identities = 46/352 (13%), Positives = 94/352 (26%), Gaps = 31/352 (8%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPL 101 + ++ + +K K Q + ++ + G+ + + P Sbjct: 97 KTHMNSQDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGNRGGAPNVAVVLVDGWPT 156 Query: 102 QYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSME 161 I E N+F + +A S + +V + Sbjct: 157 DKIEEVSRVARESGINVFFVTVEGAAEREKQHVVEPNFA-SKAVCRTNGFYSFNVQSWLS 215 Query: 162 DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLV 221 K S + + + VL + NL Sbjct: 216 LHKT---VQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTSNFRTVL-QFVANLS 271 Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPA 280 + + + RIG + Y N+ +V S + ++ T+T A Sbjct: 272 KEFEISDTD-----TRIGAVQYTYEQRLEFGFDKYNSKADVLSAIRRVGYWSGGTSTGAA 326 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + +A +L+ +K +I ITDG + G+ Sbjct: 327 IQYALEQLF--------KKSKPNKRKVMILITDGRSYDDVRIP--------AMAAYQKGV 370 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 Y++ + QD L F V++ L + +I I + Sbjct: 371 ITYAIGI--AWAAQDELEVIATHPARDHSFFVDEFDNLYKFVPRIIRNICTE 420 >gi|332291974|ref|YP_004430583.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] gi|332170060|gb|AEE19315.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] Length = 334 Score = 91.1 bits (224), Expect = 2e-16, Method: Composition-based stats. Identities = 32/194 (16%), Positives = 64/194 (32%), Gaps = 42/194 (21%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY----EN 274 N + +++K N TP++++ + V S L + Sbjct: 112 NRLEALKKVAASFINGRPNDRIGLIEYAGESFTKTPITSDKSIVLSALKSIQYNNIIEGG 171 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 T + L S L K +I +TDGEN+ ++ E Sbjct: 172 TAIGMGLATGVNRL----------KDSKALSKVIILMTDGENNAGQ-----IDPRIAAEL 216 Query: 335 MRNAGMKIYSVAVSAPP----------------------EGQDLLRKCT-DSSGQFFAVN 371 + G+K+Y++ + ++LL + + GQ+F Sbjct: 217 AQEFGIKVYTIGMGTNGMALSPYARNANGTFVYENIQVTIDEELLEEIAATTGGQYFRAT 276 Query: 372 DSRELLESFDKITD 385 ++ +L E +D+I Sbjct: 277 NNEKLQEIYDEIDK 290 >gi|149412375|ref|XP_001507696.1| PREDICTED: hypothetical protein [Ornithorhynchus anatinus] Length = 691 Score = 91.1 bits (224), Expect = 2e-16, Method: Composition-based stats. Identities = 46/352 (13%), Positives = 90/352 (25%), Gaps = 31/352 (8%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPL 101 + + R +K K Q + ++ + G+ A + + P Sbjct: 361 KTHTNSRDLKAAIEKITQKGGLSNVGRALSFVNKNFFSDANGNRGGAANVAVVMVDGWPT 420 Query: 102 QYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSME 161 + ES N+F + +A + + L+V Sbjct: 421 DRVEESSRLARESGINIFFITIEGAAESEKQNVVEPNF-VDKAVCRRNGFYSLNVPSWFG 479 Query: 162 DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLV 221 + K S + + VL A Sbjct: 480 LQKVA---RPLAKRVCDTHRLACSKTCLNSADVGFVIDGSSSVGTGNFRTVLQFVAN--- 533 Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPA 280 E + R+G + Y +++ S + ++N T+T A Sbjct: 534 ---ISKEFEVSDTDTRVGAVQYTYEQRLEFGFDQHRTKSDLLSAIKRVNYWSGGTSTGAA 590 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + +A L+ +K +I ITDG + G+ Sbjct: 591 IRYALERLFE--------KSKPNKRKLMIVITDGRSYDDVRIP--------ALAAHRKGV 634 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 Y++ + QD L F V++ L +I I + Sbjct: 635 ITYAIGI--TWAAQDELEVMASDPDKDHAFFVDEFDNLYTFVPQIIQNICTE 684 Score = 43.4 bits (100), Expect = 0.060, Method: Composition-based stats. Identities = 13/118 (11%), Positives = 37/118 (31%), Gaps = 14/118 (11%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNT 298 + Y N ++K+ + K+ +N A+ + +++ + Sbjct: 347 IVQYGDDPTTEFNLKTHTNSRDLKAAIEKITQKGGLSNVGRALSFVNKNFFSDANGNRGG 406 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + + + DG + + L R +G+ I+ + + E + Sbjct: 407 AAN-----VAVVMVDGWPTDRVEESSRL--------ARESGINIFFITIEGAAESEKQ 451 >gi|327400025|ref|YP_004340864.1| von Willebrand factor type A [Archaeoglobus veneficus SNP6] gi|327315533|gb|AEA46149.1| von Willebrand factor type A [Archaeoglobus veneficus SNP6] Length = 790 Score = 91.1 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 38/327 (11%), Positives = 95/327 (29%), Gaps = 5/327 (1%) Query: 69 KKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSAL 128 ++ + + I + D + + T L Sbjct: 361 AQYAIPNGEMYYVTNPVEGNWSIAVVADYPTGYDTVHVDIYKKSGGTWYLVDSHNFTLYA 420 Query: 129 TNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKS 188 + + + A + ++ +ED L ++N + + + Sbjct: 421 APQTFTINVPSVENLKIEATPVNGTKELHLWVEDGGLYGPYSSSNGEAYETTNAGGTYTA 480 Query: 189 FWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIV 248 + + + KID +A ++ + Q Y Sbjct: 481 YVVADFPYGEQEFYLNVYIAKIDAAKIAAKTFNGFLKSSDQVGVAYFGGDVPGGY--TPR 538 Query: 249 GNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 + L+N+ + ++ L Y T + A +EL + + Sbjct: 539 YDVSQTLTNDTLSANNSIDDLWAYGGTPMGGGIKVARQELVANTAPGNIPVMIVLSDGNP 598 Query: 309 IFITDGENSGASAYQNTLNTLQICE--YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ 366 +DG S A Q + + + + + IY++ + LL++ S Sbjct: 599 TLTSDGTASETLAIQEAIEEAETTKQTTIGGEQILIYTIGFGNDAN-ETLLKQIATSPDY 657 Query: 367 FFAVNDSRELLESFDKITDKIQEQSVR 393 ++ S EL + +I +++E++ + Sbjct: 658 YYFAATSEELSSIYRQIAKELKEKAAK 684 >gi|331658353|ref|ZP_08359315.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli TA206] gi|331056601|gb|EGI28610.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli TA206] Length = 574 Score = 91.1 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 38/379 (10%), Positives = 109/379 (28%), Gaps = 48/379 (12%) Query: 21 HIMYIRNQMQSAL-DAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIR 79 + +Q L +A + A + T + Q KQ+ ++ + Sbjct: 67 QQYSDKQTLQGRLKEAPTFARAAKANATHIANPGTARYQQFDDNPVKQVAQNPLVTFSLD 126 Query: 80 ENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGII 139 + G A + P E Y + S + Sbjct: 127 VDTGSYANVRRFLNQGLLPPPDAVRVEEVVNYFPSDW-----DIKDKQSIPASKPIPFAM 181 Query: 140 ERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY 199 + L + + +N+ Sbjct: 182 RYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLI--------------------D 221 Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 ++ ++ ++ S LV +++ I + Y + ++ Sbjct: 222 TSGSMISDERLPLIQSSLKLLVKELREQDN--------IAIVTYAGDSRIALPSISGSHK 273 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 E+ + ++ L+ +TN + AY++ ++ TDG+ + Sbjct: 274 AEINAAIDSLDAEGSTNGGAGLEMAYQQAAKGFIKGGINR--------ILLATDGDFNVG 325 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLE 378 + + + + R +G+ + ++ V + ++ + D +G + ++ L E Sbjct: 326 --IDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEAMMVRIADVGNGNYSYIDT---LSE 380 Query: 379 SFDKITDKIQEQSVRIAPN 397 + + ++++ + +A + Sbjct: 381 AQKVLNSEMRQTLITVAKD 399 >gi|152990340|ref|YP_001356062.1| von Willebrand factor type A domain-containing protein [Nitratiruptor sp. SB155-2] gi|151422201|dbj|BAF69705.1| von Willebrand factor type A domain protein [Nitratiruptor sp. SB155-2] Length = 289 Score = 91.1 bits (224), Expect = 3e-16, Method: Composition-based stats. Identities = 30/136 (22%), Positives = 52/136 (38%), Gaps = 21/136 (15%) Query: 254 PLSNNLNEVKSRLNKLNP---YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + + +N L P NT + + L T +K +I Sbjct: 133 PLTYDTKALDFLINYLEPSIAGNNTAIGEGLWQGIKALQA----------DTAKQKVLIL 182 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFA 369 ITDG ++ S Q E + G+KIY++ + + LL + +S G+FF Sbjct: 183 ITDGHHNSGSIS-----PRQAVEKAKKLGIKIYTIGLG--DADKHLLEQIAKESGGKFFY 235 Query: 370 VNDSRELLESFDKITD 385 +L F ++ Sbjct: 236 AKSEEDLQSIFSELNK 251 >gi|332088403|gb|EGI93521.1| von Willebrand factor type A domain protein [Shigella boydii 5216-82] Length = 575 Score = 90.7 bits (223), Expect = 3e-16, Method: Composition-based stats. Identities = 38/379 (10%), Positives = 107/379 (28%), Gaps = 48/379 (12%) Query: 21 HIMYIRNQMQSALDAA-VLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIR 79 + +Q L A + A + T + Q KQ+ ++ + Sbjct: 68 QQYSDKQALQGRLQEAPTFARAAKAKATHIANLGTARYQQFDDNPVKQVAQNPLATFSLD 127 Query: 80 ENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGII 139 + G A + P E Y + S + Sbjct: 128 VDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDW-----DIKDKQSIPASKPIPFAM 182 Query: 140 ERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY 199 + L + + +N+ Sbjct: 183 RYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLI--------------------D 222 Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 ++ ++ ++ S LV +++ I + Y + ++ Sbjct: 223 TSGSMISDERLPLIQSSLKLLVKELREQDN--------IAIVTYAGDSRIALPSISGSHK 274 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 E+ + ++ L+ +TN + AY++ ++ TDG+ + Sbjct: 275 AEINAAIDSLDAEGSTNGGAGLELAYQQAAKGFIKGGINR--------ILLATDGDFNVG 326 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLE 378 + + + + R +G+ + + V + ++ + D +G + ++ L E Sbjct: 327 --IDDPKSIESMVKKQRESGVTLSTFGVGDDNYNEAMMVRIADVGNGNYSYIDT---LSE 381 Query: 379 SFDKITDKIQEQSVRIAPN 397 + + ++++ + +A + Sbjct: 382 AQKVLNSEMRQTLITVAKD 400 >gi|306814616|ref|ZP_07448778.1| hypothetical protein ECNC101_21282 [Escherichia coli NC101] gi|305852010|gb|EFM52462.1| hypothetical protein ECNC101_21282 [Escherichia coli NC101] Length = 581 Score = 90.7 bits (223), Expect = 3e-16, Method: Composition-based stats. Identities = 38/379 (10%), Positives = 109/379 (28%), Gaps = 48/379 (12%) Query: 21 HIMYIRNQMQSAL-DAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIR 79 + +Q L +A + A + T + Q KQ+ ++ + Sbjct: 74 QQYSDKQTLQGRLKEAPTFARAAKANATHIANPGTARYQQFDDNPVKQVAQNPLVTFSLD 133 Query: 80 ENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGII 139 + G A + P E Y + S + Sbjct: 134 VDTGSYANVRRFLNQGLLPPPDAVRVEEVVNYFPSDW-----DIKDKQSIPASKPIPFAM 188 Query: 140 ERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY 199 + L + + +N+ Sbjct: 189 RYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLI--------------------D 228 Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 ++ ++ ++ S LV +++ I + Y + ++ Sbjct: 229 TSGSMISDERLPLIQSSLKLLVKELREQDN--------IAIVTYAGDSRIALPSISGSHK 280 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 E+ + ++ L+ +TN + AY++ ++ TDG+ + Sbjct: 281 AEINAAIDSLDAEGSTNGGAGLEMAYQQAAKGFIKGGINR--------ILLATDGDFNVG 332 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLE 378 + + + + R +G+ + ++ V + ++ + D +G + ++ L E Sbjct: 333 --IDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEAMMVRIADVGNGNYSYIDT---LSE 387 Query: 379 SFDKITDKIQEQSVRIAPN 397 + + ++++ + +A + Sbjct: 388 AQKVLNSEMRQTLITVAKD 406 >gi|254820233|ref|ZP_05225234.1| hypothetical protein MintA_09911 [Mycobacterium intracellulare ATCC 13950] Length = 339 Score = 90.7 bits (223), Expect = 3e-16, Method: Composition-based stats. Identities = 29/222 (13%), Positives = 65/222 (29%), Gaps = 26/222 (11%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 ++ S+ + ++ E+ + + AI Sbjct: 99 NRAVVMLVIDVSESMASNDVPPNRLAAAKEAGKQFADQLTPAINLGLVEF---------- 148 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 P + N VKS ++ L P T T + A + + S G Sbjct: 149 AANATLLVPPTTNRGAVKSGIDSLQPAPKTATGEGIFTALQAIATV--GSVMGGGEGPPP 206 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--------------P 351 ++ +DG + + G++I +++ P Sbjct: 207 ARIVLESDGAENVPLDPNAPQGAFTAARAAKGQGVQISTISFGTPYGTVDYEGATIPVPV 266 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + Q L + C + G+ F + L + + +I ++V+ Sbjct: 267 DDQTLQKICEITDGEAFHADSLDSLKNVYTTLQRQIGYETVK 308 >gi|116249091|ref|YP_764932.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] gi|115253741|emb|CAK12134.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] Length = 465 Score = 90.7 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 44/327 (13%), Positives = 99/327 (30%), Gaps = 33/327 (10%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A+ + + + + D +R +MQS LD A+++ I D K++ + Sbjct: 43 VALTLVPMIVAVGASFDYIRTYNVRQRMQSDLDTALIAAVKEI----DTDDTDALKEKVA 98 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 F Q++ G + + + A +PT Sbjct: 99 DWFHAQVENSYTLGD-------------------IDIDTSNHKITATASGTVPTT----- 134 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 L+ A + S + +++ +V+D S SM Sbjct: 135 -LMKIANIDTVDVSVASAVKGPATSYLNVYIVIDTSPSMLLAATTAGQSAMYSGIGCQFA 193 Query: 182 PPPPKKSFWSKNTTKSKYAPAPA-PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 K + Y + A + DV ++ ++++ I + + + +++G Sbjct: 194 CHTGDAHTVGKTKYANNYEYSAAKTIKLRADVAGDAVRDVLDMIDDS--DSNHQRIKVGL 251 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN-TYPAMHHAYRELYNEKESSHNTI 299 + + LS + + T+ + L + + + Sbjct: 252 YSLGDTLTEVLTPTLSTDTARNRLADASYGLTSATSKAATYFDVSLATLKQKVGTGGDGT 311 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTL 326 S K V+ +TDG S + + Sbjct: 312 SSGSPLKLVLLLTDGVQSQREWVTDKV 338 >gi|149914292|ref|ZP_01902823.1| hypothetical protein RAZWK3B_19866 [Roseobacter sp. AzwK-3b] gi|149811811|gb|EDM71644.1| hypothetical protein RAZWK3B_19866 [Roseobacter sp. AzwK-3b] Length = 597 Score = 90.7 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 43/320 (13%), Positives = 83/320 (25%), Gaps = 51/320 (15%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 ++ + V ID R +Q+ LD+AVL+G + +D Sbjct: 21 LSLFLFVVMLAVAGLGIDTMRHEMARTHLQATLDSAVLAGAGAPADATAADVKLIVEDYF 80 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 Q + + I + + A + + + FL Sbjct: 81 DAADLSQYLNTIDPETDIVASLNAKSVSASVELE---------------------MDTFL 119 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 L L + G + + I + LDVS SM L K + Sbjct: 120 MRLSGVDT----LTTAGGATAAIAAPRMEIVLALDVSGSMAGERLTKMKSAAKQFVTDVM 175 Query: 181 LPPPPKKSFWSKNTTKSKYAPAP----APANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 + S P+ A + L + +K + Sbjct: 176 SASDQGTTTISIVPYSWSVTPSDEMFEALSVDVRHNYSTCIDFLESDFEKTAIDPARSYG 235 Query: 237 RIGTIAY----------------------NIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 + + + P SN++ + ++++ L + Sbjct: 236 QTIYTSLTGSFGNIGIGDPTVTNTAYDRTCYTDEYFRILPYSNSVTALHNKIDSLKAAGS 295 Query: 275 TNTYPAMHHAYRELYNEKES 294 T+T+ M A L Sbjct: 296 TSTHLGMKWAAGLLDPAFAP 315 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 33/326 (10%), Positives = 83/326 (25%), Gaps = 30/326 (9%) Query: 92 NITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISIC 151 + + + ++ + GL+ A + ++ + + Sbjct: 277 SNSVTALHNKIDSLKAAGSTSTHLGMKWAAGLLDPAFAPVVSSLQQTRTKTDSSGNLVTY 336 Query: 152 MVLDVS--RSMEDLYLQKHNDNNN----MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAP 205 ++D S + + ++ P P Sbjct: 337 SIVDPSINNTPALYSTGQVLKVAIVMGDGANDWTYGLDDPNGLMNPDIVENHTQPDYRGP 396 Query: 206 ----ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS--NNL 259 + + V + + N + Y + N + N+ Sbjct: 397 DSNLYRVQYTDDVFKYRYFVFNPSFIAYSEMNCNTGYWVCVYESEDITNYYLYSTYWNDY 456 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 ++ + + L P + A+ +++ S F T Sbjct: 457 TDITNGV-YLTP----AQFDALPTTLPNFESQERLSWEEAWGLMTPDFYSRTTFDYAPDN 511 Query: 320 SAYQNTLNTLQ----------ICEYMRNAGMKIYSVAVSAP--PEGQDLLRKCTDSSGQF 367 +N ++ IC ++ G+ +Y++A D L C S Q Sbjct: 512 MFSKNGTGSIAPETKDDRMADICGATKSKGIVVYTIAFEMGEFDSAADRLENCASSPSQH 571 Query: 368 FAVNDSRELLESFDKITDKIQEQSVR 393 F + ++F I +Q+ + Sbjct: 572 FNATTLN-ISQAFGSIAANVQKLRLT 596 >gi|313159758|gb|EFR59115.1| von Willebrand factor type A domain protein [Alistipes sp. HGB5] Length = 330 Score = 90.7 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 27/206 (13%), Positives = 66/206 (32%), Gaps = 43/206 (20%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENT 275 + + + ++ +PL+ + +++ L ++ + T Sbjct: 108 DRITAAKEVAGSFIADRYGDRIGLVAFAGEAFTQSPLTTDQGTLQTLLARIRSGLIEDGT 167 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + A L S K +I +TDG N+ + E Sbjct: 168 AIGNGLATAINRL----------RESEAKSKVIILLTDGVNNRG-----EIAPQTAAEIA 212 Query: 336 RNAGMKIYSVAVSAPP-----------------------EGQDLLRKCTD-SSGQFFAVN 371 + G+++Y++ V + LR + + GQ+F Sbjct: 213 KAQGIRVYTIGVGTEGMAPYPAVDIYGTPTGGTVMAKVEIDEKTLRSIAEQTGGQYFRAT 272 Query: 372 DSRELLESFDKITDKIQEQSVRIAPN 397 D +L +D+I +++++ V + + Sbjct: 273 DKAKLKAIYDQI-NQLEKSKVEVTEH 297 >gi|296446920|ref|ZP_06888856.1| conserved hypothetical protein [Methylosinus trichosporium OB3b] gi|296255595|gb|EFH02686.1| conserved hypothetical protein [Methylosinus trichosporium OB3b] Length = 486 Score = 90.7 bits (223), Expect = 4e-16, Method: Composition-based stats. Identities = 63/478 (13%), Positives = 136/478 (28%), Gaps = 99/478 (20%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+ + A+D A ++ Q+ + D+A L+ + +T T Sbjct: 24 IFALAAIPLLIAAGGAVDFAIASRVQTQLYAICDSATLAATTPAMMQQTTATAKTVA--- 80 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + Q + + N+ ++ + + AQ Sbjct: 81 ----TSMFAAQVAQINRLTYNSANLTVTVNDDTSASPVKTRTVTVSYLAQVGNA------ 130 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 S+ + +S I +VLD S SME + + Sbjct: 131 --FGSFYHVPTSIFTVKASSTASTARNIDFYLVLDNSPSMELPATTAGLASMTAATGCVF 188 Query: 181 LP-------PPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAI-QEKK 232 P + T S A +ID + E+A L ++ Q + Sbjct: 189 ACHENTYSDPENTVQYPGYGTIDSYTYAKNAGIALRIDNVREAAKRLASTSQAMMSANGA 248 Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN----------PYENTNTYPAMH 282 + Y+ + + S N++ + + +N + P + TYP Sbjct: 249 TYRLAAYAFNYDTTQLQALTSTTSANVSAISTSINAMTPPLMEKNNYLPTGASYTYPTSA 308 Query: 283 HAYRELYNEKESSHNTIG-----------------------------STRLKKFVIFITD 313 + + + + + ++ V+ +TD Sbjct: 309 STWTTVTLGSDPTKTNYNVRDAMTDIEMTLTKVNAAMPNPGNGTTASGDKPQEVVMLVTD 368 Query: 314 GE-----------NSGASAYQNTLNT---------LQICEYMRNAGMKIYS---VAVSAP 350 G + AS+Y N+ T +C ++N G++I + P Sbjct: 369 GMVDGSFYTNTSCTNYASSYSNSYGTFYRCLRPLDTTLCTTIKNRGIRIAVLNLIYYPTP 428 Query: 351 PEG-------------QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 G L+ C + +F V+ ++ E+ + K+ + + Sbjct: 429 GYGFYDGAVAPFISTVSPALKSCASTD-LYFEVDTGSDISEAMTYLFQKVVTTASYLT 485 >gi|293359740|ref|XP_233802.5| PREDICTED: vitrin [Rattus norvegicus] Length = 648 Score = 90.4 bits (222), Expect = 4e-16, Method: Composition-based stats. Identities = 46/352 (13%), Positives = 94/352 (26%), Gaps = 31/352 (8%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPL 101 + ++ + +K K Q + ++ + G+ + + P Sbjct: 318 KTHMNSQDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGNRGGAPNVAVVLVDGGPT 377 Query: 102 QYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSME 161 I E N+F + +A S + +V + Sbjct: 378 DKIEEVSRVARESGINVFFVTVEGAAEREKQHVVEPNFA-SKAVCRTNGFYSFNVQSWLS 436 Query: 162 DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLV 221 K S + + + VL + NL Sbjct: 437 LHKT---VQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTSNFRTVL-QFVANLS 492 Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPA 280 + + + RIG + Y N+ +V S + ++ T+T A Sbjct: 493 KEFEISDTD-----TRIGAVQYTYEQRLEFGFDKYNSKADVLSAIRRVGYWSGGTSTGAA 547 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + +A +L+ +K +I ITDG + G+ Sbjct: 548 IQYALEQLF--------KKSKPNKRKVMILITDGRSYDDVRIP--------AMAAYQKGV 591 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 Y++ + QD L F V++ L + +I I + Sbjct: 592 ITYAIGI--AWAAQDELEVIATHPARDHSFFVDEFDNLYKFVPRIIRNICTE 641 >gi|301026928|ref|ZP_07190323.1| von Willebrand factor type A domain protein [Escherichia coli MS 196-1] gi|299879508|gb|EFI87719.1| von Willebrand factor type A domain protein [Escherichia coli MS 196-1] Length = 575 Score = 90.4 bits (222), Expect = 4e-16, Method: Composition-based stats. Identities = 38/379 (10%), Positives = 107/379 (28%), Gaps = 48/379 (12%) Query: 21 HIMYIRNQMQSALDAA-VLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIR 79 + +Q L A + A + T + Q KQ+ ++ + Sbjct: 68 QQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLD 127 Query: 80 ENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGII 139 + G A + P E Y + S + Sbjct: 128 VDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDW-----DIKDKQSIPASKPIPFAM 182 Query: 140 ERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY 199 + L + + +N+ Sbjct: 183 RYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLI--------------------D 222 Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 ++ ++ ++ S LV +++ I + Y + ++ Sbjct: 223 TSGSMISDERLPLIQSSLKLLVKELREQDN--------IAIVTYAGDSRIALPSISGSHK 274 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 E+ + ++ L+ +TN + AY++ ++ TDG+ + Sbjct: 275 AEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINR--------ILLATDGDFNVG 326 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLE 378 + + + + R +G+ + + V + ++ + D +G + ++ L E Sbjct: 327 --IDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDT---LSE 381 Query: 379 SFDKITDKIQEQSVRIAPN 397 + + ++++ + +A + Sbjct: 382 AQKVLNSEMRQMLITVAKD 400 >gi|86131264|ref|ZP_01049863.1| aerotolerance-related exported protein BatA [Dokdonia donghaensis MED134] gi|85818675|gb|EAQ39835.1| aerotolerance-related exported protein BatA [Dokdonia donghaensis MED134] Length = 334 Score = 90.4 bits (222), Expect = 4e-16, Method: Composition-based stats. Identities = 31/194 (15%), Positives = 65/194 (33%), Gaps = 42/194 (21%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY----EN 274 N + +++K N TP++++ + V S L + Sbjct: 112 NRLEALKKVASSFINGRPNDRIGLVEYAGESFTKTPITSDKSIVLSALKGIQYNSIIEGG 171 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 T + + S L K +I +TDGEN+ ++ E Sbjct: 172 TAIGMGLATGVNRI----------KDSKALSKVIILMTDGENNAGQ-----IDPRIAAEL 216 Query: 335 MRNAGMKIYSVAVSAPPE----------------------GQDLLRKCT-DSSGQFFAVN 371 + G+K+Y++ + ++LL + + GQ+F Sbjct: 217 AQEFGIKVYTIGMGTNGTALSPYARNPNGTFVYENIQVTIDEELLEEIAETTGGQYFRAT 276 Query: 372 DSRELLESFDKITD 385 ++++L E +D+I Sbjct: 277 NNKKLQEIYDEIDK 290 >gi|253584083|ref|ZP_04861281.1| BatA protein [Fusobacterium varium ATCC 27725] gi|251834655|gb|EES63218.1| BatA protein [Fusobacterium varium ATCC 27725] Length = 319 Score = 90.4 bits (222), Expect = 4e-16, Method: Composition-based stats. Identities = 31/161 (19%), Positives = 62/161 (38%), Gaps = 38/161 (23%) Query: 254 PLSNNLNEVKS-----RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 PL+ + N +K ++ + T + A L S K + Sbjct: 137 PLTFDHNVIKEMTRKLTVDDITSNTRTAIGMGIGVALNRL----------KDSEAKSKVI 186 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE---------------- 352 I +TDGEN+ + + + G+KIY++ + A Sbjct: 187 ILLTDGENNSGEMSPS-----AAADIAKELGIKIYTIGIGAKEIKVPSFFGYKTVKNTEL 241 Query: 353 GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 +++L+ + G++F +DS+E E F+KI D +++ + Sbjct: 242 DENMLKSIAETTGGEYFRASDSKEFKEIFNKI-DALEKTKI 281 >gi|197105075|ref|YP_002130452.1| hypothetical protein PHZ_c1612 [Phenylobacterium zucineum HLK1] gi|196478495|gb|ACG78023.1| conserved hypothetical protein [Phenylobacterium zucineum HLK1] Length = 521 Score = 90.4 bits (222), Expect = 4e-16, Method: Composition-based stats. Identities = 57/493 (11%), Positives = 130/493 (26%), Gaps = 102/493 (20%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQ------------------SALDAAV------ 37 A + + AIDLA + Q+Q L AA Sbjct: 27 VAFAMVPLAIGTLGAIDLARGASAKVQLQDALDAAALGAARSSANSPDTLQAAGERLLRQ 86 Query: 38 -LS-------GCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENA---GDIA 86 L+ +S K + + + + E + Sbjct: 87 NLALGGDFELVSSSFTFGPDNKVLARAQVRVEPYVAGLAGVNNMDIAAATEVVRAGMQLE 146 Query: 87 QKAQINITKDKN-NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSEN 145 ++ T N N Y ++ A+ + +G +SL R + Sbjct: 147 IALVLDNTGSMNQNNKLYHLKTAAKAFVTAMETAAEGNTVPNSIKISLVPFSHTVRVDSD 206 Query: 146 LAISICMVLDVSRSMEDL-----YLQKHNDN-----NNMTSNKYLLPPPPKKSFWSKNTT 195 + + S + N + + ++ + + ++T Sbjct: 207 AYRNAAWIDQNGSSPINNEIFPTATGTQWANRFTLFSQLGTSWRGCVESRQAPYDIQDTP 266 Query: 196 KSKYAPAPAPANR--------KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIA----- 242 + A P + + + + + ++ Sbjct: 267 PTTGATLFTPYFAPDEPDYPAEWYGTKFANSYVDDRTSSTNWRVRQGNLTKYVNTKGLGT 326 Query: 243 ---YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE-SSHNT 298 N G L+ + + ++ ++ L +TN + + L Sbjct: 327 SKGPNAGCGLRPIIRLTTDFDGLRDAVDDLVADGSTNIPMGLVWGWHTLAPMAPFPDGVP 386 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNT--------------------------------- 325 + + KK V+ +TDGEN+ Sbjct: 387 YLTEKHKKIVVLMTDGENTILYKDTPNGSDYSGVGHARQGRVLDPAGRPITESSSQRERT 446 Query: 326 ----LNTLQICEYMRN--AGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 L++C M+ ++IY++ V +L+ C S+ ++ V ++ ++ + Sbjct: 447 AALDDRLLKLCANMKAPAKDIEIYAIRVEVSSGSSSVLQTCASSADHYYDVQNAADMTMA 506 Query: 380 FDKITDKIQEQSV 392 F I +I + Sbjct: 507 FQSIAGQIAALHL 519 >gi|78776847|ref|YP_393162.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] gi|78497387|gb|ABB43927.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] Length = 307 Score = 90.4 bits (222), Expect = 5e-16, Method: Composition-based stats. Identities = 33/154 (21%), Positives = 63/154 (40%), Gaps = 21/154 (13%) Query: 245 IGIVGNQCTPLSNNLNEV---KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGS 301 G +P++ N + S LN+ +NT A+ + R S Sbjct: 138 YGDFAFIASPITYEKNIIIEMLSYLNQGMAGQNTAIGEAIAMSLRAF----------KHS 187 Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA-PPEGQDLLRKC 360 K V+ +TDGE++ + +KIY++ + + LL+K Sbjct: 188 KAKSKIVVLLTDGEHNSGDIS-----PKDALVLAKEENIKIYTIGMGNRGEADEALLKKI 242 Query: 361 TD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 D S G+FF +++EL E ++ I D+++ ++ Sbjct: 243 ADESGGEFFYATNAKELKEIYEHI-DELESSKIK 275 >gi|16130205|ref|NP_416773.1| conserved protein [Escherichia coli str. K-12 substr. MG1655] gi|89109088|ref|AP_002868.1| hypothetical protein [Escherichia coli str. K-12 substr. W3110] gi|157161758|ref|YP_001459076.1| von Willebrand factor type A domain-containing protein [Escherichia coli HS] gi|238901445|ref|YP_002927241.1| hypothetical protein BWG_2044 [Escherichia coli BW2952] gi|256022046|ref|ZP_05435911.1| hypothetical protein E4_01620 [Escherichia sp. 4_1_40B] gi|300948978|ref|ZP_07163036.1| von Willebrand factor type A domain protein [Escherichia coli MS 116-1] gi|300956471|ref|ZP_07168759.1| von Willebrand factor type A domain protein [Escherichia coli MS 175-1] gi|301647634|ref|ZP_07247429.1| von Willebrand factor type A domain protein [Escherichia coli MS 146-1] gi|307138934|ref|ZP_07498290.1| hypothetical protein EcolH7_12533 [Escherichia coli H736] gi|331642908|ref|ZP_08344043.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli H736] gi|2495629|sp|P76481|YFBK_ECOLI RecName: Full=Uncharacterized protein yfbK gi|1788606|gb|AAC75330.1| conserved protein [Escherichia coli str. K-12 substr. MG1655] gi|85675335|dbj|BAE76678.1| conserved hypothetical protein [Escherichia coli str. K12 substr. W3110] gi|157067438|gb|ABV06693.1| von Willebrand factor type A domain protein [Escherichia coli HS] gi|238860346|gb|ACR62344.1| conserved protein [Escherichia coli BW2952] gi|260448637|gb|ACX39059.1| von Willebrand factor type A [Escherichia coli DH1] gi|300316719|gb|EFJ66503.1| von Willebrand factor type A domain protein [Escherichia coli MS 175-1] gi|300451549|gb|EFK15169.1| von Willebrand factor type A domain protein [Escherichia coli MS 116-1] gi|301074238|gb|EFK89044.1| von Willebrand factor type A domain protein [Escherichia coli MS 146-1] gi|309702582|emb|CBJ01910.1| putative lipoprotein [Escherichia coli ETEC H10407] gi|315136904|dbj|BAJ44063.1| hypothetical protein ECDH1ME8569_2207 [Escherichia coli DH1] gi|331039706|gb|EGI11926.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli H736] Length = 575 Score = 90.4 bits (222), Expect = 5e-16, Method: Composition-based stats. Identities = 38/379 (10%), Positives = 107/379 (28%), Gaps = 48/379 (12%) Query: 21 HIMYIRNQMQSALDAA-VLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIR 79 + +Q L A + A + T + Q KQ+ ++ + Sbjct: 68 QQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLD 127 Query: 80 ENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGII 139 + G A + P E Y + S + Sbjct: 128 VDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDW-----DIKDKQSIPASKPIPFAM 182 Query: 140 ERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY 199 + L + + +N+ Sbjct: 183 RYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLI--------------------D 222 Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 ++ ++ ++ S LV +++ I + Y + ++ Sbjct: 223 TSGSMISDERLPLIQSSLKLLVKELREQDN--------IAIVTYAGDSRIALPSISGSHK 274 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 E+ + ++ L+ +TN + AY++ ++ TDG+ + Sbjct: 275 AEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINR--------ILLATDGDFNVG 326 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLE 378 + + + + R +G+ + + V + ++ + D +G + ++ L E Sbjct: 327 --IDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDT---LSE 381 Query: 379 SFDKITDKIQEQSVRIAPN 397 + + ++++ + +A + Sbjct: 382 AQKVLNSEMRQMLITVAKD 400 >gi|323941033|gb|EGB37220.1| von Willebrand protein type A [Escherichia coli E482] Length = 565 Score = 90.4 bits (222), Expect = 5e-16, Method: Composition-based stats. Identities = 38/379 (10%), Positives = 107/379 (28%), Gaps = 48/379 (12%) Query: 21 HIMYIRNQMQSALDAA-VLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIR 79 + +Q L A + A + T + Q KQ+ ++ + Sbjct: 58 QQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLD 117 Query: 80 ENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGII 139 + G A + P E Y + S + Sbjct: 118 VDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDW-----DIKDKQSIPASKPIPFAM 172 Query: 140 ERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY 199 + L + + +N+ Sbjct: 173 RYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLI--------------------D 212 Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 ++ ++ ++ S LV +++ I + Y + ++ Sbjct: 213 TSGSMISDERLPLIQSSLKLLVKELREQDN--------IAIVTYAGDSRIALPSISGSHK 264 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 E+ + ++ L+ +TN + AY++ ++ TDG+ + Sbjct: 265 AEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINR--------ILLATDGDFNVG 316 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLE 378 + + + + R +G+ + + V + ++ + D +G + ++ L E Sbjct: 317 --IDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDT---LSE 371 Query: 379 SFDKITDKIQEQSVRIAPN 397 + + ++++ + +A + Sbjct: 372 AQKVLNSEMRQMLITVAKD 390 >gi|147921050|ref|YP_685140.1| hypothetical protein RCIX370 [uncultured methanogenic archaeon RC-I] gi|110620536|emb|CAJ35814.1| hypothetical protein RCIX370 [uncultured methanogenic archaeon RC-I] Length = 1310 Score = 90.4 bits (222), Expect = 5e-16, Method: Composition-based stats. Identities = 26/134 (19%), Positives = 51/134 (38%), Gaps = 16/134 (11%) Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 N VK+ +N L+ T+ + A EL K S+ K+++I +TD Sbjct: 945 NSGTNKTTVKNAINSLSASGGTDISSGIKKAIAELDAHKRSTA--------KQYIIVLTD 996 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDS 373 G + L + + G I+++ + +D L+K ++ V Sbjct: 997 GYSQYPEFD------LIEADKAKAKGYTIFTIGMGM--ADEDTLKKIASKPEYYYRVLSP 1048 Query: 374 RELLESFDKITDKI 387 +L ++ I +I Sbjct: 1049 EQLEAAYYDIGQEI 1062 >gi|148256121|ref|YP_001240706.1| hypothetical protein BBta_4775 [Bradyrhizobium sp. BTAi1] gi|146408294|gb|ABQ36800.1| hypothetical protein BBta_4775 [Bradyrhizobium sp. BTAi1] Length = 602 Score = 90.4 bits (222), Expect = 5e-16, Method: Composition-based stats. Identities = 58/435 (13%), Positives = 115/435 (26%), Gaps = 56/435 (12%) Query: 13 ITYAID--LAHIMYIR-NQMQSAL-------DA-AVLSGCASIVSDRTIKDPTTKKDQTS 61 + A+D + + +Q+A+ D + L+ V I S Sbjct: 167 VALALDNTGSMAYSGKMTALQNAVAGSGGLIDQLSALAKSPGDVYISLIPFAKVVNVGAS 226 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + I Q + Q A N + ++ S + + Sbjct: 227 NYAQSWIDWTDWQNPPTIQPNNGSYQAAIPNASFTQSQWDMVGPGSSCPFTSGNGFPYFS 286 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 T S S S S +++ N + Y Sbjct: 287 CTSG-PATASSSASKVPSSGSYSGYICPGYDSASHSYYNGCWNSVQNSTRVNWCTGSYCS 345 Query: 182 PPPPKKSFWSKNTTKSKYAPAPA---------PANRKIDVLIESAGNLVNSIQKAIQEKK 232 P + + + + + + Sbjct: 346 CPTTGSNVPNNTCSCTGSGSSTVCKVNTFTHTWIANATSTWTGCVADRTQPNDANAVSPA 405 Query: 233 NLSVRI---------GTIAYNIGIVGN---QCTPLSNNLNEVKSRLNKLNPYENTNTYPA 280 + V + Y Q PLS N +KS +N + P TN Sbjct: 406 SSDVATLFPANQHMENNVQYCSSSASTKLGQIVPLSYNWTSLKSAVNAMEPTGGTNQAIG 465 Query: 281 MHHAYRELYNEKE-SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI-------- 331 M A + L + +T + +I ++DG N+ + Q Sbjct: 466 MAWAVQSLIPNGVLGAPAEDANTTYNRVIILLSDGLNTEDRWPDYGNGSTQASGNPIDAR 525 Query: 332 ----CEYMRNAG-------MKIYSVAVSAPPEGQ---DLLRKCTDSSGQFFAVNDSRELL 377 C ++N IY++ V+ +L+ C S +F+ + S +++ Sbjct: 526 QALLCSNLKNTKDSKGNAMYTIYTIQVNTSSPADPTSTVLQNCASSPDKFYMLTSSSQIV 585 Query: 378 ESFDKITDKIQEQSV 392 +F+ I + + V Sbjct: 586 TTFNSIGTALSKLRV 600 Score = 70.3 bits (170), Expect = 4e-10, Method: Composition-based stats. Identities = 26/214 (12%), Positives = 52/214 (24%), Gaps = 25/214 (11%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + AI + FI AID + R+ MQ ALD+ L + T Sbjct: 39 LFAIALLPILAFIGAAIDYSRANAARSAMQGALDSTALMLSR------DLSQGTITAADV 92 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + K + +T A + Sbjct: 93 AAKASTYFKALYTSTD-----------AQSVAVTASYTASTSSSASNIQLNASGQIVTQF 141 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNM----TS 176 L+ + ++ + ++ + + + LD + SM N Sbjct: 142 MKLVGFPTMTFNTKA----TTTWGDVKMRVALALDNTGSMAYSGKMTALQNAVAGSGGLI 197 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKI 210 ++ + +K A + Sbjct: 198 DQLSALAKSPGDVYISLIPFAKVVNVGASNYAQS 231 >gi|260461186|ref|ZP_05809435.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259033220|gb|EEW34482.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 523 Score = 90.4 bits (222), Expect = 5e-16, Method: Composition-based stats. Identities = 61/494 (12%), Positives = 129/494 (26%), Gaps = 115/494 (23%) Query: 13 ITYAIDLAHIMYIRNQMQSALDAAVLS--------GCASIVSDRTIKDPTT--------- 55 + +A D++ +M + +Q++LDAA LS + D + Sbjct: 29 VGFAADVSSVMRAKVNLQNSLDAATLSSSHLSDDEAARRLAFDGYFQANVANHPELTNAK 88 Query: 56 -------------KKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQ 102 K S +I +AG + + + +N Sbjct: 89 LTLSVDKGFNYVKTKAIASADVNLYFAFLFGDNQHIEVDAGGVEATNNLEVVLVLDNTGS 148 Query: 103 YIA-ESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDV----- 156 + KA + L S ++ + + N +D+ Sbjct: 149 MAGAKIKALRDATKVLLDNLDGAKSPDRKVTAAIVPFVTAVNINGDKFDPSWIDMAGKSP 208 Query: 157 ---------------------------SRSMEDLYLQK---------------HNDNNNM 174 + + + N Sbjct: 209 NNGANFPLLPDGKRVNHMDLFRDLAQGTGWQGTGWKGCVEARPGSAAISDVAPDQADANT 268 Query: 175 TSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR---KIDVLIESAGNLVNSIQKAIQEK 231 Y P P + + + + + + + +K Sbjct: 269 LFVPYFAPDDPGDATGPSASYGNDAKVYNNSYLSDDVQDGTDTKGKDKKIAKYENPKAKK 328 Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN--PYENTNTYPAMHHAYRELY 289 N T+ PL+ +L+++++ ++ TN + R L Sbjct: 329 INDKYAPLTVGP-NRACPTPVVPLTADLDKLRTAAAQMQEWNGSGTNVSEGLSWGMRVLS 387 Query: 290 NEKESSHNTI-GSTRLKKFVIFITDGEN--------------------SGASAYQNTLN- 327 + + K V+ +TDGEN S + Sbjct: 388 PAPPYTDGAPWKTPNTSKIVVLLTDGENVVYGASAEPEKSDYTSYGYLSSGRFGTSNQTD 447 Query: 328 --------TLQICEYMRNAGMKIYSVAVSAP-PEGQDLLRKCTDSSGQFFAVNDSRELLE 378 TL +C+ ++ ++IY++ + + + L KC + ++AVND +L Sbjct: 448 AARSVDRWTLDVCDKLKAQQVQIYTITLQSDTAANRTLYGKCATNPADYYAVNDPSKLPN 507 Query: 379 SFDKITDKIQEQSV 392 F I K + Sbjct: 508 VFQTIAGKFTTLQL 521 >gi|330806846|ref|YP_004351308.1| lipoprotein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] gi|327374954|gb|AEA66304.1| Putative lipoprotein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] Length = 557 Score = 90.0 bits (221), Expect = 6e-16, Method: Composition-based stats. Identities = 34/307 (11%), Positives = 93/307 (30%), Gaps = 29/307 (9%) Query: 92 NITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISIC 151 ++ + + + Y L L P L + S Sbjct: 102 SVAETPVSTFSVDVD-TGSYANVRRLLNQGSLPPEGAVRLEEMVNYFPYSYALPTDGSPF 160 Query: 152 MVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKID 211 V + S++ + P + + + S P Sbjct: 161 GVTTEVAPSPWNPHTRLLRIGIKASDRAVADLAPANLVFLVDVSGSMDRREGLP------ 214 Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP 271 ++ + LV+ ++ R+ + Y + +++ +++L+ Sbjct: 215 LVKSTLKLLVDQLRDQD--------RVSLVVYAGESRVVLKPTSGRDKVTIRNAIDQLDA 266 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 +T + AY+ ++ TDG+ + + ++L Q+ Sbjct: 267 GGSTAGASGIELAYQMARESFIDKGINR--------ILLATDGDFNVGVSDFDSL--KQM 316 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQ 390 R +G+ + ++ + L+ + D+ G + +++ LLE+ + D++ Sbjct: 317 AVDQRKSGVSLTTLGFGVDNYNEHLMEQLADAGDGNYAYIDN---LLEARKVLVDQLSST 373 Query: 391 SVRIAPN 397 +A + Sbjct: 374 LAVVARD 380 >gi|291386938|ref|XP_002709809.1| PREDICTED: vitrin [Oryctolagus cuniculus] Length = 869 Score = 90.0 bits (221), Expect = 6e-16, Method: Composition-based stats. Identities = 49/383 (12%), Positives = 97/383 (25%), Gaps = 47/383 (12%) Query: 27 NQMQSALD---AAVLSGCASIVSDR-------------TIKDPTTKKDQTSTIFKKQIKK 70 + ALD A L G + +K K Q + Sbjct: 508 TDVAQALDIGPAGPLMGVVQYGDNPATQFSLKTHMNSRDLKAAIEKITQRGGLSNAGRAI 567 Query: 71 HLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTN 130 ++ + G+ + + P + E+ N+F + + Sbjct: 568 SFVTKNFFSKVNGNRGGAPNVAVVLVDGWPTDKVEEASRLARESGINIFFITIEGAVENE 627 Query: 131 LSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFW 190 + +V K F Sbjct: 628 KQYVVEPNFANK-AVCRTNGFYSFNVQSWFGLHKT---VQPLVKRVCDTDRLACSKTCFN 683 Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 S + + VL + NL + + + RIG + Y Sbjct: 684 SADLGFVIDGSSSVGTGNFRTVL-QFVANLSKEFEISETD-----TRIGAVQYTYEQRLE 737 Query: 251 QCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 N ++ + + ++ T+T A+++A +L+ +K +I Sbjct: 738 FGFDKYNTKPDILNAIKRVGYWSGGTSTGAAINYALEQLF--------KKSKPNKRKLMI 789 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQF 367 ITDG + + G+ Y++ V QD L Sbjct: 790 LITDGRSYDDVRIP--------AMAAHHKGVITYAIGV--AWAAQDELEVIATYPAKDHS 839 Query: 368 FAVNDSRELLESFDKITDKIQEQ 390 F V++ L + +I I + Sbjct: 840 FFVDEFDNLYKFVPRIIQNICTE 862 >gi|241113143|ref|YP_002972978.1| hypothetical protein Rleg_4788 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240861351|gb|ACS59017.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 461 Score = 90.0 bits (221), Expect = 6e-16, Method: Composition-based stats. Identities = 45/327 (13%), Positives = 97/327 (29%), Gaps = 33/327 (10%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A+ + + + + D +R +MQS LD A+++ I D K + S Sbjct: 39 VALTLVPMIVAVGASFDYIRTYNVRQRMQSDLDTALIAAVKEI----DTDDAVALKQKVS 94 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 F Q++ G + + + A +PT Sbjct: 95 DWFHAQVENSYTLGD-------------------INIDTSNHKITATASGTVPTT----- 130 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 L+ A + S + +++ +V+D S SM Sbjct: 131 -LMRIANIDTVDVSVASAVKGPATSYLNVYIVIDTSPSMLLAATTAGQATMYSGIGCQFA 189 Query: 182 PPPPKKSFWSKNTTKSKYAPAPA-PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 T + Y + A + DV ++ +++ I + + + +++G Sbjct: 190 CHTGDAHKIGNKTYNNNYEYSTAKNIKLRADVAGDAVKDVLALIDTS--DSNHQRIKVGL 247 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN-TYPAMHHAYRELYNEKESSHNTI 299 + + LS + + T+ + L + + + Sbjct: 248 YSLGDTLTEVLAPTLSTDTARNRLTDASYGLTSATSKAATYFDVSLTTLKQKVGTGGDGT 307 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTL 326 S K V+ +TDG S + + Sbjct: 308 ASNSPLKLVLLLTDGVQSQREWVTDKV 334 >gi|323936560|gb|EGB32847.1| von Willebrand type A protein [Escherichia coli E1520] Length = 565 Score = 90.0 bits (221), Expect = 6e-16, Method: Composition-based stats. Identities = 38/379 (10%), Positives = 107/379 (28%), Gaps = 48/379 (12%) Query: 21 HIMYIRNQMQSALDAA-VLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIR 79 + +Q L A + A + T + Q KQ+ ++ + Sbjct: 58 QQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLD 117 Query: 80 ENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGII 139 + G A + P E Y + S + Sbjct: 118 VDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDW-----DIKDKQSIPASKPIPFAM 172 Query: 140 ERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY 199 + L + + +N+ Sbjct: 173 RYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLI--------------------D 212 Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 ++ ++ ++ S LV +++ I + Y + ++ Sbjct: 213 TSGSMISDERLPLIQSSLKLLVKELREQDN--------IAIVTYAGDSRIALPSISGSHK 264 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 E+ + ++ L+ +TN + AY++ ++ TDG+ + Sbjct: 265 AEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINR--------ILLATDGDFNVG 316 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLE 378 + + + + R +G+ + + V + ++ + D +G + ++ L E Sbjct: 317 --IDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDT---LSE 371 Query: 379 SFDKITDKIQEQSVRIAPN 397 + + ++++ + +A + Sbjct: 372 AQKVLNSEMRQMLITVAKD 390 >gi|291514853|emb|CBK64063.1| Mg-chelatase subunit ChlD [Alistipes shahii WAL 8301] Length = 328 Score = 90.0 bits (221), Expect = 7e-16, Method: Composition-based stats. Identities = 25/204 (12%), Positives = 70/204 (34%), Gaps = 41/204 (20%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENT 275 + + + ++ +PL+ + + +++ L ++ + T Sbjct: 108 DRITAAKEVAGSFIADRYGDRIGLVAFAGEAFTQSPLTTDQSTLQTLLARIRSGLIEDGT 167 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + A L S K +I +TDG N+ + + E Sbjct: 168 AIGNGLATAINRL----------RESDAKSKVIILLTDGVNNQGQ-----IAPMTAAEIA 212 Query: 336 RNAGMKIYSVAVSAPP---------------------EGQDLLRKCTD-SSGQFFAVNDS 373 + G+++Y++ V + +L+ +D + G++F D Sbjct: 213 KAQGIRVYTIGVGTEGMAPYPAIDMFGNLTFVNQKVEIDEKVLKAISDMTGGRYFRATDK 272 Query: 374 RELLESFDKITDKIQEQSVRIAPN 397 +L +D+I +++++ + + + Sbjct: 273 EKLKAVYDEI-NQLEKSKIEVMEH 295 >gi|320158179|ref|YP_004190557.1| BatA [Vibrio vulnificus MO6-24/O] gi|319933491|gb|ADV88354.1| BatA [Vibrio vulnificus MO6-24/O] Length = 323 Score = 90.0 bits (221), Expect = 7e-16, Method: Composition-based stats. Identities = 32/203 (15%), Positives = 67/203 (33%), Gaps = 47/203 (23%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 ++ + + R+G + + TPL+ + V Sbjct: 107 GDYIDRLSAVKNVVTQFIEQ---------RQGDRLGLVLFADHAYLQ--TPLTADRQTVA 155 Query: 264 SRLNK--LNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 ++LN+ + T + A + + S ++ VI ++DG N+ Sbjct: 156 NQLNQTIIGLIGQKTAIGDGLALATKTFVD----------SEAPQRVVILLSDGSNTAG- 204 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEGQDLLRKCTD- 362 TL+ ++ + G+KIY++ + A + L K Sbjct: 205 ----TLDPIEAANIAKKYGVKIYTIGIGAGEMEVKQFFMTRKVNTSADLDEKTLTKIATM 260 Query: 363 SSGQFFAVNDSRELLESFDKITD 385 + GQ+F D++EL + I Sbjct: 261 TGGQYFRARDAQELQAIYQAINQ 283 >gi|315615538|gb|EFU96170.1| von Willebrand factor type A domain protein [Escherichia coli 3431] Length = 575 Score = 89.6 bits (220), Expect = 7e-16, Method: Composition-based stats. Identities = 38/379 (10%), Positives = 107/379 (28%), Gaps = 48/379 (12%) Query: 21 HIMYIRNQMQSALDAA-VLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIR 79 + +Q L A + A + T + Q KQ+ ++ + Sbjct: 68 QQYSDKQALQGRLQEAPTFARAAKAKATHIANPGTARYQQFDDNPVKQVAQNPLATFSLD 127 Query: 80 ENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGII 139 + G A + P E Y + S + Sbjct: 128 VDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDW-----DIKDKQSIPASKPIPFAM 182 Query: 140 ERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY 199 + L + + +N+ Sbjct: 183 RYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLI--------------------D 222 Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 ++ ++ ++ S LV +++ I + Y + ++ Sbjct: 223 TSGSMISDERLPLIQSSLKLLVKELREQDN--------IAIVTYAGDSRIALPSISGSHK 274 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 E+ + ++ L+ +TN + AY++ ++ TDG+ + Sbjct: 275 AEINAAIDSLDAEGSTNGGAGLELAYQQATKGFIKGGINR--------ILLATDGDFNVG 326 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLE 378 + + + + R +G+ + + V + ++ + D +G + ++ L E Sbjct: 327 --IDDPKSIESMVKKQRESGVTLSTFGVGNSNYNEAMMVRIADVGNGNYSYIDT---LSE 381 Query: 379 SFDKITDKIQEQSVRIAPN 397 + + ++++ + +A + Sbjct: 382 AQKVLNSEMRQMLITVAKD 400 >gi|218559186|ref|YP_002392099.1| hypothetical protein ECS88_2420 [Escherichia coli S88] gi|218365955|emb|CAR03699.1| conserved hypothetical protein [Escherichia coli S88] Length = 580 Score = 89.6 bits (220), Expect = 7e-16, Method: Composition-based stats. Identities = 37/380 (9%), Positives = 107/380 (28%), Gaps = 49/380 (12%) Query: 21 HIMYIRNQMQSALDAAVL--SGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYI 78 + + L AA + + T + Q KQ+ ++ + Sbjct: 72 QQYSDKQALLGRLQAAPKYQHAAREKAASQIANPGTARYQQFDDNPVKQVAQNPLATFSL 131 Query: 79 RENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGI 138 + G A + P E Y + S Sbjct: 132 DVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDW-----DIKDKQSIPASKPIPFA 186 Query: 139 IERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSK 198 + + L + + +N+ Sbjct: 187 MRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLI-------------------- 226 Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 ++ ++ ++ S LV +++ I + Y + ++ Sbjct: 227 DTSGSMISDERLPLIQSSLKLLVKELREQDN--------IAIVTYAGDSRIALPSISGSH 278 Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 E+ + ++ L+ +TN + AY++ ++ TDG+ + Sbjct: 279 KAEINAAIDSLDAEGSTNGGAGLEMAYQQAAKGFIKGGINR--------ILLATDGDFNV 330 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELL 377 + + + + R +G+ + ++ V + ++ + D +G + ++ L Sbjct: 331 G--IDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEAMMVRIADVGNGNYSYIDT---LS 385 Query: 378 ESFDKITDKIQEQSVRIAPN 397 E+ + ++++ + +A + Sbjct: 386 EAQKVLNSEMRQTLITVAKD 405 >gi|332227196|ref|XP_003262777.1| PREDICTED: vitrin isoform 1 [Nomascus leucogenys] Length = 694 Score = 89.6 bits (220), Expect = 8e-16, Method: Composition-based stats. Identities = 36/323 (11%), Positives = 88/323 (27%), Gaps = 31/323 (9%) Query: 71 HLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTN 130 ++ + G+ + + + P + E+ N+F + +A Sbjct: 393 SFVTKNFFSKANGNRSGAPNVVVVMVDGWPTDKVEEASRLARESGINIFFITIEGAAENE 452 Query: 131 LSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFW 190 + V + K Sbjct: 453 KQYVVEPNFANK-AVCRTNGFYSFHVQSWFGLHKTLQPL---VKRVCDTDRLACSKTCLN 508 Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 S + + VL + NL + + + RIG + Y Sbjct: 509 SADIGFVIDGSSSVGTGNFRTVL-QFVTNLTKEFEISDTD-----TRIGAVQYTYEQRLE 562 Query: 251 QCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 ++ ++ + + ++ T+T A++ A +L+ +K +I Sbjct: 563 FGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF--------KKSKPNKRKLMI 614 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQF 367 ITDG + + + +++ +++ A+ Q+ L Sbjct: 615 LITDGR------SYDDVRIPAMAAHLKG----VFTYAIGVAWAAQEELEVIATHPARDHS 664 Query: 368 FAVNDSRELLESFDKITDKIQEQ 390 F V++ L + +I I + Sbjct: 665 FFVDEFDNLYQYVPRIIQNICTE 687 Score = 45.3 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 18/163 (11%), Positives = 53/163 (32%), Gaps = 23/163 (14%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSH 296 +G + Y + N ++K+ + K+ +N A+ + +++ Sbjct: 348 MGVVQYGDNPATHFNLKTHMNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANG-- 405 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + V+ + DG + + R +G+ I+ + + E + Sbjct: 406 ---NRSGAPNVVVVMVDGWPTD--------KVEEASRLARESGINIFFITIEGAAENEKQ 454 Query: 357 L--------RKCTDSSGQF-FAVNDSRELLESFDKITDKIQEQ 390 + ++G + F V L ++ + ++ + Sbjct: 455 YVVEPNFANKAVCRTNGFYSFHVQSWFGLHKTLQPLVKRVCDT 497 >gi|26248659|ref|NP_754699.1| hypothetical protein c2813 [Escherichia coli CFT073] gi|91211565|ref|YP_541551.1| hypothetical protein UTI89_C2553 [Escherichia coli UTI89] gi|117624462|ref|YP_853375.1| hypothetical protein APECO1_4292 [Escherichia coli APEC O1] gi|227887329|ref|ZP_04005134.1| von Willebrand factor type A domain protein [Escherichia coli 83972] gi|300983332|ref|ZP_07176546.1| von Willebrand factor type A domain protein [Escherichia coli MS 45-1] gi|301049025|ref|ZP_07196011.1| von Willebrand factor type A domain protein [Escherichia coli MS 185-1] gi|26109064|gb|AAN81267.1|AE016763_226 Hypothetical protein yfbK [Escherichia coli CFT073] gi|91073139|gb|ABE08020.1| hypothetical protein YfbK [Escherichia coli UTI89] gi|115513586|gb|ABJ01661.1| conserved hypothetical protein [Escherichia coli APEC O1] gi|227835679|gb|EEJ46145.1| von Willebrand factor type A domain protein [Escherichia coli 83972] gi|294491429|gb|ADE90185.1| von Willebrand factor type A domain protein [Escherichia coli IHE3034] gi|300299173|gb|EFJ55558.1| von Willebrand factor type A domain protein [Escherichia coli MS 185-1] gi|300408590|gb|EFJ92128.1| von Willebrand factor type A domain protein [Escherichia coli MS 45-1] gi|307554335|gb|ADN47110.1| von Willebrand factor type A domain protein [Escherichia coli ABU 83972] gi|307626191|gb|ADN70495.1| hypothetical protein UM146_05450 [Escherichia coli UM146] gi|315285863|gb|EFU45301.1| von Willebrand factor type A domain protein [Escherichia coli MS 110-3] gi|315292192|gb|EFU51544.1| von Willebrand factor type A domain protein [Escherichia coli MS 153-1] gi|323952065|gb|EGB47939.1| von Willebrand protein type A [Escherichia coli H252] gi|323956039|gb|EGB51792.1| von Willebrand protein type A [Escherichia coli H263] Length = 580 Score = 89.6 bits (220), Expect = 8e-16, Method: Composition-based stats. Identities = 37/380 (9%), Positives = 107/380 (28%), Gaps = 49/380 (12%) Query: 21 HIMYIRNQMQSALDAAVL--SGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYI 78 + + L AA + + T + Q KQ+ ++ + Sbjct: 72 QQYSDKQALLGRLQAAPKYQHAAREKAASQIANPGTARYQQFDDNPVKQVAQNPLATFSL 131 Query: 79 RENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGI 138 + G A + P E Y + S Sbjct: 132 DVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDW-----DIKDKQSIPASKPIPFA 186 Query: 139 IERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSK 198 + + L + + +N+ Sbjct: 187 MRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLI-------------------- 226 Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 ++ ++ ++ S LV +++ I + Y + ++ Sbjct: 227 DTSGSMISDERLPLIQSSLKLLVKELREQDN--------IAIVTYAGDSRIALPSISGSH 278 Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 E+ + ++ L+ +TN + AY++ ++ TDG+ + Sbjct: 279 KAEINAAIDSLDAEGSTNGGAGLEMAYQQAAKGFIKGGINR--------ILLATDGDFNV 330 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELL 377 + + + + R +G+ + ++ V + ++ + D +G + ++ L Sbjct: 331 G--IDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEAMMVRIADVGNGNYSYIDT---LS 385 Query: 378 ESFDKITDKIQEQSVRIAPN 397 E+ + ++++ + +A + Sbjct: 386 EAQKVLNSEMRQTLITVAKD 405 >gi|293347920|ref|XP_001064219.2| PREDICTED: vitrin-like [Rattus norvegicus] Length = 648 Score = 89.6 bits (220), Expect = 8e-16, Method: Composition-based stats. Identities = 46/352 (13%), Positives = 94/352 (26%), Gaps = 31/352 (8%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPL 101 + ++ + +K K Q + ++ + G+ + + P Sbjct: 318 KTHMNSQDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGNRGGAPNVAVVLVDGWPT 377 Query: 102 QYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSME 161 I E N+F + +A S + +V + Sbjct: 378 DKIEEVSRVARESGINVFFVTVEGAAEREKQHVVEPNFA-SKAVCRTNGFYSFNVQSWLS 436 Query: 162 DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLV 221 K S + + + VL + NL Sbjct: 437 LHKT---VQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTSNFRTVL-QFVANLS 492 Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPA 280 + + + RIG + Y N+ +V S + ++ T+T A Sbjct: 493 KEFEISDTD-----TRIGAVQYTYEQRLEFGFDKYNSKADVLSAIRRVGYWSGGTSTGAA 547 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + +A +L+ +K +I ITDG + G+ Sbjct: 548 IQYALEQLF--------KKSKPNKRKVMILITDGRSYDDVRIP--------AMAAYQKGV 591 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 Y++ + QD L F V++ L + +I I + Sbjct: 592 ITYAIGI--AWAAQDELEVIATHPARDHSFFVDEFDNLYKFVPRIIRNICTE 641 >gi|218672731|ref|ZP_03522400.1| hypothetical protein RetlG_14377 [Rhizobium etli GR56] Length = 323 Score = 89.6 bits (220), Expect = 8e-16, Method: Composition-based stats. Identities = 42/318 (13%), Positives = 91/318 (28%), Gaps = 29/318 (9%) Query: 105 AESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLY 164 S E + + + + S + + L + ++ Sbjct: 7 VSSSMIEENRFTPMQTAVAGFLQAFSSNTSLVDKTKISIVPFSSRVNFGLANTAWLKSYN 66 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPAN----RKIDVLIESAGNL 220 K S+W T + DV + G + Sbjct: 67 GTAAVPKRWTDPKSVYTSSSYKLSYWIDGVTPVMSTSKNYYWMGCIEPRADVEVRDTGAI 126 Query: 221 VNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPA 280 + + A TPL+ + +KS + L +T Sbjct: 127 GDGMGDAPPGTAAFVAMDANPKSGTSFCPPPVTPLTGDFAYLKSVVKNLTSEGSTRLDAG 186 Query: 281 MHHAYRELYNEKESSHNTIG-----STRLKKFVIFITDGENSGASAYQN----------- 324 + + L + + S + K ++F+TDGE + + Sbjct: 187 VVAGWYTLSPKWQGVWGDETSPAEVSDSVHKVMVFMTDGEMNTKYDPNDKFDWICSQTQS 246 Query: 325 -------TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 T C M+ +G++IY+++ SA + ++ R C ++ FF + Sbjct: 247 SACNAFATAAMQTACTAMKKSGIEIYTLSYSADADVVNI-RNCATNTAHFFTA-SPATIK 304 Query: 378 ESFDKITDKIQEQSVRIA 395 ++ I I+ ++R+ Sbjct: 305 TVYETIAAAIRGDTLRLT 322 >gi|149371021|ref|ZP_01890616.1| aerotolerance-related membrane protein [unidentified eubacterium SCB49] gi|149355807|gb|EDM44365.1| aerotolerance-related membrane protein [unidentified eubacterium SCB49] Length = 334 Score = 89.6 bits (220), Expect = 8e-16, Method: Composition-based stats. Identities = 35/178 (19%), Positives = 62/178 (34%), Gaps = 44/178 (24%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYN 290 + RIG + Y TPL+++ V S LN + T + A L Sbjct: 130 NDRIGLVEYAGESYTK--TPLTSDKTVVLSSLNSIEYNSIIEGGTAIGMGLATAVNRL-- 185 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 ST K +I +TDGEN+ ++ E G+K+Y++ + Sbjct: 186 --------KESTAKSKVIILLTDGENNSG-----FIDPKIASELAVEFGIKVYTIGLGTN 232 Query: 351 P----------------------EGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITD 385 + LL++ + GQ+F + +L E +++I Sbjct: 233 GMASSPIGILPNGRFQYGNQPVKIDETLLKEIAKTTGGQYFRATSNTKLNEIYEEINK 290 >gi|37676036|ref|NP_936432.1| hypothetical protein VVA0376 [Vibrio vulnificus YJ016] gi|37200576|dbj|BAC96402.1| conserved hypothetical protein [Vibrio vulnificus YJ016] Length = 323 Score = 89.6 bits (220), Expect = 8e-16, Method: Composition-based stats. Identities = 32/203 (15%), Positives = 67/203 (33%), Gaps = 47/203 (23%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 ++ + + R+G + + TPL+ + V Sbjct: 107 GDYIDRLSSVKNVVTQFIEQ---------RQGDRLGLVLFADHAYLQ--TPLTADRQTVA 155 Query: 264 SRLNK--LNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 ++LN+ + T + A + + S ++ VI ++DG N+ Sbjct: 156 NQLNQTIIGLIGQKTAIGDGLALATKTFVD----------SEAPQRVVILLSDGSNTAG- 204 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEGQDLLRKCTD- 362 TL+ ++ + G+KIY++ + A + L K Sbjct: 205 ----TLDPIEAANIAKKYGVKIYTIGIGAGEMEVKQFFMTRKVNTSADLDEKTLTKIATM 260 Query: 363 SSGQFFAVNDSRELLESFDKITD 385 + GQ+F D++EL + I Sbjct: 261 TGGQYFRARDAQELQTIYQAINQ 283 >gi|116623631|ref|YP_825787.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116226793|gb|ABJ85502.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 589 Score = 89.6 bits (220), Expect = 8e-16, Method: Composition-based stats. Identities = 33/324 (10%), Positives = 74/324 (22%), Gaps = 43/324 (13%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 T ++ SV + AID +R ++ SA+D L+ + S + Sbjct: 20 TLLVSSVLIPMVGLAIDGGRGYLVRLKLSSAVDGGALAAARLLGSGSNAAQQLSM---AK 76 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK-------NNPLQYIAESKAQYEIP 114 + + + + +G + Y + A Sbjct: 77 ATAAQFVNANFPAKFFGASLSGAANVCVDPGTDSSDPCGVGNGSGISTYKVRTVAVKATA 136 Query: 115 TENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNM 174 T +I +S T L + + + ++ + Sbjct: 137 TMPTLFMRIIGMPTVTVSGSGTASRRDVRVILVMDRSSSMG-TYYSGINQTPPSINDMAL 195 Query: 175 TSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNL 234 + A D + + ++ Sbjct: 196 KFVNSF---------------------SGAGEFGGRDEVGLVVYGGSGIVAYPPRDITKD 234 Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKES 294 + + + NT T A++ AY L + + Sbjct: 235 YTDYTKFTPPDNNFKAS--------GNIPKYIADITSGSNTGTAEALYLAYMTLRADAAT 286 Query: 295 SHNTIGSTRLKKFVIFITDGENSG 318 + ++ TDG +G Sbjct: 287 NP---DLATKLNVIVLFTDGIPNG 307 Score = 38.3 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 31/362 (8%), Positives = 86/362 (23%), Gaps = 34/362 (9%) Query: 59 QTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENL 118 + + K ++ + + Y+A + + T Sbjct: 230 DITKDYTDYTKFTPPDNNFKASGNIPKYIADITSGSNTGTAEALYLAYMTLRADAATNPD 289 Query: 119 FLKGL----IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNM 174 L + + + + + + + I ++ N Sbjct: 290 LATKLNVIVLFTDGIPNGVTAMANDKTIANQHYLMIPNCTNLGLGDTSRTPMLSGSPNPN 349 Query: 175 TSNKYLLPPPPKKSFWSKNTTK--SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK 232 + + + S A A K D + N + K Q Sbjct: 350 IAGWFAQWGGNSYTDNSGPHGFHKPMMAYADTGYTGKGDDIDSYMKNPGHDGGKIDQMTG 409 Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 + + + NL+ ++ L P + A++ Sbjct: 410 TGCTADPMVGELNKLPDHDIYGNYLNLSAA-PAVSGLTPPVG--SAGALYKLGTLYSTST 466 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK----------I 342 + ++++ + + G A+ + I + + I Sbjct: 467 QCNNSSYNPSAPDNACQ-MGLGSWQAAAHQAWKIWNQIIWDKATQTNIPDPATNKASPVI 525 Query: 343 YSVAVSAPPEG---QDLLRKCTDSS-----------GQFFAVNDSRELLESFDKITDKIQ 388 +++ + LL+ + G+ + D + +F +I +I Sbjct: 526 FTIGFESTASDLPDMKLLQLIANDPSSPAPFSTRVQGKAYNAKDPNAVDAAFQQIRSEIL 585 Query: 389 EQ 390 Sbjct: 586 RL 587 >gi|86143679|ref|ZP_01062055.1| batA protein [Leeuwenhoekiella blandensis MED217] gi|85829722|gb|EAQ48184.1| batA protein [Leeuwenhoekiella blandensis MED217] Length = 334 Score = 89.6 bits (220), Expect = 9e-16, Method: Composition-based stats. Identities = 33/178 (18%), Positives = 60/178 (33%), Gaps = 44/178 (24%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYN 290 S RIG + Y TP++++ + V S LN + T + + L Sbjct: 130 SDRIGLVEYAGESYTR--TPITSDKSIVLSSLNDIQYNSIIEGGTAIGMGLATSVNRL-- 185 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 S K +I +TDG N+ + E + G+K+Y++ + Sbjct: 186 --------KDSRAKSKVIILMTDGVNNAG-----FIEPSTASELAQEFGIKVYTIGLGTN 232 Query: 351 PE----------------------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + LL++ D + G +F D+ L E + +I Sbjct: 233 GTALSPVALRPDGSFQYGSIPVEIDEALLQEIADKTGGLYFRATDNESLEEIYAEINK 290 >gi|148974876|ref|ZP_01811856.1| Flp pilus assembly protein TadG [Vibrionales bacterium SWAT-3] gi|145965385|gb|EDK30634.1| Flp pilus assembly protein TadG [Vibrionales bacterium SWAT-3] Length = 418 Score = 89.6 bits (220), Expect = 9e-16, Method: Composition-based stats. Identities = 46/440 (10%), Positives = 119/440 (27%), Gaps = 77/440 (17%) Query: 8 VCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQ 67 + +F+ +++ ++ M ++ A + A L+ AS +D + + Sbjct: 1 MMVIFMAFSMQMSQQMLAHTRLLEAAEVASLALIASP-----REDEENNVKYARYLVDRY 55 Query: 68 IKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSA 127 + + + + + + + ++ + A+Y Sbjct: 56 VVDNTDDVDVAVYTSICEYKDGCVQASGELAPFSDFVVRATAKYTSWIAY------EDVN 109 Query: 128 LTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND------NNNMTSNKYLL 181 L S + R + + + D S SM + + + Sbjct: 110 LKPEFSVSGRAVTRKYLPQPVDVYFIGDFSGSMGNPWKNGKMKLDVVKETIKRVVDDIEE 169 Query: 182 PPPPKKSFWSKNTT-------KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAI------ 228 +KS + ++ A R + NS + Sbjct: 170 FNSEEKSRVALLGYNPLHVKQSNEIVRLNAYGYRASWRKKHAYDYARNSPATTVRRMFDE 229 Query: 229 -------QEKKNLSVRIGTIAYN------IGIVGNQCTPLSNNLNEVKSRLNK--LNPYE 273 E + R PL+ + + +++L L Sbjct: 230 PTLYNEIIEPSHGMSRYEVERLYKRNNDFDDYFKFYDIPLTEDYDNFRAQLMSAQLKAGG 289 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T+++ + A +E S ++ I ++DG + Y L +C+ Sbjct: 290 GTSSWNGIIAAAQEANKA--------TSLNPEQVFIVLSDG-SDSDKTYLQKLVDQGLCK 340 Query: 334 YMR-------------------NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSR 374 +R + + + ++ + D C + D Sbjct: 341 KLRSTISAKRNRFQSNAPTEAEKTKVTMGVIGINYRVQASDGFGDC-FGKKNIYHAKDGE 399 Query: 375 ELLESFDKITDKIQEQSVRI 394 + + I + I E++ ++ Sbjct: 400 ---DVYKYILNLINEETGKL 416 >gi|257469959|ref|ZP_05634051.1| hypothetical protein FulcA4_11506 [Fusobacterium ulcerans ATCC 49185] gi|317064188|ref|ZP_07928673.1| BatA protein [Fusobacterium ulcerans ATCC 49185] gi|313689864|gb|EFS26699.1| BatA protein [Fusobacterium ulcerans ATCC 49185] Length = 319 Score = 89.6 bits (220), Expect = 9e-16, Method: Composition-based stats. Identities = 30/152 (19%), Positives = 57/152 (37%), Gaps = 37/152 (24%) Query: 254 PLSNNLNEVKS-----RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 PL+ + N +K ++ + T + A L S K + Sbjct: 137 PLTFDHNVIKEMTGKLTVDDITSNTRTAIGMGIGVALNRL----------KDSEAKSKVI 186 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE---------------- 352 I +TDGEN+ + + + G+KIY++ + A Sbjct: 187 ILLTDGENNSGEMSPS-----AAADIAKELGIKIYTIGIGAKEIKVPSFFGYTTVKNTEL 241 Query: 353 GQDLLRKCT-DSSGQFFAVNDSRELLESFDKI 383 +++L+ + G++F +DS+E E F+KI Sbjct: 242 DENMLKSIAETTGGEYFRASDSKEFKEIFNKI 273 >gi|324006620|gb|EGB75839.1| von Willebrand factor type A domain protein [Escherichia coli MS 57-2] Length = 580 Score = 89.6 bits (220), Expect = 9e-16, Method: Composition-based stats. Identities = 37/380 (9%), Positives = 107/380 (28%), Gaps = 49/380 (12%) Query: 21 HIMYIRNQMQSALDAAVL--SGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYI 78 + + L AA + + T + Q KQ+ ++ + Sbjct: 72 QQYSDKQALLGRLQAAPKYQHAAREKAASQIANPGTARYQQFDDNPVKQVAQNPLATFSL 131 Query: 79 RENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGI 138 + G A + P E Y + S Sbjct: 132 DVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDW-----DIKDKQSIPASKPIPFA 186 Query: 139 IERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSK 198 + + L + + +N+ Sbjct: 187 MRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLI-------------------- 226 Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 ++ ++ ++ S LV +++ I + Y + ++ Sbjct: 227 DTSGSMISDERLPLIQSSLKLLVKELREQDN--------IAIVTYAGDSRIALPSISGSH 278 Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 E+ + ++ L+ +TN + AY++ ++ TDG+ + Sbjct: 279 KAEINAAIDSLDAEGSTNGGAGLEMAYQQAAKGFIKGGINR--------ILLATDGDFNV 330 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELL 377 + + + + R +G+ + ++ V + ++ + D +G + ++ L Sbjct: 331 G--IDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEAMMVRIADVGNGNYSYIDT---LS 385 Query: 378 ESFDKITDKIQEQSVRIAPN 397 E+ + ++++ + +A + Sbjct: 386 EAQKVLNSEMRQTLITVAKD 405 >gi|281179360|dbj|BAI55690.1| conserved hypothetical protein [Escherichia coli SE15] Length = 580 Score = 89.2 bits (219), Expect = 9e-16, Method: Composition-based stats. Identities = 37/380 (9%), Positives = 107/380 (28%), Gaps = 49/380 (12%) Query: 21 HIMYIRNQMQSALDAAVL--SGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYI 78 + + L AA + + T + Q KQ+ ++ + Sbjct: 72 QQYSDKQALLGRLQAAPKYQHAAREKAASQIANPATARYQQFDDNPVKQVAQNPLATFSL 131 Query: 79 RENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGI 138 + G A + P E Y + S Sbjct: 132 DVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDW-----DIKDKQSIPASKPIPFA 186 Query: 139 IERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSK 198 + + L + + +N+ Sbjct: 187 MRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLI-------------------- 226 Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 ++ ++ ++ S LV +++ I + Y + ++ Sbjct: 227 DTSGSMISDERLPLIQSSLKLLVKELREQDN--------IAIVTYAGDSRIALPSISGSH 278 Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 E+ + ++ L+ +TN + AY++ ++ TDG+ + Sbjct: 279 KAEINAAIDSLDAEGSTNGGAGLEMAYQQAAKGFIKGGINR--------ILLATDGDFNV 330 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELL 377 + + + + R +G+ + ++ V + ++ + D +G + ++ L Sbjct: 331 G--IDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEAMMVRIADVGNGNYSYIDT---LS 385 Query: 378 ESFDKITDKIQEQSVRIAPN 397 E+ + ++++ + +A + Sbjct: 386 EAQKVLNSEMRQTLITVAKD 405 >gi|297565073|ref|YP_003684045.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] gi|296849522|gb|ADH62537.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] Length = 308 Score = 89.2 bits (219), Expect = 9e-16, Method: Composition-based stats. Identities = 32/220 (14%), Positives = 78/220 (35%), Gaps = 32/220 (14%) Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYN 244 +++ S A ++D +A + V + V++G +++ Sbjct: 83 DEQAGVVLAIDVSGSMMADDLKPSRLDAAKAAARSFVERMPAG--------VKVGLVSFA 134 Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 G V + L+ + V R++ L NT + + + + Sbjct: 135 AGAVLE--SGLTADHQGVIERIDLLERRANTAIGEGLLESLKAFPTGAN------HQVAV 186 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP---------EGQD 355 VI ++DG N + + + + G+++Y++ V + + Sbjct: 187 PATVILLSDGRNRIG------IAPQEAAQEAKRRGVRVYTIGVGSDDPNASVDWAGFDEA 240 Query: 356 LLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVRI 394 LR + + G++FA + + L E + ++ +I + R Sbjct: 241 ELRGIAEVTGGRYFAADSADRLQEIYRELGSQIGWKLERT 280 >gi|16126967|ref|NP_421531.1| hypothetical protein CC_2734 [Caulobacter crescentus CB15] gi|221235756|ref|YP_002518193.1| hypothetical protein CCNA_02820 [Caulobacter crescentus NA1000] gi|13424325|gb|AAK24699.1| hypothetical protein CC_2734 [Caulobacter crescentus CB15] gi|220964929|gb|ACL96285.1| conserved hypothetical protein [Caulobacter crescentus NA1000] Length = 629 Score = 89.2 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 40/365 (10%), Positives = 84/365 (23%), Gaps = 41/365 (11%) Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGL 123 + + ++ I+ N+P + Sbjct: 268 VLTSMASLNGAIYTVTRLDSNVVYLNGIDTRLKSNSPSGGTIRKCLTSDCNLVVTTSAAH 327 Query: 124 IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPP 183 +S + G + + + + + TS Sbjct: 328 GFDTGDQISFANLGGLTTLNGQTYSITDLTSNTFDTGVPG-----LGTAAFTSGGTATCE 382 Query: 184 PPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAY 243 + S + A A ++ N Sbjct: 383 QSTTPGCRRLAYVSNWGTNEVRALSTCVSERTGADAYTDAAPSTAFVGTNYPSTSADSYS 442 Query: 244 NIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY-------NEKESSH 296 + TPLS++ +K+++N + +T + + + Sbjct: 443 PNPCPSAKITPLSSDKTALKAQINNYSVGGSTAGQIGLAWGWYMVAPNFGYIWPSASQRP 502 Query: 297 NTIGSTRLKKFVIFITDGENSGA-------------------------SAYQNTLNTLQI 331 S L K VI +TDG + + + Sbjct: 503 AAYKSKDLMKVVIMMTDGAFNTPYCNGVIAANAGIGSGSDEDHINCNATNGDPFAQARAL 562 Query: 332 CEYMRN--AGMKIYSVAV--SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 C ++N + +Y+V + + L C S + F EL SF I +I Sbjct: 563 CTVIKNSANDITLYTVGFAVGSDYTAKTFLTDCASDSSKAFFPATGSELKASFTAIAREI 622 Query: 388 QEQSV 392 + Sbjct: 623 SSLRI 627 Score = 41.0 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 30/185 (16%), Positives = 54/185 (29%), Gaps = 27/185 (14%) Query: 17 IDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGS 76 +D++ + R QMQ ALDAA L S + D T + Sbjct: 49 LDVSRLSLQRRQMQDALDAATLMAARSAATASADLDTT------------------GDAA 90 Query: 77 YIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRST 136 ++ E AG + T + I + A + NL+ G T Sbjct: 91 FLAEIAGMNLGLTASSSTFSVGTGNRVIGTATATLKPIIANLWQAG---------DFTVT 141 Query: 137 GIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTK 196 E + + + +VLD++ SM + + + + Sbjct: 142 ATSEVVRSSKNLEVALVLDITGSMSGTRIADLKVAASDLVDIVIRDTQTPFYSKVALVPY 201 Query: 197 SKYAP 201 + Sbjct: 202 AAGVN 206 >gi|218690433|ref|YP_002398645.1| hypothetical protein ECED1_2737 [Escherichia coli ED1a] gi|218427997|emb|CAR08918.2| conserved hypothetical protein [Escherichia coli ED1a] Length = 580 Score = 89.2 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 37/380 (9%), Positives = 107/380 (28%), Gaps = 49/380 (12%) Query: 21 HIMYIRNQMQSALDAAVL--SGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYI 78 + + L AA + + T + Q KQ+ ++ + Sbjct: 72 QQYSDKQALLGRLQAAPKYQHAAREKAASQIANPGTARYQQFDDNPIKQVAQNPLVTFSL 131 Query: 79 RENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGI 138 + G A + P E Y + S Sbjct: 132 DVDTGSYANVRRFLNQGLLPPPDAVRVEEVVNYFPSDW-----DIKDKQSIPASKPIPFA 186 Query: 139 IERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSK 198 + + L + + +N+ Sbjct: 187 MRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLI-------------------- 226 Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 ++ ++ ++ S LV +++ I + Y + ++ Sbjct: 227 DTSGSMISDERLPLIQSSLKLLVKELREQDN--------IAIVTYAGDSRIALPSISGSH 278 Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 E+ + ++ L+ +TN + AY++ ++ TDG+ + Sbjct: 279 KAEINAAIDSLDAEGSTNGGAGLEMAYQQAAKGFIKGGINR--------ILLATDGDFNV 330 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELL 377 + + + + R +G+ + ++ V + ++ + D +G + ++ L Sbjct: 331 G--IDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEAMMVRIADVGNGNYSYIDT---LS 385 Query: 378 ESFDKITDKIQEQSVRIAPN 397 E+ + ++++ + +A + Sbjct: 386 EAQKVLNSEMRQTLITVAKD 405 >gi|315298071|gb|EFU57340.1| von Willebrand factor type A domain protein [Escherichia coli MS 16-3] Length = 581 Score = 89.2 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 38/379 (10%), Positives = 108/379 (28%), Gaps = 48/379 (12%) Query: 21 HIMYIRNQMQSALDAA-VLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIR 79 + +Q L A + A + T + Q KQ+ ++ + Sbjct: 74 QQYSDKQTLQGRLQEAPTFARAAKANATHIANPGTARYQQFDDNPVKQVAQNPLVTFSLD 133 Query: 80 ENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGII 139 + G A + P E Y + S + Sbjct: 134 VDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDW-----DIKDKQSIPASKPIPFAM 188 Query: 140 ERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY 199 + L + + +N+ Sbjct: 189 RYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLI--------------------D 228 Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 ++ ++ ++ S LV +++ I + Y + ++ Sbjct: 229 TSGSMISDERLPLIQSSLKLLVKELREQDN--------IAIVTYAGDSRIALPSISGSHK 280 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 E+ + ++ L+ +TN + AY++ ++ TDG+ + Sbjct: 281 AEINAAIDSLDAEGSTNGGAGLEMAYQQAAKGFIKGGINR--------ILLATDGDFNVG 332 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLE 378 + + + + R +G+ + ++ V + ++ + D +G + ++ L E Sbjct: 333 --IDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEAMMVRIADVGNGNYSYIDT---LSE 387 Query: 379 SFDKITDKIQEQSVRIAPN 397 + + ++++ + +A + Sbjct: 388 AQKVLNSEMRQTLITVAKD 406 >gi|332227198|ref|XP_003262778.1| PREDICTED: vitrin isoform 2 [Nomascus leucogenys] Length = 679 Score = 89.2 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 36/323 (11%), Positives = 88/323 (27%), Gaps = 31/323 (9%) Query: 71 HLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTN 130 ++ + G+ + + + P + E+ N+F + +A Sbjct: 378 SFVTKNFFSKANGNRSGAPNVVVVMVDGWPTDKVEEASRLARESGINIFFITIEGAAENE 437 Query: 131 LSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFW 190 + V + K Sbjct: 438 KQYVVEPNFANK-AVCRTNGFYSFHVQSWFGLHKTLQPL---VKRVCDTDRLACSKTCLN 493 Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 S + + VL + NL + + + RIG + Y Sbjct: 494 SADIGFVIDGSSSVGTGNFRTVL-QFVTNLTKEFEISDTD-----TRIGAVQYTYEQRLE 547 Query: 251 QCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 ++ ++ + + ++ T+T A++ A +L+ +K +I Sbjct: 548 FGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF--------KKSKPNKRKLMI 599 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQF 367 ITDG + + + +++ +++ A+ Q+ L Sbjct: 600 LITDGR------SYDDVRIPAMAAHLKG----VFTYAIGVAWAAQEELEVIATHPARDHS 649 Query: 368 FAVNDSRELLESFDKITDKIQEQ 390 F V++ L + +I I + Sbjct: 650 FFVDEFDNLYQYVPRIIQNICTE 672 Score = 44.9 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 18/163 (11%), Positives = 53/163 (32%), Gaps = 23/163 (14%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSH 296 +G + Y + N ++K+ + K+ +N A+ + +++ Sbjct: 333 MGVVQYGDNPATHFNLKTHMNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANG-- 390 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + V+ + DG + + R +G+ I+ + + E + Sbjct: 391 ---NRSGAPNVVVVMVDGWPTD--------KVEEASRLARESGINIFFITIEGAAENEKQ 439 Query: 357 L--------RKCTDSSGQF-FAVNDSRELLESFDKITDKIQEQ 390 + ++G + F V L ++ + ++ + Sbjct: 440 YVVEPNFANKAVCRTNGFYSFHVQSWFGLHKTLQPLVKRVCDT 482 >gi|332227202|ref|XP_003262780.1| PREDICTED: vitrin isoform 4 [Nomascus leucogenys] Length = 657 Score = 89.2 bits (219), Expect = 1e-15, Method: Composition-based stats. Identities = 36/323 (11%), Positives = 88/323 (27%), Gaps = 31/323 (9%) Query: 71 HLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTN 130 ++ + G+ + + + P + E+ N+F + +A Sbjct: 356 SFVTKNFFSKANGNRSGAPNVVVVMVDGWPTDKVEEASRLARESGINIFFITIEGAAENE 415 Query: 131 LSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFW 190 + V + K Sbjct: 416 KQYVVEPNFANK-AVCRTNGFYSFHVQSWFGLHKTLQPL---VKRVCDTDRLACSKTCLN 471 Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 S + + VL + NL + + + RIG + Y Sbjct: 472 SADIGFVIDGSSSVGTGNFRTVL-QFVTNLTKEFEISDTD-----TRIGAVQYTYEQRLE 525 Query: 251 QCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 ++ ++ + + ++ T+T A++ A +L+ +K +I Sbjct: 526 FGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF--------KKSKPNKRKLMI 577 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQF 367 ITDG + + + +++ +++ A+ Q+ L Sbjct: 578 LITDGR------SYDDVRIPAMAAHLKG----VFTYAIGVAWAAQEELEVIATHPARDHS 627 Query: 368 FAVNDSRELLESFDKITDKIQEQ 390 F V++ L + +I I + Sbjct: 628 FFVDEFDNLYQYVPRIIQNICTE 650 Score = 44.9 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 18/163 (11%), Positives = 53/163 (32%), Gaps = 23/163 (14%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSH 296 +G + Y + N ++K+ + K+ +N A+ + +++ Sbjct: 311 MGVVQYGDNPATHFNLKTHMNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANG-- 368 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + V+ + DG + + R +G+ I+ + + E + Sbjct: 369 ---NRSGAPNVVVVMVDGWPTD--------KVEEASRLARESGINIFFITIEGAAENEKQ 417 Query: 357 L--------RKCTDSSGQF-FAVNDSRELLESFDKITDKIQEQ 390 + ++G + F V L ++ + ++ + Sbjct: 418 YVVEPNFANKAVCRTNGFYSFHVQSWFGLHKTLQPLVKRVCDT 460 >gi|126727880|ref|ZP_01743708.1| hypothetical protein RB2150_00467 [Rhodobacterales bacterium HTCC2150] gi|126702821|gb|EBA01926.1| hypothetical protein RB2150_00467 [Rhodobacterales bacterium HTCC2150] Length = 576 Score = 88.8 bits (218), Expect = 1e-15, Method: Composition-based stats. Identities = 46/373 (12%), Positives = 92/373 (24%), Gaps = 77/373 (20%) Query: 1 MTAIIISV---CFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKK 57 MTA I + +ID R QMQ LD AVLS + + + Sbjct: 39 MTAFGIFIVAIMVTSAGLSIDFMRQERTRVQMQQNLDTAVLSAASLLQTLG--------- 89 Query: 58 DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 + + ++ + + + + IN ++ A + T Sbjct: 90 ------AEAVVTDYMSKANIDVDYNLSVNVSEGINF---------RAVDATATATLETLF 134 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSN 177 L GL+ +++ S + I +VLDVS SM + Sbjct: 135 L---GLLNIDSLGITVTSGAEERI----PNLEISLVLDVSGSMGSNSRLTNLKTAATQFV 187 Query: 178 KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI------------- 224 ++ + S I + + Sbjct: 188 STIISGGSGGTVAMSIIPFSSSVTPSQSVIDAITMEDNHDYSTCIEFADDDFSSSSLDLD 247 Query: 225 -------------QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP 271 + + + S++ + +++ L Sbjct: 248 STYKRAVFTSRYSDTGSGDFDDADDFNQDWRSCYMDEYFELLAYSDDETVLYNKIQGLLA 307 Query: 272 YENTNTYPAMHHAYRELYNEKE-----------------SSHNTIGSTRLKKFVIFITDG 314 +T + M L E + T K ++F++DG Sbjct: 308 QGSTAGHTGMKWGTSLLDPEFQAVTNSMIAAGVVDAAHAGMPVAYSDTNTMKIIVFMSDG 367 Query: 315 ENSGASAYQNTLN 327 N + + Sbjct: 368 NNHTQRRFGSDYR 380 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 28/254 (11%), Positives = 67/254 (26%), Gaps = 10/254 (3%) Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPP-KKSFWSKNTTKSKYAPAP 203 + + V + ++D N M ++ + + + Sbjct: 329 QAVTNSMIAAGVVDAAHAGMPVAYSDTNTMKIIVFMSDGNNHTQRRFGSDYRGDNSVVWK 388 Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 A + V + + + + + + + Sbjct: 389 AEGGTGVWVEGSF--DRIYHRYSSWSSSNTGYEYACSWSNYYCTYTE---GFYADPDPYY 443 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF--ITD-GENSGAS 320 N T T+ E + + + + TD G S + Sbjct: 444 FEKNGNYYGVATETWYNSMTGMTFENLSWEEAWGLMSIEYYESVMGSGAATDWGSTSART 503 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 Q+ C ++ G+ I+++A AP + L C S ++ + + F Sbjct: 504 GSQSDTLMSANCTAAKDRGITIFTIAFEAPSNAETQLNNCATSDNHYYDAQGTS-ITSVF 562 Query: 381 DKITDKIQEQSVRI 394 I IQ+ + + Sbjct: 563 SSIATTIQKLKLTL 576 >gi|226951529|ref|ZP_03821993.1| von Willebrand factor type A domain-containing protein [Acinetobacter sp. ATCC 27244] gi|226837721|gb|EEH70104.1| von Willebrand factor type A domain-containing protein [Acinetobacter sp. ATCC 27244] Length = 536 Score = 88.4 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 35/362 (9%), Positives = 89/362 (24%), Gaps = 35/362 (9%) Query: 41 CASIVSDRTIKDPTTKKDQTSTIFKKQIKKHL--KQGSYIRENAGDIAQKAQINITKDKN 98 + + + + E +T Sbjct: 28 SIQTSDVIMPAPIAARSHIAHKAAYNAVMPTMERPRLEQDTEKYQKNEVNPVHRVTDQAV 87 Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD--V 156 + + Y L L P + + + + V V Sbjct: 88 STFSIDVD-TGSYTNTRRFLNDGRLPPVDAVRIEEMINYFDYQYPQPNGVHPFSVTTETV 146 Query: 157 SRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIES 216 ++ LPP A K+ ++ ++ Sbjct: 147 DSPWKENAKLIRIGIQAKDLALQQLPPANLVFLVD--------VSGSMSAADKLPLVKQT 198 Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 L ++ ++ I Y G ++ + +N L T Sbjct: 199 LRILTEQLRAQD--------KVTIITYASGEKLVLEPTSGEQKEKILAVINGLRAGGATA 250 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 A+ AY++ + ++ TDG+ + +TL + R Sbjct: 251 GEQAIQLAYKQAEKAFVKNGINR--------ILLATDGDFNVGITDFSTL--KGMVAEKR 300 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 +G+ + ++ + L+ + D+ G + +++ E + ++ +A Sbjct: 301 KSGISLTTLGFGTGNYNEQLMEQLADAGDGNYSYIDNKNEAKKVVQ---RQLSSTLATVA 357 Query: 396 PN 397 + Sbjct: 358 QD 359 >gi|16765642|ref|NP_461257.1| hypothetical protein STM2315 [Salmonella enterica subsp. enterica serovar Typhimurium str. LT2] gi|167992650|ref|ZP_02573747.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar 4,[5],12:i:- str. CVM23701] gi|197262795|ref|ZP_03162869.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA23] gi|16420855|gb|AAL21216.1| putative von Willebrand factor, vWF type A domain protein [Salmonella enterica subsp. enterica serovar Typhimurium str. LT2] gi|197241050|gb|EDY23670.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA23] gi|205329241|gb|EDZ16005.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar 4,[5],12:i:- str. CVM23701] gi|261247522|emb|CBG25349.1| lipoprotein [Salmonella enterica subsp. enterica serovar Typhimurium str. D23580] gi|267994407|gb|ACY89292.1| hypothetical protein STM14_2853 [Salmonella enterica subsp. enterica serovar Typhimurium str. 14028S] gi|301158873|emb|CBW18386.1| lipoprotein [Salmonella enterica subsp. enterica serovar Typhimurium str. SL1344] gi|312913305|dbj|BAJ37279.1| hypothetical protein STMDT12_C23360 [Salmonella enterica subsp. enterica serovar Typhimurium str. T000240] gi|321222984|gb|EFX48055.1| hypothetical protein SEE_04302 [Salmonella enterica subsp. enterica serovar Typhimurium str. TN061786] gi|323130645|gb|ADX18075.1| Putative von Willebrand factor, vWF type A domain protein [Salmonella enterica subsp. enterica serovar Typhimurium str. 4/74] gi|332989248|gb|AEF08231.1| hypothetical protein STMUK_2345 [Salmonella enterica subsp. enterica serovar Typhimurium str. UK-1] Length = 593 Score = 88.4 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 32/329 (9%), Positives = 89/329 (27%), Gaps = 28/329 (8%) Query: 75 GSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLR 134 + +++ + Y L L P + Sbjct: 111 NVMGTARYEHYDENPIKQVSQAPLATFSLDVD-TGSYANVRRFLNQGQLPPPEAVRVEEM 169 Query: 135 STGIIERSSENLAISICM----VLDVSRSMEDLYLQKHNDNNNMTS-NKYLLPPPPKKSF 189 + + +++ + + Sbjct: 170 LNYFPAPQPVADKQDNTKPIAACIPMPFAVKYELAPSPWNAQRTLLKVDVQARDMQTRDL 229 Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 N ++ ++ + LVN ++ I + Y G Sbjct: 230 PPANLVFLIDTSGSMQPAERLPLIRSALKLLVNDLRAQDN--------ITIVTYAGGTHV 281 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 + NN +K+ ++ L+ Y +T + AY + ++ Sbjct: 282 ALASTAGNNTTAIKAAIDNLDAYGSTGGEAGLRLAYEQAEKGFIKGGVNR--------IL 333 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFF 368 TDG+ + + + + + R G+ + ++ V + ++ + D +G + Sbjct: 334 LTTDGDFNLG--ITDPKDIEALVKKEREKGITLSTLGVGDDNFNEAMMVRIADVGNGNYS 391 Query: 369 AVNDSRELLESFDKITDKIQEQSVRIAPN 397 ++ L E+ + D++ + V +A + Sbjct: 392 YIDS---LSEAQKVLKDEMHQTLVTVAKD 417 >gi|86134839|ref|ZP_01053421.1| aerotolerance-related membrane protein [Polaribacter sp. MED152] gi|85821702|gb|EAQ42849.1| aerotolerance-related membrane protein [Polaribacter sp. MED152] Length = 336 Score = 88.4 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 30/192 (15%), Positives = 65/192 (33%), Gaps = 42/192 (21%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENT 275 N + +++K + + TP++++ VK +N+L T Sbjct: 114 NRLEALKKVAVDFVDRRPNDRIGIVVYAGESFTQTPITSDKTIVKRTINRLQWGQLEGGT 173 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + + S K +I +TDG N+ + ++ E Sbjct: 174 AIGMGL----------GSRVNRLKDSKAKSKVIILLTDGVNNAGN-----IDPTTATELA 218 Query: 336 RNAGMKIYSVAVSAPP-----------------------EGQDLLRKCT-DSSGQFFAVN 371 + G+K+Y++ + +DLL+ ++ G++F Sbjct: 219 KELGIKVYTIGIGTNGMADFPWSKDPRTGMLNFRKQQVQIDEDLLKNIAEETQGKYFRAT 278 Query: 372 DSRELLESFDKI 383 D+ L E +D+I Sbjct: 279 DNTSLKEIYDEI 290 >gi|296108502|ref|YP_003620203.1| hypothetical protein lpa_04155 [Legionella pneumophila 2300/99 Alcoy] gi|295650404|gb|ADG26251.1| Hypothetical protein lpa_04155 [Legionella pneumophila 2300/99 Alcoy] Length = 352 Score = 88.4 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 33/185 (17%), Positives = 67/185 (36%), Gaps = 40/185 (21%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRELYNEKE 293 RIG I + TPL+ + + V R++ + T+ A+ A + L + Sbjct: 143 RIGLILFGTRAYLQ--TPLTYDRHSVLMRIDDATAGLAGKTTSIGDAVGLAVKRLQDVPS 200 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE- 352 + +I +TDG N+ L L+ E + G+KIY++ + + + Sbjct: 201 KG----------RVIILLTDGANNSGV-----LAPLKAAELAKQDGIKIYTIGLGSEADP 245 Query: 353 ----------------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD--KIQEQSVR 393 + L K + G++F D L + I ++++ Sbjct: 246 RALTGDFFAPTLSAELDEKTLEKMAKMTGGRYFRATDPESLQSIYQTINQLETVKQEQAT 305 Query: 394 IAPNR 398 + P + Sbjct: 306 VRPQK 310 >gi|148361167|ref|YP_001252374.1| Von Willebrand factor type A (vWA) domain-containing protein [Legionella pneumophila str. Corby] gi|148282940|gb|ABQ57028.1| conserved hypothetical protein [Legionella pneumophila str. Corby] Length = 344 Score = 88.4 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 33/185 (17%), Positives = 67/185 (36%), Gaps = 40/185 (21%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRELYNEKE 293 RIG I + TPL+ + + V R++ + T+ A+ A + L + Sbjct: 135 RIGLILFGTRAYLQ--TPLTYDRHSVLMRIDDATAGLAGKTTSIGDAVGLAVKRLQDVPS 192 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE- 352 + +I +TDG N+ L L+ E + G+KIY++ + + + Sbjct: 193 KG----------RVIILLTDGANNSGV-----LAPLKAAELAKQDGIKIYTIGLGSEADP 237 Query: 353 ----------------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD--KIQEQSVR 393 + L K + G++F D L + I ++++ Sbjct: 238 RALTGDFFAPTLSAELDEKTLEKMAKMTGGRYFRATDPESLQSIYQTINQLETVKQEQAT 297 Query: 394 IAPNR 398 + P + Sbjct: 298 VRPQK 302 >gi|52843052|ref|YP_096851.1| hypothetical protein lpg2856 [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] gi|52630163|gb|AAU28904.1| hypothetical protein lpg2856 [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] Length = 352 Score = 88.4 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 33/185 (17%), Positives = 67/185 (36%), Gaps = 40/185 (21%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRELYNEKE 293 RIG I + TPL+ + + V R++ + T+ A+ A + L + Sbjct: 143 RIGLILFGTRAYLQ--TPLTYDRHSVLMRIDDATAGLAGKTTSIGDAVGLAVKRLQDVPS 200 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE- 352 + +I +TDG N+ L L+ E + G+KIY++ + + + Sbjct: 201 KG----------RVIILLTDGANNSGV-----LAPLKAAELAKQDGIKIYTIGLGSEADP 245 Query: 353 ----------------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD--KIQEQSVR 393 + L K + G++F D L + I ++++ Sbjct: 246 RALTGDFFAPTLSAELDEKTLEKMAKMTGGRYFRATDPESLQSIYQTINQLETVKQEQAT 305 Query: 394 IAPNR 398 + P + Sbjct: 306 VRPQK 310 >gi|54295680|ref|YP_128095.1| hypothetical protein lpl2768 [Legionella pneumophila str. Lens] gi|53755512|emb|CAH17011.1| hypothetical protein lpl2768 [Legionella pneumophila str. Lens] gi|307611729|emb|CBX01432.1| hypothetical protein LPW_31221 [Legionella pneumophila 130b] Length = 344 Score = 88.4 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 33/185 (17%), Positives = 67/185 (36%), Gaps = 40/185 (21%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRELYNEKE 293 RIG I + TPL+ + + V R++ + T+ A+ A + L + Sbjct: 135 RIGLILFGTRAYLQ--TPLTYDRHSVLMRIDDATAGLAGKTTSIGDAVGLAVKRLQDVPS 192 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE- 352 + +I +TDG N+ L L+ E + G+KIY++ + + + Sbjct: 193 KG----------RVIILLTDGANNSGV-----LAPLKAAELAKQDGIKIYTIGLGSEADP 237 Query: 353 ----------------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD--KIQEQSVR 393 + L K + G++F D L + I ++++ Sbjct: 238 RALTGDFFAPTLSAELDEKTLEKMAKMTGGRYFRATDPESLQSIYQTINQLETVKQEQAT 297 Query: 394 IAPNR 398 + P + Sbjct: 298 VRPQK 302 >gi|168229682|ref|ZP_02654740.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Kentucky str. CDC 191] gi|194468558|ref|ZP_03074542.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Kentucky str. CVM29188] gi|194454922|gb|EDX43761.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Kentucky str. CVM29188] gi|205335570|gb|EDZ22334.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Kentucky str. CDC 191] Length = 596 Score = 88.4 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 34/355 (9%), Positives = 94/355 (26%), Gaps = 29/355 (8%) Query: 49 TIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK 108 + + +T K + +++ + Sbjct: 89 ATAEAPQHEMRTRASASKAFAAQ-GGNVMGTARYEHYDENPIKQVSQAPLATFSLDVD-T 146 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICM----VLDVSRSMEDLY 164 Y L L P + + + +++ Sbjct: 147 GSYANVRRFLNQGQLPPPEAVRVEEMLNYFPAPQPVADKQDNTKPIAACIPMPFAVKYEL 206 Query: 165 LQKHNDNNNMTS-NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNS 223 + + N ++ ++ + LVN Sbjct: 207 APSPWNAQRTLLKVDVQARDMQTRDLPPANLVFLIDTSGSMQPAERLPLIRSALKLLVND 266 Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 ++ I + Y G + NN +K+ ++ L+ Y +T + Sbjct: 267 LRAQDN--------ITIVTYAGGTHVALASTAGNNTTAIKAAIDNLDAYGSTGGEAGLRL 318 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 AY + ++ TDG+ + + + + + R G+ + Sbjct: 319 AYEQAEKGFIKGGVNR--------ILLTTDGDFNLG--ITDPKDIEALVKKEREKGITLS 368 Query: 344 SVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 ++ V + ++ + D +G + ++ L E+ + D++ + V +A + Sbjct: 369 TLGVGDDNFNEAMMVRIADVGNGNYSYIDS---LSEAQKVLKDEMHQTLVTVAKD 420 >gi|238913524|ref|ZP_04657361.1| von Willebrand factor type A domain protein [Salmonella enterica subsp. enterica serovar Tennessee str. CDC07-0191] Length = 596 Score = 88.4 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 34/355 (9%), Positives = 94/355 (26%), Gaps = 29/355 (8%) Query: 49 TIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK 108 + + +T K + +++ + Sbjct: 89 ATAEAPQHEMRTRASASKAFAAQ-GGNVMGTARYEHYDENPIKQVSQAPLATFSLDVD-T 146 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICM----VLDVSRSMEDLY 164 Y L L P + + + +++ Sbjct: 147 GSYANVRRFLNQGQLPPPEAVRVEEMLNYFPAPQPVADKQDNTKPIAACIPMPFAVKYEL 206 Query: 165 LQKHNDNNNMTS-NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNS 223 + + N ++ ++ + LVN Sbjct: 207 APSPWNAQRTLLKVDVQARDMQTRDLPPANLVFLIDTSGSMQPAERLPLIRSALKLLVND 266 Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 ++ I + Y G + NN +K+ ++ L+ Y +T + Sbjct: 267 LRAQDN--------ITIVTYAGGTHVALASTAGNNTTAIKAAIDNLDAYGSTGGEAGLRL 318 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 AY + ++ TDG+ + + + + + R G+ + Sbjct: 319 AYEQAEKGFIKGGVNR--------ILLTTDGDFNLG--ITDPKDIEALVKKEREKGITLS 368 Query: 344 SVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 ++ V + ++ + D +G + ++ L E+ + D++ + V +A + Sbjct: 369 TLGVGDDNFNEAMMVRIADVGNGNYSYIDS---LSEAQKVLKDEMHQTLVTVAKD 420 >gi|331647928|ref|ZP_08349020.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli M605] gi|330912098|gb|EGH40608.1| hypothetical protein ECAA86_02478 [Escherichia coli AA86] gi|331043652|gb|EGI15790.1| putative von Willebrand factor, vWF type A domain protein [Escherichia coli M605] Length = 580 Score = 88.4 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 37/380 (9%), Positives = 107/380 (28%), Gaps = 49/380 (12%) Query: 21 HIMYIRNQMQSALDAAVL--SGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYI 78 + + L AA + + T + Q KQ+ ++ + Sbjct: 72 QQYSDKQALLGRLQAAPKYQHAAREKAAYQIANPGTARYQQFDDNPVKQVAQNPLATFSL 131 Query: 79 RENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGI 138 + G A + P E Y + S Sbjct: 132 DVDTGSYANVRRFLNQGLLPPPDAVRVEEIVNYFPSDW-----DIKDKQSIPASKPIPFA 186 Query: 139 IERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSK 198 + + L + + +N+ Sbjct: 187 MRYELAPAPWNEQRTLLKVDILAKDRKSEELPASNLVFLI-------------------- 226 Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 ++ ++ ++ S LV +++ I + Y + ++ Sbjct: 227 DTSGSMISDERLPLIQSSLKLLVKELREQDN--------IAIVTYAGDSRIALPSISGSH 278 Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 E+ + ++ L+ +TN + AY++ ++ TDG+ + Sbjct: 279 KAEINAAIDSLDAEGSTNGGAGLEMAYQQAAKGFVKGGVNR--------ILLATDGDFNV 330 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELL 377 + + + + R +G+ + ++ V + ++ + D +G + ++ L Sbjct: 331 G--IDDPKSIESMVKKQRESGVTLSTLGVGDSNYNEAMMVRIADVGNGNYSYIDT---LS 385 Query: 378 ESFDKITDKIQEQSVRIAPN 397 E+ + ++++ + +A + Sbjct: 386 EAQKVLNSEMRQTLITVAKD 405 >gi|54298847|ref|YP_125216.1| hypothetical protein lpp2914 [Legionella pneumophila str. Paris] gi|53752632|emb|CAH14067.1| hypothetical protein lpp2914 [Legionella pneumophila str. Paris] Length = 344 Score = 88.4 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 32/185 (17%), Positives = 67/185 (36%), Gaps = 40/185 (21%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRELYNEKE 293 RIG I + TPL+ + + V R++ + T+ A+ A + L + Sbjct: 135 RIGLILFGTRAYLQ--TPLTYDRHSVLMRIDDATAGLAGKTTSIGDAVGLAVKRLQDVPS 192 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE- 352 + +I +TDG N+ L L+ E + G+KIY++ + + + Sbjct: 193 KG----------RVIILLTDGANNSGV-----LAPLKAAELAKQDGIKIYTIGLGSEADP 237 Query: 353 ----------------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD--KIQEQSVR 393 + L + + G++F D L + I ++++ Sbjct: 238 RALTGDFFAPTLSAELDEKTLEEMAKMTGGRYFRATDPESLQSIYQTINQLETVKQEQAT 297 Query: 394 IAPNR 398 + P + Sbjct: 298 VRPQK 302 >gi|307943468|ref|ZP_07658812.1| putative Flp pilus assembly protein TadG [Roseibium sp. TrichSKD4] gi|307773098|gb|EFO32315.1| putative Flp pilus assembly protein TadG [Roseibium sp. TrichSKD4] Length = 479 Score = 88.4 bits (217), Expect = 2e-15, Method: Composition-based stats. Identities = 44/338 (13%), Positives = 97/338 (28%), Gaps = 50/338 (14%) Query: 14 TYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLK 73 IDL + R++M +ALDA+ L + D+ +K +L Sbjct: 22 GSGIDLTSALNARSKMANALDASALKLAGKLSV------AKLSDDEIQAGLEKMFTANLS 75 Query: 74 QGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSL 133 + ++ + K I + + + T + L GL ++ + Sbjct: 76 RFDLKASALSELE----FEVDWTKG-----ILDVWSDVSVKTHFIGLGGL-GPEKLDVGV 125 Query: 134 RSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNN--MTSNKYLLPPPPKKSFWS 191 S S + A+ + +VLDV+ SM+ + ++ S Sbjct: 126 TS----RVSFASQALELALVLDVTGSMDGDISSLKEASQLLFEALVPENAGRHDQRIRVS 181 Query: 192 KNTTKSKYAPAPAPA--NRKIDVLIESA------GNLVNSIQKAIQEKKNLSVRIGTIAY 243 + ++ + N V G + Y Sbjct: 182 IVPYSQGVNLGAKAWKVTNRQSDSSNCVATRGGPNAFTDAYYNYRGARSNFFVAPGALDY 241 Query: 244 N--------------IGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 ++ PL+N+ + + ++ L T + ++ L Sbjct: 242 FVIRRGSNVSWYPPRNNCPESEILPLTNSRKTLLAAVDALEAQGGTAGQAGIAWGWKALS 301 Query: 290 NEKESSHNTIG------STRLKKFVIFITDGENSGASA 321 + S+++ K + +TDG+ + Sbjct: 302 WTWHPFWPSGSDPAKSFSSQVGKAAVIMTDGDFNVHYT 339 >gi|332227200|ref|XP_003262779.1| PREDICTED: vitrin isoform 3 [Nomascus leucogenys] Length = 658 Score = 88.0 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 36/323 (11%), Positives = 88/323 (27%), Gaps = 31/323 (9%) Query: 71 HLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTN 130 ++ + G+ + + + P + E+ N+F + +A Sbjct: 357 SFVTKNFFSKANGNRSGAPNVVVVMVDGWPTDKVEEASRLARESGINIFFITIEGAAENE 416 Query: 131 LSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFW 190 + V + K Sbjct: 417 KQYVVEPNFANK-AVCRTNGFYSFHVQSWFGLHKTLQPL---VKRVCDTDRLACSKTCLN 472 Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 S + + VL + NL + + + RIG + Y Sbjct: 473 SADIGFVIDGSSSVGTGNFRTVL-QFVTNLTKEFEISDTD-----TRIGAVQYTYEQRLE 526 Query: 251 QCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 ++ ++ + + ++ T+T A++ A +L+ +K +I Sbjct: 527 FGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQLF--------KKSKPNKRKLMI 578 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQF 367 ITDG + + + +++ +++ A+ Q+ L Sbjct: 579 LITDGR------SYDDVRIPAMAAHLKG----VFTYAIGVAWAAQEELEVIATHPARDHS 628 Query: 368 FAVNDSRELLESFDKITDKIQEQ 390 F V++ L + +I I + Sbjct: 629 FFVDEFDNLYQYVPRIIQNICTE 651 Score = 44.5 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 18/163 (11%), Positives = 53/163 (32%), Gaps = 23/163 (14%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSH 296 +G + Y + N ++K+ + K+ +N A+ + +++ Sbjct: 312 MGVVQYGDNPATHFNLKTHMNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANG-- 369 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + V+ + DG + + R +G+ I+ + + E + Sbjct: 370 ---NRSGAPNVVVVMVDGWPTD--------KVEEASRLARESGINIFFITIEGAAENEKQ 418 Query: 357 L--------RKCTDSSGQF-FAVNDSRELLESFDKITDKIQEQ 390 + ++G + F V L ++ + ++ + Sbjct: 419 YVVEPNFANKAVCRTNGFYSFHVQSWFGLHKTLQPLVKRVCDT 461 >gi|62180885|ref|YP_217302.1| von Willebrand factor type A domain-containing protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67] gi|62128518|gb|AAX66221.1| putative von Willebrand factor, vWF type A domain [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67] gi|322715363|gb|EFZ06934.1| von Willebrand factor type A domain-containing protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. A50] Length = 593 Score = 88.0 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 32/329 (9%), Positives = 89/329 (27%), Gaps = 28/329 (8%) Query: 75 GSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLR 134 + +++ + Y L L P + Sbjct: 111 NVMGTARYEHYDENPIKQVSQAPLATFSLDVD-TGSYANVRRFLNQGQLPPPEAVRVEEM 169 Query: 135 STGIIERSSENLAISICM----VLDVSRSMEDLYLQKHNDNNNMTS-NKYLLPPPPKKSF 189 + + +++ + + Sbjct: 170 LNYFPAPQPVADKQDNTKPIAACIPMPFAVKYELAPSPWNAQRTLLKVDVQARDMQTRDL 229 Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 N ++ ++ + LVN ++ I + Y G Sbjct: 230 PPANLVFLIDTSGSMQPAERLPLIRSALKLLVNDLRAQDN--------ITIVTYAGGTHV 281 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 + NN +K+ ++ L+ Y +T + AY + ++ Sbjct: 282 ALASTAGNNTTAIKAAIDNLDAYGSTGGEAGLRLAYEQAEKGFIKGGANR--------IL 333 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFF 368 TDG+ + + + + + R G+ + ++ V + ++ + D +G + Sbjct: 334 LTTDGDFNLG--ITDPKDIEALVKKEREKGITLSTLGVGDDNFNEAMMVRIADVGNGNYS 391 Query: 369 AVNDSRELLESFDKITDKIQEQSVRIAPN 397 ++ L E+ + D++ + V +A + Sbjct: 392 YIDS---LSEAQKVLKDEMHQTLVTVAKD 417 >gi|118468162|ref|YP_887464.1| hypothetical protein MSMEG_3149 [Mycobacterium smegmatis str. MC2 155] gi|118169449|gb|ABK70345.1| conserved hypothetical protein [Mycobacterium smegmatis str. MC2 155] Length = 327 Score = 88.0 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 26/152 (17%), Positives = 55/152 (36%), Gaps = 16/152 (10%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + N K+ L+KL + T T + A + + + G ++ ++DG+ Sbjct: 147 TTNREATKNGLDKLQLADRTATGEGIFTALQAIATV--GAVIGGGDEPPPARIVLMSDGK 204 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PEGQDLLRKCTD 362 + S N ++ G+ I +V+ P P ++L K Sbjct: 205 ETVPSNPDNPKGAFTAARTAKDQGVPISTVSFGTPYGYVEINDQRQPVPVDDEMLEKIAQ 264 Query: 363 -SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 S G F + +L F + +I ++++ Sbjct: 265 LSGGDAFTASSLEQLKAVFTSLQQQIGYETIK 296 >gi|258654082|ref|YP_003203238.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] gi|258557307|gb|ACV80249.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] Length = 618 Score = 88.0 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 43/403 (10%), Positives = 111/403 (27%), Gaps = 30/403 (7%) Query: 3 AIIISVCFLFITYAIDLAHIMYIRNQMQS---ALDAAVLSGCAS----IVSDRTIKDPTT 55 A I + DL R ++Q ++ AV+ + + + + D Sbjct: 226 AATIGTLVAATGTSSDLTEAALQRPEVQQYLKDVETAVIHYGDTTLTYLTNLQHADDSGA 285 Query: 56 KKDQTSTIF--KKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAES------ 107 S + +K + + ++ + ++S Sbjct: 286 ALGYVSAVAVEEKSVLDYNAGNPSGNPATLGDHAPPKVPLVAVYPKEGTLYSDSPFVILD 345 Query: 108 ----KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDL 163 A + ++ L+P + + + S ++ ++ Sbjct: 346 APWSTADKQAGAQDFMEFLLLPEQQKVFTEANFRTADHQPGEPITSSPYLIADGVTIALN 405 Query: 164 YLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNS 223 + + P + + + S + + + K+D+ ++A + + Sbjct: 406 PPGPSVLRDVRALWTQVR--KPARVLVVMDVSGSMASESGYGSESKLDLAKKAATSALGQ 463 Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + Q I PL+ + ++ L P T Y A Sbjct: 464 LTDTDQMGLWAFTTDLPTPDTITADLVGVGPLAQTRQPIIDAISSLTPLNGTPLYAATRE 523 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A + + +K+ V+ +TDG N + L G++++ Sbjct: 524 AAKAMNAQKDP--------NSINAVVVLTDGRNEYTDNDLDGLLRELNA-SAEEDGVRVF 574 Query: 344 SVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 ++A + L S + + + + F + Sbjct: 575 TIAYGPDADLATLQEISEASRAAAYDARNPTSIDKVFSDVLSN 617 >gi|163754426|ref|ZP_02161548.1| BatA (Bacteroides aerotolerance operon) [Kordia algicida OT-1] gi|161325367|gb|EDP96694.1| BatA (Bacteroides aerotolerance operon) [Kordia algicida OT-1] Length = 335 Score = 88.0 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 54/165 (32%), Gaps = 43/165 (26%) Query: 249 GNQCTPLSNNLNEVKSRLNKLNP-----YENTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 TP++ + + +++ L + T + A L S Sbjct: 142 SFTKTPITTDKSIIQNALKDIKYKHGELIGGTAIGMGLATAVNRL----------KDSKA 191 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP------------ 351 K +I +TDG N+ + E G+K Y++ + Sbjct: 192 KSKVIILLTDGVNNAG-----FIEPQIASELAVEYGIKTYTIGIGTNGMASTPVALNPDG 246 Query: 352 ----------EGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITD 385 + LL++ + G++F ++++L E +D+I Sbjct: 247 TILFRNMQVEIDEKLLQQIAKTTGGKYFRATNTKKLAEIYDEINK 291 >gi|222082657|ref|YP_002542022.1| hypothetical protein Arad_9368 [Agrobacterium radiobacter K84] gi|221727336|gb|ACM30425.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 405 Score = 88.0 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 52/349 (14%), Positives = 104/349 (29%), Gaps = 42/349 (12%) Query: 7 SVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKK 66 L + + D +R MQS LDAA+++ + D K + S F Sbjct: 1 MPMLLAVGASFDYIRAYNVRQSMQSDLDAALIAAVK----NVDAGDTDALKQKVSDWFHA 56 Query: 67 QIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPS 126 Q + G + + + + A +PT L+ Sbjct: 57 QTESSYSLGD-------------------IEIDTTNHRITATASGTVPTT------LMKL 91 Query: 127 ALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPK 186 A N S + + +++ +V+D S SM Sbjct: 92 ANINTVPVSVASAVKGPASSYLNVYIVIDKSPSMLLAATTAGQQAMYNGIGCQFACHTGD 151 Query: 187 KSFWSKNTTKSKYAPAP-APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 T + YA + + DV +++ +++ I + RI Y++ Sbjct: 152 SHTIGTATYSNNYAYSTEKKIKLRADVAVDAVHEVIDMISASDTNH----ERIKVGLYSL 207 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 G + + + R++ + TY + L N+ + + S Sbjct: 208 GDTITEVLAPTLDTTAAGKRVDSDLTSATSTTYTYFDVSLAALKNKVGTGGDGSSSATPL 267 Query: 306 KFVIFITDGENSGASAYQNTLNTL--------QICEYMRNAGMKIYSVA 346 K V+ +TDG S + C+Y++ + + Sbjct: 268 KLVLLLTDGVQSQREWVTSGAKYQPKVAPLNPAWCDYIKKQSATMAVLY 316 >gi|319783082|ref|YP_004142558.1| von Willebrand factor type A [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317168970|gb|ADV12508.1| von Willebrand factor type A [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 704 Score = 88.0 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 46/382 (12%), Positives = 96/382 (25%), Gaps = 40/382 (10%) Query: 30 QSALDAAVLS--------------GCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQG 75 Q+AL+ VL+ G S S + P K ++ Sbjct: 168 QNALERGVLAEPAPPAPTGEFALDGAVSAPSTSRARMPAESKLMAPQQPSTLPADQMQPQ 227 Query: 76 SYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRS 135 R+ D +D + + A Y +L L + Sbjct: 228 PENRDRIEDFKTNPVHAALEDPVSTFSIDVD-TASYSFVRSSLKQGTLPQVDTVRVEEMI 286 Query: 136 TGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTT 195 + S VS + + PK + Sbjct: 287 NYFPYDWKGPESASTPFNSTVSVMPTPWNTHTRLMHVAIKGFDVKPTEQPKANLV----- 341 Query: 196 KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL 255 K+ +L + LV+ ++ I + Y Sbjct: 342 FLIDVSGSMDEPDKLPLLKSAFRLLVSKLKADDT--------ISIVTYAGDAGTVLEPTK 393 Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 ++ +++ S ++ L P +T + AYR V+ TDG+ Sbjct: 394 ASQKDKILSAIDNLTPGGSTAGEAGIKEAYRLAQKSFVKDGVNR--------VMLATDGD 445 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE 375 + + + + ++ E R G+ + +++ + Sbjct: 446 FNVGQSDDD--DLKRLIEKERKTGVFLSVFGFGRGNLNDQMMQTIAQNGNG--TAAYIDT 501 Query: 376 LLESFDKITDKIQEQSVRIAPN 397 L E+ + + IA + Sbjct: 502 LAEAEKVLVEDASSTLFTIAKD 523 >gi|161612960|ref|YP_001586925.1| hypothetical protein SPAB_00666 [Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7] gi|161362324|gb|ABX66092.1| hypothetical protein SPAB_00666 [Salmonella enterica subsp. enterica serovar Paratyphi B str. SPB7] Length = 593 Score = 88.0 bits (216), Expect = 2e-15, Method: Composition-based stats. Identities = 32/329 (9%), Positives = 89/329 (27%), Gaps = 28/329 (8%) Query: 75 GSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLR 134 + +++ + Y L L P + Sbjct: 111 NVMGTARYEHYDENPIKQVSQAPLATFSLDVD-TGSYANVRRFLNQGQLPPPEAVRVEEM 169 Query: 135 STGIIERSSENLAISICM----VLDVSRSMEDLYLQKHNDNNNMTS-NKYLLPPPPKKSF 189 + + +++ + + Sbjct: 170 LNYFPAPQPVADKQDNTKPIAACIPMPFAVKYELAPSPWNAQRTLLKVDVQARDMQTRDL 229 Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 N ++ ++ + LVN ++ I + Y G Sbjct: 230 PPANLVFLIDTSGSMQPAERLPLIRSALKLLVNDLRAQDN--------ITIVTYAGGTHV 281 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 + NN +K+ ++ L+ Y +T + AY + ++ Sbjct: 282 ALASTAGNNTTAIKAAIDNLDAYGSTGGEAGLRLAYEQAEKGFIKGGVNR--------IL 333 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFF 368 TDG+ + + + + + R G+ + ++ V + ++ + D +G + Sbjct: 334 LTTDGDFNLG--ITDPKDIEALVKKEREKGITLSTLGVGDDNFNEAMMVRIADVGNGNYS 391 Query: 369 AVNDSRELLESFDKITDKIQEQSVRIAPN 397 ++ L E+ + D++ + V +A + Sbjct: 392 YIDS---LSEAQKVLKDEMHQTLVTVAKD 417 >gi|168465984|ref|ZP_02699854.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Newport str. SL317] gi|195631159|gb|EDX49719.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Newport str. SL317] Length = 593 Score = 87.7 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 71/190 (37%), Gaps = 22/190 (11%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++ ++ + LVN ++ I + Y G + NN +K+ ++ Sbjct: 249 RLPLIRSALKLLVNDLRAQDN--------ITIVTYAGGTHVALASTAGNNTTAIKAAIDN 300 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L+ Y +T + AY + ++ TDG+ + + + Sbjct: 301 LDAYGSTGGEAGLRLAYEQAEKGFIKGGVNR--------ILLTTDGDFNLG--ITDPKDI 350 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKI 387 + + R G+ + ++ V + ++ + D +G + ++ L E+ + D++ Sbjct: 351 EALVKKEREKGITLSTLGVGDDNFNEAMMVRIADVGNGNYSYIDS---LSEAQKVLKDEM 407 Query: 388 QEQSVRIAPN 397 + V +A + Sbjct: 408 HQTLVTVAKD 417 >gi|170727657|ref|YP_001761683.1| von Willebrand factor type A [Shewanella woodyi ATCC 51908] gi|169813004|gb|ACA87588.1| von Willebrand factor type A [Shewanella woodyi ATCC 51908] Length = 640 Score = 87.7 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 33/199 (16%), Positives = 65/199 (32%), Gaps = 22/199 (11%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 + K+ +L + L + + RI + Y N+ Sbjct: 260 VSGSMSSQDKLPLLKNALKMLSQQLDEGD--------RISIVVYAGASGVVLDGVKGNDT 311 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 + L+KL +TN + AY+ + VI TDG+ + Sbjct: 312 LAISQALDKLKAGGSTNGGAGIELAYQLAQKHFIAGGVNR--------VILATDGDFNVG 363 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLE 378 + Q L + E R G+ + ++ L+ + D G + ++ L E Sbjct: 364 VSDQQAL--EDMIEEKRKQGIALTTLGFGQGNYNDHLMEQLADKGNGHYAYIDT---LNE 418 Query: 379 SFDKITDKIQEQSVRIAPN 397 + + D+I + IA + Sbjct: 419 ARKVLVDEISATLLTIAKD 437 >gi|307943460|ref|ZP_07658804.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307773090|gb|EFO32307.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] Length = 320 Score = 87.7 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 27/244 (11%), Positives = 62/244 (25%), Gaps = 62/244 (25%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS-------NNLNE 261 + N ++ + Y + NNL Sbjct: 77 RFLGAKHCIEYKPEDFDLNDI-PLNSRAQLPHLYYWRKTNPWCPENFASRMYLNRNNLGG 135 Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS-----------HNTIGSTRLKKFVIF 310 +K+ +++L + T + + L + ++ + +K ++ Sbjct: 136 LKAAVDRLTLSDGTGMDIGLLWEAKALSPKLRTAAALDGGLLPGHPTDWSDKQTQKVIVL 195 Query: 311 ITDG----------------------------------------ENSGASAYQNTLNTLQ 330 +TDG + S + + Sbjct: 196 MTDGGITAQYRPKDPWKGLNPKDMRRGIVNARRNVQYVTTRGNMNSPANSKHNSVAYMKT 255 Query: 331 ICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 +C+ + G+ IY+V L C S ++ V +L +F I I+ Sbjct: 256 MCDQAKAKGIIIYTVGFQIRRNTLPDLSLSYCATSPSHYYFVES-SDLSAAFKAIASSIK 314 Query: 389 EQSV 392 + Sbjct: 315 SLRI 318 >gi|198245970|ref|YP_002216383.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] gi|197940486|gb|ACH77819.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853] gi|326624134|gb|EGE30479.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Dublin str. 3246] Length = 593 Score = 87.7 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 32/329 (9%), Positives = 89/329 (27%), Gaps = 28/329 (8%) Query: 75 GSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLR 134 + +++ + Y L L P + Sbjct: 111 NVMGTARYEHYDENPIKQVSRAPLATFSLDVD-TGSYANVRRFLNQGQLPPPEAVRVEEM 169 Query: 135 STGIIERSSENLAISICM----VLDVSRSMEDLYLQKHNDNNNMTS-NKYLLPPPPKKSF 189 + + +++ + + Sbjct: 170 LNYFPAPQPVADKQDNTKPIAACIPMPFAVKYELAPSPWNAQRTLLKVDVQARDMQTRDL 229 Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 N ++ ++ + LVN ++ I + Y G Sbjct: 230 PPANLVFLIDTSGSMQPAERLPLIRSALKLLVNDLRAQDN--------ITIVTYAGGTHV 281 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 + NN +K+ ++ L+ Y +T + AY + ++ Sbjct: 282 ALASTAGNNTTAIKAAIDNLDTYGSTGGEAGLRLAYEQAEKGFIKGGVNR--------IL 333 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFF 368 TDG+ + + + + + R G+ + ++ V + ++ + D +G + Sbjct: 334 LTTDGDFNLG--ITDPKDIEALVKKEREKGITLSTLGVGDDNFNEAMMVRIADVGNGNYS 391 Query: 369 AVNDSRELLESFDKITDKIQEQSVRIAPN 397 ++ L E+ + D++ + V +A + Sbjct: 392 YIDS---LSEAQKVLKDEMHQTLVTVAKD 417 >gi|307720603|ref|YP_003891743.1| von Willebrand factor A [Sulfurimonas autotrophica DSM 16294] gi|306978696|gb|ADN08731.1| von Willebrand factor type A [Sulfurimonas autotrophica DSM 16294] Length = 310 Score = 87.7 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 29/189 (15%), Positives = 74/189 (39%), Gaps = 20/189 (10%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 D + +A + ++ +++ + G +PL+ + + + +++L Sbjct: 101 FDPVNPAASRF-DVVKSIVKDFISQRTNDNMGLVVFGSYSFIASPLTYDKHILSRIVSQL 159 Query: 270 ---NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 + T Y A+ L S K I +TDG ++ + + Sbjct: 160 EVGMAGKYTALYEALAQGVNLL----------KMSKAKSKVAILLTDGYSTAG---ADKI 206 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPP--EGQDLLRKCTDSSGQFFAVNDSRELLESFDKIT 384 + + + G+K+Y + + P LL+ ++ G F +++ +L E + KI Sbjct: 207 PLDVVLDMAKKEGVKVYPIGIGGPDEYNRAVLLKIAKETGGVAFGASNASQLKEVYKKI- 265 Query: 385 DKIQEQSVR 393 D++++ ++ Sbjct: 266 DELEKSEIK 274 >gi|88798929|ref|ZP_01114511.1| hypothetical protein MED297_12762 [Reinekea sp. MED297] gi|88778409|gb|EAR09602.1| hypothetical protein MED297_12762 [Reinekea sp. MED297] Length = 322 Score = 87.7 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 30/150 (20%), Positives = 56/150 (37%), Gaps = 33/150 (22%) Query: 254 PLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ +LN ++S L L P T + A R+L ST + V+ Sbjct: 144 PLTPDLNAIQSLLADLRPGMADSRTAIGDGLALAVRQL----------RESTTEDRVVVL 193 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--------------QDL 356 ++DGEN+ + + E +++Y++ + + Sbjct: 194 LSDGENNSGEIRPDEATAVAAAE-----NIRVYTIGFGSAGRDSLLQSFGLRSSSLDEQT 248 Query: 357 LRKCTD-SSGQFFAVNDSRELLESFDKITD 385 LR+ + + G+++ S EL E F I Sbjct: 249 LREIAEQTQGRYYRATSSAELAEVFRDIER 278 >gi|300120207|emb|CBK19761.2| unnamed protein product [Blastocystis hominis] Length = 474 Score = 87.7 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 38/256 (14%), Positives = 80/256 (31%), Gaps = 34/256 (13%) Query: 166 QKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQ 225 N + +N P F Y +++ L++S Sbjct: 67 WNGNRFDWWGTNGSSYFPQEGGGFGEDGNYGCYYNYTHYNNTGICQGVMDVV-FLIDSSS 125 Query: 226 KAIQEKKN---LSVRIGTIAYNIGIVGNQCT------------PLSNNLNEVKSRLNKLN 270 E V+ Y + + L+ + +V+ ++ Sbjct: 126 SITDENYRKEIDFVKSILDYYYLHPNYTLVSILEFSTDVRVLQELTYDACDVRKAIDSDR 185 Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS-------GASAYQ 323 TN A+ A+R L N + + ++ ITDG + + Sbjct: 186 MSGLTNIAKAIEEAHRILKNSRSDIPDQ---------IVLITDGFQTVHSSINCNDHPHD 236 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFA-VNDSRELLESFD 381 ++ + ++IY++ V A +D LR+ S Q+F+ V+D + + Sbjct: 237 CNAYAIEKARAAKADDIQIYTIGVGAASYYEDDLRQIASSPSDQYFSLVDDYSSIQTVRE 296 Query: 382 KITDKIQEQSVRIAPN 397 K+ + +I P+ Sbjct: 297 KLQNSTCPLVTQILPD 312 >gi|88801581|ref|ZP_01117109.1| batA protein [Polaribacter irgensii 23-P] gi|88782239|gb|EAR13416.1| batA protein [Polaribacter irgensii 23-P] Length = 334 Score = 87.7 bits (215), Expect = 3e-15, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 71/201 (35%), Gaps = 43/201 (21%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---PYENT 275 N + +++K + + TP++++ N VK +++L T Sbjct: 112 NRLEALKKVAIDFVDRRPNDRIGIVVYAGESFTQTPITSDKNIVKRTISELQWGQLDGGT 171 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + L ST K +I +TDG N+ + ++ E Sbjct: 172 AIGMGLGSGVNRL----------KESTAKSKVIILLTDGVNNAGN-----IDPRTATELA 216 Query: 336 RNAGMKIYSVAVSAPP-----------------------EGQDLLRKCTD-SSGQFFAVN 371 R +K+Y++ + + LL++ + G++F Sbjct: 217 RELEIKVYTIGIGTNGMADFPWSKDPRTGKLNFRKQQVEIDEKLLQEIATATDGKYFRAT 276 Query: 372 DSRELLESFDKITDKIQEQSV 392 D++ L E +D+I D +++ + Sbjct: 277 DNQSLKEIYDEI-DALEKTKI 296 >gi|262193845|ref|YP_003265054.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262077192|gb|ACY13161.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 346 Score = 87.3 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 31/218 (14%), Positives = 68/218 (31%), Gaps = 52/218 (23%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 +++V+ + V A ++ + + + PL+ N + + L+ Sbjct: 116 TRLEVVKDVFRAFV-----AGEDGLDGRSNDTIGLVSFAGFADTRCPLTLNHGSLLTILD 170 Query: 268 KL-----NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 L + T + A L S + +I +TDG N+ Sbjct: 171 DLEIVRERAEDGTAIGDGLGLAVERL----------RESEASSRVIILLTDGVNNAGI-- 218 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------------GQDLLRKC 360 L+ E G+K+Y++ + L Sbjct: 219 ---ETPLEAAELASRLGIKVYTIGAGTDGVAPVRVTNPLTGAEELRPMPVEIDEATLEAI 275 Query: 361 TD-SSGQFFAVNDSRELLESFDKITD----KIQEQSVR 393 + + G++F D L + +++I +I E+ +R Sbjct: 276 AEHTGGRYFRATDGDGLRQVYEQIDRLERTEISERRLR 313 >gi|218528924|ref|YP_002419740.1| hypothetical protein Mchl_0894 [Methylobacterium chloromethanicum CM4] gi|218521227|gb|ACK81812.1| conserved hypothetical protein [Methylobacterium chloromethanicum CM4] Length = 477 Score = 87.3 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 63/461 (13%), Positives = 135/461 (29%), Gaps = 90/461 (19%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+ + + +D + + ++ +A DAA L+G + + + T Sbjct: 29 LFALAVLPTIGLVGLGVDYGMAISSKTRLDNAADAAALAGVVT-AKEFIAANAQQSDVMT 87 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + I + + + +Q+ I + ++ Y ++ Sbjct: 88 AGIKAGEYQALKAFNVNASKVPFATVSLSQLEIVRS-----GQTLDATVSYTATVQST-F 141 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 L ++T L+ R + + + +++DVS SM D++ Sbjct: 142 GRLFGLSVTTLTNRVNASADIA---GYLDFYLMVDVSGSMGLPTT----DSDAALLASKS 194 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 + F + + A + D + + L++ K I RIG Sbjct: 195 VEDQGNCQFACHFPNRKGWNLAAGKIQLRSDAVNNAVCALLDRASKPIV---PNQYRIGI 251 Query: 241 IAYNIG-IVGNQCTPLSNNLNEVKSRLN----------KLNPYENT-------------N 276 + + + +L +K+ + L +T + Sbjct: 252 YPFINRLATLAPLSDTTTSLASLKTTADCGKAWPLAFTNLLDTGSTQLFTNNDPTTGTGS 311 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA--------------- 321 A ++ + + N S K FV ITDG + + Sbjct: 312 GGTHFETALPQMKSTIRTFGNGSSSANPKPFVFLITDGMQNSQTYSSWKDKKTYPGNPSK 371 Query: 322 -------YQNTLNTLQI----CEYMRNAGMKIYSVAV----------------------S 348 + QI C ++ AG I + + Sbjct: 372 FAGYRYADWDGSQPAQIDPAKCADLKKAGATISILYIPYNYVKSYTNEGTIVWENNRVNG 431 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 P D LR+C G FF N ++++ S + D+ + Sbjct: 432 FSPTLADPLRQCAS-PGLFFTANSAKDITASLGAMFDQALK 471 >gi|163786711|ref|ZP_02181159.1| aerotolerance-related membrane protein [Flavobacteriales bacterium ALC-1] gi|159878571|gb|EDP72627.1| aerotolerance-related membrane protein [Flavobacteriales bacterium ALC-1] Length = 335 Score = 87.3 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 28/201 (13%), Positives = 67/201 (33%), Gaps = 42/201 (20%) Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP 271 + + N + +++K + TP++++ + V + + Sbjct: 106 LAKDLLPNRLEALKKVAADFIEGRPNDRIGLVEYAGEAYTKTPITSDKSIVLRSMRDIKY 165 Query: 272 Y----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 T + + L S K +I +TDG N+G ++ Sbjct: 166 NTIIEGGTAIGMGLATSVNRL----------KDSRAKSKVIILLTDGVNNGG-----FID 210 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPE----------------------GQDLLRKCTD-SS 364 E G+K+Y++ + +DLL++ D + Sbjct: 211 PKIASELAVEYGIKVYTIGLGTNGTALSPVRINPNGSFQYGRQKVEIDEDLLKEIADVTG 270 Query: 365 GQFFAVNDSRELLESFDKITD 385 G++F ++++L + +D+I Sbjct: 271 GKYFRATNNKKLAQIYDEINK 291 >gi|260592520|ref|ZP_05857978.1| BatA protein [Prevotella veroralis F0319] gi|260535566|gb|EEX18183.1| BatA protein [Prevotella veroralis F0319] Length = 318 Score = 87.3 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 63/191 (32%), Gaps = 31/191 (16%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY------ 272 N + ++ + + P++ + + + L+ + Sbjct: 108 NRIEVAKEVASDFISGRPNDNIGLTIFAGEAFTQCPMTVDHAALLNLLHNVRTDLVVKGL 167 Query: 273 --ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 + T + ++ L S K +I +TDG N+ S + Sbjct: 168 IQDGTAIGMGLANSVSRL----------KDSKAKSKVIILLTDGSNNVGSIS-----PMT 212 Query: 331 ICEYMRNAGMKIYSVAVS------APPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKI 383 + G++IY++ + L+ ++G+F+ EL + + I Sbjct: 213 AASIAKKYGIRIYTIGLGKESEGDLGAIDYKTLQNIAVSTNGEFYRAQSQAELSKIYQDI 272 Query: 384 TDKIQEQSVRI 394 DK+++ +R+ Sbjct: 273 -DKLEKTKLRV 282 >gi|312882153|ref|ZP_07741902.1| Flp pilus assembly protein TadG [Vibrio caribbenthicus ATCC BAA-2122] gi|309370199|gb|EFP97702.1| Flp pilus assembly protein TadG [Vibrio caribbenthicus ATCC BAA-2122] Length = 498 Score = 87.3 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 46/468 (9%), Positives = 124/468 (26%), Gaps = 84/468 (17%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSG--------------------- 40 +++ +A++ + + ++++ + +AA ++ Sbjct: 22 MGLLLVPIMGVTFWAVEGSRYVQESSRLRDSAEAAAIAVTIEDRKGRADSMAEEYVRSYV 81 Query: 41 ----CASIVSDRTIKDPTTKK---------DQTSTIFKKQIKKHLKQGSYIRENAGDIAQ 87 ++ + R + K T K + ++ + Sbjct: 82 RDIESTALTTSRDYRARDDAKGIREYIEYTVDARTTHKSWFVSNFIPSFDRKQELAGRSV 141 Query: 88 K---------AQINITKDKNNPLQYIAE--SKAQYEIPTENLFLKGLIPSALTNLSLRST 136 I+I + + I ++ + P +S Sbjct: 142 AGKEPIQPGEDTIDIVFVSDFSTSMYGSKLRVLKSSIDQISIKILCESPRYDQQISSYIC 201 Query: 137 GIIERSSENLAISICMVL------------------------DVSRSMEDLYLQKHNDNN 172 + N + + VS + + Sbjct: 202 DDKRATGVNRIGFVPFNIRTREMIWSRDGRAVSQLVYLNGDASVSTYTYNDIDWDRWRMS 261 Query: 173 NMTSN-KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLI--ESAGNLVNSIQKAIQ 229 + S K +N ++K D+ + + + + Sbjct: 262 SKESVEKCAQNYFNCNVPQWQNHQRAKRIFDVVGDYSDPDMFDYIDFDATVSDMFNDKSR 321 Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTP--LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 K N + G + Q L+N+L ++ + + NT Y + + Sbjct: 322 SKSNFYRTTKGVRLYSGSLHAQFQNILLTNDLLDL-EAIQNMTAAGNTAAYQGILRGLQV 380 Query: 288 LYNEKE----SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA--GMK 341 L + K ++ ++DG+ + + +C+ R G+ Sbjct: 381 LKQGQPSIDADEEVQQAYQNKNKMLLILSDGQETDERIINELVKNK-MCDKAREEIPGLY 439 Query: 342 IYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 I + + Q+ ++C ++ + + D L E +KI + I + Sbjct: 440 IGVIGIKFSASDQNGFKECVENEDE--DIIDVSNLNELIEKIEELIMK 485 >gi|313207255|ref|YP_004046432.1| von willebrand factor type a [Riemerella anatipestifer DSM 15868] gi|312446571|gb|ADQ82926.1| von Willebrand factor type A [Riemerella anatipestifer DSM 15868] gi|315023479|gb|EFT36485.1| aerotolerance operon BatA [Riemerella anatipestifer RA-YM] gi|325335298|gb|ADZ11572.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Riemerella anatipestifer RA-GD] Length = 330 Score = 87.3 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 70/204 (34%), Gaps = 41/204 (20%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENT 275 + + ++++ + PL+++ V+ L NP T Sbjct: 108 DRLTALKEIARTFIKQRTTDRIGLVEYSGEALMRVPLTSDHRVVEEELMSFNPMDLEGGT 167 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 N + A L S K +I +TDG N+ +A L E Sbjct: 168 NIGDGLAVAVSHL----------RKSKAKSKIIILMTDGVNTIDNAMS----PLTAAELA 213 Query: 336 RNAGMKIYSVAVSAPP----------------------EGQDLLRKCTD-SSGQFFAVND 372 RN +K+Y++ + + + LLR + G++F Sbjct: 214 RNNDIKVYTIGIGSNGLALMPTQQDIFGNLVFTEEQVKIDEYLLRDVAQITGGKYFRATS 273 Query: 373 SRELLESFDKITDKIQEQSVRIAP 396 + L + +++I D +++ +++ + Sbjct: 274 NESLKQIYEEI-DTLEKSNIKTSK 296 >gi|228472814|ref|ZP_04057572.1| BatA protein [Capnocytophaga gingivalis ATCC 33624] gi|228275865|gb|EEK14631.1| BatA protein [Capnocytophaga gingivalis ATCC 33624] Length = 332 Score = 87.3 bits (214), Expect = 4e-15, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 74/211 (35%), Gaps = 53/211 (25%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 +I+ L A + S RIG + Y+ P + + + V L Sbjct: 111 NRIEALKRVAAQFIQQ---------RASDRIGIVVYSGESYTK--VPATTDKSIVLQALK 159 Query: 268 KLNPY---ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 ++ + T + A L S K +I +TDG N+ Sbjct: 160 EIRQGEIEDGTAIGMGLGTAINRL----------KDSKTKSKVIILMTDGVNNTGV---- 205 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPE----------------------GQDLLRKCTD 362 ++ L E + G+++Y++ + + + LL + + Sbjct: 206 -IDPLSAAELAKEYGIRVYTIGIGTNGKALSPVAYNPDGSFQYDMVPVEIDEKLLAEISK 264 Query: 363 -SSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + G++F D+ +L + + +I DK+++ + Sbjct: 265 ITGGKYFRATDNNKLAQIYTEI-DKLEKSKI 294 >gi|66793453|ref|NP_001019751.1| vitrin [Gallus gallus] gi|56744182|dbj|BAD81032.1| Akhirin [Gallus gallus] Length = 748 Score = 86.9 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 43/323 (13%), Positives = 81/323 (25%), Gaps = 31/323 (9%) Query: 71 HLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTN 130 ++ + G+ + + P + E+ N+F + +A Sbjct: 447 SFVNKNFFSDANGNRGGAPNVVVVMVDGWPTDRVEEASRLARESGINIFFVTIEAAAQNE 506 Query: 131 LSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFW 190 + + V K Sbjct: 507 KQNVIEPNF-VDKAVCRTNGFYSITVPSWFSLHK---VVQPLVKRVCDIDRLVCSKTCLN 562 Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 S + + + VL A E + RIG I Y Sbjct: 563 SADIGFVIDGSSSVGTSNFRTVLQFVAN------ISKEFEISDTDTRIGAIQYTYEQRLE 616 Query: 251 QCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 + +V S + ++N T+T A+ +A +L+ +K +I Sbjct: 617 FSFDKYSTKQDVLSAIKRINYWSGGTSTGAAISYASEQLFT--------KSKPNKRKIMI 668 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QF 367 ITDG + G+ YS+ V QD L Sbjct: 669 LITDGRSYDDVRMP--------ALTAHQNGVIAYSIGV--AWAAQDELEAIATDPDKEHS 718 Query: 368 FAVNDSRELLESFDKITDKIQEQ 390 F V++ L + + + I + Sbjct: 719 FFVDEFDNLYQFVNPLIQNICTE 741 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 15/126 (11%), Positives = 41/126 (32%), Gaps = 14/126 (11%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYN 290 N +G + Y + E+++ + K+ +N A+ + ++ Sbjct: 396 NNAGPLMGIVQYGDDPSTEFNLKTYASPKELRNAIEKIPQKGGLSNVGKALSFVNKNFFS 455 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + V+ + DG + + R +G+ I+ V + A Sbjct: 456 DANGNRGGAPN-----VVVVMVDGWPTD--------RVEEASRLARESGINIFFVTIEAA 502 Query: 351 PEGQDL 356 + + Sbjct: 503 AQNEKQ 508 >gi|207857733|ref|YP_002244384.1| lipoprotein [Salmonella enterica subsp. enterica serovar Enteritidis str. P125109] gi|206709536|emb|CAR33881.1| lipoprotein [Salmonella enterica subsp. enterica serovar Enteritidis str. P125109] Length = 596 Score = 86.9 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 71/190 (37%), Gaps = 22/190 (11%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++ ++ + LVN ++ I + Y G + NN +K+ ++ Sbjct: 252 RLPLIQSALKLLVNDLRAQDN--------ITIVTYAGGTHVALASTAGNNTTAIKAAIDN 303 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L+ Y +T + AY + ++ TDG+ + + + Sbjct: 304 LDAYGSTGGEAGLRLAYEQAEKGFIKGGVNR--------ILLTTDGDFNLG--ITDPKDI 353 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKI 387 + + R G+ + ++ V + ++ + D +G + ++ L E+ + D++ Sbjct: 354 EALVKKEREKGITLSTLGVGDDNFNEAMMVRIADVGNGNYSYIDS---LSEAQKVLKDEM 410 Query: 388 QEQSVRIAPN 397 + V +A + Sbjct: 411 HQTLVTVAKD 420 >gi|295132198|ref|YP_003582874.1| von Willebrand factor(vWA) type A domain-containing protein [Zunongwangia profunda SM-A87] gi|294980213|gb|ADF50678.1| von Willebrand factor(vWA) type A domain-containing protein [Zunongwangia profunda SM-A87] Length = 334 Score = 86.9 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 27/168 (16%), Positives = 57/168 (33%), Gaps = 42/168 (25%) Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIG 300 TP++++ + V L + T + + L Sbjct: 138 YSGESFTKTPITSDKSVVLRALEDVEFNNILESGTAIGSGLATSVNRL----------KD 187 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP--------- 351 S K +I +TDG N+ ++ E + G+K+Y++ V Sbjct: 188 SKAESKVIILLTDGVNNSG-----FIDPKVASELAKEFGIKVYTIGVGTNGMALTPVGIA 242 Query: 352 -------------EGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 +DLL++ D + G++F ++ +L + +D+I Sbjct: 243 ANGRFQFGNRQVEIDEDLLKQIADETGGKYFRATNNEKLEDIYDEIDQ 290 >gi|293391324|ref|ZP_06635658.1| Flp pilus assembly protein TadG [Aggregatibacter actinomycetemcomitans D7S-1] gi|290951858|gb|EFE01977.1| Flp pilus assembly protein TadG [Aggregatibacter actinomycetemcomitans D7S-1] Length = 525 Score = 86.9 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 59/466 (12%), Positives = 132/466 (28%), Gaps = 99/466 (21%) Query: 15 YAIDLAHIMYIRNQMQSALDAAVLS----------------GCASIVSDRTIKDPTTKKD 58 + +D I+ + ++ A D A L VS + I K Sbjct: 43 FTVDGTGILLDKARLAQATDQAALLLIAEDNKYRKNKDHSDVSRQHVSQQDINREGNSKV 102 Query: 59 QT------STIFKKQIKKHLKQGS------YIRENAGDIAQKAQINITKDKNNPLQY--- 103 Q + + +K +L+ D + N Sbjct: 103 QAQWKKRNQELVQGLVKLYLRSDDKNGQKNSSPAIIKDPFLAECLEEKTQPKNKNGTAKS 162 Query: 104 ---IAESKAQYEIPT---ENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 + + Q + + L + ++ T ++ + I + MV D+S Sbjct: 163 IACVVQGSVQRKFWLPWGQTLVSSSRLHDGRVGINSGKTYAVKDKQITIPIDLMMVTDLS 222 Query: 158 RSMEDLY------LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKID 211 SM D PK S + A A R+ Sbjct: 223 GSMVSPIDKRIPSSSIRIDALRDVVKDIEGILLPKDSRDDTSPYNRMGFVAFAGGARQKT 282 Query: 212 VLIESA-------------------------GNLVNSIQKAIQEKKNLSVRIGT-IAYNI 245 + L++ + + G+ I+Y+ Sbjct: 283 EKNDCVLPYYAQQSKKEEISNLYRNNKLDQASKLLDQYMDIERTINQIDQFNGSNISYDF 342 Query: 246 GIVGNQCTPLSNNLNE-----------VKSRLNKLNPYENTNTYPAMHHAYRELYNEK-- 292 +C S V + LN+++P T M + + Sbjct: 343 INTTKKCLGKSEGKETTRAWFDKKNLGVSNALNEIDPDGGTAVTSGMFIGTNLMTDTNKD 402 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQ-NTLNTLQICEYMR--------------N 337 + + +T ++ ++ ++DGE++ + L + +C ++ Sbjct: 403 PEAAPSKLNTNTRRILLVLSDGEDNRPTEGTLVKLMSAGLCNKIKRKIDSLQDTKYPKVE 462 Query: 338 AGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 A + ++ + P + ++ ++C Q++ V + LL++F +I Sbjct: 463 ARVAFVALGYNPPQDQVNVWKQCV--GKQYYTVFSKQGLLDAFRQI 506 >gi|32452632|gb|AAP43994.1| TadG [Aggregatibacter actinomycetemcomitans] Length = 525 Score = 86.9 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 59/466 (12%), Positives = 132/466 (28%), Gaps = 99/466 (21%) Query: 15 YAIDLAHIMYIRNQMQSALDAAVLS----------------GCASIVSDRTIKDPTTKKD 58 + +D I+ + ++ A D A L VS + I K Sbjct: 43 FTVDGTGILLDKARLAQATDQAALLLIAEDNKYRKNKDHSDVSRQHVSQQDINREGNSKV 102 Query: 59 QT------STIFKKQIKKHLKQGS------YIRENAGDIAQKAQINITKDKNNPLQY--- 103 Q + + +K +L+ D + N Sbjct: 103 QAQWKKRNQELVQGLVKLYLRSDDKNGQKNSSPAIIKDPFLAECLEEKTQPKNKNGTAKS 162 Query: 104 ---IAESKAQYEIPT---ENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 + + Q + + L + ++ T ++ + I + MV D+S Sbjct: 163 IACVVQGSVQRKFWLPWGQTLVSSSRLHDGRVGINSGKTYAVKDKQITIPIDLMMVTDLS 222 Query: 158 RSMEDLY------LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKID 211 SM D PK S + A A R+ Sbjct: 223 GSMVSPIDKRIPSSSIRIDALRDVVKDIEGILLPKDSRDDTSPYNRMGFVAFAGGARQKT 282 Query: 212 VLIESA-------------------------GNLVNSIQKAIQEKKNLSVRIGT-IAYNI 245 + L++ + + G+ I+Y+ Sbjct: 283 EKNDCVLPYYAQQSKKEEISNLYRNNKLDQASKLLDQYMDIERTINQIDQFNGSNISYDF 342 Query: 246 GIVGNQCTPLSNNLNE-----------VKSRLNKLNPYENTNTYPAMHHAYRELYNEK-- 292 +C S V + LN+++P T M + + Sbjct: 343 INTTKKCLGKSEGKETTRAWFDKKNLGVSNALNEIDPDGGTAVTSGMFIGTNLMTDTNKD 402 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQ-NTLNTLQICEYMR--------------N 337 + + +T ++ ++ ++DGE++ + L + +C ++ Sbjct: 403 PEAAPSKLNTNTRRILLVLSDGEDNRPTEGTLVKLMSAGLCNKIKRKIDSLQDTKYPKVE 462 Query: 338 AGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 A + ++ + P + ++ ++C Q++ V + LL++F +I Sbjct: 463 ARVAFVALGYNPPQDQVNVWKQCV--GKQYYTVFSKQGLLDAFRQI 506 >gi|323342275|ref|ZP_08082507.1| hypothetical protein HMPREF0357_10688 [Erysipelothrix rhusiopathiae ATCC 19414] gi|322463387|gb|EFY08581.1| hypothetical protein HMPREF0357_10688 [Erysipelothrix rhusiopathiae ATCC 19414] Length = 1466 Score = 86.9 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 42/365 (11%), Positives = 103/365 (28%), Gaps = 44/365 (12%) Query: 65 KKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLI 124 +K++K + + + + + N +I N + Sbjct: 6 RKKMKTRIIKSGLTVLLITMVLLSINTSFVSAEGNSSSSEKTITNSIQIDNMNEGEVRVF 65 Query: 125 ----PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 P + + + + I +VLD S SM+ + D ++ K Sbjct: 66 KTAKPIPNSINRWEISIDVFGRLKREPSDIVLVLDTSGSMDPQKNPQGID--RISKAKRE 123 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS----V 236 + F + + + + + L+N I+ E + Sbjct: 124 AIHFVNEIFERDASARVALVSYGTKVSSNSFHTKQESNLLINEIKSLKAEGGTFTQGALY 183 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNE----VKSRLNK--LNPYEN---------------- 274 + LS+ +K+++N+ L N Sbjct: 184 EAKMLLNQSSAPNKTIVLLSDGQPTYRYPLKAKVNQDLLRYDGNVIVQKRYNGQQRPFDI 243 Query: 275 --TNTYPAMHHAY-------RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 T++ Y +++ + T L + + G + + Sbjct: 244 GITSSSNQAIPGYRFKSRPNTNVFDYNAMVYGTGNEYYLDELGELRSQGNQNYFVYMSSA 303 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDS-RELLESFDKIT 384 + + + +Y++ G D+L++ S+ ++ + S L + F KI+ Sbjct: 304 DAAIIESNQIHQEQIHLYAIGFDTDARGTDILKRI--SNNNYYDASSSRDNLDDIFKKIS 361 Query: 385 DKIQE 389 + I Sbjct: 362 NNIYS 366 >gi|294651171|ref|ZP_06728503.1| conserved hypothetical protein [Acinetobacter haemolyticus ATCC 19194] gi|292822924|gb|EFF81795.1| conserved hypothetical protein [Acinetobacter haemolyticus ATCC 19194] Length = 446 Score = 86.9 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 36/362 (9%), Positives = 90/362 (24%), Gaps = 35/362 (9%) Query: 41 CASIVSDRTIKDPTTKKDQTSTIFKKQIKKHL--KQGSYIRENAGDIAQKAQINITKDKN 98 + + + + E +T Sbjct: 28 SIQTSDVIMPAPIAARSHIAHKAAYNAVMPTMERPRLEQDTEKYQKNEVNPVHRVTDQAV 87 Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD--V 156 + + Y L L P + + + + V V Sbjct: 88 STFSIDVD-TGSYTNTRRFLNDGRLPPVDAVRIEEMINYFDYQYPQPNGVHPFSVTTETV 146 Query: 157 SRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIES 216 ++ LPP A K+ ++ ++ Sbjct: 147 DSPWKENAKLIRIGIQAKDLALQQLPPANLVFLVD--------VSGSMSAADKLPLVKQT 198 Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 L ++ ++ I Y G ++ + +N L +T Sbjct: 199 LRILTEQLRAQD--------KVTIITYASGEKLVLEPTSGEQKEKILAVINGLRARGSTA 250 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 A+ AY++ + ++ TDG+ + NTL + R Sbjct: 251 GEQAIQLAYKQAEKAFVKNGINR--------ILLATDGDFNVGITDFNTL--KGMVAEKR 300 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 +G+ + ++ + L+ + D+ G + +++ E + ++ +A Sbjct: 301 KSGISLTTLGFGTGNYNEQLMEQLADAGDGNYSYIDNKNEAKKVVQ---RQLSSTLATVA 357 Query: 396 PN 397 + Sbjct: 358 QD 359 >gi|294670381|ref|ZP_06735263.1| hypothetical protein NEIELOOT_02099 [Neisseria elongata subsp. glycolytica ATCC 29315] gi|291307845|gb|EFE49088.1| hypothetical protein NEIELOOT_02099 [Neisseria elongata subsp. glycolytica ATCC 29315] Length = 553 Score = 86.9 bits (213), Expect = 5e-15, Method: Composition-based stats. Identities = 33/321 (10%), Positives = 86/321 (26%), Gaps = 29/321 (9%) Query: 78 IRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTG 137 E + + ++ + + Y L L P + Sbjct: 84 DTERYQKQPDQPVKAVAQEPVSTFSIDVD-TGSYANVRRFLNNGRLPPKDTVRIEEIVNY 142 Query: 138 IIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKS 197 + + KK N Sbjct: 143 FPYNYPLPTGGHPFAI------HTQTIDSPWQHEAKLIKIGIQAQDLAKKELPPANLVFL 196 Query: 198 KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN 257 + K+ ++ ++ L ++ ++ I Y G Sbjct: 197 VDISGSMNSPEKLPLVKKTLRILTEQLRPQD--------KVTLITYASGEELVLPPTSGR 248 Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N +E+ +NKL +T A+ AY + + ++ TDG+ + Sbjct: 249 NKDEILRAINKLQAGGSTAGESALKMAYEQAQKAYVKNGINR--------ILLATDGDFN 300 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSREL 376 + + L + R +G+ + ++ +D++ + D+ G + +++ +E Sbjct: 301 VGVSSTDAL--KSMVAEKRKSGISLTTLGFGTGNYNEDMMEQIADAGDGNYSYIDNEKEA 358 Query: 377 LESFDKITDKIQEQSVRIAPN 397 + ++ +A + Sbjct: 359 KKVLQ---HQLTSTLATVAQD 376 >gi|325279872|ref|YP_004252414.1| von Willebrand factor type A [Odoribacter splanchnicus DSM 20712] gi|324311681|gb|ADY32234.1| von Willebrand factor type A [Odoribacter splanchnicus DSM 20712] Length = 330 Score = 86.9 bits (213), Expect = 6e-15, Method: Composition-based stats. Identities = 26/207 (12%), Positives = 73/207 (35%), Gaps = 40/207 (19%) Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP 271 + + + + + ++ + + PL+ + + + L ++ Sbjct: 104 LARDFTPDRLEAAKEVATKFILERPQDKIGLVVFAGESFTQCPLTTDQAVLVNLLREVKS 163 Query: 272 Y---ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 + T + +A L S K VI +TDG N+ + Sbjct: 164 GMIQDGTAIGLGLANAVNRL----------KDSPGKSKVVILLTDGINNQG-----AIAP 208 Query: 329 LQICEYMRNAGMKIYSVAVSAPPE--------------------GQDLLRKCTD-SSGQF 367 + E + G+++Y++ V E + +L++ + + G++ Sbjct: 209 VTAAELAKAFGIRVYTIGVGTYGEAPYPVPTPFGVQLQNMPVEIDEGVLQQIANVTGGKY 268 Query: 368 FAVNDSRELLESFDKITDKIQEQSVRI 394 F D+ +L + + +I D++++ + + Sbjct: 269 FRATDNDKLQQIYSEI-DQLEKSKIEV 294 >gi|21228580|ref|NP_634502.1| putative chloride channel [Methanosarcina mazei Go1] gi|20907073|gb|AAM32174.1| putative chloride channel [Methanosarcina mazei Go1] Length = 1004 Score = 86.5 bits (212), Expect = 7e-15, Method: Composition-based stats. Identities = 39/307 (12%), Positives = 95/307 (30%), Gaps = 36/307 (11%) Query: 90 QINITKDKNNPLQYIAESKAQYEIPT-------ENLFLKGLIPSALTNLSLRSTGIIERS 142 ++ + NNP +++ + E G +S G + S Sbjct: 210 SFSLVGNYNNPNLLTVDAEVKSENVPITGLNKDNFTIEIGSKKVNDVTVSDVGEGKYKLS 269 Query: 143 SENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPA 202 D++ ++ + + N +Y + +S Sbjct: 270 FNPPKQDSNGNYDLNVYVKYKKVTLSDSELNA--VRYGEDNANANANVMLVIDRSGSMSG 327 Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 I SA ++ ++ Y++ + + N + Sbjct: 328 SP-----ISSAKNSANLFIDYMEAEDMAGVVSFSSSARYDYHLATLTPEV------KNSI 376 Query: 263 KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 K ++N + T M + +L N + ++ ++ ++DG + Sbjct: 377 KQKINSIYASGVTAIGSGMRYGLNDLLNYGDPNNPW--------AIVLLSDGYQNSGENP 428 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFD 381 N + ++ + +++Y+V + P Q LL D + G+++ +L E ++ Sbjct: 429 NN------VIPSIKASNIQVYTVGLG-PAVDQKLLGNIADQTGGKYYYSPTDSQLQEIYN 481 Query: 382 KITDKIQ 388 I KI Sbjct: 482 DIVGKII 488 >gi|332519334|ref|ZP_08395801.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] gi|332045182|gb|EGI81375.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] Length = 334 Score = 86.5 bits (212), Expect = 7e-15, Method: Composition-based stats. Identities = 35/205 (17%), Positives = 69/205 (33%), Gaps = 53/205 (25%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 +++ L A + K + RIG + Y TP++++ + V L Sbjct: 112 NRLEALKNVASEFI---------KGRPNDRIGLVEYAGESYTK--TPITSDKSIVLRSLQ 160 Query: 268 KLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 ++ T + A L S K +I +TDG N+ S Sbjct: 161 EIRYNNIIEGGTAIGMGLATAVNRL----------KDSKAKSKVIILLTDGVNNSGS--- 207 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPP----------------------EGQDLLRKCT 361 +N E G+K Y++ + + LL++ Sbjct: 208 --INPKIASELAVEFGIKTYTIGLGTNGMALSPIAIKQNGQFQYGRVKVEIDETLLKEIA 265 Query: 362 D-SSGQFFAVNDSRELLESFDKITD 385 + G++F ++++L E +D+I Sbjct: 266 QVTGGKYFRATNNKKLAEIYDEINK 290 >gi|167752252|ref|ZP_02424379.1| hypothetical protein ALIPUT_00495 [Alistipes putredinis DSM 17216] gi|167660493|gb|EDS04623.1| hypothetical protein ALIPUT_00495 [Alistipes putredinis DSM 17216] Length = 328 Score = 86.5 bits (212), Expect = 7e-15, Method: Composition-based stats. Identities = 31/202 (15%), Positives = 71/202 (35%), Gaps = 41/202 (20%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENT 275 + + + ++ +PL+ + +++ L +L + T Sbjct: 108 DRIQAAKQVAGNFITDRPGDRIGLVAFAGEAFTQSPLTTDQGTLQTLLGRLRSGVVEDGT 167 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + A L S K +I +TDGEN+ + L E Sbjct: 168 AIGNGLATAINRLRESNAKS----------KVIILLTDGENNRG-----EIAPLTAAEIA 212 Query: 336 RNAGMKIYSVAVSAPPE---------------------GQDLLRKCTD-SSGQFFAVNDS 373 R+ G+++Y++ V + +L + D + G++F D+ Sbjct: 213 RDQGIRVYTIGVGTRGTAPYPTVDFFGNPTVVQAKVQIDEKILGEIADLTGGRYFRATDN 272 Query: 374 RELLESFDKITDKIQEQSVRIA 395 +L +D+I +++++ V I+ Sbjct: 273 AKLQSIYDEI-NQLEKSKVEIS 293 >gi|117618125|ref|YP_856000.1| hypothetical protein AHA_1462 [Aeromonas hydrophila subsp. hydrophila ATCC 7966] gi|117559532|gb|ABK36480.1| conserved hypothetical protein [Aeromonas hydrophila subsp. hydrophila ATCC 7966] Length = 460 Score = 86.5 bits (212), Expect = 7e-15, Method: Composition-based stats. Identities = 51/454 (11%), Positives = 117/454 (25%), Gaps = 71/454 (15%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDA--------------------AVLSGC 41 A++++ I+L ++ + +A DA A L+G Sbjct: 15 FALMLTGVLALTGVVIELVRGYSGQSLLSAAADAVLYSAADSDSAAEDAAALVRANLAGR 74 Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPL 101 V + + I + ++ + + +A A+ + + + + Sbjct: 75 HLQVGPPALSQNE---QEAQVILQGEVPALMALSAIGTSGDLPVAAAARASSARTRI-EI 130 Query: 102 QYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD------ 155 + + +GL S A + + D Sbjct: 131 ALVLDVSNSMSGAPMKAIKQGLAEFGEVLFGRERRNQDRVVSIIPATGLVNIGDHPELFH 190 Query: 156 ----------VSRSMEDLYLQKHNDNN----NMTSNKYLLPPPPKKSFWSKNTTKS---- 197 + + E + L ++ T Sbjct: 191 PESLTFPFGLQTLAHERGWSNLLTREVPGRQRKAFCARLPEHVDGIDRLAELTPGWIRKL 250 Query: 198 ----------KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGI 247 + + P + + +N ++G Sbjct: 251 ELAPRGEAQPRLHYSTKPPAIQQYEDGTPLRAFAPRENPLERYLENRRDKLGIFDDPDCG 310 Query: 248 VGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH--------NTI 299 V LS + L+ L+ NTNT + +R L + + Sbjct: 311 VSPIQAHLST-RAAYRQALDTLHAAFNTNTAEGVMWGWRLLSPQWQGRWQQGAAELPRPY 369 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 G +K ++ +DGE+ G A L +C M+ G+++Y+VA + + Sbjct: 370 GQADNRKILVLFSDGEHMGPEAALRDRKQLLLCREMKRKGIQVYTVAFEGDAR---FVAQ 426 Query: 360 CTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 C + + ++ I + + Sbjct: 427 CASERSLAYKATS-GNIRTVLTRLASAINDVVLT 459 >gi|126730249|ref|ZP_01746060.1| hypothetical protein SSE37_10854 [Sagittula stellata E-37] gi|126708982|gb|EBA08037.1| hypothetical protein SSE37_10854 [Sagittula stellata E-37] Length = 666 Score = 86.5 bits (212), Expect = 7e-15, Method: Composition-based stats. Identities = 40/401 (9%), Positives = 100/401 (24%), Gaps = 74/401 (18%) Query: 46 SDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIA 105 +D T T S + + + + + ++ Y Sbjct: 284 TDGTDTAATDTGGFFSPWPQPIANIVVYFDTNGDDIYNRAHKIINFPDGSTRDIDDIYQG 343 Query: 106 ESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYL 165 + + ++ ++ + N S D+ + + + Sbjct: 344 AVAFLIDRDPLLFHPDQFLGISVKGGQEKNRYFQVKGDGNGPFS-----DIGPTKNNGKI 398 Query: 166 QKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQ 225 + + +++ + P Sbjct: 399 PGDVIDFA------FIDYDAWAAYYVAPVAPVEAVNVNMP--------SSCVEIYDTEFT 444 Query: 226 KAIQEKKNLSVRIGTIAYN------IGIVGNQCTPL---SNNLNEVKSRLNKLNPYENTN 276 + + V G + T + S++ + + ++ + ++ T Sbjct: 445 NTDLPQSDDYVPHFMFWPYVREVMDWGWCPGEDTAIQYYSDDAATLSAFIDNMRMHDGTG 504 Query: 277 TYPAMHHAYRELYNEKE-----------------SSHNTIGSTRLKKFVIFITDG----- 314 + +A L +KF++ ++DG Sbjct: 505 IQYGLKYALALLDPATGSAVTELISAGLVDSRFLGRPIAWEDEETEKFIVVMSDGAVTDQ 564 Query: 315 -----------------------ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 + ++ N N C+ R+ G+ +++VA Sbjct: 565 YRPVDPFAPLNGETELQTQGSGSYTTFSTRGNNLDNLHTQCQLARDLGVTVFAVAFETTD 624 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 D LR C S FF V E++++FD I +I + Sbjct: 625 ADADELRLCASSDSHFFHVQ-GTEIIDAFDTIARQINNLRL 664 Score = 83.4 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 26/261 (9%), Positives = 60/261 (22%), Gaps = 32/261 (12%) Query: 4 IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTI 63 ++ + + ++D+ + IR ++Q+ LD AVL+ + + Sbjct: 66 FMLVLILVITGASVDIMYQEAIRARLQATLDRAVLAAA-------DLDQQQDPVAVVNDY 118 Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGL 123 K D A +T D + + Sbjct: 119 VTKAGLVEHLTDVIATPGLYDRTVAADAGLTLD---------------------TYFLRM 157 Query: 124 IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPP 183 + ST ++ +++ V+D+S SM + N +L Sbjct: 158 SGWQTLPVIAASTAEERIANVEISL----VMDISGSMRWNNRITNARNAAKDFVTKVLTE 213 Query: 184 PPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAY 243 + + N Q+ S+ Sbjct: 214 DSAGVTTLNLIPFAGQVNPGDVMFDYFRGVRPKIQQGNNGWGNGDQDAPGGSLCTNNAEN 273 Query: 244 NIGIVGNQCTPLSNNLNEVKS 264 + + + Sbjct: 274 ADEGAIDPSCTDGTDTAATDT 294 >gi|196233777|ref|ZP_03132616.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] gi|196222139|gb|EDY16670.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] Length = 883 Score = 86.1 bits (211), Expect = 8e-15, Method: Composition-based stats. Identities = 47/383 (12%), Positives = 104/383 (27%), Gaps = 38/383 (9%) Query: 29 MQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQK 88 + SA + ++ + + K + + + I +N+ + + Sbjct: 246 LDSAEPQGGAALSKAVTPKDKLAEADASKAMPVAAWARVRRGFAATSGGIGDNSYGLDDR 305 Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI 148 I N A +P L + + +R A+ Sbjct: 306 GGIADKASNANSFD-TLTENAFLNVPENPLSTFSIDVDTASYAIVRRYLNDNHLPPTGAV 364 Query: 149 SICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFW---------SKNTTKSKY 199 I +L+ + M P + + + Sbjct: 365 RIEELLNYFPYDYPQPQGAAPFSATMEVATCPWAPEHRLVRVGLKGREIPKDERPPSNLV 424 Query: 200 ----APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL 255 K+ +L + LV + R+ + Y G Sbjct: 425 FLIDVSGSMNMPNKLPLLQKCFSLLVEQLGPKD--------RVSIVTYASGT--KLVLEP 474 Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + + +++ ++ L+ T+ + AYR VI TDG+ Sbjct: 475 TQDKEAMQTAIDGLHAGGGTHGSSGIDLAYRMAQQSFIPGGTNR--------VILATDGD 526 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSR 374 + Q+ L L + +G+ + + +L K D G + ++ + Sbjct: 527 WNIGITNQSEL--LSMITRKAKSGVFLTVLGFGLDNLKDSMLVKLADHGNGHYAYIDTEQ 584 Query: 375 ELLESFDKITDKIQEQSVRIAPN 397 E + F D++ V IA + Sbjct: 585 EARKVF---VDQLSSTLVTIAKD 604 >gi|167549689|ref|ZP_02343448.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA29] gi|205325282|gb|EDZ13121.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Saintpaul str. SARA29] Length = 593 Score = 86.1 bits (211), Expect = 8e-15, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 72/190 (37%), Gaps = 22/190 (11%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++ ++ + LVN ++ I + Y G + NN +K+ ++ Sbjct: 249 RLPLIQSALKLLVNDLRAQDN--------ITIVTYAGGTHVALASTAGNNTTAIKAAIDN 300 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L+ Y +T + AY + ++F TDG+ + + + Sbjct: 301 LDAYGSTGGEAGLRLAYEQAEKGFIKGGVNR--------ILFTTDGDFNLG--ITDPKDI 350 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKI 387 + + R G+ + ++ V + ++ + D +G + ++ L E+ + D++ Sbjct: 351 EALVKKEREKGITLSTLGVGDDNFNEAMMVRIADVGNGNYSYIDS---LSEAQKVVKDEM 407 Query: 388 QEQSVRIAPN 397 + V +A + Sbjct: 408 HQTLVTVAKD 417 >gi|307942638|ref|ZP_07657986.1| hypothetical protein TRICHSKD4_1260 [Roseibium sp. TrichSKD4] gi|307774277|gb|EFO33490.1| hypothetical protein TRICHSKD4_1260 [Roseibium sp. TrichSKD4] Length = 403 Score = 86.1 bits (211), Expect = 9e-15, Method: Composition-based stats. Identities = 56/393 (14%), Positives = 122/393 (31%), Gaps = 61/393 (15%) Query: 15 YAIDLAHIMYIRNQMQSALDAAVL-SGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLK 73 ID++ R+Q Q D L + + + T +K + K Sbjct: 37 VGIDMSFAYNKRDQSQLVADEVSLFAVTTFRKYVADGMSKNQARKRAETDARKFLTARTK 96 Query: 74 QGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSL 133 E INI + ++ ++E ++ + + + Sbjct: 97 SLDGTTEKFSIK-----INIVDREAKVVKANVNISGKHES-----YMTHAMGFDNIDYTA 146 Query: 134 RSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKN 193 S S ++DVS SM + P S S+ Sbjct: 147 DSES--TISFGQGKYEFIFLVDVSPSMGIGASNRDRQIMQRAIGCQFACHEPWYSSVSR- 203 Query: 194 TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 A A +IDV+ ++ +LV +++A + +R G +++ + T Sbjct: 204 -------AKSAGARLRIDVVKDALKSLVTQLEEA----TEVDLRTGLYSFSN--YLHIQT 250 Query: 254 PLSNNLNEVKSRLNKLN------PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 L+ +++ K NK+ TN + + + +K+ Sbjct: 251 GLNKGISKFKREANKIAIHREYLRGGGTNFHG-------VFSDFNGVLRSLKPKADVKQH 303 Query: 308 VIFITDGE--------------NSGASAYQNTLN-TLQICEYMRNAGM-KIYSVAVSAP- 350 +I I+DG N + + + C+ + + ++++ V Sbjct: 304 IIIISDGVNHLNLRSGTNRHLWNQTPNWRPYNYSFNPRWCDEFKKGEVRTVHTMLVEPDR 363 Query: 351 ----PEGQDLLRKCTDSSGQFFAVNDSRELLES 379 +R C S+ F++ N + E+ ++ Sbjct: 364 AHYVRASTSSMRACATSADFFYSANSAAEIDKA 396 >gi|168817956|ref|ZP_02829956.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Weltevreden str. HI_N05-537] gi|205345018|gb|EDZ31782.1| von Willebrand factor, type A [Salmonella enterica subsp. enterica serovar Weltevreden str. HI_N05-537] gi|320086747|emb|CBY96519.1| Inter-alpha-trypsin inhibitor heavy chain H3 Inter-alpha-inhibitor heavy chain 3; ITI heavy chain H3; ITI-HC3; Flags: Precursor [Salmonella enterica subsp. enterica serovar Weltevreden str. 2007-60-3289-1] Length = 604 Score = 86.1 bits (211), Expect = 9e-15, Method: Composition-based stats. Identities = 23/159 (14%), Positives = 60/159 (37%), Gaps = 14/159 (8%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 + Y G + NN +K+ ++ L+ Y +T + AY + Sbjct: 283 IVTYAGGTHVALASTAGNNTTAIKAAIDNLDAYGSTGGEAGLRLAYEQAEKGFIKGGVNR 342 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 ++ TDG+ + + + + + R G+ + ++ V + ++ + Sbjct: 343 --------ILLTTDGDFNLG--ITDPKDIEALVKKEREKGITLSTLGVGDDNFNEAMMVR 392 Query: 360 CTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 D +G + ++ L E+ + D++ + V +A + Sbjct: 393 IADVGNGNYSYIDS---LSEAQKVLKDEMHQTLVTVAKD 428 >gi|223558081|gb|ACM91085.1| aerotolerance protein BatA [uncultured bacterium Rlip1] Length = 332 Score = 86.1 bits (211), Expect = 9e-15, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 68/204 (33%), Gaps = 40/204 (19%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENT 275 + + + + + PL+ + + + L ++ + T Sbjct: 112 DRLTAAKNVASDFVKGRPGDRMGLVIFSGETFTQVPLTTDHGVMLNMLAEMKNGLIDDGT 171 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + A L S + K VI +TDG N+ S ++ E Sbjct: 172 AIGDGLATAISRL----------KDSEAISKVVILLTDGMNNAGS-----VDPYTAAEIA 216 Query: 336 RNAGMKIYSVAVSAPPE--------------------GQDLLRKCTD-SSGQFFAVNDSR 374 + G+++Y++ V + + LL + G++F ++ Sbjct: 217 KLYGIRVYTIGVGSYGTAPYPVQTPFGTQIQQMKVEIDEKLLASVASMTGGKYFRATSNQ 276 Query: 375 ELLESFDKITDKIQEQSVRIAPNR 398 +L E +++I DK++ + + R Sbjct: 277 KLDEIYEEI-DKLERSKIEVTEFR 299 >gi|16124454|ref|NP_419018.1| hypothetical protein CC_0199 [Caulobacter crescentus CB15] gi|221233138|ref|YP_002515574.1| hypothetical protein CCNA_00199 [Caulobacter crescentus NA1000] gi|13421322|gb|AAK22186.1| hypothetical protein CC_0199 [Caulobacter crescentus CB15] gi|220962310|gb|ACL93666.1| hypothetical protein CCNA_00199 [Caulobacter crescentus NA1000] Length = 626 Score = 86.1 bits (211), Expect = 9e-15, Method: Composition-based stats. Identities = 34/243 (13%), Positives = 66/243 (27%), Gaps = 41/243 (16%) Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIG---- 246 S K +++ A + S V Sbjct: 382 STVPGCEKIRFTNVDGYERVNSQSTCATERIGSQAYTDAAPSTAYVGSHYPTAGSSSSTV 441 Query: 247 IVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE-------LYNEKESSHNTI 299 TPLS + +K+++N L T + + L+ Sbjct: 442 CPTATITPLSTDKTALKAQINGLTVGGATAGQIGLAWGWYMVAPNFGYLWPNASQRPAAY 501 Query: 300 GSTRLKKFVIFITDGENS--------------------------GASAYQNTLNTLQICE 333 + L K VI +TDG + A+ + ++C+ Sbjct: 502 KARDLMKVVILMTDGGFNMTYCNSVVARNIGSGTNIGDDERINCDATNGSSFDQAAELCD 561 Query: 334 YMRN--AGMKIYSVAV--SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 ++ + +Y+V ++ L C S+ + + EL SF I +I Sbjct: 562 SIKASANDITLYTVGFTVGNDQTARNFLTNCASSTDKAYFPATGSELKASFQAIAQEISN 621 Query: 390 QSV 392 + Sbjct: 622 LRI 624 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 38/260 (14%), Positives = 67/260 (25%), Gaps = 3/260 (1%) Query: 17 IDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGS 76 +D+ + R QMQ ALDAA L S + D T + I + + Sbjct: 49 LDVGRLSLQRRQMQDALDAATLMAARSTATSSADLDTTGDAAFLAEIAGMNLGLTASSST 108 Query: 77 YIRENAGDIAQKAQINITKDKNNPLQY-IAESKAQYEIPTENLFLKGLIPSALTNLSLRS 135 + + A + N Q A E+ + L+ + +T Sbjct: 109 FSAGTNNRVIGTATATLRPIIANLWQSGNFTVTASSEVVRASKNLEIALVLDITGSMGNG 168 Query: 136 TGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTT 195 T I + + +V D + + + K T Sbjct: 169 TRIADLKVAAADLVDVLVRDTQTPFYS--KMALVPYSAGVNVGATYADAVRGPVPVKTIT 226 Query: 196 KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL 255 + +A A + I + VR T + + P Sbjct: 227 GAAWASGSARSITGITRANPAVVTASGHGLSTGDYVYITGVRGMTSVNDKIYRVTRSDPD 286 Query: 256 SNNLNEVKSRLNKLNPYENT 275 +LN + T Sbjct: 287 KVSLNSTNTSSASNYTNGGT 306 >gi|27367909|ref|NP_763436.1| aerotolerance operon protein BatA [Vibrio vulnificus CMCP6] gi|27359482|gb|AAO08426.1| BatA (Bacteroides aerotolerance operon) [Vibrio vulnificus CMCP6] Length = 323 Score = 86.1 bits (211), Expect = 9e-15, Method: Composition-based stats. Identities = 32/203 (15%), Positives = 67/203 (33%), Gaps = 47/203 (23%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 ++ + + R+G + + TPL+ + V Sbjct: 107 GDYIDRLSAVKNVVTQFIEQ---------RQGDRLGLVLFADHAYLQ--TPLTADRQTVA 155 Query: 264 SRLNK--LNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 ++LN+ + T + A + + S ++ VI ++DG N+ Sbjct: 156 NQLNQTIIGLIGQKTAIGDGLALATKTFVD----------SEAPQRVVILLSDGSNTAG- 204 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEGQDLLRKCTD- 362 TL+ ++ + G+KIY++ + A + L K Sbjct: 205 ----TLDPIEAANIAKKYGVKIYTIGIGAGEMEVKQFFMTRKVNTSADLDEKTLTKVATM 260 Query: 363 SSGQFFAVNDSRELLESFDKITD 385 + GQ+F D++EL + I Sbjct: 261 TGGQYFRARDAQELQTIYQAINQ 283 >gi|150024244|ref|YP_001295070.1| BatA protein [Flavobacterium psychrophilum JIP02/86] gi|149770785|emb|CAL42250.1| BatA protein [Flavobacterium psychrophilum JIP02/86] Length = 333 Score = 85.7 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 28/179 (15%), Positives = 58/179 (32%), Gaps = 44/179 (24%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELY 289 S RIG + Y TP++++ V +N + + T + A L Sbjct: 128 QSDRIGVVVYTAEAYTK--TPVTSDKAVVLDAINTIKYDNVLQDGTGIGMGLATAVNRL- 184 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 S K +I +TDG N+ + + E+ + G+K+Y++ + Sbjct: 185 ---------KDSKAKSKVIILMTDGVNNAG-----FIEPVTAAEFAKEFGIKVYTIGIGT 230 Query: 350 PPE----------------------GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITD 385 + L++ + G++F + L + +I Sbjct: 231 NGNAPFPYAIAPNGGFLYKMLPVEIDEQLMKDIAKKTGGKYFRAQSNSSLESIYSEINK 289 >gi|319425442|gb|ADV53516.1| lipoprotein with VWA and DUF3520 domains [Shewanella putrefaciens 200] Length = 638 Score = 85.7 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 40/363 (11%), Positives = 95/363 (26%), Gaps = 36/363 (9%) Query: 36 AVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK 95 A L+ + I S + K ++ + ++ I +I I+ Sbjct: 108 ASLA-AKQRSAKHVINTHYVAAPIASDAW--YGIKQPERNRFEKQIQNGIMVAGEIPIS- 163 Query: 96 DKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD 155 + Y + L + + Sbjct: 164 ----TFSIDVD-TGSYSTLRRMIKEGSLPEKGTIRIEEMLNYFTYDY----PLPNKNAAP 214 Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 S + E ++D + + S + K+ +L Sbjct: 215 FSVTTELAPSPYNDDMMLLRIGLKGYELTKSELGASNLVFL-LDVSGSMASADKLPLLQT 273 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 + L + ++ + Y +++ + L +L +T Sbjct: 274 ALKMLTQQLSAQD--------KVSIVVYAGAAGVVLDGASGDDIQALTYALEQLRAGGST 325 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 N + AY+ VI TDG+ + N + + E Sbjct: 326 NGSQGILQAYQLAQKHFIQGGINR--------VILATDGDFNVGV--TNFDQLISLIEKE 375 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRI 394 + G+ + ++ L+ + D G + ++ L E+ + D++ + I Sbjct: 376 KQRGIGLTTLGFGMGNYNDQLMEQLADKGNGHYAYIDT---LNEARKVLVDELSSTLLTI 432 Query: 395 APN 397 A + Sbjct: 433 AKD 435 >gi|34558787|gb|AAQ75132.1| BatA protein [Alvinella pompejana epibiont 6C6] Length = 300 Score = 85.7 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 29/145 (20%), Positives = 57/145 (39%), Gaps = 21/145 (14%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGS 301 G +PL+ +L + + ++ NT A+ A + L + Sbjct: 133 GSFSFSASPLTYDLKALLEMFDLMSDVGIAGNNTAIGDAIFEAIKNLESG---------- 182 Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT 361 K +I +TDG+++ + G+KIY+V + + LL K Sbjct: 183 EAKSKVIILLTDGKHNFGKKSPKEGVV-----EAKKRGIKIYTVGIGTD-YDKKLLEKMA 236 Query: 362 -DSSGQFFAVNDSRELLESFDKITD 385 +++ + F +S+EL E F +I + Sbjct: 237 KETNAKSFFAKNSKELEEVFKEIEE 261 >gi|332232505|ref|XP_003265445.1| PREDICTED: collagen alpha-5(VI) chain [Nomascus leucogenys] Length = 2526 Score = 85.7 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 64/167 (38%), Gaps = 13/167 (7%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN-KLNPYENTNTYPAMHHAYREL 288 + V+ G + Y+ +N + + L + + NT T A+ HA Sbjct: 845 DVGRDRVQFGALKYSDQPNILFYLNTYSNRSAIIENLRMRRDTGGNTYTAKALKHANALF 904 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 E H + + +K+ +I ITDG++ +T +R+ G+ I++V V Sbjct: 905 TEE----HGSRINQNVKQMLIVITDGKSHDHDQLNDT------ASELRDKGITIFAVGVG 954 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 Q L + V++ +L + F + + + ++ + Sbjct: 955 K--ANQKELEGMAGNKNNAIYVDNFDKLKDVFTLVQESMCTEASEVC 999 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 19/166 (11%), Positives = 58/166 (34%), Gaps = 16/166 (9%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHA 284 + VR+G + Y+ +N +++ + + T T A+ + Sbjct: 469 TEMFSIGPDKVRVGVVQYSDDTEVEFYITDYSNDIDLRKAILNIKQITGGTYTGRALDYI 528 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + + N + + + ++I +TDG + ++ + +R + I++ Sbjct: 529 LQIIKNGMKDRMSK-----VPCYLIVLTDGM--------SADRVVEPAKRLRAEQITIHA 575 Query: 345 VAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 V + + L++ + + L +++ +I + Sbjct: 576 VGIG--AANKIELQEIAGKEERVSFGQNFDALKSIKNEVVREICTE 619 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 52/165 (31%), Gaps = 17/165 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + +IG + ++ E+ +++++ E T T A++ Sbjct: 656 TKIQIGADKTQIGVVQFSDKTKEEFQLNRYFTQQEISDAIDRMSLINEGTLTGKALNFVG 715 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + K + KF+I ITDG +R + I+SV Sbjct: 716 QYFTHSKGARLGAK------KFLILITDGVARDDVRDP--------ARILRGKDVTIFSV 761 Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 V + L + + F V + L K+ ++ Sbjct: 762 GV--YNANRSQLEEISGDGSLVFHVENFDHLKALERKLVFRVCAL 804 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 18/162 (11%), Positives = 50/162 (30%), Gaps = 17/162 (10%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN-KLNPYENT-NTYPAMHHAYRELYNE 291 R+ Y+ + + + L + A+ A+R ++ Sbjct: 65 NKYRVALAQYSDEFHSEFHLSTFKGRSPMLNHLKKNFQFIGGSLQIGKALREAHRTYFSA 124 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 + + V+ + ++ + + ++ G+KI ++V Sbjct: 125 HTNGRDKK--QFPPILVVLAS---------AESEDEVEEASKALKKDGVKI--ISVGVQE 171 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 ++ L+ S F + R+L F + +I + + Sbjct: 172 ASEENLKAMATSH-FHFNLRTIRDLST-FSQNMTQIIKDVTK 211 >gi|193214188|ref|YP_001995387.1| von Willebrand factor type A [Chloroherpeton thalassium ATCC 35110] gi|193087665|gb|ACF12940.1| von Willebrand factor type A [Chloroherpeton thalassium ATCC 35110] Length = 340 Score = 85.7 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 32/202 (15%), Positives = 67/202 (33%), Gaps = 42/202 (20%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY----EN 274 N + + + + + + PL+ + + + +++L + Sbjct: 119 NRIEAAKSVATDFIHQRLSDRIGLVVFSGKSFTQCPLTLDYRLLTNFISELKAGTIEEDG 178 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 T A+ A L ST K +I +TDG+N+ + + E Sbjct: 179 TAIGTAIATATNRL----------RESTAKSKVIILLTDGQNNAG-----EIEPVTAAEL 223 Query: 335 MRNAGMKIYSVAVSA----------------------PPEGQDLLRKCTDSSGQFFAVND 372 G+KIY+V + L R S G++F D Sbjct: 224 AAALGIKIYTVGAGTRGYARYPIPDPLFGKRYVQMKVDVDDSTLTRIARISGGRYFRATD 283 Query: 373 SRELLESFDKITDKIQEQSVRI 394 L +++ +I D++++ V + Sbjct: 284 LESLKKTYHEI-DELEKTKVEV 304 >gi|283778313|ref|YP_003369068.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] gi|283436766|gb|ADB15208.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] Length = 591 Score = 85.7 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 51/361 (14%), Positives = 100/361 (27%), Gaps = 40/361 (11%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 TA+++ V I +A+D+ ++ ++ Q+Q ++DAA L+G S+V I + Sbjct: 30 TAVLMVVMLGMIAFAVDVGYMYTMQTQLQRSVDAAALAGAGSLVEGTDIAQAKATEYLVR 89 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK--------AQYEI 113 + ++ + + + N E+ E Sbjct: 90 NPVGSSMTFVNEEEVPAKIAQFVAEHGDDFEVEAGEWNASTRSFETTNTLPSTLSVSMEY 149 Query: 114 PTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYL---QKHND 170 PT F ++ ++ S + I +VLD S SM D Sbjct: 150 PTMPTFFGKILGKDSFSIRASSVAMY------QPRDIMVVLDFSGSMNDDSTFEAFGKLG 203 Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE 230 + + SN + T + K+A D + Sbjct: 204 RSWVESNLQQCWADIGNPTYGSLTFEPKWANCKGAVPT--DGSKPQIY---VEYRNTSVY 258 Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAY----R 286 + + G + LS + +TN+ + + Sbjct: 259 VTSTLNLENVVLQFSGGTRQTFSGLSAKTGTFQ--------GSSTNSGKQITKVWVKSGN 310 Query: 287 ELYNEKESSHNTIG------STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 L E + + +KK + S Q +NAG Sbjct: 311 NLSGEGTNYGEPFDFSSSNMNNMVKKAFGLNSVSYPYNGSWDAYIDYCEQSSNSNKNAGY 370 Query: 341 K 341 + Sbjct: 371 R 371 >gi|170767616|ref|ZP_02902069.1| von Willebrand factor type A domain protein [Escherichia albertii TW07627] gi|170123950|gb|EDS92881.1| von Willebrand factor type A domain protein [Escherichia albertii TW07627] Length = 586 Score = 85.7 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 40/364 (10%), Positives = 106/364 (29%), Gaps = 24/364 (6%) Query: 37 VLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKD 96 VL+ + + + + + + K A Sbjct: 69 VLAETTQNETQQYTDQQALQGQLQAAPAYESVAKAKATRISNLGTARYQQFDDNPVKQVA 128 Query: 97 KNNPLQYIAES-KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD 155 +N + + Y L L P + N SI Sbjct: 129 QNPLATFSLDVDTGSYANVRRFLNQGQLPPPDAVRVEEMVNYFPSDWVINDKQSIPASKP 188 Query: 156 VSRSMEDLYLQKHNDNNNMTS-NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLI 214 + +M + L ++ + N ++ ++ ++ Sbjct: 189 IPFAMRYELAPAPWNEQRTLLKVDILAQDLKSEALPASNLVFLIDTSGSMYSDERLPLIQ 248 Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 S LV +++ I + Y + N+ +E+ + ++ LN + Sbjct: 249 SSLKLLVKELREQDN--------ISIVTYAGDSRIALPSTSGNHKDEINAAIDSLNARGS 300 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 TN + AY++ ++ TDG+ + + + + + Sbjct: 301 TNGGAGLEMAYQQAAKGFIKGGVNR--------ILLATDGDFNVG--IDDPKSIESMVKK 350 Query: 335 MRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 R +G+ + ++ V + ++ + D +G + ++ L E+ + ++ + V Sbjct: 351 QRESGVTLSTLGVGRDNYNEAMMVRIADVGNGNYSYIDT---LSEAQKVLNSEMHQTLVT 407 Query: 394 IAPN 397 +A + Sbjct: 408 VAKD 411 >gi|163850366|ref|YP_001638409.1| hypothetical protein Mext_0933 [Methylobacterium extorquens PA1] gi|163661971|gb|ABY29338.1| conserved hypothetical protein [Methylobacterium extorquens PA1] Length = 473 Score = 85.7 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 63/461 (13%), Positives = 134/461 (29%), Gaps = 90/461 (19%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+ + + +D + + ++ +A DAA L+G + + + T Sbjct: 25 LFALAVLPTIGLVGLGVDYGMAISSKTRLDNAADAAALAGVVT-AKEFIAANAQQSDVMT 83 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + I + + + +Q+ I + ++ Y ++ Sbjct: 84 AGIKAGEYQALKAFNVNASKVPFATVSLSQLEIVRS-----GQTLDATVSYTATVQST-F 137 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 L + T L+ R + + + +++DVS SM D++ Sbjct: 138 GRLFGLSATTLTNRVNASADIA---GYLDFYLMVDVSGSMGLPTT----DSDAALLASKS 190 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 + F + + A + D + + L++ K I RIG Sbjct: 191 VEDQGNCQFACHFPNRKGWNLAAGKIQLRSDAVNNAVCALLDRASKPIV---PNQYRIGI 247 Query: 241 IAYNIG-IVGNQCTPLSNNLNEVKSRLN----------KLNPYENT-------------N 276 + + + +L +K+ + L +T + Sbjct: 248 YPFINRLATLAPLSDTTTSLASLKTTADCGKAWPLAFTNLLDTGSTQLFTNNDPTTGTGS 307 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA--------------- 321 A ++ + + N S K FV ITDG + + Sbjct: 308 GGTHFETALPQMKSTIRTFGNGSSSANPKPFVFLITDGMQNSQTYSSWKDKKTYPGNPSK 367 Query: 322 -------YQNTLNTLQI----CEYMRNAGMKIYSVAV----------------------S 348 + QI C ++ AG I + + Sbjct: 368 FAGYRYADWDGSQPAQIDPAKCADLKKAGATISILYIPYNYVKSYTNEGTIVWENNRVNG 427 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 P D LR+C G FF N ++++ S + D+ + Sbjct: 428 FSPTLADPLRQCAS-PGLFFTANSAKDITASLGAMFDQALK 467 >gi|120599917|ref|YP_964491.1| von Willebrand factor, type A [Shewanella sp. W3-18-1] gi|120560010|gb|ABM25937.1| von Willebrand factor, type A [Shewanella sp. W3-18-1] Length = 638 Score = 85.7 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 40/363 (11%), Positives = 96/363 (26%), Gaps = 36/363 (9%) Query: 36 AVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK 95 A L+ + I S + K ++ + ++ I +I I+ Sbjct: 108 ASLA-AKQRSAKHVINTHYVAAPIASDAW--YGIKQPERNRFEKQIQNGIMVAGEIPIS- 163 Query: 96 DKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD 155 + Y + L + + Sbjct: 164 ----TFSIDVD-TGSYSTLRRMIKEGSLPEKGTIRIEEMLNYFTYDY----PLPNKNAAP 214 Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 S + E ++D + + S + K+ +L Sbjct: 215 FSVTTELAPSPYNDDMMLLRIGLKGYELTKSELGASNLVFL-LDVSGSMASADKLPLLQT 273 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 + L + ++ + Y +++ + L +L +T Sbjct: 274 ALKMLTQQLSAQD--------KVSIVVYAGAAGVVLDGASGDDIQALTYALEQLRAGGST 325 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 N + AY+ VI TDG+ + + L + + E Sbjct: 326 NGSQGILQAYQLAQKHFIQGGINR--------VILATDGDFNVGVTNFDLL--ISLIEKE 375 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRI 394 + G+ + ++ L+ + D G + ++ L E+ + D++ + I Sbjct: 376 KQRGIGLTTLGFGMDNYNDQLMEQLADKGNGHYAYIDT---LNEARKVLVDELSSTLLTI 432 Query: 395 APN 397 A + Sbjct: 433 AKD 435 >gi|156616290|ref|NP_001096078.1| collagen alpha-6(VI) chain precursor [Homo sapiens] gi|189082902|sp|A6NMZ7|CO6A6_HUMAN RecName: Full=Collagen alpha-6(VI) chain; Flags: Precursor Length = 2263 Score = 85.7 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 41/335 (12%), Positives = 100/335 (29%), Gaps = 31/335 (9%) Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 IKK K+ R + QI + + + ++ +F Sbjct: 305 KAYTGAAIKKLRKEVFSARNGSRKNQGVPQIAVLVTHRDSEDNVTKAAVNLRREGVTIFT 364 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYL----QKHNDNNNMTS 176 G+ ++ T L ++ E+ L + ++ L + + Sbjct: 365 LGIEGASDTQLEKIASHPAEQYVSKLKTFADLAAHNQTFLKKLRNQITHTVSVFSERTET 424 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 K + + + +V A + V Sbjct: 425 LKSGCVDTEEADIYLLIDGSGSTQ------ATDFHEMKTFLSEVVGMFNIAPHK-----V 473 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESS 295 R+G + Y +N ++ + + TNT A++ L K+ Sbjct: 474 RVGAVQYADSWDLEFEINKYSNKQDLGKAIENIRQMGGNTNTGAALNFTLSLLQKAKKQR 533 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 N + ++ +T+G + + L+ +R +++Y++ + Q Sbjct: 534 GNK-----VPCHLVVLTNGM--------SKDSILEPANRLREEHIRVYAIGI--KEANQT 578 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 LR+ + + V+D L + +++ +I + Sbjct: 579 QLREIAGEEKRVYYVHDFDALKDIRNQVVQEICTE 613 Score = 73.4 bits (178), Expect = 5e-11, Method: Composition-based stats. Identities = 50/355 (14%), Positives = 106/355 (29%), Gaps = 36/355 (10%) Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 D + K+ + + I++ ++ ++A Sbjct: 839 ADVGKNQVRFGALKYADDPEVLFYLDDFGTKLEVISVLQNDQAMGGSTYTAEALGFSDHM 898 Query: 117 NL------FLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 KG+ + S + ++ A+ +L V D Sbjct: 899 FTEARGSRLNKGVPQVLIVITDGESHDADKLNATAKALRDKGIL-VLAVGIDGANPVELL 957 Query: 171 NNNMTSNKYLLPPPPKKSF--WSKNTTKSKYAPAPAPANRKIDVL---IESAGNLVNSIQ 225 +S+KY +S T + K+D++ S N + Sbjct: 958 AMAGSSDKYFFVETFGGLKGIFSDVTASVCNSSKVDCEIDKVDLVFLMDGSTSIQPNDFK 1017 Query: 226 KA---------IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENT 275 K + VRIG ++ E+ ++ + NT Sbjct: 1018 KMKEFLASVVQDFDVSLNRVRIGAAQFSDTYHPEFPLGTFIGEKEISFQIENIKQIFGNT 1077 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + A+ RE+ + + +T + ++ +TDG++ Q E + Sbjct: 1078 HIGAAL----REVEHYFRPDMGSRINTGTPQVLLVLTDGQSQD--------EVAQAAEAL 1125 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 R+ G+ IYSV + L + T ++ + V++ EL + +I I Sbjct: 1126 RHRGIDIYSVGIG--DVDDQQLIQITGTAEKKLTVHNFDELKKVNKRIVRNICTT 1178 Score = 63.8 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 27/160 (16%), Positives = 54/160 (33%), Gaps = 19/160 (11%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEK 292 V+IG + ++ + +++ + ++++ T T A+ + K Sbjct: 657 DRVQIGVVQFSDINKEEFQLNRFMSQSDISNAIDQMAHIGQTTLTGSALSFVSQYFSPTK 716 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA-VSAPP 351 + ++KF+I ITDGE + +R G+ IYSV + Sbjct: 717 GARP------NIRKFLILITDGEAQD--------IVKEPAVVLRQEGVIIYSVGVFGSNV 762 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 L + + F V + L D + I Sbjct: 763 ---TQLEEISGRPEMVFYVENFDILQRIEDDLVFGICSPR 799 >gi|311268548|ref|XP_003132103.1| PREDICTED: collagen alpha-5(VI) chain-like [Sus scrofa] Length = 2519 Score = 85.7 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 37/370 (10%), Positives = 104/370 (28%), Gaps = 34/370 (9%) Query: 29 MQSALDAA-VLSG--CASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDI 85 + S D A LS ++ ++ + + ++K ++G + Sbjct: 274 LMSYSDRAETLSVLKSSTSQAEFQKQIQKLSLRAGKSNVGAAVEKMRREGFSASSGSRRA 333 Query: 86 AQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLS--LRSTGIIERSS 143 QI + + E+ + +F G+ + T L + S Sbjct: 334 LGVPQIAVLVTNRPSDDEVREATLNLRLDDVTVFAMGIHGANKTQLEEIVSYPPRQTISM 393 Query: 144 ENLAISICMVLDVSRSMEDLYLQKHNDNNN--MTSNKYLLPPPPKKSFWSKNTTKSKYAP 201 + + + +K + + Sbjct: 394 LESYADLGNYTNNFLKKLQNEIWSQISTKAEQKELDKTGCVDTKEVDIYFLID------- 446 Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + D + ++ I+ VR+G + Y+ Sbjct: 447 --GSGSIRSDHFEQIKKFMLEVIENFDIGPDK--VRVGAVQYSDTREKEFDITDYTTDET 502 Query: 262 VKSRLNKLN-PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 ++ ++ + T T A+ + + +E N + ++I +TDG Sbjct: 503 LRKAISNIRQLGGGTYTGEALDFILQIIKKGREQRINK-----VPCYLIVLTDGM----- 552 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 + + L+ E +R + ++++ + + L++ + + L Sbjct: 553 ---SMDDVLEPAEKLRAENIAVHAIGIG--EANRTQLQQIAGKEERVSFGQNFDSLKNIK 607 Query: 381 DKITDKIQEQ 390 +++ I + Sbjct: 608 NEVLHSICTE 617 Score = 78.0 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 29/160 (18%), Positives = 59/160 (36%), Gaps = 15/160 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + V+ G + Y+ + L++N + + L L+ T T A+ A Sbjct: 843 DVGPNRVQFGALRYSNE--PDIIFYLNSNRSAIMEYLRSLSAKGGDTYTAKALERANILF 900 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 E H + +K+ +I ITDG++ + +R G+ IY+V V Sbjct: 901 TEE----HGSRIKQNVKQMLIIITDGKSHDHIHLSDK------ASKLRAKGIIIYAVGVG 950 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 Q+ L + V++ L + + + + + Sbjct: 951 --EANQEELETMAGNKHYTIHVSNFDSLKDVYQPLQESMC 988 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 31/158 (19%), Positives = 59/158 (37%), Gaps = 17/158 (10%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEK 292 RIG + ++ E+ + +++P E NT T A+ + Sbjct: 661 DKTRIGVVQFSDVTKEEFKLDTYFTQKEISDAIERMSPIEQNTLTGKAL----TSIEPYF 716 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 S ++KF+I ITDGE + + +R+ G+ I++V V Sbjct: 717 TESKGARS--MVRKFLILITDGEAQD--------DVRNPAKVLRDKGVVIFAVGVFR--A 764 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + L + + S F V +L E +K+ ++ Sbjct: 765 NRTQLEEISGDSSLVFQVESFSDLQEIENKLIFRVCAL 802 >gi|297671963|ref|XP_002814093.1| PREDICTED: collagen alpha-6(VI) chain-like [Pongo abelii] Length = 2291 Score = 85.7 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 42/335 (12%), Positives = 100/335 (29%), Gaps = 31/335 (9%) Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 IKK K+ R + QI + + + ++ +F Sbjct: 333 KAYTGAAIKKLRKEVFSARNGSRKNQGVPQIAVLVTHRDSEDNVTKAAVNLRREGVTIFT 392 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYL----QKHNDNNNMTS 176 G+ ++ T L ++ E+ L + ++ L + + Sbjct: 393 LGIEGASDTQLEKIASHPAEQYVSKLKTFADLAAHNQTFLKKLRNQITHTVSVFSERTET 452 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 K + + + +V A + V Sbjct: 453 LKSGCVDTEEADIYLLIDGSGSTQ------ATDFHEMKTFLSEVVGMFNIAPHK-----V 501 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESS 295 R+G + Y +N ++ + + TNT A++ L K+ Sbjct: 502 RVGAVQYADSWDLEFEINKYSNKQDLGKAIENIRQMGGNTNTGAALNFTLSLLQKAKKQR 561 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 N + ++ +T+G + + L+ +R +++Y++ V Q Sbjct: 562 GNK-----VPCHLVVLTNGM--------SKDSILEPANRLREEHIRVYAIGV--KEANQT 606 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 LR+ + + V+D L + +++ +I + Sbjct: 607 QLREIAGEEKRVYYVHDFDALKDIRNQVVQEICTE 641 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 48/355 (13%), Positives = 105/355 (29%), Gaps = 36/355 (10%) Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 D + K+ + + I++ ++ ++A Sbjct: 867 ADVGKNQVRFGALKYADDPEVLFYLDDFGTKLEVISVLQNDQAMGGNTYTAEALGFSDHM 926 Query: 117 NL------FLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 KG+ + S + ++ A+ +L V Sbjct: 927 FTEARGSRLNKGVPQVLIVITDGESHDADKLNATAKALRDKGIL-VLAVGIAGANPVELL 985 Query: 171 NNNMTSNKYLLPPPPKKSF--WSKNTTKSKYAPAPAPANRKIDVL---IESAGNLVNSIQ 225 +S+KY +S T + K+D++ S N + Sbjct: 986 AMAGSSDKYFFVETFGGLKGIFSDVTASVCNSSKVDCEIDKVDLVFLMDGSTSIQQNDFK 1045 Query: 226 KA---------IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENT 275 K + VRIG ++ E+ ++ + NT Sbjct: 1046 KMKEFLASVVQDFDVSLNRVRIGAAQFSDTYHPEFPLGAFIGEKEISFQIENIKQIFGNT 1105 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + A+ R++ + + +T + ++ +TDG++ Q E + Sbjct: 1106 HIGAAL----RKVEHYFRPDMGSRINTGTPQVLLVLTDGQSQD--------EVAQAAEAL 1153 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 R+ G+ IYSV + L + T ++ + V++ EL + +I I Sbjct: 1154 RHRGIDIYSVGIG--DVDDQQLIQITGTAEKKLTVHNFDELKKVNKRIVRNICTT 1206 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 27/160 (16%), Positives = 54/160 (33%), Gaps = 19/160 (11%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEK 292 V+IG + ++ + +++ + ++++ T T A+ + K Sbjct: 685 DRVQIGAVQFSDINKEEFQLNRFMSQSDISNAIDQMAHIGQTTLTGSALSFVSQYFSPTK 744 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA-VSAPP 351 + ++KF+I ITDGE + +R G+ IYSV + Sbjct: 745 GARP------NVRKFLILITDGEAQD--------IVKEPAIALRQEGVIIYSVGVFGSNV 790 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 L + + F V + L D + I Sbjct: 791 ---TQLEEISGRPEMVFYVENFDILQRVEDDLVFGICSPR 827 >gi|218778177|ref|YP_002429495.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] gi|218759561|gb|ACL02027.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] Length = 558 Score = 85.7 bits (210), Expect = 1e-14, Method: Composition-based stats. Identities = 41/305 (13%), Positives = 91/305 (29%), Gaps = 31/305 (10%) Query: 95 KDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVL 154 D + + A Y L + P + + + Sbjct: 105 YDPLSTFSIDVD-TASYSNVRRFLSYGNMPPVDAVRIEEMINYFHYDYPQPKGQDPFSI- 162 Query: 155 DVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLI 214 +ME + DN + K S + K+ ++ Sbjct: 163 ----TMEMSQCPWNRDNMLVHVGLQGRCLDYKDVKPSNLVFL-LDVSGSMNSENKLPLVK 217 Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 S LV + R+ + Y + + N ++ + L++L + Sbjct: 218 RSMEMLVKELGAGD--------RVSIVTYAGSAGLVLPSTSARNKRKIITALDRLEAGGS 269 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 T + AYR E+ + VI TDG+ + + L +++ E Sbjct: 270 TAGGEGIELAYRV---AWENLIPEGNN-----RVILCTDGDFNVGVSSTPEL--VRMIEE 319 Query: 335 MRNAGMKIYSV-AVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSV 392 R AG+ + ++ + + +++ G F+ ++ RE + F ++ Sbjct: 320 KRRAGIYL-TICGFGMGNYKDEKMEAISNAGNGNFYYIDSRREAHKVF---VQDMRANMF 375 Query: 393 RIAPN 397 +A + Sbjct: 376 TLAKD 380 >gi|146340337|ref|YP_001205385.1| hypothetical protein BRADO3364 [Bradyrhizobium sp. ORS278] gi|146193143|emb|CAL77155.1| conserved hypothetical protein [Bradyrhizobium sp. ORS278] Length = 470 Score = 85.3 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 50/473 (10%), Positives = 125/473 (26%), Gaps = 103/473 (21%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+++ + A+D + + Q+ + + Sbjct: 20 IFALMMVPTIYLLGMALDYTQALRKQGQL--------------DAAADAAAIAAVRPAML 65 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENL-F 119 S +K + + +IT + + Q ++ Sbjct: 66 SVTDTSVVKNTAAAVFATKAAMNGLTAVPTPDITVTDSG-----LQRTIQVSYVARSINN 120 Query: 120 LKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKY 179 ++ S + +T ++ ++LD S SM Q DN ++K Sbjct: 121 FPSVLGSPSWQVKGSATAQAS---SAPNMNFYLLLDDSPSMAIAATQTDIDNLIAATSKQ 177 Query: 180 L-----------LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAI 228 P S S +T + +ID+++ + L+ Sbjct: 178 PGGSKNCGFACHEVHPNLDSGASSSTVDNLSIARSKSITLRIDLVVNAVKQLLVGPYTCP 237 Query: 229 QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK---------SRLNKLN--------- 270 Q + V A N NLN ++ ++++ + Sbjct: 238 QAGISGGVMQCMSAINNTTYKAAIYTFDYNLNTIQTLTSPSSAGTKISNIQLLTVDHQNC 297 Query: 271 -------PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 T+ A+ + + + ++ V +TDG A Sbjct: 298 VTTAICNTDFGTDISGALGGVNAIMPD--PGTGTNQSGDTPQEVVFLVTDGVEDKLIASS 355 Query: 324 NTLNTL--------------------QICEYMRNAGMKI---YSVAVSAPPEG------- 353 + + +C+ ++ ++I Y+ + + Sbjct: 356 SGCDPKATYPLPAAGSQVRCQQPLNTAVCDTIKKRNIRIAILYTEYLQLTTDNWYNSRIA 415 Query: 354 ------------QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRI 394 L+ C S F V ++ ++ K+ ++ + ++ Sbjct: 416 QFNSPSSLSGTIAQRLQACASSPDFFATVQTGGDISDALTKLFLRVASSTAKL 468 >gi|254372185|ref|ZP_04987677.1| conserved hypothetical protein [Francisella tularensis subsp. novicida GA99-3549] gi|151569915|gb|EDN35569.1| conserved hypothetical protein [Francisella novicida GA99-3549] Length = 339 Score = 85.3 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 39/202 (19%), Positives = 74/202 (36%), Gaps = 43/202 (21%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSV-----RIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 I + ++ G + + ++ R+G I + TPL+ ++ VK Sbjct: 112 IQDMKKANGQMESRFDLVMRVANQFLDTRKGDRVGLILFGTRAYLQ--TPLTFDIATVKK 169 Query: 265 RLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 L+ + P T A+ A ++L S K +I +TDGEN+ Sbjct: 170 MLDDASIALPGPQTAIGDAIGLAVKKLKKYPGDS----------KALILLTDGENNSG-- 217 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVS-----------------APPEGQDLLRKCTD-S 363 TL LQ E + +KIY++ + + +L K + Sbjct: 218 ---TLQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNTSEDLDTTVLEKIATMT 274 Query: 364 SGQFFAVNDSRELLESFDKITD 385 G++F +S +L + ++ I Sbjct: 275 GGKYFRAQNSSDLKKVYESIDK 296 >gi|332817903|ref|XP_003310057.1| PREDICTED: collagen alpha-6(VI) chain [Pan troglodytes] Length = 2263 Score = 85.3 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 40/335 (11%), Positives = 100/335 (29%), Gaps = 31/335 (9%) Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 IKK K+ R + QI + + + ++ +F Sbjct: 305 KAYTGAAIKKLRKEVFSARNGSRKNQGVPQIAVLVTHRDSEDNVTKAAVNLRREGVTIFT 364 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYL----QKHNDNNNMTS 176 G+ ++ T L ++ E+ L + ++ L + + Sbjct: 365 LGIEGASDTQLEKIASHPAEQYVSKLKTFADLAAHNQTFLKKLRNQITHTVSVFSERTET 424 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 K + + + +V A + V Sbjct: 425 LKSGCVDTEEADIYLLIDGSGSTQ------ATDFHEMKTFLSEVVGMFNIAPHK-----V 473 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESS 295 R+G + Y +N ++ + + TNT A++ L K+ Sbjct: 474 RVGAVQYADSWDLEFEINKYSNKQDLGKAIENIRQMGGNTNTGAALNFTLSLLQKAKKQR 533 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 N + ++ +T+G + + L+ +R +++Y++ + Q Sbjct: 534 GNK-----VPCHLVVLTNGM--------SKDSILEPANRLREEHIRVYAIGI--KEANQT 578 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 LR+ + + V++ L + +++ +I + Sbjct: 579 QLREIAGEEKRVYYVHNFDALKDIRNQVVQEICTE 613 Score = 73.0 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 50/355 (14%), Positives = 106/355 (29%), Gaps = 36/355 (10%) Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 D + K+ + + I++ ++ ++A Sbjct: 839 ADVGKNQVRFGALKYADDPEVLFYLDDFGTKLEVISVLQNDQAMGGSTYTAEALGFSDHM 898 Query: 117 NL------FLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 KG+ + S + ++ A+ +L V D Sbjct: 899 FTEARGSRLNKGVPQVLIVITDGESHDADKLNATAKALRDKGIL-VLAVGIDGANPMELL 957 Query: 171 NNNMTSNKYLLPPPPKKSF--WSKNTTKSKYAPAPAPANRKIDVL---IESAGNLVNSIQ 225 +S+KY +S T + K+D++ S N + Sbjct: 958 AMAGSSDKYFFVETFGGLKGIFSDVTASVCNSSKVDCEIDKVDLVFLMDGSTSIQPNDFK 1017 Query: 226 KA---------IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENT 275 K + VRIG ++ E+ ++ + NT Sbjct: 1018 KMKEFLASVVQDFDVSLNRVRIGAAQFSDTYHPEFPLGTFIGEKEISFQIENIKQIFGNT 1077 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + A+ RE+ + + +T + ++ +TDG++ Q E + Sbjct: 1078 HIGAAL----REVEHYFRPDMGSRINTGTPQVLLVLTDGQSQD--------EVAQAAEAL 1125 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 R+ G+ IYSV + L + T ++ + V++ EL + +I I Sbjct: 1126 RHRGIDIYSVGIG--DVDDQQLIQITGTAEKKLTVHNFDELKKVNKRIVRNICTT 1178 Score = 63.8 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 27/160 (16%), Positives = 54/160 (33%), Gaps = 19/160 (11%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEK 292 V+IG + ++ + +++ + ++++ T T A+ + K Sbjct: 657 DRVQIGVVQFSDINKEEFQLNRFMSQSDISNAIDQMAHIGQTTLTGSALSFVSQYFSPTK 716 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA-VSAPP 351 + ++KF+I ITDGE + +R G+ IYSV + Sbjct: 717 GARP------NIRKFLILITDGEAQD--------IVKEPAVVLRQEGVIIYSVGVFGSNV 762 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 L + + F V + L D + I Sbjct: 763 ---TQLEEISGRPEMVFYVENFDILQRIEDDLVFGICSPR 799 >gi|114589213|ref|XP_516745.2| PREDICTED: hypothetical protein [Pan troglodytes] Length = 1859 Score = 85.3 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 40/335 (11%), Positives = 100/335 (29%), Gaps = 31/335 (9%) Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 IKK K+ R + QI + + + ++ +F Sbjct: 525 KAYTGAAIKKLRKEVFSARNGSRKNQGVPQIAVLVTHRDSEDNVTKAAVNLRREGVTIFT 584 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYL----QKHNDNNNMTS 176 G+ ++ T L ++ E+ L + ++ L + + Sbjct: 585 LGIEGASDTQLEKIASHPAEQYVSKLKTFADLAAHNQTFLKKLRNQITHTVSVFSERTET 644 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 K + + + +V A + V Sbjct: 645 LKSGCVDTEEADIYLLIDGSGSTQ------ATDFHEMKTFLSEVVGMFNIAPHK-----V 693 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESS 295 R+G + Y +N ++ + + TNT A++ L K+ Sbjct: 694 RVGAVQYADSWDLEFEINKYSNKQDLGKAIENIRQMGGNTNTGAALNFTLSLLQKAKKQR 753 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 N + ++ +T+G + + L+ +R +++Y++ + Q Sbjct: 754 GNK-----VPCHLVVLTNGM--------SKDSILEPANRLREEHIRVYAIGI--KEANQT 798 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 LR+ + + V++ L + +++ +I + Sbjct: 799 QLREIAGEEKRVYYVHNFDALKDIRNQVVQEICTE 833 Score = 73.0 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 50/355 (14%), Positives = 106/355 (29%), Gaps = 36/355 (10%) Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 D + K+ + + I++ ++ ++A Sbjct: 1071 ADVGKNQVRFGALKYADDPEVLFYLDDFGTKLEVISVLQNDQAMGGSTYTAEALGFSDHM 1130 Query: 117 NL------FLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 KG+ + S + ++ A+ +L V D Sbjct: 1131 FTEARGSRLNKGVPQVLIVITDGESHDADKLNATAKALRDKGIL-VLAVGIDGANPMELL 1189 Query: 171 NNNMTSNKYLLPPPPKKSF--WSKNTTKSKYAPAPAPANRKIDVL---IESAGNLVNSIQ 225 +S+KY +S T + K+D++ S N + Sbjct: 1190 AMAGSSDKYFFVETFGGLKGIFSDVTASVCNSSKVDCEIDKVDLVFLMDGSTSIQPNDFK 1249 Query: 226 KA---------IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENT 275 K + VRIG ++ E+ ++ + NT Sbjct: 1250 KMKEFLASVVQDFDVSLNRVRIGAAQFSDTYHPEFPLGTFIGEKEISFQIENIKQIFGNT 1309 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + A+ RE+ + + +T + ++ +TDG++ Q E + Sbjct: 1310 HIGAAL----REVEHYFRPDMGSRINTGTPQVLLVLTDGQSQD--------EVAQAAEAL 1357 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 R+ G+ IYSV + L + T ++ + V++ EL + +I I Sbjct: 1358 RHRGIDIYSVGIG--DVDDQQLIQITGTAEKKLTVHNFDELKKVNKRIVRNICTT 1410 Score = 63.8 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 27/160 (16%), Positives = 54/160 (33%), Gaps = 19/160 (11%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEK 292 V+IG + ++ + +++ + ++++ T T A+ + K Sbjct: 889 DRVQIGVVQFSDINKEEFQLNRFMSQSDISNAIDQMAHIGQTTLTGSALSFVSQYFSPTK 948 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA-VSAPP 351 + ++KF+I ITDGE + +R G+ IYSV + Sbjct: 949 GARP------NIRKFLILITDGEAQD--------IVKEPAVVLRQEGVIIYSVGVFGSNV 994 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 L + + F V + L D + I Sbjct: 995 ---TQLEEISGRPEMVFYVENFDILQRIEDDLVFGICSPR 1031 >gi|315650876|ref|ZP_07903919.1| von Willebrand factor type A domain protein [Eubacterium saburreum DSM 3986] gi|315486855|gb|EFU77194.1| von Willebrand factor type A domain protein [Eubacterium saburreum DSM 3986] Length = 526 Score = 85.3 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 34/348 (9%), Positives = 92/348 (26%), Gaps = 31/348 (8%) Query: 36 AVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK 95 A L+GC+S + + + Y E I + +++ Sbjct: 23 ASLAGCSSAGRTYEATKSAFDSNDAGKMVEYYS----PNVGYNTEEYNIINESGFQSVST 78 Query: 96 DKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD 155 + + A Y + L P+ + + D Sbjct: 79 SPLSTFAADVD-TASYANIRRFITDGELPPADAVRIEEMLNYFYYDYPQPKD-------D 130 Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 S+ + + L + N A K+ ++ Sbjct: 131 EPFSVTTEISSCPWNPDTKLMQIGLQAKNTDTTTKPSNLVFLIDVSASMDEPDKLPLVKN 190 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 + L + +++ I + Y ++ + S + L +T Sbjct: 191 AFLLLCDELKENDT--------ISIVTYAGTDSVVLEGAKGSDKKSIMSAIEDLTAGGST 242 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + AY+ ++ N V+ TDG+ + + +++ + Sbjct: 243 AGSDGIKTAYKIAEKYFKTEGNNR--------VVLATDGDLNVG--ITSEGELIKLIKKE 292 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDK 382 + + + + + + ++ D+ G + ++ E + Sbjct: 293 KESNIFLSVLGFGTDNIKDNKMQSLADNGDGNYSYIDSRFEAKKVLSD 340 >gi|170743327|ref|YP_001771982.1| hypothetical protein M446_5224 [Methylobacterium sp. 4-46] gi|168197601|gb|ACA19548.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 478 Score = 85.3 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 54/455 (11%), Positives = 111/455 (24%), Gaps = 80/455 (17%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+ + + A+D + + A A++ + T K Sbjct: 28 LFALSLIPVLGLVGLAVDYGLAAADKTTLDHAA------DTAALAAVVTAKSYIAANQGQ 81 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + + I L Q + + AQ+ + + + Y +N Sbjct: 82 ANLTANAIAAGLAQAANVFAVNAGSVPFAQVTLQPPQLVRSGQTLTATVSYGATIQNS-F 140 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 L+ + T L T + + +++DVS SM + Sbjct: 141 GKLLGTPTTLLGNSVTASADL---PSYLDFYLLVDVSGSMGLPATPGGMTQLASVNKDMW 197 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 F + + A + D + + +L+ V I Sbjct: 198 SDYQQGCQFACHFPGFTGWGLAAGKIQLRSDAVNAAVCSLIQRA-STPAVPNQYRVGIYP 256 Query: 241 IAYN-----------------IGIVGNQCTPLSN--NLNEVKSRLNKLNPYENTNTYPAM 281 + +N + + + Sbjct: 257 FINQMATLVGITGSVASLNAAAQCALSWPLAFTNLLDTGTTQLFAYGDPTTGTASGGTHF 316 Query: 282 HHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ----------- 330 +L ++ + ST K FV ITDG +G T Sbjct: 317 EVVMPQLQAAIKAFGDGSSSTSPKPFVFLITDGMQNGQHYGAPANGTYAYPGNPSSFWGY 376 Query: 331 ----------------ICEYMRNAGMKIYSVAV----------------------SAPPE 352 +C +++AG I +++ P Sbjct: 377 ADAWWDGSQPSQIDPTVCAGLKSAGATISILSIPYNLITFVNNGGGVAWENNRVSGFSPT 436 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 L+ C G F N ++ S + + D+ Sbjct: 437 LATPLKACAS-PGFFATANTPADITASLNAMFDQA 470 >gi|328676285|gb|AEB27155.1| BatA in aerotolerance operon [Francisella cf. novicida Fx1] Length = 333 Score = 85.3 bits (209), Expect = 1e-14, Method: Composition-based stats. Identities = 39/202 (19%), Positives = 74/202 (36%), Gaps = 43/202 (21%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSV-----RIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 I + ++ G + + ++ R+G I + TPL+ ++ VK Sbjct: 106 IQDMKKANGQMESRFDLVMRVANQFLDTRKGDRVGLILFGTRAYLQ--TPLTFDIATVKK 163 Query: 265 RLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 L+ + P T A+ A ++L S K +I +TDGEN+ Sbjct: 164 MLDDASIALPGPQTAIGDAIGLAVKKLKKYPGDS----------KALILLTDGENNSG-- 211 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVS-----------------APPEGQDLLRKCTD-S 363 TL LQ E + +KIY++ + + +L K + Sbjct: 212 ---TLQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNTSEDLDTTVLEKIATMT 268 Query: 364 SGQFFAVNDSRELLESFDKITD 385 G++F +S +L + ++ I Sbjct: 269 GGKYFRAQNSSDLKKVYESIDK 290 >gi|328675375|gb|AEB28050.1| BatA in aerotolerance operon [Francisella cf. novicida 3523] Length = 333 Score = 85.3 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 39/205 (19%), Positives = 77/205 (37%), Gaps = 47/205 (22%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + D+++ A +++ + R+G I + TPL+ ++ Sbjct: 112 SNGQMESRFDLVMRVANQFLDTRK---------GDRVGLILFGTRAYLQ--TPLTFDIAT 160 Query: 262 VKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 VK L+ + P T A+ A ++L S K +I +TDGEN+ Sbjct: 161 VKKMLDDASIALPGPQTAIGDAIGLAVKKLKKYPGDS----------KALILLTDGENNS 210 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVS-----------------APPEGQDLLRKCT 361 TL LQ E + +KIY++ + + +L K Sbjct: 211 G-----TLQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLINTSEDLDTTVLEKIA 265 Query: 362 D-SSGQFFAVNDSRELLESFDKITD 385 + + G++F +S +L + ++ I Sbjct: 266 EMTGGKYFRAQNSSDLKKVYESIDK 290 >gi|307941490|ref|ZP_07656845.1| von Willebrand factor, type A [Roseibium sp. TrichSKD4] gi|307775098|gb|EFO34304.1| von Willebrand factor, type A [Roseibium sp. TrichSKD4] Length = 611 Score = 85.3 bits (209), Expect = 2e-14, Method: Composition-based stats. Identities = 35/319 (10%), Positives = 88/319 (27%), Gaps = 26/319 (8%) Query: 79 RENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGI 138 E + + D + + A Y L G Sbjct: 126 TERFPEAESNPLKQVKTDPVSTFSVDVD-TASYAFVRSELVNGGKPNPDAVRAEEMINYF 184 Query: 139 IERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSK 198 + +VS + + L P ++ T Sbjct: 185 QYDYKVPDSREAPFSTNVSVVETPWNSDTKLLHIGLKGYTVPLDDLPPQNLVFLIDTSG- 243 Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 K+ +L ++ L++++++ I + Y ++ Sbjct: 244 ----SMSDENKLPLLQQAFRLLLSTLREDDT--------IAIVTYAGNAGVLLEPTALSD 291 Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 +++ + L +T + + AYR ++ K +I TDG+ + Sbjct: 292 KSKIAEAIAALTSGGSTAGHAGLKEAYRLAETMQD--------DDTKSRIILATDGDFNV 343 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + + + R++G+ + + +L++ + L E Sbjct: 344 GLSSAD--DMKRFVKEKRDSGITLSVLGFGRGNYNDELMQALAQNGNGV--AAYIDTLSE 399 Query: 379 SFDKITDKIQEQSVRIAPN 397 + + D+I IA + Sbjct: 400 ARKVLVDQIVSSISTIAQD 418 >gi|85374662|ref|YP_458724.1| von Willebrand factor type A domain-containing protein [Erythrobacter litoralis HTCC2594] gi|84787745|gb|ABC63927.1| von Willebrand factor type A domain protein [Erythrobacter litoralis HTCC2594] Length = 580 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 38/368 (10%), Positives = 101/368 (27%), Gaps = 30/368 (8%) Query: 31 SALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQ 90 +A A V+SG D + + + + + + + RE Sbjct: 60 AAQQAVVVSGSRIASEAAVAPDTSGQPAEAAGREYRYVMPVIVPQPEDRERYDGEEVSPV 119 Query: 91 INITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISI 150 + + + Y L + P A S Sbjct: 120 KIAAVEPLSTFSVDVD-TGAYANARRFLSQGQMPPKAAVRTEEFINYFRYDYDRPQDRSQ 178 Query: 151 CMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKI 210 ++ + + P + K+ Sbjct: 179 PFTVNFDAARTPWNEDTRLIRIGLAGYDIERSERPPANLV-----FLMDVSGSMGRPDKL 233 Query: 211 DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN 270 ++ + L +Q + +N+ ++++ LN+L Sbjct: 234 PLVKTALAGLAGELQ----------PQDKVSIVVYAGAAGLVLEPTNDTRKIRAALNQLQ 283 Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 +T + AY+ + VI TDG+ + + ++ L ++ Sbjct: 284 AGGSTAGGAGIQLAYQIAEDNFIEGGVNR--------VILATDGDFNVGVSSRDAL--IE 333 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQE 389 + E R++G+ + ++ + ++ + + G + ++ + E + + D++ Sbjct: 334 MIEKKRDSGITLTTLGFGTGNYNEAMMEQIANHGNGNYAYIDSALEAKKV---LGDEMSS 390 Query: 390 QSVRIAPN 397 IA + Sbjct: 391 TLFTIAKD 398 >gi|205353430|ref|YP_002227231.1| lipoprotein [Salmonella enterica subsp. enterica serovar Gallinarum str. 287/91] gi|205273211|emb|CAR38174.1| lipoprotein [Salmonella enterica subsp. enterica serovar Gallinarum str. 287/91] Length = 499 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 70/190 (36%), Gaps = 22/190 (11%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++ ++ + LVN ++ I + Y G + NN +K+ ++ Sbjct: 155 RLPLIQSALKLLVNDLRAQDN--------ITIVTYAGGTHVALASTAGNNTTAIKAAIDN 206 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L+ Y +T + AY + ++ TDG+ + + + Sbjct: 207 LDAYGSTGGEAGLRLAYEQAEKGFIKGGVNR--------ILLTTDGDFNLG--ITDPKDI 256 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKI 387 + + R G+ + ++ V + ++ + D +G + + L E+ + D++ Sbjct: 257 EALVKKEREKGITLSTLGVGDDNFNEAMMVRIADVGNGNYSYIYS---LSEAQKVLKDEM 313 Query: 388 QEQSVRIAPN 397 + V +A + Sbjct: 314 HQTLVTVAKD 323 >gi|326628521|gb|EGE34864.1| lipoprotein [Salmonella enterica subsp. enterica serovar Gallinarum str. 9] Length = 596 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 70/190 (36%), Gaps = 22/190 (11%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++ ++ + LVN ++ I + Y G + NN +K+ ++ Sbjct: 252 RLPLIQSALKLLVNDLRAQDN--------ITIVTYAGGTHVALASTAGNNTTAIKAAIDN 303 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L+ Y +T + AY + ++ TDG+ + + + Sbjct: 304 LDAYGSTGGEAGLRLAYEQAEKGFIKGGVNR--------ILLTTDGDFNLG--ITDPKDI 353 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKI 387 + + R G+ + ++ V + ++ + D +G + + L E+ + D++ Sbjct: 354 EALVKKEREKGITLSTLGVGDDNFNEAMMVRIADVGNGNYSYIYS---LSEAQKVLKDEM 410 Query: 388 QEQSVRIAPN 397 + V +A + Sbjct: 411 HQTLVTVAKD 420 >gi|329963581|ref|ZP_08301060.1| von Willebrand factor type A domain protein [Bacteroides fluxus YIT 12057] gi|328528570|gb|EGF55541.1| von Willebrand factor type A domain protein [Bacteroides fluxus YIT 12057] Length = 327 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 30/205 (14%), Positives = 64/205 (31%), Gaps = 40/205 (19%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENT 275 N + + + E N PL+ + + + L + + T Sbjct: 108 NRLEAAKDVAAEFINGRPNDNIGITLFAGESFTQCPLTVDHAVLLNLLKDMKCGLIEDGT 167 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + +A L S K +I +TDG N+ L E Sbjct: 168 AIGMGIANAVTRL----------KDSKAKSKVIILLTDGVNNKGDIS-----PLTAAEIA 212 Query: 336 RNAGMKIYSVAVSAPP---------------------EGQDLLRKCTDSSGQFFAVNDSR 374 ++ G++IY++ V + + L + + G +F + Sbjct: 213 KSFGIRIYTIGVGTNGMAPYPYPVGGTVQYVNMPVEIDEKTLTQIAGTTDGNYFRATSNS 272 Query: 375 ELLESFDKITD-KIQEQSVRIAPNR 398 +L E +++I + + +V+ R Sbjct: 273 KLKEVYEEIDKLEKTKLNVKEYSKR 297 >gi|332232509|ref|XP_003265447.1| PREDICTED: collagen alpha-6(VI) chain [Nomascus leucogenys] Length = 2264 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 42/335 (12%), Positives = 100/335 (29%), Gaps = 31/335 (9%) Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 IKK K+ R + QI + + + ++ +F Sbjct: 306 KAYTGAAIKKLRKEVFSARNGSRKNQGVPQIAVLVTHRDSEDNVTKAAVNLRREGVTIFT 365 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYL----QKHNDNNNMTS 176 G+ ++ T L ++ E+ L + ++ L + + Sbjct: 366 LGIEGASDTQLEKIASHPAEQYVSKLKTFADLAAHNQTFLKKLRNQITHTVSVFSERTET 425 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 K + + + +V A + V Sbjct: 426 LKSGCVDTEEADIYLLIDGSGSTQ------ATDFHEMKTFLSAVVGMFDIAPHK-----V 474 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESS 295 R+G + Y +N ++ + + TNT A++ L K+ Sbjct: 475 RVGAVQYADSWDLEFEINKYSNRQDLGKAIENIRQMGGNTNTGAALNFTLSLLQKAKKQR 534 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 N + ++ +T+G + + L+ +R +++Y++ V Q Sbjct: 535 GNK-----VPCHLVVLTNGM--------SKDSILEPANRLREEHIRVYAIGV--KEANQT 579 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 LR+ + + V+D L + +++ +I + Sbjct: 580 QLREIAGEEKRVYYVHDFDALKDIRNQVVQEICAE 614 Score = 73.0 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 48/355 (13%), Positives = 105/355 (29%), Gaps = 36/355 (10%) Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 D + K+ + + I++ ++ ++A Sbjct: 840 ADVGKNQVRFGALKYADDPEVLFYLEDFGTKLEVISVLQNDQAMGGNTYTAEALGFSDHM 899 Query: 117 NL------FLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 KG+ + S + ++ A+ +L V Sbjct: 900 FTEARGSRLNKGVPQVLIVITDGESHDADKLNATAKALRDKGIL-VLAVGIAGANPAELL 958 Query: 171 NNNMTSNKYLLPPPPKKSF--WSKNTTKSKYAPAPAPANRKIDVL---IESAGNLVNSIQ 225 +S+KY +S T + K+D++ S N + Sbjct: 959 AMAGSSDKYFFVETFGGLKGIFSDVTASVCNSSKVDCEIDKVDLVFLMDGSTSIQPNDFK 1018 Query: 226 KA---------IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENT 275 K + VRIG ++ E+ ++ + NT Sbjct: 1019 KMKEFLASVVQDFDVSLNRVRIGAAQFSDTYHPEFPLGTFIGEKEISFQIENIKQIFGNT 1078 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + A+ R++ + + +T + ++ +TDG++ Q E + Sbjct: 1079 HIGAAL----RKVEHYFRPDMGSRINTGTPQVLLVLTDGQSQD--------EVAQAAEAL 1126 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 R+ G+ IYSV + L + T ++ + V++ EL + +I I Sbjct: 1127 RHRGIDIYSVGIG--DVDDQQLIQITGTAEKKLTVHNFDELKKVNKRIVRNICAT 1179 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 27/160 (16%), Positives = 54/160 (33%), Gaps = 19/160 (11%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEK 292 V+IG + ++ + +++ + ++++ T T A+ + K Sbjct: 658 DQVQIGVVQFSDINKEEFQLNRFMSQSDISNAIDQMAHIGQTTLTGSALSFVSQYFSPTK 717 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA-VSAPP 351 + ++KF+I ITDGE + +R G+ IYSV + Sbjct: 718 GARP------NIRKFLILITDGEAQD--------IVKEPAVVLRQEGVIIYSVGVFGSNV 763 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 L + + F V + L D + I Sbjct: 764 ---TQLEEISGRPEMVFYVENFDILQRIEDDLVFGICSPR 800 >gi|154089852|emb|CAO81741.1| collagen type VI alpha 6 [Homo sapiens] Length = 840 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 41/335 (12%), Positives = 100/335 (29%), Gaps = 31/335 (9%) Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 IKK K+ R + QI + + + ++ +F Sbjct: 304 KAYTGAAIKKLRKEVFSARNGSRKNQGVPQIAVLVTHRDSEDNVTKAAVNLRREGVTIFT 363 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYL----QKHNDNNNMTS 176 G+ ++ T L ++ E+ L + ++ L + + Sbjct: 364 LGIEGASDTQLEKIASHPAEQYVSKLKTFADLAAHNQTFLKKLRNQITHTVSVFSERTET 423 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 K + + + +V A + V Sbjct: 424 LKSGCVDTEEADIYLLIDGSGSTQ------ATDFHEMKTFLSEVVGMFNIAPHK-----V 472 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESS 295 R+G + Y +N ++ + + TNT A++ L K+ Sbjct: 473 RVGAVQYADSWDLEFEINKYSNKQDLGKAIENIRQMGGNTNTGAALNFTLSLLQKAKKQR 532 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 N + ++ +T+G + + L+ +R +++Y++ + Q Sbjct: 533 GNK-----VPCHLVVLTNGM--------SKDSILEPANRLREEHIRVYAIGI--KEANQT 577 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 LR+ + + V+D L + +++ +I + Sbjct: 578 QLREIAGEEKRVYYVHDFDALKDIRNQVVQEICTE 612 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 27/160 (16%), Positives = 54/160 (33%), Gaps = 19/160 (11%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEK 292 V+IG + ++ + +++ + ++++ T T A+ + K Sbjct: 656 DRVQIGVVQFSDINKEEFQLNRFMSQSDISNAIDQMAHIGQTTLTGSALSFVSQYFSPTK 715 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA-VSAPP 351 + ++KF+I ITDGE + +R G+ IYSV + Sbjct: 716 GARP------NIRKFLILITDGEAQD--------IVKEPAVVLRQEGVIIYSVGVFGSNV 761 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 L + + F V + L D + I Sbjct: 762 ---TQLEEISGRPEMVFYVENFDILQRIEDDLVFGICSPR 798 >gi|293361343|ref|XP_236593.5| PREDICTED: collagen type VI alpha 6 [Rattus norvegicus] Length = 2264 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 37/335 (11%), Positives = 100/335 (29%), Gaps = 31/335 (9%) Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 ++K K+ + + QI + + ++ +F Sbjct: 304 QAYTGAALRKTRKEVFSAQRGSRKNQGVPQIAVLVTHRASDDNVTKAAVNLRREGVTVFT 363 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYL----QKHNDNNNMTS 176 G+ + L ++ E+ + L+ + ++ L + + Sbjct: 364 MGVEGANPEQLEKIASYPAEQFTSKLSNFSELATHNQTFLKKLRNQITHTVSVFSERTET 423 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 K + + P + +V+ A + V Sbjct: 424 LKSACVDTEEADIYLLIDGSGNTQP------TDFHEMKIFLSEVVDMFNIAPHK-----V 472 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESS 295 R+G + Y N ++ ++ + TNT A++ L K+ Sbjct: 473 RVGAVQYADTWDLEFEISKYTNKPDLGKAIDNIRQMGGNTNTGAALNFTLTLLQRAKKQR 532 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 N + ++ +T+G + + L +R ++++++ V Q Sbjct: 533 GNK-----VPCHLVVLTNGMSQDS--------VLGPAHKLREENIRVHAIGV--KEANQT 577 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 LR+ + + V++ L + +++ +I + Sbjct: 578 QLREIAGDEKRVYYVHEFDALRDIRNQVVQEICAE 612 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 42/354 (11%), Positives = 102/354 (28%), Gaps = 34/354 (9%) Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 D + K+ + + I++ ++ ++A Sbjct: 838 ADVGKNQVRFGALKYADDPEVLFYLDELGTKLEVISVLQNDQPMGGNTYTAEALAFSDHM 897 Query: 117 NLFL------KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 KG+ + S + ++ A+ +L ++ + + Sbjct: 898 FTEARGSRLHKGVPQVLIVITDGESHDAEKLNATAKALRDKGILVLAVGIAGANTWELLA 957 Query: 171 NNNMTSNKYLLPPPPK----KSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK 226 + Y + S S + S + ++ S + QK Sbjct: 958 MAGSSDKYYFVETFGGLKGIFSDVSASVCNSSKVDCEIEKVDLVFLMDGSNSIHPDDFQK 1017 Query: 227 A---------IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTN 276 + VRIG ++ E+ +++ + T+ Sbjct: 1018 MKEFLASVVQDFDVSLNRVRIGVAQFSDSYRSEFLLGTFTGEKEISTQIEAIQQIFGYTH 1077 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 A+ R++ + + + + ++ +TDG + Q E +R Sbjct: 1078 IGDAL----RKVKHYFRPDTGSRINAGTPQVLLVLTDGRSQD--------EVAQAAEELR 1125 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + G+ IYSV + L + T ++ + V++ EL + +I I Sbjct: 1126 HKGVDIYSVGIG--DVDDQQLIQITGTAEKKLTVHNFDELKKVKKRIVRNICTS 1177 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 29/160 (18%), Positives = 54/160 (33%), Gaps = 19/160 (11%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEK 292 V+IG + ++ + N++ + ++++ T T A+ + EK Sbjct: 656 DRVQIGVVQFSHENREEFQLNTFMSQNDIANAIDQMAHIGETTLTGSALTFVSQYFSPEK 715 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA-VSAPP 351 + ++KF+I ITDGE +R G+ IYSV + Sbjct: 716 GARP------NVRKFLILITDGEAQD--------IVKDPAVALRKDGVIIYSVGVFGSNV 761 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 L + + F V + L D + I Sbjct: 762 ---TQLEEISGKPEMVFYVENFDILQHIEDDLVLGICSPR 798 >gi|293349448|ref|XP_002727144.1| PREDICTED: collagen type VI alpha 6-like [Rattus norvegicus] Length = 2264 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 37/335 (11%), Positives = 100/335 (29%), Gaps = 31/335 (9%) Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 ++K K+ + + QI + + ++ +F Sbjct: 304 QAYTGAALRKTRKEVFSAQRGSRKNQGVPQIAVLVTHRASDDNVTKAAVNLRREGVTVFT 363 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYL----QKHNDNNNMTS 176 G+ + L ++ E+ + L+ + ++ L + + Sbjct: 364 MGVEGANPEQLEKIASYPAEQFTSKLSNFSELATHNQTFLKKLRNQITHTVSVFSERTET 423 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 K + + P + +V+ A + V Sbjct: 424 LKSACVDTEEADIYLLIDGSGNTQP------TDFHEMKIFLSEVVDMFNIAPHK-----V 472 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESS 295 R+G + Y N ++ ++ + TNT A++ L K+ Sbjct: 473 RVGAVQYADTWDLEFEISKYTNKPDLGKAIDNIRQMGGNTNTGAALNFTLTLLQRAKKQR 532 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 N + ++ +T+G + + L +R ++++++ V Q Sbjct: 533 GNK-----VPCHLVVLTNGMSQDS--------VLGPAHKLREENIRVHAIGV--KEANQT 577 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 LR+ + + V++ L + +++ +I + Sbjct: 578 QLREIAGDEKRVYYVHEFDALRDIRNQVVQEICAE 612 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 42/354 (11%), Positives = 102/354 (28%), Gaps = 34/354 (9%) Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 D + K+ + + I++ ++ ++A Sbjct: 838 ADVGKNQVRFGALKYADDPEVLFYLDELGTKLEVISVLQNDQPMGGNTYTAEALAFSDHM 897 Query: 117 NLFL------KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 KG+ + S + ++ A+ +L ++ + + Sbjct: 898 FTEARGSRLHKGVPQVLIVITDGESHDAEKLNATAKALRDKGILVLAVGIAGANTWELLA 957 Query: 171 NNNMTSNKYLLPPPPK----KSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK 226 + Y + S S + S + ++ S + QK Sbjct: 958 MAGSSDKYYFVETFGGLKGIFSDVSASVCNSSKVDCEIEKVDLVFLMDGSNSIHPDDFQK 1017 Query: 227 A---------IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTN 276 + VRIG ++ E+ +++ + T+ Sbjct: 1018 MKEFLASVVQDFDVSLNRVRIGVAQFSDSYRSEFLLGTFTGEKEISTQIEAIQQIFGYTH 1077 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 A+ R++ + + + + ++ +TDG + Q E +R Sbjct: 1078 IGDAL----RKVKHYFRPDTGSRINAGTPQVLLVLTDGRSQD--------EVAQAAEELR 1125 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + G+ IYSV + L + T ++ + V++ EL + +I I Sbjct: 1126 HKGVDIYSVGIG--DVDDQQLIQITGTAEKKLTVHNFDELKKVKKRIVRNICTS 1177 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 29/160 (18%), Positives = 54/160 (33%), Gaps = 19/160 (11%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEK 292 V+IG + ++ + N++ + ++++ T T A+ + EK Sbjct: 656 DRVQIGVVQFSHENREEFQLNTFMSQNDIANAIDQMAHIGETTLTGSALTFVSQYFSPEK 715 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA-VSAPP 351 + ++KF+I ITDGE +R G+ IYSV + Sbjct: 716 GARP------NVRKFLILITDGEAQD--------IVKDPAVALRKDGVIIYSVGVFGSNV 761 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 L + + F V + L D + I Sbjct: 762 ---TQLEEISGKPEMVFYVENFDILQHIEDDLVLGICSPR 798 >gi|149018699|gb|EDL77340.1| rCG25821 [Rattus norvegicus] Length = 1513 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 37/335 (11%), Positives = 100/335 (29%), Gaps = 31/335 (9%) Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 ++K K+ + + QI + + ++ +F Sbjct: 304 QAYTGAALRKTRKEVFSAQRGSRKNQGVPQIAVLVTHRASDDNVTKAAVNLRREGVTVFT 363 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYL----QKHNDNNNMTS 176 G+ + L ++ E+ + L+ + ++ L + + Sbjct: 364 MGVEGANPEQLEKIASYPAEQFTSKLSNFSELATHNQTFLKKLRNQITHTVSVFSERTET 423 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 K + + P + +V+ A + V Sbjct: 424 LKSACVDTEEADIYLLIDGSGNTQP------TDFHEMKIFLSEVVDMFNIAPHK-----V 472 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESS 295 R+G + Y N ++ ++ + TNT A++ L K+ Sbjct: 473 RVGAVQYADTWDLEFEISKYTNKPDLGKAIDNIRQMGGNTNTGAALNFTLTLLQRAKKQR 532 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 N + ++ +T+G + + L +R ++++++ V Q Sbjct: 533 GNK-----VPCHLVVLTNGMSQDS--------VLGPAHKLREENIRVHAIGV--KEANQT 577 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 LR+ + + V++ L + +++ +I + Sbjct: 578 QLREIAGDEKRVYYVHEFDALRDIRNQVVQEICAE 612 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 42/354 (11%), Positives = 102/354 (28%), Gaps = 34/354 (9%) Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 D + K+ + + I++ ++ ++A Sbjct: 838 ADVGKNQVRFGALKYADDPEVLFYLDELGTKLEVISVLQNDQPMGGNTYTAEALAFSDHM 897 Query: 117 NLFL------KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 KG+ + S + ++ A+ +L ++ + + Sbjct: 898 FTEARGSRLHKGVPQVLIVITDGESHDAEKLNATAKALRDKGILVLAVGIAGANTWELLA 957 Query: 171 NNNMTSNKYLLPPPPK----KSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK 226 + Y + S S + S + ++ S + QK Sbjct: 958 MAGSSDKYYFVETFGGLKGIFSDVSASVCNSSKVDCEIEKVDLVFLMDGSNSIHPDDFQK 1017 Query: 227 A---------IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTN 276 + VRIG ++ E+ +++ + T+ Sbjct: 1018 MKEFLASVVQDFDVSLNRVRIGVAQFSDSYRSEFLLGTFTGEKEISTQIEAIQQIFGYTH 1077 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 A+ R++ + + + + ++ +TDG + Q E +R Sbjct: 1078 IGDAL----RKVKHYFRPDTGSRINAGTPQVLLVLTDGRSQD--------EVAQAAEELR 1125 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + G+ IYSV + L + T ++ + V++ EL + +I I Sbjct: 1126 HKGVDIYSVGIG--DVDDQQLIQITGTAEKKLTVHNFDELKKVKKRIVRNICTS 1177 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 29/160 (18%), Positives = 54/160 (33%), Gaps = 19/160 (11%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEK 292 V+IG + ++ + N++ + ++++ T T A+ + EK Sbjct: 656 DRVQIGVVQFSHENREEFQLNTFMSQNDIANAIDQMAHIGETTLTGSALTFVSQYFSPEK 715 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA-VSAPP 351 + ++KF+I ITDGE +R G+ IYSV + Sbjct: 716 GARP------NVRKFLILITDGEAQD--------IVKDPAVALRKDGVIIYSVGVFGSNV 761 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 L + + F V + L D + I Sbjct: 762 ---TQLEEISGKPEMVFYVENFDILQHIEDDLVLGICSPR 798 >gi|146343040|ref|YP_001208088.1| hypothetical protein BRADO6230 [Bradyrhizobium sp. ORS278] gi|146195846|emb|CAL79873.1| conserved hypothetical protein [Bradyrhizobium sp. ORS278] Length = 519 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 48/491 (9%), Positives = 122/491 (24%), Gaps = 107/491 (21%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSA----------LDAAVLSGCASIVSDRTIK 51 A+ + I D + M ++ ++QSA +++A + ++ SD ++ Sbjct: 29 FALALLPILTAIGCGTDYSMAMRLKVKLQSAADAASIASISVNSAGYAAAMAMTSDGSVT 88 Query: 52 DPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPL---------- 101 + D + +Q+ T Sbjct: 89 AGVNEADNIFKGNASTFGGYTLTSETSTVTKTRSTLSSQVQFTAAVPTTFLTVIGYQSIT 148 Query: 102 -----------------QYIAESKAQYEIPTENLFLKGLIPSA----------------- 127 + +P+ + + + + Sbjct: 149 VSGSSSSSVTLPLYLDFYLTLDVSGSMGLPSTSAEAQRMQAISPDNYRQYPTGCTLACHF 208 Query: 128 LTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKK 187 S + + N + VS+S L + +N + Sbjct: 209 SPQNSACTDSGTQGYPTNNYCLGYAISRVSQSGYKSLLTTNKNNPKGVQLPSSIVSGLPN 268 Query: 188 SFWSKNTTKSKYAPAPAP--ANRKIDVLIESAGNLVNSIQKAIQEKKNLS---VRIGTIA 242 S ++K T + ++D + + L + + Sbjct: 269 SLYNKLPTVANCPTDGTDDCIQLRLDAVGYAVNQLFTTANTTKKVANQFRIGLYPFIRYL 328 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNP-----------YENTNTYPAMHHAYRELYNE 291 Y+ + + +++ + + L T+ A+ + + Sbjct: 329 YSYYPLTTNISGSTSDSSTINYAAANLATLLDTNTNASLGSGGTHIDTALSSVNSLITSV 388 Query: 292 KESSHNTIGSTRLKKFVIFITDGENS-----------GASAYQNTLNTLQICEYMRNAGM 340 + S T +V +TDG S + T+N C ++N G+ Sbjct: 389 GDGSAT----TNTLPYVFLVTDGAQDPQVKGVPNGSWSGSNHATTINPTTSCTPLKNRGI 444 Query: 341 KIYSVA------------VSAPPEGQDL---------LRKCTDSSGQFFAVNDSRELLES 379 I + + + L+ C G F+ N ++ + Sbjct: 445 IISVLYIPYQTINPVNASFAGDEDDYANNNIPNIPPSLQACAS-PGFFYTANTPADITSA 503 Query: 380 FDKITDKIQEQ 390 + + + + Sbjct: 504 LNAMFNHAVSE 514 >gi|224047663|ref|XP_002193801.1| PREDICTED: vitrin [Taeniopygia guttata] Length = 746 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 43/352 (12%), Positives = 93/352 (26%), Gaps = 31/352 (8%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPL 101 + V+ + +++ K Q + ++ + G+ + + P Sbjct: 416 KTYVNSKDLRNAIEKIQQKGGLSNVGKALSFVNKNFFLDANGNRGGAPNVVVVLVDGWPT 475 Query: 102 QYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSME 161 + E+ N+F + +A + S ++V Sbjct: 476 DRVEEASRLARESGINIFFVTVAAAAQSEKQNVIEPNF-VDKAVCRTSGFYSINVPSWFS 534 Query: 162 DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLV 221 K + + + VL A Sbjct: 535 LHK---VVQPLVKRVCDTDRLACSKTCLNAADIGFVIDGSSSVGTGNFRTVLQFVAN--- 588 Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPA 280 E + RIG + Y + +V + + +++ T+T A Sbjct: 589 ---ISKEFEISDTDTRIGAVQYTYEQRLEFSFDKYSTKQDVLNAIKRISYWSGGTSTGAA 645 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + +A +L++ +K +I ITDG + + G+ Sbjct: 646 ISYASEQLFS--------KSKPNKRKIMILITDGRSYDDVSVP--------AMAAHQNGV 689 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQ 390 YSV V D L F V++ L +++ I + Sbjct: 690 IAYSVGV--AWAAPDELEAIASDPAKEHSFFVDEFDNLYRYVNQLIQNICTE 739 Score = 41.8 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 15/120 (12%), Positives = 40/120 (33%), Gaps = 14/120 (11%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSH 296 +G + Y N ++++ + K+ +N A+ + + + + Sbjct: 400 MGIVQYGDDPSTEFNLKTYVNSKDLRNAIEKIQQKGGLSNVGKALSFVNKNFFLDANGNR 459 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + V+ + DG + + R +G+ I+ V V+A + + Sbjct: 460 GGAPN-----VVVVLVDGWPTD--------RVEEASRLARESGINIFFVTVAAAAQSEKQ 506 >gi|254414936|ref|ZP_05028700.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] gi|196178425|gb|EDX73425.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] Length = 576 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 46/365 (12%), Positives = 100/365 (27%), Gaps = 32/365 (8%) Query: 34 DAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINI 93 +AA L SD T Q RE I + + Sbjct: 65 EAASL---PETQSDAENAPATKFAPQAQPSPGSNSLNRQVPEESNRETYSTIPENPFLKT 121 Query: 94 TKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMV 153 + + + ++ A Y + L P + + A + Sbjct: 122 SSNPLSTFSIDVDA-ASYSNVRRFINENRLPPPNAVRVEELINYFTYDYPQPQADKPFSI 180 Query: 154 LDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL 213 K + PP + + + S K+ +L Sbjct: 181 TTEVAEAPWNSKHKLVHIGLQGKSISTENLPPSNLVFLLDVSGSMSDA------NKLPLL 234 Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE 273 E+ LV+ ++ ++ + Y N + + + ++KL Sbjct: 235 KEAFRLLVDQLRDED--------KVSIVVYAGAAGTVLPPTPGNQKDTILAAIDKLEAGG 286 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 +T + AY+ + S N VI TDG+ + + + + E Sbjct: 287 STAGGQGIKLAYKLAQDNFIESGNNR--------VILATDGDFNVG--ISSDEQLVSLIE 336 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSV 392 R + + + + K + G + +++ E + + ++I + Sbjct: 337 EKREQDIFLTVLGFGTGNLQDAKMEKIANKGNGNYAYIDNILEANKV---LVNEIGGTLL 393 Query: 393 RIAPN 397 IA + Sbjct: 394 TIAKD 398 >gi|194324498|ref|ZP_03058270.1| von Willebrand factor type A domain membrane protein [Francisella tularensis subsp. novicida FTE] gi|194321333|gb|EDX18819.1| von Willebrand factor type A domain membrane protein [Francisella tularensis subsp. novicida FTE] Length = 339 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 39/202 (19%), Positives = 74/202 (36%), Gaps = 43/202 (21%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSV-----RIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 I + ++ G + + ++ R+G I + TPL+ ++ VK Sbjct: 112 IQDMKKANGQMESRFDLVMRVANQFLDTRKGDRVGLILFGTRAYLQ--TPLTFDIATVKK 169 Query: 265 RLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 L+ + P T A+ A ++L S K +I +TDGEN+ Sbjct: 170 MLDDASIALPGPQTAIGDAIGLAVKKLKKYPGDS----------KALILLTDGENNSG-- 217 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVS-----------------APPEGQDLLRKCTD-S 363 TL LQ E + +KIY++ + + +L K + Sbjct: 218 ---TLQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNTSEDLDTTVLEKIATMT 274 Query: 364 SGQFFAVNDSRELLESFDKITD 385 G++F +S +L + ++ I Sbjct: 275 GGKYFRAQNSSDLKKVYESIDK 296 >gi|163749961|ref|ZP_02157205.1| von Willebrand factor type A domain protein [Shewanella benthica KT99] gi|161330235|gb|EDQ01216.1| von Willebrand factor type A domain protein [Shewanella benthica KT99] Length = 648 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 39/333 (11%), Positives = 87/333 (26%), Gaps = 33/333 (9%) Query: 66 KQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIP 125 + + ++ R A I +I ++ + Y + L Sbjct: 128 SAFSQASNRNNFARRTANGIMVVGEIPVSTFSIDTD------TGSYTTLRRWINQGRLPE 181 Query: 126 SALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPP 185 + + S + ++ + + Sbjct: 182 KGTVRVEEMINYFNYQYSTPSTVEQPFSVNTELAPSPYNE----HKMLLRIGLKGYEVDK 237 Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 + S + K+ +L S L + + + + Y Sbjct: 238 SQLGASNLVFL-LDVSGSMNSRDKLPLLKTSLKMLSQQLSEQD--------HVSIVVYAG 288 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 N+ + LN L +TN + AYR Sbjct: 289 ASGVVLDGVKGNDTQAINQALNSLKAGGSTNGGAGIQQAYRLAQKHFIQGGVNR------ 342 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS- 364 VI TDG+ + + L L + R+ G+ + ++ L+ + D Sbjct: 343 --VILATDGDFNVGTTDHQALMDLIAAK--RDQGIALTTLGFGQGNYNDHLMEQLADKGN 398 Query: 365 GQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 G + ++ L E+ + D++ + IA + Sbjct: 399 GHYAYIDT---LNEARKVLVDELSSTLLTIAKD 428 >gi|118496821|ref|YP_897871.1| von Willebrand factor type A domain-containing protein [Francisella tularensis subsp. novicida U112] gi|118422727|gb|ABK89117.1| von Willebrand factor type A domain protein [Francisella novicida U112] Length = 333 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 39/202 (19%), Positives = 74/202 (36%), Gaps = 43/202 (21%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSV-----RIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 I + ++ G + + ++ R+G I + TPL+ ++ VK Sbjct: 106 IQDMKKANGQMESRFDLVMRVANQFLDTRKGDRVGLILFGTRAYLQ--TPLTFDIATVKK 163 Query: 265 RLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 L+ + P T A+ A ++L S K +I +TDGEN+ Sbjct: 164 MLDDASIALPGPQTAIGDAIGLAVKKLKKYPGDS----------KALILLTDGENNSG-- 211 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVS-----------------APPEGQDLLRKCTD-S 363 TL LQ E + +KIY++ + + +L K + Sbjct: 212 ---TLQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNTSEDLDTTVLEKIATMT 268 Query: 364 SGQFFAVNDSRELLESFDKITD 385 G++F +S +L + ++ I Sbjct: 269 GGKYFRAQNSSDLKKVYESIDK 290 >gi|325954650|ref|YP_004238310.1| von Willebrand factor type A [Weeksella virosa DSM 16922] gi|323437268|gb|ADX67732.1| von Willebrand factor type A [Weeksella virosa DSM 16922] Length = 338 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 52/159 (32%), Gaps = 37/159 (23%) Query: 254 PLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + + +N L + T + A + S K +I Sbjct: 146 PLTTDREVLIREINALESGELEDGTAIGIGLATAINHI----------KDSKAKSKVIIL 195 Query: 311 ITDGENS-GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP------------------- 350 +TDG S + ++ E + G+K+Y++ + Sbjct: 196 MTDGVESINPTNDLMYISPQTAAEMATSRGIKVYTIGIGTRGLAPFPTAYDMYGNYIFDM 255 Query: 351 ---PEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + LL+ D + G +F D++ L + + +I Sbjct: 256 MPVDIDEKLLQNIADLTGGLYFRATDNQSLQKIYQEIDR 294 >gi|331697176|ref|YP_004333415.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] gi|326951865|gb|AEA25562.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] Length = 327 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 18/159 (11%), Positives = 58/159 (36%), Gaps = 16/159 (10%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + + VK ++ L E+T T A+ A + + + + + ++ ++DG+ Sbjct: 144 TTDRTAVKQAVDGLKLSESTATGEAIFAALQSIDSFSRTVAASGTEGPPPARIVLMSDGK 203 Query: 316 NSGASAY--QNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PEGQDLLRKC 360 + + + + + + +++ +++ Sbjct: 204 QTVPGPDGENDPRGSFTAAKQAAAEKIPVSTISFGTDYGTIDIEGGRTRVAVDDASMQQI 263 Query: 361 TD-SSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 S GQFF +L + + ++ ++I ++ ++ +R Sbjct: 264 ASLSGGQFFTAASESQLRQVYSELGEQIGYETRQVDTSR 302 >gi|270158235|ref|ZP_06186892.1| von Willebrand factor type A domain protein [Legionella longbeachae D-4968] gi|289163509|ref|YP_003453647.1| hypothetical protein LLO_0165 [Legionella longbeachae NSW150] gi|269990260|gb|EEZ96514.1| von Willebrand factor type A domain protein [Legionella longbeachae D-4968] gi|288856682|emb|CBJ10493.1| putative unknown protein [Legionella longbeachae NSW150] Length = 342 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 36/215 (16%), Positives = 79/215 (36%), Gaps = 49/215 (22%) Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 +++++ +A V +IG I + TPL+ + + + RL Sbjct: 114 TSRLNIVKSAAEQFVRE---------RSGDKIGLILFGTRAYLQ--TPLTYDRHSILLRL 162 Query: 267 NKLNPY---ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 + T+ A+ A + L + + + +I +TDG N+ Sbjct: 163 EDATAGLAGKTTSIGDAVGLAVKRLDSAPKKG----------RVIILLTDGANNSGV--- 209 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPE-----------------GQDLLRKCTD-SSG 365 L L+ E + G+KIY++ + + + ++ L+K +D + G Sbjct: 210 --LAPLKAAELAKEEGIKIYTIGLGSEGDSRALVGDFLMQSPAADLDEETLKKMSDMTGG 267 Query: 366 QFFAVNDSRELLESFDKITD--KIQEQSVRIAPNR 398 ++F D+ L + I I ++ + P + Sbjct: 268 RYFRATDTESLHLIYKTINQLETINQEQATVRPQK 302 >gi|254373668|ref|ZP_04989152.1| conserved hypothetical protein [Francisella novicida GA99-3548] gi|151571390|gb|EDN37044.1| conserved hypothetical protein [Francisella novicida GA99-3548] Length = 339 Score = 85.0 bits (208), Expect = 2e-14, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 73/202 (36%), Gaps = 43/202 (21%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSV-----RIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 I + ++ G + + ++ R+G I + TPL+ ++ VK Sbjct: 112 IQDMKKANGQMESRFDLVMRVANQFLDTRKGDRVGLILFGTRAYLQ--TPLTFDIATVKK 169 Query: 265 RLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 L+ + P T A+ A ++L K +I +TDGEN+ Sbjct: 170 MLDDASIALPGPQTAIGDAIGLAVKKL----------KKFPGDSKALILLTDGENNSG-- 217 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVS-----------------APPEGQDLLRKCTD-S 363 TL LQ E + +KIY++ + + +L K + Sbjct: 218 ---TLQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNTSEDLDTTVLEKIATMT 274 Query: 364 SGQFFAVNDSRELLESFDKITD 385 G++F +S +L + ++ I Sbjct: 275 GGKYFRAQNSSDLKKVYESIDK 296 >gi|303240108|ref|ZP_07326629.1| conserved hypothetical protein [Acetivibrio cellulolyticus CD2] gi|302592377|gb|EFL62104.1| conserved hypothetical protein [Acetivibrio cellulolyticus CD2] Length = 323 Score = 84.6 bits (207), Expect = 2e-14, Method: Composition-based stats. Identities = 41/193 (21%), Positives = 70/193 (36%), Gaps = 41/193 (21%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE----N 274 N + +K IQ+ + PL+ + N V+ L ++ Sbjct: 102 NRLEVARKTIQDFVDQRPSDRIALIAFAGTAYTRVPLTLDHNVVRESLQDISFKSVNEEG 161 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 T A+ L ST K +I +TDG+N+ S NT +TL Sbjct: 162 TAIGMAISVGLNRL----------KKSTSPSKIMILLTDGDNNAGSIDPNTASTL----- 206 Query: 335 MRNAGMKIYSVAVSAPPE---------------------GQDLLRKCT-DSSGQFFAVND 372 +++G+KIY++ V + +DLL+K ++GQ++ D Sbjct: 207 AKDSGIKIYTIGVGSDKTIIPGTNEFGQTVYQEYESGLLNEDLLKKIAETTNGQYYRAKD 266 Query: 373 SRELLESFDKITD 385 S L + F I Sbjct: 267 SNALSQVFANINK 279 >gi|256821501|ref|YP_003145464.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] gi|256795040|gb|ACV25696.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] Length = 582 Score = 84.6 bits (207), Expect = 2e-14, Method: Composition-based stats. Identities = 37/291 (12%), Positives = 81/291 (27%), Gaps = 27/291 (9%) Query: 108 KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQK 167 Y L L P L + ++ Sbjct: 130 TGSYSNVRRMLNDGYLPPEDAVRLEEFVNYFNYDYQTPDSTEQPFAVNTHVFSAPWNSNA 189 Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 + M P ++ S + K+ ++ +S L Sbjct: 190 YL----MEIGIKGFEPEQQELPPSNLVYL-IDVSGSMNSEDKLGLVKKSLKLLAQE---- 240 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 + RI + Y N+ ++ L++L+ +TN + AY+ Sbjct: 241 ----SSDQDRISIVVYAGASGVVLEPTKGNDRMAIEQALDRLSAGGSTNGGAGIELAYKL 296 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 VI TDG+ + + N + + E R +G+ ++ Sbjct: 297 AEQAFIKDGINR--------VILATDGDFNVGTI--NREQLIDLVERKRESGISFTTLGF 346 Query: 348 SAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + + L+ + D G + ++ L E+ + ++ + IA + Sbjct: 347 GSGNYNEHLMEQLADKGNGNYGYIDS---LQEARKLLVEQRAGTLMTIAKD 394 >gi|299135165|ref|ZP_07028356.1| conserved hypothetical protein [Afipia sp. 1NLS2] gi|298590142|gb|EFI50346.1| conserved hypothetical protein [Afipia sp. 1NLS2] Length = 601 Score = 84.6 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 32/242 (13%), Positives = 85/242 (35%), Gaps = 20/242 (8%) Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 N + P + S T + P+ + +N + + Sbjct: 361 DWRNVLADPSATAPDPRVTPAPDSTLTQYAATNPSASVSN---GGWTGCVNDRDQNFDTT 417 Query: 228 IQ--EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAY 285 + ++ + TP+SN + +KS++N + P NTN + + Sbjct: 418 NDAMTGSGIPSKLPYAEQWADCLPATITPMSNQWSTLKSQINAMTPSGNTNQAVGLFWGW 477 Query: 286 RELYNEKES--SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ----ICEYMRN-- 337 + L + + + K +++ ++DG N+ Q + +C+ +++ Sbjct: 478 QTLNTTNDPFKAPAKDPNWVYKDYIVLLSDGLNTQNRWTQTVSDIDARQELLCKNIKDPA 537 Query: 338 ----AGMKIYSVAVSAPPEG--QDLLRKCTDSSGQFFA-VNDSRELLESFDKITDKIQEQ 390 + ++S+ V+ + +L+ C +F + S + ++F+ + I + Sbjct: 538 QNGGNQITVFSIQVNISSKDPTSKVLQDCATPGAGYFQMITQSSQTADAFNNVLATIAKL 597 Query: 391 SV 392 + Sbjct: 598 RI 599 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 28/209 (13%), Positives = 55/209 (26%), Gaps = 23/209 (11%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + AI+ + A+D + R +QSALD+A L T Q Sbjct: 27 IFAIVSIPLVALVGAAVDYTRAVSDRTALQSALDSAALMISK--------DAATMSASQI 78 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 +T ++ + + T N+ A T + Sbjct: 79 TTRARQYVDSLYTATDAPIQ---------NFTATYTPNSGSGASILLSAN---GTMPTYF 126 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 ++ S L + ++ + + + +VLD + SM + L Sbjct: 127 MRVLGSNFNTLPVATSS--TTKWGSTRMRVALVLDNTGSMAQNGKMAALQSAATDMITKL 184 Query: 181 -LPPPPKKSFWSKNTTKSKYAPAPAPANR 208 + +K Sbjct: 185 SAFNTTTGDVYISIVPFAKDVNVSTSNVS 213 >gi|85374104|ref|YP_458166.1| hypothetical protein ELI_06385 [Erythrobacter litoralis HTCC2594] gi|84787187|gb|ABC63369.1| hypothetical protein ELI_06385 [Erythrobacter litoralis HTCC2594] Length = 623 Score = 84.6 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 31/264 (11%), Positives = 70/264 (26%), Gaps = 30/264 (11%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A + +D++ +++Q A D+ VL+ ++ ++ I T Sbjct: 15 LIAAGLLPLLAMAGSGVDMSRAYLAESRLQQACDSGVLAARKALGTE--IATLTDIPTDA 72 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 T ++ + + G+Y +N Y A ++PT + Sbjct: 73 GTRGQEFFNSNFQDGNYGTQNRTFNMVLEN-----------DYSVSGTATVDVPTS---V 118 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ------KHNDNNNM 174 + + + I S + + MVLDV+ SM+ Sbjct: 119 MTVFGFTKIPVKVECQARISFS----DVDVMMVLDVTGSMKHTNSGDTLSKIDSLKATVR 174 Query: 175 TSNKYLLPPPPKKSFWSK-NTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 + + + + V + + S ++ Sbjct: 175 NFYDQMEGAKSAGTRIRYGFVPYASNVNVGHLLKDEWVVNSWAYQSRAISGTTTVEAGTK 234 Query: 234 LSVRIGTIAYNIGIVGNQCTPLSN 257 AY G + Sbjct: 235 TRENW---AYKSGSRSAWIEESTY 255 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 38/278 (13%), Positives = 90/278 (32%), Gaps = 34/278 (12%) Query: 144 ENLAISICMVLDVSRSMEDLYL--QKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAP 201 NL + + +DVS + ++ + L + + Sbjct: 349 ANLYNYLPVAMDVSNWRAEALGCMEERKSTVLTDFSSVDLSANLDLDINTVPVASDQDTQ 408 Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT-PLSNNLN 260 I V + A + + + + K ++ G ++ Q +++ Sbjct: 409 WRPRYPDMIYVRSKEADDKGSFSPAPVYDTKKEFIQTGNWWFSGCPAPAQKLKAMTSG-- 466 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNE--KESSHNTIGSTRLKKFVIFITDGENS- 317 E+ S L+ L P+ T M R L + +++ + +IF+TDG+ Sbjct: 467 ELDSYLDSLTPHGATYHDGGMIWGGRLLSQYGLFAAENSSKPGRTTSRHLIFLTDGQTEP 526 Query: 318 ----GASAYQNTLNTLQ-------------------ICEYMRNAGMKIYSVAVSAPPEGQ 354 S + ++ + C ++ G ++ VA + Sbjct: 527 YDLAYGSYGIDPIDERRWTQTSSLTLAQTVEERFLFACNEVKKLGATVWVVAFGTAANDK 586 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 ++ C S G++F ++ +L ++F I + + Sbjct: 587 --MKTCAGS-GRYFEAANASQLNDAFSTIAKSTGDLRI 621 >gi|262202333|ref|YP_003273541.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] gi|262085680|gb|ACY21648.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] Length = 325 Score = 84.6 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 28/222 (12%), Positives = 70/222 (31%), Gaps = 26/222 (11%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 K+ S+ A A +I +A + + + I + Sbjct: 86 NKATVILVMDVSRSMNATDVAPSRIRAAQSAAKKFADDLTEGIN----------LGLISF 135 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + + + K ++KL + T T + A ++ ++ Sbjct: 136 AGTPSTLVSPTPDHTATKKAVDKLVLADKTATGEGIFAALDQIRT--LNAVLGGPEAAPP 193 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PE 352 ++ ++DG+ + + + G+ + +++ P Sbjct: 194 AHIVLLSDGKQTVPDEPTDPRGAFTAARKAKEEGIPVSTISFGTAYGTVELDGDRVPVPV 253 Query: 353 GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 L++ + S G FF + EL E ++K+ +I ++ R Sbjct: 254 DDPSLKQIANLSGGNFFTASSLDELNEVYEKLQSEIGYETRR 295 >gi|73974730|ref|XP_539177.2| PREDICTED: similar to collagen, type XXII, alpha 1 [Canis familiaris] Length = 1628 Score = 84.6 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 68/195 (34%), Gaps = 20/195 (10%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + + + NLV++ + R+G + Y+ L + Sbjct: 53 TSSSVGKEDFEKVRQWVANLVDTFE-----VGPDRTRVGVVRYSDRPTTAFELGLFGSRE 107 Query: 261 EVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 VK+ L NTNT A+ R ++ + G K+ I +TDG + Sbjct: 108 AVKAAARHLAYHGGNTNTGDALRFITRHSFS--RQAGGRPGDRAFKQVAILLTDGRSQD- 164 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELL 377 L AG++I++V V ++ L + F V+D + Sbjct: 165 -------LVLDAAATAHRAGIRIFAVGVG--AALREELEEIASEPKSAHVFHVSDFDAID 215 Query: 378 ESFDKITDKIQEQSV 392 + K+ ++ E + Sbjct: 216 KIRGKLRRRLCENVL 230 >gi|312886236|ref|ZP_07745850.1| von Willebrand factor type A [Mucilaginibacter paludis DSM 18603] gi|311301261|gb|EFQ78316.1| von Willebrand factor type A [Mucilaginibacter paludis DSM 18603] Length = 335 Score = 84.6 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 25/192 (13%), Positives = 55/192 (28%), Gaps = 40/192 (20%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS---RLNKLNPYENT 275 N + + + + PL+ + + + + + + T Sbjct: 114 NRLEAGKNIAIDFIKNRPDDRIGLVIFSGESFTQCPLTIDHDVLINLYHDIKNGMIEDGT 173 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + A L GS K VI +TDG N+ S + E Sbjct: 174 AIGMGLATAVNRL----------RGSEAKSKVVILLTDGVNNAGSIP-----PITAAEIA 218 Query: 336 RNAGMKIYSVAVSAPP----------------------EGQDLLRKCTDSSGQFFAVNDS 373 + G+++Y+V + + L + + G++F ++ Sbjct: 219 KQFGIRVYTVGIGTQGYAPYPVPSPYGGVVYQRMEVQIDEPTLTKIAAITGGKYFRATNN 278 Query: 374 RELLESFDKITD 385 L + +I Sbjct: 279 DALTRIYKQIDQ 290 >gi|312139646|ref|YP_004006982.1| integral membrane protein [Rhodococcus equi 103S] gi|311888985|emb|CBH48298.1| putative integral membrane protein [Rhodococcus equi 103S] Length = 326 Score = 84.6 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 24/152 (15%), Positives = 59/152 (38%), Gaps = 16/152 (10%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + N +E K+ ++ L E T T A+ + + + ++ ++ ++DG+ Sbjct: 147 TPNRDETKAAIDNLTLSERTATGEAIFTSLQSIDT--LAAVLGGSEQAPPARIVLLSDGK 204 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PEGQDLLRKCTD 362 + + + ++ G+ I +++ P LR+ + Sbjct: 205 QTVPESPDDPRGGFTAARQAKDKGVPISTISFGTGYGTVEIEGDRIPVPVDDPSLREIAN 264 Query: 363 -SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 S G FF + EL + +D + ++I ++ R Sbjct: 265 LSGGNFFTASSLEELRDVYDTLEEQIGFENAR 296 >gi|325676908|ref|ZP_08156581.1| von Willebrand factor [Rhodococcus equi ATCC 33707] gi|325552456|gb|EGD22145.1| von Willebrand factor [Rhodococcus equi ATCC 33707] Length = 326 Score = 84.6 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 24/152 (15%), Positives = 59/152 (38%), Gaps = 16/152 (10%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + N +E K+ ++ L E T T A+ + + + ++ ++ ++DG+ Sbjct: 147 TPNRDETKAAIDNLTLSERTATGEAIFTSLQSIDT--LAAVLGGSEQAPPARIVLLSDGK 204 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PEGQDLLRKCTD 362 + + + ++ G+ I +++ P LR+ + Sbjct: 205 QTVPESPDDPRGGFTAARQAKDKGVPISTISFGTGYGTVEIEGDRIPVPVDDPSLREIAN 264 Query: 363 -SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 S G FF + EL + +D + ++I ++ R Sbjct: 265 LSGGNFFTASSLEELRDVYDTLEEQIGFENAR 296 >gi|296228120|ref|XP_002759672.1| PREDICTED: collagen alpha-5(VI) chain [Callithrix jacchus] Length = 2614 Score = 84.6 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 30/162 (18%), Positives = 62/162 (38%), Gaps = 13/162 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN-KLNPYENTNTYPAMHHAYREL 288 + V+ G + Y+ +N + + L + + NT T A+ HA Sbjct: 845 DVGRDRVQFGALKYSDNPEILFYLNTYSNRSAIIENLRMRRDTGGNTYTAKALKHANALF 904 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 E H + + +K+ +I ITDGE+ +T +R+ G+ I++V V Sbjct: 905 TEE----HGSRINQNVKQMLIVITDGESDDRVELNDT------AAKLRDKGITIFAVGVG 954 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 Q L + V++ +L + + + + + + Sbjct: 955 K--ADQKELEGMAGNKNNTIYVDNFDKLKDIYAPVQESMCTE 994 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 16/166 (9%), Positives = 60/166 (36%), Gaps = 16/166 (9%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHA 284 + VR+G + ++ + +N +++ + + +T+T A+ Sbjct: 469 TEMFSIGPDKVRVGVVQFSDKMRVEFSITDYSNDIDLRKAILNIQQLTGDTHTGEALDF- 527 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + ++ S + ++I +TDG++ + ++ + +R + +++ Sbjct: 528 ---ILPRIKNGIKERMSQ-VPCYLIVLTDGKSQYS--------VVEPAKRVRAEQITVHA 575 Query: 345 VAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + + + L++ + + L +++ I + Sbjct: 576 IGIG--EANKKELQEIAGKEERVSFGQNFDALKSIKNEVVHGICTE 619 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 23/165 (13%), Positives = 51/165 (30%), Gaps = 17/165 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + +IG + ++ E+ +++++ + T A++ Sbjct: 656 TKIQIGADKTQIGVVQFSDDPEEEFQLNTYFTQQEISDAIDRMSLIDKGTLMGKALNFVD 715 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + K + KF+I ITDG +R + I+SV Sbjct: 716 QYFTHSKGARFGAK------KFLILITDGVAQDDVRDP--------ARILRGKDVTIFSV 761 Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 V + L + + F V + L K+ ++ Sbjct: 762 GV--YGADRSQLEEISGDGSLVFYVENFDHLQALEKKLVFRVCAL 804 Score = 44.1 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 22/186 (11%), Positives = 66/186 (35%), Gaps = 20/186 (10%) Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 N + +L+++ + ++ ++IG + S N + K+++ Sbjct: 1018 NSDFVTMTTFLSDLIDNF-----DIQSQRMKIGMAQFGSDYQNIIDLKSSLNKTQWKTQI 1072 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 ++ +P + +A +++ N + + + ++ IT G+ Sbjct: 1073 QTVSKSSG---FPRIDYALKKVGNMFNIRAGGRRNAGVPQTLVVITSGDP--------RY 1121 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL--ESFDKIT 384 N + +++ G+ + + + + LL T +S + L + +I Sbjct: 1122 NVAGAVKALKDLGICVLVLGIG-DVHKEQLL-PITGNSEKIITFQHFDRLKNVDVKKRIV 1179 Query: 385 DKIQEQ 390 +I + Sbjct: 1180 REICQS 1185 Score = 43.4 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 22/197 (11%), Positives = 63/197 (31%), Gaps = 22/197 (11%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + ++NS+ + R+ Y+ + Sbjct: 35 VDSSDHLGTKSFPFVKTFINKMINSLP-----IEADKYRVALAQYSDKLHSEFHLSTFKG 89 Query: 259 LNEVKSRL--NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 N + + L N + A+ A+R ++ + + V+ + Sbjct: 90 RNPMLNHLKKNFQFLGGSLQIGKALQEAHRTYFSAPTNGRDRK--QFPPILVVLAS---- 143 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 ++ + + + ++ G+KI ++V ++ L+ S F + R+L Sbjct: 144 -----AESEDDVEEASKALQEDGVKI--ISVGVQEASEENLKAMATSH-FHFNLRTVRDL 195 Query: 377 LESFDKITDKIQEQSVR 393 F + +I ++ + Sbjct: 196 ST-FSQNMTQIIKEVTK 211 >gi|146298482|ref|YP_001193073.1| von Willebrand factor, type A [Flavobacterium johnsoniae UW101] gi|146152900|gb|ABQ03754.1| BatA-like protein [Flavobacterium johnsoniae UW101] Length = 334 Score = 84.6 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 24/178 (13%), Positives = 56/178 (31%), Gaps = 44/178 (24%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYN 290 + RIG + Y TP++++ + + + + T + A L Sbjct: 130 NDRIGLVLYASEAYTK--TPVTSDKPIILEAIKGIRYDTVLQDGTGIGMGLATAVNRL-- 185 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 S + +I +TDG N+ + + + G+K+Y++ + Sbjct: 186 --------KDSKAKSRVIILLTDGVNNAG-----FIEPETAADIAKQYGIKVYTIGLGTN 232 Query: 351 P----------------------EGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITD 385 + L++ + G +F + +L E ++ I Sbjct: 233 GMAESPYAYAPNGGFLFKMQKVEIDERLMKSIAKKTDGTYFRATSNDKLAEIYNSINK 290 >gi|33152377|ref|NP_873730.1| tight adherence protein G [Haemophilus ducreyi 35000HP] gi|21326716|gb|AAL92476.1| TadG [Haemophilus ducreyi] gi|33148600|gb|AAP96119.1| tight adherence protein G [Haemophilus ducreyi 35000HP] Length = 562 Score = 84.6 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 58/516 (11%), Positives = 125/516 (24%), Gaps = 136/516 (26%) Query: 3 AIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVL---------------SGCASIVSD 47 A++ + ++++A I+ + ++ AL+ AVL + + ++ Sbjct: 27 ALLTLPIVALLFVSLEVAGIIQDKARLNDALEQAVLSLTAENNSGRKSYDYALTNAEKAN 86 Query: 48 RTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYI--- 104 + + S I K +K +L Q I IT Y Sbjct: 87 GKYLADSEAGKRDSQIVKTFVKLYLPQIDENTMKFEPICTTQNNAITPKNGKQYAYSSSH 146 Query: 105 AESKAQYEIPTENLFLKGLIPSALTNLSLR---STGIIERSSENLAISICMVLDVSRSME 161 I +LF + S + + + + ++ NL + + +V D+S SM+ Sbjct: 147 VTCTVTGSINHRSLFPMTVGKSKIIPEQVSLSSGSMAQKINNVNLPLDLMVVADLSGSMD 206 Query: 162 DLYLQKHNDNNNMT-----------------------------------------SNKYL 180 +N Sbjct: 207 YNINNHKVYSNTEASKLTLLKQVLEELTDKYLLSEEANPNNRISMIPFAMGAQHPIRNSC 266 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRK------IDVLIESAGNLVNSIQK-----AIQ 229 + P K +P N + + + L+++ + Sbjct: 267 VLPFEWNQSHIGYNDSQKVSPNEIEYNLRNLPIRSRTIFTHNLVYLLDTKKTLEKIGTRF 326 Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 ++ + I Q N + + +L T + A + Sbjct: 327 NNYDVEYQKSAICLEGSDKFQQQWYEKNQKINFINEVKRLKAAGATLASSGLIVAVNNML 386 Query: 290 NEKESSHNTIGSTR------LKKFVIFITDGENSGA------------------------ 319 NE S TR D + Sbjct: 387 NEPARSDVLKQQTRRTILILSDGSDSIGDDSGENNWYQKEIPFMNFSRITENLILGKQEL 446 Query: 320 -------------------------SAYQNTLNTLQICEYM--------RNAGMKIYSVA 346 + + T +C+ + ++ KI V Sbjct: 447 FNKSPQSKNLENHIYGYRYNYPIYLTNNTEKIQTKGLCDVIRDKLNTKNKDNNTKIIFVE 506 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + +D C + +++ LLE+F + Sbjct: 507 LGYNSSSKDTWLHCVGGTQNYYSATSKESLLEAFKQ 542 >gi|10334988|gb|AAD46685.2| TadG [Aggregatibacter actinomycetemcomitans] gi|26000721|gb|AAN75217.1| TadG [Aggregatibacter actinomycetemcomitans] Length = 538 Score = 84.6 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 60/481 (12%), Positives = 130/481 (27%), Gaps = 112/481 (23%) Query: 13 ITYAIDLAHIMYIRNQMQSALDAAVLS----------------GCASIVSDRTIKDPTTK 56 + + +D I+ + ++ A D A L VS + I + Sbjct: 41 VAFTVDGTGILLDKARLAQATDQAALLLIAEDNQYRKNKDHSDVTRQRVSQQDIDRESKD 100 Query: 57 ----------KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQ------KAQINITKDKNNP 100 K + + + +K +L+ + + N Sbjct: 101 FSNAKVQAQWKKRNQELVQGLVKLYLRSDDSNGQKNSSPVTIKEPFLAECLEEKTQPRNK 160 Query: 101 LQY------IAESKAQYEIPT---ENLFLKGLIPSALTNLSLRSTGIIERSSENLAISIC 151 + + Q + + L + ++ T ++ + I + Sbjct: 161 NGTAKSIACVVQGSVQRKFWLPWGQTLVSSSQLHDGRVGINSGETYAVKEKQITIPIDLM 220 Query: 152 MVLDVSRSMEDLYL----------QKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAP 201 MV D+SRSM + + D + PK + Sbjct: 221 MVTDLSRSMNWAIVSHRDVEVPPPNRRIDALREVVSNIQDILLPKAIRDDISPYNRIGFV 280 Query: 202 APAPANRKIDVLIESA-------------------GNLVNSIQKAIQEKKNLSVRIGTIA 242 + A R+ D G + ++ + Sbjct: 281 SFAAGARQKDETDNCVLPYYSKQNKQAEISNYFNSGQISQGFEELSRSMDIEKTINQITQ 340 Query: 243 YNIGIVGNQCTPLSN-----------------------NLNEVKSRLNKLNPYENTNTYP 279 + G + LS+ + V L ++ P T Sbjct: 341 FKNGEKKSYPFSLSSLSSRNFCLENNKGKATTQAWFSKSKPGVADALKEIEPLGGTAVTS 400 Query: 280 AMHHAYRELYNEKESSHNTIG--STRLKKFVIFITDGENSGASAYQ-NTLNTLQICEYMR 336 + + + + +T ++ ++ ++DGE++ S T +CE ++ Sbjct: 401 GIFIGTNLMTDTNKDPEAAPNKLNTNTRRVLLILSDGEDNRPSKNTLVTFMNSGMCEKIK 460 Query: 337 NA------------GMKIYSVAVSAPPEGQDL--LRKCTDSSGQFFAVNDSRELLESFDK 382 +I VA+ P L +KC Q++ VN + LL++F + Sbjct: 461 EKINSLQDSNYPQVEARIAFVALGFNPPQDQLIAWKKCV--GKQYYPVNSKQGLLDAFKQ 518 Query: 383 I 383 I Sbjct: 519 I 519 >gi|307548796|dbj|BAJ19118.1| TadG [Aggregatibacter actinomycetemcomitans] gi|307548811|dbj|BAJ19132.1| TadG [Aggregatibacter actinomycetemcomitans] Length = 538 Score = 84.6 bits (207), Expect = 3e-14, Method: Composition-based stats. Identities = 60/481 (12%), Positives = 130/481 (27%), Gaps = 112/481 (23%) Query: 13 ITYAIDLAHIMYIRNQMQSALDAAVLS----------------GCASIVSDRTIKDPTTK 56 + + +D I+ + ++ A D A L VS + I + Sbjct: 41 VAFTVDGTGILLDKARLAQATDQAALLLIAEDNQYRKNKDHSDVTRQRVSQQDIDRESKD 100 Query: 57 ----------KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQ------KAQINITKDKNNP 100 K + + + +K +L+ + + N Sbjct: 101 FSNAKVQAQWKKRNQELVQGLVKLYLRSDDSNGQKNSSPVTIKEPFLAECLEEKTQPRNK 160 Query: 101 LQY------IAESKAQYEIPT---ENLFLKGLIPSALTNLSLRSTGIIERSSENLAISIC 151 + + Q + + L + ++ T ++ + I + Sbjct: 161 NGTAKSIACVVQGSVQRKFWLPWGQTLVSSSQLHDGRVGINSGKTYAVKEKQITIPIDLM 220 Query: 152 MVLDVSRSMEDLYL----------QKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAP 201 MV D+SRSM + + D + PK + Sbjct: 221 MVTDLSRSMNWAIVSHRDVEVPPPNRRIDALREVVSNIQDILLPKAIRDDISPYNRIGFV 280 Query: 202 APAPANRKIDVLIESA-------------------GNLVNSIQKAIQEKKNLSVRIGTIA 242 + A R+ D G + ++ + Sbjct: 281 SFAAGARQKDETDNCVLPYYSKQNKQAEISNYFNSGQISQGFEELSRSMDIEKTINQITQ 340 Query: 243 YNIGIVGNQCTPLSN-----------------------NLNEVKSRLNKLNPYENTNTYP 279 + G + LS+ + V L ++ P T Sbjct: 341 FKNGEKKSYPFSLSSLSSRNFCLENNKGKATTQAWFSKSKPGVADALKEIEPLGGTAVTS 400 Query: 280 AMHHAYRELYNEKESSHNTIG--STRLKKFVIFITDGENSGASAYQ-NTLNTLQICEYMR 336 + + + + +T ++ ++ ++DGE++ S T +CE ++ Sbjct: 401 GIFIGTNLMTDTNKDPEAAPNKLNTNTRRVLLILSDGEDNRPSKNTLVTFMNSGMCEKIK 460 Query: 337 NA------------GMKIYSVAVSAPPEGQDL--LRKCTDSSGQFFAVNDSRELLESFDK 382 +I VA+ P L +KC Q++ VN + LL++F + Sbjct: 461 EKINSLQDSNYPQVEARIAFVALGFNPPQDQLIAWKKCV--GKQYYPVNSKQGLLDAFKQ 518 Query: 383 I 383 I Sbjct: 519 I 519 >gi|326918160|ref|XP_003205359.1| PREDICTED: hypothetical protein LOC100539194 [Meleagris gallopavo] Length = 1584 Score = 84.2 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 70/200 (35%), Gaps = 22/200 (11%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + + + NLV + + R+G + Y+ Sbjct: 45 ASSSVGKEDFEKVRQWVSNLVETFE-----IGPDKTRVGVVRYSDRPTTEFDLGKYKTRE 99 Query: 261 EVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 E+K K+ NTNT A+ + ++ + + + +KK I +TDG + Sbjct: 100 EIKEAARKIRYYGGNTNTGDALRYINTYSFS--KEAGGRLSDRTVKKVAILLTDGRSQDY 157 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELL 377 R AG++I++V V ++ L + F V+D + Sbjct: 158 VLDP--------ANAARQAGIRIFAVGVG--EALKEELDEIASEPKSAHVFHVSDYNAID 207 Query: 378 ESFDKITDKIQEQSVRIAPN 397 + K+ ++ E + PN Sbjct: 208 KIRGKLRRRLCENV--LCPN 225 >gi|258624772|ref|ZP_05719703.1| putative Flp pilus assembly protein TadG [Vibrio mimicus VM603] gi|258582934|gb|EEW07752.1| putative Flp pilus assembly protein TadG [Vibrio mimicus VM603] Length = 419 Score = 84.2 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 36/427 (8%), Positives = 115/427 (26%), Gaps = 62/427 (14%) Query: 4 IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTI 63 ++ + + + + L+ ++ A + A L+ I P + + Sbjct: 17 MMFPAMMMILAFTMQLSQQFLAHARLSEASEVASLAL---------IASPKEDDENNVSY 67 Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGL 123 +K + +++ + + + + + ++ Sbjct: 68 ARKLVDRYVVDNIDDIKVTVKNKRCEYKDGCVQSSGEAAPFTDFTVAATAKHKSWISYE- 126 Query: 124 IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPP 183 +L + + R + + ++D+S SM + + + + + Sbjct: 127 -NISLKPEFTVNGSSVTRKFLPQPVDVYFIVDMSASMRATWQNGKSQIDEVKNVIT-RVV 184 Query: 184 PPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAY 243 K F ++ ++ ++ +E+ + Q N+ + Sbjct: 185 NDLKDFDTEVKSRVALLGYHNFNIKQGGRSLEAYDYAL--YNTPQQTVSNMFFPPKRVNP 242 Query: 244 NIGI-VGNQCTPLSNNLNEVKSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNT 298 ++ L+ N + +N N T ++ + + + Sbjct: 243 GDSALFSHRDIDLTQNYSSFLQIMNDRNFYPPRSACTESWQGII--------AAAQAADK 294 Query: 299 IGSTRLKKFVIFITDGENSGA------------SAYQNTLNTLQICEYMRNA-------- 338 ++ I ++DG + Y L +C+ ++ Sbjct: 295 ATDINPEQVFIILSDGADCPWQRQDRWGRLRTTEYYLKKLVDGGLCKNLKQRIRQKPNRF 354 Query: 339 -----------GMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 + + + V+ D C + + + I + I Sbjct: 355 QSSTPTENEKTKVTMGVIGVNYQVNPNDGFGDCV-GRENIYHATQGE---DVYKYILNLI 410 Query: 388 QEQSVRI 394 E++ R+ Sbjct: 411 NEETGRL 417 >gi|251779520|ref|ZP_04822440.1| von Willebrand factor, type A domain protein [Clostridium botulinum E1 str. 'BoNT E Beluga'] gi|243083835|gb|EES49725.1| von Willebrand factor, type A domain protein [Clostridium botulinum E1 str. 'BoNT E Beluga'] Length = 815 Score = 84.2 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 50/355 (14%), Positives = 99/355 (27%), Gaps = 85/355 (23%) Query: 123 LIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSME--DLYLQKHNDNNNMTSNKYL 180 ++ + E E I +VLD S SM + +N ++ Sbjct: 68 IVGEDILIKGKIIPKPFEMEMERQKKEIVLVLDTSGSMNEKVGKVCTNNRGWYCKTHNSS 127 Query: 181 LPPPPKKSFWSKNTTKS-------KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ---- 229 + F+ A + K++ L ++A N ++ ++ Sbjct: 128 DLYHRESLFYHNWINDYCEEHGKVGQHYASYSKSTKMEELKKAANNFIDKMKDVPDLKIC 187 Query: 230 -----------------EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY 272 +K + SV L++N N + S +N L Sbjct: 188 IVNYSSEATINPCGYNGDKNSASVEEDRHHTIPNYKSLGTKFLNSNDNTLHSMINGLKAL 247 Query: 273 ENTNTYPAMHHA--------------------------------------YRELYNEKES 294 TNT + A Y L ++ Sbjct: 248 GGTNTGEGLRKAEYMLEQGDKDAKKTIVFMSDGLPTYYSVYKNHQNVQKYYWVLKYSWDN 307 Query: 295 SHNTIGSTRLKKFVIF----ITD-------GENSGASAYQNTLNTLQICEYMRNAGMKIY 343 ++ K+ + TD G + +I E +++ ++ Sbjct: 308 GYHWEKEYYWKEEDYWDYYTSTDDTFPNYAGTGNSDDQGCCKKYAKKIGEIIKSNNSNVF 367 Query: 344 SVAVSA---PPEGQDLLRKCTDSSGQF---FAVNDSRELLESFDKITDKIQEQSV 392 S+ + ++++ DS G F D+ + E F++I DKI E Sbjct: 368 SLGYGLGNKNSDANKIMKEIHDSMGGAEKDFFATDTGAIDEIFNQIADKIIESYT 422 >gi|115525407|ref|YP_782318.1| hypothetical protein RPE_3406 [Rhodopseudomonas palustris BisA53] gi|115519354|gb|ABJ07338.1| conserved hypothetical protein [Rhodopseudomonas palustris BisA53] Length = 580 Score = 84.2 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 39/262 (14%), Positives = 81/262 (30%), Gaps = 20/262 (7%) Query: 151 CMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKI 210 + + + Y + S + + T A Sbjct: 317 YDSVQRTVASCTGYNSTECSCSGWPSVCKIYHTWRPADTVTPVTAADTAATPSTSTWNGC 376 Query: 211 DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN 270 + N + + + A I + T ++NN + + ++ L Sbjct: 377 VTDRGTYSGPSNDYDRNVSLPLSGVPASRFPAEQISSCAPKVTEMNNNWATMNTTVDGLF 436 Query: 271 PYENTNTYPAMHHAYRELYNE--KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 P TN + ++ L + + ++ ++DG N+ Y N +T Sbjct: 437 PVGGTNQPIGLVWGWQSLVGGGPFPTPPVKDEQYTYQDIIVLMSDGLNTVDRWYGNGWDT 496 Query: 329 -------------LQICEYMRNAGMKIYSVAVSAPPEGQ-DLLRKCTD---SSGQFFA-V 370 C ++ AG+K+Y+V V+ + LL+ C G+ F V Sbjct: 497 NTSVDNRMYASATTGTCVNVKAAGIKVYTVHVNTNGSPESTLLKNCASPADDGGKEFQMV 556 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + L +F+ I K+ + V Sbjct: 557 TSASGLNAAFNSIATKLTDLRV 578 Score = 80.3 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 28/252 (11%), Positives = 65/252 (25%), Gaps = 26/252 (10%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + I F+ A+D + + R MQSALD+ L ++ K Sbjct: 27 LFGIACVPLITFVGAAVDYSRAVAARTAMQSALDSTALMVAKDYSLNKISASEIDGK--- 83 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + + A ++ N + ++PT+ Sbjct: 84 ------------AKSIFSALYTNKSANSVEVVAVLTPNTGKGSTIKVDGTGKVPTD---F 128 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKY- 179 L+ + ++ S + + + + +VLD + SM D Sbjct: 129 MKLVNISQIDIGASS----TTTWGSTRLRVALVLDTTGSMNDNGKIGALKTATQNLLTQL 184 Query: 180 -LLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGN--LVNSIQKAIQEKKNLSV 236 P+ + S + + +++ + + +V Sbjct: 185 KDAAGKPEDVYVSIIPFSKDVNVGASNYTANWIDWTDWKSQPPVLDYAKSGSKSGLVDNV 244 Query: 237 RIGTIAYNIGIV 248 + Sbjct: 245 AWKDVGPGSKCP 256 >gi|258620794|ref|ZP_05715829.1| conserved hypothetical protein [Vibrio mimicus VM573] gi|258586992|gb|EEW11706.1| conserved hypothetical protein [Vibrio mimicus VM573] Length = 419 Score = 84.2 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 37/427 (8%), Positives = 118/427 (27%), Gaps = 62/427 (14%) Query: 4 IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTI 63 ++ + + + + L+ ++ A + A L+ I P + + Sbjct: 17 MMFPAMMMILAFTMQLSQQFLAHARLSEASEVASLAL---------IASPKEDDENNVSY 67 Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGL 123 +K + +++ + + + + + ++ Sbjct: 68 ARKLVDRYVVDNIDDIKVTVKNKRCEYKDGCVQSSGEAAPFTDFTVAATAKHKSWISYE- 126 Query: 124 IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPP 183 +L + + R + + ++D+S SM + + + + + Sbjct: 127 -NISLKPEFTVNGSSVTRKFLPQPVDVYFIVDMSASMRATWQNGKSQIDEVKNVIT-RVV 184 Query: 184 PPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAY 243 K F ++ ++ ++ +E+ + Q N+ + Sbjct: 185 NDLKDFDTEVKSRVALLGYHNFNIKQGGRSLEAYDYAL--YNTPQQTVSNMFFPPKRVNP 242 Query: 244 NI-GIVGNQCTPLSNNLNEVKSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNT 298 + G+ ++ L+ N + +N N T ++ + + + Sbjct: 243 SDSGLFSHRDIDLTQNYSSFLQIMNDRNFYPPRSACTESWQGII--------AAAQAADK 294 Query: 299 IGSTRLKKFVIFITDGENSGA------------SAYQNTLNTLQICEYMRNA-------- 338 ++ I ++DG + Y L +C+ ++ Sbjct: 295 ATDINPEQVFIILSDGADCPWQRQDRWGRLRTTEYYLKKLVDGGLCKNLKQRIRQKPNRF 354 Query: 339 -----------GMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 + + + V+ D C + + + I + I Sbjct: 355 QSSTPTENEKTKVTMGVIGVNYQVNPNDGFGDCV-GRENIYHATQGE---DVYKYILNLI 410 Query: 388 QEQSVRI 394 E++ R+ Sbjct: 411 NEETGRL 417 >gi|269926132|ref|YP_003322755.1| von Willebrand factor type A; type II secretion system protein [Thermobaculum terrenum ATCC BAA-798] gi|269789792|gb|ACZ41933.1| von Willebrand factor type A; type II secretion system protein [Thermobaculum terrenum ATCC BAA-798] Length = 643 Score = 84.2 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 37/275 (13%), Positives = 86/275 (31%), Gaps = 21/275 (7%) Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSN 177 L + L+ S + S ++ + + ++ S + D + Sbjct: 15 LVILTFYSYFTEALAANSGNTVRVSIREVSTTSQPKIVMTLSANNSKGLPVTDLSADDFI 74 Query: 178 KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVR 237 K P P +D + + Q A N Sbjct: 75 ----VKENGKEQSDIAVYPFYQNPDPIDVVLALDTSASMNDDAFTAAQDAAYGLINGLSP 130 Query: 238 IGTIAYNIGIVGNQCT-PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 + + PL+ + V+ + KL+ T Y + A +E+ + + Sbjct: 131 EDKVGLITFDKTARVIEPLAQDHARVQESIQKLSRSVGTALYQGLSLAAQEVAKGQNT-- 188 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 K ++ +TDG N+ + + + G +++V + Q L Sbjct: 189 ---------KAIVLMTDGFNTSRNTTL-----EEAVAKAQEVGASVFTVGFGKKVDTQGL 234 Query: 357 LRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 + ++ G++F+ + +L F I+ K+ ++ Sbjct: 235 QKIANETGGEYFSAPTNAQLRRVFADISQKLHQEY 269 >gi|56707447|ref|YP_169343.1| hypothetical protein FTT_0293 [Francisella tularensis subsp. tularensis SCHU S4] gi|110669918|ref|YP_666475.1| hypothetical protein FTF0293 [Francisella tularensis subsp. tularensis FSC198] gi|115314141|ref|YP_762864.1| hypothetical protein FTH_0198 [Francisella tularensis subsp. holarctica OSU18] gi|254370860|ref|ZP_04986865.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis FSC033] gi|254874284|ref|ZP_05246994.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis MA00-2987] gi|56603939|emb|CAG44926.1| hypothetical membrane protein [Francisella tularensis subsp. tularensis SCHU S4] gi|110320251|emb|CAL08309.1| hypothetical membrane protein [Francisella tularensis subsp. tularensis FSC198] gi|115129040|gb|ABI82227.1| conserved hypothetical protein [Francisella tularensis subsp. holarctica OSU18] gi|151569103|gb|EDN34757.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis FSC033] gi|254840283|gb|EET18719.1| conserved hypothetical protein [Francisella tularensis subsp. tularensis MA00-2987] Length = 339 Score = 84.2 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 40/205 (19%), Positives = 76/205 (37%), Gaps = 47/205 (22%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 A + D+++ A +++ + R+G I + TPL+ ++ Sbjct: 118 ANGQMESRFDLVMRVANQFIDTRK---------GDRVGLILFGTRAYLQ--TPLTFDIAT 166 Query: 262 VKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 VK L+ + P T A+ A ++L S K +I +TDGEN+ Sbjct: 167 VKKMLDDASIALPGPQTAIGDAIGLAVKKLKKYPGDS----------KALILLTDGENNS 216 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVS-----------------APPEGQDLLRKCT 361 TL LQ E + +KIY++ + + +L K Sbjct: 217 G-----TLQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNTSEDLDTTVLEKIA 271 Query: 362 D-SSGQFFAVNDSRELLESFDKITD 385 + G++F +S +L + ++ I Sbjct: 272 TMTGGKYFRAQNSSDLKKVYESIDK 296 >gi|296445280|ref|ZP_06887239.1| von Willebrand factor type A [Methylosinus trichosporium OB3b] gi|296257235|gb|EFH04303.1| von Willebrand factor type A [Methylosinus trichosporium OB3b] Length = 575 Score = 84.2 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 38/412 (9%), Positives = 101/412 (24%), Gaps = 71/412 (17%) Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKA 109 +K + + + ++ + + ++ +N + Sbjct: 164 SAGGKSKIAALRDAATSFVNNIYSKTTDVKMSIVPFSAGVRVLDPSVSSNRTLSWIDVNG 223 Query: 110 QYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHN 169 L+ + + + S+ D E + Sbjct: 224 NNSQHWLVFGDGSLVAATAKAAAKTAGFTSRFDIFTKLKSLNSSWDWGGCFEGPKYPLNV 283 Query: 170 DNNNMTSNKY------LLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNS 223 + + ++ L P + N + + L + Sbjct: 284 SDTAVDTSNAETLFVPFLAPDEPSTKDKYNNSLYTNNYLAETGGSCSGTVTGDWKLLTRA 343 Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + + K G + L+ + + ++++ L TN + Sbjct: 344 CKY--GKPKKDGSGAGPNSSCPTSSSQTVLQLTATQSTITTKISGLTENGYTNLHEGFMW 401 Query: 284 AYRELYNEKE-SSHNTIGSTRLKKFVIFIT------------------------------ 312 +R + ++ + K ++F+T Sbjct: 402 GWRTISPTGPFAAGRAYATKDNHKIIVFMTDGFNNWQSATSTVTGSAYQAAGYYSYNGTA 461 Query: 313 -----DGENSGASAYQN----------------------TLNTLQICEYMRNAGMKIYSV 345 DG + + TL+ C + AG++IY++ Sbjct: 462 NQRFPDGTATNGNGVNYQTTLEAAAGSSTDYHDTSRNMQDELTLEACTNAKTAGVEIYTI 521 Query: 346 AVSA-----PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 S +G +++ C + +FA D L +F I + + + Sbjct: 522 GFSVPVDPIDAQGLKMMQDCATDANHYFAATDVDSLNAAFASIGSGVGKLRL 573 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 24/269 (8%), Positives = 66/269 (24%), Gaps = 5/269 (1%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKD-Q 59 + + L + +D + ++ +Q A D+A L+ +IV+ T + ++ Sbjct: 27 IFGLSFIPLVLMLGAGVDYGRAVSTKSNLQQATDSAALAVAKTIVATTTNQQAQSQAQVY 86 Query: 60 TSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPL-QYIAESKAQYEIPTENL 118 T + + K + AQI T K + ++ + P Sbjct: 87 LLTNVRNAVAVVTKAEISADRLTLCLDSTAQIPTTIMKIAHIETITTKATTCAQTPGGMN 146 Query: 119 FLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNK 178 + + S+ + + L + ++ S + Sbjct: 147 GTYEIALVLDNSGSMSKSAGGKSKIAALRDAATSFVNNIYSKTTDVKMSIVPFSAGVRVL 206 Query: 179 YLLPPPPKKSFWSKNTTKSKY---APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 + W + + + K S Sbjct: 207 DPSVSSNRTLSWIDVNGNNSQHWLVFGDGSLVAATAKAAAKTAGFTSRFDIFTKLKSLNS 266 Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 + + + + + ++ Sbjct: 267 SWDWGGCFEGPKYPLNVSDTAVDTSNAET 295 >gi|56477526|ref|YP_159115.1| hypothetical protein ebA3711 [Aromatoleum aromaticum EbN1] gi|56313569|emb|CAI08214.1| hypothetical protein ebA3711 [Aromatoleum aromaticum EbN1] Length = 441 Score = 84.2 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 28/236 (11%), Positives = 64/236 (27%), Gaps = 8/236 (3%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 +TA+ + V F A+D H+ + ++Q+ DA L+ + + T+ + Sbjct: 18 ITALSLVVLVGFAGLALDGGHLYLTKTELQNGADACALAASYELTGSPISPENFTRAENA 77 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK-NNPLQYIAESKAQYEIPTENLF 119 + + QG I D+ + + + + Sbjct: 78 GKTVGTENRVDF-QGGAIAAADIDVTFSTSLAGSWLPAGGATGNSKYVRCTITRNGIAPW 136 Query: 120 LKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRS-MEDLYLQKHNDNNN-MTSN 177 ++ +S +T + S N AI + + S S Y++ + N S Sbjct: 137 FMQVMGFGDQTVSAIATATLAPSQNNCAIPMGLCTHPSSSAPHFGYVKGDWYSMNFKESG 196 Query: 178 KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 + W + +++ Sbjct: 197 GGTMENLTGDFRWVDFDPSTTTPNCSGKGAQEL----SCLFEGAGQCNLPPNGPST 248 >gi|89889805|ref|ZP_01201316.1| BatA, aerotolerance operon [Flavobacteria bacterium BBFL7] gi|89518078|gb|EAS20734.1| BatA, aerotolerance operon [Flavobacteria bacterium BBFL7] Length = 337 Score = 84.2 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 29/182 (15%), Positives = 62/182 (34%), Gaps = 44/182 (24%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE----NTNTYPAMHHAYR 286 K + RIG + Y TP++ + +N++ T + A Sbjct: 129 KGRPNDRIGVVVYAGESYTK--TPITTDEMISLRAINEIAFDGVLENGTAIGMGLATAVN 186 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 L S L K +I +TDG N+ ++ E +K+Y++ Sbjct: 187 RL----------KDSEALSKVIILMTDGVNNSG-----FIDPKIASELALEYDIKVYTIG 231 Query: 347 VSAPPE----------------------GQDLLRKCT-DSSGQFFAVNDSRELLESFDKI 383 + ++L+++ D+ G++F ++++L E + +I Sbjct: 232 IGTNGNAPSPVAQIGRNKFRMAMMPVEIDEELMKQIAVDTGGKYFRATNNKKLEEIYGEI 291 Query: 384 TD 385 Sbjct: 292 DK 293 >gi|187932172|ref|YP_001892157.1| protein of unknown function containing a von Willebrand factor type A (vWA) domain [Francisella tularensis subsp. mediasiatica FSC147] gi|187713081|gb|ACD31378.1| protein of unknown function containing a von Willebrand factor type A (vWA) domain [Francisella tularensis subsp. mediasiatica FSC147] Length = 333 Score = 84.2 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 40/205 (19%), Positives = 76/205 (37%), Gaps = 47/205 (22%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 A + D+++ A +++ + R+G I + TPL+ ++ Sbjct: 112 ANGQMESRFDLVMRVANQFIDTRK---------GDRVGLILFGTRAYLQ--TPLTFDIAT 160 Query: 262 VKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 VK L+ + P T A+ A ++L S K +I +TDGEN+ Sbjct: 161 VKKMLDDASIALPGPQTAIGDAIGLAVKKLKKYPGDS----------KALILLTDGENNS 210 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVS-----------------APPEGQDLLRKCT 361 TL LQ E + +KIY++ + + +L K Sbjct: 211 G-----TLQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNTSEDLDTTVLEKIA 265 Query: 362 D-SSGQFFAVNDSRELLESFDKITD 385 + G++F +S +L + ++ I Sbjct: 266 TMTGGKYFRAQNSSDLKKVYESIDK 290 >gi|150005795|ref|YP_001300539.1| hypothetical protein BVU_3288 [Bacteroides vulgatus ATCC 8482] gi|149934219|gb|ABR40917.1| conserved hypothetical protein BatA [Bacteroides vulgatus ATCC 8482] Length = 332 Score = 84.2 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 31/252 (12%), Positives = 68/252 (26%), Gaps = 56/252 (22%) Query: 172 NNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEK 231 + + S A +++ + A + Sbjct: 72 ARPQTTDNWQNTEIEGIDIMLAVDVSTSMLAEDLKPNRLEAAKQVASEFI---------- 121 Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY--------ENTNTYPAMHH 283 N PL+ + + + N + + T + + Sbjct: 122 -NGRPNDNIGLTIFAGESFTQCPLTVDHGVLLNLFNSIKGDIAQRGLIEDGTAIGMGIAN 180 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A L S K +I +TDG N+ L E + G++IY Sbjct: 181 AVTRL----------KDSKAKSKVIILLTDGSNNRGDIS-----PLTAAEIAKQFGIRIY 225 Query: 344 SVAVSAPPE--------------------GQDLLRKCT-DSSGQFFAVNDSRELLESFDK 382 ++ V + L + ++G +F + +L E + + Sbjct: 226 TIGVGTNGTAPYPMQTYAGTQYVNVPVEIDEKTLTEIAGTTNGNYFRATSNSKLKEVYQE 285 Query: 383 ITDKIQEQSVRI 394 I DK+++ + + Sbjct: 286 I-DKLEKTKLNV 296 >gi|89255637|ref|YP_512998.1| hypothetical protein FTL_0203 [Francisella tularensis subsp. holarctica LVS] gi|134302613|ref|YP_001122584.1| hypothetical protein FTW_1793 [Francisella tularensis subsp. tularensis WY96-3418] gi|156501587|ref|YP_001427652.1| hypothetical protein FTA_0219 [Francisella tularensis subsp. holarctica FTNF002-00] gi|167009921|ref|ZP_02274852.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Francisella tularensis subsp. holarctica FSC200] gi|224456527|ref|ZP_03665000.1| hypothetical protein FtultM_01598 [Francisella tularensis subsp. tularensis MA00-2987] gi|254367031|ref|ZP_04983067.1| hypothetical protein FTHG_00206 [Francisella tularensis subsp. holarctica 257] gi|290953465|ref|ZP_06558086.1| hypothetical protein FtulhU_03745 [Francisella tularensis subsp. holarctica URFT1] gi|295313263|ref|ZP_06803900.1| hypothetical protein FtulhU_03730 [Francisella tularensis subsp. holarctica URFT1] gi|89143468|emb|CAJ78644.1| hypothetical membrane protein [Francisella tularensis subsp. holarctica LVS] gi|134050390|gb|ABO47461.1| conserved membrane protein with von Willebrand factor type A domain [Francisella tularensis subsp. tularensis WY96-3418] gi|134252857|gb|EBA51951.1| hypothetical protein FTHG_00206 [Francisella tularensis subsp. holarctica 257] gi|156252190|gb|ABU60696.1| conserved membrane protein with von Willebrand factor, type A domain [Francisella tularensis subsp. holarctica FTNF002-00] gi|282158589|gb|ADA77980.1| hypothetical protein NE061598_01650 [Francisella tularensis subsp. tularensis NE061598] Length = 333 Score = 84.2 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 40/205 (19%), Positives = 76/205 (37%), Gaps = 47/205 (22%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 A + D+++ A +++ + R+G I + TPL+ ++ Sbjct: 112 ANGQMESRFDLVMRVANQFIDTRK---------GDRVGLILFGTRAYLQ--TPLTFDIAT 160 Query: 262 VKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 VK L+ + P T A+ A ++L S K +I +TDGEN+ Sbjct: 161 VKKMLDDASIALPGPQTAIGDAIGLAVKKLKKYPGDS----------KALILLTDGENNS 210 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVS-----------------APPEGQDLLRKCT 361 TL LQ E + +KIY++ + + +L K Sbjct: 211 G-----TLQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNTSEDLDTTVLEKIA 265 Query: 362 D-SSGQFFAVNDSRELLESFDKITD 385 + G++F +S +L + ++ I Sbjct: 266 TMTGGKYFRAQNSSDLKKVYESIDK 290 >gi|262173885|ref|ZP_06041562.1| protein TadG associated with Flp pilus assembly [Vibrio mimicus MB-451] gi|261891243|gb|EEY37230.1| protein TadG associated with Flp pilus assembly [Vibrio mimicus MB-451] Length = 403 Score = 84.2 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 37/427 (8%), Positives = 118/427 (27%), Gaps = 62/427 (14%) Query: 4 IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTI 63 ++ + + + + L+ ++ A + A L+ I P + + Sbjct: 1 MMFPAMMMILAFTMQLSQQFLAHARLSEASEVASLAL---------IASPKEDDENNVSY 51 Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGL 123 +K + +++ + + + + + ++ Sbjct: 52 ARKLVDRYVVDNIDDIKVTVKNKRCEYKDGCVQSSGEAAPFTDFTVAATAKHKSWISYE- 110 Query: 124 IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPP 183 +L + + R + + ++D+S SM + + + + + Sbjct: 111 -NISLKPEFTVNGSSVTRKFLPQPVDVYFIVDMSASMRATWQNGKSQIDEVKNVIT-RVV 168 Query: 184 PPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAY 243 K F ++ ++ ++ +E+ + Q N+ + Sbjct: 169 NDLKDFDTEVKSRVALLGYHNFNIKQGGRSLEAYDYAL--YNTPQQTVSNMFFPPKRVNP 226 Query: 244 NI-GIVGNQCTPLSNNLNEVKSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNT 298 + G+ ++ L+ N + +N N T ++ + + + Sbjct: 227 SDSGLFSHRDIDLTQNYSSFLQIMNDRNFYPPRSACTESWQGII--------AAAQAADK 278 Query: 299 IGSTRLKKFVIFITDGENSGA------------SAYQNTLNTLQICEYMRNA-------- 338 ++ I ++DG + Y L +C+ ++ Sbjct: 279 ATDINPEQVFIILSDGADCPWQRQDRWGRLRTTEYYLKKLVDGGLCKNLKQRIRQKPNRF 338 Query: 339 -----------GMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 + + + V+ D C + + + I + I Sbjct: 339 QSSTPTENEKTKVTMGVIGVNYQVNPNDGFGDCV-GRENIYHATQGE---DVYKYILNLI 394 Query: 388 QEQSVRI 394 E++ R+ Sbjct: 395 NEETGRL 401 >gi|146292146|ref|YP_001182570.1| von Willebrand factor, type A [Shewanella putrefaciens CN-32] gi|145563836|gb|ABP74771.1| von Willebrand factor, type A [Shewanella putrefaciens CN-32] Length = 633 Score = 84.2 bits (206), Expect = 3e-14, Method: Composition-based stats. Identities = 39/363 (10%), Positives = 94/363 (25%), Gaps = 41/363 (11%) Query: 36 AVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK 95 A L+ I + + K ++ + ++ I +I I+ Sbjct: 108 ASLAAKHVINTHYVAAPIASDAW--------YGIKQPERNRFEKQIQNGIMVAGEIPIS- 158 Query: 96 DKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD 155 + Y + L + + Sbjct: 159 ----TFSIDVD-TGSYSTLRRMIKEGSLPEKGTIRIEEMLNYFTYDY----PLPNKNAAP 209 Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 S + E ++D + + S + K+ +L Sbjct: 210 FSATTELAPSPYNDDMMLLRIGLKGYELTKSELGASNLVFL-LDVSGSMASADKLPLLQT 268 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 + L + ++ + Y +++ + L +L +T Sbjct: 269 ALKMLTQQLSAQD--------KVSIVVYAGAAGVVLDGASGDDIQALTYALEQLRAGGST 320 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 N + AY+ VI TDG+ + N + + E Sbjct: 321 NGSQGILQAYQLAQKHFIQGGINR--------VILATDGDFNVGV--TNFDQLISLIEKE 370 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRI 394 + G+ + ++ L+ + D G + ++ L E+ + D++ + I Sbjct: 371 KQRGIGLTTLGFGMDNYNDQLMEQLADKGNGHYAYIDT---LNEARKVLVDELSSTLLTI 427 Query: 395 APN 397 A + Sbjct: 428 AKD 430 >gi|254882023|ref|ZP_05254733.1| BatA aerotolerance operon protein [Bacteroides sp. 4_3_47FAA] gi|294776174|ref|ZP_06741663.1| von Willebrand factor type A domain protein [Bacteroides vulgatus PC510] gi|319640969|ref|ZP_07995677.1| hypothetical protein HMPREF9011_01274 [Bacteroides sp. 3_1_40A] gi|254834816|gb|EET15125.1| BatA aerotolerance operon protein [Bacteroides sp. 4_3_47FAA] gi|294449997|gb|EFG18508.1| von Willebrand factor type A domain protein [Bacteroides vulgatus PC510] gi|317387414|gb|EFV68285.1| hypothetical protein HMPREF9011_01274 [Bacteroides sp. 3_1_40A] Length = 332 Score = 84.2 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 31/252 (12%), Positives = 68/252 (26%), Gaps = 56/252 (22%) Query: 172 NNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEK 231 + + S A +++ + A + Sbjct: 72 ARPQTTDNWQNTEIEGIDIMLAVDVSTSMLAEDLKPNRLEAAKQVASEFI---------- 121 Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY--------ENTNTYPAMHH 283 N PL+ + + + N + + T + + Sbjct: 122 -NGRPNDNIGLTIFAGESFTQCPLTVDHGVLLNLFNSIKGDIAQRGLIEDGTAIGMGIAN 180 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A L S K +I +TDG N+ L E + G++IY Sbjct: 181 AVTRL----------KDSKAKSKVIILLTDGSNNRGDIS-----PLTAAEIAKQFGIRIY 225 Query: 344 SVAVSAPPE--------------------GQDLLRKCT-DSSGQFFAVNDSRELLESFDK 382 ++ V + L + ++G +F + +L E + + Sbjct: 226 TIGVGTNGTAPYPMQTYAGTQYVNVPVEIDEKTLTEIAGTTNGNYFRATSNSKLKEVYQE 285 Query: 383 ITDKIQEQSVRI 394 I DK+++ + + Sbjct: 286 I-DKLEKTKLNV 296 >gi|256820507|ref|YP_003141786.1| von Willebrand factor type A [Capnocytophaga ochracea DSM 7271] gi|256582090|gb|ACU93225.1| von Willebrand factor type A [Capnocytophaga ochracea DSM 7271] Length = 333 Score = 84.2 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 66/181 (36%), Gaps = 43/181 (23%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRE 287 K + RIG + Y TP++ + + + L++L + T + A Sbjct: 126 KDRPNDRIGLVIYAGESYTK--TPVTTDKGIILNALSELTYGQIEDGTAIGMGLATAVNR 183 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 L S + +I +TDG N+ ++ E G+++Y++ + Sbjct: 184 L----------KESKAKSRVIILLTDGVNNTG-----FIDPQTAAELAAEYGIRVYTIGI 228 Query: 348 SAPPE----------------------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKIT 384 + + L++K + + G++F D+++L + +D+I Sbjct: 229 GSNGTALSPYALNPDGSIMYRMLQVEIDEPLMKKIAEVTHGRYFRATDNQKLQQIYDEIN 288 Query: 385 D 385 Sbjct: 289 K 289 >gi|301758046|ref|XP_002914871.1| PREDICTED: LOW QUALITY PROTEIN: vitrin-like [Ailuropoda melanoleuca] Length = 686 Score = 84.2 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 42/352 (11%), Positives = 92/352 (26%), Gaps = 31/352 (8%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPL 101 + ++ R +K K Q + ++ + G+ + + P Sbjct: 356 RTHMNSRDLKTAIDKITQRGGLSNAGRAISYVTKNFFSKANGNRGGAPNVAVVIVDGWPT 415 Query: 102 QYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSME 161 + E+ N+F + + + + L+V Sbjct: 416 DKVEEASRLARESGINIFFITVEGATENEKQYVTEPNFSNK-AVCRTNGFYSLNVQSWFS 474 Query: 162 DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLV 221 K + S + + VL A Sbjct: 475 LRKT---VQPLLKRVCDTDRLACSKTCWNSADIGFVIDGSSSVGTGNFRTVLQFVAN--- 528 Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPA 280 E R+G + Y + +++ + + ++ T+T A Sbjct: 529 ---ISKEFEISETDTRVGAVQYTYEQRLEFGFDDYHTKSDILNAIKRVGYWSGGTSTGAA 585 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 +++A +L+ +K +I ITDG + + G+ Sbjct: 586 INYALEQLF--------KKSKPNKRKLMILITDGRSYDDVRIP--------AMVAHHKGV 629 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 Y++ V QD L F V++ L + K+ I + Sbjct: 630 TTYAIGV--AWAAQDELEVIATHPASDHSFFVDEFDNLYKFVPKVIRNICTE 679 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 16/156 (10%), Positives = 50/156 (32%), Gaps = 19/156 (12%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + R+ + + ++ ++ +G + Y N + Sbjct: 309 SSSIGKRRFRIQKQFLADVAQTLDIGPGGPL-----MGVVQYGDNPAAQFSLRTHMNSRD 363 Query: 262 VKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 +K+ ++K+ +N A+ + + +++ + + + I DG + Sbjct: 364 LKTAIDKITQRGGLSNAGRAISYVTKNFFSKANGNRGGAPN-----VAVVIVDGWPTD-- 416 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + R +G+ I+ + V E + Sbjct: 417 ------KVEEASRLARESGINIFFITVEGATENEKQ 446 >gi|268316013|ref|YP_003289732.1| von Willebrand factor type A [Rhodothermus marinus DSM 4252] gi|262333547|gb|ACY47344.1| von Willebrand factor type A [Rhodothermus marinus DSM 4252] Length = 329 Score = 84.2 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 29/159 (18%), Positives = 58/159 (36%), Gaps = 39/159 (24%) Query: 251 QCTPLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 P + + + + L +L + T A+ A L S K Sbjct: 140 TQVPPTLDYRFLLTMLQRLQVGRLEDGTAIGTAIATAINRL----------KNSEARSKV 189 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA--------------VSAPPE- 352 +I +TDG+N+ ++ L E R AG++IY++ P+ Sbjct: 190 IILLTDGQNNRG-----EIDPLTAAELARQAGIRIYTIGLSGRGEAPYPVQTPFGTRPQP 244 Query: 353 -----GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + ++R+ + + G++F D+R L + +I Sbjct: 245 VPVEIDEAMMREVAEKTGGRYFRATDARTLEAIYAEIDR 283 >gi|281341943|gb|EFB17527.1| hypothetical protein PANDA_002811 [Ailuropoda melanoleuca] Length = 652 Score = 84.2 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 42/352 (11%), Positives = 92/352 (26%), Gaps = 31/352 (8%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPL 101 + ++ R +K K Q + ++ + G+ + + P Sbjct: 322 RTHMNSRDLKTAIDKITQRGGLSNAGRAISYVTKNFFSKANGNRGGAPNVAVVIVDGWPT 381 Query: 102 QYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSME 161 + E+ N+F + + + + L+V Sbjct: 382 DKVEEASRLARESGINIFFITVEGATENEKQYVTEPNFSNK-AVCRTNGFYSLNVQSWFS 440 Query: 162 DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLV 221 K + S + + VL A Sbjct: 441 LRKT---VQPLLKRVCDTDRLACSKTCWNSADIGFVIDGSSSVGTGNFRTVLQFVAN--- 494 Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPA 280 E R+G + Y + +++ + + ++ T+T A Sbjct: 495 ---ISKEFEISETDTRVGAVQYTYEQRLEFGFDDYHTKSDILNAIKRVGYWSGGTSTGAA 551 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 +++A +L+ +K +I ITDG + + G+ Sbjct: 552 INYALEQLF--------KKSKPNKRKLMILITDGRSYDDVRIP--------AMVAHHKGV 595 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 Y++ V QD L F V++ L + K+ I + Sbjct: 596 TTYAIGV--AWAAQDELEVIATHPASDHSFFVDEFDNLYKFVPKVIRNICTE 645 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 16/156 (10%), Positives = 50/156 (32%), Gaps = 19/156 (12%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + R+ + + ++ ++ +G + Y N + Sbjct: 275 SSSIGKRRFRIQKQFLADVAQTLDIGPGGPL-----MGVVQYGDNPAAQFSLRTHMNSRD 329 Query: 262 VKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 +K+ ++K+ +N A+ + + +++ + + + I DG + Sbjct: 330 LKTAIDKITQRGGLSNAGRAISYVTKNFFSKANGNRGGAPN-----VAVVIVDGWPTD-- 382 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + R +G+ I+ + V E + Sbjct: 383 ------KVEEASRLARESGINIFFITVEGATENEKQ 412 >gi|297286914|ref|XP_001113364.2| PREDICTED: collagen alpha-6(VI) chain-like [Macaca mulatta] Length = 2262 Score = 84.2 bits (206), Expect = 4e-14, Method: Composition-based stats. Identities = 42/335 (12%), Positives = 101/335 (30%), Gaps = 31/335 (9%) Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 IKK K+ R + QI + + + ++ +F Sbjct: 306 KAYTGAAIKKLRKEVFSARNGSRKNQGVPQIAVLVTHRDSEDNVTKAAVNLRREGVTIFT 365 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYL----QKHNDNNNMTS 176 G+ ++ T L ++ E+ L + ++ L + + Sbjct: 366 LGIKGASDTQLEKIASHPAEQYVSKLKTFADLAAHNQTFLKKLRNQITHTVSVFSERTET 425 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 K + + + +V A + V Sbjct: 426 LKSGCVDTEEADIYLLIDGSGSTQ------ATDFHEMKIFLSEVVGMFNIAPHK-----V 474 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYRELYNEKESS 295 R+G + Y +N ++ + + NTNT A++ L K+ Sbjct: 475 RVGAVQYADSWDLEFEINKYSNKQDLGKAIENIRQLGGNTNTGAALNFTLSLLQKAKKQR 534 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 N + ++ +T+G + + L+ +R +++Y++ + Q Sbjct: 535 GNK-----VPCHLVVLTNGM--------SKDSILEPANRLREEHIRVYAIGI--KEANQT 579 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 LR+ + + V+D L + +++ +I + Sbjct: 580 QLREIAGEEKRVYYVHDFDALKDIRNQVVQEICAE 614 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 63/164 (38%), Gaps = 15/164 (9%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYR 286 + N VRIG ++ E+ ++ + NT+ A+ R Sbjct: 1030 DFDVSNKRVRIGAAQFSDAYRPEFPLGTFIGAKEISIQIENITQIFGNTHIGAAL----R 1085 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 ++ + + +T + ++ +TDG++ Q E +R G+ IYSV Sbjct: 1086 KVEHYFRPDMGSRINTGTPQVLLVLTDGQSQD--------EVAQAAEALRRRGIDIYSVG 1137 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + + Q L++ T ++ + V++ EL + +I I Sbjct: 1138 IG-DVDDQQLMQ-ITGTAEKKLTVHNFDELKKVNKRIVRNICTT 1179 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 33/174 (18%), Positives = 61/174 (35%), Gaps = 15/174 (8%) Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TN 276 +++ ++KA + VR G + Y EV S L T Sbjct: 831 DFMIDIVRKA--DVGMNRVRFGALKYADDPEVLFYLDNFGTKPEVISVLQTDQAMGGNTY 888 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 T A+ + + + + + + +I ITDG S + LN + +R Sbjct: 889 TAEALGFSDHMFTE----ARGSRLNKGVPQVLIVITDG----DSHDADKLNAT--AKALR 938 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + G+ + +V + A +LL SS ++F V L +T + Sbjct: 939 DKGILVLAVGI-ADANPVELL-AMAGSSDKYFFVETFGGLKGIISDVTASVCNS 990 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 28/160 (17%), Positives = 54/160 (33%), Gaps = 19/160 (11%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEK 292 V+IG + ++ + +++ + ++++ T T A+ + K Sbjct: 658 DRVQIGVVQFSDINKEEFQLNRFMSQSDISNAIDQMAHIGQTTLTGSALSFVSQYFSPTK 717 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA-VSAPP 351 S ++KF+I ITDGE + +R G+ IYSV + Sbjct: 718 GSRP------SVRKFLILITDGEAQD--------IVKEPAVALRQEGVIIYSVGVFGSNV 763 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 L + + F V + L D + I Sbjct: 764 ---TQLEEISGRPEMVFYVENFDILQRIEDDLVFGICSPR 800 >gi|294141918|ref|YP_003557896.1| von Willebrand factor type A domain-containing protein [Shewanella violacea DSS12] gi|194578715|dbj|BAG66042.1| von Willebrand factor typeA domain protein [Shewanella violacea] gi|293328387|dbj|BAJ03118.1| von Willebrand factor type A domain protein [Shewanella violacea DSS12] Length = 689 Score = 83.8 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 36/332 (10%), Positives = 90/332 (27%), Gaps = 33/332 (9%) Query: 67 QIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPS 126 ++ ++ R+ A I +I ++ + Y + L Sbjct: 154 AFREASSSDNFKRQTANGIMVAGEIPVSTFSIDTD------TGSYTTLRRWINQGRLPEK 207 Query: 127 ALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPK 186 + + S + ++ + ++ + Sbjct: 208 GTVRVEEMINYFNYQYSTPSTVEQPFSVNTELAPSPY----NDHKMLLRIGLKGYEVDKS 263 Query: 187 KSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIG 246 + S + K+ +L S L + + + + Y Sbjct: 264 QLGASNLVFL-LDVSGSMNSRDKLPLLKTSLKMLSQQLSEQD--------HVSIVVYAGA 314 Query: 247 IVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 N++ + LN L +TN + AY Sbjct: 315 SGVVLDGVKGNDIYAINQALNNLKAGGSTNGGAGIQQAYGLAQKHFIQGGVNR------- 367 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-G 365 VI TDG+ + + + + + R+ G+ + ++ L+ + D G Sbjct: 368 -VILATDGDFNVGT--TDHQALMDLIASKRDQGIALTTLGFGQGNYNDHLMEQLADKGNG 424 Query: 366 QFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + ++ L E+ + D++ + IA + Sbjct: 425 HYAYIDT---LNEARKVLVDELSSTLLTIAKD 453 >gi|254368552|ref|ZP_04984568.1| conserved hypothetical protein [Francisella tularensis subsp. holarctica FSC022] gi|157121455|gb|EDO65646.1| conserved hypothetical protein [Francisella tularensis subsp. holarctica FSC022] Length = 339 Score = 83.8 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 40/205 (19%), Positives = 75/205 (36%), Gaps = 47/205 (22%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 A + D+++ A +++ + R+G I + TPL+ ++ Sbjct: 118 ANGQMESRFDLVMRVANQFIDTRK---------GDRVGLILFGTRAYLQ--TPLTFDIAT 166 Query: 262 VKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 VK L+ + P T A+ A ++L S K +I +TDGEN+ Sbjct: 167 VKKMLDDASIALPGPQTAIGDAIGLAVKKLKKYPGDS----------KALILLTDGENNS 216 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEGQDLLRKCT 361 TL LQ E + +KIY++ + +L K Sbjct: 217 G-----TLQPLQAAEIAKQYHIKIYTIGLGGDQMIVETTFGQRLVNTSEDLDTTVLEKIA 271 Query: 362 D-SSGQFFAVNDSRELLESFDKITD 385 + G++F +S +L + ++ I Sbjct: 272 TMTGGKYFRAQNSSDLKKVYESIDK 296 >gi|332185631|ref|ZP_08387379.1| hypothetical protein SUS17_560 [Sphingomonas sp. S17] gi|332014609|gb|EGI56666.1| hypothetical protein SUS17_560 [Sphingomonas sp. S17] Length = 420 Score = 83.8 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 50/435 (11%), Positives = 116/435 (26%), Gaps = 64/435 (14%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M A+ + V I +D A +++ + + + K+ D T Sbjct: 2 MFALALPVLTCSIGMGVDYARAAKAQSK--------LNAIADAAALLAVSKNAMRADDAT 53 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + F + + Q + + ++ Sbjct: 54 AAYFARSFFSLQSAALVKSDGITLSNVTVQAPTDGNGRRTAVVNYRATSE-------NVF 106 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKH-NDNNNMTSNKY 179 ++ + +S +S + I M+LDVS SM N +++ Sbjct: 107 ARILGMSTLTISGKSETANAIA---PDIDFYMLLDVSASMALPTTSSGLNKVAQSNTSRC 163 Query: 180 LLPPPPKKSFWSKNTTKSKY-----APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNL 234 + + + K +ID ++ L + + K Sbjct: 164 VFACHTGEKRFRGYDAHGKQTDLYGVALSYGLPLRIDAEGDAVNQLTATARSMA-SKNGS 222 Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-----------------YENTNT 277 RI + + PL+N+L + L P + Sbjct: 223 DYRIAITTFRGARGFSVRQPLTNDLTAAGHKAANLKPPYYASIGCPTSACKSSEVGWNDR 282 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT---LQICEY 334 A ++ + + + V +TDG + S C+ Sbjct: 283 DTGSSDAMDQINAMIPQPGSGVNGQDPQAVVFMVTDGMRNEKSPKGARPEVAFDTAKCDM 342 Query: 335 MRNAGMKI---YSVAVSAPPEG---------------QDLLRKCTDSSGQFFAVNDSREL 376 +++ G++I Y+ + +G + L+ C G + V ++ Sbjct: 343 IKHRGIRIAVLYTEYLRDAVKGTTNLERSVEPYLYQVEPALQSCAS-PGLYTKVTTDGDI 401 Query: 377 LESFDKITDKIQEQS 391 + + + + Sbjct: 402 SAALNTLFQNAVATT 416 >gi|149918184|ref|ZP_01906676.1| hypothetical protein PPSIR1_11265 [Plesiocystis pacifica SIR-1] gi|149820944|gb|EDM80351.1| hypothetical protein PPSIR1_11265 [Plesiocystis pacifica SIR-1] Length = 522 Score = 83.8 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 41/334 (12%), Positives = 93/334 (27%), Gaps = 24/334 (7%) Query: 65 KKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLI 124 E+ + + + + + + A Y L L Sbjct: 29 SSYQTHSSPLLEVGSEDYAPRRENPYFDAARVPLSTVSVDVD-TAAYSNVRRFLRDGHLP 87 Query: 125 PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP 184 PS + + A + + S + E ++ + Sbjct: 88 PSDAVRIEELINYFDYDYPQPAADAEGVAEPFSVTTEVGPCPWSDEARLVHIGLQGKS-I 146 Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYN 244 P++ +N K+ +L +S LVN + + + Y Sbjct: 147 PERELPPRNLVFLLDVSGSMHDQDKLPLLTDSLRVLVNQLG--------ERDHVAIVVYA 198 Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 ++ + + +++L +TN + AY + Sbjct: 199 GASGVVLPPTRGSDRGTILAAISELRAGGSTNGGEGIQKAYALAREHFDPQGINR----- 253 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS 364 VI TDG+ + + ++ L + E R +G+ + + G + D Sbjct: 254 ---VILATDGDFNVGTTTESGL--ENLIERERESGVFLTVLGFGRGNLGDRTMEMLADKG 308 Query: 365 -GQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 G + ++ L E+ + + V IA + Sbjct: 309 NGNYAYIDS---LAEARKVLGTEAGSTLVTIAKD 339 >gi|238020799|ref|ZP_04601225.1| hypothetical protein GCWU000324_00689 [Kingella oralis ATCC 51147] gi|237867779|gb|EEP68785.1| hypothetical protein GCWU000324_00689 [Kingella oralis ATCC 51147] Length = 554 Score = 83.8 bits (205), Expect = 4e-14, Method: Composition-based stats. Identities = 38/354 (10%), Positives = 95/354 (26%), Gaps = 30/354 (8%) Query: 47 DRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQY-IA 105 +R+ + + + + +N + + NP+ Sbjct: 48 ERSNAAAVKEYAGAPAYAALEKTARIAPAPSLPQNTEKYGKIESNPVQAVARNPVSTFSI 107 Query: 106 ES-KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLY 164 + Y L L P+ + + V + Sbjct: 108 DVDTGSYANARRFLNDNRLPPAHAVRIEELINYFDYGYAPPSDGKPFAVYTETIDSPWQA 167 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 K PP + + + S A K+ ++ ++ L + Sbjct: 168 DAKLIKIAIKAKEIRSSALPPANLVFLVDVSGSMQA------QDKLPLVKKTLRILTKRL 221 Query: 225 QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHA 284 + ++ I Y + + + +N+L +T A+ A Sbjct: 222 RAED--------KVTLITYASNEKLVLPPTSGKDKDTILQAINQLEAGGSTAGEQALQMA 273 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 Y + + ++ TDG+ + NTL R AG+ + + Sbjct: 274 YAQAQKAYIKNGINR--------ILLATDGDFNVGITDFNTL--KDTVAEKRKAGISLTT 323 Query: 345 VAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + + L+ + D+ G + +++ E + ++ +A + Sbjct: 324 LGFGTGNYNEQLMEQLADAGDGNYSYIDNETEAKKVLQ---RQLSSTLATVAQD 374 >gi|160889563|ref|ZP_02070566.1| hypothetical protein BACUNI_01987 [Bacteroides uniformis ATCC 8492] gi|317480055|ref|ZP_07939167.1| von Willebrand factor type A domain-containing protein [Bacteroides sp. 4_1_36] gi|156861080|gb|EDO54511.1| hypothetical protein BACUNI_01987 [Bacteroides uniformis ATCC 8492] gi|316903797|gb|EFV25639.1| von Willebrand factor type A domain-containing protein [Bacteroides sp. 4_1_36] Length = 327 Score = 83.8 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 64/205 (31%), Gaps = 40/205 (19%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENT 275 N + + + E N PL+ + + + L + + T Sbjct: 108 NRLEAAKDVAAEFINGRPNDNIGITLFAGESFTQCPLTVDHAVLLNLLKDMKCGLIEDGT 167 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + +A L S K +I +TDG N+ L E Sbjct: 168 AIGMGIANAVTRL----------KDSKAKSKVIILLTDGVNNKGDIS-----PLTAAEIA 212 Query: 336 RNAGMKIYSVAVSAPP---------------------EGQDLLRKCTDSSGQFFAVNDSR 374 ++ G+++Y++ V + + L + + G +F + Sbjct: 213 KSFGIRVYTIGVGTNGMAPYPYPVGGTVQYVNMPVEIDEKTLTQIAGTTEGNYFRATSNS 272 Query: 375 ELLESFDKITD-KIQEQSVRIAPNR 398 +L E +++I + + +V+ R Sbjct: 273 KLKEVYEEIDKLEKTKLNVKEYSKR 297 >gi|225024147|ref|ZP_03713339.1| hypothetical protein EIKCOROL_01015 [Eikenella corrodens ATCC 23834] gi|224943172|gb|EEG24381.1| hypothetical protein EIKCOROL_01015 [Eikenella corrodens ATCC 23834] Length = 573 Score = 83.8 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 40/383 (10%), Positives = 98/383 (25%), Gaps = 39/383 (10%) Query: 18 DLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSY 77 D R +++ AAV S + + Sbjct: 50 DYYESRNARFSLRAMPQAAVKSAA-------PADLSANAAPMAEAAVGSVATRQMAPPRQ 102 Query: 78 IRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTG 137 E G + + + ++ + I L P+ + Sbjct: 103 NTERYGHYEPNPVHAVAEQPVSTFSIDVDTGSYANIRRFLTQTGRLPPADAVRIEEIINY 162 Query: 138 IIERSSENLAISICMV--LDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTT 195 ++ V V ++ LPP Sbjct: 163 FDYGYAKPTDGKPFAVHTETVDSPFRSGAKLIRIGIQAKEVSQAALPPANLVFLVD---- 218 Query: 196 KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL 255 + K+ ++ + L + + RI + Y G Sbjct: 219 ----VSGSMYSRDKLPMVKYTLCTLAHQTRAQD--------RITLVTYADGNKVVLPPTP 266 Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 N ++ + L+ L +T A+ + + ++ TDG+ Sbjct: 267 GNQRQKILAALDSLKAGGSTAGENAIQ--------QAYQAAQRAYIRNGINRILLATDGD 318 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSR 374 + NTL + + R +G+ + ++ + + L+ + D+ G + ++ Sbjct: 319 FNVGITDFNTLRS--MVAEKRKSGISLTTLGFGSGNYNERLMEQLADAGDGNYSYIDSPE 376 Query: 375 ELLESFDKITDKIQEQSVRIAPN 397 E + + ++ +A + Sbjct: 377 EAQKV---LHRQLSSTLATVAQD 396 >gi|87306401|ref|ZP_01088548.1| hypothetical protein DSM3645_08717 [Blastopirellula marina DSM 3645] gi|87290580|gb|EAQ82467.1| hypothetical protein DSM3645_08717 [Blastopirellula marina DSM 3645] Length = 578 Score = 83.8 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 38/334 (11%), Positives = 91/334 (27%), Gaps = 22/334 (6%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+++ V F+ ++D+ ++ +++Q+Q ++D+A L+G +++ + T + T Sbjct: 26 LAAVLMIVMMGFMALSVDVGYMFTMQSQLQRSVDSAALAGAGTLIEGEDVATGTVHEYLT 85 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIA--------ESKAQYE 112 Q K+ + + + + +T + N + Sbjct: 86 HNPVGLQWKEFTEGNTADNVDKFLTKYGDGLQLTIGEWNDTSGQVVAAEKNPTTVSVRMT 145 Query: 113 IPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNN 172 F L+ +++ S + I +VLD+S SM D Sbjct: 146 YENMPFFFGHLLGRDSFDITAESIATY------QSRDIMLVLDLSGSMNDDSEFNSIGKL 199 Query: 173 NMTSNKYLLPPPPKKSFWSKNTTKS---KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ 229 +YA P + S NS+ Sbjct: 200 GFDHIYSNSQQMYADLGSPIFGNLQFDPQYAVVNGPTPQSSGQAKSSVTYRGNSVVVKSD 259 Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 + + + N N+ + + + Sbjct: 260 KTIKQISVKTSNGSTYNYYPGSSLNYTANPNKEIRYV--WVTSGKNSNNSD---QVQSFD 314 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 + + + + L G + Y Sbjct: 315 FDGQRINTIKTALGLDNLAYPYPGGSWNDYVNYC 348 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 42/373 (11%), Positives = 86/373 (23%), Gaps = 63/373 (16%) Query: 67 QIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPS 126 G++ Q + A+S Y + + I Sbjct: 205 YSNSQQMYADLGSPIFGNLQFDPQYAVVNGPTPQSSGQAKSSVTYRGNSVVVKSDKTIKQ 264 Query: 127 ALTNLS---------LRSTGIIERSSENLAISICMVLDVSRS----MEDLYLQKHNDNNN 173 S S ++ + S + + + + Sbjct: 265 ISVKTSNGSTYNYYPGSSLNYTANPNKEIRYVWVTSGKNSNNSDQVQSFDFDGQRINTIK 324 Query: 174 MTSN-KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK 232 L P P S+ R L+ + Sbjct: 325 TALGLDNLAYPYPGGSWNDYVNYCLGTGQNNNAGYRYRFGYFNWINYLLERQYSSNSTPD 384 Query: 233 ------------NLSVRIGTIAYNIGIVGNQC---------------TPLSNNLNEVKSR 265 SV + G ++ + L+ NL + ++ Sbjct: 385 LWKASAQPITAVKNSVDLFIHFMQEGDGRDRIGLAVYNAPNGDGLLESTLTENLPFIMTQ 444 Query: 266 LNKLNPY---ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + TN M EL G K ++ +TDG+ + + Sbjct: 445 SRQRQAGHYHNYTNIGGGMTVGREELQT--------RGRKGAVKMMVLLTDGQANWVNGG 496 Query: 323 QNTLNTLQ-ICEYM---RNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV-------N 371 N + + G I ++++ A + + + + G F V Sbjct: 497 VNNNAAKNYVLNEAYLCADQGFTIITISLGAGADKALMDQVAEITGGVHFNVPGGQTVDE 556 Query: 372 DSRELLESFDKIT 384 S +L E F ++ Sbjct: 557 YSEDLTEIFRQVA 569 >gi|90418447|ref|ZP_01226359.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] gi|90338119|gb|EAS51770.1| conserved hypothetical protein [Aurantimonas manganoxydans SI85-9A1] Length = 636 Score = 83.8 bits (205), Expect = 5e-14, Method: Composition-based stats. Identities = 36/290 (12%), Positives = 70/290 (24%), Gaps = 76/290 (26%) Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 P S + N + +P A + + A + Sbjct: 350 TLFVPTFAPDEYDDSDYGWNDYLDSGSGSPGSAKEAMAEQAKVAKYFDSGYSITTPSSNR 409 Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 N TPL+ L V ++ + TN + +R L Sbjct: 410 SD-----WGPNSTCATTPITPLTKTLKTVTDAIDVMGAQGATNIPHGLAWGWRLLTARPP 464 Query: 294 -SSHNTIGSTRLKKFVIFITDGENSGA--------------------------------- 319 + + K ++ +TDG N+ Sbjct: 465 FTEGRSHDEPDNLKVLVLMTDGNNTYNLNSGGRPLEIRDYNRSTYGSYGYGAAYSHGSSS 524 Query: 320 -----SAYQNTLNTLQ----------------ICEYMRN--------AGMKIYSVAVSAP 350 T N +CE ++ G+ I+++A Sbjct: 525 RKPGRIYDGTTGNAKDYSVDSYVAAMDQNVAKVCENVKADGRKPGGTDGILIFTIAFDLR 584 Query: 351 PEG--QDLLRKCTDSS------GQFFAVNDSRELLESFDKITDKIQEQSV 392 + L+ C + ++ EL +F IT++I + Sbjct: 585 DGEPVKKLMEDCASNGLIDASEKLYYDAQSQEELAAAFQSITEQISSLRI 634 Score = 41.4 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 42/351 (11%), Positives = 84/351 (23%), Gaps = 66/351 (18%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 + + V L A+DL+ I +Q + ++ Sbjct: 38 FGLTLPVLALCFATAVDLSGIYGANRSLQQ---------------AADVAALAAGREYGR 82 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN-LFL 120 T + + + + + Q + I + A+ ++PT L Sbjct: 83 TQDADYL-SSVSEAFFFHNAGDETRGTTQFSYDGVFREDGLTILKVTARRQLPTFFGDAL 141 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 + L E +N +I + +VLD S SM+D Sbjct: 142 MWVTGGKLDWRQFPLYAKSEIVVQNRSIELALVLDNSGSMQDRPR--------------- 186 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 + KID++ ++A +L + + Sbjct: 187 ----------------------SGGSKSKIDIIKDAAEDLAKQFLSSDKGSTEEFPVQFA 224 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRL----NKLNPYENT--NTYPAMHHAYRELYNEKES 294 + V + + + + R L+ T + Sbjct: 225 VVPFSSSVNVGPQYKNADWMDTQGRSPIHHENLDWGGWLSGATSGGWEWIRDRGWVYTAP 284 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI------CEYMRNAG 339 S + + IT GE + C R G Sbjct: 285 SSGAPMARYNGSYWTRITTGEPLTRFYVYDNARYKSQFGTWRGCVEARPNG 335 >gi|119720657|ref|YP_921152.1| von Willebrand factor, type A [Thermofilum pendens Hrk 5] gi|119525777|gb|ABL79149.1| von Willebrand factor, type A [Thermofilum pendens Hrk 5] Length = 327 Score = 83.4 bits (204), Expect = 5e-14, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 62/205 (30%), Gaps = 26/205 (12%) Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 S P KI+V +A LV + + Sbjct: 103 VLVVDVSGSMEDSIPGGVKIEVARRAATLLVERMPGGVDVG----------LLAFSDRIV 152 Query: 251 QCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 P + + V + L P T + A L V+F Sbjct: 153 LSLPPTGDRRRVLDAIESLKPGGGTMYTYPLQAALSWLKP--------YKLFNASTLVVF 204 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-GQDLLRKCT-DSSGQFF 368 ++DG + + R+ G+ +Y+V + + G+ L+ + G+ + Sbjct: 205 VSDGLP------ADAATYRTLLSEFRSLGIPVYTVYIGPGGDEGERELKLIAGSTGGEEY 258 Query: 369 AVNDSRELLESFDKITDKIQEQSVR 393 + ELL++F + +K VR Sbjct: 259 TAGSAEELLKAFKTLAEKASSILVR 283 >gi|307565272|ref|ZP_07627765.1| von Willebrand factor type A domain protein [Prevotella amnii CRIS 21A-A] gi|307345941|gb|EFN91285.1| von Willebrand factor type A domain protein [Prevotella amnii CRIS 21A-A] Length = 318 Score = 83.4 bits (204), Expect = 5e-14, Method: Composition-based stats. Identities = 28/196 (14%), Positives = 67/196 (34%), Gaps = 31/196 (15%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY- 272 + N + ++ + + PL+ + + + LN + Sbjct: 103 NDVIPNRLEVAKEVASDFISGRPNDNIGLTIFAGEAFTQCPLTTDHASLINLLNSVRTDL 162 Query: 273 -------ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + T + +A L S K VI +TDG N+ S Sbjct: 163 VVKGLIQDGTAIGMGLINAVGRL----------KSSKAKSKVVILLTDGSNNVGSIS--- 209 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPE------GQDLLRKCTD-SSGQFFAVNDSRELLE 378 + E + +++Y++ + LR+ + ++G+F++ EL + Sbjct: 210 --PMTAAEIAKKFNIRVYTIGLGTEQNNGYSDIDYTTLRQIANVTNGKFYSAQSQTELSQ 267 Query: 379 SFDKITDKIQEQSVRI 394 + I +K+++ ++I Sbjct: 268 IYKDI-NKLEKTKLKI 282 >gi|208780564|ref|ZP_03247903.1| von Willebrand factor type A domain protein [Francisella novicida FTG] gi|208743539|gb|EDZ89844.1| von Willebrand factor type A domain protein [Francisella novicida FTG] Length = 333 Score = 83.4 bits (204), Expect = 5e-14, Method: Composition-based stats. Identities = 38/202 (18%), Positives = 73/202 (36%), Gaps = 43/202 (21%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSV-----RIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 I + ++ G + + ++ R+G I + TPL+ ++ VK Sbjct: 106 IQDMKKANGQMESRFDLVMRVANQFLDTRKGDRVGLILFGTRAYLQ--TPLTFDIATVKK 163 Query: 265 RLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 L+ + P T A+ A ++L K +I +TDGEN+ Sbjct: 164 MLDDASIALPGPQTAIGDAIGLAVKKL----------KKFPGDSKALILLTDGENNSG-- 211 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVS-----------------APPEGQDLLRKCTD-S 363 TL LQ E + +KIY++ + + +L K + Sbjct: 212 ---TLQPLQAAEIAKQYHIKIYTIGLGGGQMIVETTFGQRLVNTSEDLDTTVLEKIATMT 268 Query: 364 SGQFFAVNDSRELLESFDKITD 385 G++F +S +L + ++ I Sbjct: 269 GGKYFRAQNSSDLKKVYESIDK 290 >gi|192359934|ref|YP_001981670.1| von Willebrand factor type A domain-containing protein [Cellvibrio japonicus Ueda107] gi|190686099|gb|ACE83777.1| von Willebrand factor type A domain protein [Cellvibrio japonicus Ueda107] Length = 674 Score = 83.4 bits (204), Expect = 5e-14, Method: Composition-based stats. Identities = 35/389 (8%), Positives = 90/389 (23%), Gaps = 31/389 (7%) Query: 10 FLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIK 69 + +L+ +++ ++ D + + P + Sbjct: 140 IVVTGMRAELSQAEERQHKAKAIADR-----QRRMAEAQMAAKPMAAAPTAAVHADAYAP 194 Query: 70 KHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAES-KAQYEIPTENLFLKGLIPSAL 128 + Q + I N + + A Y L L P Sbjct: 195 ADILQATTREYRDRFNQVDDNPVIATRDNPFSTFSIDVDTAAYSFTRRLLNQGQLPPKDA 254 Query: 129 TNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKS 188 + + +++ + L +S Sbjct: 255 VRIEEMVNYFDYSYPLPSSAQTPFTTNITVLDSPWKPGNKL-LHIGIQGYQLPAGHIPQS 313 Query: 189 FWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIV 248 K+ ++ +S L+++++ + + Y Sbjct: 314 NLV----FLLDVSGSMDEPSKLPLVKQSMELLLSTLKPEDT--------VAIVVYAGAAG 361 Query: 249 GNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 +++ + L+ L +T + AY+ + Sbjct: 362 TVLEPTKVREKSKILAALHNLQAGGSTAGGEGLALAYQLAEANFNPKGVNR--------I 413 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF 368 I TDG+ + L E R G+ + + L++ + Sbjct: 414 ILATDGDFNVGQTGDEPL--QDFVERKRAKGIYLSVLGFGQGNYQDALMQTLAQNGNG-- 469 Query: 369 AVNDSRELLESFDKITDKIQEQSVRIAPN 397 L E+ + ++ IA + Sbjct: 470 TAAYIDTLSEAQKVLVNEATSTLFPIARD 498 >gi|153812017|ref|ZP_01964685.1| hypothetical protein RUMOBE_02410 [Ruminococcus obeum ATCC 29174] gi|149831916|gb|EDM87002.1| hypothetical protein RUMOBE_02410 [Ruminococcus obeum ATCC 29174] Length = 2099 Score = 83.4 bits (204), Expect = 5e-14, Method: Composition-based stats. Identities = 46/368 (12%), Positives = 107/368 (29%), Gaps = 31/368 (8%) Query: 51 KDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQY-IAESKA 109 + + T + ++ + + I T + A Sbjct: 1192 ETGSQVNGYTVETTQTVSGGDVQSDGKSTKIGEKDSATFTITNTYTPIDINSVIEYNKTA 1251 Query: 110 QYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHN 169 + + L S+ T S+++ I + ++ ++ + Sbjct: 1252 TLLDWNQRTYKIDLTASSKTTQSMKTPYDIVLVLDQSGSMSQKFVEYNKINGSSMFWRKT 1311 Query: 170 DNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ 229 + Y + WS + S P + V +S +++++ A Sbjct: 1312 YYIKTQNGIYQQLSWSWDNTWSYTDSYSGKTVTVDPNTTDVYVAQKSNQTKIDALKSAAT 1371 Query: 230 E------KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE-------VKSRLNKLNPYENTN 276 KN R+G + ++ T S L + + + ++ L +T Sbjct: 1372 TFVNNVANKNSDCRVGIVTFSNDGYIKPITNNSYTLAKVGTSKGDIINTIDGLKTGGDTY 1431 Query: 277 TYPAMHHAYRELYNEKESSHNT-IGSTRLKKFVIFITDGEN---SGASAYQNTLNT-LQI 331 + A +S T + KK V+F+TDG + + +N Sbjct: 1432 PAKGLDKANEIFSENSSNSWETVEQTDGRKKMVVFLTDGVPAPANTNNFDENLAGAGTNS 1491 Query: 332 CEYMRNAGMKIYSVA-VSAPPEG-----------QDLLRKCTDSSGQFFAVNDSRELLES 379 + + + G+ Y++ A ++ S ++ + L Sbjct: 1492 AKILHDQGVATYALGIFGAANSDGTMDNASVQRIDKYMQSIASSHEKYMTADSVDNLSSL 1551 Query: 380 FDKITDKI 387 F+ IT+ I Sbjct: 1552 FESITNNI 1559 >gi|256419476|ref|YP_003120129.1| hypothetical protein Cpin_0430 [Chitinophaga pinensis DSM 2588] gi|256034384|gb|ACU57928.1| conserved hypothetical protein [Chitinophaga pinensis DSM 2588] Length = 336 Score = 83.4 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 26/205 (12%), Positives = 72/205 (35%), Gaps = 41/205 (20%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENT 275 + + + ++ + + P++ + +K+++ ++ + T Sbjct: 116 DRLEAAKRVAMNFVDSRISDRIGLVIFSGESFTQCPITTDHGVLKNQIAQVKSGMLQDGT 175 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + + L S K +I +TDG N+ ++ L E Sbjct: 176 AIGMGLATSVERL----------RTSKAKSKVIILLTDGVNNTGL-----IDPLTALEIA 220 Query: 336 RNAGMKIYSVAVSA-----------PPE----------GQDLLRKCT-DSSGQFFAVNDS 373 + +++Y++ V + L++K + ++ G++F + Sbjct: 221 KAFKIRVYTIGVGTIGKAPFPMTMPDGSIQMQMQDVQLDEPLMKKISVETGGKYFRATSN 280 Query: 374 RELLESFDKITDKIQEQSVRIAPNR 398 +EL + +I DK+++ V I + Sbjct: 281 KELENIYGEI-DKLEKTKVEITSYK 304 >gi|19031201|gb|AAL17974.1| proximal thread matrix protein 1 [Mytilus galloprovincialis] Length = 453 Score = 83.4 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 26/236 (11%), Positives = 72/236 (30%), Gaps = 18/236 (7%) Query: 155 DVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLI 214 + + + + + + A ++ Sbjct: 218 NYVLTTNSFSELSTLLKLVIDLACEVCVVDCAGHADIAFVFDASSSIN-ANNPNNYQLMK 276 Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-- 272 ++V+ K + +V + + ++ ++K ++K++P Sbjct: 277 NFMKDIVDRFNKTGPDGTQFAV----VTFADRATKQFGLKDYSSKADIKGAIDKVSPSII 332 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T + +A E++ + ++K VI +TDG+N+G + ++ Sbjct: 333 GQTAIGDGLENARLEVFPNRNGGGR----EEVQKVVILLTDGQNNGHKSPEHES------ 382 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 +R G+ I ++ V + L S F + +L + + + Sbjct: 383 SLLRKEGVVIVAIGVGTG-FLKSELINIASSEEYVFTTSSFDKLSKIMEDVVKLAC 437 Score = 78.0 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 26/216 (12%), Positives = 60/216 (27%), Gaps = 14/216 (6%) Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 + ++ + LV+S +N S Sbjct: 39 PGNTGKDAEECDVQADIIVLFDDSSSIQYDNKENYQMMKDFVKELVDSFTTVGVNGRNGS 98 Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP--YENTNTYPAMHHAYRELYNEKE 293 + G + ++ G+ ++K + + P T + H ++ E Sbjct: 99 -QFGVVQFSQGVKTAFPLNKFKTKEDIKKGIQDMVPRNGGQTEIGTGLKHVRENSFSGAE 157 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 G+ +K VI +TDG+++ ++ G+ + ++ + Sbjct: 158 GG----GNPDKQKIVILMTDGKSNAG------APPQHEAHKLKAEGVTVIAIGIGQG-FV 206 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + L + N EL + D E Sbjct: 207 KTELEQIATMKNYVLTTNSFSELSTLLKLVIDLACE 242 >gi|218678237|ref|ZP_03526134.1| hypothetical protein RetlC8_04927 [Rhizobium etli CIAT 894] Length = 120 Score = 83.4 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 30/109 (27%), Positives = 57/109 (52%), Gaps = 2/109 (1%) Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT--LQICEYMRNAGMKIYSVA 346 + ++++H KK+++F+TDG+N+ S+ + +T + C+ ++ G++IY++A Sbjct: 11 NDAEDAAHKLKTGQIPKKYIVFMTDGDNNNDSSGGRSYDTATKKTCDDAKSKGIEIYTIA 70 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 AP GQ LL C +F +LL +F+ I K Q R+ Sbjct: 71 FMAPAGGQALLHYCASDDSHYFQAEKMEDLLAAFEAIGAKSAAQVTRLT 119 >gi|326672754|ref|XP_002664126.2| PREDICTED: collagen alpha-1(XXI) chain-like [Danio rerio] Length = 572 Score = 83.4 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 29/176 (16%), Positives = 65/176 (36%), Gaps = 19/176 (10%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHA 284 ++G + Y+ + + +++ + + TNT A+ A Sbjct: 68 TMSFNIGQKFTQVGVVQYSDDPFLHIPLGKHFSSSDLIKAMESIEYMGGNTNTGRAIKFA 127 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 +L+ E N + K + +TDG++ L E R G+ +++ Sbjct: 128 NDKLFALSERGPN-----GIAKIAVVLTDGKSQD--------EVLAAAEAARKKGIILFA 174 Query: 345 VAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + V + + LR + F+V D + + + + I K+ E++V P + Sbjct: 175 IGVGSE-TEEAQLRAIANKPSSTYVFSVKDYKAIAKIREVIRQKLCEETV--CPAK 227 >gi|218462279|ref|ZP_03502370.1| hypothetical protein RetlK5_23628 [Rhizobium etli Kim 5] Length = 347 Score = 83.4 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 43/325 (13%), Positives = 100/325 (30%), Gaps = 45/325 (13%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQM----------QSALDAAVLSGCASIVSDRTI 50 MTA+++ A+D AH + +R Q+ A + ++ ++ S+ T+ Sbjct: 19 MTALLMVPLLGTAGMAVDFAHALSLRTQLYAAADAAAVGSIAEKSGAVAAAMAMNSNGTV 78 Query: 51 KDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQ 110 K IF Q+ L ++ I++TK N + Sbjct: 79 SLG---KTDARNIFMSQMSGELA----------EVQVDLGIDVTKTANKLNSQV------ 119 Query: 111 YEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 T ++ +S +T + ++LD + SM Sbjct: 120 SFTATVPTTFMQILGRDSITISGTATA---EYQTAAFMDFYILLDNTPSMGVGATPDDVS 176 Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE 230 + K+ + KS +IDV+ ++ L ++ + Sbjct: 177 KLEAKAGCAFACHQMDKTINNYTIAKS------LGVAMRIDVVRQATQALTDTAKTERVS 230 Query: 231 KKNLSVRIGTIAYNIGIVG-NQCTPLSNNLNEVKSRLNKLNPY------ENTNTYPAMHH 283 + + T + L+++L +VK+ + ++ N + Sbjct: 231 SDQFRMGVYTFGTKAEDAKLTTISGLTSDLTKVKNYTDAVDLMTIPYQNYNNDQITNFDS 290 Query: 284 AYRELYNEKESSHNTIGSTRLKKFV 308 A ++ + + + + +K Sbjct: 291 AMTQMNTIIDQAGDGTSNISAEKSC 315 >gi|59713864|ref|YP_206639.1| hypothetical protein VF_A0681 [Vibrio fischeri ES114] gi|59482112|gb|AAW87751.1| hypothetical membrane spanning protein [Vibrio fischeri ES114] Length = 321 Score = 83.4 bits (204), Expect = 6e-14, Method: Composition-based stats. Identities = 33/204 (16%), Positives = 71/204 (34%), Gaps = 47/204 (23%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + ++ + + + ++ + R+G + + TPL+ + N Sbjct: 102 TSNGDFVDRLTAVKQVVSDFIDQRK---------GDRLGLVLFGDHAYLQ--TPLTFDRN 150 Query: 261 EVKSRLNK--LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 V+ +L++ LN T + A + ++ +I ++DG N+ Sbjct: 151 TVREQLDRTVLNLVGQRTAIGEGLGLATKTFIESN----------APQRTIILLSDGANT 200 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEGQDLLRKC 360 L L+ + ++ KIY+V + A +D L K Sbjct: 201 AGV-----LEPLEAAQLAKDNHAKIYTVGIGAGEMQVRGFFGKQTVNTARDLDEDTLTKI 255 Query: 361 TD-SSGQFFAVNDSRELLESFDKI 383 + GQ+F ++ EL E + I Sbjct: 256 ATMTGGQYFRARNADELAEIYQTI 279 >gi|91216721|ref|ZP_01253686.1| batA protein [Psychroflexus torquis ATCC 700755] gi|91185190|gb|EAS71568.1| batA protein [Psychroflexus torquis ATCC 700755] Length = 334 Score = 83.0 bits (203), Expect = 7e-14, Method: Composition-based stats. Identities = 28/176 (15%), Positives = 60/176 (34%), Gaps = 44/176 (25%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYN 290 + RIG + Y TPL+ + + + + +N L T + + +L Sbjct: 130 NDRIGLVIYAGESYTK--TPLTTDKSIIFNAINDLEYSQNIEGGTAIGMGLATSVNKL-- 185 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 S K +I +TDGEN+ ++ + +K Y++ V + Sbjct: 186 --------KDSKAESKVIILLTDGENNAG-----FIDPKTATQLATEYDIKTYTIGVGSN 232 Query: 351 P----------------------EGQDLLRKCT-DSSGQFFAVNDSRELLESFDKI 383 + LL+ + G++F D+++ +++I Sbjct: 233 GMALSPVGIKANGQFEYRNIEVKIDEALLKTIAESNGGKYFRATDNQKFEAIYEEI 288 >gi|270296687|ref|ZP_06202886.1| conserved hypothetical protein [Bacteroides sp. D20] gi|270272674|gb|EFA18537.1| conserved hypothetical protein [Bacteroides sp. D20] Length = 327 Score = 83.0 bits (203), Expect = 7e-14, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 64/205 (31%), Gaps = 40/205 (19%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENT 275 N + + + E N PL+ + + + L + + T Sbjct: 108 NRLEAAKDVAAEFINGRPNDNIGITLFAGESFTQCPLTVDHAVLLNLLKDMKCGLIEDGT 167 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + +A L S K +I +TDG N+ L E Sbjct: 168 AIGMGIANAVTRL----------KDSKAKSKVIILLTDGVNNKGDIS-----PLTAAEIA 212 Query: 336 RNAGMKIYSVAVSAPP---------------------EGQDLLRKCTDSSGQFFAVNDSR 374 ++ G+++Y++ V + + L + + G +F + Sbjct: 213 KSFGIRVYTIGVGTNGMAPYPYPVGGTVQYVNMPVEIDEKTLTQIAGTTEGNYFRATSNS 272 Query: 375 ELLESFDKITD-KIQEQSVRIAPNR 398 +L E +++I + + +V+ R Sbjct: 273 KLKEVYEEIDKLEKTKLNVKEYSKR 297 >gi|300776751|ref|ZP_07086609.1| aerotolerance protein BatA [Chryseobacterium gleum ATCC 35910] gi|300502261|gb|EFK33401.1| aerotolerance protein BatA [Chryseobacterium gleum ATCC 35910] Length = 330 Score = 83.0 bits (203), Expect = 7e-14, Method: Composition-based stats. Identities = 30/211 (14%), Positives = 71/211 (33%), Gaps = 41/211 (19%) Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP 271 + + + + +++ + P++++ V + LN Sbjct: 101 LAKDLNPDRITALKDIAVKFVQKRPNDRIGVVAYAAEAFTKVPVTSDHQVVIDEIKNLNS 160 Query: 272 YE---NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 T + A L K S K VI +TDG ++ QN + Sbjct: 161 AGLEPGTAIGEGLSVAVNHLVKSKAKS----------KVVILMTDGVSN----IQNAIPP 206 Query: 329 LQICEYMRNAGMKIYSVAVSAPPE----------------------GQDLLRKCT-DSSG 365 E +N +K+Y++ + ++ LR+ + G Sbjct: 207 QVAAELAKNNNIKVYAIGIGTNGYALMPTSQDIFGDLVFTETEVTIDENTLREIAQTTGG 266 Query: 366 QFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 ++F + L E +D+I +++++ V+++ Sbjct: 267 KYFRATSNSSLEEVYDEI-NQLEKSDVKVSK 296 >gi|237737388|ref|ZP_04567869.1| BatA protein [Fusobacterium mortiferum ATCC 9817] gi|229421250|gb|EEO36297.1| BatA protein [Fusobacterium mortiferum ATCC 9817] Length = 319 Score = 83.0 bits (203), Expect = 7e-14, Method: Composition-based stats. Identities = 32/180 (17%), Positives = 63/180 (35%), Gaps = 40/180 (22%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKS-----RLNKLNPYENTNTYPAMHHAYRELY 289 + RI + + PL+ + N VK + + T + + L Sbjct: 120 NDRISLVVFGGDAYTK--VPLTFDHNVVKDITSKLTTDDITSNNRTAIGMGLGVSLNRL- 176 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 S K +I +TDGEN+ + E + G+KIY++ + A Sbjct: 177 ---------KDSEAKSKVIILMTDGENNSGEMS-----PMGASEIAKELGIKIYTIGIGA 222 Query: 350 ----------------PPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSV 392 ++LL+ + G++F +E E F++I D +++ + Sbjct: 223 REIQIRVPFGHTTVKNTELDENLLKNIASTTGGEYFRAGSEKEFQEIFNRI-DSLEKTKI 281 >gi|305665951|ref|YP_003862238.1| BatA protein [Maribacter sp. HTCC2170] gi|88710726|gb|EAR02958.1| batA protein [Maribacter sp. HTCC2170] Length = 332 Score = 83.0 bits (203), Expect = 7e-14, Method: Composition-based stats. Identities = 25/193 (12%), Positives = 61/193 (31%), Gaps = 41/193 (21%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENT 275 N ++++++ + TP++++ + V + L ++ + T Sbjct: 111 NRLSALKEVAADFIRQRPNDRIGLVAYAGEAFTKTPITSDKSIVLNSLREITYGQLNDGT 170 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + + L S + K +I +TDG N+ + + Sbjct: 171 AIGMGLATSVNRL----------KESKAISKIIILLTDGVNNSG-----FIEPQTAADLA 215 Query: 336 RNAGMKIYSVAVSAPPE----------------------GQDLLRKCT-DSSGQFFAVND 372 G+K Y++ + + LL + G++F D Sbjct: 216 VEYGIKSYTIGLGTNGNALSPIAYNADGSYRYGMRQVEIDEKLLEGIAETTGGKYFRATD 275 Query: 373 SRELLESFDKITD 385 + +L +D+I Sbjct: 276 NEKLEAIYDEINK 288 >gi|296228118|ref|XP_002759733.1| PREDICTED: collagen alpha-6(VI) chain [Callithrix jacchus] Length = 2267 Score = 83.0 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 40/335 (11%), Positives = 99/335 (29%), Gaps = 31/335 (9%) Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + IKK K+ R + QI + + ++ +F Sbjct: 304 NAYTGAAIKKLRKEVFSARNGSRKNQGVPQIAVLVTHRASEDNVTKAAVNLRREGVAIFT 363 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYL----QKHNDNNNMTS 176 G+ + + L ++ E+ L + ++ L + + Sbjct: 364 LGIEGARDSQLEKIASHPAEQYVSKLKTFADLAAHNQTFLKKLRNQITHTVSVFSERTET 423 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 K + + + +V A + V Sbjct: 424 LKSGCVDTEEADIYLLIDGSGSTQ------ATDFHEMKSFLSEVVGMFNIAPHK-----V 472 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESS 295 R+G + Y +N ++ + + TNT A++ L K+ Sbjct: 473 RVGAVQYADSWDLEFEINKYSNKQDLGKAIENIRQMGGNTNTGAALNFTLSLLQKAKQQR 532 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 N + ++ +T+G + + L+ +R +++Y++ V + Sbjct: 533 GNK-----VPCHLVVLTNGM--------SKDSILEPANRLREEHIRVYAIGV--KEANKT 577 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 LR+ + + V+D L + +++ +I + Sbjct: 578 QLREIAGEEKRVYYVHDFDALKDIRNQVVQEICAE 612 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 60/164 (36%), Gaps = 15/164 (9%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYR 286 + VRIG ++ E+ ++ + NT+ A+ R Sbjct: 1028 DFDVSLNRVRIGAAQFSHNYRQEFPLGTFIGEKEISFQIENIQQLGGNTHIGDAL----R 1083 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 ++ + + +T + ++ +TDG++ Q E +R+ G+ IYSV Sbjct: 1084 QVGHYFRPDMGSRINTGTPQVLLVLTDGQSQD--------EVAQAAEALRHRGIDIYSVG 1135 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + L + T ++ + V++ EL + +I I Sbjct: 1136 IG--DVDDQQLIQITGAAEKKLTVHNFDELKKVKKRIVRNICTT 1177 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 34/162 (20%), Positives = 58/162 (35%), Gaps = 13/162 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + VR G + Y + EV S L P T T A+ + Sbjct: 839 DVGKNRVRFGALKYADDPEVLFYLGDFDTKLEVISVLQNDQPMGGNTYTSEALGFSDHMF 898 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + + + + + +I ITDGE S + LN + +R+ G+ + +V + Sbjct: 899 TE----AQGSRLNKGVPQVLIVITDGE----SHDADKLNAT--AKALRDKGILVLAVGI- 947 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 A +LL SS ++F V L F +T + Sbjct: 948 AGANPVELL-AMAGSSDKYFFVETFGGLKGIFSDVTASVCNS 988 Score = 63.0 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 54/159 (33%), Gaps = 19/159 (11%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEK 292 V+IG + ++ + +++ + ++++ T T A+ + K Sbjct: 656 NRVQIGVVQFSGVNKEEFQLNRFMSQSDISNAIDQMVHIGETTLTGSALSFVSQYFSPTK 715 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA-VSAPP 351 + ++KF+I ITDGE + +R G+ IYSV + Sbjct: 716 GARP------NVRKFLILITDGEAQD--------VVKEPAVALRQEGIIIYSVGVFGSNV 761 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 L + + F V + L D + I Sbjct: 762 ---TQLEEISGRPEMVFYVENFDILQHIEDDLVFGICSP 797 Score = 42.6 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 20/162 (12%), Positives = 53/162 (32%), Gaps = 17/162 (10%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL--NKLNPYENTNTYPAMHHAYRELYNE 291 R+ Y+ + N + + L N + A+ A+R ++ Sbjct: 61 DKYRVALAQYSDKLHSEFHLSTFKGRNPMLNHLKKNFQFLGGSLQIGKALQEAHRTYFSA 120 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 + + V+ + ++ + + + ++ G+KI ++V Sbjct: 121 PTNGRDRK--QFPPILVVLAS---------AESEDDVEEASKALQKDGVKI--ISVGVQD 167 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 ++ L+ S F + R+L F + +I + + Sbjct: 168 ASEENLKAMATSH-FHFNLRTVRDLST-FSQNMTQIIKDVTK 207 >gi|313203640|ref|YP_004042297.1| von willebrand factor type a [Paludibacter propionicigenes WB4] gi|312442956|gb|ADQ79312.1| von Willebrand factor type A [Paludibacter propionicigenes WB4] Length = 327 Score = 83.0 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 57/170 (33%), Gaps = 40/170 (23%) Query: 249 GNQCTPLSNNLN---EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 PL+ + + + +N + T + +A + Sbjct: 138 SFTQCPLTTDHAVLVNLFNGVNNGMIEDGTAIGLGLANAVNRI----------KDGKSKS 187 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA---------------- 349 K +I +TDG N+ + + E + G++IY++ V Sbjct: 188 KVIILLTDGSNNSG-----DIAPITAAEIAKTFGIRIYTIGVGTHGVINIPVSTPMGIQY 242 Query: 350 ----PPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVRI 394 L + + G++F D+ +L + +I DK+++ ++I Sbjct: 243 QRVQSEFDAKSLENIANLTGGKYFGATDNSKLRNIYQEI-DKLEKTRIKI 291 >gi|255693880|ref|ZP_05417555.1| BatA protein [Bacteroides finegoldii DSM 17565] gi|260620309|gb|EEX43180.1| BatA protein [Bacteroides finegoldii DSM 17565] Length = 327 Score = 83.0 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 27/205 (13%), Positives = 64/205 (31%), Gaps = 40/205 (19%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENT 275 N + + + E N PL+ + + ++ + + T Sbjct: 108 NRLEAAKDVAAEFINGRPNDNIGITLFAGESFTQCPLTVDHAVLLDMIHNIKCGLIEDGT 167 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + +A L S K +I +TDG N+ + E Sbjct: 168 AVGMGIANAVTRL----------KDSKAKSKVIILLTDGTNNKGDIS-----PMTAAEIA 212 Query: 336 RNAGMKIYSVAVSAPP---------------------EGQDLLRKCTDSSGQFFAVNDSR 374 ++ G+++Y++ V + + L + + G +F + Sbjct: 213 KSFGIRVYTIGVGTNGMAPYPYPVGNTVQYVSMPVEIDEKTLTQIAGTTDGNYFRATSNS 272 Query: 375 ELLESFDKITD-KIQEQSVRIAPNR 398 +L E +++I + + +V+ R Sbjct: 273 KLKEVYEEIDKLEKTKLNVKEYSKR 297 >gi|226314068|ref|YP_002773964.1| hypothetical protein BBR47_44830 [Brevibacillus brevis NBRC 100599] gi|226097018|dbj|BAH45460.1| hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 677 Score = 83.0 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 72/205 (35%), Gaps = 16/205 (7%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL--- 255 + + + K D + + I + RIG +AYN IV Q Sbjct: 49 FVVDTSNSMNKTDPGKTA----AEVMSMFIDMSEATRTRIGFVAYNDRIVQAQSPASMAE 104 Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + N ++K + L ++ + + K+ + TD Sbjct: 105 ARNREQLKRTIQGLRYSGYSDLGLGLRRGAEMIEKAKDPARKPFLILLSDG----GTDLR 160 Query: 316 NSGA--SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-GQDLLRKCTD-SSGQFFAVN 371 + S + + + + G IY++ ++ ++ L+K + + G F Sbjct: 161 QNAGGRSVAASNKDVETVISKAKAQGYPIYTIGLNNDGSVQKEQLKKIAEATGGTSFVTQ 220 Query: 372 DSRELLESFDKITDK-IQEQSVRIA 395 + +L E F++I K IQ Q V +A Sbjct: 221 STDDLPEIFNQIFAKHIQSQLVSVA 245 >gi|73990557|ref|XP_853279.1| PREDICTED: similar to alpha 3 type VI collagen isoform 1 precursor [Canis familiaris] Length = 1634 Score = 83.0 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 41/335 (12%), Positives = 99/335 (29%), Gaps = 31/335 (9%) Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 I+K K+ R + QI + + ++ +F Sbjct: 303 KAYTGAAIRKIRKEVFSARNGSRKNQGVPQIAVLVTHRPSEDNVTKAAVNLRREGVTIFT 362 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND----NNNMTS 176 G+ ++ + L ++ E+ L + ++ L Q + + Sbjct: 363 MGIEGASDSQLEKIASHPAEQHVSKLKTFSELAAHNQTFLKKLRNQITLTVSVFSERTET 422 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 K + + + +V A V Sbjct: 423 LKSGCVDTEEADIYLLIDGSGSTQ------ATDFHEMKTFLSEVVGMFNIAP-----QKV 471 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESS 295 R+G + Y N +++ + + TNT A++ L K+ Sbjct: 472 RVGAVQYADSWDLEFEINKYTNKHDLGKAIENIRQMGGNTNTGAALNFTLGLLQKAKKQR 531 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 N + ++ +T+G + + L+ +R +++Y++ V Q Sbjct: 532 GNR-----VPCHLVVLTNGM--------SKDSILEPANRLREELIRVYAIGV--KEANQT 576 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 LR+ + + V+D L + +++ +I + Sbjct: 577 QLREIAGEDKRVYYVHDFDALKDIRNQVVQEICAE 611 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 58/164 (35%), Gaps = 15/164 (9%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYR 286 + VRIG ++ E+ ++ + T+ A+ R Sbjct: 1027 DFDVSVNRVRIGAAQFSHTYRPEFPLGTFIGKKEISFQIENIQQIFGYTHIGAAL----R 1082 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 E+ + + + + ++ +TDG++ Q E +R+ G+ IYSV Sbjct: 1083 EVGDYFRPDMGSRINAGTPQVLLVLTDGQSQD--------EVAQAAEELRHKGIDIYSVG 1134 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + L + T ++ + V++ EL + +I I Sbjct: 1135 IG--DVDDQQLIQITGTADKKLTVHNFDELTKVKKRIVRNICTS 1176 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 36/174 (20%), Positives = 65/174 (37%), Gaps = 15/174 (8%) Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TN 276 +V+ ++KA + VR G + Y + EV S L K P T Sbjct: 828 DFMVDLVKKA--DVGKNQVRFGALKYADDPEVLFYLDDLSTKWEVISVLQKDQPMGGNTY 885 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 T A+ + + + + + +I ITDGE+ A +T + +R Sbjct: 886 TAEALGFSDHMFTE----ARGSRLHKGVPQVLIVITDGESHDADKLNDT------AKALR 935 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + G+ + +V + A +LL SS ++F V L F ++ + Sbjct: 936 DKGILVLAVGI-AGANPVELL-AMAGSSDKYFFVETFGGLKGIFSDVSASVCNS 987 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 28/160 (17%), Positives = 53/160 (33%), Gaps = 19/160 (11%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEK 292 V+IG + ++ + NE+ + ++++ T T A+ + K Sbjct: 655 DRVQIGVVQFSDINKEEFQLNRYMSQNEISNAIDRMAHIGETTLTGSALTFVSQYFSPAK 714 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA-VSAPP 351 + +++F+I ITDGE +R G+ IYSV + Sbjct: 715 GARP------NVRRFLILITDGEAQD--------IVKDPAVALRQEGIIIYSVGVFGSNV 760 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 L + + F V + L D + I Sbjct: 761 ---TQLEEISGRPEMVFYVENFDILQHIEDDLVFGICSPR 797 >gi|54025448|ref|YP_119690.1| hypothetical protein nfa34780 [Nocardia farcinica IFM 10152] gi|81374389|sp|Q5YU15|Y3478_NOCFA RecName: Full=UPF0353 protein NFA_34780 gi|54016956|dbj|BAD58326.1| hypothetical protein [Nocardia farcinica IFM 10152] Length = 335 Score = 83.0 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 25/198 (12%), Positives = 64/198 (32%), Gaps = 27/198 (13%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 + K + + +G + + Q + N VK+ ++ + E T T Sbjct: 112 EVAQQAGKEFVDGLTQGINLGFVTFAGTASVMQSP--TTNREAVKAAIDNIKLAERTATG 169 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS--GASAYQNTLNTLQICEYMR 336 + A + + T ++ ++DG+ + N + + Sbjct: 170 EGILTALQSIETLATVLGGA--ETPPPARIVLMSDGKQTVPDDKDVDNPRHAFTAARLAK 227 Query: 337 NAGMKIYSVAVSAPPE--------------------GQDLLRKCTD-SSGQFFAVNDSRE 375 + G+ + +++ + LR+ S G+F+ + E Sbjct: 228 SKGIPVSTISFGTEWGSVEIPDQDGQGGSQRVKVPVDNESLREIAKLSGGEFYTASSLEE 287 Query: 376 LLESFDKITDKIQEQSVR 393 L +D + ++I ++ R Sbjct: 288 LTAVYDTLEEQIGYETTR 305 >gi|153806291|ref|ZP_01958959.1| hypothetical protein BACCAC_00547 [Bacteroides caccae ATCC 43185] gi|149130968|gb|EDM22174.1| hypothetical protein BACCAC_00547 [Bacteroides caccae ATCC 43185] Length = 327 Score = 83.0 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 28/205 (13%), Positives = 64/205 (31%), Gaps = 40/205 (19%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENT 275 N + + + E N PL+ + + ++ + + T Sbjct: 108 NRLEAAKDVAAEFINGRPNDNIGITLFAGETFTQCPLTVDHAVLLDMIHNIKCGLIEDGT 167 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + +A L S K +I +TDG N+ L E Sbjct: 168 AVGMGIANAVTRL----------KDSKAKSKVIILLTDGTNNKGDIS-----PLTAAEIA 212 Query: 336 RNAGMKIYSVAVSAPP---------------------EGQDLLRKCTDSSGQFFAVNDSR 374 ++ G+++Y++ V + + L + + G +F + Sbjct: 213 KSFGIRVYTIGVGTNGMAPYPYPVGNTVQYVNMPVEIDEKTLTQIAGTTDGNYFRATSNS 272 Query: 375 ELLESFDKITD-KIQEQSVRIAPNR 398 +L E +++I + + +V+ R Sbjct: 273 KLKEVYEEIDKLEKTKLNVKEYSKR 297 >gi|260061451|ref|YP_003194531.1| aerotolerance operon BatA [Robiginitalea biformata HTCC2501] gi|88785583|gb|EAR16752.1| BatA (Bacteroides aerotolerance operon) [Robiginitalea biformata HTCC2501] Length = 333 Score = 83.0 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 61/177 (34%), Gaps = 43/177 (24%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRELYNE 291 + RIG +AY TP++++ + V L ++ + T + + L Sbjct: 130 NDRIGLVAYAGESYTK--TPITSDKSIVLGALREITYGQLNDGTAIGMGLATSVNRL--- 184 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 ST + K +I +TDG N+ + + G+K Y++ + Sbjct: 185 -------KESTAISKVIILLTDGVNNAG-----FIEPQTAADLALEYGIKTYTIGLGTNG 232 Query: 352 E----------------------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 ++LL + G++F D+ +L +++I Sbjct: 233 NALSPIGYNPDGSFRYGMRQVEIDEELLTDIATATGGEYFRATDNEKLEAIYEEINK 289 >gi|298207017|ref|YP_003715196.1| aerotolerance-related membrane protein [Croceibacter atlanticus HTCC2559] gi|83849651|gb|EAP87519.1| aerotolerance-related membrane protein [Croceibacter atlanticus HTCC2559] Length = 334 Score = 83.0 bits (203), Expect = 8e-14, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 71/205 (34%), Gaps = 53/205 (25%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 +++ L + A + + R+G + Y TP++++ + V LN Sbjct: 112 NRLEALKDVASEFIQ---------GRPNDRVGIVLYAGESYTK--TPITSDKSIVLGALN 160 Query: 268 KLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 + T + + L S L K +I +TDG N+ Sbjct: 161 DVKFSEVLENGTAIGMGLATSVNRL----------KDSKALSKVIILLTDGVNNSG---- 206 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPP----------------------EGQDLLRKCT 361 T++ E G+K Y++ + + +DLL++ Sbjct: 207 -TIDPKLASELAVEYGIKTYTIGIGSNGMALSPIGIKSNGQFQYGNQKVEIDEDLLKQIA 265 Query: 362 D-SSGQFFAVNDSRELLESFDKITD 385 + GQ+F ++++L +++I Sbjct: 266 TVTGGQYFRATNNQKLEAIYEEINK 290 >gi|297671961|ref|XP_002814089.1| PREDICTED: collagen alpha-5(VI) chain-like [Pongo abelii] Length = 2586 Score = 82.6 bits (202), Expect = 9e-14, Method: Composition-based stats. Identities = 29/162 (17%), Positives = 62/162 (38%), Gaps = 13/162 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN-KLNPYENTNTYPAMHHAYREL 288 + V+ G + Y+ +N + + L + + NT T A+ HA Sbjct: 845 DVGRNRVQFGALKYSDQPNILFYLNTYSNRSAIIENLRMRRDTGGNTYTAKALKHANALF 904 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + H + +K+ +I ITDG++ +T +R+ G+ I++V V Sbjct: 905 TEK----HGSRIKQNVKQVLIVITDGKSHDHDQLNDT------ASELRDKGITIFAVGVG 954 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 Q L + V++ +L + F + +++ + Sbjct: 955 K--ANQKELEGMAGNKNNTIYVDNFDKLKDVFTLVQERMCTE 994 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 18/166 (10%), Positives = 59/166 (35%), Gaps = 16/166 (9%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHA 284 + VR+G + Y+ +N +++ + + T T A+ + Sbjct: 469 TEMFSIGPDKVRVGVVQYSDDTEVEFYITDYSNDIDLRKAIFNIKQLTGGTYTGKALDYI 528 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + + N + + + ++I +TDG ++ ++ + +R + +++ Sbjct: 529 LQIIKNGTKDRMSK-----VPCYLIVLTDGMSTD--------RVVEPAKRLRAEQITVHA 575 Query: 345 VAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 V + + L++ + + L +++ +I + Sbjct: 576 VGIG--AANKIELQEIAGKEERVSFGQNFDALKSIKNEVVREICTE 619 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 52/165 (31%), Gaps = 17/165 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + +IG + ++ E+ +++++ E T T A++ Sbjct: 656 TKIQVGADKTQIGVVQFSDKTKEEFQLSRYFTQQEISDAIDRMSLINEGTLTGKALNFVS 715 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + K + KF+I ITDG +R + I+SV Sbjct: 716 QYFTHSKGARLGAK------KFLILITDGVAQDDVRDP--------ARILRGKDVTIFSV 761 Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 V + L + + F V + L K+ ++ Sbjct: 762 GV--YNANRSQLEEISGDGSLVFHVENFDHLKALERKLVFRVCAL 804 >gi|149187170|ref|ZP_01865468.1| hypothetical protein VSAK1_16642 [Vibrio shilonii AK1] gi|148838706|gb|EDL55645.1| hypothetical protein VSAK1_16642 [Vibrio shilonii AK1] Length = 324 Score = 82.6 bits (202), Expect = 9e-14, Method: Composition-based stats. Identities = 33/213 (15%), Positives = 72/213 (33%), Gaps = 47/213 (22%) Query: 192 KNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ 251 + + ++ + + V+ + R+G + + Sbjct: 94 SYSMSQQDMAYQDDYIDRLTAVKHVVSDFVDRRK---------GDRVGLVYFADHAYLQ- 143 Query: 252 CTPLSNNLNEVKSRLNK--LNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 TPL+ + VK++LN+ L T + A + + ++ + Sbjct: 144 -TPLTFDRETVKTQLNQTVLKLIGTQTAIGDGIGLATKTFVDSN----------APQRVM 192 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP----------------- 351 I ++DG N+ L+ +Q E + G IY++ V A Sbjct: 193 ILLSDGSNNAGV-----LDPVQAAEIAKKYGTTIYTIGVGAGEMQVKDFFMTRTVNTAED 247 Query: 352 -EGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 + + L++ + GQ+F ++ EL +D I Sbjct: 248 LDEKTLIKIANITGGQYFRARNADELATIYDTI 280 >gi|315223608|ref|ZP_07865462.1| aerotolerance protein BatA [Capnocytophaga ochracea F0287] gi|314946389|gb|EFS98384.1| aerotolerance protein BatA [Capnocytophaga ochracea F0287] Length = 340 Score = 82.6 bits (202), Expect = 9e-14, Method: Composition-based stats. Identities = 28/181 (15%), Positives = 66/181 (36%), Gaps = 43/181 (23%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRE 287 K + RIG + Y TP++ + + + L++L + T + A Sbjct: 133 KDRPNDRIGLVIYAGESYTK--TPVTTDKGIILNALSELTYGQIEDGTAIGMGLATAVNR 190 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 L S + +I +TDG N+ ++ E G+++Y++ + Sbjct: 191 L----------KESKAKSRVIILLTDGVNNTG-----FIDPQTAAELAAEYGIRVYTIGI 235 Query: 348 SAPPE----------------------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKIT 384 + + L++K + + G++F D+++L + +++I Sbjct: 236 GSNGTALSPYALNPDGSIMYRMLQVEIDEPLMKKIAEVTHGRYFRATDNQKLQQIYNEIN 295 Query: 385 D 385 Sbjct: 296 K 296 >gi|256426121|ref|YP_003126774.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] gi|256041029|gb|ACU64573.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] Length = 462 Score = 82.6 bits (202), Expect = 9e-14, Method: Composition-based stats. Identities = 26/189 (13%), Positives = 68/189 (35%), Gaps = 19/189 (10%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 KI ++A L++ + + + Y+ + + N +K+ ++ Sbjct: 96 DKIKYARQAAKFLIDQLNSTD--------HLSIVNYDDRVEVTSPSQSVKNKEALKAAID 147 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 K++ +TN M Y ++ + ++ + V+ +TDG + L Sbjct: 148 KIHDRGSTNLSGGMLEGYTQVKSTRKEGYVNR--------VLLLTDGLANQGITDPLELK 199 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDK 386 L + G+ + + V A +DLL ++ ++ ++ ++ + F Sbjct: 200 RLAE-NKYKEDGIALSTFGVGAD-YNEDLLTMLAENGRANYYFIDSPDKIPQIFAGELKG 257 Query: 387 IQEQSVRIA 395 + + A Sbjct: 258 LLSVVAQNA 266 >gi|327399949|ref|YP_004340788.1| von Willebrand factor type A [Archaeoglobus veneficus SNP6] gi|327315457|gb|AEA46073.1| von Willebrand factor type A [Archaeoglobus veneficus SNP6] Length = 527 Score = 82.6 bits (202), Expect = 9e-14, Method: Composition-based stats. Identities = 26/214 (12%), Positives = 78/214 (36%), Gaps = 29/214 (13%) Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 + + + + N + +A + V+ + + +S Sbjct: 61 TGSEWTTSVPIDVVFALDSSGSMGWNDPSGLRKTAAKSFVDKLNSTTDQAGVVS------ 114 Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGS 301 + L+NN + VKS+++ ++ T+ ++ A L K+++ + Sbjct: 115 ---WDNNIDFTQTLTNNFSLVKSKIDAVDSSGGTDLNVGLNAAISLLDTGKQANSSW--- 168 Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA--VSAPPEGQDLLRK 359 +IF+++G+ + + + N G +Y++ +S + L+ Sbjct: 169 -----VIIFLSNGQGTYSHST---------AVVAANKGYTVYTIGLAISPGSTAESNLKD 214 Query: 360 CT-DSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + G++++ ++ L F+ I ++ ++ Sbjct: 215 IANTTGGKYYSSPNATNLDAVFNDIYKEVVTSTI 248 >gi|254459074|ref|ZP_05072497.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] gi|207084345|gb|EDZ61634.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] Length = 279 Score = 82.6 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 34/154 (22%), Positives = 65/154 (42%), Gaps = 21/154 (13%) Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGS 301 G +P++ V L L +NT A+ R S Sbjct: 110 YGDFAFIASPITYEKEIVTQMLGYLTQGMAGQNTAIGEAIAMGVRSF----------KHS 159 Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA-PPEGQDLLRKC 360 K ++ ++DGE++ S + E + G+KIY++A+ + LL Sbjct: 160 KAKTKVIVLLSDGEHNSGSVS-----PKEATELAKEQGIKIYTIAMGNKGEADEALLETI 214 Query: 361 T-DSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 DS+G+FF+ + ++EL +D+I DK++ +++ Sbjct: 215 AKDSNGEFFSASSAKELKNIYDEI-DKLESSNIK 247 >gi|78776855|ref|YP_393170.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] gi|78497395|gb|ABB43935.1| von Willebrand factor, type A [Sulfurimonas denitrificans DSM 1251] Length = 309 Score = 82.6 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 71/180 (39%), Gaps = 19/180 (10%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---PYENT 275 + + +++ + + + G +PL+ ++N + L++L + T Sbjct: 109 SRFDVVKEIVSDFISQRKNDNMGLVVFGAYSFIASPLTYDVNILNKILSQLQIGMAGKYT 168 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 ++ L S K I +TDG ++ +T+ + + Sbjct: 169 ALNTSLAQGANLL----------KQSKSKTKIAILLTDGYSTPQV---DTITLDIALDMI 215 Query: 336 RNAGMKIYSVAVSAPP--EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + G+K+Y + + P + LL+ +S G F + + EL E + KI D +++ ++ Sbjct: 216 KKEGIKVYPIGIGMPHEYNTEALLKIANESGGVAFGASSAAELQEVYKKI-DSLEKSKIK 274 >gi|296228122|ref|XP_002759734.1| PREDICTED: collagen alpha-4(VI) chain-like [Callithrix jacchus] Length = 2348 Score = 82.6 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 30/168 (17%), Positives = 57/168 (33%), Gaps = 16/168 (9%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + + VR+G YN I + V ++ L TNT A+ Sbjct: 263 SGLDVSSDRVRVGLAQYNDDIYPAFQLNQHPLKSMVLEQIQNLPYRTGGTNTGNALEFIR 322 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 E S R+ + VI +TDGE++ + + ++ G+ +Y V Sbjct: 323 TSYLTEGSGSRAK---DRVPQIVILVTDGESND--------EVQEAADQLKEDGIVVYVV 371 Query: 346 AVSAPPEGQDLLRKCTDSS-GQF-FAVNDSRELLESFDKITDKIQEQS 391 + + L+K +F F + + L + I + Sbjct: 372 GI--NVQDVQELQKIASEPFEKFLFNIENFNILQDFSGSILQTLCSAV 417 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 29/158 (18%), Positives = 59/158 (37%), Gaps = 21/158 (13%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYRELYNEK 292 V+ G I Y+ I + E+K ++ + T T A+++ + + Sbjct: 878 DRVQFGVIQYSDEIKSKFVLSQYPTVAELKVAIDNIQQGGGGTTTGEALNNMTQVFADTA 937 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + +++I ITDG++S A E +R +G+ IY++ V Sbjct: 938 R--------INVARYLIVITDGKSSDPVAD--------AAEGLRASGVIIYAIGV--REA 979 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 D L++ + F V + L + ++ I Sbjct: 980 NIDELKEIAK--DKIFFVYEFDLLKDIQKEVVRDICSS 1015 Score = 54.5 bits (129), Expect = 2e-05, Method: Composition-based stats. Identities = 23/162 (14%), Positives = 59/162 (36%), Gaps = 17/162 (10%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYREL 288 S++IG + ++ ++ ++ + + + + T T A++ Sbjct: 1055 NIGTDSIQIGLLQFSSIPKEEFRLNRYSSKVDIYNAIFAVQQMRDGTRTGKALNFTLPFF 1114 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + K +++++I ITDG N + + +R+ + I+++ V Sbjct: 1115 ESSKGGRP------SVQQYLIVITDGVAQD--------NVILPAKALRDKNIIIFAIGVG 1160 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + LL T+ + + D L +I K+ + Sbjct: 1161 -EAKKSQLLE-ITNDEDRVYYDVDFEVLQNLEKEILSKVCDP 1200 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 42/310 (13%), Positives = 96/310 (30%), Gaps = 31/310 (10%) Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI 148 Q + + ++ + + G+ L + + + Sbjct: 535 PQYAVVITSGKSKDEVQDAAQRLREKGVKVMSVGVQDFDRRELEWMGSPDLVYDIQREDR 594 Query: 149 SICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR 208 +V D++ ++ ++H + N + P + +++ P + Sbjct: 595 VRHVVEDMNVVIQGTGQREHRNTANKEAVGACATAIPADLVFLT----EEFSRVRQPNFQ 650 Query: 209 K-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 + I L L + +VR G + Y+ N E+ L+ Sbjct: 651 QVISFLKTIVSLL---------SIRPDAVRFGLVFYSEEPRLEFSLDTFQNPAEILEHLD 701 Query: 268 KLNPY---ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 KL T T A+ E++ + S +++ + IT+ + + + Sbjct: 702 KLTYRERRGRTKTGAALDFLRNEVF---IQEKGSRSSHGVQQIAVVITENFSQDSVSGP- 757 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDK 382 +R AG+ IY+V E +D L K + +L K Sbjct: 758 -------ASRLRRAGVTIYAVGTQDVSESKD-LEKMASYPPWKHSVPLESFLQLSIIGSK 809 Query: 383 ITDKIQEQSV 392 +T+++ + V Sbjct: 810 LTNQLCPEIV 819 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 24/171 (14%), Positives = 54/171 (31%), Gaps = 16/171 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYREL 288 + ++R+G Y+ + N+V + + A+ L Sbjct: 63 NVSSKTIRVGLAQYSDVPHSEFLLSTYHRKNDVLRHIRQFQFKPGGKKMGLALQFI---L 119 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + + + + S + + + ++ G E +R AG+ +Y+V V Sbjct: 120 DHHFQEAAGSRASQGVPQIAVVMSSGPAEDHVHGP--------GEALRRAGILVYAVGV- 170 Query: 349 APPEGQDLLRKCTDSSGQFF--AVNDSRELLESFDKITDKIQEQSVRIAPN 397 LR+ S + F V L K+ ++ + + A Sbjct: 171 -KDAVWAELREIASSPQENFTSFVPSLSGLSSLAQKLRQELCDTLAKAASR 220 >gi|152990152|ref|YP_001355874.1| von Willebrand factor A [Nitratiruptor sp. SB155-2] gi|151422013|dbj|BAF69517.1| von Willebrand factor type A domain protein [Nitratiruptor sp. SB155-2] Length = 305 Score = 82.6 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 30/139 (21%), Positives = 53/139 (38%), Gaps = 20/139 (14%) Query: 252 CTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 +PL+ N VK L+ L+ T A+ + R L S K V Sbjct: 141 ASPLTFNKEAVKKILDYLDIGVAGSKTAIDDALIESVRLL----------KESQAKSKIV 190 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-PEGQDLLRKCTDSS-GQ 366 I +TDG ++ + + + + G+KIY++ + + LR G Sbjct: 191 ILLTDGIDTASKTPPDV-----AVKMAKKYGVKIYTIGIGDKRGIDEAFLRWLAQQGHGY 245 Query: 367 FFAVNDSRELLESFDKITD 385 +F D+ L + +D+I Sbjct: 246 YFYAKDASMLRKIYDEINR 264 >gi|262164956|ref|ZP_06032694.1| protein TadG associated with Flp pilus assembly [Vibrio mimicus VM223] gi|262027336|gb|EEY46003.1| protein TadG associated with Flp pilus assembly [Vibrio mimicus VM223] Length = 403 Score = 82.6 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 36/430 (8%), Positives = 116/430 (26%), Gaps = 68/430 (15%) Query: 4 IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTI 63 ++ + + + + L+ ++ A + A L+ I P + + Sbjct: 1 MMFPAMMMILAFTMQLSQQFLAHARLSEASEVASLAL---------IASPKEDDENNVSY 51 Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGL 123 +K + +++ + + + + + ++ Sbjct: 52 ARKVVDRYVVDNIDDIKVTVKNKRCEYKDGCVQSSGEAAPFTDFTVAATAKHKSWISYE- 110 Query: 124 IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPP 183 +L + + R + + ++D+S SM + + + + + Sbjct: 111 -NISLKPEFTVNGSSVTRKFLPQPVDVYFIVDMSASMRATWQNGKSQIDEVKEVITRVVN 169 Query: 184 PPKK---SFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 K S+ + + + +R ++ + Q N+ + Sbjct: 170 DLKGFDTEVKSRVSLLAYHNYNIKQGSRTLEAYDYAL------YNTPQQTVSNMFLPPKR 223 Query: 241 IAYNIGI-VGNQCTPLSNNLNEVKSRLNKLNP----YENTNTYPAMHHAYRELYNEKESS 295 + + ++ L+ N + +N N T ++ + + Sbjct: 224 VTPSDSALFSHRDIDLTQNYSSFLQIMNDRNFYPPKSACTESWQGII--------AAAQA 275 Query: 296 HNTIGSTRLKKFVIFITDGENSGA------------SAYQNTLNTLQICEYMRNA----- 338 + ++ I ++DG + Y L +C+ ++ Sbjct: 276 ADKATDINPEQVFIILSDGADCSWQRQDRWGRWLTTKNYLKKLVDSGLCKNLKQRIRQKP 335 Query: 339 --------------GMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKIT 384 + + + V+ D C + + + I Sbjct: 336 NRFQSSTPTENEKTKVTMGVIGVNYQVNPNDGFGDCV-GRENIYHATQGE---DVYKYIL 391 Query: 385 DKIQEQSVRI 394 + I E++ R+ Sbjct: 392 NLINEETGRL 401 >gi|109071570|ref|XP_001110086.1| PREDICTED: collagen alpha-1(XXI) chain-like isoform 2 [Macaca mulatta] Length = 959 Score = 82.6 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 32/176 (18%), Positives = 65/176 (36%), Gaps = 22/176 (12%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYENTNTYPAMHHA 284 + +++G + Y+ V ++ + + + L NT T A+ A Sbjct: 64 TKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTRTGKAIQFA 123 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 L+ S L K + +TDG++ + E R++ + +++ Sbjct: 124 LDYLF--------AKSSRFLTKIAVVLTDGKSQD--------DVKDAAEAARDSKITLFA 167 Query: 345 VAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + V + E + LR + F V D + + + I K+ E+SV P R Sbjct: 168 IGVGSETEDAE-LRAIANKPSSTYVFYVEDYIAISKIREVIKQKLCEESV--CPTR 220 >gi|212720733|ref|NP_001132911.1| collagen, type XXII, alpha 1 [Gallus gallus] Length = 1599 Score = 82.6 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 70/200 (35%), Gaps = 22/200 (11%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + + + NLV + + R+G + Y+ Sbjct: 45 ASSSVGKEDFEKVRQWVSNLVETFE-----IGPDKTRVGVVRYSDRPTTEFDLGKYKTCE 99 Query: 261 EVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 E+K K+ NTNT A+ + ++ + + + +KK I +TDG + Sbjct: 100 EIKEAARKIRYYGGNTNTGDALRYINTYSFS--KEAGGRLSDRTVKKVAILLTDGRSQDY 157 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELL 377 R AG++I++V V ++ L + F V+D + Sbjct: 158 VLDP--------ANAARQAGIRIFAVGVG--EALKEELDEIASEPKSAHVFHVSDYNAID 207 Query: 378 ESFDKITDKIQEQSVRIAPN 397 + K+ ++ E + PN Sbjct: 208 KIRGKLRRRLCENV--LCPN 225 >gi|212693197|ref|ZP_03301325.1| hypothetical protein BACDOR_02707 [Bacteroides dorei DSM 17855] gi|237709939|ref|ZP_04540420.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA] gi|237725394|ref|ZP_04555875.1| conserved hypothetical protein [Bacteroides sp. D4] gi|265753591|ref|ZP_06088946.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA] gi|212664302|gb|EEB24874.1| hypothetical protein BACDOR_02707 [Bacteroides dorei DSM 17855] gi|229436081|gb|EEO46158.1| conserved hypothetical protein [Bacteroides dorei 5_1_36/D4] gi|229456032|gb|EEO61753.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA] gi|263235305|gb|EEZ20829.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA] Length = 332 Score = 82.6 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 31/252 (12%), Positives = 68/252 (26%), Gaps = 56/252 (22%) Query: 172 NNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEK 231 + + S A +++ + A + Sbjct: 72 ARPQTTDNWQNTEIEGIDIMLAVDVSTSMLAEDLKPNRLEAAKQVASEFI---------- 121 Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY--------ENTNTYPAMHH 283 N PL+ + + + N + + T + + Sbjct: 122 -NGRPNDNIGLTIFAGESFTQCPLTVDHGVLLNLFNSIKGDIAQRGLIEDGTAIGMGIAN 180 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A L S K +I +TDG N+ L E + G++IY Sbjct: 181 AVTRL----------KDSKAKSKVIILLTDGSNNRGDIS-----PLTAAEIAKQFGIRIY 225 Query: 344 SVAVSAPPE--------------------GQDLLRKCT-DSSGQFFAVNDSRELLESFDK 382 ++ V + L + ++G +F + +L E + + Sbjct: 226 TIGVGTNGTAPYPMQTYAGTQYVNVPVEIDEKTLTEIAGTTNGNYFRATSNSKLKEVYQE 285 Query: 383 ITDKIQEQSVRI 394 I DK+++ + + Sbjct: 286 I-DKLEKTKLNV 296 >gi|332817900|ref|XP_526306.3| PREDICTED: collagen alpha-5(VI) chain isoform 2 [Pan troglodytes] Length = 2526 Score = 82.6 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 32/162 (19%), Positives = 62/162 (38%), Gaps = 13/162 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL-NKLNPYENTNTYPAMHHAYREL 288 + V+ G + Y+ +N + + L + + NT T A+ HA Sbjct: 845 DVGRDRVQFGALKYSDQPNILFYLNTYSNRSAIIENLRKRRDTGGNTYTAKALKHANALF 904 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 E H + +K+ +I ITDGE+ +T +RN G+ I++V V Sbjct: 905 TEE----HGSRIKQNVKQMLIVITDGESHDHDQLNDT------ALELRNKGITIFAVGVG 954 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 Q L + V++ +L + F + +++ + Sbjct: 955 K--ANQKELEGMAGNKNNTIYVDNFDKLKDVFTLVQERMCTE 994 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 18/166 (10%), Positives = 59/166 (35%), Gaps = 16/166 (9%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHA 284 + VR+G + Y+ +N +++ + + T T A+ + Sbjct: 469 TEMFSIGPDKVRVGVVQYSDDTEVEFYVTDYSNDIDLRKAIFNIKQLTGGTYTGKALDYI 528 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + + N + + + ++I +TDG ++ ++ + +R + +++ Sbjct: 529 LQIIKNGMKDRMSK-----VPCYLIVLTDGMSTD--------RVVEPAKRLRAEQITVHA 575 Query: 345 VAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 V + + L++ + + L +++ +I + Sbjct: 576 VGIG--AANKTELQEIAGKEERVSFGQNFDALKSIKNEVIREICAE 619 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 49/153 (32%), Gaps = 17/153 (11%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + +IG + ++ E+ +++++ E T T A++ Sbjct: 656 TKIQIGADKTQIGVVQFSDKTKEEFQLNRYFTQQEISDAIDRMSLINEGTLTGKALNFVG 715 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + K + KF+I ITDG +R + I+SV Sbjct: 716 QYFTHSKGARLGAK------KFLILITDGVAQDDVRDP--------ARILRGKDVTIFSV 761 Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 V + L + + S F V + L Sbjct: 762 GV--YNANRSQLEEISGDSSLVFHVENFDHLKA 792 Score = 46.8 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 21/186 (11%), Positives = 66/186 (35%), Gaps = 20/186 (10%) Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 N + +L+++ + ++ ++IG + S + K+++ Sbjct: 1018 NSDFVTMTTFLSDLIDNF-----DIQSQRMKIGMAQFGSNYQSIIELKNSLTKTQWKTQI 1072 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 ++ +P + A +++ N + + + ++ IT G+ Sbjct: 1073 QNVSKSGG---FPRIDFALKKVSNMFNLHAGGRRNAGVPQTLVVITSGDP--------RY 1121 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL--ESFDKIT 384 + + +++ G+ + + + + LL T +S + D +L + +I Sbjct: 1122 DVADAVKTLKDLGICVLVLGIG-NVYKEQLL-PITGNSEKIITFQDFDKLKNVDVKKRII 1179 Query: 385 DKIQEQ 390 +I + Sbjct: 1180 REICQS 1185 >gi|332817898|ref|XP_003310056.1| PREDICTED: collagen alpha-5(VI) chain isoform 1 [Pan troglodytes] Length = 2615 Score = 82.6 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 32/162 (19%), Positives = 62/162 (38%), Gaps = 13/162 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL-NKLNPYENTNTYPAMHHAYREL 288 + V+ G + Y+ +N + + L + + NT T A+ HA Sbjct: 845 DVGRDRVQFGALKYSDQPNILFYLNTYSNRSAIIENLRKRRDTGGNTYTAKALKHANALF 904 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 E H + +K+ +I ITDGE+ +T +RN G+ I++V V Sbjct: 905 TEE----HGSRIKQNVKQMLIVITDGESHDHDQLNDT------ALELRNKGITIFAVGVG 954 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 Q L + V++ +L + F + +++ + Sbjct: 955 K--ANQKELEGMAGNKNNTIYVDNFDKLKDVFTLVQERMCTE 994 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 18/166 (10%), Positives = 59/166 (35%), Gaps = 16/166 (9%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHA 284 + VR+G + Y+ +N +++ + + T T A+ + Sbjct: 469 TEMFSIGPDKVRVGVVQYSDDTEVEFYVTDYSNDIDLRKAIFNIKQLTGGTYTGKALDYI 528 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + + N + + + ++I +TDG ++ ++ + +R + +++ Sbjct: 529 LQIIKNGMKDRMSK-----VPCYLIVLTDGMSTD--------RVVEPAKRLRAEQITVHA 575 Query: 345 VAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 V + + L++ + + L +++ +I + Sbjct: 576 VGIG--AANKTELQEIAGKEERVSFGQNFDALKSIKNEVIREICAE 619 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 49/153 (32%), Gaps = 17/153 (11%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + +IG + ++ E+ +++++ E T T A++ Sbjct: 656 TKIQIGADKTQIGVVQFSDKTKEEFQLNRYFTQQEISDAIDRMSLINEGTLTGKALNFVG 715 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + K + KF+I ITDG +R + I+SV Sbjct: 716 QYFTHSKGARLGAK------KFLILITDGVAQDDVRDP--------ARILRGKDVTIFSV 761 Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 V + L + + S F V + L Sbjct: 762 GV--YNANRSQLEEISGDSSLVFHVENFDHLKA 792 Score = 46.8 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 21/186 (11%), Positives = 66/186 (35%), Gaps = 20/186 (10%) Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 N + +L+++ + ++ ++IG + S + K+++ Sbjct: 1018 NSDFVTMTTFLSDLIDNF-----DIQSQRMKIGMAQFGSNYQSIIELKNSLTKTQWKTQI 1072 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 ++ +P + A +++ N + + + ++ IT G+ Sbjct: 1073 QNVSKSGG---FPRIDFALKKVSNMFNLHAGGRRNAGVPQTLVVITSGDP--------RY 1121 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL--ESFDKIT 384 + + +++ G+ + + + + LL T +S + D +L + +I Sbjct: 1122 DVADAVKTLKDLGICVLVLGIG-NVYKEQLL-PITGNSEKIITFQDFDKLKNVDVKKRII 1179 Query: 385 DKIQEQ 390 +I + Sbjct: 1180 REICQS 1185 >gi|325286051|ref|YP_004261841.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] gi|324321505|gb|ADY28970.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] Length = 332 Score = 82.6 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 26/193 (13%), Positives = 59/193 (30%), Gaps = 41/193 (21%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENT 275 + + +++K + TP++++ + V S L ++ + T Sbjct: 111 DRLTALKKVAADFIKKRPNDRIGLVAYAAESYTKTPITSDKSIVLSSLRQITHGQLEDGT 170 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + A L S K +I +TDG N+ + + Sbjct: 171 AIGMGLATAVNRL----------KDSKSKSKVIILLTDGVNNSG-----FIEPKTAADLA 215 Query: 336 RNAGMKIYSVAVSAPPE----------------------GQDLLRKCTD-SSGQFFAVND 372 +K Y++ + + LL + + GQ+F D Sbjct: 216 VEYKIKTYTIGLGTNGNALTPIAFNPDRTYRYGMRQVEIDEKLLEEIATVTGGQYFRATD 275 Query: 373 SRELLESFDKITD 385 + +L +++I Sbjct: 276 NEKLSAIYNEINK 288 >gi|183583553|ref|NP_694996.5| collagen alpha-5(VI) chain [Homo sapiens] Length = 2526 Score = 82.6 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 32/162 (19%), Positives = 62/162 (38%), Gaps = 13/162 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL-NKLNPYENTNTYPAMHHAYREL 288 + V+ G + Y+ +N + + L + + NT T A+ HA Sbjct: 845 DVGRDRVQFGALKYSDQPNILFYLNTYSNRSAIIENLRKRRDTGGNTYTAKALKHANALF 904 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 E H + +K+ +I ITDGE+ +T +RN G+ I++V V Sbjct: 905 TEE----HGSRIKQNVKQMLIVITDGESHDHDQLNDT------ALELRNKGITIFAVGVG 954 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 Q L + V++ +L + F + +++ + Sbjct: 955 K--ANQKELEGMAGNKNNTIYVDNFDKLKDVFTLVQERMCTE 994 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 18/166 (10%), Positives = 59/166 (35%), Gaps = 16/166 (9%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHA 284 + VR+G + Y+ +N +++ + + T T A+ + Sbjct: 469 TEMFSIGPDKVRVGVVQYSDDTEVEFYITDYSNDIDLRKAIFNIKQLTGGTYTGKALDYI 528 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + + N + + + ++I +TDG ++ ++ + +R + +++ Sbjct: 529 LQIIKNGMKDRMSK-----VPCYLIVLTDGMSTD--------RVVEPAKRLRAEQITVHA 575 Query: 345 VAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 V + + L++ + + L +++ +I + Sbjct: 576 VGIG--AANKIELQEIAGKEERVSFGQNFDALKSIKNEVVREICAE 619 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 49/153 (32%), Gaps = 17/153 (11%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + +IG + ++ E+ +++++ E T T A++ Sbjct: 656 TKIQIGADKTQIGVVQFSDKTKEEFQLNRYFTQQEISDAIDRMSLINEGTLTGKALNFVG 715 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + K + KF+I ITDG +R + I+SV Sbjct: 716 QYFTHSKGARLGAK------KFLILITDGVAQDDVRDP--------ARILRGKDVTIFSV 761 Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 V + L + + S F V + L Sbjct: 762 GV--YNANRSQLEEISGDSSLVFHVENFDHLKA 792 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 21/186 (11%), Positives = 66/186 (35%), Gaps = 20/186 (10%) Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 N + +L+++ + ++ ++IG + S + K+++ Sbjct: 1018 NSDFVTMTTFLSDLIDNF-----DIQSQRMKIGMAQFGSNYQSIIELKNSLTKTQWKTQI 1072 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 ++ +P + A +++ N + + + ++ IT G+ Sbjct: 1073 QNVSKSGG---FPRIDFALKKVSNMFNLHAGGRRNAGVPQTLVVITSGDP--------RY 1121 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL--ESFDKIT 384 + + +++ G+ + + + + LL T +S + D +L + +I Sbjct: 1122 DVADAVKTLKDLGICVLVLGIG-DVYKEHLL-PITGNSEKIITFQDFDKLKNVDVKKRII 1179 Query: 385 DKIQEQ 390 +I + Sbjct: 1180 REICQS 1185 >gi|189082691|sp|A8TX70|CO6A5_HUMAN RecName: Full=Collagen alpha-5(VI) chain; AltName: Full=Collagen alpha-1(XXIX) chain; AltName: Full=von Willebrand factor A domain-containing protein 4; Flags: Precursor gi|158828630|gb|ABW81241.1| collagen XXIX alpha 1 [Homo sapiens] Length = 2615 Score = 82.6 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 32/162 (19%), Positives = 62/162 (38%), Gaps = 13/162 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL-NKLNPYENTNTYPAMHHAYREL 288 + V+ G + Y+ +N + + L + + NT T A+ HA Sbjct: 845 DVGRDRVQFGALKYSDQPNILFYLNTYSNRSAIIENLRKRRDTGGNTYTAKALKHANALF 904 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 E H + +K+ +I ITDGE+ +T +RN G+ I++V V Sbjct: 905 TEE----HGSRIKQNVKQMLIVITDGESHDHDQLNDT------ALELRNKGITIFAVGVG 954 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 Q L + V++ +L + F + +++ + Sbjct: 955 K--ANQKELEGMAGNKNNTIYVDNFDKLKDVFTLVQERMCTE 994 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 18/166 (10%), Positives = 59/166 (35%), Gaps = 16/166 (9%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHA 284 + VR+G + Y+ +N +++ + + T T A+ + Sbjct: 469 TEMFSIGPDKVRVGVVQYSDDTEVEFYITDYSNDIDLRKAIFNIKQLTGGTYTGKALDYI 528 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + + N + + + ++I +TDG ++ ++ + +R + +++ Sbjct: 529 LQIIKNGMKDRMSK-----VPCYLIVLTDGMSTD--------RVVEPAKRLRAEQITVHA 575 Query: 345 VAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 V + + L++ + + L +++ +I + Sbjct: 576 VGIG--AANKIELQEIAGKEERVSFGQNFDALKSIKNEVVREICAE 619 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 49/153 (32%), Gaps = 17/153 (11%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + +IG + ++ E+ +++++ E T T A++ Sbjct: 656 TKIQIGADKTQIGVVQFSDKTKEEFQLNRYFTQQEISDAIDRMSLINEGTLTGKALNFVG 715 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + K + KF+I ITDG +R + I+SV Sbjct: 716 QYFTHSKGARLGAK------KFLILITDGVAQDDVRDP--------ARILRGKDVTIFSV 761 Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 V + L + + S F V + L Sbjct: 762 GV--YNANRSQLEEISGDSSLVFHVENFDHLKA 792 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 21/186 (11%), Positives = 66/186 (35%), Gaps = 20/186 (10%) Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 N + +L+++ + ++ ++IG + S + K+++ Sbjct: 1018 NSDFVTMTTFLSDLIDNF-----DIQSQRMKIGMAQFGSNYQSIIELKNSLTKTQWKTQI 1072 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 ++ +P + A +++ N + + + ++ IT G+ Sbjct: 1073 QNVSKSGG---FPRIDFALKKVSNMFNLHAGGRRNAGVPQTLVVITSGDP--------RY 1121 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL--ESFDKIT 384 + + +++ G+ + + + + LL T +S + D +L + +I Sbjct: 1122 DVADAVKTLKDLGICVLVLGIG-DVYKEHLL-PITGNSEKIITFQDFDKLKNVDVKKRII 1179 Query: 385 DKIQEQ 390 +I + Sbjct: 1180 REICQS 1185 >gi|119599630|gb|EAW79224.1| hypothetical protein FLJ35880 [Homo sapiens] Length = 2531 Score = 82.6 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 32/162 (19%), Positives = 62/162 (38%), Gaps = 13/162 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL-NKLNPYENTNTYPAMHHAYREL 288 + V+ G + Y+ +N + + L + + NT T A+ HA Sbjct: 790 DVGRDRVQFGALKYSDQPNILFYLNTYSNRSAIIENLRKRRDTGGNTYTAKALKHANALF 849 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 E H + +K+ +I ITDGE+ +T +RN G+ I++V V Sbjct: 850 TEE----HGSRIKQNVKQMLIVITDGESHDHDQLNDT------ALELRNKGITIFAVGVG 899 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 Q L + V++ +L + F + +++ + Sbjct: 900 K--ANQKELEGMAGNKNNTIYVDNFDKLKDVFTLVQERMCTE 939 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 18/166 (10%), Positives = 59/166 (35%), Gaps = 16/166 (9%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHA 284 + VR+G + Y+ +N +++ + + T T A+ + Sbjct: 414 TEMFSIGPDKVRVGVVQYSDDTEVEFYITDYSNDIDLRKAIFNIKQLTGGTYTGKALDYI 473 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + + N + + + ++I +TDG ++ ++ + +R + +++ Sbjct: 474 LQIIKNGMKDRMSK-----VPCYLIVLTDGMSTD--------RVVEPAKRLRAEQITVHA 520 Query: 345 VAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 V + + L++ + + L +++ +I + Sbjct: 521 VGIG--AANKIELQEIAGKEERVSFGQNFDALKSIKNEVVREICAE 564 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 49/153 (32%), Gaps = 17/153 (11%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + +IG + ++ E+ +++++ E T T A++ Sbjct: 601 TKIQIGADKTQIGVVQFSDKTKEEFQLNRYFTQQEISDAIDRMSLINEGTLTGKALNFVG 660 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + K + KF+I ITDG +R + I+SV Sbjct: 661 QYFTHSKGARLGAK------KFLILITDGVAQDDVRDP--------ARILRGKDVTIFSV 706 Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 V + L + + S F V + L Sbjct: 707 GV--YNANRSQLEEISGDSSLVFHVENFDHLKA 737 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 21/186 (11%), Positives = 66/186 (35%), Gaps = 20/186 (10%) Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 N + +L+++ + ++ ++IG + S + K+++ Sbjct: 963 NSDFVTMTTFLSDLIDNF-----DIQSQRMKIGMAQFGSNYQSIIELKNSLTKTQWKTQI 1017 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 ++ +P + A +++ N + + + ++ IT G+ Sbjct: 1018 QNVSKSGG---FPRIDFALKKVSNMFNLHAGGRRNAGVPQTLVVITSGDP--------RY 1066 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL--ESFDKIT 384 + + +++ G+ + + + + LL T +S + D +L + +I Sbjct: 1067 DVADAVKTLKDLGICVLVLGIG-DVYKEHLL-PITGNSEKIITFQDFDKLKNVDVKKRII 1124 Query: 385 DKIQEQ 390 +I + Sbjct: 1125 REICQS 1130 >gi|237716505|ref|ZP_04546986.1| aerotolerance protein BatA [Bacteroides sp. D1] gi|262408103|ref|ZP_06084651.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] gi|294645097|ref|ZP_06722823.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CC 2a] gi|294809498|ref|ZP_06768201.1| von Willebrand factor type A domain protein [Bacteroides xylanisolvens SD CC 1b] gi|298484179|ref|ZP_07002345.1| BatA protein [Bacteroides sp. D22] gi|229444152|gb|EEO49943.1| aerotolerance protein BatA [Bacteroides sp. D1] gi|262354911|gb|EEZ04003.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] gi|292639603|gb|EFF57895.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CC 2a] gi|294443316|gb|EFG12080.1| von Willebrand factor type A domain protein [Bacteroides xylanisolvens SD CC 1b] gi|295084189|emb|CBK65712.1| von Willebrand factor type A domain. [Bacteroides xylanisolvens XB1A] gi|298269683|gb|EFI11278.1| BatA protein [Bacteroides sp. D22] Length = 327 Score = 82.6 bits (202), Expect = 1e-13, Method: Composition-based stats. Identities = 27/205 (13%), Positives = 63/205 (30%), Gaps = 40/205 (19%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENT 275 N + + + E N PL+ + + ++ + + T Sbjct: 108 NRLEAAKDVAAEFINGRPNDNIGITLFAGETFTQCPLTVDHAVLLDMIHNIKCGLITDGT 167 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + +A L S K +I +TDG N+ + E Sbjct: 168 AVGMGIANAVTRL----------KDSKAKSKVIILLTDGTNNKGDIS-----PMTAAEIA 212 Query: 336 RNAGMKIYSVAVSAPP---------------------EGQDLLRKCTDSSGQFFAVNDSR 374 ++ G+++Y++ V + + L + G +F + Sbjct: 213 KSFGIRVYTIGVGTNGMAPYPYPVGNTVQYVSMPVEIDEKTLTEIAGTTDGNYFRATSNS 272 Query: 375 ELLESFDKITD-KIQEQSVRIAPNR 398 +L E +++I + + +V+ R Sbjct: 273 KLKEVYEEIDKLEKTKLNVKEYSKR 297 >gi|323136144|ref|ZP_08071226.1| hypothetical protein Met49242DRAFT_0613 [Methylocystis sp. ATCC 49242] gi|322398218|gb|EFY00738.1| hypothetical protein Met49242DRAFT_0613 [Methylocystis sp. ATCC 49242] Length = 652 Score = 82.3 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 33/198 (16%), Positives = 59/198 (29%), Gaps = 46/198 (23%) Query: 244 NIGIVGNQCTPLSNNLNEVKSRLNKLNP--YENTNTYPAMHHAYRELYNEKESS-HNTIG 300 T L+NNL+ V + ++ +N T + A+R L +K + Sbjct: 454 PNASCPEPLTRLTNNLSTVTAAIDSMNYWLNGGTVISEGLMWAWRTLSPQKPYADGAAYT 513 Query: 301 STRLKKFVIFITDGEN-----------SGASAYQNTL----------------------- 326 + KK ++ +TDG N + + Sbjct: 514 DKKTKKVIVLMTDGVNGLADNGNAASANISDYSAYGYMGASRLSVADGVTTYAGLQTFLD 573 Query: 327 -NTLQICEYMRNAGMKIYSVAVSAP--------PEGQDLLRKCTDSSGQFFAVNDSRELL 377 + C+ + G+ IY+V + LL C F DS L Sbjct: 574 DRLKKACDNAKAKGISIYTVMFNHNGFLSATEQARSATLLSYCASKPEYAFLATDSAALN 633 Query: 378 ESFDKITDKIQEQSVRIA 395 +F +I +R+ Sbjct: 634 SAFGQIASSAAASPLRLT 651 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 25/200 (12%), Positives = 54/200 (27%), Gaps = 27/200 (13%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 + + AID +R+ +Q A + + + Sbjct: 4 FGLAMIPVTFMAGAAIDYGRATLLRSSLQK---------AADAGALAAGARTSLTQLARE 54 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 I K + +L + +++T + P + + + Q I T + + Sbjct: 55 QIAKNAVLANLGA------------KARNLSLTITETEPSAGVFQVQIQASIATSIMKVA 102 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 + + S + I I + LD + SM D T + ++ Sbjct: 103 RF---DTIPVVVTSEARV-VGGSTNPIEIALALDNTGSMRDDM--PALKQAAKTLAQNVM 156 Query: 182 PPPPKKSFWSKNTTKSKYAP 201 S + P Sbjct: 157 SGSGGNVKVSVVPYVAAVNP 176 >gi|301784735|ref|XP_002927783.1| PREDICTED: collagen alpha-6(VI) chain-like [Ailuropoda melanoleuca] Length = 2267 Score = 82.3 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 41/339 (12%), Positives = 99/339 (29%), Gaps = 39/339 (11%) Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 I+K K+ R + QI + + ++ +F Sbjct: 304 KAYTGAAIRKIRKEVFSARNGSRKNQGVPQIAVLVTHRPSEDNVTKAAVNLRREGVTIFT 363 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNN-------- 172 G+ ++ + L ++ E+ L D++ + + N Sbjct: 364 MGIEGASDSQLEKIASHPAEQHVSKLKTFS----DLAAHNQTFLKKLRNQIMHTVSVFSE 419 Query: 173 NMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK 232 + K + + + +V A Sbjct: 420 RTETLKSGCVDTEEADIYLLIDGSGSTQ------ATDFHEMKTFLSEVVGMFNIAP---- 469 Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNE 291 VR+G + Y N +++ + + TNT A++ L Sbjct: 470 -QKVRVGAVQYADSWDLEFEINKYTNKHDLGKAIENIRQMGGNTNTGAALNFTLGLLQKA 528 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 K+ N + ++ +T+G + + L+ +R +++Y++ V Sbjct: 529 KKQRGNR-----VPCHLVVLTNGM--------SKDSILEPANRLREELIRVYAIGV--KE 573 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 Q LR+ + + V+D L + +++ +I + Sbjct: 574 ANQTQLREIAGEEKRVYYVHDFDALKDIRNQVVQEICAE 612 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 26/164 (15%), Positives = 59/164 (35%), Gaps = 15/164 (9%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYR 286 + VRIG ++ E+ ++ + T+ A+ R Sbjct: 1028 DFDVSVNRVRIGAAQFSHTYRPEFPLGTFVGKKEISFQIENIQQIFGYTHIGAAL----R 1083 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 ++ + + + + ++ +TDG++ + E +R+ G+ IYSV Sbjct: 1084 QVGHYFRPDMGSRINAGTPQVLLVLTDGQSQD--------EVARAAEDLRHKGVDIYSVG 1135 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + L + T ++G+ V++ EL + +I I Sbjct: 1136 IG--DVDDQQLIQITGTAGKKLTVHNFDELTKVKKRIVRNICTS 1177 Score = 66.1 bits (159), Expect = 8e-09, Method: Composition-based stats. Identities = 28/160 (17%), Positives = 53/160 (33%), Gaps = 19/160 (11%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEK 292 V+IG + ++ + NE+ + ++++ T T A+ + K Sbjct: 656 DRVQIGVVQFSDVNKEEFQLNRYMSQNEISNAIDRMTHIGETTLTGSALTFVSQYFSPAK 715 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA-VSAPP 351 + +++F+I ITDGE +R G+ IYSV + Sbjct: 716 GARP------NVRRFLILITDGEAQD--------IVKDPAVALRQEGIIIYSVGVFGSNV 761 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 L + + F V + L D + I Sbjct: 762 ---TQLEEISGRPEMVFYVENFDILQHIEDDLVFGICSPR 798 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 37/174 (21%), Positives = 65/174 (37%), Gaps = 15/174 (8%) Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TN 276 +V+ ++KA + VR G + Y + EV S L K P T Sbjct: 829 DFMVDLVKKA--DVGKNQVRFGALKYADDPEVLFYLGDLGSKWEVISVLQKDQPMGGNTY 886 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 T A+ + + + + + +I ITDGE S + LN + +R Sbjct: 887 TAEALGFSDHMFTE----ARGSRLQKGVPQVLIVITDGE----SHDADKLNAT--AKALR 936 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + G+ + +V + A +LL SS ++F V L F ++ + Sbjct: 937 DKGILVLAVGI-AGANPVELL-AMAGSSDKYFFVETFGGLKGIFSDVSASVCNS 988 Score = 44.1 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 59/160 (36%), Gaps = 17/160 (10%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN-KLN-PYENTNTYPAMHHAYRELYNEKE 293 R+ Y+ + + N + + L L + T A+ A+R ++ Sbjct: 63 YRVAVAQYSDRLHSEFQLGTFKSRNPMLNHLKKNLGFLGGSLRTGHALREAHRTYFSAPA 122 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + V+ + Q+ + + + +R G++I SV + Sbjct: 123 GGRDKK--QFPPILVVLAS---------AQSEDDVEEASKALREDGVRIVSVGLQ--SAS 169 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 ++ L+ ++ F + +R+L +F + +I +++ + Sbjct: 170 EEELKAMA-TAQFHFNLRSARDL-GAFSQNMTQIIKEATQ 207 >gi|281345782|gb|EFB21366.1| hypothetical protein PANDA_017603 [Ailuropoda melanoleuca] Length = 2245 Score = 82.3 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 41/339 (12%), Positives = 99/339 (29%), Gaps = 39/339 (11%) Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 I+K K+ R + QI + + ++ +F Sbjct: 284 KAYTGAAIRKIRKEVFSARNGSRKNQGVPQIAVLVTHRPSEDNVTKAAVNLRREGVTIFT 343 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNN-------- 172 G+ ++ + L ++ E+ L D++ + + N Sbjct: 344 MGIEGASDSQLEKIASHPAEQHVSKLKTFS----DLAAHNQTFLKKLRNQIMHTVSVFSE 399 Query: 173 NMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK 232 + K + + + +V A Sbjct: 400 RTETLKSGCVDTEEADIYLLIDGSGSTQ------ATDFHEMKTFLSEVVGMFNIAP---- 449 Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNE 291 VR+G + Y N +++ + + TNT A++ L Sbjct: 450 -QKVRVGAVQYADSWDLEFEINKYTNKHDLGKAIENIRQMGGNTNTGAALNFTLGLLQKA 508 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 K+ N + ++ +T+G + + L+ +R +++Y++ V Sbjct: 509 KKQRGNR-----VPCHLVVLTNGM--------SKDSILEPANRLREELIRVYAIGV--KE 553 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 Q LR+ + + V+D L + +++ +I + Sbjct: 554 ANQTQLREIAGEEKRVYYVHDFDALKDIRNQVVQEICAE 592 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 26/164 (15%), Positives = 59/164 (35%), Gaps = 15/164 (9%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYR 286 + VRIG ++ E+ ++ + T+ A+ R Sbjct: 1008 DFDVSVNRVRIGAAQFSHTYRPEFPLGTFVGKKEISFQIENIQQIFGYTHIGAAL----R 1063 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 ++ + + + + ++ +TDG++ + E +R+ G+ IYSV Sbjct: 1064 QVGHYFRPDMGSRINAGTPQVLLVLTDGQSQD--------EVARAAEDLRHKGVDIYSVG 1115 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + L + T ++G+ V++ EL + +I I Sbjct: 1116 IG--DVDDQQLIQITGTAGKKLTVHNFDELTKVKKRIVRNICTS 1157 Score = 66.1 bits (159), Expect = 8e-09, Method: Composition-based stats. Identities = 28/160 (17%), Positives = 53/160 (33%), Gaps = 19/160 (11%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEK 292 V+IG + ++ + NE+ + ++++ T T A+ + K Sbjct: 636 DRVQIGVVQFSDVNKEEFQLNRYMSQNEISNAIDRMTHIGETTLTGSALTFVSQYFSPAK 695 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA-VSAPP 351 + +++F+I ITDGE +R G+ IYSV + Sbjct: 696 GARP------NVRRFLILITDGEAQD--------IVKDPAVALRQEGIIIYSVGVFGSNV 741 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 L + + F V + L D + I Sbjct: 742 ---TQLEEISGRPEMVFYVENFDILQHIEDDLVFGICSPR 778 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 37/174 (21%), Positives = 65/174 (37%), Gaps = 15/174 (8%) Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TN 276 +V+ ++KA + VR G + Y + EV S L K P T Sbjct: 809 DFMVDLVKKA--DVGKNQVRFGALKYADDPEVLFYLGDLGSKWEVISVLQKDQPMGGNTY 866 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 T A+ + + + + + +I ITDGE S + LN + +R Sbjct: 867 TAEALGFSDHMFTE----ARGSRLQKGVPQVLIVITDGE----SHDADKLNAT--AKALR 916 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + G+ + +V + A +LL SS ++F V L F ++ + Sbjct: 917 DKGILVLAVGI-AGANPVELL-AMAGSSDKYFFVETFGGLKGIFSDVSASVCNS 968 Score = 44.1 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 59/160 (36%), Gaps = 17/160 (10%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN-KLN-PYENTNTYPAMHHAYRELYNEKE 293 R+ Y+ + + N + + L L + T A+ A+R ++ Sbjct: 43 YRVAVAQYSDRLHSEFQLGTFKSRNPMLNHLKKNLGFLGGSLRTGHALREAHRTYFSAPA 102 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + V+ + Q+ + + + +R G++I SV + Sbjct: 103 GGRDKK--QFPPILVVLAS---------AQSEDDVEEASKALREDGVRIVSVGLQ--SAS 149 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 ++ L+ ++ F + +R+L +F + +I +++ + Sbjct: 150 EEELKAMA-TAQFHFNLRSARDL-GAFSQNMTQIIKEATQ 187 >gi|218131126|ref|ZP_03459930.1| hypothetical protein BACEGG_02731 [Bacteroides eggerthii DSM 20697] gi|317476996|ref|ZP_07936238.1| von Willebrand factor type A domain-containing protein [Bacteroides eggerthii 1_2_48FAA] gi|217986646|gb|EEC52980.1| hypothetical protein BACEGG_02731 [Bacteroides eggerthii DSM 20697] gi|316906789|gb|EFV28501.1| von Willebrand factor type A domain-containing protein [Bacteroides eggerthii 1_2_48FAA] Length = 327 Score = 82.3 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 30/252 (11%), Positives = 69/252 (27%), Gaps = 51/252 (20%) Query: 172 NNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEK 231 + + S A +++ + A + Sbjct: 72 ARPQTTNSWQNSEIEGIDIMLAIDVSTSMLAEDLKPNRLEAAKDVAAEFI---------- 121 Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYREL 288 N PL+ + + + + + + T + +A L Sbjct: 122 -NGRPNDNIGITLFAGESFTQCPLTVDHAVLLNLIKDVKCGLIEDGTAVGMGIANAVTRL 180 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 S K +I +TDG N+ L E ++ G+++Y++ V Sbjct: 181 ----------KDSKAKSKVIILLTDGTNNKGDIS-----PLTAAEIAKSFGIRVYTIGVG 225 Query: 349 APP---------------------EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD-K 386 + + L + + G +F + +L E +++I + Sbjct: 226 TNGMAPYPYPVGGTVQYVNMPVEIDEKTLTQIAGTTDGNYFRATSNSKLKEVYEEIDKLE 285 Query: 387 IQEQSVRIAPNR 398 + SV+ R Sbjct: 286 KTKLSVKEYSKR 297 >gi|316975135|gb|EFV58594.1| putative calcium binding EGF domain protein [Trichinella spiralis] Length = 2537 Score = 82.3 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 27/169 (15%), Positives = 57/169 (33%), Gaps = 13/169 (7%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAY 285 + + R+G I Y + N+++++ + + T T A+ Sbjct: 511 TLFDINEDRTRVGIIQYTGQVKPEFYLNQHKNIDQLQQAIRNIRYVGGLTKTGAALQFMT 570 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + + + + K + ITDG N + R + +Y+V Sbjct: 571 KNTFTAQMGARTR--DPNVYKIGVVITDGRAQD--------NVKIPADEARRHNISLYAV 620 Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFD-KITDKIQEQSVR 393 V + L + S ++F V EL KI ++ + V+ Sbjct: 621 GV-TNHVLESELEQIAGSKDRYFIVGTFAELNTRLRAKIQKEMCKGIVK 668 >gi|19031199|gb|AAL17973.1| proximal thread matrix protein 1b [Mytilus edulis] Length = 444 Score = 82.3 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 27/236 (11%), Positives = 72/236 (30%), Gaps = 18/236 (7%) Query: 155 DVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLI 214 + + + + + + A ++ Sbjct: 209 NYVLTTNSFSELSTLLKLVIDLACEVCVVDCAGHADIAFVFDASSSIN-ANNPNNYGLMK 267 Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-- 272 + ++V+ K + +V + + ++ E+K ++K+ P Sbjct: 268 DFMKDIVDRFNKTGPDGTQFAV----VTFADRATKQFGLKDYSSKAEIKGAIDKVTPSII 323 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T + +A E++ + ++K VI +TDG+N+G + ++ Sbjct: 324 GQTAIGDGLENARLEVFPNRNGGGR----EEVQKVVILLTDGQNNGHKSPEHES------ 373 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 +R G+ I ++ V + L S F + +L + + + Sbjct: 374 SLLRKEGVVIVAIGVGTG-FLKSELINIASSEEYVFTTSSFNKLSKIMENVVKLAC 428 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 26/216 (12%), Positives = 59/216 (27%), Gaps = 14/216 (6%) Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 + ++ LV+S +N S Sbjct: 30 PGNTGKDAEECDVQADIIVLFDDSSSIQYDNKENYQMMKNFVKELVDSFTTVGVNGRNGS 89 Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP--YENTNTYPAMHHAYRELYNEKE 293 + G + ++ G+ ++K + + P T + H ++ E Sbjct: 90 -QFGVVQFSQGVKTAFPLNKFKTKEDIKKGIQDMVPRNGGQTEIGTGLQHVRENSFSGAE 148 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 G+ +K VI +TDG+++ ++ G+ + ++ + Sbjct: 149 GG----GNPDKQKIVILMTDGKSNAG------APPQHEAHKLKAEGVTVIAIGIGQG-FV 197 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + L + N EL + D E Sbjct: 198 KTELEQIATMKNYVLTTNSFSELSTLLKLVIDLACE 233 >gi|319902109|ref|YP_004161837.1| von Willebrand factor type A [Bacteroides helcogenes P 36-108] gi|319417140|gb|ADV44251.1| von Willebrand factor type A [Bacteroides helcogenes P 36-108] Length = 327 Score = 82.3 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 64/205 (31%), Gaps = 40/205 (19%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENT 275 N + + + E N PL+ + + + L + + T Sbjct: 108 NRLEAAKDVATEFINGRPNDNIGITLFAGESFTQCPLTVDHAVLLNLLKDMKCGFIEDGT 167 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + +A L S K +I +TDG N+ L E Sbjct: 168 AIGMGLANAVTRL----------KDSKAKSKVIILLTDGVNNKGDIS-----PLTAAEIA 212 Query: 336 RNAGMKIYSVAVSAPP---------------------EGQDLLRKCTDSSGQFFAVNDSR 374 ++ G+++Y++ V + + L + + G +F + Sbjct: 213 KSFGIRVYTIGVGTNGMAPYPYPVGGTVQYVNMPVEIDEKTLTQIAGTTEGNYFRATSNS 272 Query: 375 ELLESFDKITD-KIQEQSVRIAPNR 398 +L E +++I + + +V+ R Sbjct: 273 KLKEVYEEIDKLEKTKLNVKEYSKR 297 >gi|160882770|ref|ZP_02063773.1| hypothetical protein BACOVA_00731 [Bacteroides ovatus ATCC 8483] gi|237720676|ref|ZP_04551157.1| BatA [Bacteroides sp. 2_2_4] gi|260170239|ref|ZP_05756651.1| aerotolerance protein BatA [Bacteroides sp. D2] gi|293373990|ref|ZP_06620331.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CMC 3f] gi|299145608|ref|ZP_07038676.1| BatA protein [Bacteroides sp. 3_1_23] gi|315918602|ref|ZP_07914842.1| conserved hypothetical protein [Bacteroides sp. D2] gi|156111794|gb|EDO13539.1| hypothetical protein BACOVA_00731 [Bacteroides ovatus ATCC 8483] gi|229449511|gb|EEO55302.1| BatA [Bacteroides sp. 2_2_4] gi|292631066|gb|EFF49703.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CMC 3f] gi|298516099|gb|EFI39980.1| BatA protein [Bacteroides sp. 3_1_23] gi|313692477|gb|EFS29312.1| conserved hypothetical protein [Bacteroides sp. D2] Length = 327 Score = 82.3 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 27/205 (13%), Positives = 63/205 (30%), Gaps = 40/205 (19%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENT 275 N + + + E N PL+ + + ++ + + T Sbjct: 108 NRLEAAKDVAAEFINGRPNDNIGITLFAGETFTQCPLTVDHAVLLDMIHNIKCGLITDGT 167 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + +A L S K +I +TDG N+ + E Sbjct: 168 AVGMGIANAVTRL----------KDSKAKSKVIILLTDGTNNKGDIS-----PMTAAEIA 212 Query: 336 RNAGMKIYSVAVSAPP--------------------EGQDLLRKCT-DSSGQFFAVNDSR 374 ++ G+++Y++ V + L + + G +F + Sbjct: 213 KSFGIRVYTIGVGTNGMAPYPYPVGNTVQYVSMPVEIDEKTLTEIAGTTDGNYFRATSNS 272 Query: 375 ELLESFDKITD-KIQEQSVRIAPNR 398 +L E +++I + + +V+ R Sbjct: 273 KLKEVYEEIDKLEKTKLNVKEYSKR 297 >gi|19033105|gb|AAL83537.1|AF414454_1 proximal thread matrix protein 1 variant a [Mytilus edulis] Length = 441 Score = 82.3 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 27/236 (11%), Positives = 72/236 (30%), Gaps = 18/236 (7%) Query: 155 DVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLI 214 + + + + + + A ++ Sbjct: 206 NYVLTTNSFSELSTLLKLVIDLACEVCVVDCAGHADIAFVFDASSSIN-ANNPNNYGLMK 264 Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-- 272 + ++V+ K + +V + + ++ E+K ++K+ P Sbjct: 265 DFMKDIVDRFNKTGPDGTQFAV----VTFADRATKQFGLKDYSSKAEIKGAIDKVTPSII 320 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T + +A E++ + ++K VI +TDG+N+G + ++ Sbjct: 321 GQTAIGDGLENARLEVFPNRNGGGR----EEVQKVVILLTDGQNNGHKSPEHES------ 370 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 +R G+ I ++ V + L S F + +L + + + Sbjct: 371 SLLRKEGVVIVAIGVGTG-FLKSELINIASSEEYVFTTSSFNKLSKIMENVVKLAC 425 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 26/216 (12%), Positives = 59/216 (27%), Gaps = 14/216 (6%) Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 + ++ LV+S +N S Sbjct: 27 PGNTGKDAEECDVQADIIVLFDDSSSIQYDNKENYQMMKNFVKELVDSFTTVGVNGRNGS 86 Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP--YENTNTYPAMHHAYRELYNEKE 293 + G + ++ G+ ++K + + P T + H ++ E Sbjct: 87 -QFGVVQFSQGVKTAFPLNKFKTKEDIKKGIQDMVPRNGGQTEIGTGLQHVRENSFSGAE 145 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 G+ +K VI +TDG+++ ++ G+ + ++ + Sbjct: 146 GG----GNPDKQKIVILMTDGKSNAG------APPQHEAHKLKAEGVTVIAIGIGQG-FV 194 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + L + N EL + D E Sbjct: 195 KTELEQIATMKNYVLTTNSFSELSTLLKLVIDLACE 230 >gi|262371960|ref|ZP_06065239.1| von Willebrand factor type A domain-containing protein [Acinetobacter junii SH205] gi|262311985|gb|EEY93070.1| von Willebrand factor type A domain-containing protein [Acinetobacter junii SH205] Length = 537 Score = 82.3 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 36/359 (10%), Positives = 100/359 (27%), Gaps = 29/359 (8%) Query: 40 GCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNN 99 S + +K T + + E + + + Sbjct: 30 VAMSPPPTAHMSSEMARKMVTPSYIAAIPAPERVKLEANTEKYQKNEVNPVHRVAERAVS 89 Query: 100 PLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRS 159 + Y L L P + + + + I V + Sbjct: 90 TFSIDVD-TGSYTNTRRFLNDGRLPPVDAIRVEEMINYFDYQYPQPVGIHPFSVTTETVD 148 Query: 160 MEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGN 219 K + + P + + + S P K+ ++ ++ Sbjct: 149 SPWKENAKLIKIGIQAKDLSVKQLPAANLVFLVDVSGSMDDP------DKLPLVKQTLRI 202 Query: 220 LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYP 279 L ++ ++ I Y G + +++ +++L T+ Sbjct: 203 LTEQLRPQD--------KVTIITYASGEKLVLEPTSGDQKDKILRVIDELRAGGATSGEQ 254 Query: 280 AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG 339 A+ AY++ + ++ TDG+ + +TL + R +G Sbjct: 255 AIQLAYKQAEKAFIKNGINR--------ILLATDGDFNVGITDFSTL--KGMVAEKRKSG 304 Query: 340 MKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + + ++ ++L+ + D+ G + +++ E + ++ +A + Sbjct: 305 VSLTALGYGTGNYNEELMEQIADAGDGNYSYIDNKNEAKKVVQ---RQLSSTLATVAQD 360 >gi|197336671|ref|YP_002158318.1| von Willebrand factor, type A [Vibrio fischeri MJ11] gi|197313923|gb|ACH63372.1| von Willebrand factor, type A [Vibrio fischeri MJ11] Length = 321 Score = 82.3 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 70/204 (34%), Gaps = 47/204 (23%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + ++ + + + ++ + R+G + + TPL+ + N Sbjct: 102 TSNGDFVDRLTAVKQVVSDFIDQRK---------GDRLGLVLFGDHAYLQ--TPLTFDRN 150 Query: 261 EVKSRLNK--LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 V+ +L++ L T + A + ++ +I ++DG N+ Sbjct: 151 TVREQLDRTVLRLVGQMTAMGEGLGLATKTFIESN----------APQRTIILLSDGANT 200 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEGQDLLRKC 360 L L+ + ++ KIY+V + A +D L K Sbjct: 201 AGV-----LEPLEAAQLAKDNHAKIYTVGIGAGEMQVRGFFGKQTVNTARDLDEDTLTKI 255 Query: 361 TD-SSGQFFAVNDSRELLESFDKI 383 + GQ+F ++ EL E + I Sbjct: 256 ATMTGGQYFRARNADELAEIYQTI 279 >gi|163801668|ref|ZP_02195566.1| hypothetical protein 1103602000597_AND4_09447 [Vibrio sp. AND4] gi|159174585|gb|EDP59387.1| hypothetical protein AND4_09447 [Vibrio sp. AND4] Length = 524 Score = 82.3 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 30/319 (9%), Positives = 77/319 (24%), Gaps = 52/319 (16%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A+++ +A++ + ++++ + +AA L+ P Sbjct: 22 MALLLVPIMGITFWAVEGTRYIQESSRLRDSAEAAALAVTIE-------DKPGAASVMAE 74 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + ++ ++ + + +QY + ++ N F+ Sbjct: 75 NYVRSYVRDIKSINVQAERREPGNSRNEEAA------DFIQYTVNATTTHDSWFANSFIP 128 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLY-------LQKHNDNNNM 174 + + S I I V D S SM + Sbjct: 129 SFDETQDIAGRSLAR-KYLSSVGGKNIDIVFVSDFSGSMNFDWMDPNGNKKIDDLKTAIR 187 Query: 175 TSNKYLLP-----------------PPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 + + K + + P + K + + Sbjct: 188 AISNKFICQDVRNEFVEGELKPVCHDQEDGYTADKLKNRVGFVPFNSRTREKRGSSVYAT 247 Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN-----------EVKSRL 266 L + ++S + Y + + N N E K + Sbjct: 248 SQLKYK-DNYKTDISSVSYKGVDWDYWTRFRSEEIKNCATNSNFCEAPRQERHLEAKRAI 306 Query: 267 NKLNPYENTNTYPAMHHAY 285 + L +T + + Sbjct: 307 DVLE--YHTLPSYGIDWGF 323 Score = 63.0 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 39/291 (13%), Positives = 85/291 (29%), Gaps = 28/291 (9%) Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 G +P R + + L D+S + K Sbjct: 228 GFVPFNSRTREKRGSSVYAT--SQLKYKDNYKTDISSVSYKGVDWDYWTRFRSEEIKNCA 285 Query: 182 PPPPKK--------SFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 + +Y P+ + ++ S+ +K Sbjct: 286 TNSNFCEAPRQERHLEAKRAIDVLEYHTLPSYGIDWGFMDSAEYVDVDESVNTMFIDKSK 345 Query: 234 LSVRIGTIAYN---------IGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHA 284 +A + + L+N E+K ++ + T + + Sbjct: 346 EKSNYYRVAPDIKLFGVPEDFDELQFNNISLTNRSEEIK-AIDLMWANGATAAFQGILRG 404 Query: 285 YRELYNEKESSHNTIGSTRLK---KFVIFITDGENS-GASAYQNTLNTLQICEYMRNA-- 338 + LY+ K + K + ++DGE + + +L +C+ R+ Sbjct: 405 SQILYDGKPDGSDQQELQAYNDKLKIMFILSDGEETPHDNGILKSLVKAGMCDKARDKIP 464 Query: 339 GMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 G+ I + + QD ++C S Q + D ++ E +KI + I + Sbjct: 465 GLYIGFIGIDFHATQQDAFKQCVLDSEQ--DIIDVHDVDELIEKIEELIGK 513 >gi|254819550|ref|ZP_05224551.1| hypothetical protein MintA_06484 [Mycobacterium intracellulare ATCC 13950] Length = 335 Score = 82.3 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 55/152 (36%), Gaps = 16/152 (10%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + N + K+ L+KL + T T + A + + + G ++ +DG+ Sbjct: 155 TTNRDSTKAALDKLQFADRTATGEGIFTALQAIATV--GAVIGGGDKPPPARIVLFSDGK 212 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PEGQDLLRKCTD 362 + + N ++ G+ I +++ P P + L+K Sbjct: 213 ETMPTNPDNPKGAFTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETLKKVAQ 272 Query: 363 -SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 S G + +EL + + +I ++++ Sbjct: 273 LSGGNAYNAASLQELKAVYATLQQQIGYETIK 304 >gi|209809314|ref|YP_002264852.1| hypothetical protein VSAL_II0524 [Aliivibrio salmonicida LFI1238] gi|208010876|emb|CAQ81278.1| putative membrane protein [Aliivibrio salmonicida LFI1238] Length = 320 Score = 82.3 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 32/203 (15%), Positives = 69/203 (33%), Gaps = 47/203 (23%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 + ++ + + + + R+G + + TPL+ + N V+ Sbjct: 104 SGFVDRLTAVKRVVSDFIEKRK---------GDRLGLVLFGDHAYLQ--TPLTFDRNTVQ 152 Query: 264 SRLNK--LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 +LN+ L T + A + ++ +I ++DG N+ Sbjct: 153 EQLNRTVLGLVGQRTAIGEGLGLATKTFIESN----------APQRTIILLSDGANTAGV 202 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEGQDLLRKCTD- 362 L+ ++ + ++ KIY+V + A +D L K Sbjct: 203 -----LDPIEAAQLAKDNNAKIYTVGIGAGEMQVRGFFGNQTVNTARDLDEDTLTKIATM 257 Query: 363 SSGQFFAVNDSRELLESFDKITD 385 + GQ+F ++ EL E + I Sbjct: 258 TGGQYFRARNADELAEIYQTIDK 280 >gi|325678004|ref|ZP_08157643.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] gi|324110284|gb|EGC04461.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] Length = 812 Score = 82.3 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 37/335 (11%), Positives = 95/335 (28%), Gaps = 34/335 (10%) Query: 58 DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK---NNPLQYIAESKAQYEIP 114 T I + +L++ + + + + ++ Y I Sbjct: 110 AVTEVIVSMEATGNLQKTTTVESIMNKDMLCTGVVGLVGEPFSIETTSDYEKATLTYVID 169 Query: 115 TENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND--NN 172 L + L V + L + + Sbjct: 170 KNKLGDTEFDNLMFLWYDEK-KDEFVELDTILDEENSTVSINTPHFSKYMLVDKKEWFDA 228 Query: 173 NMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK 232 ++ Y + + RK+ + N ++S+ Sbjct: 229 WKRASLYFQDEYEPLAAAICYDCSGSMSGNDPKGYRKL-----AIDNFIDSMTLTD---- 279 Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 + I++ + S+N E+K +N TN ++ A +L Sbjct: 280 ----KTALISFEDEA--KLVSEFSDNKEELKGLVNPYF-GGGTNVRASVEMAIEQLNTV- 331 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + +I ++DG+ + N + + + +KI+++ + + + Sbjct: 332 --------QHWYTRHIILLSDGDVNININLANNTVD-DLIKKAVDNNIKIHTIGLGSGAD 382 Query: 353 GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDK 386 Q L+ C + + GQ+F + +L + ++ Sbjct: 383 NQK-LKDCAEYTGGQYFTAETAEKLDAIYKDLSKN 416 >gi|307721534|ref|YP_003892674.1| von Willebrand factor A [Sulfurimonas autotrophica DSM 16294] gi|306979627|gb|ADN09662.1| von Willebrand factor type A [Sulfurimonas autotrophica DSM 16294] Length = 303 Score = 82.3 bits (201), Expect = 1e-13, Method: Composition-based stats. Identities = 30/217 (13%), Positives = 68/217 (31%), Gaps = 24/217 (11%) Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 + Y K+ +V L ++ I ++ N Sbjct: 65 ASPIIYDQKTSSKRKGRDLVFALDTSGSMAESGFNPENVQNRKFDALKELLRSFITKRYN 124 Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYN 290 +V + G PLS ++ V L+ + ++T + A + L Sbjct: 125 DNVGVSIF----GTYAYPAIPLSYDMGSVAFLLDFFDVGIAGDSTAIGEGLAMALKIL-- 178 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA- 349 +K +I ITDG + + + + + +KIY++ + Sbjct: 179 --------KKGEAKEKVIILITDGYQNSGAVS-----VKEAVQKAKKQHVKIYTIGIGDR 225 Query: 350 PPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITD 385 +LL+ ++ + F + + L + + +I Sbjct: 226 SAFDANLLQLIAKNTDAKMFEAKNVKMLQDIYKEIDK 262 >gi|226366409|ref|YP_002784192.1| hypothetical protein ROP_70000 [Rhodococcus opacus B4] gi|226244899|dbj|BAH55247.1| hypothetical membrane protein [Rhodococcus opacus B4] Length = 328 Score = 81.9 bits (200), Expect = 1e-13, Method: Composition-based stats. Identities = 25/222 (11%), Positives = 66/222 (29%), Gaps = 26/222 (11%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 ++ S A ++ ++A + + + I Sbjct: 89 NRATVILVIDVSLSMKATDVEPTRLAAAQDAAKSFADGLTPGINLGLVAF---------- 138 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + + N K ++ L E T T A+ + + + ++ Sbjct: 139 AGTASVLVSPTTNREATKVAIDNLQLSERTATGEAIFTSLQSIDT--LAAVLGGSDQAPP 196 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA--------------PP 351 ++ ++DG+ + + ++ + I +++ P Sbjct: 197 ARIVLLSDGKQTVPENPDDPRGGFTAARQAKDKDVPISTISFGTSYGKVEIEDERIPVPV 256 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + L S G FF + EL + +D + ++I ++ R Sbjct: 257 DDPSLREIANLSGGSFFTASSLEELRDVYDTLEEQIGFETTR 298 >gi|41407305|ref|NP_960141.1| hypothetical protein MAP1207 [Mycobacterium avium subsp. paratuberculosis K-10] gi|118463234|ref|YP_882479.1| hypothetical protein MAV_3297 [Mycobacterium avium 104] gi|81414471|sp|Q740Y5|Y1207_MYCPA RecName: Full=UPF0353 protein MAP_1207 gi|41395657|gb|AAS03524.1| hypothetical protein MAP_1207 [Mycobacterium avium subsp. paratuberculosis K-10] gi|118164521|gb|ABK65418.1| protein Nfa34780 [Mycobacterium avium 104] Length = 335 Score = 81.9 bits (200), Expect = 1e-13, Method: Composition-based stats. Identities = 29/222 (13%), Positives = 68/222 (30%), Gaps = 26/222 (11%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 ++ S+ A A ++ E+A + + I Sbjct: 95 NRAVVMLVIDVSQSMRATDVAPNRMAAAQEAAKQFADELTPGIN----------LGLIAY 144 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + N K+ L+KL + T T + A + + + G Sbjct: 145 AGTATVLVSPTTNREATKNALDKLQFADRTATGEGIFTALQAIATV--GAVIGGGDKPPP 202 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PE 352 ++ +DG+ + + N ++ G+ I +++ P P Sbjct: 203 ARIVLFSDGKETMPTNPDNPKGAFTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPV 262 Query: 353 GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + L+K S G + +EL + + +I ++++ Sbjct: 263 DDETLKKVAQLSGGNAYNAASLQELKSVYATLQQQIGYETIK 304 >gi|325268973|ref|ZP_08135594.1| aerotolerance protein BatA [Prevotella multiformis DSM 16608] gi|324988594|gb|EGC20556.1| aerotolerance protein BatA [Prevotella multiformis DSM 16608] Length = 318 Score = 81.9 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 24/147 (16%), Positives = 50/147 (34%), Gaps = 30/147 (20%) Query: 254 PLSNNLNEVKSRLNKLNPY--------ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 P++ + + + L+ + P + T + +A L + K S Sbjct: 143 PMTLDHAALLNLLHNVRPDLVTSGLMKDGTAIGMGLANAVSRLQDSKAKS---------- 192 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS------APPEGQDLLRK 359 K VI +TDG N+ S + + G+++Y++ L+ Sbjct: 193 KIVILLTDGSNNVGSIS-----PMTAAAIAKKFGIRVYTIGFGRETGEEIGAIDYRALQN 247 Query: 360 CT-DSSGQFFAVNDSRELLESFDKITD 385 ++G+F+ EL + I Sbjct: 248 IAVSTNGEFYRAQSQAELSRIYQDIDK 274 >gi|118349484|ref|XP_001008023.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89289790|gb|EAR87778.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 646 Score = 81.9 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 22/123 (17%), Positives = 44/123 (35%), Gaps = 10/123 (8%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N ++S +N + T+ M A+ L + ++DG+++ Sbjct: 267 NTPNIQSIINSITADGGTDINSGMLMAFNILQ--------KRQFFNPVSSIFLLSDGQDN 318 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 GA + ++N I+S + +G + R C G F+ V ++ Sbjct: 319 GADEKIKKYINSN--QSLKNECFSIHSFGFGSDHDGPLMNRICQLKDGNFYYVEKINQVD 376 Query: 378 ESF 380 E F Sbjct: 377 EFF 379 >gi|329117975|ref|ZP_08246688.1| von Willebrand factor type A domain protein [Neisseria bacilliformis ATCC BAA-1200] gi|327465863|gb|EGF12135.1| von Willebrand factor type A domain protein [Neisseria bacilliformis ATCC BAA-1200] Length = 562 Score = 81.9 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 40/375 (10%), Positives = 97/375 (25%), Gaps = 34/375 (9%) Query: 29 MQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGS-----YIRENAG 83 + A AA + ++ K T++ + K+L E Sbjct: 35 LYEAAPAAARDSVMYERAPVAMQSNLAAKRSGVTLYGQLHAKYLPAAEARPLRTDTERYQ 94 Query: 84 DIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSS 143 + + ++ + + Y L L P + Sbjct: 95 KQPENPVKAVAQEPVSTFSIDVD-TGSYANVRRFLNNGRLPPKDAVRIEEIVNYFPYSYP 153 Query: 144 ENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAP 203 V + KK N Sbjct: 154 LPQDGRPFAV------HTQTVDSPWQSEAKLIKIGIQAQDTAKKDLPPANLVFLVDVSGS 207 Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 K+ ++ ++ L ++ ++ I Y G + + + Sbjct: 208 MTDPDKLPLVKKTLRILTEQLRPQD--------KVTLITYASGEQLVLPPTSGKDKDTIL 259 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 LN L+ T+ A+ AY + + +I TDG+ + + Sbjct: 260 RALNALHAGGATSGERALRMAYEQAEKAYVKNGINR--------IILATDGDFNVGVSDT 311 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDK 382 TL + R +G+ + ++ + ++ + D+ G + ++ +E + Sbjct: 312 ETL--KSLVAEKRKSGISLSTLGYGTGNYNEAMMEQIADAGDGNYSYIDSEKEARKVLR- 368 Query: 383 ITDKIQEQSVRIAPN 397 ++ +A + Sbjct: 369 --HQLTSTLATVAQD 381 >gi|189465623|ref|ZP_03014408.1| hypothetical protein BACINT_01981 [Bacteroides intestinalis DSM 17393] gi|189437897|gb|EDV06882.1| hypothetical protein BACINT_01981 [Bacteroides intestinalis DSM 17393] Length = 327 Score = 81.9 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 28/205 (13%), Positives = 63/205 (30%), Gaps = 40/205 (19%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENT 275 N + + + E N PL+ + + + + + T Sbjct: 108 NRLEAAKDVAAEFINGRPNDNIGITLFAGESFTQCPLTVDHAVLLNLFQGIKCGIIEDGT 167 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + +A L S K +I +TDG N+ L E Sbjct: 168 AVGMGIANAVTRL----------KDSKAKSKVIILLTDGTNNKGDIS-----PLTAAEIA 212 Query: 336 RNAGMKIYSVAVSAPP---------------------EGQDLLRKCTDSSGQFFAVNDSR 374 ++ G+++Y++ V + + L + + G +F + Sbjct: 213 KSFGIRVYTIGVGTNGMAPYPYPVGNTVQYVNMPVEIDEKTLTQIAATTEGNYFRATSNS 272 Query: 375 ELLESFDKITD-KIQEQSVRIAPNR 398 +L E +++I + + +V+ R Sbjct: 273 KLKEVYEEIDKLEKTKLNVKEYSKR 297 >gi|284046349|ref|YP_003396689.1| von Willebrand factor A [Conexibacter woesei DSM 14684] gi|283950570|gb|ADB53314.1| von Willebrand factor type A [Conexibacter woesei DSM 14684] Length = 319 Score = 81.9 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 33/224 (14%), Positives = 77/224 (34%), Gaps = 40/224 (17%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 +++ + T S A ++ +A V+ + + V +G I++N Sbjct: 84 ERASIALVTDVSGSMLATDVQPNRMIAAKRAARRFVDEVPRT--------VNLGVISFNN 135 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 Q + N ++V + +++L T T A+ A L + R Sbjct: 136 TATVLQSP--TRNRSDVLTAIDRLAVSGGTATGEAIATATEML-----RNQPGENGRRPP 188 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--------------- 350 ++ I+DG ++ + ++ R + IY+VA Sbjct: 189 SAIVLISDGTSTNGR------DPIEAAAEARRLRIPIYTVAFGTDQGTITVPGRDGVERT 242 Query: 351 ---PEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQ 390 P L + + + G+ F + + L F+++ ++ + Sbjct: 243 ERVPPDPTALAQIAEMTGGETFTADSADRLDTVFERLGSQLGTR 286 >gi|224539999|ref|ZP_03680538.1| hypothetical protein BACCELL_04911 [Bacteroides cellulosilyticus DSM 14838] gi|224518389|gb|EEF87494.1| hypothetical protein BACCELL_04911 [Bacteroides cellulosilyticus DSM 14838] Length = 327 Score = 81.9 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 28/205 (13%), Positives = 63/205 (30%), Gaps = 40/205 (19%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENT 275 N + + + E N PL+ + + + + + T Sbjct: 108 NRLEAAKDVAAEFINGRPNDNIGITLFAGESFTQCPLTVDHAVLLNLFQGIKCGIIEDGT 167 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + +A L S K +I +TDG N+ L E Sbjct: 168 AVGMGIANAVTRL----------KDSKAKSKVIILLTDGTNNKGDIS-----PLTAAEIA 212 Query: 336 RNAGMKIYSVAVSAPP---------------------EGQDLLRKCTDSSGQFFAVNDSR 374 ++ G+++Y++ V + + L + + G +F + Sbjct: 213 KSFGIRVYTIGVGTNGMAPYPYPVGNTVQYVNMPVEIDEKTLTQIAATTEGNYFRATSNS 272 Query: 375 ELLESFDKITD-KIQEQSVRIAPNR 398 +L E +++I + + +V+ R Sbjct: 273 KLKEVYEEIDKLEKTKLNVKEYSKR 297 >gi|119599629|gb|EAW79223.1| hCG1743181 [Homo sapiens] Length = 1211 Score = 81.9 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 41/331 (12%), Positives = 98/331 (29%), Gaps = 31/331 (9%) Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 IKK K+ R + QI + + + ++ +F Sbjct: 258 KAYTGAAIKKLRKEVFSARNGSRKNQGVPQIAVLVTHRDSEDNVTKAAVNLRREGVTIFT 317 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 G+ ++ T L ++ E+ L + + K N + Sbjct: 318 LGIEGTSDTQLEKIASHPAEQYVSKLKTFADLAAHNQTFL------KKLRNQITHTVSVF 371 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 + + + +V A + VR+G Sbjct: 372 SERTETLKSDIYL----LIDGSGSTQATDFHEMKTFLSEVVGMFNIAPHK-----VRVGA 422 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNTI 299 + Y +N ++ + + TNT A++ L K+ N Sbjct: 423 VQYADSWDLEFEINKYSNKQDLGKAIENIRQMGGNTNTGAALNFTLSLLQKAKKQRGNK- 481 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 + ++ +T+G + + L+ +R +++Y++ + Q LR+ Sbjct: 482 ----VPCHLVVLTNGM--------SKDSILEPANRLREEHIRVYAIGI--KEANQTQLRE 527 Query: 360 CTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + + V+D L + +++ +I + Sbjct: 528 IAGEEKRVYYVHDFDALKDIRNQVVQEICTE 558 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 32/162 (19%), Positives = 53/162 (32%), Gaps = 13/162 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + VR G + Y EV S L T T A+ + Sbjct: 818 DVGKNQVRFGALKYADDPEVLFYLDDFGTKLEVISVLQNDQAMGGSTYTAEALGFSDHMF 877 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + + + + + +I ITDGE S + LN + +R+ G+ + +AV Sbjct: 878 TE----ARGSRLNKGVPQVLIVITDGE----SHDADKLNAT--AKALRDKGILV--LAVG 925 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 L SS ++F V L F +T + Sbjct: 926 IDGANPVELLAMAGSSDKYFFVETFGGLKGIFSDVTASVCNS 967 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 59/159 (37%), Gaps = 15/159 (9%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYR 286 + VRIG ++ E+ ++ + NT+ A+ R Sbjct: 1024 DFDVSLNRVRIGAAQFSDTYHPEFPLGTFIGEKEISFQIENIKQIFGNTHIGAAL----R 1079 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 E+ + + +T + ++ +TDG++ Q E +R+ G+ IYSV Sbjct: 1080 EVEHYFRPDMGSRINTGTPQVLLVLTDGQSQD--------EVAQAAEALRHRGIDIYSVG 1131 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + L + T ++ + V++ EL + +I Sbjct: 1132 IG--DVDDQQLIQITGTAEKKLTVHNFDELKKVNKRIVT 1168 Score = 63.0 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 27/160 (16%), Positives = 54/160 (33%), Gaps = 19/160 (11%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEK 292 V+IG + ++ + +++ + ++++ T T A+ + K Sbjct: 635 DRVQIGVVQFSDINKEEFQLNRFMSQSDISNAIDQMAHIGQTTLTGSALSFVSQYFSPTK 694 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA-VSAPP 351 + ++KF+I ITDGE + +R G+ IYSV + Sbjct: 695 GARP------NIRKFLILITDGEAQD--------IVKEPAVVLRQEGVIIYSVGVFGSNV 740 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 L + + F V + L D + I Sbjct: 741 ---TQLEEISGRPEMVFYVENFDILQRIEDDLVFGICSPR 777 >gi|254776724|ref|ZP_05218240.1| hypothetical protein MaviaA2_18936 [Mycobacterium avium subsp. avium ATCC 25291] Length = 335 Score = 81.9 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 29/222 (13%), Positives = 66/222 (29%), Gaps = 26/222 (11%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 ++ S+ + ++ E+ + + AI Sbjct: 95 NRAVVMLVIDVSESMASTDVPPNRLAAAKEAGKQFADQLTPAINLGLVEF---------- 144 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 P + N + VK+ ++ L P T T + A + + S G Sbjct: 145 AANATLLVPPTTNRSAVKAGIDSLQPAPKTATGEGIFTALQAIATV--GSVMGGGEGPPP 202 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--------------P 351 ++ +DG + + G++I +++ P Sbjct: 203 ARIVLESDGAENVPLDPNAPQGAFTAARAAKAEGVQISTISFGTPYGTVDYEGATIPVPV 262 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + Q L + C + GQ F + L + + +I ++V+ Sbjct: 263 DDQTLQKICEITDGQAFHADSLDSLKNVYSTLQRQIGYETVK 304 >gi|326382237|ref|ZP_08203929.1| hypothetical protein SCNU_04806 [Gordonia neofelifaecis NRRL B-59395] gi|326198967|gb|EGD56149.1| hypothetical protein SCNU_04806 [Gordonia neofelifaecis NRRL B-59395] Length = 330 Score = 81.9 bits (200), Expect = 2e-13, Method: Composition-based stats. Identities = 23/227 (10%), Positives = 69/227 (30%), Gaps = 31/227 (13%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 ++ S + A ++ + + + I + Sbjct: 86 NRATVMLVVDVSNSMKSTDVAPSRLKAAQAAGKRFADDLTDGIN----------LGLISF 135 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + + + + K+ L++L + T T + A +++ ++ S Sbjct: 136 AGTASTLVSPTPDHSATKNALDRLKLADKTATGEGIFAALQQIDT--LNAVLGGPSGAPP 193 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA---------------- 349 ++ ++DG+ + + + + G+ + +++ Sbjct: 194 ARIVLLSDGKQTVPESPDDPRGGFTAARKAKEKGVPVSTISFGTLTGTVDLETPGGGVER 253 Query: 350 ---PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 P + + L + S G FF + EL + + + +I + R Sbjct: 254 VPVPVDDESLRKIANLSGGDFFTASSLDELNKVYSTLQKQIGYERER 300 >gi|319952789|ref|YP_004164056.1| von willebrand factor type a [Cellulophaga algicola DSM 14237] gi|319421449|gb|ADV48558.1| von Willebrand factor type A [Cellulophaga algicola DSM 14237] Length = 332 Score = 81.5 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 27/193 (13%), Positives = 56/193 (29%), Gaps = 41/193 (21%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE---NT 275 N + S++K + TP++++ V + L ++ T Sbjct: 111 NRLASLKKVAADFIKKRPNDRIGLVVYAGESYTKTPITSDKGIVLNALKEITYGSLEDGT 170 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + + L S L K +I +TDG N+ + E Sbjct: 171 AIGMGLATSVNRL----------KESKALSKVIILLTDGINNSG-----FIEPQTAAELA 215 Query: 336 RNAGMKIYSVAVSAPPE----------------------GQDLLRKCT-DSSGQFFAVND 372 +K Y++ + + LL + + G +F + Sbjct: 216 VEYDIKTYTIGLGTNGNALSPIAINSDGSFRYGMKPVEIDEGLLEQIAKTTGGAYFRATN 275 Query: 373 SRELLESFDKITD 385 + L +D+I Sbjct: 276 NESLASIYDEINK 288 >gi|329954838|ref|ZP_08295855.1| von Willebrand factor type A domain protein [Bacteroides clarus YIT 12056] gi|328526942|gb|EGF53953.1| von Willebrand factor type A domain protein [Bacteroides clarus YIT 12056] Length = 327 Score = 81.5 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 28/205 (13%), Positives = 64/205 (31%), Gaps = 40/205 (19%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENT 275 N + + + E N PL+ + + + + + + T Sbjct: 108 NRLEAAKDVAAEFINGRPNDNVGITLFAGESFTQCPLTVDHAVLLNLIKDVKCGLIEDGT 167 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + +A L S K +I +TDG N+ L E Sbjct: 168 AVGMGIANAVTRL----------KDSKAKSKVIILLTDGTNNRGEIS-----PLTAAEIA 212 Query: 336 RNAGMKIYSVAVSAPP---------------------EGQDLLRKCTDSSGQFFAVNDSR 374 ++ G+++Y++ V + + L + + G +F + Sbjct: 213 KSFGIRVYTIGVGTNGMAPYPYPVGGTVQYVNMPVEIDEKTLTQIAGTTDGNYFRATSNS 272 Query: 375 ELLESFDKITD-KIQEQSVRIAPNR 398 +L E +++I + + +V+ R Sbjct: 273 KLKEVYEEIDKLEKTKLNVKEYSKR 297 >gi|254775742|ref|ZP_05217258.1| hypothetical protein MaviaA2_13890 [Mycobacterium avium subsp. avium ATCC 25291] Length = 335 Score = 81.5 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 54/152 (35%), Gaps = 16/152 (10%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + N K+ L+KL + T T + A + + + G ++ +DG+ Sbjct: 155 TTNREATKNALDKLQFADRTATGEGIFTALQAIATV--GAVIGGGDKPPPARIVLFSDGK 212 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PEGQDLLRKCTD 362 + + N ++ G+ I +++ P P + L+K Sbjct: 213 ETMPTNPDNPKGAFTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETLKKVAQ 272 Query: 363 -SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 S G + +EL + + +I ++++ Sbjct: 273 LSGGNAYNAASLQELKSVYATLQQQIGYETIK 304 >gi|167626845|ref|YP_001677345.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Francisella philomiragia subsp. philomiragia ATCC 25017] gi|167596846|gb|ABZ86844.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Francisella philomiragia subsp. philomiragia ATCC 25017] Length = 333 Score = 81.5 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 38/205 (18%), Positives = 76/205 (37%), Gaps = 47/205 (22%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + D+++ A +++ R+G I + TPL+ ++ Sbjct: 112 SNGKMESRFDLVMRVANEFLDT---------RQGDRVGLILFGTWAYLQ--TPLTFDIPT 160 Query: 262 VKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 VK L+ + P T A+ A ++L S K ++ +TDGEN+ Sbjct: 161 VKKMLDDASIALPGPQTAIGDAIGLAVKKLKRYPGDS----------KALVLLTDGENNS 210 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVS-----------------APPEGQDLLRKCT 361 L LQ E + +KIY++ + + ++L+K Sbjct: 211 G-----ALQPLQAAELAKQYHIKIYTIGLGGGQMMVKTTFGERLVNTSEDLDTEVLQKIA 265 Query: 362 D-SSGQFFAVNDSRELLESFDKITD 385 + G+FF +S +L + ++ I Sbjct: 266 TMTGGKFFRAQNSADLKQVYESIDQ 290 >gi|284040938|ref|YP_003390868.1| von Willebrand factor A [Spirosoma linguale DSM 74] gi|283820231|gb|ADB42069.1| von Willebrand factor type A [Spirosoma linguale DSM 74] Length = 359 Score = 81.5 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 33/151 (21%), Positives = 59/151 (39%), Gaps = 24/151 (15%) Query: 254 PLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTR------ 303 PL+ + N + LN LN T A+ + + +S +T + Sbjct: 168 PLTTDYNLLNQYLNDLNDGMIRTSGTAIGDALARCINRMRDRPAASSDTTQAKTEQWKSE 227 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE--------GQD 355 K +I ++DG+N+ + L+ + + +KIY++AV P + Sbjct: 228 RSKVIILLSDGDNTAGN-----LDPITAASLAKAFNIKIYTIAVGQPVASASEASTVDEG 282 Query: 356 LLRKCTDSS-GQFFAVNDSRELLESFDKITD 385 +L+K G FF DS L F +I+ Sbjct: 283 ILKKIATIGKGSFFRAVDSGRLKTVFAQISQ 313 >gi|226306560|ref|YP_002766520.1| hypothetical protein RER_30730 [Rhodococcus erythropolis PR4] gi|226185677|dbj|BAH33781.1| conserved hypothetical membrane protein [Rhodococcus erythropolis PR4] Length = 326 Score = 81.5 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 55/152 (36%), Gaps = 16/152 (10%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + N + K ++ L E T T A+ + + + + ++ ++DG+ Sbjct: 147 TTNRDATKVAIDNLKLSERTATGEAIFTSLQSIDTLS--AVLGGSDQAPPARIVLLSDGK 204 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PEGQDLLRKCTD 362 + + ++ G+ I +++ P L++ + Sbjct: 205 QTVPENSDDPRGGFTAARQAKDKGVPISTISFGTTYGRVEIEGDRIPVPVDDASLKEIAN 264 Query: 363 -SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 S G FF + EL + +D + ++I ++ R Sbjct: 265 LSGGSFFTASSLEELRQVYDTLEEQIGFETTR 296 >gi|254875972|ref|ZP_05248682.1| conserved hypothetical protein [Francisella philomiragia subsp. philomiragia ATCC 25015] gi|254841993|gb|EET20407.1| conserved hypothetical protein [Francisella philomiragia subsp. philomiragia ATCC 25015] Length = 339 Score = 81.5 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 38/205 (18%), Positives = 76/205 (37%), Gaps = 47/205 (22%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + D+++ A +++ R+G I + TPL+ ++ Sbjct: 118 SNGKMESRFDLVMRVANEFLDT---------RQGDRVGLILFGTWAYLQ--TPLTFDIPT 166 Query: 262 VKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 VK L+ + P T A+ A ++L S K ++ +TDGEN+ Sbjct: 167 VKKMLDDASIALPGPQTAIGDAIGLAVKKLKRYPGDS----------KALVLLTDGENNS 216 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVS-----------------APPEGQDLLRKCT 361 L LQ E + +KIY++ + + ++L+K Sbjct: 217 G-----ALQPLQAAELAKQYHIKIYTIGLGGGQMMVKTTFGERLVNTSEDLDTEVLQKIA 271 Query: 362 D-SSGQFFAVNDSRELLESFDKITD 385 + G+FF +S +L + ++ I Sbjct: 272 TMTGGKFFRAQNSTDLKQVYESIDQ 296 >gi|241667423|ref|ZP_04755001.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Francisella philomiragia subsp. philomiragia ATCC 25015] Length = 333 Score = 81.5 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 38/205 (18%), Positives = 76/205 (37%), Gaps = 47/205 (22%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + D+++ A +++ R+G I + TPL+ ++ Sbjct: 112 SNGKMESRFDLVMRVANEFLDT---------RQGDRVGLILFGTWAYLQ--TPLTFDIPT 160 Query: 262 VKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 VK L+ + P T A+ A ++L S K ++ +TDGEN+ Sbjct: 161 VKKMLDDASIALPGPQTAIGDAIGLAVKKLKRYPGDS----------KALVLLTDGENNS 210 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVS-----------------APPEGQDLLRKCT 361 L LQ E + +KIY++ + + ++L+K Sbjct: 211 G-----ALQPLQAAELAKQYHIKIYTIGLGGGQMMVKTTFGERLVNTSEDLDTEVLQKIA 265 Query: 362 D-SSGQFFAVNDSRELLESFDKITD 385 + G+FF +S +L + ++ I Sbjct: 266 TMTGGKFFRAQNSTDLKQVYESIDQ 290 >gi|218662717|ref|ZP_03518647.1| hypothetical protein RetlI_26604 [Rhizobium etli IE4771] Length = 295 Score = 81.5 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 36/293 (12%), Positives = 90/293 (30%), Gaps = 41/293 (13%) Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 S + ++LD + SM + K+ + Sbjct: 9 SGTATAEYQTAAFMDFYILLDNTPSMGVGATPDDVSKLEAKAGCAFACHQMDKTINNYTI 68 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG-NQCT 253 KS +IDV+ ++ L ++ + + + T + Sbjct: 69 AKS------LGVAMRIDVVRQATQALTDTAKTERVSSDQFRMGVYTFGTKAEDAKLTTIS 122 Query: 254 PLSNNLNEVKSRLNKLNPY------ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 L+++L +VK+ + ++ N + A ++ + + + + +K Sbjct: 123 GLTSDLTKVKNYTDAVDLMTIPYQNYNNDQITNFDSAMTQMNTIIDQAGDGTSNISAEKI 182 Query: 308 VIFITDGENSGASAYQNTLNTLQ----------ICEYMRNAGMKI---YSVAVSAPPEG- 353 + F++DG T T C+ +++ G+KI Y+ + P Sbjct: 183 LFFVSDGVGDSYKPSTCTKKTTGGRCQEPIDTSFCKPLKDRGVKIAVLYTTYLPLPSNSW 242 Query: 354 ------------QDLLRKCTDSSGQFFAVNDSRELLESFDKIT-DKIQEQSVR 393 ++ C G +F V+ + + ++ + I+ + Sbjct: 243 YNTWIKPFQGEIPTKMQACAS-PGFYFEVSPTEGITDAMKALFLKVIRAPRIT 294 >gi|330447847|ref|ZP_08311495.1| von Willebrand factor type A domain protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] gi|328492038|dbj|GAA05992.1| von Willebrand factor type A domain protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] Length = 321 Score = 81.5 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 30/207 (14%), Positives = 73/207 (35%), Gaps = 47/207 (22%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 + ++ + + + + R+G + + TPL+ + Sbjct: 101 VTKNGQSIDRLTAVKHVLSDFIEKRK---------GDRLGLVLFADHAYLQ--TPLTFDR 149 Query: 260 NEVKSRLNK--LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 N V+ +L++ L +T + A + S ++ +I ++DG N Sbjct: 150 NTVEQQLDRTVLGLIGQSTAIGEGLGIATKTFI----------NSKAPQRVIILLSDGAN 199 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEGQDLLRK 359 + ++ L+ + + +G+KIY+V V A + L + Sbjct: 200 TSGV-----IDPLEAAKLAKESGVKIYTVGVGADQMVQKGFFGDRLVNPSQDLDEKTLTE 254 Query: 360 CTD-SSGQFFAVNDSRELLESFDKITD 385 + G++F + ++L + +D I Sbjct: 255 IAKMTGGEYFRARNPQQLEKIYDIINK 281 >gi|226314649|ref|YP_002774545.1| hypothetical protein BBR47_50640 [Brevibacillus brevis NBRC 100599] gi|226097599|dbj|BAH46041.1| conserved hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 513 Score = 81.5 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 39/345 (11%), Positives = 88/345 (25%), Gaps = 31/345 (8%) Query: 39 SGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKN 98 S + + Q + K+ L Y ++ + KD+ Sbjct: 37 SASVEQGQSNQVASSPSPPSQLADYALKKSGDPLPNDMYFKDYGTNQFVSTA----KDRL 92 Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR 158 + + A Y I + L P+ + + + D Sbjct: 93 STFAADVD-TASYTIMRHFIKDGNLPPAEAVRVEEFINFFPTSYPAPTNQTFAIQADSGP 151 Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAG 218 S K+ + L P N +++++ +S Sbjct: 152 SPFQ----KNLQIVRIGIKGKELSPKE---RKPANLVFVIDVSGSMNQENRLELVKKSLH 204 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 LV+ +Q + Y + + + S +++L P +TN Sbjct: 205 VLVDQLQPTDSVG--------IVVYGSEGRVLLPPTSTEDKQAILSAIDELQPEGSTNAE 256 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 + Y + VI +DG + + L+ E Sbjct: 257 QGLVLGYEMAARSFKPPAINR--------VILCSDGVANVGETGAEGI--LRSIEDYARK 306 Query: 339 GMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDK 382 + + S ++ + + G + ++ E F + Sbjct: 307 DIYLSSFGFGMGNYNDVMMEQLANKGEGSYAYIDTFSEARRIFTE 351 >gi|121637412|ref|YP_977635.1| hypothetical protein BCG_1543 [Mycobacterium bovis BCG str. Pasteur 1173P2] gi|224989887|ref|YP_002644574.1| hypothetical protein JTY_1518 [Mycobacterium bovis BCG str. Tokyo 172] gi|166979775|sp|A1KIS1|Y1543_MYCBP RecName: Full=UPF0353 protein BCG_1543 gi|254800546|sp|C1ANC7|Y1518_MYCBT RecName: Full=UPF0353 protein JTY_1518 gi|121493059|emb|CAL71530.1| Probable membrane protein [Mycobacterium bovis BCG str. Pasteur 1173P2] gi|224773000|dbj|BAH25806.1| hypothetical protein JTY_1518 [Mycobacterium bovis BCG str. Tokyo 172] Length = 335 Score = 81.5 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 24/152 (15%), Positives = 55/152 (36%), Gaps = 16/152 (10%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + N K+ L+KL + T T A+ A + + + G T ++ +DG+ Sbjct: 155 TTNREATKNALDKLQFADRTATGEAIFTALQAIATV--GAVIGGGDTPPPARIVLFSDGK 212 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PEGQDLLRKCTD 362 + + N ++ G+ I +++ P P + ++K Sbjct: 213 ETMPTNPDNPKGAYTAARTAKDQGVPISTISFGTPYGFVEIDDQRQPVPVDDETMKKVAQ 272 Query: 363 -SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 S G + EL + + +I ++++ Sbjct: 273 LSGGNSYNAATLAELRAVYSSLQQQIGYETIK 304 >gi|21539497|gb|AAM53301.1| putative protein [Arabidopsis thaliana] gi|23198328|gb|AAN15691.1| putative protein [Arabidopsis thaliana] Length = 704 Score = 81.5 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 22/181 (12%), Positives = 58/181 (32%), Gaps = 22/181 (12%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL--SNNLNEVKSR 265 K+ +L + G ++ ++ R+ I+++ N L E Sbjct: 266 TKLALLKRAMGFVIQNLG--------PFDRLSVISFSSTARRNFPLRLMTETGKQEALQA 317 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 +N L TN + R L + ++ ++DG+++ N Sbjct: 318 VNSLVSNGGTNIAEGLKKGARVLID--------RRFKNPVSSIVLLSDGQDTYTMTSPNG 369 Query: 326 LNTLQ----ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFD 381 + + + + +++ A + + +S G F + + ++F Sbjct: 370 SRGTDYKALLPKEINGNRIPVHAFGFGADHDASLMHSIAENSGGTFSFIESETVIQDAFA 429 Query: 382 K 382 + Sbjct: 430 Q 430 >gi|15239414|ref|NP_200879.1| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|9759335|dbj|BAB09844.1| retroelement pol polyprotein-like [Arabidopsis thaliana] gi|332009986|gb|AED97369.1| C3H4 type zinc finger protein [Arabidopsis thaliana] Length = 704 Score = 81.5 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 22/181 (12%), Positives = 58/181 (32%), Gaps = 22/181 (12%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL--SNNLNEVKSR 265 K+ +L + G ++ ++ R+ I+++ N L E Sbjct: 266 TKLALLKRAMGFVIQNLG--------PFDRLSVISFSSTARRNFPLRLMTETGKQEALQA 317 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 +N L TN + R L + ++ ++DG+++ N Sbjct: 318 VNSLVSNGGTNIAEGLKKGARVLID--------RRFKNPVSSIVLLSDGQDTYTMTSPNG 369 Query: 326 LNTLQ----ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFD 381 + + + + +++ A + + +S G F + + ++F Sbjct: 370 SRGTDYKALLPKEINGNRIPVHAFGFGADHDASLMHSIAENSGGTFSFIESETVIQDAFA 429 Query: 382 K 382 + Sbjct: 430 Q 430 >gi|229493542|ref|ZP_04387327.1| von Willebrand factor type A domain protein [Rhodococcus erythropolis SK121] gi|229319503|gb|EEN85339.1| von Willebrand factor type A domain protein [Rhodococcus erythropolis SK121] Length = 326 Score = 81.5 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 55/152 (36%), Gaps = 16/152 (10%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + N + K ++ L E T T A+ + + + + ++ ++DG+ Sbjct: 147 TTNRDATKVAIDNLKLSERTATGEAIFTSLQSIDTLS--AVLGGSDQAPPARIVLLSDGK 204 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PEGQDLLRKCTD 362 + + ++ G+ I +++ P L++ + Sbjct: 205 QTVPENSDDPRGGFTAARQAKDKGVPISTISFGTTYGRVEIEGDRIPVPVDDASLKEIAN 264 Query: 363 -SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 S G FF + EL + +D + ++I ++ R Sbjct: 265 LSGGSFFTASSLEELRQVYDTLEEQIGFETTR 296 >gi|156975610|ref|YP_001446517.1| Flp pilus assembly protein TadG [Vibrio harveyi ATCC BAA-1116] gi|156527204|gb|ABU72290.1| hypothetical protein VIBHAR_03343 [Vibrio harveyi ATCC BAA-1116] Length = 502 Score = 81.5 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 49/482 (10%), Positives = 119/482 (24%), Gaps = 110/482 (22%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A+ + F +A++ + ++++ A A L+ S++ Sbjct: 25 FALSLVPVFGMTFFAVEGTRYIQETSRLRDAAQTAALAITIDDKSNQ-----------AD 73 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + I +++ S++ ++ + D +QY ++ + + + Sbjct: 74 ALATMYINDYVRDISHVDIQTVRTYEEPTEDN--DNTEKIQYSVQAVTTHNSWFASNSIP 131 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ----KHNDNNNMTSN 177 + + I + +V D S SM + + + + Sbjct: 132 SFETQEKLAGQAVA-AKYPFYLGDKIIDLVLVTDFSGSMNNSWDGEIKIDLLKDAVKQIS 190 Query: 178 KYLLPP-----------------------------------------------PPKKSFW 190 +L P ++ W Sbjct: 191 NRILVPREGESEVLNRIAIIPFNLRVQEKINDNLYSTSQLRYKGNYRKSVSSVKYEQVNW 250 Query: 191 SK-NTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAY------ 243 + + A + + + N + Y Sbjct: 251 DYWSPYSEEAVEECANKRTDCPNKKSWERDQAKRVADVVNINNNRLEIPDYVGYSKSVRH 310 Query: 244 -NIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-------------------TNTYPAMHH 283 V N N K + + N T + Sbjct: 311 MFDDKVANNNLTFHFRSNNNKLYNSSMTRTGNSGFYTIPLTANKANLDKMQTMSSGGNTA 370 Query: 284 AYRELYNEKE----SSHNTIGSTRLKKF------VIFITDGENSGASAYQNTLNTLQICE 333 A++ + + N +++ +I I+DG + L +C+ Sbjct: 371 AHQGMLRGLQIMEAGRPNGGSEEETEQYNDRLKMLIVISDGMEYPYTEILPGLVNKGMCD 430 Query: 334 YMRNA------GMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 R + I + V+ Q + C + + + D E + KI + I Sbjct: 431 KAREHFQTENGNLYIGVIGVNFSASSQSGFQDCVLNPDE--DIIDVTETEDFIKKIEELI 488 Query: 388 QE 389 Q+ Sbjct: 489 QK 490 >gi|15608619|ref|NP_215997.1| hypothetical protein Rv1481 [Mycobacterium tuberculosis H37Rv] gi|31792676|ref|NP_855169.1| hypothetical protein Mb1517 [Mycobacterium bovis AF2122/97] gi|148661274|ref|YP_001282797.1| hypothetical protein MRA_1491 [Mycobacterium tuberculosis H37Ra] gi|148822701|ref|YP_001287455.1| hypothetical protein TBFG_11510 [Mycobacterium tuberculosis F11] gi|167968028|ref|ZP_02550305.1| hypothetical membrane protein [Mycobacterium tuberculosis H37Ra] gi|215403336|ref|ZP_03415517.1| hypothetical protein Mtub0_06533 [Mycobacterium tuberculosis 02_1987] gi|215411140|ref|ZP_03419948.1| hypothetical protein Mtub9_07385 [Mycobacterium tuberculosis 94_M4241A] gi|215426820|ref|ZP_03424739.1| hypothetical protein MtubT9_10680 [Mycobacterium tuberculosis T92] gi|215430374|ref|ZP_03428293.1| hypothetical protein MtubE_06801 [Mycobacterium tuberculosis EAS054] gi|215445676|ref|ZP_03432428.1| hypothetical protein MtubT_06934 [Mycobacterium tuberculosis T85] gi|218753198|ref|ZP_03531994.1| hypothetical protein MtubG1_07054 [Mycobacterium tuberculosis GM 1503] gi|219557390|ref|ZP_03536466.1| hypothetical protein MtubT1_08827 [Mycobacterium tuberculosis T17] gi|253799469|ref|YP_003032470.1| hypothetical protein TBMG_02500 [Mycobacterium tuberculosis KZN 1435] gi|254231712|ref|ZP_04925039.1| hypothetical protein TBCG_01457 [Mycobacterium tuberculosis C] gi|254364352|ref|ZP_04980398.1| hypothetical membrane protein [Mycobacterium tuberculosis str. Haarlem] gi|254550498|ref|ZP_05140945.1| hypothetical protein Mtube_08557 [Mycobacterium tuberculosis '98-R604 INH-RIF-EM'] gi|260186427|ref|ZP_05763901.1| hypothetical protein MtubCP_10429 [Mycobacterium tuberculosis CPHL_A] gi|260204765|ref|ZP_05772256.1| hypothetical protein MtubK8_10713 [Mycobacterium tuberculosis K85] gi|289447084|ref|ZP_06436828.1| membrane protein [Mycobacterium tuberculosis CPHL_A] gi|289554729|ref|ZP_06443939.1| membrane protein [Mycobacterium tuberculosis KZN 605] gi|289569506|ref|ZP_06449733.1| membrane protein [Mycobacterium tuberculosis T17] gi|289574162|ref|ZP_06454389.1| membrane protein [Mycobacterium tuberculosis K85] gi|289745232|ref|ZP_06504610.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987] gi|289750042|ref|ZP_06509420.1| membrane protein [Mycobacterium tuberculosis T92] gi|289753564|ref|ZP_06512942.1| hypothetical protein TBGG_00680 [Mycobacterium tuberculosis EAS054] gi|289757593|ref|ZP_06516971.1| conserved hypothetical protein [Mycobacterium tuberculosis T85] gi|289761639|ref|ZP_06521017.1| conserved hypothetical protein [Mycobacterium tuberculosis GM 1503] gi|294993225|ref|ZP_06798916.1| hypothetical protein Mtub2_01637 [Mycobacterium tuberculosis 210] gi|297634047|ref|ZP_06951827.1| hypothetical protein MtubK4_07987 [Mycobacterium tuberculosis KZN 4207] gi|297731033|ref|ZP_06960151.1| hypothetical protein MtubKR_08072 [Mycobacterium tuberculosis KZN R506] gi|298524990|ref|ZP_07012399.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A] gi|306775670|ref|ZP_07414007.1| membrane protein [Mycobacterium tuberculosis SUMu001] gi|306779490|ref|ZP_07417827.1| membrane protein [Mycobacterium tuberculosis SUMu002] gi|306784220|ref|ZP_07422542.1| membrane protein [Mycobacterium tuberculosis SUMu003] gi|306788587|ref|ZP_07426909.1| membrane protein [Mycobacterium tuberculosis SUMu004] gi|306792930|ref|ZP_07431232.1| membrane protein [Mycobacterium tuberculosis SUMu005] gi|306797308|ref|ZP_07435610.1| membrane protein [Mycobacterium tuberculosis SUMu006] gi|306803189|ref|ZP_07439857.1| membrane protein [Mycobacterium tuberculosis SUMu008] gi|306967588|ref|ZP_07480249.1| membrane protein [Mycobacterium tuberculosis SUMu009] gi|306971779|ref|ZP_07484440.1| membrane protein [Mycobacterium tuberculosis SUMu010] gi|307079498|ref|ZP_07488668.1| membrane protein [Mycobacterium tuberculosis SUMu011] gi|307084057|ref|ZP_07493170.1| membrane protein [Mycobacterium tuberculosis SUMu012] gi|313658366|ref|ZP_07815246.1| hypothetical protein MtubKV_08092 [Mycobacterium tuberculosis KZN V2475] gi|54040185|sp|P64856|Y1517_MYCBO RecName: Full=UPF0353 protein Mb1517 gi|54042534|sp|P64855|Y1481_MYCTU RecName: Full=UPF0353 protein Rv1481/MT1528 gi|166979870|sp|A5U2I5|Y1491_MYCTA RecName: Full=UPF0353 protein MRA_1491 gi|3261503|emb|CAA16011.1| PROBABLE MEMBRANE PROTEIN [Mycobacterium tuberculosis H37Rv] gi|31618266|emb|CAD96184.1| PROBABLE MEMBRANE PROTEIN [Mycobacterium bovis AF2122/97] gi|124600771|gb|EAY59781.1| hypothetical protein TBCG_01457 [Mycobacterium tuberculosis C] gi|134149866|gb|EBA41911.1| hypothetical membrane protein [Mycobacterium tuberculosis str. Haarlem] gi|148505426|gb|ABQ73235.1| putative membrane protein [Mycobacterium tuberculosis H37Ra] gi|148721228|gb|ABR05853.1| hypothetical membrane protein [Mycobacterium tuberculosis F11] gi|253320972|gb|ACT25575.1| membrane protein [Mycobacterium tuberculosis KZN 1435] gi|289420042|gb|EFD17243.1| membrane protein [Mycobacterium tuberculosis CPHL_A] gi|289439361|gb|EFD21854.1| membrane protein [Mycobacterium tuberculosis KZN 605] gi|289538593|gb|EFD43171.1| membrane protein [Mycobacterium tuberculosis K85] gi|289543260|gb|EFD46908.1| membrane protein [Mycobacterium tuberculosis T17] gi|289685760|gb|EFD53248.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987] gi|289690629|gb|EFD58058.1| membrane protein [Mycobacterium tuberculosis T92] gi|289694151|gb|EFD61580.1| hypothetical protein TBGG_00680 [Mycobacterium tuberculosis EAS054] gi|289709145|gb|EFD73161.1| conserved hypothetical protein [Mycobacterium tuberculosis GM 1503] gi|289713157|gb|EFD77169.1| conserved hypothetical protein [Mycobacterium tuberculosis T85] gi|298494784|gb|EFI30078.1| conserved hypothetical protein [Mycobacterium tuberculosis 94_M4241A] gi|308215767|gb|EFO75166.1| membrane protein [Mycobacterium tuberculosis SUMu001] gi|308327531|gb|EFP16382.1| membrane protein [Mycobacterium tuberculosis SUMu002] gi|308330994|gb|EFP19845.1| membrane protein [Mycobacterium tuberculosis SUMu003] gi|308334816|gb|EFP23667.1| membrane protein [Mycobacterium tuberculosis SUMu004] gi|308338604|gb|EFP27455.1| membrane protein [Mycobacterium tuberculosis SUMu005] gi|308342306|gb|EFP31157.1| membrane protein [Mycobacterium tuberculosis SUMu006] gi|308350100|gb|EFP38951.1| membrane protein [Mycobacterium tuberculosis SUMu008] gi|308354737|gb|EFP43588.1| membrane protein [Mycobacterium tuberculosis SUMu009] gi|308358644|gb|EFP47495.1| membrane protein [Mycobacterium tuberculosis SUMu010] gi|308362622|gb|EFP51473.1| membrane protein [Mycobacterium tuberculosis SUMu011] gi|308366304|gb|EFP55155.1| membrane protein [Mycobacterium tuberculosis SUMu012] gi|323719929|gb|EGB29041.1| membrane protein [Mycobacterium tuberculosis CDC1551A] gi|326903107|gb|EGE50040.1| membrane protein [Mycobacterium tuberculosis W-148] gi|328459217|gb|AEB04640.1| membrane protein [Mycobacterium tuberculosis KZN 4207] Length = 335 Score = 81.5 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 24/152 (15%), Positives = 55/152 (36%), Gaps = 16/152 (10%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + N K+ L+KL + T T A+ A + + + G T ++ +DG+ Sbjct: 155 TTNREATKNALDKLQFADRTATGEAIFTALQAIATV--GAVIGGGDTPPPARIVLFSDGK 212 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PEGQDLLRKCTD 362 + + N ++ G+ I +++ P P + ++K Sbjct: 213 ETMPTNPDNPKGAYTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETMKKVAQ 272 Query: 363 -SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 S G + EL + + +I ++++ Sbjct: 273 LSGGNSYNAATLAELRAVYSSLQQQIGYETIK 304 >gi|315649824|ref|ZP_07902907.1| von Willebrand factor type A [Paenibacillus vortex V453] gi|315274798|gb|EFU38179.1| von Willebrand factor type A [Paenibacillus vortex V453] Length = 1316 Score = 81.5 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 52/150 (34%), Gaps = 16/150 (10%) Query: 244 NIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 PLS + VK+ +N L T T A+ A L N + + Sbjct: 112 YSSANNISSFPLSTDKEAVKNYVNGLRANGGTATGDAIKKARELLVNHRPDAQP------ 165 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD------LL 357 ++ +TDG+ + + L + G+ Y++A+ D LL Sbjct: 166 ---VIVLLTDGDATEPNGNAYNY-ALTNSNEAKQEGIVFYTIALLNTNANPDTSGPNLLL 221 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDKI 387 ++ +S V S L + + I +I Sbjct: 222 KQMATTSHHHHFVLGSVGLGDIYAAIVQEI 251 >gi|294055316|ref|YP_003548974.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] gi|293614649|gb|ADE54804.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] Length = 730 Score = 81.5 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 42/363 (11%), Positives = 102/363 (28%), Gaps = 26/363 (7%) Query: 37 VLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKD 96 L+ + +R + + + + + +GS+ E I + + + Sbjct: 213 SLALVDAEFGNRADAFTVSSELGGTGGVQVKCATVEPEGSFNTEAYDPIETTSFRSPLVE 272 Query: 97 KNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDV 156 + + A Y L L P+ + + Sbjct: 273 PLSTFSIDVD-TASYANVRRFLNQGQLPPADSVRIEELVNYFNYSDAAPTKSLEDGGAPF 331 Query: 157 SRSM-EDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 + + + + K P ++ + K+ +L E Sbjct: 332 AVHLEQMSAPWQPEHRLVRVGLKGYEMPWEERPASNLV--FLLDVSGSMSQPNKLPLLKE 389 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 + L + R+ + Y + +NN ++ L +L +T Sbjct: 390 ALMLLTRRLDS--------RDRVAIVVYAGASGLVLPSTTANNTATIEHALTQLQAGGST 441 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 N + AY+ N VI TDG+ + + L + + Sbjct: 442 NAGAGIELAYQVAREHFIEDGNNR--------VILCTDGDFNVGQTNRGDL-AQIVADQA 492 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRI 394 ++ G+ + + ++L + ++ G + V+ E + F + +I Sbjct: 493 KD-GVSLTVLGFGMGNYKDNMLEELSNKGKGTYAYVDSEAEARKVF---LQDLASNIFKI 548 Query: 395 APN 397 A + Sbjct: 549 AKD 551 >gi|15840942|ref|NP_335979.1| hypothetical protein MT1528 [Mycobacterium tuberculosis CDC1551] gi|13881148|gb|AAK45793.1| conserved hypothetical protein [Mycobacterium tuberculosis CDC1551] Length = 335 Score = 81.5 bits (199), Expect = 2e-13, Method: Composition-based stats. Identities = 24/152 (15%), Positives = 55/152 (36%), Gaps = 16/152 (10%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + N K+ L+KL + T T A+ A + + + G T ++ +DG+ Sbjct: 155 TTNREATKNALDKLQFADRTATGEAIFTALQAIATV--GAVIGGGDTXPPARIVLFSDGK 212 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PEGQDLLRKCTD 362 + + N ++ G+ I +++ P P + ++K Sbjct: 213 ETMPTNPDNPKGAYTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETMKKVAQ 272 Query: 363 -SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 S G + EL + + +I ++++ Sbjct: 273 LSGGNSYNAATLAELRAVYSSLQQQIGYETIK 304 >gi|169629808|ref|YP_001703457.1| hypothetical protein MAB_2724c [Mycobacterium abscessus ATCC 19977] gi|169241775|emb|CAM62803.1| Conserved hypothetical protein [Mycobacterium abscessus] Length = 336 Score = 81.1 bits (198), Expect = 2e-13, Method: Composition-based stats. Identities = 21/152 (13%), Positives = 53/152 (34%), Gaps = 16/152 (10%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + N + + L+ L + T T + A + + + G ++ ++DG+ Sbjct: 156 TTNRDATVNALDNLQLADRTATGEGIFTALQAIATV--GAVIGGGDKPPPARIVLMSDGK 213 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-------------EGQDLLRKCTD 362 + S N ++ + I ++A D++ K Sbjct: 214 ETVPSNPDNPKGAYTAARTAKDQQVPISTIAFGTKDGYVEINGQRQNVPYAPDMMEKVAK 273 Query: 363 -SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 S G+ + + +L E + + +I +++R Sbjct: 274 LSGGETYTASTLGQLKEVYANLQQQIGYETIR 305 >gi|240172225|ref|ZP_04750884.1| hypothetical protein MkanA1_23119 [Mycobacterium kansasii ATCC 12478] Length = 335 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 25/152 (16%), Positives = 56/152 (36%), Gaps = 16/152 (10%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + N + K+ L+KL + T T A+ A + + + G T ++ +DG+ Sbjct: 155 TTNRDATKNALDKLQFADRTATGEAIFTALQAIATV--GAVIGGGDTPPPARIVLFSDGK 212 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PEGQDLLRKCTD 362 + + N ++ G+ I +++ P P + L+K Sbjct: 213 ETMPTNPDNPKGAFTAARTAKDQGVPISTISFGTPYGFVEINGQRQPVPVDDETLKKVAQ 272 Query: 363 -SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 S G + EL + + +I ++++ Sbjct: 273 LSGGNAYNAATLAELKSVYASLQQQIGYETIK 304 >gi|254293317|ref|YP_003059340.1| von Willebrand factor A [Hirschia baltica ATCC 49814] gi|254041848|gb|ACT58643.1| von Willebrand factor type A [Hirschia baltica ATCC 49814] Length = 563 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 40/394 (10%), Positives = 101/394 (25%), Gaps = 32/394 (8%) Query: 7 SVCFLFITYAIDLAHIMYIRNQMQSALDAAVLS--GCASIVSDRTIKDPTTKKDQTSTIF 64 + I +A + + Q+ + AV + A P+ + + Sbjct: 6 IALLGSASILILVACSSSQKTE-QAQEEVAVTADEAYAEAPPPPPPPSPSMLSKHSLGMM 64 Query: 65 KKQIKKHLKQGSYIRENAGDIAQKAQINI---TKDKNNPLQYIAESKAQYEIPTENLFLK 121 + + + + ++D + + A Y L Sbjct: 65 AMSAPAPMPTSLPSQLDRDKYEDVDTNPVKLVSEDPVSTFSIDVD-TASYSNVRSFLNDG 123 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 L P + + + V+ K + + Sbjct: 124 NLPPKDAVRIEELINYFDYDYPIPASKDVPFATHVNVVPAPWAEGKQLMHVGIKGYDLDR 183 Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 P N T K+ + ++ L+ +K + I + Sbjct: 184 TEQPP-----LNLTLLVDVSGSMNHEDKLPLAKKALKLLI--------DKMDEDDHISVV 230 Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGS 301 Y + +++ + L+ L+ +T + AY +++ Sbjct: 231 VYAGAAGTVLEPTKGSEKSKIFAALDNLSAGGSTAGGEGLRLAYSLAEQNYDAASVNR-- 288 Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT 361 V+ +TDG+ + + R +G+ + + +++K Sbjct: 289 ------VMLLTDGDFNVGV--TSDERLEDFVARKRESGVYLSVLGFGRGNYNDAMMQKIA 340 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 + + L E+ + D + IA Sbjct: 341 QAGNG--MASYIDTLNEARKVLNDDLSGSMFTIA 372 >gi|254786433|ref|YP_003073862.1| von Willebrand factor A [Teredinibacter turnerae T7901] gi|237687231|gb|ACR14495.1| von Willebrand factor type A domain protein [Teredinibacter turnerae T7901] Length = 347 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 43/287 (14%), Positives = 85/287 (29%), Gaps = 57/287 (19%) Query: 123 LIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLP 182 T + +++ VL V+ S ++ + + L Sbjct: 38 FSQVRNLQHQATGTPAQQHKISAGWLALIWVLLVAASARPQWVGEPVTLPATGRDLLLAV 97 Query: 183 PPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIA 242 + K+ +I V+ + S R+G I Sbjct: 98 D-------ISGSMKTPDMVVQDKQIARILVVKYVVNEFIER---------RESDRLGLIL 141 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTI 299 + PL+ + V + L++ E T A+ A + L Sbjct: 142 FGSQAYLQA--PLTFDRKTVSTLLDEAQLGFAGEQTAIGDAVGLAIKRL----------R 189 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--------- 350 ++ +I +TDG N+ + Q + + AG+KIY+V V A Sbjct: 190 ERPASQRVLILLTDGANTAG-----EVAPRQAADLAKQAGIKIYTVGVGADQMEQRMGLF 244 Query: 351 -----------PEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITD 385 +D LR + G +F + +EL ++++ Sbjct: 245 GGFSRTVNPSSDLDEDTLRYMAETTGGLYFRARNPQELQAIYEELDK 291 >gi|163760702|ref|ZP_02167782.1| hypothetical protein HPDFL43_12638 [Hoeflea phototrophica DFL-43] gi|162282024|gb|EDQ32315.1| hypothetical protein HPDFL43_12638 [Hoeflea phototrophica DFL-43] Length = 668 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 29/290 (10%), Positives = 75/290 (25%), Gaps = 25/290 (8%) Query: 108 KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQK 167 A Y + L + + ++ V+ + Sbjct: 226 TASYAMVRRALKQGVMPDPRTVRIEEMVNYFNYDYPAPESVETPFRATVTVTPTPWNANT 285 Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 + + P+ + K+ +L + L+ ++ Sbjct: 286 RLLHIGVKGYDVKPAARPQANLVLLV-----DVSGSMQETDKLPLLKSAFRLLIQKLEPE 340 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 + + Y +++ ++ L+ L P +T + AYR Sbjct: 341 DT--------VSIVTYAGDAGTVLEPTPASDKAKILDALDDLRPGGSTAGAAGIEEAYRL 392 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 + + V+ TDG+ + ++ + L + E R +G+ + Sbjct: 393 AEKARVNGGVNR--------VLLATDGDFNVGASDDDAL--KSLIEEKRESGVFLSIFGF 442 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 L++ + L E+ + + IA + Sbjct: 443 GQGNYNDQLMQTLAQNGNGV--AAYIDTLAEAEKTLAQEATASLFPIASD 490 >gi|2811055|sp|O07395|Y335_MYCAV RecName: Full=UPF0353 protein MAV335 gi|2183263|gb|AAC46199.1| MAV335 [Mycobacterium avium] Length = 335 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 29/222 (13%), Positives = 67/222 (30%), Gaps = 25/222 (11%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 ++ S+ A A ++ E+A + + I Sbjct: 95 NRAVVMLVIDVSQSMRATDVAPNRMAAAQEAAKQFADELTPGIN----------LGLIAY 144 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + N K+ L+KL + T T + A + + G Sbjct: 145 AGTATVLVSPTTNREATKNALDKLQFADRTATGEGIFTALQ-VQAIATVGAVIAGDKPPP 203 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PE 352 ++ +DG+ + + N ++ G+ I +++ P P Sbjct: 204 ARIVLFSDGKETMPTNPDNPKGAFTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPV 263 Query: 353 GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + L+K S G + +EL + + +I ++++ Sbjct: 264 DDETLKKVAQLSGGNAYNARSLQELKSVYATLQQQIGYETIK 305 >gi|326673138|ref|XP_001334803.4| PREDICTED: collagen alpha-4(VI) chain-like [Danio rerio] Length = 1356 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 27/161 (16%), Positives = 53/161 (32%), Gaps = 16/161 (9%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEK 292 VRIG + Y+ N E+ + L T+T + L Sbjct: 67 DKVRIGLVQYSDTPRTEFSLNTYQNKEEILDYIRNLRYKTGGTHTGQGLEFI---LKQHF 123 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + + + I ITDG++ Q + +R G+KI+++ + Sbjct: 124 IEEAGSRAQQNVPQIAIVITDGDSQDEVDLQ--------AQELRQRGIKIFAIGI--KDA 173 Query: 353 GQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQS 391 LLR+ + ++V+D L + ++ Sbjct: 174 DVRLLRQIANEPYDQYVYSVSDFAALQGISQSVVRELCTSV 214 Score = 66.1 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 25/152 (16%), Positives = 57/152 (37%), Gaps = 16/152 (10%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYREL 288 + ++I I Y+ + +N N++ +N L T T A+ A + Sbjct: 643 DVGKDKIQIAVILYSDFPRADVYLNTFSNKNDILRYINTLPYGRGKTYTGAALRFAKEHV 702 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + + S +++ + ITDG+++ +A +R +G+ I+++ + Sbjct: 703 FTKARGSRRDK---YVQQVAVVITDGKSTDDAASAAA--------ELRRSGVSIFALGI- 750 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLE 378 +D LR+ V + +L Sbjct: 751 -KDTKEDDLREIASYPPKKFVLNVENFDQLNS 781 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 21/166 (12%), Positives = 53/166 (31%), Gaps = 20/166 (12%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++ ++ L+ + I R+G ++ + NE+ + + Sbjct: 434 ELSIIKNFLQKLIGQLNVGINGN-----RVGLAQFSENVKEEFLLNTHRTRNEMSTSIRN 488 Query: 269 LN--PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 L P A+ HA +N + + K+F++ I GE++ Sbjct: 489 LQLTPTGERRIGHAIEHARSNFFN---RDAGSRAAEGYKQFLLVIAAGESADG------- 538 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND 372 +Q ++ + ++ A ++ S + V + Sbjct: 539 -VIQASRKIKKDAVTVF--AAGLNRADAYEMKDIASQSHNYKLVGN 581 >gi|167763116|ref|ZP_02435243.1| hypothetical protein BACSTE_01485 [Bacteroides stercoris ATCC 43183] gi|167699456|gb|EDS16035.1| hypothetical protein BACSTE_01485 [Bacteroides stercoris ATCC 43183] Length = 327 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 28/205 (13%), Positives = 64/205 (31%), Gaps = 40/205 (19%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENT 275 N + + + E N PL+ + + + + + + T Sbjct: 108 NRLEAAKDVAAEFINGRPNDNVGITLFAGESFTQCPLTVDHAVLLNLIKDVKCGLIEDGT 167 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + +A L S K +I +TDG N+ L E Sbjct: 168 AVGMGIANAVTRL----------KDSKAKSKVIILLTDGTNNRGDIS-----PLTAAEIA 212 Query: 336 RNAGMKIYSVAVSAPP---------------------EGQDLLRKCTDSSGQFFAVNDSR 374 ++ G+++Y++ V + + L + + G +F + Sbjct: 213 KSFGIRVYTIGVGTNGMAPYPYPVGGTVQYVNMPVEIDEKTLTQIAGTTDGNYFRATSNS 272 Query: 375 ELLESFDKITD-KIQEQSVRIAPNR 398 +L E +++I + + +V+ R Sbjct: 273 KLKEVYEEIDKLEKTKLNVKEYSKR 297 >gi|332664649|ref|YP_004447437.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] gi|332333463|gb|AEE50564.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] Length = 328 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 31/168 (18%), Positives = 65/168 (38%), Gaps = 42/168 (25%) Query: 254 PLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + +++ L +L + T + A L S K +I Sbjct: 142 PLTTDHKILETFLEQLECGNLEDGTAIGMGLAGAVNRL----------KKSPAKSKVIIL 191 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------------------ 352 +TDG N+ L E + G+K+YS+ V E Sbjct: 192 LTDGVNNVGYF-----KPLTAGELAKELGIKVYSIGVGTIGEALTPVSRLSDGSFFLDYA 246 Query: 353 ----GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 ++LLR+ + GQ+F ++++L + ++ I D++++ +++ Sbjct: 247 QVEIDEELLREIARMTGGQYFRAKNNQDLRQIYNTI-DRLEKTEIQVT 293 >gi|240137440|ref|YP_002961911.1| hypothetical protein MexAM1_META1p0705 [Methylobacterium extorquens AM1] gi|240007408|gb|ACS38634.1| conserved hypothetical protein [Methylobacterium extorquens AM1] Length = 473 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 62/459 (13%), Positives = 128/459 (27%), Gaps = 90/459 (19%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+ + + +D + + ++ +A DAA L+G + + + Sbjct: 25 LFALAVLPTIGLVGLGVDYGMAISSKTRLDNAADAAALAGVVT-AKEFIAANAQQSDVTA 83 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 S I + + + + +Q+ I + ++ Y ++ Sbjct: 84 SGIKAGESQALKAFNANASKVPFATVSLSQLEIVRT-----GQTLDATVSYTATVQST-F 137 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 + + T L+ R ++ + + +++DVS SM D++ Sbjct: 138 GRIFGLSATTLTNRVNASVDLA---SYLDFYLMVDVSGSMG----LPTKDSDAEVLAMQS 190 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 F + A + D + + L+ RIG Sbjct: 191 KEKQGNCQFACHFPDSVGWTKAAGKIQLRSDAVNNAVCELLKRASTP---VVPNQYRIGI 247 Query: 241 IAYNIG-IVGNQCTPLSNNLNEVKSRLN----------KLNPYENT-------------N 276 + T + +L +++ L +T + Sbjct: 248 YPFINQLATLAPLTDTTTSLAALRTAAQCDKVWPLAFTNLLDTGSTQLFTNNDPKTGTGS 307 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS------------------- 317 A ++ + + N ST + FV ITDG + Sbjct: 308 GGTHFEAALPKMKSTIKPYGNGSASTNSRPFVFLITDGMQNSQSYSAWKDTKTFSGNPSK 367 Query: 318 ---GASAYQNTLNTLQI----CEYMRNAGMKIYSVAVSAP-------------------- 350 +A N QI C ++NAG I + + Sbjct: 368 FAGYPNADWNGSQPAQIDPSKCTDLKNAGATISVLYIPYNIVKNYNNDSYIVWENGRVNQ 427 Query: 351 --PEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 P D LRKC G F+ N ++ S + + Sbjct: 428 FSPTLADPLRKCAS-PGFFYTANTQDDITASLGAMFKQA 465 >gi|188578240|ref|YP_001915169.1| von Willebrand factor type A domain protein [Xanthomonas oryzae pv. oryzae PXO99A] gi|188522692|gb|ACD60637.1| von Willebrand factor type A domain protein [Xanthomonas oryzae pv. oryzae PXO99A] Length = 335 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 33/205 (16%), Positives = 66/205 (32%), Gaps = 46/205 (22%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 ++ + ++ G TPL+ +L Sbjct: 117 VLGGKVVDRLTAAKAVLSDFLDR-----------RDGDRVGLLVFGQRAYALTPLTADLT 165 Query: 261 EVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 V+ +L T A+ + + L +K+ ++ V+ +TDG N+ Sbjct: 166 SVRDQLRDSVVGLAGRETAIGDAIALSVKRLREQKQG----------QRVVVLLTDGVNT 215 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAV----------------SAPPEGQDLLRKCT 361 L+ L+ E + G++IY++A +D LRK Sbjct: 216 AGV-----LDPLKAAELAKAEGVRIYTIAFGGGGGYSLFGVPIPAGGNDDIDEDGLRKIA 270 Query: 362 -DSSGQFFAVNDSRELLESFDKITD 385 + G+FF D+ EL + ++ Sbjct: 271 QQTGGRFFRARDTEELAGIYAELDR 295 >gi|118464548|ref|YP_883428.1| hypothetical protein MAV_4290 [Mycobacterium avium 104] gi|118165835|gb|ABK66732.1| conserved hypothetical protein [Mycobacterium avium 104] Length = 335 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 29/222 (13%), Positives = 65/222 (29%), Gaps = 26/222 (11%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 ++ S+ + ++ E+ + + AI Sbjct: 95 NRAVVMLVIDVSESMASTDVPPNRLAAAKEAGKQFADQLTPAINLGLVEF---------- 144 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 P + N VK+ ++ L P T T + A + + S G Sbjct: 145 AANATLLVPPTTNRAAVKAGIDSLQPAPKTATGEGIFTALQAIATV--GSVMGGGEGPPP 202 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--------------P 351 ++ +DG + + G++I +++ P Sbjct: 203 ARIVLESDGAENVPLDPNAPQGAFTAARAAKAEGVQISTISFGTPYGTVDYEGATIPVPV 262 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + Q L + C + GQ F + L + + +I ++V+ Sbjct: 263 DDQTLQKICEITDGQAFHADSLDSLKNVYSTLQRQIGYETVK 304 >gi|257454382|ref|ZP_05619644.1| von Willebrand factor, type A [Enhydrobacter aerosaccus SK60] gi|257448148|gb|EEV23129.1| von Willebrand factor, type A [Enhydrobacter aerosaccus SK60] Length = 550 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 36/309 (11%), Positives = 92/309 (29%), Gaps = 31/309 (10%) Query: 94 TKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMV 153 T I Y L L P+ + S+ ++ Sbjct: 94 TATDAVATFSIDTDTGSYANVRRFLNNGQLPPTDAVRIEELINYFNYDFSQAKRLANAPF 153 Query: 154 LDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY----APAPAPANRK 209 L + ++ N + P + S + N K Sbjct: 154 LVSTETVA-----APWRTANRIIKVAIKADDPTITKQSTLPPANLVFLVDVSGSMSDNDK 208 Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 + ++ S L ++ I + Y ++ +++ + ++ L Sbjct: 209 LPLVKSSLKMLTKQLRPQDT--------ISIVTYAGRTQVTLPATRGSDTDKILAAIDSL 260 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 + +TN A+ AY++ + ++ +TDG+ + + + L Sbjct: 261 DASGSTNGEAAIKLAYQQAKIHYKKDGINR--------ILMMTDGDFNVGVSDVDE--ML 310 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQ 388 I R++G+ + + ++ + D+ G + ++ L E+ + D++ Sbjct: 311 DIIRRERDSGVSLSTFGFGEGNLNDHMMEQVADNGNGNYSYIDS---LSEAKKALVDEMS 367 Query: 389 EQSVRIAPN 397 +A + Sbjct: 368 ATFNTVAKD 376 >gi|183982301|ref|YP_001850592.1| membrane protein [Mycobacterium marinum M] gi|226701243|sp|B2HPD3|Y2288_MYCMM RecName: Full=UPF0353 protein MMAR_2288 gi|183175627|gb|ACC40737.1| membrane protein [Mycobacterium marinum M] Length = 335 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 25/152 (16%), Positives = 55/152 (36%), Gaps = 16/152 (10%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + N K+ L+KL + T T A+ A + + + G T ++ +DG+ Sbjct: 155 TTNREATKAALDKLQFADRTATGEAIFTALQAIATV--GAVIGGGDTPPPARIVLFSDGK 212 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PEGQDLLRKCTD 362 + + N ++ G+ I +++ P P + ++K Sbjct: 213 ETMPTNPDNPKGAYTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETMKKVAQ 272 Query: 363 -SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 S G + EL + + +I +++R Sbjct: 273 LSGGNSYNAATLAELNSVYASLQQQIGYETIR 304 >gi|60477748|gb|AAH90753.1| Matn4 protein [Danio rerio] Length = 261 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 26/174 (14%), Positives = 65/174 (37%), Gaps = 20/174 (11%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ + +V+ + + R+G + Y+ + + ++ +E+K + Sbjct: 44 NFELVKQFVNQVVDQL-----DVSAKGTRVGLVQYSSCVRTEFPLSMYHSKDEIKKAVMN 98 Query: 269 LNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + + T T A+ H ++E E + + + + TDG + + Sbjct: 99 VEYMEKGTMTGLALKHMVENSFSEAEGARPAEKNI--PRVGLVFTDGRSQD--------D 148 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 + + + AG+ +Y+V V +D LR+ FF D + + Sbjct: 149 IQEWAKKAKEAGITMYAVGVGKAV--EDELREIASDPVEKHFFYSADFTAISQI 200 >gi|194221587|ref|XP_001495285.2| PREDICTED: similar to collagen type VI alpha 6 [Equus caballus] Length = 2301 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 38/352 (10%), Positives = 97/352 (27%), Gaps = 39/352 (11%) Query: 52 DPTTKKDQTSTIFKKQIKKHLKQGS--------YIRENAGDIAQKAQINITKDKNNPLQY 103 + + + + + + + QI + Sbjct: 322 NKSEVLQYIQNLSPQAGNAYTGAAIRKIRKEVFGAQNGSRKNQGVPQIAVLVTHRPSEDN 381 Query: 104 IAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDL 163 + ++ +F G+ ++ T L ++ E+ L + ++ L Sbjct: 382 VTKAAVNLRRQGVTIFTMGIEGASDTQLEKIASHPAEQYVSKLKSFSDLAAHNQTFLKKL 441 Query: 164 YL----QKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGN 219 + + K + + + Sbjct: 442 RNQITHTLSVFSERTETLKSGCVDTEEADIYLLIDGSGSTQ------ATDFHEMKTFLSE 495 Query: 220 LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTY 278 LV A VR G + Y +N +++ + + TNT Sbjct: 496 LVGMFNIAP-----QKVRFGAVQYADSWDLEFEINKYSNKHDLGKAIENIRQMGGNTNTG 550 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 A++ L K+ N + ++ +T+G + + ++ +R Sbjct: 551 AALNFTLGLLQKAKKERGNK-----VPCHLVVLTNGV--------SKDSIVEPANRLREE 597 Query: 339 GMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + +Y++ V Q LR+ + + V+D L + +++ +I Q Sbjct: 598 LIHVYAIGV--REANQTQLREIAGEEKRMYYVHDFDALKDIRNQVVQEICAQ 647 Score = 73.0 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 56/164 (34%), Gaps = 15/164 (9%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYR 286 + VRIG ++ E+ ++ K+ T+ A+ R Sbjct: 1063 DFDVTVNRVRIGAAQFSHNYQPEFPLGTFTGEEEISLQIEKIQQIFGYTHIGAAL----R 1118 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 + + + + ++ +TDG++ Q E +R G+ IYSV Sbjct: 1119 RVGRYFRPDMGSRINAGTPQVLLVLTDGQSQD--------EVAQAAEDLRRKGINIYSVG 1170 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + L + T ++ + V+D EL + +I I Sbjct: 1171 IG--DVDDQQLVQITGTADKKLTVHDFDELRKVKKRIVRHICTT 1212 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 36/191 (18%), Positives = 66/191 (34%), Gaps = 18/191 (9%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + +++ + +LV + VR G + Y Sbjct: 850 SSGSIDYDEYNIMKDFMTDLVKKA-----DVGKNQVRFGALKYADDPEVLFYLDTLGTKW 904 Query: 261 EVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 EV S L P T T A+ + + + + + +I ITDGE Sbjct: 905 EVISVLQNDQPMGGNTYTAEALAFSDHMFTE----ARGSRLQRGVPQVLIVITDGE---- 956 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 S + LN + +R+ G+ I +V + A +LL SS ++F V L Sbjct: 957 SHDADKLNAT--AKALRDKGILILAVGI-AGANPVELL-AMAGSSDKYFFVETFGGLKGI 1012 Query: 380 FDKITDKIQEQ 390 F ++ + Sbjct: 1013 FSDVSASVCNS 1023 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 28/160 (17%), Positives = 52/160 (32%), Gaps = 19/160 (11%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEK 292 V+IG + ++ + +E+ ++++ T T A+ + K Sbjct: 691 DRVQIGVVQFSHVNKEEFQLNRYMSQSEISDAIDRMAHIGETTLTGHALTFVSQYFSPAK 750 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA-VSAPP 351 + ++KF+I ITDGE +R G+ IYSV + Sbjct: 751 GARP------NVRKFLILITDGEAQD--------IVKDPAVALRQEGIIIYSVGVFGSNV 796 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 L + + F V + L D + I Sbjct: 797 ---TQLEEISGRPEMVFYVENFDILQHIEDDLVFGICSPR 833 >gi|326797316|ref|YP_004315136.1| hypothetical protein Marme_4100 [Marinomonas mediterranea MMB-1] gi|326548080|gb|ADZ93300.1| hypothetical protein Marme_4100 [Marinomonas mediterranea MMB-1] Length = 528 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 58/494 (11%), Positives = 120/494 (24%), Gaps = 106/494 (21%) Query: 2 TAIIISVCFLFIT-----YAIDLAHIMYIRNQMQSALDAAVLSGC----ASIVSDRTIKD 52 A+ V L +A+D ++ Q++ A DAA L+ + D T D Sbjct: 27 AALPFIVLILVGALVGFSFALDTTRMVNTAGQLKRATDAAALAIGQIQLRNNNDDETDLD 86 Query: 53 PTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKN------NPLQYIAE 106 + + + + I G +T N N Sbjct: 87 SIAQGYVLNNLGMDSGLIDQIDTTSIFVTQGTNDGHPTFTVTVTLNTQSDLLNAQTEDQV 146 Query: 107 SKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLY-- 164 + E+ + + L+P+ LT + + + D S S + ++ Sbjct: 147 ISSTVEVVSTPTEVALLLPNTLTEDEPELVALRKLGKSFARNLLGEDTDASNSAQKVWLS 206 Query: 165 -----------------LQKHNDNN------------NMTSNKYLLPPPPKKSFWSKNTT 195 + L P Sbjct: 207 LVPFSQAVNVYDADDPERISRWAAAGALNPPELRSLFKTGKVRSLADPRFPDRVAKLLCM 266 Query: 196 KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ--------------EKKNLSVRIGTI 241 + + Q +++ + I Sbjct: 267 YRGLGAGENFNWDQQPDSQFGVYYRHDLPQNGSPGATPISWVGPNPSLWPSSVAEDVRWI 326 Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGS 301 + G PL+N+L+ + +RL++++ N N AM A L S S Sbjct: 327 VADKGCPNAPLLPLTNDLDAIDARLDEMSTRFNVNYAIAMGWAGHALSPNMRGSSGWGDS 386 Query: 302 TRL-------KKFVIFITDGENSGASAYQNTLNTL----------------------QIC 332 + + +G + N +C Sbjct: 387 ELPLDFSKSKSNVKVMVMLANTTGDWFDTDAYNFNRDQALDSTGPGSAKAFATQRFHDVC 446 Query: 333 EYMRNAGMKIYSVAVSAPPEGQ-----------DLLRKCTDSSGQFFAV------NDSRE 375 R+ +K + + V LR+C G + + Sbjct: 447 RSFRDKNIKFFFIGVRPGDPADFGRTLFTDIAGPGLRECAGGGGGLYFADASSFVEGKSQ 506 Query: 376 LLESFDKITDKIQE 389 + ++I ++I++ Sbjct: 507 IDSLLEEIAEEIRQ 520 >gi|148974032|ref|ZP_01811565.1| hypothetical protein VSWAT3_12932 [Vibrionales bacterium SWAT-3] gi|145965729|gb|EDK30977.1| hypothetical protein VSWAT3_12932 [Vibrionales bacterium SWAT-3] Length = 330 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 69/204 (33%), Gaps = 47/204 (23%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 ++ + + + + R+G + + TPL+ + + + Sbjct: 113 NGEYIDRLSAVKHVLSDFIERRK---------GDRVGLVLFADHAYLQ--TPLTLDRDTL 161 Query: 263 KSRLNK--LNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +LN+ L N T + A + + S ++ ++ ++DG N+ Sbjct: 162 SQQLNQAVLRLIGNQTAIGDGIGLATKTFVD----------SDAPQRVMVLLSDGSNTAG 211 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP------------------EGQDLLRKCT 361 L+ L+ + + IY+V V A + + L+ Sbjct: 212 V-----LDPLEAADIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTAQDLDERTLMEIAK 266 Query: 362 DSSGQFFAVNDSRELLESFDKITD 385 + GQ+F DS+EL +D I Sbjct: 267 RTGGQYFRARDSKELATIYDTINQ 290 >gi|89072369|ref|ZP_01158948.1| hypothetical protein SKA34_06335 [Photobacterium sp. SKA34] gi|89051901|gb|EAR57353.1| hypothetical protein SKA34_06335 [Photobacterium sp. SKA34] Length = 321 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 29/199 (14%), Positives = 70/199 (35%), Gaps = 47/199 (23%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 ++ + + + + R+G + + TPL+ + V+ +L+ Sbjct: 109 DRLTAVKHVLSDFIEKRK---------GDRLGLVLFADHAYLQ--TPLTFDRKTVEKQLD 157 Query: 268 K--LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + L +T + A + S ++ +I ++DG N+ Sbjct: 158 RTVLGLIGQSTAIGEGLGIATKTFI----------NSKAPQRVIILLSDGANTSGV---- 203 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEGQDLLRKCTD-SSGQ 366 ++ L+ + + +G+KIY+V V A + L + G+ Sbjct: 204 -IDPLEAAKLAKESGVKIYTVGVGADQMVQQGFFGDRIVNPSQDLDEKTLTDIAKMTGGE 262 Query: 367 FFAVNDSRELLESFDKITD 385 +F + ++L + +D I Sbjct: 263 YFRARNPQQLEKIYDIINK 281 >gi|284030499|ref|YP_003380430.1| von Willebrand factor type A [Kribbella flavida DSM 17836] gi|283809792|gb|ADB31631.1| von Willebrand factor type A [Kribbella flavida DSM 17836] Length = 317 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 23/194 (11%), Positives = 61/194 (31%), Gaps = 32/194 (16%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 +++ +SA N VN + + N + P + + V+ ++ Sbjct: 106 NRLEAAKKSAKNFVNQLPS----------KFNVALVNFAGTASIIVPPTTDRATVQRSID 155 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 L E+T T + + + L + + ++ ++DG+ + Sbjct: 156 GLELAESTATGEGIFTSLQALTQVPPDPEH--PNDPAPARIVLLSDGKRTVGRT------ 207 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPE-------------GQDLLRKCTD-SSGQFFAVNDS 373 + + + IY++ + LR + + G+ + + Sbjct: 208 AQEGAQAAKEKNTPIYTITFGTDSGFIEMDGIRQRVPPDRAELRSVAEITGGEAYTAESA 267 Query: 374 RELLESFDKITDKI 387 EL + + I + Sbjct: 268 GELEDVYKDIGSSV 281 >gi|213963729|ref|ZP_03391979.1| BatA protein [Capnocytophaga sputigena Capno] gi|213953609|gb|EEB64941.1| BatA protein [Capnocytophaga sputigena Capno] Length = 333 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 32/181 (17%), Positives = 64/181 (35%), Gaps = 43/181 (23%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRE 287 K + RIG + Y TP++ + + S L +L + T + A Sbjct: 126 KDRPNDRIGLVVYAGESYTK--TPVTTDKGIILSSLAELTYGQVEDGTAIGMGLATAVNR 183 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 L S + +I +TDG N+ ++ L E G+K+Y+V + Sbjct: 184 L----------KESKAKSRVIILLTDGVNNTGV-----IDPLIAAELAAEYGIKVYTVGI 228 Query: 348 SAPP----------------------EGQDLLRKCTD-SSGQFFAVNDSRELLESFDKIT 384 + L++K + G++F ++++L + +D+I Sbjct: 229 GTNGMALSPYALNPDGSIMYRMLQVEIDESLMKKIAQVTHGRYFRATNNQKLQQIYDEIN 288 Query: 385 D 385 Sbjct: 289 K 289 >gi|291399639|ref|XP_002716220.1| PREDICTED: collagen, type VI, alpha 6 [Oryctolagus cuniculus] Length = 2273 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 36/307 (11%), Positives = 92/307 (29%), Gaps = 31/307 (10%) Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI 148 QI + + + +F G+ ++ T L ++ E+ L Sbjct: 332 PQIVVLVTHRASEDNVTRAAVNLRRQGVTVFTLGVEGASATQLEKIASHPAEQYVSKLPT 391 Query: 149 SICMVLDVSRSMEDLYL----QKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + ++ L + K + + Sbjct: 392 FSDLAAHNQTFVKKLRNQITHTVSVLAERTETLKSGCVDTEEADIYLLIDGSGSTQ---- 447 Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 + + + VR+G + Y +N +++ Sbjct: 448 --ATDFQEMKTFLSEVASMFH-----IGPHKVRVGAVQYASSWDLEFEIGKYSNKHDLGR 500 Query: 265 RLNKLN-PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 + + +TNT A++ R L K+ N + ++ +T+G Sbjct: 501 AIENIRQLGGDTNTGAALNFTLRLLQKAKQQRGNK-----VPCHLVVLTNGM-------- 547 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 + + L+ + +R +++Y++ V Q LR+ + + V+D L + +++ Sbjct: 548 SKDSILEPAKRLREENIRVYAIGV--KEANQTQLREIAGDEKRVYYVHDFDALKDIRNQV 605 Query: 384 TDKIQEQ 390 +I + Sbjct: 606 VQEICAE 612 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 26/164 (15%), Positives = 58/164 (35%), Gaps = 15/164 (9%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYR 286 + V+IG ++ + EV + + T+ A+ Sbjct: 1028 DFDVSLNRVQIGAAQFSHTYQPEFPLGTFTDEKEVSFHIENIQQIFGYTHIGAAL----H 1083 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 ++ + + +T + ++ +TDG++ Q E +R+ G+ IYSV Sbjct: 1084 QVGRYFQPDMGSRINTGTPQVLLVLTDGQSQD--------EVAQAAEELRHNGVDIYSVG 1135 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + L + T ++ + V++ EL + +I I Sbjct: 1136 IG--NVDHQQLIQITGTADKKLTVDNFDELKKIKKRIVRNICTS 1177 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 32/162 (19%), Positives = 56/162 (34%), Gaps = 13/162 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + VR G + Y + EV S L P T T A+ + Sbjct: 839 DVGKAQVRFGALKYADDPEVLFYLGDLDTKMEVISMLQNDQPMGGNTYTAEALAFSDHMF 898 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + + + + +I ITDGE+ A T + +R+ G+ + +V + Sbjct: 899 TE----ARGSRLHKGVPQVLIVITDGESHDAEKLNGTT------KALRDKGILVLAVGI- 947 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 A +LL SS ++F V L F ++ + Sbjct: 948 AGANPVELL-AMAGSSDKYFFVETFGGLQGIFSDVSASVCNS 988 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 54/159 (33%), Gaps = 19/159 (11%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKE 293 V+IG + ++ + +++ + ++++ T T A+ + K Sbjct: 657 RVQIGVVQFSHINKEEFQLDTFMSQSDISNAIDRMAHIGETTLTGGALTFVSQYFSPAKG 716 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA-VSAPPE 352 + ++KF+I ITDGE + +R G+ IYSV + Sbjct: 717 ARP------NVRKFLILITDGEAQD--------VVKEPAVALRQEGVIIYSVGVFGSNV- 761 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 L + + F V + L D + I Sbjct: 762 --TQLEEISGRPEMVFYVENFDILQRIEDDLVFGICSPR 798 >gi|29346317|ref|NP_809820.1| aerotolerance protein BatA [Bacteroides thetaiotaomicron VPI-5482] gi|29338212|gb|AAO76014.1| BatA [Bacteroides thetaiotaomicron VPI-5482] Length = 327 Score = 81.1 bits (198), Expect = 3e-13, Method: Composition-based stats. Identities = 28/205 (13%), Positives = 64/205 (31%), Gaps = 40/205 (19%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENT 275 N + + + E N PL+ + + ++ + + T Sbjct: 108 NRLEAAKDVAAEFINGRPNDNIGITLFAGETFTQCPLTVDHAVLLDMIHNIKCGLIEDGT 167 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + +A L S K +I +TDG N+ L E Sbjct: 168 AVGMGVANAVTRL----------KDSKAKSKVIILLTDGTNNKGDIS-----PLTAAEIA 212 Query: 336 RNAGMKIYSVAVSAPP---------------------EGQDLLRKCTDSSGQFFAVNDSR 374 ++ G+++Y++ V + + L + + G +F + Sbjct: 213 KSFGIRVYTIGVGTNGMAPYPYPVGNTVQYINMPVEIDEKTLTQIAGTTDGNYFRATSNS 272 Query: 375 ELLESFDKITD-KIQEQSVRIAPNR 398 +L E +++I + + +V+ R Sbjct: 273 KLKEVYEEIDKLEKTKLNVKEYSKR 297 >gi|218675994|ref|YP_002394813.1| hypothetical protein VS_II0212 [Vibrio splendidus LGP32] gi|218324262|emb|CAV25554.1| Conserved hypothetical protein [Vibrio splendidus LGP32] Length = 347 Score = 80.7 bits (197), Expect = 3e-13, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 68/204 (33%), Gaps = 47/204 (23%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 ++ + + + + R+G + + TPL+ + + + Sbjct: 130 NGEYIDRLSAVKHVLSDFIERRK---------GDRVGLVLFADHAYLQ--TPLTLDRDTL 178 Query: 263 KSRLNK--LNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +LN+ L T + A + + S ++ +I ++DG N+ Sbjct: 179 SQQLNQAVLKLIGTQTAIGDGIGLATKTFVD----------SDAPQRVMILLSDGSNTAG 228 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP------------------EGQDLLRKCT 361 L+ L+ + + IY+V V A + + L+ Sbjct: 229 V-----LDPLEAADIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTAQDLDERTLMEIAK 283 Query: 362 DSSGQFFAVNDSRELLESFDKITD 385 + GQ+F DS+EL +D I Sbjct: 284 RTGGQYFRARDSKELATIYDTINQ 307 >gi|148258701|ref|YP_001243286.1| hypothetical protein BBta_7530 [Bradyrhizobium sp. BTAi1] gi|146410874|gb|ABQ39380.1| hypothetical protein BBta_7530 [Bradyrhizobium sp. BTAi1] Length = 511 Score = 80.7 bits (197), Expect = 3e-13, Method: Composition-based stats. Identities = 50/480 (10%), Positives = 110/480 (22%), Gaps = 112/480 (23%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+ + I A+D +R+++QSA DAA ++ + T Sbjct: 39 IFAMALLPILSAIGCAVDYTQATRLRSKLQSAADAASVASISQQSLGYNAALQMTSDGTV 98 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + + + G + K + Sbjct: 99 QVAVE---EATKLFNGNAANSLGYTNLSLNAQVMKTGVKL------AATVAFSADVPTTF 149 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 ++ ++ S SS + ++LDVS SM + Sbjct: 150 MTVVGYRKLTVTGTSKS---TSSLPPYLDFYLMLDVSGSMGLPSTDAEQTRLAAINPDNY 206 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANR-------------------------------- 208 P +F T+ S A N Sbjct: 207 KQYPNGCTFACHFTSASSCPAANQKYNTNGSCMGYPMSRVSYSGVKKLLTNNGGKLPSSL 266 Query: 209 ---------------------KIDVLIESAGNLVNSIQKAIQEKKNLS---VRIGTIAYN 244 + D + + L+ + + Y Sbjct: 267 LSSLTAVTSCPTDGSDACIQLRADAVGAAVQQLLVTANATQKTPNQFRIGLYPFVRYLYA 326 Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKLNP-----------YENTNTYPAMHHAYRELYNEKE 293 + + L T+ A + + + Sbjct: 327 YSPLTASINGSPTTPGTINHAAANLASQLDTGANASLGSGGTHFENAFPTMNGIITSVGD 386 Query: 294 SSHNTIGSTRLKKFVIFITDGENS---GASAYQNTLNTLQI-----CEYMRNAGMKIYSV 345 S + +V ITDG + + + N+ C +++ G+ + + Sbjct: 387 GSASNKTQP----YVFLITDGAQNPQVYWNGSWSGSNSATTMDTSKCTTLKSRGIIVSVL 442 Query: 346 AVSAP--------PEGQDL------------LRKCTDSSGQFFAVNDSRELLESFDKITD 385 + +D L+ C G F+ N ++ + + + + Sbjct: 443 YIPYQPIQNPTSFANSEDFYANANIPKIPPSLQACAS-PGYFYTANSPADITAALNAMFN 501 >gi|331006778|ref|ZP_08330044.1| BatA [gamma proteobacterium IMCC1989] gi|330419396|gb|EGG93796.1| BatA [gamma proteobacterium IMCC1989] Length = 364 Score = 80.7 bits (197), Expect = 3e-13, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 70/200 (35%), Gaps = 40/200 (20%) Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 ++ + + + ++ RIG I + TPL+ + V L Sbjct: 118 ATRLAAVKKVVSDFIDQ---------RQGDRIGLILFGTQAYLQ--TPLTFDTQSVNQFL 166 Query: 267 NKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 + ++T A+ + + L N+ +S + K +I +TDGEN+ Sbjct: 167 QEAQLGFAGKDTAIGDAIGLSVKRLKNQSSASSAKPSN---SKVIILLTDGENTAG---- 219 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEGQDLLRKCT-DSSG 365 + LQ + G KIY+V + A ++ L + G Sbjct: 220 -EVEPLQAAKLAEKIGAKIYTVGIGADEMIVRGFFGNRRVNPSASLDEETLTAIANTTGG 278 Query: 366 QFFAVNDSRELLESFDKITD 385 +F +++EL + ++ Sbjct: 279 LYFRARNTQELNNIYSELDK 298 >gi|269104787|ref|ZP_06157483.1| protein BatA [Photobacterium damselae subsp. damselae CIP 102761] gi|268161427|gb|EEZ39924.1| protein BatA [Photobacterium damselae subsp. damselae CIP 102761] Length = 321 Score = 80.7 bits (197), Expect = 3e-13, Method: Composition-based stats. Identities = 26/154 (16%), Positives = 59/154 (38%), Gaps = 36/154 (23%) Query: 253 TPLSNNLNEVKSRLNK--LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 TPL+ + + V+ +L++ L +T + A + S ++ +I Sbjct: 143 TPLTFDRHTVEQQLDRTVLGLVGQSTAIGEGLGIATKTFIK----------SKAPQRVII 192 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PE 352 ++DG N+ ++ L+ + + +G+ IY+V + A Sbjct: 193 LLSDGANTAGV-----IDPLEAAKLAKESGVTIYTVGIGADEMLQRSIFGVQKVNPSQDL 247 Query: 353 GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + L K + G++F + +EL + + I Sbjct: 248 DEKTLTKIAQMTGGKYFRARNPQELDKIYQIINQ 281 >gi|262191198|ref|ZP_06049398.1| protein BatA [Vibrio cholerae CT 5369-93] gi|262032938|gb|EEY51476.1| protein BatA [Vibrio cholerae CT 5369-93] Length = 477 Score = 80.7 bits (197), Expect = 3e-13, Method: Composition-based stats. Identities = 31/170 (18%), Positives = 58/170 (34%), Gaps = 38/170 (22%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK--LNPYEN-TNTYPAMHHAYRELYNEKE 293 RIG I + TPL+ + V ++LN+ L T + A + + Sbjct: 288 RIGLILFADHAYLQ--TPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKTFID--- 342 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--- 350 S ++ +I ++DG N+ L+ L+ + IY+V V A Sbjct: 343 -------SDAPQRVMILLSDGSNTAGV-----LDPLEAANIAKQYHTTIYTVGVGAGEMV 390 Query: 351 --------------PEGQDLLRKCTDS-SGQFFAVNDSRELLESFDKITD 385 + L+ + G +F + ++L +D I Sbjct: 391 VKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTINQ 440 >gi|255531385|ref|YP_003091757.1| von Willebrand factor A [Pedobacter heparinus DSM 2366] gi|255344369|gb|ACU03695.1| von Willebrand factor type A [Pedobacter heparinus DSM 2366] Length = 332 Score = 80.7 bits (197), Expect = 3e-13, Method: Composition-based stats. Identities = 28/203 (13%), Positives = 62/203 (30%), Gaps = 42/203 (20%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENT 275 N + + + + PL+ + + + + + ++ + T Sbjct: 112 NRLEAGKNIAIDFIKGRPEDRIGLVIFSGESFTQCPLTIDHDVLINLFSDISNGMVEDGT 171 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + A L S K VI +TDG N+ S L E Sbjct: 172 AIGMGLATAVNRL----------KDSEAKSKVVILLTDGSNTTGSIP-----PLTAAEIA 216 Query: 336 RNAGMKIYSVAVSAPPE--------------------GQDLLRKCTD-SSGQFFAVNDSR 374 + +++Y++ V + +L K + G++F ++ Sbjct: 217 KQMKVRVYTIGVGTKGYAPYPVKTPFGTQYQQVPVTIDEGVLSKIAGITGGKYFRATNNE 276 Query: 375 ELLESFDKITDKIQEQSVRIAPN 397 +L E + +I + +IA Sbjct: 277 KLKEIYQQIDKL---ERAKIAVT 296 >gi|134100328|ref|YP_001105989.1| hypothetical protein SACE_3793 [Saccharopolyspora erythraea NRRL 2338] gi|133912951|emb|CAM03064.1| von Willebrand factor, type A [Saccharopolyspora erythraea NRRL 2338] Length = 327 Score = 80.7 bits (197), Expect = 3e-13, Method: Composition-based stats. Identities = 23/220 (10%), Positives = 64/220 (29%), Gaps = 26/220 (11%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 ++ S A +++ +A + + I +G +++ Sbjct: 88 NRATVMLTVDVSLSMKATDVEPNRLEAAKVAAKEFADQLTPGIN--------LGLVSFAG 139 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + + VK ++ L E T T ++ A + + S Sbjct: 140 TATVLVMP--TTDRASVKQAIDNLKLSEATATGDGINAAMSAIDS--FGKMVGGPSGAPP 195 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-------------- 351 ++ + DG + + + A + I +++ Sbjct: 196 ARIVLMADGGQTIPRELDAPRGAYTKAQEAKKANIPISTISFGTKHGSIEIEGEQEFVEV 255 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 + + + S G+F + +L E + + ++I + Sbjct: 256 DDEAMQEIARLSGGEFHKAASAEQLREVYATLGEQIGYEI 295 >gi|85710455|ref|ZP_01041519.1| putative secreted protein [Erythrobacter sp. NAP1] gi|85687633|gb|EAQ27638.1| putative secreted protein [Erythrobacter sp. NAP1] Length = 576 Score = 80.7 bits (197), Expect = 3e-13, Method: Composition-based stats. Identities = 34/370 (9%), Positives = 103/370 (27%), Gaps = 31/370 (8%) Query: 30 QSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQ-IKKHLKQGSYIRENAGDIAQK 88 Q+ D V+ + + + ++ + I + RE Sbjct: 62 QALADNTVVVTASRRAGAPSRSRTGAVSAEVASAPRGYSIPPVVVPVDPGRERYDGEEVS 121 Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI 148 ++ + + + Y L P + Sbjct: 122 PVKLVSAEPVSTFSVDVD-TGAYANTRRFLRQGVTPPRDAVRTEEMINYFRYDYARPTTR 180 Query: 149 SICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR 208 + ++ + + + P + + Sbjct: 181 DVPFTTNIDVAKTPWNEDTYLMRIGLRGYDIERDERPPANLV-----FLMDVSGSMGSPD 235 Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 K+ ++ + L + + R+ + Y +N+ ++++ L Sbjct: 236 KLPLVQTALSGLAGELGEQD--------RVSIVVYAGAA--GLVLEPTNDTAKIRAALMS 285 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L+ +T + AY + VI TDG+ + + ++ L Sbjct: 286 LSAGGSTAGGAGIQLAYNIAEDNFIEGGVNR--------VILATDGDFNVGVSDRDAL-- 335 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKI 387 +++ E R+ G+ + ++ + ++ + + G + ++ + E + ++D++ Sbjct: 336 VEMVEKNRDRGITLTTLGFGTGNFNEAMMEQIANKGNGNYAYIDSALEAKKV---LSDEM 392 Query: 388 QEQSVRIAPN 397 IA + Sbjct: 393 SSTLFTIAKD 402 >gi|253568262|ref|ZP_04845673.1| aerotolerance protein BatA [Bacteroides sp. 1_1_6] gi|298385671|ref|ZP_06995229.1| BatA protein [Bacteroides sp. 1_1_14] gi|251842335|gb|EES70415.1| aerotolerance protein BatA [Bacteroides sp. 1_1_6] gi|298261812|gb|EFI04678.1| BatA protein [Bacteroides sp. 1_1_14] Length = 327 Score = 80.7 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 28/205 (13%), Positives = 64/205 (31%), Gaps = 40/205 (19%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENT 275 N + + + E N PL+ + + ++ + + T Sbjct: 108 NRLEAAKDVAAEFINGRPNDNIGITLFAGETFTQCPLTVDHAVLLDMIHNIKCGLIEDGT 167 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + +A L S K +I +TDG N+ L E Sbjct: 168 AVGMGVANAVTRL----------KDSKAKSKVIILLTDGTNNKGDIS-----PLTAAEIA 212 Query: 336 RNAGMKIYSVAVSAPP---------------------EGQDLLRKCTDSSGQFFAVNDSR 374 ++ G+++Y++ V + + L + + G +F + Sbjct: 213 KSFGIRVYTIGVGTNGMAPYPYPVGNTVQYINMPVEIDEKTLTQIAGTTDGNYFRATSNS 272 Query: 375 ELLESFDKITD-KIQEQSVRIAPNR 398 +L E +++I + + +V+ R Sbjct: 273 KLKEVYEEIDKLEKTKLNVKEYSKR 297 >gi|111024162|ref|YP_707134.1| hypothetical protein RHA1_ro07212 [Rhodococcus jostii RHA1] gi|110823692|gb|ABG98976.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length = 326 Score = 80.7 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 25/222 (11%), Positives = 66/222 (29%), Gaps = 26/222 (11%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 ++ S A ++ ++A + + + I Sbjct: 87 NRATVILVIDVSLSMKATDVEPTRLAAAQDAAKSFADGLTPGINLGLVAF---------- 136 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + + N K ++ L E T T A+ + + + ++ Sbjct: 137 AGTASVLVSPTTNREASKVAIDNLQLSERTATGEAIFTSLQSIDT--LAAVLGGSDQAPP 194 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA--------------PP 351 ++ ++DG+ + + ++ + I +++ P Sbjct: 195 ARIVLLSDGKQTVPENPDDPRGGFTAARQAKDKDVPISTISFGTSYGKVEIEDERIPVPV 254 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + L S G FF + EL + +D + ++I ++ R Sbjct: 255 DDPSLREIANLSGGSFFTASSLEELRDVYDTLEEQIGFETTR 296 >gi|41409533|ref|NP_962369.1| hypothetical protein MAP3435c [Mycobacterium avium subsp. paratuberculosis K-10] gi|81570936|sp|Q73UD3|Y3435_MYCPA RecName: Full=UPF0353 protein MAP_3435c gi|41398364|gb|AAS05985.1| hypothetical protein MAP_3435c [Mycobacterium avium subsp. paratuberculosis K-10] Length = 335 Score = 80.7 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 29/222 (13%), Positives = 65/222 (29%), Gaps = 26/222 (11%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 ++ S+ + ++ E+ + + AI Sbjct: 95 NRAVVMLVIDVSESMASTDVPPNRLAAAKEAGKQFADQLTPAINLGLVEF---------- 144 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 P + N VK+ ++ L P T T + A + + S G Sbjct: 145 AANATLLVPPTTNRAAVKAGIDSLQPAPKTATGEGIFTALQAIATV--GSVMGGGEGPPP 202 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--------------P 351 ++ +DG + + G++I +++ P Sbjct: 203 ARIVLESDGAENVPLDPNAPQGAFTAARAAKAEGVQISTISFGTPYGTVDYEGATIPVPV 262 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + Q L + C + GQ F + L + + +I ++V+ Sbjct: 263 DDQTLQKICEITDGQAFHADSLDSLKNVYSTLQRQIGYETVK 304 >gi|126341666|ref|XP_001379908.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 2347 Score = 80.7 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 27/190 (14%), Positives = 67/190 (35%), Gaps = 21/190 (11%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + ++ A + VR+G + Y+ +N N+ Sbjct: 450 SGSIYPTDFQEMKAFLSEVIEMFTIAPYK-----VRVGAVQYSHIQELEFEINKYSNKND 504 Query: 262 VKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + ++ + NTNT A+ L K N + +I +TDG + Sbjct: 505 LGKAIDNIWQLGGNTNTGAALDFTLGLLQRAKTQRGNK-----VPCHLIVLTDGMSDD-- 557 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 N L+ + +++ + +Y++ V + L + + + + V + L + Sbjct: 558 ------NVLEPAKKLKDENINVYAIGV--KEANRTQLLEIAGTEKRVYYVYNFDSLKDIK 609 Query: 381 DKITDKIQEQ 390 +++ I + Sbjct: 610 NQVVQGICSK 619 Score = 76.1 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 32/160 (20%), Positives = 63/160 (39%), Gaps = 13/160 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + VR G + Y+ + ++V S L +P T T A+ + Sbjct: 846 DVAKDRVRFGALKYSYDPTILFYLDEFDTRSKVISLLQNDSPKGGDTYTAKALAFSEHMF 905 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + + + ++ + +I ITDGE+ A+ + +R+ G+ I +V + Sbjct: 906 TE----ARGSRINQKVPQVLIVITDGESHDANQLN------ATAKALRDKGILILAVGI- 954 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 A ++LL S+ ++F V L F ++D I Sbjct: 955 AGANTEELL-AMAGSTDKYFFVETFGGLKGIFQNVSDSIC 993 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 40/357 (11%), Positives = 101/357 (28%), Gaps = 34/357 (9%) Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 D + K+ + + + I++ ++ + +KA Sbjct: 845 ADVAKDRVRFGALKYSYDPTILFYLDEFDTRSKVISLLQNDSPKGGDTYTAKALAFSEHM 904 Query: 117 NLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMED-----LYLQKHNDN 171 +G + L E N + L + + Sbjct: 905 FTEARGSRINQKVPQVLIVITDGESHDANQLNATAKALRDKGILILAVGIAGANTEELLA 964 Query: 172 NNMTSNKYLLPPPPKK-----SFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK 226 +++KY S + + +L S + +K Sbjct: 965 MAGSTDKYFFVETFGGLKGIFQNVSDSICGPSKVECKMEKADLVFLLDGSNSIYPENFKK 1024 Query: 227 A---------IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN-KLNPYENTN 276 + V IG ++ + EV +++ + NT+ Sbjct: 1025 MKDFLVSVVDDFDIGPSRVHIGLAQFSHVYRAEFFLGSFTSEGEVSTQIEMTQQVFGNTH 1084 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 A+ ++ + + +++ ++ +TDG++ + E +R Sbjct: 1085 IGAALK----QVEQYFRPEMGSRINVGIQQVLLVLTDGQSQD--------EVAKAAEDLR 1132 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 G+ IYS+ + + L + + +S + +++ EL + +I I R Sbjct: 1133 RKGIDIYSLGIG--DVDEQQLIQISGTSDKKLTIDNFDELKKIKKRIVRNICTPHGR 1187 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 28/161 (17%), Positives = 52/161 (32%), Gaps = 17/161 (10%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYN 290 V++G + ++ +E+ +++++ + T T A+ + Sbjct: 661 GEDQVQVGIVQFSDVNKEEFQLNRYWTQHEIFDAIDRMSNIDRETLTGSALKFVSDYFHP 720 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 K + KF+I ITDGE+ +R G+ IYSV V Sbjct: 721 SKGARPGVR------KFLILITDGESQDPVKDP--------AMALRQDGVIIYSVGV--Y 764 Query: 351 PEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 + L + + F V L D + I Sbjct: 765 GANETQLVEISGKPEMIFYVETFDILKHIEDDLVFGICNPR 805 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 22/139 (15%), Positives = 48/139 (34%), Gaps = 14/139 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN-TYPAMHHAY 285 + + K +RIG + Y N +EV + KL+P T AM Sbjct: 263 SSFDVKENCMRIGLVTYTDETKVIHSLSTGTNKSEVLQEIQKLSPKAGRAYTGAAMTKVR 322 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 +E+++ ++ S + + I + + N + +R G+ +++V Sbjct: 323 KEVFSVQKGSRRM---QGVPQIAILVAH--------RPSEDNVSEAALDLRREGVTVFTV 371 Query: 346 AVSAPPEGQDLLRKCTDSS 364 + L + + Sbjct: 372 GL--EGSDDTQLGQISSHP 388 Score = 42.2 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 14/157 (8%), Positives = 48/157 (30%), Gaps = 17/157 (10%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN-KLNPYENT-NTYPAMHHAYRELYNEKES 294 RI Y+ + N + + L + A+ ++ ++ + Sbjct: 64 RIALAQYSDDLHQEFLLSPFMTKNAILNHLKRNFTYMGGSLRIGNALEKVHKTYFSGPMN 123 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + + + V+ + + + + ++ G+KI ++ + + Sbjct: 124 GRDK--NQFPQVLVVLTS---------AHSEDDVEGPAKALQRDGVKIITLGMQ--NASE 170 Query: 355 DLLRKCTDSSGQF--FAVNDSRELLESFDKITDKIQE 389 + L+ + + V D + I ++ + Sbjct: 171 ENLKTMATAQFHYNLRTVRDVGTFSTNMTSIIKEVAK 207 >gi|303235701|ref|ZP_07322308.1| von Willebrand factor type A domain protein [Prevotella disiens FB035-09AN] gi|302484148|gb|EFL47136.1| von Willebrand factor type A domain protein [Prevotella disiens FB035-09AN] Length = 322 Score = 80.7 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 27/158 (17%), Positives = 56/158 (35%), Gaps = 35/158 (22%) Query: 254 PLSNNLNEVKSRLNKLNPY--------ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 P++ + + + L + + T + +A L + KE S Sbjct: 143 PMTTDHASLLNLLAGIRADLSVNHLIQDGTAIGMGLANAVGRLKDVKEGS---------- 192 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------GQD 355 K VI +TDG N+ L R G+++Y++ + + Sbjct: 193 KVVILLTDGSNNVGDIS-----PLTAASIARKFGVRVYTIGLGTDGKDIQGRPVGEIDYK 247 Query: 356 LLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 L+ + G+F+ EL + + I DK+++ + Sbjct: 248 TLQDIAMQTDGEFYRAQSRAELSQIYKDI-DKLEKTKI 284 >gi|296170658|ref|ZP_06852233.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295894647|gb|EFG74381.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] Length = 335 Score = 80.7 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 54/152 (35%), Gaps = 16/152 (10%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + N + K L+KL + T T + A + + + G ++ +DG+ Sbjct: 155 TTNRDSTKRALDKLQFADRTATGEGIFTALQAIATV--GAVIGGGDAPPPARIVLFSDGK 212 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PEGQDLLRKCTD 362 + + N ++ G+ I +++ P P + L+K Sbjct: 213 ETMPTNPDNPKGAFTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETLKKVAQ 272 Query: 363 -SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 S G + +EL + + +I ++++ Sbjct: 273 LSGGNAYNAATLQELKSVYATLQQQIGYETIK 304 >gi|291008772|ref|ZP_06566745.1| hypothetical protein SeryN2_29978 [Saccharopolyspora erythraea NRRL 2338] Length = 324 Score = 80.7 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 23/220 (10%), Positives = 64/220 (29%), Gaps = 26/220 (11%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 ++ S A +++ +A + + I +G +++ Sbjct: 85 NRATVMLTVDVSLSMKATDVEPNRLEAAKVAAKEFADQLTPGIN--------LGLVSFAG 136 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + + VK ++ L E T T ++ A + + S Sbjct: 137 TATVLVMP--TTDRASVKQAIDNLKLSEATATGDGINAAMSAIDS--FGKMVGGPSGAPP 192 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-------------- 351 ++ + DG + + + A + I +++ Sbjct: 193 ARIVLMADGGQTIPRELDAPRGAYTKAQEAKKANIPISTISFGTKHGSIEIEGEQEFVEV 252 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 + + + S G+F + +L E + + ++I + Sbjct: 253 DDEAMQEIARLSGGEFHKAASAEQLREVYATLGEQIGYEI 292 >gi|326792960|ref|YP_004310781.1| von Willebrand factor A [Clostridium lentocellum DSM 5427] gi|326543724|gb|ADZ85583.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] Length = 903 Score = 80.7 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 49/353 (13%), Positives = 115/353 (32%), Gaps = 35/353 (9%) Query: 45 VSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYI 104 V + + T++ +FK +K + + ++ + L Sbjct: 139 VDEINFMELTSQAHNKRVVFKTTVKADGSSKGGFSRYSSSNVAASSADVVVVTQDGLTVT 198 Query: 105 AESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLY 164 +K + +K + + + +++RS + ++ L Sbjct: 199 KTAKELAAKNVWEIEVKVEGKNVVLQEATDVVLVLDRSGSMGQGVVDKNNPNAQKCTVLT 258 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 N + ++ Y K + T A + + D + +++ ++++ Sbjct: 259 CTNSNRWHRHNADCYDEEYYILKCTQNHTHTLPGDFIANSCYVSRADKVKDASYTFLDTL 318 Query: 225 QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHA 284 Q+ V I + Y + L + + + L + TNT + A Sbjct: 319 QE------KEDVNISVVTYAGTASKVTNSNLKSGIESAYNVLG----TDGTNTGRGIEIA 368 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + L ST K ++ ++DGE++ ++ +N G +Y+ Sbjct: 369 SQILS----------NSTAPNKMIVVLSDGESNAGNS-------RTAANSAKNKGCIVYT 411 Query: 345 VAVS--APPEGQDLLRKCTD-----SSGQFFAVNDS-RELLESFDKITDKIQE 389 + + G L C + +F+ +D+ L E F +I +IQE Sbjct: 412 IGAGIASGSNGAKELFDCASVDQSTNKAKFYLADDTGNALNEIFAEIAGEIQE 464 >gi|90577284|ref|ZP_01233095.1| hypothetical protein VAS14_09574 [Vibrio angustum S14] gi|90440370|gb|EAS65550.1| hypothetical protein VAS14_09574 [Vibrio angustum S14] Length = 321 Score = 80.7 bits (197), Expect = 4e-13, Method: Composition-based stats. Identities = 29/207 (14%), Positives = 72/207 (34%), Gaps = 47/207 (22%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 + ++ + + + + R+G + + TPL+ + Sbjct: 101 VTKNGQSIDRLTAVKHVLSDFIEKRK---------GDRLGLVLFADHAYLQ--TPLTFDR 149 Query: 260 NEVKSRLNK--LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 V+ +L++ L +T + A + S ++ +I ++DG N Sbjct: 150 KTVEQQLDRTVLGLIGQSTAIGEGLGIATKTFI----------NSKAPQRVIILLSDGAN 199 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEGQDLLRK 359 + ++ L+ + + +G+KIY+V V A + L + Sbjct: 200 TSGV-----IDPLEAAKLAKESGVKIYTVGVGADQMVQQGFFGDRIVNPSQDLDEKTLTE 254 Query: 360 CTD-SSGQFFAVNDSRELLESFDKITD 385 + G++F + ++L + +D I Sbjct: 255 IAKMTGGEYFRARNPQQLEKIYDIINK 281 >gi|308375589|ref|ZP_07444436.2| membrane protein [Mycobacterium tuberculosis SUMu007] gi|308345800|gb|EFP34651.1| membrane protein [Mycobacterium tuberculosis SUMu007] Length = 327 Score = 80.3 bits (196), Expect = 4e-13, Method: Composition-based stats. Identities = 24/152 (15%), Positives = 55/152 (36%), Gaps = 16/152 (10%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + N K+ L+KL + T T A+ A + + + G T ++ +DG+ Sbjct: 147 TTNREATKNALDKLQFADRTATGEAIFTALQAIATV--GAVIGGGDTPPPARIVLFSDGK 204 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PEGQDLLRKCTD 362 + + N ++ G+ I +++ P P + ++K Sbjct: 205 ETMPTNPDNPKGAYTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETMKKVAQ 264 Query: 363 -SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 S G + EL + + +I ++++ Sbjct: 265 LSGGNSYNAATLAELRAVYSSLQQQIGYETIK 296 >gi|86144576|ref|ZP_01062908.1| hypothetical protein MED222_09203 [Vibrio sp. MED222] gi|85837475|gb|EAQ55587.1| hypothetical protein MED222_09203 [Vibrio sp. MED222] Length = 330 Score = 80.3 bits (196), Expect = 4e-13, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 68/204 (33%), Gaps = 47/204 (23%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 ++ + + + + R+G + + TPL+ + + + Sbjct: 113 NGEYIDRLSAVKHVLSDFIERRK---------GDRVGLVLFADHAYLQ--TPLTLDRDTL 161 Query: 263 KSRLNK--LNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +LN+ L T + A + + S ++ +I ++DG N+ Sbjct: 162 SQQLNQAVLKLIGTQTAIGDGIGLATKTFVD----------SDAPQRVMILLSDGSNTAG 211 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP------------------EGQDLLRKCT 361 L+ L+ + + IY+V V A + + L+ Sbjct: 212 V-----LDPLEAADIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTAQDLDERTLMEIAK 266 Query: 362 DSSGQFFAVNDSRELLESFDKITD 385 + GQ+F DS+EL +D I Sbjct: 267 RTGGQYFRARDSKELATIYDTINQ 290 >gi|190339201|gb|AAI63867.1| Matn4 protein [Danio rerio] Length = 944 Score = 80.3 bits (196), Expect = 4e-13, Method: Composition-based stats. Identities = 26/174 (14%), Positives = 65/174 (37%), Gaps = 20/174 (11%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ + +V+ + + R+G + Y+ + + ++ +E+K + Sbjct: 727 NFELVKQFVNQVVDQL-----DVSAKGTRVGLVQYSSRVRTEFPLSMYHSKDEIKKAVMN 781 Query: 269 LNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + + T T A+ H ++E E + + + + TDG + + Sbjct: 782 VEYMEKGTMTGLALKHMVENSFSEAEGARPAEKNI--PRVGLVFTDGRSQD--------D 831 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 + + + AG+ +Y+V V +D LR+ FF D + + Sbjct: 832 IQEWAKKAKEAGITMYAVGVGKAV--EDELREIASDPVEKHFFYSADFTAISQI 883 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 24/147 (16%), Positives = 48/147 (32%), Gaps = 18/147 (12%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKES 294 RIG + Y+ + + ++ +N++ P T T A+ +A ++ +E Sbjct: 72 TRIGVVQYSSQVQNVFSLKAFSKTEQMVKAINEIIPLAQGTMTGLAIRYAMNVAFSAEEG 131 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + + + +TDG A +G++IY+V V Sbjct: 132 AR-----PNVPHVAVIVTDGRPQDRVAEVAAAARE--------SGIEIYAVGV--ARADM 176 Query: 355 DLLRKCTDSS--GQFFAVNDSRELLES 379 LR F V + + Sbjct: 177 TSLRAMASPPFEDHVFLVESFDLIHQF 203 >gi|56797871|emb|CAG27569.1| matrilin-4 [Danio rerio] Length = 644 Score = 80.3 bits (196), Expect = 4e-13, Method: Composition-based stats. Identities = 26/174 (14%), Positives = 65/174 (37%), Gaps = 20/174 (11%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ + +V+ + + R+G + Y+ + + ++ +E+K + Sbjct: 427 NFELVKQFVNQVVDQL-----DVSAKGTRVGLVQYSSRVRTEFPLSMYHSKDEIKKAVMN 481 Query: 269 LNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + + T T A+ H ++E E + + + + TDG + + Sbjct: 482 VEYMEKGTMTGLALKHMVENSFSEAEGARPAEKNI--PRVGLVFTDGRSQD--------D 531 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 + + + AG+ +Y+V V +D LR+ FF D + + Sbjct: 532 IQEWAKKAKEAGITMYAVGVGKAV--EDELREIASDPVEKHFFYSADFTAISQI 583 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 24/147 (16%), Positives = 48/147 (32%), Gaps = 18/147 (12%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKES 294 RIG + Y+ + + ++ +N++ P T T A+ +A ++ +E Sbjct: 59 TRIGVVQYSSQVQNVFSLKAFSKTEQMVKAINEIIPLAQGTMTGLAIRYAMNVAFSAEEG 118 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + + + +TDG A +G++IY+V V Sbjct: 119 AR-----PNVPHVAVIVTDGRPQDRVAEVAAAARE--------SGIEIYAVGV--ARADM 163 Query: 355 DLLRKCTDSS--GQFFAVNDSRELLES 379 LR F V + + Sbjct: 164 TSLRAMASPPFEDHVFLVESFDLIHQF 190 >gi|56797865|emb|CAG27566.1| matrilin-4 [Danio rerio] Length = 726 Score = 80.3 bits (196), Expect = 4e-13, Method: Composition-based stats. Identities = 26/174 (14%), Positives = 65/174 (37%), Gaps = 20/174 (11%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ + +V+ + + R+G + Y+ + + ++ +E+K + Sbjct: 509 NFELVKQFVNQVVDQL-----DVSAKGTRVGLVQYSSRVRTEFPLSMYHSKDEIKKAVMN 563 Query: 269 LNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + + T T A+ H ++E E + + + + TDG + + Sbjct: 564 VEYMEKGTMTGLALKHMVENSFSEAEGARPAEKNI--PRVGLVFTDGRSQD--------D 613 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 + + + AG+ +Y+V V +D LR+ FF D + + Sbjct: 614 IQEWAKKAKEAGITMYAVGVGKAV--EDELREIASDPVEKHFFYSADFTAISQI 665 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 24/147 (16%), Positives = 48/147 (32%), Gaps = 18/147 (12%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKES 294 RIG + Y+ + + ++ +N++ P T T A+ +A ++ +E Sbjct: 59 TRIGAVQYSSQVQNVFSLKAFSKTEQMVKAINEIIPLAQGTMTGLAIRYAMNVAFSAEEG 118 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + + + +TDG A +G++IY+V V Sbjct: 119 AR-----PNVPHVAVIVTDGRPQDRVAEVAAAARE--------SGIEIYAVGV--ARADM 163 Query: 355 DLLRKCTDSS--GQFFAVNDSRELLES 379 LR F V + + Sbjct: 164 TSLRAMASPPFEDHVFLVESFDLIHQF 190 >gi|56797863|emb|CAG27565.1| matrilin-4 [Danio rerio] Length = 944 Score = 80.3 bits (196), Expect = 4e-13, Method: Composition-based stats. Identities = 26/174 (14%), Positives = 65/174 (37%), Gaps = 20/174 (11%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ + +V+ + + R+G + Y+ + + ++ +E+K + Sbjct: 727 NFELVKQFVNQVVDQL-----DVSAKGTRVGLVQYSSRVRTEFPLSMYHSKDEIKKAVMN 781 Query: 269 LNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + + T T A+ H ++E E + + + + TDG + + Sbjct: 782 VEYMEKGTMTGLALKHMVENSFSEAEGARPAEKNI--PRVGLVFTDGRSQD--------D 831 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 + + + AG+ +Y+V V +D LR+ FF D + + Sbjct: 832 IQEWAKKAKEAGITMYAVGVGKAV--EDELREIASDPVEKHFFYSADFTAISQI 883 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 24/147 (16%), Positives = 48/147 (32%), Gaps = 18/147 (12%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKES 294 RIG + Y+ + + ++ +N++ P T T A+ +A ++ +E Sbjct: 72 TRIGVVQYSSQVQNVFSLKAFSKTEQMVKAINEIIPLAQGTMTGLAIRYAMNVAFSAEEG 131 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + + + +TDG A +G++IY+V V Sbjct: 132 AR-----PNVPHVAVIVTDGRPQDRVAEVAAAARE--------SGIEIYAVGV--ARADM 176 Query: 355 DLLRKCTDSS--GQFFAVNDSRELLES 379 LR F V + + Sbjct: 177 TSLRAMASPPFEDHVFLVESFDLIHQF 203 >gi|47087209|ref|NP_998714.1| matrilin-2 [Danio rerio] gi|45827653|gb|AAS78465.1| matrilin-4-like protein [Danio rerio] Length = 821 Score = 80.3 bits (196), Expect = 4e-13, Method: Composition-based stats. Identities = 26/174 (14%), Positives = 65/174 (37%), Gaps = 20/174 (11%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ + +V+ + + R+G + Y+ + + ++ +E+K + Sbjct: 604 NFELVKQFVNQVVDQL-----DVSAKGTRVGLVQYSSRVRTEFPLSMYHSKDEIKKAVMN 658 Query: 269 LNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + + T T A+ H ++E E + + + + TDG + + Sbjct: 659 VEYMEKGTMTGLALKHMVENSFSEAEGARPAEKNI--PRVGLVFTDGRSQD--------D 708 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 + + + AG+ +Y+V V +D LR+ FF D + + Sbjct: 709 IQEWAKKAKEAGITMYAVGVGKAV--EDELREIASDPVEKHFFYSADFTAISQI 760 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 24/147 (16%), Positives = 48/147 (32%), Gaps = 18/147 (12%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKES 294 RIG + Y+ + + ++ +N++ P T T A+ +A ++ +E Sbjct: 72 TRIGVVQYSSQVQNVFSLKAFSKTEQMVKAINEIIPLAQGTMTGLAIRYAMNVAFSAEEG 131 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + + + +TDG A +G++IY+V V Sbjct: 132 AR-----PNVPHVAVIVTDGRPQDRVAEVAAAARE--------SGIEIYAVGV--ARADM 176 Query: 355 DLLRKCTDSS--GQFFAVNDSRELLES 379 LR F V + + Sbjct: 177 TSLRAMASPPFEDHVFLVESFDLIHQF 203 >gi|333030669|ref|ZP_08458730.1| von Willebrand factor type A [Bacteroides coprosuis DSM 18011] gi|332741266|gb|EGJ71748.1| von Willebrand factor type A [Bacteroides coprosuis DSM 18011] Length = 328 Score = 80.3 bits (196), Expect = 4e-13, Method: Composition-based stats. Identities = 27/206 (13%), Positives = 60/206 (29%), Gaps = 41/206 (19%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENT 275 N + + + + N PL+ + + + + + T Sbjct: 108 NRLEAAKDVAAKFINDRPNDNIGITLFAGESFTQCPLTIDHTALLNLFGNIQTGVIEDGT 167 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + +A L S K +I +TDG N+ L E Sbjct: 168 AIGMGVSNAVARL----------KDSQAKSKVIILLTDGSNNAGDIS-----PLTSAEIA 212 Query: 336 RNAGMKIYSVAVSAPPE---------------------GQDLLRKCT-DSSGQFFAVNDS 373 + G+++Y+V + + L+ + G +F D+ Sbjct: 213 KTYGIRVYTVGIGTRGTAPYPIQTMTGAIQRIQVEVDIDEPTLKDIARTTGGVYFRATDN 272 Query: 374 RELLESFDKITD-KIQEQSVRIAPNR 398 L + + +I + + +V+ R Sbjct: 273 TSLQDIYQEIDQLEKTKLNVKEYSKR 298 >gi|323135758|ref|ZP_08070841.1| von Willebrand factor type A [Methylocystis sp. ATCC 49242] gi|322398849|gb|EFY01368.1| von Willebrand factor type A [Methylocystis sp. ATCC 49242] Length = 588 Score = 80.3 bits (196), Expect = 4e-13, Method: Composition-based stats. Identities = 53/426 (12%), Positives = 116/426 (27%), Gaps = 80/426 (18%) Query: 44 IVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQY 103 + SD ++ + K + K + + + ++ P Sbjct: 164 LSSDGSVTKISALKTAAKSFVDTMFAKAPDRVQFSVTPFAGAVVAVDPTVAANRTLPWID 223 Query: 104 IAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDL 163 +Q+ + N L PS + N +D E Sbjct: 224 TEGDNSQHWLVFGN---GSLTPSTAKAAAAAQGFNNRFDIFNKLKQRNSAMDWRGCFEAP 280 Query: 164 YLQKHNDNNN-------MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR-----KID 211 K+ + YL P P + + D Sbjct: 281 AYPKNVQDIVVSSSDPETQFVPYLAPDEPSGYDNNNYIDDNGGVTTRTYYGSTTTYTCSD 340 Query: 212 VLIESAGNLVNSIQKAIQEKKNLSV----RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 S L + + K+ + G A+ + L+ + +K++++ Sbjct: 341 TASGSWSKLTHVCKYKPTAAKSGNYGPTSFFGPNAFCPDHTTQRLLQLTTSQTTIKNKID 400 Query: 268 KLNPYENTNTYPAMHHAYRELYNE---------------------------KESSHNTIG 300 +L NTN +R + + NT+ Sbjct: 401 QLVANGNTNLQEGFMWGWRTISPNGPFAAGRPYATSNNRKVMVFMTDGFNHWGAYPNTVV 460 Query: 301 STRLKKFVIFITDGENSGASAYQNTLN-----------------------------TLQI 331 + + + +GE + + + TLQ Sbjct: 461 GSDYEALGYYTYNGEKNLRLPDGSRGDRVDYQNALKAARNSNSSYLATARDAQDELTLQA 520 Query: 332 CEYMRNAGMKIYSVAVSA-----PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 C +NAG++++++ S +G +LL+ C + +FAV ++ +L +F I Sbjct: 521 CTNAKNAGVEVFTIGFSTSTDPIDAQGLELLKSCATNVDHYFAVENANQLNAAFSSIGIG 580 Query: 387 IQEQSV 392 + + + Sbjct: 581 LGKLRL 586 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 21/215 (9%), Positives = 54/215 (25%), Gaps = 27/215 (12%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 + + + D R+ ++ A D AVL+ + + + T + Sbjct: 28 FGLGLVPVMFMLGATADYTRYATTRSALRQATDVAVLTVASKLTATTTDAQAKAQAQVIL 87 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + + +IT + + ++ I + + Sbjct: 88 NA---------------------QPRMSTASITTASIATTKQTFCATSEVTIQNSFMQMA 126 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLY----LQKHNDNNNMTSN 177 + +LT + ++ N + +V+D S SM + Sbjct: 127 RVT--SLTPSVTSCADLAWGANPNATYEVALVVDNSGSMLSSDGSVTKISALKTAAKSFV 184 Query: 178 KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV 212 + P + +S + Sbjct: 185 DTMFAKAPDRVQFSVTPFAGAVVAVDPTVAANRTL 219 >gi|269962784|ref|ZP_06177125.1| conserved hypothetical protein [Vibrio harveyi 1DA3] gi|269832474|gb|EEZ86592.1| conserved hypothetical protein [Vibrio harveyi 1DA3] Length = 353 Score = 80.3 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 32/206 (15%), Positives = 66/206 (32%), Gaps = 47/206 (22%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 ++ + + V K R+G + + TPL+ + Sbjct: 134 NDNGEYIDRLTTVKRVLSDFVE---------KRQGDRLGVVLFGDHAYLQ--TPLTADRK 182 Query: 261 EVKSRLNK--LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 V ++N+ + T + + + S ++ +I ++DG N+ Sbjct: 183 TVMQQINQTVIGLVGQRTAIGDGIGLGTKTFVD----------SDAPQRVMILLSDGSNT 232 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP------------------EGQDLLRK 359 L+ L+ E + IY+V V A + Q L + Sbjct: 233 AGV-----LDPLEAAEIAKKYNATIYTVGVGAGEMMVKDFFMTRKVNTASDLDEQTLTKI 287 Query: 360 CTDSSGQFFAVNDSRELLESFDKITD 385 + G++F D++EL +D I Sbjct: 288 AEMTGGKYFRARDAKELETIYDTINQ 313 >gi|91773457|ref|YP_566149.1| von Willebrand factor, type A [Methanococcoides burtonii DSM 6242] gi|91712472|gb|ABE52399.1| hypothetical protein with von Willebrand factor type A domain and Invasin domain [Methanococcoides burtonii DSM 6242] Length = 892 Score = 80.3 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 23/145 (15%), Positives = 54/145 (37%), Gaps = 15/145 (10%) Query: 247 IVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 +S N + + + ++ + T AM A L N + + KK Sbjct: 672 DRQPVSLNISGNKDLLHNAIDSMVADGGTAIGDAMADANNLLINGRPDA---------KK 722 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-GQDLLRKCTD-SS 364 +I +TDG + + + ++IYS+ + + + +L++ + Sbjct: 723 IMIVLTDGVATAG----SDRDGSDAISTANLNNIRIYSIGLGSSEYIDEPMLKRIASETG 778 Query: 365 GQFFAVNDSRELLESFDKITDKIQE 389 G ++ EL ++ I+ +I + Sbjct: 779 GSYYNAPSGSELQTVYNTISKEISD 803 >gi|313674519|ref|YP_004052515.1| von willebrand factor type a [Marivirga tractuosa DSM 4126] gi|312941217|gb|ADR20407.1| von Willebrand factor type A [Marivirga tractuosa DSM 4126] Length = 345 Score = 80.3 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 28/206 (13%), Positives = 67/206 (32%), Gaps = 42/206 (20%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP----YEN 274 N + + ++ + + + +PL+ + +K+++ ++ Sbjct: 125 NRLEAAKQVANDFIDGRFQDRIGLTIFSGEAYSLSPLTTDYKMLKNQITDIDFKMMEASG 184 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 T A+ + S K +I ++DG+N+ + ++ + Sbjct: 185 TAIGSALAVGTNRM----------RESDSKSKVLILLSDGDNNAGN-----IDPETSAKL 229 Query: 335 MRNAGMKIYSVAVS------------------APPEGQDLLRKCTD-SSGQFFAVNDSRE 375 G+KIY++A+ L+ GQF+ D++ Sbjct: 230 ANAYGIKIYTIAIGKEGKVPYGKDFFGRTRYIENSMDVTGLKNIAKIGEGQFYRATDNQA 289 Query: 376 LLESFDKITD----KIQEQSVRIAPN 397 L E F I +I+E + + Sbjct: 290 LEEVFSIIDQYEKAEIKETRYKNTKD 315 >gi|312881786|ref|ZP_07741560.1| hypothetical protein VIBC2010_06474 [Vibrio caribbenthicus ATCC BAA-2122] gi|309370537|gb|EFP98015.1| hypothetical protein VIBC2010_06474 [Vibrio caribbenthicus ATCC BAA-2122] Length = 323 Score = 80.3 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 28/205 (13%), Positives = 66/205 (32%), Gaps = 47/205 (22%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + ++ + + R+G + + TPL+ + N Sbjct: 105 SSGNYIDRLTAVKNVVSQFAQQRK---------GDRLGLVLFADHAYLQ--TPLTLDRNT 153 Query: 262 VKSRLNKLNP---YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + ++N L + T + A + + S ++ +I ++DG N+ Sbjct: 154 ISEQVNSLVLQLIGQKTAIGEGIGLATKTFID----------SDAPQRVMILLSDGSNTS 203 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP------------------EGQDLLRKC 360 L+ ++ + IY++ V A + + L+ Sbjct: 204 GV-----LDPIEAANIAKKYNATIYTIGVGAGEMMVKDFFMTRKVNTAQDLDEKTLMSIA 258 Query: 361 TDSSGQFFAVNDSRELLESFDKITD 385 + GQ+F +++EL +D I Sbjct: 259 KITGGQYFRARNAQELATIYDTINS 283 >gi|260781663|ref|XP_002585923.1| hypothetical protein BRAFLDRAFT_90333 [Branchiostoma floridae] gi|229270991|gb|EEN41934.1| hypothetical protein BRAFLDRAFT_90333 [Branchiostoma floridae] Length = 2692 Score = 80.3 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 34/391 (8%), Positives = 95/391 (24%), Gaps = 47/391 (12%) Query: 11 LFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKK 70 + AID ++ D +++ + Sbjct: 918 VATGAAIDFVRQNSY---------------TRGNGDRTSVPDLLVVVTSSASTDDVASAQ 962 Query: 71 HLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN---------LFLK 121 + I + + + ++ Sbjct: 963 ETAEKEGITIYTVGVTNSVSFAELTSTAGSFSRVLRANDFSDLSAIRQPLHETICQAAFC 1022 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMV-LDVSRSMEDLYLQKHND--NNNMTSNK 178 G + + + + D ++ D T Sbjct: 1023 GDPGTPANGFQQGTYFEGNTVTFGCNFGFLLSGTDNTQCQADGSWSNPLPVCIAITTPAP 1082 Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 P ++ + + + D++ N+V + + + R+ Sbjct: 1083 TQAPACNDFPLFNGTDLVFLLDGSGSVGSNNFDLVKTFTKNVVQNF-----DISETATRV 1137 Query: 239 GTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHN 297 + Y+ + EV + ++ ++ T T A+ + ++ Sbjct: 1138 AVVQYSDQFSTEFSLNAFSTKTEVYNAIDNISYLTGGTFTGFAIDFVMQSVFTSISGER- 1196 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 ++ +TDG ++ + + R G+ IY+V V + + L Sbjct: 1197 ----DGYPDLLVVVTDGLSTDDVSGP--------ADTARAQGVTIYAVGVGSDIDFNT-L 1243 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 + + + V+D L+ ++ I Sbjct: 1244 EQIAGLTSRVSQVSDFSSLVTLSQTLSQDIC 1274 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 24/191 (12%), Positives = 59/191 (30%), Gaps = 19/191 (9%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + ++ + N+ + S R+G + Y+ + Sbjct: 306 SGSIGTDNFKLVKSFTERMANNF-----DISPNSTRVGVVQYSNFPGTEFSLNAFTDKAA 360 Query: 262 VKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 V ++K++ T T A+ + + +I ITDG + Sbjct: 361 VLDAISKIDYNGGSTFTGAAIDFVRNNEFTS-----VNGDRDDVPNILIVITDGNPNDDV 415 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 + NAG+ Y+V + + + +L++ G+ D +L Sbjct: 416 SGP--------AISANNAGITTYAVGIGSNVDQANLVQMTAGRPGRVLQAADFTDLTTVV 467 Query: 381 DKITDKIQEQS 391 + + + + Sbjct: 468 GTLQENVCDAV 478 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 43/325 (13%), Positives = 95/325 (29%), Gaps = 26/325 (8%) Query: 68 IKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSA 127 + S AG+ A + I Q AQ I S Sbjct: 2130 AAIDFVRTSTFSTPAGNRAGQPDFLIVVTD-GLSQDNVAVPAQTARNNGISIFAVGIGSE 2188 Query: 128 LTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKK 187 + +L S L I+ L + + + P Sbjct: 2189 VDADTLLQIA--GTPSRTLQINDFAGLVNAEEQLASIVCNTITTTPPATTPTPAPSCSSI 2246 Query: 188 SFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGI 247 + + +P + ++ + ++V + + + +G + Y+ I Sbjct: 2247 TPVDLVFLLDGSSSITSPNFQ---IVKDFTADVVRTFN-----VSSAATNVGLVQYSDTI 2298 Query: 248 VGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 + + V + + + NT T A+ ++ + Sbjct: 2299 RTEFFLNSFDTKSGVLNAIGNIGYLQGNTRTGAAIDFVRISSFSVPAGNRGNQPD----- 2353 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ 366 ++I +TDG + + RN G+ I++V + + + LL S + Sbjct: 2354 YLIVVTDGLSQDDVVVP--------AQTARNDGISIFAVGIGSEIDFATLL-NIAGSPNR 2404 Query: 367 FFAVNDSRELLESFDKITDKIQEQS 391 +ND L + +++TD + + Sbjct: 2405 ILQINDFAGLANAQEQLTDIVCNLT 2429 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 22/172 (12%), Positives = 60/172 (34%), Gaps = 16/172 (9%) Query: 221 VNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTY 278 A + + S R+ Y+ +++++ + + + ++ T T Sbjct: 37 FTQQTTAKFDISDGSTRVAVAQYSSTPQVEFNLNTNSDVDTLSNAIEQITYMNGDSTFTG 96 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 A+ + ++ + + ++ +TDG+++ + R Sbjct: 97 FAIEFVRQSAFSSFNGARDDKPD-----IMVVVTDGQSADSVTSS--------AATAREQ 143 Query: 339 GMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 G+ +++V V G L+ + + +ND +L +S D I + Sbjct: 144 GVTMFAVGVGTGV-GLSELQDIAGYTDRVLQLNDFVQLAQSADTIQSTLCGL 194 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 31/191 (16%), Positives = 68/191 (35%), Gaps = 20/191 (10%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + D++ + A +V+ Q R+G + ++ + + + Sbjct: 1644 SGSVTAVNFDLVKDFASGVVSEFQ-----ISTTETRVGVVQFSDTLRTEFFMSSFSTKQQ 1698 Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 V ++ ++ NT T A+ A S F+I +TDG + + Sbjct: 1699 VLQAISDIDYIQGNTLTGAAITFA-----TASSFSTPAGNRANFPDFMIVVTDGLSQDS- 1752 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 +Q + R+ G+ I++V V + LL+ T V D +LL + Sbjct: 1753 -------VVQPAQSARDQGITIFAVGVGNEVDFATLLQ-ITGVPEYILQVTDFSDLLAAQ 1804 Query: 381 DKITDKIQEQS 391 ++ + + Sbjct: 1805 LQVAEIACNLT 1815 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 35/331 (10%), Positives = 84/331 (25%), Gaps = 24/331 (7%) Query: 41 CASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNP 100 + + L+Q + + ++ + + + Sbjct: 1212 TDDVSGPADTARAQGVTIYAVGVGSDIDFNTLEQIAGLTSRVSQVSDFSSLVTLSQTLSQ 1271 Query: 101 LQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR-- 158 A P G + +++ +S + + C V Sbjct: 1272 DICTACYCGNPGAPVNGYTRGGFFGGSTVTFGCNDGYLLQGASSAVCQADCTWSAVPPVC 1331 Query: 159 ----SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAP-APANRKIDVL 213 TS+ P + ++ + Sbjct: 1332 TDVCDPNPCLNGGTCQAVAFTSSCTCPPMYEGDVCQRYSPCYNRNIELDLVFLLDGSGSV 1391 Query: 214 IESAGNLVNSIQ---KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN 270 + ++V + R+G + Y+ N V + + Sbjct: 1392 TTANFDIVKEFTRRLANNFDISLADTRVGVVQYSDSPTLEFNLNSFNTNELVDLAIRNIQ 1451 Query: 271 PY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 TNT A+ S N + + +I +TDG+ ++ + + Sbjct: 1452 YQQGGTNTGQAIDFVR-----VNSFSANNGDRSDVPNVMIVVTDGQ--------SSDDVV 1498 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 + RNAG+ +Y+V + + +LL+ Sbjct: 1499 GPAQTARNAGISMYAVGIGNGVDTNELLQIA 1529 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 27/189 (14%), Positives = 66/189 (34%), Gaps = 20/189 (10%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + ++ + ++V + + + +G + Y+ I + +E Sbjct: 2517 SSSITSPNFQIVKDFTADVVRTFN-----VSSAATNVGLVQYSDTIRTEFFLNSFDTKSE 2571 Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 V + + + NT T A+ ++ + ++I +TDG + Sbjct: 2572 VLNAIGNIGYLQGNTRTGAAIDFVRISSFSVPAGNRGNQPD-----YLIVVTDGLSQD-- 2624 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 L + R G+ I++V + + LL S + +ND L + Sbjct: 2625 ------EVLGPAQTARFEGINIFAVGIGNEIDFTTLLH-IAGSPNRVLQINDFAGLASAT 2677 Query: 381 DKITDKIQE 389 ++TD + Sbjct: 2678 GQLTDIVCN 2686 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 31/337 (9%), Positives = 83/337 (24%), Gaps = 25/337 (7%) Query: 48 RTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAES 107 + I + + A + A + + Sbjct: 407 TDGNPNDDVSGPAISANNAGITTYAVGIGSNVDQANLVQMTAGRPGRVLQAADFTDLTTV 466 Query: 108 KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQK 167 + + G + + + + + + +V VS + Sbjct: 467 VGTLQENVCDAVYCGDPGTPTNGFQVGTYFENDLVTWGCNLGYQLVGAVSSVCQGNGAWT 526 Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAP----APAPANRKIDVLIESAGNLVNS 223 + P + + + + ++ ++ + Sbjct: 527 N--AVPTCVATTTPAPTAAPVCANLAYPGADLVFLLDGSGSIGTDNFQLVKAFTKEVIRN 584 Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMH 282 + R+G + Y+ I N +E+ + ++ + T T A+ Sbjct: 585 FAISPTA-----TRVGLLQYSDTIDNEFFMNEFNTRDELYTAVDNVVYKTGGTFTGFAVE 639 Query: 283 HAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKI 342 + + + +I +TDG + + G+ I Sbjct: 640 FTRQIAFRTSAGTR-----DNYPDILIVVTDGN--------SEDVVTSAVASAIDQGILI 686 Query: 343 YSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 Y+V V + + LL + + V+D L + Sbjct: 687 YAVGVGSNVDFASLLELTGGVNSRVLQVSDFTGLSTA 723 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 20/118 (16%), Positives = 44/118 (37%), Gaps = 14/118 (11%) Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 NT T A+ ++ + F+I +TDG + A + Sbjct: 2125 NTLTGAAIDFVRTSTFSTPAGNRAGQPD-----FLIVVTDGLSQDNVAVP--------AQ 2171 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 RN G+ I++V + + + LL+ + + +ND L+ + +++ + Sbjct: 2172 TARNNGISIFAVGIGSEVDADTLLQ-IAGTPSRTLQINDFAGLVNAEEQLASIVCNTI 2228 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 37/401 (9%), Positives = 90/401 (22%), Gaps = 38/401 (9%) Query: 12 FITYAIDLAHIMYIRNQM---------------QSALDAAVLSGCASIVSDRTIKDPTTK 56 F +A++ + R ++ D + ++I I Sbjct: 633 FTGFAVEFTRQIAFRTSAGTRDNYPDILIVVTDGNSEDVVTSAVASAIDQGILIYAVGVG 692 Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 + + + + + G + Y T Sbjct: 693 SNVDFASLLELTGGVNSRVLQVSDFTGLSTAAQTLPTVLCAAAYCGDPGAPTNGYRRGTF 752 Query: 117 NLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS 176 + L S+ + + + + LD + + Sbjct: 753 FVDSVVTFGCNDGYLLQGSSNTSCLGTGQWSNPVPVCLDRCDPN-PCFHGAACSVQGSSF 811 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYA-PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN-- 233 P + ++ K + + LV + Sbjct: 812 VCVCPPNYEGELCQFYTPCLNRSVEFDLVFLVDKSSSVGPANFELVKEFMYDFTNTFSVG 871 Query: 234 -LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYREL 288 RIG + + + + + + + T A+ + Sbjct: 872 LSDTRIGAVQFADAQTKDFDMDTFATKEQTLAGIQNIVYTDNQVGGVATGAAIDFVRQ-- 929 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + T + ++ +T + +T + E G+ IY+V V Sbjct: 930 ---NSYTRGNGDRTSVPDLLVVVT--------SSASTDDVASAQETAEKEGITIYTVGV- 977 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 L S + ND +L + + I + Sbjct: 978 TNSVSFAELTSTAGSFSRVLRANDFSDLSAIRQPLHETICQ 1018 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 22/119 (18%), Positives = 45/119 (37%), Gaps = 14/119 (11%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 P NT T A+ ++ L ++ +TDG + A Sbjct: 1952 PSRNTLTGAAIDFVRTSSFS-----IPAGNRLTLPDVLVVVTDGLSQDDVAGP------- 1999 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + R+ G+ IY+V + + + LL + +ND LL++ +++T+ + Sbjct: 2000 -AQIARDNGIAIYAVGIGSEVDFATLL-DIAGLQSRVLQINDFSSLLDAEEQLTEIVCN 2056 >gi|84385370|ref|ZP_00988402.1| hypothetical protein V12B01_16906 [Vibrio splendidus 12B01] gi|84379967|gb|EAP96818.1| hypothetical protein V12B01_16906 [Vibrio splendidus 12B01] Length = 319 Score = 80.3 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 68/204 (33%), Gaps = 47/204 (23%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 ++ + + + + R+G + + TPL+ + + + Sbjct: 102 NGEYIDRLSAVKHVLSDFIERRK---------GDRVGLVLFADHAYLQ--TPLTLDRDTL 150 Query: 263 KSRLNK--LNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +LN+ L T + A + + S ++ +I ++DG N+ Sbjct: 151 SQQLNQAVLKLIGTQTAIGDGIGLATKTFVD----------SDAPQRVMILLSDGSNTAG 200 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP------------------EGQDLLRKCT 361 L+ L+ + + IY+V V A + + L+ Sbjct: 201 V-----LDPLEAADIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTAQDLDERTLMEIAK 255 Query: 362 DSSGQFFAVNDSRELLESFDKITD 385 + GQ+F DS+EL +D I Sbjct: 256 RTGGQYFRARDSKELATIYDTINQ 279 >gi|304312669|ref|YP_003812267.1| von Willebrand factor, type A protein [gamma proteobacterium HdN1] gi|301798402|emb|CBL46626.1| von Willebrand factor, type A protein [gamma proteobacterium HdN1] Length = 347 Score = 80.3 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 38/201 (18%), Positives = 67/201 (33%), Gaps = 49/201 (24%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 ++ V+ + ++ + + RIG I + TPL+ + V++ LN Sbjct: 118 DRLTVIKSVVDDFISHRK---------NDRIGLILFGTQAYLQ--TPLTFDHKTVRTLLN 166 Query: 268 KLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + T A+ A + L K +I +TDG N+ S Sbjct: 167 ESRIGIAGGQTAIGDAIGLALKRL----------KNHKTGSKVLILLTDGANTAGSVS-- 214 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAP-------------------PEGQDLLRKCTD-SS 364 +Q E GMKIY+V V A + ++K + Sbjct: 215 ---PVQAAELAARQGMKIYTVGVGADEMRIPGVLGFGSQIVNPSADLDEVTMKKIASLTG 271 Query: 365 GQFFAVNDSRELLESFDKITD 385 Q+F ++ EL + I Sbjct: 272 AQYFRARNTDELRRIYQHIDK 292 >gi|113867117|ref|YP_725606.1| von Willebrand factor type A (vWA) domain-containing protein [Ralstonia eutropha H16] gi|113525893|emb|CAJ92238.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Ralstonia eutropha H16] Length = 566 Score = 80.3 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 22/162 (13%), Positives = 56/162 (34%), Gaps = 14/162 (8%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 RI + Y G ++ + + +++L +T + AY+ + Sbjct: 227 RITLVTYASGTRVALPPTPGSDKTAISAAIDQLVAGGSTAGASGIALAYQAAQQSFIAGG 286 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 V+ TDG+ + + + E R +G+ + ++ + L Sbjct: 287 INR--------VLLATDGDFNVGV--TDFRQLKSMVEEKRKSGVSLSTLGFGTGNYNEQL 336 Query: 357 LRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + + D+ G + +++ E + + +I IA + Sbjct: 337 MEQLADAGDGAYSYIDNLMEGNKV---LVSEISSTLATIARD 375 >gi|225012026|ref|ZP_03702463.1| von Willebrand factor type A [Flavobacteria bacterium MS024-2A] gi|225003581|gb|EEG41554.1| von Willebrand factor type A [Flavobacteria bacterium MS024-2A] Length = 334 Score = 80.3 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 38/245 (15%), Positives = 73/245 (29%), Gaps = 54/245 (22%) Query: 169 NDNNNMTSNKYLLPP-PPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 + K S A ++ L A V+ Sbjct: 72 LAIARPQTVDISTRTKTNKGIDIVMAIDVSSSMLAQDLKPDRLSALKRVASAFVDD---- 127 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE----NTNTYPAMHH 283 LS RIG + Y TP++++ VK L +++ T + Sbjct: 128 -----RLSDRIGLVVYAGESY--TLTPITSDKGIVKGSLREISYQGLIEDGTAIGMGLAT 180 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 + L S K +I +TDG N+ ++ E G+K Y Sbjct: 181 SVNRL----------KDSRAKSKVIILLTDGVNNSG-----FIDPKIATELAVEFGIKTY 225 Query: 344 SVAVSAPPE----------------------GQDLLRKCTD-SSGQFFAVNDSRELLESF 380 ++ + + + LL++ + G +F D+++L E + Sbjct: 226 TIGLGSNGTARAPVGILPNGSFQYAMTKVEIDEALLQEIATATGGIYFRATDNKKLEEIY 285 Query: 381 DKITD 385 ++I Sbjct: 286 EEINK 290 >gi|164688691|ref|ZP_02212719.1| hypothetical protein CLOBAR_02337 [Clostridium bartlettii DSM 16795] gi|164602167|gb|EDQ95632.1| hypothetical protein CLOBAR_02337 [Clostridium bartlettii DSM 16795] Length = 1508 Score = 80.3 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 29/202 (14%), Positives = 65/202 (32%), Gaps = 15/202 (7%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 ++I + +SA VN + ++ + R+ + ++ N N++ Sbjct: 423 TDNTKKRITIAKDSAKQFVNQLFANNEDGSKSNNRVSVVIFSSSGYTNGILCSLKNVDNK 482 Query: 263 KSRLNKLN-----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 ++ ++ ++ P T+ AM A + L K+++ N +G+ Sbjct: 483 QTVIDAIDGISNNPTGGTDYDNAMTMAEQVLETVKDTTRNKAVLFMSDGAPENGYNGKTG 542 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLR----------KCTDSSGQF 367 + E N G IY+V+ L + + Sbjct: 543 YDIYPDAFKAHEKSSEIKNNYGATIYTVSFGLKGSQYKELTEDRCRQILRDYMASNENCY 602 Query: 368 FAVNDSRELLESFDKITDKIQE 389 N +L +F I I++ Sbjct: 603 KNANSKEDLENAFTNIATAIRK 624 >gi|312126757|ref|YP_003991631.1| hypothetical protein Calhy_0520 [Caldicellulosiruptor hydrothermalis 108] gi|311776776|gb|ADQ06262.1| protein of unknown function DUF1355 [Caldicellulosiruptor hydrothermalis 108] Length = 909 Score = 80.3 bits (196), Expect = 5e-13, Method: Composition-based stats. Identities = 29/198 (14%), Positives = 65/198 (32%), Gaps = 25/198 (12%) Query: 196 KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL 255 + K+++ +A +++ ++ + IA++ Sbjct: 415 SGSMGESNLGNINKLEIAKSAAAKMIDHLESSDSVG--------VIAFDHNFYWASKFGK 466 Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + NEV ++ + T P + A L S K ++ +TD Sbjct: 467 LKSKNEVIENISGIQIGGGTAIIPPLTEAVNTL----------RKSKAKDKVIVLLTD-- 514 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE 375 Y + +KI ++ V + L +SG+F+ V D+ Sbjct: 515 -----GYGEEGGYEYPASIAKRNNIKITTIGVGSSINAPILSWMAAYTSGRFYYVKDASN 569 Query: 376 LLESFDKITDKIQEQSVR 393 L++ F K I+ + ++ Sbjct: 570 LIDVFLKEAKIIKGKYIK 587 >gi|327542784|gb|EGF29248.1| von Willebrand factor type A [Rhodopirellula baltica WH47] Length = 264 Score = 80.0 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 36/301 (11%), Positives = 94/301 (31%), Gaps = 42/301 (13%) Query: 96 DKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD 155 D + + + + +F +L S + + +V+D Sbjct: 3 DDDEFGNAVRLTTLSLSNSPQPVFSPLF---PTMGTNLEIRPQRVAVSTQSTMDVALVID 59 Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 S SM + + N Y P +R +D++ Sbjct: 60 RSGSMAYASDETPDPYVNPA----------------SAPPGWTYGDPVPPNSRWLDLV-A 102 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP---Y 272 S + + ++ Y ++ L++ E+ + L+ ++ Sbjct: 103 SVNAFNGFLVDSP-----QYEKLCLATY--SSTASRDCDLTHTYAEISNELDAISYQFDG 155 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T+ + H L + + + ++ +TDG ++ + ++ + Sbjct: 156 GGTSVGYGLEHGLAVLTDATHAR------KFAVRVMVLMTDGHHNTGKSPESMMYH---- 205 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 ++N G+ ++++ S + + G+ F D+ +L +F KI K+ Sbjct: 206 --LQNHGVTLFTITFSDDADQSRMSNLANACGGENFHATDASQLQNAFQKIAKKLPSLMT 263 Query: 393 R 393 + Sbjct: 264 Q 264 >gi|83647467|ref|YP_435902.1| von Willebrand factor type A (vWA) domain-containing protein [Hahella chejuensis KCTC 2396] gi|83635510|gb|ABC31477.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Hahella chejuensis KCTC 2396] Length = 345 Score = 80.0 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 35/194 (18%), Positives = 67/194 (34%), Gaps = 36/194 (18%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-- 270 L + ++ ++ + + + G PL+ +L V LN+ Sbjct: 113 LKGNQATRLDVVKSVVTDFIQVRQGDRLGLILFGAQPYIQAPLTYDLVTVGELLNEATLG 172 Query: 271 -PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T A+ + L S + ++ +TDG N+G ++ Sbjct: 173 IAGNATAIGDAIGLGIKRLRERPADS----------RVLVLLTDGANTGG-----EVSPE 217 Query: 330 QICEYMRNAGMKIYSVAVSAP-----------------PEGQDLLRKCTD-SSGQFFAVN 371 Q + +AG+KIY+V V A + LL+ D + GQ+F Sbjct: 218 QAAKLAADAGIKIYTVGVGADEIIRRGIFGYRKENPSADLDETLLQSIADETDGQYFRAR 277 Query: 372 DSRELLESFDKITD 385 ++ EL ++ I Sbjct: 278 NTGELELIYESINQ 291 >gi|149638912|ref|XP_001511941.1| PREDICTED: similar to Collagen alpha-1(XII) chain [Ornithorhynchus anatinus] Length = 3176 Score = 80.0 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 59/166 (35%), Gaps = 18/166 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+G + Y+ NE+ S + K+ NT T A+ + Sbjct: 220 SAFDVGEEKTRVGVVQYSSDTRTEFNLNQYYQRNELLSAIKKIPYKGGNTMTGDAIDYLI 279 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + + + K I ITDG++ +RN G++++S+ Sbjct: 280 KNTFTDAAGARIGF-----PKVAIVITDGKSQD--------EVEIPARELRNRGVEVFSL 326 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQE 389 + L++ + F V + +++ ++I ++ Sbjct: 327 GI--KAADAKELKQIASTPSLQHVFNVANFDAIVDIQNEIISQVCS 370 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 22/165 (13%), Positives = 52/165 (31%), Gaps = 18/165 (10%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + V+I Y+ + + + L NT T A++ + Sbjct: 1282 DIGPKRVQIALAQYSGDPRTEWQLNAYKDKKNLLEAVANLPYKGGNTLTGMALNFIRQ-- 1339 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 R +K + ITDG++ + + +R+ G+++++V + Sbjct: 1340 ---NNFRPQAGMRPRARKIGVLITDGKSQDDVEAPS--------KKLRDEGVELFAVGI- 1387 Query: 349 APPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 ++ L+ + V D L + +T + Sbjct: 1388 -KNADENELKAVATDPDDTHAYNVADFSFLSSIVEDLTANLCNSV 1431 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 28/167 (16%), Positives = 63/167 (37%), Gaps = 23/167 (13%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYN 290 V+I + Y+ + ++ +N TNT AM + +++ Sbjct: 525 SPNRVQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFV 584 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + S + + K +I ITDG+ ++ + +RN+ ++I++V V Sbjct: 585 ASKGSR-----SNVPKVMILITDGK--------SSDAFKEPAIKLRNSDVEIFAVGV--K 629 Query: 351 PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS-VRI 394 + L + V D ++F +I+ ++ + +RI Sbjct: 630 DAVRSELETIASPPAETHVYTVEDF----DAFQRISFELTQSICLRI 672 >gi|94498567|ref|ZP_01305122.1| hypothetical protein SKA58_08339 [Sphingomonas sp. SKA58] gi|94422010|gb|EAT07056.1| hypothetical protein SKA58_08339 [Sphingomonas sp. SKA58] Length = 678 Score = 80.0 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 35/331 (10%), Positives = 93/331 (28%), Gaps = 33/331 (9%) Query: 93 ITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICM 152 + ++ + P ++ + + ++ + + S Sbjct: 348 VITRTQYSNYVETKTDRSVQTPQKSWEYGKFPLTVSSFITGNAVANPTYDASVPTDSSGN 407 Query: 153 VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKS-FWSKNTTKSKYAPAPAPANRKID 211 V+ + + ++ D+N S P + ++ P ++ Sbjct: 408 VVASTSTWAGCIEERDTDSNITASTSTSSTPSNAQDLNIDALPDPTQKGTKWRPHWPDVE 467 Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP 271 + G+ + K + + + N+ + + + S ++ L Sbjct: 468 FIRS--GSSSGTWLNTQSRKDGGWNACPSQSRRLTSYTNRTSTPTGQSSSFNSYIDNLIA 525 Query: 272 YENTNTYPAMHHAYRELYNE---KESSHNTIGSTRLKKFVIFITDGE------------- 315 T M R L + +++ + + ++F+TDG+ Sbjct: 526 VGGTYHDIGMLWGARFLSPKGIFASDNNSAPNGFNISRHIVFMTDGDMSAYQQVYGAYGY 585 Query: 316 ------------NSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--SAPPEGQDLLRKCT 361 + + +C ++ G+ I+ + + Q L+ C Sbjct: 586 QQLDARVAPGNTSDTDLTAIHNTRLQMLCNAIKAKGITIWVIGFRNQSEGNIQTPLQNCA 645 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 SS + D+ L + F I I V Sbjct: 646 TSSNHWTMAYDATSLSQKFKDIAKNIGGLRV 676 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 34/298 (11%), Positives = 77/298 (25%), Gaps = 65/298 (21%) Query: 14 TYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLK 73 +D+ R +MQ A DAA L+G ++ + ++ KK + Sbjct: 39 GGGLDMGRAYMARARMQQACDAAALAGRRAMTT-------SSMTQANKDEAKKFFDFNFP 91 Query: 74 QGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSL 133 QG++ Q A + A T + + LS+ Sbjct: 92 QGTF---------QAATFTPVIRSKPGETTTVQVTAS---TTMPTTVMKIFRYETLPLSV 139 Query: 134 RSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKN 193 + + +VLD + SM Sbjct: 140 TCEARFDI----GNTDVMLVLDTTGSMAYAISDGK------------------------- 170 Query: 194 TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 + ++ L ++ + +++ +R G + Y+ + Sbjct: 171 ----------GGSTTRLAALKQAVKDFYDTLGAGSNA--TGRIRYGFMPYSSTVNVGYQL 218 Query: 254 PLSN-----NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 P + + + +L T T Y +++++ + Sbjct: 219 PTNYLVGGISGETWDYQTRRLLTTYGTATNETGSWIYTSGSVSAPTTYSSTSGGSASQ 276 >gi|187470892|sp|A6NF34|ANTRL_HUMAN RecName: Full=Anthrax toxin receptor-like; Flags: Precursor Length = 565 Score = 80.0 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 77/195 (39%), Gaps = 24/195 (12%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 K + + +L +++ + ++ ++R+ I Y+ G PL+++ N +K Sbjct: 78 YFILDKSGSVNNNWIDLYMWVEETVARFQSPNIRMCFITYSTD--GQTVLPLTSDKNRIK 135 Query: 264 SRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + L++L P +T A +++ + + ++ +I +TDGE Sbjct: 136 NGLDQLQKIVPDGHTFMQAGFRKAIQQIESFNSGN-------KVPSMIIAMTDGE----L 184 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 +TL+ + R G +Y++ V D + DS G FAV + F Sbjct: 185 VAHAFQDTLREAQKARKLGANVYTLGV--ADYNLDQITAIADSPGHVFAVENG------F 236 Query: 381 DKITDKIQEQSVRIA 395 + I + ++ Sbjct: 237 KALRSTIDALTSKVC 251 >gi|327269503|ref|XP_003219533.1| PREDICTED: collagen alpha-1(XXII) chain-like [Anolis carolinensis] Length = 1601 Score = 80.0 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 31/195 (15%), Positives = 67/195 (34%), Gaps = 20/195 (10%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + + + NLV++ + R+G + Y+ Sbjct: 45 TSSSVGKEDFEKVRQWVANLVDTFE-----IGPDKTRVGVVRYSDRPTTEFDLGRYQTRE 99 Query: 261 EVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 ++K + NTNT A+ + ++E+ T + KK I +TDG + Sbjct: 100 QIKEAAKNIKYYGGNTNTGDALRYINTYSFSEEAGGRPTDSAI--KKVAILLTDGRSQDH 157 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELL 377 AG++I++V V ++ L + F V+D + Sbjct: 158 VLDP--------ATAAHKAGIRIFAVGVG--EALKEELDEIASEPKSAHVFHVSDYNAID 207 Query: 378 ESFDKITDKIQEQSV 392 + K+ ++ E + Sbjct: 208 KIRGKLRRRLCENVL 222 >gi|261418348|ref|YP_003252030.1| von Willebrand factor A [Geobacillus sp. Y412MC61] gi|319767693|ref|YP_004133194.1| von Willebrand factor type A [Geobacillus sp. Y412MC52] gi|261374805|gb|ACX77548.1| von Willebrand factor type A [Geobacillus sp. Y412MC61] gi|317112559|gb|ADU95051.1| von Willebrand factor type A [Geobacillus sp. Y412MC52] Length = 1077 Score = 80.0 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 25/278 (8%), Positives = 72/278 (25%), Gaps = 31/278 (11%) Query: 146 LAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAP 205 I + V+DVS SM + + + Y + ++ A Sbjct: 196 PPIDVVFVMDVSGSM-TAMKLQSAKSALQAAVNYFKSNYNQNDRFALIPFSDGVREASVV 254 Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT---IAYNIGIVGNQCTPLSNNLNEV 262 K + ++N+ + + L++ + V Sbjct: 255 PFGKYSNVASQLDAILNTGNSLTAGGGTNYSAALSLAKSYFTDPTRKKYIIFLTDGMPTV 314 Query: 263 KSRLNKLNP--------YENTNTY----PAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 + ++ + + T ++ Y + + + + + Sbjct: 315 LNAVDTITYREVKQNFWGGYSYTGNRVTDSLSVTYELYSDGRTAGIRFTDNKGYSRRFYS 374 Query: 311 ITDGENSGASAYQNT--------------LNTLQICEYMRNAGMKIYSVAVSAPPE-GQD 355 +G + + + + + + +YS+ E D Sbjct: 375 DGQDYVNGWRVSWDNGYSFTYSSIEGKIRADATAVAKTLGMNNITLYSIGFGDNDEVDMD 434 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 LR + ++G ++ L F + + ++ Sbjct: 435 YLRSLSATAGGEARQGTTQNLTALFQQFSQLATTPAIT 472 >gi|290960274|ref|YP_003491456.1| lipoprotein [Streptomyces scabiei 87.22] gi|260649800|emb|CBG72916.1| putative lipoprotein [Streptomyces scabiei 87.22] Length = 537 Score = 80.0 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 39/319 (12%), Positives = 85/319 (26%), Gaps = 29/319 (9%) Query: 65 KKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAES-KAQYEIPTENLFLKGL 123 G E G+ + ++ + + A Y L GL Sbjct: 66 ADGFPAPAPDGPDSGEQRGEHQDDGSGDFAPAPDHLSTFALDVDTASYGYARRTLDDGGL 125 Query: 124 IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPP 183 + + + +D +R+ + + PP Sbjct: 126 PDPSTVRPEEFVNSFRQDYERPDGDGFSVTVDGARTGPGNWSLVRVGLATRAAGDRQRPP 185 Query: 184 PPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAY 243 T ++D+ ++ + N ++ + + + Sbjct: 186 AAL--------TFVIDVSGSMAEPGRLDLAQDALRTMTNRLRDDDS--------VAVVTF 229 Query: 244 NIGIVGNQCTP-LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 + L N E++ + L P ++TN + Y Sbjct: 230 SDEAETVLPMTRLDGNRGEIREAVAGLEPTDSTNLAAGVETGYETAVEGLRKGATNR--- 286 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 V+ ++D + S +T+ E R G+ ++ V V + G L+ + D Sbjct: 287 -----VVLLSDALANTGSTDADTILERIAGER-REHGITLFGVGVGSD-YGDALMEQLAD 339 Query: 363 SS-GQFFAVNDSRELLESF 380 G V+ E + F Sbjct: 340 KGDGHTTYVSTEEEAEKVF 358 >gi|56421171|ref|YP_148489.1| hypothetical protein GK2636 [Geobacillus kaustophilus HTA426] gi|56381013|dbj|BAD76921.1| hypothetical protein [Geobacillus kaustophilus HTA426] Length = 960 Score = 80.0 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 25/278 (8%), Positives = 70/278 (25%), Gaps = 31/278 (11%) Query: 146 LAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAP 205 I + V+DVS SM + + + Y + ++ A Sbjct: 79 PPIDVVFVMDVSGSM-TAMKLQSAKSALQAAVNYFKSNYNQNDRFALIPFSDGVREASVV 137 Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT---IAYNIGIVGNQCTPLSNNLNEV 262 K + ++N+ + + L++ + V Sbjct: 138 PFGKYSNVASQLDAILNTGNSLTAGGGTNYSAALSLAKSYFTDPTRKKYIIFLTDGMPTV 197 Query: 263 KSRLNKLNP--------YENTNTYP----AMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 + ++ + T ++ Y + + + + Sbjct: 198 LNTVDTITYREVKPKFRGGYQYTGNRVTGSLSVTYELYSDGWTAGIRFTDNKGYSRQFYS 257 Query: 311 ITDGENSGASAYQNT--------------LNTLQICEYMRNAGMKIYSVAVSAPPE-GQD 355 +G + + + + + + +YS+ E D Sbjct: 258 DGQDYVNGWWVSWDNGYSFTYSSIERKIRADATAVAKTLGMNNITLYSIGFGDNDEVDMD 317 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 LR + ++G ++ L F + + ++ Sbjct: 318 YLRSLSATAGGEARQGTTQNLTALFQQFSQLATTPAIT 355 >gi|118617151|ref|YP_905483.1| hypothetical protein MUL_1490 [Mycobacterium ulcerans Agy99] gi|166979868|sp|A0PNU3|Y1490_MYCUA RecName: Full=UPF0353 protein MUL_1490 gi|118569261|gb|ABL04012.1| membrane protein [Mycobacterium ulcerans Agy99] Length = 335 Score = 80.0 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 25/152 (16%), Positives = 55/152 (36%), Gaps = 16/152 (10%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + N K+ L+KL + T T A+ A + + + G T ++ +DG+ Sbjct: 155 TTNREATKAALDKLQFADRTATGEAIFTALQAIATV--GAVIGGGDTPPPARIVLFSDGK 212 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PEGQDLLRKCTD 362 + + N ++ G+ I +++ P P + ++K Sbjct: 213 ETMPTNPDNPKGAYTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETMKKVAQ 272 Query: 363 -SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 S G + EL + + +I +++R Sbjct: 273 LSGGNSYNAATLAELNSVYVSLQQQIGYETIR 304 >gi|53713710|ref|YP_099702.1| hypothetical protein BF2419 [Bacteroides fragilis YCH46] gi|60681981|ref|YP_212125.1| aerotolerance-related membrane protein [Bacteroides fragilis NCTC 9343] gi|253565658|ref|ZP_04843113.1| BatA [Bacteroides sp. 3_2_5] gi|265764034|ref|ZP_06092602.1| BatA [Bacteroides sp. 2_1_16] gi|4838138|gb|AAD30858.1|AF116251_1 BatA [Bacteroides fragilis] gi|52216575|dbj|BAD49168.1| conserved hypothetical protein BatA [Bacteroides fragilis YCH46] gi|60493415|emb|CAH08201.1| aerotolerance-related membrane protein [Bacteroides fragilis NCTC 9343] gi|251945937|gb|EES86344.1| BatA [Bacteroides sp. 3_2_5] gi|263256642|gb|EEZ27988.1| BatA [Bacteroides sp. 2_1_16] gi|301163419|emb|CBW22970.1| aerotolerance-related membrane protein [Bacteroides fragilis 638R] Length = 327 Score = 80.0 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 31/301 (10%), Positives = 80/301 (26%), Gaps = 51/301 (16%) Query: 123 LIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLP 182 ++ T +L+ + + + ++ + + Sbjct: 23 IMKRKKTEPTLQISDARVYAHAPKSYKNYLLHVPFGLRIITLILIILVLARPQTTNSWQN 82 Query: 183 PPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIA 242 + S A +++ + A + N Sbjct: 83 SEIEGIDIMLAIDVSTSMLAEDLKPNRLEAAKDVAAEFI-----------NGRPNDNIGI 131 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRELYNEKESSHNTI 299 PL+ + + + + + T + +A L Sbjct: 132 TLFAGESFTQCPLTVDHAVLLNLFQGIQCDIIEDGTAVGMGIANAVTRL----------K 181 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-------- 351 S K +I +TDG N+ L E ++ G+++Y++ V Sbjct: 182 DSKAKSKVIILLTDGTNNKGDIS-----PLTAAEIAKSFGIRVYTIGVGTNGMAPYPVRV 236 Query: 352 -------------EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD-KIQEQSVRIAPN 397 + + L + + G +F + +L E +++I + + +V+ Sbjct: 237 GGTTQYINTPVEIDEKTLTQIAGTTDGNYFRATSNSKLKEVYEEIDKLEKTKLNVKEYSK 296 Query: 398 R 398 R Sbjct: 297 R 297 >gi|254496635|ref|ZP_05109500.1| conserved hypothetical protein [Legionella drancourtii LLAP12] gi|254354157|gb|EET12827.1| conserved hypothetical protein [Legionella drancourtii LLAP12] Length = 342 Score = 80.0 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 36/213 (16%), Positives = 76/213 (35%), Gaps = 49/213 (23%) Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 ++ V+ +A V L +IG I + TPL+ + V R+ Sbjct: 114 ASRLTVVKNAAEQFVRD---------RLGDKIGLILFGSRAYLQ--TPLTYDRQTVLLRI 162 Query: 267 NKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 + T+ A+ A + L + + +I +TDG N+ Sbjct: 163 EDATVGLAGKTTSIGDAVGLAVKRLDAVPQKG----------RVIILLTDGANNSGI--- 209 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPE-----------------GQDLLRKC-TDSSG 365 L L+ E ++ G+KIY++ + A + ++ L++ + G Sbjct: 210 --LEPLKAAELAKDEGIKIYTIGLGAATDPRALTNGFLMQAAAADLDEETLKEMSAMTGG 267 Query: 366 QFFAVNDSRELLESFDKITD--KIQEQSVRIAP 396 ++F D+ L + I ++ ++ + P Sbjct: 268 RYFRATDTATLNSIYKTINQLERVSQEQASVRP 300 >gi|327313515|ref|YP_004328952.1| von Willebrand factor type A domain-containing protein [Prevotella denticola F0289] gi|326945266|gb|AEA21151.1| von Willebrand factor type A domain protein [Prevotella denticola F0289] Length = 318 Score = 80.0 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 25/147 (17%), Positives = 45/147 (30%), Gaps = 30/147 (20%) Query: 254 PLSNNLNEVKSRL----NKLNPYE----NTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 P++ + + + L L T + +A L S Sbjct: 143 PMTLDHAALLNLLHGVRTDLVTSGLMQDGTAIGMGLANAVSRL----------KDSKAKS 192 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS------APPEGQDLLRK 359 K VI +TDG N+ S + R G++IY++ L+ Sbjct: 193 KIVILLTDGSNNAGSIS-----PMTAAAIARKFGIRIYTIGFGKETGEEIGAIDYKTLQD 247 Query: 360 CT-DSSGQFFAVNDSRELLESFDKITD 385 ++G+F+ EL + I Sbjct: 248 IAVSTNGEFYRAQSQAELSRIYQDIDK 274 >gi|47229020|emb|CAG09535.1| unnamed protein product [Tetraodon nigroviridis] Length = 680 Score = 80.0 bits (195), Expect = 6e-13, Method: Composition-based stats. Identities = 39/325 (12%), Positives = 81/325 (24%), Gaps = 33/325 (10%) Query: 71 HLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTN 130 Y + G+ + + P + E+ N+F + Sbjct: 377 SYINKQYFSDANGNRGAAPNVAVVLVDGWPTDKVEEASRLARESGINIFFVTIEGPDGLE 436 Query: 131 LSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFW 190 S L VS K Sbjct: 437 KQKVVE-HDFVDKAVCRTSGFFSLPVSSWFALRK---LVQPLVKRVCDTDRLVCSKTCLN 492 Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 + + + VL + N+ + + + R+G + Y Sbjct: 493 ANDIAFVIDGSSSVGTGNFRTVL-QFVANITREFEISDTD-----TRVGAVQYTYEQRLE 546 Query: 251 QCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 N ++ + +++ T+T A+ +A +L++ +K +I Sbjct: 547 FSFSQYGNKADLLGAIKRISYWSGGTSTGAAISYASEQLFS--------KSKPNKRKIMI 598 Query: 310 FITDGENSGASAYQNTLNTLQI--CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG-- 365 ITDG + Q C+++ QD L Sbjct: 599 VITDGRSYDDVRQPAAAVHRQGQRCDHLLR----------GHRWAAQDELEYIATDPDAE 648 Query: 366 QFFAVNDSRELLESFDKITDKIQEQ 390 F V++ L + +I I ++ Sbjct: 649 HSFFVDEFDNLYKFVPRIVSNICQE 673 >gi|332877593|ref|ZP_08445337.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332684442|gb|EGJ57295.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 333 Score = 80.0 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 64/181 (35%), Gaps = 43/181 (23%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRE 287 K + RIG + Y TP++ + + + L+++ + T + A Sbjct: 126 KDRPNDRIGLVIYAGESYTK--TPVTTDKLIILNALSEITYGQIEDGTAIGMGLATAVNR 183 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 L S + +I +TDG N+ ++ E G+K+Y+V + Sbjct: 184 L----------KESKAKSRVIILLTDGVNNTG-----FIDPQTAAELAAEYGIKVYTVGI 228 Query: 348 SAP----------------------PEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKIT 384 + L++K + G++F ++++L + +D+I Sbjct: 229 GTNGMALSPYALNADGSIIYRMQQVDIDEPLMKKIAQVTKGRYFRATNNQKLQQIYDEIN 288 Query: 385 D 385 Sbjct: 289 Q 289 >gi|150007595|ref|YP_001302338.1| hypothetical protein BDI_0948 [Parabacteroides distasonis ATCC 8503] gi|255013876|ref|ZP_05286002.1| hypothetical protein B2_08207 [Bacteroides sp. 2_1_7] gi|256839782|ref|ZP_05545291.1| conserved hypothetical protein [Parabacteroides sp. D13] gi|149936019|gb|ABR42716.1| conserved hypothetical protein BatA [Parabacteroides distasonis ATCC 8503] gi|256738712|gb|EEU52037.1| conserved hypothetical protein [Parabacteroides sp. D13] Length = 328 Score = 80.0 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 26/200 (13%), Positives = 63/200 (31%), Gaps = 40/200 (20%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENT 275 N + + + N PL+ + + + + + T Sbjct: 109 NRLEAAKDVAASFINGRPNDNIGLVVFSAESFTQCPLTTDHTVLLNLFKDIQSGMIQDGT 168 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + +A + S K +I +TDG N+ + + E Sbjct: 169 AIGLGLANAVSRI----------KDSHAKSKVIILLTDGSNNAG-----EIAPVTAAEIA 213 Query: 336 RNAGMKIYSVAVSAPP--------------------EGQDLLRKCTDS-SGQFFAVNDSR 374 + G+++Y++ V + L++ + GQ+F D+ Sbjct: 214 KTFGVRVYTIGVGTKGMAPYPFQTAFGVQYQNIPVEIDEATLKQIASTTGGQYFRATDNA 273 Query: 375 ELLESFDKITDKIQEQSVRI 394 L E + +I D++++ + + Sbjct: 274 SLKEIYSEI-DQMEKTKISV 292 >gi|126731914|ref|ZP_01747718.1| BatB protein, putative [Sagittula stellata E-37] gi|126707741|gb|EBA06803.1| BatB protein, putative [Sagittula stellata E-37] Length = 323 Score = 80.0 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 28/186 (15%), Positives = 69/186 (37%), Gaps = 33/186 (17%) Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 ++ ++ E+A + ++ R+G + ++ PL+ + V+ L Sbjct: 115 ATRLSIVKETADDFISR---------RDGDRLGLVLFSDRAYLQA--PLTFDREAVRKLL 163 Query: 267 NKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 ++ + T A+ + + L + E + ++ +TDG N+ Sbjct: 164 DQAQVGLTGQKTAIGDAIAVSVKRLKDRPEDG----------RVLVLLTDGANNEGVMSP 213 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAP---PEGQDLLRKCTD-SSGQFFAVNDSRELLES 379 + + G++IY++ V + + LR+ D + G +F D + L + Sbjct: 214 DK-----AADLAAKLGIRIYTIGVGSARSRDLDERTLRQIADATGGAYFRATDVQGLAQI 268 Query: 380 FDKITD 385 + I Sbjct: 269 YRAIDR 274 >gi|261250853|ref|ZP_05943427.1| protein BatA [Vibrio orientalis CIP 102891] gi|260937726|gb|EEX93714.1| protein BatA [Vibrio orientalis CIP 102891] Length = 322 Score = 80.0 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 28/201 (13%), Positives = 65/201 (32%), Gaps = 47/201 (23%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 ++ + + V R+G + + TPL+ + + + Sbjct: 108 YIDRLSAVKNVVSDFVKQ---------REGDRLGLVLFADHAYLQ--TPLTLDRETISDQ 156 Query: 266 LNKLNP---YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 +N L + T + A + + S ++ ++ ++DG N+ Sbjct: 157 VNSLVLRLIGDKTAIGEGIGLATKTFVD----------SEAPQRVMVLLSDGSNTSGV-- 204 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEGQDLLRKCTD-SS 364 L L+ + IY++ + A + L++ D + Sbjct: 205 ---LEPLEAARIAKKYNATIYTIGIGAGEMMVKEFFMTRKVNTAKDLDEKTLKQIADLTG 261 Query: 365 GQFFAVNDSRELLESFDKITD 385 GQ+F ++ EL +D I + Sbjct: 262 GQYFRARNADELATIYDTINN 282 >gi|319956579|ref|YP_004167842.1| von willebrand factor type a [Nitratifractor salsuginis DSM 16511] gi|319418983|gb|ADV46093.1| von Willebrand factor type A [Nitratifractor salsuginis DSM 16511] Length = 560 Score = 80.0 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 28/182 (15%), Positives = 61/182 (33%), Gaps = 21/182 (11%) Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY--EN- 274 G + +K + + + + P+S +K+ L L P Sbjct: 95 GGRFDIARKVLLDFIDRRPKDRIALEVFADYAYLAAPMSYEKKGLKTILAALEPGVVGGR 154 Query: 275 -TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T Y A+ R R + +I +TDG ++ + Sbjct: 155 DTALYEALFLGARLF---------KKEEGRSNRVMILLTDGIDTVGNIPL-----EAAIR 200 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 ++ A +++Y+V V + +L K + G+F+ L + +I D ++ + Sbjct: 201 ELKRAHIRVYTVGVG-DDFRRGVLEKIARSTGGRFYDARYPEALANIYRRI-DTLERTRI 258 Query: 393 RI 394 + Sbjct: 259 KT 260 >gi|224046761|ref|XP_002188607.1| PREDICTED: collagen, type XXII, alpha 1 [Taeniopygia guttata] Length = 1598 Score = 80.0 bits (195), Expect = 7e-13, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 69/200 (34%), Gaps = 22/200 (11%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + + + NLV + + R+G + Y+ Sbjct: 47 TSSSVGKEDFEKVRQWVSNLVETFE-----IGPDKTRVGVVRYSDRPSTEFDLGKYKTRE 101 Query: 261 EVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 E+K K+ NTNT A+ + ++ + + + +KK I +TDG + Sbjct: 102 EIKEAARKIQYYGGNTNTGDALRYITTYSFS--KEAGGRLSDRTVKKVAILLTDGRSQDF 159 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELL 377 AG++I++V V ++ L + F V+D + Sbjct: 160 VLDP--------ATAAHQAGIRIFAVGVG--EALKEELDEIASEPKSAHVFHVSDYNAID 209 Query: 378 ESFDKITDKIQEQSVRIAPN 397 + K+ ++ E + PN Sbjct: 210 KIRGKLRRRLCENV--LCPN 227 >gi|323493530|ref|ZP_08098652.1| hypothetical protein VIBR0546_14455 [Vibrio brasiliensis LMG 20546] gi|323312353|gb|EGA65495.1| hypothetical protein VIBR0546_14455 [Vibrio brasiliensis LMG 20546] Length = 322 Score = 79.6 bits (194), Expect = 7e-13, Method: Composition-based stats. Identities = 29/215 (13%), Positives = 70/215 (32%), Gaps = 47/215 (21%) Query: 192 KNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ 251 + K + ++ + + + R+G + + Sbjct: 94 SYSMSQKDMLSQDDYIDRLTAVKKVVSDFAQQ---------REGDRLGLVLFADHAYLQ- 143 Query: 252 CTPLSNNLNEVKSRLNKLNP---YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 TPL+ + N + ++ L + T + A + + S ++ + Sbjct: 144 -TPLTLDRNTIAKQVESLVLRLIGDKTAIGEGIGLATKTFID----------SDAPQRVM 192 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------P 351 I ++DG N+ L+ ++ + + IY++ V A Sbjct: 193 ILLSDGSNTSGV-----LDPIEAAKIAKKYNATIYTIGVGAGEMMVKEFFMTRKVNTAKD 247 Query: 352 EGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 ++ L++ D + GQ+F ++ EL +D I Sbjct: 248 LDENTLQEIADLTGGQYFRARNADELATIYDTINS 282 >gi|166713250|ref|ZP_02244457.1| hypothetical protein Xoryp_17865 [Xanthomonas oryzae pv. oryzicola BLS256] Length = 335 Score = 79.6 bits (194), Expect = 8e-13, Method: Composition-based stats. Identities = 32/205 (15%), Positives = 66/205 (32%), Gaps = 46/205 (22%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 ++ + ++ G TPL+ +L Sbjct: 117 VLGGKVVDRLTAAKAVLSDFLDR-----------RDGDRVGLLVFGQRAYALTPLTADLT 165 Query: 261 EVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 V+ +L T A+ + + L +K+ ++ V+ +TDG N+ Sbjct: 166 SVRDQLRDSVVGLAGRETAIGDAIALSVKRLREQKQG----------QRVVVLLTDGVNT 215 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAV----------------SAPPEGQDLLRKCT 361 L+ L+ E + G++I+++A +D LRK Sbjct: 216 AGV-----LDPLKAAELAKAEGVRIHTIAFGGGGGYSLFGVPIPAGGNDDIDEDGLRKIA 270 Query: 362 -DSSGQFFAVNDSRELLESFDKITD 385 + G+FF D+ EL + ++ Sbjct: 271 QQTGGRFFRARDTEELAGIYAELDR 295 >gi|58580793|ref|YP_199809.1| hypothetical protein XOO1170 [Xanthomonas oryzae pv. oryzae KACC10331] gi|58425387|gb|AAW74424.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC10331] Length = 335 Score = 79.6 bits (194), Expect = 8e-13, Method: Composition-based stats. Identities = 32/205 (15%), Positives = 66/205 (32%), Gaps = 46/205 (22%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 ++ + ++ G TPL+ +L Sbjct: 117 VLGGKVVDRLTAAKAVLSDFLDR-----------RDGDRVGLLVFGQRAYALTPLTADLT 165 Query: 261 EVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 V+ +L T A+ + + L +K+ ++ V+ +TDG N+ Sbjct: 166 SVRDQLRDSVVGLAGRETAIGDAIALSVKRLREQKQG----------QRVVVLLTDGVNT 215 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAV----------------SAPPEGQDLLRKCT 361 L+ L+ E + G++I+++A +D LRK Sbjct: 216 AGV-----LDPLKAAELAKAEGVRIHTIAFGGGGGYSLFGVPIPAGGNDDIDEDGLRKIA 270 Query: 362 -DSSGQFFAVNDSRELLESFDKITD 385 + G+FF D+ EL + ++ Sbjct: 271 QQTGGRFFRARDTEELAGIYAELDR 295 >gi|78049050|ref|YP_365225.1| hypothetical protein XCV3494 [Xanthomonas campestris pv. vesicatoria str. 85-10] gi|78037480|emb|CAJ25225.1| putative membrane protein [Xanthomonas campestris pv. vesicatoria str. 85-10] Length = 451 Score = 79.6 bits (194), Expect = 8e-13, Method: Composition-based stats. Identities = 32/205 (15%), Positives = 67/205 (32%), Gaps = 46/205 (22%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 ++ + ++ G TPL+ +L Sbjct: 230 VLGGKVVDRLTAAKAVLSDFLDR-----------RDGDRVGLLVFGQRAYALTPLTADLT 278 Query: 261 EVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 V+ +L+ T A+ + + L +K+ ++ V+ +TDG N+ Sbjct: 279 SVRDQLSDSVVGLAGRETAIGDAIALSVKRLREQKQG----------QRVVVLLTDGVNT 328 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVS----------------APPEGQDLLRKCT 361 LN L+ E + G++++++A +D LRK Sbjct: 329 AGV-----LNPLKAAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGNDDIDEDGLRKIA 383 Query: 362 D-SSGQFFAVNDSRELLESFDKITD 385 + G+FF D+ EL + ++ Sbjct: 384 QQTGGRFFRARDTEELAGIYAELDR 408 >gi|325922265|ref|ZP_08184046.1| Mg-chelatase subunit ChlD [Xanthomonas gardneri ATCC 19865] gi|325547218|gb|EGD18291.1| Mg-chelatase subunit ChlD [Xanthomonas gardneri ATCC 19865] Length = 335 Score = 79.6 bits (194), Expect = 8e-13, Method: Composition-based stats. Identities = 34/194 (17%), Positives = 71/194 (36%), Gaps = 35/194 (18%) Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN- 270 VL S + + + + + + + G TPL+ +L V+ +L Sbjct: 117 VLGGSVVDRLTAAKAVLSDFLDRREGDRVGLLVFGQRAYALTPLTADLTSVRDQLADSVV 176 Query: 271 --PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 T A+ + + L +K+ ++ V+ +TDG N+ LN Sbjct: 177 GLAGRETAIGDAIALSVKRLREQKQG----------QRVVVLLTDGVNTAGV-----LNP 221 Query: 329 LQICEYMRNAGMKIYSVAV----------------SAPPEGQDLLRKCTD-SSGQFFAVN 371 L+ E + G++++++A + LRK + + G+FF Sbjct: 222 LKAAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGNDDIDEAGLRKIAEQTGGRFFRAR 281 Query: 372 DSRELLESFDKITD 385 D+ EL + ++ Sbjct: 282 DTEELAGIYAELDR 295 >gi|226226933|ref|YP_002761039.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27] gi|226090124|dbj|BAH38569.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27] Length = 326 Score = 79.6 bits (194), Expect = 8e-13, Method: Composition-based stats. Identities = 21/157 (13%), Positives = 51/157 (32%), Gaps = 34/157 (21%) Query: 251 QCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + V + ++ L + +++ S + ++ Sbjct: 139 TQVPLTTDYPVVLAAIDNLQVG-------QLEDGTAIGTAIATAANRLRNSPGRSRVMVL 191 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP------------------- 351 +TDGEN+ ++ + G++IY++ V Sbjct: 192 LTDGENNRG-----AIDPRTAAQAAGTFGIRIYTIGVGTDGMAAVPVGRGLFGLRYENRP 246 Query: 352 --EGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITD 385 + LL + + G++F D+ L +++I Sbjct: 247 VKIDEALLTEIANSTGGRYFRAKDAAALQSIYEQIDR 283 >gi|12052774|emb|CAB66559.1| hypothetical protein [Homo sapiens] Length = 957 Score = 79.6 bits (194), Expect = 8e-13, Method: Composition-based stats. Identities = 30/176 (17%), Positives = 66/176 (37%), Gaps = 22/176 (12%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYENTNTYPAMHHA 284 + +++G + Y+ V ++ + + + L NT T A+ A Sbjct: 64 TKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTKTGKAIQFA 123 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 L++ S L K + +TDG++ + + R++ + +++ Sbjct: 124 LDYLFD--------KSSRFLTKIAVVLTDGKSQD--------DVKDAAQAARDSKITLFA 167 Query: 345 VAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + V + E + LR + F V D + + + + K+ E+SV P R Sbjct: 168 IGVGSETEDAE-LRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV--CPTR 220 >gi|212635916|ref|YP_002312441.1| Von Willebrand factor type A domain-containing protein [Shewanella piezotolerans WP3] gi|212557400|gb|ACJ29854.1| Von Willebrand factor type A domain protein [Shewanella piezotolerans WP3] Length = 333 Score = 79.6 bits (194), Expect = 8e-13, Method: Composition-based stats. Identities = 30/170 (17%), Positives = 55/170 (32%), Gaps = 38/170 (22%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK--LNPYEN-TNTYPAMHHAYRELYNEKE 293 RIG I + +PL+ + V LN+ + T A+ + Sbjct: 130 RIGLILFADHAYLQ--SPLTQDRRTVAQYLNEAEIGLVGRQTAIGEAIALGVKRFDQV-- 185 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--- 350 + +I +TDG N+ S Q + G+ IY+V V A Sbjct: 186 --------ENSNRVLILLTDGSNNAGSIS-----PEQATDIAAKRGITIYTVGVGAEVME 232 Query: 351 --------------PEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + L++ + + G +F ++ EL + +I Sbjct: 233 RRTLFGKERVNPSMDLDETQLKQIAEKTGGSYFRARNTEELERIYQEIDK 282 >gi|262381906|ref|ZP_06075044.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B] gi|298375541|ref|ZP_06985498.1| BatA protein [Bacteroides sp. 3_1_19] gi|301310439|ref|ZP_07216378.1| BatA protein [Bacteroides sp. 20_3] gi|262297083|gb|EEY85013.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B] gi|298268041|gb|EFI09697.1| BatA protein [Bacteroides sp. 3_1_19] gi|300832013|gb|EFK62644.1| BatA protein [Bacteroides sp. 20_3] Length = 328 Score = 79.6 bits (194), Expect = 8e-13, Method: Composition-based stats. Identities = 26/200 (13%), Positives = 63/200 (31%), Gaps = 40/200 (20%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENT 275 N + + + N PL+ + + + + + T Sbjct: 109 NRLEAAKDVAASFINGRPNDNIGLVVFSAESFTQCPLTTDHTVLLNLFKDIQSGMIQDGT 168 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + +A + S K +I +TDG N+ + + E Sbjct: 169 AIGLGLANAVSRI----------KDSHAKSKVIILLTDGSNNAG-----EIAPVTAAEIA 213 Query: 336 RNAGMKIYSVAVSAPP--------------------EGQDLLRKCTDS-SGQFFAVNDSR 374 + G+++Y++ V + L++ + GQ+F D+ Sbjct: 214 KTFGVRVYTIGVGTKGMAPYPFQTAFGVQYQNIPVEIDEATLKQIASTTGGQYFRATDNA 273 Query: 375 ELLESFDKITDKIQEQSVRI 394 L E + +I D++++ + + Sbjct: 274 SLKEIYSEI-DQMEKTKISV 292 >gi|325860278|ref|ZP_08173400.1| von Willebrand factor type A domain protein [Prevotella denticola CRIS 18C-A] gi|325482157|gb|EGC85168.1| von Willebrand factor type A domain protein [Prevotella denticola CRIS 18C-A] Length = 318 Score = 79.6 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 24/147 (16%), Positives = 45/147 (30%), Gaps = 30/147 (20%) Query: 254 PLSNNLNEVKSRL----NKLNPYE----NTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 P++ + + + L L T + +A L S Sbjct: 143 PMTLDHAALLNLLHGVRTDLVTSGLMQDGTAIGMGLANAVSRL----------KDSKAKS 192 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS------APPEGQDLLRK 359 K VI +TDG N+ S + R G+++Y++ L+ Sbjct: 193 KIVILLTDGSNNAGSIS-----PMTAAAIARKFGIRVYTIGFGKETGEEIGAIDYKTLQD 247 Query: 360 CT-DSSGQFFAVNDSRELLESFDKITD 385 ++G+F+ EL + I Sbjct: 248 IAVSTNGEFYRAQSQAELSRIYQDIDK 274 >gi|328951280|ref|YP_004368615.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] gi|328451604|gb|AEB12505.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] Length = 320 Score = 79.6 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 20/158 (12%), Positives = 54/158 (34%), Gaps = 28/158 (17%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 P + + ++ ++ L+ T + A R L E + + Sbjct: 141 SSYATLLQPPTTDRERLEQAVDLLDLAHRTAIGDGLVAALRVLPLEDSDAPGGMS----- 195 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------------- 352 V+ ++DG N+ ++ L+ G+++Y+V V Sbjct: 196 --VVLLSDGRNNYG------IDPLEAARQAEAQGVRVYTVGVGLSENTYVFANGYYIRAG 247 Query: 353 -GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQ 388 ++ L++ + G ++ + + EL + + ++ Sbjct: 248 LDEETLQEIAALTGGAYYRASSADELRAVYQTLARAVR 285 >gi|261867447|ref|YP_003255369.1| TadG [Aggregatibacter actinomycetemcomitans D11S-1] gi|261412779|gb|ACX82150.1| TadG [Aggregatibacter actinomycetemcomitans D11S-1] Length = 545 Score = 79.6 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 58/488 (11%), Positives = 131/488 (26%), Gaps = 119/488 (24%) Query: 13 ITYAIDLAHIMYIRNQMQSALDAAVLSGCASIV-----------------------SDRT 49 + + +D I+ + ++ A D A L A R Sbjct: 41 VAFTVDGTGILLDKARLAQATDQAALLLIAEDNQYRKNKDHSDVKRQNVSQQEIEREGRD 100 Query: 50 IKDPTTKKD---QTSTIFKKQIKKHLKQGSYIRENAGDIAQ------KAQINITKDKNNP 100 + + + + +K +L+ + + N Sbjct: 101 FSSAKVQAQWKKRNQELVQGLVKLYLRSDDSKGQKNSSPVTIKEPFLAECLEEKTQPKNK 160 Query: 101 LQY------IAESKAQYEIPT---ENLFLKGLIPSALTNLSLRSTGIIERSSENLAISIC 151 + + Q + + L + ++ T ++ + I + Sbjct: 161 NGTAKSIACVVQGSVQRKFWLPWGQTLVSSSQLYDGRVGINSGKTYAVKEKQITIPIDLM 220 Query: 152 MVLDVSRSMEDLYLQKHNDN---------------NNMTSNKYLLPPPPKKSFWSKNTTK 196 MV D+SRSM N+ K LLP K N Sbjct: 221 MVTDLSRSMMWAINATGNNPPEVNYPNRRIDALREAVEGIEKILLPAQNKGDVSPYNRMG 280 Query: 197 -----------SKYAPAPAPANRKIDVLIESAG--------------------NLVNSIQ 225 + P K + + I Sbjct: 281 FVSFAAGTRQRDELTNCVLPYYVKSEDKKREISAKFKKGYNGGNHIAKGFELLDRDLDIP 340 Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE-------------VKSRLNKLNPY 272 K I + + T + + ++ L +N+N+ V S L ++ P Sbjct: 341 KTIDQISQFDGQKRTYDFTLDSKTSRNYCLEDNVNKKTTQAWFDKNNRAVASALKQIIPR 400 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIG--STRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T + + + + T ++ ++ ++DGE++ S Sbjct: 401 GGTAVTSGIFIGTNLMMEKNKDFEAMPNKIGTNTRRILMILSDGEDNIPSKDTLVKLMEA 460 Query: 331 -ICEYMRNA--------------GMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE 375 +C ++ + + + P + Q++ +KC Q+++V+ + Sbjct: 461 GLCTRVKEKIDGLQDSNYPKVETRIAFVAFGFNPPQKQQEVWKKCV--GDQYYSVSSKQA 518 Query: 376 LLESFDKI 383 L ++F +I Sbjct: 519 LFDAFKQI 526 >gi|149632101|ref|XP_001514410.1| PREDICTED: hypothetical protein [Ornithorhynchus anatinus] Length = 2392 Score = 79.6 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 46/371 (12%), Positives = 105/371 (28%), Gaps = 59/371 (15%) Query: 21 HIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRE 80 + + ++ A+D L G +++ T Sbjct: 675 NQYDTKTKISDAIDGLSLIGRGTLIGG------------ALTFVSDYF----------SV 712 Query: 81 NAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIE 140 + G + + + E+ ++ G+ S + L S Sbjct: 713 SKGARPNVKKFLVLLTDGKSQDAVKEAAVALRQDGVIIYSVGVFGSEYSQLEEISGRSDM 772 Query: 141 RSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYA 200 + V DV + + + S + ++Y Sbjct: 773 VFYVENFDILKPVEDVLVFGICSPYEVCKRIEVLDIVFVID--------SSGSIDSNEYN 824 Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 A ID++ ++ + V+ G + Y+ ++ + Sbjct: 825 IMKAFM---IDLVKKA-------------DVGKNQVQFGALKYSDFPEVLFNLNEFSSKS 868 Query: 261 EVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 E+ S + +P T T A+ H+ + + + +I ITDGE+ A Sbjct: 869 EIISFIQNDHPRGGSTYTAKALAHSAHLFSESL----GSRMHRGVPQVLIVITDGESHDA 924 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 +R+ G+ + +AV + L S+ ++F V + L Sbjct: 925 HLLN------ATARALRDKGILV--LAVGIEGANHEELLSMAGSTDRYFFVENFEGLKGI 976 Query: 380 FDKITDKIQEQ 390 F+ ++ + Sbjct: 977 FENVSASVCNT 987 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 21/156 (13%), Positives = 55/156 (35%), Gaps = 16/156 (10%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYRELYNEKES 294 VR G + Y+ +N N++ + + NT+T A+ ++ Sbjct: 471 VRFGAVQYSHLWEWEFEMDRYSNKNDLVKAVENIRQLGGNTDTGAALDKMLPLFQRARQQ 530 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + + ++ +TDG + + + +R + +Y++ V Sbjct: 531 RARK-----VPQHLVVLTDGLSHDS--------VREPAGRLRGDNINVYAIGV--KEANH 575 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 L + S + + V++ L + +++ I + Sbjct: 576 TQLEEIAGSDSRVYYVHNFDSLKDIKNRVVRSICSE 611 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 31/235 (13%), Positives = 80/235 (34%), Gaps = 29/235 (12%) Query: 171 NNNMTSNKYLLPPPPKKS-----FWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQ 225 + ++++Y + S + + + ++ S L + Sbjct: 956 SMAGSTDRYFFVENFEGLKGIFENVSASVCNTSKVDCELGMADLVFLIDGSTSILEEDFK 1015 Query: 226 KA---------IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENT 275 K + + V +G ++ + EVK+++ ++ NT Sbjct: 1016 KMKDFLVTIVNDFDIRPGKVHVGLAQFSHEYRPEFSLIPFRDKIEVKNQIGRIQQIFGNT 1075 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 A+ R + + + + +++ ++ +TDG++ Q E + Sbjct: 1076 LIGAAL----RNVGSYFWPDFGSRINAGVQQVLLVLTDGQSQD--------EVAQAAEDL 1123 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 RN G+ IYS+ V L + + S+ + V++ EL + ++ + Sbjct: 1124 RNKGIDIYSLGVG--QVNDQQLIQISGSAKKKLTVDNFSELDKIKKRVVRDVCTS 1176 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 26/172 (15%), Positives = 63/172 (36%), Gaps = 19/172 (11%) Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN- 276 G LV +I + K +RIG + Y+ N +++ +++ L+P Sbjct: 248 GFLVETIDS--FDVKENCMRIGLVMYSNETKLVSRLGTGTNKSDILQQIDGLSPKAGRAL 305 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 T A++ +E+++ + S + + + IT + N + +R Sbjct: 306 TGAAINVTRKEIFS---RGAGSRKSQGVLQITVLITH--------RSSEDNVSEAALSLR 354 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDK 386 G+ +++V + + L + V ++ ++ +I K Sbjct: 355 REGVTVFAVGI--EGANETQLDQIASYPREQYVSMVKSYSDM-GAYYRIFQK 403 >gi|325927915|ref|ZP_08189139.1| Mg-chelatase subunit ChlD [Xanthomonas perforans 91-118] gi|325541755|gb|EGD13273.1| Mg-chelatase subunit ChlD [Xanthomonas perforans 91-118] Length = 338 Score = 79.6 bits (194), Expect = 9e-13, Method: Composition-based stats. Identities = 32/205 (15%), Positives = 67/205 (32%), Gaps = 46/205 (22%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 ++ + ++ G TPL+ +L Sbjct: 117 VLGGKVVDRLTAAKAVLSDFLDR-----------RDGDRVGLLVFGQRAYALTPLTADLT 165 Query: 261 EVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 V+ +L+ T A+ + + L +K+ ++ V+ +TDG N+ Sbjct: 166 SVRDQLSDSVVGLAGRETAIGDAIALSVKRLREQKQG----------QRVVVLLTDGVNT 215 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVS----------------APPEGQDLLRKCT 361 LN L+ E + G++++++A +D LRK Sbjct: 216 AG-----ALNPLKAAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGNDDIDEDGLRKIA 270 Query: 362 D-SSGQFFAVNDSRELLESFDKITD 385 + G+FF D+ EL + ++ Sbjct: 271 QQTGGRFFRARDTEELAGIYAELDR 295 >gi|297529200|ref|YP_003670475.1| von Willebrand factor A [Geobacillus sp. C56-T3] gi|297252452|gb|ADI25898.1| von Willebrand factor type A [Geobacillus sp. C56-T3] Length = 1077 Score = 79.2 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 25/278 (8%), Positives = 72/278 (25%), Gaps = 31/278 (11%) Query: 146 LAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAP 205 I + V+DVS SM + + + Y + ++ A Sbjct: 196 PPIDVVFVMDVSGSM-TAMKLQSAKSALQAAVNYFKSNYNQNDRFALVPFSDGVREASVV 254 Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT---IAYNIGIVGNQCTPLSNNLNEV 262 K + ++N+ + + L++ + V Sbjct: 255 PFGKYSNVASQLDAILNTGNSLTAGGGTNYSAALSLAKSYFTDPTRKKYIIFLTDGMPTV 314 Query: 263 KSRLNKLNP--------YENTNTY----PAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 + ++ + + T ++ Y + + + + + Sbjct: 315 LNTVDTITYREVKQNFWGGYSYTGNRVTDSLSVTYELYSDGRTAGIRFTDNKGYSRRFYS 374 Query: 311 ITDGENSGASAYQNT--------------LNTLQICEYMRNAGMKIYSVAVSAPPE-GQD 355 +G + + + + + + +YS+ E D Sbjct: 375 DGQDYVNGWRVSWDNGYSFTYSSIEGKIRADATAVAKTLGMNNITLYSIGFGDNDEVDMD 434 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 LR + ++G ++ L F + + ++ Sbjct: 435 YLRSLSATAGGEARQGTTQNLTALFQQFSQLATTPAIT 472 >gi|156976371|ref|YP_001447277.1| hypothetical protein VIBHAR_05144 [Vibrio harveyi ATCC BAA-1116] gi|156527965|gb|ABU73050.1| hypothetical protein VIBHAR_05144 [Vibrio harveyi ATCC BAA-1116] Length = 334 Score = 79.2 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 32/206 (15%), Positives = 66/206 (32%), Gaps = 47/206 (22%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 ++ + + V K R+G + + TPL+ + Sbjct: 115 NDNGEYIDRLTAVKRVLSDFVE---------KRQGDRLGVVLFGDHAYLQ--TPLTADRK 163 Query: 261 EVKSRLNK--LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 V ++N+ + T + + + S ++ +I ++DG N+ Sbjct: 164 TVMQQINQTVIGLVGQRTAIGDGIGLGTKTFVD----------SDAPQRVMILLSDGSNT 213 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP------------------EGQDLLRK 359 L+ L+ E + IY+V V A + Q L + Sbjct: 214 AGV-----LDPLEAAEIAKKYNATIYTVGVGAGEMMVKDFFMTRKVNTASELDEQTLTKI 268 Query: 360 CTDSSGQFFAVNDSRELLESFDKITD 385 + G++F D++EL +D I Sbjct: 269 AEMTGGKYFRARDAKELETIYDTINQ 294 >gi|73990553|ref|XP_853265.1| PREDICTED: similar to alpha 3 type VI collagen isoform 1 precursor [Canis familiaris] Length = 1798 Score = 79.2 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 40/384 (10%), Positives = 108/384 (28%), Gaps = 31/384 (8%) Query: 13 ITYAIDLAHIMYIRNQMQSALDAA---VLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIK 69 IT ++D R + S D A L ++ + + + I+ Sbjct: 250 ITASMD-MRYNCTRLGLMSYSDGAKTISLLNSSTSQYEFQEQIQKLSFQAGKSHAGAAIE 308 Query: 70 KHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALT 129 K + + QI + + ++ Q + +F + + T Sbjct: 309 KMRLEAFSESSGSRRAQGVPQIAVLVTHRPSTDEVRDAALQLRLQDVTVFAMNIQGANDT 368 Query: 130 NLS--LRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKK 187 L + S + + + + + Sbjct: 369 QLEEIVSYPPRQMVSMLKSYADLEAYSNNFQKKLQNEIWSQI-SVRAGQMDLDRTGCIDT 427 Query: 188 SFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGI 247 +I + + + + V+ G + Y+ I Sbjct: 428 KEADIYFLIDGSTSINTEGFEQIKQFMLAVTGMF--------SIGSDKVQAGAVQYSDKI 479 Query: 248 VGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 S+N +++ + + NT+T A+ + +++ + + Sbjct: 480 RVEFYINASSNDMDLRKAILNIEQLQGNTHTGKALDFMLSIIKKDRKHRI-----SEIPC 534 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ 366 +I +TDG++ L+ E +R+ + I++V + + L++ + Sbjct: 535 HLIVLTDGKSQD--------EVLKPAERLRDEQITIHAVGIG--EADKIQLQQIAGEEER 584 Query: 367 FFAVNDSRELLESFDKITDKIQEQ 390 + L +++ +I + Sbjct: 585 VNFGQNFDSLRNIKNEVVHRICTE 608 Score = 78.8 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 35/160 (21%), Positives = 62/160 (38%), Gaps = 15/160 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + VR+G + Y+ LS N + V L + +T T A+ HA Sbjct: 834 DVDSDRVRVGALKYSD--YPEVLFYLSGNKSAVIEHLRRRRYTSGHTYTARALEHANIMF 891 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 E + + +K+ +I ITDG + +T +RN G+ IY+V V Sbjct: 892 TEE----YGSRIQQNVKQMLIIITDGVSHDRDNLSDT------ASKLRNKGINIYAVGVG 941 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 Q L + F V++ L + + + +K+ Sbjct: 942 --QANQLELETMAGNKSNTFHVDNFSNLKDIYLPLQEKMC 979 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 31/158 (19%), Positives = 59/158 (37%), Gaps = 17/158 (10%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEK 292 S +IG + ++ NE +++++ T T A+ + K Sbjct: 652 DSTQIGVVQFSDINQEEFQLNKYFTQNETSDAIDRMSLINRGTLTGSALTFVGQYFTPTK 711 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + T++KKF+I ITDGE + +R+ G+ I+SV V Sbjct: 712 GAR------TKVKKFLILITDGEAQDPVRDP--------AKALRDKGVVIFSVGV--YGA 755 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + L + + S F V + +L K+ ++ Sbjct: 756 NRTQLEEISGDSSLVFQVENFDDLKTVESKLVFRVCAL 793 >gi|118590977|ref|ZP_01548377.1| hypothetical protein SIAM614_19991 [Stappia aggregata IAM 12614] gi|118436499|gb|EAV43140.1| hypothetical protein SIAM614_19991 [Stappia aggregata IAM 12614] Length = 608 Score = 79.2 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 42/430 (9%), Positives = 105/430 (24%), Gaps = 49/430 (11%) Query: 5 IISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCA------SIVSDRTIKDPTTKKD 58 + A + ++ +AA + + VS + Sbjct: 15 ASLMTIAIAGIASQTIDWQDLGKKLTDTTEAAGRMTSSGKPDGDASVSTAELPKQEETHT 74 Query: 59 QTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENL 118 + I + + + + A + + + S Q E P Sbjct: 75 VVAEIARPVATPQPAPAPALPQKQRSRSDGAGGGLMTFSSGAGGAVLNSGIQLEPPAMPA 134 Query: 119 FLK----GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQK--HNDNN 172 + L S + S ++ + + + S L + Sbjct: 135 VQLEDRERFASAEANPLRRTSADPVSTFSVDVDTASYSYVRSTLSGGRLPNPDAVRVEEM 194 Query: 173 NMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKI----------DVLIESAGNLVN 222 + P +S N + ++ D+ ++ L++ Sbjct: 195 VNYFDYNYPVPEKGGHPFSTNVSVVDTPWNEHTKLMQVGIQGYKVPLDDLPSQNLVFLID 254 Query: 223 SIQKAIQEKKNLSVRIG---------------TIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 + K ++ + Y + + ++N Sbjct: 255 TSGSMADANKLPLLQQSFRLLLSSLRDEDEVAIVTYAGSSGVLLEPTKVADKTRILEKIN 314 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 L +T + + AY T +I TDG+ + + ++L Sbjct: 315 ALTSGGSTAGHEGLKGAYALAETMTGDGEQTR--------IILATDGDFNVGLSDPDSL- 365 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 + R G + + +L++ + L E+ + D++ Sbjct: 366 -KRYVAEQRENGTALSVLGFGRGNYNDELMQTLAQNGQGV--AAYIDTLSEARKVLVDQV 422 Query: 388 QEQSVRIAPN 397 IA + Sbjct: 423 VSSISMIAQD 432 >gi|255033973|ref|YP_003084594.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] gi|254946729|gb|ACT91429.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] Length = 625 Score = 79.2 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 38/328 (11%), Positives = 83/328 (25%), Gaps = 29/328 (8%) Query: 71 HLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTN 130 + E+ I + +++ + + A Y L + P Sbjct: 135 TILAMPQATESYKPINENGFLSVGQQPVTTFSVDVDRAA-YSNVRRFLNNGQMPPEDAVR 193 Query: 131 LSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFW 190 + + ++ + K + Sbjct: 194 IEEMINYFDYDYPQPRGEHPVAIVAETTDSPWNPGLKLV-----HIGLQAKTVSAENLSA 248 Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 S K+ +L ++ L + ++ +I +AY Sbjct: 249 SNLVFL-IDVSGSMNEANKLPLLKQAFKLLADQLRVED--------KISIVAYAGSAGMV 299 Query: 251 QCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 + +K L+KL +T + AY N VI Sbjct: 300 LAPTSGSEKKTIKDALDKLEAGGSTAGGEGIELAYDLAKKHFLPKGNNR--------VIL 351 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFA 369 TDG+ + N ++ E R AG+ + + + D G + Sbjct: 352 ATDGDFNVG--ISNESELQKLIEEKRKAGIFLSVMGFGMGNYKDSHVETLADKGNGNYAY 409 Query: 370 VNDSRELLESFDKITDKIQEQSVRIAPN 397 +++ +E + F + IA + Sbjct: 410 IDNIQEARKVF---VQEFGGTLFTIAKD 434 >gi|325917650|ref|ZP_08179844.1| Mg-chelatase subunit ChlD [Xanthomonas vesicatoria ATCC 35937] gi|325536114|gb|EGD07916.1| Mg-chelatase subunit ChlD [Xanthomonas vesicatoria ATCC 35937] Length = 335 Score = 79.2 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 33/194 (17%), Positives = 71/194 (36%), Gaps = 35/194 (18%) Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN- 270 VL + + + + + + + + G TPL+ +L V+ +L Sbjct: 117 VLGGNVVDRLTAAKAVLSDFLDRRDGDRVGLLVFGQRAYALTPLTADLTSVRDQLADSVV 176 Query: 271 --PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 T A+ + + L +K+ ++ V+ +TDG N+ LN Sbjct: 177 GLAGRETAIGDAIALSVKRLREQKQG----------QRVVVLLTDGVNTAGV-----LNP 221 Query: 329 LQICEYMRNAGMKIYSVAV----------------SAPPEGQDLLRKCTD-SSGQFFAVN 371 L+ E + G++++++A + LRK + + G+FF Sbjct: 222 LKAAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGNDDIDEAGLRKIAEQTGGRFFRAR 281 Query: 372 DSRELLESFDKITD 385 D+ EL + ++ Sbjct: 282 DTEELAGIYAELDR 295 >gi|194289206|ref|YP_002005113.1| lipoprotein, von willebrand factor type a domain [Cupriavidus taiwanensis LMG 19424] gi|193223041|emb|CAQ69046.1| putative lipoprotein, Von Willebrand factor type A domain [Cupriavidus taiwanensis LMG 19424] Length = 570 Score = 79.2 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 22/162 (13%), Positives = 56/162 (34%), Gaps = 14/162 (8%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 RI + Y G ++ + + +++L +T + AY+ + Sbjct: 231 RITLVTYASGTRVALPPTPGSDKGAIVAAIDQLVAGGSTAGASGIALAYQAAQQSYIAGG 290 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 V+ TDG+ + + + E R +G+ + ++ + L Sbjct: 291 INR--------VLLATDGDFNVGV--TDFRQLKSMVEDKRKSGVSLSTLGFGTGNYNEQL 340 Query: 357 LRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + + D+ G + +++ E + + +I IA + Sbjct: 341 MEQLADAGDGAYSYIDNLMEGNKV---LVSEISSTLATIARD 379 >gi|219520386|gb|AAI43866.1| COL21A1 protein [Homo sapiens] Length = 957 Score = 79.2 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 29/176 (16%), Positives = 65/176 (36%), Gaps = 22/176 (12%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYENTNTYPAMHHA 284 + +++G + Y+ V ++ + + + L NT T A+ A Sbjct: 64 TKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTKTGKAIQFA 123 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 L+ S L K + +TDG++ + + R++ + +++ Sbjct: 124 LDYLF--------AKSSRFLTKIAVVLTDGKSQD--------DVKDAAQAARDSKITLFA 167 Query: 345 VAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + V + E + L+ + F V D + + + + K+ E+SV P R Sbjct: 168 IGVGSETEDAE-LKAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV--CPTR 220 >gi|332210152|ref|XP_003254169.1| PREDICTED: collagen alpha-1(XXI) chain [Nomascus leucogenys] Length = 957 Score = 79.2 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 30/176 (17%), Positives = 65/176 (36%), Gaps = 22/176 (12%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYENTNTYPAMHHA 284 + +++G + Y+ V ++ + + + L NT T A+ A Sbjct: 64 TKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTRTGKAIQFA 123 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 L+ S L K + +TDG++ + + R++ + +++ Sbjct: 124 LDYLF--------AKSSRFLTKIAVVLTDGKSQD--------DVKDAAQAARDSKITLFA 167 Query: 345 VAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + V + E + LR + F V D + + + + K+ E+SV P R Sbjct: 168 IGVGSETEDAE-LRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV--CPTR 220 >gi|297678422|ref|XP_002817079.1| PREDICTED: collagen alpha-1(XXI) chain-like [Pongo abelii] Length = 612 Score = 79.2 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 30/176 (17%), Positives = 65/176 (36%), Gaps = 22/176 (12%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYENTNTYPAMHHA 284 + +++G + Y+ V ++ + + + L NT T A+ A Sbjct: 64 TKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTRTGKAIQFA 123 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 L+ S L K + +TDG++ + + R++ + +++ Sbjct: 124 LDYLF--------AKSSRFLTKIAVVLTDGKSQD--------DVKDAAQAARDSKITLFA 167 Query: 345 VAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + V + E + LR + F V D + + + + K+ E+SV P R Sbjct: 168 IGVGSETEDAE-LRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV--CPTR 220 >gi|91201136|emb|CAJ74195.1| conserved hypothetical protein [Candidatus Kuenenia stuttgartiensis] Length = 331 Score = 79.2 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 27/213 (12%), Positives = 66/213 (30%), Gaps = 52/213 (24%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 ++ V+ + + +N + ++ PL+ + + L Sbjct: 111 NRLYVVKQVVKDFINKRSTDPIG---------LVVFSANAYTQ--CPLTLDYGILLQFLE 159 Query: 268 KLNPY---ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 K + T A+ + L + K ++ +TDG N+ Sbjct: 160 KTEIGLLEDGTAIGSAIASSVDRL----------RNTKAQSKVIVLLTDGRNNSGQ---- 205 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSA----------------------PPEGQDLLRKCTD 362 ++ L E + +KIY++ + + + L Sbjct: 206 -IDPLTAAELAQAFNIKIYTIGAGSKGLVPYPARDLFGNRVMRQVKIDIDDESLAEIANI 264 Query: 363 SSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 + G+++ D+ L E + +I D +++ I Sbjct: 265 TGGRYYRATDTGSLKEIYQQI-DALEKTETEIT 296 >gi|301626452|ref|XP_002942405.1| PREDICTED: collagen alpha-6(VI) chain-like [Xenopus (Silurana) tropicalis] Length = 2615 Score = 79.2 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 24/190 (12%), Positives = 63/190 (33%), Gaps = 22/190 (11%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + +V+S VRIG Y+ + ++ Sbjct: 1411 SASITSSNFTSAKTFMKEIVDSFT-----ISENRVRIGVAQYSANPKKEFFLNEYYSSSD 1465 Query: 262 VKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 +K +++ + T T + + + +++I +TDG ++ + Sbjct: 1466 MKKQIDSISQLKATTYTGKGLRFVKQFFDPANGGR------KNVPQYLIVMTDGMSNDSV 1519 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 +R++G+KI+S+ + + ++ S + V + L Sbjct: 1520 NED--------AAALRSSGVKIFSIGIGLRNSFELVM--IAGSPKNVYEVETFQALDSIK 1569 Query: 381 DKITDKIQEQ 390 +I ++ E Sbjct: 1570 RQIVAQVCEP 1579 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 61/165 (36%), Gaps = 16/165 (9%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAY 285 ++ + VR G + Y+ N +K ++++ T T A+ Sbjct: 867 SMFQVGANRVRFGVVQYSDVRRTEFFISEHNTQKMLKDAISQIEQLGGGTLTGEALTSMK 926 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + N + + + + ++ ITDGE+ + +RN G+ I+++ Sbjct: 927 QLFVNAAKDRPHK-----VPQSLVVITDGESQD--------RVTEAAAEIRNDGITIFAI 973 Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 V ++ +R S+ + F VN+ L + + ++ Sbjct: 974 GV--KNAVEEEIRDIAGSNEKMFFVNNFDSLKVIKNDLARELCTP 1016 Score = 68.8 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 36/333 (10%), Positives = 107/333 (32%), Gaps = 30/333 (9%) Query: 72 LKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNL 131 L Q + QI + I E + L+ G+ + L+ L Sbjct: 120 LTQHFSEAAGSRAAEGVPQIAVVITDGQAQDSIREPAIAVKNAGIILYAIGIKDAVLSEL 179 Query: 132 SLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWS 191 + ++ ++ ++A + +S++M + + T + ++ Sbjct: 180 NEIASDPDDKHVYSVA-DFNALQSISQNMIQVLCTTVEEAARQTGQIAQVCRTANQADIV 238 Query: 192 KNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ 251 + ++ L + + + + +RIG + YN Sbjct: 239 LLVESTT--------RMGDATFEKAKNFLYDLV--SNLDVGINKIRIGLVTYNDETNPEF 288 Query: 252 CTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 ++ E+ + + T T A+ + + + S + + +I Sbjct: 289 LLNSYSSKTEILESIQNMKYVEGYTYTGRALEYVNTTYFTQAAGSRFE---ESVAQILII 345 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFF 368 +T+G++S + +++ G+ +Y V + L++ + F+ Sbjct: 346 VTEGDSSDTLTEP--------AKELKSRGISVYVVG--TNIKYDRQLQEASSKPDEKFFY 395 Query: 369 AVNDSRELLESFDKITDKIQ---EQSVRIAPNR 398 ++D + +++ + + ++++ R Sbjct: 396 QLDDFDDSENVTEQLLKNLCFSIDLNIQVYSKR 428 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 56/168 (33%), Gaps = 13/168 (7%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELY 289 +V+ G + Y + ++ + L T T A+ + Sbjct: 1244 VGKDNVQFGAVVYGTNPAEQFSLNTYSTKLDILKAVFSLPQVSGYTYTAKALEYTRIRFG 1303 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + +I +TDG + + N + + +++ G+ +++V V Sbjct: 1304 TSYGGRPG------ISHILILVTDGATTE----ADRPNLPIVSKALKDDGIIVFAVGVGK 1353 Query: 350 PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 L++ ++F V + + L D IT + ++S + Sbjct: 1354 AV--PQELQQIAGYPDRWFLVQNYKGLDNIHDNITQVVCDESKPACSH 1399 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 30/160 (18%), Positives = 54/160 (33%), Gaps = 21/160 (13%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYRELYNEK 292 V+IG I ++ +E++S + + + T A+ + K Sbjct: 1060 DRVQIGLIQFSSETKEEFPLNRYKRKDEIQSAIRGIQQLSQGTLMGEALKYTLPYFSASK 1119 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 NT +++I ITDGE A + +R+ G+ IY++ V Sbjct: 1120 GGRVNTK------QYLIVITDGEAQDAVGNP--------AKAIRDHGVIIYAIGVQQ-AN 1164 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESF--DKITDKIQEQ 390 LL Q + + L +F I +I Sbjct: 1165 NTQLLE-IAGKQEQVYYEDSFDSL--AFLNKNIMFEICNP 1201 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 21/169 (12%), Positives = 58/169 (34%), Gaps = 16/169 (9%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAY 285 + + +VR+G + Y+ N E+ + KL +T A+ Sbjct: 660 SALDISLSNVRVGLVLYSDEPRLELKLNTFNEKYEILDFITKLPYRGGKAHTGAALDFLR 719 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 ++++ ++ +++ + +T+G+ + N + +R +G+++++V Sbjct: 720 KKMFTKQNGG---RPHQGVQQIAVVMTNGQ--------SMDNFTKPAAKLRRSGVEVFAV 768 Query: 346 AVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQSV 392 L V +L +I ++ + V Sbjct: 769 GFQ--NINDTELDIIASHPPRKHVTNVESFLQLSNLEFRIQKRLCNEIV 815 Score = 44.1 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 20/156 (12%), Positives = 48/156 (30%), Gaps = 16/156 (10%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYEN-TNTYPAMHHAYRELYNEKES 294 RIG Y+ E+ + + T A+ + E+ Sbjct: 469 RIGLAQYSGLPQTEFLLNHYETKEEILKHIKETFTYRGGPLKTGHALEFVRSTFFIEEAG 528 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 S G+ +F++ IT + ++ + E +++ G+ +++V + Sbjct: 529 SRINYGN---PQFLVVIT--------SSKSEDAVRRHAEELKSVGVT--TISVGIGNSDR 575 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 L K F + + + + I + Sbjct: 576 KELEKIATDP-FVFQTTGLQHISNLQQDVANVIIAE 610 >gi|330959358|gb|EGH59618.1| von Willebrand factor, type A [Pseudomonas syringae pv. maculicola str. ES4326] Length = 353 Score = 79.2 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 28/150 (18%), Positives = 55/150 (36%), Gaps = 35/150 (23%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + V+ L++ +NT A+ A + L + ++ Sbjct: 150 PLTYDRRTVRVWLDEAKIGIAGKNTAIGDAIGLALKRL----------RMRPANSRVLVL 199 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------GQ 354 +TDG N+G ++ + + G+KIY+V + + P+ + Sbjct: 200 VTDGANNGGQ-----IDPITAARLAADEGVKIYTVGIGSDPDKNALQGVLGLNPSLDLDE 254 Query: 355 DLLRKCTD-SSGQFFAVNDSRELLESFDKI 383 L+ S GQ+F D EL + + Sbjct: 255 PTLKDIASLSGGQYFRARDGAELDKIRTAL 284 >gi|85708696|ref|ZP_01039762.1| hypothetical protein NAP1_05635 [Erythrobacter sp. NAP1] gi|85690230|gb|EAQ30233.1| hypothetical protein NAP1_05635 [Erythrobacter sp. NAP1] Length = 640 Score = 79.2 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 35/266 (13%), Positives = 77/266 (28%), Gaps = 42/266 (15%) Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 + + + N + ++ + P T + + Sbjct: 388 NGSNTNVTWDGCIEEANTVATDTFDPFPQDAHDLKINLTPSNVNEYWKPVLRNATWKRED 447 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN-NLNEVKSRLNKLNPYEN 274 S+GN++ I + E + L++ + ++++ ++ L P N Sbjct: 448 SSGNVLGHITQTGNENRPGY-----------SCPAAAFKLTDISRTDLETYVDGLTPRSN 496 Query: 275 TNTYPAMHHAYRELYNE---KESSHNTIGSTRLKKFVIFITDGE---------------- 315 T M R + S+ + + ++F+TDG Sbjct: 497 TYHDFGMIWGARFISPNGIFAASNATAPNGDAISRHIVFMTDGLLVPNQEIYSMYGIEWW 556 Query: 316 ----NSGASAYQNTLNTLQ----ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF 367 + S Q C R + ++ +A L C + G+ Sbjct: 557 DRRITNDGSGGQARDRHATRFQVACRAARQENISVWVIAFGTTLTQN--LIDCA-TPGRA 613 Query: 368 FAVNDSRELLESFDKITDKIQEQSVR 393 F ND+ L F++I +I + Sbjct: 614 FQANDTAALETRFEQIAQEIAALRLT 639 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 16/166 (9%), Positives = 42/166 (25%), Gaps = 25/166 (15%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 ++A + + +D + ++Q + + + D Sbjct: 12 ISAASLVPLMAMVGGGVDASRYYMAETRLQ------------AACDAGALAARRSMADDN 59 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + + + +Y G + + Q E+ + Sbjct: 60 FSRADRITGEKFFDENYPDGTFG----LEDLERSFTATQSGQVNGEASGTL-----PTAI 110 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ 166 +LS+ + S+ + V+DV+ SM Sbjct: 111 MAPFGYDEFSLSVTCEADVNISNT----DVLFVVDVTGSMNCAPDN 152 >gi|327403932|ref|YP_004344770.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] gi|327319440|gb|AEA43932.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] Length = 341 Score = 79.2 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 30/227 (13%), Positives = 70/227 (30%), Gaps = 51/227 (22%) Query: 183 PPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIA 242 + S A +++V A V+ R+G + Sbjct: 97 DYKNGIDIILSIDASGSMLAQDFDPNRLEVAKRVAKKFVD---------SRKGDRVGLVV 147 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE---NTNTYPAMHHAYRELYNEKESSHNTI 299 Y P + + +K +++ + P T + A L Sbjct: 148 YEGEAYTA--CPATLDYKLLKEQISAIEPGHLEPGTAIGSGLGVAVTRL----------R 195 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-------- 351 + + K +I +TDG ++ L++ E + ++Y++ V A Sbjct: 196 SDSLISKVIILLTDGSSNTG------PEPLEVAELAKAKKCRVYTIGVGADGMAPTPVNT 249 Query: 352 ------------EGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + +L++ ++G++F D + L + + +I Sbjct: 250 PFGVVYQNLPVEIDEGVLKEIASATNGKYFRAQDEKSLEKIYAEIDK 296 >gi|196250158|ref|ZP_03148852.1| von Willebrand factor type A [Geobacillus sp. G11MC16] gi|196210342|gb|EDY05107.1| von Willebrand factor type A [Geobacillus sp. G11MC16] Length = 668 Score = 79.2 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 30/270 (11%), Positives = 78/270 (28%), Gaps = 27/270 (10%) Query: 142 SSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY-A 200 + I + V+DVS SM + + + Y ++ A Sbjct: 191 APVRPPIDVVFVMDVSGSM-TTMKLQSAKSALQAAVNYFKTNYHPNDRFALIPFSDDVKA 249 Query: 201 PAPAPANRKIDVLI--ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + P K +V+ ++ + N + + ++ + +N L++ Sbjct: 250 TSVVPFGSKSNVISQLDAILDEGNRLTANGGTNYSAALSLAQSYFNDPERKKYIIFLTDG 309 Query: 259 LNEVKSRLNKLNP--------YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 + V + + + + ++ Y + K +S + Sbjct: 310 MPTVLNTTSSITHKEIKKGFKDDGEKITASLPLIYGLYSDGKMTSIIFTDKDGYDRLFYN 369 Query: 311 ITDGENSGASAYQNTL--------------NTLQICEYMRNAGMKIYSVAVSAPPE-GQD 355 +G + + + + + + + +YS+ E D Sbjct: 370 NHIDYVNGWLFSNDNGYSFTYAWGEGKAYGDAVNVAKTLVMNNITLYSIGFGNNHEVDMD 429 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITD 385 LR + ++G + L E F + + Sbjct: 430 YLRALSTTAGGEVRQGTPQNLTELFQRFSQ 459 >gi|219804750|ref|NP_001137338.1| cartilage matrix protein [Bos taurus] gi|296490178|gb|DAA32291.1| matrilin 1, cartilage matrix protein [Bos taurus] Length = 497 Score = 79.2 bits (193), Expect = 1e-12, Method: Composition-based stats. Identities = 39/378 (10%), Positives = 112/378 (29%), Gaps = 52/378 (13%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQIN---ITKDKN 98 + S + + ST + + + G + I+ I Sbjct: 97 RAHSSKAELLQAVRRIQPLSTGTMTGLAIQFAITKALSDAEGGRPRSPDISKVVIVVTDG 156 Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR 158 P + + A+ LF G+ L ++ + + S ++ +S+ Sbjct: 157 RPQDSVRDVSARARAGGIELFAIGVGRVDKATLQQIASEPQD-EHVDYVESYSVIEKLSK 215 Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF-------WSKNTTKSKYAPAPAPANR--- 208 ++ + + + + S+ ++ N+ Sbjct: 216 KFQEAFCLVSDLCATGDHDCEQVCVSSPGSYTCACREGFTLNSDGKTCNVCNGGGGSSAT 275 Query: 209 ---------------KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 +++ + +V+++ + + ++G + Y+ + Sbjct: 276 DLVFLIDGSKSVRPENFELVKKFINQIVDTL-----DVSDKLAQVGLVQYSSSVRQEFPL 330 Query: 254 PLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 + ++K+ + ++ + T T A+ + + + +K I T Sbjct: 331 GRFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSGAR-----PGAQKVGIVFT 385 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAV 370 DG + + ++ G K+++V V +D LR+ +F Sbjct: 386 DGRSQDYIND--------AAKKAKDLGFKMFAVGVG--NAVEDELREIASEPVAEHYFYT 435 Query: 371 NDSRELLESFDKITDKIQ 388 D + + + K+ +I Sbjct: 436 ADFKTINQIGKKLQKRIC 453 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 55/159 (34%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + R+G + Y + ++ E+ + ++ P T T A+ A + Sbjct: 73 DVGPNATRVGLVNYASSVKQEFPLRAHSSKAELLQAVRRIQPLSTGTMTGLAIQFAITKA 132 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E S + K VI +TDG + + R G++++++ V Sbjct: 133 LSDAEGGRPR--SPDISKVVIVVTDGRPQDS--------VRDVSARARAGGIELFAIGVG 182 Query: 349 APPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 + L++ V +L + F + Sbjct: 183 R--VDKATLQQIASEPQDEHVDYVESYSVIEKLSKKFQE 219 >gi|254559618|ref|YP_003066713.1| hypothetical protein METDI1076 [Methylobacterium extorquens DM4] gi|254266896|emb|CAX22695.1| conserved hypothetical protein [Methylobacterium extorquens DM4] Length = 473 Score = 78.8 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 62/459 (13%), Positives = 127/459 (27%), Gaps = 90/459 (19%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+ + + +D + + ++ +A DAA L+G + + + Sbjct: 25 LFALAVLPTIGLVGLGVDYGMAISSKTRLDNAADAAALAGVVT-AKEFIAANAQQSDVTA 83 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 S I + + + + +Q+ I + ++ Y ++ Sbjct: 84 SGIKAGESQALKAFNANASKVPFATVSLSQLEIVRS-----GQTLDATVSYTATVQST-F 137 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 + T L+ R ++ + + +++DVS SM D++ Sbjct: 138 GRTFGLSATTLTNRVNASVDLA---SYLDFYLMVDVSGSMG----LPTKDSDAEALAMQS 190 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 F + A + D + + L+ RIG Sbjct: 191 KEKQGNCQFACHFPDSVGWTKAAGKIQLRSDAVNNAVCELLKRASTP---VVPNQYRIGI 247 Query: 241 IAYNIG-IVGNQCTPLSNNLNEVKSRLN----------KLNPYENT-------------N 276 + T + +L +++ L +T + Sbjct: 248 YPFINQLATLAPLTDTTTSLAALRTAAQCDKIWPLAFTNLLDTGSTQLFTNNDPKTGTGS 307 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS------------------- 317 A ++ + + N ST + FV ITDG + Sbjct: 308 GGTHFEAALPKMKSTIKPYGNGSASTNSRPFVFLITDGMQNSQSYSAWKDTKTFSGNPSK 367 Query: 318 ---GASAYQNTLNTLQI----CEYMRNAGMKIYSVAVSAP-------------------- 350 +A N QI C ++NAG I + + Sbjct: 368 FAGYPNADWNGSQPAQIDPSKCTDLKNAGATISVLYIPYNIVKNYNNDSYIVWENGRVNQ 427 Query: 351 --PEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 P D LRKC G F+ N ++ S + + Sbjct: 428 FSPTLADPLRKCAS-PGFFYTANTQDDITASLGAMFKQA 465 >gi|189066649|dbj|BAG36196.1| unnamed protein product [Homo sapiens] Length = 957 Score = 78.8 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 30/176 (17%), Positives = 65/176 (36%), Gaps = 22/176 (12%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYENTNTYPAMHHA 284 + +++G + Y+ V ++ + + + L NT T A+ A Sbjct: 64 TKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTKTGKAIQFA 123 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 L+ S L K + +TDG++ + + R++ + +++ Sbjct: 124 LDYLF--------AKSSRFLTKIAVVLTDGKSQD--------DVKDAAQAARDSKITLFA 167 Query: 345 VAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + V + E + LR + F V D + + + + K+ E+SV P R Sbjct: 168 IGVGSETEDAE-LRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV--CPTR 220 >gi|119624855|gb|EAX04450.1| collagen, type XXI, alpha 1, isoform CRA_c [Homo sapiens] Length = 552 Score = 78.8 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 30/176 (17%), Positives = 65/176 (36%), Gaps = 22/176 (12%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYENTNTYPAMHHA 284 + +++G + Y+ V ++ + + + L NT T A+ A Sbjct: 64 TKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTKTGKAIQFA 123 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 L+ S L K + +TDG++ + + R++ + +++ Sbjct: 124 LDYLF--------AKSSRFLTKIAVVLTDGKSQD--------DVKDAAQAARDSKITLFA 167 Query: 345 VAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + V + E + LR + F V D + + + + K+ E+SV P R Sbjct: 168 IGVGSETEDAE-LRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV--CPTR 220 >gi|119624854|gb|EAX04449.1| collagen, type XXI, alpha 1, isoform CRA_b [Homo sapiens] Length = 567 Score = 78.8 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 30/176 (17%), Positives = 65/176 (36%), Gaps = 22/176 (12%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYENTNTYPAMHHA 284 + +++G + Y+ V ++ + + + L NT T A+ A Sbjct: 64 TKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTKTGKAIQFA 123 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 L+ S L K + +TDG++ + + R++ + +++ Sbjct: 124 LDYLF--------AKSSRFLTKIAVVLTDGKSQD--------DVKDAAQAARDSKITLFA 167 Query: 345 VAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + V + E + LR + F V D + + + + K+ E+SV P R Sbjct: 168 IGVGSETEDAE-LRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV--CPTR 220 >gi|119624853|gb|EAX04448.1| collagen, type XXI, alpha 1, isoform CRA_a [Homo sapiens] Length = 429 Score = 78.8 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 30/176 (17%), Positives = 65/176 (36%), Gaps = 22/176 (12%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYENTNTYPAMHHA 284 + +++G + Y+ V ++ + + + L NT T A+ A Sbjct: 64 TKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTKTGKAIQFA 123 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 L+ S L K + +TDG++ + + R++ + +++ Sbjct: 124 LDYLF--------AKSSRFLTKIAVVLTDGKSQD--------DVKDAAQAARDSKITLFA 167 Query: 345 VAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + V + E + LR + F V D + + + + K+ E+SV P R Sbjct: 168 IGVGSETEDAE-LRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV--CPTR 220 >gi|114607957|ref|XP_001157649.1| PREDICTED: collagen alpha-1(XXI) chain isoform 3 [Pan troglodytes] Length = 957 Score = 78.8 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 30/176 (17%), Positives = 65/176 (36%), Gaps = 22/176 (12%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYENTNTYPAMHHA 284 + +++G + Y+ V ++ + + + L NT T A+ A Sbjct: 64 TKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTKTGKAIQFA 123 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 L+ S L K + +TDG++ + + R++ + +++ Sbjct: 124 LDYLF--------AKSSRFLTKIAVVLTDGKSQD--------DVKDAAQAARDSKITLFA 167 Query: 345 VAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + V + E + LR + F V D + + + + K+ E+SV P R Sbjct: 168 IGVGSETEDAE-LRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV--CPTR 220 >gi|114607959|ref|XP_001157475.1| PREDICTED: collagen, type XXI, alpha 1 isoform 1 [Pan troglodytes] gi|114607961|ref|XP_518554.2| PREDICTED: collagen, type XXI, alpha 1 isoform 4 [Pan troglodytes] gi|114607963|ref|XP_001157591.1| PREDICTED: collagen, type XXI, alpha 1 isoform 2 [Pan troglodytes] Length = 954 Score = 78.8 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 30/176 (17%), Positives = 65/176 (36%), Gaps = 22/176 (12%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYENTNTYPAMHHA 284 + +++G + Y+ V ++ + + + L NT T A+ A Sbjct: 64 TKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTKTGKAIQFA 123 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 L+ S L K + +TDG++ + + R++ + +++ Sbjct: 124 LDYLF--------AKSSRFLTKIAVVLTDGKSQD--------DVKDAAQAARDSKITLFA 167 Query: 345 VAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + V + E + LR + F V D + + + + K+ E+SV P R Sbjct: 168 IGVGSETEDAE-LRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV--CPTR 220 >gi|18780273|ref|NP_110447.2| collagen alpha-1(XXI) chain precursor [Homo sapiens] gi|74752071|sp|Q96P44|COLA1_HUMAN RecName: Full=Collagen alpha-1(XXI) chain; Flags: Precursor gi|15593270|gb|AAL02227.1|AF414088_1 collagen XXI [Homo sapiens] gi|19310967|gb|AAL86699.1|AF438327_1 alpha 1 type XXI collagen precursor [Homo sapiens] gi|55665071|emb|CAH73913.1| collagen type XXI alpha 1 [Homo sapiens] gi|56202573|emb|CAI22496.1| collagen type XXI alpha 1 [Homo sapiens] gi|56202937|emb|CAI22395.1| collagen type XXI alpha 1 [Homo sapiens] gi|116496597|gb|AAI26109.1| Collagen, type XXI, alpha 1 [Homo sapiens] gi|215434893|gb|ACJ66843.1| alpha 1 type XXI collagen precursor [Homo sapiens] Length = 957 Score = 78.8 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 30/176 (17%), Positives = 65/176 (36%), Gaps = 22/176 (12%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYENTNTYPAMHHA 284 + +++G + Y+ V ++ + + + L NT T A+ A Sbjct: 64 TKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTKTGKAIQFA 123 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 L+ S L K + +TDG++ + + R++ + +++ Sbjct: 124 LDYLF--------AKSSRFLTKIAVVLTDGKSQD--------DVKDAAQAARDSKITLFA 167 Query: 345 VAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + V + E + LR + F V D + + + + K+ E+SV P R Sbjct: 168 IGVGSETEDAE-LRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV--CPTR 220 >gi|17974510|gb|AAL50033.1|AF330693_1 alpha 1 chain-like collagen COLA1L precursor [Homo sapiens] gi|55665070|emb|CAH73912.1| collagen type XXI alpha 1 [Homo sapiens] gi|56202574|emb|CAI22497.1| collagen type XXI alpha 1 [Homo sapiens] gi|56202938|emb|CAI22396.1| collagen type XXI alpha 1 [Homo sapiens] Length = 954 Score = 78.8 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 30/176 (17%), Positives = 65/176 (36%), Gaps = 22/176 (12%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYENTNTYPAMHHA 284 + +++G + Y+ V ++ + + + L NT T A+ A Sbjct: 64 TKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLTAAVESILYLGGNTKTGKAIQFA 123 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 L+ S L K + +TDG++ + + R++ + +++ Sbjct: 124 LDYLF--------AKSSRFLTKIAVVLTDGKSQD--------DVKDAAQAARDSKITLFA 167 Query: 345 VAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + V + E + LR + F V D + + + + K+ E+SV P R Sbjct: 168 IGVGSETEDAE-LRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV--CPTR 220 >gi|301606205|ref|XP_002932732.1| PREDICTED: collagen alpha-1(XXI) chain-like [Xenopus (Silurana) tropicalis] Length = 1058 Score = 78.8 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 35/203 (17%), Positives = 71/203 (34%), Gaps = 24/203 (11%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + D N+ +S + ++G + Y+ + + Sbjct: 65 VDGSWSVGYKDFDTAKNWLLNITSSF-----DIGPSYTQVGVVQYSDFPQLEIPLGHNTS 119 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 ++ S L + NTNT A+ A E++ + + + K I ITDG++ Sbjct: 120 YQQLLSALKSIKYLGGNTNTGRAIKFATEEVFPTSKRLNVSKN-----KIAIVITDGKSQ 174 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF--AVNDSRE 375 N + I R G+ +++V V + + L + + D Sbjct: 175 D--------NVVNISSSARAQGIILFAVGVGSEIT-KSELVAIANMPSTHYVLYAEDYTT 225 Query: 376 LLESFDKITDKIQEQSVRIAPNR 398 + + + KI E+SV P R Sbjct: 226 IDRIKETMRQKICEESV--CPTR 246 >gi|152993961|ref|YP_001359682.1| von Willebrand factor type A domain-containing protein [Sulfurovum sp. NBC37-1] gi|151425822|dbj|BAF73325.1| von Willebrand factor type A domain protein [Sulfurovum sp. NBC37-1] Length = 325 Score = 78.8 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 20/142 (14%), Positives = 54/142 (38%), Gaps = 25/142 (17%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 P + +L+ ++ L+ L T ++ A + S + +I Sbjct: 154 PFTQDLDALEHLLDSLRVGMAGPQTAMGDSIGLAVKMF----------RESNVTDRMLIV 203 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------GQDLLRKCTD-S 363 ++DG+++G + + E G+ ++++ + P D L++ + Sbjct: 204 MSDGDDTG-----SKVPPKTSAELAAKNGVNVFTIGIGDPKNAGEHPIDTDTLKEIAAIT 258 Query: 364 SGQFFAVNDSRELLESFDKITD 385 G+F+ + +L + + +I Sbjct: 259 GGKFYYAWNLDDLQDIYKQIDK 280 >gi|323499301|ref|ZP_08104278.1| hypothetical protein VISI1226_03745 [Vibrio sinaloensis DSM 21326] gi|323315689|gb|EGA68723.1| hypothetical protein VISI1226_03745 [Vibrio sinaloensis DSM 21326] Length = 322 Score = 78.8 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 29/215 (13%), Positives = 72/215 (33%), Gaps = 47/215 (21%) Query: 192 KNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ 251 + + + ++ + + + K R+G + + Sbjct: 94 SYSMSQEDMSDGSDYVDRLTAVKKVVSDFA---------IKREGDRLGVVLFADHAYLQ- 143 Query: 252 CTPLSNNLNEVKSRLNKLNP---YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 TPL+ + V ++N+L + T + A + + S ++ + Sbjct: 144 -TPLTLDRTTVADQVNQLVLRLIGDKTAIGEGIGLATKTFID----------SDAPQRVM 192 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP----------------- 351 I ++DG N+ ++ ++ + + IY++ V A Sbjct: 193 ILLSDGSNTSGV-----IDPIEAAKIAKKYDATIYTIGVGAGEMMVKEFFMTRKVNTAQD 247 Query: 352 -EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + + L++ + GQ+F D++EL +D I Sbjct: 248 LDEKALMQIAQITGGQYFRARDAKELATIYDTINS 282 >gi|15827966|ref|NP_302229.1| hypothetical protein ML1808 [Mycobacterium leprae TN] gi|221230443|ref|YP_002503859.1| hypothetical protein MLBr_01808 [Mycobacterium leprae Br4923] gi|81536900|sp|Q9CBL9|Y1808_MYCLE RecName: Full=UPF0353 protein ML1808 gi|254800638|sp|B8ZS82|Y1808_MYCLB RecName: Full=UPF0353 protein MLBr01808 gi|13093519|emb|CAC30761.1| possible membrane protein [Mycobacterium leprae] gi|219933550|emb|CAR71903.1| possible membrane protein [Mycobacterium leprae Br4923] Length = 335 Score = 78.8 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 21/152 (13%), Positives = 53/152 (34%), Gaps = 16/152 (10%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + N K+ L+KL + T T A+ A + + + G ++ +DG+ Sbjct: 155 TTNRYATKNALDKLQFADRTATGEAIFTALQAIATV--GAVIGGGEMPPPARIVLFSDGK 212 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA--------------PPEGQDLLRKCT 361 + + N ++ G+ I +++ P + + + + Sbjct: 213 ETMPTNPDNPKGAYTAARTAKDQGVPISTISFGTVYGFVEINGQRQPVPVDDETMKKVAQ 272 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 S G + EL + + +I ++++ Sbjct: 273 LSGGNSYNAATLAELKAVYASLQQQIGYETIK 304 >gi|282854077|ref|ZP_06263414.1| von Willebrand factor type A domain protein [Propionibacterium acnes J139] gi|282583530|gb|EFB88910.1| von Willebrand factor type A domain protein [Propionibacterium acnes J139] Length = 318 Score = 78.8 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 21/222 (9%), Positives = 58/222 (26%), Gaps = 32/222 (14%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 ++ S+ A ++ +A + + + R Sbjct: 85 DRATVVVAIDVSRSMVATDVEPSRLSAAKTAAKDFLGDLP----------PRFNVSLVKF 134 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 P + + V + + L +T ++ + L + + Sbjct: 135 AASAQVVVPPTTDRAAVSTAITNLQVLPSTAIGEGIYSSLNALKLVPDD--PKHPGQKPP 192 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------------- 352 ++ ++DG + L+ + + +Y++A Sbjct: 193 AAIVLLSDGATNVGRPS------LEAAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPV 246 Query: 353 GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 L S G+ F+ +L + + I + + V Sbjct: 247 NHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSVGYEKVF 288 >gi|153831781|ref|ZP_01984448.1| von Willebrand factor, type A [Vibrio harveyi HY01] gi|148872291|gb|EDL71108.1| von Willebrand factor, type A [Vibrio harveyi HY01] Length = 334 Score = 78.8 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 32/206 (15%), Positives = 66/206 (32%), Gaps = 47/206 (22%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 ++ + + V K R+G + + TPL+ + Sbjct: 115 NDNGEYIDRLTAVKRVLSDFVE---------KRQGDRLGVVLFGDHAYLQ--TPLTADRK 163 Query: 261 EVKSRLNK--LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 V ++N+ + T + + + S ++ +I ++DG N+ Sbjct: 164 TVMQQINQTVIGLVGQRTAIGDGIGLGTKTFVD----------SDAPQRVMILLSDGSNT 213 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP------------------EGQDLLRK 359 L+ L+ E + IY+V V A + Q L + Sbjct: 214 AGV-----LDPLEAAEIAKKYNATIYTVGVGAGEMMVKDFFMTRKVNTASDLDEQTLTKI 268 Query: 360 CTDSSGQFFAVNDSRELLESFDKITD 385 + G++F D++EL +D I Sbjct: 269 AEMTGGKYFRARDAKELETIYDTINQ 294 >gi|15600942|ref|NP_232572.1| hypothetical protein VCA0172 [Vibrio cholerae O1 biovar eltor str. N16961] gi|229510539|ref|ZP_04400019.1| protein BatA [Vibrio cholerae B33] gi|229517329|ref|ZP_04406774.1| protein BatA [Vibrio cholerae RC9] gi|229605140|ref|YP_002875844.1| protein BatA [Vibrio cholerae MJ-1236] gi|254286663|ref|ZP_04961618.1| conserved hypothetical protein [Vibrio cholerae AM-19226] gi|254850438|ref|ZP_05239788.1| conserved hypothetical protein [Vibrio cholerae MO10] gi|255746016|ref|ZP_05419963.1| protein BatA [Vibrio cholera CIRS 101] gi|262162145|ref|ZP_06031160.1| protein BatA [Vibrio cholerae INDRE 91/1] gi|9657562|gb|AAF96085.1| conserved hypothetical protein [Vibrio cholerae O1 biovar El Tor str. N16961] gi|150423247|gb|EDN15193.1| conserved hypothetical protein [Vibrio cholerae AM-19226] gi|229345365|gb|EEO10338.1| protein BatA [Vibrio cholerae RC9] gi|229352984|gb|EEO17924.1| protein BatA [Vibrio cholerae B33] gi|229371626|gb|ACQ62048.1| protein BatA [Vibrio cholerae MJ-1236] gi|254846143|gb|EET24557.1| conserved hypothetical protein [Vibrio cholerae MO10] gi|255735770|gb|EET91168.1| protein BatA [Vibrio cholera CIRS 101] gi|262028220|gb|EEY46878.1| protein BatA [Vibrio cholerae INDRE 91/1] Length = 318 Score = 78.8 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 31/170 (18%), Positives = 58/170 (34%), Gaps = 38/170 (22%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK--LNPYEN-TNTYPAMHHAYRELYNEKE 293 RIG I + TPL+ + V ++LN+ L T + A + + Sbjct: 129 RIGLILFADHAYLQ--TPLTLDRQTVANQLNQTVLKLIGTQTAIGEGIGLATKTFID--- 183 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--- 350 S ++ +I ++DG N+ L+ L+ + IY+V V A Sbjct: 184 -------SDAPQRVMILLSDGSNTAGV-----LDPLEAANIAKQYNTTIYTVGVGAGEMV 231 Query: 351 --------------PEGQDLLRKCTDS-SGQFFAVNDSRELLESFDKITD 385 + L+ + G +F + ++L +D I Sbjct: 232 VKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTINQ 281 >gi|126310280|ref|XP_001371684.1| PREDICTED: similar to collagen XXI [Monodelphis domestica] Length = 957 Score = 78.8 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 31/176 (17%), Positives = 62/176 (35%), Gaps = 22/176 (12%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHA 284 + +++G + Y+ V ++ + + + NT T A+ A Sbjct: 64 SNNFDIGPKFIQVGVVQYSDYPVLEIPLGSHHSGENLMEAMESIQYLGGNTRTGKAIQFA 123 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 L+ S L K + +TDG++ E R+ + +++ Sbjct: 124 LDHLF--------AKSSRFLTKIAVVLTDGKSQD--------EVKDAAEAARDNRITLFA 167 Query: 345 VAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + V + E + LR + F V D + + + I K+ E+SV P R Sbjct: 168 IGVGSETEDAE-LRAIANKPSSTYVFYVEDYIAISKIREVIKQKLCEESV--CPTR 220 >gi|163848731|ref|YP_001636775.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|163670020|gb|ABY36386.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] Length = 845 Score = 78.8 bits (192), Expect = 1e-12, Method: Composition-based stats. Identities = 42/331 (12%), Positives = 95/331 (28%), Gaps = 26/331 (7%) Query: 58 DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 D+ T + + + D + E + E+P Sbjct: 260 DRNDTFTANNVAGATTLVMPPPNILLIEDSGDEGAVLADALRRADMVIERSSASELPANL 319 Query: 118 LFLKGLIPS-----ALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNN 172 L T LSL + + + ++ Y + + Sbjct: 320 DLLTRFDGFVLVDVPATQLSLEQMVALREVVRSEGKGLTVIGGNQSFTLGGYAETPLADA 379 Query: 173 NMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK 232 P +S A K D+ E+A + ++Q + Sbjct: 380 LPLLMTPPPRPQRAPVSILFIIDRSASMSATFGI-SKFDMAKEAAILSLTTLQPGDRVGV 438 Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 VG + L E++ ++ ++ TN A+ L Sbjct: 439 LAFDTETIWTVPFRTVGEGVS-----LVELQDQIATMSLGGGTNIERALSVGLPAL---- 489 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + + +TDG + + + Q+ E R A + + ++A+ + + Sbjct: 490 ------ANEPYSTRHAVLLTDGRSYSNNYPRY----QQLVETARAAQITLSTIAIGSDSD 539 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLES-FDK 382 + L + + +G+++ V D+ +L F + Sbjct: 540 TELLNQLASWGNGRYYFVADATDLPRITFQE 570 >gi|138896202|ref|YP_001126655.1| hypothetical protein GTNG_2565 [Geobacillus thermodenitrificans NG80-2] gi|134267715|gb|ABO67910.1| Conserved hypothetical protein [Geobacillus thermodenitrificans NG80-2] Length = 668 Score = 78.8 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 30/270 (11%), Positives = 79/270 (29%), Gaps = 27/270 (10%) Query: 142 SSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY-A 200 + I + V+DVS SM + + + Y ++ A Sbjct: 191 APVRPPIDVVFVMDVSGSM-TTMKLQSAKSALQAAVNYFKTNYHPNDRFALIPFSDDVKA 249 Query: 201 PAPAPANRKIDVLI--ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + P K +V+ ++ + N + + ++ + +N L++ Sbjct: 250 TSVVPFGSKSNVISQLDAILDEGNRLTANGGTNYSAALSLAQSYFNDPERKKYIIFLTDG 309 Query: 259 LNEVKSRLNKLNP--------YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 + V + + + + ++ Y + K +S + + Sbjct: 310 MPTVLNTTSSITHKEIKKGFKDDGEKITASLPLIYGLYSDGKMTSISFTDKDGYDRLFYN 369 Query: 311 ITDGENSGASAYQNTL--------------NTLQICEYMRNAGMKIYSVAVSAPPE-GQD 355 +G + + + + + + + +YS+ E D Sbjct: 370 NHIDYVNGWLFSNDNGYSFTYAWGEGKAYGDAVNVAKTLVMNNITLYSIGFGNNHEVDMD 429 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITD 385 LR + ++G + L E F + + Sbjct: 430 YLRALSTTAGGEVRQGTPQNLTELFQRFSQ 459 >gi|296207278|ref|XP_002750612.1| PREDICTED: cartilage matrix protein [Callithrix jacchus] Length = 496 Score = 78.8 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 43/378 (11%), Positives = 112/378 (29%), Gaps = 52/378 (13%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQIN---ITKDKN 98 + VS + + ST + + G ++ I+ I Sbjct: 96 RAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKAFSDAEGGRSRSPDISKVVIVVTDG 155 Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR 158 P + + A+ LF G+ L ++ + S ++ +SR Sbjct: 156 RPQDSVRDVSARARASGVELFAIGVGRVDKATLQQIASEPQ-EEHVDYVESYNVIEKLSR 214 Query: 159 SMEDLY----LQKHNDNNNMTSNKYLLPPPPKKSF---WSKNTTKSKYAPAPAPANR--- 208 ++ + +++ P + ++ NT Sbjct: 215 KFQEAFCVVSDLCATGDHDCEQVCISSPGSYTCACREGFTLNTDGKTCNVCSGGGGSSAT 274 Query: 209 ---------------KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 +++ + +V+++ + + ++G + Y+ + Sbjct: 275 DLVFLIDGSKSVRPENFELVKKFINQIVDTL-----DVSDKLAQVGLVQYSSSVRQEFPL 329 Query: 254 PLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 + ++K+ + ++ + T T A+ + + + +K I T Sbjct: 330 GRFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSGAR-----PGAQKVGIVFT 384 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAV 370 DG + + ++ G K+++V V +D LR+ +F Sbjct: 385 DGRSQDYIND--------AAKKAKDLGFKMFAVGVG--NAVEDELREIASEPVAEHYFYT 434 Query: 371 NDSRELLESFDKITDKIQ 388 D + + + K+ KI Sbjct: 435 ADFKTINQIGKKLQKKIC 452 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 23/159 (14%), Positives = 55/159 (34%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + R+G + Y + + + + ++ P T T A+ A + Sbjct: 72 DVGPNATRVGVVNYASTVKQEFPLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKA 131 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 +++ E + S + K VI +TDG + + R +G++++++ V Sbjct: 132 FSDAEGGRSR--SPDISKVVIVVTDGRPQDS--------VRDVSARARASGVELFAIGVG 181 Query: 349 APPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 + L++ V +L F + Sbjct: 182 R--VDKATLQQIASEPQEEHVDYVESYNVIEKLSRKFQE 218 >gi|20089145|ref|NP_615220.1| hypothetical protein MA0247 [Methanosarcina acetivorans C2A] gi|19914014|gb|AAM03700.1| hypothetical protein (multi-domain) [Methanosarcina acetivorans C2A] Length = 589 Score = 78.8 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 34/277 (12%), Positives = 85/277 (30%), Gaps = 36/277 (12%) Query: 117 NLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS 176 ++F ++ L+ + + +++++ S + + S Sbjct: 14 SIFFASVLGVVAIVLTGAVSAQAIAEPAVSKTASPALINIAGSGVNEETTVTIEVTGAGS 73 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 P S + P+ + + +S A Sbjct: 74 TSTSAVPMD----VVFAIDSSGSMQSNDPSGLRKTAAKSFVDKMDSSRDTAGVV------ 123 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 + + PL+N+ VK+ ++ ++ +TN + A L + Sbjct: 124 -------SWDDSIDFSLPLTNDFPLVKTNIDSVDSSGSTNLNVGLEEAIDILDANPRT-- 174 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + +IF+TDG+ + + + + G IYS+ + Sbjct: 175 -----ENSVEVIIFLTDGQGTYLHST---------AQEAADKGYVIYSIGLG--GVNPTP 218 Query: 357 LRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 L+ + G +++ D+ L FD I ++ ++ Sbjct: 219 LQDMATTTGGAYYSSPDATSLQAIFDDIFSEVTTSTI 255 >gi|118443684|ref|YP_877685.1| hypothetical protein NT01CX_1604 [Clostridium novyi NT] gi|118134140|gb|ABK61184.1| hypothetical protein NT01CX_1604 [Clostridium novyi NT] Length = 1252 Score = 78.8 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 33/231 (14%), Positives = 83/231 (35%), Gaps = 44/231 (19%) Query: 193 NTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQC 252 N + + + L + + + ++K +K + IG ++++ + + Sbjct: 158 NYYIQGNKCYRQSSYNEKNRLKHAQESAIKFVKKFENDKN---ISIGLVSFDTRAIEQKE 214 Query: 253 TPLSNNLNEVKSRLNKLN--PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 L+++L+EVKS +N L TN + A + L G+ K+VI Sbjct: 215 --LTSSLSEVKSSINNLKVAYNGATNIEAGLKSAQKIL---------KKGNEDADKYVIL 263 Query: 311 ITDGENSGA----------------------SAYQNTLNTLQI------CEYMRNAGMKI 342 ++DG + + N + + ++ G+ Sbjct: 264 MSDGFPTAFDYAGEKFEENFNEHEVQDNTFINFGYNDYRGYAMKHSINQADSLKKVGINS 323 Query: 343 YSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + + S + L + + G++ ++ L +++KI K++ ++ Sbjct: 324 FIIGFSDGANSEKLNKIAKAAGGEYEEARNTDALNGAYNKIETKVKAPLIK 374 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 34/265 (12%), Positives = 73/265 (27%), Gaps = 57/265 (21%) Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 L + +K +ID + + A + V+ + + + VR Sbjct: 695 QLASGNFTINGKKYYVKDNKVYEFNEQDRSRIDSVKKVANDFVDKFKDDENTEIAI-VRY 753 Query: 239 GTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYREL---------- 288 + A + N+ S + +K R+N L TN + +Y L Sbjct: 754 SSKADVVLDNSNKVFLSSKDNETIKKRINSLKADVATNIGDGIRKSYSILDKCDKDSEKY 813 Query: 289 ----YNEKESSHNTIGST----------------------------------RLKKFVIF 310 + +++ +T +K Sbjct: 814 MILMTDGVPTAYTCYANTIKTLNNCKYGEEELDLGYCPEGYIECYNRKYYYSEVKGDFKL 873 Query: 311 ITDGENSGASA---YQNTLNTLQICEYM----RNAGMKIYSVAVSAPPEGQDLLRKCTDS 363 D + G + N L+ + ++ + V + L++ Sbjct: 874 ENDNRDEGYVIKFGDEYDKNALEYAKQAMQKSKSKNINNVIVGF-SDGIDTKKLKEIAGD 932 Query: 364 SGQFFAVNDSRELLESFDKITDKIQ 388 + Q+ D EL + +D+I I Sbjct: 933 NAQYKEAKDLGELSKQYDEIQKDIN 957 >gi|332255311|ref|XP_003276777.1| PREDICTED: collagen alpha-1(XXII) chain-like, partial [Nomascus leucogenys] Length = 695 Score = 78.8 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 68/195 (34%), Gaps = 20/195 (10%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + + + NLV++ + R+G + Y+ L + Sbjct: 45 TSSSVGKEDFEKVRQWVANLVDTFE-----VGPDRTRVGVVRYSDRPTTAFELGLFGSQE 99 Query: 261 EVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 EVK+ +L NTNT A+ + ++ + G K+ I +TDG + Sbjct: 100 EVKAAARRLAYHGGNTNTGDALRYITARSFS--PRAGGRPGDRAYKQVAILLTDGRSQDL 157 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELL 377 G++I++V V ++ L + F V+D + Sbjct: 158 VLDAAAAAHRA--------GIRIFAVGVG--EALKEELEEIASEPKSAHVFHVSDFNAID 207 Query: 378 ESFDKITDKIQEQSV 392 + K+ ++ E + Sbjct: 208 KIRGKLRRRLCENVL 222 >gi|301784737|ref|XP_002927782.1| PREDICTED: collagen alpha-5(VI) chain-like [Ailuropoda melanoleuca] Length = 2524 Score = 78.8 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 38/384 (9%), Positives = 106/384 (27%), Gaps = 31/384 (8%) Query: 13 ITYAIDLAHIMYIRNQMQSALDAA---VLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIK 69 I ++D+ R + S D L ++ + + + I+ Sbjct: 261 IAASMDV-RYNCTRLGLMSYSDRTNIISLLNSSTTQYEFQEQIQKLSVQAGKSNAGAAIE 319 Query: 70 KHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALT 129 K + + QI + + + Q + +F + + T Sbjct: 320 KMRLEAFSESSGSRKAQGVPQIAVLVTHRPSTDEVRDVAIQLRLQDVTVFGMNIQGADET 379 Query: 130 NLS--LRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKK 187 L S + + + Sbjct: 380 QLEEIASYPPRQMVSMLKSYADLETYSKNFQKKLRNEIWSQIST-RAEQMDLDRTGCIDT 438 Query: 188 SFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGI 247 +I + + + V++G + Y+ + Sbjct: 439 KEADIYFLIDGSTSIQGKHFEQIKEFMLAVTGMF--------SIGPDKVQVGAVQYSDKM 490 Query: 248 VGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 ++N +K+ + + NT T A++ + +++ +++ Sbjct: 491 RVEFYINDNSNNVNLKNAILNIEQLQGNTYTGEALNFTLSIIKEDRKLRT-----SQVPC 545 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ 366 ++I +TDG ++T + L+ E +R + I++V + + L++ + Sbjct: 546 YLIVLTDG--------RSTDDVLEPAERLRAEQVTIHAVGIGEAI--KVQLQQIAGGEER 595 Query: 367 FFAVNDSRELLESFDKITDKIQEQ 390 + L +++ +I + Sbjct: 596 VSFGQNFDSLRSIKNEVVHRICTE 619 Score = 74.6 bits (181), Expect = 2e-11, Method: Composition-based stats. Identities = 32/160 (20%), Positives = 60/160 (37%), Gaps = 13/160 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + + VRIG + Y+ +N + V L + T T A+ H Sbjct: 845 DVGSDRVRIGALKYSDYPEILFHLGKYSNRSSVIEHLRRRRSTGGDTYTARALDHTNMMF 904 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 E + + +K+ +I ITDG + + T +RN G+ IY+V V Sbjct: 905 TEE----YGSRIQQNVKQMLIVITDGVSHDRNLLNET------ALKLRNKGIDIYAVGVG 954 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 Q L + + F V++ +L + + + + + Sbjct: 955 --QADQLELEAMAGNKSKTFHVDNFNKLKDIYLPLQESMC 992 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 30/158 (18%), Positives = 55/158 (34%), Gaps = 17/158 (10%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEK 292 IG I ++ NE+ +++++ +NT T A+ + K Sbjct: 663 DRTHIGVIQFSDKTREEFQLNKYFTQNEISDAIDRMSLIDKNTLTGNALISVDQYFTPAK 722 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + KF+I ITDGE A + +R+ G+ I+SV V Sbjct: 723 GARIGIK------KFLILITDGEAQDAVRDP--------AKALRDKGVVIFSVGV--YGA 766 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + L + + F V +L K+ ++ Sbjct: 767 NRTQLEEISGDGNLVFQVESFDDLKAIESKLIFRVCAL 804 Score = 44.1 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 59/160 (36%), Gaps = 17/160 (10%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN-KLN-PYENTNTYPAMHHAYRELYNEKE 293 R+ Y+ + + N + + L L + T A+ A+R ++ Sbjct: 67 YRVAVAQYSDRLHSEFQLGTFKSRNPMLNHLKKNLGFLGGSLRTGHALREAHRTYFSAPA 126 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + V+ + Q+ + + + +R G++I SV + Sbjct: 127 GGRDKK--QFPPILVVLAS---------AQSEDDVEEASKALREDGVRIVSVGLQ--SAS 173 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 ++ L+ ++ F + +R+L +F + +I +++ + Sbjct: 174 EEELKAMA-TAQFHFNLRSARDL-GAFSQNMTQIIKEATQ 211 >gi|148689164|gb|EDL21111.1| RIKEN cDNA E330026B02 [Mus musculus] Length = 1482 Score = 78.8 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 37/338 (10%), Positives = 98/338 (28%), Gaps = 31/338 (9%) Query: 58 DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 ++K K+ + + QI + + ++ Sbjct: 301 QVGQAYTGAALRKTRKEIFSAQRGSRKNQGVPQIAVLVTHRASEDNVTKAAVNLRREGVT 360 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYL----QKHNDNNN 173 +F G+ + L ++ E+ + L + ++ L + Sbjct: 361 IFTMGIEGANPDELEKIASHPAEQFTSKLGNFSELATHNQTFLKKLRNQITHTVSVFSER 420 Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 + K + + P + +V A + Sbjct: 421 TETLKSACVDTEEADIYLLIDGSGSTQP------TDFHEMKTFLSEVVGMFNIAPHK--- 471 Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEK 292 VR+G + Y +N ++ + + TNT A++ + L K Sbjct: 472 --VRVGAVQYADTWDLEFEISKYSNKPDLGKAIENIRQMGGNTNTGAALNFTLKLLQRAK 529 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + + + ++ +T+G + + L +R ++++++ V Sbjct: 530 KERGSK-----VPCHLVVLTNGMSRDS--------VLGPAHKLREENIRVHAIGV--KEA 574 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 Q LR+ + + V+D L +++ +I + Sbjct: 575 NQMQLREIAGEEKRVYYVHDFDALRNIRNQVVQEICAE 612 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 42/354 (11%), Positives = 102/354 (28%), Gaps = 34/354 (9%) Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 D + K+ + + I++ ++ + ++A Sbjct: 838 ADVGKNQVRFGALKYADDPEVLFYLDELGTKLEVISVLQNDHPMGGNTYTAEALAFSNHM 897 Query: 117 NLFL------KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 KG+ + S + ++ A+ +L ++ + + Sbjct: 898 FTEARGSRLHKGVPQVLIVITDGESHDAEKLNATAKALRDKGILVLAVGIAGANSWELLA 957 Query: 171 NNNMTSNKYLLPPPPK----KSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK 226 Y + S S + S + ++ S + QK Sbjct: 958 MAGSGDKYYFVETFGGLKGIFSDVSASVCNSSKVDCEIEKVDLVFLMDGSNSIHPDDFQK 1017 Query: 227 A---------IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTN 276 + VRIG ++ E+ +++ + T+ Sbjct: 1018 MKGFLVSVVQDFDVSLNRVRIGVAQFSDSYRSEFLLGTFTGEREISTQIEGIQQIFGYTH 1077 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 A+ R++ + + + + ++ +TDG + Q E +R Sbjct: 1078 IGDAL----RKVKYYFQPDTGSRINAGTPQVLLVLTDGRSQD--------EVAQAAEELR 1125 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + G+ IYSV + L + T ++ + V++ EL + +I I Sbjct: 1126 HKGVDIYSVGIG--DVDDQELVQITGTAEKKLTVHNFDELKKVKKRIVRNICTS 1177 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 27/160 (16%), Positives = 54/160 (33%), Gaps = 19/160 (11%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEK 292 V+IG + ++ + +++ + ++++ T T A+ + +K Sbjct: 656 DRVQIGVVQFSHENKEEFQLNTFMSQSDIANAIDRMTHIGETTLTGSALTFVSQYFSPDK 715 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA-VSAPP 351 + ++KF+I ITDGE +R G+ IYSV + Sbjct: 716 GARP------NVRKFLILITDGEAQD--------IVRDPALALRKEGVIIYSVGVFGSNV 761 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 L + + F V + L D + I Sbjct: 762 ---TQLEEISGKPEMVFYVENFDILQHIEDDLVLGICSPR 798 Score = 43.4 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 24/162 (14%), Positives = 54/162 (33%), Gaps = 17/162 (10%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL--NKLNPYENTNTYPAMHHAYRELYNE 291 R+ Y+ + N N + + L N + A+ A+R ++ Sbjct: 61 NKYRVALAQYSDALHNEFQLGTFKNRNPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSA 120 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 + + V+ + ++ + + + +R G+KI SV V Sbjct: 121 PTNGRDKK--QFPPILVVLAS---------AESEDDVEEAAKALREDGVKIISVGVQK-- 167 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 ++ L+ S F + +R+L F +I + + Sbjct: 168 ASEENLKAMATSQ-FHFNLRTARDLS-VFAPNMTEIIKDVTQ 207 >gi|188994393|ref|YP_001928645.1| aerotolerance-related membrane protein BatA [Porphyromonas gingivalis ATCC 33277] gi|188594073|dbj|BAG33048.1| aerotolerance-related membrane protein BatA [Porphyromonas gingivalis ATCC 33277] Length = 327 Score = 78.8 bits (192), Expect = 2e-12, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 60/198 (30%), Gaps = 40/198 (20%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENT 275 N + + + N PL+ + + + + L + T Sbjct: 108 NRLEAAKDVAISFINNRPNDNIGMVTFAGESFTQCPLTTDHTVLLNMVQDLQMGVLDDGT 167 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + A L S + VI +TDG N+ + + Sbjct: 168 AIGMGLATAVNRL----------KDSKAKSRVVILLTDGSNNMG-----DITPRMAADIA 212 Query: 336 RNAGMKIYSVAVSAP--------------------PEGQDLLRKCTD-SSGQFFAVNDSR 374 R G+++Y+V V + L + S G++F D+ Sbjct: 213 RTFGIRVYTVGVGTRGEAPFPIQTEFGVRIQNVPVDIDEPTLDGIAEVSGGKYFRAVDNE 272 Query: 375 ELLESFDKITDKIQEQSV 392 L E + +I DK+++ + Sbjct: 273 TLNEIYKEI-DKLEKTRL 289 >gi|317508725|ref|ZP_07966378.1| von Willebrand factor type A domain-containing protein [Segniliparus rugosus ATCC BAA-974] gi|316252973|gb|EFV12390.1| von Willebrand factor type A domain-containing protein [Segniliparus rugosus ATCC BAA-974] Length = 350 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 29/233 (12%), Positives = 73/233 (31%), Gaps = 32/233 (13%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 ++ S ++D ++A V+ ++ +++G + + Sbjct: 105 NRATVVLVIDISLSMMCDDVRPTRVDAARQAAIKFVDEMEPT--------LQLGLVTFAG 156 Query: 246 GIVGNQCTPLSNNLNEVKSRLNK-LNPY---ENTNTYPAMHHAYRELYNEKESSHNTIGS 301 S++ VK L+ + P T T ++ A +++ S Sbjct: 157 TA--QTLIAPSSDHEVVKRALDDAIRPDKLAARTATGEGIYTALQQIET--LKGILGGAS 212 Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA------------ 349 ++ +DG+ + + + + IYS++ Sbjct: 213 KAPPARIVLESDGKETVPDDLNAPRGAFTAAKEAKAKEVPIYSISFGTASPIPYVNIQGS 272 Query: 350 ----PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 P + L + S G+FF + +L + + + +I V+ +R Sbjct: 273 RVPVPADDASLQKVAELSGGKFFTASSLDQLTDVYSSLNAEIGYDLVKQESSR 325 >gi|109087573|ref|XP_001097885.1| PREDICTED: collagen alpha-1(XXII) chain-like [Macaca mulatta] Length = 232 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 69/200 (34%), Gaps = 20/200 (10%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + + + NLV++ + R+G + Y+ L + Sbjct: 45 TSSSVGKEDFEKVRQWVANLVDTFE-----VGPDRTRVGVVRYSDRPTTAFELGLFGSRE 99 Query: 261 EVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 EVK+ +L NTNT A+ + ++ + G K+ I +TDG + Sbjct: 100 EVKAAARRLAYHGGNTNTGDALRYITALSFS--PRAGGRPGDRAYKQVAILLTDGRSQDL 157 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELL 377 G++I++V V ++ L + F V+D + Sbjct: 158 VLDAAAAAHRA--------GIRIFAVGVGK--ALKEELEEIASEPKSAHVFHVSDFNAID 207 Query: 378 ESFDKITDKIQEQSVRIAPN 397 + K+ ++ E AP Sbjct: 208 KIRGKLRRRLCESECSRAPR 227 >gi|305663382|ref|YP_003859670.1| von Willebrand factor type A [Ignisphaera aggregans DSM 17230] gi|304377951|gb|ADM27790.1| von Willebrand factor type A [Ignisphaera aggregans DSM 17230] Length = 323 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 35/206 (16%), Positives = 69/206 (33%), Gaps = 25/206 (12%) Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 ++ T +D + + +V+ I IG + ++ I Sbjct: 99 FNARTPVVIIVDTSGSMADNMDSVKYALRTMVSLFNNTID--------IGLVEFSHSIKS 150 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 P + N + + ++++ T A+ A L +E + F + Sbjct: 151 A--IPPTPNRSYIDMVIDRMEAGGGTMYSFALSTALSWLRPYRE--------LNVSAFTV 200 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFF 368 FITDG + + + + G+ IY+V + P G D + + G+ F Sbjct: 201 FITDGLP------GDPQDYRPLLDEYNKLGIPIYTVFIGEDPRGIDETKLIASKTGGEQF 254 Query: 369 AVNDSRELLESFDKITDKIQEQSVRI 394 V L ++ + I KI I Sbjct: 255 TVESIDRLSDTLNTIASKINTIIANI 280 >gi|282858824|ref|ZP_06267969.1| von Willebrand factor type A domain protein [Prevotella bivia JCVIHMP010] gi|282588393|gb|EFB93553.1| von Willebrand factor type A domain protein [Prevotella bivia JCVIHMP010] Length = 318 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 26/185 (14%), Positives = 61/185 (32%), Gaps = 31/185 (16%) Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY--------ENT 275 + + N PL+ + + + LN + + T Sbjct: 113 AKNVASDFINGRPNDNIGLTIFAGEAFTQCPLTIDHATLINLLNNVRADLVVKGLIQDGT 172 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + +A L S K VI +TDG N+ S + Sbjct: 173 AIGMGLANAVGRLKASNAKS----------KIVILLTDGSNNVGSIS-----PMTAATIA 217 Query: 336 RNAGMKIYSVAVSA------PPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQ 388 + +++Y++ + L++ ++G+F+ ELL+ ++ I +K++ Sbjct: 218 KKFNIRVYTIGLGTEQSGNYNDIDYTTLKQIALTTNGEFYRAQSQTELLQIYNDI-NKLE 276 Query: 389 EQSVR 393 + ++ Sbjct: 277 KTKLK 281 >gi|314923047|gb|EFS86878.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL001PA1] gi|314966819|gb|EFT10918.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL082PA2] gi|315093261|gb|EFT65237.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL060PA1] gi|315103481|gb|EFT75457.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA2] gi|327327645|gb|EGE69421.1| von Willebrand factor, type A [Propionibacterium acnes HL103PA1] Length = 320 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 21/222 (9%), Positives = 58/222 (26%), Gaps = 32/222 (14%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 ++ S+ A ++ +A + + + R Sbjct: 87 DRATVVVAIDVSRSMVATDVEPSRLSAAKTAAKDFLGDLP----------PRFNVSLVKF 136 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 P + + V + + L +T ++ + L + + Sbjct: 137 AASAQVVVPPTTDRAAVSTAITNLQVLPSTAIGEGIYSSLNALKLVPDD--PKHPGQKPP 194 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------------- 352 ++ ++DG + L+ + + +Y++A Sbjct: 195 AAIVLLSDGATNVGRPS------LEAAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPV 248 Query: 353 GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 L S G+ F+ +L + + I + + V Sbjct: 249 NHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSVGYEKVF 290 >gi|228997913|ref|ZP_04157515.1| D-amino acid dehydrogenase, large subunit [Bacillus mycoides Rock3-17] gi|228761788|gb|EEM10732.1| D-amino acid dehydrogenase, large subunit [Bacillus mycoides Rock3-17] Length = 474 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 28/196 (14%), Positives = 69/196 (35%), Gaps = 23/196 (11%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG----NQCTPLS-NNLN 260 K+D+ E+ V+ + +A+ + G+ L + Sbjct: 196 GKMKMDIAKEAIQQFVSDLPEAVNVSLRVYGHKGSNDEKDKTASCGAIENIYTLQKYDQT 255 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + L+ P T A+ + + KE+ K + ++DG + Sbjct: 256 TFRQSLDGFQPVGWTPLAEAIKKSTETFQSAKEND---------KNIIYIVSDGVETCGG 306 Query: 321 AYQNTLNTLQICEYMRNAGMK--IYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELL 377 N ++ + + N+ +K + + E + L++ + S G++ N ++EL Sbjct: 307 ------NPVEEAQKVSNSNIKPIMNIIGFQVDHEAEKQLKEIAEVSKGKYVLANSAKELQ 360 Query: 378 ESFDKITDKIQEQSVR 393 + F + I + ++ Sbjct: 361 DQFKETGKDITSRRLK 376 >gi|229005450|ref|ZP_04163163.1| D-amino acid dehydrogenase, large subunit [Bacillus mycoides Rock1-4] gi|228755812|gb|EEM05144.1| D-amino acid dehydrogenase, large subunit [Bacillus mycoides Rock1-4] Length = 474 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 28/196 (14%), Positives = 69/196 (35%), Gaps = 23/196 (11%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG----NQCTPLS-NNLN 260 K+D+ E+ V+ + +A+ + G+ L + Sbjct: 196 GKMKMDIAKEAIQQFVSDLPEAVNVSLRVYGHKGSNDEKDKTASCGAIENIYTLQKYDQT 255 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + L+ P T A+ + + KE+ K + ++DG + Sbjct: 256 TFRQSLDGFQPVGWTPLAEAIKKSTETFQSAKEND---------KNIIYIVSDGVETCGG 306 Query: 321 AYQNTLNTLQICEYMRNAGMK--IYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELL 377 N ++ + + N+ +K + + E + L++ + S G++ N ++EL Sbjct: 307 ------NPVEEAQKVSNSNIKPIMNIIGFQVDHEAEKQLKEIAEVSKGKYVLANSAKELQ 360 Query: 378 ESFDKITDKIQEQSVR 393 + F + I + ++ Sbjct: 361 DQFKETGKDITSRRLK 376 >gi|21221175|ref|NP_626954.1| lipoprotein [Streptomyces coelicolor A3(2)] gi|6969217|emb|CAB75310.1| putative lipoprotein [Streptomyces coelicolor A3(2)] Length = 532 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 40/346 (11%), Positives = 98/346 (28%), Gaps = 28/346 (8%) Query: 40 GCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNN 99 G + + + ++ G+ + I D ++ Sbjct: 38 GSNGAKDTAADSRGGSAPMPAPDRPRGEGEQRYDGGTGAPGSGEGEKNGDSREIAPDPDH 97 Query: 100 PLQYIAES-KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR 158 + + A Y L L A + + +D +R Sbjct: 98 LSTFALDVDTASYGYARRTLSEGRLPDPATVRPEEFVNSFRQDYDRPDGDGFSVTVDGAR 157 Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAG 218 + ++ + + + PP +F ++D+ E+ G Sbjct: 158 TDDEDWSLVRVGLATRPAERQSERPPAALTFVIDI-------SGSMGEPGRLDLAQEALG 210 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP-LSNNLNEVKSRLNKLNPYENTNT 277 + + ++ + + ++ L ++ V ++ L P ++TN Sbjct: 211 TMTDRLRDDDS--------VALVTFSDEAETVLPMTRLGDHRGRVHDAIDGLEPTDSTNL 262 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 M Y + V+ ++D + +T+ +I R Sbjct: 263 GAGMETGYETAVEGRREGATNR--------VVLVSDALANTGDTDADTI-LERIATERRE 313 Query: 338 AGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDK 382 G+ ++ V V + G L+ + D G V+ + + E F + Sbjct: 314 HGITLFGVGVGSD-YGDALMERLADKGDGHTTYVSTTEDAREVFSE 358 >gi|329928736|ref|ZP_08282585.1| IPT/TIG domain protein [Paenibacillus sp. HGF5] gi|328937517|gb|EGG33935.1| IPT/TIG domain protein [Paenibacillus sp. HGF5] Length = 964 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 28/186 (15%), Positives = 68/186 (36%), Gaps = 29/186 (15%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 +I+ +A ++ + + + + PL+ + K ++ Sbjct: 85 NRINAAKNAAKGFIDLMDMTKHQVGIVGY--------SSVAETSSLPLTTDTAAAKQFID 136 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + T T A+ A L + + + ++ +TDGE + ++ Sbjct: 137 PIVASGGTETGYAIDQAITLLSSHRPEAQP---------VIVIMTDGEAN------SSQA 181 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEG------QDLLRKCTDSSGQFFAVNDSRELLESFD 381 L+ + ++AG+ Y++A+ P + +LL++ ++ V S L E + Sbjct: 182 ALERAQAAKDAGIVFYTIALLGPNDNPDTSAPNELLKQMATTNSHHHFVLGSTGLAEIYA 241 Query: 382 KITDKI 387 I +I Sbjct: 242 AIVAEI 247 >gi|296393889|ref|YP_003658773.1| von Willebrand factor type A [Segniliparus rotundus DSM 44985] gi|296181036|gb|ADG97942.1| von Willebrand factor type A [Segniliparus rotundus DSM 44985] Length = 343 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 32/241 (13%), Positives = 75/241 (31%), Gaps = 32/241 (13%) Query: 178 KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVR 237 L P ++ S ++D ++A V+ ++ ++ Sbjct: 90 TTLARVPKNRATVVLVVDISLSMVCDDVRPTRVDAARQAAIKFVDEMEPT--------LQ 141 Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLN-KLNPY---ENTNTYPAMHHAYRELYNEKE 293 +G + + S++ VK L+ + P T T ++ A +++ Sbjct: 142 LGLVTFAGTA--QTLIAPSSDHEIVKHALDEAIRPDKLAARTATGEGIYTALQQIETLS- 198 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA---- 349 S S ++ +DG+ + + + + IYS++ Sbjct: 199 -SILGGKSKAPSARIVLESDGKETVPDDLNAPRGAFTAAKEAKAKEVPIYSISFGTTRPI 257 Query: 350 ------------PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 P + L + S G+FF +L + + + +I V+ + Sbjct: 258 PYVNIQGSRVPVPADDASLQKVAELSGGKFFTAGSLDQLSDVYSSLNAEIGYDLVKQDSS 317 Query: 398 R 398 R Sbjct: 318 R 318 >gi|327274976|ref|XP_003222250.1| PREDICTED: collagen alpha-6(VI) chain-like [Anolis carolinensis] Length = 2088 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 27/161 (16%), Positives = 58/161 (36%), Gaps = 13/161 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + + V+ G + Y+ + + + T T A+ H+ Sbjct: 831 DVSHDRVQFGAVKYSAEPETFFYLNRYTTKSAIIRAIQNDKSIGETTYTAKALRHSEGLF 890 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 E H + + + +I ITDG+ + ++ + +R G+ IY++ + Sbjct: 891 SEE----HGSRKHRGVPQVLIVITDGD------SHDAAELDEVSKKLRANGIIIYAIGIE 940 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + L S ++F VN L + +I++KI Sbjct: 941 RARPDELL--TMAGSEDKYFYVNTFEGLKHLYPRISEKICS 979 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 73/191 (38%), Gaps = 22/191 (11%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + ++ + ++V +V++G Y+ + ++ E Sbjct: 998 SNSISDSDFTKMKNFLQDVVRPFDTGH------NVQVGIAQYSDRYRKEFSLNMFSHKLE 1051 Query: 262 VKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +++++ ++ E T A+ E + + +++ ++ ITDG Sbjct: 1052 LETQIGRIRQMEGLQTYIGAALDRVRNFFTPEG----GSRVNENIQQILLVITDG----- 1102 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 ++ ++ E +R G+ IY++ V L + SS + + V++ EL Sbjct: 1103 ---RSHDKVVKAAEDLRKKGVDIYAIGVGR--IDHLQLSQIAGSSDRKYTVDNFSELKVI 1157 Query: 380 FDKITDKIQEQ 390 ++ D I E+ Sbjct: 1158 KKRLVDDICEE 1168 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 27/163 (16%), Positives = 52/163 (31%), Gaps = 17/163 (10%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + + +G I ++ +++ + +++ +T T A+ Sbjct: 644 DIGLDRMHVGAIQFSSRNKEEFRLSQYATKSDIIRAIGRMSLMGQSTLTGGALQFVLDYF 703 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 +KK +I ITDGE + E +R G+ +YSV V Sbjct: 704 ------RPIKGSRPYVKKILILITDGEAQD--------DVKTPAEALRQEGIIVYSVGVF 749 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 + L + + F V D L ++I I Sbjct: 750 --NANRTQLVEISGKPEMVFYVEDFDILKHLENEILFGICSPY 790 Score = 60.7 bits (145), Expect = 3e-07, Method: Composition-based stats. Identities = 22/172 (12%), Positives = 60/172 (34%), Gaps = 24/172 (13%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAY 285 + + K+ +RIG + Y+ ++ + V + ++P N +H Sbjct: 248 SSLDVKDKCMRIGLVTYSNKPQATSLLRMATDKTHVLQSIQSISPKGGKANLGSVIHFTK 307 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 ++++ S + S +++ I IT + + +R AG+ I+++ Sbjct: 308 EKVFS---KSAGSRKSQGVEQIAIVITH--------RSSEDDVSGAATALRRAGVTIFAI 356 Query: 346 AVSAPPEGQDLLRKCTDSS--------GQFFAVNDSREL--LESFDKITDKI 387 + L + F + + ++ + ++I K+ Sbjct: 357 GI--DAANTTQLAQIVAYPPEKHLTRLKTFSHLPNKTQIFHKKIMNQIQQKL 406 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 20/160 (12%), Positives = 51/160 (31%), Gaps = 16/160 (10%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 VR G + Y +++ + + N +T A+ + L Sbjct: 458 MVGPNKVRFGVVQYAEINELEFGPEEYGKTSDILKAIENIRQLRGNPHTGKALKFIHPLL 517 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 S + ++ +TD + + + ++N + IY++ + Sbjct: 518 RKS-----QGQHSRNVPCHLVVLTDQI--------SEDPVKEPAKKLKNEMVSIYAIGI- 563 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 + + + +S + + VND L +++ I Sbjct: 564 -RHANESQIYEIAESKDRAYFVNDFASLKHIRNEVVRDIC 602 Score = 41.4 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 19/154 (12%), Positives = 43/154 (27%), Gaps = 20/154 (12%) Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYEN-T 275 + +I R ++ + N++ + + L Sbjct: 45 TFISKTINSLPLGPNE--YRAALAQFSDQVYNEFHLDTYKGKNQMLNHIKSKLVFKGGPL 102 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 T A+ +Y +I K +I +T + + + + + Sbjct: 103 KTGNALRKIQESVY------WKSITGQNRSKILIVMT--------PQPSEDDVKEAAQML 148 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA 369 + +G+K +VA+ D L FF Sbjct: 149 QKSGVK--TVALGMENASHDELFLIATKPYSFFF 180 >gi|50842462|ref|YP_055689.1| aerotolerance protein BatA [Propionibacterium acnes KPA171202] gi|289427042|ref|ZP_06428758.1| von Willebrand factor type A domain protein [Propionibacterium acnes J165] gi|295130539|ref|YP_003581202.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK137] gi|50840064|gb|AAT82731.1| conserved protein, putative BatA (bacteroides aerotolerance operon) [Propionibacterium acnes KPA171202] gi|289159511|gb|EFD07699.1| von Willebrand factor type A domain protein [Propionibacterium acnes J165] gi|291375600|gb|ADD99454.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK137] gi|332675379|gb|AEE72195.1| hypothetical protein PAZ_c10200 [Propionibacterium acnes 266] Length = 318 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 21/222 (9%), Positives = 58/222 (26%), Gaps = 32/222 (14%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 ++ S+ A ++ +A + + + R Sbjct: 85 DRATVVVAIDVSRSMVATDVEPSRLSAAKTAAKDFLGDLP----------PRFNVSLVKF 134 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 P + + V + + L +T ++ + L + + Sbjct: 135 AASAQVVVPPTTDRAAVSTAITNLQVLPSTAIGEGIYSSLNALKLVPDD--PKHPGQKPP 192 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------------- 352 ++ ++DG + L+ + + +Y++A Sbjct: 193 AAIVLLSDGATNVGRPS------LEAAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPV 246 Query: 353 GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 L S G+ F+ +L + + I + + V Sbjct: 247 NHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSVGYEKVF 288 >gi|314981157|gb|EFT25251.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA3] gi|315091980|gb|EFT63956.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA4] Length = 320 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 21/222 (9%), Positives = 58/222 (26%), Gaps = 32/222 (14%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 ++ S+ A ++ +A + + + R Sbjct: 87 DRATVVVAIDVSRSMVATDVEPSRLSAAKTAAKDFLGDLP----------PRFNVSLVKF 136 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 P + + V + + L +T ++ + L + + Sbjct: 137 AASAQVVVPPTTDRAAVSTAITNLQVLPSTAIGEGIYSSLNALKLVPDD--PKHPGQKPP 194 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------------- 352 ++ ++DG + L+ + + +Y++A Sbjct: 195 AAIVLLSDGATNVGRPS------LEAAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPV 248 Query: 353 GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 L S G+ F+ +L + + I + + V Sbjct: 249 NHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSVGYEKVF 290 >gi|291399431|ref|XP_002716111.1| PREDICTED: matrilin 1, cartilage matrix protein [Oryctolagus cuniculus] Length = 497 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 38/382 (9%), Positives = 106/382 (27%), Gaps = 58/382 (15%) Query: 35 AAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINIT 94 AA+L I T T + +++ I Sbjct: 102 AALLQAVRRIRPLATGTMTGLAIQFAITKALSDAEG----------GRVTSPGISKVVIV 151 Query: 95 KDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVL 154 P + + + LF G+ + + + S ++ Sbjct: 152 VTDGRPQDSVRDVSERARASGVELFAIGVGGRVDKATLRQIASEPQDEHVDYVESYSVIE 211 Query: 155 DVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF-------WSKNTTKSKYAPAPAPAN 207 +S+ + + + + + S+ ++ N+ Sbjct: 212 KLSKKFREAFCVVSDLCATGDHDCEQVCVSSPGSYTCACRDGFTLNSDGKTCNVCSGGGG 271 Query: 208 R------------------KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 +++ + +V+++ + + ++G + Y+ + Sbjct: 272 SSATDLVFLIDGSKSVRPENFELVKKFINQIVDTL-----DVSDRLAQVGLVQYSSSVRQ 326 Query: 250 NQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 + ++K+ + ++ + T T A+ + + + +K Sbjct: 327 EFPLGRFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSGAR-----PGAQKVG 381 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQ 366 I TDG + + ++ G K+++V V +D LR+ Sbjct: 382 IVFTDGRSQDYIND--------AAKKAKDLGFKMFAVGVG--NAVEDELREIASEPVAEH 431 Query: 367 FFAVNDSRELLESFDKITDKIQ 388 +F D + + + K+ +I Sbjct: 432 YFYTADFKTITQIGKKLQKRIC 453 Score = 63.8 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 26/159 (16%), Positives = 56/159 (35%), Gaps = 17/159 (10%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + R+G + Y + + + + ++ P T T A+ A + Sbjct: 72 DVGPNATRVGVVNYASAVRQEFPLRAHGSKAALLQAVRRIRPLATGTMTGLAIQFAITKA 131 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E T S + K VI +TDG + + E R +G++++++ V Sbjct: 132 LSDAEGGRVT--SPGISKVVIVVTDGRPQDS--------VRDVSERARASGVELFAIGVG 181 Query: 349 APPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 + LR+ V +L + F + Sbjct: 182 -GRVDKATLRQIASEPQDEHVDYVESYSVIEKLSKKFRE 219 >gi|34541234|ref|NP_905713.1| batA protein [Porphyromonas gingivalis W83] gi|34397550|gb|AAQ66612.1| batA protein [Porphyromonas gingivalis W83] Length = 327 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 60/198 (30%), Gaps = 40/198 (20%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENT 275 N + + + N PL+ + + + + L + T Sbjct: 108 NRLEAAKDVAISFINNRPNDNIGMVTFAGESFTQCPLTTDHTVLLNMVQDLQMGVLDDGT 167 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + A L S + VI +TDG N+ + + Sbjct: 168 AIGMGLATAVNRL----------KDSKAKSRVVILLTDGSNNMG-----DITPRMAADIA 212 Query: 336 RNAGMKIYSVAVSAP--------------------PEGQDLLRKCTD-SSGQFFAVNDSR 374 R G+++Y+V V + L + S G++F D+ Sbjct: 213 RTFGIRVYTVGVGTRGEAPFPIQTEFGVRIQNVPVDIDEPTLDGIAEVSGGKYFRAVDNE 272 Query: 375 ELLESFDKITDKIQEQSV 392 L E + +I DK+++ + Sbjct: 273 TLNEIYKEI-DKLEKTRL 289 >gi|297462925|ref|XP_608567.5| PREDICTED: collagen type VI alpha 6-like [Bos taurus] Length = 2343 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 59/165 (35%), Gaps = 15/165 (9%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYR 286 + N VRIG ++ + E+ ++ + T+ A+ R Sbjct: 1025 DFDISNNRVRIGAAQFSHTYQPEFPLGMFIGKEEISFQIENIKQIFGYTHIGAAL----R 1080 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 ++ + + + ++ +TDG++ Q E +R+ G+ IYSV Sbjct: 1081 QVGHYFRPDMGSRIHAGTPQVLLVLTDGQSQD--------EVAQAAEELRHKGIDIYSVG 1132 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 + L + T ++ + V++ EL + +I I Sbjct: 1133 IG--DVDDQQLVQITGTANKKLTVHNFDELKKVKKRIVRNICSPR 1175 Score = 76.9 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 36/352 (10%), Positives = 97/352 (27%), Gaps = 31/352 (8%) Query: 44 IVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQY 103 S+ + ++K K+ + + QI + + Sbjct: 284 NKSEVLQDIQSLAPQAGKAYTGAALRKIRKEVFSAQHGSRKNQGVPQIAVLVTHSPSQDN 343 Query: 104 IAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDL 163 + ++ +F G+ ++ T L ++ E+ L + ++ L Sbjct: 344 VTKAAVNLRRQGVIVFTIGVEGASDTQLEKIASHPAEQYVSQLRSFSDLAAHNQTFLKKL 403 Query: 164 YL----QKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGN 219 + + K + + + Sbjct: 404 RNQITHTVSVISERTETLKAGCVDTEEADIYLLIDGSGSTQ------ATDFQEMKTFLSE 457 Query: 220 LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT-NTY 278 + A VR+G + Y N ++V+ + + NT Sbjct: 458 VAGMFNIAP-----QKVRVGAVQYADRWDLEFEISKYTNKHDVRKAIENIRQMGGNRNTG 512 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 A++ L K+ + ++ +T+G + + + + Sbjct: 513 AALNFTLGLLQRAKQQRGGR-----VPCHLVVLTNGASRDSVSGP--------ANRLSEE 559 Query: 339 GMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + +Y++ V Q LR+ + + V+D L + +++ +I + Sbjct: 560 LIHVYAIGV--REANQTQLREIAGEEKRVYYVHDFDALKDIRNQVVQEICAE 609 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 34/162 (20%), Positives = 58/162 (35%), Gaps = 13/162 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + VR G + Y + EV S L P NT T A+ + Sbjct: 836 DVGKNHVRFGALKYADDPEVLFYLDNLDTKWEVISVLQNDQPLGGNTYTAEALGFSDHMF 895 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + + + + +I ITDGE S + LN + +R+ G+ + +V + Sbjct: 896 TE----ARGSRLHKGVPQVLIVITDGE----SHDADKLNAT--AKALRDKGILVLAVGI- 944 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 A +LL SS ++F V L F ++ + Sbjct: 945 AGANPVELL-AMAGSSDKYFFVETFGGLKGIFSDVSASVCNS 985 Score = 63.4 bits (152), Expect = 5e-08, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 50/159 (31%), Gaps = 19/159 (11%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEK 292 V+IG + ++ + E+ ++++ T A+ + K Sbjct: 653 DRVQIGVVQFSDVNKEEFQLNRYTSQEEISDAIDRMAHIGETTLMGSALTFVSQYFSPAK 712 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA-VSAPP 351 + ++KF+I ITDGE +R G+ IYSV + Sbjct: 713 GARP------NVRKFLILITDGEAQD--------IVKDPAVALREEGIIIYSVGVFGSNV 758 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 L + + F V + L D + I Sbjct: 759 ---TQLEEISGRPEMVFYVENFDILKHIEDDLVFGICSP 794 >gi|297471452|ref|XP_002685218.1| PREDICTED: collagen, type VI, alpha 1-like [Bos taurus] gi|296490817|gb|DAA32930.1| collagen, type VI, alpha 1-like [Bos taurus] Length = 2268 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 59/165 (35%), Gaps = 15/165 (9%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYR 286 + N VRIG ++ + E+ ++ + T+ A+ R Sbjct: 1025 DFDISNNRVRIGAAQFSHTYQPEFPLGMFIGKEEISFQIENIKQIFGYTHIGAAL----R 1080 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 ++ + + + ++ +TDG++ Q E +R+ G+ IYSV Sbjct: 1081 QVGHYFRPDMGSRIHAGTPQVLLVLTDGQSQD--------EVAQAAEELRHKGIDIYSVG 1132 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 + L + T ++ + V++ EL + +I I Sbjct: 1133 IG--DVDDQQLVQITGTANKKLTVHNFDELKKVKKRIVRNICSPR 1175 Score = 76.9 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 36/352 (10%), Positives = 97/352 (27%), Gaps = 31/352 (8%) Query: 44 IVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQY 103 S+ + ++K K+ + + QI + + Sbjct: 284 NKSEVLQDIQSLAPQAGKAYTGAALRKIRKEVFSAQHGSRKNQGVPQIAVLVTHSPSQDN 343 Query: 104 IAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDL 163 + ++ +F G+ ++ T L ++ E+ L + ++ L Sbjct: 344 VTKAAVNLRRQGVIVFTIGVEGASDTQLEKIASHPAEQYVSQLRSFSDLAAHNQTFLKKL 403 Query: 164 YL----QKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGN 219 + + K + + + Sbjct: 404 RNQITHTVSVISERTETLKAGCVDTEEADIYLLIDGSGSTQ------ATDFQEMKTFLSE 457 Query: 220 LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT-NTY 278 + A VR+G + Y N ++V+ + + NT Sbjct: 458 VAGMFNIAP-----QKVRVGAVQYADRWDLEFEISKYTNKHDVRKAIENIRQMGGNRNTG 512 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 A++ L K+ + ++ +T+G + + + + Sbjct: 513 AALNFTLGLLQRAKQQRGGR-----VPCHLVVLTNGASRDSVSGP--------ANRLSEE 559 Query: 339 GMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + +Y++ V Q LR+ + + V+D L + +++ +I + Sbjct: 560 LIHVYAIGV--REANQTQLREIAGEEKRVYYVHDFDALKDIRNQVVQEICAE 609 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 34/162 (20%), Positives = 58/162 (35%), Gaps = 13/162 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + VR G + Y + EV S L P NT T A+ + Sbjct: 836 DVGKNHVRFGALKYADDPEVLFYLDNLDTKWEVISVLQNDQPLGGNTYTAEALGFSDHMF 895 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + + + + +I ITDGE S + LN + +R+ G+ + +V + Sbjct: 896 TE----ARGSRLHKGVPQVLIVITDGE----SHDADKLNAT--AKALRDKGILVLAVGI- 944 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 A +LL SS ++F V L F ++ + Sbjct: 945 AGANPVELL-AMAGSSDKYFFVETFGGLKGIFSDVSASVCNS 985 Score = 63.4 bits (152), Expect = 5e-08, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 50/159 (31%), Gaps = 19/159 (11%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEK 292 V+IG + ++ + E+ ++++ T A+ + K Sbjct: 653 DRVQIGVVQFSDVNKEEFQLNRYTSQEEISDAIDRMAHIGETTLMGSALTFVSQYFSPAK 712 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA-VSAPP 351 + ++KF+I ITDGE +R G+ IYSV + Sbjct: 713 GARP------NVRKFLILITDGEAQD--------IVKDPAVALREEGIIIYSVGVFGSNV 758 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 L + + F V + L D + I Sbjct: 759 ---TQLEEISGRPEMVFYVENFDILKHIEDDLVFGICSP 794 >gi|224081306|ref|XP_002190595.1| PREDICTED: matrilin 1, cartilage matrix protein [Taeniopygia guttata] Length = 493 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 46/395 (11%), Positives = 117/395 (29%), Gaps = 49/395 (12%) Query: 17 IDLAHIMYIRNQMQS-ALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQG 75 I+ A + +Q AA+L I T + + Sbjct: 81 INYASAVRNELSLQGPHSKAALLQAVRRIQPLSTGTMTGLAIQFAISRAFSAAEG----- 135 Query: 76 SYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRS 135 G ++ I P + + A+ +F G+ + L + Sbjct: 136 -----GRGSAPNFKKVAIVVTDGRPQDGVQDVSARARAAGIEIFAIGVGRVDMGTLRQMA 190 Query: 136 TGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF------ 189 + ++ + S ++ ++ ++ + + + + ++ Sbjct: 191 SEPLD-EHVDYVESYSVIEKLTHKFQEAFCVVSDLCATGDHDCEQVCVSTPGAYRCACRD 249 Query: 190 -WSKNTTKSKYAPAPAPANRKID---------VLIESAGNLVNSIQKAIQE---KKNLSV 236 +S N +D + LV I + + Sbjct: 250 GFSLNNDGKTCTACNGGLGSALDLVFLIDGSKSVRPENFELVKKFINQIVDSLEVSDKQA 309 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESS 295 ++G + Y+ + + ++K+ + K++ + T T A+ + ++ + Sbjct: 310 QVGLVQYSSSVRQEFPLGQFKSKKDIKAAVKKMSYMEKGTMTGQALKYLVDSSFSAINGA 369 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 + K I TDG + + + +++G ++++V V +D Sbjct: 370 R-----PGVPKVGIVFTDGRSQDYISD--------AAKKAKDSGFRMFAVGVG--NAVED 414 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 LR+ +F D R + + K+ KI Sbjct: 415 ELREIASEPVAEHYFYTADFRTISKIGKKLQMKIC 449 >gi|296168868|ref|ZP_06850540.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295896485|gb|EFG76135.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] Length = 335 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 29/222 (13%), Positives = 65/222 (29%), Gaps = 26/222 (11%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 ++ S+ A ++ E+ + + AI Sbjct: 95 NRAVVMLVIDVSESMAATDVPPDRLTAAKEAGKQFADELTPAINLGLVEF---------- 144 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + + N VK+ ++ L P T T + A + + S G Sbjct: 145 AANASLLVSPTTNRAAVKAAIDSLKPAPKTATGEGLFTALQAIATV--GSVMGGGDGPPP 202 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--------------P 351 ++ +DG + + G++I +++ P Sbjct: 203 ARIVLESDGAENVPLDPNAPQGAFTAARAAKAEGVQISTISFGTPYGTVEYEGATIPVPV 262 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + Q L + C + GQ F + L + + +I ++V+ Sbjct: 263 DDQTLQKICEITDGQAFHADSLESLKNVYSTLQRQIGYETVK 304 >gi|289667993|ref|ZP_06489068.1| hypothetical protein XcampmN_05693 [Xanthomonas campestris pv. musacearum NCPPB4381] Length = 310 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 31/205 (15%), Positives = 66/205 (32%), Gaps = 46/205 (22%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 ++ + ++ G TPL+ +L Sbjct: 92 VLGGKVVDRLTAAKAVLSDFLDR-----------RDGDRVGLLVFGQRAYALTPLTADLT 140 Query: 261 EVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 V+ +L T A+ + + L +K+ ++ V+ +TDG N+ Sbjct: 141 SVRDQLRDSVVGLAGRETAIGDAIALSVKRLREQKQG----------QRVVVLLTDGVNT 190 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVS----------------APPEGQDLLRKCT 361 LN L+ E + G++++++A ++ LRK Sbjct: 191 AGV-----LNPLKAAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGNDDIDEEGLRKIA 245 Query: 362 D-SSGQFFAVNDSRELLESFDKITD 385 + G+FF D+ EL + ++ Sbjct: 246 QQTGGRFFRARDTEELAGIYAELDR 270 >gi|289662175|ref|ZP_06483756.1| hypothetical protein XcampvN_03493 [Xanthomonas campestris pv. vasculorum NCPPB702] Length = 335 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 31/205 (15%), Positives = 66/205 (32%), Gaps = 46/205 (22%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 ++ + ++ G TPL+ +L Sbjct: 117 VLGGKVVDRLTAAKAVLSDFLDR-----------RDGDRVGLLVFGQRAYALTPLTADLT 165 Query: 261 EVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 V+ +L T A+ + + L +K+ ++ V+ +TDG N+ Sbjct: 166 SVRDQLRDSVVGLAGRETAIGDAIALSVKRLREQKQG----------QRVVVLLTDGVNT 215 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVS----------------APPEGQDLLRKCT 361 LN L+ E + G++++++A ++ LRK Sbjct: 216 AGV-----LNPLKAAELAKAEGVRVHTIAFGGSGGYSLFGVPIPAGGNDDIDEEGLRKIA 270 Query: 362 D-SSGQFFAVNDSRELLESFDKITD 385 + G+FF D+ EL + ++ Sbjct: 271 QQTGGRFFRARDTEELAGIYAELDR 295 >gi|260774144|ref|ZP_05883059.1| protein BatA [Vibrio metschnikovii CIP 69.14] gi|260611105|gb|EEX36309.1| protein BatA [Vibrio metschnikovii CIP 69.14] Length = 322 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 64/204 (31%), Gaps = 47/204 (23%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 ++ + + + + R+G I + TPL+ + V Sbjct: 105 GDDYIDRLTAVKQVLSDFIAQ---------RQGDRLGLIFFADHAYLQ--TPLTLDRTTV 153 Query: 263 KSRLNK--LNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +LN+ L T + A + S ++ +I ++DG N+ Sbjct: 154 AQQLNQAVLRLIGTQTAIGDGIGLATKTFIE----------SDAPQRVMILLSDGSNNAG 203 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEGQDLLRKCTD 362 L+ ++ + IY+V V A ++ L+K D Sbjct: 204 V-----LDPIEAAHIAKQYHTTIYTVGVGAGEMMVRDFFMTRRINTAEDLDEETLQKIAD 258 Query: 363 -SSGQFFAVNDSRELLESFDKITD 385 + GQ+F D +L + I Sbjct: 259 LTGGQYFRARDKHDLQTIYQTIDK 282 >gi|332882611|ref|ZP_08450223.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332679411|gb|EGJ52396.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 547 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 27/207 (13%), Positives = 65/207 (31%), Gaps = 22/207 (10%) Query: 192 KNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ 251 N K+ +L S L+ ++ ++ + Y G Sbjct: 187 SNIVFLIDVSGSMDEENKLPLLQSSFKMLLGQLR--------PDDKVAIVTYANGTKVAL 238 Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 + + ++ L+ L T+ + AY + + N +I Sbjct: 239 PSTSVKDKEKIIKVLDNLYASGGTSGGKGIQLAYEQAQKSFIKNGNNR--------IILA 290 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAV 370 TDG+ + NT + + E R +G+ + + D+ D G + + Sbjct: 291 TDGDFNIG--INNTTDLEKFIEKQRESGIYMSVLGFGMGNYRDDMAETIADKGNGNYAYI 348 Query: 371 NDSRELLESFDKITDKIQEQSVRIAPN 397 ++ E + + +++ +A + Sbjct: 349 DNITEAKKV---LVNELSGTLFAVAKD 372 >gi|297678514|ref|XP_002817115.1| PREDICTED: collagen alpha-1(XII) chain-like isoform 1 [Pongo abelii] Length = 3115 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 59/166 (35%), Gaps = 18/166 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+G + Y+ +E+ + + K+ NT T A+ + Sbjct: 168 SAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLV 227 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + E + K I ITDG++ +RN G++++S+ Sbjct: 228 KNTFTESAGARVGF-----PKVAIIITDGKSQD--------EVEIPARELRNVGVEVFSL 274 Query: 346 AVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQE 389 + L++ + F V + +++ ++I ++ Sbjct: 275 GI--KAADAKELKQIASTPSLNHVFNVANFDAIVDIQNEIISQVCS 318 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 20/165 (12%), Positives = 52/165 (31%), Gaps = 18/165 (10%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + V++ Y+ + + + L NT T A++ ++ Sbjct: 1227 DIGPKRVQVALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQ- 1285 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 R +K + ITDG++ + + +++ G++++++ + Sbjct: 1286 ----NFRTQAGMRPRARKIGVLITDGKSQDDVEAPS--------KKLKDEGVELFAIGI- 1332 Query: 349 APPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 + L+ + V D L D +T + Sbjct: 1333 -KNADEVELKMIATDPDDTHAYNVADFESLSRIVDDLTINLCNSV 1376 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 62/167 (37%), Gaps = 23/167 (13%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYN 290 V+I + Y+ + ++ +N TNT AM + +++ Sbjct: 473 SPNRVQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIF- 531 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + K +I ITDG++S A +RN+ ++I++V V Sbjct: 532 ----VPSKGSRSNVPKVMILITDGKSSDAFRDP--------AIKLRNSDVEIFAVGV--K 577 Query: 351 PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS-VRI 394 + L F V D ++F +I+ ++ + +RI Sbjct: 578 DAVRSELEAIASPPAETHVFTVEDF----DAFQRISFELTQSICLRI 620 >gi|224024929|ref|ZP_03643295.1| hypothetical protein BACCOPRO_01660 [Bacteroides coprophilus DSM 18228] gi|224018165|gb|EEF76163.1| hypothetical protein BACCOPRO_01660 [Bacteroides coprophilus DSM 18228] Length = 332 Score = 78.4 bits (191), Expect = 2e-12, Method: Composition-based stats. Identities = 29/252 (11%), Positives = 67/252 (26%), Gaps = 56/252 (22%) Query: 172 NNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEK 231 + + S A +++ + A + Sbjct: 72 ARPQTTDNWQNTEIEGIDIMLAVDVSTSMLAEDLKPNRLEAAKQVAAEFI---------- 121 Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN--------PYENTNTYPAMHH 283 N PL+ + + + + + T + + Sbjct: 122 -NGRPNDNIGLTIFAGEAFTQCPLTVDHGVLLNLFQSIKCDIAQKGLIMDGTALGMGLAN 180 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A L S K +I +TDG N+ L E + G+++Y Sbjct: 181 AVSRL----------KDSKAKSKVIILLTDGVNNRGDIS-----PLTAAEIAKQFGIRVY 225 Query: 344 SVAVSAPPE--------------------GQDLLRKCT-DSSGQFFAVNDSRELLESFDK 382 ++ V + L + ++G +F + +L E + + Sbjct: 226 TIGVGTNGTAPYPMQTYAGVQYVQVPVEIDEQTLTQIAGTTNGNYFRATSNSKLKEVYQE 285 Query: 383 ITDKIQEQSVRI 394 I DK+++ + + Sbjct: 286 I-DKLEKTKLNV 296 >gi|314918209|gb|EFS82040.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA1] Length = 320 Score = 78.0 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 21/222 (9%), Positives = 58/222 (26%), Gaps = 32/222 (14%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 ++ S+ A ++ +A + + + R Sbjct: 87 DRATVVVAIDVSRSMVATDVEPTRLSAAKTAAKDFLGDLP----------PRFNVSLVKF 136 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 P + + V + + L +T ++ + L + + Sbjct: 137 AASAQVVVPPTTDRAAVSTAITNLQVLPSTAIGEGIYSSLNALKLVPDD--PKHPGQKPP 194 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------------- 352 ++ ++DG + L+ + + +Y++A Sbjct: 195 AAIVLLSDGATNVGRPS------LEAAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPV 248 Query: 353 GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 L S G+ F+ +L + + I + + V Sbjct: 249 NHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSVGYEKVF 290 >gi|146295744|ref|YP_001179515.1| von Willebrand factor, type A [Caldicellulosiruptor saccharolyticus DSM 8903] gi|145409320|gb|ABP66324.1| von Willebrand factor, type A [Caldicellulosiruptor saccharolyticus DSM 8903] Length = 909 Score = 78.0 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 29/198 (14%), Positives = 65/198 (32%), Gaps = 25/198 (12%) Query: 196 KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL 255 + K+++ +A +++ ++ + IA++ Sbjct: 415 SGSMGGSNLRNINKLEIAKSAAAKMIDHLESSDSVG--------VIAFDHNFYWASKFGK 466 Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + NEV ++ + T P + A L S K ++ +TD Sbjct: 467 LKSKNEVIENISTIQVGGGTAIIPPLTEAVNLL----------KKSKAKDKVIVLLTD-- 514 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE 375 Y + +KI ++ V + L +SG+F+ V D+ Sbjct: 515 -----GYGEEGGYEYPASIAKRNNIKITTIGVGSSINAPILSWMAAYTSGRFYYVKDASN 569 Query: 376 LLESFDKITDKIQEQSVR 393 L++ F K I+ + ++ Sbjct: 570 LIDVFLKEAKIIKGKYIK 587 >gi|300783401|ref|YP_003763692.1| von Willebrand factor type A [Amycolatopsis mediterranei U32] gi|299792915|gb|ADJ43290.1| von Willebrand factor type A [Amycolatopsis mediterranei U32] Length = 535 Score = 78.0 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 45/378 (11%), Positives = 101/378 (26%), Gaps = 39/378 (10%) Query: 35 AAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINIT 94 +A++ +++ D T + + + L GS + + + + Sbjct: 164 SALVGVASALAGAGNAIDARQIASVTPQLTQFFSAQTLSAGSSGWLSDAYVRRATGPDAV 223 Query: 95 KDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVL 154 N + + A ++P + + L + + + L Sbjct: 224 DGLINYESVLLSANASGKLPQPLTLVYPSDGVVTADYPLTLLADAGSDARSAHQRLADHL 283 Query: 155 DVSRSMEDLYLQKHNDN----------------------NNMTSNKYLLPPPPKKSFWSK 192 + + + LL K Sbjct: 284 RTPAVQKRIMETTQRRPVVPGVALGPQFAQRDLVELPFPATQQAVDALLSAYFDKLRRPS 343 Query: 193 NTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQC 252 T +ID L + L + + R Sbjct: 344 RTLYVLDTSGSMAGA-RIDSLRSALVGLTGADTSLTGRFRRFRSREEVTMLPFNTGPGAP 402 Query: 253 TPLS-------NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + L ++K+ L T Y ++ AY+ L + + S Sbjct: 403 RTFTVPEENPAAELAQIKTFAEGLVARGGTAIYDSLSRAYQVLEPLMAADPDRFTS---- 458 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG 365 ++ +TDGEN+ S+ + L +L + +++V + L + T + G Sbjct: 459 --IVLMTDGENANGSSLPDFLTSLASLPPA-MKQVPVFTVLFGEGSSDE-LTQVATRTGG 514 Query: 366 QFFAVNDSRELLESFDKI 383 + F + +L F +I Sbjct: 515 KVFDARNV-QLSRVFQEI 531 >gi|113476847|ref|YP_722908.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] gi|110167895|gb|ABG52435.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] Length = 477 Score = 78.0 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 24/145 (16%), Positives = 47/145 (32%), Gaps = 21/145 (14%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 T + E+K + L P TN + L N + Sbjct: 90 SSNSQVLTNFDADKTELKQAIANLTPSGGTNLSQGLKTVASLLRNSNTPN---------- 139 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG 365 ++ TDG+ + + + I +R AG+ + +V + L T++ Sbjct: 140 --ILLFTDGQPN------DPRASKSIAREIREAGINLVTVG--TGDANSNYLTSLTENPD 189 Query: 366 QFFAVNDSRELLESFDKITDKIQEQ 390 F +S E+ ++F I + Sbjct: 190 LVFFA-NSGEIDQAFRAAEKAISQL 213 >gi|256787646|ref|ZP_05526077.1| lipoprotein [Streptomyces lividans TK24] gi|289771539|ref|ZP_06530917.1| lipoprotein [Streptomyces lividans TK24] gi|289701738|gb|EFD69167.1| lipoprotein [Streptomyces lividans TK24] Length = 532 Score = 78.0 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 40/346 (11%), Positives = 98/346 (28%), Gaps = 28/346 (8%) Query: 40 GCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNN 99 G + + + ++ G+ + I D ++ Sbjct: 38 GSNGAKDTAADSRGGSAPMPAPDRPRGEGEQRYDGGTGAPGSGEGEKNGDSREIAPDPDH 97 Query: 100 PLQYIAES-KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR 158 + + A Y L L A + + +D +R Sbjct: 98 LSTFALDVDTASYGYARRTLSEGRLPDPATVRPEEFVNSFRQDYDRPDGDGFAVTVDGAR 157 Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAG 218 + ++ + + + PP +F ++D+ E+ G Sbjct: 158 TDDEDWSLVRVGLATRPAERQSERPPAALTFVIDI-------SGSMGEPGRLDLAQEALG 210 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP-LSNNLNEVKSRLNKLNPYENTNT 277 + + ++ + + ++ L ++ V ++ L P ++TN Sbjct: 211 TMTDRLRDDDS--------VALVTFSDEAETVLPMTRLGDHRGRVHDAIDGLEPTDSTNL 262 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 M Y + V+ ++D + +T+ +I R Sbjct: 263 GAGMETGYETAVEGRREGATNR--------VVLVSDALANTGDTDADTI-LERIATERRE 313 Query: 338 AGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDK 382 G+ ++ V V + G L+ + D G V+ + + E F + Sbjct: 314 HGITLFGVGVGSD-YGDALMERLADKGDGHTTYVSTTEDAREVFSE 358 >gi|223462569|gb|AAI50696.1| RIKEN cDNA E330026B02 gene [Mus musculus] Length = 1182 Score = 78.0 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 37/338 (10%), Positives = 98/338 (28%), Gaps = 31/338 (9%) Query: 58 DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 ++K K+ + + QI + + ++ Sbjct: 301 QVGQAYTGAALRKTRKEIFSAQRGSRKNQGVPQIAVLVTHRASEDNVTKAAVNLRREGVT 360 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYL----QKHNDNNN 173 +F G+ + L ++ E+ + L + ++ L + Sbjct: 361 IFTMGIEGANPDELEKIASHPAEQFTSKLGNFSELATHNQTFLKKLRNQITHTVSVFSER 420 Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 + K + + P + +V A + Sbjct: 421 TETLKSACVDTEEADIYLLIDGSGSTQP------TDFHEMKTFLSEVVGMFNIAPHK--- 471 Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEK 292 VR+G + Y +N ++ + + TNT A++ + L K Sbjct: 472 --VRVGAVQYADTWDLEFEISKYSNKPDLGKAIENIRQMGGNTNTGAALNFTLKLLQRAK 529 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + + + ++ +T+G + + L +R ++++++ V Sbjct: 530 KERGSK-----VPCHLVVLTNGMSRDS--------VLGPAHKLREENIRVHAIGV--KEA 574 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 Q LR+ + + V+D L +++ +I + Sbjct: 575 NQMQLREIAGEEKRVYYVHDFDALRNIRNQVVQEICAE 612 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 42/354 (11%), Positives = 102/354 (28%), Gaps = 34/354 (9%) Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 D + K+ + + I++ ++ + ++A Sbjct: 838 ADVGKNQVRFGALKYADDPEVLFYLDELGTKLEVISVLQNDHPMGGNTYTAEALAFSNHM 897 Query: 117 NLFL------KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 KG+ + S + ++ A+ +L ++ + + Sbjct: 898 FTEARGSRLHKGVPQVLIVITDGESHDAEKLNATAKALRDKGILVLAVGIAGANSWELLA 957 Query: 171 NNNMTSNKYLLPPPPK----KSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK 226 Y + S S + S + ++ S + QK Sbjct: 958 MAGSGDKYYFVETFGGLKGIFSDVSASVCNSSKVDCEIEKVDLVFLMDGSNSIHPDDFQK 1017 Query: 227 A---------IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTN 276 + VRIG ++ E+ +++ + T+ Sbjct: 1018 MKGFLVSVVQDFDVSLNRVRIGVAQFSDSYRSEFLLGTFTGEREISTQIEGIQQIFGYTH 1077 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 A+ R++ + + + + ++ +TDG + Q E +R Sbjct: 1078 IGDAL----RKVKYYFQPDTGSRINAGTPQVLLVLTDGRSQD--------EVAQAAEELR 1125 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + G+ IYSV + L + T ++ + V++ EL + +I I Sbjct: 1126 HKGVDIYSVGIG--DVDDQELVQITGTAEKKLTVHNFDELKKVKKRIVRNICTS 1177 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 27/160 (16%), Positives = 54/160 (33%), Gaps = 19/160 (11%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEK 292 V+IG + ++ + +++ + ++++ T T A+ + +K Sbjct: 656 DRVQIGVVQFSHENKEEFQLNTFMSQSDIANAIDRMTHIGETTLTGSALTFVSQYFSPDK 715 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA-VSAPP 351 + ++KF+I ITDGE +R G+ IYSV + Sbjct: 716 GARP------NVRKFLILITDGEAQD--------IVRDPALALRKEGVIIYSVGVFGSNV 761 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 L + + F V + L D + I Sbjct: 762 ---TQLEEISGKPEMVFYVENFDILQHIEDDLVLGICSPR 798 Score = 43.0 bits (99), Expect = 0.080, Method: Composition-based stats. Identities = 24/162 (14%), Positives = 54/162 (33%), Gaps = 17/162 (10%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL--NKLNPYENTNTYPAMHHAYRELYNE 291 R+ Y+ + N N + + L N + A+ A+R ++ Sbjct: 61 NKYRVALAQYSDALHNEFQLGTFKNRNPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSA 120 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 + + V+ + ++ + + + +R G+KI SV V Sbjct: 121 PTNGRDKK--QFPPILVVLAS---------AESEDDVEEAAKALREDGVKIISVGVQK-- 167 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 ++ L+ S F + +R+L F +I + + Sbjct: 168 ASEENLKAMATSQ-FHFNLRTARDLS-VFAPNMTEIIKDVTQ 207 >gi|289425049|ref|ZP_06426826.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK187] gi|289154027|gb|EFD02715.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK187] gi|313764512|gb|EFS35876.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL013PA1] gi|313772105|gb|EFS38071.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL074PA1] gi|313801850|gb|EFS43084.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA2] gi|313807459|gb|EFS45946.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA2] gi|313809969|gb|EFS47690.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL083PA1] gi|313812999|gb|EFS50713.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL025PA1] gi|313816053|gb|EFS53767.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL059PA1] gi|313818504|gb|EFS56218.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL046PA2] gi|313820270|gb|EFS57984.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA1] gi|313822921|gb|EFS60635.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA2] gi|313825147|gb|EFS62861.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL063PA1] gi|313827718|gb|EFS65432.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL063PA2] gi|313830298|gb|EFS68012.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL007PA1] gi|313833672|gb|EFS71386.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL056PA1] gi|314915506|gb|EFS79337.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA4] gi|314920024|gb|EFS83855.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA3] gi|314925157|gb|EFS88988.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA3] gi|314932038|gb|EFS95869.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL067PA1] gi|314955908|gb|EFT00308.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL027PA1] gi|314958220|gb|EFT02323.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA1] gi|314960059|gb|EFT04161.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA2] gi|314962858|gb|EFT06958.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL082PA1] gi|314967774|gb|EFT11873.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA1] gi|314973303|gb|EFT17399.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL053PA1] gi|314975981|gb|EFT20076.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL045PA1] gi|314978482|gb|EFT22576.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL072PA2] gi|314988184|gb|EFT32275.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA2] gi|314989987|gb|EFT34078.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA3] gi|315078073|gb|EFT50124.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL053PA2] gi|315084373|gb|EFT56349.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL027PA2] gi|315085714|gb|EFT57690.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA3] gi|315088866|gb|EFT60842.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL072PA1] gi|315096218|gb|EFT68194.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL038PA1] gi|315098476|gb|EFT70452.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL059PA2] gi|315101164|gb|EFT73140.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL046PA1] gi|315105440|gb|EFT77416.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL030PA1] gi|315108385|gb|EFT80361.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL030PA2] gi|327326130|gb|EGE67920.1| von Willebrand factor, type A [Propionibacterium acnes HL096PA2] gi|327330198|gb|EGE71947.1| von Willebrand factor, type A [Propionibacterium acnes HL097PA1] gi|327331995|gb|EGE73732.1| von Willebrand factor, type A [Propionibacterium acnes HL096PA3] gi|327443197|gb|EGE89851.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL013PA2] gi|327445982|gb|EGE92636.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL043PA2] gi|327448038|gb|EGE94692.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL043PA1] gi|327450840|gb|EGE97494.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA3] gi|327453083|gb|EGE99737.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL092PA1] gi|327453814|gb|EGF00469.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL083PA2] gi|328753528|gb|EGF67144.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL020PA1] gi|328754259|gb|EGF67875.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA1] gi|328754490|gb|EGF68106.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL025PA2] gi|328760648|gb|EGF74215.1| von Willebrand factor, type A [Propionibacterium acnes HL099PA1] Length = 320 Score = 78.0 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 21/222 (9%), Positives = 58/222 (26%), Gaps = 32/222 (14%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 ++ S+ A ++ +A + + + R Sbjct: 87 DRATVVVAIDVSRSMVATDVEPSRLSAAKTAAKDFLGDLP----------PRFNVSLVKF 136 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 P + + V + + L +T ++ + L + + Sbjct: 137 AASAQVVVPPTTDRAAVSTAITNLQVLPSTAIGEGIYSSLNALKLVPDD--PKHPGQKPP 194 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------------- 352 ++ ++DG + L+ + + +Y++A Sbjct: 195 AAIVLLSDGATNVGRPS------LEAAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPV 248 Query: 353 GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 L S G+ F+ +L + + I + + V Sbjct: 249 NHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSVGYEKVF 290 >gi|332244061|ref|XP_003271190.1| PREDICTED: collagen alpha-1(XII) chain [Nomascus leucogenys] Length = 3100 Score = 78.0 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 59/166 (35%), Gaps = 18/166 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+G + Y+ +E+ + + K+ NT T A+ + Sbjct: 168 SAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLV 227 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + E + K I ITDG++ +RN G++++S+ Sbjct: 228 KNTFTESAGARVGF-----PKVAIIITDGKSQD--------EVEIPARELRNVGVEVFSL 274 Query: 346 AVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQE 389 + L++ + F V + +++ ++I ++ Sbjct: 275 GI--KAADAKELKQIASTPSLNHVFNVANFDAIVDIQNEIISQVCS 318 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 21/165 (12%), Positives = 52/165 (31%), Gaps = 18/165 (10%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + V+I Y+ + + + L NT T A++ ++ Sbjct: 1230 DIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQ- 1288 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 R +K + ITDG++ + + +++ G++++++ + Sbjct: 1289 ----NFRTQAGMRPRARKIGVLITDGKSQDDVEAPS--------KKLKDEGVELFAIGI- 1335 Query: 349 APPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 + L+ + V D L D +T + Sbjct: 1336 -KNADEVELKMIATDPDDTHAYNVADFESLSRIVDDLTVNLCNSV 1379 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 31/167 (18%), Positives = 62/167 (37%), Gaps = 23/167 (13%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYN 290 V+I + Y+ + ++ +N TNT AM + +++ Sbjct: 473 SPNRVQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIF- 531 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 N + + K +I ITDG++S A +RN+ ++I++V V Sbjct: 532 ----VPNKGSRSNVPKVMILITDGKSSDAFRDP--------AIKLRNSDVEIFAVGV--K 577 Query: 351 PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS-VRI 394 + L F V D ++F +I+ ++ + +RI Sbjct: 578 DAVRSELEAIASPPAETHVFTVEDF----DAFQRISFELTQSICLRI 620 >gi|261415414|ref|YP_003249097.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|261371870|gb|ACX74615.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|302325633|gb|ADL24834.1| BatA protein [Fibrobacter succinogenes subsp. succinogenes S85] Length = 367 Score = 78.0 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 39/293 (13%), Positives = 83/293 (28%), Gaps = 67/293 (22%) Query: 136 TGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTT 195 + + + I + LDVS SM L + + + Sbjct: 81 NAMEVEYTSTDGVDIMLALDVSGSMGTLDMLTRTEQAKLGVMNAEKI------------- 127 Query: 196 KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN--QCT 253 ++ + + R T G+ + Sbjct: 128 ---LKRGEYWKYSRLGYAQDVIAEFIGKRHSDRIGLSAFGARSFTQCPLTMDYGSLLEIL 184 Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 S++L + +N T + +A L S + VI +TD Sbjct: 185 KASDDLAR-DTLVNNR-----TAIGDGLMNALARL----------KMSDAKSRVVILLTD 228 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE--------------------- 352 G ++ + ++ E ++ G+K+Y+V V Sbjct: 229 GRDNASVVP-----PVRAAEVAKSLGVKVYTVGVGKKSGKILAFQQNPWTGEISWGERDI 283 Query: 353 ------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 +D+L+ + G+F+ + EL + + +I + + + IA R Sbjct: 284 TPEEGIDEDVLKAIASKTGGRFYRAENKAELEKIYSEIDELEKTEIETIAYAR 336 >gi|156616286|ref|NP_001096077.1| collagen alpha-6(VI) chain isoform 1 [Mus musculus] gi|189082903|sp|Q8C6K9|CO6A6_MOUSE RecName: Full=Collagen alpha-6(VI) chain; Flags: Precursor Length = 2265 Score = 78.0 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 36/338 (10%), Positives = 98/338 (28%), Gaps = 31/338 (9%) Query: 58 DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 ++K K+ + + QI + + ++ Sbjct: 301 QVGQAYTGAALRKTRKEIFSAQRGSRKNQGVPQIAVLVTHRASEDNVTKAAVNLRREGVT 360 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYL----QKHNDNNN 173 +F G+ + L ++ E+ + L + ++ L + Sbjct: 361 IFTMGIEGANPDELEKIASHPAEQFTSKLGNFSELATHNQTFLKKLRNQITHTVSVFSER 420 Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 + K + + P + +V A + Sbjct: 421 TETLKSACVDTEEADIYLLIDGSGSTQP------TDFHEMKTFLSEVVGMFNIAPHK--- 471 Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEK 292 VR+G + Y +N ++ + + TNT A++ + L K Sbjct: 472 --VRVGAVQYADTWDLEFEISKYSNKPDLGKAIENIRQMGGNTNTGAALNFTLKLLQRAK 529 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + + + ++ +T+G + + L +R ++++++ V Sbjct: 530 KERGSK-----VPCHLVVLTNGMSRDS--------VLGPAHKLREENIRVHAIGV--KEA 574 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 Q LR+ + + V++ L +++ +I + Sbjct: 575 NQTQLREIAGEEKRVYYVHEFDALRNIRNQVVQEICAE 612 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 41/354 (11%), Positives = 103/354 (29%), Gaps = 34/354 (9%) Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 D + K+ + + +++ ++ + ++A Sbjct: 838 ADVGKNQVRFGALKYADDPEVLFYLDELGTKLEVVSVLQNDHPMGGNTYTAEALAFSDHM 897 Query: 117 NLFL------KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 KG+ + S + ++ A+ +L ++ + + Sbjct: 898 FTEARGSRLHKGVPQVLIVITDGESHDAEKLNTTAKALRDKGILVLAVGIAGANSWELLA 957 Query: 171 NNNMTSNKYLLPPPPK----KSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK 226 + Y + S S + S + ++ S + QK Sbjct: 958 MAGSSDKYYFVETFGGLKGIFSDVSASVCNSSKVDCEIEKVDLVFLMDGSNSIHPDDFQK 1017 Query: 227 A---------IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTN 276 + VRIG ++ E+ +++ + T+ Sbjct: 1018 MKGFLVSVVQDFDVSLNRVRIGVAQFSDSYRSEFLLGTFTGEREISTQIEGIQQIFGYTH 1077 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 A+ R++ + + + + ++ +TDG + Q E +R Sbjct: 1078 IGDAL----RKVKYYFQPDMGSRINAGTPQVLLVLTDGRSQD--------EVAQAAEELR 1125 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + G+ IYSV + L + T ++ + V++ EL + +I I Sbjct: 1126 HKGVDIYSVGIG--DVDDQELVQITGTAEKKLTVHNFDELKKVKKRIVRNICTS 1177 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 27/160 (16%), Positives = 54/160 (33%), Gaps = 19/160 (11%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEK 292 V+IG + ++ + +++ + ++++ T T A+ + +K Sbjct: 656 DRVQIGVVQFSHENKEEFQLNTFMSQSDIANAIDRMTHIGETTLTGSALTFVSQYFSPDK 715 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA-VSAPP 351 + ++KF+I ITDGE +R G+ IYSV + Sbjct: 716 GARP------NVRKFLILITDGEAQD--------IVRDPAIALRKEGVIIYSVGVFGSNV 761 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 L + + F V + L D + I Sbjct: 762 ---TQLEEISGKPEMVFYVENFDILQHIEDDLVLGICSPR 798 Score = 43.4 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 24/162 (14%), Positives = 54/162 (33%), Gaps = 17/162 (10%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL--NKLNPYENTNTYPAMHHAYRELYNE 291 R+ Y+ + N N + + L N + A+ A+R ++ Sbjct: 61 NKYRVALAQYSDALHNEFQLGTFKNRNPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSA 120 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 + + V+ + ++ + + + +R G+KI SV V Sbjct: 121 PTNGRDKK--QFPPILVVLAS---------AESEDDVEEAAKALREDGVKIISVGVQK-- 167 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 ++ L+ S F + +R+L F +I + + Sbjct: 168 ASEENLKAMATSQ-FHFNLRTARDLS-VFAPNMTEIIKDVTQ 207 >gi|296227162|ref|XP_002759255.1| PREDICTED: hypothetical protein LOC100397584 [Callithrix jacchus] Length = 1319 Score = 78.0 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 68/195 (34%), Gaps = 20/195 (10%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + + + NLV++ + R+G + Y+ L + Sbjct: 45 TSSSVGKEDFEKVRQWVANLVDTFE-----VGPDRTRVGVVRYSDQPTTAFELGLFGSRE 99 Query: 261 EVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 EVK+ +L NTNT A+ + ++ + G K+ I +TDG + Sbjct: 100 EVKAAARRLAYHGGNTNTGDALRYITARSFS--PHAGGRPGDRAYKQVAILLTDGRSQDL 157 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELL 377 G++I++V V ++ L + F V+D + Sbjct: 158 VLDAAAAAHRA--------GIRIFAVGVG--EALKEELEEIASEPKSAHVFHVSDFNAID 207 Query: 378 ESFDKITDKIQEQSV 392 + K+ ++ E + Sbjct: 208 KIRGKLRRRLCENVL 222 >gi|221115612|ref|XP_002166292.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 484 Score = 78.0 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 43/279 (15%), Positives = 84/279 (30%), Gaps = 30/279 (10%) Query: 110 QYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHN 169 Q E T++L+ G+I ++ + N S+S Sbjct: 74 QAEEKTQSLYTFGVIEEGDIKVTELFHELKAEKKTNFW---------SQSSYHWDRTAVP 124 Query: 170 DNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ 229 N N L P S ++ D + L+N + Sbjct: 125 YKINSDINTNLKNPSCAGIVDVGFIIDSSWSL--------RDQYHQEIEFLINLAKTFNI 176 Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 K G I ++ V N + + + ++++ T A+ A Sbjct: 177 SKNGA--HAGVITFSSRAVLNIKLNQYYDQEQFEFAIDEIPYMGYVTRIDLALRKALEMF 234 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + + + + +TDGE + N + + +R+ G++I +V + Sbjct: 235 DEINGAR------KDIPQLLFLLTDGEQYAGDGVVDE-NPANVAQLVRDRGIEIIAVGIG 287 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSREL--LESFDKITD 385 + Q L SS + F + EL +I D Sbjct: 288 SGVN-QSELNSIAGSSEKVFLAENFDELINKNFLKRIKD 325 >gi|114608138|ref|XP_518589.2| PREDICTED: collagen, type XII, alpha 1 isoform 3 [Pan troglodytes] Length = 3119 Score = 78.0 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 59/166 (35%), Gaps = 18/166 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+G + Y+ +E+ + + K+ NT T A+ + Sbjct: 168 SAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLV 227 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + E + K I ITDG++ +RN G++++S+ Sbjct: 228 KNTFTESAGARVGF-----PKVAIIITDGKSQD--------EVEIPARELRNVGVEVFSL 274 Query: 346 AVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQE 389 + L++ + F V + +++ ++I ++ Sbjct: 275 GI--KAADAKELKQIASTPSLNHVFNVANFDAIVDIQNEIISQVCS 318 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 21/165 (12%), Positives = 52/165 (31%), Gaps = 18/165 (10%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + V+I Y+ + + + L NT T A++ ++ Sbjct: 1230 DIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQ- 1288 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 R +K + ITDG++ + + +++ G++++++ + Sbjct: 1289 ----NFRTQAGMRPRARKIGVLITDGKSQDDVEAPS--------KKLKDEGVELFAIGI- 1335 Query: 349 APPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 + L+ + V D L D +T + Sbjct: 1336 -KNADEVELKMIATDPDDTHAYNVADFESLSRIVDDLTINLCNSV 1379 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 62/167 (37%), Gaps = 23/167 (13%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYN 290 V+I + Y+ + ++ +N TNT AM + +++ Sbjct: 473 SPNRVQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIF- 531 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + K +I ITDG++S A +RN+ ++I++V V Sbjct: 532 ----VPSKGSRSNVPKVMILITDGKSSDAFRDP--------AIKLRNSDVEIFAVGV--K 577 Query: 351 PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS-VRI 394 + L F V D ++F +I+ ++ + +RI Sbjct: 578 DAVRSELEAIASPPAETHVFTVEDF----DAFQRISFELTQSICLRI 620 >gi|154488145|ref|ZP_02029262.1| hypothetical protein BIFADO_01716 [Bifidobacterium adolescentis L2-32] gi|154083618|gb|EDN82663.1| hypothetical protein BIFADO_01716 [Bifidobacterium adolescentis L2-32] Length = 835 Score = 78.0 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 43/370 (11%), Positives = 96/370 (25%), Gaps = 44/370 (11%) Query: 54 TTKKDQTSTIFKKQIKKHLKQ-GSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYE 112 + + EN D T D Sbjct: 158 QPTAENADVPANGAAPAQDGDRNDHAGENRTDAVANGDAGATADDGKTGDADDADNDGNT 217 Query: 113 IPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNN 172 + + N + + + + D Sbjct: 218 AEDADDAEHIMRDRFTFNRKTVMRAARNVAVPQPDHTKSITYNNGGKYTLNLNVVGKDTR 277 Query: 173 N--MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE 230 T+ K + S +++ L E+A + +++ + Sbjct: 278 ESHETTEKIEVVLVLDTSGSMNYCMDGSQRGCNKSNPKRLTALKEAATSFIDATETTNDT 337 Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 ++ + ++ G + L+++ +KS +++L+ T M A L Sbjct: 338 IQDENSKVRIAIAQFGQTSGVVSSLTSDTAALKSSVSRLSANGATPADKGMAAAQTALLR 397 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL--NTLQICEYMRNAGMKIYSVAVS 348 + + KK VIF DG + + + + + + M++AG IYS+ + Sbjct: 398 ARPGA---------KKVVIFFADGVPTTQNTFSTRVANDAVTTALAMKSAGTLIYSIGIF 448 Query: 349 APPE-------------GQDLLRKCT-----------------DSSGQFFAVNDSRELLE 378 + + + G + A N + +L + Sbjct: 449 EGANPEQQSFGNRENDQANQFMHAVSSNYPNATAYNKTNWGTGSNLGYYKATNSADDLTK 508 Query: 379 SFDKITDKIQ 388 FD I +I Sbjct: 509 IFDDIQKEIT 518 >gi|296198464|ref|XP_002746714.1| PREDICTED: collagen alpha-1(XXI) chain isoform 2 [Callithrix jacchus] Length = 954 Score = 78.0 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 31/176 (17%), Positives = 64/176 (36%), Gaps = 22/176 (12%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYENTNTYPAMHHA 284 + +++G + Y+ V ++ + + + L NT T A+ A Sbjct: 64 TKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLMAAVESILYLGGNTRTGKAIQFA 123 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 L+ S L K + +TDG++ E R++ + +++ Sbjct: 124 LDYLF--------AKSSRFLTKIAVVLTDGKSQD--------EVKDAAEAARDSKITLFA 167 Query: 345 VAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + V + E + LR + F V D + + + + K+ E+SV P R Sbjct: 168 IGVGSETEDAE-LRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV--CPTR 220 >gi|296198462|ref|XP_002746713.1| PREDICTED: collagen alpha-1(XXI) chain isoform 1 [Callithrix jacchus] Length = 957 Score = 78.0 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 31/176 (17%), Positives = 64/176 (36%), Gaps = 22/176 (12%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYENTNTYPAMHHA 284 + +++G + Y+ V ++ + + + L NT T A+ A Sbjct: 64 TKNFDIGPKFIQVGVVQYSDYPVLEIPLGSYDSGEHLMAAVESILYLGGNTRTGKAIQFA 123 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 L+ S L K + +TDG++ E R++ + +++ Sbjct: 124 LDYLF--------AKSSRFLTKIAVVLTDGKSQD--------EVKDAAEAARDSKITLFA 167 Query: 345 VAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + V + E + LR + F V D + + + + K+ E+SV P R Sbjct: 168 IGVGSETEDAE-LRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV--CPTR 220 >gi|288802180|ref|ZP_06407620.1| BatA protein [Prevotella melaninogenica D18] gi|288335147|gb|EFC73582.1| BatA protein [Prevotella melaninogenica D18] Length = 318 Score = 78.0 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 25/156 (16%), Positives = 54/156 (34%), Gaps = 31/156 (19%) Query: 254 PLSNNLNEVKSRL----NKLNPYE----NTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 P++ + + + L L T + ++ L S Sbjct: 143 PMTLDHAALLNLLHNVRTDLVTNGLMQDGTAIGLGLANSVSRL----------KDSKAKS 192 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS------APPEGQDLLRK 359 K VI +TDG N+ S + + G+++Y++ + L+ Sbjct: 193 KVVILLTDGSNNVGSIS-----PMTAATIAKKFGIRVYTIGLGRETGEDIGAIDYKTLQD 247 Query: 360 CT-DSSGQFFAVNDSRELLESFDKITDKIQEQSVRI 394 ++G+F+ EL + + I DK+++ + I Sbjct: 248 IAVLTNGEFYRAQSQAELSKIYQDI-DKLEKTKMNI 282 >gi|260769474|ref|ZP_05878407.1| protein BatA [Vibrio furnissii CIP 102972] gi|260614812|gb|EEX39998.1| protein BatA [Vibrio furnissii CIP 102972] gi|315182004|gb|ADT88917.1| von Willebrand factor type A domain protein [Vibrio furnissii NCTC 11218] Length = 322 Score = 78.0 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 31/215 (14%), Positives = 70/215 (32%), Gaps = 47/215 (21%) Query: 192 KNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ 251 + + + ++ + + + + K R+G I + Sbjct: 94 SYSMSQQDMKSGDQFIDRLSAVKQVLSDFIA---------KRQGDRLGLIFFADHAYLQ- 143 Query: 252 CTPLSNNLNEVKSRLNK--LNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 TPL+ + + +LN+ L T + A + + S ++ + Sbjct: 144 -TPLTLDRQTIAQQLNQAVLRLIGTQTAIGEGIGLATKTFID----------SDAPQRVM 192 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------P 351 I ++DG N+ L+ ++ + + IY+V V A Sbjct: 193 ILLSDGSNTSGV-----LDPMEAAKIAKKYHTTIYTVGVGAGEMMVKEFFMTRKINTAED 247 Query: 352 EGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + L+ D + GQ+F + ++L +D I Sbjct: 248 LDEKTLQAIADETGGQYFRARNQQDLQHIYDTINQ 282 >gi|254225237|ref|ZP_04918850.1| conserved hypothetical protein [Vibrio cholerae V51] gi|125622336|gb|EAZ50657.1| conserved hypothetical protein [Vibrio cholerae V51] Length = 318 Score = 78.0 bits (190), Expect = 2e-12, Method: Composition-based stats. Identities = 31/170 (18%), Positives = 58/170 (34%), Gaps = 38/170 (22%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK--LNPYEN-TNTYPAMHHAYRELYNEKE 293 RIG I + TPL+ + V ++LN+ L T + A + + Sbjct: 129 RIGLILFADHAYLQ--TPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKTFID--- 183 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--- 350 S ++ +I ++DG N+ L+ L+ + IY+V V A Sbjct: 184 -------SDAPQRVMILLSDGSNTAGV-----LDPLEAANIAKQYHTTIYTVGVGAGEMV 231 Query: 351 --------------PEGQDLLRKCTDS-SGQFFAVNDSRELLESFDKITD 385 + L+ + G +F + ++L +D I Sbjct: 232 VKDFLFSRKVNTSQDLDEKTLQSIATTTGGHYFRARNQQDLQNIYDTINQ 281 >gi|297291177|ref|XP_001109727.2| PREDICTED: collagen alpha-1(XII) chain-like [Macaca mulatta] Length = 3095 Score = 78.0 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 59/166 (35%), Gaps = 18/166 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+G + Y+ +E+ + + K+ NT T A+ + Sbjct: 168 SAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLV 227 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + E + K I ITDG++ +RN G++++S+ Sbjct: 228 KNTFTESAGARVGF-----PKVAIIITDGKSQD--------EVEIPARELRNVGVEVFSL 274 Query: 346 AVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQE 389 + L++ + F V + +++ ++I ++ Sbjct: 275 GI--KAADAKELKQIASTPSLNHVFNVANFDAIVDIQNEIISQVCS 318 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 62/167 (37%), Gaps = 23/167 (13%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYN 290 V+I + Y+ + ++ +N TNT AM + +++ Sbjct: 473 SPNRVQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIF- 531 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + K +I ITDG++S A +RN+ ++I++V V Sbjct: 532 ----VPSKGSRSNVPKVMILITDGKSSDAFRDP--------AIKLRNSDVEIFAVGV--K 577 Query: 351 PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS-VRI 394 + L F V D ++F +I+ ++ + +RI Sbjct: 578 DAVRSELEAIASPPAETHVFTVEDF----DAFQRISFELTQSICLRI 620 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 13/98 (13%), Positives = 34/98 (34%), Gaps = 12/98 (12%) Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 R +K + ITDG++ + + +++ G++++++ + + Sbjct: 1269 TQAGMRPRARKIGVLITDGKSQDDVEAPS--------KKLKDEGVELFAIGI--KNADEV 1318 Query: 356 LLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 L+ + V D L D +T + Sbjct: 1319 ELKMIATDPDDTHAYNVADFESLSRIVDDLTINLCNSV 1356 >gi|254421496|ref|ZP_05035214.1| von Willebrand factor type A domain protein [Synechococcus sp. PCC 7335] gi|196188985|gb|EDX83949.1| von Willebrand factor type A domain protein [Synechococcus sp. PCC 7335] Length = 410 Score = 78.0 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 29/195 (14%), Positives = 71/195 (36%), Gaps = 15/195 (7%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + ++ + V I ++ + RI IA++ L+++ Sbjct: 41 NVCFVLDRSGSMMGTPLQTVKQAASRIVDRLSNRDRISIIAFDHKAEVLISNELASDPQA 100 Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 +K R+N L T + +L + KE ++ +TDGE + Sbjct: 101 IKRRINSLRAGGGTCIDDGLKAGIEQLASGKEG---------YISQLLLLTDGE----NE 147 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESF 380 + + +++ + + + ++ QD+L + D G + + E + +F Sbjct: 148 HGDNSRAIKLADVAIGYNLTVNTLGFG-DHWNQDVLEQIADAGGGSLSYIEHAEEAIATF 206 Query: 381 DKITDKIQEQSVRIA 395 ++ ++Q S+ A Sbjct: 207 GRLFTRMQSVSLTNA 221 >gi|294678572|ref|YP_003579187.1| hypothetical protein RCAP_rcc03056 [Rhodobacter capsulatus SB 1003] gi|294477392|gb|ADE86780.1| conserved hypothetical protein [Rhodobacter capsulatus SB 1003] Length = 647 Score = 78.0 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 37/397 (9%), Positives = 88/397 (22%), Gaps = 95/397 (23%) Query: 5 IISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIF 64 + V + AIDL + R +Q+ +D AVL+ ++ Sbjct: 39 VFLVMLITTGIAIDLVRVEERRTLIQNTIDRAVLAAA-------SLTQKRDPTLVVKDYL 91 Query: 65 KKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLI 124 K ++ S T + + L+ Sbjct: 92 TKAGLGYIASDSS---------------FTPKVEGSIALGWRRVSVEVDDDMPTIFGPLL 136 Query: 125 PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDN-----------NN 173 + + +T + + +++ + + ++ ++D N Sbjct: 137 GVSSLAATGDTTAMQAVGNVEISLVLDLSGSMTEYVKDNPSCTKNCTSSKTRFQYLQVAA 196 Query: 174 MTSNKYLLPPP----PKKSFWSKNTTKSKYAPAPAPANRKIDVLIE------------SA 217 + + S + + + A Sbjct: 197 KSFINTVFASSGSGVAAGRTSVSVVPYSTNVYLGSEMQEGYTLSSDFSVTGSSFAMPQCA 256 Query: 218 GNLVNSIQKAIQEKKNLSVRIG--------------------------TIAYNIGIVGNQ 251 + N + + R + N+ Sbjct: 257 DFVANDYNTMVIDGTGPLTRTMYGSSYKYSDSLSALVSDGSTSNNPGQDWHNCMNTPQNR 316 Query: 252 CTPLSNNLNEVKS----RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK-- 305 PLS++ + + ++KL T+ L + S Sbjct: 317 VIPLSSDPTFLAADKTGFIDKLTAGGWTSIDVGAKWGLALLDPSARDEVAKMTSVSSAFR 376 Query: 306 --------------KFVIFITDGENSGASAYQNTLNT 328 K ++ +TDG N+ + T Sbjct: 377 ETKPRPINYDGDTMKVLVLMTDGANTTNFSTLPGYRT 413 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 33/216 (15%), Positives = 64/216 (29%), Gaps = 6/216 (2%) Query: 183 PPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIA 242 P + ++ + V + Q G Sbjct: 432 PSATTASGIYYYDATRSSTPYYKYETGTWVARSAITQTTQEQQVQTATCTQYKNSWGQWK 491 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN-----TYPAMHHAYRELYNEKESSHN 297 +N VG L ++ +K+ K N + H Y+ + Sbjct: 492 WNSCSVGTSDCTLISSTTSLKTYTCKKTTTVNVTTEAPLYDVSYDHLYKTKNWNLNTVAG 551 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 +G + N+ T ++C+ ++ G+ I+SVA AP G+ LL Sbjct: 552 LLGKPYGRSAGTQYELMANAVYDTSVKDARTKKLCDLAKSKGIYIFSVAADAPSGGKTLL 611 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + C+ + ++ V L +F I I + Sbjct: 612 KYCSSGTSYYYEVQ-GSNLSTAFASIAASISSLRLT 646 >gi|301164324|emb|CBW23882.1| conserved exported hypothetical protein [Bacteroides fragilis 638R] Length = 610 Score = 78.0 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 33/306 (10%), Positives = 86/306 (28%), Gaps = 26/306 (8%) Query: 75 GSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLR 134 + E G+I + N++ + ++ A Y + L P Sbjct: 139 NAVNAEEYGEIQENGFKNVSDAPLSTFSIDVDA-ASYSNMRRFINKGKLPPVDAIRTEEL 197 Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 + + + + + P + + N Sbjct: 198 VNYFSYDYPKPTGSDPVKITMEAGTCPWNADHRLVRIGLKAKEI------PTDNLPASNL 251 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 ++D++ S LVN+++ ++ + Y Sbjct: 252 VFLIDVSGSMWGANRLDLVKSSLKLLVNNLRDKD--------KVAIVTYAGNAGVKLEAT 303 Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 +N +++ +++L +T + AY+ S N +I TDG Sbjct: 304 PGSNKQKIREAIDELEASGSTAGGEGIMLAYKIAQKNFISGGNNR--------IILCTDG 355 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDS 373 + + + L ++ E R +G+ + + ++ + G +++ Sbjct: 356 DFNVGVSSDKEL--EKLIEQKRKSGIFLTVLGYGMGNYKDSKMQTLAEKGNGNHAYIDNL 413 Query: 374 RELLES 379 +E Sbjct: 414 QEANRV 419 >gi|229514670|ref|ZP_04404131.1| protein BatA [Vibrio cholerae TMA 21] gi|229348650|gb|EEO13608.1| protein BatA [Vibrio cholerae TMA 21] Length = 318 Score = 78.0 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 30/170 (17%), Positives = 57/170 (33%), Gaps = 38/170 (22%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK--LNPYEN-TNTYPAMHHAYRELYNEKE 293 RIG I + TPL+ + V ++LN+ L T + A + + Sbjct: 129 RIGLILFADHAYLQ--TPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKTFIDSN- 185 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--- 350 ++ +I ++DG N+ L+ L+ + IY+V V A Sbjct: 186 ---------APQRVMILLSDGSNTAGV-----LDPLEAANIAKQYHTTIYTVGVGAGEMV 231 Query: 351 --------------PEGQDLLRKCTDS-SGQFFAVNDSRELLESFDKITD 385 + L+ + G +F + ++L +D I Sbjct: 232 VKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTINQ 281 >gi|254281808|ref|ZP_04956776.1| von Willebrand factor, type A [gamma proteobacterium NOR51-B] gi|219678011|gb|EED34360.1| von Willebrand factor, type A [gamma proteobacterium NOR51-B] Length = 328 Score = 78.0 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 27/195 (13%), Positives = 70/195 (35%), Gaps = 30/195 (15%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---PYENT 275 + + +++ + + V G PL+ + V + + E+T Sbjct: 116 SRIEAVKAIASDFTSQRVGDRVGLILFGTRAYVQAPLTFDTATVTRFIREAQLGFAGEDT 175 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 A+ A + L + +I +TDG+++ +T++ ++ Sbjct: 176 AIGDALGLAIKRL----------RERPAESRVLILLTDGQDTA-----STVDPMEATALA 220 Query: 336 RNAGMKIYSVAV---------SAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 +G+K+Y++ + + + LL + + G++F + EL + + Sbjct: 221 AESGIKVYTIGISRRIGARAGGSGEVDEALLNAIAEATGGEYFRARNPAELQSIYGVVDQ 280 Query: 386 K--IQEQSVRIAPNR 398 I++ + P R Sbjct: 281 LEPIEQNTSTFRPKR 295 >gi|115375477|ref|ZP_01462737.1| von Willebrand factor, type A [Stigmatella aurantiaca DW4/3-1] gi|310821370|ref|YP_003953728.1| von willebrand factor, type a [Stigmatella aurantiaca DW4/3-1] gi|115367520|gb|EAU66495.1| von Willebrand factor, type A [Stigmatella aurantiaca DW4/3-1] gi|309394442|gb|ADO71901.1| Von Willebrand factor, type A [Stigmatella aurantiaca DW4/3-1] Length = 562 Score = 78.0 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 52/166 (31%), Gaps = 13/166 (7%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 N + + + Y + + + + ++ L T M AYR Sbjct: 247 NENDTVAIVTYAGSTQDVLPPTPATEVQRIHTAIDLLQSGGGTAMGSGMELAYRHA---- 302 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 S VI +TDG+ + I E G+ + ++ Sbjct: 303 ----VKKASGNAISRVIVLTDGDANIGPNLSAESMLSGI-EKYVAEGVTLSTIGFGMGNY 357 Query: 353 GQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 DL+ + D G F V+ +E + F+ ++ IA + Sbjct: 358 RDDLMERLADKGNGNCFYVDSYQEAKKVFEA---QLTGTLEVIAKD 400 >gi|95928343|ref|ZP_01311091.1| von Willebrand factor, type A [Desulfuromonas acetoxidans DSM 684] gi|95135614|gb|EAT17265.1| von Willebrand factor, type A [Desulfuromonas acetoxidans DSM 684] Length = 329 Score = 78.0 bits (190), Expect = 3e-12, Method: Composition-based stats. Identities = 31/198 (15%), Positives = 64/198 (32%), Gaps = 45/198 (22%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE--VKSR 265 ++ L AG ++ RIG I + +N + Sbjct: 115 DRLTALKAVAGAFIDQ---------RQGDRIGLILFGEQPYIQAPLTFDHNTVTRLLHEA 165 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + L T A+ A + L + K +I +TDG ++ S Sbjct: 166 VVGL-AGNKTAIGDAIGLAVKRL----------RKDPQAKNVLILLTDGASNSGS----- 209 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPE-----------------GQDLLRKCT-DSSGQF 367 L+ L+ + G+K+Y++ + A + L+ + G++ Sbjct: 210 LDPLKAAKLAAQRGLKVYTIGIGAEAVEVGSFFFKRTVNPSLDLDEKTLKAIAETTGGRY 269 Query: 368 FAVNDSRELLESFDKITD 385 F D+ EL + + ++ Sbjct: 270 FRARDTEELAQIYQQLDQ 287 >gi|326681146|ref|XP_002665520.2| PREDICTED: collagen alpha-1(XXII) chain [Danio rerio] Length = 1623 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 30/200 (15%), Positives = 68/200 (34%), Gaps = 22/200 (11%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + + + NLV S + R+ + Y+ L Sbjct: 45 TSSSVGKENFEKIRQWVANLVESF-----DVGVDKTRVAVVRYSDRPTVEFNLARYKTLE 99 Query: 261 EVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 EVK + NT T A+ ++ E+ + + ++K I +TDG++ Sbjct: 100 EVKRAAGNIRYLGGNTKTGDAISFTTTNIFTERAGARPA--AKGIQKVAILLTDGQSQDF 157 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELL 377 + AG+++++V + ++ L + F V D + Sbjct: 158 VLEPSV--------AAAAAGIRLFAVGIG--EALKEELEEIAAEPKSAHVFHVTDFDAID 207 Query: 378 ESFDKITDKIQEQSVRIAPN 397 + ++ ++ E + PN Sbjct: 208 KIRGRLRRRLCENV--LCPN 225 >gi|254787807|ref|YP_003075236.1| von Willebrand factor A [Teredinibacter turnerae T7901] gi|237686979|gb|ACR14243.1| von Willebrand factor type A domain protein [Teredinibacter turnerae T7901] Length = 689 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 38/387 (9%), Positives = 96/387 (24%), Gaps = 36/387 (9%) Query: 23 MYIRNQMQ-----SALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSY 77 R+ +Q SA D L + + + P + + K+ Sbjct: 150 ETQRHALQEYDAISAKDIGALPSHEAARNLQRFASPAASSEAKREVLMKREASSFMPRKP 209 Query: 78 IRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGL----IPSALTNLSL 133 E + + + NP++ E + + Sbjct: 210 DLEPPHQLETADRDHFDTVATNPIKVTREEPVSTFSIDVDTASYSFVRRQLNRGQLPQKA 269 Query: 134 RSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS---NKYLLPPPPKKSFW 190 + + + + K L P K+ Sbjct: 270 AVRLEEMVNYFPYDYPLPSAATAPFKPTITVIPAPWNQAKRLVHIGIKALPLAHPPKANL 329 Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 + K+ ++ +S L++ +Q + + Y Sbjct: 330 V----FLLDVSGSMGSPDKLPLVKQSMELLLSGLQPTDT--------VSIVVYAGAAGTV 377 Query: 251 QCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 ++ + L++LN +T + AY+ + +I Sbjct: 378 LEPTPVAEQQKILAALDRLNAGGSTAGAQGIELAYQLAEANYQRDAVNR--------IIL 429 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV 370 TDG+ + + E R G+++ + + L+++ + Sbjct: 430 ATDGDFNVG--IADPEQLKGYVERKRANGIELSILGFGSGNYNDALMQQLAQNGNGV--A 485 Query: 371 NDSRELLESFDKITDKIQEQSVRIAPN 397 L E+ + ++ +A + Sbjct: 486 AYIDTLSEAQKVLVEQASGTLFTVAKD 512 >gi|229527849|ref|ZP_04417240.1| protein BatA [Vibrio cholerae 12129(1)] gi|229334211|gb|EEN99696.1| protein BatA [Vibrio cholerae 12129(1)] gi|327485392|gb|AEA79798.1| BatA aerotolerance operon [Vibrio cholerae LMA3894-4] Length = 318 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 30/170 (17%), Positives = 57/170 (33%), Gaps = 38/170 (22%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK--LNPYEN-TNTYPAMHHAYRELYNEKE 293 RIG I + TPL+ + V ++LN+ L T + A + + Sbjct: 129 RIGLILFADHAYLQ--TPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKTFIDSN- 185 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--- 350 ++ +I ++DG N+ L+ L+ + IY+V V A Sbjct: 186 ---------APQRVMILLSDGSNTAGV-----LDPLEAANIAKQYHTTIYTVGVGAGEMV 231 Query: 351 --------------PEGQDLLRKCTDS-SGQFFAVNDSRELLESFDKITD 385 + L+ + G +F + ++L +D I Sbjct: 232 VKDFLFSRKLNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTINQ 281 >gi|253565978|ref|ZP_04843432.1| conserved hypothetical protein [Bacteroides sp. 3_2_5] gi|251945082|gb|EES85520.1| conserved hypothetical protein [Bacteroides sp. 3_2_5] Length = 610 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 32/306 (10%), Positives = 86/306 (28%), Gaps = 26/306 (8%) Query: 75 GSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLR 134 + E G+I + N++ + ++ A Y + L P Sbjct: 139 NAVNAEEYGEIQENGFKNVSDAPLSTFSIDVDA-ASYSNMRRFINKGKLPPVDAIRTEEL 197 Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 + + + + + P + + N Sbjct: 198 VNYFSYDYPKPTGSDPVKITMEAGTCPWNADHRLVRIGLKAKEI------PTDNLPASNL 251 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 ++D++ S LVN+++ ++ + Y Sbjct: 252 VFLIDVSGSMWGANRLDLVKSSLKLLVNNLRDKD--------KVAIVTYAGNAGVKLEAT 303 Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 ++ +++ +++L +T + AY+ S N +I TDG Sbjct: 304 PGSDKQKIREAIDELEASGSTAGGEGIMLAYKIAQKNFISGGNNR--------IILCTDG 355 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDS 373 + + + L ++ E R +G+ + + ++ + G +++ Sbjct: 356 DFNVGVSSDKEL--EKLIEQKRKSGIFLTVLGYGMGNYKDSKMQTLAEKGNGNHAYIDNL 413 Query: 374 RELLES 379 +E Sbjct: 414 QEANRV 419 >gi|229522840|ref|ZP_04412254.1| protein BatA [Vibrio cholerae TM 11079-80] gi|229340057|gb|EEO05065.1| protein BatA [Vibrio cholerae TM 11079-80] Length = 318 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 31/170 (18%), Positives = 58/170 (34%), Gaps = 38/170 (22%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK--LNPYEN-TNTYPAMHHAYRELYNEKE 293 RIG I + TPL+ + V ++LN+ L T + A + + Sbjct: 129 RIGLILFADHAYLQ--TPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKTFID--- 183 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--- 350 S ++ +I ++DG N+ L+ L+ + IY+V V A Sbjct: 184 -------SDAPQRVMILLSDGSNTAGV-----LDPLEAANIAKQYHTTIYTVGVGAGEMV 231 Query: 351 --------------PEGQDLLRKCTDS-SGQFFAVNDSRELLESFDKITD 385 + L+ + G +F + ++L +D I Sbjct: 232 VKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTINQ 281 >gi|311244457|ref|XP_001927071.2| PREDICTED: collagen alpha-1(XII) chain [Sus scrofa] Length = 1894 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 58/166 (34%), Gaps = 18/166 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+G + Y+ +E+ + + K+ NT T A+ Sbjct: 148 SAFDIGEEKTRVGVVQYSSDTRTEFNLNQFYQRDELLAAIKKIPYKGGNTMTGDAIDFLI 207 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + E + K I ITDG++ +RN G++++S+ Sbjct: 208 KNTFTESAGARVGF-----PKVAIIITDGKSQD--------EVEIPARELRNIGVEVFSL 254 Query: 346 AVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQE 389 + L++ + F V + +++ ++I ++ Sbjct: 255 GI--KAADAKELKQIASTPSLNHVFNVANFDAIVDIQNEIISQVCS 298 Score = 66.1 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 62/167 (37%), Gaps = 23/167 (13%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYN 290 V+I + Y+ + ++ +N TNT AM + +++ Sbjct: 453 SPNRVQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIF- 511 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + K +I ITDG++S A +RN+ ++I++V V Sbjct: 512 ----VPSKGSRSNVPKVMILITDGKSSDAFRDP--------AIKLRNSDVEIFAVGV--K 557 Query: 351 PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS-VRI 394 + L F V D ++F +I+ ++ + +RI Sbjct: 558 DAVRSELEAIASPPAETHVFTVEDF----DAFQRISFELTQSICLRI 600 >gi|225010241|ref|ZP_03700713.1| von Willebrand factor type A [Flavobacteria bacterium MS024-3C] gi|225005720|gb|EEG43670.1| von Willebrand factor type A [Flavobacteria bacterium MS024-3C] Length = 330 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 29/181 (16%), Positives = 59/181 (32%), Gaps = 43/181 (23%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRE 287 K + RIG + Y TP++++ V L ++ + T + + Sbjct: 123 KDRPNDRIGLVVYAGESYTK--TPITSDKRLVLEALKEIKYGALTDGTAIGMGLATSVNR 180 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 L S L K +I +TDG N+ + E +K Y++ + Sbjct: 181 L----------KDSKALSKVIILLTDGVNNAG-----FIEPATAAELAVEYDIKTYTIGL 225 Query: 348 SAPPE----------------------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKIT 384 + LL + + G++F D+++L +++I Sbjct: 226 GTNGNALSPIAYNPDGSFRYGMAEVQIDEALLEQIATLTGGKYFRATDNKKLEAIYNEIN 285 Query: 385 D 385 Sbjct: 286 S 286 >gi|47219516|emb|CAG09870.1| unnamed protein product [Tetraodon nigroviridis] Length = 1259 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 26/163 (15%), Positives = 57/163 (34%), Gaps = 18/163 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + R+ + Y+ ++ +N L NT T A+ + + + Sbjct: 126 DIGEDKTRVAVVQYSTDTRTEFPLTRYTRRGDLLQAINSLPYKGGNTMTGDAIDYLLQNI 185 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + E S K + ITDG++ + +RN G++I+ + + Sbjct: 186 FTEAGGSR-----KSFPKVAMIITDGKSQDP--------VEEHARRLRNIGVEIFVLGI- 231 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQE 389 +D LR+ + + V + ++ E KI ++ Sbjct: 232 -KGADEDELREIASTPHSKHMYNVPNFDKIQEVQKKIIREVCS 273 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 28/169 (16%), Positives = 64/169 (37%), Gaps = 23/169 (13%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + V+I + Y+ +++N V + TNT AM + ++ Sbjct: 426 DIGPSKVQISLVQYSRDPHTEFALNTHHDINAVVRAVRTFPYRGGSTNTGKAMKYVKDKI 485 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + + + + ++ ITDG++S + +RN ++I++V V Sbjct: 486 FVASRGAR-----QNVPRVMVLITDGKSSDSF--------KDAATNLRNIDVEIFAVGV- 531 Query: 349 APPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDKITDKIQEQS-VRI 394 + L + + F V D ++F +I+ ++ + +RI Sbjct: 532 -KDAVRSELEAIANPPADNHVFEVEDF----DAFQRISKELTQSICLRI 575 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 26/204 (12%), Positives = 60/204 (29%), Gaps = 21/204 (10%) Query: 196 KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL 255 +KY + + L + + S IV Sbjct: 1064 GTKYTVSVFGMFDGGESLPLAGEERTTLSDGPDPTPYSPSDVTCKTKAQADIVLLLDGSW 1123 Query: 256 S---NNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 S N +++ ++++ + A + + +K + Sbjct: 1124 SIGRLNFKTIRTFISRMVEVFDIGPDKVQVGL--ALNYVLQNNFKENV-GMRRNSRKIGV 1180 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QF 367 +TDG++ + + +RN +++Y+V V ++ LR Sbjct: 1181 LVTDGKSQDDVHEK--------AQNLRNENIELYAVGV--KNAEENELRSIASDPDDIHM 1230 Query: 368 FAVNDSRELLESFDKITDKIQEQS 391 + V D LL+ D +T+ + Sbjct: 1231 YNVADFSFLLDIVDNLTNNLCNSV 1254 >gi|60682855|ref|YP_212999.1| hypothetical protein BF3393 [Bacteroides fragilis NCTC 9343] gi|60494289|emb|CAH09084.1| conserved exported hypothetical protein [Bacteroides fragilis NCTC 9343] Length = 610 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 32/306 (10%), Positives = 86/306 (28%), Gaps = 26/306 (8%) Query: 75 GSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLR 134 + E G+I + N++ + ++ A Y + L P Sbjct: 139 NAVNAEEYGEIQENGFKNVSDAPLSTFSIDVDA-ASYSNMRRFINKGKLPPVDAIRTEEL 197 Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 + + + + + P + + N Sbjct: 198 VNYFSYDYPKPTGSDPVKITMEAGTCPWNADHRLVRIGLKAKEI------PTDNLPASNL 251 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 ++D++ S LVN+++ ++ + Y Sbjct: 252 VFLIDVSGSMWGANRLDLVKSSLKLLVNNLRDKD--------KVAIVTYAGNAGVKLEAT 303 Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 ++ +++ +++L +T + AY+ S N +I TDG Sbjct: 304 PGSDKQKIREAIDELEASGSTAGGEGIMLAYKIAQKNFISGGNNR--------IILCTDG 355 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDS 373 + + + L ++ E R +G+ + + ++ + G +++ Sbjct: 356 DFNVGVSSDKEL--EKLIEQKRKSGIFLTVLGYGMGNYKDSKMQTLAEKGNGNHAYIDNL 413 Query: 374 RELLES 379 +E Sbjct: 414 QEANRV 419 >gi|113475004|ref|YP_721065.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] gi|110166052|gb|ABG50592.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] Length = 412 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 26/180 (14%), Positives = 63/180 (35%), Gaps = 21/180 (11%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 ++ + ++A LV +++ R+ +A++ + N+ +K ++NKL Sbjct: 59 LETVKQAAVQLVEKLKEGD--------RLSVVAFDHQAQVIVPNQMINDSASIKGKINKL 110 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T + EL G +TDGE + + + L Sbjct: 111 RASGGTAIDKGLKLGIEELN---------KGRKESISQAFILTDGE----NEHGDNDLCL 157 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 ++ + + + + S+ L + G + + +E F K+ ++I+ Sbjct: 158 KLAKLATDYNITLNSLGFGDDWNQDVLEKIADAGGGNLSYIQQPEQAIEEFSKLFNRIKS 217 >gi|237735881|ref|ZP_04566362.1| predicted protein [Mollicutes bacterium D7] gi|229381626|gb|EEO31717.1| predicted protein [Coprobacillus sp. D7] Length = 965 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 28/251 (11%), Positives = 66/251 (26%), Gaps = 11/251 (4%) Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 S S D N + KK+ +S T + Y ++ + Sbjct: 105 NSTSSNDKKRLTKAKNAAIEFLNNSKISGNKKNRYS-IVTFNYYGTVEQNLTSNLETAKQ 163 Query: 216 SAGNL-VNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 + ++ + + + + T+ N LS+ +LN Sbjct: 164 AIRDVELGNNSDGGTNIQAGLYKARTVLKNAKSENGIIILLSDGGATGSYKLNNERNNGY 223 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI-----FITDGENSGASAYQNTLNTL 329 + A + + + F D + + N Sbjct: 224 LVNDYSEATATDKALGYSGRYTFGENAINYDSVIKGGRNDFTLDLYLNNSHYSLNNAAAT 283 Query: 330 QICEYM---RNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + +G I+++ + L+ + + S +L ++ I ++ Sbjct: 284 LAENEALLAKKSGNTIFTIGYTTGSSVNSFLKNVATQGEGYAY-SSSSDLSGIYENIANE 342 Query: 387 IQEQSVRIAPN 397 I + I N Sbjct: 343 IVTRYETIIKN 353 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 38/337 (11%), Positives = 85/337 (25%), Gaps = 19/337 (5%) Query: 70 KHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALT 129 + + K N N ++ KA E I + Sbjct: 539 GYAQTNLSSGNIIELSQNKDAANGYVTLTNSFGHVGFFKATDEKTNYLPSGSETINNYEY 598 Query: 130 NLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF 189 N+S ++ +++ + + + SM H NN + L P Sbjct: 599 NVSGNVKSTTVKAPQDVVLLLDKSGSMDESMNGSSRLTHLKNNVIKFITKLYEHNPDSRV 658 Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 S K+ + + K KN+ GT Sbjct: 659 SVITFAYSADGSITNNNFVKLSDIKSGNETWYTYLTKNNGGIKNIKASGGTQIDLGLYEV 718 Query: 250 NQCTPLSNNLNE---------------VKSRLNKLNPYENTNTYPAMHHAYRELYNEKES 294 + N + N + A++ A ++ + Sbjct: 719 RNQLSSATGENNRSVIVFTDGQPGNKGFNTSYNDYDDNGYRVGAEALNQADFIKFSGNLT 778 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM-RNAGMKIYSVAVSAPPEG 353 N + + + + + N+ + + G I+++ +++ Sbjct: 779 GINNYIESSNGSKYYGHKNDDITKNRSNNNSNDAGNRTNRSGKGLGKTIFTIGLNSNNSS 838 Query: 354 --QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 L + G + N+S + +F+ I I Sbjct: 839 LFDSFLTRLAS-EGHYTKANNSSAMENAFNSIFTSIT 874 >gi|156616288|ref|NP_766515.2| collagen alpha-6(VI) chain isoform 2 [Mus musculus] Length = 1182 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 36/338 (10%), Positives = 98/338 (28%), Gaps = 31/338 (9%) Query: 58 DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 ++K K+ + + QI + + ++ Sbjct: 301 QVGQAYTGAALRKTRKEIFSAQRGSRKNQGVPQIAVLVTHRASEDNVTKAAVNLRREGVT 360 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYL----QKHNDNNN 173 +F G+ + L ++ E+ + L + ++ L + Sbjct: 361 IFTMGIEGANPDELEKIASHPAEQFTSKLGNFSELATHNQTFLKKLRNQITHTVSVFSER 420 Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 + K + + P + +V A + Sbjct: 421 TETLKSACVDTEEADIYLLIDGSGSTQP------TDFHEMKTFLSEVVGMFNIAPHK--- 471 Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEK 292 VR+G + Y +N ++ + + TNT A++ + L K Sbjct: 472 --VRVGAVQYADTWDLEFEISKYSNKPDLGKAIENIRQMGGNTNTGAALNFTLKLLQRAK 529 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + + + ++ +T+G + + L +R ++++++ V Sbjct: 530 KERGSK-----VPCHLVVLTNGMSRDS--------VLGPAHKLREENIRVHAIGV--KEA 574 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 Q LR+ + + V++ L +++ +I + Sbjct: 575 NQTQLREIAGEEKRVYYVHEFDALRNIRNQVVQEICAE 612 Score = 74.2 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 41/354 (11%), Positives = 103/354 (29%), Gaps = 34/354 (9%) Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 D + K+ + + +++ ++ + ++A Sbjct: 838 ADVGKNQVRFGALKYADDPEVLFYLDELGTKLEVVSVLQNDHPMGGNTYTAEALAFSDHM 897 Query: 117 NLFL------KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 KG+ + S + ++ A+ +L ++ + + Sbjct: 898 FTEARGSRLHKGVPQVLIVITDGESHDAEKLNTTAKALRDKGILVLAVGIAGANSWELLA 957 Query: 171 NNNMTSNKYLLPPPPK----KSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK 226 + Y + S S + S + ++ S + QK Sbjct: 958 MAGSSDKYYFVETFGGLKGIFSDVSASVCNSSKVDCEIEKVDLVFLMDGSNSIHPDDFQK 1017 Query: 227 A---------IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTN 276 + VRIG ++ E+ +++ + T+ Sbjct: 1018 MKGFLVSVVQDFDVSLNRVRIGVAQFSDSYRSEFLLGTFTGEREISTQIEGIQQIFGYTH 1077 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 A+ R++ + + + + ++ +TDG + Q E +R Sbjct: 1078 IGDAL----RKVKYYFQPDMGSRINAGTPQVLLVLTDGRSQD--------EVAQAAEELR 1125 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + G+ IYSV + L + T ++ + V++ EL + +I I Sbjct: 1126 HKGVDIYSVGIG--DVDDQELVQITGTAEKKLTVHNFDELKKVKKRIVRNICTS 1177 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 27/160 (16%), Positives = 54/160 (33%), Gaps = 19/160 (11%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEK 292 V+IG + ++ + +++ + ++++ T T A+ + +K Sbjct: 656 DRVQIGVVQFSHENKEEFQLNTFMSQSDIANAIDRMTHIGETTLTGSALTFVSQYFSPDK 715 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA-VSAPP 351 + ++KF+I ITDGE +R G+ IYSV + Sbjct: 716 GARP------NVRKFLILITDGEAQD--------IVRDPAIALRKEGVIIYSVGVFGSNV 761 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 L + + F V + L D + I Sbjct: 762 ---TQLEEISGKPEMVFYVENFDILQHIEDDLVLGICSPR 798 Score = 43.0 bits (99), Expect = 0.079, Method: Composition-based stats. Identities = 24/162 (14%), Positives = 54/162 (33%), Gaps = 17/162 (10%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL--NKLNPYENTNTYPAMHHAYRELYNE 291 R+ Y+ + N N + + L N + A+ A+R ++ Sbjct: 61 NKYRVALAQYSDALHNEFQLGTFKNRNPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSA 120 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 + + V+ + ++ + + + +R G+KI SV V Sbjct: 121 PTNGRDKK--QFPPILVVLAS---------AESEDDVEEAAKALREDGVKIISVGVQK-- 167 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 ++ L+ S F + +R+L F +I + + Sbjct: 168 ASEENLKAMATSQ-FHFNLRTARDLS-VFAPNMTEIIKDVTQ 207 >gi|153802375|ref|ZP_01956961.1| conserved hypothetical protein [Vibrio cholerae MZO-3] gi|124122094|gb|EAY40837.1| conserved hypothetical protein [Vibrio cholerae MZO-3] Length = 318 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 31/170 (18%), Positives = 59/170 (34%), Gaps = 38/170 (22%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK--LNPYEN-TNTYPAMHHAYRELYNEKE 293 RIG I + TPL+ + V ++LN+ L T + A + + Sbjct: 129 RIGLILFADHAYLQ--TPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKTFID--- 183 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--- 350 S ++ +I ++DG N+ L+ L+ + + IY+V V A Sbjct: 184 -------SDAPQRVMILLSDGSNTAGV-----LDPLEAADIAKQYHTTIYTVGVGAGEMV 231 Query: 351 --------------PEGQDLLRKCTDS-SGQFFAVNDSRELLESFDKITD 385 + L+ + G +F + ++L +D I Sbjct: 232 VKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTINQ 281 >gi|315080701|gb|EFT52677.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL078PA1] Length = 320 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 21/222 (9%), Positives = 58/222 (26%), Gaps = 32/222 (14%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 ++ S+ A ++ +A + + + R Sbjct: 87 DRATVVVAIDVSRSMVATDVEPSRLSAAKTAAKDFLGDLP----------PRFNVSLVKF 136 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 P + + V + + L +T ++ + L + + Sbjct: 137 AASAQVVVPPTTDRAAVSTAITNLQVLPSTAIGEGIYSSLNALKLVPDD--PKHPGQKPP 194 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------------- 352 ++ ++DG + L+ + + +Y++A Sbjct: 195 AAIVLLSDGATNVGRPS------LEAAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPV 248 Query: 353 GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 L S G+ F+ +L + + I + + V Sbjct: 249 NHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSVGYEKVF 290 >gi|324499923|gb|ADY39979.1| Transmembrane cell adhesion receptor mua-3 [Ascaris suum] Length = 2123 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 27/170 (15%), Positives = 63/170 (37%), Gaps = 12/170 (7%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + + R+ + Y+ I +++ V+ ++++ T T A+ H E Sbjct: 479 DISSDRTRVAVVQYSDRIRHEFDLNQYSSIQNVEDAIDRIQYMTGLTRTGAAIEHVRNEA 538 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 +NE+ + S ++ + I ITDG + + + R +++++V V Sbjct: 539 FNERRGARPL--SDKISRVTIVITDGRSQDNVSLP--------AQQARQQHIQLFAVGV- 587 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 L + ++F V ++L K+ + AP + Sbjct: 588 TNHVLDSELETIAGAKDRYFHVTGFKDLNARLRSAIQKVACPEGKPAPPK 637 >gi|313792199|gb|EFS40300.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA1] gi|314984000|gb|EFT28092.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA1] Length = 320 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 21/222 (9%), Positives = 58/222 (26%), Gaps = 32/222 (14%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 ++ S+ A ++ +A + + + R Sbjct: 87 DRATVVVAIDVSRSMVATDVEPSRLSAAKTAAKDFLGDLP----------PRFNVSLVKF 136 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 P + + V + + L +T ++ + L + + Sbjct: 137 AASAQVVVPPTPDRAAVSTAITNLQVLPSTAIGEGIYSSLNALKLVPDD--PKHPGQKPP 194 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------------- 352 ++ ++DG + L+ + + +Y++A Sbjct: 195 AAIVLLSDGATNVGRPS------LEAAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPV 248 Query: 353 GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 L S G+ F+ +L + + I + + V Sbjct: 249 NHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSVGYEKVF 290 >gi|297579701|ref|ZP_06941628.1| conserved hypothetical protein [Vibrio cholerae RC385] gi|297535347|gb|EFH74181.1| conserved hypothetical protein [Vibrio cholerae RC385] Length = 318 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 32/170 (18%), Positives = 58/170 (34%), Gaps = 38/170 (22%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK--LNPYEN-TNTYPAMHHAYRELYNEKE 293 RIG I + TPL+ + V S+LN+ L T + A + + Sbjct: 129 RIGLILFADHAYLQ--TPLTLDRQTVASQLNQAVLKLIGTQTAIGEGIGLATKTFID--- 183 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--- 350 S ++ +I ++DG N+ L+ L+ + IY+V V A Sbjct: 184 -------SDAPQRVMILLSDGSNTAGV-----LDPLEAANIAKQYHTTIYTVGVGAGEMV 231 Query: 351 --------------PEGQDLLRKCTDS-SGQFFAVNDSRELLESFDKITD 385 + L+ + G +F + ++L +D I Sbjct: 232 VKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTINQ 281 >gi|332831234|ref|XP_528236.3| PREDICTED: collagen alpha-1(XXII) chain, partial [Pan troglodytes] Length = 695 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 31/195 (15%), Positives = 67/195 (34%), Gaps = 20/195 (10%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + + + NLV++ + R+G + Y+ L + Sbjct: 45 TSSSVGKEDFEKVRQWVANLVDTFE-----VGPDRTRVGVVRYSDRPTTAFELGLFGSQE 99 Query: 261 EVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 EVK+ +L NTNT A+ + ++ + K+ I +TDG + Sbjct: 100 EVKAAARRLAYHGGNTNTGDALRYITARSFS--PRAGGRPRDRAYKQVAILLTDGRSQDL 157 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELL 377 G++I++V V ++ L + F V+D + Sbjct: 158 VLDAAAAAHRA--------GIRIFAVGVG--EALKEELEEIASEPKSAHVFHVSDFNAID 207 Query: 378 ESFDKITDKIQEQSV 392 + K+ ++ E + Sbjct: 208 KIRGKLRRRLCENVL 222 >gi|289607418|emb|CBI60804.1| unnamed protein product [Sordaria macrospora] Length = 814 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 53/393 (13%), Positives = 98/393 (24%), Gaps = 62/393 (15%) Query: 55 TKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIP 114 + + Q S N I+ T N ++ Sbjct: 244 ATAYFNNKSACNSAYTYDSQASQADANYTGSRTPDWIDATAYFNGKSACNVAYTYDSQVS 303 Query: 115 TENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDN--- 171 N G++ + S + + + +DV + + + Sbjct: 304 QANPNYTGIMGGWWLTSNRCSVIVQSNGNPDGYTYGRRSVDVRPFLASNLKATNVQSPTP 363 Query: 172 ---NNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAI 228 T++ P KS W+ + K A ++ V+ + Sbjct: 364 IWQITGTNDPSDDRPYEFKSVWNGCIEERKTNSAAINGGSSTTAPSDAYDLDVDLVPYND 423 Query: 229 QE------KKNLSVRIGTIAYNIGIVGNQCTPL--------SNNLNEVKSRLNKLNPYEN 274 + +Y +G P NN + S LN L Sbjct: 424 DTRWRPMWNDVSYYPDWSWSYGVGRQPVAYCPTEAKRLQNYHNNRSGFVSYLNGLVARGG 483 Query: 275 TNTYPAMHHAYRELYNEK--------------ESSHNTIGSTRLKKFVIFITDGENSGAS 320 T M R L + I +KK++IF+TDG+ S Sbjct: 484 TYHDIGMIWGARFLSTTGLFKSATPETNDVNDPDNPAKIRGFSVKKYMIFMTDGDMSPTW 543 Query: 321 AYQNTLNTLQ--------------------------ICEYMRNAGMKIYSVAVSAPPEGQ 354 + + C + G+ I+ +A S Sbjct: 544 SDYSAYGIEYLDGRVMGSPTTDNTALLARHLQRFRMACNAAKAKGIDIWVIAFSTTLTAD 603 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 + C Q ++ + L+ F +I KI Sbjct: 604 --MTNCASKPEQAAGLSSNAALIAKFKEIGSKI 634 Score = 73.8 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 41/364 (11%), Positives = 86/364 (23%), Gaps = 48/364 (13%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + + + + +D+ +N+ + A DA L+G + T Sbjct: 21 LMTLALIPLVALMGSGLDMTRAYVAQNRFRQACDAGSLAGRRMLA-------GLTLPQAA 73 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 K QG T + P + +Q T L Sbjct: 74 RDEATKYFMFDFPQGYL-----------QSAPYTLTMSVPTAGTLQISSQ---TTVPTTL 119 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLY--------------LQ 166 GL +S + + + I V D+S SM Sbjct: 120 MGLFGFDTLPISTTCSATQDFVNT----DIMFVFDLSGSMNCAPGVTGYCGDVEQSGSRM 175 Query: 167 KHNDNNNMTSNKYL------LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE--SAG 218 + + L L + + S + D +++ S Sbjct: 176 GALRSAATSFYDTLETAQSQLAANNLRLRYGFVNYNSTVNVGRILYEKNPDWMVQSWSYQ 235 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 + A N S Y+ + + N N Sbjct: 236 SRTPDWIDATAYFNNKSACNSAYTYDSQASQADANYTGSRTPDWIDATAYFNGKSACNVA 295 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ-ICEYMRN 337 + T + VI ++G G + + +++ + ++ Sbjct: 296 YTYDSQVSQANPNYTGIMGGWWLTSNRCSVIVQSNGNPDGYTYGRRSVDVRPFLASNLKA 355 Query: 338 AGMK 341 ++ Sbjct: 356 TNVQ 359 >gi|289178041|gb|ADC85287.1| Fibronectin-binding protein [Bifidobacterium animalis subsp. lactis BB-12] Length = 2710 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 49/422 (11%), Positives = 114/422 (27%), Gaps = 84/422 (19%) Query: 37 VLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKD 96 ++G S T+ + + T Sbjct: 38 AIAGVISAAMIVTVALVAIPAQAAGNMIAD---PTTFTQWEQGIGEPTDPRSTGRVWTDK 94 Query: 97 KNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDV 156 + + ++ + L+ + + + +I S+ + I +VLD Sbjct: 95 SVSTQEVTLKTYDDNHVTVAPKDGSFLVGLSAMS---SAQKLIGVSNVTKPLDIVLVLDT 151 Query: 157 SRSMEDLYLQKH---------NDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPAN 207 S SM D + + + S N + Sbjct: 152 SGSMAWGMDGDDEYAYDPVYAADITTSKRYYVRVSGSMTRVYSSANGWYYDAGGSRHYVT 211 Query: 208 -------------------------RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIA 242 ++ L ++ ++ A + + + + Sbjct: 212 PKTSAADSDAAHTQFYSRRRLTTQDTRMYALKQAVNGFIDQTIAANAKVSDPNKKNRIGL 271 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 N + L+++L+ +KS ++ L T M A L N + + Sbjct: 272 VTYASDVNTRSGLTDSLSGLKSTVDDLKASGATRADLGMQTANTVLGNARADA------- 324 Query: 303 RLKKFVIFITDGENSGASAYQNTL--NTLQICEYMRNAGMKIYSVAVSAPPE-------- 352 K VIF TDG+ + ++ ++N + + + + M+ G +YSV + Sbjct: 325 --SKIVIFFTDGQPTKSNGFENDVANDAIGAAKTMKTNGASVYSVGIFTGANPDANVSSV 382 Query: 353 ---------GQDLLRKCTD----------------SSGQFFAVNDSRELLESFDKITDKI 387 ++ + +S + A +D+ L F+ I ++ Sbjct: 383 TGKSDIELKSNAFMQGVSSNYPNATTYTNLGAKAPNSNYYLAASDADTLNAVFNTIWSEV 442 Query: 388 QE 389 Sbjct: 443 SS 444 >gi|167757049|ref|ZP_02429176.1| hypothetical protein CLORAM_02598 [Clostridium ramosum DSM 1402] gi|167703224|gb|EDS17803.1| hypothetical protein CLORAM_02598 [Clostridium ramosum DSM 1402] Length = 965 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 28/251 (11%), Positives = 66/251 (26%), Gaps = 11/251 (4%) Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 S S D N + KK+ +S T + Y ++ + Sbjct: 105 NSTSSNDKKRLTKAKNAAIEFLNNSKISGNKKNRYS-IVTFNYYGTVEQNLTSNLETAKQ 163 Query: 216 SAGNL-VNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 + ++ + + + + T+ N LS+ +LN Sbjct: 164 AIRDVELGNNSDGGTNIQAGLYKARTVLKNAKSENGIIILLSDGGATGSYKLNNERNNGY 223 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI-----FITDGENSGASAYQNTLNTL 329 + A + + + F D + + N Sbjct: 224 LVNDYSEATATDKALGYSGRYTFGENAINYDSVIKGGRNDFTLDLYLNNSHYSLNNAAAT 283 Query: 330 QICEYM---RNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + +G I+++ + L+ + + S +L ++ I ++ Sbjct: 284 LAENEALLAKKSGNTIFTIGYTTGSSVNSFLKNVATQGEGYAY-SSSSDLSGIYENIANE 342 Query: 387 IQEQSVRIAPN 397 I + I N Sbjct: 343 IVTRYETIIKN 353 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 38/337 (11%), Positives = 85/337 (25%), Gaps = 19/337 (5%) Query: 70 KHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALT 129 + + K N N ++ KA E I + Sbjct: 539 GYAQTNLSSGNIIELSQNKDAANGYVTLTNSFGHVGFFKATDEKTNYLPAGSETINNYEY 598 Query: 130 NLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF 189 N+S ++ +++ + + + SM H NN + L P Sbjct: 599 NVSGNVKSTTVKAPQDVVLLLDKSGSMDESMNGSSRLTHLKNNVIKFITKLYEHNPDSRV 658 Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 S K+ + + K KN+ GT Sbjct: 659 SVITFAYSADGSITNNNFVKLSDIKSGNETWYTYLTKNNGGIKNIKASGGTQIDLGLYEV 718 Query: 250 NQCTPLSNNLNE---------------VKSRLNKLNPYENTNTYPAMHHAYRELYNEKES 294 + N + N + A++ A ++ + Sbjct: 719 RNQLSSATGENNRSVIVFTDGQPGNKGFNTSYNDYDDNGYRVGAEALNQADFIKFSGNLT 778 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM-RNAGMKIYSVAVSAPPEG 353 N + + + + + N+ + + G I+++ +++ Sbjct: 779 GINNYIESSNGSKYYGHKNDDITKNRSNNNSNDAGNRTNRSGKGLGKTIFTIGLNSNNSS 838 Query: 354 --QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 L + G + N+S + +F+ I I Sbjct: 839 LFDSFLTRLAS-EGHYTKANNSSAMENAFNSIFTSIT 874 >gi|153214389|ref|ZP_01949360.1| conserved hypothetical protein [Vibrio cholerae 1587] gi|124115338|gb|EAY34158.1| conserved hypothetical protein [Vibrio cholerae 1587] Length = 318 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 31/170 (18%), Positives = 58/170 (34%), Gaps = 38/170 (22%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK--LNPYEN-TNTYPAMHHAYRELYNEKE 293 RIG I + TPL+ + V ++LN+ L T + A + + Sbjct: 129 RIGLILFADHAYLQ--TPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKTFID--- 183 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--- 350 S ++ +I ++DG N+ L+ L+ + IY+V V A Sbjct: 184 -------SDAPQRVMILLSDGSNTAGV-----LDPLEAANIAKQYHTTIYTVGVGAGEMV 231 Query: 351 --------------PEGQDLLRKCTDS-SGQFFAVNDSRELLESFDKITD 385 + L+ + G +F + ++L +D I Sbjct: 232 VKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTINQ 281 >gi|121586746|ref|ZP_01676529.1| conserved hypothetical protein [Vibrio cholerae 2740-80] gi|121728206|ref|ZP_01681240.1| conserved hypothetical protein [Vibrio cholerae V52] gi|147672023|ref|YP_001215942.1| hypothetical protein VC0395_1106 [Vibrio cholerae O395] gi|153816797|ref|ZP_01969464.1| conserved hypothetical protein [Vibrio cholerae NCTC 8457] gi|227811796|ref|YP_002811806.1| hypothetical protein VCM66_A0168 [Vibrio cholerae M66-2] gi|229506663|ref|ZP_04396172.1| protein BatA [Vibrio cholerae BX 330286] gi|262167807|ref|ZP_06035508.1| protein BatA [Vibrio cholerae RC27] gi|298500027|ref|ZP_07009833.1| conserved hypothetical protein [Vibrio cholerae MAK 757] gi|121549043|gb|EAX59080.1| conserved hypothetical protein [Vibrio cholerae 2740-80] gi|121629529|gb|EAX61953.1| conserved hypothetical protein [Vibrio cholerae V52] gi|126512600|gb|EAZ75194.1| conserved hypothetical protein [Vibrio cholerae NCTC 8457] gi|146314406|gb|ABQ18946.1| conserved hypothetical protein [Vibrio cholerae O395] gi|227010938|gb|ACP07149.1| conserved hypothetical protein [Vibrio cholerae M66-2] gi|227014797|gb|ACP11006.1| conserved hypothetical protein [Vibrio cholerae O395] gi|229357014|gb|EEO21932.1| protein BatA [Vibrio cholerae BX 330286] gi|262023715|gb|EEY42415.1| protein BatA [Vibrio cholerae RC27] gi|297542008|gb|EFH78059.1| conserved hypothetical protein [Vibrio cholerae MAK 757] Length = 318 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 30/170 (17%), Positives = 57/170 (33%), Gaps = 38/170 (22%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK--LNPYEN-TNTYPAMHHAYRELYNEKE 293 RIG I + TPL+ + V ++LN+ L T + A + + Sbjct: 129 RIGLILFADHAYLQ--TPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKTFIDSN- 185 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--- 350 ++ +I ++DG N+ L+ L+ + IY+V V A Sbjct: 186 ---------APQRVMILLSDGSNTAGV-----LDPLEAANIAKQYHTTIYTVGVGAGEMV 231 Query: 351 --------------PEGQDLLRKCTDS-SGQFFAVNDSRELLESFDKITD 385 + L+ + G +F + ++L +D I Sbjct: 232 VKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTINQ 281 >gi|325297739|ref|YP_004257656.1| von Willebrand factor type A [Bacteroides salanitronis DSM 18170] gi|324317292|gb|ADY35183.1| von Willebrand factor type A [Bacteroides salanitronis DSM 18170] Length = 332 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 30/301 (9%), Positives = 84/301 (27%), Gaps = 56/301 (18%) Query: 123 LIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLP 182 + T +++ + + + + ++ ++ + Sbjct: 23 FMRRKKTEPTMQVSTTRMYMNAPQSWKVYLLHAPFLLRIVTFIMIVLVLARPQTTDNWQN 82 Query: 183 PPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIA 242 + S A +++ + A + N Sbjct: 83 TEIEGIDIMLAVDVSTSMLAEDLKPNRLEAAKQVAAQFI-----------NGRPNDNIGL 131 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPY--------ENTNTYPAMHHAYRELYNEKES 294 PL+ + + + + + T + ++ L Sbjct: 132 TIFAGEAFTQCPLTIDHGVLLNLFGSIKGDIAQRGLIEDGTAIGMGLANSISRL------ 185 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP--- 351 S K +I +TDG N+ L E + G+++Y++ V Sbjct: 186 ----KDSKAKSKVIILLTDGSNNRGDIS-----PLTAAEIAKQFGIRVYTIGVGTNGTAP 236 Query: 352 ------------------EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + Q L++ + ++G +F + +L E +++I DK+++ + Sbjct: 237 YPMPTYAGVQYVNVPVEIDEQTLIQIASTTNGNYFRATSNSKLKEVYEEI-DKLEKTKLN 295 Query: 394 I 394 + Sbjct: 296 V 296 >gi|296139788|ref|YP_003647031.1| von Willebrand factor type A [Tsukamurella paurometabola DSM 20162] gi|296027922|gb|ADG78692.1| von Willebrand factor type A [Tsukamurella paurometabola DSM 20162] Length = 327 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 21/149 (14%), Positives = 56/149 (37%), Gaps = 16/149 (10%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + + + ++KL + T T ++ + + + N ++ + +I +DG+ Sbjct: 148 TPDRTLALNAVDKLELAQRTATGEGIYTSIQSIKNIRDVLGGEDNA--PPARIILESDGK 205 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------------GQDLLRKCTD 362 + + + + G+ I +++ L++ + Sbjct: 206 QTVPTDLDDPRGGFTAARKAKEEGIPISTISFGTTSGSVNIGGQNIPVPVDDASLKRIAE 265 Query: 363 -SSGQFFAVNDSRELLESFDKITDKIQEQ 390 S GQFFA + +L E++ + D+I + Sbjct: 266 LSGGQFFAASSLNDLNEAYGSLRDEIGWE 294 >gi|329850248|ref|ZP_08265093.1| von Willebrand factor type A [Asticcacaulis biprosthecum C19] gi|328840563|gb|EGF90134.1| von Willebrand factor type A [Asticcacaulis biprosthecum C19] Length = 575 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 48/398 (12%), Positives = 106/398 (26%), Gaps = 36/398 (9%) Query: 29 MQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQK 88 ++S+LD AVL+ K + + + + + Sbjct: 180 LKSSLD-AVLASMLDSTGKNYAKTKVALVPFDTQVSLSNVAGMVDYAGDFSTVTPTYSCS 238 Query: 89 AQIN-----ITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSS 143 + I+++ + A + T + G + S + RS Sbjct: 239 GYSSAQCQVISENASAMCNGNATCLSNNRNYTRSWTSNGNTYFGVFATSYYQSSNTYRSY 298 Query: 144 ENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAP 203 N + + N NN + + +S+ Y+ Sbjct: 299 GNTYYYTYIAWRQVVYRVNSSTLTLNSTNNGGDYYTYQAYYNQPNNYSRYYGAVTYSTPT 358 Query: 204 APANR----------------------KIDVLIESAGNLVNSIQKAIQEKKNLSVRI-GT 240 A + S + Sbjct: 359 AGGYNSTSTTIIKDNTTITANSDLLGVGTSNWTGCVIDRTQSYDVTSDAPVAGTPATLYP 418 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE-SSHNTI 299 A L+ ++ ++ ++ P NTN + L S Sbjct: 419 AAKCATNTLLPVMALTQDIAAARTYAARMAPAGNTNVTIGVQWGMEVLSPTAPFSEGGAF 478 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQ----ICEYMRNAGMKIYSVAVSAPPEGQD 355 + K++I +TDG N+ N C +N G+ +++V V Sbjct: 479 TDKAVLKYMIVLTDGINTQNRWTTNNSQINARLALACTNAKNLGITVFTVRVEQG--DST 536 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 L+ C + ++ ++++ +L + KI I++ + Sbjct: 537 TLQNCASQTAYYYNLSNADQLPATMSKIMKSIRKVRLT 574 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 40/337 (11%), Positives = 94/337 (27%), Gaps = 26/337 (7%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A + + +D A+I R ++Q A+DA ++ + T + Sbjct: 22 IVAFSVIPIVAAVGGGLDFANIQAARAKLQDAVDAGAIAATIDPTATPTQTTREAVAKKA 81 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 KQ A ++ T NN + + A +L Sbjct: 82 FCGNIKQSGGLQNSFCNTTTLDTLGTASATLS-TATSNNIMTVTYSATA-----HVPTYL 135 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 GL+ ++ + + S+ +A VLD + SM + ++ + Sbjct: 136 LGLVGIDTVDIDAVAKSGVSTSTAEVAF----VLDNTGSMSSNNKMTYLKSSLDAVLASM 191 Query: 181 LPP---PPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVR 237 L K+ + ++ + + AG+ + + Sbjct: 192 LDSTGKNYAKTKVALVPFDTQVSLSNVAGMVDY------AGDFSTVTPTYSCSGYSSAQC 245 Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY---PAMHHAYRELYNEKES 294 + T LSNN ++ NT + + + + + Sbjct: 246 QVISENASAMCNGNATCLSNN----RNYTRSWTSNGNTYFGVFATSYYQSSNTYRSYGNT 301 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 + T + R + + + + + + Sbjct: 302 YYYTYIAWRQVVYRVNSSTLTLNSTNNGGDYYTYQAY 338 >gi|224072363|ref|XP_002303700.1| predicted protein [Populus trichocarpa] gi|222841132|gb|EEE78679.1| predicted protein [Populus trichocarpa] Length = 587 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 30/281 (10%), Positives = 79/281 (28%), Gaps = 26/281 (9%) Query: 111 YEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 Y +P + + + + + R + VS S Sbjct: 79 YNVPPVQIEAEHFSDDEVLSDVSPDQSLSSRPHAITVKTFTEYPAVSASESFSNFGVLVR 138 Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE 230 + L K+ +L + ++ ++ + Sbjct: 139 ILAPPLDNTL-----PHHRARAPIDVVTVLDVSGSMASKLILLKRAVNFIIQNLGPSD-- 191 Query: 231 KKNLSVRIGTIAYNIGIVGNQCT--PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYREL 288 R+ + ++ + + S ++ ++ TN + + L Sbjct: 192 ------RLSIVTFSSSARRMLPLRRMSGSGREDATSVVDSISAIGGTNIVAGLKKGVQVL 245 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ + +I ++DG ++ + QN L+ L+ E + +Y+ Sbjct: 246 EERRQHNSVAT--------IILLSDGCDTQSHNAQNRLDYLK--EESKQPTFPVYTFGFG 295 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK-ITDKIQ 388 + + + S G F + L ++F + I I Sbjct: 296 SDHDSAAMHAISDASRGTFSFIESINILQDAFARCIGGLIS 336 >gi|257058175|ref|YP_003136063.1| von Willebrand factor type A [Cyanothece sp. PCC 8802] gi|256588341|gb|ACU99227.1| von Willebrand factor type A [Cyanothece sp. PCC 8802] Length = 418 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 25/184 (13%), Positives = 58/184 (31%), Gaps = 13/184 (7%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + + A V + + R+ I +N +L VK++ Sbjct: 47 ILDRSGSMRAQAMETVKEAANYLVDGLGPDDRLSVITFNHHAEVILPNQSVEDLQGVKNK 106 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 +N+L T M +E KE+ + +TDGE + + + Sbjct: 107 INRLTASGGTCIDEGMKLGIKEAALGKENR---------VSQIFLLTDGE----NEHGDN 153 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 L++ + + + ++ + L + + G + + L F ++ Sbjct: 154 ERCLKLAKVAAEYNITLNTLGFGSNWNQDILEQIADSAGGMLCYIEHPEQALTEFSRLFT 213 Query: 386 KIQE 389 + Q Sbjct: 214 RAQS 217 >gi|218245149|ref|YP_002370520.1| von Willebrand factor type A [Cyanothece sp. PCC 8801] gi|218165627|gb|ACK64364.1| von Willebrand factor type A [Cyanothece sp. PCC 8801] Length = 418 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 25/184 (13%), Positives = 58/184 (31%), Gaps = 13/184 (7%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + + A V + + R+ I +N +L VK++ Sbjct: 47 ILDRSGSMRAQAMETVKEAANYLVDGLGPDDRLSVITFNHHAEVILPNQSVEDLQGVKNK 106 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 +N+L T M +E KE+ + +TDGE + + + Sbjct: 107 INRLTASGGTCIDEGMKLGIKEAALGKENR---------VSQIFLLTDGE----NEHGDN 153 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 L++ + + + ++ + L + + G + + L F ++ Sbjct: 154 ERCLKLAKVAAEYNITLNTLGFGSNWNQDILEQIADSAGGMLCYIEHPEQALTEFSRLFT 213 Query: 386 KIQE 389 + Q Sbjct: 214 RAQS 217 >gi|302346571|ref|YP_003814869.1| von Willebrand factor type A domain protein [Prevotella melaninogenica ATCC 25845] gi|302150280|gb|ADK96541.1| von Willebrand factor type A domain protein [Prevotella melaninogenica ATCC 25845] Length = 318 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 25/156 (16%), Positives = 54/156 (34%), Gaps = 31/156 (19%) Query: 254 PLSNNLNEVKSRL----NKLNPYE----NTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 P++ + + + L L T + ++ L S Sbjct: 143 PMTLDHAALLNLLHNVRTDLVTNGLMQDGTAIGLGLANSVSRL----------KDSKAKS 192 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS------APPEGQDLLRK 359 K VI +TDG N+ S + + G+++Y++ + L+ Sbjct: 193 KVVILLTDGSNNVGSIS-----PMTAATIAKKFGIRVYTIGLGRETGEDIGAIDYKTLQD 247 Query: 360 CT-DSSGQFFAVNDSRELLESFDKITDKIQEQSVRI 394 ++G+F+ EL + + I DK+++ + I Sbjct: 248 IAVLTNGEFYRAQSQAELSKIYQDI-DKLEKTKMNI 282 >gi|229526203|ref|ZP_04415607.1| protein BatA [Vibrio cholerae bv. albensis VL426] gi|229336361|gb|EEO01379.1| protein BatA [Vibrio cholerae bv. albensis VL426] Length = 318 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 30/170 (17%), Positives = 58/170 (34%), Gaps = 38/170 (22%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK--LNPYEN-TNTYPAMHHAYRELYNEKE 293 RIG I + TPL+ + V ++LN+ L T + A + + Sbjct: 129 RIGLILFADHAYLQ--TPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKTFIDSN- 185 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--- 350 ++ +I ++DG N+ L+ L+ + + IY+V V A Sbjct: 186 ---------APQRVMILLSDGSNTAGV-----LDPLEAADIAKQYHTTIYTVGVGAGEMV 231 Query: 351 --------------PEGQDLLRKCTDS-SGQFFAVNDSRELLESFDKITD 385 + L+ + G +F + ++L +D I Sbjct: 232 VKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTINQ 281 >gi|1846005|gb|AAC51244.1| collagen type XII alpha-1 [Homo sapiens] Length = 3063 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 59/166 (35%), Gaps = 18/166 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+G + Y+ +E+ + + K+ NT T A+ + Sbjct: 168 SAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLV 227 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + E + K I ITDG++ +RN G++++S+ Sbjct: 228 KNTFTESAGARVGF-----PKVAIIITDGKSQD--------EVEIPARELRNVGVEVFSL 274 Query: 346 AVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQE 389 + L++ + F V + +++ ++I ++ Sbjct: 275 GI--KAADAKELKQIASTPSLNHVFNVANFDAIVDIQNEIISQVCS 318 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 21/165 (12%), Positives = 52/165 (31%), Gaps = 18/165 (10%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + V+I Y+ + + + L NT T A++ ++ Sbjct: 1230 DIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQ- 1288 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 R +K + ITDG++ + + +++ G++++++ + Sbjct: 1289 ----NFRTQAGMRPRARKIGVLITDGKSQDDVEAPS--------KKLKDEGVELFAIGI- 1335 Query: 349 APPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 + L+ + V D L D +T + Sbjct: 1336 -KNADEVELKMIATDPDDTHDYNVADFESLSRIVDDLTINLCNSV 1379 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 62/167 (37%), Gaps = 23/167 (13%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYN 290 V+I + Y+ + ++ +N TNT AM + +++ Sbjct: 473 SPNRVQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIF- 531 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + K +I ITDG++S A +RN+ ++I++V V Sbjct: 532 ----VPSKGSRSNVPKVMILITDGKSSDAFRDP--------AIKLRNSDVEIFAVGV--K 577 Query: 351 PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS-VRI 394 + L F V D ++F +I+ ++ + +RI Sbjct: 578 DAVRSELEAIASPPAETHVFTVEDF----DAFQRISFELTQSICLRI 620 >gi|307354884|ref|YP_003895935.1| hypothetical protein Mpet_2754 [Methanoplanus petrolearius DSM 11571] gi|307158117|gb|ADN37497.1| conserved hypothetical protein [Methanoplanus petrolearius DSM 11571] Length = 316 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 35/289 (12%), Positives = 83/289 (28%), Gaps = 49/289 (16%) Query: 126 SALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPP 185 S + ++ + DVS+S L L P Sbjct: 20 WYYIRNSAKKRKQEAMVFSRVSFLKSALGDVSKSKRPKILVILILAAVGFIFIGLADPHI 79 Query: 186 ------KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG 239 + S A + +++ +A L+N + Sbjct: 80 PLEQTKEGVNVVLVIDDSGSMQATDYSPNRLEATKSAAEELINDLDPKDYVG-------- 131 Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYRELYNEKESSHNT 298 + + G LS + + V L + T + + Sbjct: 132 IVVFESGASTASY--LSPDKDSVIENLENIMEKDGATAIGDGLSLGINMADS-------- 181 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------ 352 KK VI ++DG N+ + ++ +++ ++++++ + + Sbjct: 182 --IPNRKKVVILLSDGVNNAGVISPDE-----AIQFAKDSDIQVFTIGMGSEQPVVMGYD 234 Query: 353 ----------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQ 390 + L++ D + G++F D + L E + I +I+ + Sbjct: 235 WFGNPQYAELDEATLKEIADETGGKYFKSVDDQTLNEIYSNINSEIKRE 283 >gi|114608140|ref|XP_001142912.1| PREDICTED: collagen alpha-1(XII) chain isoform 2 [Pan troglodytes] Length = 3063 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 59/166 (35%), Gaps = 18/166 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+G + Y+ +E+ + + K+ NT T A+ + Sbjct: 168 SAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLV 227 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + E + K I ITDG++ +RN G++++S+ Sbjct: 228 KNTFTESAGARVGF-----PKVAIIITDGKSQD--------EVEIPARELRNVGVEVFSL 274 Query: 346 AVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQE 389 + L++ + F V + +++ ++I ++ Sbjct: 275 GI--KAADAKELKQIASTPSLNHVFNVANFDAIVDIQNEIISQVCS 318 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 21/165 (12%), Positives = 52/165 (31%), Gaps = 18/165 (10%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + V+I Y+ + + + L NT T A++ ++ Sbjct: 1230 DIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQ- 1288 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 R +K + ITDG++ + + +++ G++++++ + Sbjct: 1289 ----NFRTQAGMRPRARKIGVLITDGKSQDDVEAPS--------KKLKDEGVELFAIGI- 1335 Query: 349 APPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 + L+ + V D L D +T + Sbjct: 1336 -KNADEVELKMIATDPDDTHAYNVADFESLSRIVDDLTINLCNSV 1379 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 62/167 (37%), Gaps = 23/167 (13%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYN 290 V+I + Y+ + ++ +N TNT AM + +++ Sbjct: 473 SPNRVQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIF- 531 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + K +I ITDG++S A +RN+ ++I++V V Sbjct: 532 ----VPSKGSRSNVPKVMILITDGKSSDAFRDP--------AIKLRNSDVEIFAVGV--K 577 Query: 351 PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS-VRI 394 + L F V D ++F +I+ ++ + +RI Sbjct: 578 DAVRSELEAIASPPAETHVFTVEDF----DAFQRISFELTQSICLRI 620 >gi|228991799|ref|ZP_04151737.1| D-amino acid dehydrogenase, large subunit [Bacillus pseudomycoides DSM 12442] gi|228767939|gb|EEM16564.1| D-amino acid dehydrogenase, large subunit [Bacillus pseudomycoides DSM 12442] Length = 453 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 28/196 (14%), Positives = 70/196 (35%), Gaps = 23/196 (11%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG----NQCTPLS-NNLN 260 K+D+ E+ V+ + +A+ + G+ L N Sbjct: 175 GKMKMDIAKEAIQQFVSDLPEAVNVSLRVYGHKGSNDEKDKTASCGAIENVYTLQKYNQT 234 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 ++ L+ P T A+ + + K + K + ++DG + Sbjct: 235 TLRQSLDGFQPVGWTPLAEAIKRSTETFQSAKAND---------KNIMYVVSDGVETCGG 285 Query: 321 AYQNTLNTLQICEYMRNAGMK--IYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELL 377 N ++ + + N+ +K + + E + L++ + S G++ N+++EL Sbjct: 286 ------NPVEEAQKVSNSNIKPIMNIIGFQVDHEAEKQLKEIAEVSKGKYVLANNAKELQ 339 Query: 378 ESFDKITDKIQEQSVR 393 + F + I + ++ Sbjct: 340 DQFKETGKDITSRRLK 355 >gi|93141047|ref|NP_004361.3| collagen alpha-1(XII) chain long isoform precursor [Homo sapiens] gi|146345397|sp|Q99715|COCA1_HUMAN RecName: Full=Collagen alpha-1(XII) chain; Flags: Precursor gi|55662663|emb|CAH71310.1| collagen, type XII, alpha 1 [Homo sapiens] gi|56203512|emb|CAI19898.1| collagen, type XII, alpha 1 [Homo sapiens] gi|56203521|emb|CAI19908.1| collagen, type XII, alpha 1 [Homo sapiens] Length = 3063 Score = 77.6 bits (189), Expect = 3e-12, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 59/166 (35%), Gaps = 18/166 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+G + Y+ +E+ + + K+ NT T A+ + Sbjct: 168 SAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLV 227 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + E + K I ITDG++ +RN G++++S+ Sbjct: 228 KNTFTESAGARVGF-----PKVAIIITDGKSQD--------EVEIPARELRNVGVEVFSL 274 Query: 346 AVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQE 389 + L++ + F V + +++ ++I ++ Sbjct: 275 GI--KAADAKELKQIASTPSLNHVFNVANFDAIVDIQNEIISQVCS 318 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 21/165 (12%), Positives = 52/165 (31%), Gaps = 18/165 (10%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + V+I Y+ + + + L NT T A++ ++ Sbjct: 1230 DIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQ- 1288 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 R +K + ITDG++ + + +++ G++++++ + Sbjct: 1289 ----NFRTQAGMRPRARKIGVLITDGKSQDDVEAPS--------KKLKDEGVELFAIGI- 1335 Query: 349 APPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 + L+ + V D L D +T + Sbjct: 1336 -KNADEVELKMIATDPDDTHAYNVADFESLSRIVDDLTINLCNSV 1379 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 62/167 (37%), Gaps = 23/167 (13%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYN 290 V+I + Y+ + ++ +N TNT AM + +++ Sbjct: 473 SPNRVQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIF- 531 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + K +I ITDG++S A +RN+ ++I++V V Sbjct: 532 ----VPSKGSRSNVPKVMILITDGKSSDAFRDP--------AIKLRNSDVEIFAVGV--K 577 Query: 351 PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS-VRI 394 + L F V D ++F +I+ ++ + +RI Sbjct: 578 DAVRSELEAIASPPAETHVFTVEDF----DAFQRISFELTQSICLRI 620 >gi|194367004|ref|YP_002029614.1| von Willebrand factor type A [Stenotrophomonas maltophilia R551-3] gi|194349808|gb|ACF52931.1| von Willebrand factor type A [Stenotrophomonas maltophilia R551-3] Length = 334 Score = 77.6 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 32/186 (17%), Positives = 66/186 (35%), Gaps = 34/186 (18%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---PYENT 275 + + + + + + + G TPL+ +L V+ +L T Sbjct: 124 DRLTAAKAVLADFLDRRAGDRIGLLVFGDRAYTLTPLTADLASVRDQLRDSVVGLAGRET 183 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 A+ A + L ++ E ++ +I +TDG ++ L L+ E Sbjct: 184 AIGDAIGLAVKRLRSQPEG----------QRVLILLTDGVSNAGV-----LEPLRAAEVA 228 Query: 336 RNAGMKIYSVAVSAPPE---------------GQDLLRKCTD-SSGQFFAVNDSRELLES 379 R G++I++VA + L+K + GQFF D+ +L Sbjct: 229 RAEGVRIHTVAFGGDGSMRLFGIPISADQDPVDEATLKKIATMTGGQFFRARDTAQLAGI 288 Query: 380 FDKITD 385 + ++ Sbjct: 289 YAELDR 294 >gi|119569135|gb|EAW48750.1| collagen, type XII, alpha 1, isoform CRA_c [Homo sapiens] Length = 3063 Score = 77.6 bits (189), Expect = 4e-12, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 59/166 (35%), Gaps = 18/166 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+G + Y+ +E+ + + K+ NT T A+ + Sbjct: 168 SAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLV 227 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + E + K I ITDG++ +RN G++++S+ Sbjct: 228 KNTFTESAGARVGF-----PKVAIIITDGKSQD--------EVEIPARELRNVGVEVFSL 274 Query: 346 AVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQE 389 + L++ + F V + +++ ++I ++ Sbjct: 275 GI--KAADAKELKQIASTPSLNHVFNVANFDAIVDIQNEIISQVCS 318 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 21/165 (12%), Positives = 52/165 (31%), Gaps = 18/165 (10%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + V+I Y+ + + + L NT T A++ ++ Sbjct: 1230 DIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQ- 1288 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 R +K + ITDG++ + + +++ G++++++ + Sbjct: 1289 ----NFRTQAGMRPRARKIGVLITDGKSQDDVEAPS--------KKLKDEGVELFAIGI- 1335 Query: 349 APPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 + L+ + V D L D +T + Sbjct: 1336 -KNADEVELKMIATDPDDTHAYNVADFESLSRIVDDLTINLCNSV 1379 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 62/167 (37%), Gaps = 23/167 (13%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYN 290 V+I + Y+ + ++ +N TNT AM + +++ Sbjct: 473 SPNRVQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIF- 531 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + K +I ITDG++S A +RN+ ++I++V V Sbjct: 532 ----VPSKGSRSNVPKVMILITDGKSSDAFRDP--------AIKLRNSDVEIFAVGV--K 577 Query: 351 PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS-VRI 394 + L F V D ++F +I+ ++ + +RI Sbjct: 578 DAVRSELEAIASPPAETHVFTVEDF----DAFQRISFELTQSICLRI 620 >gi|254228714|ref|ZP_04922137.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Vibrio sp. Ex25] gi|262396564|ref|YP_003288417.1| protein BatA [Vibrio sp. Ex25] gi|151938661|gb|EDN57496.1| IMP dehydrogenase/GMP reductase:von Willebrand factor, type A [Vibrio sp. Ex25] gi|262340158|gb|ACY53952.1| protein BatA [Vibrio sp. Ex25] Length = 334 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 31/205 (15%), Positives = 66/205 (32%), Gaps = 47/205 (22%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 ++ + + + V + R+G + + TPL+ + Sbjct: 116 LNGEYIDRLTAVKKVLSDFVAKRK---------GDRLGVVLFGDHAYLQ--TPLTADRKT 164 Query: 262 VKSRLNK--LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 V ++N+ + T + + + S ++ +I ++DG N+ Sbjct: 165 VMQQINQTVIGLVGQRTAIGDGIGLGTKTFVD----------SDAPQRVMILLSDGSNTA 214 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP------------------EGQDLLRKC 360 L L+ E + IY+V V A + Q L + Sbjct: 215 GV-----LEPLEAAEIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTAADLDEQTLTKVA 269 Query: 361 TDSSGQFFAVNDSRELLESFDKITD 385 + GQ+F D+ +L + +D I Sbjct: 270 EMTGGQYFRARDTDQLEKIYDTINQ 294 >gi|294673502|ref|YP_003574118.1| BatA protein [Prevotella ruminicola 23] gi|294472594|gb|ADE81983.1| putative BatA protein [Prevotella ruminicola 23] Length = 332 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 26/180 (14%), Positives = 57/180 (31%), Gaps = 45/180 (25%) Query: 249 GNQCTPLSNNLNEVKSRLNKLNPY--------ENTNTYPAMHHAYRELYNEKESSHNTIG 300 P++ + + + L + + T + +A L Sbjct: 138 SFTQCPMTTDHASLLNLLQNVRTDIAARGLIEDGTAIGMGLANAVSRL----------KD 187 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP---------- 350 S K VI +TDG N+ +T E ++ G+++Y++ V Sbjct: 188 SKAKSKVVILLTDGSNNRGDISPST-----AAEIAKSLGIRVYTIGVGTNKVAPYPMPVA 242 Query: 351 ----------PEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD-KIQEQSVRIAPNR 398 L + + G F+ ++ EL + + +I + + +V+ R Sbjct: 243 GGVQYVNVPVEIDTKTLSEIASITEGDFYRATNTNELRKIYKEIDQLEKSKLNVKTFSKR 302 >gi|284163331|ref|YP_003401610.1| von Willebrand factor A [Haloterrigena turkmenica DSM 5511] gi|284012986|gb|ADB58937.1| von Willebrand factor type A [Haloterrigena turkmenica DSM 5511] Length = 1446 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 37/228 (16%), Positives = 78/228 (34%), Gaps = 22/228 (9%) Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAG 218 S ++ + ++ + + + + N + + S Sbjct: 496 SFFSVFRIEEWEDETSDTITLDGNETDGEIGNGSGIETADFVF----VNDESGSMSGSPT 551 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 + K S R G + Y G +Q PL+ + + V S L +L+ TNT Sbjct: 552 HYAELAGKRFVGALTDSERAGRVGYASGANLDQ--PLTTDHDAVNSSLERLSASGGTNTR 609 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 + L E G +I ++DG+ + + L + E A Sbjct: 610 AGLRVGLNHLEEE--------GWENRSAVMILLSDGK--------SGSDPLPVAEDAAEA 653 Query: 339 GMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 G++I +V + +L + G F+ V +L ++F+++ + Sbjct: 654 GVEISTVGLGNNINENELREIAAITGGDFYHVEREEDLPDTFERVAEN 701 >gi|118088945|ref|XP_419902.2| PREDICTED: similar to collagen XXI [Gallus gallus] Length = 964 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 30/176 (17%), Positives = 61/176 (34%), Gaps = 22/176 (12%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHA 284 + +++G + Y+ V + + + ++ NT T A+ A Sbjct: 71 TRNFDIGPKFIQVGVVQYSDYPVLEIPLGTHESTENLIKEMESIHYLGGNTKTGRAIQFA 130 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 Y L+ S L K + +TDG++ + R + +++ Sbjct: 131 YDHLF--------AKSSRFLTKIAVVLTDGKSQD--------EVKDVAAEARKNKITLFA 174 Query: 345 VAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + V + +D L+ + F V D + + I K+ E+SV P R Sbjct: 175 IGVGSEI-EEDELKAIANKPSSTYVFYVEDYIAISRIKEVIKQKLCEESV--CPTR 227 >gi|26344185|dbj|BAC35749.1| unnamed protein product [Mus musculus] Length = 1182 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 36/338 (10%), Positives = 98/338 (28%), Gaps = 31/338 (9%) Query: 58 DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 ++K K+ + + QI + + ++ Sbjct: 301 QVGQAYTGAALRKTRKEIFSAQRGSRKNQGVPQIAVLVTHRASEDNVTKAAVNLRREGVT 360 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYL----QKHNDNNN 173 +F G+ + L ++ E+ + L + ++ L + Sbjct: 361 IFTMGIEGANPDELEKIASHPAEQFTSKLGNFSELATHNQTFLKKLRNQITHTVSVFSER 420 Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 + K + + P + +V A + Sbjct: 421 TETLKSACVDTEEADIYLLIDGSGSTQP------TDFHEMKTFLSEVVGMFNIAPHK--- 471 Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEK 292 VR+G + Y +N ++ + + TNT A++ + L K Sbjct: 472 --VRVGAVQYADTWDLEFEISKYSNKPDLGKAIENIRQMGGNTNTGAALNFTLKLLQRAK 529 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + + + ++ +T+G + + L +R ++++++ V Sbjct: 530 KERGSK-----VPCHLVVLTNGMSRDS--------VLGPAHKLREENIRVHAIGV--KEA 574 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 Q LR+ + + V++ L +++ +I + Sbjct: 575 NQTQLREIAGEEKRVYYVHEFDALRNIRNQVVQEICAE 612 Score = 73.8 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 41/354 (11%), Positives = 103/354 (29%), Gaps = 34/354 (9%) Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 D + K+ + + +++ ++ + ++A Sbjct: 838 ADVGKNQVRFGALKYADDPEVLFYLDELGTKLEVVSVLQNDHPMGGNTYTAEALAFSDHM 897 Query: 117 NLFL------KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 KG+ + S + ++ A+ +L ++ + + Sbjct: 898 FTEARGSRLHKGVPQVLIVITDGESNDAEKLNTTAKALRDKGILVLAVGIAGANSWELLA 957 Query: 171 NNNMTSNKYLLPPPPK----KSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK 226 + Y + S S + S + ++ S + QK Sbjct: 958 MAGSSDKYYFVETFGGLKGIFSDVSASVCNSSKVDCEIEKVDLVFLMDGSNSIHPDDFQK 1017 Query: 227 A---------IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTN 276 + VRIG ++ E+ +++ + T+ Sbjct: 1018 MKGFLVSVVQDFDVSLNRVRIGVAQFSDSYRSEFLLGTFTGEREISTQIEGIQQIFGYTH 1077 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 A+ R++ + + + + ++ +TDG + Q E +R Sbjct: 1078 IGDAL----RKVKYYFQPDMGSRINAGTPQVLLVLTDGRSQD--------EVAQAAEELR 1125 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + G+ IYSV + L + T ++ + V++ EL + +I I Sbjct: 1126 HKGVDIYSVGIG--DVDDQELVQITGTAEKKLTVHNFDELKKVKKRIVRNICTS 1177 Score = 66.9 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 27/160 (16%), Positives = 54/160 (33%), Gaps = 19/160 (11%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEK 292 V+IG + ++ + +++ + ++++ T T A+ + +K Sbjct: 656 DRVQIGVVQFSHENKEEFQLNTFMSQSDIANAIDRMTHIGETTLTGSALTFVSQYFSPDK 715 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA-VSAPP 351 + ++KF+I ITDGE +R G+ IYSV + Sbjct: 716 GARP------NVRKFLILITDGEAQD--------IVRDPAIALRKEGVIIYSVGVFGSNV 761 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 L + + F V + L D + I Sbjct: 762 ---TQLEEISGKPEMVFYVENFDILQHIEDDLVLGICSPR 798 Score = 43.0 bits (99), Expect = 0.087, Method: Composition-based stats. Identities = 24/162 (14%), Positives = 54/162 (33%), Gaps = 17/162 (10%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL--NKLNPYENTNTYPAMHHAYRELYNE 291 R+ Y+ + N N + + L N + A+ A+R ++ Sbjct: 61 NKYRVALAQYSDALHNEFQLGTFKNRNPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSA 120 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 + + V+ + ++ + + + +R G+KI SV V Sbjct: 121 PTNGRDKK--QFPPILVVLAS---------AESEDDVEEAAKALREDGVKIISVGVQK-- 167 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 ++ L+ S F + +R+L F +I + + Sbjct: 168 ASEENLKAMATSQ-FHFNLRTARDLS-VFAPNMTEIIKDVTQ 207 >gi|153830331|ref|ZP_01982998.1| conserved hypothetical protein [Vibrio cholerae 623-39] gi|148874174|gb|EDL72309.1| conserved hypothetical protein [Vibrio cholerae 623-39] Length = 318 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 30/170 (17%), Positives = 57/170 (33%), Gaps = 38/170 (22%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK--LNPYEN-TNTYPAMHHAYRELYNEKE 293 RIG I + TPL+ + V ++LN+ L T + A + + Sbjct: 129 RIGLILFADHAYLQ--TPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKTFIDSN- 185 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--- 350 ++ +I ++DG N+ L+ L+ + IY+V V A Sbjct: 186 ---------APQRVMILLSDGSNTAGV-----LDPLEAANIAKQYHTTIYTVGVGAGEMV 231 Query: 351 --------------PEGQDLLRKCTDS-SGQFFAVNDSRELLESFDKITD 385 + L+ + G +F + ++L +D I Sbjct: 232 VKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTINQ 281 >gi|111020122|ref|YP_703094.1| hypothetical protein RHA1_ro03133 [Rhodococcus jostii RHA1] gi|110819652|gb|ABG94936.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length = 326 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 26/160 (16%), Positives = 56/160 (35%), Gaps = 17/160 (10%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 G +P++ + + L+ L E T T A+ A + + + GST Sbjct: 138 GTASMLVSPIT-DHTATDNALDHLQLAERTATGEAIFTALQAIDT--LAGVLGGGSTPPP 194 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-------------- 351 ++ +DG+ + + + + G+ I +++ Sbjct: 195 ARIVLESDGKQTVPADLNDPRGAFTAARLAKEQGVPISTISFGTTHGAIDLNGSHIPVPV 254 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 + + L R S G FF + EL S+ + +I ++ Sbjct: 255 DDESLRRIAELSGGSFFTATSADELQASYQNLQQQIGYET 294 >gi|194215132|ref|XP_001499245.2| PREDICTED: similar to Collagen alpha-1(XXII) chain [Equus caballus] Length = 1632 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 33/195 (16%), Positives = 69/195 (35%), Gaps = 20/195 (10%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + + + NLV++ + R+G + Y+ + Sbjct: 54 TSSSVGKEDFEKVQQWVANLVDTFE-----VGADHTRVGVVRYSDQPTTAFELGHFRSRE 108 Query: 261 EVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 VK+ +L NTNT A+ + R ++ + G K+ I +TDG + Sbjct: 109 AVKAAARRLAYHGGNTNTGDALRYITRHSFS--PQAGGRPGDRAFKQVAILLTDGRSQD- 165 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELL 377 L AG++I++V V ++ L + F V+D + Sbjct: 166 -------LVLDAAATAHRAGIRIFAVGVG--EALKEELEEIASEPKSAHIFHVSDFNAID 216 Query: 378 ESFDKITDKIQEQSV 392 + K+ ++ E + Sbjct: 217 KIRGKLRRRLCENVL 231 >gi|301780200|ref|XP_002925521.1| PREDICTED: collagen alpha-1(XII) chain-like [Ailuropoda melanoleuca] Length = 3172 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 59/166 (35%), Gaps = 18/166 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+G + Y+ +E+ + + K+ NT T A+ + Sbjct: 168 SAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLI 227 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + E + K I ITDG++ +RN G++++S+ Sbjct: 228 KNTFTESAGARAGF-----PKVAIIITDGKSQD--------EVEIPARELRNIGVEVFSL 274 Query: 346 AVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQE 389 + L++ + F V + +++ ++I ++ Sbjct: 275 GI--KAADAKELKQIASTPSLNHVFNVANFDAIVDIQNEIISQVCS 318 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 54/160 (33%), Gaps = 18/160 (11%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKE 293 V+I Y+ + + + L NT T A++ ++ + Sbjct: 1235 RVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQSFKT-- 1292 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 R +K + ITDG++ + + +++ G++++++ + Sbjct: 1293 ---QAGMRPRARKIGVLITDGKSQDDVEAPS--------KKLKDEGVELFAIGI--KNAD 1339 Query: 354 QDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 +D L+ + V D L + D +T + Sbjct: 1340 EDELKMIATDPDDTHAYNVADFESLSKIVDDLTINLCNSV 1379 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 31/167 (18%), Positives = 62/167 (37%), Gaps = 23/167 (13%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYN 290 V+I + Y+ + ++ +N TNT AM + +++ Sbjct: 473 SPNRVQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFV 532 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + S + K +I ITDG++S A +RN+ ++I++V V Sbjct: 533 PSKGSRG-----NVPKVMILITDGKSSDAFRDP--------AIKLRNSDVEIFAVGV--K 577 Query: 351 PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS-VRI 394 + L F V D ++F +I+ ++ + +RI Sbjct: 578 DAVRSELEAIASPPAETHVFTVEDF----DAFQRISFELTQSICLRI 620 >gi|149694147|ref|XP_001503972.1| PREDICTED: similar to Cartilage matrix protein precursor (Matrilin-1) [Equus caballus] Length = 495 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 42/378 (11%), Positives = 112/378 (29%), Gaps = 52/378 (13%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQIN---ITKDKN 98 + S + + ST + E G A+ I+ I Sbjct: 95 RAHGSKAALLQAVRRIQPLSTGTMTGLAIQFAITRAFSEGEGGRARSPDISKVVIVVTDG 154 Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR 158 P + + A+ LF G+ L ++ + + S ++ +S+ Sbjct: 155 RPQDSVRDVSARSRASGIELFAIGVGRVDKATLREIASEPQD-EHVDYVESYSVIEKLSK 213 Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF-------WSKNTTKSKYAPAPAPANR--- 208 ++ + + + + S+ ++ N+ Sbjct: 214 KFQEAFCVVSDLCATGDHDCEQVCVSSPASYTCACHEGFTLNSDGKTCNVCSGGGGSSAT 273 Query: 209 ---------------KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 +++ + +V+++ + + ++G + Y+ + Sbjct: 274 DLVFLIDGSKSVRPENFELVKKFINQIVDTL-----DVSDKLAQVGLVQYSSSVRQEFPL 328 Query: 254 PLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 + ++K+ + ++ + T T A+ + + + +K I T Sbjct: 329 GRFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSGAR-----PGAQKVGIVFT 383 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAV 370 DG + + ++ G K+++V V +D LR+ +F Sbjct: 384 DGRSQDYIND--------AAKKAKDLGFKMFAVGVG--NAVEDELREIASEPVAEHYFYT 433 Query: 371 NDSRELLESFDKITDKIQ 388 D + + + K+ KI Sbjct: 434 ADFKTINQIGKKLQKKIC 451 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 55/159 (34%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + R+G + Y + + + + ++ P T T A+ A Sbjct: 71 DVGPNATRVGLVNYASAVKQEFPLRAHGSKAALLQAVRRIQPLSTGTMTGLAIQFAITRA 130 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E S + K VI +TDG + + + R +G++++++ V Sbjct: 131 FSEGEGGRAR--SPDISKVVIVVTDGRPQDSVRDVSARS--------RASGIELFAIGVG 180 Query: 349 APPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 + LR+ V +L + F + Sbjct: 181 R--VDKATLREIASEPQDEHVDYVESYSVIEKLSKKFQE 217 >gi|167624593|ref|YP_001674887.1| von Willebrand factor type A [Shewanella halifaxensis HAW-EB4] gi|167354615|gb|ABZ77228.1| von Willebrand factor type A [Shewanella halifaxensis HAW-EB4] Length = 345 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 30/170 (17%), Positives = 59/170 (34%), Gaps = 38/170 (22%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN--KLNPYE-NTNTYPAMHHAYRELYNEKE 293 RIG I + +PL+ + V L ++ T A+ A + ++ Sbjct: 129 RIGLILFADHAYLQ--SPLTQDRRTVAQYLKEAQIGLVGKQTAIGEAIALAVKRFDKVEQ 186 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--- 350 S+ + +I +TDG N+ + Q + G+ IY++ V A Sbjct: 187 SN----------RVLILLTDGSNNAGAIS-----PEQATQIAAKRGITIYTIGVGADVME 231 Query: 351 --------------PEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITD 385 + L++ + GQ+F ++ EL + + I Sbjct: 232 RRTLFGKERVNPSMDLDESQLQEIAKTTGGQYFRARNTEELEQIYQVIDT 281 >gi|183601829|ref|ZP_02963198.1| hypothetical protein BIFLAC_06106 [Bifidobacterium animalis subsp. lactis HN019] gi|241190320|ref|YP_002967714.1| hypothetical protein Balac_0261 [Bifidobacterium animalis subsp. lactis Bl-04] gi|241195726|ref|YP_002969281.1| hypothetical protein Balat_0261 [Bifidobacterium animalis subsp. lactis DSM 10140] gi|183218714|gb|EDT89356.1| hypothetical protein BIFLAC_06106 [Bifidobacterium animalis subsp. lactis HN019] gi|240248712|gb|ACS45652.1| hypothetical fibronectin binding protein [Bifidobacterium animalis subsp. lactis Bl-04] gi|240250280|gb|ACS47219.1| hypothetical fibronectin binding protein [Bifidobacterium animalis subsp. lactis DSM 10140] gi|295793307|gb|ADG32842.1| hypothetical fibronectin binding protein [Bifidobacterium animalis subsp. lactis V9] Length = 2696 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 49/422 (11%), Positives = 114/422 (27%), Gaps = 84/422 (19%) Query: 37 VLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKD 96 ++G S T+ + + T Sbjct: 24 AIAGVISAAMIVTVALVAIPAQAAGNMIAD---PTTFTQWEQGIGEPTDPRSTGRVWTDK 80 Query: 97 KNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDV 156 + + ++ + L+ + + + +I S+ + I +VLD Sbjct: 81 SVSTQEVTLKTYDDNHVTVAPKDGSFLVGLSAMS---SAQKLIGVSNVTKPLDIVLVLDT 137 Query: 157 SRSMEDLYLQKH---------NDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPAN 207 S SM D + + + S N + Sbjct: 138 SGSMAWGMDGDDEYAYDPVYAADITTSKRYYVRVSGSMTRVYSSANGWYYDAGGSRHYVT 197 Query: 208 -------------------------RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIA 242 ++ L ++ ++ A + + + + Sbjct: 198 PKTSAADSDAAHTQFYSRRRLTTQDTRMYALKQAVNGFIDQTIAANAKVSDPNKKNRIGL 257 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 N + L+++L+ +KS ++ L T M A L N + + Sbjct: 258 VTYASDVNTRSGLTDSLSGLKSTVDDLKASGATRADLGMQTANTVLGNARADA------- 310 Query: 303 RLKKFVIFITDGENSGASAYQNTL--NTLQICEYMRNAGMKIYSVAVSAPPE-------- 352 K VIF TDG+ + ++ ++N + + + + M+ G +YSV + Sbjct: 311 --SKIVIFFTDGQPTKSNGFENDVANDAIGAAKTMKTNGASVYSVGIFTGANPDANVSSV 368 Query: 353 ---------GQDLLRKCTD----------------SSGQFFAVNDSRELLESFDKITDKI 387 ++ + +S + A +D+ L F+ I ++ Sbjct: 369 TGKSDIELKSNAFMQGVSSNYPNATTYTNLGAKAPNSNYYLAASDADTLNAVFNTIWSEV 428 Query: 388 QE 389 Sbjct: 429 SS 430 >gi|255009407|ref|ZP_05281533.1| aerotolerance-related membrane protein [Bacteroides fragilis 3_1_12] gi|313147166|ref|ZP_07809359.1| aerotolerance protein BatA [Bacteroides fragilis 3_1_12] gi|313135933|gb|EFR53293.1| aerotolerance protein BatA [Bacteroides fragilis 3_1_12] Length = 327 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 28/205 (13%), Positives = 62/205 (30%), Gaps = 40/205 (19%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENT 275 N + + + E N PL+ + + + + + T Sbjct: 108 NRLEAAKDVAAEFINGRPNDNIGITLFAGESFTQCPLTVDHAVLLNLFQGIKCDIIEDGT 167 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + +A L S K +I +TDG N+ L E Sbjct: 168 AVGMGIANAVTRL----------KDSKAKSKVIILLTDGTNNKGDIS-----PLTAAEIA 212 Query: 336 RNAGMKIYSVAVSAPP--------------------EGQDLLRKCTD-SSGQFFAVNDSR 374 ++ G+++Y++ V + L + + G +F + Sbjct: 213 KSFGIRVYTIGVGTNGMAPYPVPVGGTVQYINTPVEIDEKTLTQIAGITDGNYFRATSNS 272 Query: 375 ELLESFDKITD-KIQEQSVRIAPNR 398 +L E +++I + + +V+ R Sbjct: 273 KLKEVYEEIDKLEKTKLNVKEYSKR 297 >gi|219682744|ref|YP_002469127.1| Rhs family protein [Bifidobacterium animalis subsp. lactis AD011] gi|219620394|gb|ACL28551.1| Rhs family protein [Bifidobacterium animalis subsp. lactis AD011] Length = 2582 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 49/422 (11%), Positives = 114/422 (27%), Gaps = 84/422 (19%) Query: 37 VLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKD 96 ++G S T+ + + T Sbjct: 38 AIAGVISAAMIVTVALVAIPAQAAGNMIAD---PTTFTQWEQGIGEPTDPRSTGRVWTDK 94 Query: 97 KNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDV 156 + + ++ + L+ + + + +I S+ + I +VLD Sbjct: 95 SVSTQEVTLKTYDDNHVTVAPKDGSFLVGLSAMS---SAQKLIGVSNVTKPLDIVLVLDT 151 Query: 157 SRSMEDLYLQKH---------NDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPAN 207 S SM D + + + S N + Sbjct: 152 SGSMAWGMDGDDEYAYDPVYAADITTSKRYYVRVSGSMTRVYSSANGWYYDAGGSRHYVT 211 Query: 208 -------------------------RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIA 242 ++ L ++ ++ A + + + + Sbjct: 212 PKTSAADSDAAHTQFYSRRRLTTQDTRMYALKQAVNGFIDQTIAANAKVSDPNKKNRIGL 271 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 N + L+++L+ +KS ++ L T M A L N + + Sbjct: 272 VTYASDVNTRSGLTDSLSGLKSTVDDLKASGATRADLGMQTANTVLGNARADA------- 324 Query: 303 RLKKFVIFITDGENSGASAYQNTL--NTLQICEYMRNAGMKIYSVAVSAPPE-------- 352 K VIF TDG+ + ++ ++N + + + + M+ G +YSV + Sbjct: 325 --SKIVIFFTDGQPTKSNGFENDVANDAIGAAKTMKTNGASVYSVGIFTGANPDANVSSV 382 Query: 353 ---------GQDLLRKCTD----------------SSGQFFAVNDSRELLESFDKITDKI 387 ++ + +S + A +D+ L F+ I ++ Sbjct: 383 TGKSDIELKSNAFMQGVSSNYPNATTYTNLGAKAPNSNYYLAASDADTLNAVFNTIWSEV 442 Query: 388 QE 389 Sbjct: 443 SS 444 >gi|297683708|ref|XP_002819511.1| PREDICTED: collagen alpha-1(XXII) chain-like [Pongo abelii] Length = 259 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 67/193 (34%), Gaps = 20/193 (10%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + + + NLV++ + R+G + Y+ L + Sbjct: 20 TSSSVGKEDFEKVRQWVANLVDTFE-----VGPDRTRVGVVRYSDRPTTAFELGLFGSQE 74 Query: 261 EVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 EVK+ +L NTNT A+ + ++ + G K+ I +TDG + Sbjct: 75 EVKAAARRLAYHGGNTNTGDALRYITARSFS--PRTGGRPGDRAYKQVAILLTDGRSQDL 132 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELL 377 G++I++V V ++ L + F V+D + Sbjct: 133 VLDAAAAAHRA--------GIRIFAVGVG--EALKEELEEIASEPKSAHVFHVSDFNAID 182 Query: 378 ESFDKITDKIQEQ 390 + K+ ++ E Sbjct: 183 KIRGKLRRRLCES 195 >gi|158425008|ref|YP_001526300.1| von Willebrand factor type A domain-containing protein [Azorhizobium caulinodans ORS 571] gi|158331897|dbj|BAF89382.1| von Willebrand factor type A domain protein [Azorhizobium caulinodans ORS 571] Length = 343 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 31/166 (18%), Positives = 61/166 (36%), Gaps = 34/166 (20%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKS--RLNKLNPYEN-TNTYPAMHHAYRELYNEKE 293 RIG I ++ PL+ + N V+ R + + T A+ A + L + Sbjct: 136 RIGLILFSTRAYVQA--PLTFDRNVVRDLLRTSSIGMTGQETAIGDAIALAVKTLRTRPQ 193 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE- 352 ++ ++ +TDG N+ L+ + E + G+KIY++ V A Sbjct: 194 E----------QRVLVLLTDGANNSGM-----LSPIPAAEIAKANGVKIYTIGVGADAFA 238 Query: 353 ------------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + L + + G++F D+ L ++ I Sbjct: 239 VGQRMVNPSFDLDEGALEQIAQMTGGRYFRARDAAGLAAIYNDIDR 284 >gi|149176865|ref|ZP_01855475.1| BatA [Planctomyces maris DSM 8797] gi|148844302|gb|EDL58655.1| BatA [Planctomyces maris DSM 8797] Length = 356 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 33/173 (19%), Positives = 64/173 (36%), Gaps = 38/173 (21%) Query: 248 VGNQCTPLSNNLNEVKSRLNKL-----NPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 + TP + + + S+LN + + T A+ A +L + Sbjct: 145 YADGITPPTLDHPYLVSQLNNIQIVTNRSEDGTAIGDAISLAVEKLN----ALDARRDEK 200 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE---------- 352 K +I +TDGEN+ + +Q E G+K+Y++ V E Sbjct: 201 VKSKVIILLTDGENNAG-----EVEPIQAAELAETLGIKVYTIGVGTKGEAPVPVTDPFS 255 Query: 353 ------------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + L+K D + G++F D+ L + + +I D +++ V Sbjct: 256 GKQVVQWMPVNIDEATLQKVADLTHGKYFRATDTDSLEKIYHEI-DALEKTKV 307 >gi|84385675|ref|ZP_00988706.1| hypothetical protein V12B01_26114 [Vibrio splendidus 12B01] gi|84379655|gb|EAP96507.1| hypothetical protein V12B01_26114 [Vibrio splendidus 12B01] Length = 520 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 37/282 (13%), Positives = 86/282 (30%), Gaps = 18/282 (6%) Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 G +P + + S +S+ +VS + + + Sbjct: 232 GFVPFNVRTREIVSGNQARATSQLSYKDNYKT-NVSPYSYNDVNWDYWRTYSQNDVIRCA 290 Query: 182 P-------PPPKKSFWSKNTTKSKY--APAPAPANRKIDVLIESAGNLVNSIQKAIQEKK 232 P + ++K Y A +D+ + + Sbjct: 291 DRQSRCSYPKSENQKYAKRIKDVIYQDNYLVADVYNYVDLSTSVSTMFTDKSGLQPDFYS 350 Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 + + LSN L+++ + +N + T + + + L++ Sbjct: 351 VSGTSLFNAHGSSNSSQFSNIRLSNKLSDL-NPINSMWADGGTAAFQGILRGSQVLHDGD 409 Query: 293 ESSHNTIGSTRLK---KFVIFITDGENSGASAYQNTLNTLQICEYMRNA--GMKIYSVAV 347 +S + K ++ ++DG+ S + L +C+ RN G+ I + + Sbjct: 410 PNSSDQEEQQVYNKKIKMLLILSDGQESPNNGILKGLVDRGMCDKARNEIPGLYIGVIGI 469 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 Q + C + + D L E +KI + I++ Sbjct: 470 DFRASQQSGFQDCVVDPNE--DIIDVSNLDELIEKIEELIRK 509 Score = 74.6 bits (181), Expect = 2e-11, Method: Composition-based stats. Identities = 32/268 (11%), Positives = 81/268 (30%), Gaps = 29/268 (10%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 +++ +A++ + ++++ + +AA ++ + DQ Sbjct: 27 MGLLLVPIMGMTFWAVEGTRYVQETSRLRDSAEAAAIAVTIE-----------DQPDQAR 75 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + K ++ +++ + + +D+ + + + ++ F Sbjct: 76 GLATKYVENYVRDIKSTN------LSADRFHQAEDEGAGVLEYIQYTVNAKTTHDSWFAS 129 Query: 122 GLIPSALTNLSLRSTG---IIERSSENLAISICMVLDVSRSMEDLY------LQKHNDNN 172 IPS L + I I V D S SM D + Sbjct: 130 SFIPSFDQQQDLAGRSLARKYPVYLGDNNIDIVFVSDFSGSMNDRWGSSRHIKIDDLKTA 189 Query: 173 NMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK 232 + +L K+ + + KY + D L+ G + +++ Sbjct: 190 IDAISSKILCTSIKQDYVD---GEWKYVCDEPGEDTTGDKLLNRVGFVPFNVRTREIVSG 246 Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 N + ++Y N N++N Sbjct: 247 NQARATSQLSYKDNYKTNVSPYSYNDVN 274 >gi|53714874|ref|YP_100866.1| putative outer membrane protein [Bacteroides fragilis YCH46] gi|52217739|dbj|BAD50332.1| putative outer membrane protein [Bacteroides fragilis YCH46] Length = 610 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 32/306 (10%), Positives = 88/306 (28%), Gaps = 26/306 (8%) Query: 75 GSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLR 134 + E G+I + N++ + ++ A Y + L P Sbjct: 139 NAVNAEEYGEIQENGFKNVSDAPLSTFSIDVDA-ASYSNMRRFINKGKLPPVDAIRTEEL 197 Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 + + + + + P + + N Sbjct: 198 VNYFSYDYPKPTGSDPVKITMEAGTCPWNADHRLVRIGLKAKEI------PTDNLPASNL 251 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 ++D++ S LVN+++ ++ + Y + Sbjct: 252 VFLIDVSGSMWGANRLDLVKSSLKLLVNNLRDKD--------KVAIVTYAGNAGEKLAST 303 Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 ++ +++ +++L +T + AY+ S N +I TDG Sbjct: 304 PGSDKQKIREAIDELEASGSTAGGEGIMLAYKIAQKNFISGGNNR--------IILCTDG 355 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDS 373 + + + L ++ E R +G+ + + ++ ++ G +++ Sbjct: 356 DFNVGVSSDKEL--EKLIEQKRKSGIFLTVLGYGMGNYKDSKMQTLSEKGNGNHAYIDNL 413 Query: 374 RELLES 379 +E Sbjct: 414 QEANRV 419 >gi|301767086|ref|XP_002918971.1| PREDICTED: collagen alpha-1(XXI) chain-like [Ailuropoda melanoleuca] Length = 957 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 30/176 (17%), Positives = 65/176 (36%), Gaps = 22/176 (12%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHA 284 + +++G + Y+ V ++ + + + ++ NT T A+ A Sbjct: 64 TKNFDIGPKFIQVGVVQYSDYPVLEIPLGSHDSGENLVAAMESIHYLGGNTRTGKAIQFA 123 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 L+ S L K + +TDG++ E R++ + +++ Sbjct: 124 LDYLF--------AKSSRFLTKIAVVLTDGKSQD--------EVKDAAEAARDSKITLFA 167 Query: 345 VAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + V + E + LR + F V D + + + + K+ E+SV P R Sbjct: 168 IGVGSETEDAE-LRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV--CPTR 220 >gi|219841908|gb|AAI44536.1| COL22A1 protein [Homo sapiens] Length = 1319 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 31/195 (15%), Positives = 66/195 (33%), Gaps = 20/195 (10%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + + + NLV++ + R+G + Y+ L + Sbjct: 45 TSSSVGKEDFEKVRQWVANLVDTFE-----VGPDRTRVGVVRYSDRPTTAFELGLFGSQE 99 Query: 261 EVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 EVK+ +L NTNT A+ + ++ K+ I +TDG + Sbjct: 100 EVKAAARRLAYHGGNTNTGDALRYITARSFSPHAGGRPR--DRAYKQVAILLTDGRSQDL 157 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELL 377 G++I++V V ++ L + F V+D + Sbjct: 158 VLDAAAAAHRA--------GIRIFAVGVG--EALKEELEEIASEPKSAHVFHVSDFNAID 207 Query: 378 ESFDKITDKIQEQSV 392 + K+ ++ E + Sbjct: 208 KIRGKLRRRLCENVL 222 >gi|298372684|ref|ZP_06982674.1| BatA protein [Bacteroidetes oral taxon 274 str. F0058] gi|298275588|gb|EFI17139.1| BatA protein [Bacteroidetes oral taxon 274 str. F0058] Length = 326 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 20/171 (11%), Positives = 60/171 (35%), Gaps = 41/171 (23%) Query: 249 GNQCTPLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 P+++N + + +++ + T + A + + Sbjct: 136 SFTQCPITSNHASLVNLFKQVDFGIIQDGTAIGLGLATAINRI----------KDAEGKS 185 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------------- 352 K +I +TDG N+ + + + ++ G+++Y++ V Sbjct: 186 KVIILLTDGTNNTG-----DIAPISAAQIAQSYGIRVYTIGVGTQGIAEVPMLDQFGNIH 240 Query: 353 --------GQDLLRKCTDS-SGQFFAVNDSRELLESFDKITDKIQEQSVRI 394 + L++ + G++F + L + + +I DK+++ ++ Sbjct: 241 YTEAEVVIDETTLQQIASTTGGKYFRATNVSSLKQIYSEI-DKMEKTKLKT 290 >gi|292627386|ref|XP_001332035.3| PREDICTED: collagen alpha-1(XXI) chain-like [Danio rerio] Length = 1056 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 24/176 (13%), Positives = 58/176 (32%), Gaps = 20/176 (11%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHA 284 + + + ++G + Y+ ++ + K++ NT T A+ A Sbjct: 129 TSGFDVSSHYSQVGVVQYSDTPRLEIPLGQHKTTQQLIEAIEKISYLGGNTQTGRAIKFA 188 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 ++ + + + +TDG++ + R G+ +++ Sbjct: 189 VDHVFPSSRR------NDVKNRIAVVVTDGKSQDDVTDASLD--------ARTQGITVFA 234 Query: 345 VAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 V V + + L ++ D + D + K+ E+SV P R Sbjct: 235 VGVGSEITNSE-LVTIANTPAGDYVLFAEDYTNIERIRDAMEQKLCEESV--CPTR 287 >gi|126662671|ref|ZP_01733670.1| batA protein [Flavobacteria bacterium BAL38] gi|126626050|gb|EAZ96739.1| batA protein [Flavobacteria bacterium BAL38] Length = 334 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 22/196 (11%), Positives = 62/196 (31%), Gaps = 45/196 (22%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL------NKLNPY 272 N + +++K + TP++++ + L + + Sbjct: 111 NRLEALKKVAATFVQDRINDRIGLVVYAGESYTRTPVTSDKTIILQSLKSVEFDDSIIAD 170 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T + A + S + +I +TDG N+ T++ Sbjct: 171 G-TGIGVGLATAINRI----------KDSKAKSRIIILLTDGVNNSG-----TIDPRTAA 214 Query: 333 EYMRNAGMKIYSVAVSAPPE----------------------GQDLLRKCT-DSSGQFFA 369 + G+K+Y++ + + + L+++ ++ ++F Sbjct: 215 SIAKEYGIKVYTIGIGTNGKAMFPVAKDANGKLVFKMMPVEIDEKLMQEIAKNTDAKYFR 274 Query: 370 VNDSRELLESFDKITD 385 +++L +D+I Sbjct: 275 ATSNKKLQAIYDEINK 290 >gi|119612600|gb|EAW92194.1| collagen, type XXII, alpha 1, isoform CRA_b [Homo sapiens] Length = 1626 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 31/195 (15%), Positives = 66/195 (33%), Gaps = 20/195 (10%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + + + NLV++ + R+G + Y+ L + Sbjct: 45 TSSSVGKEDFEKVRQWVANLVDTFE-----VGPDRTRVGVVRYSDRPTTAFELGLFGSQE 99 Query: 261 EVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 EVK+ +L NTNT A+ + ++ K+ I +TDG + Sbjct: 100 EVKAAARRLAYHGGNTNTGDALRYITARSFSPHAGGRPR--DRAYKQVAILLTDGRSQDL 157 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELL 377 G++I++V V ++ L + F V+D + Sbjct: 158 VLDAAAAAHRA--------GIRIFAVGVG--EALKEELEEIASEPKSAHVFHVSDFNAID 207 Query: 378 ESFDKITDKIQEQSV 392 + K+ ++ E + Sbjct: 208 KIRGKLRRRLCENVL 222 >gi|40805823|ref|NP_690848.1| collagen, type XXII, alpha 1 [Homo sapiens] gi|296434458|sp|Q8NFW1|COMA1_HUMAN RecName: Full=Collagen alpha-1(XXII) chain; Flags: Precursor gi|225000822|gb|AAI72420.1| Collagen, type XXII, alpha 1 [synthetic construct] Length = 1626 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 31/195 (15%), Positives = 66/195 (33%), Gaps = 20/195 (10%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + + + NLV++ + R+G + Y+ L + Sbjct: 45 TSSSVGKEDFEKVRQWVANLVDTFE-----VGPDRTRVGVVRYSDRPTTAFELGLFGSQE 99 Query: 261 EVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 EVK+ +L NTNT A+ + ++ K+ I +TDG + Sbjct: 100 EVKAAARRLAYHGGNTNTGDALRYITARSFSPHAGGRPR--DRAYKQVAILLTDGRSQDL 157 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELL 377 G++I++V V ++ L + F V+D + Sbjct: 158 VLDAAAAAHRA--------GIRIFAVGVG--EALKEELEEIASEPKSAHVFHVSDFNAID 207 Query: 378 ESFDKITDKIQEQSV 392 + K+ ++ E + Sbjct: 208 KIRGKLRRRLCENVL 222 >gi|22652113|gb|AAN03620.1|AF406780_1 alpha 1 type XXII collagen [Homo sapiens] Length = 1626 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 31/195 (15%), Positives = 66/195 (33%), Gaps = 20/195 (10%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + + + NLV++ + R+G + Y+ L + Sbjct: 45 TSSSVGKEDFEKVRQWVANLVDTFE-----VGPDRTRVGVVRYSDRPTTAFELGLFGSQE 99 Query: 261 EVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 EVK+ +L NTNT A+ + ++ K+ I +TDG + Sbjct: 100 EVKAAARRLAYHGGNTNTGDALRYITARSFSPHAGGRPR--DRAYKQVAILLTDGRSQDL 157 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELL 377 G++I++V V ++ L + F V+D + Sbjct: 158 VLDAAAAAHRA--------GIRIFAVGVG--EALKEELEEIASEPKSAHVFHVSDFNAID 207 Query: 378 ESFDKITDKIQEQSV 392 + K+ ++ E + Sbjct: 208 KIRGKLRRRLCENVL 222 >gi|269965331|ref|ZP_06179451.1| hypothetical protein VMC_08810 [Vibrio alginolyticus 40B] gi|269829977|gb|EEZ84206.1| hypothetical protein VMC_08810 [Vibrio alginolyticus 40B] Length = 334 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 32/205 (15%), Positives = 66/205 (32%), Gaps = 47/205 (22%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 ++ + + + V + R+G + + TPL+ + Sbjct: 116 LNGEYIDRLTAVKQVLSDFVAKRK---------GDRLGVVLFGDHAYLQ--TPLTADRKS 164 Query: 262 VKSRLNK--LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 V ++N+ + T + + + S ++ +I ++DG N+ Sbjct: 165 VMQQINQTVIGLVGQRTAIGDGIGLGTKTFVD----------SDAPQRVMILLSDGSNTA 214 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP------------------EGQDLLRKC 360 L L+ E + IY+V V A + Q L + Sbjct: 215 GV-----LEPLEAAEIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTAADLDEQTLTKVA 269 Query: 361 TDSSGQFFAVNDSRELLESFDKITD 385 + GQ+F D+ EL + +D I Sbjct: 270 EVTGGQYFRARDTEELEKIYDTINQ 294 >gi|218261917|ref|ZP_03476585.1| hypothetical protein PRABACTJOHN_02256 [Parabacteroides johnsonii DSM 18315] gi|218223693|gb|EEC96343.1| hypothetical protein PRABACTJOHN_02256 [Parabacteroides johnsonii DSM 18315] Length = 328 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 28/200 (14%), Positives = 63/200 (31%), Gaps = 40/200 (20%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENT 275 N + + + N PL+ + + + + P + T Sbjct: 109 NRLEASKDVASAFINGRPNDNIGLVVFAAESFTQCPLTTDHTVLLNLFKDVQPGIIQDGT 168 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + +A + S K +I +TDG N+ + + E Sbjct: 169 AIGLGLANAVSRI----------KDSQAKSKVIILLTDGVNNQG-----EIAPVTAAEIA 213 Query: 336 RNAGMKIYSVAVSA---------------------PPEGQDLLRKCTDSSGQFFAVNDSR 374 + G+++Y++ V + L + + GQ+F D+ Sbjct: 214 KTFGVRVYTIGVGTQGKAPYPFQTAFGVQYMDVDVEIDEPTLKQIAATTGGQYFRATDNA 273 Query: 375 ELLESFDKITDKIQEQSVRI 394 L E + +I DK+++ + + Sbjct: 274 SLKEIYSEI-DKMEKTKISV 292 >gi|120437734|ref|YP_863420.1| von Willebrand factor(vWA) type A domain-containing protein [Gramella forsetii KT0803] gi|117579884|emb|CAL68353.1| membrane protein containing von Willebrand factor(vWA) type A domain [Gramella forsetii KT0803] Length = 335 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 28/195 (14%), Positives = 61/195 (31%), Gaps = 43/195 (22%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE----N 274 N +++ + +E TP++++ V L + Sbjct: 112 NRLDATKNVAEEFIQDRPGDRIGLVVYAGESFTKTPITSDKAIVLDALEDIEYNNVLENG 171 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 T + A + S K +I +TDG N+ ++ E Sbjct: 172 TAIGSGLATAVNRI----------KDSDAESKVIILLTDGVNNAG-----FIDPSTASEL 216 Query: 335 MRNAGMKIYSVAVSAPP-----------------------EGQDLLRKCTD-SSGQFFAV 370 G+K+Y++ V + +DLL++ + G++F Sbjct: 217 AVEFGIKVYTIGVGSNGMALSPVGVNPANGRLRFGNVQVEIDEDLLKEIAAATGGKYFRA 276 Query: 371 NDSRELLESFDKITD 385 ++ +L E + +I Sbjct: 277 TNNEKLEEIYAEIDS 291 >gi|293361231|ref|XP_243912.5| PREDICTED: collagen alpha-1(XII) chain [Rattus norvegicus] Length = 3119 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 58/166 (34%), Gaps = 18/166 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+G + Y+ ++ + + K+ NT T A+ + Sbjct: 168 SAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYRREDLLAAIKKIPYKGGNTMTGDAIDYLV 227 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + E S K I ITDG++ +RN G++++S+ Sbjct: 228 KNTFTESAGSRAGF-----PKVAIIITDGKSQD--------EVEIPARELRNIGVEVFSL 274 Query: 346 AVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQE 389 + L++ + F V + +++ ++I ++ Sbjct: 275 GI--KAADAKELKQIASTPSLNHVFNVANFDAIVDIQNEIISQVCS 318 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 21/160 (13%), Positives = 53/160 (33%), Gaps = 18/160 (11%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKE 293 V+I Y+ + + + L NT T A++ ++ + Sbjct: 1235 RVQIALAQYSGDPRTEWHLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQSFKT-- 1292 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 R +K + ITDG++ + + +++ G++++++ + Sbjct: 1293 ---QAGMRPRARKIGVLITDGKSQDDVEAPS--------KKLKDEGVELFAIGI--KNAD 1339 Query: 354 QDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 + L+ + V D L + D +T + Sbjct: 1340 EVELKMIATDPDDIHAYNVADFESLSKIVDDLTINLCNSV 1379 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 32/167 (19%), Positives = 63/167 (37%), Gaps = 23/167 (13%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYN 290 V+I + Y+ N + ++ +N TNT AM + +++ Sbjct: 473 SPNRVQISLVQYSRDPHTEFTLKEFNRVEDIIKAINNFPYRGGSTNTGKAMTYVREKIF- 531 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 N + + K +I ITDG++S A +RN+ ++I++V V Sbjct: 532 ----VPNKGSRSNVPKVMILITDGKSSDAFRDP--------AIKLRNSDVEIFAVGV--K 577 Query: 351 PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS-VRI 394 + L F V D ++F +I+ ++ + +RI Sbjct: 578 DAVRSELEAIASPPAETHVFTVEDF----DAFQRISFELTQSICLRI 620 >gi|154492261|ref|ZP_02031887.1| hypothetical protein PARMER_01895 [Parabacteroides merdae ATCC 43184] gi|154087486|gb|EDN86531.1| hypothetical protein PARMER_01895 [Parabacteroides merdae ATCC 43184] Length = 328 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 28/200 (14%), Positives = 63/200 (31%), Gaps = 40/200 (20%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENT 275 N + + + N PL+ + + + + P + T Sbjct: 109 NRLEASKDVASAFINGRPNDNIGLVVFAAESFTQCPLTTDHTVLLNLFKDVQPGIIQDGT 168 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + +A + S K +I +TDG N+ + + E Sbjct: 169 AIGLGLANAVSRI----------KDSQAKSKVIILLTDGVNNQG-----EIAPVTAAEIA 213 Query: 336 RNAGMKIYSVAVSA---------------------PPEGQDLLRKCTDSSGQFFAVNDSR 374 + G+++Y++ V + L + + GQ+F D+ Sbjct: 214 KTFGVRVYTIGVGTQGKAPYPFQTAFGVQYMDVDVEIDEPTLKQIAATTGGQYFRATDNA 273 Query: 375 ELLESFDKITDKIQEQSVRI 394 L E + +I DK+++ + + Sbjct: 274 SLKEIYSEI-DKMEKTKISV 292 >gi|116626306|ref|YP_828462.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116229468|gb|ABJ88177.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 310 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 28/193 (14%), Positives = 79/193 (40%), Gaps = 23/193 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + ++A + + A Q+ + ++++ + L + ++ Sbjct: 96 SNSIRDRFKFEQDAASEFIKGVVHANQD------KAMLVSFDTKA--ELVSDLIGDTEKL 147 Query: 263 KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + L P T Y A+ A R+ ++ + H + ++ ++DG+++ + Sbjct: 148 DHAIRSLRPGGGTALYDAIFFACRDKLSQDQPKHKFR------RAIVIVSDGDDNQSQYT 201 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVS---APPEGQDLLR-KCTDSSGQFFAVNDSRELLE 378 ++ Q E + A + +YS++ + +G +L+ ++ G+ F +L + Sbjct: 202 RD-----QALEMAQKADVVLYSISTNISKIESDGDKVLKYYAAETGGKAFFPFKVEDLEQ 256 Query: 379 SFDKITDKIQEQS 391 SF+ I ++++ Q Sbjct: 257 SFENIANELRHQY 269 >gi|254505681|ref|ZP_05117827.1| von Willebrand factor, type A [Vibrio parahaemolyticus 16] gi|219551334|gb|EED28313.1| von Willebrand factor, type A [Vibrio parahaemolyticus 16] Length = 322 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 30/203 (14%), Positives = 69/203 (33%), Gaps = 47/203 (23%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 ++ + + + + K R+G + + TPL+ + + Sbjct: 106 GDYVDRLTAVKKVVSDFAS---------KREGDRLGLVLFADHAYLQ--TPLTLDRKTIA 154 Query: 264 SRLNKLNP---YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 ++N+L E T + A + + S ++ ++ ++DG N+ Sbjct: 155 EQVNQLVLRLIGEKTAIGEGIGLATKTFVD----------SDAPQRVMVLLSDGSNTSGV 204 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPP------------------EGQDLLRKCTD 362 L+ L+ + + IY++ V A + + L+ Sbjct: 205 -----LDPLEAAKIAKKYNATIYTIGVGAGEMVVKEFFMTRKVNTAQDLDERTLMDIAQV 259 Query: 363 SSGQFFAVNDSRELLESFDKITD 385 + GQ+F D++EL +D I Sbjct: 260 TGGQYFRARDAKELATIYDTINS 282 >gi|296198593|ref|XP_002806760.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-1(XII) chain-like [Callithrix jacchus] Length = 3113 Score = 77.3 bits (188), Expect = 4e-12, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 59/166 (35%), Gaps = 18/166 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+G + Y+ +E+ + + K+ NT T A+ + Sbjct: 168 SAFDIGEDKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLV 227 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + E + K I ITDG++ +RN G++++S+ Sbjct: 228 KNTFTESAGARVGF-----PKVAIVITDGKSQD--------EVEIPARELRNVGVEVFSL 274 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQE 389 + L++ + F V + +++ ++I ++ Sbjct: 275 GI--KAADAKELKQIASTPSLTHVFNVANFDAIVDIQNEIISQVCS 318 Score = 69.6 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 21/165 (12%), Positives = 53/165 (32%), Gaps = 18/165 (10%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + V+I Y+ + + + L NT T A++ ++ Sbjct: 1230 DIGPKRVQIALAQYSGDPRTEWQLNTHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQS 1289 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + R +K + ITDG++ + + +++ G++++++ + Sbjct: 1290 F-----RTQAGMRPRARKIGVLITDGKSQDDVEAPS--------KKLKDEGVELFAIGI- 1335 Query: 349 APPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 + L+ + V D L D +T + Sbjct: 1336 -KNADEVELKMIATDPDDTHAYNVADFESLSRIVDDLTINLCNSV 1379 Score = 66.5 bits (160), Expect = 6e-09, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 62/167 (37%), Gaps = 23/167 (13%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYN 290 V+I + Y+ + ++ +N TNT AM + +++ Sbjct: 473 SPNRVQISLVQYSRDPHTEFTLKXFTKVEDIIEAINNFPYRGGSTNTGKAMTYVREKIF- 531 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + K +I ITDG++S A +RN+ ++I++V V Sbjct: 532 ----VPSKGSRSNVPKVMILITDGKSSDAFRDP--------AIKLRNSDVEIFAVGV--K 577 Query: 351 PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS-VRI 394 + L F V D ++F +I+ ++ + +RI Sbjct: 578 DAVRSELEAIASPPAETHVFTVEDF----DAFQRISFELTQSICLRI 620 >gi|310115525|ref|XP_003120135.1| PREDICTED: collagen alpha-1(XXII) chain-like [Homo sapiens] Length = 788 Score = 77.3 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 31/195 (15%), Positives = 66/195 (33%), Gaps = 20/195 (10%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + + + NLV++ + R+G + Y+ L + Sbjct: 45 TSSSVGKEDFEKVRQWVANLVDTFE-----VGPDRTRVGVVRYSDRPTTAFELGLFGSQE 99 Query: 261 EVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 EVK+ +L NTNT A+ + ++ K+ I +TDG + Sbjct: 100 EVKAAARRLAYHGGNTNTGDALRYITARSFSPHAGGRPR--DRAYKQVAILLTDGRSQDL 157 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELL 377 G++I++V V ++ L + F V+D + Sbjct: 158 VLDAAAAAHRA--------GIRIFAVGVG--EALKEELEEIASEPKSAHVFHVSDFNAID 207 Query: 378 ESFDKITDKIQEQSV 392 + K+ ++ E + Sbjct: 208 KIRGKLRRRLCENVL 222 >gi|149773091|emb|CAO01895.1| collagen type VI alpha 6 [Mus musculus] Length = 1162 Score = 77.3 bits (188), Expect = 5e-12, Method: Composition-based stats. Identities = 36/338 (10%), Positives = 98/338 (28%), Gaps = 31/338 (9%) Query: 58 DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 ++K K+ + + QI + + ++ Sbjct: 281 QVGQAYTGAALRKTRKEIFSAQRGSRKNQGVPQIAVLVTHRASEDNVTKAAVNLRREGVT 340 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYL----QKHNDNNN 173 +F G+ + L ++ E+ + L + ++ L + Sbjct: 341 IFTMGIEGANPDELEKIASHPAEQFTSKLGNFSELATHNQTFLKKLRNQITHTVSVFSER 400 Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 + K + + P + +V A + Sbjct: 401 TETLKSACVDTEEADIYLLIDGSGSTQP------TDFHEMKTFLSEVVGMFNIAPHK--- 451 Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEK 292 VR+G + Y +N ++ + + TNT A++ + L K Sbjct: 452 --VRVGAVQYADTWDLEFEISKYSNKPDLGKAIENIRQMGGNTNTGAALNFTLKLLQRAK 509 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + + + ++ +T+G + + L +R ++++++ V Sbjct: 510 KERGSK-----VPCHLVVLTNGMSRDS--------VLGPAHKLREENIRVHAIGV--KEA 554 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 Q LR+ + + V++ L +++ +I + Sbjct: 555 NQTQLREIAGEEKRVYYVHEFDALRNIRNQVVQEICAE 592 Score = 73.8 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 41/354 (11%), Positives = 103/354 (29%), Gaps = 34/354 (9%) Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 D + K+ + + +++ ++ + ++A Sbjct: 818 ADVGKNQVRFGALKYADDPEVLFYLDELGTKLEVVSVLQNDHPMGGNTYTAEALAFSDHM 877 Query: 117 NLFL------KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 KG+ + S + ++ A+ +L ++ + + Sbjct: 878 FTEARGSRLHKGVPQVLIVITDGESHDAEKLNTTAKALRDKGILVLAVGIAGANSWELLA 937 Query: 171 NNNMTSNKYLLPPPPK----KSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK 226 + Y + S S + S + ++ S + QK Sbjct: 938 MAGSSDKYYFVETFGGLKGIFSDVSASVCNSSKVDCEIEKVDLVFLMDGSNSIHPDDFQK 997 Query: 227 A---------IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTN 276 + VRIG ++ E+ +++ + T+ Sbjct: 998 MKGFLVSVVQDFDVSLNRVRIGVAQFSDSYRSEFLLGTFTGEREISTQIEGIQQIFGYTH 1057 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 A+ R++ + + + + ++ +TDG + Q E +R Sbjct: 1058 IGDAL----RKVKYYFQPDMGSRINAGTPQVLLVLTDGRSQD--------EVAQAAEELR 1105 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + G+ IYSV + L + T ++ + V++ EL + +I I Sbjct: 1106 HKGVDIYSVGIG--DVDDQELVQITGTAEKKLTVHNFDELKKVKKRIVRNICTS 1157 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 27/160 (16%), Positives = 54/160 (33%), Gaps = 19/160 (11%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEK 292 V+IG + ++ + +++ + ++++ T T A+ + +K Sbjct: 636 DRVQIGVVQFSHENKEEFQLNTFMSQSDIANAIDRMTHIGETTLTGSALTFVSQYFSPDK 695 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA-VSAPP 351 + ++KF+I ITDGE +R G+ IYSV + Sbjct: 696 GARP------NVRKFLILITDGEAQD--------IVRDPAIALRKEGVIIYSVGVFGSNV 741 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 L + + F V + L D + I Sbjct: 742 ---TQLEEISGKPEMVFYVENFDILQHIEDDLVLGICSPR 778 Score = 43.0 bits (99), Expect = 0.093, Method: Composition-based stats. Identities = 24/162 (14%), Positives = 54/162 (33%), Gaps = 17/162 (10%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL--NKLNPYENTNTYPAMHHAYRELYNE 291 R+ Y+ + N N + + L N + A+ A+R ++ Sbjct: 41 NKYRVALAQYSDALHNEFQLGTFKNRNPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSA 100 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 + + V+ + ++ + + + +R G+KI SV V Sbjct: 101 PTNGRDKK--QFPPILVVLAS---------AESEDDVEEAAKALREDGVKIISVGVQK-- 147 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 ++ L+ S F + +R+L F +I + + Sbjct: 148 ASEENLKAMATSQ-FHFNLRTARDLS-VFAPNMTEIIKDVTQ 187 >gi|254525166|ref|ZP_05137221.1| von Willebrand factor, type A [Stenotrophomonas sp. SKA14] gi|219722757|gb|EED41282.1| von Willebrand factor, type A [Stenotrophomonas sp. SKA14] Length = 334 Score = 76.9 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 32/186 (17%), Positives = 66/186 (35%), Gaps = 34/186 (18%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---PYENT 275 + + + + + + + G TPL+ +L V+ +L T Sbjct: 124 DRLTAAKAVLADFLDRRAGDRIGLLIFGDRAYTLTPLTADLASVRDQLRDSVVGLAGRET 183 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 A+ A + L ++ E ++ +I +TDG ++ L L+ E Sbjct: 184 AIGDAIGLAVKRLRSQPEG----------QRVLILLTDGVSNAGV-----LEPLRAAEVA 228 Query: 336 RNAGMKIYSVAVSAPPE---------------GQDLLRKCTD-SSGQFFAVNDSRELLES 379 R G++I++VA + L+K + GQFF D+ +L Sbjct: 229 RAEGVRIHTVAFGGDGSMRVFGISISADQDPVDEATLKKIAGMTGGQFFRARDTAQLAGI 288 Query: 380 FDKITD 385 + ++ Sbjct: 289 YAELDR 294 >gi|167032571|ref|YP_001667802.1| von Willebrand factor type A [Pseudomonas putida GB-1] gi|166859059|gb|ABY97466.1| von Willebrand factor type A [Pseudomonas putida GB-1] Length = 358 Score = 76.9 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 30/187 (16%), Positives = 62/187 (33%), Gaps = 35/187 (18%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-- 270 + ++ ++ + + G PL+ + V++ L++ Sbjct: 109 WKNEDISRLDLVKALMGDFLQDREGDRVGLILFGSQAYLQAPLTFDRRTVRTFLDEAKIG 168 Query: 271 -PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 +NT A+ A + L + ++ ITDG N+G + L Sbjct: 169 IAGKNTAIGDAIGLAVKRL----------RQRPAQSRVLVLITDGANNGGQIH-----PL 213 Query: 330 QICEYMRNAGMKIYSVAVSAPPE----------------GQDLLRKCTD-SSGQFFAVND 372 G++IY++ + A PE + L++ D + G +F +D Sbjct: 214 TAARLAAQEGVRIYTIGIGANPEASGTPGLLGLNPSLDLDEAALKEIADITHGAYFRAHD 273 Query: 373 SRELLES 379 EL Sbjct: 274 GAELDAI 280 >gi|28901309|ref|NP_800964.1| hypothetical protein VPA1454 [Vibrio parahaemolyticus RIMD 2210633] gi|308125557|ref|ZP_05775735.2| von Willebrand factor type A [Vibrio parahaemolyticus K5030] gi|28809856|dbj|BAC62797.1| conserved hypothetical protein [Vibrio parahaemolyticus RIMD 2210633] gi|308112309|gb|EFO49849.1| von Willebrand factor type A [Vibrio parahaemolyticus K5030] Length = 328 Score = 76.9 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 28/170 (16%), Positives = 59/170 (34%), Gaps = 38/170 (22%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK--LNPYE-NTNTYPAMHHAYRELYNEKE 293 R+G + + TPL+ + V ++ + + T + + + Sbjct: 136 RLGVVLFGDHAYLQ--TPLTADRQTVIQQIKQTVIGLVGQRTAIGDGIGLGTKTFVD--- 190 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-- 351 S ++ +I ++DG N+ L+ ++ E + IY+V V A Sbjct: 191 -------SDAPQRVMILLSDGSNTAGV-----LDPIEAAEIAKKYNATIYTVGVGAGEMM 238 Query: 352 ----------------EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + Q L + + GQ+F D+ +L + +D I Sbjct: 239 VKDFFMTRKVDTAADLDEQTLTKIAEMTGGQYFRARDAEQLEKIYDTINK 288 >gi|148975971|ref|ZP_01812760.1| hypothetical protein VSWAT3_06656 [Vibrionales bacterium SWAT-3] gi|145964716|gb|EDK29969.1| hypothetical protein VSWAT3_06656 [Vibrionales bacterium SWAT-3] Length = 520 Score = 76.9 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 34/276 (12%), Positives = 80/276 (28%), Gaps = 17/276 (6%) Query: 128 LTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKK 187 +G ++ L+ +VS + + + Sbjct: 237 NVRTREIVSGGQANATSQLSYKPNYKPNVSPYSYNDVNWDYWRAYSQNEVLNCANWQSYC 296 Query: 188 SFWSKNTTKSKYAPAPAPANRKIDVLI-ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI- 245 + K V + +L S+ +K L + Sbjct: 297 PNPKSDNQKYAKRIKDVIYLDNYHVADVYNYVDLSTSVATMFTDKSGLRPNFYGVNGTDL 356 Query: 246 -------GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNT 298 + LSN L+++ + ++ + T + + + L + +S + Sbjct: 357 FNAHGSSSSTQFKNIRLSNKLSDL-NPISSMWADGGTAAFQGILRGSQILKDGDPNSSDD 415 Query: 299 IGSTRLK---KFVIFITDGENSGASAYQNTLNTLQICEYMRNA--GMKIYSVAVSAPPEG 353 K ++ ++DG+ S + L +C+ R G+ I + + Sbjct: 416 EEQQAYNKKIKMLLILSDGQESPNNGILKGLVDRGMCDKAREEIPGLYIGVIGIDFRASQ 475 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 Q + C + + D L E +KI + I++ Sbjct: 476 QSGFQDCVIDPNE--DIIDVSNLDELIEKIEELIRK 509 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 35/374 (9%), Positives = 103/374 (27%), Gaps = 55/374 (14%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 +++ +A++ + ++++ + +AA ++ + D Sbjct: 27 MGLLLVPIMGMTFWAVEGTRYVQETSRLRDSAEAAAIAVTIE-----------DQPDLAR 75 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + + ++ +++ + + T+D+ + + + ++ F Sbjct: 76 GLATQYVENYVRDIKSTN------LSAQRFHQTEDEGAGILEYIQYTVNAKTTHDSWFAS 129 Query: 122 GLIPSALTNLSLRSTG---IIERSSENLAISICMVLDVSRSMEDLY------LQKHNDNN 172 IPS L + I I V D S SM+D + Sbjct: 130 SFIPSFDEQQDLAGRSLARKYPVYLGDNNIDIVFVSDFSGSMDDRWGSSRHKKIDDLKTA 189 Query: 173 NMTSNKYLLPPPPKKSFWS-----------KNTTK----SKYAPAPAPANRKIDVLIESA 217 + +L + ++TT ++ P + V A Sbjct: 190 IDQISSKILCTSTDLEYVDGEWKEVCDEPGEDTTGDKLLNRVGFVPFNVRTREIVSGGQA 249 Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNI------------GIVGNQCTPLSNNLNEVKSR 265 K + + ++ + C ++ + R Sbjct: 250 NATSQLSYKPNYKPNVSPYSYNDVNWDYWRAYSQNEVLNCANWQSYCPNPKSDNQKYAKR 309 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + + +N + ++ +L + R + + TD N+ S+ Sbjct: 310 IKDVIYLDNYHVADVYNYV--DLSTSVATMFTDKSGLRPNFYGVNGTDLFNAHGSSSSTQ 367 Query: 326 LNTLQICEYMRNAG 339 +++ + + Sbjct: 368 FKNIRLSNKLSDLN 381 >gi|73973418|ref|XP_532180.2| PREDICTED: similar to alpha 1 type XXI collagen precursor [Canis familiaris] Length = 961 Score = 76.9 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 30/176 (17%), Positives = 65/176 (36%), Gaps = 22/176 (12%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHA 284 + +++G + Y+ V ++ + + + ++ NT T A+ A Sbjct: 64 TKNFDIGPKFIQVGVVQYSDYPVLEIPLGSHDSGKNLVAAMESIHYLGGNTRTGKAIQFA 123 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 L+ S L K + +TDG++ E R++ + +++ Sbjct: 124 LDYLF--------AKSSRFLTKIAVVLTDGKSQD--------EVKDAAEAARDSKITLFA 167 Query: 345 VAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + V + E + LR + F V D + + + + K+ E+SV P R Sbjct: 168 IGVGSETEDAE-LRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV--CPTR 220 >gi|51893456|ref|YP_076147.1| hypothetical protein STH2318 [Symbiobacterium thermophilum IAM 14863] gi|51857145|dbj|BAD41303.1| conserved hypothetical protein [Symbiobacterium thermophilum IAM 14863] Length = 414 Score = 76.9 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 22/174 (12%), Positives = 58/174 (33%), Gaps = 20/174 (11%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 + ++ LV+ + + R+ + Y+ + + + V+ ++ + Sbjct: 60 LYFTKQALRFLVDQMAEED--------RLAIVTYDDQVHVPFPSQPVVQKDAVRLLVDGI 111 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 TN + +++ + V+ +TDG + + L Sbjct: 112 TAGGTTNLSGGLATGMQQIRPHAGPGRVSR--------VLLMTDGLANVGVTDPDVLA-- 161 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDK 382 R G+ + ++ V P +DLL + G F + + ++ F + Sbjct: 162 GWARAWREKGLAVSTMGVG-PHFSEDLLVALAEAGGGNFHYIANPDQIPRIFQE 214 >gi|87308834|ref|ZP_01090973.1| hypothetical protein DSM3645_11362 [Blastopirellula marina DSM 3645] gi|87288545|gb|EAQ80440.1| hypothetical protein DSM3645_11362 [Blastopirellula marina DSM 3645] Length = 616 Score = 76.9 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 31/333 (9%), Positives = 94/333 (28%), Gaps = 28/333 (8%) Query: 66 KQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIP 125 I H + + + + + + ++ + +I + + L P Sbjct: 133 DGIAGHGEGPGVGGDKFAYVENNPFRAVADEPLSTFSIDVDTASYSKIRSYLIDYHQLPP 192 Query: 126 SALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPP 185 + + + + + P Sbjct: 193 QGAVRVEELINYFTYDYATPTDQKPFAANVEAAACPWNAEHRLVRIGIKGKEIANAERPA 252 Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 + + + S RK+ +L + LV+ + + ++ + Y Sbjct: 253 SNLVFLLDVSGSMNN------ARKLPLLKQGMKLLVDQLGEND--------KVAIVVYAG 298 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + ++ + + L++L +TN + AY+ Sbjct: 299 AAGMVLNSTNGDDKSTIMEALDRLQAGGSTNGGQGIELAYQAATENFIKGGVNR------ 352 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS-S 364 VI TDG+ + + L T+ + + +G+ + + ++ + + + Sbjct: 353 --VILCTDGDFNVGVTSTSDLVTMA-ADKAK-SGVFLSVMGFGTGNHNDAMMEELSGKAN 408 Query: 365 GQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 G + ++ E + + +++ IA + Sbjct: 409 GNYAFIDTITEAKKV---LVEQMSGTLTTIAKD 438 >gi|115373770|ref|ZP_01461063.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] gi|115369169|gb|EAU68111.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 420 Score = 76.9 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 67/191 (35%), Gaps = 20/191 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS-NNLNEVKSRLN 267 K+D +A + + + R+ + Y + T + ++ + +N Sbjct: 62 KLDDAKRAAREFITRVSEED--------RVALVHYGTDVTVFPSTLATPETREQMLTFVN 113 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + +TN + A ++L + + +I ++DG+ + Sbjct: 114 AIEDEGSTNISGGLEAAAQQLQKNADQFRVSR--------IILLSDGQPTAGL--TREEQ 163 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 + +R+ GM + ++ V ++L++ D G F S +L E F + ++ Sbjct: 164 LTALARNLRSQGMAVSALGVG-EDFNENLMQGIADQGGGFSGFLRSDQLAEVFTRELEQA 222 Query: 388 QEQSVRIAPNR 398 R R Sbjct: 223 TSTVARAVEVR 233 >gi|182414212|ref|YP_001819278.1| von Willebrand factor type A [Opitutus terrae PB90-1] gi|177841426|gb|ACB75678.1| von Willebrand factor type A [Opitutus terrae PB90-1] Length = 377 Score = 76.9 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 35/182 (19%), Positives = 61/182 (33%), Gaps = 39/182 (21%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRELYNE 291 S RIG + ++ PL+ + + S+L ++ + T + L Sbjct: 154 SDRIGIVLFSGRAYTMA--PLTFDHRWLGSQLERIKVGLIEDGTAIGDGLGVGLTRL--- 208 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP- 350 E + G R FV+ +TDG N+ S L Q E + G+ +Y++ Sbjct: 209 -EQAQRESGGKRQGAFVVLLTDGANNRGS-----LTPQQAAELAKARGIPVYTIGAGQDG 262 Query: 351 ---------------------PEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKI--TDK 386 + LR + + G FF D + +F I K Sbjct: 263 IVPFPVFDDKGRKLGYRRIMSDLDEGALRDIAEMTGGHFFRAADVGTVESAFRAIDRAQK 322 Query: 387 IQ 388 I+ Sbjct: 323 IE 324 >gi|87310694|ref|ZP_01092822.1| BatA [Blastopirellula marina DSM 3645] gi|87286675|gb|EAQ78581.1| BatA [Blastopirellula marina DSM 3645] Length = 355 Score = 76.9 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 35/214 (16%), Positives = 77/214 (35%), Gaps = 42/214 (19%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 ++ + + AG+ V + + + + TP + + + S+LN Sbjct: 110 DRLTAIKKVAGDFV-----TGGDNLDGRLSDLVGLITFAGYADGVTPPTLDHAFLVSQLN 164 Query: 268 --KLNPY---ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 ++ + T A+ A +L + K +I +TDGEN+ Sbjct: 165 HSQIVTNRSEDGTAIGDAISLAVEKLN----ALDARRKEKIQSKIIILLTDGENNAG--- 217 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPE---------------------GQDLLRKCT 361 L +Q E + G+K+Y++ V ++ L+K Sbjct: 218 --DLEPIQAAELAQTMGIKVYTIGVGTKGRAPMPVTDMFGRQSMQWMSVNIDEETLQKVA 275 Query: 362 D-SSGQFFAVNDSRELLESFDKITDKIQEQSVRI 394 + G++F D+ L + + +I D++++ V Sbjct: 276 SITGGKYFRATDTDSLAKIYGEI-DQLEKTKVET 308 >gi|91223292|ref|ZP_01258558.1| hypothetical protein V12G01_05596 [Vibrio alginolyticus 12G01] gi|91192105|gb|EAS78368.1| hypothetical protein V12G01_05596 [Vibrio alginolyticus 12G01] Length = 334 Score = 76.9 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 32/205 (15%), Positives = 66/205 (32%), Gaps = 47/205 (22%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 ++ + + + V + R+G + + TPL+ + Sbjct: 116 LNGEYIDRLTAVKKVLSDFVAKRK---------GDRLGVVLFGDHAYLQ--TPLTADRKT 164 Query: 262 VKSRLNK--LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 V ++N+ + T + + + S ++ +I ++DG N+ Sbjct: 165 VMQQINQTVIGLVGQRTAIGDGIGLGTKTFVD----------SDAPQRVMILLSDGSNTA 214 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP------------------EGQDLLRKC 360 L L+ E + IY+V V A + Q L + Sbjct: 215 GV-----LEPLEAAEIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTAADLDEQTLTKVA 269 Query: 361 TDSSGQFFAVNDSRELLESFDKITD 385 + GQ+F D+ EL + +D I Sbjct: 270 EVTGGQYFRARDTEELEKIYDTINQ 294 >gi|258647263|ref|ZP_05734732.1| BatA protein [Prevotella tannerae ATCC 51259] gi|260852912|gb|EEX72781.1| BatA protein [Prevotella tannerae ATCC 51259] Length = 334 Score = 76.9 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 31/208 (14%), Positives = 67/208 (32%), Gaps = 47/208 (22%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS---RLN-KLNPYE- 273 + + + ++ E N G PL+ + + + + ++N L Sbjct: 108 SRIETAKQVAYEFINNRPDDNIGLTVFGGEAYTQCPLTTDHSALLNMFKQVNCDLQKEGV 167 Query: 274 ---NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T + A L K S K +I +TDGEN+ L Sbjct: 168 ISPGTAIGMGLSSAVSHLEQSKSKS----------KVIILLTDGENNAGEIS-----PLT 212 Query: 331 ICEYMRNAGMKIYSVAVSAPP----------------------EGQDLLRKCT-DSSGQF 367 E + G++IY+++V L + G+F Sbjct: 213 AAEMAKRLGIRIYTISVGTDAAVNQTVATLPNGETYEAAIKQNTDPKTLEAIANSTGGKF 272 Query: 368 FAVNDSRELLESFDKITDKIQEQSVRIA 395 + +L + + I D++++ ++++ Sbjct: 273 YQARSKAKLRDIYQNI-DRLEKTKLKVS 299 >gi|72162840|ref|YP_290497.1| von Willebrand factor, type A [Thermobifida fusca YX] gi|71916572|gb|AAZ56474.1| von Willebrand factor, type A [Thermobifida fusca YX] Length = 609 Score = 76.9 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 27/205 (13%), Positives = 67/205 (32%), Gaps = 14/205 (6%) Query: 184 PPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAY 243 P +T+ S P + ++++ E+A ++ + + + Sbjct: 408 KPANVLLVIDTSGSMQESVPGTGSTRLELAKEAAITSLDEFSDSDRVGLWMFSTDLEDNG 467 Query: 244 NIGIVGNQCTPL------SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHN 297 PL + E+ R++ L P T Y A+ + Sbjct: 468 QDWRELVPLGPLGASVNGTPRREELAERISNLPPGGGTGLYDTALAAHTLVAEHSRPDAI 527 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 V+F+TDG+N + I G++I++++ + + + Sbjct: 528 --------NAVVFLTDGKNEDLNGISLEKLLDSITPEPGQQGVRIFTISYGEDADLKTMT 579 Query: 358 RKCTDSSGQFFAVNDSRELLESFDK 382 + ++ + +D + + E F+ Sbjct: 580 QIAEATNAAAYDASDPQSIDEVFEA 604 >gi|153835956|ref|ZP_01988623.1| von Willebrand factor, type A [Vibrio parahaemolyticus AQ3810] gi|260880154|ref|ZP_05892509.1| von Willebrand factor type A [Vibrio parahaemolyticus AN-5034] gi|260895271|ref|ZP_05903767.1| von Willebrand factor type A [Vibrio parahaemolyticus Peru-466] gi|260900622|ref|ZP_05909017.1| von Willebrand factor type A [Vibrio parahaemolyticus AQ4037] gi|149750710|gb|EDM61455.1| von Willebrand factor, type A [Vibrio parahaemolyticus AQ3810] gi|308085798|gb|EFO35493.1| von Willebrand factor type A [Vibrio parahaemolyticus Peru-466] gi|308091801|gb|EFO41496.1| von Willebrand factor type A [Vibrio parahaemolyticus AN-5034] gi|308107055|gb|EFO44595.1| von Willebrand factor type A [Vibrio parahaemolyticus AQ4037] Length = 334 Score = 76.9 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 28/170 (16%), Positives = 59/170 (34%), Gaps = 38/170 (22%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK--LNPYE-NTNTYPAMHHAYRELYNEKE 293 R+G + + TPL+ + V ++ + + T + + + Sbjct: 142 RLGVVLFGDHAYLQ--TPLTADRQTVIQQIKQTVIGLVGQRTAIGDGIGLGTKTFVD--- 196 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-- 351 S ++ +I ++DG N+ L+ ++ E + IY+V V A Sbjct: 197 -------SDAPQRVMILLSDGSNTAGV-----LDPIEAAEIAKKYNATIYTVGVGAGEMM 244 Query: 352 ----------------EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + Q L + + GQ+F D+ +L + +D I Sbjct: 245 VKDFFMTRKVDTAADLDEQTLTKIAEMTGGQYFRARDAEQLEKIYDTINK 294 >gi|292491521|ref|YP_003526960.1| hypothetical protein Nhal_1422 [Nitrosococcus halophilus Nc4] gi|291580116|gb|ADE14573.1| conserved hypothetical protein [Nitrosococcus halophilus Nc4] Length = 398 Score = 76.9 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 21/149 (14%), Positives = 39/149 (26%), Gaps = 9/149 (6%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + I + A+D+ H + ++Q+ALDAA LSG + + T Sbjct: 18 LFTIGMVAIIGMAGLALDMGHAYLNKTRLQNALDAAALSGAKVLNDMHDVGQATAA---A 74 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 T F ++ L + ++ + P + Sbjct: 75 LTTFNMHLEGELADAGLVPTVEVSETLSP-----FAPGGINPRYLRARVN-DFPMQVWLA 128 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAIS 149 + L T S Sbjct: 129 QVLPGIGNTQSVGGSAVAGPIPLGPKKCD 157 >gi|189461337|ref|ZP_03010122.1| hypothetical protein BACCOP_01987 [Bacteroides coprocola DSM 17136] gi|189431866|gb|EDV00851.1| hypothetical protein BACCOP_01987 [Bacteroides coprocola DSM 17136] Length = 332 Score = 76.9 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 30/252 (11%), Positives = 68/252 (26%), Gaps = 56/252 (22%) Query: 172 NNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEK 231 + + S A +++ + A + Sbjct: 72 ARPQTTDNWQNTEIEGIDIMLAVDVSTSMLAEDLKPNRLEAAKQVAAEFI---------- 121 Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY--------ENTNTYPAMHH 283 N PL+ + + + N + + T + + Sbjct: 122 -NGRPNDNIGLTVFAGEAFTQCPLTVDHGVLLNLFNSIKGDIAQRGMIEDGTAIGMGLAN 180 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A L S K +I +TDG N+ L E + G+++Y Sbjct: 181 AISRL----------KDSKAKSKVIILLTDGSNNRGDIS-----PLTAAEIAKQFGIRVY 225 Query: 344 SVAVSAPPE--------------------GQDLLRKCT-DSSGQFFAVNDSRELLESFDK 382 ++ V + L + ++G +F + +L E + + Sbjct: 226 TIGVGTNGTAPYPMQTYAGVQYVNVPVEIDEQTLTQIAGTTNGNYFRATSNSKLEEVYRE 285 Query: 383 ITDKIQEQSVRI 394 I DK+++ + + Sbjct: 286 I-DKLEKTKLNV 296 >gi|118353830|ref|XP_001010180.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89291947|gb|EAR89935.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 544 Score = 76.9 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 25/149 (16%), Positives = 52/149 (34%), Gaps = 14/149 (9%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNE--VKSRLNKLNPYENTNTYPAMHHAYRELYNE 291 + R+ I +N +NN N+ K +NK+ T+ M A+R L Sbjct: 154 DNDRLCLILFNSYSTRLCHLMKTNNSNKPAFKEIINKIQATGGTDINSGMELAFRVLKE- 212 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 + ++DG++ + + ++ I+S + Sbjct: 213 -------RKYQNPVSSIFLLSDGQD----GSADLRVRQSLERHLPQECFTIHSFGFGSDH 261 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESF 380 +G + + C+ G F+ V ++ E F Sbjct: 262 DGPLMNKICSLKDGNFYYVEKINQVDEFF 290 >gi|145219382|ref|YP_001130091.1| hypothetical protein Cvib_0567 [Prosthecochloris vibrioformis DSM 265] gi|145205546|gb|ABP36589.1| conserved hypothetical protein [Chlorobium phaeovibrioides DSM 265] Length = 356 Score = 76.9 bits (187), Expect = 5e-12, Method: Composition-based stats. Identities = 27/254 (10%), Positives = 63/254 (24%), Gaps = 23/254 (9%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIK-DPTTKKDQ 59 + A+++ V F A+DLA I ++ ++Q+A DAA L G S+ + + + Sbjct: 21 LFALVLPVLLGFAALAVDLARIHLVKVELQNAADAASLGGAHSLSDAGGQPYNWSAAVNA 80 Query: 60 TSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQY-----------IAESK 108 + + + ++ Sbjct: 81 AQNVVQSNVAN--GAHIQDATIETGYWNLQNPSLGLRPAGTGSVPAAGDVPAVRTTVAIS 138 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKH 168 A LF ++ A +++ + +I S + ++ Sbjct: 139 ANQNNGPLPLFFAPILGIAESSIQASAIAVIAPPSGGTGMFPFVIAAPMLDHYWDRDTNS 198 Query: 169 ------NDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKI---DVLIESAGN 219 + Y + TT P+ D+ Sbjct: 199 PVLENGVAPTIKLGSIYHFEDSEDGVLSGEWTTFQTEDGNPSGRFLWDLLEDLTTNGNDT 258 Query: 220 LVNSIQKAIQEKKN 233 ++ + Sbjct: 259 ALSIGDNTYIQPGT 272 >gi|260459671|ref|ZP_05807925.1| von Willebrand factor type A [Mesorhizobium opportunistum WSM2075] gi|259034473|gb|EEW35730.1| von Willebrand factor type A [Mesorhizobium opportunistum WSM2075] Length = 718 Score = 76.9 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 36/357 (10%), Positives = 83/357 (23%), Gaps = 26/357 (7%) Query: 41 CASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNP 100 S K T + R+ D +D + Sbjct: 207 ADGTTSRIARMPAAESKLMTPQQPATAPADQIAPQEENRDRVQDFKTNPVHAALEDPVST 266 Query: 101 LQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSM 160 + A Y +L + + + + S VS Sbjct: 267 FSIDVD-TASYSFVRRSLKEGFVPQADTVRVEEMINYFPYDWKGPDSASTPFNSTVSVMP 325 Query: 161 EDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNL 220 + + PK + K+ +L + L Sbjct: 326 TPWNTHTKLMHVAIKGFDVKPTEQPKANLV-----FLIDVSGSMDEPDKLPLLKSAFRLL 380 Query: 221 VNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPA 280 V+ ++ I + Y +++ + ++ L P +T Sbjct: 381 VSKLKADDT--------ISIVTYAGDAGTVLMPTKIAEKDKILNAIDNLQPGGSTAGEAG 432 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + AY+ V+ TDG+ + + + ++ E R G+ Sbjct: 433 IKEAYKLAQQSFIKDGVNR--------VMLATDGDFNVGQTDDD--DLKRLIEQERKTGV 482 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + ++++ + L E+ + + IA + Sbjct: 483 FLSVFGFGRGNLNDEMMQTIAQNGNG--TAAYIDTLAEAEKVLVEDASSTLFTIAKD 537 >gi|109000862|ref|XP_001094970.1| PREDICTED: cartilage matrix protein [Macaca mulatta] Length = 495 Score = 76.9 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 42/378 (11%), Positives = 114/378 (30%), Gaps = 52/378 (13%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQIN---ITKDKN 98 + VS + + ST + + + G ++ I+ I Sbjct: 95 RAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKALSDAEGGRSRSPDISKVVIVVTDG 154 Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR 158 P + + A+ LF G+ L ++ + + S ++ +SR Sbjct: 155 RPQDSVQDVSARARASGVELFAIGVGRVDKATLRQIASEPQD-EHVDYVESYSVIEKLSR 213 Query: 159 SMEDLY----LQKHNDNNNMTSNKYLLPPPPKKSF---WSKNTTKSKYAPAPAPANR--- 208 ++ + +++ P + ++ N+ Sbjct: 214 KFQEAFCVVSDLCATGDHDCEQVCISSPGSYTCACREGFTLNSDGKTCNVCSGGGGSSAT 273 Query: 209 ---------------KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 +++ + +V+++ + + ++G + Y+ + Sbjct: 274 DLVFLIDGSKSVRPENFELVKKFINQIVDTL-----DVSDKLAQVGLVQYSSSVRQEFPL 328 Query: 254 PLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 + ++K+ + ++ + T T A+ + + + +K I T Sbjct: 329 GRFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSGAR-----PGAQKVGIVFT 383 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAV 370 DG + + ++ G K+++V V +D LR+ +F Sbjct: 384 DGRSQDYIND--------AAKKAKDLGFKMFAVGVG--NAVEDELREIASEPVAEHYFYT 433 Query: 371 NDSRELLESFDKITDKIQ 388 D + + + K+ KI Sbjct: 434 ADFKTINQIGKKLQKKIC 451 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 54/159 (33%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + R+G + Y + + + + ++ P T T A+ A + Sbjct: 71 DVGPNATRVGMVNYASTVKQEFSLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKA 130 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E + S + K VI +TDG + + R +G++++++ V Sbjct: 131 LSDAEGGRSR--SPDISKVVIVVTDGRPQDS--------VQDVSARARASGVELFAIGVG 180 Query: 349 APPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 + LR+ V +L F + Sbjct: 181 R--VDKATLRQIASEPQDEHVDYVESYSVIEKLSRKFQE 217 >gi|297471458|ref|XP_002685231.1| PREDICTED: collagen type VI alpha 4-like [Bos taurus] gi|296490811|gb|DAA32924.1| collagen type VI alpha 4-like [Bos taurus] Length = 780 Score = 76.9 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 26/166 (15%), Positives = 58/166 (34%), Gaps = 23/166 (13%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEK 292 V+ G + Y+ + +++ ++ ++ + T A+ + + Sbjct: 204 DRVQFGVVQYSDEVSPQFTLSQHSSVAGLEVAVDSIQQKGGGTKMGEALGSMIQVFADSA 263 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 S+ ++I +TDG++ A E +R G+ IY+V V Sbjct: 264 RSNVPW--------YLIVVTDGQSMDPVAD--------AAEALRGHGVTIYAVGV--RDA 305 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 L++ + + F V+D L ++ I + NR Sbjct: 306 NIAELQEIAE--DRMFFVHDFESLKTIQQEVVQDICS--LETCKNR 347 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 19/154 (12%), Positives = 54/154 (35%), Gaps = 17/154 (11%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYNEKES 294 ++IG + ++ ++ +V ++ + T T A++ + + Sbjct: 387 IQIGLLQFSSDPQEEFRLNRYSSKVDVHRAISDVKQINGGTYTGKALNFTLPFFGSSRGG 446 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + +++I +TDG + N + +R+ + I+++ V Q Sbjct: 447 RP------SVHQYLIVVTDGV--------SRDNVALPAKALRDRNIIIFAIGVGEVKFSQ 492 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 L + T+ + + L +I ++ Sbjct: 493 LL--EITNDQSKVYYEEKFESLQNLEKEILYQVC 524 >gi|258620051|ref|ZP_05715090.1| conserved hypothetical protein [Vibrio mimicus VM573] gi|258587409|gb|EEW12119.1| conserved hypothetical protein [Vibrio mimicus VM573] Length = 308 Score = 76.9 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 32/170 (18%), Positives = 59/170 (34%), Gaps = 38/170 (22%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK--LNPYEN-TNTYPAMHHAYRELYNEKE 293 R+G I + TPL+ + V S+LN+ L T + A + + Sbjct: 119 RMGLILFADHAYLQ--TPLTLDRQTVISQLNQAVLKLIGTQTAIGEGIGLATKTFID--- 173 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--- 350 S ++ +I ++DG N+ L+ L+ + IY+V V A Sbjct: 174 -------SDAPQRVMILLSDGSNTAGV-----LDPLEAANIAKQYQTTIYTVGVGAGEMI 221 Query: 351 --------------PEGQDLLRKCTDS-SGQFFAVNDSRELLESFDKITD 385 + L+ + GQ+F + ++L +D I Sbjct: 222 VKDFLFSRKVNTAQDLDEKTLQTIASTTGGQYFRARNQQDLQSIYDTINQ 271 >gi|326795817|ref|YP_004313637.1| von Willebrand factor type A [Marinomonas mediterranea MMB-1] gi|326546581|gb|ADZ91801.1| von Willebrand factor type A [Marinomonas mediterranea MMB-1] Length = 337 Score = 76.9 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 26/153 (16%), Positives = 51/153 (33%), Gaps = 36/153 (23%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PLS + V+ + + E T A+ ++L KK +I Sbjct: 153 PLSFDTKTVRQLIQETQIGFAGEKTAIGDAIGLGIKQLSELPSD----------KKVLIL 202 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEG 353 +TDG N+ LQ + G+ I+++ + A Sbjct: 203 MTDGANTAGRVS-----PLQAANFAAEQGVTIHTIGIGADEMEVQGFFGPQTVNPSEDLD 257 Query: 354 QDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + LL + G+++ + +L E + I + Sbjct: 258 EALLENVASLTGGKYYRAKSTSDLEEIYGDINN 290 >gi|190575666|ref|YP_001973511.1| putative von Willebrand factor-like protein [Stenotrophomonas maltophilia K279a] gi|190013588|emb|CAQ47223.1| putative von Willebrand factor-like protein [Stenotrophomonas maltophilia K279a] Length = 334 Score = 76.9 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 31/186 (16%), Positives = 66/186 (35%), Gaps = 34/186 (18%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---PYENT 275 + + + + + + + G TPL+ +L V+ +L T Sbjct: 124 DRLTAAKAVLADFLDRRAGDRIGLLIFGDRAYTLTPLTADLASVRDQLRDSVVGLAGRET 183 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 A+ A + L ++ E ++ +I +TDG ++ L L+ E Sbjct: 184 AIGDAIGLAVKRLRSQPEG----------QRVLILLTDGVSNAGV-----LEPLRAAEVA 228 Query: 336 RNAGMKIYSVAVSAPPE---------------GQDLLRKCTD-SSGQFFAVNDSRELLES 379 + G++I++VA + L+K + GQFF D+ +L Sbjct: 229 QAEGVRIHTVAFGGDGSMRFLGIPISADQDPVDEATLKKIASLTGGQFFRARDTAQLAGI 288 Query: 380 FDKITD 385 + ++ Sbjct: 289 YAELDR 294 >gi|311260227|ref|XP_003128387.1| PREDICTED: collagen alpha-1(XXI) chain-like [Sus scrofa] Length = 957 Score = 76.9 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 30/176 (17%), Positives = 65/176 (36%), Gaps = 22/176 (12%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHA 284 + +++G + Y+ V + ++ + + ++ NT T A+ A Sbjct: 64 TKNFDIGPKFIQVGVVQYSDYPVLEIPLGRHESGEKLLAAVGSIHYLGGNTRTGKAIQFA 123 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 L+ S L K + +TDG++ E R++ + +++ Sbjct: 124 LDYLF--------AKSSRFLTKIAVVLTDGKSQD--------EVKDAAEAARDSKITLFA 167 Query: 345 VAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + V + E + LR + F V D + + + + K+ E+SV P R Sbjct: 168 IGVGSETEDAE-LRAIANKPSSTYVFHVEDYIAISKIREVMKQKLCEESV--CPTR 220 >gi|293339632|gb|ADE44108.1| collagen type XXI alpha 1 [Sus scrofa] Length = 895 Score = 76.9 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 30/176 (17%), Positives = 65/176 (36%), Gaps = 22/176 (12%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHA 284 + +++G + Y+ V + ++ + + ++ NT T A+ A Sbjct: 64 TKNFDIGPKFIQVGVVQYSDYPVLEIPLGRHESGEKLLAAVGSIHYLGGNTRTGKAIQFA 123 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 L+ S L K + +TDG++ E R++ + +++ Sbjct: 124 LDYLF--------AKSSRFLTKIAVVLTDGKSQD--------EVKDAAEAARDSKITLFA 167 Query: 345 VAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + V + E + LR + F V D + + + + K+ E+SV P R Sbjct: 168 IGVGSETEDAE-LRAIANKPSSTYVFHVEDYIAISKIREVMKQKLCEESV--CPTR 220 >gi|328469247|gb|EGF40193.1| hypothetical protein VP10329_10201 [Vibrio parahaemolyticus 10329] Length = 334 Score = 76.9 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 27/170 (15%), Positives = 59/170 (34%), Gaps = 38/170 (22%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK--LNPYE-NTNTYPAMHHAYRELYNEKE 293 R+G + + TPL+ + V ++ + + T + + + Sbjct: 142 RLGVVLFGDHAYLQ--TPLTADRQTVIQQIKQTVIGLVGQRTAIGDGIGLGTKTFVD--- 196 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-- 351 S ++ +I ++DG N+ L+ ++ E + IY+V V A Sbjct: 197 -------SDAPQRVMILLSDGSNTAGV-----LDPIEAAEIAKKYNATIYTVGVGAGEMM 244 Query: 352 ----------------EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + Q L + + GQ+F D+ +L + ++ I Sbjct: 245 VKDFFMTRKVDTAADLDEQTLTKIAEMTGGQYFRARDAEQLEKIYNTINK 294 >gi|262164788|ref|ZP_06032526.1| protein BatA [Vibrio mimicus VM223] gi|262027168|gb|EEY45835.1| protein BatA [Vibrio mimicus VM223] Length = 318 Score = 76.9 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 29/154 (18%), Positives = 54/154 (35%), Gaps = 36/154 (23%) Query: 253 TPLSNNLNEVKSRLNK--LNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 TPL+ + V S+LN+ L T + A + + S ++ +I Sbjct: 143 TPLTLDRQTVISQLNQAVLKLIGTQTAIGEGIGLATKTFID----------SDAPQRVMI 192 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PE 352 ++DG N+ L+ L+ + IY+V V A Sbjct: 193 LLSDGSNTAGV-----LDPLEAANIAKQYQTTIYTVGVGAGEMIVKDFLFSRKVNTAQDL 247 Query: 353 GQDLLRKCTDS-SGQFFAVNDSRELLESFDKITD 385 + L+ + GQ+F + ++L +D I Sbjct: 248 DEKTLQTIASTTGGQYFRARNQQDLQSIYDTINQ 281 >gi|111025338|ref|YP_707758.1| hypothetical protein RHA1_ro08556 [Rhodococcus jostii RHA1] gi|110824317|gb|ABG99600.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length = 326 Score = 76.9 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 24/160 (15%), Positives = 53/160 (33%), Gaps = 17/160 (10%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 G +P++ + + L+ L E T T A+ A + + T Sbjct: 138 GTASMLVSPIT-DHTATDNALDHLQLAERTATGEAIFTALQAIDTLAGVVGGGG--TPPP 194 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-------------- 351 ++ +DG+ + + + + G+ I +++ Sbjct: 195 ARIVLESDGKQTVPTDLNDPRGAFTAARLAKEQGVPISTISFGTTHGAIDLNGSHIPVPV 254 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 + + L R S G FF + EL S+ + +I ++ Sbjct: 255 DDESLRRIAELSGGSFFTATSADELQASYQNLQQQIGYET 294 >gi|78484419|ref|YP_390344.1| von Willebrand factor, type A [Thiomicrospira crunogena XCL-2] gi|78362705|gb|ABB40670.1| Type A von Willebrand factor-like [Thiomicrospira crunogena XCL-2] Length = 349 Score = 76.9 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 39/210 (18%), Positives = 73/210 (34%), Gaps = 50/210 (23%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 ++ + N + K R+G + + +PL+ +LN V++ LN Sbjct: 128 DRLTAVKSVVKNFIQ---------KRQGDRMGLVVFGSQAFLQ--SPLTYDLNTVETLLN 176 Query: 268 KL---NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + NT A+ A + L+ E K +I +TDG N+ Sbjct: 177 ETEIGMAGNNTAIGDAIGIALKHLHQNSEK----------KAVLILLTDGSNTAG----- 221 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPE--------------GQDLLRKCTD-SSGQFFA 369 + L + + G+KIY++ + L+K + + G+FF Sbjct: 222 AVQPLDAAKQAQEMGLKIYTIGIGQNQATGLDAFIFGPNRNMDTTTLQKIAELTQGRFFM 281 Query: 370 VNDSRELLESFDKITD------KIQEQSVR 393 D+ +L E + I I +R Sbjct: 282 AKDTNQLNEIYQLIDQLEASQHDINNYRLR 311 >gi|327286711|ref|XP_003228073.1| PREDICTED: cartilage matrix protein-like [Anolis carolinensis] Length = 565 Score = 76.9 bits (187), Expect = 6e-12, Method: Composition-based stats. Identities = 25/183 (13%), Positives = 67/183 (36%), Gaps = 23/183 (12%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ + +V+S+ + + + +G + Y+ + + ++K+ + + Sbjct: 358 NFELVKKFINQIVDSL-----DVSDRNAHVGLVQYSSSVRQEFPLGQYKDKKDIKAAVRR 412 Query: 269 LNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 ++ + T T A+ + + + + K I TDG + Sbjct: 413 MSYMEKGTMTGQALQYLVDSSFAISSGAR-----PGVPKVAIVFTDGRSQDYINDS---- 463 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITD 385 + + G K+++V V +D L++ +F D R + + K+ + Sbjct: 464 ----AKKAKELGYKMFAVGVG--NAVEDELKEIASEPVAEHYFYTADFRTINQIGKKLQN 517 Query: 386 KIQ 388 KI Sbjct: 518 KIC 520 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 48/159 (30%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + R+G + Y + + + ++ P T T A+ A Sbjct: 143 DVGPNATRVGVVNYASAVKSEFSLKTHRTKASLLQAVRRIEPLSTGTMTGLAIQFAINRA 202 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + K I +TDG + R G++I+++ V Sbjct: 203 FSEGEGARVRVPEIN--KVAIIVTDGRPQD--------AVKDVAARARALGIEIFAIGVG 252 Query: 349 APPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 LR V +L + F + Sbjct: 253 R--VDMHTLRLIASPPLEEHVDYVESYSVIEKLTKKFQE 289 >gi|301616677|ref|XP_002937788.1| PREDICTED: collagen alpha-6(VI) chain-like, partial [Xenopus (Silurana) tropicalis] Length = 1529 Score = 76.5 bits (186), Expect = 6e-12, Method: Composition-based stats. Identities = 27/175 (15%), Positives = 67/175 (38%), Gaps = 15/175 (8%) Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTN 276 +V+ + K+ N V+ G + Y+ N ++ + +NK NT Sbjct: 215 NFMVSLVNKSAVGPDN--VQFGALKYSDYNTELFYLNRYTNKVDIINHINKDTTQGGNTY 272 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 T A+ + + H + + + + V+ ITDG+ + + + Sbjct: 273 TAGAVRFSKEFFTEK----HGSRKARGVPQIVMVITDGD------SHDKDKLNETARQLE 322 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 G+ IY++ + + L + G++F V + L + ++++ + ++ Sbjct: 323 QEGIIIYAIGI--DQANTNELETLAGTEGKWFMVANFSGLQDILVQVSEAMCNKT 375 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 24/178 (13%), Positives = 60/178 (33%), Gaps = 20/178 (11%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PY 272 ++++ ++V I Y+ + S +K+ + + Sbjct: 403 KNFVVSVMDDFN-----VGPVNVHIAVSQYSESCIREINFDYSTERGTLKNEIINIRKTK 457 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 + A+ +Y+ S + + +K+ ++ IT G S A Sbjct: 458 GRRHIGAALDFTKSTVYS---PSSDNRLNQGVKQLLVVITAGNASDQVARP--------A 506 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + +R+ G+ IY+V + + L + T S + + +D L ++ + Sbjct: 507 KALRDRGVDIYAVGIG--NICKTQLTQITGSPEKIYT-DDVSGLKAIKKRLVRDTCSK 561 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 20/156 (12%), Positives = 53/156 (33%), Gaps = 17/156 (10%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN-TYPAMHHAYRELYNEKES 294 V+ + ++ I N + ++ +N N T A+ + + Sbjct: 44 VQFAAVQFSNSIKEEFQFNKHATKNAIWDSIDNMNLMGNVAKTGNALA----NVADYFTE 99 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + S+++ K ++ ITD + + +R+ G+ IYS+ + Sbjct: 100 AKGARPSSKVSKILLLITDNPSQD--------EVKVPADSLRSNGLIIYSIG-GFSAN-K 149 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 L++ + ++ +L + + +I Sbjct: 150 KELQEISGKITPYY--QSFDKLQTIENDLLFRICNP 183 >gi|294054315|ref|YP_003547973.1| hypothetical protein Caka_0779 [Coraliomargarita akajimensis DSM 45221] gi|293613648|gb|ADE53803.1| conserved hypothetical protein [Coraliomargarita akajimensis DSM 45221] Length = 345 Score = 76.5 bits (186), Expect = 6e-12, Method: Composition-based stats. Identities = 41/226 (18%), Positives = 78/226 (34%), Gaps = 63/226 (27%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 ++D E + K RIG +A+ +PL+ N + +K Sbjct: 111 NIVTRLDAAKEVVQEFIG---------KRPHDRIGLVAFAADA--FVVSPLTLNHDWLKK 159 Query: 265 RLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + +L T A+ + L + VI +TDGEN+ Sbjct: 160 NVQRLELGDINLSGTAIGTALGASVNRL----------RDHESRSRIVILLTDGENNSG- 208 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVS-------------------------------A 349 TL+ L E ++ +K+Y++A Sbjct: 209 ----TLSPLSAAEAAKSLNVKVYTIATGRKGRVEVAEMSRDGRVIRDRNGNPLYRGRSEL 264 Query: 350 PPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVRI 394 + LR+ + GQFF ++S +L +D+I D++++ +V + Sbjct: 265 SDYDESELREIAKLTGGQFFKASESGDLENIYDEI-DELEKTTVEL 309 >gi|114778216|ref|ZP_01453088.1| batB protein, putative [Mariprofundus ferrooxydans PV-1] gi|114551463|gb|EAU54018.1| batB protein, putative [Mariprofundus ferrooxydans PV-1] Length = 355 Score = 76.5 bits (186), Expect = 6e-12, Method: Composition-based stats. Identities = 28/154 (18%), Positives = 55/154 (35%), Gaps = 30/154 (19%) Query: 253 TPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 TPL+ + V + L++ + T A+ A + L + +I Sbjct: 166 TPLTFDRKTVITLLDEAAVGLAGKATAIGDAIGLAVKRLEQSNRDKRIASKE----QVLI 221 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PE 352 +TDG N+ L+ Q E G+ IY++ + A Sbjct: 222 LLTDGVNTAGQ-----LSAPQAAELAAEHGLTIYTIGIGADAMTVQSFFGTQRVNPSADL 276 Query: 353 GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + +L + G++F +D++EL + + I Sbjct: 277 DEKMLTDIATKTGGRYFRAHDTQELQKIYAMIDK 310 >gi|293349337|ref|XP_001060689.2| PREDICTED: collagen, type XII, alpha 1 [Rattus norvegicus] Length = 3064 Score = 76.5 bits (186), Expect = 6e-12, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 58/166 (34%), Gaps = 18/166 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+G + Y+ ++ + + K+ NT T A+ + Sbjct: 168 SAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYRREDLLAAIKKIPYKGGNTMTGDAIDYLV 227 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + E S K I ITDG++ +RN G++++S+ Sbjct: 228 KNTFTESAGSRAGF-----PKVAIIITDGKSQD--------EVEIPARELRNIGVEVFSL 274 Query: 346 AVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQE 389 + L++ + F V + +++ ++I ++ Sbjct: 275 GI--KAADAKELKQIASTPSLNHVFNVANFDAIVDIQNEIISQVCS 318 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 21/160 (13%), Positives = 53/160 (33%), Gaps = 18/160 (11%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKE 293 V+I Y+ + + + L NT T A++ ++ + Sbjct: 1235 RVQIALAQYSGDPRTEWHLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQSFKT-- 1292 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 R +K + ITDG++ + + +++ G++++++ + Sbjct: 1293 ---QAGMRPRARKIGVLITDGKSQDDVEAPS--------KKLKDEGVELFAIGI--KNAD 1339 Query: 354 QDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 + L+ + V D L + D +T + Sbjct: 1340 EVELKMIATDPDDIHAYNVADFESLSKIVDDLTINLCNSV 1379 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 32/167 (19%), Positives = 63/167 (37%), Gaps = 23/167 (13%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYN 290 V+I + Y+ N + ++ +N TNT AM + +++ Sbjct: 473 SPNRVQISLVQYSRDPHTEFTLKEFNRVEDIIKAINNFPYRGGSTNTGKAMTYVREKIF- 531 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 N + + K +I ITDG++S A +RN+ ++I++V V Sbjct: 532 ----VPNKGSRSNVPKVMILITDGKSSDAFRDP--------AIKLRNSDVEIFAVGV--K 577 Query: 351 PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS-VRI 394 + L F V D ++F +I+ ++ + +RI Sbjct: 578 DAVRSELEAIASPPAETHVFTVEDF----DAFQRISFELTQSICLRI 620 >gi|332254526|ref|XP_003276380.1| PREDICTED: cartilage matrix protein [Nomascus leucogenys] Length = 496 Score = 76.5 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 42/378 (11%), Positives = 113/378 (29%), Gaps = 52/378 (13%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQIN---ITKDKN 98 + VS + + ST + + G ++ I+ I Sbjct: 96 RAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKAFSDAEGGRSRSPDISKVVIVVTDG 155 Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR 158 P + + A+ LF G+ L ++ + + S ++ +SR Sbjct: 156 RPQDSVQDVSARARASGVELFAIGVGRVDKATLRQIASEPQD-EHVDYVESYSVIEKLSR 214 Query: 159 SMEDLY----LQKHNDNNNMTSNKYLLPPPPKKSF---WSKNTTKSKYAPAPAPANR--- 208 ++ + +++ P + ++ N+ Sbjct: 215 KFQEAFCVVSDLCATGDHDCEQVCISSPGSYTCACHEGFTLNSDGKTCNVCSGGGGSSAT 274 Query: 209 ---------------KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 +++ + +V+++ + + ++G + Y+ + Sbjct: 275 DLVFLIDGSKSVRPENFELVKKFINQIVDTL-----DVSDKLAQVGLVQYSSSVRQEFPL 329 Query: 254 PLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 + ++K+ + ++ + T T A+ + + + +K I T Sbjct: 330 GRFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSGAR-----PGAQKVGIVFT 384 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAV 370 DG + + ++ G K+++V V +D LR+ +F Sbjct: 385 DGRSQDYIND--------AAKKAKDLGFKMFAVGVG--NAVEDELREIASEPVAEHYFYT 434 Query: 371 NDSRELLESFDKITDKIQ 388 D + + + K+ KI Sbjct: 435 ADFKTINQIGKKLQKKIC 452 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 55/159 (34%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + R+G + Y + + + + ++ P T T A+ A + Sbjct: 72 DVGPNATRVGMVNYASTVKQEFSLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKA 131 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 +++ E + S + K VI +TDG + + R +G++++++ V Sbjct: 132 FSDAEGGRSR--SPDISKVVIVVTDGRPQDS--------VQDVSARARASGVELFAIGVG 181 Query: 349 APPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 + LR+ V +L F + Sbjct: 182 R--VDKATLRQIASEPQDEHVDYVESYSVIEKLSRKFQE 218 >gi|307825379|ref|ZP_07655598.1| von Willebrand factor type A [Methylobacter tundripaludum SV96] gi|307733554|gb|EFO04412.1| von Willebrand factor type A [Methylobacter tundripaludum SV96] Length = 326 Score = 76.5 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 29/188 (15%), Positives = 66/188 (35%), Gaps = 36/188 (19%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---PYENT 275 + + + + + N V G TPL+ + V + LN+ +NT Sbjct: 114 DRLTAAKMVAADFINRRVGDRVGLILFGTQAYLQTPLTFDRKTVMTLLNEAVIGLAGDNT 173 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 A+ A + L + ++ +TDG N+ ++ L+ E Sbjct: 174 AIGDAIGLAVKRL----------KSEQVNSRVLVLMTDGANTAG-----EVSPLKAAELA 218 Query: 336 RNAGMKIYSVAVSAPP------------------EGQDLLRKCTDSSGQFFAVNDSRELL 377 +KIY++ + A + + L++ + GQ++ ++ EL Sbjct: 219 AANHLKIYTIGIGADEMIVRSFFGNRKINPSVDLDEKTLIKIAESTGGQYYRARNTDELN 278 Query: 378 ESFDKITD 385 + ++ + Sbjct: 279 NIYMRLDE 286 >gi|238750905|ref|ZP_04612402.1| hypothetical protein yrohd0001_16570 [Yersinia rohdei ATCC 43380] gi|238710819|gb|EEQ03040.1| hypothetical protein yrohd0001_16570 [Yersinia rohdei ATCC 43380] Length = 520 Score = 76.5 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 60/502 (11%), Positives = 137/502 (27%), Gaps = 117/502 (23%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKK---- 57 + + I + +++ + + ++ A++ A L+ + + + Sbjct: 29 FIFFLPIFIGLIFLSFEISCFIQKKAKLSDAMEQATLALTVENNNIPSSEQEVKNNILIS 88 Query: 58 --------------------------DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQI 91 D + I K L + + + ++ I Sbjct: 89 SFAHAYLPEETFSEPVITINSSASHMDYHADITMSYPAKFLNKAFNLISISDIKLDESAI 148 Query: 92 NITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISIC 151 + Y F I S + + + I Sbjct: 149 AKKNTSITAIPTDVVFVTDYSGSMNRDFDGTDIDSTDISKVRIVALRRIFKNLHNEIQQN 208 Query: 152 MVLDVSRSMEDLYLQKH-NDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA---PAN 207 +++ + + K DNNN T P K +KY + P Sbjct: 209 ENINLVGFVPFTWGTKRTIDNNNTTPTLLCHFPFVPKKHSPDGNYLTKYNISGLKELPGM 268 Query: 208 RKIDVLIE----SAGNLVN----SIQKAIQEKKNLSVRIGTIAYNIGIVG---------- 249 KI L + G L + SI + I +K N + R + Y I Sbjct: 269 DKIVSLNMINDINYGKLSDKEYISISEEITDKANYTDREKALRYLSRIYYVKLENVISPV 328 Query: 250 -------------------NQCTPLS-------------------NNLNEVK-SRLNKLN 270 PLS NN+++ + S++ Sbjct: 329 IEENIDYEKTIKSINGSDHTIDIPLSDVRVNDFCLRTSMAHTFDSNNMDDSQFSKILNSR 388 Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL- 329 T + ++ K ++ ++DG+++ + + Sbjct: 389 ANGGTLISSGILSGNNLFKETNNNNR---------KIMVILSDGDDNDNTHAGDNRINKD 439 Query: 330 -------------QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 +CE +++ +++ +A+ P+ +KC G F+ ++++EL Sbjct: 440 APYLNITKKLIDNGMCERIKDNDIRMVFIAIGYTPDENIDWKKCV-GEGNFYLASNAQEL 498 Query: 377 L-ESFDKITDKIQEQSVRIAPN 397 + + + + R P Sbjct: 499 ELDINRALATE-DTEVGRNTPK 519 >gi|281348096|gb|EFB23680.1| hypothetical protein PANDA_015036 [Ailuropoda melanoleuca] Length = 3047 Score = 76.5 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 59/166 (35%), Gaps = 18/166 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+G + Y+ +E+ + + K+ NT T A+ + Sbjct: 152 SAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLI 211 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + E + K I ITDG++ +RN G++++S+ Sbjct: 212 KNTFTESAGARAGF-----PKVAIIITDGKSQD--------EVEIPARELRNIGVEVFSL 258 Query: 346 AVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQE 389 + L++ + F V + +++ ++I ++ Sbjct: 259 GI--KAADAKELKQIASTPSLNHVFNVANFDAIVDIQNEIISQVCS 302 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 54/160 (33%), Gaps = 18/160 (11%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKE 293 V+I Y+ + + + L NT T A++ ++ + Sbjct: 1219 RVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQSFKT-- 1276 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 R +K + ITDG++ + + +++ G++++++ + Sbjct: 1277 ---QAGMRPRARKIGVLITDGKSQDDVEAPS--------KKLKDEGVELFAIGI--KNAD 1323 Query: 354 QDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 +D L+ + V D L + D +T + Sbjct: 1324 EDELKMIATDPDDTHAYNVADFESLSKIVDDLTINLCNSV 1363 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 31/167 (18%), Positives = 62/167 (37%), Gaps = 23/167 (13%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYN 290 V+I + Y+ + ++ +N TNT AM + +++ Sbjct: 457 SPNRVQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFV 516 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + S + K +I ITDG++S A +RN+ ++I++V V Sbjct: 517 PSKGSRG-----NVPKVMILITDGKSSDAFRDP--------AIKLRNSDVEIFAVGV--K 561 Query: 351 PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS-VRI 394 + L F V D ++F +I+ ++ + +RI Sbjct: 562 DAVRSELEAIASPPAETHVFTVEDF----DAFQRISFELTQSICLRI 604 >gi|126341668|ref|XP_001379927.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 2568 Score = 76.5 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 31/166 (18%), Positives = 60/166 (36%), Gaps = 12/166 (7%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 + V+ G + Y+ N + +L NT T A++ ++ Sbjct: 785 DVGRDRVQFGALKYSDTPEILFYLNDYQNKKSIIDKLKFQRKGGNTYTAKALNRSHELFT 844 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + H + +K+ +I ITDG++ L + + +R + IY V Sbjct: 845 EQ----HGSRIKRGVKQMLIVITDGKSHDY------LELETVGKALRAKNIIIY--GVGV 892 Query: 350 PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 L T S +F V++ +L E + I K E ++ + Sbjct: 893 AEASDKELLDITGSKDNYFMVDNFEKLKEIYLPIEKKACENALDVC 938 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 24/157 (15%), Positives = 56/157 (35%), Gaps = 16/157 (10%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT-NTYPAMHHAYRELYNEKES 294 VR G + Y+ +N+ ++ + + +T NT A+ E+ + K+ Sbjct: 419 VRFGAVQYSSTHQQEFGISTYSNVVDLSKAILNIRQLGHTTNTGAALRFMLEEIKSSKKK 478 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 H + ++ +TDG ++ + + +R + I+++ V Sbjct: 479 QH-----DSVPCHLLVVTDGMSND--------EVSEPAKRLREEDVIIHAIGV--KGANI 523 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 L+ S + V + L ++I I + Sbjct: 524 IELKDIAGSEDRVKFVYNFESLKGIKNEIVQTICRER 560 >gi|148657120|ref|YP_001277325.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148569230|gb|ABQ91375.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 774 Score = 76.5 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 30/342 (8%), Positives = 88/342 (25%), Gaps = 28/342 (8%) Query: 66 KQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIP 125 + D+ Q N+ + + P ++ FL Sbjct: 167 SMAGNKMVAAREASRQFVDLMQAGDGIGIVGFNDGVVMPLNLTTILDSPPQSAFLFS-DD 225 Query: 126 SALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPP-- 183 + +S + S + + + + P Sbjct: 226 MESGADQWNAQASWGLTSTARNSAWAWTDSPSGNYANNTNSALTIATPIAIPASMSNPAL 285 Query: 184 PPKKSFWSKNTTKSKYAPAPAPANRK---------IDVLIESAGNLVNSIQKAIQEKKNL 234 + + + + + L + + I+ + L Sbjct: 286 MFWHRYDIASPDRGEVEISTNNGATWQRLRLFTGTATSWRREVIRLDDYRGQTIRMRFRL 345 Query: 235 SVRIGTIAYNIGIVGNQCTP-LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 + I P + ++ ++ LN T+ + + R L Sbjct: 346 VTDAFGVRDGWYIDDVTIGPEWDDVRARAQAAIDTLNSRGATSIGGGLQSSQRMLDTA-- 403 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + L + +I ++DG+ + + + +R A ++++ + + Sbjct: 404 -------NPDLPRVIILLSDGQENTRPFVAD------VLPQIRAAQTTVHTIGLGRDADQ 450 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 Q +L + G + +L ++ I+ + + + Sbjct: 451 QLMLSIAAQTGGTYNYAPTPEQLSGIYNTISGAVSNRQTLVT 492 >gi|11498366|ref|NP_069594.1| hypothetical protein AF0760 [Archaeoglobus fulgidus DSM 4304] gi|2649856|gb|AAB90485.1| predicted coding region AF_0760 [Archaeoglobus fulgidus DSM 4304] Length = 959 Score = 76.5 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 36/291 (12%), Positives = 80/291 (27%), Gaps = 41/291 (14%) Query: 142 SSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT---TKSK 198 VL +N P P + TK Sbjct: 540 QWVVGPSEKTYVLKRYYDGNYYGWYYAWNNYGYDFYSDFFVPNPTPGTYKIIIVPLTKES 599 Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 P +++D +A N + + ++ + Sbjct: 600 IVIEPKVVMKRMDAAKLAAITFNNMLGEGDFVGLATFTTYAERISVNQTPLKY---MTKD 656 Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELY--NEKESSHNTIGSTRLKKFVIFITDGEN 316 V + + L T+ A++ + ++ ++ T + +I +TDGE Sbjct: 657 KLRVNNEIEGLYAKLATDHADALYWGVKVFPIWPDETQNNCTECINNTRPLMILLTDGET 716 Query: 317 SG------------------ASAYQNTLNTLQICEYMRNA------GMKIYSVAVSAP-- 350 + + + L + +Y++ + I ++ Sbjct: 717 TTCDKNEDYFNNTCKNKCVRDNGHYGAQQALCVADYIKRNIKVNGFNIPICTIGFGTDIG 776 Query: 351 PEGQDLLRKCTD-----SSGQFFAVNDSRELLESFDKITD--KIQEQSVRI 394 +GQ LR +F S EL+E++ I + +I +++ I Sbjct: 777 SDGQAFLRDIASPRPDNGEACYFFATTSEELIEAYKTIFNIFQIAAKNISI 827 >gi|309266594|ref|XP_003086799.1| PREDICTED: collagen alpha-5(VI) chain-like [Mus musculus] Length = 2601 Score = 76.5 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 40/368 (10%), Positives = 98/368 (26%), Gaps = 30/368 (8%) Query: 29 MQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKH---LKQGSYIRENAGDI 85 + S D A ++++ +K T K+G + Sbjct: 267 LMSFSDRAQTISSLRSSANQSEFQQQIQKLSLQTGASNVGAAIEQMRKEGFSESSGSRKA 326 Query: 86 AQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLS--LRSTGIIERSS 143 QI + + E+ + +F G+ + T L + S+ Sbjct: 327 QGVPQIAVLVTHRASDDVVREAALDLRLEGVTMFAMGIEGANNTQLEDIVSYPSRQSIST 386 Query: 144 ENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAP 203 + + + + + Sbjct: 387 HSSYSHLESYSGNFLKKIHNEIWTQVST-RAEQMELDKTGCVDTKEADIYFLIDGSSSIR 445 Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 +I V + S ++ VR+G + Y+ + ++K Sbjct: 446 KKEFEQIQVFMSSVIDMFP--------IGPNKVRVGVVQYSHKNEVEFPVSRYTDGIDLK 497 Query: 264 SRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + + T T A+ + K R ++I +TDG+++ + Sbjct: 498 KAVFNIKQLKGLTFTGKALDFILPLIKKGKTERT-----DRAPCYLIVLTDGKSNDS--- 549 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 L+ +R + I+++ + + LR+ + + L ++ Sbjct: 550 -----VLEPANRLRAEQITIHAIGIG--EANKTQLRQIAGKDERVNFGQNFDSLKSIKNE 602 Query: 383 ITDKIQEQ 390 I +I + Sbjct: 603 IVHRICSE 610 Score = 70.3 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 28/161 (17%), Positives = 63/161 (39%), Gaps = 13/161 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYENTNTYPAMHHAYREL 288 + V+IG + Y+ ++ + + L + + T T A+ H+ Sbjct: 836 DIGRDRVQIGALTYSNHPEILFYLNTYSSGSAIAEHLRRPRDTGGETYTAKALQHSNILF 895 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 E H + + +++ +I ITDG S ++ L+ + +R+ G+ I++V V Sbjct: 896 TEE----HGSRLTQNVRQLMIVITDGV----SHDRDKLD--EAARELRDKGITIFAVGVG 945 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 QD L V++ +L + + + + + Sbjct: 946 --NANQDELETMAGKKENTVHVDNFDKLRDIYLPLQETLCN 984 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 61/193 (31%), Gaps = 22/193 (11%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + NLV IQ ++G + ++ + Sbjct: 624 VDSSGSIGPTNFETMKTFMKNLVGKIQ-----IGADRSQVGVVQFSDYNREEFQLNKYST 678 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 E+ + +++++P NT T A+ K KF+I +TDG+ Sbjct: 679 HEEIYAAIDRMSPINRNTLTGGALTFVNEYFDLSKGGRPQVR------KFLILLTDGKAQ 732 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 +R+ + I+SV V + L + + F V + L Sbjct: 733 DEVGGP--------AMALRSKSVTIFSVGV--YGANRAQLEEISGDGSLVFHVENFDHLK 782 Query: 378 ESFDKITDKIQEQ 390 K+ ++ Sbjct: 783 AIESKLIFRVCAL 795 >gi|148689167|gb|EDL21114.1| mCG140659 [Mus musculus] Length = 1670 Score = 76.5 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 40/368 (10%), Positives = 98/368 (26%), Gaps = 30/368 (8%) Query: 29 MQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKH---LKQGSYIRENAGDI 85 + S D A ++++ +K T K+G + Sbjct: 55 LMSFSDRAQTISSLRSSANQSEFQQQIQKLSLQTGASNVGAAIEQMRKEGFSESSGSRKA 114 Query: 86 AQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLS--LRSTGIIERSS 143 QI + + E+ + +F G+ + T L + S+ Sbjct: 115 QGVPQIAVLVTHRASDDVVREAALDLRLEGVTMFAMGIEGANNTQLEDIVSYPSRQSIST 174 Query: 144 ENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAP 203 + + + + + Sbjct: 175 HSSYSHLESYSGNFLKKIHNEIWTQVST-RAEQMELDKTGCVDTKEADIYFLIDGSSSIR 233 Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 +I V + S ++ VR+G + Y+ + ++K Sbjct: 234 KKEFEQIQVFMSSVIDMFP--------IGPNKVRVGVVQYSHKNEVEFPVSRYTDGIDLK 285 Query: 264 SRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + + T T A+ + K R ++I +TDG+++ + Sbjct: 286 KAVFNIKQLKGLTFTGKALDFILPLIKKGKTERT-----DRAPCYLIVLTDGKSNDS--- 337 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 L+ +R + I+++ + + LR+ + + L ++ Sbjct: 338 -----VLEPANRLRAEQITIHAIGIG--EANKTQLRQIAGKDERVNFGQNFDSLKSIKNE 390 Query: 383 ITDKIQEQ 390 I +I + Sbjct: 391 IVHRICSE 398 Score = 70.3 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 28/161 (17%), Positives = 63/161 (39%), Gaps = 13/161 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYENTNTYPAMHHAYREL 288 + V+IG + Y+ ++ + + L + + T T A+ H+ Sbjct: 624 DIGRDRVQIGALTYSNHPEILFYLNTYSSGSAIAEHLRRPRDTGGETYTAKALQHSNILF 683 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 E H + + +++ +I ITDG S ++ L+ + +R+ G+ I++V V Sbjct: 684 TEE----HGSRLTQNVRQLMIVITDGV----SHDRDKLD--EAARELRDKGITIFAVGVG 733 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 QD L V++ +L + + + + + Sbjct: 734 --NANQDELETMAGKKENTVHVDNFDKLRDIYLPLQETLCN 772 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 61/193 (31%), Gaps = 22/193 (11%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + NLV IQ ++G + ++ + Sbjct: 412 VDSSGSIGPTNFETMKTFMKNLVGKIQ-----IGADRSQVGVVQFSDYNREEFQLNKYST 466 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 E+ + +++++P NT T A+ K KF+I +TDG+ Sbjct: 467 HEEIYAAIDRMSPINRNTLTGGALTFVNEYFDLSKGGRPQVR------KFLILLTDGKAQ 520 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 +R+ + I+SV V + L + + F V + L Sbjct: 521 DEVGGP--------AMALRSKSVTIFSVGV--YGANRAQLEEISGDGSLVFHVENFDHLK 570 Query: 378 ESFDKITDKIQEQ 390 K+ ++ Sbjct: 571 AIESKLIFRVCAL 583 >gi|118353828|ref|XP_001010179.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89291946|gb|EAR89934.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 511 Score = 76.5 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 26/149 (17%), Positives = 53/149 (35%), Gaps = 14/149 (9%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNE--VKSRLNKLNPYENTNTYPAMHHAYRELYNE 291 + R+ I +N +NN N+ K +NK+ T+ M A+R L + Sbjct: 121 DNDRLCLILFNSKATKLCHLMRTNNSNKPAFKEIINKIEANGGTDINSGMELAFRVLKD- 179 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 V ++DG++ + + ++ I+S + Sbjct: 180 -------RKYHNPVSSVFLLSDGQD----GSADLKVRQSLERHLPQECFTIHSFGFGSDH 228 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESF 380 +G + + C+ G F+ V ++ E F Sbjct: 229 DGPLMNKICSLKDGNFYYVEKINQVDEFF 257 >gi|310817544|ref|YP_003949902.1| von willebrand factor type a domain-containing protein [Stigmatella aurantiaca DW4/3-1] gi|309390616|gb|ADO68075.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 470 Score = 76.5 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 67/191 (35%), Gaps = 20/191 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS-NNLNEVKSRLN 267 K+D +A + + + R+ + Y + T + ++ + +N Sbjct: 112 KLDDAKRAAREFITRVSEED--------RVALVHYGTDVTVFPSTLATPETREQMLTFVN 163 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + +TN + A ++L + + +I ++DG+ + Sbjct: 164 AIEDEGSTNISGGLEAAAQQLQKNADQFRVSR--------IILLSDGQPTAGL--TREEQ 213 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 + +R+ GM + ++ V ++L++ D G F S +L E F + ++ Sbjct: 214 LTALARNLRSQGMAVSALGVG-EDFNENLMQGIADQGGGFSGFLRSDQLAEVFTRELEQA 272 Query: 388 QEQSVRIAPNR 398 R R Sbjct: 273 TSTVARAVEVR 283 >gi|153876525|ref|ZP_02003802.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] gi|152067011|gb|EDN66198.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] Length = 180 Score = 76.5 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 28/158 (17%), Positives = 57/158 (36%), Gaps = 25/158 (15%) Query: 242 AYNIGIVGNQCTPLSNNL--NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 + L N + ++ + + +T A+ A ++L E S Sbjct: 1 MFADHAYLQAPLTLDNLAVQSLLQKAVIGM-AGRDTAIGDAIGLAVKKLRERPEGS---- 55 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------G 353 + +I +TDGEN+ L LQ E + ++IY++ V Sbjct: 56 ------RILILLTDGENNAG-----ALKPLQAAELAKQYDIRIYTIGVGGKGGMFSRGLN 104 Query: 354 QDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQ 390 + L+K ++G +F + L ++ I +Q+ Sbjct: 105 ETELKKIAQLTNGAYFPATNLGALNNVYEHIDKTLQKT 142 >gi|198435715|ref|XP_002125840.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 3908 Score = 76.5 bits (186), Expect = 7e-12, Method: Composition-based stats. Identities = 28/171 (16%), Positives = 58/171 (33%), Gaps = 21/171 (12%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHA-Y 285 E + R + Y+ + +V + ++ + NTNT A+ + Y Sbjct: 3454 PFEVGWDNSRFAVVQYSDDPRTEFLMNEHFTVTDVLNAIDAIPYKGGNTNTGKALAFSLY 3513 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 L + + K + +TDG + +R AG+K+ +V Sbjct: 3514 TALSPANGARP------YVNKVALVLTDGRSQDEVGNP--------ARELRQAGVKVLTV 3559 Query: 346 AVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQE-QSVR 393 V ++ L+ + V+D + E + K+ E + +R Sbjct: 3560 GVG--DADKNELKSIASPPYDSSVYHVSDYDSISEIKAHLAAKLCEGEVLR 3608 Score = 73.0 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 37/334 (11%), Positives = 96/334 (28%), Gaps = 25/334 (7%) Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 S +K H G + + + + N + + + + Sbjct: 270 SGALLGFVKTHPLPGQDPSDIQRVLVESSNDTNLNVDLNGGVVTSGGRVTTSHGGSSSWS 329 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 + + ++ + + + + V + + +D ++ T Sbjct: 330 GTSVSTVGSSSRRVVSTSGLALTSSGGANEFDVSPSVGGSSAMGAGRTSDIDSDTQTGRF 389 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 + K +KI LV+ + N V +G Sbjct: 390 VQDCKKAQKTDLVVLTDGSWSVGPQNFKKIQA------FLVSLVDAFSIGFNN--VLMGY 441 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTI 299 Y+ +++ +N++ NT T A+ + L+ + + Sbjct: 442 AQYSDDARTEFNLNEHVTKDDLIRAINQVQYKGGNTATGGALDYIRTNLFTSEGGTRR-- 499 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 + K I ITDGE + + ++ G++++S+ V + L Sbjct: 500 ---GVLKTAIVITDGE-------SILDDVTEPARMLKEIGVEVFSIGV--AAALRSELED 547 Query: 360 CTDSS--GQFFAVNDSRELLESFDKITDKIQEQS 391 S F+V++ ++ + + + + Sbjct: 548 IASSPASDHVFSVDNFDDIKNIKNILLKETCKAV 581 >gi|90414549|ref|ZP_01222523.1| hypothetical protein P3TCK_02206 [Photobacterium profundum 3TCK] gi|90324356|gb|EAS40922.1| hypothetical protein P3TCK_02206 [Photobacterium profundum 3TCK] Length = 321 Score = 76.5 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 30/154 (19%), Positives = 56/154 (36%), Gaps = 36/154 (23%) Query: 253 TPLSNNLNEVKSRLNK--LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 TPL+ + N VK +L + L +T + A + S ++ +I Sbjct: 143 TPLTFDRNTVKQQLERTVLGLIGQSTAIGEGLGIATKTFI----------NSEAPQRVII 192 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PE 352 ++DG N+ + L+ + + + IY+V V A Sbjct: 193 LLSDGANTAGV-----IEPLEAAKLAAESNVTIYTVGVGAEEMIQKSFFGNRKVNPSQDL 247 Query: 353 GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + +L K D + GQ+F + +EL + I Sbjct: 248 DERMLTKIADMTGGQYFRARNPQELEHIYQLIDQ 281 >gi|54303502|ref|YP_133495.1| hypothetical protein PBPRB1845 [Photobacterium profundum SS9] gi|46916932|emb|CAG23695.1| conserved hypothetical protein [Photobacterium profundum SS9] Length = 321 Score = 76.5 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 30/154 (19%), Positives = 56/154 (36%), Gaps = 36/154 (23%) Query: 253 TPLSNNLNEVKSRLNK--LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 TPL+ + N VK +L + L +T + A + S ++ +I Sbjct: 143 TPLTFDRNTVKQQLERTVLGLIGQSTAIGEGLGIATKTFI----------NSEAPQRVII 192 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PE 352 ++DG N+ + L+ + + + IY+V V A Sbjct: 193 LLSDGANTAGV-----IEPLEAAKLAAESNVTIYTVGVGAEEMIQKSFFGNRKVNPSQDL 247 Query: 353 GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + +L K D + GQ+F + +EL + I Sbjct: 248 DERMLTKIADMTGGQYFRARNPQELEHIYQLIDQ 281 >gi|291295701|ref|YP_003507099.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] gi|290470660|gb|ADD28079.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] Length = 298 Score = 76.5 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 15/143 (10%), Positives = 52/143 (36%), Gaps = 16/143 (11%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 G P + + ++ ++ L+ + + A L Sbjct: 132 SGYGTLLLPPTTDRKAIRQAIDNLDLGGGFSFTYGLLAALEALPQTPPEGSR-------P 184 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE--GQDLLRKCTD- 362 ++ + G + + + L+I + G++++++ V +++L+K D Sbjct: 185 GVIVLFSHGHDVSGN------DPLKIADQALERGIQVHAIGVGTHGHNFDEEMLKKVADR 238 Query: 363 SSGQFFAVNDSRELLESFDKITD 385 + G+++ + + +L ++ + Sbjct: 239 TGGRYYPIFSASDLSKAHADLGR 261 >gi|42524204|ref|NP_969584.1| hypothetical protein Bd2794 [Bdellovibrio bacteriovorus HD100] gi|39576412|emb|CAE80577.1| conserved hypothetical protein [Bdellovibrio bacteriovorus HD100] Length = 336 Score = 76.5 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 33/212 (15%), Positives = 68/212 (32%), Gaps = 50/212 (23%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE----- 273 N + + ++ I + + P + + + R+N+++ Sbjct: 109 NRLEAAKETIAKFISARTSDRIGLVVFAGESFTMVPPTLDYQMILQRVNEISSASSAKIK 168 Query: 274 -NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T AM +A L S + +IF+TDGEN+ T++ Sbjct: 169 DGTALGVAMANAAGRL----------KDSQARSRVMIFMTDGENNSG-----TIDPETGL 213 Query: 333 EYMRNAGMKIYSVAVSAPPE-------------------------GQDLLRKCTD-SSGQ 366 E + G+K+YS+ + +DLL + + G+ Sbjct: 214 EIAKGYGIKVYSIGIGKDGPTRIPVYSRDIFGQKVKTYQPFESTVNEDLLGRMASDTGGK 273 Query: 367 FFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 ++ L + F I + +I N+ Sbjct: 274 YYRATTEGALQKVFSDIDTL---EKTKIDVNK 302 >gi|114762302|ref|ZP_01441760.1| von Willebrand factor type A domain protein [Pelagibaca bermudensis HTCC2601] gi|114544920|gb|EAU47924.1| von Willebrand factor type A domain protein [Roseovarius sp. HTCC2601] Length = 335 Score = 76.5 bits (186), Expect = 8e-12, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 65/169 (38%), Gaps = 27/169 (15%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL---NPYENTNTYPAMHHAYRELYNEKE 293 R+ I + PL+++L + + L++ +T A+ + R + Sbjct: 140 RMALIVFGTSAYLQA--PLTDDLETIIALLDRTEVGMAGPHTALGDAIGLSIRTFETSEI 197 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE- 352 ++ +I ++DG ++ + + E + G++IY++ V P Sbjct: 198 D----------QRLLILLSDGSDTASRMS-----PVNAAEIAADRGVEIYTIGVGDPDAT 242 Query: 353 -----GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 L++ + GQ+F D+ L +D+I + ++ ++ Sbjct: 243 GENRVDLTTLKEVAQRTGGQYFFAEDAASLEAVYDRIDELAPRETETLS 291 >gi|56797869|emb|CAG27568.1| matrilin-4 [Danio rerio] Length = 685 Score = 76.1 bits (185), Expect = 8e-12, Method: Composition-based stats. Identities = 26/174 (14%), Positives = 64/174 (36%), Gaps = 20/174 (11%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ + +V+ + + R+G + Y+ + + ++ +E+K Sbjct: 468 NFELVKQFVNQVVDQL-----DVSAKGTRVGLVQYSSRVRTEFPLSMYHSKDEIKKAEMN 522 Query: 269 LNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + + T T A+ H ++E E + + + + TDG + + Sbjct: 523 VEYMEKGTMTGLALKHMVENSFSEAEGARPAEKNI--PRVGLVFTDGRSQD--------D 572 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 + + + AG+ +Y+V V +D LR+ FF D + + Sbjct: 573 IQEWAKKAKEAGITMYAVGVGKAV--EDELREIASDPVEKHFFYSADFTAISQI 624 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 24/147 (16%), Positives = 48/147 (32%), Gaps = 18/147 (12%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKES 294 RIG + Y+ + + ++ +N++ P T T A+ +A ++ +E Sbjct: 59 TRIGVVQYSSQVQNVFSLKAFSKTEQMVKAINEIIPLAQGTMTGLAIRYAMNVAFSAEEG 118 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + + + +TDG A +G++IY+V V Sbjct: 119 AR-----PNVPHVAVIVTDGRPQDRVAEVAAAARE--------SGIEIYAVGV--ARADM 163 Query: 355 DLLRKCTDSS--GQFFAVNDSRELLES 379 LR F V + + Sbjct: 164 TSLRAMASPPFEDHVFLVESFDLIHQF 190 >gi|296484270|gb|DAA26385.1| collagen, type XII, alpha 1 [Bos taurus] Length = 3115 Score = 76.1 bits (185), Expect = 8e-12, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 58/166 (34%), Gaps = 18/166 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+G + Y+ E+ + + K+ NT T A+ + Sbjct: 168 SAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQQEELLAAIKKIPYKGGNTMTGEAIDYLI 227 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + E + K I ITDG++ +RN G++++S+ Sbjct: 228 KNTFTESAGARVGF-----PKVAIIITDGKSQD--------EVEIPARELRNIGVEVFSL 274 Query: 346 AVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQE 389 + L++ + F V + +++ ++I ++ Sbjct: 275 GI--KAADAKELKQIASTPSLNHVFNVANFDAIVDIQNEIISQVCS 318 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 21/160 (13%), Positives = 53/160 (33%), Gaps = 18/160 (11%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKE 293 V+I Y+ + + + L NT T A++ ++ + Sbjct: 1235 RVQIALAQYSGDPRTEWQLNAHKDKKSLLQAVANLPYKGGNTLTGMALNFIRQQSFKT-- 1292 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 R +K + ITDG++ + + +++ G++++++ + Sbjct: 1293 ---QAGMRPRARKIGVLITDGKSQDDVEAPS--------KKLKDEGVELFAIGI--KNAD 1339 Query: 354 QDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 + L+ + V D L D +T+ + Sbjct: 1340 EVELKMIATDPDDIHAYNVADFDSLSRIVDDLTNNLCNSV 1379 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 62/167 (37%), Gaps = 23/167 (13%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYN 290 V+I + Y+ + ++ +N TNT AM + +++ Sbjct: 473 SPNRVQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIF- 531 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + K +I ITDG++S A +RN+ ++I++V V Sbjct: 532 ----VPSKGSRSNVPKVMILITDGKSSDAFRDP--------AIKLRNSDVEIFAVGV--K 577 Query: 351 PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS-VRI 394 + L F V D ++F +I+ ++ + +RI Sbjct: 578 DAVRSELEAIASPPAETHVFTVEDF----DAFQRISFELTQSICLRI 620 >gi|332808244|ref|XP_001147912.2| PREDICTED: cartilage matrix protein [Pan troglodytes] Length = 717 Score = 76.1 bits (185), Expect = 8e-12, Method: Composition-based stats. Identities = 42/378 (11%), Positives = 112/378 (29%), Gaps = 52/378 (13%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQIN---ITKDKN 98 + VS + + ST + + G ++ I+ I Sbjct: 96 RAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKAFGDAEGGHSRSPDISKVVIVVTDG 155 Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR 158 P + + A+ LF G L ++ + + S ++ +SR Sbjct: 156 RPHDSVQDVSARARASGVELFAIGFGRVDKATLRQIASEPQD-EHVDYVESYSVIEKLSR 214 Query: 159 SMEDLY----LQKHNDNNNMTSNKYLLPPPPKKSF---WSKNTTKSKYAPAPAPANR--- 208 ++ + +++ P + ++ N+ Sbjct: 215 KFQEAFCVVSDLCATGDHDCEQVCISSPGSYTCACHEGFTLNSDGKTCNVCSGGGGSSAT 274 Query: 209 ---------------KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 +++ + +V+++ + + ++G + Y+ + Sbjct: 275 DLVFLIDGSKSVRPENFELVKKFINQIVDTL-----DVSDKLAQVGLVQYSSSVRQEFPL 329 Query: 254 PLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 + ++K+ + ++ + T T A+ + + + +K I T Sbjct: 330 GRFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSGAR-----PGAQKVGIVFT 384 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAV 370 DG + + ++ G K+++V V +D LR+ +F Sbjct: 385 DGRSQDYIND--------AAKKAKDLGFKMFAVGVG--NAVEDELREIASEPVAEHYFYT 434 Query: 371 NDSRELLESFDKITDKIQ 388 D + + + K+ KI Sbjct: 435 ADFKTINQIGKKLQKKIC 452 Score = 63.4 bits (152), Expect = 5e-08, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 54/159 (33%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + R+G + Y + + + + ++ P T T A+ A + Sbjct: 72 DVGPNATRVGMVNYASTVKQEFSLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKA 131 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + + E H+ S + K VI +TDG + + R +G++++++ Sbjct: 132 FGDAEGGHSR--SPDISKVVIVVTDGRPHDS--------VQDVSARARASGVELFAIGFG 181 Query: 349 APPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 + LR+ V +L F + Sbjct: 182 R--VDKATLRQIASEPQDEHVDYVESYSVIEKLSRKFQE 218 >gi|90021389|ref|YP_527216.1| BatB protein [Saccharophagus degradans 2-40] gi|89950989|gb|ABD81004.1| von Willebrand factor, type A [Saccharophagus degradans 2-40] Length = 341 Score = 76.1 bits (185), Expect = 8e-12, Method: Composition-based stats. Identities = 32/201 (15%), Positives = 72/201 (35%), Gaps = 50/201 (24%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL-- 266 +I V+ G+ + + R+G + + +PL+ + VK L Sbjct: 117 RIAVVKHIVGDFIER---------RVGDRLGLVLFGTSAYLQ--SPLTFDRTTVKQLLVE 165 Query: 267 NKLN-PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 +++ NT A+ + + L + VI +TDG+N+ Sbjct: 166 SQIGFAGPNTAIGDAIGLSIKRL----------RDRPAENRVVILLTDGQNTAG-----E 210 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAP--------------------PEGQDLLRKCT-DSS 364 ++ Q + + +G+K+Y++ V A +D L + Sbjct: 211 VSPRQAADLAKQSGVKVYTIGVGANEMIVSDGFFGNFQRKINPSRDLDEDTLTYIAETTG 270 Query: 365 GQFFAVNDSRELLESFDKITD 385 G++F + +EL + + + + Sbjct: 271 GRYFRAHSPQELNQIYQLLDE 291 >gi|73973308|ref|XP_539002.2| PREDICTED: similar to alpha 1 type XII collagen long isoform precursor isoform 1 [Canis familiaris] Length = 3065 Score = 76.1 bits (185), Expect = 8e-12, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 59/166 (35%), Gaps = 18/166 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+G + Y+ +E+ + + K+ NT T A+ + Sbjct: 168 SAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYRTDELLAAIKKIPYKGGNTMTGDAIDYLI 227 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + E + K I ITDG++ +RN G++++S+ Sbjct: 228 KNTFTESAGARVGF-----PKVAIIITDGKSQD--------EVEIPARELRNIGVEVFSL 274 Query: 346 AVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQE 389 + L++ + F V + +++ ++I ++ Sbjct: 275 GI--KAADAKELKQIASTPSLNHVFNVANFDAIVDIQNEIISQVCS 318 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 53/160 (33%), Gaps = 18/160 (11%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKE 293 V+I Y+ + + + L NT T A++ ++ Sbjct: 1235 RVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQ-----N 1289 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 R +K + ITDG++ + + +++ G++++++ + Sbjct: 1290 FRTQAGMRPRARKIGVLITDGKSQDDVEAPS--------KKLKDEGVELFAIGI--KNAD 1339 Query: 354 QDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 +D L+ + V D L + D +T + Sbjct: 1340 EDELKMIATDPDDTHAYNVADFESLSKIVDDLTINLCNSV 1379 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 31/167 (18%), Positives = 62/167 (37%), Gaps = 23/167 (13%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYN 290 V+I + Y+ + ++ +N TNT AM + +++ Sbjct: 473 SPNRVQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFV 532 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + S + K +I ITDG++S A +RN+ ++I++V V Sbjct: 533 PSKGSRG-----NVPKVMILITDGKSSDAFRDP--------AIKLRNSDVEIFAVGV--K 577 Query: 351 PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS-VRI 394 + L F V D ++F +I+ ++ + +RI Sbjct: 578 DAVRSELEAIASPPAETHVFTVEDF----DAFQRISFELTQSICLRI 620 >gi|22299719|ref|NP_682966.1| hypothetical protein tlr2176 [Thermosynechococcus elongatus BP-1] gi|22295903|dbj|BAC09728.1| tlr2176 [Thermosynechococcus elongatus BP-1] Length = 415 Score = 76.1 bits (185), Expect = 8e-12, Method: Composition-based stats. Identities = 25/181 (13%), Positives = 67/181 (37%), Gaps = 23/181 (12%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 + ++ ++A +LV+ + + R+ IA++ + +K+++ L Sbjct: 56 LAMVKQAAASLVDRLLPSD--------RLSVIAFDHKAKVLVPNQTVWDKEAIKAQIATL 107 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 P T M +E+ K+ + + +TDGE + + + L Sbjct: 108 EPGGGTAIDEGMKLGLKEIAAGKQGTI---------SQIFLLTDGE----NEHGDNQRCL 154 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQ 388 ++ + + + ++ QD+L + D + G+ + + + + F + I Sbjct: 155 ELAKLAAEYNITLNALGFGV-HWNQDVLEQIADAAGGRLVFIEYAEQAIACFQSLFSHIS 213 Query: 389 E 389 Sbjct: 214 S 214 >gi|126341670|ref|XP_001379945.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 2439 Score = 76.1 bits (185), Expect = 8e-12, Method: Composition-based stats. Identities = 32/188 (17%), Positives = 65/188 (34%), Gaps = 24/188 (12%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + + L+++ + VR G + Y+ +++ + Sbjct: 631 SGSINHDDFAEMKTFMIELISTFR-----VGADHVRFGVVQYSDSPTVEFDIRQHSSVAQ 685 Query: 262 VKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 +KS + K+ T T A+ R ++ +F+I ITDG++ Sbjct: 686 LKSAITKIWQTGGGTRTGEALTFMKRLFSEVAR--------DKVLRFLIVITDGQSQD-- 735 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 Q E +R + IY++ V L + + S + F VND L Sbjct: 736 ------QVAQAAEELRQENITIYAIGV--KSAVTKELLEISGSQNRMFFVNDFDSLKPIQ 787 Query: 381 DKITDKIQ 388 ++ I Sbjct: 788 QEVIQDIC 795 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 29/161 (18%), Positives = 54/161 (33%), Gaps = 16/161 (9%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEK 292 VR+G +N I ++V ++ L T T A++ E + E Sbjct: 52 DQVRVGLAQFNDNIYKAFLLNQFPRKSDVLEQILSLPYRTGGTRTGSALNFLRTEFFTES 111 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 S + + VI +TDGE++ + ++ G+ IY V + + Sbjct: 112 AGSRAK---DNVPQIVILVTDGESND--------EVAEAASKLKGQGVSIYVVGI--NVQ 158 Query: 353 GQDLLRKCTDSS-GQF-FAVNDSRELLESFDKITDKIQEQS 391 L+ +F F++ D L I + Sbjct: 159 DVQELKTIASKPLEKFLFSIEDFNILEGLSGNILPTLCSAV 199 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 25/175 (14%), Positives = 63/175 (36%), Gaps = 16/175 (9%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYE 273 + + I + VRIG Y+ + E+ ++K+ Sbjct: 1203 TAMKTFMKQIVNS-FTIGKDRVRIGVAQYSTNPQKEFYLNTFYSGAEINQHIDKITQLRT 1261 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T T + + + E ++ + + + + ++ ITDG ++ + ++ Sbjct: 1262 QTYTGKGLRF----VKSFFEPANGSRKNLHVLQSLVVITDGMSNDS--------VVEAAN 1309 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 +RN ++I+S+ + L+ + + F V D +L K+ ++ Sbjct: 1310 DLRNEKIQIFSIGIGV--INLFELQLIAGNVKRVFVVGDFGQLGSIERKVVRELC 1362 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 31/181 (17%), Positives = 63/181 (34%), Gaps = 17/181 (9%) Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN- 274 + L I I VR+G + Y+ + +E+ S LNKL Sbjct: 434 AVELLRKIIHTLIIGPNK--VRVGLVLYSDEPRLEFGLNTFLSQSEILSHLNKLPFIGGK 491 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 T T A+ ++ +++ S +++ + IT+G + Sbjct: 492 TKTGAALDFLRNTVFTQQKGSRYR---QGVQQLAVVITEGYSQDEVDRP--------ASL 540 Query: 335 MRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQSV 392 +R AG+ +++V + G L K + +L +KI ++ + V Sbjct: 541 LRRAGVTVFAVG-TLKASGSRDLNKIASHPPRKHAIYLESFLQLSVITEKIKKRVCTEIV 599 Query: 393 R 393 + Sbjct: 600 Q 600 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 28/169 (16%), Positives = 61/169 (36%), Gaps = 11/169 (6%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELY 289 +VR G I+Y+ +++ + L P +T A+ A Sbjct: 1025 VGKDNVRFGAISYSDNSEVLFSLDTYITKAQIRDAVFHLKPKVGKAHTATALKFAKERFS 1084 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 H S + + ++ IT + + + + + ++ AG+ ++++ + Sbjct: 1085 E----MHGGRQSLAVTQILVLIT----NKPTESEEKKYLQESAQTLQEAGIDVFAIGI-- 1134 Query: 350 PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + L+ T + F V EL +K+T I +S I +R Sbjct: 1135 KNVKRPELQAITKHRDRSFMVQSYNELYNLHEKVTHIICNESKPICGHR 1183 Score = 46.4 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 19/167 (11%), Positives = 52/167 (31%), Gaps = 19/167 (11%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNT 277 ++ + K+ V+IG + ++ + ++ + + Sbjct: 828 FMMQMVNKSDLGP--EKVQIGLLQFSSNPQEEFRLNTYYSKVDILRAITGMVQIRAGARV 885 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 A+ + + + +++I I G+ + +R+ Sbjct: 886 GSALSFSLPYFERSRGGR------LNVPQYLIIIISGKTGD--------AVKMPAKALRD 931 Query: 338 AGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKIT 384 G+KI+++ V LL T + + + + LL +I Sbjct: 932 KGIKIFAIGV-HKANNSQLLE-ITGAQDKVYYEENFDSLLFLEKQIF 976 Score = 45.3 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 21/167 (12%), Positives = 51/167 (30%), Gaps = 21/167 (12%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL--NKLNP 271 + ++++++ +IG Y N E+ S + N L Sbjct: 232 KDFICRVIDTLE-----VGRDKDQIGLAQYGNQGHVEFLLNAYQNPVEMISHIQQNFLPR 286 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T + + + E+ S + ++ + IT G+ + L+ Sbjct: 287 GGARKTGNGLQYIQETFFQEEAGSRFL---QGIPQYAVVITSGQ--------SEDLVLEK 335 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + ++ G+KI V + L+ F + + + Sbjct: 336 AQKLKERGVKIMVVGIQ--DFDSRELKAMATPP-LVFEIEGQDGIRQ 379 >gi|224048603|ref|XP_002193071.1| PREDICTED: collagen, type XXI, alpha 1 [Taeniopygia guttata] Length = 945 Score = 76.1 bits (185), Expect = 8e-12, Method: Composition-based stats. Identities = 29/176 (16%), Positives = 61/176 (34%), Gaps = 22/176 (12%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHA 284 + +++G + Y+ V + + + ++ NT T A+ A Sbjct: 64 TRNFDIGPKFIQVGVVQYSDYPVLEIPLGTHESTENLIREMESIHYLGGNTRTGRAIQFA 123 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + L+ S L K + +TDG++ + R + +++ Sbjct: 124 FDHLF--------AKSSRFLTKIAVVLTDGKSQD--------EVKDVAAEARKNKITLFA 167 Query: 345 VAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + V + +D L+ + F V D + + I K+ E+SV P R Sbjct: 168 IGVGSEI-EEDELKAIANKPSSTYVFYVEDYIAISRIKEVIKQKLCEESV--CPTR 220 >gi|291396486|ref|XP_002714579.1| PREDICTED: collagen, type XII, alpha 1 [Oryctolagus cuniculus] Length = 3117 Score = 76.1 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 23/166 (13%), Positives = 59/166 (35%), Gaps = 18/166 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+G + Y+ +E+ + + ++ NT T A+ + Sbjct: 168 SAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKRIPYKGGNTMTGDAIDYLV 227 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + E + K I ITDG++ +RN G++++S+ Sbjct: 228 KNTFLESAGARVGF-----PKVAIIITDGKSQD--------EVEIPARELRNIGVEVFSL 274 Query: 346 AVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQE 389 + L++ + F V + +++ ++I ++ Sbjct: 275 GI--KAADAKELKQIASTPSLNHVFNVANFDAIVDIQNEIISQVCS 318 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 21/165 (12%), Positives = 53/165 (32%), Gaps = 18/165 (10%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + V+I Y+ + + + L NT T A++ ++ Sbjct: 1230 DIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQ- 1288 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 R +K + ITDG++ + + +++ G++++++ + Sbjct: 1289 ----NFKTQAGMRPRARKIGVLITDGKSQDDVEAPS--------KKLKDEGVELFAIGI- 1335 Query: 349 APPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 + L+ + V D L + D +T + Sbjct: 1336 -KNADEVELKMIATDPDDTHAYNVADFDSLSKIVDDLTINLCNSV 1379 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 62/167 (37%), Gaps = 23/167 (13%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYN 290 V+I + Y+ + ++ +N TNT AM + +++ Sbjct: 473 SPNRVQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIF- 531 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + K +I ITDG++S A +RN+ ++I++V V Sbjct: 532 ----VPSKGSRSNVPKVMILITDGKSSDAFRDP--------AIKLRNSDVEIFAVGV--K 577 Query: 351 PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS-VRI 394 + L F V D ++F +I+ ++ + +RI Sbjct: 578 DAVRSELEAIASPPAETHVFTVEDF----DAFQRISFELTQSICLRI 620 >gi|119569133|gb|EAW48748.1| collagen, type XII, alpha 1, isoform CRA_a [Homo sapiens] Length = 821 Score = 76.1 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 59/166 (35%), Gaps = 18/166 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+G + Y+ +E+ + + K+ NT T A+ + Sbjct: 168 SAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLV 227 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + E + K I ITDG++ +RN G++++S+ Sbjct: 228 KNTFTESAGARVGF-----PKVAIIITDGKSQDEVEIP--------ARELRNVGVEVFSL 274 Query: 346 AVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQE 389 + L++ + F V + +++ ++I ++ Sbjct: 275 GI--KAADAKELKQIASTPSLNHVFNVANFDAIVDIQNEIISQVCS 318 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 62/167 (37%), Gaps = 23/167 (13%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYN 290 V+I + Y+ + ++ +N TNT AM + +++ Sbjct: 473 SPNRVQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIF- 531 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + K +I ITDG++S A +RN+ ++I++V V Sbjct: 532 ----VPSKGSRSNVPKVMILITDGKSSDAFRDP--------AIKLRNSDVEIFAVGV--K 577 Query: 351 PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS-VRI 394 + L F V D ++F +I+ ++ + +RI Sbjct: 578 DAVRSELEAIASPPAETHVFTVEDF----DAFQRISFELTQSICLRI 620 >gi|327282764|ref|XP_003226112.1| PREDICTED: collagen alpha-1(XXI) chain-like [Anolis carolinensis] Length = 956 Score = 76.1 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 30/176 (17%), Positives = 62/176 (35%), Gaps = 22/176 (12%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHA 284 + +++G + Y+ V ++ + + + NT T A+ A Sbjct: 64 TSNFNIGPKFIQVGVVQYSDYPVLEIPLGFHDSNENLVRGMEYIQYLGGNTQTGKAIQFA 123 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 L+ S L K + +TDG++ + ++ R + +++ Sbjct: 124 LDHLF--------AKSSRFLTKIAVVLTDGKSQD--------DVKEVAAEARKNRITLFA 167 Query: 345 VAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + V + +D LR + F V D + + I K+ E+SV P R Sbjct: 168 IGVGSE-TEEDELRAIANKPSSTYVFYVEDYIAISRIREVIKQKLCEESV--CPTR 220 >gi|307352559|ref|YP_003893610.1| von Willebrand factor type A [Methanoplanus petrolearius DSM 11571] gi|307155792|gb|ADN35172.1| von Willebrand factor type A [Methanoplanus petrolearius DSM 11571] Length = 1022 Score = 76.1 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 32/354 (9%), Positives = 90/354 (25%), Gaps = 29/354 (8%) Query: 65 KKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLI 124 + + +L+ S + + G + + Y + A ++ + I Sbjct: 423 SQLAEPYLESTSALSDQDGLAVVNFVPGTFETDWDATDYSKLANASCQVRAQWNTTVRQI 482 Query: 125 PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP 184 N S V D + Sbjct: 483 DLEWKNYPYLSVKTNVTPETVAVNDTVTVTIQLIGDGWALQPDPIDVMLTADRSGSMLRD 542 Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAI-QEKKNLSVRIGTIAY 243 S + ++ + + I + Y Sbjct: 543 YPDRMVSLMDALEDFGIEMKEGWDRLGLASFGTYGNADIIDYGNRYWAGYDNSYYDDWEY 602 Query: 244 ----------NIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 N L+ + ++ + + L P T ++++ + L + Sbjct: 603 ISEHYAGNDKNYNDYATIDLNLTEDFSDYNTEVKALVPDGGTPMRKGLYYSIKHLRDNGR 662 Query: 294 ---------------SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 + + + +D + + + + + ++ Sbjct: 663 DDAVKAVVVLSDGDYNYYGDPLARGSGGTKWDWSDMQEKYYTFSDLNSSEQDMRIFAKDN 722 Query: 339 GMKIYSVAV--SAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQE 389 +KI+S+A EG+ +L+ + G+++ +L E ++ I +++E Sbjct: 723 DIKIFSIAYADGISSEGKAVLQALAEGTGGKYYYAPSGEDLEEIYEDIAGELKE 776 >gi|255535987|ref|YP_003096358.1| aerotolerance operon BatA [Flavobacteriaceae bacterium 3519-10] gi|255342183|gb|ACU08296.1| BatA (Bacteroides aerotolerance operon) [Flavobacteriaceae bacterium 3519-10] Length = 334 Score = 76.1 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 25/158 (15%), Positives = 55/158 (34%), Gaps = 40/158 (25%) Query: 254 PLSNNLNEVKSRLNKLNP---YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 P++++ + L LNP T + A L S K +I Sbjct: 147 PVTSDHAVLLEELENLNPLELQPGTAIGEGLSVAVSHL----------RHSKAKSKIIIL 196 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------------------ 352 +TDG N+ +A + + ++ +++YS+ + Sbjct: 197 MTDGVNTIENAMPAQVG----AQLAKSNDIRVYSIGIGTNGYALMPTQTDIFGDLVFTEV 252 Query: 353 ----GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITD 385 + +LR+ + G++F ++ L E +++I Sbjct: 253 EVKIDEPVLREIAQTTGGKYFRATSNQSLEEVYEEINQ 290 >gi|255037594|ref|YP_003088215.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] gi|254950350|gb|ACT95050.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] Length = 339 Score = 76.1 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 27/151 (17%), Positives = 51/151 (33%), Gaps = 34/151 (22%) Query: 254 PLSNNLNEVKSRLN----KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 PL+ + + L+ L P T A+ A + + K I Sbjct: 158 PLTTDYELLYGFLDEVTPSLIPTPGTAIGSALAVAVNRM----------RDTAGESKVAI 207 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE--------------GQD 355 I+DG+N+ + L + G+K+Y+++V P + Sbjct: 208 LISDGDNTSGN-----LGPTTSAQLANAFGVKVYTISVGKPKSASKADTTASAGALMDEG 262 Query: 356 LLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 L+ +G++F D+ L F +I Sbjct: 263 ELQNIAGIGNGKYFRATDNTALESVFKQIDQ 293 >gi|88601902|ref|YP_502080.1| von Willebrand factor, type A [Methanospirillum hungatei JF-1] gi|88187364|gb|ABD40361.1| von Willebrand factor, type A [Methanospirillum hungatei JF-1] Length = 316 Score = 76.1 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 67/201 (33%), Gaps = 43/201 (21%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 +++ SA L+ S+ I + G LS + + V +L Sbjct: 108 TRLESSKRSAEILLKSLDPKDYAG--------IITFESGATSAAY--LSPDKDRVIRKLQ 157 Query: 268 KLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 + P T + + KK VI ++DG N+ + Sbjct: 158 AIEPKEGATAIGDGLALGIDMAESMPNR----------KKVVILLSDGVNNAGVIH---- 203 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPE----------------GQDLLRKCT-DSSGQFFA 369 Q + R G++++++ + + + +L++ ++GQ+F Sbjct: 204 -PEQAAGFAREKGIQVFTIGMGSDSPVVLGYDWFGNPQYATLDEAMLQQIAASTNGQYFK 262 Query: 370 VNDSRELLESFDKITDKIQEQ 390 D R L E + + +I + Sbjct: 263 SVDDRTLSEIYSNLNKEIVRE 283 >gi|156308416|ref|XP_001617662.1| hypothetical protein NEMVEDRAFT_v1g225902 [Nematostella vectensis] gi|156195093|gb|EDO25562.1| predicted protein [Nematostella vectensis] Length = 273 Score = 76.1 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 31/245 (12%), Positives = 73/245 (29%), Gaps = 57/245 (23%) Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE 230 + + K S A ++D L A + Sbjct: 12 ARPRSVDVTAKSRTTKGIDIVMAIDVSGSMLAKDFKPNRLDALKRVASTFIED------- 64 Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL------NKLNPYENTNTYPAMHHA 284 ++ RIG + Y TP++++ + L + + T + A Sbjct: 65 --RINDRIGLVVYAGESYTR--TPITSDKTVILQSLKTVEYDDSIIADG-TGIGVGLATA 119 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + S + +I +TDG N+ T++ + + G+K+Y+ Sbjct: 120 INRI----------KDSKAKSRVIILLTDGVNNAG-----TIDPRMAADIAKQYGIKVYT 164 Query: 345 VAVSAPP-----------------------EGQDLLRKCTD-SSGQFFAVNDSRELLESF 380 + + + L+++ + + G++F D ++L + Sbjct: 165 IGIGTNGMALFPYAKDQETGKFLFRNMQVEIDEKLMKEIAEMTDGKYFRATDDKKLKAIY 224 Query: 381 DKITD 385 +I Sbjct: 225 AEINK 229 >gi|146325834|sp|Q60847|COCA1_MOUSE RecName: Full=Collagen alpha-1(XII) chain; Flags: Precursor Length = 3120 Score = 76.1 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 58/166 (34%), Gaps = 18/166 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+G + Y+ ++ + + K+ NT T A+ + Sbjct: 168 SAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYRREDLLAAVKKIPYKGGNTMTGDAIDYLV 227 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + E S K I ITDG++ +RN G++++S+ Sbjct: 228 KNTFTESAGSRAGF-----PKVAIIITDGKSQD--------EVEIPARELRNIGVEVFSL 274 Query: 346 AVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQE 389 + L++ + F V + +++ ++I ++ Sbjct: 275 GI--KAADAKELKQIASTPSLNHVFNVANFDAIVDIQNEIISQVCS 318 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 21/160 (13%), Positives = 53/160 (33%), Gaps = 18/160 (11%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKE 293 V+I Y+ + + + L NT T A++ ++ + Sbjct: 1235 RVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQSFKT-- 1292 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 R +K + ITDG++ + + +++ G++++++ + Sbjct: 1293 ---QAGMRPRARKIGVLITDGKSQDDVEAPS--------KKLKDEGVELFAIGI--KNAD 1339 Query: 354 QDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 + L+ + V D L + D +T + Sbjct: 1340 EVELKMIATDPDDTHAYNVADFESLSKIVDDLTINLCNSV 1379 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 32/167 (19%), Positives = 63/167 (37%), Gaps = 23/167 (13%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYN 290 V+I + Y+ N + ++ +N TNT AM + +++ Sbjct: 473 SPNRVQISLVQYSRDPHTEFTLKEFNRVEDIIKAINTFPYRGGSTNTGKAMTYVREKIF- 531 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 N + + K +I ITDG++S A +RN+ ++I++V V Sbjct: 532 ----VPNKGSRSNVPKVMILITDGKSSDAFRDP--------AIKLRNSDVEIFAVGV--K 577 Query: 351 PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS-VRI 394 + L F V D ++F +I+ ++ + +RI Sbjct: 578 DAVRSELEAIASPPAETHVFTVEDF----DAFQRISFELTQSICLRI 620 >gi|317502942|ref|ZP_07961034.1| aerotolerance protein BatA [Prevotella salivae DSM 15606] gi|315665941|gb|EFV05516.1| aerotolerance protein BatA [Prevotella salivae DSM 15606] Length = 332 Score = 76.1 bits (185), Expect = 9e-12, Method: Composition-based stats. Identities = 25/203 (12%), Positives = 60/203 (29%), Gaps = 45/203 (22%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY------ 272 N + + + E + P++ + + + L + Sbjct: 108 NRLEAAKNVASEFISDRPNDNIGLTIFAGEAFTQCPMTTDHASLINMLRSVRTDIAARGL 167 Query: 273 --ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 + T + +A L S K VI +TDG N+ L Sbjct: 168 ISDGTAIGMGLANAVSRL----------KDSKAKSKVVILLTDGSNNMGDIS-----PLT 212 Query: 331 ICEYMRNAGMKIYSVAVSAP--------------------PEGQDLLRKCT-DSSGQFFA 369 + ++ G+++Y++ V L+ + G ++ Sbjct: 213 SAQIAKSLGIRVYTIGVGTNKVAPYPMPVAGGVQYVNIPVEIDSKTLKNIAETTDGNYYR 272 Query: 370 VNDSRELLESFDKITDKIQEQSV 392 + +L + + I DK+++ + Sbjct: 273 ATSNNQLKQIYKDI-DKLEKSKI 294 >gi|153825062|ref|ZP_01977729.1| von Willebrand factor type A domain protein [Vibrio cholerae MZO-2] gi|149741387|gb|EDM55421.1| von Willebrand factor type A domain protein [Vibrio cholerae MZO-2] Length = 318 Score = 76.1 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 57/170 (33%), Gaps = 38/170 (22%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK--LNPYEN-TNTYPAMHHAYRELYNEKE 293 RIG I + TPL+ + V ++LN+ L T + A + + Sbjct: 129 RIGLILFADHAYLQ--TPLTLDRQTVANQLNQAVLKLIGTQTAIGEGIGLATKTFIDSN- 185 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--- 350 ++ +I ++DG N+ L+ L+ + IY++ V A Sbjct: 186 ---------APQRVMILLSDGSNTAGV-----LDPLEAANIAKQYHTTIYTLGVGAGEMV 231 Query: 351 --------------PEGQDLLRKCTDS-SGQFFAVNDSRELLESFDKITD 385 + L+ + G +F + ++L +D I Sbjct: 232 VKDFLFSRKVNTAQDLDEKTLQTIATTTGGHYFRARNQQDLQNIYDTINQ 281 >gi|73973312|ref|XP_867438.1| PREDICTED: similar to alpha 1 type XII collagen long isoform precursor isoform 3 [Canis familiaris] Length = 2989 Score = 76.1 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 59/166 (35%), Gaps = 18/166 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+G + Y+ +E+ + + K+ NT T A+ + Sbjct: 168 SAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYRTDELLAAIKKIPYKGGNTMTGDAIDYLI 227 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + E + K I ITDG++ +RN G++++S+ Sbjct: 228 KNTFTESAGARVGF-----PKVAIIITDGKSQDEVEIP--------ARELRNIGVEVFSL 274 Query: 346 AVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQE 389 + L++ + F V + +++ ++I ++ Sbjct: 275 GI--KAADAKELKQIASTPSLNHVFNVANFDAIVDIQNEIISQVCS 318 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 53/160 (33%), Gaps = 18/160 (11%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKE 293 V+I Y+ + + + L NT T A++ ++ Sbjct: 1235 RVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQ-----N 1289 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 R +K + ITDG++ + + +++ G++++++ + Sbjct: 1290 FRTQAGMRPRARKIGVLITDGKSQDDVEAPS--------KKLKDEGVELFAIGI--KNAD 1339 Query: 354 QDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 +D L+ + V D L + D +T + Sbjct: 1340 EDELKMIATDPDDTHAYNVADFESLSKIVDDLTINLCNSV 1379 Score = 64.2 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 31/167 (18%), Positives = 62/167 (37%), Gaps = 23/167 (13%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYN 290 V+I + Y+ + ++ +N TNT AM + +++ Sbjct: 473 SPNRVQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFV 532 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + S + K +I ITDG++S A +RN+ ++I++V V Sbjct: 533 PSKGSRG-----NVPKVMILITDGKSSDAFRDP--------AIKLRNSDVEIFAVGV--K 577 Query: 351 PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS-VRI 394 + L F V D ++F +I+ ++ + +RI Sbjct: 578 DAVRSELEAIASPPAETHVFTVEDF----DAFQRISFELTQSICLRI 620 >gi|118086119|ref|XP_426008.2| PREDICTED: similar to alpha 3 type VI collagen [Gallus gallus] Length = 2533 Score = 76.1 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 26/160 (16%), Positives = 52/160 (32%), Gaps = 19/160 (11%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYNEK 292 +VR G + Y + + + + + T T A+ Sbjct: 685 NNVRFGVVQYASESKTEIIIGQHSQMMRLTEAIENINQIGGGTRTGNALRSMKSLFQMAY 744 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + + +I ITDG+ + Q +R G+ IY++ + Sbjct: 745 R--------ENVPQILIVITDGK--------SEDKVNQAARDLRQQGIVIYAIGI--KDA 786 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 Q L + ++ + F VND L +I ++ +V Sbjct: 787 VQQELEEIAETKNRMFFVNDFDSLKHIKHEIVQEVCSTNV 826 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 29/178 (16%), Positives = 68/178 (38%), Gaps = 20/178 (11%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPY 272 ++V+S + VR+G + Y+ + +K ++N++ Sbjct: 1231 KTFMKDVVDSF-----DISRDKVRLGVVQYSQEPQREFYLNEFYSDTIIKEQINRIEQLR 1285 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 +T T + + + ES++ + + + ++ ITDG ++ + Sbjct: 1286 SSTFTGKGLRF----VQSLFESANGGRKNQGVSQNLVVITDGYSADSVDD--------AA 1333 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 +R+ G+ +++V V +LLR + + F V + L I ++I E Sbjct: 1334 MALRSNGIHVFAVGVGI-VNSFELLR-IAGDARRVFTVENFNALKTIKSTIINEICEP 1389 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 25/218 (11%), Positives = 63/218 (28%), Gaps = 17/218 (7%) Query: 164 YLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNS 223 + + + + T + + + N ++S Sbjct: 12 FKMDDWKSLLVLLLVSTFGTIDAQQTACSKATVADVVFIVDTSTSIAQENFQKVKNFLSS 71 Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMH 282 + + + +R+G Y+ ++V ++ L T T A+ Sbjct: 72 LVSS-LDIGLDMIRVGLAQYSDEAYQVFLLNQYLLKSDVLDQIGNLPYRGGETYTGRALD 130 Query: 283 HAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKI 342 + E S + + + IT GE++ Q + +R G+ I Sbjct: 131 FVSTRYFTESAGSRAKG---YVPQLAVLITSGESND--------EVEQPAKKLRYRGISI 179 Query: 343 YSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLE 378 Y V + + L++ +++ +L + Sbjct: 180 YVVGIGI--QNTTELQQIASKPFRRYLYSIGSFDDLPD 215 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 25/178 (14%), Positives = 57/178 (32%), Gaps = 17/178 (9%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN-T 277 L+ + N VR+G + Y+ + E+ + L L T Sbjct: 464 FLLKIVNALDIAPSN--VRVGLVLYSNEPRLEFTLDTFKDKLEILNYLKNLPYRGGQAYT 521 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 A+ +++ + + +++ + ITDG+ + + ++ +R Sbjct: 522 GIAIEFLRNKVFTQ---EAGSRKKQGVQQIAVVITDGQ--------SLDDYIEPASKLRR 570 Query: 338 AGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQSVR 393 + +Y+V V L K + +L KI ++ + V Sbjct: 571 ESVTVYAVGV-KNITEGSKLDKIATYPPRNHVTTLKYFLQLSNIRWKIKKQLCNEIVT 627 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 56/168 (33%), Gaps = 10/168 (5%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN-TYPAMHHAYRELY 289 +V+ G + Y+ + ++++ + L P T A+ A + Sbjct: 1049 VGKDNVQFGVLVYSSNPEVQFSLNSYASKSQIREAVFSLKPLSGQPFTARALSFARQTF- 1107 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + S + + ++ ITD + + N ++ + + +AV Sbjct: 1108 --GVNYGGRASSLAVARILVLITD----EPTVPSDRDNLPMAIRALKEDKIVL--IAVGV 1159 Query: 350 PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 ++ L + T+ + F L + +T + E S + N Sbjct: 1160 SKASREELEEITEDQKRLFFAQSYDALENMHENLTQTVCESSKPVCSN 1207 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 29/162 (17%), Positives = 57/162 (35%), Gaps = 20/162 (12%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + +VRIG I ++ I + + +V+ + ++ + T T A+ A Sbjct: 864 DVGLDNVRIGLIQFSSEIREEFQLDRYSTIADVQRAIQEMQQIKLGTLTGKALTFAASYF 923 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 K +++I ITDGE + +R+ G+ IY++ Sbjct: 924 DRPKGGRPELK------QYLIVITDGEAQDSVKSP--------ARAIRDKGITIYAI--D 967 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 L + T + + F E+ S +I +I Sbjct: 968 MLQANNSQLVEITGAQDKVFF---ESEMNFSEKQILFEICNL 1006 >gi|319954909|ref|YP_004166176.1| von willebrand factor type a [Cellulophaga algicola DSM 14237] gi|319423569|gb|ADV50678.1| von Willebrand factor type A [Cellulophaga algicola DSM 14237] Length = 703 Score = 76.1 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 59/184 (32%), Gaps = 19/184 (10%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 K+ +L + LV+ +++ R+ + Y N+ Sbjct: 353 VSGSMNNQNKLPLLKSAFKLLVHQLREKD--------RVSIVVYAGAAGVVLEPTGGNDK 404 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 ++ L+ L+ +T + AY + + N VI TDG+ + Sbjct: 405 EKIIKALDNLSAGGSTAGGEGIELAYALAEKNFKPNKNNR--------VIMATDGDFNVG 456 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLE 378 + + + E R +G+ + + L K D G ++ +E + Sbjct: 457 --ASSDKDMETLIEEKRKSGIFLSVLGFGMGNYKDSKLEKLADKGNGNHAYIDTMQEAQK 514 Query: 379 SFDK 382 F + Sbjct: 515 IFGE 518 >gi|94499146|ref|ZP_01305684.1| hypothetical protein RED65_10169 [Oceanobacter sp. RED65] gi|94428778|gb|EAT13750.1| hypothetical protein RED65_10169 [Oceanobacter sp. RED65] Length = 340 Score = 76.1 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 33/195 (16%), Positives = 70/195 (35%), Gaps = 38/195 (19%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---PYENT 275 N + +++ + + G TPL+ +L+ VK L++ T Sbjct: 113 NRLQTVKAVVTDFVEERKGDRLGLILFGEQAYIQTPLTFDLSTVKRLLDEAVVGLAGNKT 172 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 A+ + L + + +I +TDG+N+ + L+ E Sbjct: 173 AIGDAIGLGVKRLQD----------LPESNRVLILLTDGQNTAG-----EIEPLKAAELA 217 Query: 336 RNAGMKIYSVAVSAP-----------------PEGQDLLRKCTD-SSGQFFAVNDSRELL 377 AG+KIY++ + A +D L + + GQ++ + EL Sbjct: 218 EKAGVKIYAIGIGADEMVIQGFFGPRRVNPSRDLDEDTLTAIAENTGGQYYRARNVNELE 277 Query: 378 ESFDKITD--KIQEQ 390 + +D + +I+ + Sbjct: 278 QIYDVLNQIEEIESE 292 >gi|300853773|ref|YP_003778757.1| hypothetical protein CLJU_c05730 [Clostridium ljungdahlii DSM 13528] gi|300433888|gb|ADK13655.1| hypothetical protein CLJU_c05730 [Clostridium ljungdahlii DSM 13528] Length = 419 Score = 76.1 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 30/176 (17%), Positives = 66/176 (37%), Gaps = 17/176 (9%) Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL-----NKLNPYENTN 276 + N++ + Y + P++ +K P NTN Sbjct: 134 ERFSSVLNLMDNMNTQNRVSIYKFDDTSKRIIPMTEVSESLKKNAEEELKQYEIPAGNTN 193 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 A+ AY E+ + K VI ++DGE++ + + + Sbjct: 194 MGEAIDSAYNEINSTKRPGRKA--------AVILLSDGEDNFGL----NKKFDETLKPFK 241 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 ++ + IY++ +S L + D+ G+++ V ++ +L +F KI Q++ + Sbjct: 242 DSNISIYTIGMSNENNFTTLKKIAKDTHGEYYNVKNASDLKGTFSKIYYATQQRLL 297 >gi|104780882|ref|YP_607380.1| hypothetical protein PSEEN1727 [Pseudomonas entomophila L48] gi|95109869|emb|CAK14574.1| conserved hypothetical protein; Willebrand factor type A domain protein [Pseudomonas entomophila L48] Length = 358 Score = 76.1 bits (185), Expect = 1e-11, Method: Composition-based stats. Identities = 30/181 (16%), Positives = 61/181 (33%), Gaps = 35/181 (19%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---PYENT 275 + ++ ++ + + G PL+ + V++ L++ +NT Sbjct: 115 SRLDLVKALMGDFLQDRQGDRVGLILFGSQAYLQAPLTFDRRTVRTFLDEAQIGIAGKNT 174 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 A+ A + L + +I ITDG N+G + L Sbjct: 175 AIGDAIGLAVKRL----------RQRPAQSRVLILITDGANNGGQIH-----PLTAARLA 219 Query: 336 RNAGMKIYSVAVSAPPE----------------GQDLLRKCTD-SSGQFFAVNDSRELLE 378 G++IY++ + A PE + L++ + G +F +D EL Sbjct: 220 AQEGVRIYTIGIGANPEASGTPGLLGLNPSLDLDEASLKEIAGITHGTYFRAHDGAELNA 279 Query: 379 S 379 Sbjct: 280 I 280 >gi|117921591|ref|YP_870783.1| von Willebrand factor, type A [Shewanella sp. ANA-3] gi|117613923|gb|ABK49377.1| von Willebrand factor, type A [Shewanella sp. ANA-3] Length = 613 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 37/368 (10%), Positives = 88/368 (23%), Gaps = 31/368 (8%) Query: 34 DAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINI 93 DAA+ + + ++ L + Q+ Sbjct: 66 DAAIAMHEQASSAKLRTMSAESRAYIAQPTASISAAPALNGDWPGAVPPERNRFEKQVQN 125 Query: 94 TKDKNNPLQYIAES----KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAIS 149 + + Y L L + Sbjct: 126 GIMVAGEIPVSTFAIDVDTGSYTTLRRMLKEGRLPQKDTLRVEEMLNYFSYDYPLPGKND 185 Query: 150 ICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRK 209 + + E ++D + + S + K Sbjct: 186 APFSV----TTELAPSPYNDDMMLLRIGLKGYEQSKAELGASNLVFL-LDVSGSMASPDK 240 Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 + +L + L + ++ + Y N+ + L +L Sbjct: 241 LPLLQTALKMLTQQLDAQD--------KVSIVVYAGAAGVVLDGAAGNDTQTLNYALEQL 292 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 + +TN + AY+ VI TDG+ + + + L L Sbjct: 293 SAGGSTNGAQGIQLAYQLAQKHFVEGGINR--------VILATDGDFNVGTTNLDELIDL 344 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQ 388 + G+ + ++ L+ + D GQ+ ++ + E+ + + + Sbjct: 345 VSAR--KQQGIGLTTLGFGMGDYNDHLMEQLADKGNGQYAYIDS---INEARKVLVEHLS 399 Query: 389 EQSVRIAP 396 + IA Sbjct: 400 ATLLTIAK 407 >gi|254448210|ref|ZP_05061672.1| von Willebrand factor, type A [gamma proteobacterium HTCC5015] gi|198262077|gb|EDY86360.1| von Willebrand factor, type A [gamma proteobacterium HTCC5015] Length = 336 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 28/195 (14%), Positives = 57/195 (29%), Gaps = 42/195 (21%) Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL--------NKL 269 + + + + G TPL+ + V + + L Sbjct: 110 RSRIAVTKDVAMDFVKQREGDRIALVLFGTHPYLQTPLTFDHPTVMQHIYEAQLTMADDL 169 Query: 270 NPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 T A+ A + L K +I +TDG ++ + + Sbjct: 170 QRGIHATAIGDAIGLAVKRL----------RDIDAPDKTLILLTDGSDNA-----SQVAP 214 Query: 329 LQICEYMRNAGMKIYSVAVSAPPE-----------------GQDLLRKCT-DSSGQFFAV 370 L+ + G+KIY++ + A + L+ + G++F Sbjct: 215 LKAAQIAAREGLKIYTIGLGAEQRQASLLGFDFGFGKNREIDEKTLKDIAKATDGRYFRA 274 Query: 371 NDSRELLESFDKITD 385 + EL E + I Sbjct: 275 RNPEELREIYQHIDR 289 >gi|289442929|ref|ZP_06432673.1| LOW QUALITY PROTEIN: membrane protein [Mycobacterium tuberculosis T46] gi|289415848|gb|EFD13088.1| LOW QUALITY PROTEIN: membrane protein [Mycobacterium tuberculosis T46] Length = 246 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 24/152 (15%), Positives = 55/152 (36%), Gaps = 16/152 (10%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + N K+ L+KL + T T A+ A + + + G T ++ +DG+ Sbjct: 66 TTNREATKNALDKLQFADRTATGEAIFTALQAIATV--GAVIGGGDTPPPARIVLFSDGK 123 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PEGQDLLRKCTD 362 + + N ++ G+ I +++ P P + ++K Sbjct: 124 ETMPTNPDNPKGAYTAARTAKDQGVPISTISFGTPYGFVEINDQRQPVPVDDETMKKVAQ 183 Query: 363 -SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 S G + EL + + +I ++++ Sbjct: 184 LSGGNSYNAATLAELRAVYSSLQQQIGYETIK 215 >gi|313838674|gb|EFS76388.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL086PA1] Length = 320 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 20/222 (9%), Positives = 57/222 (25%), Gaps = 32/222 (14%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 ++ S+ A ++ +A + + + R Sbjct: 87 DRATVVVAIDVSRSMVATDVEPSRLSAAKTAAKDFLGDLP----------PRFNVSLVKF 136 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + + V + + L +T ++ + L + + Sbjct: 137 AASAQVVVAPTTDRAAVSTAITNLQVLPSTAIGEGIYSSLNALKLVPDD--PKHPGQKPP 194 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------------- 352 ++ ++DG + L+ + + +Y++A Sbjct: 195 AAIVLLSDGATNVGRPS------LEAAKEAGRQHVPVYTIAYGTAGGYVVEGGQRQPVPV 248 Query: 353 GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 L S G+ F+ +L + + I + + V Sbjct: 249 NHYELAAIAKASGGEKFSAESLGQLSDVYKSIAQSVGYEKVF 290 >gi|118355471|ref|XP_001010995.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89292762|gb|EAR90750.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 787 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 19/124 (15%), Positives = 42/124 (33%), Gaps = 9/124 (7%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 +N ++ + TN + A+ L N V ++DG++ Sbjct: 300 DNKENLQKITKSIQADGGTNITSGLQTAFSILQN--------RKQRNSVSSVFLLSDGQD 351 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 + + + L + ++ I+S +G + R G F+ V + ++ Sbjct: 352 NNSDSRIRNLLQTTY-QQLQEECFTIHSFGFGNDHDGPLMQRIAQIKDGSFYYVERNDQV 410 Query: 377 LESF 380 E F Sbjct: 411 DEFF 414 >gi|148548919|ref|YP_001269021.1| von Willebrand factor, type A [Pseudomonas putida F1] gi|148512977|gb|ABQ79837.1| von Willebrand factor, type A [Pseudomonas putida F1] Length = 358 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 30/187 (16%), Positives = 61/187 (32%), Gaps = 35/187 (18%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-- 270 + ++ ++ + + G PL+ + V++ L + Sbjct: 109 WKNEDISRLDLVKALMGDFLQDREGDRVGLILFGSQAYLQAPLTFDRRTVRTFLIEAQIG 168 Query: 271 -PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 +NT A+ A + L + ++ ITDG N+G + L Sbjct: 169 IAGKNTAIGDAIGLAVKRL----------RERPAQSRVLVLITDGANNGGQIH-----PL 213 Query: 330 QICEYMRNAGMKIYSVAVSAPPE----------------GQDLLRKCTD-SSGQFFAVND 372 G++IY++ + A PE + L++ D + G +F +D Sbjct: 214 TAARLAAQEGVRIYTIGIGANPEASGTPGLLGLNPSLDLDEAALKEIADITHGAYFRAHD 273 Query: 373 SRELLES 379 EL Sbjct: 274 GAELDAI 280 >gi|47208180|emb|CAF89812.1| unnamed protein product [Tetraodon nigroviridis] Length = 1636 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 31/253 (12%), Positives = 80/253 (31%), Gaps = 24/253 (9%) Query: 148 ISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPAN 207 +S+ + + S E L + ND + + SK Sbjct: 986 VSVFSIGVKNASREQLDIMAGNDPSRVFFVDTFDALETLYKNISKVLCNHTKPVCEKQKA 1045 Query: 208 RKIDVLIESAGNLVNSIQKAIQ---------EKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + +L +S + + + V +G ++ + Sbjct: 1046 DLVFLLDQSGSIQSDDYTTMKKFTIDLINKFQISRDLVHVGLAQFSSTFKDEFYLNKFFD 1105 Query: 259 LNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + + + T A+ + E+SH + + + + ++ ITDG++ Sbjct: 1106 EQAISAHIKDMQQEEGGTLIGLAL----NSIRKYFEASHGSRKAEGISQNLVLITDGDSQ 1161 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 + + +R G++++++ + +LL+ + F V + +L Sbjct: 1162 D--------DVEEAARLLRGLGVEVFAIGIG-NVHDLELLQ-IAGTPENVFTVKNFDKLE 1211 Query: 378 ESFDKITDKIQEQ 390 K+ D I + Sbjct: 1212 GIHQKVVDTICQS 1224 Score = 74.6 bits (181), Expect = 2e-11, Method: Composition-based stats. Identities = 26/158 (16%), Positives = 55/158 (34%), Gaps = 16/158 (10%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYNEK 292 VRIG + Y ++ ++ + + T T A+ + Sbjct: 494 NHVRIGVVKYADSPTLEFDLHTYTDVKSLEKAITNIHQVGGGTETGKALDFMRPQFDRAV 553 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + + +K++++ ITDG ++ + +R G+ +Y++ V Sbjct: 554 TTRGHK-----VKEYLVVITDGNSTD--------KVKDPADKLRAQGVVVYAIGV-KDAV 599 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 ++LL + + F VN+ L D I I Sbjct: 600 EKELLE-ISGEPQRTFYVNNFDALKPIKDDIITDICST 636 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 27/170 (15%), Positives = 58/170 (34%), Gaps = 14/170 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 R G I ++ G+ ++ +V + + P NTNT A+ ++ + Sbjct: 885 SVGPELTRFGVITFSTGVQSIFTLKQYSSKRDVLQAVGAVTAPGGNTNTGDALDYSLQYF 944 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 E + + ++ ITDG S E +R G+ ++S+ V Sbjct: 945 GKEHGGRAALK----VPQILMVITDGAAQEPSKLPGPS------EALRKQGVSVFSIGV- 993 Query: 349 APPEGQDLLRKCTDS-SGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 ++ L + + F V+ L + I+ + + + Sbjct: 994 -KNASREQLDIMAGNDPSRVFFVDTFDALETLYKNISKVLCNHTKPVCEK 1042 Score = 66.1 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 26/221 (11%), Positives = 66/221 (29%), Gaps = 20/221 (9%) Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 + +T + + K++ +K N Sbjct: 626 KDDIITDICSTDGSDLSLLSTVCKDVPGDLIFLIDSSGSIYPEDYQKMKDFMKSLVQKSN 685 Query: 234 ---LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYRELY 289 V +G + Y+ +++ ++ + T+T A+ + Sbjct: 686 IGKDQVHVGVLQYSTEQKLVFPLIQYYTKDQLSKAIDDMQQIGGGTHTGEAIAVVSKYFD 745 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + + + ++ +TDGE+ + E +R G+ +YS+ V Sbjct: 746 AQNGGRPDLK------QRLVVVTDGESQD--------DVKLPAEALRAKGVIVYSIGV-V 790 Query: 350 PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 LL + + + +A D L + ++ +I + Sbjct: 791 AANTSQLLE-ISGDADRMYAERDFDALKDLEKQMALEICDP 830 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 18/119 (15%), Positives = 47/119 (39%), Gaps = 15/119 (12%) Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T T A++ ++ + +K S R+ + + ITDG+++ Sbjct: 83 NGGTETGKAINFLRKQYFTKKAGS---RADQRVPQIAVVITDGDSTDDVVVP-------- 131 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG-QF-FAVNDSRELLESFDKITDKIQ 388 +R G+ ++++ V Q L+ + +F F ++ + L +++ + + Sbjct: 132 ARELRKHGVIVFAIGVG--NANQGELKSIANRPSERFKFTIDSFQALKRLTERLLETMC 188 Score = 39.9 bits (91), Expect = 0.82, Method: Composition-based stats. Identities = 16/85 (18%), Positives = 35/85 (41%), Gaps = 4/85 (4%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELY 289 VR+G + Y+ + +E+ + L TNT A++ +++ Sbjct: 370 VSPNRVRVGIVVYHGEPKAEVFLNTFTDKSELLDFIRILPYHGGGTNTGAALNFTQHQVF 429 Query: 290 NEKESSHNTIGSTRLKKFVIFITDG 314 ++ S +G +++ + ITDG Sbjct: 430 VREKGSRIELG---VQQVAVVITDG 451 >gi|296227520|ref|XP_002759384.1| PREDICTED: matrilin-2 [Callithrix jacchus] Length = 973 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 68/187 (36%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +V+ + +++S+ + + R+G + Y+ + N+ ++K Sbjct: 667 GEENFEVVKQFVTGIIDSLTISPKAA-----RVGLLQYSTQVRTEFTLRNFNSAKDMKKA 721 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + STR+ + I TDG + Sbjct: 722 VAHMKYMGKGSMTGLALKHMFERSFTQGEGARPL--STRVPRVAIVFTDGRAQDDVSEW- 778 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ + F D + E +K Sbjct: 779 -------ASKAKANGITMYAVGVGKAI--EEELQEIASEPTDKHLFYAEDFSTMDEISEK 829 Query: 383 ITDKIQE 389 + I E Sbjct: 830 LKKGICE 836 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 56/159 (35%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 88 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 147 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ RN G+ I+++ V Sbjct: 148 FSEAEGARPLKEN--VPRVIMIVTDGRPQDS--------VAEVAAKARNTGILIFAIGVG 197 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F K Sbjct: 198 --QVDFNTLKAIGSEPHEDHVFLVANFSQIETLTSVFQK 234 >gi|301604824|ref|XP_002932067.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-3(VI) chain-like [Xenopus (Silurana) tropicalis] Length = 2881 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 23/165 (13%), Positives = 55/165 (33%), Gaps = 15/165 (9%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAY 285 + RI Y+ N + V + +L P NT A+ Sbjct: 844 NLDVGQDKTRIAVAQYSDSARPNFQLNTHRDKQGVLDAIQRLTPIGGPSLNTGAALDFVT 903 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 R ++ S + + +F+I +T G++ ++ G+ +++ Sbjct: 904 RNVFT---KSAGSRSDEGVPQFLILLTTGKSRDDVGRP--------ATSLKGQGVIPFAI 952 Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 L+ + F+V+D+ +L + + +K+ + Sbjct: 953 G--TNKADTTELQTISFVPDFTFSVSDASQLTDVYQNFANKVSQL 995 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 39/356 (10%), Positives = 92/356 (25%), Gaps = 24/356 (6%) Query: 37 VLS-GCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK 95 L+ + + + TTKKD + + K +I + ++ Sbjct: 275 ALAQYAQDVKPEFYLNSYTTKKDVAAQVKKLKILNSTPLNTGAALKYVQRNFFTASAGSR 334 Query: 96 DKNNPLQYIAESKA--QYEIPTENLFLKGLIPSALTNL-SLRSTGIIERSSENLAISICM 152 Q + + + L S R+ + + Sbjct: 335 VGEGVPQLLVLITGGPSRDDIGQVSHELKRGGILTFTLGSKRAVESELKEIAFDSTLAFR 394 Query: 153 VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV 212 V + L + + + P P A D Sbjct: 395 VDEFKALPLQGILPQLLTPLKTLTGTVVEAPAPVNPRDIVFLLDGSVNVGSANFPLVRDF 454 Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY 272 LI L + R+G ++ + E+ +R+ +L Sbjct: 455 LINVINGL---------GVSSEGTRVGLAQFSDTPRTEFYLNSLTSKPELLNRIAQLRLQ 505 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 Y L N S + + + ++ + G++S + Sbjct: 506 GGNALNIGSAIQY-VLENHFTSGAGSRIQENVPQLLVMLAAGKSSDSI--------QSAA 556 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 + N+G+ + + A ++ L++ + + +D L + +I + Sbjct: 557 IQLVNSGVLTFCIG--AGNADKEELQRIAFNRQLVYETDDFSTLPQLSQEILTPLT 610 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 16/185 (8%), Positives = 58/185 (31%), Gaps = 18/185 (9%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-- 274 + + + + E +++G + Y+ + + ++ L L Sbjct: 50 IRDFLVNYVQRRLEVGRSRMQVGVVQYSDDVKTEFSLAQNPTKGQLVEALKNLRFIGGEE 109 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 N A+ + ++ S + + ++ IT G ++ + + Sbjct: 110 ANLGAALDYVVENVFTTAAGSRRE---EGVPQSLMIITAG--------PSSDDFREATNA 158 Query: 335 MRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKIT---DKIQEQS 391 ++ + + + L++ F+ ++L I + + +++ Sbjct: 159 LKLNSII--TFGIGVERADIAELQQIATDESFVFSTPQMQDLSTLERNIVPYINGVAQRT 216 Query: 392 VRIAP 396 + + P Sbjct: 217 IVLQP 221 Score = 53.0 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 29/198 (14%), Positives = 64/198 (32%), Gaps = 18/198 (9%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 N D E + + E +++I YN + + +++ Sbjct: 1422 GSINLGRDNFKEVLQFVSGIVDAVFDE--EDAIQIALAQYNSDVTDEFFLKDFTDRDQIM 1479 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYN-EKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + K A R L + + S+ + + IT G+ Sbjct: 1480 DAVTKAEYKGGRVASLGA--AIRHLQDKHFVKEAGSRVSSGVPQIAFVITGGK------- 1530 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + N G+K++++ V A + K + F V + +EL E ++ Sbjct: 1531 -SVDDGQSAAMALSNKGVKVFAIGVGAIDGDEVA--KIASDAPSAFRVPNVQELSELNEQ 1587 Query: 383 I---TDKIQEQSVRIAPN 397 I D + + + P+ Sbjct: 1588 ILITLDTALTKKMTLCPS 1605 >gi|317055486|ref|YP_004103953.1| von Willebrand factor type A [Ruminococcus albus 7] gi|315447755|gb|ADU21319.1| von Willebrand factor type A [Ruminococcus albus 7] Length = 1311 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 26/210 (12%), Positives = 65/210 (30%), Gaps = 32/210 (15%) Query: 183 PPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIA 242 + + + N ++ ++A V+ + + Sbjct: 634 VDNSGKTVAMDIALVIDSSGSMTWNDPKNLRKDAAKEFVDKLSSIDEAAIIDF------- 686 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 L++N + S ++ ++ T+ + L + Sbjct: 687 ---DSSSKINRNLTSNRTLLYSAIDDIDSSGGTSLTAGVSKGLEALSKSND--------- 734 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-GQDLLRKCT 361 KK +I +TDG+ + NAG+ IY++ + + Q LL Sbjct: 735 --KKIMILLTDGKGPYD---------KSLTTQAINAGVTIYTIGLGTNNDIDQPLLNSIA 783 Query: 362 D-SSGQFFAVNDSRELLESFDKITDKIQEQ 390 + G+++ ++ SFD ++ + + Sbjct: 784 TETGGKYYHAKKDIDIQGSFDNVSGDLGNK 813 >gi|156742635|ref|YP_001432764.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233963|gb|ABU58746.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 777 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 27/313 (8%), Positives = 85/313 (27%), Gaps = 31/313 (9%) Query: 93 ITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICM 152 ++ D + A+ +P+ LF + + +S + Sbjct: 196 VSFDHVIETPLSLTTIAEAPLPSAFLFSDSM---ETGASRWIAAPPWGLTSVARNSAQAW 252 Query: 153 VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPP--PPKKSFWSKNTTKSKYA----PAPAPA 206 + + + + + P + +N Sbjct: 253 TDSPAGNYANNTNSALEIATPIAIPASMTAPALMFWHRYDIENGFDRGEVEISANNGVTW 312 Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAY-------NIGIVGNQCTPLSNNL 259 R + G I + + VR + + Sbjct: 313 QRLRSFTGTALGWRREMISLDAYRGQTVRVRFRLVTNAFGVRDGWYIDDVALGPAWDDVR 372 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 ++ ++ LN T+ + + L + + + + ++ ++DG+ + + Sbjct: 373 ARAQAAIDTLNSRGATSIGGGLQRSQHLLTSANPA---------IPRAIVLLSDGQENTS 423 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 + L +R+A ++++ V + + +L + G + +L Sbjct: 424 PYVADVLPP------IRDAQTTVHTIGVGQDADQRLMLSIAAQTGGTYNYAPTPDQLARI 477 Query: 380 FDKITDKIQEQSV 392 ++ I+ + + Sbjct: 478 YNTISGNVSNRQT 490 >gi|13471293|ref|NP_102862.1| hypothetical protein mll1222 [Mesorhizobium loti MAFF303099] gi|14022037|dbj|BAB48648.1| mll1222 [Mesorhizobium loti MAFF303099] Length = 638 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 34/329 (10%), Positives = 83/329 (25%), Gaps = 26/329 (7%) Query: 69 KKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSAL 128 + R D +D + + A Y +L + + Sbjct: 155 ADQIAPQEENRNRVQDFKTNPVHAALEDPVSTFSIDVD-TASYSFVRRSLKEGSVPQADT 213 Query: 129 TNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKS 188 + ++S VS Q + + PK + Sbjct: 214 VRVEEMINYFPYDWKGPDSVSTPFNSTVSVMPTPWNAQTKLMHVAIKGFDIKPTEQPKAN 273 Query: 189 FWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIV 248 K+ +L + LV+ ++ I + Y Sbjct: 274 LV-----FLIDVSGSMDEPDKLPLLKSAFRLLVSKLRADDT--------ISIVTYAGEAG 320 Query: 249 GNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 + +++ + ++ L P +T + AY+ V Sbjct: 321 TVLMPTRAAEKDKILNAIDNLTPGGSTAGEAGIKEAYKLAQQSFVKDGVNR--------V 372 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF 368 + TDG+ + + + + ++ E R +G+ + +++ + Sbjct: 373 MLATDGDFNVGQSDDD--DLKRLIEQERKSGVFLSVFGFGHDNLNDQMMQTIAQNGNG-- 428 Query: 369 AVNDSRELLESFDKITDKIQEQSVRIAPN 397 L E+ + + IA + Sbjct: 429 TAAYIDTLAEAEKVLVEDASSTLFPIAKD 457 >gi|301777181|ref|XP_002924014.1| PREDICTED: cartilage matrix protein-like [Ailuropoda melanoleuca] Length = 495 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 37/352 (10%), Positives = 105/352 (29%), Gaps = 52/352 (14%) Query: 68 IKKHLKQGSYIRENAGDIAQKAQIN---ITKDKNNPLQYIAESKAQYEIPTENLFLKGLI 124 + G A+ I+ I P + + A+ LF G+ Sbjct: 121 LAIQFAITKAFSNTEGGRARSPDISKVVIVVTDGRPQDSVRDVSARARANGIELFAIGVG 180 Query: 125 PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP 184 L ++ + + S ++ +S+ ++ + + + + Sbjct: 181 RVDKATLRQIASEPQD-EHVDYVESYSVIEKLSKKFQEAFCVVSDLCATGDHDCEQVCLS 239 Query: 185 PKKSF-------WSKNTTKSKYAPAPAPANR------------------KIDVLIESAGN 219 S+ ++ N+ +++ + Sbjct: 240 SPGSYTCACREGFTLNSDGKTCNVCSGGGGSSATDLVFLIDGSKSVRPENFELVKKFINQ 299 Query: 220 LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTY 278 +V+++ + + ++G + Y+ + + ++K+ + ++ + T T Sbjct: 300 IVDTL-----DVSDKLAQVGLVQYSSSVRQEFPLGRFHTKKDIKAAVRNMSYMEKGTMTG 354 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 A+ + + + +K I TDG + + ++ Sbjct: 355 AALKYLIDNSFTVSSGAR-----PGAQKVGIVFTDGRSQDYIN--------NAAKKAKDL 401 Query: 339 GMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 G K+++V V +D LR+ +F D + + + ++ KI Sbjct: 402 GFKMFAVGVG--NAVEDELREIASEPVAEHYFYTADFKTINQIGKRLQKKIC 451 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 53/159 (33%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + R+G + Y + + + + ++ P T T A+ A + Sbjct: 71 DVGPNATRVGVVNYASAVKQEFPLRAHGSKASLLRAVRRIQPLSTGTMTGLAIQFAITKA 130 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E S + K VI +TDG + + R G++++++ V Sbjct: 131 FSNTEGGRAR--SPDISKVVIVVTDGRPQDS--------VRDVSARARANGIELFAIGVG 180 Query: 349 APPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 + LR+ V +L + F + Sbjct: 181 R--VDKATLRQIASEPQDEHVDYVESYSVIEKLSKKFQE 217 >gi|300786826|ref|YP_003767117.1| hypothetical protein AMED_4949 [Amycolatopsis mediterranei U32] gi|299796340|gb|ADJ46715.1| conserved hypothetical protein [Amycolatopsis mediterranei U32] Length = 326 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 19/150 (12%), Positives = 49/150 (32%), Gaps = 16/150 (10%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + + N V + L ++T T + A + + + ++ ++DG+ Sbjct: 147 TTDRNGVIKAIENLKLAQSTATGEGIFAALQSVESFSSLVGGA--DGPPPARIVLMSDGK 204 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP--------------EGQDLLRKCT 361 + + + AG+ I S++ + + L Sbjct: 205 QTVPEDLYAARGGYTAAQAAKQAGVPISSISFGTTHGSVTIDDKPQPVSVDDESLREIAR 264 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQS 391 S G F+ + EL + + + ++I + Sbjct: 265 LSGGDFYKAASAEELKKVYADLGEQIGYEL 294 >gi|281353337|gb|EFB28921.1| hypothetical protein PANDA_013248 [Ailuropoda melanoleuca] Length = 471 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 37/352 (10%), Positives = 105/352 (29%), Gaps = 52/352 (14%) Query: 68 IKKHLKQGSYIRENAGDIAQKAQIN---ITKDKNNPLQYIAESKAQYEIPTENLFLKGLI 124 + G A+ I+ I P + + A+ LF G+ Sbjct: 106 LAIQFAITKAFSNTEGGRARSPDISKVVIVVTDGRPQDSVRDVSARARANGIELFAIGVG 165 Query: 125 PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP 184 L ++ + + S ++ +S+ ++ + + + + Sbjct: 166 RVDKATLRQIASEPQD-EHVDYVESYSVIEKLSKKFQEAFCVVSDLCATGDHDCEQVCLS 224 Query: 185 PKKSF-------WSKNTTKSKYAPAPAPANR------------------KIDVLIESAGN 219 S+ ++ N+ +++ + Sbjct: 225 SPGSYTCACREGFTLNSDGKTCNVCSGGGGSSATDLVFLIDGSKSVRPENFELVKKFINQ 284 Query: 220 LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTY 278 +V+++ + + ++G + Y+ + + ++K+ + ++ + T T Sbjct: 285 IVDTL-----DVSDKLAQVGLVQYSSSVRQEFPLGRFHTKKDIKAAVRNMSYMEKGTMTG 339 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 A+ + + + +K I TDG + + ++ Sbjct: 340 AALKYLIDNSFTVSSGAR-----PGAQKVGIVFTDGRSQDYIN--------NAAKKAKDL 386 Query: 339 GMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 G K+++V V +D LR+ +F D + + + ++ KI Sbjct: 387 GFKMFAVGVG--NAVEDELREIASEPVAEHYFYTADFKTINQIGKRLQKKIC 436 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 53/159 (33%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + R+G + Y + + + + ++ P T T A+ A + Sbjct: 56 DVGPNATRVGVVNYASAVKQEFPLRAHGSKASLLRAVRRIQPLSTGTMTGLAIQFAITKA 115 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E S + K VI +TDG + + R G++++++ V Sbjct: 116 FSNTEGGRAR--SPDISKVVIVVTDGRPQDS--------VRDVSARARANGIELFAIGVG 165 Query: 349 APPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 + LR+ V +L + F + Sbjct: 166 R--VDKATLRQIASEPQDEHVDYVESYSVIEKLSKKFQE 202 >gi|262172998|ref|ZP_06040675.1| protein BatA [Vibrio mimicus MB-451] gi|261890356|gb|EEY36343.1| protein BatA [Vibrio mimicus MB-451] Length = 318 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 29/154 (18%), Positives = 54/154 (35%), Gaps = 36/154 (23%) Query: 253 TPLSNNLNEVKSRLNK--LNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 TPL+ + V S+LN+ L T + A + + S ++ +I Sbjct: 143 TPLTMDRQTVISQLNQAVLKLIGTQTAIGEGIGLATKTFID----------SDAPQRVMI 192 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PE 352 ++DG N+ L+ L+ + IY+V V A Sbjct: 193 LLSDGSNTAGV-----LDPLEAANIAKQYQTTIYTVGVGAGEMIVKDFLFSRKVNTAQDL 247 Query: 353 GQDLLRKCTDS-SGQFFAVNDSRELLESFDKITD 385 + L+ + GQ+F + ++L +D I Sbjct: 248 DEKTLQTIASTTGGQYFRARNQQDLQSIYDTINQ 281 >gi|55741484|ref|NP_001006980.1| cartilage matrix protein [Rattus norvegicus] gi|54035339|gb|AAH83869.1| Matrilin 1, cartilage matrix protein [Rattus norvegicus] gi|149024105|gb|EDL80602.1| matrilin 1, cartilage matrix protein [Rattus norvegicus] Length = 498 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 39/376 (10%), Positives = 107/376 (28%), Gaps = 49/376 (13%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQIN---ITKDKN 98 + S ++ + ST + + + G ++ + I+ I Sbjct: 98 RAHTSKASLLQAVHRIQPLSTGTMTGLALQFAITKALSDAEGGRSRSSDISKVVIVVTDG 157 Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERS--SENLAISICMVLDV 156 P + + + LF G+ L ++ + I + Sbjct: 158 RPQDSVRDVSERARASGIELFAIGVGRVDKATLRQIASEPQDEHVDYVESYNVIEKLAKK 217 Query: 157 SRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF---WSKNTTKSKYAPAPAPAN------ 207 + + ++ P + ++ N+ N Sbjct: 218 FQEAFCVSDLCATGDHYCEQVCVSSPGSYTCACHEGFTLNSDGKTCNVCRGGGNGSATDL 277 Query: 208 ------------RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL 255 +++ + +V+++ + + ++G + Y+ I Sbjct: 278 VFLIDGSKSVRPENFELVKKFINQIVDTL-----DVSDRLAQVGLVQYSSSIRQEFPLGR 332 Query: 256 SNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 + ++K+ + ++ + T T A+ + + + +K I TDG Sbjct: 333 FHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSGAR-----PGAQKVGIVFTDG 387 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVND 372 + ++ G K+++V V ++ LR+ +F D Sbjct: 388 RSQDYIND--------AARKAKDLGFKMFAVGVG--NAVEEELREIASEPVADHYFYTAD 437 Query: 373 SRELLESFDKITDKIQ 388 + + + K+ KI Sbjct: 438 FKTINQIGKKLQKKIC 453 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 58/159 (36%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + R+G + Y + + + ++++ P T T A+ A + Sbjct: 74 DVGPNATRVGLVNYASTVKPEFPLRAHTSKASLLQAVHRIQPLSTGTMTGLALQFAITKA 133 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E + S+ + K VI +TDG + + E R +G++++++ V Sbjct: 134 LSDAEGGRSR--SSDISKVVIVVTDGRPQDS--------VRDVSERARASGIELFAIGVG 183 Query: 349 APPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 + LR+ V +L + F + Sbjct: 184 R--VDKATLRQIASEPQDEHVDYVESYNVIEKLAKKFQE 220 >gi|329849361|ref|ZP_08264207.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] gi|328841272|gb|EGF90842.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] Length = 505 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 27/361 (7%), Positives = 80/361 (22%), Gaps = 31/361 (8%) Query: 38 LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK 97 L V + + + + + G A + + Sbjct: 6 LCVSVLAVLTACQPKSAEMYTEAAEYDQTYVPAPTGERPNDERYDG-KAVSPIVRVADRP 64 Query: 98 NNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 + + Y L P+ + Sbjct: 65 VSTFAVDVD-TGAYANVRRLLNDGQRPPADAVRTEELLNYFRYDYPLPADKKQPFSITTE 123 Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 + + + P + K+ ++ + Sbjct: 124 VTTTPWNPNSRLLRVGLRAYDVPRSERPAANLV-----FLIDVSGSMDEKDKLPLVQHAL 178 Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNT 277 + + + R + N +V+ L +L +T Sbjct: 179 RLVADDM----------RPRDRVSIVVYAGAAGIVLEPTANPAQVRRALGQLKAGGSTAG 228 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 + ++ VI TDG+ + + + ++ Sbjct: 229 GEGIA--------LAYATARAAYIDGGINRVILATDGDFNVG--ISDPEAIKDLVRKNKD 278 Query: 338 AGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 G+ + ++ + L+ D +G + ++ + E + + D++ +A Sbjct: 279 DGITLTALGFGTGNYNEALMEGIADVGNGNYAYIDSASEARKV---LDDELSSTLFTVAQ 335 Query: 397 N 397 + Sbjct: 336 D 336 >gi|27379052|ref|NP_770581.1| hypothetical protein blr3941 [Bradyrhizobium japonicum USDA 110] gi|27352202|dbj|BAC49206.1| blr3941 [Bradyrhizobium japonicum USDA 110] Length = 472 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 51/470 (10%), Positives = 116/470 (24%), Gaps = 98/470 (20%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+++ + A+D + R Q+ +A DAA ++ + ++ + Sbjct: 23 IFALMMVPTIFLLGMALDYTLALRKREQLNAAADAAAIAAVRPAMLTQSDTTVVKATAEA 82 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 K L D I ++ + Sbjct: 83 -VFAAKANLPGLSAVPTPTVTIVDSGLARTITVSYTAQSTNN-----------------F 124 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS---- 176 G++ ++ +T ++ +++D S SM N + Sbjct: 125 PGVLGKQTWQVAGSATARAS---SAPNMNFYLLMDDSPSMGIGATTTDISNLIKYTAPAY 181 Query: 177 ------NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE 230 T + +ID++ + L+NS Q Sbjct: 182 QSAGGSQNCGFACHETNIAHDGGTKDNLAIARQRNITLRIDLVTSAVNQLLNSWSNCPQS 241 Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN---------------- 274 + V A N LN + + + Sbjct: 242 GVSGGVMQCMSALNNTTYKAALYTFDLGLNALATLTTPTSAGTQVSNIALMPVAYQNCVV 301 Query: 275 ------TNTYPAMHHAYRELYNEKESSHNTIG--STRLKKFVIFITDGENSG-------- 318 T+ + A + L + + ++ V +TDG Sbjct: 302 VTTNCKTDNGTDIAGALKSLNDVMPTPGLGSNASGDTPQEVVFLVTDGVEDKIVSGASTC 361 Query: 319 ------ASAYQNTLNTLQICEYMRNAGMKI---YSVAVSAPPEG---------------- 353 ++ IC ++N G+KI Y+ + Sbjct: 362 PNASLASNNRCQQPLDTTICTTIKNRGIKIAVLYTEYLQLKTPNIPVTNSWYMSWVDPYN 421 Query: 354 ---------QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRI 394 L+ C G + +V ++ ++ + K+ + + Sbjct: 422 EPTSLTGTIAQKLQSCAS-PGFYASVQTGGDISDALTNLFIKVASSTASL 470 >gi|114048546|ref|YP_739096.1| von Willebrand factor, type A [Shewanella sp. MR-7] gi|113889988|gb|ABI44039.1| von Willebrand factor, type A [Shewanella sp. MR-7] Length = 625 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 37/368 (10%), Positives = 90/368 (24%), Gaps = 34/368 (9%) Query: 36 AVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIREN------AGDIAQKA 89 A ++ S + + + G + Q Sbjct: 79 AAIAIHEQATSTKLRTMNAEHRAYIAQPAATISAAPALNGDWPGAVPPERNRFEKQVQNG 138 Query: 90 QINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAIS 149 + + + + Y L L + Sbjct: 139 IMVAGETPVSTFAIDVD-TGSYTTLRRMLKEGRLPQKDTLRVEEMLNYFSYDYPLPSKNE 197 Query: 150 ICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRK 209 + + E ++D + + S + K Sbjct: 198 APFSV----TTELAPSPYNDDMMLLRIGLKGYEQSKAELGASNLVFL-LDVSGSMASPDK 252 Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 + +L + L + ++ + Y N+ + L +L Sbjct: 253 LPLLQTALKMLTQQLGAQD--------KVSIVVYAGAAGVVLDGAAGNDSQTLNYALEQL 304 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 + +TN + AY+ VI TDG+ + + + L L Sbjct: 305 SAGGSTNGAQGIQLAYQLAKKHLVEGGINR--------VILATDGDFNVGTTNLDELIDL 356 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQ 388 + + G+ + ++ L+ + D GQ+ ++ L E+ + +++ Sbjct: 357 VSAQ--KQLGIGLTTLGFGMGDYNDHLMEQLADKGNGQYAYIDS---LNEARKVLVEQLS 411 Query: 389 EQSVRIAP 396 + IA Sbjct: 412 ATLLTIAK 419 >gi|330417948|ref|NP_001193426.1| collagen alpha-1(XII) chain [Bos taurus] Length = 3065 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 58/166 (34%), Gaps = 18/166 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+G + Y+ E+ + + K+ NT T A+ + Sbjct: 168 SAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQQEELLAAIKKIPYKGGNTMTGEAIDYLI 227 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + E + K I ITDG++ +RN G++++S+ Sbjct: 228 KNTFTESAGARVGF-----PKVAIIITDGKSQD--------EVEIPARELRNIGVEVFSL 274 Query: 346 AVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQE 389 + L++ + F V + +++ ++I ++ Sbjct: 275 GI--KAADAKELKQIASTPSLNHVFNVANFDAIVDIQNEIISQVCS 318 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 21/160 (13%), Positives = 53/160 (33%), Gaps = 18/160 (11%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKE 293 V+I Y+ + + + L NT T A++ ++ + Sbjct: 1235 RVQIALAQYSGDPRTEWQLNAHKDKKSLLQAVANLPYKGGNTLTGMALNFIRQQSFKT-- 1292 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 R +K + ITDG++ + + +++ G++++++ + Sbjct: 1293 ---QAGMRPRARKIGVLITDGKSQDDVEAPS--------KKLKDEGVELFAIGI--KNAD 1339 Query: 354 QDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 + L+ + V D L D +T+ + Sbjct: 1340 EVELKMIATDPDDIHAYNVADFDSLSRIVDDLTNNLCNSV 1379 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 62/167 (37%), Gaps = 23/167 (13%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYN 290 V+I + Y+ + ++ +N TNT AM + +++ Sbjct: 473 SPNRVQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIF- 531 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + K +I ITDG++S A +RN+ ++I++V V Sbjct: 532 ----VPSKGSRSNVPKVMILITDGKSSDAFRDP--------AIKLRNSDVEIFAVGV--K 577 Query: 351 PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS-VRI 394 + L F V D ++F +I+ ++ + +RI Sbjct: 578 DAVRSELEAIASPPAETHVFTVEDF----DAFQRISFELTQSICLRI 620 >gi|319426861|gb|ADV54935.1| von Willebrand factor type A [Shewanella putrefaciens 200] Length = 339 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 64/204 (31%), Gaps = 47/204 (23%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + ++ + + + RIG I + PL+ + V Sbjct: 104 NGKTVDRFTLIQHVVSDFIERRK---------GDRIGLILFADHAYLQA--PLTQDRRSV 152 Query: 263 KSRLN--KLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 L ++ T A+ A + + +I +TDG N+ Sbjct: 153 AQFLKEAQIGLVGKQTAIGEAIALAVKRFDK----------IDESNRVLILLTDGSNNAG 202 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEGQDLLRKCTD 362 + + Q + N + IY+V V A ++ L+ D Sbjct: 203 N-----IEPEQAAQIAANRKVTIYTVGVGADVMERRTLFGRERVNPSMDLDENQLKHIAD 257 Query: 363 -SSGQFFAVNDSRELLESFDKITD 385 + G++F +S+EL + + +I Sbjct: 258 VTHGRYFRARNSQELDQIYQEIDK 281 >gi|120598362|ref|YP_962936.1| von Willebrand factor, type A [Shewanella sp. W3-18-1] gi|146293560|ref|YP_001183984.1| von Willebrand factor, type A [Shewanella putrefaciens CN-32] gi|120558455|gb|ABM24382.1| von Willebrand factor, type A [Shewanella sp. W3-18-1] gi|145565250|gb|ABP76185.1| von Willebrand factor, type A [Shewanella putrefaciens CN-32] Length = 339 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 64/204 (31%), Gaps = 47/204 (23%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + ++ + + + RIG I + PL+ + V Sbjct: 104 NGKTVDRFTLIQHVVSDFIERRK---------GDRIGLILFADHAYLQA--PLTQDRRSV 152 Query: 263 KSRLN--KLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 L ++ T A+ A + + +I +TDG N+ Sbjct: 153 AQFLKEAQIGLVGKQTAIGEAIALAVKRFDK----------IDESNRVLILLTDGSNNAG 202 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEGQDLLRKCTD 362 + + Q + N + IY+V V A ++ L+ D Sbjct: 203 N-----IEPEQAAQIAANRKVTIYTVGVGADVMERRTLFGRERVNPSMDLDENQLKHIAD 257 Query: 363 -SSGQFFAVNDSRELLESFDKITD 385 + G++F +S+EL + + +I Sbjct: 258 VTHGRYFRARNSQELDQIYQEIDK 281 >gi|212276002|ref|NP_001130333.1| hypothetical protein LOC100191428 [Zea mays] gi|194688870|gb|ACF78519.1| unknown [Zea mays] Length = 704 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 33/271 (12%), Positives = 81/271 (29%), Gaps = 30/271 (11%) Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 + N T I E A+ LD + + + ++ S Sbjct: 191 RMEAFDDLNFGSSKTAEISSYPEFQAVPQSTCLDGFDILIHVKAPTSSSDDATGSLVNGS 250 Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 + T + + A K+ +L ++ G ++ ++ + R+ I Sbjct: 251 SLRLSRRVPIDIVTVLDVSGSMAG--TKMALLKQAMGFVIQHLRPSD--------RLSVI 300 Query: 242 AYNIGIVGNQCTP--LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 A++ + + +N L TN A+ A + + + Sbjct: 301 AFSSTARRLFPLQRMSHHGRQQALQAINSLGAGGGTNIADALKKAVKVIAD--------R 352 Query: 300 GSTRLKKFVIFITDGENSGASAYQ--------NTLNTLQICEYMRNAGMKIYSVAVSAPP 351 +I ++DG+++ + +L + + +++ A Sbjct: 353 SYKNSVCSIILLSDGQDTYNISSNFQGTSAGRRSLVPSANPNELHM--VPLHTFGFGADH 410 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + L S G F + D + ++F + Sbjct: 411 DSDTLHSISEASGGTFSFIEDEGVMQDAFAQ 441 >gi|242042273|ref|XP_002468531.1| hypothetical protein SORBIDRAFT_01g047480 [Sorghum bicolor] gi|241922385|gb|EER95529.1| hypothetical protein SORBIDRAFT_01g047480 [Sorghum bicolor] Length = 650 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 22/189 (11%), Positives = 59/189 (31%), Gaps = 31/189 (16%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT--PLSNNLNEVKSRL 266 K+ +L ++ G +++++ R+ ++++ + KS + Sbjct: 200 KLALLKQAMGFVIDNLGPHD--------RLSVVSFSDRARRVTRLLRMSGDGKAAAKSAV 251 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG-------A 319 L TN + A R L VI ++DG+++ Sbjct: 252 ESLVARGGTNIAEGLRTAARVLEE--------RRHRNTVSSVILLSDGQDTYTAPRWSRG 303 Query: 320 SAYQNTLNTLQ------ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDS 373 T N + R+ +++ + + ++G F + + Sbjct: 304 PGAGATPNYEALVPPSFMATSTRDWSAPVHTFGFGNDHDAAAMHVIAESTAGTFSYIGNE 363 Query: 374 RELLESFDK 382 + ++F + Sbjct: 364 AVIQDAFAQ 372 >gi|91792882|ref|YP_562533.1| von Willebrand factor, type A [Shewanella denitrificans OS217] gi|91714884|gb|ABE54810.1| von Willebrand factor, type A [Shewanella denitrificans OS217] Length = 330 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 52/153 (33%), Gaps = 36/153 (23%) Query: 254 PLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + + + L + T A+ A + ES+ + ++ Sbjct: 144 PLTQDRRSIATFLADAQIGLVGKQTAIGEAIALAVKRFDQVSESN----------RVLVL 193 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEG 353 +TDG N+ + + E + IY+V V A Sbjct: 194 LTDGSNNAGNIEPDV-----AAEIAAKRNVTIYTVGVGAELMERRTIFGKERVNPSMDLD 248 Query: 354 QDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + L++ ++G +F +S +L + + KI Sbjct: 249 EAQLQRLATMTNGYYFRAKNSEDLAQIYQKIDQ 281 >gi|84622723|ref|YP_450095.1| hypothetical protein XOO_1066 [Xanthomonas oryzae pv. oryzae MAFF 311018] gi|84366663|dbj|BAE67821.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF 311018] Length = 335 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 32/205 (15%), Positives = 66/205 (32%), Gaps = 46/205 (22%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 ++ + ++ G TPL+ +L Sbjct: 117 VLGGKVVDRLTAAKAVLSDFLDR-----------RDGDRVGLLVFGQRAYALTPLTADLT 165 Query: 261 EVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 V+ +L T A+ + + L +K+ ++ V+ +TDG N+ Sbjct: 166 SVRDQLRDSVVGLAGRETAIGDAIALSVKRLREQKQG----------QRVVVLLTDGVNT 215 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAV----------------SAPPEGQDLLRKCT 361 L+ L+ E + G++I+++A +D LRK Sbjct: 216 AGV-----LDPLKAAELAKAEGVRIHTIAFGGGGGSSLFGVPIPAGGNDDIDEDGLRKIA 270 Query: 362 -DSSGQFFAVNDSRELLESFDKITD 385 + G+FF D+ EL + ++ Sbjct: 271 QQTGGRFFRARDTEELAGIYAELDR 295 >gi|198274642|ref|ZP_03207174.1| hypothetical protein BACPLE_00794 [Bacteroides plebeius DSM 17135] gi|198272089|gb|EDY96358.1| hypothetical protein BACPLE_00794 [Bacteroides plebeius DSM 17135] Length = 332 Score = 75.7 bits (184), Expect = 1e-11, Method: Composition-based stats. Identities = 23/142 (16%), Positives = 48/142 (33%), Gaps = 37/142 (26%) Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T + +A L S K VI +TDG N+ L E Sbjct: 171 GTALGMGLANAVSRL----------KDSKAKSKVVILLTDGVNNRGDIS-----PLTAAE 215 Query: 334 YMRNAGMKIYSVAVSAPPE--------------------GQDLLRKCT-DSSGQFFAVND 372 + G+++Y++ V + + + ++G +F Sbjct: 216 IAKQFGIRVYTIGVGTNGTAPYPMQTYAGVQYVQMPVEIDEQTMSQIAGTTNGNYFRATS 275 Query: 373 SRELLESFDKITDKIQEQSVRI 394 + +L E + +I DK+++ + + Sbjct: 276 NTKLKEVYREI-DKLEKTKLNV 296 >gi|38505728|ref|NP_942348.1| hypothetical protein slr7060 [Synechocystis sp. PCC 6803] gi|38423752|dbj|BAD01962.1| slr7060 [Synechocystis sp. PCC 6803] Length = 588 Score = 75.3 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 24/211 (11%), Positives = 62/211 (29%), Gaps = 21/211 (9%) Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 PP + S N + KI ++ ++ + + Sbjct: 34 PPAMDQPRPSLNLGFVIDRSGSMEGHNKITYARQAVCYAIDQLSPGD--------HLSVT 85 Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGS 301 ++ + + L + + K + +NP T+ + + Sbjct: 86 IFDDQVQTLIPSTLVKDKAQFKRLVQGINPGGCTDLHGGWLQG---------GIQVSQNL 136 Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT 361 + +I ++DG + + + T + G ++ + +DLL Sbjct: 137 SAELNRIILLSDGLANRGETNPDIIATD--VHGLAQRGASTTTLGLG-DDYNEDLLEAMA 193 Query: 362 -DSSGQFFAVNDSRELLESFDKITDKIQEQS 391 G ++ V D+ +L F++ + Sbjct: 194 RSGDGNYYYVADAEQLPTIFERELQGLAATY 224 >gi|111074529|ref|NP_031756.2| collagen alpha-1(XII) chain [Mus musculus] Length = 3061 Score = 75.3 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 58/166 (34%), Gaps = 18/166 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+G + Y+ ++ + + K+ NT T A+ + Sbjct: 168 SAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYRREDLLAAVKKIPYKGGNTMTGDAIDYLV 227 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + E S K I ITDG++ +RN G++++S+ Sbjct: 228 KNTFTESAGSRAGF-----PKVAIIITDGKSQD--------EVEIPARELRNIGVEVFSL 274 Query: 346 AVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQE 389 + L++ + F V + +++ ++I ++ Sbjct: 275 GI--KAADAKELKQIASTPSLNHVFNVANFDAIVDIQNEIISQVCS 318 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 21/160 (13%), Positives = 53/160 (33%), Gaps = 18/160 (11%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKE 293 V+I Y+ + + + L NT T A++ ++ + Sbjct: 1235 RVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQSFKT-- 1292 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 R +K + ITDG++ + + +++ G++++++ + Sbjct: 1293 ---QAGMRPRARKIGVLITDGKSQDDVEAPS--------KKLKDEGVELFAIGI--KNAD 1339 Query: 354 QDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 + L+ + V D L + D +T + Sbjct: 1340 EVELKMIATDPDDTHAYNVADFESLSKIVDDLTINLCNSV 1379 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 32/167 (19%), Positives = 63/167 (37%), Gaps = 23/167 (13%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYN 290 V+I + Y+ N + ++ +N TNT AM + +++ Sbjct: 473 SPNRVQISLVQYSRDPHTEFTLKEFNRVEDIIKAINTFPYRGGSTNTGKAMTYVREKIF- 531 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 N + + K +I ITDG++S A +RN+ ++I++V V Sbjct: 532 ----VPNKGSRSNVPKVMILITDGKSSDAFRDP--------AIKLRNSDVEIFAVGV--K 577 Query: 351 PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS-VRI 394 + L F V D ++F +I+ ++ + +RI Sbjct: 578 DAVRSELEAIASPPAETHVFTVEDF----DAFQRISFELTQSICLRI 620 >gi|320333536|ref|YP_004170247.1| von Willebrand factor type A [Deinococcus maricopensis DSM 21211] gi|319754825|gb|ADV66582.1| von Willebrand factor type A [Deinococcus maricopensis DSM 21211] Length = 509 Score = 75.3 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 48/359 (13%), Positives = 111/359 (30%), Gaps = 26/359 (7%) Query: 34 DAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINI 93 DA + S + + + I+ + I + ++ + Sbjct: 164 DAITAADARSPELRAFFRGQALTAGSSGWLADAYIRDQARLNGLINYESVLLSLNRGGQL 223 Query: 94 TKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTG-IIERSSENLAISICM 152 + L Y + + P L P +LR+ + L + Sbjct: 224 --REPLTLIYPRDGLITADYPLLLLNPAQQAPFRKLVDALRAPAVQARIMRDTLRRPVNT 281 Query: 153 VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV 212 + ++R D L + + ++ ++ + + NT +ID Sbjct: 282 TVPLTRDFPDALLLELPFPRSASTLDAVVSTYLQDTRQPANTIFVLDVSGSMRGA-RIDA 340 Query: 213 LIESAGNL--VNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS-----NNLNEVKSR 265 L + L ++ R+ I ++ + T L+ L +++++ Sbjct: 341 LKTALRGLSGADTTLTGRYATFANRERVTLIPFSSAPGAPRTTELTPATRGAALKQLRAQ 400 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 ++ L P TN Y A+ AY + + + ++ +TDGE + + Sbjct: 401 VDALTPDGGTNIYGALQAAYEQARAAPAGRYTS---------IVLMTDGERTEGPSADQF 451 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKI 383 T R +K ++V + + + G+ F + +L +F I Sbjct: 452 RATYAALPE-RARQVKTFTVLFG--DSDATEMNRIATLTGGRTFDGQN--DLRAAFKDI 505 >gi|297286916|ref|XP_002808380.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-5(VI) chain-like [Macaca mulatta] Length = 2604 Score = 75.3 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 29/168 (17%), Positives = 62/168 (36%), Gaps = 13/168 (7%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN-KLNPYENTNTYPAMHHAYREL 288 + V+ G + Y+ +N + + L + + +T T A+ HA Sbjct: 845 DVGRNGVQFGALKYSDQPNILFYLNTYSNRSAIIENLRMRRDTGGSTYTAKALKHANALF 904 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 E H + +K+ +I ITDG++ +T +R+ G+ I +V V Sbjct: 905 TEE----HGSRIKENVKQMLIVITDGKSHDHDQLNDT------ASELRDKGITILAVGVG 954 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 Q L + V++ +L + F + + + ++ + Sbjct: 955 K--ANQKELEGMAGNKNNTIYVDNFDKLKDVFTLVQESMCTEAPEVCS 1000 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 19/166 (11%), Positives = 56/166 (33%), Gaps = 16/166 (9%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHA 284 + VR+G + Y+ +N ++ + + T T A+ + Sbjct: 469 TEMFSIGPDKVRVGVVQYSDNAEVEFYISDYSNDIGLRKAIFNIKQLTGRTYTGKALDYI 528 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + N + + + ++I +TDG + ++ + +R + I++ Sbjct: 529 LQIXKNGSKDRMSK-----VPCYLIVLTDGM--------SMDRVVEPGKRLRAEQITIHA 575 Query: 345 VAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 V + + L++ + + L +++ +I + Sbjct: 576 VGIG--AANKIELQEIAGKEERVHFGQNFDALKSIKNEVVREICTE 619 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 24/165 (14%), Positives = 52/165 (31%), Gaps = 17/165 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + +IG + ++ E+ +++++ + T T A++ Sbjct: 656 TKIQIGADKTQIGVVQFSDTTKEEFQLNRYFTRQEISDAIDRMSLINKGTLTGKALNFVG 715 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + K + KF+I ITDG +R + I+SV Sbjct: 716 QYFTHSKGARLGAK------KFLILITDGVARDYVRDP--------ARILRGKNVTIFSV 761 Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 V + L + + F V + L K+ ++ Sbjct: 762 GV--YNANRSQLEEISGDGSLVFHVENFDHLKALERKLVFRVCAL 804 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 67/186 (36%), Gaps = 20/186 (10%) Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 N + +L+++ + ++ ++IG + S + K+++ Sbjct: 1018 NSDFVTMTTFLSDLIDNF-----DIQSQRMKIGMAQFGSNYQSIIELKNSLTKTQWKNQI 1072 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 ++ +P + A +++ N + + + ++ IT G+ Sbjct: 1073 QNVSKSSG---FPRIDFALKKVSNMFNLYAGGRRNAGVPQTLVVITSGDPH--------Y 1121 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL--ESFDKIT 384 N + +++ G+ + + + + Q LL T +S + D +L + +I Sbjct: 1122 NVADAVKTLKDLGICVLVLGIGDVSKEQLLL--ITGNSEKIITFQDFDKLKNVDVRKRIV 1179 Query: 385 DKIQEQ 390 +I + Sbjct: 1180 REICQS 1185 >gi|125548980|gb|EAY94802.1| hypothetical protein OsI_16587 [Oryza sativa Indica Group] Length = 708 Score = 75.3 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 20/133 (15%), Positives = 43/133 (32%), Gaps = 17/133 (12%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + +N L TN A+ A + + + +I ++DG+++ Sbjct: 333 GRQQALQAINSLGASGGTNIADALKKAMKVIED--------RSYKNSVCSIILLSDGQDT 384 Query: 318 GA--------SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA 369 S +L I R + +++ A + L S G F Sbjct: 385 YNISSSVQGASPDYKSLVPSSIINDAR-HTVPLHAFGFGADHDSDSLHSIAQASGGTFSF 443 Query: 370 VNDSRELLESFDK 382 + D + ++F + Sbjct: 444 IEDEGVMQDAFAQ 456 >gi|115459346|ref|NP_001053273.1| Os04g0508800 [Oryza sativa Japonica Group] gi|32489531|emb|CAE04734.1| OSJNBa0043L24.22 [Oryza sativa Japonica Group] gi|113564844|dbj|BAF15187.1| Os04g0508800 [Oryza sativa Japonica Group] gi|116310776|emb|CAH67569.1| OSIGBa0101P20.12 [Oryza sativa Indica Group] gi|125590953|gb|EAZ31303.1| hypothetical protein OsJ_15416 [Oryza sativa Japonica Group] Length = 708 Score = 75.3 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 20/133 (15%), Positives = 43/133 (32%), Gaps = 17/133 (12%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + +N L TN A+ A + + + +I ++DG+++ Sbjct: 333 GRQQALQAINSLGASGGTNIADALKKAMKVIED--------RSYKNSVCSIILLSDGQDT 384 Query: 318 GA--------SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA 369 S +L I R + +++ A + L S G F Sbjct: 385 YNISSSVQGASPDYKSLVPSSIINDAR-HTVPLHAFGFGADHDSDSLHSIAQASGGTFSF 443 Query: 370 VNDSRELLESFDK 382 + D + ++F + Sbjct: 444 IEDEGVMQDAFAQ 456 >gi|324997883|ref|ZP_08118995.1| hypothetical protein PseP1_03919 [Pseudonocardia sp. P1] Length = 329 Score = 75.3 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 21/161 (13%), Positives = 55/161 (34%), Gaps = 18/161 (11%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYREL--YNEKESSHNTIGSTRLKKFVIFITD 313 + + N VK+ ++ L E+T T A+ A + + ++ T V+ ++D Sbjct: 144 TTDRNAVKNGVDNLQLAESTATGEAIFTAMQSIDTFSRSLQGGPDQQGTPPPARVVLLSD 203 Query: 314 GENSGASAYQNT--LNTLQICEYMRNAGMKIYSVAVSAP-------------PEGQDLLR 358 G + + + G+ + +++ + Sbjct: 204 GTQTVPGPDGENEPRGSFTAASDAQRRGIPVSTISFGTSYGSIELDGGRTPVAVDDASME 263 Query: 359 KCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + S G+FF EL + +++++ + + +R Sbjct: 264 RIASLSGGRFFTAATESELRAVYSDLSEELGYEEREVDASR 304 >gi|312131680|ref|YP_003999020.1| von willebrand factor type a [Leadbetterella byssophila DSM 17132] gi|311908226|gb|ADQ18667.1| von Willebrand factor type A [Leadbetterella byssophila DSM 17132] Length = 328 Score = 75.3 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 30/157 (19%), Positives = 58/157 (36%), Gaps = 25/157 (15%) Query: 249 GNQCTPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 +PL+ + +K L +N T A+ +L Sbjct: 152 TATLSPLTTDYTALKEYLASINTNIIRTSGTALGMALSSCVNKL----------RDVAGK 201 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----GQDLLRKC 360 + I I+DG+N+ + E ++ G+++Y++A+ P + LR Sbjct: 202 SRIAIIISDGDNTAGTIP-----PETAVELAKSFGVRVYTIAIGKPGSEEGVDEKTLRML 256 Query: 361 TDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 G+FF D+ L + FD+I D +++ A Sbjct: 257 AGGPNGRFFKAADNSSLSKIFDEI-DHLEKTISETAS 292 >gi|293361345|ref|XP_236596.5| PREDICTED: collagen type VI alpha 4 [Rattus norvegicus] Length = 2327 Score = 75.3 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 28/170 (16%), Positives = 59/170 (34%), Gaps = 16/170 (9%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + ++ V++G + Y+ I S+ + V ++ L +TNT A+ Sbjct: 263 SGLDVRSDQVQVGLVQYSDNIYPAFQLKQSSLKSVVLEQIRNLPYNMGSTNTGSALEFIR 322 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 E S + + VI +TDGE++ + ++ G+ +Y V Sbjct: 323 ANYLTEMSGSRAK---DGVPQIVILVTDGESND--------EVQDAADQLKRDGVFVYVV 371 Query: 346 AVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQSVR 393 + + L+K F + L E ++ + R Sbjct: 372 GI--NIQDVQELQKIASEPFEEFLFTTENFNILQELSGSLSQAVCSTVER 419 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 55/183 (30%), Gaps = 17/183 (9%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-E 273 +S N + + ++Q ++R+ Y +V + L Sbjct: 49 QSVRNFLYILANSLQVGG-DNIRVSLALYGDTPTTEFLLSAYPRKGDVLKHIRGLQFKPG 107 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 ++ + E S S + + + ++ G + E Sbjct: 108 GNRMGQSLQFILEHHFPEGAGS---RASQGVPQVALVMSTGVAEDHF--------REPAE 156 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQS 391 ++ G+ +Y++ V Q LR+ S F V + L K+ ++ Sbjct: 157 ALKREGILLYAIGV--KDAAQAELREIASSPKDNFTFFVPNFSGLPGLAQKLRPELCTTL 214 Query: 392 VRI 394 ++ Sbjct: 215 AKV 217 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 18/151 (11%), Positives = 47/151 (31%), Gaps = 20/151 (13%) Query: 244 NIGIVGNQCTPLSNNLNEVK---SRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNTI 299 + L NN + ++ + T T A++ + Sbjct: 1093 QFSSTPREEFTLKNNYSSKDEMCRAISNVTQINSGTETGKALNFTLPFFDISQGGRPG-- 1150 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 + +++I ITDG++ + +R+ + I+++ V + L Sbjct: 1151 ----VHQYLIVITDGDSHDDIVSP--------AKALRDRNIIIFAIGVGK--IQRAQLLA 1196 Query: 360 CTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 T+ + + + L +I ++ Sbjct: 1197 ITNDQDKVYHEENFESLQNLEKEILYEVCTS 1227 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 24/161 (14%), Positives = 57/161 (35%), Gaps = 15/161 (9%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEK 292 VR+ + Y+ N +V + L++L T A+ +E++ Sbjct: 671 VRVSLVFYSEKPQLKFSLNTFQNAAQVLTSLDQLTFRARRGRTKAGAALDFLRKEVFL-- 728 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + +++ + I + + N ++R G+ IY+V + + E Sbjct: 729 -PEKGSRSIWGVQQIAVIIME--------SPSLDNVSTPASHLRRTGVTIYAVGIQSASE 779 Query: 353 GQDLLRKCTDSSG-QFFAVNDSRELLESFDKITDKIQEQSV 392 +DL + T G + +L + K+ +++ Sbjct: 780 SKDLEKIATYPPGKHTIHLESFLQLDVVRKILNKKLCPETL 820 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 18/88 (20%), Positives = 36/88 (40%), Gaps = 12/88 (13%) Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT 361 T + +++I ITDG++S A + +R+ G+ IY++ V L + Sbjct: 965 TNVAQYLIVITDGQSSDPVAD--------AAQGLRDTGINIYAIGV--RDANTTELEEIA 1014 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQE 389 + + F +D L ++ I Sbjct: 1015 --NNRVFFTDDFHFLKSIHQEVVRDICS 1040 >gi|293349452|ref|XP_002727145.1| PREDICTED: similar to procollagen, type VI, alpha 3 isoform 4 [Rattus norvegicus] Length = 2114 Score = 75.3 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 28/170 (16%), Positives = 59/170 (34%), Gaps = 16/170 (9%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + ++ V++G + Y+ I S+ + V ++ L +TNT A+ Sbjct: 263 SGLDVRSDQVQVGLVQYSDNIYPAFQLKQSSLKSVVLEQIRNLPYNMGSTNTGSALEFIR 322 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 E S + + VI +TDGE++ + ++ G+ +Y V Sbjct: 323 ANYLTEMSGSRAK---DGVPQIVILVTDGESND--------EVQDAADQLKRDGVFVYVV 371 Query: 346 AVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQSVR 393 + + L+K F + L E ++ + R Sbjct: 372 GI--NIQDVQELQKIASEPFEEFLFTTENFNILQELSGSLSQAVCSTVER 419 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 55/183 (30%), Gaps = 17/183 (9%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-E 273 +S N + + ++Q ++R+ Y +V + L Sbjct: 49 QSVRNFLYILANSLQVGG-DNIRVSLALYGDTPTTEFLLSAYPRKGDVLKHIRGLQFKPG 107 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 ++ + E S S + + + ++ G + E Sbjct: 108 GNRMGQSLQFILEHHFPEGAGS---RASQGVPQVALVMSTGVAEDHF--------REPAE 156 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQS 391 ++ G+ +Y++ V Q LR+ S F V + L K+ ++ Sbjct: 157 ALKREGILLYAIGV--KDAAQAELREIASSPKDNFTFFVPNFSGLPGLAQKLRPELCTTL 214 Query: 392 VRI 394 ++ Sbjct: 215 AKV 217 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 18/151 (11%), Positives = 47/151 (31%), Gaps = 20/151 (13%) Query: 244 NIGIVGNQCTPLSNNLNEVK---SRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNTI 299 + L NN + ++ + T T A++ + Sbjct: 890 QFSSTPREEFTLKNNYSSKDEMCRAISNVTQINSGTETGKALNFTLPFFDISQGGRPG-- 947 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 + +++I ITDG++ + +R+ + I+++ V + L Sbjct: 948 ----VHQYLIVITDGDSHDDIVSP--------AKALRDRNIIIFAIGVGK--IQRAQLLA 993 Query: 360 CTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 T+ + + + L +I ++ Sbjct: 994 ITNDQDKVYHEENFESLQNLEKEILYEVCTS 1024 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 24/161 (14%), Positives = 57/161 (35%), Gaps = 15/161 (9%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEK 292 VR+ + Y+ N +V + L++L T A+ +E++ Sbjct: 468 VRVSLVFYSEKPQLKFSLNTFQNAAQVLTSLDQLTFRARRGRTKAGAALDFLRKEVFL-- 525 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + +++ + I + + N ++R G+ IY+V + + E Sbjct: 526 -PEKGSRSIWGVQQIAVIIME--------SPSLDNVSTPASHLRRTGVTIYAVGIQSASE 576 Query: 353 GQDLLRKCTDSSG-QFFAVNDSRELLESFDKITDKIQEQSV 392 +DL + T G + +L + K+ +++ Sbjct: 577 SKDLEKIATYPPGKHTIHLESFLQLDVVRKILNKKLCPETL 617 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 18/88 (20%), Positives = 36/88 (40%), Gaps = 12/88 (13%) Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT 361 T + +++I ITDG++S A + +R+ G+ IY++ V L + Sbjct: 762 TNVAQYLIVITDGQSSDPVAD--------AAQGLRDTGINIYAIGV--RDANTTELEEIA 811 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQE 389 + + F +D L ++ I Sbjct: 812 --NNRVFFTDDFHFLKSIHQEVVRDICS 837 >gi|291547618|emb|CBL20726.1| fibro-slime domain [Ruminococcus sp. SR1/5] Length = 1928 Score = 75.3 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 43/390 (11%), Positives = 103/390 (26%), Gaps = 48/390 (12%) Query: 40 GCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNN 99 + T ++ + + + S + + Q N Sbjct: 1014 VTKENNNVTTTLKNSSGTAVKDNKILNETPEEIINSSMVTSKTAKVKDWDQRTYDITINA 1073 Query: 100 PLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRS 159 + + + + + S +++ S + ++ A +L+ + + Sbjct: 1074 TSTSTSSIIETKTSVADIMLVLDVSGSMGEDITSYSYTFVANNTSE-ARDDKKLLNRNVT 1132 Query: 160 MEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKN---TTKSKYAPAPAPANRKIDVLIES 216 + + +S P+ S K + ++D L + Sbjct: 1133 YYIEVDGSYKEMWYYSSYNKGWRVGPRGSSDDAAKDKYNNCKIYTRTSTTETRLDALKNA 1192 Query: 217 AGNLVNSIQK---------AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 ++ K + + R +G T S + E+K+ + Sbjct: 1193 VNQFIDDTAKKSPNSKIGITVFSSTDDYNRPYGNHGTSVSLGEVGTADSAKVTELKNFVK 1252 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 L T+ + A +L +++ K+V+ TDG+ +G N+ Sbjct: 1253 DLKANGGTDPAVGLEDAKNKLDAMVDTNP---------KYVVLFTDGKPTGGGNKWNSNA 1303 Query: 328 TLQICEYM-------------RNAGMKIYSVAVSAPPEGQ---DLLR----------KCT 361 +Y++ + EG L Sbjct: 1304 QKNAETQAGELKTGLRNNVDNAKNPYTVYTIGFALNDEGDRAKTFLSGGTYDGKKDPGIA 1363 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQS 391 SS +D+ L + F I+ I + Sbjct: 1364 SSSDCAKTADDAASLTQIFQSISSTINKNV 1393 >gi|319949307|ref|ZP_08023384.1| hypothetical protein ES5_07781 [Dietzia cinnamea P4] gi|319437027|gb|EFV92070.1| hypothetical protein ES5_07781 [Dietzia cinnamea P4] Length = 326 Score = 75.3 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 21/149 (14%), Positives = 51/149 (34%), Gaps = 16/149 (10%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + + V +++L+ E T T A++ A + + + ++ ++DG+ Sbjct: 147 TTDRGPVVRAVDRLSLDERTATGEAIYTATQAITT--FTESLGGPDQAPPARIVLLSDGK 204 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA--------------PPEGQDLLRKCT 361 + + E AG+ + +++ P + L Sbjct: 205 ETVPADPTEERGAFTAAERAAEAGIPVSTISFGTLYGTVDIQGRPQPVPVDDASLRTIAE 264 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQ 390 S G FF + EL + + ++I + Sbjct: 265 LSGGDFFTASTLEELDSVYRTLEEQIGYE 293 >gi|218458490|ref|ZP_03498581.1| von Willebrand factor type A [Rhizobium etli Kim 5] Length = 220 Score = 75.3 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 38/234 (16%), Positives = 72/234 (30%), Gaps = 15/234 (6%) Query: 5 IISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIF 64 + V A+ + ++ + Q+Q A + + + + Sbjct: 1 MAPVLLGAAGMAVHVGDMLLSKQQLQEAA--DSAALATATALANGKIQTSEAEAYARNFV 58 Query: 65 KKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLI 124 Q+ +L+ G I+ Q T Y Y++ L + Sbjct: 59 AGQMANYLQSGVDIKGGTSVNVQ------TSTSGKSTSYQVTVSPSYDLSVNPL--MQAV 110 Query: 125 PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP 184 +LS T + S +IS+ + LD S SM + N+++ + Y Sbjct: 111 GFKTQHLSTSGTTVGGHSQTQGSISMFLALDKSGSMGES-TATVNEDDPTETFTYDCNLH 169 Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 K + KI+ L +AGNL + + A + VR Sbjct: 170 YNSKNNKWV--YDKCTGSRTNYYTKIEALKIAAGNLFSQLNSA--DPNAQYVRT 219 >gi|281420095|ref|ZP_06251094.1| BatA protein [Prevotella copri DSM 18205] gi|281405895|gb|EFB36575.1| BatA protein [Prevotella copri DSM 18205] Length = 332 Score = 75.3 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 25/196 (12%), Positives = 52/196 (26%), Gaps = 44/196 (22%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN----KLNPYE- 273 N + + + E + P++ + + L + Sbjct: 108 NRMEAAKDVATEFISGRPNDNIGLTIFAGEAFTQCPMTTDHASLLRLLQATRTDIAARGL 167 Query: 274 ---NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T + +A L S K VI +TDG N+ + Sbjct: 168 IDDGTAVGMGLANAVSRL----------KDSKSKSKVVILLTDGSNNMGEIS-----PMT 212 Query: 331 ICEYMRNAGMKIYSVAVSAP--------------------PEGQDLLRKCT-DSSGQFFA 369 E ++ G+++Y++ V L + G F+ Sbjct: 213 AAEIAKSYGIRVYTIGVGTNKVAPYPMPVAGGVQYVNIPVEIDTKTLSDIAQTTDGNFYR 272 Query: 370 VNDSRELLESFDKITD 385 ++ EL + + I Sbjct: 273 ATNNNELKKIYRDIDK 288 >gi|218672263|ref|ZP_03521932.1| hypothetical protein RetlG_11787 [Rhizobium etli GR56] Length = 256 Score = 75.3 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 38/256 (14%), Positives = 76/256 (29%), Gaps = 20/256 (7%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 MTA+++ A+D AH + +R Q+ +A + SI + + Sbjct: 19 MTALLMVPLVGTAGMAVDFAHALSLRTQLYAAA---DAAAVGSIAEKSSAVAAAMAMNGN 75 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 TI + S + ++ I++TK N + T Sbjct: 76 GTISLGKTDARNIFMSQVSGELAEVHVDLGIDVTKTANKLNSQV------SFTATVPTTF 129 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 + +S +T + ++LD + SM + Sbjct: 130 MQIFGRDSITISGTATA---EYQTAAFMDFYILLDNTPSMGVGATPSDVSKLEAKTGCAF 186 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 KS + KS +IDV+ ++ L ++ + R+G Sbjct: 187 ACHQMDKSTNNYTIAKS------LGVAMRIDVVRQATQALTDTAKTERISSDQF--RMGV 238 Query: 241 IAYNIGIVGNQCTPLS 256 + + T +S Sbjct: 239 YTFGTKAEDAKLTTIS 254 >gi|149732296|ref|XP_001503293.1| PREDICTED: collagen, type XXI, alpha 1 [Equus caballus] Length = 957 Score = 75.3 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 29/176 (16%), Positives = 63/176 (35%), Gaps = 22/176 (12%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHA 284 + +++G + Y+ V ++ + + + ++ NT T A+ A Sbjct: 64 TRNFDIGPKFIQVGVVQYSDYPVLEIPLGSHDSGENLMAAMESIHYLGGNTRTGKAIQFA 123 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 L+ S L K + +TDG++ E R + + +++ Sbjct: 124 LDYLF--------AKSSRFLTKIAVVLTDGKSQD--------EVKDAAEAARESKIILFA 167 Query: 345 VAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + V + + LR + F V D + + + + K+ E+SV P R Sbjct: 168 IGVGSE-TEEAELRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV--CPTR 220 >gi|242076422|ref|XP_002448147.1| hypothetical protein SORBIDRAFT_06g022130 [Sorghum bicolor] gi|241939330|gb|EES12475.1| hypothetical protein SORBIDRAFT_06g022130 [Sorghum bicolor] Length = 697 Score = 75.3 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 33/272 (12%), Positives = 83/272 (30%), Gaps = 32/272 (11%) Query: 125 PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNK-YLLPP 183 P +++E + + S ++ + H +SN Sbjct: 192 PLQHIEAFENLNSGSNKTAEISSYPESQAVSQSSCLDGFDILIHVKAPTSSSNDVTGSLV 251 Query: 184 PPKKSFWSKNTTKSKYAPAPAPAN---RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 S+ + K+ +L ++ G ++ ++ + R+ Sbjct: 252 NESSMRSSRRVPIDLVTVLDVSGSMAGTKLALLKQAMGFVIQHLRPSD--------RLSV 303 Query: 241 IAYNIGIVGNQCTP--LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNT 298 IA++ + + ++ L TN A+ A + + + Sbjct: 304 IAFSSTARRLFPLQRMSHHGRQQALQAISSLGAGGGTNIADALKKAVKVIED-------- 355 Query: 299 IGSTRLKKFVIFITDGENS--------GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 +I ++DG+++ G SA + +L I + + +++ A Sbjct: 356 RNYKNSVCSIILLSDGQDTFNISSNFQGTSAGRRSLVPPSILNELHM--VPLHTFGFGAD 413 Query: 351 PEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + L S G F + D + ++F + Sbjct: 414 HDSDTLHSISEASGGTFSFIEDEGVMQDAFAQ 445 >gi|109088926|ref|XP_001104056.1| PREDICTED: anthrax toxin receptor-like isoform 1 [Macaca mulatta] Length = 557 Score = 75.3 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 75/195 (38%), Gaps = 24/195 (12%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 K + + +L +++ + ++ +R+ I Y+ G PL+++ N +K Sbjct: 76 YFILDKSGSVNNNWIDLYMWVEETVARFQSSDIRMCFITYSTD--GQTVLPLTSDKNRIK 133 Query: 264 SRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + L++L P +T A +++ + ++ +I +TDGE Sbjct: 134 NGLDQLRKIVPDGHTFMQAGFRKAIQQIETFNSGN-------KVPSMIIAMTDGE----L 182 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 +TL+ + R G +Y+V V D + DS FAV + F Sbjct: 183 VAHAFQDTLREAQKARKLGANVYTVGV--ADYKLDQITAIADSPEHVFAVENG------F 234 Query: 381 DKITDKIQEQSVRIA 395 + D + + ++ Sbjct: 235 KAMRDTVDALTSKVC 249 >gi|109088928|ref|XP_001104141.1| PREDICTED: anthrax toxin receptor-like isoform 2 [Macaca mulatta] Length = 564 Score = 75.3 bits (183), Expect = 1e-11, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 75/195 (38%), Gaps = 24/195 (12%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 K + + +L +++ + ++ +R+ I Y+ G PL+++ N +K Sbjct: 76 YFILDKSGSVNNNWIDLYMWVEETVARFQSSDIRMCFITYSTD--GQTVLPLTSDKNRIK 133 Query: 264 SRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + L++L P +T A +++ + ++ +I +TDGE Sbjct: 134 NGLDQLRKIVPDGHTFMQAGFRKAIQQIETFNSGN-------KVPSMIIAMTDGE----L 182 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 +TL+ + R G +Y+V V D + DS FAV + F Sbjct: 183 VAHAFQDTLREAQKARKLGANVYTVGV--ADYKLDQITAIADSPEHVFAVENG------F 234 Query: 381 DKITDKIQEQSVRIA 395 + D + + ++ Sbjct: 235 KAMRDTVDALTSKVC 249 >gi|260912478|ref|ZP_05919014.1| aerotolerance protein BatA [Prevotella sp. oral taxon 472 str. F0295] gi|260633397|gb|EEX51551.1| aerotolerance protein BatA [Prevotella sp. oral taxon 472 str. F0295] Length = 332 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 59/192 (30%), Gaps = 37/192 (19%) Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YEN 274 +A ++ A+ + T L N L V++ + + Sbjct: 112 AAKDVAAEFIADRPNDNIGLTIFAGEAFTQCPMTTDHTSLLNMLQTVRTDIAAKGLIQDG 171 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 T + +A L S K VI +TDG N+ L+ + Sbjct: 172 TAIGMGLANAVSRL----------KDSKAKSKVVILLTDGSNNMG-----DLSPMTSANI 216 Query: 335 MRNAGMKIYSVAVSAP--------------------PEGQDLLRKCT-DSSGQFFAVNDS 373 ++ G+++Y++ V +L+ + G F+ ++ Sbjct: 217 AKSLGIRVYTIGVGTNKVARYPMPVAGGVQYVNMPVEIDTKVLKDIAASTDGNFYRATNN 276 Query: 374 RELLESFDKITD 385 +EL + + I Sbjct: 277 QELKQIYKDIDK 288 >gi|73950493|ref|XP_544451.2| PREDICTED: similar to Cartilage matrix protein precursor (Matrilin-1) [Canis familiaris] Length = 562 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 66/183 (36%), Gaps = 23/183 (12%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ + +V+++ + + ++G + Y+ + + ++K+ + Sbjct: 245 NFELVKKFINQIVDTL-----DVSDKLAQVGLVQYSSSVRQEFPLGRFHTKKDIKAAVRN 299 Query: 269 LNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 ++ + T T A+ + + + +K I TDG + Sbjct: 300 MSYMEKGTMTGAALKYLIDNSFTVSSGAR-----PGAQKVGIVFTDGRSQDYIND----- 349 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITD 385 + ++ G K+++V V +D LR+ +F D + + + ++ Sbjct: 350 ---AAKKAKDLGFKMFAVGVG--NAVEDELREIASEPVAEHYFYTADFKTINQIGKRLQK 404 Query: 386 KIQ 388 KI Sbjct: 405 KIC 407 >gi|953237|gb|AAA99719.1| collagen type XII alpha-1 precursor [Mus musculus] Length = 3067 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 58/166 (34%), Gaps = 18/166 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+G + Y+ ++ + + K+ NT T A+ + Sbjct: 168 SAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYRREDLLAAVKKIPYKGGNTMTGDAIDYLV 227 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + E S K I ITDG++ +RN G++++S+ Sbjct: 228 KNTFTESAGSRAGF-----PKVGIIITDGKSQD--------EVEIPARELRNIGVEVFSL 274 Query: 346 AVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQE 389 + L++ + F V + +++ ++I ++ Sbjct: 275 GI--KAADAKELKQIASTPSLNHVFNVANFDAIVDIQNEIISQVCS 318 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 21/160 (13%), Positives = 53/160 (33%), Gaps = 18/160 (11%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKE 293 V+I Y+ + + + L NT T A++ ++ + Sbjct: 1239 RVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPNKGGNTLTGMALNFIRQQSFKT-- 1296 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 R +K + ITDG++ + + +++ G++++++ + Sbjct: 1297 ---QAGMRPRARKIGVLITDGKSQDDVEAPS--------KKLKDEGVELFAIGI--KNAD 1343 Query: 354 QDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 + L+ + V D L + D +T + Sbjct: 1344 EVELKMIATDPDDTHAYNVADFESLSKIVDDLTINLCNSV 1383 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 31/156 (19%), Positives = 57/156 (36%), Gaps = 22/156 (14%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYN 290 V+I + Y+ N + ++ +N TNT AM + +++ Sbjct: 477 SPNRVQISLVQYSRDPHTEFTLKEFNRVEDIIKAINTFPYRGGSTNTGKAMTYVREKIF- 535 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 N + + K +I ITDGE+S A +RN+ ++I++V V Sbjct: 536 ----VPNKGSRSNVPKVMILITDGESSDAFRDP--------AIKLRNSDVEIFAVGV--K 581 Query: 351 PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKIT 384 + L F V D ++F +I+ Sbjct: 582 DAVRSELEAIASPPAETHVFTVEDF----DAFQRIS 613 >gi|88800880|ref|ZP_01116434.1| hypothetical protein MED297_00315 [Reinekea sp. MED297] gi|88776393|gb|EAR07614.1| hypothetical protein MED297_00315 [Reinekea sp. MED297] Length = 555 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 42/380 (11%), Positives = 98/380 (25%), Gaps = 42/380 (11%) Query: 33 LDAAVL----------SGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLK----QGSYI 78 D AVL + + ++ + + + S Sbjct: 29 TDPAVLQEPVHTEETRAIETDSADQVFLAASKSRVEVQESYVLPSSTPIIPMPNPPVSEN 88 Query: 79 RENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGI 138 REN + D + + A Y L P+ + Sbjct: 89 RENYPKTPISPIRQVATDPVSTFSTDVD-TASYTNARRFLNQGMRPPADSIRVEEFINYF 147 Query: 139 IERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSK 198 + + + R+ Q ++ S + P + Sbjct: 148 DYALPAPDTTNTPIQISTERTQTPWNPQTELVRVSLQSYRSDFKTLPPLNLV-----FLL 202 Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + K+ ++ S LV+ ++ R+ Y + Sbjct: 203 DVSGSMNSPDKLPLMQRSFNLLVSQLRPQD--------RVAIAVYAGQSGVVLEPTSGDQ 254 Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 ++ +N+L T+ +H AY + TDG+ + Sbjct: 255 KAQINQAINQLRAGGGTHGSAGIHLAYDLAQANYLPDGINR--------IFIGTDGDFNV 306 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELL 377 + L + E R AG+ + + L+ + ++ G + ++ +E Sbjct: 307 GTTSLTEL--KALIERKREAGVFLSVLGFGTGNYNDALMEELSNHGNGTAYYLDSYQEAR 364 Query: 378 ESFDKITDKIQEQSVRIAPN 397 + F ++ +A + Sbjct: 365 KLF---ATQLAATLQTVAKD 381 >gi|118353826|ref|XP_001010178.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89291945|gb|EAR89933.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 547 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 22/141 (15%), Positives = 50/141 (35%), Gaps = 20/141 (14%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 S+N + +N+++ + TN M A+R L + ++DG+ Sbjct: 181 SSNKPNLIKIINEIHCHGGTNINSGMELAFRVLKE--------RKYYNPVSSIFLLSDGQ 232 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE 375 + + + +++ I+S + +G + + C+ G F+ V + Sbjct: 233 D----GGADLRVRQSLEKHLSQECFTIHSFGFGSDHDGPLMNKICSLKDGNFYYVEKINQ 288 Query: 376 LLES--------FDKITDKIQ 388 + E F I +I Sbjct: 289 VDEFFVDALGGLFSVIAQEII 309 >gi|114563846|ref|YP_751360.1| von Willebrand factor, type A [Shewanella frigidimarina NCIMB 400] gi|114335139|gb|ABI72521.1| von Willebrand factor, type A [Shewanella frigidimarina NCIMB 400] Length = 334 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 50/153 (32%), Gaps = 36/153 (23%) Query: 254 PLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + V + L+ + T A+ A + + +I Sbjct: 143 PLTLDRRSVATFLDDAQIGLVGKQTAIGEAIALAVKRFDKV----------DESNRVLIL 192 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEG 353 +TDG N+ + + + + IY++ V A Sbjct: 193 LTDGSNNAGN-----IEPEVAAQIAAKRNITIYTIGVGAEILERRTIFGKERINPSMDLD 247 Query: 354 QDLLRK-CTDSSGQFFAVNDSRELLESFDKITD 385 +D L+K + G++F +S EL + +I Sbjct: 248 EDQLKKLAAMTKGRYFRARNSEELASIYQEIDK 280 >gi|157819693|ref|NP_001101483.1| matrilin-3 [Rattus norvegicus] gi|149050910|gb|EDM03083.1| matrilin 3 (predicted) [Rattus norvegicus] Length = 463 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 24/176 (13%), Positives = 57/176 (32%), Gaps = 23/176 (13%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP- 271 + +++++ + R+ + Y + +N +K + ++ P Sbjct: 97 VKTFVSRIIDTLDIGATD-----TRVAVVNYASTVKIEFQLNTYSNKQALKQAVARITPL 151 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T + A+ A E + + + + K I +TDG ++ Sbjct: 152 STGTMSGLAIQTAMEEAFTVEAGARGPTSNI--PKVAIIVTDGRPQD--------QVNEV 201 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 R +G+++Y+V V + L+ F V +L F + Sbjct: 202 AARARASGIELYAVGV--DRADMESLKMMASKPLEDHVFYVETYGVIEKLSARFQE 255 >gi|254458905|ref|ZP_05072328.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] gi|207084176|gb|EDZ61465.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] Length = 309 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 27/170 (15%), Positives = 59/170 (34%), Gaps = 18/170 (10%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL---NPYENT 275 + ++ + G +PL+ + N + +++L + T Sbjct: 109 TRFDVVKDIVSNFIKERQNDNIGLVVFGAYSFIASPLTYDENILNKIVSQLYIGMAGKYT 168 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + ++ L S K I +TDG ++ L + + Sbjct: 169 ALFTSLAQGVNLL----------KMSESKSKVGILLTDGFSTPEVDKIPFDVALDMA--I 216 Query: 336 RNAGMKIYSVAVSAPP-EGQDLLRKCTD-SSGQFFAVNDSRELLESFDKI 383 + +KIY + + P ++LRK + + G+ F + EL E + +I Sbjct: 217 KEK-IKIYPIGIGMPHEYNIEVLRKIAEKTGGKAFGAASATELKEVYKEI 265 >gi|328907235|gb|EGG27001.1| von Willebrand factor type A domain protein [Propionibacterium sp. P08] Length = 318 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 23/230 (10%), Positives = 60/230 (26%), Gaps = 32/230 (13%) Query: 178 KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVR 237 K P ++ S+ A ++ +A + + + R Sbjct: 77 KAYHNVPRDRATVVVAIDVSRSMVATDVDPSRLSAAKTAAKDFLGDLP----------PR 126 Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHN 297 P + + V + + L +T ++ + L + Sbjct: 127 FNVSLVKFAASSQVVVPPTTDRAVVSTAIANLQVLPSTAIGEGIYSSLNALKLVPDD--P 184 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----- 352 + ++ ++DG + L+ + + +Y++A Sbjct: 185 KHPGQKPPAAIVLLSDGATNVGRPS------LEAAKEAGRQHVPVYTIAYGTAGGYVVEG 238 Query: 353 --------GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 L S G+ F+ +L + + I + + V Sbjct: 239 GQRQPVPVNHYELAAVAKASGGEKFSAESLGQLSDVYKSIAQSVGYEKVF 288 >gi|320352629|ref|YP_004193968.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] gi|320121131|gb|ADW16677.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] Length = 577 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 40/308 (12%), Positives = 84/308 (27%), Gaps = 33/308 (10%) Query: 91 INITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISI 150 IN +D + + A Y + L P + + + Sbjct: 107 INAARDPLSTFSIDVD-TASYTNVRRFIQGGHLPPVGAVRIEEMINYFTYAYPRPIGKAP 165 Query: 151 CMVLDVSRS--MEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR 208 + YL K LPP Sbjct: 166 FALGAEVGPSPFHRDYLLARIGLAAKDLAKEHLPPSNLVFLID--------VSGSMQDGN 217 Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 K+ +L ++ +V + R+ + Y + E+ + L++ Sbjct: 218 KLPLLKQALPLVVRQLGA--------RDRVALVVYAGADSVVLPPTPGDRQQEILAALDQ 269 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L +T+ + AY N VI +DG+ + ++ L Sbjct: 270 LQAGGSTHASSGIRTAYELARKSFIKGGNNR--------VILASDGDFNVGVTSRDEL-- 319 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKI 387 ++ E R G+ + + + D + D G + ++ LLE+ + ++ Sbjct: 320 TRLIEEERKDGIYLTVLGLGMGNYHDDTMEVLADKGNGNYAYIDS---LLEAKKVLVKEM 376 Query: 388 QEQSVRIA 395 +A Sbjct: 377 SGTLFALA 384 >gi|260437096|ref|ZP_05790912.1| putative von Willebrand factor type A domain protein [Butyrivibrio crossotus DSM 2876] gi|292810406|gb|EFF69611.1| putative von Willebrand factor type A domain protein [Butyrivibrio crossotus DSM 2876] Length = 623 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 38/237 (16%), Positives = 83/237 (35%), Gaps = 26/237 (10%) Query: 148 ISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPAN 207 S MV+D E + N+N ++ S + + Sbjct: 68 FSKYMVVDKEEWFEAWNKEIKYPNSNNIVFDTVILIDCSGSMRTNDPDFEYSVKNTL-YP 126 Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 + L K + + R G + + N L+N+ + + ++ Sbjct: 127 GSSYQITTCYRKL---ASKNYVKAQGNDDRTGIVLFTSEA--NTVCELTNSEYVLMNAID 181 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 K+ TN A+ + R L N + S +K ++ ++DGE+ +S+ Sbjct: 182 KIYSNGGTNFNNAIKESIRILTNTRNDS---------EKRILLVSDGESELSSS------ 226 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKI 383 + + +KI +V + +LL+ + + G++F + EL+ + +I Sbjct: 227 ---VIDLAIENNIKINTVYIG-GQNNNELLKNVAERTGGKYFKAVTADELINIYSEI 279 >gi|226314609|ref|YP_002774505.1| hypothetical protein BBR47_50240 [Brevibacillus brevis NBRC 100599] gi|226097559|dbj|BAH46001.1| conserved hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 947 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 32/192 (16%), Positives = 68/192 (35%), Gaps = 26/192 (13%) Query: 192 KNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ 251 KS + A K+ + E+A + IG IA++ Sbjct: 410 LVIDKSGSMSSDARGADKMALAREAAIRATTMMNAQDY--------IGVIAFDDTPWDVV 461 Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 L+E++ +++++ T+ +PA+ Y + K VI + Sbjct: 462 APQSVTKLDEIQQQISRIQADGGTDIFPALQLGYERVKAMNTQR----------KHVILL 511 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAV 370 TDG+ + + + M + + +VA+ + LL + G+++ Sbjct: 512 TDGQ------SALDDDYEGLLQQMTAENITVSTVALG-DDSDRGLLEMIAELGKGRYYFA 564 Query: 371 NDSRELLESFDK 382 ND+ + + F K Sbjct: 565 NDAESIPKIFSK 576 >gi|171910783|ref|ZP_02926253.1| Protein containing von Willebrand factor (vWF) type A domain [Verrucomicrobium spinosum DSM 4136] Length = 917 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 42/397 (10%), Positives = 97/397 (24%), Gaps = 52/397 (13%) Query: 38 LSGC--ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK 95 L+ + + + + + G I + K Sbjct: 343 LAAAPVMLDATPQLGTPAEGASARARQPAQPAAGAKAQPGVTITTGSNFYVGTDSTKADK 402 Query: 96 DKNNPLQYIAESKAQYEIPTENL-----FLKGLIPSALTNLSLRSTGIIERSSENLAISI 150 D + TE L G + + + S+ ++ + Sbjct: 403 DAGRTGDTTRFGRGLPPEVTEPLRQRESVSAGETYTPIYENPFMAVAQEPLSTFSIDVDT 462 Query: 151 CMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKI 210 +V R + + Y + + A AP + Sbjct: 463 ASYANVRRFLNSGQRPPADAVRLEELINYFPYAYEGPT--DDRPFGVQVDMAEAPWKPEH 520 Query: 211 DVLIESAGNLVNSIQKAIQEKK-----------------------------NLSVRIGTI 241 + + VN ++A + R+ + Sbjct: 521 RLARIAIKGRVNQQERAPANFVFLVDVSGSMDEPDKLPLVKQSLRMLTERLSTKDRVAIV 580 Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGS 301 Y + + + ++ L +TN + AY + + Sbjct: 581 TYAGSTAVILPSTAGTEKSRIIEAIDGLGAGGSTNGAGGIRLAYEQAQQHFQKEGVNR-- 638 Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT 361 VI TDG+ + + L I E ++ + + + A + Sbjct: 639 ------VILCTDGDFNVGISSPGELQ-KLIEEKAKSR-VFLSVLGFGAGNLKDRTMETLA 690 Query: 362 DSS-GQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 D G + ++ L E+ + +++ V IA + Sbjct: 691 DKGNGNYAYIDS---LSEARKVLVEQMNATLVTIAKD 724 >gi|119628048|gb|EAX07643.1| matrilin 1, cartilage matrix protein, isoform CRA_b [Homo sapiens] Length = 519 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 42/378 (11%), Positives = 113/378 (29%), Gaps = 52/378 (13%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQIN---ITKDKN 98 + VS + + ST + + G ++ I+ I Sbjct: 96 RAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKAFGDAEGGRSRSPDISKVVIVVTDG 155 Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR 158 P + + A+ LF G+ L ++ + + S ++ +SR Sbjct: 156 RPQDSVQDVSARARASGVELFAIGVGSVDKATLRQIASEPQD-EHVDYVESYSVIEKLSR 214 Query: 159 SMEDLY----LQKHNDNNNMTSNKYLLPPPPKKSF---WSKNTTKSKYAPAPAPANR--- 208 ++ + +++ P + ++ N+ Sbjct: 215 KFQEAFCVVSDLCATGDHDCEQVCISSPGSYTCACHEGFTLNSDGKTCNVCSGGGGSSAT 274 Query: 209 ---------------KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 +++ + +V+++ + + ++G + Y+ + Sbjct: 275 DLVFLIDGSKSVRPENFELVKKFISQIVDTL-----DVSDKLAQVGLVQYSSSVRQEFPL 329 Query: 254 PLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 + ++K+ + ++ + T T A+ + + + +K I T Sbjct: 330 GRFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSGAR-----PGAQKVGIVFT 384 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAV 370 DG + + ++ G K+++V V +D LR+ +F Sbjct: 385 DGRSQDYIND--------AAKKAKDLGFKMFAVGVG--NAVEDELREIASEPVAEHYFYT 434 Query: 371 NDSRELLESFDKITDKIQ 388 D + + + K+ KI Sbjct: 435 ADFKTINQIGKKLQKKIC 452 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 23/159 (14%), Positives = 53/159 (33%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + R+G + Y + + + + ++ P T T A+ A + Sbjct: 72 DVGPNATRVGMVNYASTVKQEFSLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKA 131 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + ++ S + K VI +TDG + + R +G++++++ V Sbjct: 132 F--GDAEGGRSRSPDISKVVIVVTDGRPQDS--------VQDVSARARASGVELFAIGVG 181 Query: 349 APPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 + LR+ V +L F + Sbjct: 182 --SVDKATLRQIASEPQDEHVDYVESYSVIEKLSRKFQE 218 >gi|4505111|ref|NP_002370.1| cartilage matrix protein precursor [Homo sapiens] gi|115556|sp|P21941|MATN1_HUMAN RecName: Full=Cartilage matrix protein; AltName: Full=Matrilin-1; Flags: Precursor gi|1732121|gb|AAB38702.1| cartilage matrix protein [Homo sapiens] gi|56205026|emb|CAI19322.1| matrilin 1, cartilage matrix protein [Homo sapiens] gi|182887817|gb|AAI60064.1| Matrilin 1, cartilage matrix protein [synthetic construct] gi|189066540|dbj|BAG35790.1| unnamed protein product [Homo sapiens] Length = 496 Score = 75.3 bits (183), Expect = 2e-11, Method: Composition-based stats. Identities = 42/378 (11%), Positives = 113/378 (29%), Gaps = 52/378 (13%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQIN---ITKDKN 98 + VS + + ST + + G ++ I+ I Sbjct: 96 RAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKAFGDAEGGRSRSPDISKVVIVVTDG 155 Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR 158 P + + A+ LF G+ L ++ + + S ++ +SR Sbjct: 156 RPQDSVQDVSARARASGVELFAIGVGSVDKATLRQIASEPQD-EHVDYVESYSVIEKLSR 214 Query: 159 SMEDLY----LQKHNDNNNMTSNKYLLPPPPKKSF---WSKNTTKSKYAPAPAPANR--- 208 ++ + +++ P + ++ N+ Sbjct: 215 KFQEAFCVVSDLCATGDHDCEQVCISSPGSYTCACHEGFTLNSDGKTCNVCSGGGGSSAT 274 Query: 209 ---------------KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 +++ + +V+++ + + ++G + Y+ + Sbjct: 275 DLVFLIDGSKSVRPENFELVKKFISQIVDTL-----DVSDKLAQVGLVQYSSSVRQEFPL 329 Query: 254 PLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 + ++K+ + ++ + T T A+ + + + +K I T Sbjct: 330 GRFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSGAR-----PGAQKVGIVFT 384 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAV 370 DG + + ++ G K+++V V +D LR+ +F Sbjct: 385 DGRSQDYIND--------AAKKAKDLGFKMFAVGVG--NAVEDELREIASEPVAEHYFYT 434 Query: 371 NDSRELLESFDKITDKIQ 388 D + + + K+ KI Sbjct: 435 ADFKTINQIGKKLQKKIC 452 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 23/159 (14%), Positives = 53/159 (33%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + R+G + Y + + + + ++ P T T A+ A + Sbjct: 72 DVGPNATRVGMVNYASTVKQEFSLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKA 131 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + ++ S + K VI +TDG + + R +G++++++ V Sbjct: 132 F--GDAEGGRSRSPDISKVVIVVTDGRPQDS--------VQDVSARARASGVELFAIGVG 181 Query: 349 APPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 + LR+ V +L F + Sbjct: 182 --SVDKATLRQIASEPQDEHVDYVESYSVIEKLSRKFQE 218 >gi|301058342|ref|ZP_07199375.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300447578|gb|EFK11310.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 331 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 30/200 (15%), Positives = 65/200 (32%), Gaps = 40/200 (20%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL---NPYENT 275 + +++K + + G +PL+ + + +N++ + T Sbjct: 112 TRLEAVKKVVADFIGKRETDRIGLVVFGEEAFTQSPLTIDKGLLLELVNRMKIGMAGDRT 171 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 A+ + L K +I +TDG N+ + Sbjct: 172 AIGSAIAIGGKRL----------KDLKSKSKILILLTDGRNNAGEIS-----PQAAARAV 216 Query: 336 RNAGMKIYSVAVSAPPE--------------------GQDLLRKCT-DSSGQFFAVNDSR 374 R G+K+Y++ V + LR G++F +S+ Sbjct: 217 REFGIKLYTIGVGGKGPAPFRMKTLFGTRLVPQHVDLDEVTLRNVAKTGGGKYFRAANSQ 276 Query: 375 ELLESFDKITDKIQEQSVRI 394 EL E + I D+ ++ V++ Sbjct: 277 ELQEIY-DIIDRAEKTDVKV 295 >gi|162147499|ref|YP_001601960.1| hypothetical protein GDI_1715 [Gluconacetobacter diazotrophicus PAl 5] gi|161786076|emb|CAP55658.1| conserved hypothetical protein [Gluconacetobacter diazotrophicus PAl 5] Length = 571 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 38/385 (9%), Positives = 95/385 (24%), Gaps = 77/385 (20%) Query: 83 GDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERS 142 G + P N + +T Sbjct: 187 GGGTGADTVPNLWVSIVPFAGEMNIFGSTYGGPSNWQSMP-SGWLTAGSDISTTRYGSNG 245 Query: 143 SENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPA 202 ++ + S + + N + Y K S S S Sbjct: 246 WMGCVMARYSGYNNSPAH-IYDVNDANPIQAPFTPFYWPSTYHKYSQSSWFGGTSWVVGD 304 Query: 203 PAPANRKIDVL-IESAGNLVNSIQKAI-------QEKKNLSVRIGTIAYNIGIVGNQCTP 254 V +A +L + ++ ++ + N+G + P Sbjct: 305 NDWILSGGVVTPSSAARSLYGQMAESPLITSFPTTSGSLVTESGLQVGPNLGCDPSPTLP 364 Query: 255 LSNNLNEVKSRLNKL--NPYENTNTYPAMHHAYRELYNEKESSHN------TIGSTRLKK 306 + + + V++ ++ + T A+ + + + + + K Sbjct: 365 ETASRSVVEAHISSMPMMSRGGTMLPQALQAGWFTISPNWQGFWPNPALPLAYNTPNMTK 424 Query: 307 FVIFITDGEN-------------------------------------------SGASAYQ 323 ++ +TDG N +G Y Sbjct: 425 VLVLMTDGNNQICPCFPVYNYYGPVAPPQSNGDTDMVAYGRLLQDELGVVSSYNGNGYYG 484 Query: 324 NT-----------LNTLQICEYMRNAGMKIYSV-----AVSAPPEGQDLLRKCTDSSGQF 367 + +C+ ++N+G+ IY + A Q +L+ C G + Sbjct: 485 SNGFSSNILPEMNSLVSTVCDNIKNSGITIYVILYTHEGEEADATTQAMLQNCASKPGNY 544 Query: 368 FAVNDSRELLESFDKITDKIQEQSV 392 + + + ++F + ++ + Sbjct: 545 YDAPTAASMKQAFSDLGGQLSALRI 569 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 33/278 (11%), Positives = 74/278 (26%), Gaps = 31/278 (11%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIV--SDRTIKDPTTKKDQ 59 A+ ++LA I ++ ++Q+ALDAA + + ++ + D Sbjct: 14 MAVCAFAMLAISMMGVELARIYIVQERLQTALDAASIVAAREMSAVNNVGTCTGSCASDT 73 Query: 60 TSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLF 119 T+ + H G + N ++ Q L Sbjct: 74 TAIFWANFSSAHQANGLGPFQAVSTGPVI-------TPQNASTITIQANVQL-----PLL 121 Query: 120 LKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKY 179 ++ + LS + N+ + + +VLD + S+E ++ + + Sbjct: 122 FTKILGVSQIALSEHAQA----VRSNMGMELALVLDNTDSLEAQGIEDLQCGAKILVDTV 177 Query: 180 LLPPPPKK-----------SFWSKNTTKSKYAPAPAPANRKIDVLIE--SAGNLVNSIQK 226 P + W + S S Sbjct: 178 YGVAAPGSCGGGTGADTVPNLWVSIVPFAGEMNIFGSTYGGPSNWQSMPSGWLTAGSDIS 237 Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 + N + Y+ N+ N +++ Sbjct: 238 TTRYGSNGWMGCVMARYSGYNNSPAHIYDVNDANPIQA 275 >gi|148698183|gb|EDL30130.1| matrilin 1, cartilage matrix protein 1 [Mus musculus] Length = 456 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 65/183 (35%), Gaps = 23/183 (12%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ + +V+++ + + ++G + Y+ I + ++K+ + Sbjct: 250 NFELVKKFINQIVDTL-----DVSDRLAQVGLVQYSSSIRQEFPLGRFHTKKDIKAAVRN 304 Query: 269 LNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 ++ + T T A+ + + + +K I TDG + Sbjct: 305 MSYMEKGTMTGAALKYLIDNSFTVSSGAR-----PGAQKVGIVFTDGRSQDYIND----- 354 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITD 385 ++ G K+++V V ++ LR+ +F D + + + K+ Sbjct: 355 ---AARKAKDLGFKMFAVGVG--NAVEEELREIASEPVADHYFYTADFKTINQIGKKLQK 409 Query: 386 KIQ 388 +I Sbjct: 410 QIC 412 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 55/159 (34%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + R+G + Y + + + + ++ P T T A+ A + Sbjct: 32 DVGPNATRVGLVNYASTVKPEFPLRAHGSKASLLQAVRRIQPLSTGTMTGLALQFAITKA 91 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E S + K VI +TDG + + E R +G++++++ V Sbjct: 92 LSDAEGGRAR--SPDISKVVIVVTDGRPQDS--------VRDVSERARASGIELFAIGVG 141 Query: 349 APPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 + LR+ V +L + F + Sbjct: 142 R--VDKATLRQIASEPQDEHVDYVESYNVIEKLAKKFQE 178 >gi|126153367|gb|AAI31711.1| MATN1 protein [Homo sapiens] Length = 480 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 42/378 (11%), Positives = 113/378 (29%), Gaps = 52/378 (13%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQIN---ITKDKN 98 + VS + + ST + + G ++ I+ I Sbjct: 80 RAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKAFGDAEGGRSRSPDISKVVIVVTDG 139 Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR 158 P + + A+ LF G+ L ++ + + S ++ +SR Sbjct: 140 RPQDSVQDVSARARASGVELFAIGVGSVDKATLRQIASEPQD-EHVDYVESYSVIEKLSR 198 Query: 159 SMEDLY----LQKHNDNNNMTSNKYLLPPPPKKSF---WSKNTTKSKYAPAPAPANR--- 208 ++ + +++ P + ++ N+ Sbjct: 199 KFQEAFCVVSDLCATGDHDCEQVCISSPGSYTCACHEGFTLNSDGKTCNVCSGGGGSSAT 258 Query: 209 ---------------KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 +++ + +V+++ + + ++G + Y+ + Sbjct: 259 DLVFLIDGSKSVRPENFELVKKFISQIVDTL-----DVSDKLAQVGLVQYSSSVRQEFPL 313 Query: 254 PLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 + ++K+ + ++ + T T A+ + + + +K I T Sbjct: 314 GRFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSGAR-----PGAQKVGIVFT 368 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAV 370 DG + + ++ G K+++V V +D LR+ +F Sbjct: 369 DGRSQDYIND--------AAKKAKDLGFKMFAVGVG--NAVEDELREIASEPVAEHYFYT 418 Query: 371 NDSRELLESFDKITDKIQ 388 D + + + K+ KI Sbjct: 419 ADFKTINQIGKKLQKKIC 436 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 23/159 (14%), Positives = 53/159 (33%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + R+G + Y + + + + ++ P T T A+ A + Sbjct: 56 DVGPNATRVGMVNYASTVKQEFSLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKA 115 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + ++ S + K VI +TDG + + R +G++++++ V Sbjct: 116 F--GDAEGGRSRSPDISKVVIVVTDGRPQDS--------VQDVSARARASGVELFAIGVG 165 Query: 349 APPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 + LR+ V +L F + Sbjct: 166 --SVDKATLRQIASEPQDEHVDYVESYSVIEKLSRKFQE 202 >gi|171846231|ref|NP_034899.2| cartilage matrix protein precursor [Mus musculus] gi|28503027|gb|AAH47140.1| Matrilin 1, cartilage matrix protein [Mus musculus] gi|66350790|emb|CAC79633.1| cartilage matrix protein [Mus musculus] gi|123295072|emb|CAM17812.1| matrilin 1, cartilage matrix protein 1 [Mus musculus] gi|187466486|emb|CAQ51562.1| matrilin 1, cartilage matrix protein 1 [Mus musculus] Length = 500 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 65/183 (35%), Gaps = 23/183 (12%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ + +V+++ + + ++G + Y+ I + ++K+ + Sbjct: 294 NFELVKKFINQIVDTL-----DVSDRLAQVGLVQYSSSIRQEFPLGRFHTKKDIKAAVRN 348 Query: 269 LNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 ++ + T T A+ + + + +K I TDG + Sbjct: 349 MSYMEKGTMTGAALKYLIDNSFTVSSGAR-----PGAQKVGIVFTDGRSQDYIND----- 398 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITD 385 ++ G K+++V V ++ LR+ +F D + + + K+ Sbjct: 399 ---AARKAKDLGFKMFAVGVG--NAVEEELREIASEPVADHYFYTADFKTINQIGKKLQK 453 Query: 386 KIQ 388 +I Sbjct: 454 QIC 456 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 55/159 (34%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + R+G + Y + + + + ++ P T T A+ A + Sbjct: 76 DVGPNATRVGLVNYASTVKPEFPLRAHGSKASLLQAVRRIQPLSTGTMTGLALQFAITKA 135 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E S + K VI +TDG + + E R +G++++++ V Sbjct: 136 LSDAEGGRAR--SPDISKVVIVVTDGRPQDS--------VRDVSERARASGIELFAIGVG 185 Query: 349 APPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 + LR+ V +L + F + Sbjct: 186 R--VDKATLRQIASEPQDEHVDYVESYNVIEKLAKKFQE 222 >gi|330973664|gb|EGH73730.1| von Willebrand factor, type A [Pseudomonas syringae pv. aceris str. M302273PT] Length = 352 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 25/146 (17%), Positives = 53/146 (36%), Gaps = 35/146 (23%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + V+ L++ +NT A+ A + L + ++ Sbjct: 150 PLTYDRRTVRVWLDEARIGIAGKNTALGDAIGLALKRL----------RMRPATSRALVL 199 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------GQ 354 +TDG N+ ++ + G+KIY++ + + P+ + Sbjct: 200 VTDGANNAGQ-----IDPVTAARLAAEEGVKIYAIGIGSDPDKDALQSVLGLNPSLDLDE 254 Query: 355 DLLRKCTD-SSGQFFAVNDSRELLES 379 L++ S GQ+F D +L + Sbjct: 255 PTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|119628047|gb|EAX07642.1| matrilin 1, cartilage matrix protein, isoform CRA_a [Homo sapiens] Length = 496 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 42/378 (11%), Positives = 113/378 (29%), Gaps = 52/378 (13%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQIN---ITKDKN 98 + VS + + ST + + G ++ I+ I Sbjct: 96 RAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKAFGDAEGGRSRSPDISKVVIVVTDG 155 Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR 158 P + + A+ LF G+ L ++ + + S ++ +SR Sbjct: 156 RPQDSVQDVSARARASGVELFAIGVGSVDKATLRQIASEPQD-EHVDYVESYSVIEKLSR 214 Query: 159 SMEDLY----LQKHNDNNNMTSNKYLLPPPPKKSF---WSKNTTKSKYAPAPAPANR--- 208 ++ + +++ P + ++ N+ Sbjct: 215 KFQEAFCVVSDLCATGDHDCEQVCISSPGSYTCACHEGFTLNSDGKTCNVCSGGGGSSAT 274 Query: 209 ---------------KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 +++ + +V+++ + + ++G + Y+ + Sbjct: 275 DLVFLIDGSKSVRPENFELVKKFISQIVDTL-----DVSDKLAQVGLVQYSSSVRQEFPL 329 Query: 254 PLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 + ++K+ + ++ + T T A+ + + + +K I T Sbjct: 330 GRFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSGAR-----PGAQKVGIVFT 384 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAV 370 DG + + ++ G K+++V V +D LR+ +F Sbjct: 385 DGRSQDYIND--------AAKKAKDLGFKMFAVGVG--NAVEDELREIASEPVAEHYFYT 434 Query: 371 NDSRELLESFDKITDKIQ 388 D + + + K+ KI Sbjct: 435 ADFKTINQIGKKLQKKIC 452 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 23/159 (14%), Positives = 53/159 (33%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + R+G + Y + + + + ++ P T T A+ A + Sbjct: 72 DVGPNATRVGMVNYASTVKQEFSLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKA 131 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + ++ S + K VI +TDG + + R +G++++++ V Sbjct: 132 F--GDAEGGRSRSPDISKVVIVVTDGRPQDS--------VQDVSARARASGVELFAIGVG 181 Query: 349 APPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 + LR+ V +L F + Sbjct: 182 --SVDKATLRQIASEPQDEHVDYVESYSVIEKLSRKFQE 218 >gi|66044963|ref|YP_234804.1| von Willebrand factor, type A [Pseudomonas syringae pv. syringae B728a] gi|63255670|gb|AAY36766.1| von Willebrand factor, type A [Pseudomonas syringae pv. syringae B728a] Length = 352 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 25/146 (17%), Positives = 53/146 (36%), Gaps = 35/146 (23%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + V+ L++ +NT A+ A + L + ++ Sbjct: 150 PLTYDRRTVRVWLDEARIGIAGKNTALGDAIGLALKRL----------RMRPATSRALVL 199 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------GQ 354 +TDG N+ ++ + G+KIY++ + + P+ + Sbjct: 200 VTDGANNAGQ-----IDPVTAARLAAEEGVKIYAIGIGSDPDKDALQSVLGLNPSLDLDE 254 Query: 355 DLLRKCTD-SSGQFFAVNDSRELLES 379 L++ S GQ+F D +L + Sbjct: 255 PTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|329894014|ref|ZP_08270022.1| BatA [gamma proteobacterium IMCC3088] gi|328923357|gb|EGG30676.1| BatA [gamma proteobacterium IMCC3088] Length = 339 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 30/166 (18%), Positives = 64/166 (38%), Gaps = 38/166 (22%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKE 293 R+G I + +PLS + + VK L++ T A+ A + L ++++ Sbjct: 136 RVGLILFGSRAYMQ--SPLSFDRDTVKQFLSEAQIGFAGSETAIGDALGLAVKRLRDKED 193 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE- 352 + VI +TDG+++ ++++ L N G+K+Y++ + A Sbjct: 194 G----------DRVVILLTDGQDTA-----SSVDPLDATALAANYGVKVYTIGIGADEML 238 Query: 353 ----------------GQDLLRKCT-DSSGQFFAVNDSRELLESFD 381 ++ L + G++F EL + +D Sbjct: 239 VPSLFGNRRVNPSAELDEETLSAMAESTGGRYFRARSPDELAKIYD 284 >gi|317419330|emb|CBN81367.1| Integrin alpha-M [Dicentrarchus labrax] Length = 1058 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 32/254 (12%), Positives = 75/254 (29%), Gaps = 34/254 (13%) Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTK-SKYAPAPAPANRKIDVLIESA 217 S+ N + + + ++ + + +P +D A Sbjct: 2 SLGLTMTNDPTTQNTLACGPTIPKDCKSITMYNGLCFQIDRSNRVGSPIPSSLDECRSEA 61 Query: 218 G-----NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN-----------NLNE 261 + S+ K V+ ++ SN ++N Sbjct: 62 DIAFLLDGSGSVASQDFTKMKDFVKNLVNSFQGKDTKFAIAQFSNAPLVHYYFDTFDINN 121 Query: 262 VKSRLNKLN-PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 +++++++ T T A+ H +++ S +KK +I ITDGE Sbjct: 122 WRTQIDRIRQLTGGTYTAAAIEHVVNNVFDPSRGSRL-----NVKKVLIVITDGE----- 171 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVS---APPEGQDLLRKCTDSS--GQFFAVNDSRE 375 + + + + +++ V + + L F V++ Sbjct: 172 -SHDRRDLPYAASLAQGKNIVRFAIGVGGAFSNVAAKQELDTIASDPPASHVFRVDNFGA 230 Query: 376 LLESFDKITDKIQE 389 L + + DKI Sbjct: 231 LEQIRQNLQDKIFS 244 >gi|148694466|gb|EDL26413.1| procollagen, type XII, alpha 1, isoform CRA_b [Mus musculus] gi|148694467|gb|EDL26414.1| procollagen, type XII, alpha 1, isoform CRA_b [Mus musculus] Length = 2886 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 58/166 (34%), Gaps = 18/166 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+G + Y+ ++ + + K+ NT T A+ + Sbjct: 168 SAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYRREDLLAAVKKIPYKGGNTMTGDAIDYLV 227 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + E S K I ITDG++ +RN G++++S+ Sbjct: 228 KNTFTESAGSRAGF-----PKVAIIITDGKSQDEVEIP--------ARELRNIGVEVFSL 274 Query: 346 AVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQE 389 + L++ + F V + +++ ++I ++ Sbjct: 275 GI--KAADAKELKQIASTPSLNHVFNVANFDAIVDIQNEIISQVCS 318 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 21/160 (13%), Positives = 53/160 (33%), Gaps = 18/160 (11%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKE 293 V+I Y+ + + + L NT T A++ ++ + Sbjct: 1235 RVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQSFKT-- 1292 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 R +K + ITDG++ + + +++ G++++++ + Sbjct: 1293 ---QAGMRPRARKIGVLITDGKSQDDVEAPS--------KKLKDEGVELFAIGI--KNAD 1339 Query: 354 QDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 + L+ + V D L + D +T + Sbjct: 1340 EVELKMIATDPDDTHAYNVADFESLSKIVDDLTINLCNSV 1379 Score = 64.2 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 32/167 (19%), Positives = 63/167 (37%), Gaps = 23/167 (13%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYN 290 V+I + Y+ N + ++ +N TNT AM + +++ Sbjct: 473 SPNRVQISLVQYSRDPHTEFTLKEFNRVEDIIKAINTFPYRGGSTNTGKAMTYVREKIF- 531 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 N + + K +I ITDG++S A +RN+ ++I++V V Sbjct: 532 ----VPNKGSRSNVPKVMILITDGKSSDAFRDP--------AIKLRNSDVEIFAVGV--K 577 Query: 351 PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS-VRI 394 + L F V D ++F +I+ ++ + +RI Sbjct: 578 DAVRSELEAIASPPAETHVFTVEDF----DAFQRISFELTQSICLRI 620 >gi|120616160|gb|ABG80452.1| collagen [Hydra vulgaris] Length = 2439 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 22/159 (13%), Positives = 53/159 (33%), Gaps = 12/159 (7%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYN 290 + VR+G + Y+ S + +VK ++ + T + A Sbjct: 1873 SSSGVRVGVLTYSDEAKIRIRFDYSFDKEDVKKAIDNIPYDSMGTRIDLGLEAAKELFLE 1932 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + ++ K +I +TDG+ + + + + + + G+ I+++ + Sbjct: 1933 KSGGRGSSK------KVLILLTDGQQT---YIPDAKDPVDYAKELAEYGVDIFAIGIGDE 1983 Query: 351 PEGQDLLRKCTDSSGQFFAVNDSREL-LESFDKITDKIQ 388 D L F +D L + I+ + Sbjct: 1984 INKVD-LEDLISKPQHIFLSDDINSLITDLSKDISTALS 2021 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 22/132 (16%), Positives = 50/132 (37%), Gaps = 5/132 (3%) Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 ++ + ++K+ + A ++ + E +S ++K ++ + DG SG Sbjct: 1315 TDLFNAIDKIKHS--QADESRLDLALKKAHEELFTSQGARSDKDIEKAIVILGDGYISGG 1372 Query: 320 SAYQNTL--NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 L + + +R G+ I+++ V A P LL+ + V D +L+ Sbjct: 1373 GNRSRDLIESAKKEAAKLRELGVLIFTIGVGAEPNS-LLLQNFASKKTYYITVKDYGQLI 1431 Query: 378 ESFDKITDKIQE 389 + I Sbjct: 1432 GKIGALKTSISS 1443 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 67/191 (35%), Gaps = 18/191 (9%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + + + A +V+S + R + Y ++ Sbjct: 11 SSSSVGELAYEEMKKFAHQVVDSFS-----ISQQNARFAALVYGSNASVEFNFVRYDSAL 65 Query: 261 EVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 E+K + L+ NT A+ A +L++ + + + DG + Sbjct: 66 EIKQAIQSLSYLKSNTRIDKALEVAKSDLFSLQGKVRSRR-----PMILYVFFDGTVT-- 118 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 ++ + + + +++ G+KI ++ V L+K ++ + F+ +EL Sbjct: 119 ---RSMSDLESVVQPLKDYGVKIIAIGVGPEVNRYQ-LKKISEDNA-IFSGKSFKELAPL 173 Query: 380 FDKITDKIQEQ 390 I ++ + Sbjct: 174 LYSIVEQSCSE 184 >gi|285019106|ref|YP_003376817.1| von willebrand factor, type a protein [Xanthomonas albilineans GPE PC73] gi|283474324|emb|CBA16825.1| putative von willebrand factor, type a protein [Xanthomonas albilineans] Length = 343 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 31/191 (16%), Positives = 67/191 (35%), Gaps = 37/191 (19%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---PYE 273 + + + + + + + G TPL+ +L V+ +L Sbjct: 128 VVDRLTAAKAVLADFLDRRDGDRIGLLVFGQQAYALTPLTADLATVRDQLRDSVVGLAGR 187 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T A+ A + L + + + +I +TDG N+ L+ L+ E Sbjct: 188 ETALGDAIALAVKRLREQPQGE----------RVLILLTDGVNTAGV-----LDPLKAAE 232 Query: 334 YMRNAGMKIYSVAVSAPPE------------------GQDLLRKCT-DSSGQFFAVNDSR 374 + +++Y++A+ +D LRK D+ G+FF D+ Sbjct: 233 LAKAEHVRVYTIALGGDGGGMSLFGMPIPGSGGDDEVDEDTLRKIAQDTGGRFFRARDTA 292 Query: 375 ELLESFDKITD 385 +L + ++ Sbjct: 293 QLASIYAELDR 303 >gi|255526268|ref|ZP_05393185.1| von Willebrand factor type A [Clostridium carboxidivorans P7] gi|296186262|ref|ZP_06854666.1| von Willebrand factor type A domain protein [Clostridium carboxidivorans P7] gi|255510048|gb|EET86371.1| von Willebrand factor type A [Clostridium carboxidivorans P7] gi|296049063|gb|EFG88493.1| von Willebrand factor type A domain protein [Clostridium carboxidivorans P7] Length = 422 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 32/177 (18%), Positives = 67/177 (37%), Gaps = 19/177 (10%) Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN----NLNEVKSRLNKLN-PYENTN 276 A+ N+ Y + P+S + EV +L + P NTN Sbjct: 134 ERFSAALNLIDNMDKNNRFSMYKFDDTAEKIIPMSQVTKQSREEVSGKLKDMQNPKGNTN 193 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 A+ AY E+ + + VI ++DG G + + + + + Sbjct: 194 MRDALEKAYEEIKSS--------ETKDKNAMVIMLSDG---GDTYDLSKKFDETL-KPFK 241 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + IY++ + +L++ +S G ++ V + ++L F+KI Q++ + Sbjct: 242 EKNISIYTIG-MSNGNNFSMLKEIAKESGGNYYNVKEIKDLKNVFNKIYRDRQQRLL 297 >gi|239620965|ref|ZP_04663996.1| conserved hypothetical protein [Bifidobacterium longum subsp. infantis CCUG 52486] gi|239516066|gb|EEQ55933.1| conserved hypothetical protein [Bifidobacterium longum subsp. infantis CCUG 52486] Length = 816 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 58/391 (14%), Positives = 115/391 (29%), Gaps = 74/391 (18%) Query: 36 AVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK 95 A+++ A + + D ++ + Q S + + + T Sbjct: 24 AIVAAVAMLGGVAGVSATAMADDGNASTTQSQTTDEKAAASAPAPLSTEGTNGVPDDPTL 83 Query: 96 DKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD 155 + + N++ + N + I +VLD Sbjct: 84 SA-PAREKTVTANEDGTYTV------------ALNVTGAKSAGTGEIVTNQPLDIVLVLD 130 Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 VS SM + N + S K + + T +I ++ Sbjct: 131 VSGSMAEKIASGWNQPTKIDSLKTAVN------KFINATAAENAKITDQSQRNRIALVKF 184 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 + N R G +YN + L+ +++ + S +N L+ T Sbjct: 185 A--------GTEKTSVGNDFYREGWSSYN---YTQIVSNLTYDVSGLTSTVNGLSASGAT 233 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI--CE 333 + A + A L + + KK VIF TDGE + S + T+ + + Sbjct: 234 SADYAFNRAQAALTYQPRA--------NAKKVVIFFTDGEPNHGSGFDPTVAATAVNKAK 285 Query: 334 YMRNAGMKIYSVAV-------SAPPEGQDLLRKC-------------------------- 360 +++AG IYS+ V + Sbjct: 286 SLKDAGTTIYSIGVVSGANPGDTSSNLNKYMHGISSNYPDATATSSEHLWGKSWNANLGD 345 Query: 361 -TDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 ++S + A D+ +L F+ I +I + Sbjct: 346 RAETSSYYKAATDAGQLNNIFESIYQEITKT 376 >gi|238789155|ref|ZP_04632943.1| hypothetical protein yfred0001_29950 [Yersinia frederiksenii ATCC 33641] gi|238722687|gb|EEQ14339.1| hypothetical protein yfred0001_29950 [Yersinia frederiksenii ATCC 33641] Length = 522 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 41/330 (12%), Positives = 99/330 (30%), Gaps = 14/330 (4%) Query: 76 SYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRS 135 + I+ N +I + + N + + PS+ Sbjct: 200 NEIKYNNVNIDIIGFVPFSWGTKNFYSSTLANMEKESFCHFPFVPNQFSPSSDYLAKYNF 259 Query: 136 TGIIERSS--ENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKN 193 + S + + ++D ++ E Y N N L + Sbjct: 260 SEKYNISDLKKFPELKNLDIVDRIKNGECTYNNYRKIIN--EINDKLHYEVDLDESYVNE 317 Query: 194 TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 S +I ++ES + +++ + +++++ N I Sbjct: 318 IYDSLDLACNMAYFDEIADIVESNIDYTKTLESINKADNTINIKM-VDMPNNSICLRGSK 376 Query: 254 PLSNNL----NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 + + N S++ T + ++S+N + Sbjct: 377 AFTFDRHNRNNTSISKILGTRATGGTLISSGILTGNNIFLET-DNSYNKLMIIISDGDDS 435 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA 369 TD E + +N + +CE +++ G+K+ +A+ P R+C F+ Sbjct: 436 RQTDKEKRYYNISKNLIKD-GMCEKIKDNGIKMAFIAIGYVPREDIDWRRCV-GEENFYF 493 Query: 370 VNDSRELL-ESFDKITDKIQEQSVRIAPNR 398 ++ EL + + + + R P + Sbjct: 494 AKNAHELELDIRQALVG-VDSEVGRNIPKK 522 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 29/254 (11%), Positives = 70/254 (27%), Gaps = 14/254 (5%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 I+ + + + +++H + + ++ A++ A L+ + K++ S Sbjct: 29 FIFILPIFIGLVFLSFEISHFIQKKARLSDAIEQATLALTVDNDESPDDDNIK--KEKNS 86 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 K +L + + + IN Y + Y N + Sbjct: 87 KFIINYAKAYLPNEKFSKPVINITSHSDYIN----------YQVDMTIYYPTKILNKIFQ 136 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 + S ++S + + ++++ + V D S SM + + + + + Sbjct: 137 TV--SPEVSISDNARALKYTTTDSKPTDVVFVADYSGSMNEYFDESDESDEKKIVALRRI 194 Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 + N P K A S S + Sbjct: 195 FKDIQNEIKYNNVNIDIIGFVPFSWGTKNFYSSTLANMEKESFCHFPFVPNQFSPSSDYL 254 Query: 242 AYNIGIVGNQCTPL 255 A + L Sbjct: 255 AKYNFSEKYNISDL 268 >gi|217972770|ref|YP_002357521.1| von Willebrand factor type A [Shewanella baltica OS223] gi|217497905|gb|ACK46098.1| von Willebrand factor type A [Shewanella baltica OS223] Length = 340 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 31/204 (15%), Positives = 64/204 (31%), Gaps = 47/204 (23%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + ++ + + + RIG I + PL+ + V Sbjct: 104 NGKTVDRFTLIQHVVSDFIERRK---------GDRIGLILFADHAYLQA--PLTQDRRSV 152 Query: 263 KSRLN--KLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 L ++ T A+ A + + +I +TDG N+ Sbjct: 153 AQFLKEAQIGLVGKQTAIGEAIALAVKRFDKM----------DESNRVLILLTDGSNNAG 202 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEGQDLLRKCTD 362 + + Q + N + IY+V V A ++ L+ D Sbjct: 203 N-----IEPEQAAQIAANRKVTIYTVGVGADVMERRTLFGRERVNPSMDLDENQLKHIAD 257 Query: 363 -SSGQFFAVNDSRELLESFDKITD 385 + G++F +S+EL + + +I Sbjct: 258 VTHGRYFRARNSQELDQIYQEIDK 281 >gi|47217757|emb|CAG05979.1| unnamed protein product [Tetraodon nigroviridis] Length = 1380 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 28/200 (14%), Positives = 69/200 (34%), Gaps = 22/200 (11%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + + + NLV+S + R+ + Y+ L Sbjct: 16 TSSSVGKENFEKIRQWVANLVDSF-----DVAPDKTRVAVVRYSDRPTTEFNLARYRTLE 70 Query: 261 EVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +VK + NT T A+ + ++ E+ + + +++ I +TDG + Sbjct: 71 DVKRAARNIRYLGGNTMTGDAISYTTSNIFTERNGARPI--ARGIQRVAILLTDGRSQDY 128 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELL 377 + + AG+++++V + + L + F V D + Sbjct: 129 VLEPS--------KAAAKAGIRMFAVGIG--EALKVELDEIAAEPKNAHVFHVTDFNAID 178 Query: 378 ESFDKITDKIQEQSVRIAPN 397 + ++ ++ E + PN Sbjct: 179 KIRGRLRKRLCENV--LCPN 196 >gi|306823858|ref|ZP_07457232.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|309802423|ref|ZP_07696530.1| conserved repeat protein [Bifidobacterium dentium JCVIHMP022] gi|304552856|gb|EFM40769.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|308221023|gb|EFO77328.1| conserved repeat protein [Bifidobacterium dentium JCVIHMP022] Length = 1136 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 39/286 (13%), Positives = 87/286 (30%), Gaps = 54/286 (18%) Query: 125 PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP 184 + N+ + + + +VLDVS SM + ++ L Sbjct: 578 NTYTVNVDVTGAASSSTITTTQPVDFTLVLDVSGSMRENMGSVTKLQALQSAVNNFLDEA 637 Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYN 244 K + ++ ++ + K Q+K N Sbjct: 638 AKINK----------GAQSGSEPVRVGLVKFAGNATKKIGNKTYQDKWNTYN-------- 679 Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 L+ + + +K+ +NKL T HA+ + + + Sbjct: 680 ---YSQIVKKLTADTDGLKNEVNKLTAGGATRADYGFQHAFTVMSEARTEA--------- 727 Query: 305 KKFVIFITDGENSGASAYQNTL--NTLQICEYMRNAGMKIYSVAVSAPPE-------GQD 355 KK VIF TDG+ + + + + ++ + ++++G +YS+ V Sbjct: 728 KKVVIFFTDGKPTSEKTFDGKVANDAVEYAKQLKDSGAIVYSIGVFDGANPASTATSENK 787 Query: 356 LLRKCTDS---------------SGQFFAVNDSRELLESFDKITDK 386 + + + +G + D+ L F++I Sbjct: 788 FMHAVSSNYPNAANYEDLSEGSNAGYYKTATDASGLNSIFEEIRKS 833 >gi|313681552|ref|YP_004059290.1| von willebrand factor type a [Sulfuricurvum kujiense DSM 16994] gi|313154412|gb|ADR33090.1| von Willebrand factor type A [Sulfuricurvum kujiense DSM 16994] Length = 311 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 27/137 (19%), Positives = 52/137 (37%), Gaps = 20/137 (14%) Query: 252 CTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 TP++ V + L+ +NT + R L S K + Sbjct: 148 ATPVTYEKEIVSEMIGYLSHGMAGQNTAIGEGIAMGVRAL----------RDSKAKSKVI 197 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS-APPEGQDLLRKCTDSS-GQ 366 I +TDGE++ S + + +++Y++ + LL++ G+ Sbjct: 198 ILLTDGEHNSGSIS-----PKEAVAMVGKEHIRLYTIGIGQKGEFDNALLKQLAHDGHGK 252 Query: 367 FFAVNDSRELLESFDKI 383 FFA + +EL +D+I Sbjct: 253 FFAAANEKELQSVYDEI 269 >gi|11125762|gb|AAC51260.2| matrilin-2 precursor [Homo sapiens] Length = 956 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 69/187 (36%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +V+ + +++S+ + + R+G + Y+ + N+ ++K Sbjct: 667 GEENFEVVKQFVTGIIDSLTISPKAA-----RVGLLQYSTQVHTEFTLRNFNSAKDMKKA 721 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + STR+ + I TDG + Sbjct: 722 VAHMKYMGKGSMTGLALKHMFERSFTQGEGARPF--STRVPRAAIVFTDGRAQDDVSEW- 778 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ ++ F D + E +K Sbjct: 779 -------ASKAKANGITMYAVGVGKAI--EEELQEIASEPTNKHLFYAEDFSTMDEISEK 829 Query: 383 ITDKIQE 389 + I E Sbjct: 830 LKKGICE 836 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 56/159 (35%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 88 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 147 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ R+ G+ I+++ V Sbjct: 148 FSEAEGARPLREN--VPRVIMIVTDGRPQDS--------VAEVAAKARDTGILIFAIGVG 197 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F K Sbjct: 198 --QVDFNTLKSIGSEPHEDHVFLVANFSQIETLTSVFQK 234 >gi|260800527|ref|XP_002595180.1| hypothetical protein BRAFLDRAFT_240914 [Branchiostoma floridae] gi|229280424|gb|EEN51192.1| hypothetical protein BRAFLDRAFT_240914 [Branchiostoma floridae] Length = 419 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 38/395 (9%), Positives = 94/395 (23%), Gaps = 47/395 (11%) Query: 17 IDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGS 76 I ++ + DAA L + + + + Sbjct: 43 IQYSNQPQSEILLNDHQDAASL------------QQAISSINYLQGGTNTGKALRYLANN 90 Query: 77 YIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRST 136 G A +++ I + L+ G+ S Sbjct: 91 AFSGKNGARAGVSKVAIVVTDGRSSDDVVRPALNAGKEGIVLYAVGIGGSVDYQELRDIA 150 Query: 137 GIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPP----------PPK 186 ++ S + + + N ++ P Sbjct: 151 SSDDKVVNVTDFSGLQSVGNTLPDQVCQSLCPNSDHRSDLQCRAAQPPLSGIFNLQCSRW 210 Query: 187 KSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ----------EKKNLSV 236 S + + A I +L++ + + + +A + + + Sbjct: 211 SYTSSNVSKHAHCCYAGCFTPLDIAILLDGSDGVSSDDFEAEKSFAKLFLNEFDIGQDNS 270 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT-NTYPAMHHAYRELYNEKESS 295 R+ Y +V+ + N A+ + ++ + + Sbjct: 271 RVTVFQYGTEPRQEFALDTYETDQDVQDAIADTEYMGGDRNLGQAIRYMATYGFSGRNGA 330 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 + + I IT GE + R +G+ +Y++ V Sbjct: 331 RRS-----IPSVAIIITGGE--------SLDEVASAASKARRSGIILYAIGVG-NATVPA 376 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 L ++ +A L + + D+I Sbjct: 377 ELAAIATTANTSYAAASFAALKDLRGALADEICTS 411 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 27/198 (13%), Positives = 65/198 (32%), Gaps = 20/198 (10%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + + V + +IG I Y+ + Sbjct: 9 SGSIGTDNFERIKTFVSKAVTRFNIGPTQ-----TQIGVIQYSNQPQSEILLNDHQDAAS 63 Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 ++ ++ +N TNT A+ + ++ K + + K I +TDG Sbjct: 64 LQQAISSINYLQGGTNTGKALRYLANNAFSGKNGARA-----GVSKVAIVVTDG------ 112 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 +++ + ++ G+ +Y+V + LR S + V D L Sbjct: 113 --RSSDDVVRPALNAGKEGIVLYAVGIG-GSVDYQELRDIASSDDKVVNVTDFSGLQSVG 169 Query: 381 DKITDKIQEQSVRIAPNR 398 + + D++ + + +R Sbjct: 170 NTLPDQVCQSLCPNSDHR 187 >gi|312794604|ref|YP_004027527.1| von willebrand factor type a [Caldicellulosiruptor kristjanssonii 177R1B] gi|312181744|gb|ADQ41914.1| von Willebrand factor type A [Caldicellulosiruptor kristjanssonii 177R1B] Length = 900 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 30/188 (15%), Positives = 65/188 (34%), Gaps = 25/188 (13%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 K+++ ++ +V ++ + AY G + +V Sbjct: 425 GIPKLEIAKSASAKMVEHLESSDGVGVIAFDHNYYWAYKFGKLVR--------KEDVIES 476 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 ++ + T P + A + L S K V+ +TDG + Sbjct: 477 ISSIEVGGGTAIIPPLSEAVKTL----------KKSKAKNKLVVLLTDGMGEQSGY---- 522 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + + +KI ++ V L +SG+F+ V++ EL++ F K T Sbjct: 523 ---EIPADEAKRNNIKITTIGVGKFVNASVLSWIAAYTSGRFYLVSNPSELVDVFLKETK 579 Query: 386 KIQEQSVR 393 I+ + ++ Sbjct: 580 IIKGKYIK 587 >gi|110639040|ref|YP_679249.1| BatA-like protein [Cytophaga hutchinsonii ATCC 33406] gi|110281721|gb|ABG59907.1| BatA-like protein, aerotolerance-related protein [Cytophaga hutchinsonii ATCC 33406] Length = 351 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 27/198 (13%), Positives = 66/198 (33%), Gaps = 37/198 (18%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL----NPYEN 274 + ++ ++ + N +PL+N+ +K++LN L + Sbjct: 132 SRFDAAKQICTDIINKRSNDRIGIVIFSGEAVTLSPLTNDYVLLKNQLNDLKQNKDLQSG 191 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 T A+ A L + ++ ++ I+DGEN+ + + + Sbjct: 192 TAIGTALGTAINRL----------KNAETKERIIVLISDGENTSGLM-----DPITAADL 236 Query: 335 MRNAGMKIYSVAVSAPPE-----------------GQDLLRKC-TDSSGQFFAVNDSREL 376 +KIY + + ++ L+ + G+F+ D + L Sbjct: 237 CLEYNIKIYCIGLGKDGTHQFKDDNGTIQYVESKLDENTLKNISATTKGKFYRAYDKKSL 296 Query: 377 LESFDKITDKIQEQSVRI 394 + I + + V++ Sbjct: 297 DDVIANIDQLEKGKIVQL 314 >gi|288928458|ref|ZP_06422305.1| BatA protein [Prevotella sp. oral taxon 317 str. F0108] gi|288331292|gb|EFC69876.1| BatA protein [Prevotella sp. oral taxon 317 str. F0108] Length = 332 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 59/192 (30%), Gaps = 37/192 (19%) Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YEN 274 +A ++ A+ + T L N L V++ + + Sbjct: 112 AAKDVAAEFIADRPNDNIGLTIFAGEAFTQCPMTTDHTSLLNMLQTVRTDIAAKGLIQDG 171 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 T + +A L S K VI +TDG N+ L+ + Sbjct: 172 TAIGMGLANAVSRL----------KDSKAKSKVVILLTDGSNNMG-----DLSPMTSANI 216 Query: 335 MRNAGMKIYSVAVSAP--------------------PEGQDLLRKCT-DSSGQFFAVNDS 373 ++ G+++Y++ V +L+ + G F+ ++ Sbjct: 217 AKSLGIRVYTIGVGTNKVARYPMPVAGGVQYVNMPVEIDTKVLKDIAATTDGNFYRATNN 276 Query: 374 RELLESFDKITD 385 +EL + + I Sbjct: 277 QELKQIYKDIDK 288 >gi|315649635|ref|ZP_07902720.1| von Willebrand factor type A [Paenibacillus vortex V453] gi|315275108|gb|EFU38483.1| von Willebrand factor type A [Paenibacillus vortex V453] Length = 595 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 22/142 (15%), Positives = 54/142 (38%), Gaps = 9/142 (6%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + +K +++L+ T+ + A + L E +H + D + + Sbjct: 100 DKTALKEFIDQLDRGPYTDISVGLDEAVKVLKQGMEPAHAPMIVVLADG----NNDLDPN 155 Query: 318 GASAYQNTLNTLQIC-EYMRNAGMKIYSVAVSAPP--EGQDLLRKCTDSSGQFFAVNDSR 374 Q + L + + +G+ IY++ ++A + L + G+ F + + Sbjct: 156 TGKTSQEASDHLNQAVQEAKGSGIPIYTIGLNADGKLNKEALAELANQTGGKSFTTSSAD 215 Query: 375 ELLESFDKITDKIQEQSVRIAP 396 +L + +I Q ++I P Sbjct: 216 DLPQILSEIF--ASHQQLKIVP 235 >gi|313837214|gb|EFS74928.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA2] gi|314927768|gb|EFS91599.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL044PA1] gi|314971985|gb|EFT16083.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA3] Length = 320 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 23/230 (10%), Positives = 60/230 (26%), Gaps = 32/230 (13%) Query: 178 KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVR 237 K P ++ S+ A ++ +A + + + R Sbjct: 79 KAYHNVPRDRATVVVAIDVSRSMVATDVDPSRLSAAKTAAKDFLGDLP----------PR 128 Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHN 297 P + + V + + L +T ++ + L + Sbjct: 129 FNVSLVKFAASSQVVVPPTTDRAVVSTAIANLQVLPSTAIGEGIYSSLNALKLVPDD--P 186 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----- 352 + ++ ++DG + L+ + + +Y++A Sbjct: 187 KHPGQKPPAAIVLLSDGATNVGRPS------LEAAKEAGRQHVPVYTIAYGTAGGYVVEG 240 Query: 353 --------GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 L S G+ F+ +L + + I + + V Sbjct: 241 GQRQPVPVNHYELAAVAKASGGEKFSAESLGQLSDVYKSIAQSVGYEKVF 290 >gi|290769676|gb|ADD61455.1| putative protein [uncultured organism] Length = 816 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 58/391 (14%), Positives = 115/391 (29%), Gaps = 74/391 (18%) Query: 36 AVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK 95 A+++ A + + D ++ + Q S + + + T Sbjct: 24 AIVAAVAMLGGVAGVSATAMADDGNASTTQSQTTDEKAAASAPAPLSTEGTNGVPDDPTL 83 Query: 96 DKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD 155 + + N++ + N + I +VLD Sbjct: 84 SA-PAREKTVTANEDGTYTV------------ALNVTGAKSAGTGEIVTNQPLDIVLVLD 130 Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 VS SM + N + S K + + T +I ++ Sbjct: 131 VSGSMAEKIASGWNQPTKIDSLKTAVN------KFINATAAENAKITDQSQRNRIALVKF 184 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 + N R G +YN + L+ +++ + S +N L+ T Sbjct: 185 A--------GTEKTSVGNDFYREGWSSYN---YTQIVSNLTYDVSGLTSTVNGLSASGAT 233 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI--CE 333 + A + A L + + KK VIF TDGE + S + T+ + + Sbjct: 234 SADYAFNRAQAALTYQPRA--------NAKKVVIFFTDGEPNHGSGFDPTVAATAVNKAK 285 Query: 334 YMRNAGMKIYSVAV-------SAPPEGQDLLRKC-------------------------- 360 +++AG IYS+ V + Sbjct: 286 SLKDAGTTIYSIGVVSGANPGDTSSNLNKYMHGISSNYPDATATSSEHLWGKSWNANLGD 345 Query: 361 -TDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 ++S + A D+ +L F+ I +I + Sbjct: 346 RAETSSYYKAATDAGQLNNIFESIYQEITKT 376 >gi|225387166|ref|ZP_03756930.1| hypothetical protein CLOSTASPAR_00918 [Clostridium asparagiforme DSM 15981] gi|225046714|gb|EEG56960.1| hypothetical protein CLOSTASPAR_00918 [Clostridium asparagiforme DSM 15981] Length = 556 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 40/371 (10%), Positives = 101/371 (27%), Gaps = 26/371 (7%) Query: 32 ALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQI 91 AL A +LSGC + K + + + + + + E G Sbjct: 15 ALLAGLLSGCGAGGGKTASATEAEVKAEAGSYASETMAAQSQWDGAVMEAEGPPLSHNTE 74 Query: 92 NITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISIC 151 N +A + + + + + + + Sbjct: 75 EYNYIAENAFLAVANAPLSTFAADVDTASYANLRRKILEGNEVPADAVRIEEMLNYFTYD 134 Query: 152 M---VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF-WSKNTTKSKYAPAPAPAN 207 D S+ + N+ L P N + Sbjct: 135 YPEPTEDEPFSVTTYIGDCPWNENHKLLQIGLQAEKPDLENQKPSNLVFLIDVSGSMESA 194 Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 K+ ++ + L +++ + + Y + + + Sbjct: 195 DKLGLVKRAFLLLTENLRPEDT--------VSIVTYASSDTVVLDGVSGEEKAAIMTAIE 246 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 L +T+ + AYR + N VI TDG+ + + + Sbjct: 247 NLTAGGSTDGSKGIETAYRLAEEHFQKDGNNR--------VILATDGDLNLGL--TSEGD 296 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDK 386 ++ + + +G+ + + + + D+ GQ+ V+ L+E+ + ++ Sbjct: 297 LTRLIQKKKESGVFLSVMGFGTGNIKDNKMEALADNGNGQYAYVDS---LMEAKRVLVEE 353 Query: 387 IQEQSVRIAPN 397 + +A + Sbjct: 354 LGGTLFTVAKD 364 >gi|119946440|ref|YP_944120.1| von Willebrand factor, type A [Psychromonas ingrahamii 37] gi|119865044|gb|ABM04521.1| von Willebrand factor, type A [Psychromonas ingrahamii 37] Length = 327 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 20/154 (12%), Positives = 53/154 (34%), Gaps = 36/154 (23%) Query: 253 TPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 TPL+ +L ++ +++ T ++ A + + ++ +I Sbjct: 145 TPLTFDLKTIQQMVDESEIGLAGTRTAIGESIAMAIKRFVE----------NKNEQRVLI 194 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PE 352 ++DG N+ S + +Q + + IY++ + A Sbjct: 195 LVSDGANNSGS-----IEPIQAAKQAAKNNITIYTIGMGAEQMIKRGLFGNQRINPSADL 249 Query: 353 GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + L + + + G++F + EL + + Sbjct: 250 DEKTLTEIANLTGGKYFRARNQTELQNIYQTLNK 283 >gi|301165481|emb|CBW25052.1| putative membrane protein (von Willebrand factor type A) [Bacteriovorax marinus SJ] Length = 329 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 35/206 (16%), Positives = 65/206 (31%), Gaps = 54/206 (26%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 +++V + + V S R PLS +L +K + Sbjct: 105 NRLEVAKDKISDFVALRPTDRIGLIMFSERAF-----------TLLPLSTDLKLIKQMVG 153 Query: 268 KLNPYE----NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 ++N TN A+ A S K +I +TDG ++ Sbjct: 154 EINVGGMLGSGTNIGDALGLAVA----------RGAQSLAKNKVIILLTDGVSNVG---- 199 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ-----------------------DLLRKC 360 L +Q E + G+K+Y++ + + + L++ Sbjct: 200 -FLTPIQAAEEAKKQGIKVYTIGIGGRGDAKIPYGKNIFGRQRYQNIPGGSIDFKTLKEI 258 Query: 361 TD-SSGQFFAVNDSRELLESFDKITD 385 D ++GQ F D + L E +I Sbjct: 259 ADKTNGQTFEAQDEKALAEVLSEIEK 284 >gi|91205150|ref|YP_537505.1| hypothetical protein RBE_0335 [Rickettsia bellii RML369-C] gi|91068694|gb|ABE04416.1| unknown [Rickettsia bellii RML369-C] Length = 516 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 42/354 (11%), Positives = 98/354 (27%), Gaps = 27/354 (7%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQIN-ITKDKNNP 100 +VS + K K ++ +K + + + + Sbjct: 121 KLLVSKGKLSFKNPIKKDIKEATLKLLQATIKDEKSLSKILVNEEDITPFQKAIYHPTDF 180 Query: 101 LQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLA--ISICMVLDVSR 158 Q I + + E + G I ++ + I S I +L Sbjct: 181 SQLITQISSNEENSLNFIMNNGAIAQSVQVYTADGKAPIIASDLKDGFIIDKQYLLKYLL 240 Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAG 218 + + ++ + P + S + V + Sbjct: 241 PIFNGFIWNEEGKFPIMFAP--KNPKVLDGENNYAHNISLLIDISGSMEKDFSVYKNNIL 298 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 +++ + + + +I + +N + NN+ ++K +N L T Y Sbjct: 299 KILDKLAEIP------NWQINIVVFNDESTARSFSNQENNIEDIKVYINNLKANGYTKLY 352 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY-MRN 337 + A G +I TDG++ G ++ + + + ++N Sbjct: 353 GTIKEALESF----------KGKIDESSTLIVFTDGKDEGTNSNVTEKDVVDVTSEVIKN 402 Query: 338 AGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF----AVNDSRELLESFDKITDKI 387 +Y+V Q+ + G +L + D I K+ Sbjct: 403 PQFNMYTVGFGQ-YYNQEFFEQVATRGGFTHVSLNDPTGMHQLQQYIDNIEQKV 455 >gi|269315863|ref|NP_001161395.1| collagen alpha-5(VI) chain precursor [Mus musculus] Length = 2640 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 39/368 (10%), Positives = 98/368 (26%), Gaps = 30/368 (8%) Query: 29 MQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKH---LKQGSYIRENAGDI 85 + S D A ++++ +K T K+G + Sbjct: 308 LMSFSDRAQTISSLRSSANQSEFQQQIQKLSLQTGASNVGAAIEQMRKEGFSESSGSRKA 367 Query: 86 AQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLS--LRSTGIIERSS 143 QI + + E+ + +F G+ + T L + S+ Sbjct: 368 QGVPQIAVLVTHRASDDMVREAALDLRLEGVTMFAMGIEGANNTQLEDIVSYPSRQSIST 427 Query: 144 ENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAP 203 + + + + + Sbjct: 428 HSSYSHLESYSGNFLKKIRNEIWTQVST-RAEQMELDKTGCVDTKEADIYFLIDGSSSIR 486 Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 +I + + S ++ VR+G + Y+ + ++K Sbjct: 487 KKEFEQIQIFMSSVIDMFP--------IGPNKVRVGVVQYSHKNEVEFPVSRYTDGIDLK 538 Query: 264 SRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + + T T A+ + K R ++I +TDG+++ + Sbjct: 539 KAVFNIKQLKGLTFTGKALDFILPLIKKGKTERT-----DRAPCYLIVLTDGKSNDS--- 590 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 L+ +R + I+++ + + LR+ + + L ++ Sbjct: 591 -----VLEPANRLRAEQITIHAIGIG--EANKTQLRQIAGKDERVNFGQNFDSLKSIKNE 643 Query: 383 ITDKIQEQ 390 I +I + Sbjct: 644 IVHRICSE 651 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 29/161 (18%), Positives = 65/161 (40%), Gaps = 13/161 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYENTNTYPAMHHAYREL 288 + V+IG + Y+ ++ + + L + + T T A+ H+ L Sbjct: 877 DVGRDRVQIGALTYSNHPEILFYLNTYSSGSAIAEHLRRPRDTGGETYTAKALQHS-NVL 935 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + E H + + +++ +I ITDG S ++ L+ + +R+ G+ I++V V Sbjct: 936 FTE---EHGSRLTQNVRQLMIVITDGV----SHDRDKLD--EAARELRDKGITIFAVGVG 986 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 QD L V++ +L + + + + + Sbjct: 987 --NANQDELETMAGKKENTVHVDNFDKLRDIYLPLQETLCN 1025 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 61/193 (31%), Gaps = 22/193 (11%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + NLV IQ ++G + ++ + Sbjct: 665 VDSSGSIGPTNFETMKTFMKNLVGKIQ-----IGADRSQVGVVQFSDYNREEFQLNKYST 719 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 E+ + +++++P NT T A+ K KF+I +TDG+ Sbjct: 720 HEEIYAAIDRMSPINRNTLTGGALTFVNEYFDLSKGGRPQVR------KFLILLTDGKAQ 773 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 +R+ + I+SV V + L + + F V + L Sbjct: 774 DEVGGP--------ATALRSKSVTIFSVGV--YGANRAQLEEISGDGSLVFHVENFDHLK 823 Query: 378 ESFDKITDKIQEQ 390 K+ ++ Sbjct: 824 AIESKLIFRVCAL 836 Score = 41.4 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 20/136 (14%), Positives = 44/136 (32%), Gaps = 15/136 (11%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL--NKLNPYENTNTYPAMHHAYRELYNE 291 R+ Y+ + N N + + L N + A+ A+R ++ Sbjct: 65 NKYRVALAQYSDALHNEFQLGTFKNRNPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSA 124 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 + + V+ + ++ + + + +R G+KI SV V Sbjct: 125 PTNGRDKK--QFPPILVVLAS---------AESEDDVEEAAKALREDGVKIISVGVQK-- 171 Query: 352 EGQDLLRKCTDSSGQF 367 ++ L+ S F Sbjct: 172 ASEENLKAMATSQFHF 187 >gi|189082901|sp|A6H584|CO6A5_MOUSE RecName: Full=Collagen alpha-5(VI) chain; AltName: Full=Collagen alpha-1(XXIX) chain; Flags: Precursor Length = 2640 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 39/368 (10%), Positives = 98/368 (26%), Gaps = 30/368 (8%) Query: 29 MQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKH---LKQGSYIRENAGDI 85 + S D A ++++ +K T K+G + Sbjct: 308 LMSFSDRAQTISSLRSSANQSEFQQQIQKLSLQTGASNVGAAIEQMRKEGFSESSGSRKA 367 Query: 86 AQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLS--LRSTGIIERSS 143 QI + + E+ + +F G+ + T L + S+ Sbjct: 368 QGVPQIAVLVTHRASDDMVREAALDLRLEGVTMFAMGIEGANNTQLEDIVSYPSRQSIST 427 Query: 144 ENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAP 203 + + + + + Sbjct: 428 HSSYSHLESYSGNFLKKIRNEIWTQVST-RAEQMELDKTGCVDTKEADIYFLIDGSSSIR 486 Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 +I + + S ++ VR+G + Y+ + ++K Sbjct: 487 KKEFEQIQIFMSSVIDMFP--------IGPNKVRVGVVQYSHKNEVEFPVSRYTDGIDLK 538 Query: 264 SRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + + T T A+ + K R ++I +TDG+++ + Sbjct: 539 KAVFNIKQLKGLTFTGKALDFILPLIKKGKTERT-----DRAPCYLIVLTDGKSNDS--- 590 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 L+ +R + I+++ + + LR+ + + L ++ Sbjct: 591 -----VLEPANRLRAEQITIHAIGIG--EANKTQLRQIAGKDERVNFGQNFDSLKSIKNE 643 Query: 383 ITDKIQEQ 390 I +I + Sbjct: 644 IVHRICSE 651 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 29/161 (18%), Positives = 65/161 (40%), Gaps = 13/161 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYENTNTYPAMHHAYREL 288 + V+IG + Y+ ++ + + L + + T T A+ H+ L Sbjct: 877 DVGRDRVQIGALTYSNHPEILFYLNTYSSGSAIAEHLRRPRDTGGETYTAKALQHS-NVL 935 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + E H + + +++ +I ITDG S ++ L+ + +R+ G+ I++V V Sbjct: 936 FTE---EHGSRLTQNVRQLMIVITDGV----SHDRDKLD--EAARELRDKGITIFAVGVG 986 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 QD L V++ +L + + + + + Sbjct: 987 --NANQDELETMAGKKENTVHVDNFDKLRDIYLPLQETLCN 1025 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 61/193 (31%), Gaps = 22/193 (11%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + NLV IQ ++G + ++ + Sbjct: 665 VDSSGSIGPTNFETMKTFMKNLVGKIQ-----IGADRSQVGVVQFSDYNREEFQLNKYST 719 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 E+ + +++++P NT T A+ K KF+I +TDG+ Sbjct: 720 HEEIYAAIDRMSPINRNTLTGGALTFVNEYFDLSKGGRPQVR------KFLILLTDGKAQ 773 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 +R+ + I+SV V + L + + F V + L Sbjct: 774 DEVGGP--------ATALRSKSVTIFSVGV--YGANRAQLEEISGDGSLVFHVENFDHLK 823 Query: 378 ESFDKITDKIQEQ 390 K+ ++ Sbjct: 824 AIESKLIFRVCAL 836 Score = 41.4 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 20/136 (14%), Positives = 44/136 (32%), Gaps = 15/136 (11%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL--NKLNPYENTNTYPAMHHAYRELYNE 291 R+ Y+ + N N + + L N + A+ A+R ++ Sbjct: 65 NKYRVALAQYSDALHNEFQLGTFKNRNPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSA 124 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 + + V+ + ++ + + + +R G+KI SV V Sbjct: 125 PTNGRDKK--QFPPILVVLAS---------AESEDDVEEAAKALREDGVKIISVGVQK-- 171 Query: 352 EGQDLLRKCTDSSGQF 367 ++ L+ S F Sbjct: 172 ASEENLKAMATSQFHF 187 >gi|1705570|sp|P51942|MATN1_MOUSE RecName: Full=Cartilage matrix protein; AltName: Full=Matrilin-1; Flags: Precursor gi|1163179|gb|AAB06521.1| cartilage matrix protein precursor [Mus musculus] Length = 500 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 24/183 (13%), Positives = 67/183 (36%), Gaps = 23/183 (12%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ + +V+++ + + ++G + Y+ I ++ ++K+R+ Sbjct: 294 NFELVKKFINQIVDTL-----DVSDRLAQVGLVQYSSSIRQEFPLGRFHSKKDIKARVRN 348 Query: 269 LNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 ++ + T T A+ + + + +K I TDG + Sbjct: 349 MSYMEKGTMTGAALKYLIDNSFTVSSGAR-----PGAQKVGIVFTDGRSQDYIND----- 398 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITD 385 ++ G K+++V V ++ LR+ +F D + + + K+ Sbjct: 399 ---AARKAKDLGFKMFAVGVG--NAVEEELREIASEPVADHYFYTADFKTINQIGKKLQK 453 Query: 386 KIQ 388 +I Sbjct: 454 QIC 456 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 55/159 (34%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + R+G + Y + + + + ++ P T T A+ A + Sbjct: 76 DVGPNATRVGLVNYASTVKPEFPLRAHGSKASLLQAVRRIQPLSTGTMTGLALQFAITKA 135 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E S + K VI +TDG + + E R +G++++++ + Sbjct: 136 LSDAEGGRAR--SPDISKVVIVVTDGRPQDS--------VRDVSERARASGIELFAIGLG 185 Query: 349 APPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 + LR+ V +L + F + Sbjct: 186 R--VDKATLRQIASEPQDEHVDYVESYNVIEKLAKKFQE 222 >gi|304411849|ref|ZP_07393460.1| von Willebrand factor type A [Shewanella baltica OS183] gi|307303383|ref|ZP_07583138.1| von Willebrand factor type A [Shewanella baltica BA175] gi|304349709|gb|EFM14116.1| von Willebrand factor type A [Shewanella baltica OS183] gi|306913743|gb|EFN44165.1| von Willebrand factor type A [Shewanella baltica BA175] Length = 339 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 64/204 (31%), Gaps = 47/204 (23%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + ++ + + + RIG I + PL+ + V Sbjct: 104 NGKTVDRFTLIQHVVSDFIERRK---------GDRIGLILFADHAYLQA--PLTQDRRSV 152 Query: 263 KSRLN--KLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 L ++ T A+ A + + +I +TDG N+ Sbjct: 153 AQFLKEAQIGLVGKQTAIGEAIALAVKRFDKM----------DESNRVLILLTDGSNNAG 202 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEGQDLLRKCTD 362 + + Q + N + IY+V V A ++ L+ + Sbjct: 203 N-----IEPEQAAQIAANRKVTIYTVGVGADVMERRTLFGRERVNPSMDLDENQLKHIAE 257 Query: 363 -SSGQFFAVNDSRELLESFDKITD 385 + G++F +S+EL + + +I Sbjct: 258 VTHGRYFRARNSQELDQIYQEIDK 281 >gi|73980586|ref|XP_540098.2| PREDICTED: similar to matrilin 3 precursor [Canis familiaris] Length = 481 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 24/176 (13%), Positives = 58/176 (32%), Gaps = 23/176 (13%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP- 271 + +++++ + R+ + Y + ++ +K + ++ P Sbjct: 101 VKTFVSQIIDTL-----DIGAADTRVAVVNYASTVKIEFHLQTYSDKQSLKQAVARITPL 155 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T + A+ A E + E+ + + K I +TDG ++ Sbjct: 156 STGTMSGLAIQTAMDEAFTEEAGARGPTSNI--PKVAIIVTDGRPQD--------QVNEV 205 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 R +G+++Y+V V + L+ F V +L F + Sbjct: 206 AARARASGIELYAVGV--DRADMESLKIIASEPLDEHVFYVETYGVIEKLSSRFQE 259 >gi|194216197|ref|XP_001914777.1| PREDICTED: collagen, type XII, alpha 1 [Equus caballus] Length = 3120 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 23/166 (13%), Positives = 59/166 (35%), Gaps = 18/166 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+G + Y+ +E+ + + K+ NT T A+ + Sbjct: 168 SAFDIGEEKTRVGVVQYSSDPRTEFNLNQYYQRDELLAAIKKIPYKGGNTMTGDAIDYLI 227 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + E + K I ITDG++ +R+ G++++S+ Sbjct: 228 KNTFMESAGARVGF-----PKVAIIITDGKSQD--------EVEIPARELRSIGVEVFSL 274 Query: 346 AVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQE 389 + L++ + F V + +++ ++I ++ Sbjct: 275 GI--KAADAKELKQIASTPSLNHVFNVANFDAIVDIQNEIISQVCS 318 Score = 66.9 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 52/160 (32%), Gaps = 18/160 (11%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKE 293 V+I Y+ + + + L NT T A++ ++ + Sbjct: 1235 RVQIALAQYSGDPRTEWQLNAHKDKQSLLEAVANLPYKGGNTLTGMALNFIRQQSFKT-- 1292 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 R +K + ITDG++ + + +++ G+++++V + Sbjct: 1293 ---QAGMRPRARKIGVLITDGKSQDDVEAPS--------KKLKDEGVELFAVGI--KNAD 1339 Query: 354 QDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 + L+ + V D L D +T + Sbjct: 1340 EVELKMIATDPDDTHAYNVADFESLSRIVDDLTINLCNSV 1379 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 62/167 (37%), Gaps = 23/167 (13%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYN 290 V+I + Y+ + ++ +N TNT AM + +++ Sbjct: 473 SPNRVQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIF- 531 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + K +I ITDG++S A +RN+ ++I++V V Sbjct: 532 ----VPSKGSRSNVPKVMILITDGKSSDAFRDP--------AIKLRNSDVEIFAVGV--K 577 Query: 351 PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS-VRI 394 + L F V D ++F +I+ ++ + +RI Sbjct: 578 DAVRSELEAIASPPAETHVFTVEDF----DAFQRISFELTQSICLRI 620 >gi|147899676|ref|NP_001087858.1| collagen alpha-1(XXI) chain precursor [Xenopus laevis] gi|82234134|sp|Q641F3|COLA1_XENLA RecName: Full=Collagen alpha-1(XXI) chain; Flags: Precursor gi|51950065|gb|AAH82384.1| MGC81791 protein [Xenopus laevis] Length = 957 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 30/176 (17%), Positives = 61/176 (34%), Gaps = 22/176 (12%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHA 284 + ++G + Y+ + +++++ R + NT T A+ A Sbjct: 64 TSNFNIGPKFTQVGVVQYSDYPILEIPLGSYESIDDLSRRTQSIQYLGGNTQTGNAIQFA 123 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 L+ L K I +TDG++ + I E R + +++ Sbjct: 124 IDNLF--------ARSLRPLTKIAIVLTDGKSQD--------DVKHIAEEARKNKITLFA 167 Query: 345 VAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + V + + LR + F V D + + + K+ E+SV P R Sbjct: 168 IGVGSEI-EESELRAIANKPSSTYVFYVEDYIAISRIREIMKQKLCEESV--CPTR 220 >gi|332830871|ref|XP_003311907.1| PREDICTED: matrilin-2 isoform 2 [Pan troglodytes] Length = 915 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 69/187 (36%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +V+ + +++S+ + + R+G + Y+ + N+ ++K Sbjct: 626 GEENFEVVKQFVTGIIDSLTISPKAA-----RVGLLQYSTQVRTEFTLRNFNSAKDMKKA 680 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + STR+ + I TDG + Sbjct: 681 VAHMKYMGKGSMTGLALKHMFERSFTQGEGARPL--STRVPRAAIVFTDGRAQDDVSEW- 737 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ ++ F D + E +K Sbjct: 738 -------ASKAKANGITMYAVGVGKAI--EEELQEIASEPTNKHLFYAEDFSTMDEISEK 788 Query: 383 ITDKIQE 389 + I E Sbjct: 789 LKKGICE 795 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 56/159 (35%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 88 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 147 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ R+ G+ I+++ V Sbjct: 148 FSEAEGARPLREN--VPRVIMIVTDGRPQDS--------VAEVAAKARDTGILIFAIGVG 197 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F K Sbjct: 198 --QVDFNTLKSIGSEPHEDHVFLVANFSQIETLTSVFQK 234 >gi|332830869|ref|XP_528309.3| PREDICTED: matrilin-2 isoform 3 [Pan troglodytes] Length = 937 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 69/187 (36%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +V+ + +++S+ + + R+G + Y+ + N+ ++K Sbjct: 667 GEENFEVVKQFVTGIIDSLTISPKAA-----RVGLLQYSTQVRTEFTLRNFNSAKDMKKA 721 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + STR+ + I TDG + Sbjct: 722 VAHMKYMGKGSMTGLALKHMFERSFTQGEGARPL--STRVPRAAIVFTDGRAQDDVSEW- 778 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ ++ F D + E +K Sbjct: 779 -------ASKAKANGITMYAVGVGKAI--EEELQEIASEPTNKHLFYAEDFSTMDEISEK 829 Query: 383 ITDKIQE 389 + I E Sbjct: 830 LKKGICE 836 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 56/159 (35%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 88 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 147 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ R+ G+ I+++ V Sbjct: 148 FSEAEGARPLREN--VPRVIMIVTDGRPQDS--------VAEVAAKARDTGILIFAIGVG 197 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F K Sbjct: 198 --QVDFNTLKSIGSEPHEDHVFLVANFSQIETLTSVFQK 234 >gi|332830867|ref|XP_003311906.1| PREDICTED: matrilin-2 isoform 1 [Pan troglodytes] Length = 956 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 69/187 (36%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +V+ + +++S+ + + R+G + Y+ + N+ ++K Sbjct: 667 GEENFEVVKQFVTGIIDSLTISPKAA-----RVGLLQYSTQVRTEFTLRNFNSAKDMKKA 721 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + STR+ + I TDG + Sbjct: 722 VAHMKYMGKGSMTGLALKHMFERSFTQGEGARPL--STRVPRAAIVFTDGRAQDDVSEW- 778 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ ++ F D + E +K Sbjct: 779 -------ASKAKANGITMYAVGVGKAI--EEELQEIASEPTNKHLFYAEDFSTMDEISEK 829 Query: 383 ITDKIQE 389 + I E Sbjct: 830 LKKGICE 836 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 56/159 (35%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 88 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 147 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ R+ G+ I+++ V Sbjct: 148 FSEAEGARPLREN--VPRVIMIVTDGRPQDS--------VAEVAAKARDTGILIFAIGVG 197 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F K Sbjct: 198 --QVDFNTLKSIGSEPHEDHVFLVANFSQIETLTSVFQK 234 >gi|332238409|ref|XP_003268390.1| PREDICTED: matrilin-2 isoform 3 [Nomascus leucogenys] Length = 915 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 69/187 (36%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +V+ + +++S+ + + R+G + Y+ + N+ ++K Sbjct: 626 GEENFEVVKQFVTGIIDSLTISPKAA-----RVGLLQYSTQVRTEFTLRNFNSAKDMKKA 680 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + STR+ + I TDG + Sbjct: 681 VAHMKYMGKGSMTGLALKHMFERSFTQGEGARPL--STRVPRAAIVFTDGRAQDDVSEW- 737 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ ++ F D + E +K Sbjct: 738 -------ASKAKANGITMYAVGVGKAI--EEELQEIASEPTNKHLFYAEDFSTMDEISEK 788 Query: 383 ITDKIQE 389 + I E Sbjct: 789 LKKGICE 795 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 56/159 (35%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 88 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 147 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ R+ G+ I+++ V Sbjct: 148 FSEAEGARPLREN--VPRVIMIVTDGRPQDS--------VAEVAAKARDTGILIFAIGVG 197 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F K Sbjct: 198 --QVDFNTLKAIGSEPHEDHVFLVANFSQIETLTSVFQK 234 >gi|332238407|ref|XP_003268389.1| PREDICTED: matrilin-2 isoform 2 [Nomascus leucogenys] Length = 937 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 69/187 (36%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +V+ + +++S+ + + R+G + Y+ + N+ ++K Sbjct: 667 GEENFEVVKQFVTGIIDSLTISPKAA-----RVGLLQYSTQVRTEFTLRNFNSAKDMKKA 721 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + STR+ + I TDG + Sbjct: 722 VAHMKYMGKGSMTGLALKHMFERSFTQGEGARPL--STRVPRAAIVFTDGRAQDDVSEW- 778 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ ++ F D + E +K Sbjct: 779 -------ASKAKANGITMYAVGVGKAI--EEELQEIASEPTNKHLFYAEDFSTMDEISEK 829 Query: 383 ITDKIQE 389 + I E Sbjct: 830 LKKGICE 836 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 56/159 (35%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 88 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 147 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ R+ G+ I+++ V Sbjct: 148 FSEAEGARPLREN--VPRVIMIVTDGRPQDS--------VAEVAAKARDTGILIFAIGVG 197 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F K Sbjct: 198 --QVDFNTLKAIGSEPHEDHVFLVANFSQIETLTSVFQK 234 >gi|332238405|ref|XP_003268388.1| PREDICTED: matrilin-2 isoform 1 [Nomascus leucogenys] Length = 956 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 69/187 (36%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +V+ + +++S+ + + R+G + Y+ + N+ ++K Sbjct: 667 GEENFEVVKQFVTGIIDSLTISPKAA-----RVGLLQYSTQVRTEFTLRNFNSAKDMKKA 721 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + STR+ + I TDG + Sbjct: 722 VAHMKYMGKGSMTGLALKHMFERSFTQGEGARPL--STRVPRAAIVFTDGRAQDDVSEW- 778 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ ++ F D + E +K Sbjct: 779 -------ASKAKANGITMYAVGVGKAI--EEELQEIASEPTNKHLFYAEDFSTMDEISEK 829 Query: 383 ITDKIQE 389 + I E Sbjct: 830 LKKGICE 836 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 56/159 (35%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 88 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 147 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ R+ G+ I+++ V Sbjct: 148 FSEAEGARPLREN--VPRVIMIVTDGRPQDS--------VAEVAAKARDTGILIFAIGVG 197 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F K Sbjct: 198 --QVDFNTLKAIGSEPHEDHVFLVANFSQIETLTSVFQK 234 >gi|284990593|ref|YP_003409147.1| von Willebrand factor type A [Geodermatophilus obscurus DSM 43160] gi|284063838|gb|ADB74776.1| von Willebrand factor type A [Geodermatophilus obscurus DSM 43160] Length = 318 Score = 74.9 bits (182), Expect = 2e-11, Method: Composition-based stats. Identities = 22/148 (14%), Positives = 53/148 (35%), Gaps = 21/148 (14%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + +V + ++ L E+T A+ + + +SS + G ++ ++DG N+ Sbjct: 146 DRGQVSTAIDNLELAESTAIGEAVFTSLTAI-ENFQSSLDADGEEVPPARIVLLSDGYNT 204 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PEGQDLLRKCTD-S 363 Q +AG+ + ++A P + L + D + Sbjct: 205 VGRPDT------QAVSAALDAGIPVSTIAFGTDYGTLDLDGERVPVPVDRATLEEIADQT 258 Query: 364 SGQFFAVNDSRELLESFDKITDKIQEQS 391 G + + EL + + + +I + Sbjct: 259 GGSYSEAASAAELEQVYQDLGSQIGYTT 286 >gi|298491708|ref|YP_003721885.1| von Willebrand factor type A ['Nostoc azollae' 0708] gi|298233626|gb|ADI64762.1| von Willebrand factor type A ['Nostoc azollae' 0708] Length = 418 Score = 74.6 bits (181), Expect = 2e-11, Method: Composition-based stats. Identities = 24/180 (13%), Positives = 65/180 (36%), Gaps = 23/180 (12%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 ++ + ++ LV+ + R+ + ++ + ++ N++K ++N+L Sbjct: 59 LETVKKAVSLLVDQLSSED--------RLSIVVFDHRAKILVPNQIISDRNQIKQQINRL 110 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T + EL G +TDGE + + + L Sbjct: 111 TADGGTAIDEGLRLGIEEL---------AKGKKDTISQAFLLTDGE----NEHGDNNRCL 157 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQ 388 + + + + + ++ QD+L K D+ G + + ++ F ++ ++Q Sbjct: 158 KFAQLAASYNLTLNTLGFGDNWN-QDILEKIADAGLGNLSHIEHPNQAVDKFSRLFSRMQ 216 >gi|180654|gb|AAA63904.1| cartilage matrix protein [Homo sapiens] Length = 340 Score = 74.6 bits (181), Expect = 2e-11, Method: Composition-based stats. Identities = 24/183 (13%), Positives = 67/183 (36%), Gaps = 23/183 (12%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++++ + +V+++ + + ++G + Y+ + + ++K+ + Sbjct: 134 NLELVKKFISQIVDTL-----DVSDKLAQVGLVQYSSSVRQEFPLGRFHTKKDIKAAVRN 188 Query: 269 LNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 ++ + T T A+ + + + +K I TDG + Sbjct: 189 MSYMEKGTMTGAALKYLIDNSFTVSSGAR-----PGAQKVGIVFTDGRSQDYIND----- 238 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITD 385 + ++ G K+++V V +D LR+ +F D + + + K+ Sbjct: 239 ---AAKKAKDLGFKMFAVGVG--NAVEDELREIASEPVAEHYFYTADFKTINQIGKKLQK 293 Query: 386 KIQ 388 KI Sbjct: 294 KIC 296 >gi|113971308|ref|YP_735101.1| von Willebrand factor, type A [Shewanella sp. MR-4] gi|113885992|gb|ABI40044.1| von Willebrand factor, type A [Shewanella sp. MR-4] Length = 624 Score = 74.6 bits (181), Expect = 2e-11, Method: Composition-based stats. Identities = 38/368 (10%), Positives = 90/368 (24%), Gaps = 34/368 (9%) Query: 36 AVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIREN------AGDIAQKA 89 A ++ S + + + G + Q Sbjct: 79 AAIAIHEQATSTKLRTMNAEHRAYIAQPAATISAAPALNGDWPGAVPPERNRFEKQVQNG 138 Query: 90 QINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAIS 149 + + + + Y L L + Sbjct: 139 IMVAGETPVSTFAIDVD-TGSYTTLRRMLKEGRLPQKDTLRVEEMLNYFSYDYPLPSKNE 197 Query: 150 ICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRK 209 + + E + D + + S + K Sbjct: 198 APFSV----TTELAPSPYNYDMMLLRIGLKGYEQSKAELGASNLVFL-LDVSGSMASPDK 252 Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 + +L + L + ++ + Y N+ + L +L Sbjct: 253 LPLLQTALKMLTQQLGAQD--------KVSIVVYAGAAGVVLDGAAGNDSQTLNYALEQL 304 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 + +TN + AY+ VIF TDG+ + + + L L Sbjct: 305 SAGGSTNGAQGIQLAYQLAKKHLVEGGINR--------VIFATDGDFNVGTTNLDELIDL 356 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQ 388 + + G+ + ++ L+ + D GQ+ ++ L E+ + +++ Sbjct: 357 VSAQ--KQLGIGLTTLGFGMGDYNDHLMEQLADKGNGQYAYIDS---LNEARKVLVEQLS 411 Query: 389 EQSVRIAP 396 + IA Sbjct: 412 ATLLTIAK 419 >gi|306824220|ref|ZP_07457590.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|309801684|ref|ZP_07695804.1| von Willebrand factor type A domain protein [Bifidobacterium dentium JCVIHMP022] gi|304552423|gb|EFM40340.1| conserved hypothetical protein [Bifidobacterium dentium ATCC 27679] gi|308221626|gb|EFO77918.1| von Willebrand factor type A domain protein [Bifidobacterium dentium JCVIHMP022] Length = 967 Score = 74.6 bits (181), Expect = 2e-11, Method: Composition-based stats. Identities = 51/382 (13%), Positives = 111/382 (29%), Gaps = 54/382 (14%) Query: 40 GCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNN 99 S + ++ ++ + + Sbjct: 136 ATTSATPTQKPTGTENPTTVERSVQSDDDDADTVANQNEAKDDETKDNADKTVHLGIASY 195 Query: 100 PLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRS 159 + S + L + +++ I I +VLDVS S Sbjct: 196 RGMLKSASAGLSTPEHTKSIEYQGNGAYTLKLDVTGKDASTSTTDTTPIDIALVLDVSGS 255 Query: 160 MEDLY--LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 M D + + + + + K+ + +K A +I + Sbjct: 256 MNDDFGGRGSPSKISALKTAVNSFLDETAKTNDTIEDDNNKVKVALVKYANQIGTATGAD 315 Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNT 277 G +++ +++ G L+ + +K+ +N L T Sbjct: 316 GCRISNSRQSD----------------TGNCTQIVQELTTDAGLLKTSVNGLQAAGATYA 359 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI--CEYM 335 AM A + L + + KK+VIF TDGE + S + + + I + + Sbjct: 360 DAAMEVAQQALAGGRAGA---------KKYVIFFTDGEPNHWSGFDDDVANAAIKKSQEL 410 Query: 336 RNAGMKIYSVAVSAPPEGQ----------DLLRKCTD---------------SSGQFFAV 370 +NAG +YS+ + + + S +++ Sbjct: 411 KNAGTTVYSIGIFDGANPSASVSSASNANKFMHGISSNYPNATGYRSLGDRASGDYYYSA 470 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + +L + F+ I I E+ V Sbjct: 471 SSATQLAQIFNDIQKTITEKHV 492 >gi|332299342|ref|YP_004441263.1| von Willebrand factor type A [Porphyromonas asaccharolytica DSM 20707] gi|332176405|gb|AEE12095.1| von Willebrand factor type A [Porphyromonas asaccharolytica DSM 20707] Length = 326 Score = 74.6 bits (181), Expect = 2e-11, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 59/171 (34%), Gaps = 44/171 (25%) Query: 249 GNQCTPLSNNLNEVK-----SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 PL+ + N ++ + + +L + T + A L GS Sbjct: 137 SFTLCPLTVDHNVIQQMLETTEIGQLE--DGTAIGLGLATAINTL----------RGSDN 184 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------- 352 K +I +TDG N+ + E + G++IY+VA Sbjct: 185 KSKVIILLTDGSNNAG-----DITPSMAAELAQQYGIRIYTVAAGTNGVAKFPVQTAFGT 239 Query: 353 ---------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + LR + + G+++ D +L E + +I D +++ + Sbjct: 240 EYVEADVQIDEGTLRHIAEQTGGKYYRATDETKLHEIYKEI-DSLEKSRLT 289 >gi|311746225|ref|ZP_07720010.1| BatA protein [Algoriphagus sp. PR1] gi|126576455|gb|EAZ80733.1| BatA protein [Algoriphagus sp. PR1] Length = 347 Score = 74.6 bits (181), Expect = 2e-11, Method: Composition-based stats. Identities = 32/206 (15%), Positives = 66/206 (32%), Gaps = 42/206 (20%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY----EN 274 N + + + + N PL+N+ + + ++ + Sbjct: 127 NRLEAAKATAIDFINGRFGDRIGMVVFAGEAYSLAPLTNDYKLLTDLIQDISFNMMEAKG 186 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 T A+ A + S K +I ++DGE++ + ++ L + Sbjct: 187 TAIGSAIASATNRM----------KESESASKVLILLSDGESNAGN-----VDPLFAAQL 231 Query: 335 MRNAGMKIYSVAVSAPPE------------------GQDLLRKCTD-SSGQFFAVNDSRE 375 +KIY++AV + LR+ +G+FF +D Sbjct: 232 ASALDIKIYTIAVGKDGMVPYGTDFFGRPQMVESYLDETNLREIAKIGNGEFFRASDGGT 291 Query: 376 LLESFDKITD----KIQEQSVRIAPN 397 L FD+I +I E + + Sbjct: 292 LNNIFDRIDTMEKAEILENRYKETSD 317 >gi|313885991|ref|ZP_07819729.1| von Willebrand factor type A domain protein [Porphyromonas asaccharolytica PR426713P-I] gi|312924521|gb|EFR35292.1| von Willebrand factor type A domain protein [Porphyromonas asaccharolytica PR426713P-I] Length = 326 Score = 74.6 bits (181), Expect = 2e-11, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 59/171 (34%), Gaps = 44/171 (25%) Query: 249 GNQCTPLSNNLNEVK-----SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 PL+ + N ++ + + +L + T + A L GS Sbjct: 137 SFTLCPLTVDHNVIQQMLETTEIGQLE--DGTAIGLGLATAINTL----------RGSDN 184 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------- 352 K +I +TDG N+ + E + G++IY+VA Sbjct: 185 KSKVIILLTDGSNNAG-----DITPSMAAELAQQYGIRIYTVAAGTNGVAKFPVQTAFGT 239 Query: 353 ---------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + LR + + G+++ D +L E + +I D +++ + Sbjct: 240 EYVEADVQIDEGTLRHIAEQTGGKYYRATDETKLHEIYKEI-DSLEKSRLT 289 >gi|149559056|ref|XP_001512734.1| PREDICTED: similar to collagen, type XXI, alpha 1, partial [Ornithorhynchus anatinus] Length = 225 Score = 74.6 bits (181), Expect = 2e-11, Method: Composition-based stats. Identities = 27/167 (16%), Positives = 58/167 (34%), Gaps = 20/167 (11%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHA 284 + +++G + Y+ V ++L + + + NT T A+ A Sbjct: 76 TKNFDIGPKFIQVGVVQYSDYPVLEIPLGSHDSLENLIQAMESIQYLGGNTRTGKAIQFA 135 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 L+ L K + +TDG++ E R++ + +++ Sbjct: 136 LDHLF--------AKSQRFLTKIAVVLTDGKSQD--------EVKDAAEAARDSKITMFA 179 Query: 345 VAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQE 389 + V + E + LR + F V D + + + I K+ E Sbjct: 180 IGVGSETEDAE-LRAIANKPSYTYVFYVEDYIAISKIREVIKQKLCE 225 >gi|228471029|ref|ZP_04055873.1| BatA protein [Porphyromonas uenonis 60-3] gi|228307249|gb|EEK16272.1| BatA protein [Porphyromonas uenonis 60-3] Length = 326 Score = 74.6 bits (181), Expect = 2e-11, Method: Composition-based stats. Identities = 28/169 (16%), Positives = 55/169 (32%), Gaps = 40/169 (23%) Query: 249 GNQCTPLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 PL+ + + + L+ + T + A L GS Sbjct: 137 SFTLCPLTVDHDVILQMLDATEIGQLEDGTAIGLGLATAINTL----------RGSDNKS 186 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------------- 352 K +I +TDG N+ + E + G++IY+VA Sbjct: 187 KVIILLTDGSNNAG-----DITPSMAAELAQQYGIRIYTVAAGTNGVAKFPVQTASGIEY 241 Query: 353 -------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + LR + G+++ D +L E + +I D +++ + Sbjct: 242 VEADVQIDEGTLRHIAQQTGGKYYRATDETKLHEIYKEI-DSLEKSRLT 289 >gi|224048789|ref|XP_002188138.1| PREDICTED: matrilin 3 [Taeniopygia guttata] Length = 284 Score = 74.6 bits (181), Expect = 2e-11, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 54/159 (33%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + R+ + Y + + +K ++++ P T T A+ A E+ Sbjct: 64 DVGERTTRVAVMNYASTVKVEFPLRTYFDKASMKEAVSRIEPLSAGTMTGLAIQTAMEEV 123 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + E+ + + + + VI +TDG + R AG++IY+V V Sbjct: 124 FTEEMGTRPA--AFNIPRVVIVVTDGRPQD--------QVQDVAASARTAGIEIYTVGVG 173 Query: 349 APPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 LR F V +L F + Sbjct: 174 R--ADMQALRIMASEPLDEHVFYVETYGVIEKLTSRFRE 210 >gi|3182939|sp|Q91145|COCA1_NOTVI RecName: Full=Collagen alpha-1(XII) chain gi|632648|gb|AAA80217.1| type XII collagen alpha-1 chain [Notophthalmus viridescens] Length = 929 Score = 74.6 bits (181), Expect = 2e-11, Method: Composition-based stats. Identities = 32/327 (9%), Positives = 78/327 (23%), Gaps = 30/327 (9%) Query: 68 IKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSA 127 L + R ++ +T + + E L + P Sbjct: 514 SGTTLSPFNAPRSIKTSEPTRSTFRVTWEPAPGEVKGYKITFHPEGDDGYLGEMMVGPYD 573 Query: 128 LTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKK 187 T + + D + + + K Sbjct: 574 STVVLEELRARTSYKVNVFGVFD----DGQSPPLIGHETTTLRDAPRSPIPSSGLDCTTK 629 Query: 188 SFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGI 247 + + + ++ +V + + V+I Y+ Sbjct: 630 AQADIVL---LVDGSWSIGRPNFKIVRNFISRVVEVF-----DIGSDRVQIAVSQYSGDP 681 Query: 248 VGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 + + L NTNT A+ + +K Sbjct: 682 RTEWQLNTHKTKKSLMDAVANLPYKGGNTNTGSALKFILE-----NNFRPGVGMREKARK 736 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG- 365 I +TDG++ + + G+++Y+V + ++ L++ Sbjct: 737 IAILLTDGKSQDDIVAPSKRYA--------DEGIELYAVGI--KNADENELKEIASDPDE 786 Query: 366 -QFFAVNDSRELLESFDKITDKIQEQS 391 + V D L + +T+ + Sbjct: 787 LYMYNVADFSLLTNIVNDLTENVCNSV 813 >gi|225621320|ref|YP_002722578.1| von Willebrand factor type A (vWA) domain-containing protein [Brachyspira hyodysenteriae WA1] gi|225216140|gb|ACN84874.1| von Willebrand factor type A (vWA) domain containing protein [Brachyspira hyodysenteriae WA1] Length = 289 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 25/187 (13%), Positives = 62/187 (33%), Gaps = 40/187 (21%) Query: 225 QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY--ENTNTYPAMH 282 +K + + + + + +P + + ++ + K+ +T+ + Sbjct: 73 KKTMIDFIKKRNFDKISLVSFALRASVLSPATFDYTSLEEEIKKIEIDEEGSTSIGLGIA 132 Query: 283 HAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKI 342 A L + KE + +K +I +TDGEN+ ++ E N +KI Sbjct: 133 TAVDMLRSVKEDN---------EKIIILLTDGENNSG-----EIDPKLASEIASNFNIKI 178 Query: 343 YSVAVSAPP------------------------EGQDLLRKCTDSSGQFFAVNDSRELLE 378 Y++ + + L+ + G++F ++ L Sbjct: 179 YTIGIGDANGSHAWVTYDDPNYGKRRIRADFTLNEESLIDIAATTGGKYFNAKNASALDN 238 Query: 379 SFDKITD 385 ++ I Sbjct: 239 VYNTIDR 245 >gi|108763557|ref|YP_631764.1| von Willebrand factor type A domain-containing protein [Myxococcus xanthus DK 1622] gi|108467437|gb|ABF92622.1| von Willebrand factor type A domain protein [Myxococcus xanthus DK 1622] Length = 592 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 30/182 (16%), Positives = 59/182 (32%), Gaps = 19/182 (10%) Query: 220 LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYP 279 L K + N + + + Y + + + + L+ L T Sbjct: 264 LAREAIKVAVKNLNENDTVAIVTYAGNTRDVLPPTPATDAKSIHAALDSLTAGGGTAMGS 323 Query: 280 AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR--- 336 M AYR S + V+ +TDG+ + ++ + + + Sbjct: 324 GMELAYRHA--------VKKASGSVVSRVVVLTDGDANIGR----NVSANAMLDSIHKYT 371 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 G+ + +V DL+ K D G F V+ RE + F+ ++ IA Sbjct: 372 AEGVTLTTVGFGMGNYRDDLMEKLADKGNGNCFYVDSLREAKKVFE---TQLTGTLEVIA 428 Query: 396 PN 397 + Sbjct: 429 KD 430 >gi|158260465|dbj|BAF82410.1| unnamed protein product [Homo sapiens] Length = 937 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 69/187 (36%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +V+ + +++S+ + + R+G + Y+ + N+ ++K Sbjct: 667 GEENFEVVKQFVTGIIDSLTISPKAA-----RVGLLQYSTQVHTEFTLRNFNSAKDMKKA 721 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + STR+ + I TDG + Sbjct: 722 VAHMKYMGKGSMTGLALKHMFERSFTQGEGARPL--STRVPRAAIVFTDGRAQDDVSEW- 778 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ ++ F D + E +K Sbjct: 779 -------ASKAKANGITMYAVGVGKAI--EEELQEIASEPTNKHLFYAEDFSTMDEISEK 829 Query: 383 ITDKIQE 389 + I E Sbjct: 830 LKKGICE 836 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 56/159 (35%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 88 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 147 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ R+ G+ I+++ V Sbjct: 148 FSEAEGARPLREN--VPRVIMIVTDGRPQDS--------VAEVAAKARDMGILIFAIGVG 197 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F K Sbjct: 198 --QVDFNTLKSIGSEPHEDHVFLVANFSQIETLTSVFQK 234 >gi|119612172|gb|EAW91766.1| matrilin 2, isoform CRA_b [Homo sapiens] gi|119612174|gb|EAW91768.1| matrilin 2, isoform CRA_b [Homo sapiens] Length = 922 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 69/187 (36%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +V+ + +++S+ + + R+G + Y+ + N+ ++K Sbjct: 667 GEENFEVVKQFVTGIIDSLTISPKAA-----RVGLLQYSTQVHTEFTLRNFNSAKDMKKA 721 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + STR+ + I TDG + Sbjct: 722 VAHMKYMGKGSMTGLALKHMFERSFTQGEGARPL--STRVPRAAIVFTDGRAQDDVSEW- 778 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ ++ F D + E +K Sbjct: 779 -------ASKAKANGITMYAVGVGKAI--EEELQEIASEPTNKHLFYAEDFSTMDEISEK 829 Query: 383 ITDKIQE 389 + I E Sbjct: 830 LKKGICE 836 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 56/159 (35%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 88 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 147 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ R+ G+ I+++ V Sbjct: 148 FSEAEGARPLREN--VPRVIMIVTDGRPQDS--------VAEVAAKARDTGILIFAIGVG 197 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F K Sbjct: 198 --QVDFNTLKSIGSEPHEDHVFLVANFSQIETLTSVFQK 234 >gi|119612170|gb|EAW91764.1| matrilin 2, isoform CRA_a [Homo sapiens] gi|119612171|gb|EAW91765.1| matrilin 2, isoform CRA_a [Homo sapiens] Length = 941 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 69/187 (36%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +V+ + +++S+ + + R+G + Y+ + N+ ++K Sbjct: 667 GEENFEVVKQFVTGIIDSLTISPKAA-----RVGLLQYSTQVHTEFTLRNFNSAKDMKKA 721 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + STR+ + I TDG + Sbjct: 722 VAHMKYMGKGSMTGLALKHMFERSFTQGEGARPL--STRVPRAAIVFTDGRAQDDVSEW- 778 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ ++ F D + E +K Sbjct: 779 -------ASKAKANGITMYAVGVGKAI--EEELQEIASEPTNKHLFYAEDFSTMDEISEK 829 Query: 383 ITDKIQE 389 + I E Sbjct: 830 LKKGICE 836 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 56/159 (35%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 88 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 147 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ R+ G+ I+++ V Sbjct: 148 FSEAEGARPLREN--VPRVIMIVTDGRPQDS--------VAEVAAKARDTGILIFAIGVG 197 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F K Sbjct: 198 --QVDFNTLKSIGSEPHEDHVFLVANFSQIETLTSVFQK 234 >gi|62548862|ref|NP_085072.2| matrilin-2 isoform b precursor [Homo sapiens] Length = 937 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 69/187 (36%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +V+ + +++S+ + + R+G + Y+ + N+ ++K Sbjct: 667 GEENFEVVKQFVTGIIDSLTISPKAA-----RVGLLQYSTQVHTEFTLRNFNSAKDMKKA 721 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + STR+ + I TDG + Sbjct: 722 VAHMKYMGKGSMTGLALKHMFERSFTQGEGARPL--STRVPRAAIVFTDGRAQDDVSEW- 778 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ ++ F D + E +K Sbjct: 779 -------ASKAKANGITMYAVGVGKAI--EEELQEIASEPTNKHLFYAEDFSTMDEISEK 829 Query: 383 ITDKIQE 389 + I E Sbjct: 830 LKKGICE 836 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 56/159 (35%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 88 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 147 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ R+ G+ I+++ V Sbjct: 148 FSEAEGARPLREN--VPRVIMIVTDGRPQDS--------VAEVAAKARDTGILIFAIGVG 197 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F K Sbjct: 198 --QVDFNTLKSIGSEPHEDHVFLVANFSQIETLTSVFQK 234 >gi|62548860|ref|NP_002371.3| matrilin-2 isoform a precursor [Homo sapiens] Length = 956 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 69/187 (36%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +V+ + +++S+ + + R+G + Y+ + N+ ++K Sbjct: 667 GEENFEVVKQFVTGIIDSLTISPKAA-----RVGLLQYSTQVHTEFTLRNFNSAKDMKKA 721 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + STR+ + I TDG + Sbjct: 722 VAHMKYMGKGSMTGLALKHMFERSFTQGEGARPL--STRVPRAAIVFTDGRAQDDVSEW- 778 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ ++ F D + E +K Sbjct: 779 -------ASKAKANGITMYAVGVGKAI--EEELQEIASEPTNKHLFYAEDFSTMDEISEK 829 Query: 383 ITDKIQE 389 + I E Sbjct: 830 LKKGICE 836 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 56/159 (35%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 88 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 147 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ R+ G+ I+++ V Sbjct: 148 FSEAEGARPLREN--VPRVIMIVTDGRPQDS--------VAEVAAKARDTGILIFAIGVG 197 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F K Sbjct: 198 --QVDFNTLKSIGSEPHEDHVFLVANFSQIETLTSVFQK 234 >gi|37182908|gb|AAQ89254.1| MATN2 [Homo sapiens] Length = 915 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 69/187 (36%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +V+ + +++S+ + + R+G + Y+ + N+ ++K Sbjct: 626 GEENFEVVKQFVTGIIDSLTISPKAA-----RVGLLQYSTQVHTEFTLRNFNSAKDMKKA 680 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + STR+ + I TDG + Sbjct: 681 VAHMKYMGKGSMTGLALKHMFERSFTQGEGARPL--STRVPRAAIVFTDGRAQDDVSEW- 737 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ ++ F D + E +K Sbjct: 738 -------ASKAKANGITMYAVGVGKAI--EEELQEIASEPTNKHLFYAEDFSTMDEISEK 788 Query: 383 ITDKIQE 389 + I E Sbjct: 789 LKKGICE 795 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 56/159 (35%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 88 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 147 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ R+ G+ I+++ V Sbjct: 148 FSEAEGARPLREN--VPRVIMIVTDGRPQDS--------VAEVAAKARDTGILIFAIGVG 197 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F K Sbjct: 198 --QVDFNTLKSIGSEPHEDHVFLVANFSQIETLTSVFQK 234 >gi|32425428|gb|AAH16394.1| MATN2 protein [Homo sapiens] Length = 715 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 69/187 (36%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +V+ + +++S+ + + R+G + Y+ + N+ ++K Sbjct: 426 GEENFEVVKQFVTGIIDSLTISPKAA-----RVGLLQYSTQVHTEFTLRNFNSAKDMKKA 480 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + STR+ + I TDG + Sbjct: 481 VAHMKYMGKGSMTGLALKHMFERSFTQGEGARPL--STRVPRAAIVFTDGRAQDDVSEW- 537 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ ++ F D + E +K Sbjct: 538 -------ASKAKANGITMYAVGVGKAI--EEELQEIASEPTNKHLFYAEDFSTMDEISEK 588 Query: 383 ITDKIQE 389 + I E Sbjct: 589 LKKGICE 595 >gi|14714613|gb|AAH10444.1| Matrilin 2 [Homo sapiens] gi|261858984|dbj|BAI46014.1| matrilin 2 [synthetic construct] Length = 937 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 69/187 (36%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +V+ + +++S+ + + R+G + Y+ + N+ ++K Sbjct: 667 GEENFEVVKQFVTGIIDSLTISPKAA-----RVGLLQYSTQVHTEFTLRNFNSAKDMKKA 721 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + STR+ + I TDG + Sbjct: 722 VAHMKYMGKGSMTGLALKHMFERSFTQGEGARPL--STRVPRAAIVFTDGRAQDDVSEW- 778 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ ++ F D + E +K Sbjct: 779 -------ASKAKANGITMYAVGVGKAI--EEELQEIASEPTNKHLFYAEDFSTMDEISEK 829 Query: 383 ITDKIQE 389 + I E Sbjct: 830 LKKGICE 836 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 56/159 (35%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 88 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 147 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ R+ G+ I+++ V Sbjct: 148 FSEAEGARPLREN--VPRVIMIVTDGRPQDS--------VAEVAAKARDTGILIFAIGVG 197 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F K Sbjct: 198 --QVDFNTLKSIGSEPHEDHVFLVANFSQIETLTSVFQK 234 >gi|62298084|sp|O00339|MATN2_HUMAN RecName: Full=Matrilin-2; Flags: Precursor Length = 956 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 69/187 (36%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +V+ + +++S+ + + R+G + Y+ + N+ ++K Sbjct: 667 GEENFEVVKQFVTGIIDSLTISPKAA-----RVGLLQYSTQVHTEFTLRNFNSAKDMKKA 721 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + STR+ + I TDG + Sbjct: 722 VAHMKYMGKGSMTGLALKHMFERSFTQGEGARPL--STRVPRAAIVFTDGRAQDDVSEW- 778 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ ++ F D + E +K Sbjct: 779 -------ASKAKANGITMYAVGVGKAI--EEELQEIASEPTNKHLFYAEDFSTMDEISEK 829 Query: 383 ITDKIQE 389 + I E Sbjct: 830 LKKGICE 836 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 56/159 (35%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 88 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 147 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ R+ G+ I+++ V Sbjct: 148 FSEAEGARPLREN--VPRVIMIVTDGRPQDS--------VAEVAAKARDTGILIFAIGVG 197 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F K Sbjct: 198 --QVDFNTLKSIGSEPHEDHVFLVANFSQIETLTSVFQK 234 >gi|52545626|emb|CAB70853.2| hypothetical protein [Homo sapiens] Length = 672 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 69/187 (36%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +V+ + +++S+ + + R+G + Y+ + N+ ++K Sbjct: 383 GEENFEVVKQFVTGIIDSLTISPKAA-----RVGLLQYSTQVHTEFTLRNFNSAKDMKKA 437 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + STR+ + I TDG + Sbjct: 438 VAHMKYMGKGSMTGLALKHMFERSFTQGEGARPL--STRVPRAAIVFTDGRAQDDVSEW- 494 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ ++ F D + E +K Sbjct: 495 -------ASKAKANGITMYAVGVGKAI--EEELQEIASEPTNKHLFYAEDFSTMDEISEK 545 Query: 383 ITDKIQE 389 + I E Sbjct: 546 LKKGICE 552 >gi|21739491|emb|CAD38787.1| hypothetical protein [Homo sapiens] Length = 1016 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 69/187 (36%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +V+ + +++S+ + + R+G + Y+ + N+ ++K Sbjct: 727 GEENFEVVKQFVTGIIDSLTISPKAA-----RVGLLQYSTQVHTEFTLRNFNSAKDMKKA 781 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + STR+ + I TDG + Sbjct: 782 VAHMKYMGKGSMTGLALKHMFERSFTQGEGARPL--STRVPRAAIVFTDGRAQDDVSEW- 838 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ ++ F D + E +K Sbjct: 839 -------ASKAKANGITMYAVGVGKAI--EEELQEIASEPTNKHLFYAEDFSTMDEISEK 889 Query: 383 ITDKIQE 389 + I E Sbjct: 890 LKKGICE 896 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 56/159 (35%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 148 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 207 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ R+ G+ I+++ V Sbjct: 208 FSEAEGARPLREN--VPRVIMIVTDGRPQDS--------VAEVAAKARDTGILIFAIGVG 257 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F K Sbjct: 258 --QVDFNTLKSIGSEPHEDHVFLVANFSQIETLTSVFQK 294 >gi|45384196|ref|NP_990403.1| matrilin-3 precursor [Gallus gallus] gi|14548115|sp|O42401|MATN3_CHICK RecName: Full=Matrilin-3; Flags: Precursor gi|2326444|emb|CAA03885.1| matrilin-3 [Gallus gallus] Length = 452 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 29/159 (18%), Positives = 54/159 (33%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + R+ + Y + + +K ++++ P T T A+ A E+ Sbjct: 85 DVGERTTRVAVMNYASTVKVEFPLRTYFDKASMKEAVSRIQPLSAGTMTGLAIQAAMDEV 144 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + E+ + + + K VI +TDG + R AG++IY+V V Sbjct: 145 FTEEMGTRPA--NFNIPKVVIIVTDGRPQD--------QVENVAANARTAGIEIYAVGVG 194 Query: 349 APPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 LR F V +L F + Sbjct: 195 R--ADMQSLRIMASEPLDEHVFYVETYGVIEKLTSKFRE 231 >gi|14042702|dbj|BAB55358.1| unnamed protein product [Homo sapiens] Length = 537 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 69/187 (36%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +V+ + +++S+ + + R+G + Y+ + N+ ++K Sbjct: 248 GEENFEVVKQFVTGIIDSLTISPKAA-----RVGLLQYSTQVHTEFTLRNFNSAKDMKKA 302 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + STR+ + I TDG + Sbjct: 303 VAHMKYMGKGSMTGLALKHMFERSFTQGEGARPL--STRVPRAAIVFTDGRAQDDVSEW- 359 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ ++ F D + E +K Sbjct: 360 -------ASKAKANGITMYAVGVGKAI--EEELQEIASEPTNKHLFYAEDFSTMDEISEK 410 Query: 383 ITDKIQE 389 + I E Sbjct: 411 LKKGICE 417 >gi|11360063|pir||T46488 hypothetical protein DKFZp434J065.1 - human (fragment) Length = 741 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 69/187 (36%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +V+ + +++S+ + + R+G + Y+ + N+ ++K Sbjct: 452 GEENFEVVKQFVTGIIDSLTISPKAA-----RVGLLQYSTQVHTEFTLRNFNSAKDMKKA 506 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + STR+ + I TDG + Sbjct: 507 VAHMKYMGKGSMTGLALKHMFERSFTQGEGARPL--STRVPRAAIVFTDGRAQDDVSEW- 563 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ ++ F D + E +K Sbjct: 564 -------ASKAKANGITMYAVGVGKAI--EEELQEIASEPTNKHLFYAEDFSTMDEISEK 614 Query: 383 ITDKIQE 389 + I E Sbjct: 615 LKKGICE 621 >gi|32474857|ref|NP_867851.1| chloride channel [Rhodopirellula baltica SH 1] gi|32445397|emb|CAD75398.1| conserved hypothetical protein-putative chloride channel [Rhodopirellula baltica SH 1] Length = 900 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 32/158 (20%), Positives = 63/158 (39%), Gaps = 19/158 (12%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 IA++ +++ + ++ + TN YPAM AY L Sbjct: 502 VIAFDGDSYTVSELRSTSDRGAISDAISTIEASGGTNMYPAMADAYEALL---------- 551 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 G+T K VI +TDG +S + M + + + +VA+ +DLL + Sbjct: 552 GATAKLKHVILMTDGVSSPGDF-------QGVAGDMSASRITLSTVALGQGS-SEDLLEE 603 Query: 360 CTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 G+++ +D + + + F K T + + ++ P Sbjct: 604 LAQIGGGRYYFCDDPQSVPQVFAKETVEASKSAINELP 641 >gi|218462234|ref|ZP_03502325.1| hypothetical protein RetlK5_23393 [Rhizobium etli Kim 5] Length = 66 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 20/64 (31%), Positives = 31/64 (48%) Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 C+ ++ G++IY++A AP GQ LL C +F +LL +F I K Q Sbjct: 2 CDTAKSKGIEIYTIAFMAPAGGQALLHYCASDDSHYFQAEKMEDLLAAFKAIGAKASSQL 61 Query: 392 VRIA 395 R+ Sbjct: 62 TRLT 65 >gi|88858061|ref|ZP_01132703.1| hypothetical protein PTD2_11764 [Pseudoalteromonas tunicata D2] gi|88819678|gb|EAR29491.1| hypothetical protein PTD2_11764 [Pseudoalteromonas tunicata D2] Length = 328 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 31/158 (19%), Positives = 60/158 (37%), Gaps = 33/158 (20%) Query: 246 GIVGNQCTPLSNNLNEVKSRL--NKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGST 302 G TPL+ +LN V L ++ T A+ A + +++S+ Sbjct: 139 GDTAFLQTPLTRDLNTVSKMLEEAQIGLVGRATAIGDALGLAVKRFSQKQDSN------- 191 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE---------- 352 + ++ +TDGEN+ + L + R G+K+Y+V V + Sbjct: 192 ---RILVLLTDGENTAGN-----LAPEEALLLAREEGIKVYTVGVGSQGGNRFNLFSMSG 243 Query: 353 ----GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + LL+K + G +F D L + + ++ Sbjct: 244 SSSLDESLLQKIATETGGLYFRATDVASLQQIYQELDK 281 >gi|157962424|ref|YP_001502458.1| von Willebrand factor type A [Shewanella pealeana ATCC 700345] gi|157847424|gb|ABV87923.1| von Willebrand factor type A [Shewanella pealeana ATCC 700345] Length = 336 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 58/170 (34%), Gaps = 38/170 (22%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN--KLNPYE-NTNTYPAMHHAYRELYNEKE 293 RIG I + +PL+ + V L ++ T A+ + ++ Sbjct: 129 RIGLILFADHAYLQ--SPLTQDRRSVAQYLKEAQIGLVGKQTAIGEAIALGVKRFDKVEQ 186 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--- 350 S+ + +I +TDG N+ + Q + G+ IY++ V A Sbjct: 187 SN----------RVLILLTDGSNNAG-----AITPEQASQIAAQRGITIYTIGVGADVME 231 Query: 351 --------------PEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + L++ + GQ+F ++ EL + + I Sbjct: 232 RRTLFGKERVNPSMDLDESQLQEIAKVTGGQYFRARNTEELEQIYQVIDT 281 >gi|149376601|ref|ZP_01894361.1| hypothetical protein MDG893_00577 [Marinobacter algicola DG893] gi|149359119|gb|EDM47583.1| hypothetical protein MDG893_00577 [Marinobacter algicola DG893] Length = 340 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 29/178 (16%), Positives = 57/178 (32%), Gaps = 36/178 (20%) Query: 229 QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL--NKL-NPYENTNTYPAMHHAY 285 + G PL+ +L V++ L + T A+ A Sbjct: 124 DDFIQRREGDRLGLLLFGTEPYIQAPLTFDLATVRTLLHEAGIGMAGRATAIGDALGLAV 183 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + L + + ++ VI +TDG N+ + E + AG++IY++ Sbjct: 184 KRLRDRPQE----------QRVVILLTDGANTAGEIAPDK-----AAEIAKAAGVRIYTI 228 Query: 346 AVSAP-----------------PEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITD 385 + A + LL + + G++F EL ++ I Sbjct: 229 GIGAETMVQRGLLGSRRVNPSRDLDEGLLTRIAQQTGGEYFRARSLPELELIYESINQ 286 >gi|126334040|ref|XP_001370580.1| PREDICTED: similar to Integrin, alpha M (complement component 3 receptor 3 subunit) [Monodelphis domestica] Length = 1156 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 31/180 (17%), Positives = 60/180 (33%), Gaps = 23/180 (12%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPY 272 +++ + + + Y+ + N K+ + + Sbjct: 176 KNFVMTVMDQFKGTDT-------QFSLMQYSDDFKTHFTFNNFKNDPTSKNLVGPIEQLN 228 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T+T + REL+ E + KK +I ITDG+ G ++LN + Sbjct: 229 GKTHTASGIRKVVRELFQEWNGAR-----KDAKKILIVITDGQIQG-----DSLNYRDVI 278 Query: 333 EYMRNAGMKIYSVAVSAP---PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKI 387 G+ Y++ V P + LR F VN+ L +++ +KI Sbjct: 279 PEAEKEGVIRYAIGVGYAFNTPSARQELRTIASQPAQEHVFQVNNFDALKNIQNQLQEKI 338 >gi|213961827|ref|ZP_03390093.1| von Willebrand factor, type A [Capnocytophaga sputigena Capno] gi|213955616|gb|EEB66932.1| von Willebrand factor, type A [Capnocytophaga sputigena Capno] Length = 607 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 40/344 (11%), Positives = 91/344 (26%), Gaps = 24/344 (6%) Query: 37 VLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKD 96 V+ R + + + ++ + K+ + E I + TKD Sbjct: 100 VVVTSYETAKRRDVTSSSIRIRGAGSVSRNYRTKNSYTLNSSLETYKAIDEGGFKKTTKD 159 Query: 97 KNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDV 156 + A Y L + + + Sbjct: 160 PVTTFSADVDR-ASYSNVRRMLNYGQKPHKDAVRIEELINYFDYDYAPPAEGGKTPLKAT 218 Query: 157 SRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIES 216 + + + K + S S P K+ +L S Sbjct: 219 TTLSSCPWNPDNYLLRIGLQAKKIDFTKAPPSNIVFLIDTSGSMDEP----NKMPLLKAS 274 Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 L+++++ RI + Y + + ++ ++ L +T Sbjct: 275 FKLLLDNLRPED--------RIAIVVYASQTGIALPSTPAKEKEKISKVIDDLVASGSTA 326 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 + AY N +I TDG+ + + ++ L ++ E R Sbjct: 327 GGAGLQTAYEVAEKNFLPKGNNR--------IILATDGDFNVGISSRDEL--QRLVEEKR 376 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLES 379 N G+ I + D+ + G + +++ E + Sbjct: 377 NNGIYISVLGYGMGNYRDDMAETIANKGNGNYAYIDNFTEAKKV 420 >gi|119493582|ref|ZP_01624246.1| von Willebrand factor, type A [Lyngbya sp. PCC 8106] gi|119452572|gb|EAW33755.1| von Willebrand factor, type A [Lyngbya sp. PCC 8106] Length = 414 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 22/180 (12%), Positives = 64/180 (35%), Gaps = 21/180 (11%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 ++ + ++AG L++ + RI + ++ ++ +K ++N+L Sbjct: 59 LETVKKAAGELIDRLNPGD--------RISVVVFDHRAKVLIPNQDIDDPESIKKQINRL 110 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T+ + EL K +TDGE + + + L Sbjct: 111 RTSGGTSIDEGLKLGIEELGKGKVERI---------SQAFLLTDGE----NEHGDNNRCL 157 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 ++ + + + + S+ L + + G + ++++ F ++ +++Q Sbjct: 158 KLAKLATDYNLTLNSLGFGNDWNQDILEKIADEGGGTLAYIEYPEQVIDEFSRLFNRMQS 217 >gi|146307954|ref|YP_001188419.1| von Willebrand factor, type A [Pseudomonas mendocina ymp] gi|145576155|gb|ABP85687.1| von Willebrand factor, type A [Pseudomonas mendocina ymp] Length = 334 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 26/145 (17%), Positives = 56/145 (38%), Gaps = 34/145 (23%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + + V++ L++ +NT A+ A + L + ++ Sbjct: 150 PLTFDRHTVRTWLDEAMIGIAGKNTAIGDAIGLAVKRL----------RQRPAQSRVLVL 199 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE---------------GQD 355 ITDG N+G ++ + + G++IY++ + A P + Sbjct: 200 ITDGANNGG-----EIDPMVAAQLAAEEGVRIYAIGIGADPRQSGVLGAFGFSALDLDET 254 Query: 356 LLRKCTD-SSGQFFAVNDSRELLES 379 LR + + G++F + EL + Sbjct: 255 SLRAIAEATGGEYFRARNQAELTQI 279 >gi|166367777|ref|YP_001660050.1| hypothetical protein MAE_50360 [Microcystis aeruginosa NIES-843] gi|166090150|dbj|BAG04858.1| hypothetical protein MAE_50360 [Microcystis aeruginosa NIES-843] Length = 420 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 25/140 (17%), Positives = 55/140 (39%), Gaps = 15/140 (10%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 ++L ++S++ +L T + +E ++ GS + +TDGE Sbjct: 99 DDLTLIRSKIQQLQAGGGTAIDEGIKLGIQE---------SSTGSKGYVSHIFLLTDGE- 148 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRE 375 + + N L++ E G+ + + QD+L K D + G + + Sbjct: 149 ---NEHGNNQRCLKLAEVAAEYGITLNTFGFG-DHWNQDILEKIADIAGGSLSYIERPEQ 204 Query: 376 LLESFDKITDKIQEQSVRIA 395 L F ++ +++Q + A Sbjct: 205 ALIEFTRLFNRLQSVRLTNA 224 >gi|296474583|gb|DAA16698.1| collagen, type XXI, alpha 1 [Bos taurus] Length = 507 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 28/176 (15%), Positives = 63/176 (35%), Gaps = 22/176 (12%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHA 284 + +++G + Y+ V + + + + ++ NT T A+ A Sbjct: 64 TKNFDIGPKFIQVGVVQYSDYPVLEIPLGSHESGENLMAAMESIHYLGGNTRTGKAIQFA 123 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 L+ + L K + +TDG++ E R++ + +++ Sbjct: 124 LDYLF--------AKSARFLTKIAVVLTDGKSQD--------EVKDAAEAARDSRITLFA 167 Query: 345 VAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + V + + LR + F V D + + + + K+ E+SV P R Sbjct: 168 IGVGSE-TEEAELRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV--CPTR 220 >gi|151556217|gb|AAI49226.1| COL21A1 protein [Bos taurus] Length = 518 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 28/176 (15%), Positives = 63/176 (35%), Gaps = 22/176 (12%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHA 284 + +++G + Y+ V + + + + ++ NT T A+ A Sbjct: 64 TKNFDIGPKFIQVGVVQYSDYPVLEIPLGSHESGENLMAAMESIHYLGGNTRTGKAIQFA 123 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 L+ + L K + +TDG++ E R++ + +++ Sbjct: 124 LDYLF--------AKSARFLTKIAVVLTDGKSQD--------EVKDAAEAARDSRITLFA 167 Query: 345 VAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + V + + LR + F V D + + + + K+ E+SV P R Sbjct: 168 IGVGSE-TEEAELRAIANKPSSTYVFYVEDYIAISKIREVMKQKLCEESV--CPTR 220 >gi|126330546|ref|XP_001381755.1| PREDICTED: similar to Cartilage matrix protein precursor (Matrilin-1) [Monodelphis domestica] Length = 495 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 42/351 (11%), Positives = 96/351 (27%), Gaps = 50/351 (14%) Query: 68 IKKHLKQGSYIRENAGDIAQKAQIN---ITKDKNNPLQYIAESKAQYEIPTENLFLKGLI 124 + E G + +I+ I P + + A+ + LF G+ Sbjct: 121 LAIQFAINKAFSEVEGSRLKFPEISKVAIVVTDGRPQDGVKDVSARAKQSGIELFAIGVG 180 Query: 125 PSALTNLSLRSTGI----------IERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNM 174 L ++ + VS N+ Sbjct: 181 RVDKHTLRQIASEPLDDHVDYVESYSVIEKLSKKFQEAFCVVSDLCATGDHDCQQICNSS 240 Query: 175 TSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR--------------KIDVLIESAGNL 220 + T + A A +++ + Sbjct: 241 PGSYTCSCRDGFTLNSDGKTCSACNGAAGGSATDLVFLIDGSKSVRPENFELVKRFINQI 300 Query: 221 VNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYP 279 V+S+ + + ++G + Y+ + ++K+ + K++ + T T Sbjct: 301 VDSL-----DVSDKLAQVGLVQYSSSVRQEFPLGRYKTKKDIKAAVKKMSYMEKGTMTGA 355 Query: 280 AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG 339 A+ + + + +K I TDG + + ++ G Sbjct: 356 ALKYLIDNTFTISSGAR-----PGAQKVGIVFTDGRSQDYIND--------AAKKAKDLG 402 Query: 340 MKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 K+++V V +D LR+ +F D + + + K+ KI Sbjct: 403 FKMFAVGVG--NAVEDELREIASEPVAEHYFYTADFKTINQIGKKLQKKIC 451 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 53/159 (33%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + R+G I Y + + + + K+ P T T A+ A + Sbjct: 71 DVGPNTTRVGVINYASAVKHEFPLKAHRSKASLLQAVRKIEPLSTGTMTGLAIQFAINKA 130 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E S + K I +TDG + + +G++++++ V Sbjct: 131 FSEVEGSRLKF--PEISKVAIVVTDGRPQDG--------VKDVSARAKQSGIELFAIGVG 180 Query: 349 APPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 + LR+ V +L + F + Sbjct: 181 R--VDKHTLRQIASEPLDDHVDYVESYSVIEKLSKKFQE 217 >gi|209545606|ref|YP_002277835.1| hypothetical protein Gdia_3496 [Gluconacetobacter diazotrophicus PAl 5] gi|209533283|gb|ACI53220.1| conserved hypothetical protein [Gluconacetobacter diazotrophicus PAl 5] Length = 568 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 36/387 (9%), Positives = 89/387 (22%), Gaps = 81/387 (20%) Query: 83 GDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERS 142 G + P N + +T Sbjct: 184 GGGTGADTVPNLWVSIVPFAGEMNIFGSTYGGPSNWRSMP-SGWLTAGSDISTTRYGSNG 242 Query: 143 SENLAISICMVLDVSRSMEDLYLQKHNDNNN-------MTSNKYLLPPPPKKSFWSKNTT 195 ++ + S + + T KY + W Sbjct: 243 WMGCVMARYSGYNNSPAHIYDVNDANPIQAPFTPFYWPSTYQKYSQSSWFGGTSWVVGDN 302 Query: 196 KSKYAPAPAPANRKIDVLIESAGNLVNSI---QKAIQEKKNLSVRIGTIAYNIGIVGNQC 252 S G + S ++ + N+G + Sbjct: 303 DW---ILSGGVVTPSSAARSSYGQMAESPLITSFPTTSGSLVTESGLQVGPNLGCDPSPT 359 Query: 253 TPLSNNLNEVKSRLNKL--NPYENTNTYPAMHHAYRELYNEKESSHN------TIGSTRL 304 P + + + V++ ++ + T A+ + + + + + Sbjct: 360 LPETASRSVVEAHISSMPMMSRGGTMLPQALQAGWFTISPNWQGFWPNPALPLAYNTPNM 419 Query: 305 KKFVIFITDGEN-------------------------------------------SGASA 321 K ++ +TDG N +G Sbjct: 420 TKVLVLMTDGNNQICPCFPVYNYYGPVAPPQSNGDTDMVAYGRLLQNELGVVSSYNGNGY 479 Query: 322 YQNT-----------LNTLQICEYMRNAGMKIYSV-----AVSAPPEGQDLLRKCTDSSG 365 Y + +C+ ++N+G+ IY + A Q +L+ C G Sbjct: 480 YGSNGFSSNILPEMNSLVSTVCDNIKNSGITIYVILYTHEGEEADATTQAMLQNCASKPG 539 Query: 366 QFFAVNDSRELLESFDKITDKIQEQSV 392 ++ + + ++F + ++ + Sbjct: 540 NYYDAPTAASMKQAFSDLGGQLSALRI 566 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 39/344 (11%), Positives = 91/344 (26%), Gaps = 44/344 (12%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIV--SDRTIKDPTTKKDQ 59 A+ ++LA I ++ ++Q+ALDAA + + ++ + D Sbjct: 11 MAVCAFAMLAISMMGVELARIYIVQERLQTALDAASIVAAREMSAVNNVGTCTGSCASDT 70 Query: 60 TSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLF 119 T+ + H G + N ++ Q L Sbjct: 71 TAIFWANFSSAHQANGLGPFQAVSTGPVI-------TPQNASTITIQANVQL-----PLL 118 Query: 120 LKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKY 179 ++ + LS + N+ + + +VLD + S+E ++ + + Sbjct: 119 FTKILGVSQIALSEHAQA----VRSNMGMELALVLDNTDSLEAQGIEDLQCGAKILVDTV 174 Query: 180 LLPPPPKK-----------SFWSKNTTKSKYAPAPAPANRKIDVLIE--SAGNLVNSIQK 226 P + W + S S Sbjct: 175 YGVAAPGSCGGGTGADTVPNLWVSIVPFAGEMNIFGSTYGGPSNWRSMPSGWLTAGSDIS 234 Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYR 286 + N + Y+ N+ N ++ + + Sbjct: 235 TTRYGSNGWMGCVMARYSGYNNSPAHIYDVNDANPIQ------------APFTPFYWPST 282 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 +SS S + ++ G + +SA +++ + Sbjct: 283 Y-QKYSQSSWFGGTSWVVGDNDWILSGGVVTPSSAARSSYGQMA 325 >gi|124006869|ref|ZP_01691699.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] gi|123987550|gb|EAY27259.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] Length = 351 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 31/190 (16%), Positives = 63/190 (33%), Gaps = 44/190 (23%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYN 290 RIG + ++ +PL+ + +K + + T A+ + Sbjct: 149 YDRIGLVIFSGEAYS--VSPLTTDYKLLKRYIEDIREDMIQENGTAIGSALGMGTIRMQE 206 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS-- 348 S K VI I+DG+N+ + L+ + +KIY++ V Sbjct: 207 SASRS----------KVVILISDGDNTAGN-----LDPITASRLATAHNIKIYTILVGRS 251 Query: 349 ----------------APPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD----KI 387 + +LR+ G+F+ +D++ L F +I +I Sbjct: 252 GKVPYGRDMFGQPQYVNNTVDESVLREIAKIGEGKFYRASDNQALKNVFAEINRLEKTEI 311 Query: 388 QEQSVRIAPN 397 E + + Sbjct: 312 IENRFKSIKD 321 >gi|21232653|ref|NP_638570.1| hypothetical protein XCC3224 [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|66767265|ref|YP_242027.1| hypothetical protein XC_0933 [Xanthomonas campestris pv. campestris str. 8004] gi|21114459|gb|AAM42494.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|66572597|gb|AAY48007.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. 8004] Length = 335 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 33/194 (17%), Positives = 72/194 (37%), Gaps = 35/194 (18%) Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN- 270 VL + + + + + + + + G TPL+ +L V+ +L Sbjct: 117 VLGGNVVDRLTAAKAVLSDFLDRREGDRVGLLVFGQRAYALTPLTADLTSVRDQLADSVV 176 Query: 271 --PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 T A+ + + L +++ ++ V+ +TDG N+ LN Sbjct: 177 GLAGRETAIGDAIALSVKRLREQRQG----------QRVVVLLTDGVNTAGV-----LNP 221 Query: 329 LQICEYMRNAGMKIYSVAVSAPPE----------------GQDLLRKCTD-SSGQFFAVN 371 L+ E + G++++++A +D LRK + + G+FF Sbjct: 222 LKAAELAKAEGVRVHTIAFGGSGSYSLFGVPIPAGGGDDIDEDGLRKIAEQTGGRFFRAR 281 Query: 372 DSRELLESFDKITD 385 D+ EL + ++ Sbjct: 282 DTEELAGIYAELDR 295 >gi|118394228|ref|XP_001029494.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89283721|gb|EAR81831.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 406 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 24/183 (13%), Positives = 58/183 (31%), Gaps = 19/183 (10%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL--SN 257 KI+ + + L+ + N + R+ I +N + Sbjct: 169 NSGSMSGCSKIENVKNTILQLLEML--------NENDRLSLITFNTKAKQLCGLKKVNNQ 220 Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N +++ + T+ + A++ L + + ++DG++ Sbjct: 221 NKESLQTITKSIKADGGTDITSGLEIAFQILQS--------RKQKNSVSSIFLLSDGQDD 272 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 GA L + ++ I+S +G + + G F+ V + ++ Sbjct: 273 GADIKIKNLLKTTY-QQLQEESFTIHSFGFGNDHDGPLMQKIAQIKDGSFYFVEKNDQVD 331 Query: 378 ESF 380 E F Sbjct: 332 EFF 334 >gi|134300085|ref|YP_001113581.1| von Willebrand factor, type A [Desulfotomaculum reducens MI-1] gi|134052785|gb|ABO50756.1| von Willebrand factor, type A [Desulfotomaculum reducens MI-1] Length = 416 Score = 74.6 bits (181), Expect = 3e-11, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 62/191 (32%), Gaps = 20/191 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 K+D ++ V + +A++ + + N + +K + Sbjct: 58 KLDYTKKAVAFAVGHLSPQDY--------CSVVAFDDMVTMVASSHQVANKDALKMAVES 109 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 + P +TN M RE+ + + V+ +TDG + + L Sbjct: 110 IYPGGSTNLSGGMLLGVREVKLAHKENQINR--------VLLLTDGMANVGVTDHSALVE 161 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKI 387 M G+ + + + +DLL+ + G F+ + ++ F++ + Sbjct: 162 KS--REMAAGGVNLSTFGLG-EDFEEDLLQAMVEAGGGNFYYIEKPDQIPGIFEQELTGL 218 Query: 388 QEQSVRIAPNR 398 + + Sbjct: 219 LSIVAQNLSVK 229 >gi|55380211|ref|YP_138060.1| calcium-binding protein-like [Haloarcula marismortui ATCC 43049] gi|55232936|gb|AAV48354.1| calcium-binding protein-like [Haloarcula marismortui ATCC 43049] Length = 1562 Score = 74.2 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 45/354 (12%), Positives = 91/354 (25%), Gaps = 52/354 (14%) Query: 37 VLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKD 96 L S+ + I + TT Q F ++ + N+ A + Sbjct: 365 TLGVTLSLTGNGDIGNGTTIAPQDDPRFNTSRVGNMSASPVVELNSEQEFSSANV----- 419 Query: 97 KNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDV 156 L Y + S ++E S V D+ Sbjct: 420 ---TLGYNETGVENESQDLAVFTYDPEAGIFVPLNSTVDATNNTATAETTHFSTFAVFDI 476 Query: 157 SRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIES 216 S + + + + Sbjct: 477 S----------NWATTYNATEPVRQTDDDGLRPVDVTLVMDTSGSMSSSVKLRNTAGQRF 526 Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 L++ + A+ + + + L+++ S L+ L T+ Sbjct: 527 VAGLLDVDRAAVVDFDSSAY--------------VAQDLTSDFGAANSTLDNLGSGGGTD 572 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 + A + + + + +I +TDG +G + + Sbjct: 573 IGSGLSTANSQFASN--------SNDSRAQVMILLTDGRGNGG---------ISEAQTAA 615 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQE 389 N +Y+V +D LR + + G+F V D EL F +I + E Sbjct: 616 NQNTTVYTVGF--DNANRDKLRDIANITDGEFNYVTDRSELPNVFSRIAENTTE 667 >gi|118353832|ref|XP_001010181.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89291948|gb|EAR89936.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 542 Score = 74.2 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 26/149 (17%), Positives = 53/149 (35%), Gaps = 14/149 (9%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNE--VKSRLNKLNPYENTNTYPAMHHAYRELYNE 291 + R+ I +N +NN N+ K +NK+ T+ M A+R L + Sbjct: 166 DNDRLCLILFNSYATRLCHLMKTNNSNKPAFKEIINKIYSTGGTDINSGMELAFRVLKD- 224 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 V ++DG++ + + ++ I+S + Sbjct: 225 -------RKYQNPVSSVFLLSDGQD----GSADLRVRQSLERHLPQECFTIHSFGFGSDH 273 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESF 380 +G + + C+ G F+ V ++ E F Sbjct: 274 DGPLMNKICSLKDGNFYYVEKINQVDEFF 302 >gi|26988754|ref|NP_744179.1| von Willebrand factor type A domain-containing protein [Pseudomonas putida KT2440] gi|24983548|gb|AAN67643.1|AE016394_4 von Willebrand factor type A domain protein [Pseudomonas putida KT2440] gi|313499848|gb|ADR61214.1| Von Willebrand factor type A domain-containing protein [Pseudomonas putida BIRD-1] Length = 358 Score = 74.2 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 30/187 (16%), Positives = 61/187 (32%), Gaps = 35/187 (18%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-- 270 + ++ ++ + + G PL+ + V++ L + Sbjct: 109 WQNEDISRLDLVKALMGDFLQDREGDRVGLILFGSQAYLQAPLTFDRRTVRTFLIEAQIG 168 Query: 271 -PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 +NT A+ A + L + ++ ITDG N+G + L Sbjct: 169 IAGKNTAIGDAIGLAVKRL----------RERPAQSRVLVLITDGANNGGQIH-----PL 213 Query: 330 QICEYMRNAGMKIYSVAVSAPPE----------------GQDLLRKCTD-SSGQFFAVND 372 G++IY++ + A PE + L++ D + G +F +D Sbjct: 214 TAARLAAQEGVRIYTIGIGANPEASGTPGLLGLNPSLDLDEAALKEIADITHGAYFRAHD 273 Query: 373 SRELLES 379 EL Sbjct: 274 GAELDAI 280 >gi|297560911|ref|YP_003679885.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] gi|296845359|gb|ADH67379.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] Length = 315 Score = 74.2 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 23/199 (11%), Positives = 57/199 (28%), Gaps = 34/199 (17%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 +++ +SA V ++ R + +++ V + Sbjct: 106 NRLEAAKKSAQGFVETLPD----------RFNVGLVAFSSTATVVSSPTHDHQAVIGSIE 155 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 L T + + + + E + ++ ++DGEN+ + Sbjct: 156 NLQLGPGTAIGEGVFASLESISSFDEDA----DVDPPPSAIVLLSDGENTSGR------D 205 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPE-------------GQDLLRKCTD-SSGQFFAVNDS 373 Q + + ++A ++ LR + G F+ Sbjct: 206 ISQAVAMAAEQEVPVSTIAFGTGAAMIEIDGYQVPADIDKEALRGLASDTGGHFYEAESE 265 Query: 374 RELLESFDKITDKIQEQSV 392 EL E ++ I + + V Sbjct: 266 TELDEVYEDIGSSLGTELV 284 >gi|163800205|ref|ZP_02194106.1| hypothetical protein 1103602000595_AND4_05979 [Vibrio sp. AND4] gi|159175648|gb|EDP60442.1| hypothetical protein AND4_05979 [Vibrio sp. AND4] Length = 334 Score = 74.2 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 32/206 (15%), Positives = 65/206 (31%), Gaps = 47/206 (22%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 ++ + + V K R+G + + TPL+ + Sbjct: 115 NDNGEYIDRLTAVKRVLSDFVE---------KRQGDRLGVVLFGDHAYLQ--TPLTADRR 163 Query: 261 EVKSRLNK--LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 V ++N+ + T + + + S ++ +I ++DG N+ Sbjct: 164 TVMQQINQAVIGLVGERTAIGDGIGLGTKTFVD----------SDAPQRVMILLSDGSNT 213 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP------------------EGQDLLRK 359 L L+ E + IY+V V A + Q L + Sbjct: 214 AGV-----LEPLEATEIAQKYNATIYTVGVGAGEMMVKDFFMTRKVNTASDLDEQTLTKI 268 Query: 360 CTDSSGQFFAVNDSRELLESFDKITD 385 + G++F D++EL +D I Sbjct: 269 AEMTGGKYFRARDAKELEAIYDTINQ 294 >gi|149923979|ref|ZP_01912364.1| von Willebrand factor, type A [Plesiocystis pacifica SIR-1] gi|149815157|gb|EDM74708.1| von Willebrand factor, type A [Plesiocystis pacifica SIR-1] Length = 785 Score = 74.2 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 38/329 (11%), Positives = 84/329 (25%), Gaps = 24/329 (7%) Query: 70 KHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALT 129 ++ + R A T + I A Y ++L + Sbjct: 234 ADMRGYVHERSVVEARQVAASFVATGEDRKSTFSIDVDTASYASVRQSLRNGWMPDPGSV 293 Query: 130 NLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF 189 + + + H LP ++ Sbjct: 294 RTEEMINYFDYGYVAPSGGAGAPFAVHTEVGPCPWAPDHRLVQIGVQATRELPAQAQELR 353 Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 + K+ ++ LV + + + Y Sbjct: 354 TRNLVFL-LDVSGSMSSRGKLPLIKHGFTQLVEQLGAED--------HVSIVVYAGAAGV 404 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 + + L++L TN + AY VI Sbjct: 405 VLPPTSGDQKETILGALDRLEAGGGTNGSAGIVEAYELAQANFVDGGVNR--------VI 456 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFF 368 TDG+ + + + L +++ E R +G+ + + V +L+ + D G + Sbjct: 457 LGTDGDFNVGLSDHDAL--VELIEQKRESGVFLSVLGVG-GHYDDELMEQLADHGNGNYA 513 Query: 369 AVNDSRELLESFDKITDKIQEQSVRIAPN 397 ++ RE + + ++I IA + Sbjct: 514 FLDGKREAEKV---LVEEIGGTLTTIAKD 539 >gi|327274978|ref|XP_003222251.1| PREDICTED: collagen alpha-6(VI) chain-like [Anolis carolinensis] Length = 2025 Score = 74.2 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 58/162 (35%), Gaps = 17/162 (10%) Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTN 276 +++ + K+ + N V+ G + Y L+ N ++ ++ + T Sbjct: 839 DFVIDIVNKS--DVGNNRVQFGAVKY--SAYPQILFNLNGNKADIIDKIKGDTLLNDTTY 894 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 T A+ HA K + + + ++ ITDG + + +R Sbjct: 895 TAEALRHAENLFTESK----GSRKRRGVPQLLMVITDG------TSHDKDKLDAVSTRIR 944 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 N G+ IY++ + ++ L ++ F V+ L Sbjct: 945 NDGITIYAIGI--KDAKREELEIIAENKDHVFFVDTFDGLKN 984 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 26/152 (17%), Positives = 52/152 (34%), Gaps = 18/152 (11%) Query: 242 AYNIGIVGNQCTPL-SNNLNEVKSRLNKLNPYENT-NTYPAMHHAYRELYNEKESSHNTI 299 ++ ++ +++ + ++ +T NT A+ + K++ Sbjct: 671 HFSDTSKEVFSLNKNTSKKSDIIQAVEDMSLIGSTTNTGGALRFVSKYFKLAKQARP--- 727 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 + K ++ ITDGE S +RN G+ IYSV V + L + Sbjct: 728 ---SVNKILVLITDGEASD--------EVTAPATELRNDGIIIYSVGVF--NANKTQLEE 774 Query: 360 CTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 + + F V + L + I I Sbjct: 775 ISGKPEKVFYVENFDILEDIKGDIIFGICSPY 806 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 23/162 (14%), Positives = 54/162 (33%), Gaps = 18/162 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 VR G + Y+ + ++ ++ +N + +T T A+ Sbjct: 475 TVGPDHVRFGVVQYSNIHRTEFEIDKHSTISNLEKAINNIQYLTGDTYTGAALESMLGLF 534 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + ++ N + ++I +TDG+ + + +R AG+ + ++ V Sbjct: 535 ESARKQRKNK-----VPTYLIVLTDGDPHD--------KVKEPADRLRKAGINVIAIGVG 581 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 L++ + V+ L D I I + Sbjct: 582 --DIKWKGLQEI--GESNVYYVHQYASLKTIKDNIVQDICSE 619 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 69/186 (37%), Gaps = 23/186 (12%) Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 +++ + LV+SI + +++ G ++ + +E+K+++ Sbjct: 1015 KNDFEIVKDFLTKLVDSIS------FHDNIQFGMAQFSDIYSEEFPLGHYQSKSELKNKI 1068 Query: 267 NK--LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + +T + S + + + ++ TDGE + Sbjct: 1069 ANVSMQAGLHTYIGKGLKEV-----KAFFKSPRRRVARNVHQKLLIFTDGE--------S 1115 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKIT 384 + Q E +R G++I++V V + L++ T S + + + L IT Sbjct: 1116 KDSFTQPAEDLRREGVEIHAVGVGKIEHAK--LQQITVSPERIYTTANYTGLPHITKGIT 1173 Query: 385 DKIQEQ 390 +++ ++ Sbjct: 1174 EEMCKE 1179 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 34/336 (10%), Positives = 88/336 (26%), Gaps = 43/336 (12%) Query: 75 GSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEI----PTENLFLKGL----IPS 126 G +R T K+ + E P + L G+ + Sbjct: 119 GGSLRTGNAIQKVHEAFFKTTQKDRNQIVVVTVSGSSEDDVEGPAKMLQDAGIKIIALGI 178 Query: 127 ALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPK 186 + + + S+ + ++ K Sbjct: 179 QDAQQHELQSMATKFYHFRTPRDLQAFSPNMSSVIESAVEMDIGTVADLFYKVCRSDSVA 238 Query: 187 KSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIG 246 + A + L S +L + +RIG + Y+ Sbjct: 239 DVVF---VVDESVGNANVEYIK--TFLQNSINSL---------DVTEECIRIGLVKYSTE 284 Query: 247 IVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 ++ R+ +P N A++ ++++ E + + +K Sbjct: 285 TQVVSFLSKETEKKDILQRIQSFSPRAGKANLGAAINITRKQVFTE---RAGSRKNQGVK 341 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG 365 + IT + + + E + NA + ++++ + L + + Sbjct: 342 QIATIITHRPSDDSL--------TEAAEQLINADITVFAIGIE-GANISQLNQVVSYPPN 392 Query: 366 QFFAVNDSREL--------LESFDKITDKIQEQSVR 393 + +L + F++I + + + R Sbjct: 393 RNIIQVTFSDLPRESDTLKKKLFNEIQNMLYVKRER 428 >gi|189485266|ref|YP_001956207.1| aerotolerance-related cytoplasmic membrane protein BatA [uncultured Termite group 1 bacterium phylotype Rs-D17] gi|170287225|dbj|BAG13746.1| aerotolerance-related cytoplasmic membrane protein BatA [uncultured Termite group 1 bacterium phylotype Rs-D17] Length = 333 Score = 74.2 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 29/194 (14%), Positives = 64/194 (32%), Gaps = 42/194 (21%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE----N 274 N + + +K I++ + PL+ + + + +N +N + Sbjct: 111 NRMEAAKKVIRDFMKERKYDRIGLVIFSGLAFTQCPLTTDKDSLAEFINNINIGDTGLDG 170 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 T A+ + L S + +I +TDG N+ ++ L + Sbjct: 171 TAIGSAIMTSVNRL----------KDSRAKSRIIILVTDGNNNMG-----EIDPLTASKI 215 Query: 335 MRNAGMKIYSVAVSA----------------------PPEGQDLLRKCT-DSSGQFFAVN 371 R+ +KIY+V V + + +L++ ++SG +F Sbjct: 216 ARSYDIKIYAVGVGSLDGAIYEVDDPFLGKREIKYRKDAINESVLKEVAYNTSGGYFRAQ 275 Query: 372 DSRELLESFDKITD 385 D + +I Sbjct: 276 DVKSFENIMKQIDK 289 >gi|170727371|ref|YP_001761397.1| von Willebrand factor type A [Shewanella woodyi ATCC 51908] gi|169812718|gb|ACA87302.1| von Willebrand factor type A [Shewanella woodyi ATCC 51908] Length = 330 Score = 74.2 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 51/153 (33%), Gaps = 36/153 (23%) Query: 254 PLSNNLNEVKSRLN--KLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + V L ++ T A+ + + + ++ Sbjct: 143 PLTQDRRSVAQFLKEAQIGLVGKQTAIGEAIALSVKRFDLV----------DESNRILVL 192 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEG 353 +TDG N+ S Q + G+KIYS+ V A Sbjct: 193 LTDGSNNSGSIS-----PEQAADIAAKRGIKIYSIGVGADVMERRTLFGKERVNPSMDLD 247 Query: 354 QDLLRKCT-DSSGQFFAVNDSRELLESFDKITD 385 ++ L + G++F +++EL + + +I Sbjct: 248 EEQLTSLAQTTGGRYFRARNAQELEQIYQEIDK 280 >gi|126174972|ref|YP_001051121.1| von Willebrand factor type A [Shewanella baltica OS155] gi|125998177|gb|ABN62252.1| von Willebrand factor, type A [Shewanella baltica OS155] Length = 339 Score = 74.2 bits (180), Expect = 3e-11, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 64/204 (31%), Gaps = 47/204 (23%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + ++ + + + RIG I + PL+ + V Sbjct: 104 NGKTVDRFTLIQHVVSDFIERRK---------GDRIGLILFADHAYLQA--PLTQDRRSV 152 Query: 263 KSRLN--KLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 L ++ T A+ A + + +I +TDG N+ Sbjct: 153 AQFLKEAQIGLVGKQTAIGEAIGLAVKRFDKM----------DESNRVLILLTDGSNNSG 202 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEGQDLLRKCTD 362 + + Q + N + IY+V V A ++ L+ + Sbjct: 203 N-----IEPEQAAQIAANRKVTIYTVGVGADVMERRTLFGRERVNPSMDLDENQLKHIAE 257 Query: 363 -SSGQFFAVNDSRELLESFDKITD 385 + G++F +S+EL + + +I Sbjct: 258 VTHGRYFRARNSQELDQIYQEIDK 281 >gi|113969745|ref|YP_733538.1| von Willebrand factor, type A [Shewanella sp. MR-4] gi|113884429|gb|ABI38481.1| von Willebrand factor, type A [Shewanella sp. MR-4] Length = 338 Score = 74.2 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 63/204 (30%), Gaps = 47/204 (23%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + ++ + + RIG I + PL+ + V Sbjct: 104 NGKVVDRFTLIQHVVSEFIERRK---------GDRIGLILFADHAYLQA--PLTQDRRSV 152 Query: 263 KSRLN--KLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 L ++ T ++ A + + +I +TDG N+ Sbjct: 153 AQFLKEAQIGLVGKQTAIGESIALAVKRFDKM----------DESNRVLILLTDGSNNAG 202 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEGQDLLRKCTD 362 + + Q + N + IY+V V A ++ L+ + Sbjct: 203 NIDPD-----QAAQIAANRKVTIYTVGVGADVMERRTLFGRERVNPSMDLDENQLKHIAE 257 Query: 363 -SSGQFFAVNDSRELLESFDKITD 385 + G++F +S+EL + + +I Sbjct: 258 VTHGRYFRARNSQELEQIYQEIDK 281 >gi|302143246|emb|CBI20541.3| unnamed protein product [Vitis vinifera] Length = 630 Score = 74.2 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 20/178 (11%), Positives = 52/178 (29%), Gaps = 18/178 (10%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 K+ +L + L+ ++ + + + + + + N +N Sbjct: 220 SKLSLLKRAVCFLIQNLGPSDRLSIVSFSSTARRIFPLRRMSD------NGREAAGLAIN 273 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 L TN + R L +I ++DG+++ N Sbjct: 274 SLTSSGGTNIVEGLKKGVRVLEE--------RSEQNPVASIILLSDGKDTYNCDNVNRRQ 325 Query: 328 TLQICEYMRNAG----MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFD 381 T G + +++ + + + +S G F + + ++F Sbjct: 326 TSHCASSNPRQGRQAIIPVHTFGFGSDHDSTAMHAISDESGGTFSFIESVATVQDAFA 383 >gi|226315301|ref|YP_002775197.1| hypothetical protein BBR47_57160 [Brevibacillus brevis NBRC 100599] gi|226098251|dbj|BAH46693.1| hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 597 Score = 74.2 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 21/148 (14%), Positives = 58/148 (39%), Gaps = 17/148 (11%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + N++K+ ++ L T+ + A + L + + ++ + DG N Sbjct: 101 DKNDIKAFIDSLQKGAYTDIAVGVTEAVKILDAGR--------NPNNAPIIVLLADGNNF 152 Query: 318 GASAYQ-----NTLNTLQICEYMRNAGMKIYSVAVSAPP--EGQDLLRKCTDSSGQFFAV 370 A + Q + ++ G +Y++ ++A L + +++G+FF Sbjct: 153 LNKASSRTQAKSDQELQQAVKEAKDKGYPVYTIGLNADGQLNRTTLQQIAAETNGKFFET 212 Query: 371 NDSRELLESFDKITDKIQEQSVRIAPNR 398 + + +L + +I +++ P + Sbjct: 213 STADKLPQILSEIF--ANHLKLKVVPVK 238 >gi|212702323|ref|ZP_03310451.1| hypothetical protein DESPIG_00334 [Desulfovibrio piger ATCC 29098] gi|212674201|gb|EEB34684.1| hypothetical protein DESPIG_00334 [Desulfovibrio piger ATCC 29098] Length = 1151 Score = 74.2 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 38/328 (11%), Positives = 92/328 (28%), Gaps = 18/328 (5%) Query: 75 GSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLR 134 + N D + + + + + ++ Sbjct: 477 APTSNADVIVNGNSGHDNFVGDPGGATESTQTTYTDLNVALVVDTSGSMDGTRMSETKEA 536 Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 + ++ E+ + L + L + ++ Y Sbjct: 537 LKDLCDQLKEHADEGADVNLSLIGFSGALNINLPVEDITGIEENYRKFTLRDGDTIIGKV 596 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 + P + G ++S + R + G Sbjct: 597 GTEFSVTSYVPYDDWS-------GGQISSTTTYRITEDGTLQRWESGFMGWGGSWEDVRT 649 Query: 255 LSN----NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 S V ++++ + T A ++S +++ + + VIF Sbjct: 650 FSTESLTGYEAVLAQIDDMEAIGGTVYSDGYEAAKDWF--GGKTSPDSLQNNGGENIVIF 707 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS-APPEGQDLLRKCTDSSGQFFA 369 +TDGE + + +N N L + + +V ++ + DLL T ++ Sbjct: 708 VTDGEPNNEWSAKNAYNQLVAAV----DNITVETVGIAITDKDATDLLNGLTTNNNGAHF 763 Query: 370 VNDSRELLESFDKITDKIQEQSVRIAPN 397 + D+ +L + F +I I +V A + Sbjct: 764 IEDASKLGDVFGEIVSDITTSTVTPAAD 791 >gi|90417299|ref|ZP_01225225.1| batB protein, putative [marine gamma proteobacterium HTCC2207] gi|90330884|gb|EAS46147.1| batB protein, putative [marine gamma proteobacterium HTCC2207] Length = 330 Score = 74.2 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 29/154 (18%), Positives = 57/154 (37%), Gaps = 36/154 (23%) Query: 253 TPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 TPL+ + +++ L + T A+ + + L E+ + VI Sbjct: 149 TPLTFDRKTMQTLLYEAQLGFAGNGTAIGDAIGLSVKRLQQRPEN----------HRVVI 198 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PE 352 +TDG N+ L+ L+ E +A +KIY++ V A Sbjct: 199 LLTDGANNAG-----ELDPLKAAELASSAKVKIYTIGVGAETQEAWGLFGKRVTNPSADL 253 Query: 353 GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + L + + GQ+F + EL+ + ++ Sbjct: 254 DEQTLTAIAEATGGQYFRARNPEELMAIYQELNR 287 >gi|315649108|ref|ZP_07902201.1| von Willebrand factor type A [Paenibacillus vortex V453] gi|315275543|gb|EFU38898.1| von Willebrand factor type A [Paenibacillus vortex V453] Length = 983 Score = 74.2 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 32/153 (20%), Positives = 58/153 (37%), Gaps = 16/153 (10%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 +A++ N EV S + + TN YPA+ A E+ K Sbjct: 446 VVAFDDQPWWVVPPQKLGNKEEVLSSIQSIPSAGGTNIYPAVSSALEEMLKIKSQR---- 501 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 + +I +TDG+++ S YQ+ +T M + + SVAV + L Sbjct: 502 ------RHIILMTDGQSAMNSGYQDLTDT------MVENKITMSSVAVGTDADTHLLQSL 549 Query: 360 CTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + G+++ V D L F + + + + Sbjct: 550 AEAAKGRYYFVEDETTLPAVFSREAVMLAKSYI 582 >gi|118349482|ref|XP_001008022.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89289789|gb|EAR87777.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 632 Score = 74.2 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 26/153 (16%), Positives = 54/153 (35%), Gaps = 12/153 (7%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE--VKSRLNKLNPYENTNTYPAMHHAYRE 287 + N + R+ I +N ++ N ++ +N + E T+ M A+ Sbjct: 238 DMLNSNDRLSLILFNSYPTLLCNLRKVDDKNTPNIQKIINSITAEEYTDINSGMLMAFNI 297 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 L + ++DG+++GA + ++N I+S Sbjct: 298 LQ--------KRQFFNPVSSIFLLSDGQDNGADEKIKKYINSN--QSLKNECFSIHSFGF 347 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 + +G + R C G F+ V ++ E F Sbjct: 348 GSDHDGPLMNRICQLKDGNFYYVEKINQVDEFF 380 >gi|300933821|ref|ZP_07149077.1| hypothetical protein CresD4_07088 [Corynebacterium resistens DSM 45100] Length = 676 Score = 74.2 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 31/194 (15%), Positives = 69/194 (35%), Gaps = 31/194 (15%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIV----GNQCTPLSN-N 258 ++D ++A L++++ + + + + A N +P+ N Sbjct: 68 GGGGTRLDAAKKAATGLIDALPDSANMGMVVYGQQESNAPNNRAAGCKDVETISPVGPIN 127 Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 E+K R++ T ++ A EL E E + ++ ++DG Sbjct: 128 KGELKDRISNFKAKGYTPIGNSLLKAAEELGKEGE------------RSIVLVSDGH--- 172 Query: 319 ASAYQNTLNTLQICEYMRN-----AGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVND 372 +T +CE + + I++V A + + L S GQ+ + + Sbjct: 173 -----DTCAPPPVCEVAKKLAGEGYNLTIHTVGFHADRKARKELECIAKTSGGQYLSAEN 227 Query: 373 SRELLESFDKITDK 386 + EL S + + Sbjct: 228 ASELSNSMKFLATR 241 >gi|300869050|ref|ZP_07113652.1| von Willebrand factor, type A [Oscillatoria sp. PCC 6506] gi|300332961|emb|CBN58846.1| von Willebrand factor, type A [Oscillatoria sp. PCC 6506] Length = 411 Score = 74.2 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 25/180 (13%), Positives = 64/180 (35%), Gaps = 21/180 (11%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 +D + ++AG LV+ ++ R+ IA++ ++ +K +++KL Sbjct: 59 LDTVKQAAGRLVDRLKPGD--------RLSVIAFDHKAKVIVPNQFIDDPGSIKKQIDKL 110 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T + E+ G + +TDGE + + + L Sbjct: 111 RSSGGTAIDEGLKLGIEEM---------GKGKSETVSQAFLLTDGE----NEHGDNNRCL 157 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 ++ + + M + S+ L + + G + + ++ F ++ ++Q Sbjct: 158 KLAKLAADYNMTLNSLGFGDDWNQDILEKIADAAGGTLAYIQRPEQAIDEFSRLFTRLQS 217 >gi|24374613|ref|NP_718656.1| von Willebrand factor type A domain-containing protein [Shewanella oneidensis MR-1] gi|24349233|gb|AAN56100.1|AE015746_4 von Willebrand factor type A domain protein [Shewanella oneidensis MR-1] Length = 338 Score = 74.2 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 58/170 (34%), Gaps = 38/170 (22%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN--KLNPYE-NTNTYPAMHHAYRELYNEKE 293 RIG I + PL+ + V L ++ T ++ A + Sbjct: 129 RIGLILFADHAYLQA--PLTQDRRSVAQFLKEAQIGLVGKQTAIGESIALAVKRFDKM-- 184 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--- 350 + ++ +TDG N+ + + Q + N + IY+V V A Sbjct: 185 --------DESNRVLVLLTDGSNNAGN-----IEPQQAAQIAANRKVTIYTVGVGADVME 231 Query: 351 --------------PEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 ++ L++ D + G++F +S EL + + +I Sbjct: 232 RRTLFGRERVNPSMDLDENQLQQIADATHGRYFRARNSEELEQIYQEIDK 281 >gi|162448865|ref|YP_001611232.1| hypothetical protein sce0595 [Sorangium cellulosum 'So ce 56'] gi|161159447|emb|CAN90752.1| hypothetical protein sce0595 [Sorangium cellulosum 'So ce 56'] Length = 656 Score = 74.2 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 21/176 (11%), Positives = 56/176 (31%), Gaps = 19/176 (10%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 KI++ +S L ++++ + Y + + ++ + L Sbjct: 315 DKIELAKKSLKMLTDTLKPGDT--------VALCTYAGSVREVLAPTGIESKGKILAALA 366 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 L +T + AY H VI ++DG+ + + + Sbjct: 367 DLTAGGSTAMSSGIDLAYSLAERTLVKGHVNR--------VIVLSDGDANVGPTSHDEI- 417 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDK 382 L+ + R+ G+ + +V ++ + G + ++ + F + Sbjct: 418 -LKTIKRARDKGITLSTVGFGQGNYKDLMMEQLANQGDGNYAYIDSEAQARRVFSE 472 >gi|301610722|ref|XP_002934910.1| PREDICTED: collagen alpha-1(XXI) chain-like [Xenopus (Silurana) tropicalis] Length = 957 Score = 74.2 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 28/176 (15%), Positives = 61/176 (34%), Gaps = 22/176 (12%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHA 284 + ++G + Y+ + + +++ R+ + NT T A+ A Sbjct: 64 TSNFNIGPKFTQVGVVQYSDYPILEIPLGSYESSDDLSRRMQSIQYLGGNTQTGNAIRFA 123 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 L+ L K + +TDG++ + E R + +++ Sbjct: 124 IDNLF--------ARSLRPLTKIAVVLTDGKSQD--------EVKHVAEEARKNKITLFA 167 Query: 345 VAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + V + + LR + F V D + + + + K+ E+SV P R Sbjct: 168 IGVGSEI-EESELRAIANKPSSTYVFYVEDYIAISKIREIMKQKLCEESV--CPTR 220 >gi|148652289|ref|YP_001279382.1| von Willebrand factor, type A [Psychrobacter sp. PRwf-1] gi|148571373|gb|ABQ93432.1| von Willebrand factor, type A [Psychrobacter sp. PRwf-1] Length = 571 Score = 74.2 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 43/369 (11%), Positives = 100/369 (27%), Gaps = 34/369 (9%) Query: 33 LDAAV---LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKA 89 DAA + ++ R + TT + + KQ E + Sbjct: 62 ADAAAPVVVMATPAMAESRQLSKMTTNARIMPPPSQGYMAP--KQQENYAEIEPNAVNAT 119 Query: 90 QINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAIS 149 + + + + + L K + + N Sbjct: 120 SEQAFATLSIDTDTGSYANVRRFLNQGQLPPKDAVRVEELINYFNYDFTAAKKQANAPFL 179 Query: 150 ICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRK 209 + + S + K PP F K Sbjct: 180 VSTEVVNSPWHPTNQIVKVGIKAEDLLTAKQKQPPANLVFLVD-------VSGSMDTEDK 232 Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 + + S L ++ I I Y + N ++ + ++ L Sbjct: 233 LQLAKSSLKMLTKQLRAQDS--------ITLITYAGNTKVVLPSTPGNQTQKILNAIDNL 284 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 +TN A+ AY++ + ++ +TDG+ + + + L Sbjct: 285 TASGSTNGEAAIKLAYQQATEHFKKDGINR--------ILMLTDGDFNVGV--SSVKDML 334 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQ 388 QI R+ G+ + ++ ++ + D+ G + ++ E + + D++ Sbjct: 335 QIIRSNRDKGISLSTLGFGQGNYNDHMMEQVADNGNGNYSYIDSLSEAKKV---LIDEMS 391 Query: 389 EQSVRIAPN 397 +A + Sbjct: 392 ATFNTVAKD 400 >gi|329922540|ref|ZP_08278115.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] gi|328942084|gb|EGG38366.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] Length = 595 Score = 74.2 bits (180), Expect = 4e-11, Method: Composition-based stats. Identities = 20/142 (14%), Positives = 53/142 (37%), Gaps = 9/142 (6%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + +K +++L+ T+ + A + L + +H + D + + Sbjct: 100 DKTALKEFIDQLDRGPYTDMSVGLDEAVKVLKQGMDPAHAPMIVVLADG----NNDLDPN 155 Query: 318 GASAYQNTLNTLQIC-EYMRNAGMKIYSVAVSAPP--EGQDLLRKCTDSSGQFFAVNDSR 374 + L + + +G+ IY++ ++A + L + G+ F + + Sbjct: 156 TGRTSKEASEQLAQAVKEAKGSGIPIYTIGLNADGKLNKETLAELANQTGGKSFTTSSAD 215 Query: 375 ELLESFDKITDKIQEQSVRIAP 396 +L + +I Q ++I P Sbjct: 216 DLPQILSEIF--ASHQQLKIVP 235 >gi|253687077|ref|YP_003016267.1| hypothetical protein PC1_0676 [Pectobacterium carotovorum subsp. carotovorum PC1] gi|251753655|gb|ACT11731.1| conserved hypothetical protein [Pectobacterium carotovorum subsp. carotovorum PC1] Length = 543 Score = 73.8 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 57/495 (11%), Positives = 124/495 (25%), Gaps = 107/495 (21%) Query: 3 AIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTST 62 + + + +D + Q++ A DAA L+ + Sbjct: 41 VLGAMALLVTAAFIVDTSTATGDATQIKRATDAAALAVGHQATINGEEYSQEETNKLAYE 100 Query: 63 IFKKQIKKH-------LKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQ----- 110 K + + + + E +K + P ++ Q Sbjct: 101 YVKNNLGMNKALSEKLVASDVSVAEGRNSATRKTYTVTVAFETKPSLMSLGARKQEVYST 160 Query: 111 -------YEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLA----ISICMVLDVSRS 159 EI + + + +L S +ER + ++ S+S Sbjct: 161 SEVINRPTEIALVMPVTGDMSDADIRSLKSVSRSFVERMLSSADGKRDNLWLSLVPYSQS 220 Query: 160 MEDLYLQKHNDNNNMTSNKYLLPPP----------------------------------P 185 + + N ++ L PP Sbjct: 221 VNVYDAEDANRIRRWSTPSALNPPELRSLFASGVVSSLADRRFPDRRANLLCMYRGLGRE 280 Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 + FW + + + + + +V I + Sbjct: 281 ENFFWDEPPVGQFRVYYRHDLPQNGSPGAPPITWRGPNPDLYPWDNNSDAVDTRFIVADR 340 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRL- 304 G PL+N +++ R+ + P NTN AM A L S +T Sbjct: 341 GCPNAALMPLTNEESKLNQRIAEFTPRFNTNYAIAMSWAGAALSPNMRGSDGWGDTTLPL 400 Query: 305 ----------KKFVIFITDGENSGASAYQNTLN--------------------TLQICEY 334 +K ++ + + + N +C Sbjct: 401 DFNLDGNGDGQKVIVMMANTIGNWFDTDSYNFNRNEFRGSTGTDPARSFAAQRFQDLCSS 460 Query: 335 MRNAGMKIYSVA--------VSAPPEGQDL---LRKCTDSSGQFFAVNDS--------RE 375 R +K Y V ++ L CT+ + ++ + + Sbjct: 461 FRARNIKFYFVGIRPGDPEDFGRNLFDREATPGLLVCTEGEKRMSFIDGAGFGAEGVEDQ 520 Query: 376 LLESFDKITDKIQEQ 390 L++ D+I +I+ + Sbjct: 521 LIQRLDRIAGQIETE 535 >gi|301064759|ref|ZP_07205139.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300441134|gb|EFK05519.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 332 Score = 73.8 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 33/218 (15%), Positives = 72/218 (33%), Gaps = 52/218 (23%) Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 ++DV+ + + RIG +A+ +PL+ + + + RL Sbjct: 111 ANRVDVVKKVVFRFIGE---------RPDDRIGLVAFAGRPYM--VSPLTLDHDWLGRRL 159 Query: 267 NKLNPY---ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 ++P + T A+ + L K VI +TDG N+ Sbjct: 160 QTIHPGMVEDGTAIGSAIGSSINRL----------RDQKAKSKVVILLTDGMNNAG---- 205 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAP---------------------PEGQDLLRKCTD 362 + + E G+KIY++ + + L K Sbjct: 206 -KILPVTAAEAAETLGIKIYTIGAGSRGEVPVPITDKFGNQKIVRAKVDIDEATLEKVAQ 264 Query: 363 -SSGQFFAVNDSRELLESFDKITD-KIQEQSVRIAPNR 398 + +++ D+ L + + +I + ++ +R +R Sbjct: 265 MTGAKYYRATDTDSLKKIYSEINKLETTKRKIRKFEHR 302 >gi|114567231|ref|YP_754385.1| chloride channel [Syntrophomonas wolfei subsp. wolfei str. Goettingen] gi|114338166|gb|ABI69014.1| conserved putative chloride channel [Syntrophomonas wolfei subsp. wolfei str. Goettingen] Length = 951 Score = 73.8 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 26/153 (16%), Positives = 56/153 (36%), Gaps = 16/153 (10%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 +A++ + + ++ + + T+ YPA+ AY L + Sbjct: 451 VVAFDDTAQWVVEFQAVKDKDAIQDDIATIRADGGTSIYPALALAYTALKDAHTKF---- 506 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 K +I +TDG+ T + + M AG+ + +VAV + L + Sbjct: 507 ------KHIILLTDGQ------SATTGDYYFLSRRMARAGITMSTVAVGEGADTLLLEQL 554 Query: 360 CTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 G+++ ++ + F K T K + + Sbjct: 555 AAWGQGRYYFSDEISNIPRIFTKETMKAIKSYL 587 >gi|108756796|ref|YP_635538.1| von Willebrand factor type A domain-containing protein [Myxococcus xanthus DK 1622] gi|108460676|gb|ABF85861.1| von Willebrand factor type A domain protein [Myxococcus xanthus DK 1622] Length = 700 Score = 73.8 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 20/182 (10%), Positives = 54/182 (29%), Gaps = 19/182 (10%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 ++ ++ + LVN + ++ + Y + + Sbjct: 342 VSGSMNLENRLGLVKRALHLLVNELD--------ERDQVSIVVYGSTARLVLEPTSAVHA 393 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 + +++ ++ L+ +TN + Y + VI +DG + Sbjct: 394 HIIRAAIDSLHTEGSTNAQAGLEMGYSLAASHLVEGGINR--------VILCSDGVANTG 445 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLE 378 N++ + G+ + +V L+ + + G + V+ E Sbjct: 446 LTDANSIWERIRARAAK--GITLSTVGFGMGNYNDVLMERLSQVGEGNYAYVDRIEEAHR 503 Query: 379 SF 380 F Sbjct: 504 IF 505 >gi|301758388|ref|XP_002915048.1| PREDICTED: matrilin-3-like [Ailuropoda melanoleuca] Length = 466 Score = 73.8 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 57/176 (32%), Gaps = 23/176 (13%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP- 271 + +++++ + R+ + Y + ++ +K + ++ P Sbjct: 82 VKTFVSQIIDTL-----DIGAADTRVAVVNYASTVKTEFHLQTYSDKQSLKQAVARITPL 136 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T + A+ A E + + + + K I +TDG ++ Sbjct: 137 STGTMSGLAIQTAMDEAFTVEAGARGPTSNI--PKVAIIVTDGRPQD--------QVNEV 186 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 R +G+++Y+V V + L+ F V +L F + Sbjct: 187 AARARASGIELYAVGV--DRADMESLKMIASEPLDEHVFYVETYGVIEKLSSRFQE 240 >gi|281344227|gb|EFB19811.1| hypothetical protein PANDA_002988 [Ailuropoda melanoleuca] Length = 395 Score = 73.8 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 57/176 (32%), Gaps = 23/176 (13%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP- 271 + +++++ + R+ + Y + ++ +K + ++ P Sbjct: 29 VKTFVSQIIDTL-----DIGAADTRVAVVNYASTVKTEFHLQTYSDKQSLKQAVARITPL 83 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T + A+ A E + + + + K I +TDG ++ Sbjct: 84 STGTMSGLAIQTAMDEAFTVEAGARGPTSNI--PKVAIIVTDGRPQD--------QVNEV 133 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 R +G+++Y+V V + L+ F V +L F + Sbjct: 134 AARARASGIELYAVGV--DRADMESLKMIASEPLDEHVFYVETYGVIEKLSSRFQE 187 >gi|218506166|ref|ZP_03504044.1| hypothetical protein RetlB5_00485 [Rhizobium etli Brasil 5] Length = 205 Score = 73.8 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 27/205 (13%), Positives = 59/205 (28%), Gaps = 28/205 (13%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+ + + + + D +R +MQS LDAA+++ I + K + Sbjct: 24 IVALSLVPMLVAVGASFDYIRSYNVRQKMQSDLDAALIAAVKQINNTGDTD---ALKLKV 80 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + F Q++ + + + + + A +PT + Sbjct: 81 TDWFHAQVENSY-------------------TLGEIDIDTTNHNITATASGTVPTTFM-- 119 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 A + S + +++ +V+D S SM Sbjct: 120 ----KIANIDTVPVSVASAVKGPATSYLNVYIVIDTSPSMLLAATTAGQATMYSGIGCQF 175 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAP 205 T + Y + A Sbjct: 176 ACHTGDSHKVGNKTYANNYEYSTAK 200 >gi|119775307|ref|YP_928047.1| von Willebrand factor type A domain-containing protein [Shewanella amazonensis SB2B] gi|119767807|gb|ABM00378.1| von Willebrand factor type A domain protein [Shewanella amazonensis SB2B] Length = 327 Score = 73.8 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 25/195 (12%), Positives = 57/195 (29%), Gaps = 36/195 (18%) Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN--KL 269 VL + +Q + + V G +P++ + V L ++ Sbjct: 101 VLDNKTVDRFTLVQHVVSDFIERRVGDRIGLILFGDHAYLQSPMTQDRRSVAQYLREAQI 160 Query: 270 NPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 T ++ A + + ++ +TDG N+ S + Sbjct: 161 GLVGKQTAIGESIALAVKRF----------ENLEESNRVLVLLTDGTNNAGSISPDK--- 207 Query: 329 LQICEYMRNAGMKIYSVAVSAP-----------------PEGQDLLRKCT-DSSGQFFAV 370 + IY++ V A ++ L++ + G++F Sbjct: 208 --AAAIAAERKVTIYTIGVGAEMMERRSFFGRDRVNPSMDLDEEQLQRIANATQGKYFRA 265 Query: 371 NDSRELLESFDKITD 385 S +L + +I Sbjct: 266 RSSEDLAAIYQEIDK 280 >gi|153001301|ref|YP_001366982.1| von Willebrand factor type A [Shewanella baltica OS185] gi|151365919|gb|ABS08919.1| von Willebrand factor type A [Shewanella baltica OS185] Length = 340 Score = 73.8 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 64/204 (31%), Gaps = 47/204 (23%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + ++ + + + RIG I + PL+ + V Sbjct: 104 NGKTVDRFTLIQHVVSDFIERRK---------GDRIGLILFADHAYLQA--PLTQDRRSV 152 Query: 263 KSRLN--KLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 L ++ T A+ A + + +I +TDG N+ Sbjct: 153 AQFLKEAQIGLVGKQTAIGEAIGLAVKRFDKM----------DESNRVLILLTDGSNNAG 202 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEGQDLLRKCTD 362 + + Q + N + IY+V V A ++ L+ + Sbjct: 203 N-----IEPEQAAQIAANRKVTIYTVGVGADVMERRTLFGRERVNPSMDLDENQLKHIAE 257 Query: 363 -SSGQFFAVNDSRELLESFDKITD 385 + G++F +S+EL + + +I Sbjct: 258 VTHGRYFRARNSQELDQIYKEIDK 281 >gi|220908581|ref|YP_002483892.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] gi|219865192|gb|ACL45531.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] Length = 421 Score = 73.8 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 20/133 (15%), Positives = 49/133 (36%), Gaps = 13/133 (9%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + +++K+R++ L T + EL K + + +TDGEN Sbjct: 98 TDRDKIKTRISHLAAMGGTAIDEGLQLGLTELIAAKAGAI---------SQIFLLTDGEN 148 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 + + LQ+ E + + ++ L + + G + +++ Sbjct: 149 EHGNNSRC----LQLAEEAAKENITLNTLGFGYHWNQDVLEQIADAAGGSLMFIEYPQDV 204 Query: 377 LESFDKITDKIQE 389 L F+++ ++I Sbjct: 205 LIGFERLFNQIIS 217 >gi|149773093|emb|CAO01896.1| collagen type VI alpha 6 [Mus musculus] Length = 1120 Score = 73.8 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 26/164 (15%), Positives = 58/164 (35%), Gaps = 15/164 (9%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYR 286 + VRIG ++ E+ +++ + T+ A+ R Sbjct: 40 DFDVSLNRVRIGVAQFSDSYRSEFLLGTFTGEREISTQIEGIQQIFGYTHIGDAL----R 95 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 ++ + + + + ++ +TDG + Q E +R+ G+ IYSV Sbjct: 96 KVKYYFQPDMGSRINAGTPQVLLVLTDGRSQD--------EVAQAAEELRHKGVDIYSVG 147 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + L + T ++ + V++ EL + +I I Sbjct: 148 IG--DVDDQELVQITGTAEKKLTVHNFDELKKVKKRIVRNICTS 189 >gi|293361347|ref|XP_576462.3| PREDICTED: collagen type VI alpha 5 [Rattus norvegicus] Length = 1730 Score = 73.8 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 27/152 (17%), Positives = 56/152 (36%), Gaps = 13/152 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYENTNTYPAMHHAYREL 288 + V+ G + Y+ ++ + + L + + +T T A+ HA Sbjct: 871 DVGRDRVQFGALMYSNNPEILFYLNTYSSRSAITEHLKRPRDTRGDTYTAKALQHANILF 930 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 H + +++ +I ITDGE+ +T +R+ G+ I++V V Sbjct: 931 ----MEEHGSRLKQNVRQLMIVITDGESHDRDKLNDT------ARELRDKGITIFAVGVG 980 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 QD L V++ +L + + Sbjct: 981 R--ANQDELETMAGKKENTIHVDNFDKLRDIY 1010 Score = 73.4 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 35/331 (10%), Positives = 91/331 (27%), Gaps = 31/331 (9%) Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGL 123 I+ +G + QI + + E+ + +F + Sbjct: 340 VGAAIEHMRTEGFSESSGSRKAQGVPQIAVLVTHRASDDAVREAALDLRLQGVTMFAMSV 399 Query: 124 IPSALTNLS--LRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNM--TSNKY 179 + T L + S+ + + + + +K Sbjct: 400 QGANNTQLEDIVSYPSRQSISTHSSYKHLESYSGNFLKKIHNEIWTQVSTHAEQMELDKT 459 Query: 180 LLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG 239 + + S +I + + S ++ N VR+G Sbjct: 460 GCVDTKEADIYFLIDGSS---SIRRKEFEQIQIFMSSVVDMF------PIGPNN--VRVG 508 Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNT 298 + Y+ E++ + + T T A+ + K + Sbjct: 509 VVQYSHRNEVEFPVSQYTKGIELRKAVWNIKQLKGGTFTGKALDFILPIIKKGKSERIHE 568 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLR 358 + ++I +TDG+++ + L+ +R + I++V + + L+ Sbjct: 569 -----VPCYLIVLTDGKSNDS--------VLEPANRLRAEHITIHAVGIG--EANKTQLQ 613 Query: 359 KCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + + + L ++I +I Sbjct: 614 QIAGKDERVSFGQNFDSLKYIKNEIVHRICS 644 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 26/158 (16%), Positives = 51/158 (32%), Gaps = 17/158 (10%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEK 292 ++G + ++ + EV + ++ ++P NT T A+ K Sbjct: 689 DRSQVGVVQFSDYNKEEFQLNKYSTREEVYAAIDGMSPINRNTLTGSALTFVNEYFDISK 748 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 KF+I +TDGE +R+ + I+SV V Sbjct: 749 GGRPQVR------KFLILLTDGEAQDEVGGP--------AMALRSKSVTIFSVGV--YGA 792 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + L + + F V + L K+ ++ Sbjct: 793 NRTQLEEISGEGSLVFHVENFDHLKTIESKLIFRVCAL 830 >gi|293349450|ref|XP_001073278.2| PREDICTED: collagen type VI alpha 5-like [Rattus norvegicus] Length = 2640 Score = 73.8 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 27/152 (17%), Positives = 56/152 (36%), Gaps = 13/152 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYENTNTYPAMHHAYREL 288 + V+ G + Y+ ++ + + L + + +T T A+ HA Sbjct: 871 DVGRDRVQFGALMYSNNPEILFYLNTYSSRSAITEHLKRPRDTRGDTYTAKALQHANILF 930 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 H + +++ +I ITDGE+ +T +R+ G+ I++V V Sbjct: 931 ----MEEHGSRLKQNVRQLMIVITDGESHDRDKLNDT------ARELRDKGITIFAVGVG 980 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 QD L V++ +L + + Sbjct: 981 R--ANQDELETMAGKKENTIHVDNFDKLRDIY 1010 Score = 73.4 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 35/331 (10%), Positives = 91/331 (27%), Gaps = 31/331 (9%) Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGL 123 I+ +G + QI + + E+ + +F + Sbjct: 340 VGAAIEHMRTEGFSESSGSRKAQGVPQIAVLVTHRASDDAVREAALDLRLQGVTMFAMSV 399 Query: 124 IPSALTNLS--LRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNM--TSNKY 179 + T L + S+ + + + + +K Sbjct: 400 QGANNTQLEDIVSYPSRQSISTHSSYKHLESYSGNFLKKIHNEIWTQVSTHAEQMELDKT 459 Query: 180 LLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG 239 + + S +I + + S ++ N VR+G Sbjct: 460 GCVDTKEADIYFLIDGSS---SIRRKEFEQIQIFMSSVVDMF------PIGPNN--VRVG 508 Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNT 298 + Y+ E++ + + T T A+ + K + Sbjct: 509 VVQYSHRNEVEFPVSQYTKGIELRKAVWNIKQLKGGTFTGKALDFILPIIKKGKSERIHE 568 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLR 358 + ++I +TDG+++ + L+ +R + I++V + + L+ Sbjct: 569 -----VPCYLIVLTDGKSNDS--------VLEPANRLRAEHITIHAVGIG--EANKTQLQ 613 Query: 359 KCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + + + L ++I +I Sbjct: 614 QIAGKDERVSFGQNFDSLKYIKNEIVHRICS 644 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 26/158 (16%), Positives = 51/158 (32%), Gaps = 17/158 (10%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEK 292 ++G + ++ + EV + ++ ++P NT T A+ K Sbjct: 689 DRSQVGVVQFSDYNKEEFQLNKYSTREEVYAAIDGMSPINRNTLTGSALTFVNEYFDISK 748 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 KF+I +TDGE +R+ + I+SV V Sbjct: 749 GGRPQVR------KFLILLTDGEAQDEVGGP--------AMALRSKSVTIFSVGV--YGA 792 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + L + + F V + L K+ ++ Sbjct: 793 NRTQLEEISGEGSLVFHVENFDHLKTIESKLIFRVCAL 830 >gi|296220039|ref|XP_002756140.1| PREDICTED: integrin alpha-X [Callithrix jacchus] Length = 1078 Score = 73.8 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 63/191 (32%), Gaps = 21/191 (10%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + ++ +++ Q+ + + + S+N Sbjct: 159 SGSILYNNFAMMKSFVRAVMSHFQR----PSTQFSLMQFSSKFKTHFTFEEFRESSNPLS 214 Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + + +++L T+T A+ +EL+ + K +I ITDG+ G Sbjct: 215 LLASVDQL--GGYTHTATAIQKVVKELFLASNGARREAT-----KILIVITDGKKEGDWL 267 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSA---PPEGQDLLRKCTDSSG--QFFAVNDSREL 376 + + + AG+ Y++ V + L F V D L Sbjct: 268 DYDDVIPM-----ADKAGIIRYAIGVGSAFQNRNSWKELNDIASKPSQEHIFKVEDFDAL 322 Query: 377 LESFDKITDKI 387 + +++ +KI Sbjct: 323 KDIQNQLKEKI 333 >gi|148749338|gb|ABR09545.1| CD11c protein [Callithrix jacchus] Length = 1161 Score = 73.8 bits (179), Expect = 4e-11, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 63/191 (32%), Gaps = 21/191 (10%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + ++ +++ Q+ + + + S+N Sbjct: 159 SGSILYNNFAMMKSFVRAVMSHFQR----PSTQFSLMQFSSKFKTHFTFEEFRESSNPLS 214 Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + + +++L T+T A+ +EL+ + K +I ITDG+ G Sbjct: 215 LLASVDQL--GGYTHTATAIQKVVKELFLASNGARREAT-----KILIVITDGKKEGDWL 267 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSA---PPEGQDLLRKCTDSSG--QFFAVNDSREL 376 + + + AG+ Y++ V + L F V D L Sbjct: 268 DYDDVIPM-----ADKAGIIRYAIGVGSAFQNRNSWKELNDIASKPSQEHIFKVEDFDAL 322 Query: 377 LESFDKITDKI 387 + +++ +KI Sbjct: 323 KDIQNQLKEKI 333 >gi|322689979|ref|YP_004209713.1| cell surface protein [Bifidobacterium longum subsp. infantis 157F] gi|320461315|dbj|BAJ71935.1| putative cell surface protein [Bifidobacterium longum subsp. infantis 157F] Length = 794 Score = 73.8 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 58/391 (14%), Positives = 115/391 (29%), Gaps = 74/391 (18%) Query: 36 AVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK 95 A+++ A + + D ++ + Q S + + + T Sbjct: 2 AIVAAVAMLGGVAGVSATAMADDGNASTTQSQTTDEKAAASAPAPLSTEGTNGVPDDPTL 61 Query: 96 DKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD 155 + + N++ + N + I +VLD Sbjct: 62 SA-PAREKTVTANEDGTYTV------------ALNVTGAKSAGTGEIVTNQPLDIVLVLD 108 Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 VS SM + N + S K + + T +I ++ Sbjct: 109 VSGSMAEKIASGWNQPTKIDSLKTAVN------KFINATAAENAKITDQSQRNRIALVKF 162 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 + N R G +YN + L+ +++ + S +N L+ T Sbjct: 163 A--------GTEKTSVGNDFYREGWSSYN---YTQIVSNLTYDVSGLTSTVNGLSASGAT 211 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI--CE 333 + A + A L + + KK VIF TDGE + S + T+ + + Sbjct: 212 SADYAFNRAQAALTYQPRA--------NAKKVVIFFTDGEPNHGSGFDPTVAATAVNKAK 263 Query: 334 YMRNAGMKIYSVAV-------SAPPEGQDLLRKC-------------------------- 360 +++AG IYS+ V + Sbjct: 264 SLKDAGTTIYSIGVVSGANPGDTSSNLNKYMHGISSNYPDATATSSEHLWGKSWNANLGD 323 Query: 361 -TDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 ++S + A D+ +L F+ I +I + Sbjct: 324 RAETSSYYKAATDAGQLNNIFESIYQEITKT 354 >gi|254820232|ref|ZP_05225233.1| hypothetical protein MintA_09906 [Mycobacterium intracellulare ATCC 13950] Length = 327 Score = 73.8 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 25/160 (15%), Positives = 55/160 (34%), Gaps = 19/160 (11%) Query: 248 VGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 P + L KL+ ++T T A+ A + T G Sbjct: 140 TPYLLVPPTPQHQATIDALKKLDFADSTATGQAIFTALHAIGATA----VTGGDNPPPAR 195 Query: 308 VIFITDGENSGASAYQNTLN-TLQICEYMRNAGMKIYSVAVSAPPE-------------G 353 ++ ++DG + S + + ++ G+ I +++ Sbjct: 196 IVLLSDGRENKPSNPSDPHDGVYTAARLAKDEGVPISTISFGTKGGEIEMDGQRVAVPVS 255 Query: 354 QDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 D ++ S GQ + + EL +S++ I ++I ++V Sbjct: 256 TDQMKTIARLSGGQPYTATNIGELNKSYNAIENEIGYRTV 295 >gi|225028486|ref|ZP_03717678.1| hypothetical protein EUBHAL_02763 [Eubacterium hallii DSM 3353] gi|224954191|gb|EEG35400.1| hypothetical protein EUBHAL_02763 [Eubacterium hallii DSM 3353] Length = 538 Score = 73.8 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 42/274 (15%), Positives = 88/274 (32%), Gaps = 25/274 (9%) Query: 126 SALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPP 185 N L ++ + S K + + +N + Sbjct: 147 YQWNNGELLIYTKWAKNEPDNTFSDYGYYVRFNENAKDGTWKVDTFSGGETNFNNVFLCE 206 Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIE----SAGNLVNSIQKAIQEKKNLSVRIGTI 241 + K +D+ + K + N + IG + Sbjct: 207 WGDYSVTGNDGLKVTSKKRDIVLTLDISASMDGIPLDETKKAAAKFVDSILNKNSNIGLV 266 Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGS 301 +Y+ + + +N +K+ + L+ ENTN + AY L Sbjct: 267 SYSDEATS--LSGICSNDVFLKNTITSLSSAENTNIEDGLSRAYSMLQLG---------- 314 Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV-----SAPPEGQDL 356 KK ++ ++DG + + ++ E +++ G+ IY++ EGQ L Sbjct: 315 QSKKKLIVLMSDGLPTLGK---DGEELIKYAEKIKDQGVLIYTLGFFQNTEEYKAEGQYL 371 Query: 357 LRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + K G + V+ S +L+ F+ + +I Q Sbjct: 372 MEKIAS-EGCHYEVSSSEDLVFFFEDVAGQIGGQ 404 >gi|330504126|ref|YP_004380995.1| von Willebrand factor, type A [Pseudomonas mendocina NK-01] gi|328918412|gb|AEB59243.1| von Willebrand factor, type A [Pseudomonas mendocina NK-01] Length = 334 Score = 73.8 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 26/145 (17%), Positives = 58/145 (40%), Gaps = 34/145 (23%) Query: 254 PLSNNLNEVKSRLNKL---NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + + V++ L++ +NT A+ A + L + ++ Sbjct: 150 PLTFDRHTVRTWLDEALIGIAGKNTAIGDAIGLAVKRL----------RQRPAQSRVLVL 199 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE---------------GQD 355 ITDG N+G ++ + + + G++IY++ + A P+ + Sbjct: 200 ITDGANNGG-----EIDPMVAAQLAADEGVRIYTIGIGADPQQSGAFGSFGFSALDLDET 254 Query: 356 LLRKCT-DSSGQFFAVNDSRELLES 379 LR + + G++F + EL + Sbjct: 255 SLRAISDTTGGEYFRARNQAELEQI 279 >gi|297683362|ref|XP_002819353.1| PREDICTED: LOW QUALITY PROTEIN: matrilin-2-like [Pongo abelii] Length = 935 Score = 73.8 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 67/187 (35%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +V+ + +++S+ + + R+G + Y+ + N+ ++K Sbjct: 665 GEENFEVVKQFVTGIIDSLTISPKAA-----RVGLLQYSTQVRTEFTLRNFNSAKDMKKA 719 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + STR+ + I TDG + Sbjct: 720 VAHMKYMGKGSMTGLALKHMFERSFTQGEGARPL--STRVPRAAIVFTDGRAQDDVSEW- 776 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ F D + E +K Sbjct: 777 -------ASKAKANGITMYAVGVGKAI--EEELQEIASEPTTKHLFYAEDFSTMDEISEK 827 Query: 383 ITDKIQE 389 + I E Sbjct: 828 LKKGICE 834 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 56/159 (35%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 88 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 147 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ R+ G+ I+++ V Sbjct: 148 FSEAEGARPLREN--VPRVIMIVTDGRPQDS--------VAEVAAKARDTGILIFAIGVG 197 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F K Sbjct: 198 --QVDFNTLKAIGSEPHEDHVFLVANFSQIETLTSVFQK 234 >gi|260777338|ref|ZP_05886232.1| protein BatA [Vibrio coralliilyticus ATCC BAA-450] gi|260607004|gb|EEX33278.1| protein BatA [Vibrio coralliilyticus ATCC BAA-450] Length = 271 Score = 73.8 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 33/204 (16%), Positives = 71/204 (34%), Gaps = 47/204 (23%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 ++ + + + ++ K R+G + + TPL+ + + V Sbjct: 54 NGDYIDRLSAVKQVLSDFIS---------KRQGDRLGLVLFADHAYLQ--TPLTLDRHTV 102 Query: 263 KSRLNK--LNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +LN+ L T + A + + S ++ +I ++DG N+ Sbjct: 103 AEQLNQTVLRLIGTKTAIGEGIGLATKTFVD----------SDAPQRVMILLSDGSNTAG 152 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP------------------EGQDLLRKCT 361 L+ ++ + + IY+V V A + + L+ Sbjct: 153 V-----LDPIEAAKIAKKYNATIYTVGVGAGEMMVKEFFMTRKVNTAQDLDEKSLMEIAK 207 Query: 362 DSSGQFFAVNDSRELLESFDKITD 385 + GQ+F DS+EL +D I + Sbjct: 208 LTGGQYFRARDSKELATIYDTINN 231 >gi|159898662|ref|YP_001544909.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159891701|gb|ABX04781.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 610 Score = 73.8 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 34/366 (9%), Positives = 98/366 (26%), Gaps = 40/366 (10%) Query: 32 ALDAAVLSGCASIVSDRTIKDPTTKKDQTST--------------IFKKQIKKHLKQGSY 77 +A ++ + + + + Q + Sbjct: 99 TAEATSVAPAPQPMPTQAADAGQPVPNPAAGKPLVDTWELPTQPIDPNPNYAYEQDQEIF 158 Query: 78 IRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTG 137 + + D + + A Y + ++ L P+ + Sbjct: 159 DSMYFKNYGTNPFVRTETDPLSTFAMDID-SASYSLMRSSINQGLLPPADSVRVEEYLNA 217 Query: 138 IIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKS 197 + + +V+ S + + + P + +T+ S Sbjct: 218 FDYEYPQPEDGDFAIYSEVAPSPFGGPNYELVQIGIQARSIEVADRKPAALTFVIDTSGS 277 Query: 198 KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN 257 + +++++ + L + + +A+N G+ Sbjct: 278 MAQ------DNRLEMVKNALIYLAGQL--------EPDDSLAIVAFNDGMRVVLNPTSGE 323 Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N ++ + +N L P +TN ++ + + + ++ +DG + Sbjct: 324 NQMDIITAINSLEPAGSTNAEAGLYKGFELAWQAFKPEGINR--------ILLCSDGVAN 375 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSREL 376 L + +AG+++ + V LL + D G + + + E Sbjct: 376 SG--MTEPSQLLATFQQYLDAGVQLSTYGVGMGNYNDILLEQLADKGDGNYAYFDSADEA 433 Query: 377 LESFDK 382 F + Sbjct: 434 QRLFGE 439 >gi|24431113|gb|AAN61407.1|AF486289_1 matrilin [Biomphalaria glabrata] Length = 394 Score = 73.8 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 28/189 (14%), Positives = 61/189 (32%), Gaps = 19/189 (10%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS--NNLNEVKSRLNKLNP 271 + L + + VR+ I+Y GI L+ + +EV + ++ Sbjct: 71 KTAIKFLQEFLSQYEISSDPNGVRVSIISYGKGIYPEIGFNLTTYDTKDEVIEAIGRIPH 130 Query: 272 YEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T+T A+ + + + K I ITDG + T Sbjct: 131 KAGLRTDTGRAIQYMHEAQLANGVVRPG------VTKVSIVITDGNSQEW------KLTK 178 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ- 388 + E R + ++++ V +LL + V++ +L + + + Sbjct: 179 EAAEEARKDNIVMFAIGVGTDIRNSELL-NIAGDQSRVTKVDNYNQLSSIKESLAHQTCF 237 Query: 389 -EQSVRIAP 396 ++ P Sbjct: 238 VQEKTTTTP 246 >gi|229590954|ref|YP_002873073.1| hypothetical protein PFLU3509 [Pseudomonas fluorescens SBW25] gi|229362820|emb|CAY49730.1| putative exported protein [Pseudomonas fluorescens SBW25] Length = 362 Score = 73.8 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 29/146 (19%), Positives = 52/146 (35%), Gaps = 35/146 (23%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + V++ L++ +NT A+ A + L + +I Sbjct: 150 PLTFDRRTVRTWLDEARIGIAGKNTAIGDAIGLALKRL----------RQRPAQSRVLIL 199 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------GQ 354 +TDG N+ ++ L G+KIY + + A PE + Sbjct: 200 VTDGANNAGQ-----IDPLTAARLAAEEGVKIYPIGIGADPEQTGSLGILGVNPSLDLDE 254 Query: 355 DLLRKCTD-SSGQFFAVNDSRELLES 379 L+ + + GQ+F D EL Sbjct: 255 PALKAIAEATGGQYFRARDGEELQAI 280 >gi|229596191|ref|XP_001012539.2| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|225565545|gb|EAR92294.2| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 703 Score = 73.8 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 29/206 (14%), Positives = 69/206 (33%), Gaps = 24/206 (11%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL--SN 257 KI+ + + L+ + N + R+ I +N + Sbjct: 219 NSGSMNDFSKIENVKNTILQLLEML--------NENDRLSLITFNTKAKQLCGLKNVNNQ 270 Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N +++ + T+ + A++ L + + ++DG+++ Sbjct: 271 NKKSLQTITKSIKADGGTDIIRGIEIAFQILQS--------RKQKNSVSSIFLLSDGQDN 322 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 A A L + ++ I+S +G + + G F+ V + ++ Sbjct: 323 LADAGIKNLLKTTY-KQLQEESFTIHSFGFGNDHDGPLMQKIAQIKDGSFYFVEKNDQVD 381 Query: 378 ESF-DKITDK----IQEQSVRIAPNR 398 E F D + Q+ +++I NR Sbjct: 382 EFFIDALGGLFSVVAQDLTIKIEINR 407 >gi|23466092|ref|NP_696695.1| hypothetical protein BL1539 [Bifidobacterium longum NCC2705] gi|322691915|ref|YP_004221485.1| cell surface protein [Bifidobacterium longum subsp. longum JCM 1217] gi|23326823|gb|AAN25331.1| hypothetical protein with gram positive cell wall anchoring domain [Bifidobacterium longum NCC2705] gi|320456771|dbj|BAJ67393.1| putative cell surface protein [Bifidobacterium longum subsp. longum JCM 1217] Length = 794 Score = 73.8 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 58/391 (14%), Positives = 115/391 (29%), Gaps = 74/391 (18%) Query: 36 AVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK 95 A+++ A + + D ++ + Q S + + + T Sbjct: 2 AIVAAVAMLGGVAGVSATAMADDGNASTTQSQTTDEKAAASAPAPLSTEGTNGVPDDPTL 61 Query: 96 DKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD 155 + + N++ + N + I +VLD Sbjct: 62 SA-PAREKTVTANEDGTYTV------------ALNVTGAKSAGTGEIVTNQPLDIVLVLD 108 Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 VS SM + N + S K + + T +I ++ Sbjct: 109 VSGSMAEKIASGWNQPTKIDSLKTAVN------KFINATAAENAKITDQSQRNRIALVKF 162 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 + N R G +YN + L+ +++ + S +N L+ T Sbjct: 163 A--------GTEKTSVGNDFYREGWSSYN---YTQIVSNLTYDVSGLTSTVNGLSASGAT 211 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI--CE 333 + A + A L + + KK VIF TDGE + S + T+ + + Sbjct: 212 SADYAFNRAQAALTYQPRA--------NAKKVVIFFTDGEPNHGSGFDPTVAATAVNKAK 263 Query: 334 YMRNAGMKIYSVAV-------SAPPEGQDLLRKC-------------------------- 360 +++AG IYS+ V + Sbjct: 264 SLKDAGTTIYSIGVVSGANPGDTSSNLNKYMHGISSNYPDATATSSEHLWGKSWNANLGD 323 Query: 361 -TDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 ++S + A D+ +L F+ I +I + Sbjct: 324 RAETSSYYKAATDAGQLNNIFESIYQEITKT 354 >gi|55729600|emb|CAH91529.1| hypothetical protein [Pongo abelii] Length = 955 Score = 73.8 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 68/191 (35%), Gaps = 20/191 (10%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + +V+ + +++S+ + + R+G + Y+ + N+ + Sbjct: 663 SKGLGEENFEVVKQFVTGIIDSLTISPKAA-----RVGLLQYSTQVRTEFTLRNFNSAKD 717 Query: 262 VKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 +K + + + T A+ H + + + E + STR+ + I TDG Sbjct: 718 MKKAVAHMKYMGKGSMTGLALRHMFERSFTQGEGARPL--STRVPRAAIVFTDGRAQDDV 775 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLE 378 + + G+ +Y+V V ++ L++ F D + E Sbjct: 776 SEW--------ASKAKANGITMYAVGVGKAI--EEELQEIASEPTTKHLFYAEDFSTMDE 825 Query: 379 SFDKITDKIQE 389 +K+ I E Sbjct: 826 ISEKLKKGICE 836 Score = 66.1 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 55/159 (34%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 88 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 147 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ R+ G+ I++ V Sbjct: 148 FSEAEGARPLREN--VPRVIMIVTDGRPQDS--------VAEVAAKARDTGILIFAFGVG 197 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F K Sbjct: 198 --QVDFNTLKAIGSEPHEDHVFLVANFSQIETLTSVFQK 234 >gi|160875970|ref|YP_001555286.1| von Willebrand factor type A [Shewanella baltica OS195] gi|160861492|gb|ABX50026.1| von Willebrand factor type A [Shewanella baltica OS195] gi|315268165|gb|ADT95018.1| von Willebrand factor type A [Shewanella baltica OS678] Length = 339 Score = 73.8 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 64/204 (31%), Gaps = 47/204 (23%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + ++ + + + RIG I + PL+ + V Sbjct: 104 NGKTVDRFTLIQHVVSDFIERRK---------GDRIGLILFADHAYLQA--PLTQDRRSV 152 Query: 263 KSRLN--KLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 L ++ T A+ A + + +I +TDG N+ Sbjct: 153 AQFLKEAQIGLVGKQTAIGEAIGLAVKRFDKM----------DESNRVLILLTDGSNNAG 202 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEGQDLLRKCTD 362 + + Q + N + IY+V V A ++ L+ + Sbjct: 203 N-----IEPEQAAQIAANRKVTIYTVGVGADVMERRTLFGRERVNPSMDLDENQLKHIAE 257 Query: 363 -SSGQFFAVNDSRELLESFDKITD 385 + G++F +S+EL + + +I Sbjct: 258 VTHGRYFRARNSQELDQIYQEIDK 281 >gi|46190503|ref|ZP_00121395.2| COG2304: Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Bifidobacterium longum DJO10A] gi|189440499|ref|YP_001955580.1| von Willebrand factor (vWF) domain containing protein [Bifidobacterium longum DJO10A] gi|189428934|gb|ACD99082.1| von Willebrand factor (vWF) domain containing protein [Bifidobacterium longum DJO10A] Length = 794 Score = 73.8 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 58/391 (14%), Positives = 115/391 (29%), Gaps = 74/391 (18%) Query: 36 AVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK 95 A+++ A + + D ++ + Q S + + + T Sbjct: 2 AIVAAVAMLGGVAGVSATAMADDGNASTTQSQTTDEKAAASAPAPLSTEGTNGVPDDPTL 61 Query: 96 DKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD 155 + + N++ + N + I +VLD Sbjct: 62 SA-PAREKTVTANEDGTYTV------------ALNVTGAKSAGTGEIVTNQPLDIVLVLD 108 Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 VS SM + N + S K + + T +I ++ Sbjct: 109 VSGSMAEKIASGWNQPTKIDSLKTAVN------KFINATAAENAKITDQSQRNRIALVKF 162 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 + N R G +YN + L+ +++ + S +N L+ T Sbjct: 163 A--------GTEKTSVGNDFYREGWSSYN---YTQIVSNLTYDVSGLTSTVNGLSASGAT 211 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI--CE 333 + A + A L + + KK VIF TDGE + S + T+ + + Sbjct: 212 SADYAFNRAQAALTYQPRA--------NAKKVVIFFTDGEPNHGSGFDPTVAATAVNKAK 263 Query: 334 YMRNAGMKIYSVAV-------SAPPEGQDLLRKC-------------------------- 360 +++AG IYS+ V + Sbjct: 264 SLKDAGTTIYSIGVVSGANPGDTSSNLNKYMHGISSNYPDATATSSEHLWGKSWNANLGD 323 Query: 361 -TDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 ++S + A D+ +L F+ I +I + Sbjct: 324 RAETSSYYKAATDAGQLNNIFESIYQEITKT 354 >gi|312133821|ref|YP_004001160.1| von willebrand factor (vwf) domain containing protein [Bifidobacterium longum subsp. longum BBMN68] gi|311773110|gb|ADQ02598.1| Von Willebrand factor (VWF) domain containing protein [Bifidobacterium longum subsp. longum BBMN68] Length = 794 Score = 73.8 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 58/391 (14%), Positives = 115/391 (29%), Gaps = 74/391 (18%) Query: 36 AVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK 95 A+++ A + + D ++ + Q S + + + T Sbjct: 2 AIVAAVAMLGGVAGVSATAMADDGNASTTQSQTTDEKAAASAPAPLSTEGTNGVPDDPTL 61 Query: 96 DKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD 155 + + N++ + N + I +VLD Sbjct: 62 SA-PAREKTVTANEDGTYTV------------ALNVTGAKSAGTGEIVTNQPLDIVLVLD 108 Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 VS SM + N + S K + + T +I ++ Sbjct: 109 VSGSMAEKIASGWNQPTKIDSLKTAVN------KFINATAAENAKITDQSQRNRIALVKF 162 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 + N R G +YN + L+ +++ + S +N L+ T Sbjct: 163 A--------GTEKTSVGNDFYREGWSSYN---YTQIVSNLTYDVSGLTSTVNGLSASGAT 211 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI--CE 333 + A + A L + + KK VIF TDGE + S + T+ + + Sbjct: 212 SADYAFNRAQAALTYQPRA--------NAKKVVIFFTDGEPNHGSGFDPTVAATAVNKAK 263 Query: 334 YMRNAGMKIYSVAV-------SAPPEGQDLLRKC-------------------------- 360 +++AG IYS+ V + Sbjct: 264 SLKDAGTTIYSIGVVSGANPGDTSSNLNKYMHGISSNYPDATATSSEHLWGKSWNANLGD 323 Query: 361 -TDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 ++S + A D+ +L F+ I +I + Sbjct: 324 RAETSSYYKAATDAGQLNNIFESIYQEITKT 354 >gi|329928982|ref|ZP_08282792.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] gi|328937234|gb|EGG33661.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] Length = 899 Score = 73.8 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 54/331 (16%), Positives = 106/331 (32%), Gaps = 39/331 (11%) Query: 80 ENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGII 139 + A A + T+ + P I E G+I + + L S Sbjct: 273 DGDEASANNAAFDFTRVEGPPNVLIVEGTPGTSGNITAALESGMIGTEVIPPELLSLEAA 332 Query: 140 ERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKN---TTK 196 + + + I +VS S + + + + + SF T Sbjct: 333 KYAVYDSIIFN----NVSGSDVGGKQMELIEQAVRSFGIGFMMAGGEDSFGMGGYFKTPI 388 Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEK--------------KNLSVRIGTIA 242 K P K ++ +++ K +G +A Sbjct: 389 EKALPVSMELEGKREIPSLGLILVIDRSGSMDGNKIELAKESAMRTVELMRAKDTVGVVA 448 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 ++ + EV S + + TN YPA+ A E+ Sbjct: 449 FDDQPWWVVPPQKLGDKEEVLSSIQSIPSAGGTNIYPAVSSALEEMLK----------ID 498 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 ++ +I +TDG+++ S YQ+ +T M + + SVAV +LL+ D Sbjct: 499 AQRRHIILMTDGQSAMNSGYQDLTDT------MVENKITMSSVAVGMDA-DTNLLQSLAD 551 Query: 363 SS-GQFFAVNDSRELLESFDKITDKIQEQSV 392 ++ G+++ V D L F + + + + Sbjct: 552 AAKGRYYFVEDETTLPAVFSREAVMLAKSYI 582 >gi|256420242|ref|YP_003120895.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] gi|256035150|gb|ACU58694.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] Length = 639 Score = 73.8 bits (179), Expect = 5e-11, Method: Composition-based stats. Identities = 40/364 (10%), Positives = 88/364 (24%), Gaps = 37/364 (10%) Query: 40 GCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNN 99 + + + K + T + Sbjct: 126 VVVNAMIVPEAPKTVAGSPVVNAYMKSASPAFYGSRAPQFNTEDYSPVNENRFHTVASDP 185 Query: 100 PLQYIAES-KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR 158 + + +A Y L + P + + S + V Sbjct: 186 LSTFSIDVDRASYSNVRRFLNEGNMPPVDAVRVEEMINYFDYKYSNPTGNTPVAV----- 240 Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY----APAPAPANRKIDVLI 214 + + L K + +K+ ++ Sbjct: 241 --RTDMAICPWNTAHQLVRIAL---KGKDVAKDNLPPSNLVFLIDVSGSMSDAKKLPLVK 295 Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 ++ LVN ++ R+ + Y + ++ + L+KL + Sbjct: 296 QAFKLLVNQLRPVD--------RVAIVVYAGAAGLVLPSTSGDHKTAILDALDKLEAGGS 347 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 T + AY+ S N VI TDG+ + + +I E Sbjct: 348 TAGGEGVQLAYKTATEYLLKSGNNR--------VIIATDGDFNVG--PSSDGELQRIIEK 397 Query: 335 MRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVR 393 R G+ + + + L D G + +++ E +F + Sbjct: 398 KREKGIFLSVLGFGMGNYKDNKLELLADKGNGNYAYIDNFEEARRTF---ATEFGGTLFT 454 Query: 394 IAPN 397 IA + Sbjct: 455 IAKD 458 >gi|310823567|ref|YP_003955925.1| Bata protein [Stigmatella aurantiaca DW4/3-1] gi|309396639|gb|ADO74098.1| BatA protein [Stigmatella aurantiaca DW4/3-1] Length = 302 Score = 73.4 bits (178), Expect = 5e-11, Method: Composition-based stats. Identities = 30/209 (14%), Positives = 68/209 (32%), Gaps = 48/209 (22%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL-NEVKSRL 266 ++ V E ++ ++ RIG + + L + EV ++ Sbjct: 79 NRLHVAKEVLAEFIS---------NRVNDRIGLVVFAGAAYTQAPLTLDYGVVREVLKQI 129 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 + T A+ + L S + V+ ITDG+N+ + L Sbjct: 130 RTRVLEDGTAIGDALATSLNRL----------RDSEAKSRVVVLITDGDNNAGKI--SPL 177 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPE----------------------GQDLLRKCTD-S 363 + + E ++ + IY++ V + +LL+ + Sbjct: 178 DAASMAESLK---IPIYTILVGKGGKVPFPQGQDLFGNTVWRDTEIPINPELLQDIASRT 234 Query: 364 SGQFFAVNDSRELLESFDKITDKIQEQSV 392 G+++ D L + K+ D ++ + Sbjct: 235 GGEYYRATDPEGLKQGLQKVLDSLERSKL 263 >gi|296125842|ref|YP_003633094.1| von Willebrand factor type A [Brachyspira murdochii DSM 12563] gi|296017658|gb|ADG70895.1| von Willebrand factor type A [Brachyspira murdochii DSM 12563] Length = 328 Score = 73.4 bits (178), Expect = 5e-11, Method: Composition-based stats. Identities = 25/192 (13%), Positives = 66/192 (34%), Gaps = 41/192 (21%) Query: 225 QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY--ENTNTYPAMH 282 +K + + + + +P + + ++ + + +T+ + Sbjct: 112 KKTMIDFIKKRNFDKISLVAFALRASVLSPSTFDYTLLEEEIKNIKIDEEGSTSIGLGIA 171 Query: 283 HAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKI 342 A L + K + +K +I +TDGEN+ ++ E N +KI Sbjct: 172 TAVDMLRSVKGDN---------EKIIILLTDGENNSG-----EIDPKLASEIASNFNIKI 217 Query: 343 YSVAVSAPP------------------------EGQDLLRKCTDSSGQFFAVNDSRELLE 378 Y++ + + L+ + + G++F ++ L Sbjct: 218 YTIGIGDANGSHAWVTYDDPNYGKRRIRADFSLNEEALIDIASTTGGKYFNAQNASALDN 277 Query: 379 SFDKITDKIQEQ 390 ++ I D+I+++ Sbjct: 278 VYNTI-DRIEKK 288 >gi|149180101|ref|ZP_01858606.1| hypothetical protein BSG1_03760 [Bacillus sp. SG-1] gi|148852293|gb|EDL66438.1| hypothetical protein BSG1_03760 [Bacillus sp. SG-1] Length = 931 Score = 73.4 bits (178), Expect = 5e-11, Method: Composition-based stats. Identities = 25/140 (17%), Positives = 53/140 (37%), Gaps = 16/140 (11%) Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 ++ T + +V ++N L TN +P++ AY +L Sbjct: 450 FDDRPWQIIDTEPIKDKEKVIEKINGLTSGGGTNIFPSLELAYEQLTP----------LE 499 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 +K +I +TDG+ + + L + + + + +VA+ + L + Sbjct: 500 LQRKHIILLTDGQ------SATSPDYLTTIQEGKENNITLSTVAIGEGSDSVLLEELSDE 553 Query: 363 SSGQFFAVNDSRELLESFDK 382 G+F+ VNDS + + Sbjct: 554 GGGRFYDVNDSSTIPSILSR 573 >gi|219850571|ref|YP_002465004.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219544830|gb|ACL26568.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 418 Score = 73.4 bits (178), Expect = 5e-11, Method: Composition-based stats. Identities = 18/184 (9%), Positives = 55/184 (29%), Gaps = 13/184 (7%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + + V I + + + +N + L+ E+K + Sbjct: 49 VIDRSSSMRGERLQQVKQAAMQILDLLGDNESFALVTFNDRAEVVVSSQLARARAEIKRQ 108 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 ++ + T + +EL ++ +TDG Y + Sbjct: 109 ISAIEAAGGTEMATGLALGVQELQRAMMPRAIHR--------LLLLTDG-----RTYGDE 155 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 ++I + G+ I ++ + + L + + + + ++ + F + Sbjct: 156 SRCVEIARRAQARGIGITALGIGSEWNEDLLETIAARENSRTHYITSAADITKIFTAEVE 215 Query: 386 KIQE 389 ++ Sbjct: 216 RMHS 219 >gi|297665730|ref|XP_002811194.1| PREDICTED: cartilage matrix protein-like [Pongo abelii] Length = 495 Score = 73.4 bits (178), Expect = 5e-11, Method: Composition-based stats. Identities = 40/382 (10%), Positives = 102/382 (26%), Gaps = 61/382 (15%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQIN---ITKDKN 98 + +S + + ST + + G ++ I+ I Sbjct: 96 RAHISKAALLQAVRRIQPLSTGTMTGLAIQFAITKAFSDAEGGRSRSPDISKVVIVVTDG 155 Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR 158 P + + A+ LF G+ L ++ + + S ++ +SR Sbjct: 156 RPQDNVQDVSARARASGVELFAIGVGRVDKATLRQIASEPQD-EHVDYVESYSVIEKLSR 214 Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIES-- 216 ++ + + ++ S D Sbjct: 215 KFQEAF---------CVVSDLCATGDHDCEQVCISSPGSYTCACHEGFTLNSDG-KTCNG 264 Query: 217 -----------AGNLVNSIQKAIQ----------------EKKNLSVRIGTIAYNIGIVG 249 + + + + ++G + Y+ + Sbjct: 265 QWGGHRLALERVWRGGDGASATPRNHSXXXXXXNQIVDTLDVSDKLAQVGLVQYSSSVRQ 324 Query: 250 NQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 + ++K+ + ++ + T T A+ + + + +K Sbjct: 325 EFPLGRFHTKKDIKAAVRNMSYMEKGTMTGAALKYLIDNSFTVSSGARPRA-----QKVG 379 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQ 366 I TDG + + ++ G K+++V V +D LR+ Sbjct: 380 IVFTDGRSQDYIND--------AAKKAKDLGFKMFAVGVG--NAVEDELREIASEPVAEH 429 Query: 367 FFAVNDSRELLESFDKITDKIQ 388 +F D + + + K+ KI Sbjct: 430 YFYTADFKTINQIGKKLQKKIC 451 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 55/159 (34%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + R+G + Y + + + + ++ P T T A+ A + Sbjct: 72 DVGPNATRVGMVNYASTVKQEFSLRAHISKAALLQAVRRIQPLSTGTMTGLAIQFAITKA 131 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 +++ E + S + K VI +TDG N + R +G++++++ V Sbjct: 132 FSDAEGGRSR--SPDISKVVIVVTDGRPQD--------NVQDVSARARASGVELFAIGVG 181 Query: 349 APPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 + LR+ V +L F + Sbjct: 182 R--VDKATLRQIASEPQDEHVDYVESYSVIEKLSRKFQE 218 >gi|332884779|gb|EGK05035.1| hypothetical protein HMPREF9456_03188 [Dysgonomonas mossii DSM 22836] Length = 327 Score = 73.4 bits (178), Expect = 5e-11, Method: Composition-based stats. Identities = 26/191 (13%), Positives = 59/191 (30%), Gaps = 39/191 (20%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENT 275 + + +K E N PL+ + + + L+++ + T Sbjct: 108 TRLEAAKKVASEFINDRQSDRIGLVIFAGESFTQCPLTTDHRVLLNLLSEVKFGMIEDGT 167 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + ++ L S + VI +TDG N+ + L E Sbjct: 168 AIGLGLANSVNRL----------KDSQSKSRVVILLTDGSNNAGQ-----IAPLTAAELA 212 Query: 336 RNAGMKIYSVAVSA--------------------PPEGQDLLRKCTD-SSGQFFAVNDSR 374 + G+++Y++ + + + L + + G +F D+ Sbjct: 213 ASYGIRVYTIGIGSRGTSVARVMTPYGMQSMNVSGDFDERTLTEIASKTGGSYFRATDNT 272 Query: 375 ELLESFDKITD 385 L +D+I Sbjct: 273 SLSGIYDEIDQ 283 >gi|317483048|ref|ZP_07942050.1| von Willebrand factor type A domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] gi|316915549|gb|EFV36969.1| von Willebrand factor type A domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] Length = 813 Score = 73.4 bits (178), Expect = 5e-11, Method: Composition-based stats. Identities = 60/396 (15%), Positives = 115/396 (29%), Gaps = 67/396 (16%) Query: 31 SALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQ 90 +A+ AA +V+ S S + + A Sbjct: 9 AAVGAAAGRSGKRLVAVVAAVAMLGGVAGVSATAMADDGNASTTQSQTTDEKAAASAPAP 68 Query: 91 INITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISI 150 ++ P + A+ + T N + N++ + N + I Sbjct: 69 LSTEGTNGVPDDPTLSAPAREKTVTANGDGTYKV---ALNVTGAKSAGTGAIVTNQPLDI 125 Query: 151 CMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKI 210 +VLDVS SM D + + + +I Sbjct: 126 VLVLDVSGSMADNLSGGPKKIDALKTAVNGFIDATADENAKITDQSQ---------RNRI 176 Query: 211 DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN 270 ++ + N R G +YN + L+ +++ + S +N L+ Sbjct: 177 ALVKFA--------GTEKTSVGNDFYREGWSSYN---YTQIVSNLTYDVSGLTSTVNGLS 225 Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T+ A + A L + + KK VIF TDGE + S + T+ Sbjct: 226 ASGATSADYAFNRAQAALTYQPRA--------NAKKVVIFFTDGEPNHGSGFDPTVAATA 277 Query: 331 I--CEYMRNAGMKIYSVAV-------SAPPEGQDLLRKC--------------------- 360 + + +++AG IYS+ V + Sbjct: 278 VNKAKSLKDAGTTIYSIGVVSGANPGDTSSNLNKYMHGISSNYPDATATSSEHLWGKSWN 337 Query: 361 ------TDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 ++S + A D+ +L F+ I +I + Sbjct: 338 ANLGDRAETSSYYKAATDAGQLNNIFESIYQEITKT 373 >gi|218781310|ref|YP_002432628.1| hypothetical protein Dalk_3472 [Desulfatibacillum alkenivorans AK-01] gi|218762694|gb|ACL05160.1| conserved hypothetical protein [Desulfatibacillum alkenivorans AK-01] Length = 308 Score = 73.4 bits (178), Expect = 5e-11, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 58/182 (31%), Gaps = 30/182 (16%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 ++ + + + V RIG + + PL+ + + + + Sbjct: 111 NRLTAVKKVVHDFVKRRDT---------DRIGLVVFGDYAFTQA--PLTLDKGLLLNLIE 159 Query: 268 KLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 L T A+ A + + + K VI ++DGEN+ Sbjct: 160 NLRIGMAGRKTAIGDALGVAGKRI----------KDIPAMSKVVILLSDGENTAGDMTPQ 209 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKI 383 ++ IY++ + G L + G+++ +++ +L + +I Sbjct: 210 GAAEALAALGIK-----IYTIGMGTEQAGSKELAQIAAIGQGKYYHASNTEQLDSIYKEI 264 Query: 384 TD 385 Sbjct: 265 DK 266 >gi|154486447|ref|ZP_02027854.1| hypothetical protein BIFADO_00261 [Bifidobacterium adolescentis L2-32] gi|154084310|gb|EDN83355.1| hypothetical protein BIFADO_00261 [Bifidobacterium adolescentis L2-32] Length = 882 Score = 73.4 bits (178), Expect = 5e-11, Method: Composition-based stats. Identities = 45/287 (15%), Positives = 90/287 (31%), Gaps = 52/287 (18%) Query: 125 PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP 184 S N+ ++ + I +VLDVS SM+D + L Sbjct: 168 GSYTVNVDVKGAVNSTTVTTTQPIDFTLVLDVSGSMDDPMS----KTDRTRRLDALKEAV 223 Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYN 244 + NT + K + + R G YN Sbjct: 224 KAFLDEAANTNTEAGSELVHVGLVKFAGDKT-------------DKIGDDMYRSGGYTYN 270 Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 + L+ ++N +K++++KL T + A + + + T Sbjct: 271 ---YSQIVSNLTADMNGLKNKVSKLKAAGATRADNGFNRAVKVMGSASAR-------TDA 320 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQ--ICEYMRNAGMKIYSVAV-------SAPPEGQD 355 KK VIF DG + +S ++ + + +++ G +YS+ + S Sbjct: 321 KKVVIFFADGSPTSSSGFEGKVANKAVEAAKELKDGGAAVYSIGIFASANPSSLSSNENQ 380 Query: 356 LLRKCTDS----------------SGQFFAVNDSRELLESFDKITDK 386 + + + +G + + ++ EL FD+I Sbjct: 381 FMHAVSSNFPKATKYNQLGEGNIEAGYYKSATNASELNTIFDEIEKS 427 >gi|149911739|ref|ZP_01900346.1| von Willebrand factor type A domain protein [Moritella sp. PE36] gi|149805212|gb|EDM65230.1| von Willebrand factor type A domain protein [Moritella sp. PE36] Length = 330 Score = 73.4 bits (178), Expect = 5e-11, Method: Composition-based stats. Identities = 30/199 (15%), Positives = 64/199 (32%), Gaps = 47/199 (23%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 ++ ++ + + + R+G I + PL+ +L V + Sbjct: 110 DRLSLVKTVVADFIQQRK---------GDRVGLIFFADNAYLQA--PLTFDLKTVSGYMQ 158 Query: 268 K--LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + L T + A + +K +I +TDG+NS Sbjct: 159 QAVLGLVGEQTAIGEGIGLALKRFDAA----------DNPQKVLILLTDGQNSAG----- 203 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPE-----------------GQDLLRKCTD-SSGQ 366 + L ++ + G+KIY++ V A + L+ + GQ Sbjct: 204 EVKPLDAAKFAQEQGVKIYTIGVGADAYYKRTLFGNQKVDPSRDLDEVTLKTIAAQTGGQ 263 Query: 367 FFAVNDSRELLESFDKITD 385 +F D+ L + ++ Sbjct: 264 YFRARDASSLAAIYAELDK 282 >gi|158334872|ref|YP_001516044.1| von Willebrand factor type A domain-containing protein [Acaryochloris marina MBIC11017] gi|158305113|gb|ABW26730.1| von Willebrand factor type A domain protein [Acaryochloris marina MBIC11017] Length = 419 Score = 73.4 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 28/187 (14%), Positives = 65/187 (34%), Gaps = 23/187 (12%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 + + E+A +L++ + RI +A++ L + ++K+ + +L Sbjct: 63 LTTVKEAAQSLIDRLNPGD--------RIAVVAFDHHAKVLVPNQLVEDPEQIKALIQRL 114 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 P T M EL K+ + +TDGE + + + Sbjct: 115 EPKGGTAIDDGMKLGIEELAVGKQGTI---------SQAFLLTDGE----NEHGDNQRCQ 161 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQ 388 Q E + + ++ + +D+L G + ++ F+ + +I+ Sbjct: 162 QFAELAAGYNITLNTLGFGS-HWNEDVLEGIADSGGGSLSFIEKPENAVDVFNSLFTRIE 220 Query: 389 EQSVRIA 395 S+ A Sbjct: 221 TVSLTNA 227 >gi|312196190|ref|YP_004016251.1| von Willebrand factor type A [Frankia sp. EuI1c] gi|311227526|gb|ADP80381.1| von Willebrand factor type A [Frankia sp. EuI1c] Length = 319 Score = 73.4 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 24/219 (10%), Positives = 60/219 (27%), Gaps = 31/219 (14%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 +++ S A + ++ + A V+ + +I + Sbjct: 85 ERATIVLAIDVSNSMAATDISPSRLAAAKQGAQAFVDQLP----------PKINLGLVSF 134 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 P + + + VK+ +N L T ++ + G Sbjct: 135 SGTAAVLVPPTTDRDAVKAGINGLQLGPATAIGEGIYAGLSAINTVSSQ-FVNSGQAVPP 193 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-------------- 351 ++ ++DGE + Q + ++A + + ++A P Sbjct: 194 AAIVLLSDGETTRGRPNN------QAAQAAKDAHIPVSTIAYGTPNGTLDVGGQLIPVPV 247 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 L + + G EL + + I + Sbjct: 248 NEPALSQIAEQTGGSHHRATSGDELTSIYKGLGSSIGYR 286 >gi|148665999|gb|EDK98415.1| matrilin 3 [Mus musculus] Length = 482 Score = 73.4 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 58/176 (32%), Gaps = 23/176 (13%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP- 271 + +++++ + R+ + Y + ++ +K + ++ P Sbjct: 97 VKTFVSRIIDTLDIGATD-----TRVAVVNYASTVKIEFQLNTYSDKQALKQAVARITPL 151 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T + A+ A E + + + + + K I +TDG ++ Sbjct: 152 STGTMSGLAIQTAMEEAFTVEAGARGPMSNI--PKVAIIVTDGRPQD--------QVNEV 201 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 R +G+++Y+V V + L+ F V +L F + Sbjct: 202 AARARASGIELYAVGV--DRADMESLKMMASKPLEEHVFYVETYGVIEKLSARFQE 255 >gi|306518578|ref|NP_034900.4| matrilin-3 precursor [Mus musculus] gi|6918887|emb|CAB72265.1| matrilin-3 [Mus musculus] gi|26339344|dbj|BAC33343.1| unnamed protein product [Mus musculus] Length = 481 Score = 73.4 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 58/176 (32%), Gaps = 23/176 (13%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP- 271 + +++++ + R+ + Y + ++ +K + ++ P Sbjct: 97 VKTFVSRIIDTLDIGATD-----TRVAVVNYASTVKIEFQLNTYSDKQALKQAVARITPL 151 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T + A+ A E + + + + + K I +TDG ++ Sbjct: 152 STGTMSGLAIQTAMEEAFTVEAGARGPMSNI--PKVAIIVTDGRPQD--------QVNEV 201 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 R +G+++Y+V V + L+ F V +L F + Sbjct: 202 AARARASGIELYAVGV--DRADMESLKMMASKPLEEHVFYVETYGVIEKLSARFQE 255 >gi|14548114|sp|O35701|MATN3_MOUSE RecName: Full=Matrilin-3; Flags: Precursor gi|2342635|emb|CAA71532.1| matrilin-3 [Mus musculus] Length = 481 Score = 73.4 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 58/176 (32%), Gaps = 23/176 (13%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP- 271 + +++++ + R+ + Y + ++ +K + ++ P Sbjct: 97 VKTFVSRIIDTLDIGATD-----TRVAVVNYASTVKIEFQLNTYSDKQALKQAVARITPL 151 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T + A+ A E + + + + + K I +TDG ++ Sbjct: 152 STGTMSGLAIQTAMEEAFTVEAGARGPMSNI--PKVAIIVTDGRPQD--------QVNEV 201 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 R +G+++Y+V V + L+ F V +L F + Sbjct: 202 AARARASGIELYAVGV--DRADMESLKMMASKPLEEHVFYVETYGVIEKLSARFQE 255 >gi|170720775|ref|YP_001748463.1| von Willebrand factor type A [Pseudomonas putida W619] gi|169758778|gb|ACA72094.1| von Willebrand factor type A [Pseudomonas putida W619] Length = 358 Score = 73.4 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 62/187 (33%), Gaps = 35/187 (18%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-- 270 + + ++ ++ + + G PL+ + V++ L++ Sbjct: 109 WKDEEVSRLDLVKALLGDFLQDREGDRVGLILFGSQAYLQAPLTFDRRTVRTFLDEAQIG 168 Query: 271 -PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 +NT A+ A + L + ++ ITDG N+G + L Sbjct: 169 IAGKNTAIGDAIGLAVKRL----------RQRPAQSRVLVLITDGANNGGRIH-----PL 213 Query: 330 QICEYMRNAGMKIYSVAVSAPPE----------------GQDLLRKCTD-SSGQFFAVND 372 ++IY++ + A PE + L++ D + G +F +D Sbjct: 214 TAARLAAQEDVRIYTIGIGANPEASGTPGLLGLNPSLDLDEASLKEIADLTHGAYFRAHD 273 Query: 373 SRELLES 379 EL Sbjct: 274 GAELDAI 280 >gi|118375014|ref|XP_001020694.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89302461|gb|EAS00449.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 610 Score = 73.4 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 32/208 (15%), Positives = 74/208 (35%), Gaps = 28/208 (13%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIG----IVGNQCTPL 255 KI+ + + L+ + N + R+ I +N + + L Sbjct: 164 NSESMSGCSKIENVKNTILQLLEML--------NENDRLSLITFNSYAKQLCGLKKVSNL 215 Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 N +++ N + Y TN + A++ L + + ++DG+ Sbjct: 216 --NKETLQAITNSIKAYGGTNITSGLEIAFQILQS--------RKKKNSVSSIFLLSDGQ 265 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE 375 + GA L + + ++ I+S + + + + + G F+ V + + Sbjct: 266 DDGADTKIKNLLKITY-QQLQEESFTIHSFSFGSDHDCPLMQKIAQIKDGSFYFVEKNDQ 324 Query: 376 LLESF-DKITDK----IQEQSVRIAPNR 398 + E F D + Q+ +++I NR Sbjct: 325 VDEFFIDALGGLFSVVAQDLTIKIEINR 352 >gi|148975506|ref|ZP_01812377.1| hypothetical protein VSWAT3_03061 [Vibrionales bacterium SWAT-3] gi|145964934|gb|EDK30185.1| hypothetical protein VSWAT3_03061 [Vibrionales bacterium SWAT-3] Length = 357 Score = 73.4 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 25/152 (16%), Positives = 62/152 (40%), Gaps = 17/152 (11%) Query: 246 GIVGNQCTPLSNNLN---EVKSRLNKLNPYENTNTYPAMHHAYRELYNEK--ESSHNTIG 300 G TP + + + E+ ++ + ++T+ A+ A + S+ Sbjct: 154 GDAAFVQTPFTADQDVWLELLNQTDVAMAGQSTHLGDAIGLAIKVFEQSGKQMSAEQAQN 213 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------GQ 354 +K VI +TDG ++G + + + + + G++I+ +A+ P Sbjct: 214 DIEREKVVIVLTDGNDTG-----SFVEPIDAAKVAKAKGVRIHVIAMGDPQTVGEVALDM 268 Query: 355 DLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + +++ S G+ F + EL ++++I Sbjct: 269 ETIKRIASESGGEAFEALNRDELSTAYEQIGQ 300 >gi|152995759|ref|YP_001340594.1| von Willebrand factor type A [Marinomonas sp. MWYL1] gi|150836683|gb|ABR70659.1| von Willebrand factor type A [Marinomonas sp. MWYL1] Length = 342 Score = 73.4 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 27/168 (16%), Positives = 56/168 (33%), Gaps = 38/168 (22%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKE 293 RIG I + PLS + + + + E T A+ + L ++ Sbjct: 133 RIGIIVFGSKAYLQA--PLSFDTKTINQLVQEAQIGFAGEQTAIGDAIGLGIKRLEDKPS 190 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--- 350 KK +I +TDG N+ + Q + + +KI+++ + A Sbjct: 191 D----------KKVLILMTDGANTAGR-----VQPQQAATFAASQNVKIHTIGIGADSMI 235 Query: 351 --------------PEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKI 383 + LL+ + G++F + +L + + Sbjct: 236 VQSFFGPKAINPSSDLDETLLKNIAAQTGGEYFRAKSTEDLQAIYQTL 283 >gi|119476361|ref|ZP_01616712.1| batB protein, putative [marine gamma proteobacterium HTCC2143] gi|119450225|gb|EAW31460.1| batB protein, putative [marine gamma proteobacterium HTCC2143] Length = 354 Score = 73.4 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 27/158 (17%), Positives = 56/158 (35%), Gaps = 41/158 (25%) Query: 254 PLSNNLNEVKSRLNKLNPYE----NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 PL+ + V+ + + NT A+ + + L + H +I Sbjct: 150 PLTFDQTTVQRFMREAQIGFAGEENTAIGDAIGLSVKRLRDRPGDRH----------VMI 199 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP------------------- 350 +TDG+N+G +N + + N G+ IY++ V A Sbjct: 200 LLTDGQNNGG-----KINPIPASKIAANNGIIIYTIGVGADEMVMPGVLGSSFGSRRVNP 254 Query: 351 --PEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + L++ + GQ+F + +EL + + + Sbjct: 255 SADLDEKTLQQVATATGGQYFRARNPQELEKIYRLLDQ 292 >gi|159900556|ref|YP_001546803.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159893595|gb|ABX06675.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 423 Score = 73.4 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 23/182 (12%), Positives = 65/182 (35%), Gaps = 23/182 (12%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++ + E+ N+VN + + +++N N+ +++K + Sbjct: 63 RLYQVKEACNNVVNQLNRQDY--------FSVVSFNDRAEVVVPCQRPNDKDQIKRAIGM 114 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 + T M +E+ S + + +TDG Y + Sbjct: 115 IEAKGGTEMATGMMMGLQEISRPMMSRGISRM--------VLLTDG-----RTYGDESRC 161 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG-QFFAVNDSRELLESFDKITDKI 387 ++I ++ G+ I ++ + +DLL + + + ++++++ F ++ Sbjct: 162 VEIARRAQSKGIGITALGIGDEWN-EDLLETIASAENSRTEYITNAQQIVNVFSDEIKRL 220 Query: 388 QE 389 Q Sbjct: 221 QN 222 >gi|218548279|ref|YP_002382070.1| hypothetical protein EFER_0898 [Escherichia fergusonii ATCC 35469] gi|218355820|emb|CAQ88433.1| conserved hypothetical protein; putative exported protein [Escherichia fergusonii ATCC 35469] Length = 543 Score = 73.4 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 34/302 (11%), Positives = 85/302 (28%), Gaps = 42/302 (13%) Query: 108 KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI--------SICMVLDVSRS 159 Y L L + + + + I + ++ + Sbjct: 95 TGSYANVRRFLKTGSLPGADVVRVEELVNYFPLTEATKKNIPGCKGCEENSPFSINYELT 154 Query: 160 MEDLY---LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIES 216 D + LPP ++ ++ ++ S Sbjct: 155 PAPWNEKHTLLRLDIAANDIARSKLPPTNLVFLID--------TSGSMNSDERLPLIKSS 206 Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 LVN ++ RI + Y + N + + + L TN Sbjct: 207 LKLLVNELRDQD--------RISIVTYAGSARLLLSSTSGAEKNTILNAIANLQAGGGTN 258 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 + AY + ++ TDG+ + + + + + R Sbjct: 259 GGAGVAMAYEQAQAGYIKGGVNR--------ILLATDGDFNIG---DDPSSVEDLVKKQR 307 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 +G+ + ++ V + ++ K D+ G + L E+ +++++ + V +A Sbjct: 308 ESGITLSTLGVGDNNYNEAMMVKIADTGNGNYSY---LDSLSEAQKVLSNEMNQTLVTVA 364 Query: 396 PN 397 + Sbjct: 365 KD 366 >gi|21244101|ref|NP_643683.1| hypothetical protein XAC3376 [Xanthomonas axonopodis pv. citri str. 306] gi|21109728|gb|AAM38219.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri str. 306] Length = 323 Score = 73.4 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 66/206 (32%), Gaps = 48/206 (23%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 ++ + ++ G TPL+ +L Sbjct: 105 VLGGKVVDRLTAAKAVLSDFLDR-----------RDGDRVGLLVFGQRAYALTPLTADLT 153 Query: 261 EVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 V+ +L T A+ + + L +K+ ++ V+ +TDG N+ Sbjct: 154 SVRDQLRDSVVGLAGRETAIGDAIALSVKRLREQKQG----------QRVVVLLTDGVNT 203 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL-----------------LRKC 360 LN L+ E + G++++++A G L LRK Sbjct: 204 AGV-----LNPLKAAELAKAEGVRVHTIAFG-GSGGYSLFGVPIPAGGDDDIDEDGLRKI 257 Query: 361 TD-SSGQFFAVNDSRELLESFDKITD 385 + G+FF D+ EL + ++ Sbjct: 258 AQQTGGRFFRARDTDELAGIYAELDR 283 >gi|281357358|ref|ZP_06243847.1| von Willebrand factor type A [Victivallis vadensis ATCC BAA-548] gi|281316389|gb|EFB00414.1| von Willebrand factor type A [Victivallis vadensis ATCC BAA-548] Length = 342 Score = 73.4 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 42/317 (13%), Positives = 97/317 (30%), Gaps = 72/317 (22%) Query: 100 PLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRS 159 ++ A++ + + + + L+ G + + I I + LD+S S Sbjct: 36 TVRPFGAVVAKHRPTFRLVAMMLGLAVLIVALARPRYGDEKVLIRSQGIDIVLALDMSGS 95 Query: 160 MEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGN 219 ME + ++ ++ A +I+V + Sbjct: 96 MEAYDVPRNINDARTLIA----------------------AVKNKEVENRIEVAKKEIRR 133 Query: 220 LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE---NTN 276 + + RIG I + L + + + L +L P T Sbjct: 134 FIEQ---------RPNDRIGLIGFADQAYSFAPPTL--DHAWLLAHLEQLEPGMIGQQTG 182 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 + L S ++ ++ TDG N+ N L Q + Sbjct: 183 IAAPLASGVNRL----------KKSDAPRRVLVLFTDGRNNVD----NRLTPEQAAALGK 228 Query: 337 NAGMKIYSVAVSA--------------------PPEGQDLLRKCTD-SSGQFFAVNDSRE 375 + I++V + + + LLR + + G +F D+ Sbjct: 229 EFDVVIHTVGIGSRNAFVLVTDPFGRQQFQGIEDEFDEKLLRSLAEITGGTYFHAADADG 288 Query: 376 LLESFDKITDKIQEQSV 392 + + D+I +++++ ++ Sbjct: 289 MKQVMDEI-NQLEKTTI 304 >gi|282863310|ref|ZP_06272369.1| von Willebrand factor type A [Streptomyces sp. ACTE] gi|282561645|gb|EFB67188.1| von Willebrand factor type A [Streptomyces sp. ACTE] Length = 624 Score = 73.4 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 23/197 (11%), Positives = 65/197 (32%), Gaps = 13/197 (6%) Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 S + + +++ + G +V+++ + Sbjct: 22 TDDAGGSLVMVLDSSGSMGEDDGTGSTRMESARRAVGAVVDALPDGYPTGLRVYGADRPQ 81 Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGS 301 + PL + VKS + + P +T ++ A +L ++ + T Sbjct: 82 GCADTRLVRPVRPL--DRAAVKSAVAGVRPTGDTPIGLSLRKAAEDLPAPRDGAARTRT- 138 Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT 361 ++ ++DGE++ + + + G++I +V ++ L Sbjct: 139 ------IVLVSDGEDTCGTPPPCEVAARLAGQGA---GLRIDTVGFQVKGAAREQLECVA 189 Query: 362 DSS-GQFFAVNDSRELL 377 ++ G+++ D+ L Sbjct: 190 EAGNGRYYDAPDADALA 206 >gi|330975134|gb|EGH75200.1| von Willebrand factor, type A [Pseudomonas syringae pv. aptata str. DSM 50252] Length = 352 Score = 73.4 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 25/146 (17%), Positives = 52/146 (35%), Gaps = 35/146 (23%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + V+ L++ +NT A+ A + L + ++ Sbjct: 150 PLTYDRRTVRVWLDEARIGIAGKNTALGDAIGLALKRL----------RMRPATSRALVL 199 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------GQ 354 +TDG N+ ++ + G+KIY + + + P+ + Sbjct: 200 VTDGANNAGQ-----IDPITAARLAAEEGVKIYPIGIGSDPDKDALQSVLGLNPSLDLDE 254 Query: 355 DLLRKCTD-SSGQFFAVNDSRELLES 379 L++ S GQ+F D +L + Sbjct: 255 PTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|330945007|gb|EGH46785.1| von Willebrand factor, type A [Pseudomonas syringae pv. pisi str. 1704B] Length = 258 Score = 73.4 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 25/146 (17%), Positives = 52/146 (35%), Gaps = 35/146 (23%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + V+ L++ +NT A+ A + L + ++ Sbjct: 56 PLTYDRRTVRVWLDEARIGIAGKNTALGDAIGLALKRL----------RMRPATSRALVL 105 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------GQ 354 +TDG N+ ++ + G+KIY + + + P+ + Sbjct: 106 VTDGANNAGQ-----IDPITAARLAAEEGVKIYPIGIGSDPDKDALQSVLGLNPSLDLDE 160 Query: 355 DLLRKCTD-SSGQFFAVNDSRELLES 379 L++ S GQ+F D +L + Sbjct: 161 PTLKEIASLSGGQYFRARDGDQLEKI 186 >gi|313139523|ref|ZP_07801716.1| conserved hypothetical protein [Bifidobacterium bifidum NCIMB 41171] gi|313132033|gb|EFR49650.1| conserved hypothetical protein [Bifidobacterium bifidum NCIMB 41171] Length = 835 Score = 73.4 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 40/374 (10%), Positives = 108/374 (28%), Gaps = 46/374 (12%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPL 101 + + T + + I E++ + ++ + Sbjct: 161 TATGTKDGETRTTDSNGVITLKAGQYAVLLGSDAKRITESSKYKVTEINVDQDTYAVSAN 220 Query: 102 QYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE-NLAISICMVLDVSRSM 160 + + + T G +P ++ + + + N + + L+V+ + Sbjct: 221 GGQVKVTQEKDSATTEPVSVGEVPRITVTNTVVTAPRYRKYIKANNDGTYDLSLNVTGTQ 280 Query: 161 EDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNL 220 + + + S P N +++V + ++ Sbjct: 281 SGSSQTTVSPADIVVVFDT-----------------SGSMSNPMGHNSRLEVAKTAVNSM 323 Query: 221 VNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNT--- 277 + + + K+ ++R+ + + + ++N ++ S +N L TN Sbjct: 324 AQHLLTSENQGKDSNIRMALVPF--STTVGNVSNFTDNAMDIVSAVNGLRADGGTNWEAA 381 Query: 278 --------YPAMHHAYREL------YNEKESSHNTIGSTRLKKFVIFITD------GENS 317 + + +S G+ + D G + Sbjct: 382 LKAANAKLTSGRKGVKKYIVFMSDGDPTFRTSSVRTGTDWWGRPTYDDDDRRGLPAGVHG 441 Query: 318 GASAYQNTLNTLQICEYMRNAG-MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 S+ Q N G ++SV VS+ P + + G +++ + EL Sbjct: 442 SGSSDQYGANLSSAVAEANRRGDATLFSVGVSSDPT--KMRGFADQTKGSYYSATSTDEL 499 Query: 377 LESFDKITDKIQEQ 390 ++F I +I + Sbjct: 500 NKAFADIIGQINRK 513 >gi|171914502|ref|ZP_02929972.1| von Willebrand factor type A domain protein [Verrucomicrobium spinosum DSM 4136] Length = 424 Score = 73.4 bits (178), Expect = 6e-11, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 57/159 (35%), Gaps = 15/159 (9%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 +AY+ + + + VK+ ++++ +T + + EL K + Sbjct: 79 VVAYDDAVSLISPATDLTDRDRVKAAIDRIQAGGSTALFSGISKGAEELRRNKRPNQVNR 138 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 V+ ++DG + + L L + G+ + ++ + +DL+ + Sbjct: 139 --------VVLLSDGMANVGPSSPQDLGRLGA--SLAKEGITVTTLGLGLG-YNEDLMTE 187 Query: 360 CTD-SSGQFFAVNDSRELLESFDKITDKIQE---QSVRI 394 S G + +S+ L F I Q +R+ Sbjct: 188 LALRSDGNHAFIENSQNLAGIFQTEFGDILSVVAQRIRV 226 >gi|116751034|ref|YP_847721.1| von Willebrand factor, type A [Syntrophobacter fumaroxidans MPOB] gi|116700098|gb|ABK19286.1| von Willebrand factor, type A [Syntrophobacter fumaroxidans MPOB] Length = 479 Score = 73.4 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 26/182 (14%), Positives = 62/182 (34%), Gaps = 22/182 (12%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGI-VGNQCTPLS-NNLNEVKSRL 266 K+ ++ NL++ + + R ++Y+ + P++ N ++ + Sbjct: 109 KLTHARQAVLNLLSRLSETD--------RFALVSYSDHVQRHGGLLPITPANRATLERIV 160 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 + P TN + +L +++ + +I I+DG + + L Sbjct: 161 RGIQPGGATNLGGGLQEGISQLAELQQNGRLSR--------LILISDGLANRGVTDPSAL 212 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITD 385 T + G + +V V + L+ D G + + + + FDK Sbjct: 213 GT--MASVAAERGYAVSTVGVGLD-FNEHLMTSIADKGAGNYTFMESASAFAQVFDKEFR 269 Query: 386 KI 387 Sbjct: 270 DA 271 >gi|47220812|emb|CAG00019.1| unnamed protein product [Tetraodon nigroviridis] Length = 1557 Score = 73.4 bits (178), Expect = 7e-11, Method: Composition-based stats. Identities = 21/169 (12%), Positives = 59/169 (34%), Gaps = 18/169 (10%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYN 290 ++ ++ +N + + K++ NT T A+ H +++ Sbjct: 925 GTDGTQVAIAQFSDDARTEFQLSSHSNKEALLEAIQKISYKGGNTKTGRAIKHVKESIFS 984 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + K ++ +TDG + ++ + M+ G I+++ + Sbjct: 985 LEAGARR-----GVPKVLVVLTDGRSQDDVN--------KVSKEMQMDGYIIFAIGFADA 1031 Query: 351 PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 G+ L F V+D + + +++ + E + P+ Sbjct: 1032 DYGE--LVNIASKPSDRHVFFVDDLDAVKKIEEQLITFVCEAATATCPS 1078 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 28/155 (18%), Positives = 54/155 (34%), Gaps = 18/155 (11%) Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIG 300 Y+ + + V + L NT T A++ + + S Sbjct: 89 QYSGDPRIEWHLNAYSTKDAVIDAVRNLPYKGGNTLTGLALNFILENCFKPESGSR---- 144 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 L K I ITDG++ E +RNAG++++++ V ++ L+ Sbjct: 145 -EGLPKIGILITDGKSQDDVVPP--------AESLRNAGIELFAIGV--KNADENELQSI 193 Query: 361 TDSSG--QFFAVNDSRELLESFDKITDKIQEQSVR 393 + V D + + +T + EQ V+ Sbjct: 194 ASPPEDTHVYNVADFSVMNSIVEALTRTVCEQVVQ 228 >gi|168186710|ref|ZP_02621345.1| von Willebrand factor type A domain protein [Clostridium botulinum C str. Eklund] gi|169295292|gb|EDS77425.1| von Willebrand factor type A domain protein [Clostridium botulinum C str. Eklund] Length = 693 Score = 73.0 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 43/270 (15%), Positives = 85/270 (31%), Gaps = 66/270 (24%) Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYN 244 KK ++ K A A + KID L ++A N VN + + +IG ++Y Sbjct: 160 NKKRYYLKYDYSDGNWYAIAYSEPKIDELQKAAKNFVNKFE------IKANTKIGLVSYG 213 Query: 245 IGIVGNQCTPLSNNLNEVKSRLNK-LNPYENTNTYPAMHHAYRELYNEKE---------- 293 G + L+N L+ + S ++ L+ TN + A L N + Sbjct: 214 NN--GKEVHSLTNELDRINSSIDSGLSIGGGTNVGDGIRMANGILNNGSDADKYIVLMTD 271 Query: 294 ---------SSHNTIGSTRLKKF------------------------------------V 308 + + K+ Sbjct: 272 GMPTAATCYNDIYYKNNRFYSKYGEILNGNGNPLSYFNYIGNYKYKFEYNPNDYTHEDEK 331 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF 368 I + G+N + N + + + +G+ + + S + L T+ +G + Sbjct: 332 IIMNYGDNDYGNIALN--YSKEALKRASESGVNNFVIGFSNGINREKLNGIATEGNGYYR 389 Query: 369 AVNDSRELLESFDKITDKIQEQSVRIAPNR 398 EL + + +I D+I V+ + Sbjct: 390 EAMHGDELTDVYKRIADEINNPVVKNIEFK 419 >gi|186681467|ref|YP_001864663.1| von Willebrand factor A [Nostoc punctiforme PCC 73102] gi|186463919|gb|ACC79720.1| von Willebrand factor, type A [Nostoc punctiforme PCC 73102] Length = 418 Score = 73.0 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 68/180 (37%), Gaps = 23/180 (12%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 ++ + ++A LV+ + N S R+ + ++ + + ++K+++N+L Sbjct: 59 LETVKKAANRLVDRL--------NPSDRLSVVVFDHRAKVLVPSQSVEDPEKIKNQINRL 110 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T + EL G +TDGE + + + L Sbjct: 111 AADGGTAIDEGLRLGIEEL---------AKGKKDTVSQAFLLTDGE----NEHGDNNRCL 157 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQ 388 + + + + + ++ QD+L K D+ G + E ++ F+++ +IQ Sbjct: 158 KFAQLAASYNLTLNTLGFGDNWN-QDVLEKIADAGLGTLSYIQKPEEAVDEFNRLFSRIQ 216 >gi|218191186|gb|EEC73613.1| hypothetical protein OsI_08104 [Oryza sativa Indica Group] Length = 709 Score = 73.0 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 18/139 (12%), Positives = 44/139 (31%), Gaps = 27/139 (19%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + + +N L TN A+ A + + + +I ++DG++ Sbjct: 323 SGRQQALQAVNLLGAGGGTNIADALKKAAKVIED--------RNYKNPVCSIILLSDGQD 374 Query: 317 SGASAYQ-------------NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS 363 + + +++ IC + ++ A + L S Sbjct: 375 TYNISSNVRGTRPDYRSLVPSSILNHTICT------VPVHGFGFGADHDSDALHSIAESS 428 Query: 364 SGQFFAVNDSRELLESFDK 382 G F + D + ++F + Sbjct: 429 GGTFSFIEDESVIQDAFAQ 447 >gi|261409467|ref|YP_003245708.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] gi|261285930|gb|ACX67901.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] Length = 595 Score = 73.0 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 20/142 (14%), Positives = 53/142 (37%), Gaps = 9/142 (6%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + +K +++L+ T+ + A + L + +H + D + + Sbjct: 100 DKTALKEFIDQLDRGPYTDMSVGLDEAVKVLKQGMDPAHAPMIVVLADG----NNDLDPN 155 Query: 318 GASAYQNTLNTLQIC-EYMRNAGMKIYSVAVSAPP--EGQDLLRKCTDSSGQFFAVNDSR 374 + L + + +G+ IY++ ++A + L + G+ F + + Sbjct: 156 TGRTSKEASEQLNQAVKEAKGSGIPIYTIGLNADGKLNKETLAELAKQTGGKSFTTSSAD 215 Query: 375 ELLESFDKITDKIQEQSVRIAP 396 +L + +I Q ++I P Sbjct: 216 DLPQILSEIF--ASHQQLKIVP 235 >gi|119504633|ref|ZP_01626712.1| BatB protein, putative [marine gamma proteobacterium HTCC2080] gi|119459655|gb|EAW40751.1| BatB protein, putative [marine gamma proteobacterium HTCC2080] Length = 332 Score = 73.0 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 26/184 (14%), Positives = 65/184 (35%), Gaps = 28/184 (15%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---P 271 S + + +++ + + G PL+ ++ VK + + Sbjct: 111 NSLVSRITAVKAIAADFASRRTGDRVGLILFGTRAYVQAPLTFDVKTVKQFIEEAQLGFA 170 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 E+T A+ A + L S + +I +TDG+++ +T++ ++ Sbjct: 171 GEDTAIGDALGLAVKRLRERPADS----------RVLILLTDGQDTA-----STVDPMEA 215 Query: 332 CEYMRNAGMKIYSVAV----SAPPE-----GQDLLRKCT-DSSGQFFAVNDSRELLESFD 381 +KIY++ + + LL + G++F +EL + + Sbjct: 216 AALASEMNVKIYTIGISRRLGTSSNSSGEVDEALLTAIAQATGGRYFRARTPKELQDIYQ 275 Query: 382 KITD 385 + + Sbjct: 276 VLDE 279 >gi|115447343|ref|NP_001047451.1| Os02g0619600 [Oryza sativa Japonica Group] gi|47847560|dbj|BAD21612.1| zinc finger (C3HC4-type RING finger)-like protein [Oryza sativa Japonica Group] gi|47847788|dbj|BAD21564.1| zinc finger (C3HC4-type RING finger)-like protein [Oryza sativa Japonica Group] gi|113536982|dbj|BAF09365.1| Os02g0619600 [Oryza sativa Japonica Group] gi|215701433|dbj|BAG92857.1| unnamed protein product [Oryza sativa Japonica Group] gi|222623257|gb|EEE57389.1| hypothetical protein OsJ_07557 [Oryza sativa Japonica Group] Length = 709 Score = 73.0 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 18/139 (12%), Positives = 44/139 (31%), Gaps = 27/139 (19%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + + +N L TN A+ A + + + +I ++DG++ Sbjct: 323 SGRQQALQAVNLLGAGGGTNIADALKKAAKVIED--------RNYKNPVCSIILLSDGQD 374 Query: 317 SGASAYQ-------------NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS 363 + + +++ IC + ++ A + L S Sbjct: 375 TYNISSNVRGTRPDYRSLVPSSILNHTICT------VPVHGFGFGADHDSDALHSIAESS 428 Query: 364 SGQFFAVNDSRELLESFDK 382 G F + D + ++F + Sbjct: 429 GGTFSFIEDESVIQDAFAQ 447 >gi|194220937|ref|XP_001501805.2| PREDICTED: matrilin 3 [Equus caballus] Length = 450 Score = 73.0 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 24/176 (13%), Positives = 58/176 (32%), Gaps = 23/176 (13%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP- 271 + +++++ + R+ + Y + ++ +K + ++ P Sbjct: 66 VKTFVSQIIDTL-----DIGPADTRVAVVNYASTVKIEFHLNTHSDKQSLKQAVARITPL 120 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T + A+ A E + + + S + K I +TDG ++ Sbjct: 121 STGTMSGLAIQTAMDEAFTVEAGARV--PSYNIPKVAIIVTDGRPQD--------QVNEV 170 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 R +G+++Y+V V + L+ F V +L F + Sbjct: 171 AARARASGIELYAVGV--DRADMESLKVIASEPLDEHVFYVETYGVIEKLSSRFQE 224 >gi|117919904|ref|YP_869096.1| von Willebrand factor, type A [Shewanella sp. ANA-3] gi|117612236|gb|ABK47690.1| von Willebrand factor, type A [Shewanella sp. ANA-3] Length = 338 Score = 73.0 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 63/204 (30%), Gaps = 47/204 (23%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + ++ + + RIG I + PL+ + V Sbjct: 104 NGKVVDRFTLIQHVVSEFIERRK---------GDRIGLILFADHAYLQA--PLTQDRRSV 152 Query: 263 KSRLN--KLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 L ++ T ++ A + + +I +TDG N+ Sbjct: 153 AQFLKEAQIGLVGKQTAIGESIALAVKRFDKM----------DESNRVLILLTDGSNNAG 202 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEGQDLLRKCTD 362 + + Q + N + IY+V V A ++ L+ D Sbjct: 203 N-----IEPEQAAQIAANRKVTIYTVGVGADVMERRTLFGRERVNPSMDLDENQLKHIAD 257 Query: 363 -SSGQFFAVNDSRELLESFDKITD 385 + G++F +S+EL + + +I Sbjct: 258 VTHGRYFRARNSQELEQIYQEIDK 281 >gi|114046974|ref|YP_737524.1| von Willebrand factor, type A [Shewanella sp. MR-7] gi|113888416|gb|ABI42467.1| von Willebrand factor, type A [Shewanella sp. MR-7] Length = 338 Score = 73.0 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 63/204 (30%), Gaps = 47/204 (23%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + ++ + + RIG I + PL+ + V Sbjct: 104 NGKVVDRFTLIQHVVSEFIERRK---------GDRIGLILFADHAYLQA--PLTQDRRSV 152 Query: 263 KSRLN--KLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 L ++ T ++ A + + +I +TDG N+ Sbjct: 153 AQFLKEAQIGLVGKQTAIGESIALAVKRFDKM----------DESNRVLILLTDGSNNAG 202 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEGQDLLRKCTD 362 + + Q + N + IY+V V A ++ L+ D Sbjct: 203 N-----IEPEQAAQIAANRKVTIYTVGVGADVMERRTLFGRERVNPSMDLDENQLKHIAD 257 Query: 363 -SSGQFFAVNDSRELLESFDKITD 385 + G++F +S+EL + + +I Sbjct: 258 VTHGRYFRARNSQELEQIYQEIDK 281 >gi|312961300|ref|ZP_07775805.1| von Willebrand factor, type A [Pseudomonas fluorescens WH6] gi|311284958|gb|EFQ63534.1| von Willebrand factor, type A [Pseudomonas fluorescens WH6] Length = 362 Score = 73.0 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 60/187 (32%), Gaps = 35/187 (18%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-- 270 + + ++ ++ + + G PL+ + V++ L++ Sbjct: 109 WRDEDVSRLSLVKHLLGDFLQQREGDRVGLILFGSQAYLQAPLTFDRRTVRTWLDEARIG 168 Query: 271 -PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 +NT A+ A + L + +I ITDG N+ ++ L Sbjct: 169 IAGKNTAIGDAIGLALKRL----------RLRPAQSRVLILITDGANNAGQ-----IDPL 213 Query: 330 QICEYMRNAGMKIYSVAVSAPPE----------------GQDLLRKCTD-SSGQFFAVND 372 G+KIY + + A PE + L+ + GQ+F D Sbjct: 214 TAARLAAEEGVKIYPIGIGADPEQTGSLGILGVNPSLDLDEPALKAIAAATGGQYFRARD 273 Query: 373 SRELLES 379 EL Sbjct: 274 GEELQAI 280 >gi|218672104|ref|ZP_03521773.1| von Willebrand factor type A [Rhizobium etli GR56] Length = 366 Score = 73.0 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 24/198 (12%), Positives = 57/198 (28%), Gaps = 20/198 (10%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 K+ +L + LVN ++ + + Y Sbjct: 84 VSGSMDEPDKLPLLKSAFRLLVNRLKPDDT--------VSIVTYAGNAGTVLEPTRVAEK 135 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +++ S +++L +T + AY V+ TDG+ + Sbjct: 136 SKILSAIDRLEAGGSTGGAEGIEAAYDLAKKAFVKDGVNR--------VMLATDGDFNVG 187 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 + + +I E R G+ + + L++ + + L E+ Sbjct: 188 --PSSDEDLKRIIEDKREEGIFLTVLGFGRGNLNDSLMQTLAQNGNG--SAAYIDTLAEA 243 Query: 380 FDKITDKIQEQSVRIAPN 397 + ++ IA + Sbjct: 244 QKTLVEEAGSTLFPIAKD 261 >gi|188990358|ref|YP_001902368.1| hypothetical protein xccb100_0962 [Xanthomonas campestris pv. campestris str. B100] gi|167732118|emb|CAP50310.1| putative membrane protein [Xanthomonas campestris pv. campestris] Length = 335 Score = 73.0 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 33/194 (17%), Positives = 71/194 (36%), Gaps = 35/194 (18%) Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN- 270 VL + + + + + + + + G TPL+ +L V+ +L Sbjct: 117 VLGGNVVDRLTAAKAVLSDFLDRREGDRVGLLVFGQRAYALTPLTADLTSVRDQLADSVV 176 Query: 271 --PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 T A+ + + L ++ ++ V+ +TDG N+ LN Sbjct: 177 GLAGRETAIGDAIALSVKRLREQRHG----------QRVVVLLTDGVNTAGV-----LNP 221 Query: 329 LQICEYMRNAGMKIYSVAVSAPPE----------------GQDLLRKCTD-SSGQFFAVN 371 L+ E + G++++++A +D LRK + + G+FF Sbjct: 222 LKAAELAKAEGVRVHTIAFGGSGSYSLFGVPIPAGGGDDIDEDGLRKIAEQTGGRFFRAR 281 Query: 372 DSRELLESFDKITD 385 D+ EL + ++ Sbjct: 282 DTEELAGIYAELDR 295 >gi|326532158|dbj|BAK01455.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 674 Score = 73.0 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 21/134 (15%), Positives = 44/134 (32%), Gaps = 17/134 (12%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + +N L TN A+ A + + + +I ++DG++ Sbjct: 320 YGRQQALQAINSLGAGGGTNIADALKKATKVIED--------RSYKNSVCSIILLSDGQD 371 Query: 317 SG--------ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF 368 + S ++L I R + I++ A + L S G F Sbjct: 372 TYNICSNVRGGSKDYSSLVPPSILSDTRRM-LPIHAFGFGADHDSDSLHSIAEASGGTFS 430 Query: 369 AVNDSRELLESFDK 382 + D + ++F + Sbjct: 431 FIEDEGVMQDAFAQ 444 >gi|326513050|dbj|BAK03432.1| predicted protein [Hordeum vulgare subsp. vulgare] gi|326519604|dbj|BAK00175.1| predicted protein [Hordeum vulgare subsp. vulgare] gi|326532408|dbj|BAK05133.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 700 Score = 73.0 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 21/134 (15%), Positives = 44/134 (32%), Gaps = 17/134 (12%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + +N L TN A+ A + + + +I ++DG++ Sbjct: 320 YGRQQALQAINSLGAGGGTNIADALKKATKVIED--------RSYKNSVCSIILLSDGQD 371 Query: 317 SG--------ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF 368 + S ++L I R + I++ A + L S G F Sbjct: 372 TYNICSNVRGGSKDYSSLVPPSILSDTRRM-LPIHAFGFGADHDSDSLHSIAEASGGTFS 430 Query: 369 AVNDSRELLESFDK 382 + D + ++F + Sbjct: 431 FIEDEGVMQDAFAQ 444 >gi|68536401|ref|YP_251106.1| hypothetical protein jk1316 [Corynebacterium jeikeium K411] gi|260577533|ref|ZP_05845473.1| conserved hypothetical protein [Corynebacterium jeikeium ATCC 43734] gi|68264000|emb|CAI37488.1| hypothetical protein jk1316 [Corynebacterium jeikeium K411] gi|258604337|gb|EEW17574.1| conserved hypothetical protein [Corynebacterium jeikeium ATCC 43734] Length = 663 Score = 73.0 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 35/229 (15%), Positives = 83/229 (36%), Gaps = 25/229 (10%) Query: 167 KHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK 226 N+ ++ +++ K + + S ++D ++A + ++++ Sbjct: 45 DDNNGSDGSTSSNDATGGDSKVAVVLDASDSMAEKDTGDGGTRMDAAKKAANDTIDTLAD 104 Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTP----LSNNL-NEVKSRLNKLNPYENTNTYPAM 281 + Q + A + G Q L NN +++ ++N L P T A+ Sbjct: 105 SAQTAVIAYGSEESNAPDNRDKGCQDITTLASLGNNKPEDLEDKINGLEPKGYTPIGNAI 164 Query: 282 HHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM- 340 A EL + K+ +I ++DG ++ + E + G+ Sbjct: 165 KKAAEELGSSG------------KRNIILVSDGIDT-----CAPPPVCDVAEDIAGDGID 207 Query: 341 -KIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKI 387 I++V + Q L ++ S G + + +D+ L E+ ++ Sbjct: 208 LAIHTVGFKVDDKAQKELECISEVSGGTYTSADDTEALTEALTDAAQRV 256 >gi|257064432|ref|YP_003144104.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Slackia heliotrinireducens DSM 20476] gi|256792085|gb|ACV22755.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Slackia heliotrinireducens DSM 20476] Length = 629 Score = 73.0 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 34/380 (8%), Positives = 98/380 (25%), Gaps = 31/380 (8%) Query: 8 VCFLFITYAI----DLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTI 63 + + I + ++ + ++A + ++ + T Sbjct: 94 LLLVAIGVGVGTNLLGSNAEMPVAETKAASE--DTMAGSANSYAPDGGLAYETDEAYETF 151 Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGL 123 + + + E I + ++ + ++ + + L Sbjct: 152 --DTLDEGAPMEDFNTEEYAAIEENGFVSTVTRPLSTCSADVDTASYCNLRRMINDGYSL 209 Query: 124 IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPP 183 + + + + S + ND + + Sbjct: 210 DEIPDGAVRIEEMLNYFHYDSGEPEGNDLFAVRAESARCPW----NDQTQLLVMTFTASD 265 Query: 184 PPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAY 243 + + N K+D+L +S G L+ ++ + R+ + Y Sbjct: 266 KAQTASKGSNLVFLIDISGSMDEPDKLDLLKDSFGTLLENLG--------PNDRVSIVTY 317 Query: 244 NIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 G ++ ++ LN+L +TN + AY Sbjct: 318 AAGEDVLLEGASGDDTRKIMRALNRLEADGSTNGEAGLEMAYEVAERNYIEGGVNR---- 373 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS 363 ++ +DG+ + + + E R G+ + + + + D Sbjct: 374 ----IVMASDGDLNVG--ITSESDLYDFVEEKRETGVYLSVLGFGSGNYKDTKMETLADH 427 Query: 364 S-GQFFAVNDSRELLESFDK 382 G + ++ E + Sbjct: 428 GNGTYHYIDCVEEAERVLGE 447 >gi|119512059|ref|ZP_01631153.1| hypothetical protein N9414_12318 [Nodularia spumigena CCY9414] gi|119463285|gb|EAW44228.1| hypothetical protein N9414_12318 [Nodularia spumigena CCY9414] Length = 435 Score = 73.0 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 28/194 (14%), Positives = 57/194 (29%), Gaps = 17/194 (8%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL-NK 268 I +I++ L++ +Q + E RI +A+ + +K+++ K Sbjct: 59 IATVIQAVEQLLDRLQPSDSETPTSGDRISVVAFAGEAQVIIPNQTLQDTASIKAQIHKK 118 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ----- 323 L T + EL G+ +TDG + Sbjct: 119 LKASGGTAIAEGLQLGITEL---------MKGTKGAVSQAFLLTDGHGESSLKIWKFEIG 169 Query: 324 --NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFD 381 + L+ + I ++ L + G + + LE F Sbjct: 170 KDDNKRCLEFAHKATKINLTINTLGFGNDWNQDLLEKIADAGGGTLAYIERPEQALEQFR 229 Query: 382 KITDKIQEQSVRIA 395 + +IQ + A Sbjct: 230 HLLQRIQSVRLTNA 243 >gi|78186535|ref|YP_374578.1| hypothetical protein Plut_0657 [Chlorobium luteolum DSM 273] gi|78166437|gb|ABB23535.1| putative membrane protein [Chlorobium luteolum DSM 273] Length = 356 Score = 73.0 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 29/248 (11%), Positives = 65/248 (26%), Gaps = 17/248 (6%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIK-DPTTKKDQ 59 + AI++ V F A+DLA I ++ ++Q+A DAA L G S+ + + + Sbjct: 21 LFAIVLPVLLGFAALAVDLARIHLVKVELQNAADAASLGGARSLSDPGGQPYNWSAASIK 80 Query: 60 TSTIFKKQIKKHLKQGSY---------IRENAGDIAQKAQINITKDKNNPLQYIAESKAQ 110 + + + + + G ++ A Sbjct: 81 ALDVARSNVANGGQIQDAAIETGYWNILNPALGMRPAGTPGVPATGDVPAVRVTTAISAT 140 Query: 111 YEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKH-- 168 L ++ ++ + +I S + ++ Sbjct: 141 QNNGPLQLLFAPILGITERSIQASAIAVIAPPSGGTGMFPFVIATPMLDHYWDRDTNSPV 200 Query: 169 ----NDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 + Y + TT P+ D+L + N ++ Sbjct: 201 LENGVAPTIKLGSIYHFEDSDDGVLSGEWTTFQTEDGNPSGRFL-WDLLKDLTTNGNDTA 259 Query: 225 QKAIQEKK 232 Sbjct: 260 LYIGDNTY 267 >gi|256376278|ref|YP_003099938.1| hypothetical protein Amir_2147 [Actinosynnema mirum DSM 43827] gi|255920581|gb|ACU36092.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827] Length = 321 Score = 73.0 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 24/157 (15%), Positives = 57/157 (36%), Gaps = 21/157 (13%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + + + V ++ L ++T T A+ A + + + V+ +TDG+ Sbjct: 147 TTDRSAVSQGIDGLKLAQSTATGDAIVAALSAIDSFGKVVGGA--DGPPPARVVLMTDGK 204 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PEGQDLLRKCTD 362 + T + AG+ I +++ P + +++ Sbjct: 205 ETVG-----TRKATDAAGDAKEAGIPISTISFGTERGSVDINGKAQEVPVDDESMKEIAK 259 Query: 363 -SSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 S G+FF + EL +D + ++I + + +R Sbjct: 260 ISGGEFFKAASAEELRRVYDTLGEQIGYEKKQADASR 296 >gi|149773083|emb|CAO01891.1| collagen typeVI alpha 5 [Mus musculus] Length = 1212 Score = 73.0 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 39/368 (10%), Positives = 98/368 (26%), Gaps = 30/368 (8%) Query: 29 MQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKH---LKQGSYIRENAGDI 85 + S D A ++++ +K T K+G + Sbjct: 308 LMSFSDRAQTISSLRSSANQSEFQQQIQKLSLQTGASNVGAAIEQMRKEGFSESSGSRKA 367 Query: 86 AQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLS--LRSTGIIERSS 143 QI + + E+ + +F G+ + T L + S+ Sbjct: 368 QGVPQIAVLVTHRASDDMVREAALDLRLEGVTMFAMGIEGANNTQLEDIVSYPSRQSIST 427 Query: 144 ENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAP 203 + + + + + Sbjct: 428 HSSYSHLESYSGNFLKKIRNEIWTQVST-RAEQMELDKTGCVDTKEADIYFLIDGSSSIR 486 Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 +I + + S ++ VR+G + Y+ + ++K Sbjct: 487 KKEFEQIQIFMSSVIDMFP--------IGPNKVRVGVVQYSHKNEVEFPVSRYTDGIDLK 538 Query: 264 SRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + + T T A+ + K R ++I +TDG+++ + Sbjct: 539 KAVFNIKQLKGLTFTGKALDFILPLIKKGKTERT-----DRAPCYLIVLTDGKSNDS--- 590 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 L+ +R + I+++ + + LR+ + + L ++ Sbjct: 591 -----VLEPANRLRAEQITIHAIGIG--EANKTQLRQIAGKDERVNFGQNFDSLKSIKNE 643 Query: 383 ITDKIQEQ 390 I +I + Sbjct: 644 IVHRICSE 651 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 29/161 (18%), Positives = 65/161 (40%), Gaps = 13/161 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYENTNTYPAMHHAYREL 288 + V+IG + Y+ ++ + + L + + T T A+ H+ L Sbjct: 877 DVGRDRVQIGALTYSNHPEILFYLNTYSSGSAIAEHLRRPRDTGGETYTAKALQHS-NVL 935 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + E H + + +++ +I ITDG S ++ L+ + +R+ G+ I++V V Sbjct: 936 FTE---EHGSRLTQNVRQLMIVITDGV----SHDRDKLD--EAARELRDKGITIFAVGVG 986 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 QD L V++ +L + + + + + Sbjct: 987 --NANQDELETMAGKKENTVHVDNFDKLRDIYLPLQETLCN 1025 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 29/193 (15%), Positives = 61/193 (31%), Gaps = 22/193 (11%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + NLV IQ ++G + ++ + Sbjct: 665 VDSSGSIGPTNFETMKTFMKNLVGKIQ-----IGADRSQVGVVQFSDYNREEFQLNKYST 719 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 E+ + +++++P NT T A+ K KF+I +TDG+ Sbjct: 720 HEEIYAAIDRMSPINRNTLTGGALTFVNEYFDLSKGGRPQVR------KFLILLTDGKAQ 773 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 +R+ + I+SV V + L + + F V + L Sbjct: 774 DEVGGP--------ATALRSKSVTIFSVGV--YGANRAQLEEISGDGSLVFHVENFDHLK 823 Query: 378 ESFDKITDKIQEQ 390 K+ ++ Sbjct: 824 AIESKLIFRVCAL 836 Score = 40.7 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 20/136 (14%), Positives = 44/136 (32%), Gaps = 15/136 (11%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL--NKLNPYENTNTYPAMHHAYRELYNE 291 R+ Y+ + N N + + L N + A+ A+R ++ Sbjct: 65 NKYRVALAQYSDALHNEFQLGTFKNRNPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSA 124 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 + + V+ + ++ + + + +R G+KI SV V Sbjct: 125 PTNGRDKK--QFPPILVVLAS---------AESEDDVEEAAKALREDGVKIISVGVQK-- 171 Query: 352 EGQDLLRKCTDSSGQF 367 ++ L+ S F Sbjct: 172 ASEENLKAMATSQFHF 187 >gi|323136279|ref|ZP_08071361.1| von Willebrand factor type A [Methylocystis sp. ATCC 49242] gi|322398353|gb|EFY00873.1| von Willebrand factor type A [Methylocystis sp. ATCC 49242] Length = 577 Score = 73.0 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 49/433 (11%), Positives = 102/433 (23%), Gaps = 78/433 (18%) Query: 38 LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK 97 ++ K Q+ + S + + Sbjct: 143 IALALDNSGSMNESAGGATKIQSLKTAATNFVNSMFAKSPGKVKIAITPFAGLVIPVDPT 202 Query: 98 NNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICM-VLDV 156 + + + G + + R S+ Sbjct: 203 VAANRALPWIDVNGLSSQHWITFGGKANANAAGFTSRFNVFSNLKSQRADWDFGGCYEPQ 262 Query: 157 SRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE- 215 M N YL P P S + N L Sbjct: 263 PYPMNVTETAPTAGNAETLFVPYLAPDEPDSSAYENNYLNDDGGGCSLWTFGGWTDLTRT 322 Query: 216 --------SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGI-------VGNQCTPLSNNLN 260 + + V G L+ + Sbjct: 323 CKYKPATGWTSGIWSWFGATAGNGWTGGVFASRGGAFNGPNGMCPNAATQTALQLTPTQS 382 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE-SSHNTIGSTRLKKFVIFITD------ 313 + +++ +L +TN + + A+R + S+ + + ++K ++ +TD Sbjct: 383 TITAKIAQLTAAGDTNLHEGVMWAWRSISPNPPFSAGSAYNTAGVRKILVLMTDGYNNWT 442 Query: 314 -----------------------------GENSGASAYQNTLN----------------- 327 G YQ+ L+ Sbjct: 443 SNTNTVGGSYYEALGYYSYNGAKNRRLPDGTQGNGVDYQSQLDGAANSWTDYKSVSRQAQ 502 Query: 328 ---TLQICEYMRNAGMKIYSVAVSA-----PPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 T Q CE + G++IYS+A S G +LL+ C ++ + DS ++ + Sbjct: 503 DELTRQSCENAKAKGIEIYSIAFSVSTNPIDAAGINLLKSCATNADHYLLATDSTQIDRA 562 Query: 380 FDKITDKIQEQSV 392 F +I + + + Sbjct: 563 FSQIAMNLSKLRL 575 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 24/213 (11%), Positives = 53/213 (24%), Gaps = 29/213 (13%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + + + + A+D A + +Q D A L+ + I + + D + Sbjct: 19 LFGLSVIPVMMMAGAAVDYARGVTTHKVLQQGADTAALAVASRITAATSTADAIKQAQNV 78 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 +++ + I + AQ++I + Sbjct: 79 LRSASQRLAAATISNATISADRKTFCIDAQVSI-----------------------PTMI 115 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ----KHNDNNNMTS 176 + ++ S E + I + LD S SM + + Sbjct: 116 MKIARIDSMAPAVMSCA--EIGGGSTNYEIALALDNSGSMNESAGGATKIQSLKTAATNF 173 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRK 209 + P K + P Sbjct: 174 VNSMFAKSPGKVKIAITPFAGLVIPVDPTVAAN 206 >gi|256823198|ref|YP_003147161.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] gi|256796737|gb|ACV27393.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] Length = 348 Score = 73.0 bits (177), Expect = 7e-11, Method: Composition-based stats. Identities = 33/156 (21%), Positives = 60/156 (38%), Gaps = 38/156 (24%) Query: 253 TPLSNNLNEVKSRLN----KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 TPL+ +L V++ L+ L T + A + L + + Sbjct: 145 TPLTFDLKTVQTMLDETTIGLAGSSRTAIGDGIGLAVKRL----------RERDANNRVL 194 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP----------------- 351 I +TDG+N+ LN LQ E +AG+ IY++ V A Sbjct: 195 ILLTDGQNNTG-----ALNPLQAAELAEHAGITIYTIGVGADEMIVKNRFFGNRRINPSL 249 Query: 352 --EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + + L+ + G++F D++E+ E + I + Sbjct: 250 ELDEESLIAVAEKTGGRYFRARDTKEMEEIYQIIDE 285 >gi|254692857|ref|NP_081450.1| collagen, type XXII, alpha 1 [Mus musculus] Length = 1613 Score = 73.0 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 31/195 (15%), Positives = 69/195 (35%), Gaps = 20/195 (10%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + + + NLV++ + R+G + Y+ N+ Sbjct: 54 TSSSVGKEDFEKVRQWVANLVDTFE-----VGPGHTRVGVVRYSDRPTTAFELGHFNSRE 108 Query: 261 EVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 EVK+ ++ NTNT A+ + ++ + G+ K+ I +TDG + Sbjct: 109 EVKAAARRITYHGGNTNTGDALRYITSRSFSA--QAGGRPGNRAFKQVAILLTDGRSQDL 166 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELL 377 G++I++V V ++ L + F V+D + Sbjct: 167 VLDAAAAAHAA--------GIRIFAVGVG--AALKEELDEIASEPKSAHVFHVSDFNAID 216 Query: 378 ESFDKITDKIQEQSV 392 + K+ ++ E + Sbjct: 217 KIRGKLRRRLCENVL 231 >gi|28374313|gb|AAH45465.1| Matn1 protein [Danio rerio] Length = 507 Score = 73.0 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 47/381 (12%), Positives = 108/381 (28%), Gaps = 26/381 (6%) Query: 21 HIMYIRNQMQ---SALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKK--HLKQG 75 + ++N++ AA++ + I T + + + Sbjct: 100 YASRVKNEVSLKSHKTKAALVKAVSKIEPLSTGTMTGLAIQFAMNVAFSEAEGGRKSPDI 159 Query: 76 SYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRS 135 S + D + I + + + L P + S Sbjct: 160 SKVAIIVTDGRPQDNIRDIAARAREAGIEIFAIGVGRVDMTTLRQMASEPLEDHVDYVES 219 Query: 136 TGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTT 195 +IE+ ++ + C V+ + D + + + + + Sbjct: 220 YSLIEKLTKKFQEAFCAVVSDLCATGDHDCEHICISTPGSFKCACREGFTLMNDSRSCSA 279 Query: 196 KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEK---KNLSVRIGTIAYNIGIVGNQC 252 S A + LV I +K + +G + Y+ + Sbjct: 280 CSNAATDVVFLIDGSKSVRPENFELVKKWINLIIDKLDVSETNTHVGLVQYSSTVKQEFP 339 Query: 253 TPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 N+ +K + ++N E T T A+ + + + + K I Sbjct: 340 LGRHNSKRSLKEAVKRMNYMERGTMTGHALSFLVDNSFGPNQGAR-----PGVPKVGIVF 394 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFA 369 TDG + + + G K+Y+V V +D LR+ +F Sbjct: 395 TDGRSQDYIGD--------AAKKAKALGFKMYAVGVG--NAVEDELREIASEPIADHYFY 444 Query: 370 VNDSRELLESFDKITDKIQEQ 390 D + + + K+ + ++ Sbjct: 445 TADFKTMNQIAKKLQINVCQE 465 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 29/159 (18%), Positives = 51/159 (32%), Gaps = 19/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + R+G + Y + + ++K+ P T T A+ A Sbjct: 87 SVGPDATRVGVVNYASRVKNEVSLKSHKTKAALVKAVSKIEPLSTGTMTGLAIQFAMNVA 146 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E S + K I +TDG N I R AG++I+++ V Sbjct: 147 FSEAEGG---RKSPDISKVAIIVTDGRPQD--------NIRDIAARAREAGIEIFAIGVG 195 Query: 349 APPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 LR+ V +L + F + Sbjct: 196 R--VDMTTLRQMASEPLEDHVDYVESYSLIEKLTKKFQE 232 >gi|160894031|ref|ZP_02074810.1| hypothetical protein CLOL250_01586 [Clostridium sp. L2-50] gi|156864409|gb|EDO57840.1| hypothetical protein CLOL250_01586 [Clostridium sp. L2-50] Length = 1391 Score = 73.0 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 40/373 (10%), Positives = 111/373 (29%), Gaps = 38/373 (10%) Query: 18 DLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSY 77 D +++ A+D + + + + + + ++ + Sbjct: 382 DTGFYELQLSKLYLAMDDRDTA-NEHLHNVIDNSAKISDSSLLKDAIDEVVTQYNQISDD 440 Query: 78 IRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTG 137 + A ++ + + + N ++ + N+ + Sbjct: 441 SYNAELNAAVNDMVD------KQSSQVVPVSEETINGSFNSYVATTLKYDRINIHISRID 494 Query: 138 IIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKS 197 S I+I D + D + ++ + + ++ N+ Sbjct: 495 TSAYPSIQAYININGTKDSKEELADQFT-------KEDF--TVIDTQYEITDFTLNSGAE 545 Query: 198 KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN 257 A + K + +A E S ++ ++Y+ Q L++ Sbjct: 546 SEAVSIGIVMDKSGSMEGAAIANAKQAATEAVEHIT-SEKMMIVSYDNEAYLEQ--SLTS 602 Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 +K+ + ++ TN ++ A L EK + VI ++DG++ Sbjct: 603 RSGTLKNSIAAISDGGGTNISAGLNLALDNLEAEKG-----------SRAVILMSDGQD- 650 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSREL 376 + + + G+ +Y+V ++ + + G+F + S EL Sbjct: 651 ----GGSEEDMQAATDRAAKLGISVYTVGFG--ECDDAYMQAIAEVTGGKFVKASASTEL 704 Query: 377 LESFDKITDKIQE 389 + + + I Sbjct: 705 SDIYLYLQKYIVN 717 >gi|149624862|ref|XP_001517471.1| PREDICTED: similar to Cartilage matrix protein precursor (Matrilin-1) [Ornithorhynchus anatinus] Length = 238 Score = 73.0 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 26/183 (14%), Positives = 68/183 (37%), Gaps = 23/183 (12%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ + +V+S+ + + ++G + Y+ + + ++K+ + K Sbjct: 32 NFELVKKFINQIVDSL-----DVSEQNAQVGLVQYSSSVRQEFPLGRFTSKRDIKAAVKK 86 Query: 269 LNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + + T T A+++ + + +K I TDG + + Sbjct: 87 MTYMEKGTMTGTALNYLIDNTFAISSGAR-----PGAQKVGIVFTDGRSQDYISD----- 136 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITD 385 + ++ G K+++V V +D LR+ +F D + + + K+ Sbjct: 137 ---AAKKAKDLGFKMFAVGVG--NAVEDELREIASDPVAEHYFYTADFKTINQIGKKLQK 191 Query: 386 KIQ 388 KI Sbjct: 192 KIC 194 >gi|33602243|ref|NP_889803.1| hypothetical protein BB3267 [Bordetella bronchiseptica RB50] gi|33576682|emb|CAE33759.1| putative exported protein [Bordetella bronchiseptica RB50] Length = 571 Score = 73.0 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 31/361 (8%), Positives = 85/361 (23%), Gaps = 35/361 (9%) Query: 40 GCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNN 99 + + DQ + + + REN + + Sbjct: 66 AAQTYMPPAPHAAARAHADQAR---MRIMPAPMPVIQEDRENYAGYQDNPVQLAQEQPVS 122 Query: 100 PLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRS 159 + Y L L P+ ++ L + Sbjct: 123 TFGLDVD-TGSYSNVRRLLNDGRLPPADAVRAEAFINYFDYGYPAPATPAVPFSLTTEIA 181 Query: 160 MEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRK--IDVLIESA 217 N + + + +L + Sbjct: 182 PAPW-------NAQRQLLLVGIQGYRVAPQDIPAVNLVLLIDTSGSMADRAKLPLLKSAL 234 Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNT 277 LV ++ R+ +AY + + ++ + ++ L +TN Sbjct: 235 RQLVTQMRAQD--------RVAIVAYAGSAGLVLPSTPGDRHAQILAAIDGLQASGSTNG 286 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 + AY E ++ +DG+ + + R Sbjct: 287 GAGLELAYAEAAKGLVKDGVNR--------IVLASDGDFNVGR--TDLAQLKDYVGSQRK 336 Query: 338 AGMKIYSVAVSAPPEGQDL-LRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 G+ + ++ + + + ++ G + ++ + + F ++ + IA Sbjct: 337 RGIALTTLGLGSGNYNDAMAMQLANAGDGSYHYIDSLLQARKVF---ASELSATLLTIAK 393 Query: 397 N 397 + Sbjct: 394 D 394 >gi|261408991|ref|YP_003245232.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] gi|261285454|gb|ACX67425.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] Length = 1007 Score = 73.0 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 53/331 (16%), Positives = 105/331 (31%), Gaps = 39/331 (11%) Query: 80 ENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGII 139 + A A + T+ + P I E G+I + + L Sbjct: 273 DGDEASANNAAFDFTRVEGPPNVLIVEGTPGTSGNITAALQSGMIGTEVIPPELLPLEAA 332 Query: 140 ERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKN---TTK 196 + + + I +VS S + + + + + SF T Sbjct: 333 KYAVYDSIIFN----NVSGSDVGGKQMELIEQAVRSFGIGFMMAGGEDSFGMGGYFKTPI 388 Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEK--------------KNLSVRIGTIA 242 K P K ++ +++ K +G +A Sbjct: 389 EKALPVSMELEGKREIPSLGLILVIDRSGSMDGNKIELAKESAMRTVELMRAKDTVGVVA 448 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 ++ + EV S + + TN YPA+ A E+ Sbjct: 449 FDDQPWWVVPPQKLGDKEEVLSSIQSIPSAGGTNIYPAVSSALEEMLK----------ID 498 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 ++ +I +TDG+++ S YQ+ +T M + + SVAV +LL+ D Sbjct: 499 AQRRHIILMTDGQSAMNSGYQDLTDT------MVENKITMSSVAVGMDA-DTNLLQSLAD 551 Query: 363 SS-GQFFAVNDSRELLESFDKITDKIQEQSV 392 ++ G+++ V D L F + + + + Sbjct: 552 AAKGRYYFVEDETTLPAVFSREAVMLAKSYI 582 >gi|332977607|gb|EGK14375.1| von Willebrand factor type A domain protein [Psychrobacter sp. 1501(2011)] Length = 556 Score = 73.0 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 36/381 (9%), Positives = 103/381 (27%), Gaps = 29/381 (7%) Query: 18 DLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSY 77 D+++ +++ + + + + S+ + + I + Sbjct: 32 DISNANEPSSKIIQSPEVVEAAPVILVESNMSYSKMAAPTSIRAPSPMPSIGTSISNSEN 91 Query: 78 IRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTG 137 E + + + + + + +L + Sbjct: 92 YAEIERNAVHATHEQAFATLSIDTDTGSYANVRRFLNNGSLPPTDAVRVEELINYFNYDF 151 Query: 138 IIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKS 197 + N + + S + K P F Sbjct: 152 KNAKKQGNAPFLVTTEMVKSPWHATNRIVKVGIKAEDVLAAKQNQPAANLVFLVD----- 206 Query: 198 KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN 257 ++ K+ + S L ++ I I Y N Sbjct: 207 --VSGSMNSDDKLQLAKASLKMLTKQLRAQDT--------ITLITYAGNTEVVLPATSGN 256 Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 ++ + ++ L+ +TN A+ AY++ + ++ +TDG+ + Sbjct: 257 QTQKILNAIDNLSANGSTNGEAAIKLAYQQAEENFKKQGINR--------ILMLTDGDFN 308 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSREL 376 N + L I R+ G+ + ++ ++ + D+ G + ++ E Sbjct: 309 VGV--SNVKDMLDIIRNNRDKGISLSTLGFGQGNYNDHMMEQVADNGNGNYSYIDSLSEA 366 Query: 377 LESFDKITDKIQEQSVRIAPN 397 + + D++ +A + Sbjct: 367 KKV---LIDEMSSTFNTVAKD 384 >gi|332185455|ref|ZP_08387203.1| hypothetical protein SUS17_655 [Sphingomonas sp. S17] gi|332014433|gb|EGI56490.1| hypothetical protein SUS17_655 [Sphingomonas sp. S17] Length = 530 Score = 73.0 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 45/397 (11%), Positives = 96/397 (24%), Gaps = 55/397 (13%) Query: 32 ALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQ-KAQ 90 AL +AVL+ + + + + + + N Q Sbjct: 153 ALRSAVLALYDQLAPVQNQLAASGMRLRYGVVPYASAVNIGAAIRAANPNYMLSGSWTYQ 212 Query: 91 INITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISI 150 +N Y+ + +P A S ++ + + Sbjct: 213 SRQVVTENMTSWSCGYRSGSYDFGSGICTYFRYLPRAFDTSKYVSGASVDVAELVGTAAY 272 Query: 151 CMVLDVSRSMEDLYLQK---HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPAN 207 V S + ++SN +PP T P Sbjct: 273 NGVTPTRSSANKVTWAGCVEERQTARVSSNDTAIPPGATDLDIDLIPTNDATKWKPYWPE 332 Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 + + + + +++K LN Sbjct: 333 VEYASYQVEI------------------YKNDPYKPQFACPSPAASMQGWSRDDLKKYLN 374 Query: 268 KLNPYENTNTYPAMHHAYRELYNEK---ESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 L P T M R + ++ +KK++IF+TDG + Sbjct: 375 TLTPDGGTYHDNGMMWGARWASSGGIFGGNNPEKYNMMPVKKYIIFMTDGLFETGYSRLY 434 Query: 325 TLNTLQ----------------------------ICEYMRNAGMKIYSVAVSAPPEGQDL 356 + ++ +C ++ G I+ + + + Sbjct: 435 SSYGVEQLDARATPGGAYSNQDDQLARHKQRFNLLCSKAKSMGYSIWVLGFATTLDAS-- 492 Query: 357 LRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 L C + Q ++ L+ F +I I + Sbjct: 493 LTNCASTPSQASTSSNQAALMARFVEIGKNIGALRLT 529 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 42/255 (16%), Positives = 70/255 (27%), Gaps = 30/255 (11%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+ + + IDL R QMQ A DAAVL+G ++ + D Sbjct: 2 LWALFLIPLVALVGSGIDLGTRYVTRKQMQIACDAAVLAGRRAMTNGI-------VDDGV 54 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 K + +QG + + A + T N T L Sbjct: 55 RAEATKFFNFNFQQGMFGSKPFTPSISSATTSKTTVVINA------------ATTVPTSL 102 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKH-------NDNNN 173 + S +S+ S + + I VLD + SM D + Sbjct: 103 MRIFGSDELPVSVSCNA----SQDFVNTDIVFVLDTTGSMRDKATSSDSQTKIEALRSAV 158 Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 + L P + + A A L S ++ Sbjct: 159 LALYDQLAPVQNQLAASGMRLRYGVVPYASAVNIGAAIRAANPNYMLSGSWTYQSRQVVT 218 Query: 234 LSVRIGTIAYNIGIV 248 ++ + Y G Sbjct: 219 ENMTSWSCGYRSGSY 233 >gi|115379116|ref|ZP_01466240.1| von Willebrand factor type A domain, putative [Stigmatella aurantiaca DW4/3-1] gi|115363899|gb|EAU63010.1| von Willebrand factor type A domain, putative [Stigmatella aurantiaca DW4/3-1] Length = 284 Score = 73.0 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 30/209 (14%), Positives = 68/209 (32%), Gaps = 48/209 (22%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL-NEVKSRL 266 ++ V E ++ ++ RIG + + L + EV ++ Sbjct: 61 NRLHVAKEVLAEFIS---------NRVNDRIGLVVFAGAAYTQAPLTLDYGVVREVLKQI 111 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 + T A+ + L S + V+ ITDG+N+ + L Sbjct: 112 RTRVLEDGTAIGDALATSLNRL----------RDSEAKSRVVVLITDGDNNAGKI--SPL 159 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPE----------------------GQDLLRKCTD-S 363 + + E ++ + IY++ V + +LL+ + Sbjct: 160 DAASMAESLK---IPIYTILVGKGGKVPFPQGQDLFGNTVWRDTEIPINPELLQDIASRT 216 Query: 364 SGQFFAVNDSRELLESFDKITDKIQEQSV 392 G+++ D L + K+ D ++ + Sbjct: 217 GGEYYRATDPEGLKQGLQKVLDSLERSKL 245 >gi|73669697|ref|YP_305712.1| BatA [Methanosarcina barkeri str. Fusaro] gi|72396859|gb|AAZ71132.1| BatA [Methanosarcina barkeri str. Fusaro] Length = 317 Score = 73.0 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 31/223 (13%), Positives = 71/223 (31%), Gaps = 43/223 (19%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 + S A +++ SA L+NS++ + + Sbjct: 87 EGVNVVLVMDVSGSMQAQDYTPSRLEAAKSSAEILINSLKSKDYAG--------IVTFES 138 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 G LS +V +L + P +T + + Sbjct: 139 GATTAAY--LSPYKEKVIEKLRNVAPKEGSTAIGDGLSLGIDMASS----------IPNK 186 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------------ 352 KK +I ++DG N+ + +Y + +++Y++ + + Sbjct: 187 KKVIILLSDGVNNAGYISPDE-----AIQYAKANNIQVYTIGMGSNGNVLLGYDWFGNPQ 241 Query: 353 ----GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + L+ D+ G++F D + L E + I++ I+ + Sbjct: 242 YAELDEATLQAIANDTGGKYFKSIDDKTLDEIYKNISENIKRE 284 >gi|332982109|ref|YP_004463550.1| von Willebrand factor type A [Mahella australiensis 50-1 BON] gi|332699787|gb|AEE96728.1| von Willebrand factor type A [Mahella australiensis 50-1 BON] Length = 948 Score = 73.0 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 36/237 (15%), Positives = 76/237 (32%), Gaps = 25/237 (10%) Query: 157 SRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIES 216 + M Y+ + KS K+++ E+ Sbjct: 374 NSYMLGGYMGTQLEKMLPVDMDLSKKADIPSLGLVLVIDKSGSMTDGQYGITKLEMAKEA 433 Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 A +++ I ++ +++L E++ + + P TN Sbjct: 434 AIRSTEALRPTDSVG--------VICFDDAASWVVGMRQADDLAEIQDSIGTIRPGGGTN 485 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 YPA+ AY+ L K +I +TDG+++ I M Sbjct: 486 MYPALDLAYKALEEA----------DTKLKHIIVLTDGQSATGDFD-------GIAHRMA 528 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 G+ + SVAV + L R +G+++ ++ + + K T + ++ Sbjct: 529 EDGITLSSVAVGMDADKNLLSRLAEIGNGRYYYTDEFSNIPKILTKETYLATQSYLQ 585 >gi|294664114|ref|ZP_06729507.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 10535] gi|292606114|gb|EFF49372.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 10535] Length = 451 Score = 73.0 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 66/206 (32%), Gaps = 48/206 (23%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 ++ + ++ G TPL+ +L Sbjct: 230 VLGGKVVDRLTAAKAVLSDFLDR-----------RDGDRVGLLVFGQRAYALTPLTADLT 278 Query: 261 EVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 V+ +L T A+ + + L +K+ ++ V+ +TDG N+ Sbjct: 279 SVRDQLRDSVVGLAGRETAIGDAIALSVKRLREQKQG----------QRVVVLLTDGVNT 328 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL-----------------LRKC 360 LN L+ E + G++++++A G L LRK Sbjct: 329 AGV-----LNPLKAAELAKAEGVRVHTIAFG-GSGGYSLFGVPIPAGGDDDIDEDGLRKI 382 Query: 361 TD-SSGQFFAVNDSRELLESFDKITD 385 + G+FF D+ EL + ++ Sbjct: 383 AQQTGGRFFRARDTEELAGIYAELDR 408 >gi|294627092|ref|ZP_06705680.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 11122] gi|292598525|gb|EFF42674.1| conserved hypothetical protein [Xanthomonas fuscans subsp. aurantifolii str. ICPB 11122] Length = 451 Score = 73.0 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 33/206 (16%), Positives = 66/206 (32%), Gaps = 48/206 (23%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 ++ + ++ G TPL+ +L Sbjct: 230 VLGGKVVDRLTAAKAVLSDFLDR-----------RDGDRVGLLVFGQRAYALTPLTADLT 278 Query: 261 EVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 V+ +L T A+ + + L +K+ ++ V+ +TDG N+ Sbjct: 279 SVRDQLRDSVVGLAGRETAIGDAIALSVKRLREQKQG----------QRVVVLLTDGVNT 328 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL-----------------LRKC 360 LN L+ E + G++++++A G L LRK Sbjct: 329 AGV-----LNPLKAAELAKAEGVRVHTIAFG-GSGGYSLFGVPIPAGGDDDIDEDGLRKI 382 Query: 361 TD-SSGQFFAVNDSRELLESFDKITD 385 + G+FF D+ EL + ++ Sbjct: 383 AQQTGGRFFRARDTEELAGIYAELDR 408 >gi|153792263|ref|NP_001093210.1| matrilin 1 [Danio rerio] gi|148726249|emb|CAN88321.1| matrilin 1 [Danio rerio] gi|148726497|emb|CAN88267.1| matrilin 1 [Danio rerio] Length = 489 Score = 73.0 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 45/364 (12%), Positives = 102/364 (28%), Gaps = 23/364 (6%) Query: 35 AAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKK--HLKQGSYIRENAGDIAQKAQIN 92 AA++ + I T + + + S + D + I Sbjct: 99 AALVKAVSKIEPLSTGTMTGLAIQFAMNVAFSEAEGGRKSPDISKVAIIVTDGRPQDNIR 158 Query: 93 ITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICM 152 + + + L P + S +IE+ ++ + C Sbjct: 159 DIAARAREAGIEIFAIGVGRVDMTTLRQMASEPLEDHVDYVESYSLIEKLTKKFQEAFCA 218 Query: 153 VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV 212 V+ + D + + + + + S A Sbjct: 219 VVSDLCATGDHDCEHICISTPGSFKCACREGFTLMNDSRSCSACSNAATDVVFLIDGSKS 278 Query: 213 LIESAGNLVNSIQKAIQEK---KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 + LV I +K + +G + Y+ + N+ +K + ++ Sbjct: 279 VRPENFELVKKWINLIIDKLDVSETNTHVGLVQYSSTVKQEFPLGRHNSKRSLKEAVKRM 338 Query: 270 NPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 + E T T A+ + + + + K I TDG + Sbjct: 339 DYMERGTMTGHALSFLVDNSFGPNQGAR-----PGVPKVGIVFTDGRSQDYIGD------ 387 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDK 386 + + G K+Y+V V +D LR+ +F D + + + K+ Sbjct: 388 --AAKKAKALGFKMYAVGVG--NAVEDELREIASEPIADHYFYTADFKTMNQIAKKLQIN 443 Query: 387 IQEQ 390 + ++ Sbjct: 444 VCQE 447 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 29/159 (18%), Positives = 51/159 (32%), Gaps = 19/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + R+G + Y + + ++K+ P T T A+ A Sbjct: 69 SVGPDATRVGVVNYASRVKNEVSLKSHKTKAALVKAVSKIEPLSTGTMTGLAIQFAMNVA 128 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E S + K I +TDG N I R AG++I+++ V Sbjct: 129 FSEAEGG---RKSPDISKVAIIVTDGRPQD--------NIRDIAARAREAGIEIFAIGVG 177 Query: 349 APPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 LR+ V +L + F + Sbjct: 178 R--VDMTTLRQMASEPLEDHVDYVESYSLIEKLTKKFQE 214 >gi|238755460|ref|ZP_04616800.1| hypothetical protein yruck0001_3370 [Yersinia ruckeri ATCC 29473] gi|238706301|gb|EEP98678.1| hypothetical protein yruck0001_3370 [Yersinia ruckeri ATCC 29473] Length = 465 Score = 73.0 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 48/428 (11%), Positives = 118/428 (27%), Gaps = 72/428 (16%) Query: 4 IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTI 63 I + I D ++ + ++ AL+ L+ A + + + + Sbjct: 39 ISLPFFIAIIMLLFDFTQLINNKIKLSDALEQGALALTAENNAKND--------TRNNEL 90 Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGL 123 I +L + + Q QY ++ E PT F L Sbjct: 91 ISAYINFYLGHRHQLTQYNNITVNYQQNPDRLYHTQLSQYHIDANI--EQPTLFPFTSLL 148 Query: 124 IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLY------------------- 164 I + + I + A+ + V D S SME + Sbjct: 149 IDHDNFIIGGSAAAIKDV----PAMDVVFVTDFSGSMEGDFHNPDDPEVLSKLDELKRIF 204 Query: 165 --LQKHNDNNNMTSNKYLLPPPPKKSFW-----------------SKNTTKSKYAPAPAP 205 + N + P + ++ Sbjct: 205 FKIADDIYTANKD-STISFSPFSWGTKSADNKKCSLHFMPKEKNKIYPIPSNEIERNTEA 263 Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS--NNLNEVK 263 K + I + + +I+ + + + + + + + + P+S +++++ Sbjct: 264 HQEKYMIAITENIDYLATIENIGTNNEKIVIPLDHVHDELCLYSSNAYPISLAKDIDDL- 322 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 + + +N +T + L + +HN + T + Sbjct: 323 NVIKTMNEGGSTLVSSGIMQGADLLMDG--VNHNKLMIILSDGHDYPTTAVVHDKTITSA 380 Query: 324 NTLNTL----------QICEYMRNAGMKIYSVAVSAPPEGQDLL---RKCTDSSGQFFAV 370 + +C+ +R +I + + P + C D+ F+ Sbjct: 381 KEVRVNVDISRQLVQHGMCKKIRETVGRIVFIGIGYNPSANHYINWAEDCVDTE-NFYLA 439 Query: 371 NDSRELLE 378 +++EL + Sbjct: 440 MNTKELED 447 >gi|254458660|ref|ZP_05072084.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] gi|207084426|gb|EDZ61714.1| von Willebrand factor, type A [Campylobacterales bacterium GD 1] Length = 308 Score = 73.0 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 30/189 (15%), Positives = 64/189 (33%), Gaps = 19/189 (10%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 A K + +++ + + G +PL+ + N Sbjct: 92 ASESMKAKGFDEKNRDLTRFDVVKEIVSNFISSRKNDNMGIVVFGAYSFIASPLTYDSNI 151 Query: 262 VKSRLNKL---NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 +K ++ L + T + ++ L S K I +TDG N+ Sbjct: 152 LKGVVSNLYIGMAGKFTALFESLAQGVNLL----------KTSKSKTKIAILLTDGYNTP 201 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVS-APPEGQDLLRKCTD-SSGQFFAVNDSREL 376 S + ++ G+K+Y + + + Q +L K + + G F +++ EL Sbjct: 202 DSEFPFD----AAIDFANKQGVKVYPIGIGKSDEYNQKMLEKIAEQTGGVAFGASNASEL 257 Query: 377 LESFDKITD 385 + KI + Sbjct: 258 AIVYAKINE 266 >gi|301168170|emb|CBW27759.1| hypothetical protein BMS_2997 [Bacteriovorax marinus SJ] Length = 605 Score = 73.0 bits (177), Expect = 8e-11, Method: Composition-based stats. Identities = 33/322 (10%), Positives = 87/322 (27%), Gaps = 29/322 (9%) Query: 63 IFKKQIKKHLKQGSYIR-ENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + + + E I + ++ + + A Y + Sbjct: 127 FADNFSDSQVHSTNLLSREKYDHIEASSYSRVSDNPLTTFSIDVD-TASYANVRRFINNG 185 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 + + + + + + + L Sbjct: 186 VRPNKGAVRVEELINYFSYDYTFSNPEHPIDL-------KLDLTNSPWNKDRKVVRVALK 238 Query: 182 PPPP-KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 P SKN + K+ +L ES L+ +++ ++ Sbjct: 239 ADTPKTAINSSKNLVFLLDVSGSMSSPNKLPLLKESIKLLLRNLK--------GDDKVSI 290 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG 300 + Y ++ ++ LN+L +TN + AY+ E + Sbjct: 291 VVYAGSSGVVLEPTSVSDKVKIHKALNQLQSGGSTNGGAGIVAAYKLAEEEFIKNGVNR- 349 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 VI TDG+ + + + L + + + + + + + LL + Sbjct: 350 -------VILATDGDFNVGTTSRYEL--VDLIQEKAKKNIYLTVLGLGMGNYSDSLLEEI 400 Query: 361 TDSS-GQFFAVNDSRELLESFD 381 ++ G + ++ E + + Sbjct: 401 SNKGNGNYAYIDSLSEANKILN 422 >gi|332534652|ref|ZP_08410484.1| protein BatA [Pseudoalteromonas haloplanktis ANT/505] gi|332035932|gb|EGI72413.1| protein BatA [Pseudoalteromonas haloplanktis ANT/505] Length = 328 Score = 73.0 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 37/292 (12%), Positives = 86/292 (29%), Gaps = 52/292 (17%) Query: 114 PTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNN 173 P + L + +L S I + + + L + + + + Sbjct: 24 PAASNASTRLRIPSFAKHNLTSQSIEPHARRLNPLEWIIWLLLVTAAANPTWLDEPISLP 83 Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 +L S + ++ ++ + + Sbjct: 84 NEGRDIMLAVDLSGSM------TEQDMAYNGQYVDRLTMVKAVLSDFIEQ---------R 128 Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN--KLNPYE-NTNTYPAMHHAYRELYN 290 R+G I + TPL+ ++ V L+ ++ T A+ + + Sbjct: 129 QGDRLGLILFGDTAFLQ--TPLTRDVKTVSKMLSEAQIGLVGRATAIGDALGLSVKRF-- 184 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + V+ +TDG+N+ + LN R G+K+Y++ V + Sbjct: 185 --------ANKDESNRIVVLLTDGQNTAGN-----LNPEDALLLAREEGIKVYTIGVGSD 231 Query: 351 PE----------------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + LL+K + + G +F D L + + ++ Sbjct: 232 NPRGFSLFNVGGSSGSNLDESLLKKIAEQTGGLYFRAKDVAGLQQIYAELDK 283 >gi|225449026|ref|XP_002273050.1| PREDICTED: hypothetical protein [Vitis vinifera] Length = 710 Score = 73.0 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 18/185 (9%), Positives = 56/185 (30%), Gaps = 24/185 (12%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 K+ +L + G ++ ++ + + + + + + + +N Sbjct: 276 TKLALLKRAMGFVIQNLGSSDRLSVIAFSSTARRLFPLRRMTDA------GRQQALQAVN 329 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG---ASAYQN 324 L TN + + + + +I ++DG+++ S+ Sbjct: 330 SLVANGGTNIAEGLRKGAKVMED--------RKERNPVSSIILLSDGQDTYTVNGSSGNQ 381 Query: 325 TLNTLQICEYMRNAG-------MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 Q+ + G + ++S + + S G F + + Sbjct: 382 PQPNYQLLLPLSMHGSQNTGFQIPVHSFGFGTDHDASSMHTISEISGGTFSFIETESVIQ 441 Query: 378 ESFDK 382 ++F + Sbjct: 442 DAFAQ 446 >gi|170079457|ref|YP_001736094.1| von Willebrand factor type A domain-containing protein [Synechococcus sp. PCC 7002] gi|169887126|gb|ACB00840.1| Protein containing von Willebrand factor (vWF) type A domain [Synechococcus sp. PCC 7002] Length = 545 Score = 73.0 bits (177), Expect = 9e-11, Method: Composition-based stats. Identities = 36/370 (9%), Positives = 96/370 (25%), Gaps = 42/370 (11%) Query: 30 QSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKA 89 Q D + + + ++ T + ++ + Sbjct: 37 QRTSDTGMDAMAPQVAETESMPVMPTPDGEIPQEVPDLSQEDYNLIRDNPFQLVRTEPLS 96 Query: 90 QINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAIS 149 + D Y L L P+ L Sbjct: 97 TFALDVDSGA-----------YSNVRRFLDDGQLPPADAVRLEEMINYFTYDYPAPDNQP 145 Query: 150 ICMVLDVSRSM-EDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR 208 + ++S++ + + LPP + Sbjct: 146 FAINTELSQAPWQPQHQLLRIGIKGQEIENEALPPSNLVFLFD--------VSGSMNDPD 197 Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 K+ +L + LVN ++ R+ + Y + + + L+ Sbjct: 198 KLPLLKSAFRLLVNELRPED--------RVSIVVYAGAAGLVLPSTSGAEKETILAALDN 249 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L +T + AY+E + + N +I TDG+ + + Sbjct: 250 LEAGGSTAGGEGIELAYQEAADNFLDNGNNR--------IILATDGDFNVG--MSSDAEL 299 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKI 387 +++ E R + + + + + ++ G + +++ E + + ++ Sbjct: 300 IRLIEQKREQDIFLTVLGFGTGNLKDAKMEQLANNGNGNYAYIDNILEAKKV---LVTEM 356 Query: 388 QEQSVRIAPN 397 + +A + Sbjct: 357 GGTLLTLAKD 366 >gi|307292639|ref|ZP_07572485.1| hypothetical protein SphchDRAFT_0111 [Sphingobium chlorophenolicum L-1] gi|306880705|gb|EFN11921.1| hypothetical protein SphchDRAFT_0111 [Sphingobium chlorophenolicum L-1] Length = 540 Score = 72.6 bits (176), Expect = 9e-11, Method: Composition-based stats. Identities = 48/388 (12%), Positives = 106/388 (27%), Gaps = 37/388 (9%) Query: 37 VLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKD 96 VL S+ D S + + + +++ E +++ + Sbjct: 156 VLDTTGSMACKPERNDSDCSTWAGSRYVTQWVAGLGRDATFVPEEMNSGVNVSRMQGLRT 215 Query: 97 KNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISI--CMVL 154 LQ + + + + + A+ S + S + Sbjct: 216 ALANLQSQM-ATIETQFNMTEESKRKRVRWAIVPFSQMVNAGFSQGSAGTTLYSRHSDWF 274 Query: 155 DVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLI 214 + + + N ++ T ++ S T + P D L Sbjct: 275 NRTGKYYNSGYIYDNPTHSDTWLANTWDGCVEERRTSNAITLTSGHSIPNNLPNTADDLK 334 Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 + ++ + + + S + + + + N + N P Sbjct: 335 FDSTPTDSNTRWTVADPTRASGQYACP----KAMRELQQMTATDFNNYFTFNNGFIPNGG 390 Query: 275 TNTYPAMHHAYRELYNE---KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T + A R L + + + + ++VIF+TDG S S+ Sbjct: 391 TWLDVGLLWAARLLSRDGLWSTENDELYHTYPVSRYVIFMTDGYMSIGSSNYAAYAQEDY 450 Query: 332 ----------------------CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS-----S 364 C ++N KIY+++ A L C+ S Sbjct: 451 WRRVAAAGASKNDNHYARMLMTCTAIKNMDTKIYTISFGAGSTLDSNLINCSSSTNTTNP 510 Query: 365 GQFFAVNDSRELLESFDKITDKIQEQSV 392 + + S +L F I + I + Sbjct: 511 EFAYKADSSSDLNRVFRDIGENIGSLRL 538 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 34/232 (14%), Positives = 72/232 (31%), Gaps = 25/232 (10%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A + + A+D++ + ++Q A DA VL+G + S + D Sbjct: 26 IVAAAMLPLAGMVGGALDISRGYLAKTRLQQACDAGVLAGRKVMGSSGVLSDS------V 79 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 +K + + G A NP + A + Sbjct: 80 RDEVRKYVSFNYPSGYLGSTLA------------TTDINPTLGSNDQIALSLTTAIPTAV 127 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 L +++ T + S I I +VLD + SM + +D + ++Y+ Sbjct: 128 MRLFGRNNMSITASCTARNDYS----NIDIVLVLDTTGSMACKPERNDSDCSTWAGSRYV 183 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK 232 + ++ T ++ L + NL + + + Sbjct: 184 ---TQWVAGLGRDATFVPEEMNSGVNVSRMQGLRTALANLQSQMATIETQFN 232 >gi|159027742|emb|CAO89612.1| unnamed protein product [Microcystis aeruginosa PCC 7806] Length = 416 Score = 72.6 bits (176), Expect = 9e-11, Method: Composition-based stats. Identities = 23/140 (16%), Positives = 54/140 (38%), Gaps = 15/140 (10%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 ++L ++S++ +L T + +E ++ GS + +TDGE Sbjct: 99 DDLTLIRSKIQQLRAGGGTAIDEGIKLGIQE---------SSSGSKGYVSHIFLLTDGE- 148 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRE 375 + + + L++ G+ + + QD+L K D + G + + Sbjct: 149 ---NEHGDNQRCLKLAAVAAEYGITLNTFGFG-DHWNQDILEKIADIAGGSLSYIERPEQ 204 Query: 376 LLESFDKITDKIQEQSVRIA 395 L F ++ +++Q + A Sbjct: 205 ALIEFTRLFNRLQSVRLTNA 224 >gi|47219514|emb|CAG09868.1| unnamed protein product [Tetraodon nigroviridis] Length = 1450 Score = 72.6 bits (176), Expect = 9e-11, Method: Composition-based stats. Identities = 31/183 (16%), Positives = 62/183 (33%), Gaps = 18/183 (9%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENT 275 V S+ A + +++ + Y+ N V S L + NT Sbjct: 714 VIQFVTSMIGAFEVISPNGMQVSLVQYSDDAKTEFKLNTYYNKGIVISALKSVRYRGGNT 773 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 T A+ H Y +++ ++ + K ++ +TDG + + + E + Sbjct: 774 KTGIALKHVYEKVFTS-----DSGMRRNVPKVLVVLTDGRSQD--------DVKKSAEKL 820 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVR 393 +++G ++ V V LR F V+D + D + I E + Sbjct: 821 QHSGYSVFVVGV--ADVDMTELRIIGSKPSERHVFVVDDYDAFAKIQDNLITFICETATS 878 Query: 394 IAP 396 P Sbjct: 879 TCP 881 >gi|291388471|ref|XP_002710799.1| PREDICTED: collagen, type XIV, alpha 1 [Oryctolagus cuniculus] Length = 1796 Score = 72.6 bits (176), Expect = 9e-11, Method: Composition-based stats. Identities = 29/201 (14%), Positives = 65/201 (32%), Gaps = 26/201 (12%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + +I + V ++ K + ++ + + Sbjct: 1037 VDGSWSIGDDNFNKIINFLYSTVGALDKIGTDGT----QVAMVQFTDDPRTEFKLNAYET 1092 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + +++ NT T A+ H L+ + + + K ++ ITDG + Sbjct: 1093 KETLLDAIKRISYKGGNTKTGKAIKHVRDTLFTAESGTRR-----GIPKVIVVITDGRSQ 1147 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRE 375 +I M++ G I++V V L F V+D Sbjct: 1148 DDVN--------KISREMQSDGYNIFAVGV--ADADYSELVNIGSKPSARHVFFVDDF-- 1195 Query: 376 LLESFDKITDKIQEQSVRIAP 396 ++F KI D++ A Sbjct: 1196 --DAFKKIEDELITFVCETAS 1214 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 56/168 (33%), Gaps = 18/168 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + R+G Y+ + +EV + L NT T A+++ + Sbjct: 186 TAFNVGSEKTRVGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNYIF 245 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + T + K I ITDG++ + +R +G++++++ Sbjct: 246 ENSFK-----PEAGSRTGVPKIGILITDGKSQDDIIPPS--------RNLRESGVELFAI 292 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 V + L++ + V + + + +T + + Sbjct: 293 GV--KNADVNELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSRV 338 >gi|327402260|ref|YP_004343098.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] gi|327317768|gb|AEA42260.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] Length = 473 Score = 72.6 bits (176), Expect = 9e-11, Method: Composition-based stats. Identities = 33/240 (13%), Positives = 74/240 (30%), Gaps = 29/240 (12%) Query: 154 LDVSRSMEDLYLQKHNDNNNMTSNK----YLLPPPPKKSFWSKNTTKSKYAPAPAPANRK 209 LD + + + + D+ + F N + K Sbjct: 249 LDAEKKLNEQMNNEQVDSITEKIIPELAAIPSDNFDESYFKDVNVIFVIDISSSMKNGEK 308 Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 ++++ S LV+ ++ + Y Q +N +KS + L Sbjct: 309 MNLMKYSLNQLVSKLRPNDDMG--------MVTYANTADVFQAPTSGSNKESLKSSITSL 360 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 P T + Y+E+ + + VI ITDG + S Sbjct: 361 KPSGMTAGGKGIKLGYKEVMKNYDPA--------KANMVIIITDGAFNKDSDDYQKTVQK 412 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQ 388 G+ + + LL++ +G++ ++L+++ +T++I+ Sbjct: 413 YA-----KKGVVFSVLGIETRERDAKLLQEAAAFGNGRYV---SIQKLVDAHSNLTEEIR 464 >gi|193786838|dbj|BAG52161.1| unnamed protein product [Homo sapiens] Length = 496 Score = 72.6 bits (176), Expect = 9e-11, Method: Composition-based stats. Identities = 42/378 (11%), Positives = 114/378 (30%), Gaps = 52/378 (13%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQIN---ITKDKN 98 + VS + + ST + + G ++ I+ I Sbjct: 96 RAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKAFGDAEGGRSRSPDISKVVIVVTDG 155 Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR 158 P + + A+ LF G+ L ++ + + S ++ +SR Sbjct: 156 RPQDSVQDVSARARASGVELFAIGVGSVDKATLRQIASEPQD-EHVDYVESYSVIEKLSR 214 Query: 159 SMEDLY----LQKHNDNNNMTSNKYLLPPPPKKSF---WSKNTTKSKYAPAPAPANR--- 208 ++ + +++ P + ++ N+ Sbjct: 215 KFQEAFCVVSDLCATGDHDCEQVCISSPGSYTCACHEGFTLNSDGKTCNVCSGGGGSSAT 274 Query: 209 ---------------KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 +++ + +V+++ + + ++G + Y+ + Sbjct: 275 DLVFLIDGSKSVRPENFELVKKFISQIVDTL-----DVSDKLAQVGLVQYSSSVRQEFPL 329 Query: 254 PLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 + ++K+ + ++ ++T T A+ + + + +K I T Sbjct: 330 GRFHTKKDIKAAVRNMSYMEKSTMTGAALKYLIDNSFTVSSGAR-----PGAQKVGIVFT 384 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAV 370 DG + + ++ G K+++V V +D LR+ +F Sbjct: 385 DGRSQDYIND--------AAKKAKDLGFKMFAVGVG--NAVEDELREIASEPVAEHYFYT 434 Query: 371 NDSRELLESFDKITDKIQ 388 D + + + K+ KI Sbjct: 435 ADFKTINQIGKKLQKKIC 452 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 23/159 (14%), Positives = 53/159 (33%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + R+G + Y + + + + ++ P T T A+ A + Sbjct: 72 DVGPNATRVGMVNYASTVKQEFSLRAHVSKAALLQAVRRIQPLSTGTMTGLAIQFAITKA 131 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + ++ S + K VI +TDG + + R +G++++++ V Sbjct: 132 F--GDAEGGRSRSPDISKVVIVVTDGRPQDS--------VQDVSARARASGVELFAIGVG 181 Query: 349 APPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 + LR+ V +L F + Sbjct: 182 --SVDKATLRQIASEPQDEHVDYVESYSVIEKLSRKFQE 218 >gi|39933805|ref|NP_946081.1| hypothetical protein RPA0728 [Rhodopseudomonas palustris CGA009] gi|39647652|emb|CAE26172.1| conserved unknown protein [Rhodopseudomonas palustris CGA009] Length = 468 Score = 72.6 bits (176), Expect = 9e-11, Method: Composition-based stats. Identities = 55/468 (11%), Positives = 110/468 (23%), Gaps = 93/468 (19%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+++ + +D + + Q+ +D Sbjct: 20 IAALVMIPIIFLLGMTLDFTQALRKKQQL-------------DAAADAAAIAAVRPAMLM 66 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 T Q + S A + IT K Y + N F Sbjct: 67 QTDAVAQNTAYAIFMSTANRLASGLTSVPTPTITITDVGL---QRTVKVSYNAASLNNFP 123 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS---- 176 + L+ + +S ST ++ +++D S SM N ++ Sbjct: 124 QLLMNNVSWAISGASTAQAS---SAPNMNFYLLMDDSPSMGIGATATDISNLIASTAPKY 180 Query: 177 ------NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE 230 T + +ID++ + L+N+ Q Sbjct: 181 QKASSSQNCGFACHETNIAHDGGTKDNLAIARANNITLRIDLVTSAVNQLLNTWSNCPQS 240 Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN---------------- 274 + V A N LN + S Sbjct: 241 GVSGGVMQCMSALNNTTYRAALYTFDLGLNTLASLTTPTTAGAQVSNIALMPVAYQNCVV 300 Query: 275 ------TNTYPAMHHAYRELYN--EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 T+ + A L + ++ V +TDG S+ Sbjct: 301 PTTNCKTDNGTDIAGALTSLNGIMPSPGLGSNASGDTPQEVVFLVTDGVEDKISSSCPNG 360 Query: 327 NT-----------LQICEYMRNAGMKI---YSVAVSA----------------PPEGQ-- 354 + IC ++ G+KI Y+ + + Sbjct: 361 SYASYSRCQQPLDTAICTTIKKRGIKIAILYTEYLQLKTPNVPVTDTWYMSWIDAYDEPT 420 Query: 355 -------DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 L+ C G F V + ++ + K+ + + Sbjct: 421 SSTGAIAKNLQACAS-PGFFSNVQTGGNITQALTDLFLKVASSTASLT 467 >gi|319956032|ref|YP_004167295.1| von willebrand factor type a [Nitratifractor salsuginis DSM 16511] gi|319418436|gb|ADV45546.1| von Willebrand factor type A [Nitratifractor salsuginis DSM 16511] Length = 306 Score = 72.6 bits (176), Expect = 9e-11, Method: Composition-based stats. Identities = 35/212 (16%), Positives = 75/212 (35%), Gaps = 37/212 (17%) Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 S ++ P K DV+ E G+ ++ + + RIG I + Sbjct: 91 SSDSMNQWGFDPGDPNKSKFDVVKEVVGDFIDKRK---------NDRIGLINFASVAFVA 141 Query: 251 QCTPLSNNLNEVKSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 + ++ ++ + T A+ Y L S K Sbjct: 142 SPLTFEKD---FLRKILQMQEPGIAGKRTAINDALLQTYNILSK----------SDAKSK 188 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA-PPEGQDLLRKCTDSS- 364 I +TDG ++ + + + + ++ +K+Y++ + + L+ + Sbjct: 189 IAILLTDGIDNASRISFDEIR-----RLISDSDIKLYTIGIGSYRDFDAPYLKALAQAGH 243 Query: 365 GQFFAVNDSRELLESFDKITD----KIQEQSV 392 G+FFA +D R L + ++ I KI+ + V Sbjct: 244 GRFFAASDRRSLQKIYEAIDRLETSKIKSKRV 275 >gi|326677363|ref|XP_691404.5| PREDICTED: collagen alpha-1(XII) chain-like [Danio rerio] Length = 3085 Score = 72.6 bits (176), Expect = 9e-11, Method: Composition-based stats. Identities = 19/165 (11%), Positives = 57/165 (34%), Gaps = 18/165 (10%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + R+G + Y+ ++ +N L NT T A+ + + + Sbjct: 150 DLGEDKTRVGVVQYSTDTRTEFNLNQHFRRVDLLRAINNLPYKGGNTMTGEALDYLLKNM 209 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + E + + + ITDG++ + ++NAG++++++ + Sbjct: 210 FTEAAGAR-----KGFPRVAVVITDGKSQDP--------VEGYAKKLKNAGVELFTLGI- 255 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQS 391 ++ L++ + + + V + + ++ Sbjct: 256 -KEADEEELKQMSSTPYRTHVYTVPNFDMIKAVEKSFIAQVCSSV 299 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 43/331 (12%), Positives = 98/331 (29%), Gaps = 29/331 (8%) Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGL 123 ++ + L R K +T D + +L + Sbjct: 1057 ARQGVGTTLSPYKAPRNLQTSEPTKTSFRVTWDPAPGDVRGYKVTFHPSENDIDLGELLV 1116 Query: 124 IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPP 183 P T + + ++++ + D +SM +K ++ S Sbjct: 1117 GPYDNTVVLEELRAGTKY-----SVAVFGMFDGGQSMPLAGEEKTTLSDAPDSPPVKYSG 1171 Query: 184 PPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAY 243 K+ + + + + G +V + V+IG Y Sbjct: 1172 NECKTSAKADIVL-LVDGSWSIGRLNFKTIRAFIGRMVGVF-----DIGPDKVQIGLAQY 1225 Query: 244 NIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGST 302 + + + L NT T A+++ + N Sbjct: 1226 SGDPKTEWHLNAHPTRASLLDAVANLPYKGGNTMTGMALNYILQ-----NNFRPNVGMRP 1280 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 +K + +TDG++ + + +R++G+++Y++ V ++ LR Sbjct: 1281 DSRKIGVLVTDGKSQDEIVVNS--------QRLRDSGIELYAIGV--KNADENELRSIAT 1330 Query: 363 SSG--QFFAVNDSRELLESFDKITDKIQEQS 391 + VND LL+ D +T+ + Sbjct: 1331 DPDEIHMYNVNDFSFLLDIVDDLTENLCNSV 1361 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 38/327 (11%), Positives = 98/327 (29%), Gaps = 39/327 (11%) Query: 76 SYIRENAGDIAQKAQINITKDKNNPLQYIAESK----AQYEIPTENLFLKGLIPSALTNL 131 + + + + + +T+ + ++ ++ Y++ + Sbjct: 304 NSLASGEEVVEPASNLQVTEVASKSMRVTWDASIGEVTGYKVQMVPMLAGSKRQELYVGP 363 Query: 132 SLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWS 191 + S + + S + L E + + ++ L Sbjct: 364 TQTSVNVRDLSPDTEYEISLFALKGLTPSEAVMAMEKTQPLKVSLECSLGVDVQADIVL- 422 Query: 192 KNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ 251 + + + LVN+ + V+I + Y+ Sbjct: 423 ------LVDGSYSIGITNFAKVRAFLEVLVNTF-----DIGPNKVQISLVQYSRDPYTEF 471 Query: 252 CTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 ++LN V + TNT AM + ++ + + + I Sbjct: 472 YLNTHHDLNAVVKAVRTFPYRGGSTNTGKAMTYVRERIFIATRGAR-----ENVPRVTIL 526 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFF 368 ITDG+ ++ +RN ++I++V V + L + + Sbjct: 527 ITDGK--------SSDAFKDPAAKLRNTDVEIFAVGV--KDAVRSELEAIANPPAETHVY 576 Query: 369 AVNDSRELLESFDKITDKIQEQS-VRI 394 V D ++F +I++++ + +RI Sbjct: 577 TVEDF----DAFQRISNELTQSICLRI 599 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 50/164 (30%), Gaps = 18/164 (10%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKES 294 ++I + Y+ + S L + NT T A+ H Y ++ + Sbjct: 2342 MQISFVQYSDDANTEFRLNTYKDKGTALSALKLIRYQGGNTKTGVALKHVYEKVITVENG 2401 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + K V+ +TDG + +++AG ++ V V Sbjct: 2402 -----MRRNVPKVVVAVTDGRSQDDVHKN--------AAKLQHAGYSVFVVGV--ADVDF 2446 Query: 355 DLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 L+ F V+D D + I E + P Sbjct: 2447 VELQNIASKPSERHVFVVDDFDAFSTIQDNLVTFICETATSSCP 2490 >gi|265766730|ref|ZP_06094559.1| conserved hypothetical protein [Bacteroides sp. 2_1_16] gi|263253107|gb|EEZ24583.1| conserved hypothetical protein [Bacteroides sp. 2_1_16] Length = 608 Score = 72.6 bits (176), Expect = 9e-11, Method: Composition-based stats. Identities = 29/291 (9%), Positives = 79/291 (27%), Gaps = 30/291 (10%) Query: 92 NITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISIC 151 + + ++ A Y L L P+ + Sbjct: 154 QVLDTPLSTFSIDVDA-ASYSNMRRFLNKGELPPTNAIRTEELINYFSYNYARPTGNDPV 212 Query: 152 MVLDVSRSMEDLY--LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRK 209 + + LP + Sbjct: 213 RITTEVGTCPWNEQHRLVRIGLKAKEIPTENLPASNLVFLID--------VSGSMYGPER 264 Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 +D++ S LVN++++ ++ + Y+ + ++ +++ +++L Sbjct: 265 LDLVKSSLKLLVNNLREKD--------KVAIVIYSGAAGEKLASTPGSDKQKIREAIDEL 316 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 +T + AY+ S N +I TDG+ + + L Sbjct: 317 EASGSTAGGEGIMLAYKIAQKNFISGGNNR--------IILCTDGDFNVGVSSDKEL--E 366 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLES 379 ++ E R +G+ + + + ++ + G +++ +E Sbjct: 367 KLIEQKRKSGIFLTVLGYGMGNYKDNKMQTLAEKGNGNHAYIDNLQEANRV 417 >gi|189912860|ref|YP_001964749.1| BatA [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|189913185|ref|YP_001964414.1| Hypothetical BatA protein; putative von Willebrand factor, type A domain containing protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] gi|167777536|gb|ABZ95836.1| BatA [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|167781253|gb|ABZ99550.1| Hypothetical BatA protein; putative von Willebrand factor, type A domain containing protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] Length = 317 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 38/223 (17%), Positives = 74/223 (33%), Gaps = 30/223 (13%) Query: 171 NNNMTSNKYLLPPPPKKSFWS---KNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 +KY L P K + + S ++ V + V Sbjct: 73 AAAGPGSKYKLSPDSTKGVDIMIALDISGSMVNSYDFLPRNRLSVSKDLLREFV------ 126 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 KK L RIG + + +PLS++ + L++L T++ Sbjct: 127 ---KKRLYDRIGIVVFAGAAYLQ--SPLSSD----RFALDELIA--GTSSEDIEEQGTAV 175 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 SS+ S K +I +TDG ++ +T + G+K+Y + + Sbjct: 176 GDALVLSSYRLKNSEAKSKVIILLTDGVSNTGKLDPDTAAYTT-----KTMGIKVYCIGI 230 Query: 348 SAPPEG-----QDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + L + ++++G+FF L ++I Sbjct: 231 GKEEGQYEINYESLQKISSNTNGKFFRAESPEVLESVLNEIDQ 273 >gi|189219434|ref|YP_001940075.1| hypothetical protein Minf_1423 [Methylacidiphilum infernorum V4] gi|189186292|gb|ACD83477.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Methylacidiphilum infernorum V4] Length = 334 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 29/162 (17%), Positives = 57/162 (35%), Gaps = 35/162 (21%) Query: 251 QCTPLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 PL+ + N +K ++++L + T A+ A L +KES + F Sbjct: 144 TVCPLTFDHNWLKRKIDQLQAGTIEDGTAIGDALGLALSRLEGKKESGERKK----IGSF 199 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE--------------- 352 +I +TDG N+ + L + ++++ E Sbjct: 200 LILLTDGANNCGNLTPIEAARLAAHAA-----VPVFTIGAGINGEVTMPVMDEERRKIGS 254 Query: 353 -------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDK 386 + LLR + G++F DS ++ +F I + Sbjct: 255 QTVVSEVDEGLLRNIAQLTGGEYFRATDSNAIVSAFQAIDAQ 296 >gi|126310411|ref|XP_001373784.1| PREDICTED: similar to collagen type XII alpha 1 [Monodelphis domestica] Length = 3116 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 57/166 (34%), Gaps = 18/166 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+G + Y+ ++ + + K+ NT T A+ + Sbjct: 160 SAFDVGEGKTRVGVVQYSSDTRTEFNLNQYYQRKDLLAAIKKIPYKGGNTMTGDAIDYLI 219 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + E + K I ITDG++ +RN G++++S+ Sbjct: 220 KNTFTESAGARVGF-----PKVAIIITDGKSQD--------EVEIPARELRNIGVEVFSL 266 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQE 389 + LR+ F V + +++ ++I ++ Sbjct: 267 GI--KAADAKELRQIASPPSLKHVFNVANFDAIVDIQNEIISQVCS 310 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 21/160 (13%), Positives = 50/160 (31%), Gaps = 18/160 (11%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKE 293 V+I Y+ + + + L NT T A++ + Sbjct: 1227 RVQIALAQYSGDPRTEWQLNAHKDRRSLLDSVANLPYKGGNTLTGMALNFIRQ-----NN 1281 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 R +K + ITDG++ + + +++ G++++++ + Sbjct: 1282 FRPQAGMRPRARKIGVLITDGKSQDDVEAPS--------KKLKDDGVELFAIGI--KNAD 1331 Query: 354 QDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 + L+ + V D L D +T + Sbjct: 1332 ESELKMIATDPDDTHAYNVADFDSLSRIVDDLTVNLCNSV 1371 Score = 63.0 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 31/167 (18%), Positives = 63/167 (37%), Gaps = 23/167 (13%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYN 290 V+I + Y+ + ++ +N TNT AM + +++ Sbjct: 465 SPNRVQISLVQYSRDPHTEFTLKKFTRVEDIIEAINTFPYRGGSTNTGKAMTYVREKIFV 524 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + S + + K +I ITDG++S A +RN+ ++I++V V Sbjct: 525 ASKGSR-----SNVPKVMILITDGKSSDAFRDP--------AIKLRNSDVEIFAVGV--K 569 Query: 351 PEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQS-VRI 394 + L F V D ++F +I+ ++ + +RI Sbjct: 570 DAVRSELEAIASPPPETHVFTVEDF----DAFQRISFELTQSICLRI 612 >gi|48734898|gb|AAH71224.1| Matrilin 3 [Mus musculus] Length = 481 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 58/176 (32%), Gaps = 23/176 (13%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP- 271 + +++++ + R+ + Y + ++ +K + ++ P Sbjct: 97 VKTFVSRIIDTLGIGATD-----TRVAVVNYASTVKIEFQLNTYSDKQALKQAVARITPL 151 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T + A+ A E + + + + + K I +TDG ++ Sbjct: 152 STGTMSGLAIQTAMEEAFTVEAGARGPMSNI--PKVAIIVTDGRPQD--------QVNEV 201 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 R +G+++Y+V V + L+ F V +L F + Sbjct: 202 AARARASGIELYAVGV--DRADMESLKMMASKPLEEHVFYVETYGVIEKLSARFQE 255 >gi|304406204|ref|ZP_07387861.1| von Willebrand factor type A [Paenibacillus curdlanolyticus YK9] gi|304344788|gb|EFM10625.1| von Willebrand factor type A [Paenibacillus curdlanolyticus YK9] Length = 762 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 43/381 (11%), Positives = 112/381 (29%), Gaps = 46/381 (12%) Query: 16 AIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQG 75 AID ++ I+ ++ + + +K + Sbjct: 196 AIDASNYPTIKVKL----------AVEDGSEQSDLSSGQVAIKE-----NTVAQKTAEVN 240 Query: 76 SYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN-LFLKGLIPSALTNLSLR 134 + + +I ++ T N + + ++ + + + + Sbjct: 241 ANTADKTYEIVYDTTVSNTNPPNGEQRVVDLVIGDNKLSESYKSPSQKKLHIDDVSYNTD 300 Query: 135 STGIIERSSENLAISICMVLD-VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKN 193 + + +V D + + N + +K P + Sbjct: 301 EYPKVNVYFSLYDENNQLVEDMNPVKTAFTVKEGDKETKNASFSKLTEKPQA------IS 354 Query: 194 TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 T + K+ + ++A ++ A + + + + Sbjct: 355 TNLVIDVSDSMSEDNKLTKVKDAATQFLSHASFASNDV-------VGLMSFSDASNIRQS 407 Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 + + +KS + + T Y A++ A S+ S K+V+ TD Sbjct: 408 DFTTEIESIKSSIAGMQTSGCTALYEALNQAV---------SNTAYNSVEGSKYVVVFTD 458 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV--N 371 G+ + N ++ + G+ IY++ V + L++ + + + V N Sbjct: 459 GK-NTICDGTNWVSPSTVINNALQWGVPIYAIGVEEDAD----LQQIAEQTNGQYHVLGN 513 Query: 372 DSRELLESFDKITDKIQEQSV 392 D +L + I ++Q V Sbjct: 514 DFTDLNAIYSDIYTNKKKQYV 534 >gi|108762540|ref|YP_633801.1| BatA protein [Myxococcus xanthus DK 1622] gi|108466420|gb|ABF91605.1| batA protein [Myxococcus xanthus DK 1622] Length = 336 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 30/211 (14%), Positives = 73/211 (34%), Gaps = 52/211 (24%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 ++ V E + ++ RIG + + PL+ + +K + Sbjct: 113 NRMHVAKEVLSEFIA---------NRVNDRIGLVVFAGAAYTQA--PLTLDYGVLKEVVK 161 Query: 268 KLNPY---ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 +L + T A+ + L S + V+ ITDG+N+ + Sbjct: 162 QLRTRVLEDGTAIGDALATSLNRL----------RDSEAKSRVVVLITDGDNNSGKI--S 209 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPE----------------------GQDLLRKCTD 362 +++ + + ++ + IY++ V + +L++ D Sbjct: 210 PMDSANMAQALK---VPIYTILVGKGGKVPFPQGTDLFGNTVWRDTEIPINPELMQDIAD 266 Query: 363 -SSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + G+++ D +L E K+ D ++ + Sbjct: 267 RTGGEYYRATDPEQLREGLQKVLDSLERSKL 297 >gi|148657117|ref|YP_001277322.1| peptidase M23B [Roseiflexus sp. RS-1] gi|148569227|gb|ABQ91372.1| peptidase M23B [Roseiflexus sp. RS-1] Length = 982 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 37/371 (9%), Positives = 105/371 (28%), Gaps = 25/371 (6%) Query: 32 ALDAAVLS--GCASIVSDRTIKDPTTKKDQ----TSTIFKKQIKKHLKQGSYIRENAGDI 85 A +A+ +S G ++ T+ + + +Q ++ G + D Sbjct: 329 ATEASAVSYNGASNANVILTLDRSGSMSTDNKMPAAHNAARQFVDLMQVGDGVGVVGFDD 388 Query: 86 AQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSEN 145 +T + P E T + + + N Sbjct: 389 RVTTAFPLTVITDPPPLSSLIFTDTMESGTGKWIPDPPWGLTSVAYRGSA-AWTDSPAGN 447 Query: 146 LAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAP 205 A + VL ++ + + + S + + + A Sbjct: 448 YANNANSVLAIADPIVLPASLTTPALSFWHRYDIENYFDYGRVEVSTDNGATWQSLAAYT 507 Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP-LSNNLNEVKS 264 + +L + I+ + L+ I P + + + Sbjct: 508 GVN--TTWSRAVIDLSPYRGQTIRLRFRLTTNAYLTRDGWYIDDVTVGPKWVDARADAIA 565 Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + L P +T+ + + + L + ++ ++DG+ + A + Sbjct: 566 AIGTLTPRGSTSIGGGLQRSQQLLSASAPGRT---------RAIVLLSDGQENTAPYVSD 616 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKIT 384 + +R + + ++++ + + Q +L + G + +L ++ I+ Sbjct: 617 ------VLPQIRASQITVHTIGLGTDADQQLMLSIAAQTGGTYNYAPRPDQLAGIYNTIS 670 Query: 385 DKIQEQSVRIA 395 + + I Sbjct: 671 GAVSNRQTLIT 681 >gi|302188504|ref|ZP_07265177.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. syringae 642] Length = 352 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 25/146 (17%), Positives = 52/146 (35%), Gaps = 35/146 (23%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + V+ L++ +NT A+ A + L + ++ Sbjct: 150 PLTYDRRTVRIWLDEARIGIAGKNTALGDAIGLALKRL----------RLRPATSRVLVL 199 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------GQ 354 +TDG N+ ++ + G+KIY + + + P+ + Sbjct: 200 VTDGANNAGQ-----IDPITAARLAAEEGVKIYPIGIGSDPDKDALQSVLGLNPSLDLDE 254 Query: 355 DLLRKCTD-SSGQFFAVNDSRELLES 379 L++ S GQ+F D +L + Sbjct: 255 PTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|154089854|emb|CAO81739.1| collagen type VI alpha 6 [Homo sapiens] Length = 631 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 50/355 (14%), Positives = 106/355 (29%), Gaps = 36/355 (10%) Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 D + K+ + + I++ ++ ++A Sbjct: 82 ADVGKNQVRFGALKYADDPEVLFYLDDFGTKLEVISVLQNDQAMGGSTYTAEALGFSDHM 141 Query: 117 NL------FLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 KG+ + S + ++ A+ +L V D Sbjct: 142 FTEARGSRLNKGVPQVLIVITDGESHDADKLNATAKALRDKGIL-VLAVGIDGANPVELL 200 Query: 171 NNNMTSNKYLLPPPPKKSF--WSKNTTKSKYAPAPAPANRKIDVL---IESAGNLVNSIQ 225 +S+KY +S T + K+D++ S N + Sbjct: 201 AMAGSSDKYFFVETFGGLKGIFSDVTASVCNSSKVDCEIDKVDLVFLMDGSTSIQPNDFK 260 Query: 226 KA---------IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENT 275 K + VRIG ++ E+ ++ + NT Sbjct: 261 KMKEFLASVVQDFDVSLNRVRIGAAQFSDTYHPEFPLGTFIGEKEISFQIENIKQIFGNT 320 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + A+ RE+ + + +T + ++ +TDG++ Q E + Sbjct: 321 HIGAAL----REVEHYFRPDMGSRINTGTPQVLLVLTDGQSQD--------EVAQAAEAL 368 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 R+ G+ IYSV + L + T ++ + V++ EL + +I I Sbjct: 369 RHRGIDIYSVGIG--DVDDQQLIQITGTAEKKLTVHNFDELKKVNKRIVRNICTT 421 >gi|149922008|ref|ZP_01910450.1| hypothetical protein PPSIR1_18327 [Plesiocystis pacifica SIR-1] gi|149817173|gb|EDM76653.1| hypothetical protein PPSIR1_18327 [Plesiocystis pacifica SIR-1] Length = 996 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 37/188 (19%), Positives = 72/188 (38%), Gaps = 25/188 (13%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + ++D++ E+A ++ + + IG IA++ + N Sbjct: 537 SGSMSSGDRLDLVKEAARATARTLDPSDE--------IGVIAFDNSPQVLVRLQPAANRL 588 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + S + +L+ TN PA+ AY +L GS L K VI ++DGE+ Sbjct: 589 RISSSIRRLSAGGGTNAMPALREAYLQL----------AGSKALVKHVILLSDGESPENG 638 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 + MR + + + SV V L+R G++F D ++ F Sbjct: 639 IN-------ALLGDMRQSDITVSSVGVGDGAGKDFLIRVAERGRGRYFYSEDGTDVPRIF 691 Query: 381 DKITDKIQ 388 + +++ Sbjct: 692 SREAREVK 699 >gi|126322632|ref|XP_001381040.1| PREDICTED: similar to collagen, type XIV, alpha 1 [Monodelphis domestica] Length = 1892 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 65/201 (32%), Gaps = 26/201 (12%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + +I N V ++ K + +S + + Sbjct: 1184 VDGSWSIGDDNFNKIINFLYNTVGALDKIGVDGTQVS----IVQFTDDPRTEFMLNTYKT 1239 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + + L+ NT T A+ H L+ + + + K ++ ITDG + Sbjct: 1240 KDTLLEGIKNLSYKGGNTKTGKALKHVRDALFTAEGGTRR-----GIPKVIVVITDGRSQ 1294 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRE 375 +I ++ G+ I+++ V L F V+D Sbjct: 1295 DDVN--------KISRELQLEGISIFAIGV--ADADYAELVSIGSQPSARHVFFVDDF-- 1342 Query: 376 LLESFDKITDKIQEQSVRIAP 396 ++F KI D++ A Sbjct: 1343 --DAFKKIEDELITFVCETAS 1361 Score = 63.0 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 56/168 (33%), Gaps = 18/168 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 RIG Y+ + + V + L NT T A+++ + Sbjct: 333 TAFNVGANKTRIGLAQYSGDPRIEWHLNTFSTKDAVIDAVRNLPYKGGNTLTGLALNYIF 392 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + + T + K I ITDG++ + +R++G++++++ Sbjct: 393 ENSFKPEAGAR-----TGVSKIGILITDGKSQDDIIP--------ASKTLRDSGVELFAI 439 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 V L++ + V + + + +T + + Sbjct: 440 GV--KNADVAELQEIASEPDSTHVYNVAEFDLMHTVVEGLTKTVCTRV 485 >gi|229495742|ref|ZP_04389470.1| BatA protein [Porphyromonas endodontalis ATCC 35406] gi|229317316|gb|EEN83221.1| BatA protein [Porphyromonas endodontalis ATCC 35406] Length = 325 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 28/161 (17%), Positives = 54/161 (33%), Gaps = 39/161 (24%) Query: 249 GNQCTPLSNNLNEVKSRLNKLNPYE---NTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 PL+ + + +RL+++ T + A L S Sbjct: 136 SFTQCPLTTDHATLLNRLSEVEIGYLEDGTAIGLGIATACNRLKESHAKS---------- 185 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------------- 352 K ++ +TDG N+ S + + G++IY+VAV E Sbjct: 186 KIIVLLTDGTNNAGS-----IAPSMAASLAESLGIRIYTVAVGTRGEAPYPHATAFGTVI 240 Query: 353 -------GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITD 385 + L++ + G +F D+ L + +D+I Sbjct: 241 DNVKVEIDEASLKEIAQTTGGSYFRATDNESLNQIYDEIDS 281 >gi|239827908|ref|YP_002950532.1| hypothetical protein GWCH70_2571 [Geobacillus sp. WCH70] gi|239808201|gb|ACS25266.1| Ig domain protein group 2 domain protein [Geobacillus sp. WCH70] Length = 942 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 36/344 (10%), Positives = 86/344 (25%), Gaps = 33/344 (9%) Query: 63 IFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKG 122 +L + I + + A ++ T S+ +Y P Sbjct: 11 FIALFFSFYLGDATNIVFGESNDSNNATLDFTIT---------SSQLEYAKPPNGDAQGR 61 Query: 123 LIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLP 182 L + I + V DVS SM + ++ Y Sbjct: 62 L-------DVTLIPKGRVDNIVRPPIDVVFVFDVSGSMTP-LKLQSAKYALQSAVDYFKA 113 Query: 183 PPPKKSFWSKNTTKSKYAPA-PAPANRKIDVLIES---AGNLVNSIQKAIQEKKNLSVRI 238 ++ S P + + N+ N ++ S++ Sbjct: 114 NANPNDRFALIPFSSDVQYNKVVPFPTGAYDVKQHLERIANVANDLRAYGGTNYTQSLQQ 173 Query: 239 GTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNT 298 +N L++ + V + K +++ + + + Sbjct: 174 AQSFFNDPTRKKYIIFLTDGMPTV--SIAKEPITYKVCEGILFWRTCKQVTEDLDVQYIL 231 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLN---------TLQICEYMRNAGMKIYSVAVS- 348 + I+ D + + I + + + +YS+ Sbjct: 232 YSNGITAARTIYYPDEPETKTYWDRTKYREFEEKIRLHGTNIAKTLGMNNIVLYSIGFGN 291 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 L K + ++G + L E F + + + + Sbjct: 292 YQEVDMGYLEKLSSTAGGQAKQGTPQNLTEIFQQFSKLANDPVL 335 >gi|147815707|emb|CAN70517.1| hypothetical protein VITISV_016246 [Vitis vinifera] Length = 715 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 18/185 (9%), Positives = 56/185 (30%), Gaps = 24/185 (12%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 K+ +L + G ++ ++ + + + + + + + +N Sbjct: 281 TKLALLKRAMGFVIQNLGSSDRLSVIAFSSTARRLFPLRRMTDA------GRQQALQAVN 334 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG---ASAYQN 324 L TN + + + + +I ++DG+++ S+ Sbjct: 335 SLVANGGTNIAEGLRKGAKVMED--------RKERNPVSSIILLSDGQDTYTVNGSSGNX 386 Query: 325 TLNTLQICEYMRNAG-------MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 Q+ + G + ++S + + S G F + + Sbjct: 387 PQPNYQLLLPLSMHGSQNTGFQIPVHSFGFGTDHDASSMHTISEISGGTFSFIETESVIQ 446 Query: 378 ESFDK 382 ++F + Sbjct: 447 DAFAQ 451 >gi|291388325|ref|XP_002710627.1| PREDICTED: matrilin 3-like [Oryctolagus cuniculus] Length = 1109 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 26/187 (13%), Positives = 66/187 (35%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +++ + +++S+ + + R+G + Y+ + + ++K Sbjct: 820 GEDNFEIVKQFLTAIIDSLAVSPKAA-----RVGLLQYSTQVRAEFTLRSFSTAKDMKKA 874 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + S R+ + I TDG + Sbjct: 875 VAHMKYMGKGSMTGQALKHMFERSFTQLEGARPL--SARVPRVAIVFTDGRAQDDVSEW- 931 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ + F D + E +K Sbjct: 932 -------ANKAKANGITMYAVGVGKAI--EEELQEIASEPTDKHLFYAEDFSTMNEISEK 982 Query: 383 ITDKIQE 389 + I E Sbjct: 983 LKMGICE 989 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 23/159 (14%), Positives = 55/159 (34%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 241 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 300 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ R+ G+ I+++ V Sbjct: 301 FSEAEGARPLREN--VPRVIMIVTDGRPQDS--------VAEVAAKARDTGILIFAIGVG 350 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F Sbjct: 351 --QVDLNTLKAIGSEPHEDHVFLVANFSQIESLTSVFQN 387 >gi|194221585|ref|XP_001495200.2| PREDICTED: collagen, type XXIX, alpha 1 [Equus caballus] Length = 2617 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 32/361 (8%), Positives = 98/361 (27%), Gaps = 33/361 (9%) Query: 41 CASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYI--------RENAGDIAQKAQIN 92 +I S ++ + +Q + + K + + + QI Sbjct: 282 AKTISSLKSSTTQSEFLEQIQKLSLQAGKSNAGAAIDMMRREAFSESSGSRRAQGVPQIA 341 Query: 93 ITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLS--LRSTGIIERSSENLAISI 150 + + E+ + +F + + T L + S+ + Sbjct: 342 VLVTNRPSDDEVREAAQKLLREDVIVFAMSIQGANNTQLEEIVSYPSRQRVSTLKSYADL 401 Query: 151 CMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKI 210 + + + +I Sbjct: 402 ERYSRIFQKKLQNEIWDQIST-RAEQRNLDETGCLDTKEADLYFLIDGSTSIQNEQFEQI 460 Query: 211 DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL- 269 + + V++G + Y+ I + +N +++ + + Sbjct: 461 KRFMLEVTEVF--------SIGPGRVQVGVVQYSHEIREEFSIGVYSNDVDLRKAVLNIK 512 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 +T+T A+ ++ ++ + + ++I +TDG++ L Sbjct: 513 QLTGDTHTGAALAFMLPKIREGRKQRPSK-----VPCYLIVLTDGQSQDHH------RIL 561 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + +R G+ I++V + + L++ + + + L ++ I Sbjct: 562 ETANRIRAEGVTIHAVGIG--EADKTELQQIAGNEERVHFGQNFDSLKSIKSEVVHSICT 619 Query: 390 Q 390 + Sbjct: 620 E 620 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 29/154 (18%), Positives = 54/154 (35%), Gaps = 13/154 (8%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKES 294 V+ G + Y+ + + L + T T A+ HA + E Sbjct: 852 VQFGALKYSKEPEDLFYLNTFSKGAAITENLRRRRDTYGETYTAKALEHANSQFTEE--- 908 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 H + +K+ +I ITDG + +T +RN G+ IY+V V Q Sbjct: 909 -HGSRIKQNVKQMLIVITDGVSHDREQLSDT------ALKLRNKGIIIYAVGVG--EADQ 959 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 L V++ +L + + + + + Sbjct: 960 YELEAIAGDKNNTRHVDNFDKLKDIYQFLQEGMC 993 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 28/158 (17%), Positives = 53/158 (33%), Gaps = 17/158 (10%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEK 292 +IG + ++ E+ +++++ +NT T A+ + Sbjct: 664 DKTQIGVVQFSTDAKEEFQLNKYFTQKEISDAIDRMSLINQNTYTGHALEFVDQYF---- 719 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 SH +KKF+I ITDGE +R + I+SV V Sbjct: 720 --SHLKGARPGVKKFLILITDGEARDGVRDP--------ARALREKEVVIFSVGV--YGA 767 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + L + + F V +L K+ ++ Sbjct: 768 NRTQLEEISGDGSLVFQVEKFDDLKAIESKLIFRVCAL 805 >gi|327542237|gb|EGF28726.1| BatA aerotolerance operon protein [Rhodopirellula baltica WH47] Length = 345 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 33/167 (19%), Positives = 60/167 (35%), Gaps = 37/167 (22%) Query: 247 IVGNQCTPLSNNLNEVKSRLN-----KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGS 301 + TP + + + V SRLN + T A+ + +L + S Sbjct: 133 AYADAETPPTLDHSFVVSRLNQTEIVSRRDEDGTAIGDAIALSVEKLNALDARQERKVQS 192 Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE--------- 352 K +I +TDGEN+ L+ +Q E G+KIY++ V + Sbjct: 193 ----KILILLTDGENTAG-----ELDPIQAAELAETLGIKIYAIGVGTKGKAPVPVRDPF 243 Query: 353 -------------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + L+K + + G++F D+ L + +I Sbjct: 244 TGRQRLHYMEVNIDEATLQKVAEITGGKYFRATDTDSLDAIYREIDQ 290 >gi|291613312|ref|YP_003523469.1| von Willebrand factor type A [Sideroxydans lithotrophicus ES-1] gi|291583424|gb|ADE11082.1| von Willebrand factor type A [Sideroxydans lithotrophicus ES-1] Length = 321 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 54/147 (36%), Gaps = 25/147 (17%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ +LN V L++ T A+ A + L + S + +I Sbjct: 145 PLTTDLNTVGQFLDEAMIGVAGTQTAIGDAIGLAIKRLRDATNVSGRKGET-----VLIL 199 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG-----------QDLLRK 359 +TDG N + + + AG++IY++ V + +D L+ Sbjct: 200 LTDGSNDAGAMPPDEAAKMAA-----AAGLRIYTIGVGSDQTDPFGMGGANDLDEDTLKL 254 Query: 360 CT-DSSGQFFAVNDSRELLESFDKITD 385 + G++F D L + + +I Sbjct: 255 IAKTTGGEYFRATDVENLQQVYTRIDR 281 >gi|20093632|ref|NP_613479.1| Mg-chelatase subunit ChlI /Chld [Methanopyrus kandleri AV19] gi|19886501|gb|AAM01409.1| Mg-chelatase subunit ChlI and Chld (MoxR-like ATPase and vWF domain) [Methanopyrus kandleri AV19] Length = 818 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 22/133 (16%), Positives = 44/133 (33%), Gaps = 10/133 (7%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 +++++ E+ +++ L P T+ A+ + Sbjct: 680 NTKAEIVVDITSDVEEIITKVMSLKPGGATDIGDAIRVGTELFRRCGRPDRDW------- 732 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SS 364 +I +TDG + T L G+ I ++ + P EG L+ S Sbjct: 733 -HMILLTDGVPTKGEPDPETK-ALSEATAASRMGVTISTIGIKLPEEGIRLIEHIAGISG 790 Query: 365 GQFFAVNDSRELL 377 G+ + D EL Sbjct: 791 GRSHHITDPEELT 803 >gi|33596464|ref|NP_884107.1| hypothetical protein BPP1839 [Bordetella parapertussis 12822] gi|33566233|emb|CAE37141.1| putative exported protein [Bordetella parapertussis] Length = 571 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 29/291 (9%), Positives = 76/291 (26%), Gaps = 27/291 (9%) Query: 108 KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQK 167 Y L L P+ ++ L + + + Sbjct: 130 TGSYSNVRRLLNDGRLPPADAVRAEAFINYFDYGYPAPATPAVPFSL-TTEIAPAPWNPQ 188 Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 + P S A K+ +L + LV ++ Sbjct: 189 RQLLLVGIQGYRVAPQDIPAVNLVLLIDTSGSMADRA----KLPLLKSALRQLVTQMRAQ 244 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 R+ +AY + + ++ + ++ L +TN + AY E Sbjct: 245 D--------RVAIVAYAGSAGLVLPSTPGDRHAQILAAIDGLQASGSTNGGAGLELAYAE 296 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 ++ +DG+ + + R G+ + ++ + Sbjct: 297 AAKGLVKDGVNR--------IVLASDGDFNVGR--TDLAQLKDYVGSQRKRGIALTTLGL 346 Query: 348 SAPPEGQDL-LRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + + ++ G + ++ + + F ++ + IA + Sbjct: 347 GSGNYNDAMAMQLANAGDGSYHYIDSLLQARKVF---ASELSATLLTIAKD 394 >gi|224370037|ref|YP_002604201.1| hypothetical protein HRM2_29500 [Desulfobacterium autotrophicum HRM2] gi|223692754|gb|ACN16037.1| conserved hypothetical protein [Desulfobacterium autotrophicum HRM2] Length = 332 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 33/198 (16%), Positives = 70/198 (35%), Gaps = 40/198 (20%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-- 270 L + N +++++ +++ G PL+ + + + L++L Sbjct: 106 LDGAIVNRLDAVKNVVKDFIMKRSGDRIGMVVFGSEAFTQMPLTRDYDTIAFVLSRLKIG 165 Query: 271 -PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 +T AM + + L + K S VI +TDG+++ + Sbjct: 166 AAGPSTAIGDAMGISLKRLEDVKSKS----------NIVILLTDGKSNSG-----EITPG 210 Query: 330 QICEYMRNAGMKIYSVAVSAPPE---------------------GQDLLRKCTD-SSGQF 367 + R G+K+Y++ V + + L++ D + G F Sbjct: 211 AAADIARERGVKVYTIGVGQRGKAPFLVNDPLFGQRYVYQMVDMDHEALKEIADKTGGAF 270 Query: 368 FAVNDSRELLESFDKITD 385 FA D+ L + +D I Sbjct: 271 FAAADTDSLKKIYDMIDS 288 >gi|91225506|ref|ZP_01260628.1| hypothetical protein V12G01_09265 [Vibrio alginolyticus 12G01] gi|91189869|gb|EAS76142.1| hypothetical protein V12G01_09265 [Vibrio alginolyticus 12G01] Length = 356 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 25/150 (16%), Positives = 60/150 (40%), Gaps = 15/150 (10%) Query: 246 GIVGNQCTPLSNNLN---EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 G TP + + E+ ++ + ++T+ A+ A + +S Sbjct: 159 GDAAFVQTPFTADQKVWLELLNQTDVAMAGQSTHLGDAIGLAIKVFEQSDQSRGALEQDQ 218 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-------EGQD 355 +K I +TDG ++G + + + + + G++++ +A+ P + Sbjct: 219 NREKVAIVLTDGNDTG-----SFVEPIDAAKVAKAKGVRVHVIAMGDPETIGETALDMDT 273 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + R +S G+ F + EL ++D+I Sbjct: 274 IHRIAKESGGEAFEALNRDELSAAYDEIGK 303 >gi|154244802|ref|YP_001415760.1| von Willebrand factor type A [Xanthobacter autotrophicus Py2] gi|154158887|gb|ABS66103.1| von Willebrand factor type A [Xanthobacter autotrophicus Py2] Length = 345 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 27/167 (16%), Positives = 57/167 (34%), Gaps = 33/167 (19%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNL-NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS 295 RIG I ++ L N+ ++ + + NT+ A+ A + L Sbjct: 136 RIGLILFSTRAYVQAPLTLDRNVVRQLLAEASIGMTGRNTSIGDAIGLAVKTL------- 188 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE--- 352 + +I +TDG N+ L+ ++ ++I+++ V A Sbjct: 189 ---RDRPAKDRVLILLTDGANTSGV-----LDPMEAAAIAAKENVRIHTIGVGADSNFTD 240 Query: 353 -------------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 ++ L+K + GQ+F + + L + I Sbjct: 241 IQPGMLMNPSGDLDEEALKKIAGLTGGQYFRARNDKGLAAIYADIDR 287 >gi|323138937|ref|ZP_08073998.1| hypothetical protein Met49242DRAFT_3386 [Methylocystis sp. ATCC 49242] gi|322395783|gb|EFX98323.1| hypothetical protein Met49242DRAFT_3386 [Methylocystis sp. ATCC 49242] Length = 482 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 55/467 (11%), Positives = 128/467 (27%), Gaps = 87/467 (18%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + + + L A D A + ++++ ++ DAA L+ + + T + Sbjct: 21 IMGLAVIPLVLASGLAADYAIVQAAKSRLDASADAAALAAIKTAQTTIAELSATNPNPRP 80 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 I + + + A D+ K I++ + Y Sbjct: 81 QAIAAAMSQAEKSFYAQAGKRAADLLGKPAIDV-----QIKGQEVTANVAYS-AAMPSNF 134 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 + L N + + + + ++LDVS SM + L Sbjct: 135 GRIAGVKLMNYNGGAGAQLT---MAKFLDFYLLLDVSGSMGLPSTPAGEAALAAKNPDDL 191 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPA-PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG 239 P F Y + A +ID + + L+ + K V + Sbjct: 192 AQYPTGCRFACHFAGSQGYNVSRANNIQLRIDAVGAAVAQLMEKAKDTATLPKQYRVGVY 251 Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT------------------------ 275 + + L + V+S +N T Sbjct: 252 PFVTHANAFVDLTDNLRGDQYSVESAINYDPATRTTDFGRLLDAGKDWVFARDLNPNYKA 311 Query: 276 ------------NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS------ 317 +H+ ++++ + S + G++ + FV F++DG + Sbjct: 312 NPNIPADVTPMGAGGSHIHNIFQDINAKIPSVGDGSGASSPQPFVFFVSDGMQNSQSFVS 371 Query: 318 -GASAYQNTLNTL-------------QICEYMRNAGMKIYSVAVSAP------------- 350 + T +C ++ G+ + + + P Sbjct: 372 ATGTWPGVTPYPTPPGQTVSIRAMDPTLCNVLKARGITVSVLEIPYPTFTNPKPFAAAQE 431 Query: 351 -------PEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 P +R C +F + + ++ K+ ++ + Sbjct: 432 FKANDAVPNLSGAMRACAS-PNFYFMADTPEGIADAMKKMFEQAVQS 477 >gi|312621140|ref|YP_004022753.1| yd repeat protein [Caldicellulosiruptor kronotskyensis 2002] gi|312201607|gb|ADQ44934.1| YD repeat protein [Caldicellulosiruptor kronotskyensis 2002] Length = 2994 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 38/372 (10%), Positives = 110/372 (29%), Gaps = 62/372 (16%) Query: 37 VLSGCASIVSDR-TIKDPTTKKDQTSTIFKKQIKKHL---------KQGSYIRENAGDIA 86 V+ G + + T + + K ++ +I G + Sbjct: 623 VVMGTDPLAKNPLTSAEKYAVSEDGKVFVKALSDANILIAPLQVKRSDNVFINSLKGIVG 682 Query: 87 QKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENL 146 + +I + + + L + + + + Sbjct: 683 KAIEITAGGFDIKRAEIVVNYDEAELNGVDENNLMLYYVNYDKKI------LEPLEDVVV 736 Query: 147 AISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPA 206 V + L N +++ + S S + Sbjct: 737 DTVYNRVSGKTEHFSTFLLGDKNMPVDLSKVDIVFVLDNSGSMSSNDPNYY--------- 787 Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 +I+ + N+ L+ R+G + ++ + + L+++ N++ L Sbjct: 788 --RIEATKKFIQNI-----------DELNNRVGLVDFDSSVYVR--SNLTSDKNKLLQAL 832 Query: 267 NKLN-PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 N + +TN + A + KK ++ ++DG ++ + Sbjct: 833 NAMRWTGGSTNIGGGLKAALELF-----------DQEQSKKIIVLLSDGYHNTGIHPND- 880 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKIT 384 + + + + ++A+ ++LL D + G +F V+++ L + + + Sbjct: 881 -----VLPELIKQEIVVNTIALGKD-CDRELLHDIADKTKGDYFYVDNTGGLSQ--EDVD 932 Query: 385 DKIQEQSVRIAP 396 +I+ ++ Sbjct: 933 KQIELIYEKLTK 944 >gi|156347845|ref|XP_001621774.1| hypothetical protein NEMVEDRAFT_v1g221583 [Nematostella vectensis] gi|156208029|gb|EDO29674.1| predicted protein [Nematostella vectensis] Length = 357 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 32/277 (11%), Positives = 76/277 (27%), Gaps = 46/277 (16%) Query: 147 AISICMVLDVSRSMEDLYLQKHNDNNNMTSN-----------KYLLPPPPKKSFWSKNTT 195 I D++ + N N + N Sbjct: 66 DIDWSQWPDMTHVISACASWAKRKNYNDKFGVQFYGECWADADSSRFNDHGLATNCINGV 125 Query: 196 KSKYAPAPAPANRKIDVLIESAGNLVN---SIQKAIQEKKNLSVRIGTIAYNIGIVGNQC 252 ++ L + L++ SI + + ++ I Sbjct: 126 GEHWSNFVYSMKGVCPPLKMNLVFLIDNSGSINDTEFDNFKEFAKKLAESFTISATYTHV 185 Query: 253 ------------TPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTI 299 L ++N +K+ ++ L T+ A+ + ++ Sbjct: 186 AAVYFNTLANFGFNLKYDINVIKTAIDNLPNIGGGTHIGKALTYTLDNVFKVAPR----- 240 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG--MKIYSVAVSAPPEGQDLL 357 +K ++ +TDG++ + +RN G +++++V V A L Sbjct: 241 --QNVKNVLVVLTDGKSHDSVTLP--------AAAVRNYGPGVEVFAVGVGAGDSFVAQL 290 Query: 358 RKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQSV 392 F V ++ + + D+I + +V Sbjct: 291 NVIASDPDEDHVFHVEHFSQIESTTGAVEDEICKDTV 327 >gi|327261831|ref|XP_003215731.1| PREDICTED: collagen alpha-1(XII) chain-like [Anolis carolinensis] Length = 3118 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 54/166 (32%), Gaps = 18/166 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+G + Y+ ++ + + NT T A+ + Sbjct: 169 SAFDIGEDKTRVGIVQYSSDTRTEFNLNQYYRQRDLIEAIKNIPYKGGNTMTGEAIEYLM 228 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 R + E S K I ITDG++ +R AG++++S+ Sbjct: 229 RNTFVESAGSR-----KDFPKVAIIITDGKSQD--------EVEIPARELRAAGVEVFSL 275 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQE 389 + L+ F V + +++ ++I ++ Sbjct: 276 GI--KAADAKELKLIASQPSLTHVFNVANFDGIVDIQNEIVSQVCS 319 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 27/167 (16%), Positives = 62/167 (37%), Gaps = 23/167 (13%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYN 290 V+I + Y+ N ++++ +N TNT AM + +++ Sbjct: 474 SPEKVQISLVQYSRDPHTEFTLNRYNRIDDIIQAINTFPYRGGSTNTGKAMTYVREKIFV 533 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + +I ITDG+ ++ +RN+ ++I++V V Sbjct: 534 TGRGAR-----PNVPRVMILITDGK--------SSDAFKDPAIKLRNSDVEIFAVGV--K 578 Query: 351 PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS-VRI 394 + L + V D ++F +I+ ++ + +RI Sbjct: 579 DAVRTELEAIATPPAETHVYTVEDF----DAFQRISFELTQSVCLRI 621 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 24/165 (14%), Positives = 53/165 (32%), Gaps = 18/165 (10%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + V+IG Y+ + + L NT T A++ R Sbjct: 1230 DIGPDKVQIGLAQYSGDPRTEWQLNSHKTKQSLMDAVANLPYKGGNTLTGMALNFILR-- 1287 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 +K + ITDG++ + + +++ G+++Y+V + Sbjct: 1288 ---NNFKPEAGMRPGARKIGVLITDGKSQDDIVAPS--------QRLKDLGVELYAVGI- 1335 Query: 349 APPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 ++ L++ + V D L+ D +T + Sbjct: 1336 -KNADENELKQIASDPDETHAYNVGDFTLLVNIVDDLTVNLCNSV 1379 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 30/351 (8%), Positives = 77/351 (21%), Gaps = 27/351 (7%) Query: 55 TKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIP 114 + + + ++ N+ D S + Y + Sbjct: 2182 STSYDVNVYAQYDAGLSAPLVDRGTTLYLNVTDLTSYNVGWDTFCVRWAAHRSASSYRLK 2241 Query: 115 TENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNM 174 + T + + ++E + Sbjct: 2242 LNPADGSR---GQEITVRGTETSHCFTGLSPDTEYDATIFVQTPNLEGPPVSTRERTLIK 2298 Query: 175 TSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNL 234 + P P N ++ Sbjct: 2299 PTEPPTEAPTPPPPPTIPPARDVCKGAKADIVFLTDASWSIGDDNFNKVVKFIFNTVGGF 2358 Query: 235 S------VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRE 287 +++ + Y+ ++ L + NT T A+ + Sbjct: 2359 DLINPAGIQVSFVQYSDDPKPEFNLNRYDDKALALGALQNIRYKGGNTKTGKALTFIKNK 2418 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 + + + K ++ +TDG + ++ ++++G ++ V V Sbjct: 2419 VLT-----WESGMRKGVPKVLVVVTDGRSQD--------EVMKAASVIQHSGFSVFVVGV 2465 Query: 348 SAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 L K F V+D + D + + E + P Sbjct: 2466 --ADVDYHELAKIASKPSERHVFIVDDFDAFEKIEDNLITFVCETATSSCP 2514 >gi|32475535|ref|NP_868529.1| BatA [Rhodopirellula baltica SH 1] gi|32446077|emb|CAD75906.1| BatA [Rhodopirellula baltica SH 1] Length = 357 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 33/167 (19%), Positives = 60/167 (35%), Gaps = 37/167 (22%) Query: 247 IVGNQCTPLSNNLNEVKSRLN-----KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGS 301 + TP + + + V SRLN + T A+ + +L + S Sbjct: 145 AYADAETPPTLDHSFVVSRLNQTEIVSRRDEDGTAIGDAIALSVEKLNALDARQERKVQS 204 Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE--------- 352 K +I +TDGEN+ L+ +Q E G+KIY++ V + Sbjct: 205 ----KILILLTDGENTAG-----ELDPVQAAELAETLGIKIYAIGVGTTGKAPVPVRDPF 255 Query: 353 -------------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + L+K + + G++F D+ L + +I Sbjct: 256 TGRQRLHYMEVNIDEATLQKVAEITGGKYFRATDTDSLDAIYREIDQ 302 >gi|293348732|ref|XP_001072793.2| PREDICTED: collagen, type XXII, alpha 1 [Rattus norvegicus] gi|293360639|ref|XP_243609.5| PREDICTED: collagen, type XXII, alpha 1 [Rattus norvegicus] Length = 1613 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 30/195 (15%), Positives = 69/195 (35%), Gaps = 20/195 (10%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + + + NLV++ + R+G + Y+ ++ Sbjct: 54 TSSSVGKEDFEKVRQWVANLVDTFE-----VGPGHTRVGVVRYSDRPTTAFELGHFSSRE 108 Query: 261 EVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 EVK+ ++ NTNT A+ + ++ + G+ K+ I +TDG + Sbjct: 109 EVKAAARRITYHGGNTNTGDALRYITSRSFSA--QAGGRPGNRAFKQVAILLTDGRSQDL 166 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELL 377 G++I++V V ++ L + F V+D + Sbjct: 167 VLDAAAAAHAA--------GIRIFAVGVG--AALKEELDEIASEPKSAHVFHVSDFNAID 216 Query: 378 ESFDKITDKIQEQSV 392 + K+ ++ E + Sbjct: 217 KIRGKLRRRLCENVL 231 >gi|258544594|ref|ZP_05704828.1| von Willebrand factor type A domain protein [Cardiobacterium hominis ATCC 15826] gi|258520172|gb|EEV89031.1| von Willebrand factor type A domain protein [Cardiobacterium hominis ATCC 15826] Length = 563 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 32/382 (8%), Positives = 95/382 (24%), Gaps = 35/382 (9%) Query: 23 MYIRNQ--MQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRE 80 R+ +Q A A+ + +S+R + + + + Sbjct: 35 YAARSAPVLQKAA-ASPQAQAIPDLSNRLGVNMPVPAGANPQWALNKSVARMGIRGTVEV 93 Query: 81 NAGDIAQKAQIN----ITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRST 136 + + N ++ + ++ + I L P+ + Sbjct: 94 QNRERYAHSDANPVHRVSDAPVSTFSIDVDTGSYSNIRRMLTRENRLPPADAVRVEEILN 153 Query: 137 GIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTK 196 V + + + N Sbjct: 154 YFAYGYPLPQDGKPFAV------HTQTVDSPWQADAKLIRIAIQAADLAPEKRPPANLVF 207 Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 K+ ++ ++ + +++ RI I Y+ Sbjct: 208 LIDTSGSMDDPDKLPLVKKTVCHFAEALRADD--------RISLITYSGSTAEILPPTAG 259 Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + + + L L + T A+ AY ++ TDG+ Sbjct: 260 DQKETIIAALKPLRAHGATAGGEALRMAYDAAAKNYRKDGINR--------ILLATDGDF 311 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRE 375 + + R +G+ + ++ + +++ + D+ G + ++ E Sbjct: 312 NVG--ISDPATLKNYVADKRKSGISLTTLGYGSGNYNDEMMEQLADAGDGNYSYIDSEAE 369 Query: 376 LLESFDKITDKIQEQSVRIAPN 397 + + ++ +A + Sbjct: 370 AKKV---LVRQLTSTLATVARD 388 >gi|297265499|ref|XP_002799207.1| PREDICTED: matrilin-3-like [Macaca mulatta] Length = 445 Score = 72.6 bits (176), Expect = 1e-10, Method: Composition-based stats. Identities = 24/176 (13%), Positives = 58/176 (32%), Gaps = 23/176 (13%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP- 271 + +++++ + R+ + Y + + +K + ++ P Sbjct: 103 VKTFVSRIIDTL-----DIGPADTRVAVVNYASTVKIEFQLQAYTDKQSLKQAVGQITPL 157 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T + A+ A E++ + + S+ + K I +TDG ++ Sbjct: 158 STGTMSGLAIQTAMDEVFTAEAGARG--PSSNIPKVAIIVTDGRPQD--------QVNEV 207 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 R +G+++Y+V V L+ F V +L F + Sbjct: 208 AARARASGIELYAVGV--DRADMQSLKMMASEPLEEHVFYVETYGVIEKLSSRFQE 261 >gi|152993979|ref|YP_001359700.1| von Willebrand factor type A domain-containing protein [Sulfurovum sp. NBC37-1] gi|151425840|dbj|BAF73343.1| von Willebrand factor type A domain protein [Sulfurovum sp. NBC37-1] Length = 307 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 34/213 (15%), Positives = 63/213 (29%), Gaps = 26/213 (12%) Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 Y K + D ++ + Sbjct: 69 YDAAGNQHKKGRDLVLAIDASGSMAQSGFDEKDRFKTKYETTLD----LSADFIKHRFDD 124 Query: 239 GTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE----NTNTYPAMHHAYRELYNEKES 294 G +PL+ +L ++S L K+ +T A+ A R L + Sbjct: 125 NMGVVIFGTFAYTASPLTYDLEAMESML-KMTTVGIAGESTAIGDALMQAMRTLSYGEAQ 183 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS-APPEG 353 S K +I +TDG ++ + G+KIY++ V + Sbjct: 184 S----------KAIILLTDGYHNAGRSSPKAAVAKAK-----EKGIKIYTIGVGKSSDYD 228 Query: 354 QDLLRKCT-DSSGQFFAVNDSRELLESFDKITD 385 LL +S G+ +A + +L E + +I Sbjct: 229 AALLDTIAKESGGKSYAAASAAQLKEVYKEIDK 261 >gi|224048537|ref|XP_002190467.1| PREDICTED: similar to collagen, type XII, alpha 1 [Taeniopygia guttata] Length = 3122 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 22/166 (13%), Positives = 55/166 (33%), Gaps = 18/166 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+G + Y+ +++ + ++ NT T A+ + Sbjct: 167 SAFDIGEEKTRVGVVQYSSDTRTEFNLNQYFRRSDLIDAIKRIPYKGGNTMTGEAIDYLV 226 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + E + K I ITDG+ +RN G++++S+ Sbjct: 227 QNTFTESAGAR-----KGFPKVAIVITDGKAQDDVEIP--------ARELRNIGVEVFSL 273 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQE 389 + L+ F V + +++ ++I ++ Sbjct: 274 GI--KAADAKELKLIASQPSLKHVFNVANFDGIVDIQNEIILQVCS 317 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 63/163 (38%), Gaps = 23/163 (14%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKES 294 V+I + Y+ N + ++ +N TNT AM + +++ + Sbjct: 476 VQISLVQYSRDPHMEFSLNRYNRVEDIIQAINTFPYRGGSTNTGKAMTYVREKVFVTSKG 535 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 S + + +I ITDG+ ++ + +R+A ++I++V V + Sbjct: 536 SR-----PNVPRVMILITDGK--------SSDAFKEPAIKLRDADVEIFAVGV--KDAVR 580 Query: 355 DLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS-VRI 394 L + V D ++F +I+ ++ + +RI Sbjct: 581 TELEAIASPPADTHVYTVEDF----DAFQRISFELTQSVCLRI 619 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 51/165 (30%), Gaps = 18/165 (10%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + V+IG Y+ + + L NT T A+ R Sbjct: 1228 DIGPDRVQIGLAQYSGDPRTEWNLNAYRTKQSLLEAVANLPYKGGNTLTGMALDFILR-- 1285 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 R +K + ITDG++ + +R+ G+++Y++ + Sbjct: 1286 ---NNFKPVAGLRPRARKIGVLITDGKSQDDVVAPS--------RKLRDEGVELYAIGI- 1333 Query: 349 APPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 ++ L++ + V D L D +T + Sbjct: 1334 -KNADENELKQIATDPDDIHAYNVADFSFLATIVDNVTTNLCNSV 1377 Score = 65.3 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 23/196 (11%), Positives = 60/196 (30%), Gaps = 19/196 (9%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 A + D + + +++ +++ + Y+ ++ + Sbjct: 2332 ASWSIGDDNFNKVVKFVFSTVGAFDL-INPAGIQVSFVQYSDEAKSEFKLNTFDDKAQAL 2390 Query: 264 SRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 L + NT T A+ ++ + + K ++ +TDG + Sbjct: 2391 GALQNIQYRGGNTRTGKALTFIKEKVLT-----WESGMRRGVPKVLVVVTDGRSQD---- 2441 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESF 380 + ++++G ++ V V + L K F V+D + Sbjct: 2442 ----EVRKAATVIQHSGFSVFVVGV--ADVDYNELAKIASKPSERHVFIVDDFDAFEKIQ 2495 Query: 381 DKITDKIQEQSVRIAP 396 D + + E + P Sbjct: 2496 DNLVTFVCETATSTCP 2511 >gi|168702184|ref|ZP_02734461.1| hypothetical protein GobsU_21830 [Gemmata obscuriglobus UQM 2246] Length = 638 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 19/149 (12%), Positives = 43/149 (28%), Gaps = 11/149 (7%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 R+ + Y + + + L TN + AY+ + Sbjct: 305 TEKDRVSVVTYAGDSRVALPPTSGADKKAILDVVTGLQANGGTNGEGGIKKAYQFARDTF 364 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 VI TDG+ + + +++ E R + + + + Sbjct: 365 LDGGVNR--------VILCTDGDFNVGVV--DNGELVKLIEEQRKSKVFLTVLGYGMGNY 414 Query: 353 GQDLLRKCTDSS-GQFFAVNDSRELLESF 380 D L++ + G ++ E + F Sbjct: 415 KDDRLKELANHGNGHHAYIDTLDEAKKVF 443 >gi|119961201|ref|YP_948618.1| hypothetical protein AAur_2909 [Arthrobacter aurescens TC1] gi|119948060|gb|ABM06971.1| hypothetical protein AAur_2909 [Arthrobacter aurescens TC1] Length = 354 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 18/153 (11%), Positives = 43/153 (28%), Gaps = 14/153 (9%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 MTA+++ +A+D+A + Q+Q+ D++ + + + D Sbjct: 17 MTAMLMVALLGMTAFAVDVAMMYSEHAQLQNGADSSAIGIAQACAQNAASADCAAPTSAA 76 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 +++ + N G + +S+ L Sbjct: 77 TSLAGLNALDGVSNAPQASVNLG--------------TGTVDVTTQSRNTSGDNHFTLVF 122 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMV 153 + N+ + S AI + Sbjct: 123 ARALGVETANIQASAQAKFGGFSATDAIPLTFS 155 >gi|50119743|ref|YP_048910.1| hypothetical protein ECA0798 [Pectobacterium atrosepticum SCRI1043] gi|49610269|emb|CAG73712.1| putative membrane protein [Pectobacterium atrosepticum SCRI1043] Length = 543 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 53/503 (10%), Positives = 121/503 (24%), Gaps = 123/503 (24%) Query: 3 AIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTST 62 A+ + + +D + Q++ A DAA L+ + ++ + Sbjct: 41 ALGAMALLVTAAFIVDTSTATGDATQIKRATDAAALAVGHQATING--------EEYSQE 92 Query: 63 IFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN----- 117 K +++K + + + + +++T+ +N+ + A +E Sbjct: 93 DTNKLAYEYVKNNLGMNKALSEKLVASDVSVTEGRNSATRKTYTVTAAFETKPSLLSLGA 152 Query: 118 ------------------------LFLKGLIPSALTNLSLRSTGIIERSSENLAISICM- 152 RS SS + Sbjct: 153 KKQEVYSTSEVINRPTEIALVMPVTGDMSEGDIRSLKSVSRSFVERMLSSADGKRDNLWL 212 Query: 153 -VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP--------------------------- 184 ++ S+S+ + N ++ L PP Sbjct: 213 SLVPYSQSVNVYDAEDANRIRRWSTPSALNPPELRSLFASGVVSSLADRRFPDRRANLLC 272 Query: 185 -------PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVR 237 + FW + + + + + +V Sbjct: 273 MYRGLGREEDFFWDEPPAGQFSIYYRHDLPQNGSPGAPPISWRGPNPDLYPWDNNSNAVD 332 Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHN 297 I + G PL+N + + R+ + + N N M A L S Sbjct: 333 TRFIVADRGCPNAALMPLTNEESRLNQRIEQFSARFNVNYAIGMSWAGAALSPNMRGSDG 392 Query: 298 TIGSTRL---------KKFVIFITDGENSGASAYQNTLNTL------------------- 329 + + + G + N Sbjct: 393 WGDAKLPLDFNLDGNGDGQKVIVMMANTIGNWFDTDGYNFKRNQFSGSTGSDVAREFAQQ 452 Query: 330 ---QICEYMRNAGMKIYSVAV---SAPPEGQDLL---------------RKCTDSSGQFF 368 +C +K Y V + G++L ++ + G F Sbjct: 453 RFRDLCSSFHARNIKFYFVGIRPGDPEDFGRNLFDTEATPGLLVCTEGEKRMSFIDGSGF 512 Query: 369 AVNDS-RELLESFDKITDKIQEQ 390 +L++ D+I +I+ + Sbjct: 513 GAEGVEDQLIQRLDRIAGQIETE 535 >gi|75076662|sp|Q4R7B7|ANTRL_MACFA RecName: Full=Anthrax toxin receptor-like; Flags: Precursor gi|67969305|dbj|BAE01005.1| unnamed protein product [Macaca fascicularis] Length = 557 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 74/195 (37%), Gaps = 24/195 (12%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 K + + +L +++ + ++ +R+ I Y+ G PL+++ N +K Sbjct: 76 YFILDKSGSVNNNWIDLYMWVEETVARFQSSDIRMCFITYSTD--GQTVLPLTSDKNRIK 133 Query: 264 SRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + L++L P +T A +++ + ++ +I +TDGE Sbjct: 134 NGLDQLRKIVPDGHTFMQAGFRKAIQQIETFNSGN-------KVPSMIIAMTDGE----L 182 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 +TL+ + R G +Y+V D + DS FAV + F Sbjct: 183 VAHAFQDTLREAQKARKLGANVYTV--DVADYKLDQITAIADSPEHVFAVENG------F 234 Query: 381 DKITDKIQEQSVRIA 395 + D + + ++ Sbjct: 235 KAMRDTVDALTSKVC 249 >gi|115450663|ref|NP_001048932.1| Os03g0142500 [Oryza sativa Japonica Group] gi|108706121|gb|ABF93916.1| zinc finger family protein, putative, expressed [Oryza sativa Japonica Group] gi|113547403|dbj|BAF10846.1| Os03g0142500 [Oryza sativa Japonica Group] gi|125584872|gb|EAZ25536.1| hypothetical protein OsJ_09360 [Oryza sativa Japonica Group] gi|215712380|dbj|BAG94507.1| unnamed protein product [Oryza sativa Japonica Group] Length = 694 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 16/179 (8%), Positives = 58/179 (32%), Gaps = 19/179 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 K+ +L + ++ ++ + + + + + L+ + ++ Sbjct: 262 KLSLLKRAMSFVIQTLG-----PNDRLSVVAFSSTAQRLFPLRRMTLT-GRQQALQAISS 315 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L TN A+ + + + + +I ++DG+++ + Sbjct: 316 LVASGGTNIADALKKGAKVVKDRRR--------KNPVSSIILLSDGQDTHSFLSGEADIN 367 Query: 329 LQI-----CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 I + ++I++ + + S+G F ++ + ++F + Sbjct: 368 YSILVPPSILPGTSHHVQIHTFGFGTDHDSAAMHAIAETSNGTFSFIDAEGSIQDAFAQ 426 >gi|261880541|ref|ZP_06006968.1| BatA protein [Prevotella bergensis DSM 17361] gi|270332764|gb|EFA43550.1| BatA protein [Prevotella bergensis DSM 17361] Length = 332 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 30/177 (16%), Positives = 58/177 (32%), Gaps = 45/177 (25%) Query: 249 GNQCTPLSNNLNEVKSRLNKLNPY--------ENTNTYPAMHHAYRELYNEKESSHNTIG 300 PL+ + + + LN + + T + +A L G Sbjct: 138 SFTQCPLTIDHGSLLNLLNNVRTDIAARGLIQDGTAVGMGLANAVSRL----------KG 187 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP---------- 350 S K VI +TDG N+ L + R+ G+++Y++ V Sbjct: 188 SKAKSKVVILLTDGSNNMGDIS-----PLTAAQIARSLGIRVYTIGVGTNKVAPYPMPVA 242 Query: 351 ----------PEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 L + G F+ ++REL + + I DK+++ + + Sbjct: 243 GGVQYVNMPVEIDTKTLSDIAAITEGNFYRATNNRELKQIYRDI-DKLEKTKMNVTK 298 >gi|237808477|ref|YP_002892917.1| von Willebrand factor type A [Tolumonas auensis DSM 9187] gi|237500738|gb|ACQ93331.1| von Willebrand factor type A [Tolumonas auensis DSM 9187] Length = 316 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 24/142 (16%), Positives = 56/142 (39%), Gaps = 25/142 (17%) Query: 254 PLSNNLNE---VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 P + + + +N + T A+ A ++ +E + N K +I Sbjct: 147 PFTQDWQAAGLLLDEVNIGLAGKFTAIGEAITLAVKKTLHEPKPIQN--------KTLIL 198 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE--------GQDLLRKCTD 362 ++DG++ NT+ + +G+KIY++ + + + L + + Sbjct: 199 LSDGKD-----SINTIQPTDAAALAKASGLKIYTIGIGSDSTDAEAESDLDETTLEEIAN 253 Query: 363 -SSGQFFAVNDSRELLESFDKI 383 + GQ+F ++L E + +I Sbjct: 254 MTGGQYFRARSEQDLSEIYQQI 275 >gi|150375951|ref|YP_001312547.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] gi|150030498|gb|ABR62614.1| von Willebrand factor type A [Sinorhizobium medicae WSM419] Length = 334 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 64/193 (33%), Gaps = 33/193 (17%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 ++D + + V+ R+G +A+ P + + V+ Sbjct: 116 GNLQARVDAVKTVVADFVDR---------RPYDRLGLVAFGDAPYP--LVPFTMDHATVR 164 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHN-TIGSTRLKKFVIFITDGENSGASAY 322 S L T P M L + S S K ++ +TDG ++ + Sbjct: 165 SML--------TGALPGMAGPKTALGDALGLSIKLFQQSQAPDKVLVVLTDGNDTASKMP 216 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPE------GQDLLRKCTD-SSGQFFAVNDSRE 375 + E ++I++V + P + L+K + G++F D + Sbjct: 217 PDK-----AAEIASQNHIRIHTVGIGNPDAQGEEKLDTETLQKIATATGGRYFFGQDQQA 271 Query: 376 LLESFDKITDKIQ 388 L E + + D I Sbjct: 272 LAEIYT-LLDSIT 283 >gi|325273881|ref|ZP_08140055.1| von Willebrand factor type A [Pseudomonas sp. TJI-51] gi|324100983|gb|EGB98655.1| von Willebrand factor type A [Pseudomonas sp. TJI-51] Length = 311 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 30/187 (16%), Positives = 62/187 (33%), Gaps = 35/187 (18%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-- 270 + ++ ++ + + G PL+ + V++ L++ Sbjct: 62 WKNEDISRLDLVKALLGDFLQDREGDRVGLILFGSQAYLQAPLTFDRRTVRTFLDEAQIG 121 Query: 271 -PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 +NT A+ A + L + ++ ITDG N+G + L Sbjct: 122 IAGKNTAIGDAIGLAVKRL----------RQRPAQSRVLVLITDGANNGGQIH-----PL 166 Query: 330 QICEYMRNAGMKIYSVAVSAPPE----------------GQDLLRKCTD-SSGQFFAVND 372 G++IY++ + A PE + LR+ + + G +F +D Sbjct: 167 TAARLAAQEGVRIYTIGIGANPEASGTPGLLGLNPSLDLDEAALREIGEITHGAYFRAHD 226 Query: 373 SRELLES 379 EL Sbjct: 227 GAELDAI 233 >gi|91773456|ref|YP_566148.1| hypothetical protein Mbur_1491 [Methanococcoides burtonii DSM 6242] gi|91712471|gb|ABE52398.1| hypothetical protein with von Willebrand factor type A domain [Methanococcoides burtonii DSM 6242] Length = 1258 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 26/202 (12%), Positives = 63/202 (31%), Gaps = 24/202 (11%) Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 S + K+ ++++ + + + + T Sbjct: 840 SSDINVQKWNGDSWNVPYEVEISSSRYYEVSDIV-----YPWKDTTYSSTELNWKEWRAT 894 Query: 251 QCTPLSNN-LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 + LSNN L + + ++ + T ++ A EL +S V+ Sbjct: 895 VTSSLSNNSLVHLSNSIDTITADGLTAIDEGLYEANNELSAITGNST-----------VV 943 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE--GQDLLRKCTDSSGQF 367 ++DG ++ ++ + IY+V + + L + G++ Sbjct: 944 LMSDGLDNAGHHSL-----IEEALRAKEHNTVIYTVGLGNNEDEVDPILCEIANITGGKY 998 Query: 368 FAVNDSRELLESFDKITDKIQE 389 + +S L + F I +I Sbjct: 999 YFAPNSTVLEDIFIGIASEITN 1020 >gi|297567412|ref|YP_003686384.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] gi|296851861|gb|ADH64876.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] Length = 319 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 20/154 (12%), Positives = 55/154 (35%), Gaps = 25/154 (16%) Query: 250 NQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 + + N + + + T + + + L ++ S +I Sbjct: 138 TEVVAPTTNRQRLLDSVELIGLEFGTAIGEGILTSLQALPPLEQRKDAKDPSELAT--II 195 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS----------------APPEG 353 +TDG + ++ L+ ++I+++ V A Sbjct: 196 LLTDGR------SISGIDPLEAARIAAEQKVRIHTIGVGRVTEGPVPGLESVYQWAAYFD 249 Query: 354 QDLLRKCTD-SSGQFFAVNDSRELLESFDKITDK 386 +D+L++ + G++F VN + +L E++ +++ Sbjct: 250 EDVLKQIAAITGGKYFFVNSAGKLRETYQQLSQS 283 >gi|148226222|ref|NP_001089834.1| hypothetical protein LOC734900 [Xenopus laevis] gi|80477144|gb|AAI08519.1| MGC130922 protein [Xenopus laevis] Length = 840 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 23/187 (12%), Positives = 69/187 (36%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +++ + +++S++ + R+G I Y+ + ++ +VK Sbjct: 583 GEDNFEIVKQFVKGILDSLE-----ISQKAARVGLIQYSTHVRTEFTMAQYSSAKDVKKA 637 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 ++++ + T A+ + + ++E + + R+ + I TDG Sbjct: 638 VSQIKYMGRGSMTGLALKLMHEKSFSEAQGARARPM--RVPRVAIVFTDGRAQD------ 689 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDK 382 + E + +G+ IY++ + + + L++ + D + +K Sbjct: 690 --EVSEYAEKAKQSGITIYAIGIGKAIDEE--LQEIASAPQEKHVIYAEDFSAMGYIMEK 745 Query: 383 ITDKIQE 389 + + E Sbjct: 746 LKSSMCE 752 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 23/159 (14%), Positives = 53/159 (33%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + + R+G + Y + +++ + ++ T T A+ +A Sbjct: 83 DIGPDNTRVGLLQYGSTVKNEFSLKTYKRKPDIERAVKRMMHLATGTMTGLAIQYAMNIA 142 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + + +TDG +I RN+G+ I+++ V Sbjct: 143 FSEAEGARPL--NQYVPRIAMIVTDGRPQDP--------VAEIAAKARNSGILIFAIGVG 192 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 L+ F V + + L F Sbjct: 193 R--VDMSTLKTIGSQPHSEHVFLVANFSQIETLTSVFQN 229 >gi|224282379|ref|ZP_03645701.1| hypothetical protein BbifN4_00972 [Bifidobacterium bifidum NCIMB 41171] Length = 1153 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 40/374 (10%), Positives = 108/374 (28%), Gaps = 46/374 (12%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPL 101 + + T + + I E++ + ++ + Sbjct: 479 TATGTKDGETRTTDSNGVITLKAGQYAVLLGSDAKRITESSKYKVTEINVDQDTYAVSAN 538 Query: 102 QYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE-NLAISICMVLDVSRSM 160 + + + T G +P ++ + + + N + + L+V+ + Sbjct: 539 GGQVKVTQEKDSATTEPVSVGEVPRITVTNTVVTAPRYRKYIKANNDGTYDLSLNVTGTQ 598 Query: 161 EDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNL 220 + + + S P N +++V + ++ Sbjct: 599 SGSSQTTVSPADIVVVFDT-----------------SGSMSNPMGHNSRLEVAKTAVNSM 641 Query: 221 VNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNT--- 277 + + + K+ ++R+ + + + ++N ++ S +N L TN Sbjct: 642 AQHLLTSENQGKDSNIRMALVPF--STTVGNVSNFTDNAMDIVSAVNGLRADGGTNWEAA 699 Query: 278 --------YPAMHHAYREL------YNEKESSHNTIGSTRLKKFVIFITD------GENS 317 + + +S G+ + D G + Sbjct: 700 LKAANAKLTSGRKGVKKYIVFMSDGDPTFRTSSVRTGTDWWGRPTYDDDDRRGLPAGVHG 759 Query: 318 GASAYQNTLNTLQICEYMRNAG-MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 S+ Q N G ++SV VS+ P + + G +++ + EL Sbjct: 760 SGSSDQYGANLSSAVAEANRRGDATLFSVGVSSDPT--KMRGFADQTKGSYYSATSTDEL 817 Query: 377 LESFDKITDKIQEQ 390 ++F I +I + Sbjct: 818 NKAFADIIGQINRK 831 >gi|47847422|dbj|BAD21383.1| mFLJ00114 protein [Mus musculus] Length = 1188 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 25/180 (13%), Positives = 63/180 (35%), Gaps = 16/180 (8%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PY 272 +++ ++ + + + S R + ++ + + + S L+ + Sbjct: 184 STDFEKMLDFVKAVMSQLQRPSTRFSLMQFSDYFRVHFTFNNFISTSSPLSLLDSVRQLR 243 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T T A+ H EL+ + + K +I ITDG G + +++ + Sbjct: 244 GYTYTASAIKHVITELFTTQSGAR-----QDATKVLIVITDGRKQGDNLSYDSVIPMAEA 298 Query: 333 EYMRNAGMKIYSVAVS---APPEGQDLLRKCTDSSGQ--FFAVNDSRELLESFDKITDKI 387 + Y++ V + L+ F+V + L + +++ +KI Sbjct: 299 ASIIR-----YAIGVGKAFYNEHSKQELKAIASMPSHEYVFSVENFDALKDIENQLKEKI 353 >gi|326674791|ref|XP_001922046.3| PREDICTED: collagen alpha-1(XIV) chain [Danio rerio] Length = 1852 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 22/156 (14%), Positives = 56/156 (35%), Gaps = 22/156 (14%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKES 294 ++ ++ ++ + + +++ NT T A+ H +++E Sbjct: 1058 TQVAIAQFSDDARTEFKLNSYSDKEALLDAVQRISYKGGNTKTGRAIKHVKEAIFSEDAG 1117 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + K ++ +TDG + +I + M+ G I+++ + G+ Sbjct: 1118 VRR-----GIPKVLVVLTDGRSQDDVN--------KISKEMQMEGYIIFAIGFADADYGE 1164 Query: 355 DLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQ 388 L F V+D ++F KI +++ Sbjct: 1165 --LVNIASKPSERHVFFVDDL----DAFKKIEEQLI 1194 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 24/164 (14%), Positives = 49/164 (29%), Gaps = 20/164 (12%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 + RIG Y+ + V + L NT + Y L Sbjct: 189 DVGIDKTRIGLAQYSGDPRIEWHLNGFSTKEAVIDAVKNLPYKGG-NTLTGLALTY-VLE 246 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 N + + K I ITDG++ + +R++G++++++ V Sbjct: 247 NSFKPESGARDNI--PKIGILITDGKSQDDVISP--------AQTLRSSGVELFAIGV-- 294 Query: 350 PPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 ++ L+ + V D I + + Sbjct: 295 KNADENELKAIASEPEDTHVYNVADFS----IMGTIVEGLTRTV 334 >gi|55251329|emb|CAH69127.1| novel protein similar to vertebrate matrilin 3 (MATN3) [Danio rerio] Length = 454 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 56/170 (32%), Gaps = 20/170 (11%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNT 277 L + + + + R+ + Y + + +K + ++ P T T Sbjct: 86 FLADMVDT--LDVGPDATRVAVVNYASTVKIEFLLKSHLTKDTIKQAITRIEPLAAGTMT 143 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 A+ A E + EK + S + K I +TDG ++ R Sbjct: 144 GMAIKKAMDEAFTEKSGARPK--SKNISKVAIIVTDGRPQD--------QVEEVSAAARA 193 Query: 338 AGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 +G++IY+V V L+ + F V +L F + Sbjct: 194 SGIEIYAVGV--DRADMRSLKLMASNPLEDHVFYVETYGVIEKLTSKFRE 241 >gi|86360183|ref|YP_472072.1| hypothetical protein RHE_PC00139 [Rhizobium etli CFN 42] gi|86284285|gb|ABC93345.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 671 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 40/388 (10%), Positives = 94/388 (24%), Gaps = 36/388 (9%) Query: 17 IDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPT-------TKKDQTSTIFKKQIK 69 D I ++++ + + AA L S + + Sbjct: 130 FDAGEIAALKSKSEDS--AAALGMAKRAAPAAPGVVAQGQLLAEPMAAVAPSPVPPTDGR 187 Query: 70 KHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALT 129 L+ RE + A ++ D + + A Y +L + Sbjct: 188 AQLQLDPS-RERFANAAVNPIKSVAADPVSTFSADVD-SASYSFVRRSLTAGAMPDPQSV 245 Query: 130 NLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF 189 + V+ + + P + Sbjct: 246 RVEEMINYFPYDWPGPEKADQPFKATVTVMPTPWNHDTQLMHVAIKGYDIAPATAPHANL 305 Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 K+ +L + LVN ++ + + Y Sbjct: 306 V-----FLIDVSGSMDEPDKLPLLKSAFRLLVNKLKADDT--------VSIVTYAGNAGT 352 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 +++ S +++L +T + AY V+ Sbjct: 353 VLEPTRVAEKSKILSAIDRLEAGGSTGGAEGIAAAYDLAKKAFVKDGVNR--------VM 404 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA 369 TDG+ + + + +I E R G+ + + L++ + + Sbjct: 405 LATDGDFNVGPSID--EDLKRIIEEKRKDGIFLTVLGFGRGNLNDSLMQTLAQNGNG--S 460 Query: 370 VNDSRELLESFDKITDKIQEQSVRIAPN 397 L E+ + ++ IA + Sbjct: 461 AAYIDTLAEAQKTLVEEAGSTLFPIAKD 488 >gi|281423276|ref|ZP_06254189.1| BatA protein [Prevotella oris F0302] gi|281402612|gb|EFB33443.1| BatA protein [Prevotella oris F0302] Length = 332 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 24/199 (12%), Positives = 54/199 (27%), Gaps = 39/199 (19%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 +++ A + + T + L + ++ Sbjct: 108 NRLEAAKNVASEFIADRPNDNIGLTIFAGEAFTQCPMTTDHVSLINLLQSVRTDI--AAR 165 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 L T + +A L S K VI +TDG N+ Sbjct: 166 GLISDG-TAVGMGLANAVSRL----------KDSKAKSKVVILLTDGSNNMGDIS----- 209 Query: 328 TLQICEYMRNAGMKIYSVAVSAP--------------------PEGQDLLRKCT-DSSGQ 366 + + R+ G+++Y++ + L+ + G Sbjct: 210 PMTSAQIARSFGIRVYTIGIGTNKVAPYPMPVAGGIQYVNIPVEIDSKTLKDIAATTEGN 269 Query: 367 FFAVNDSRELLESFDKITD 385 F+ ++R+L + + I Sbjct: 270 FYRATNNRQLKQIYKDIDQ 288 >gi|156396520|ref|XP_001637441.1| predicted protein [Nematostella vectensis] gi|156224553|gb|EDO45378.1| predicted protein [Nematostella vectensis] Length = 177 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 28/187 (14%), Positives = 66/187 (35%), Gaps = 20/187 (10%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 A + + + + + + RIG + Y+ G N + K Sbjct: 8 ASGSVRANRFKMCLNFINKLVNSFHIGP--HNTRIGIVRYSTRPSGIFRFTSYRNKHSTK 65 Query: 264 SRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 R+N++ T T A+++A R LY ++K +I +TDG++ + Sbjct: 66 HRVNRIRYTGGWTRTGAAINYARRYLYQHNRRR-------GVRKVLIVMTDGKSQDS--- 115 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + ++ G++++++ + + L + R+L + K Sbjct: 116 -----VVGASRSVKRMGIEVFAIGIGRG-YRRSELNQMATDRNHVLTAR-FRDLHKIIGK 168 Query: 383 ITDKIQE 389 I + + Sbjct: 169 IKYRACK 175 >gi|148697327|gb|EDL29274.1| procollagen, type XIV, alpha 1, isoform CRA_c [Mus musculus] Length = 1802 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 28/201 (13%), Positives = 63/201 (31%), Gaps = 26/201 (12%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + +I + V ++ K + ++ + + Sbjct: 1043 VDGSWSIGDDNFNKIINFLYSTVGALDKIGADGT----QVAMVQFTDDPRTEFKLDSYKT 1098 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + ++ NT T A+ H L+ + + K ++ ITDG + Sbjct: 1099 KETLLDAIRHISYKGGNTKTGKAIKHVRDTLFTSDSGTRR-----GIPKVIVVITDGRSQ 1153 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRE 375 +I M+ G I+++ V L + F V+D Sbjct: 1154 DDVN--------KISREMQADGFNIFAIGV--ADADYSELVQIGSKPSSRHVFFVDDF-- 1201 Query: 376 LLESFDKITDKIQEQSVRIAP 396 ++F KI D++ A Sbjct: 1202 --DAFKKIEDELITFVCETAS 1220 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 54/168 (32%), Gaps = 18/168 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + RIG Y+ N +EV + L NT T A++ + Sbjct: 192 TAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIDAVRSLPYKGGNTLTGLALNFIF 251 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + + K I ITDG++ + +R +G++++++ Sbjct: 252 ENSFK-----PEAGSRSGVSKIGILITDGKSQDDIIPPS--------RNLRESGVELFAI 298 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 V L++ + V + + + +T + + Sbjct: 299 GV--KNADLSELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSRV 344 >gi|226423922|ref|NP_851794.3| collagen alpha-1(XIV) chain [Mus musculus] gi|148697325|gb|EDL29272.1| procollagen, type XIV, alpha 1, isoform CRA_a [Mus musculus] gi|219519338|gb|AAI45245.1| Collagen, type XIV, alpha 1 [Mus musculus] gi|223459902|gb|AAI38346.1| Collagen, type XIV, alpha 1 [Mus musculus] Length = 1794 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 28/201 (13%), Positives = 63/201 (31%), Gaps = 26/201 (12%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + +I + V ++ K + ++ + + Sbjct: 1035 VDGSWSIGDDNFNKIINFLYSTVGALDKIGADGT----QVAMVQFTDDPRTEFKLDSYKT 1090 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + ++ NT T A+ H L+ + + K ++ ITDG + Sbjct: 1091 KETLLDAIRHISYKGGNTKTGKAIKHVRDTLFTSDSGTRR-----GIPKVIVVITDGRSQ 1145 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRE 375 +I M+ G I+++ V L + F V+D Sbjct: 1146 DDVN--------KISREMQADGFNIFAIGV--ADADYSELVQIGSKPSSRHVFFVDDF-- 1193 Query: 376 LLESFDKITDKIQEQSVRIAP 396 ++F KI D++ A Sbjct: 1194 --DAFKKIEDELITFVCETAS 1212 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 54/168 (32%), Gaps = 18/168 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + RIG Y+ N +EV + L NT T A++ + Sbjct: 187 TAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIDAVRSLPYKGGNTLTGLALNFIF 246 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + + K I ITDG++ + +R +G++++++ Sbjct: 247 ENSFK-----PEAGSRSGVSKIGILITDGKSQDDIIPPS--------RNLRESGVELFAI 293 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 V L++ + V + + + +T + + Sbjct: 294 GV--KNADLSELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSRV 339 >gi|148697326|gb|EDL29273.1| procollagen, type XIV, alpha 1, isoform CRA_b [Mus musculus] Length = 1093 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 28/201 (13%), Positives = 63/201 (31%), Gaps = 26/201 (12%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + +I + V ++ K + ++ + + Sbjct: 386 VDGSWSIGDDNFNKIINFLYSTVGALDKIGADGT----QVAMVQFTDDPRTEFKLDSYKT 441 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + ++ NT T A+ H L+ + + K ++ ITDG + Sbjct: 442 KETLLDAIRHISYKGGNTKTGKAIKHVRDTLFTSDSGTRR-----GIPKVIVVITDGRSQ 496 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRE 375 +I M+ G I+++ V L + F V+D Sbjct: 497 DDVN--------KISREMQADGFNIFAIGV--ADADYSELVQIGSKPSSRHVFFVDDF-- 544 Query: 376 LLESFDKITDKIQEQSVRIAP 396 ++F KI D++ A Sbjct: 545 --DAFKKIEDELITFVCETAS 563 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 54/168 (32%), Gaps = 18/168 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + RIG Y+ N +EV + L NT T A++ + Sbjct: 187 TAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIDAVRSLPYKGGNTLTGLALNFIF 246 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + + K I ITDG++ + +R +G++++++ Sbjct: 247 ENSFK-----PEAGSRSGVSKIGILITDGKSQDDIIPPS--------RNLRESGVELFAI 293 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 V L++ + V + + + +T + + Sbjct: 294 GV--KNADLSELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSRV 339 >gi|146345398|sp|Q80X19|COEA1_MOUSE RecName: Full=Collagen alpha-1(XIV) chain; Flags: Precursor Length = 1797 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 28/201 (13%), Positives = 63/201 (31%), Gaps = 26/201 (12%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + +I + V ++ K + ++ + + Sbjct: 1038 VDGSWSIGDDNFNKIINFLYSTVGALDKIGADGT----QVAMVQFTDDPRTEFKLDSYKT 1093 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + ++ NT T A+ H L+ + + K ++ ITDG + Sbjct: 1094 KETLLDAIRHISYKGGNTKTGKAIKHVRDTLFTSDSGTRR-----GIPKVIVVITDGRSQ 1148 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRE 375 +I M+ G I+++ V L + F V+D Sbjct: 1149 DDVN--------KISREMQADGFNIFAIGV--ADADYSELVQIGSKPSSRHVFFVDDF-- 1196 Query: 376 LLESFDKITDKIQEQSVRIAP 396 ++F KI D++ A Sbjct: 1197 --DAFKKIEDELITFVCETAS 1215 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 54/168 (32%), Gaps = 18/168 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + RIG Y+ N +EV + L NT T A++ + Sbjct: 187 TAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIDAVRSLPYKGGNTLTGLALNFIF 246 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + + K I ITDG++ + +R +G++++++ Sbjct: 247 ENSFK-----PEAGSRSGVSKIGILITDGKSQDDIIPPS--------RNLRESGVELFAI 293 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 V L++ + V + + + +T + + Sbjct: 294 GV--KNADLSELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSRV 339 >gi|30420885|gb|AAO64442.1| collagen type XIV [Mus musculus] Length = 1797 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 28/201 (13%), Positives = 63/201 (31%), Gaps = 26/201 (12%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + +I + V ++ K + ++ + + Sbjct: 1038 VDGSWSIGDDNFNKIINFLYSTVGALDKIGADGT----QVAMVQFTDDPRTEFKLDSYKT 1093 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + ++ NT T A+ H L+ + + K ++ ITDG + Sbjct: 1094 KETLLDAIRHISYKGGNTKTGKAIKHVRDTLFTSDSGTRR-----GIPKVIVVITDGRSQ 1148 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRE 375 +I M+ G I+++ V L + F V+D Sbjct: 1149 DDVN--------KISREMQADGFNIFAIGV--ADADYSELVQIGSKPSSRHVFFVDDF-- 1196 Query: 376 LLESFDKITDKIQEQSVRIAP 396 ++F KI D++ A Sbjct: 1197 --DAFKKIEDELITFVCETAS 1215 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 54/168 (32%), Gaps = 18/168 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + RIG Y+ N +EV + L NT T A++ + Sbjct: 187 TAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIDAVRSLPYKGGNTLTGLALNFIF 246 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + + K I ITDG++ + +R +G++++++ Sbjct: 247 ENSFK-----PEAGSRSGVSKIGILITDGKSQDDIIPPS--------RNLRESGVELFAI 293 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 V L++ + V + + + +T + + Sbjct: 294 GV--KNADLSELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSRV 339 >gi|262196446|ref|YP_003267655.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262079793|gb|ACY15762.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 903 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 34/160 (21%), Positives = 64/160 (40%), Gaps = 20/160 (12%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 +A++ ++N + + + +L TN YPA+ AY L Sbjct: 497 VVAFDNQPTTIVRLQRASNRMRIATDIARLQAGGGTNIYPALREAYEILQGANAKV---- 552 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 K VI ++DG+ +C+ MR+A + + +V + ++LL Sbjct: 553 ------KHVIVLSDGQAPYDGIAD-------LCQEMRSARITVSAVGIG--DADRNLLNL 597 Query: 360 CTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 TD+ G+ + +D L F K T + Q ++ +P R Sbjct: 598 ITDNGDGRLYMTDDLAALPRIFMKETTEAQRSALVESPVR 637 >gi|77459433|ref|YP_348940.1| von Willebrand factor, type A [Pseudomonas fluorescens Pf0-1] gi|77383436|gb|ABA74949.1| putative exported protein [Pseudomonas fluorescens Pf0-1] Length = 359 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 28/146 (19%), Positives = 55/146 (37%), Gaps = 35/146 (23%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + V+ L++ +NT A+ A + L + +I Sbjct: 150 PLTFDRRTVRVWLDEARIGIAGKNTAIGDAIGLALKRL----------RMRPAQSRVLIL 199 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------GQ 354 +TDG N+G ++ L + + G+KIY + + A PE + Sbjct: 200 VTDGANNGG-----EIDPLTAAKLAASEGVKIYPIGIGADPEESGATALLGGNPTLDLDE 254 Query: 355 DLLRKCTD-SSGQFFAVNDSRELLES 379 L+ + + G++F D ++L Sbjct: 255 PALKAIAEVTGGRYFRARDGKQLQAI 280 >gi|301617277|ref|XP_002938060.1| PREDICTED: matrilin-4-like [Xenopus (Silurana) tropicalis] Length = 721 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 28/147 (19%), Positives = 55/147 (37%), Gaps = 15/147 (10%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEK 292 + R+G + Y+ + +N ++++ +N++ P T T A+ +A + E+ Sbjct: 67 STTRVGVVQYSSQVQTVFSLKTFSNKSDMEKAINEIIPLAQGTMTGLAIQYAMNVAFTEE 126 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 E + S + + I +TDG ++ R AG++IY+V V Sbjct: 127 EGARPL--SKNIPRVAIIVTDGRPQD--------RVTEVAVQAREAGIEIYAVGVQRADV 176 Query: 353 GQDLLRKCTDSS--GQFFAVNDSRELL 377 LR F V + Sbjct: 177 SS--LRAMASHPLDDHVFHVESFDLIQ 201 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 56/165 (33%), Gaps = 18/165 (10%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYN 290 IG + Y+ + N ++K+ + + + T T A+ H + ++ Sbjct: 499 SAQGTHIGLVQYSSRVRTEFPLSQYTNGQDIKTAVKNIQYMEKGTMTGLALKHMVEQSFS 558 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 E E + + K + TDG + + + + AG+ +Y+V V Sbjct: 559 EAEGAR-----KNVPKIGLVFTDGRSQDDISEW--------AKKAKEAGITMYAVGVGKA 605 Query: 351 PEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQSVR 393 +D L + F D + + + I + ++ Sbjct: 606 V--EDELNEIASDPVNKHSFYTADFSTMNLIAEDLKLNICPEEIK 648 >gi|85374478|ref|YP_458540.1| hypothetical protein ELI_08255 [Erythrobacter litoralis HTCC2594] gi|84787561|gb|ABC63743.1| hypothetical protein ELI_08255 [Erythrobacter litoralis HTCC2594] Length = 626 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 41/290 (14%), Positives = 76/290 (26%), Gaps = 44/290 (15%) Query: 137 GIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL--LPPPPKKSFWSKNT 194 + + + + + + LP Sbjct: 345 AMDISGFKTGTSVSTRTGSNGGWVSSSWTNGCIEERQTIAASSFDPLPTGAHDLDIDLVP 404 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 SK P +I +++ + R + Sbjct: 405 DSSKPETQWYPMWPQITYDR----GGPATLETTDDKPT----RGYNCPNRTHKLQEYLLN 456 Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNE---KESSHNTIGSTRLKKFVIFI 311 S + SR+N L+P T M A R + + + + + + VIF+ Sbjct: 457 GSARNADFVSRINALSPKGGTMHDIGMIWAGRLISPDGIFAADNASAPNGDPISRHVIFM 516 Query: 312 TDGE--------NSGASAYQN---------------------TLNTLQICEYMRNAGMKI 342 TDGE + + + L IC+ +RN + I Sbjct: 517 TDGEMGASPSNTTAYGNYDMDGRMAGFAASGSWTENQLAAIHNLRLEAICKAIRNKNVTI 576 Query: 343 YSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 +S+A P + C + + +S EL F I I E + Sbjct: 577 WSIAFGLPHSAYT--QGCATGTSRALTAANSSELDSRFRDIAGSIAELRL 624 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 32/265 (12%), Positives = 67/265 (25%), Gaps = 42/265 (15%) Query: 9 CFLFITYAI-----------DLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKK 57 +I D + + ++++Q A DAA L+ + Sbjct: 1 MMAIGAASIIPLVGVVGGGVDASRMYLAKSRLQQACDAATLAARKELAGSSISNGTIPA- 59 Query: 58 DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 + G Y N G T Q + A Sbjct: 60 -NIQDKADNFFDTNFPSGMYGTTNVGY---------TLSAGTATQMDGAATASV-----P 104 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ-----KHNDNN 172 L + ++++ + ++ I + +VLD+S SM N Sbjct: 105 TTLMKVFNVPQIDIAVNCSAELDL----PNIDVVLVLDMSGSMNSNGTTGSKRITALKNA 160 Query: 173 NMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK 232 + ++ P + A + + L +S +E Sbjct: 161 VFSFYDVVMAAKPAGTRVRIGIVPYNGAVSVGDELLTLST-TTGIDYLADSWDYQTREP- 218 Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSN 257 + ++ N G+ +N Sbjct: 219 ----KWKQVSNNDGVEEGDILSETN 239 >gi|326676330|ref|XP_003200547.1| PREDICTED: collagen alpha-1(XIV) chain-like [Danio rerio] Length = 1164 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 26/189 (13%), Positives = 66/189 (34%), Gaps = 23/189 (12%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 + D + L ++ Q ++ ++ ++ E+ Sbjct: 456 GSWSIGDDNFQKIIRFLHSTAGALDQ-IGPDGTQVAIAQFSDDARTEFSLSSHSSKEELL 514 Query: 264 SRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + + +++ NT T AM H ++ + + K ++ +TDG + Sbjct: 515 TAIQRVSYKGGNTKTGRAMKHVKDSVF-----APVGGARRGVPKVLVVLTDGRSQD---- 565 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESF 380 + LQ+ + ++ G ++++ + G+ L G F V+D ++F Sbjct: 566 ----DVLQVSQELQAEGYIVFAIGFADADYGELL--SIASRPGDRHVFFVDDL----DAF 615 Query: 381 DKITDKIQE 389 I + I + Sbjct: 616 RTIVENILQ 624 >gi|41409532|ref|NP_962368.1| hypothetical protein MAP3434 [Mycobacterium avium subsp. paratuberculosis K-10] gi|81570937|sp|Q73UD4|Y3434_MYCPA RecName: Full=UPF0353 protein MAP_3434 gi|41398363|gb|AAS05984.1| hypothetical protein MAP_3434 [Mycobacterium avium subsp. paratuberculosis K-10] Length = 330 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 25/160 (15%), Positives = 54/160 (33%), Gaps = 19/160 (11%) Query: 248 VGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 P + L KL+ ++T T A+ A + G T Sbjct: 143 TPYLLVPPTPQHQATIDALKKLDFADSTATGEAIFTALHAISATA----VAGGDTPPPAR 198 Query: 308 VIFITDGENSGASAYQNTLN-TLQICEYMRNAGMKIYSVAVSAPPE-------------G 353 ++ ++DG + S + + ++ G+ I ++ Sbjct: 199 IVLLSDGGENKPSNPSDPHDGVYTAARLAKDEGVPISTITFGTKGGEIEMDGQKVAVPVS 258 Query: 354 QDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSV 392 D ++ S GQ + + EL +S++ I ++I ++V Sbjct: 259 TDQMKMVAKLSGGQSYTATNLGELQKSYNAIENEIGYRTV 298 >gi|311106403|ref|YP_003979256.1| von Willebrand factor type A domain-containing protein 2 [Achromobacter xylosoxidans A8] gi|310761092|gb|ADP16541.1| von Willebrand factor type A domain protein 2 [Achromobacter xylosoxidans A8] Length = 340 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 29/200 (14%), Positives = 67/200 (33%), Gaps = 44/200 (22%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK---S 264 ++ + + ++ R+G I + PL+ + +K Sbjct: 118 DRLSGVKAVVADFIDR---------RQDDRLGLIVFGTAAYPQA--PLTQDHATLKLLLG 166 Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 +++ NT A+ A ++ + E + +I +TDG ++G++ + Sbjct: 167 QVSTRMAGPNTAIGDAIGVAIKQFEHAGE----------HDQVLILLTDGNDTGSAVPPD 216 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPE------GQDLLRKCTD-SSGQFFAVNDSRELL 377 + +++V + P D LR + G+FF D L Sbjct: 217 R-----AASMAAARHIVVHTVGIGDPQAEGEEKVDFDALRAIAAKTGGRFFPAQDQASLR 271 Query: 378 ESFDKITDKIQEQSVRIAPN 397 + + ++ RI P+ Sbjct: 272 QVYAELD--------RITPH 283 >gi|118355467|ref|XP_001010993.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89292760|gb|EAR90748.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 2033 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 17/124 (13%), Positives = 42/124 (33%), Gaps = 9/124 (7%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 +N ++ + TN + A+ L + + ++DG++ Sbjct: 1662 DNKENLQKITKSIYANGGTNITSGLQTAFSILQS--------RKQRNSVSSIFLLSDGQD 1713 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 + + + L + ++ I+S +G + R G F+ V + ++ Sbjct: 1714 NNSDSRIRNLLQTTY-QQLQEECFTIHSFGFGNDHDGPLMQRIAQIKDGSFYYVERNDQV 1772 Query: 377 LESF 380 E F Sbjct: 1773 DEFF 1776 >gi|296227292|ref|XP_002759309.1| PREDICTED: collagen alpha-1(XIV) chain [Callithrix jacchus] Length = 1796 Score = 72.2 bits (175), Expect = 1e-10, Method: Composition-based stats. Identities = 28/201 (13%), Positives = 66/201 (32%), Gaps = 26/201 (12%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + +I + V ++ K + ++ + + Sbjct: 1037 VDGSWSIGDDNFNKIINFLYSTVGALNKIGTDGT----QVAMVQFTDDPRTEFKLNAYKT 1092 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + +++ NT T A+ + L+ + + + K ++ ITDG + Sbjct: 1093 KETLLDAIKRISYKGGNTKTGKAIKYVRDTLFTAESGTRR-----GIPKVIVVITDGRSQ 1147 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRE 375 +I + M++ G I++V V L F V+D Sbjct: 1148 DDVN--------KISKEMQSDGYSIFAVGV--ADADYSELVSIGSKPSARHVFFVDDF-- 1195 Query: 376 LLESFDKITDKIQEQSVRIAP 396 ++F KI D++ A Sbjct: 1196 --DAFKKIEDELITFVCETAS 1214 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 57/168 (33%), Gaps = 18/168 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + RIG Y+ + +EV + L NT T A+++ + Sbjct: 186 TAFDVGSEKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNYIF 245 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + T + K I ITDG++ + +R +G++++++ Sbjct: 246 ENSFK-----PEAGSRTGVSKIGILITDGKSQDDIIPPS--------RNLRESGVELFAI 292 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 V + L++ + V + + + +T + + Sbjct: 293 GV--KNADVNELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSRV 338 >gi|299140484|ref|ZP_07033622.1| BatA protein [Prevotella oris C735] gi|298577450|gb|EFI49318.1| BatA protein [Prevotella oris C735] Length = 332 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 24/199 (12%), Positives = 54/199 (27%), Gaps = 39/199 (19%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 +++ A + + T + L + ++ Sbjct: 108 NRLEAAKNVASEFIADRPNDNIGLTIFAGEAFTQCPMTTDHVSLINLLQSVRTDI--AAR 165 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 L T + +A L S K VI +TDG N+ Sbjct: 166 GLISDG-TAVGMGLANAVSRL----------KDSKAKSKVVILLTDGSNNMGDIS----- 209 Query: 328 TLQICEYMRNAGMKIYSVAVSAP--------------------PEGQDLLRKCT-DSSGQ 366 + + R+ G+++Y++ + L+ + G Sbjct: 210 PMTSAQIARSFGIRVYTIGIGTNKVAPYPMPVAGGIQYVNIPVEIDSKTLKDIAATTEGN 269 Query: 367 FFAVNDSRELLESFDKITD 385 F+ ++R+L + + I Sbjct: 270 FYRATNNRQLKQIYKDIDQ 288 >gi|320323259|gb|EFW79347.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. glycinea str. B076] Length = 352 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 25/146 (17%), Positives = 51/146 (34%), Gaps = 35/146 (23%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + V+ L++ +NT A+ + L + ++ Sbjct: 150 PLTYDRRTVRFWLDEAKIGIAGKNTALGDAIGLGLKRL----------RLRPATSRVLVL 199 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------GQ 354 +TDG N+ ++ + G+KIY + + A P+ + Sbjct: 200 VTDGANNAGQ-----IDPITAARLAAEEGVKIYPIGIGADPDKDALQSVLGLNPSLDLDE 254 Query: 355 DLLRKCTD-SSGQFFAVNDSRELLES 379 L++ S GQ+F D +L + Sbjct: 255 PTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|312135597|ref|YP_004002935.1| von willebrand factor type a [Caldicellulosiruptor owensensis OL] gi|311775648|gb|ADQ05135.1| von Willebrand factor type A [Caldicellulosiruptor owensensis OL] Length = 667 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 40/303 (13%), Positives = 84/303 (27%), Gaps = 41/303 (13%) Query: 106 ESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYL 165 S + + K I + + L+ T + +S+ +V+D S SM Sbjct: 34 TSSVSISLSGGPVKSKSYIGND-IEVKLKLTPAGSINVSRSPVSVVLVIDSSGSMSASSK 92 Query: 166 QKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQ 225 N + + + S Sbjct: 93 MTAAKNAAKNLIDSFKNSAKSGDKLGIVDFDTFVNDNSNFYVKGFYLQNGSWQ------- 145 Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAY 285 + + ++ N ++ +N K+ ++ +N TN A++ A Sbjct: 146 ----KGNSSTIYGPYSLPNTCTSSLLDLTNTSAINSAKNLIDNMNASGGTNMEAALNKAK 201 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN-----------------T 328 L + K+VIFITDG + + T Sbjct: 202 NLLNASPSGN---------DKYVIFITDGMPTFYLNGTHNGYPLVDGPGLQPNNTTKSET 252 Query: 329 LQICEYMRNAGMKIYSVAVSA--PPEGQDLLRKCTDS-SGQFFAVNDSRELLESFDKITD 385 L + + +G K++ V V + + + +G+ + ++ + L I Sbjct: 253 LSAVQSLSQSGTKLFVVGVDTTGADVDKTFIELMASTANGKSYYISSTNALNSILQDIFK 312 Query: 386 KIQ 388 I Sbjct: 313 IIN 315 >gi|269968855|ref|ZP_06182838.1| hypothetical protein VMC_42680 [Vibrio alginolyticus 40B] gi|269826535|gb|EEZ80886.1| hypothetical protein VMC_42680 [Vibrio alginolyticus 40B] Length = 356 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 26/150 (17%), Positives = 62/150 (41%), Gaps = 15/150 (10%) Query: 246 GIVGNQCTPLSNNLN---EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 G TP + + E+ ++ + ++T+ A+ A + +SS Sbjct: 159 GDAAFVQTPFTADQKVWLELLNQTDVAMAGQSTHLGDAIGLAIKVFEQSDKSSGALEQDQ 218 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-------EGQD 355 +K I +TDG ++G + + + + + G++++ +A+ P + + Sbjct: 219 NREKVAIVLTDGNDTG-----SFVEPIDAAKVAKAKGVRVHVIAMGDPETIGETALDMET 273 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + R +S G+ F + EL ++D+I Sbjct: 274 IHRIAKESGGEAFEALNRDELSAAYDEIGK 303 >gi|327269537|ref|XP_003219550.1| PREDICTED: matrilin-2-like [Anolis carolinensis] Length = 809 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 24/187 (12%), Positives = 67/187 (35%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +++ E +++S+ + + RIG + Y+ + + ++K Sbjct: 589 GVNNFEIVKEFVLGILDSLTISPKAA-----RIGLLQYSTQVRTEFTLKQFSTATDMKKA 643 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 ++++ + T A+ + E E + + S ++ + + TDG + Sbjct: 644 VSQMKYMGKGSMTGLALKQMTERSFTEAEGARHL--SAKVPRVCVVFTDGRAQDEVSEW- 700 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDK 382 + G+ +Y++ + ++ LR+ F D + E +K Sbjct: 701 -------AAKAKQRGITMYAIGIGKAI--EEELREIASDPPEKHLFYAEDFSAMGEITEK 751 Query: 383 ITDKIQE 389 + ++ E Sbjct: 752 LQKRMCE 758 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 54/159 (33%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + R+G I Y + +++ + ++ T T A+ A Sbjct: 50 DIRPDVTRVGLIQYGSTVKNEFSLKTFARKQDMERAVRRMMYLSTGTMTGLAIQFAVNIA 109 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + S + + ++ +TDG +I RN+G+ I+++ V Sbjct: 110 FSETEGARPL--SQNVPRVIMIVTDGRPQDP--------VAEIAAKARNSGILIFAIGVG 159 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F Sbjct: 160 R--VDMNTLKSIGSEPYEDHVFLVANFSQIETLTSVFQN 196 >gi|156408065|ref|XP_001641677.1| predicted protein [Nematostella vectensis] gi|156228817|gb|EDO49614.1| predicted protein [Nematostella vectensis] Length = 1418 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 73/197 (37%), Gaps = 16/197 (8%) Query: 193 NTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQC 252 +T+ S + D +L++ I+ + + V GT A Sbjct: 9 DTSGSLQYWSGGGWKNGFDDEKVFVNSLLSHIRVSYKSTYVSVVLFGTSATIDINYIFNP 68 Query: 253 TPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 P +N+ + + L TN + A AY ++ + S + + ++K V + Sbjct: 69 HP-NNHKCNFRRDFSNLRFRSGMTNMHDAFQAAYDIIF--GKYSGHKRPTHQVKTAVFLL 125 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDG+ + I + +++ G++I+++ V + LR +F N Sbjct: 126 TDGQWNWNGDPWP------IAKRLKDRGIEIFTIGV-TNGVNVNTLRSLAS-PNNYFHYN 177 Query: 372 DSRELLESFDKITDKIQ 388 D + F ++ I+ Sbjct: 178 DFTQ----FRELATCIR 190 >gi|317406818|gb|EFV86930.1| von Willebrand factor type A domain-containing protein [Achromobacter xylosoxidans C54] Length = 252 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 65/195 (33%), Gaps = 37/195 (18%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK-----SRLNKL 269 + +V + R+G I + PL+ + ++ + + Sbjct: 13 SAVQAVVADFITRRGD-----DRLGLIVFGTAAYPQA--PLTLDHAALQLLLRHTAVG-- 63 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 NT A+ A R L K +I +TDG ++G + + Sbjct: 64 MAGPNTAIGDAIGLAIRMLDAV----------DEPDKVLILLTDGNDTG-----SAVPPQ 108 Query: 330 QICEYMRNAGMKIYSVAVSAPPE------GQDLLRKCT-DSSGQFFAVNDSRELLESFDK 382 + ++I+++ + P DLL + G+FF ND L + + Sbjct: 109 RAATLAAQHHIRIHTIGMGDPQARGDDKVDFDLLEHIAQATGGRFFQANDRESLQQVYAT 168 Query: 383 ITDKIQEQSVRIAPN 397 + D+I + V + Sbjct: 169 L-DQITPRRVDTLSH 182 >gi|291388581|ref|XP_002710673.1| PREDICTED: collagen, type XXII, alpha 1 [Oryctolagus cuniculus] Length = 1571 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 73/205 (35%), Gaps = 24/205 (11%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + + + NLV++ + R+G + Y+ ++ Sbjct: 45 TSSSVGKEDFEKVRQWVANLVDTFE-----VGPEHTRVGVVRYSDRPSTAFELGHFSSRE 99 Query: 261 EVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 EVK+ ++ NTNT A+ ++ + G ++ I +TDG + Sbjct: 100 EVKAAARRIAYHGGNTNTGDALRFITARSFS--PQAGGRPGDRAFQQVAILLTDGRSQDL 157 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELL 377 G++I++V V ++ L + +S F V+D + Sbjct: 158 VRDAAAAAHAA--------GIRIFAVGVG--EALREELHEIASEPTSAHVFHVSDFDAID 207 Query: 378 ESFDKITDKIQEQS----VRIAPNR 398 + K+ ++ E VRI +R Sbjct: 208 KIRGKLRRRLCENVLCPSVRIEGHR 232 >gi|260778728|ref|ZP_05887620.1| hypothetical protein VIC_004132 [Vibrio coralliilyticus ATCC BAA-450] gi|260604892|gb|EEX31187.1| hypothetical protein VIC_004132 [Vibrio coralliilyticus ATCC BAA-450] Length = 463 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 48/457 (10%), Positives = 113/457 (24%), Gaps = 78/457 (17%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQM-------------------QSALDAAVLSGC 41 ++ I L AID ++ ++ +A DAA+ Sbjct: 20 ISVIAAPFLILATGTAIDSGRAFLVKAKLFAAVDAAGIAAARAVAEGEDAARDAAIKFYN 79 Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPL 101 A++ +D + I L + + + + + I+ Sbjct: 80 ANLPTDYHDSTTASPTVTFGYDSFGNISIDLSASAEVSTTFLGVFGHSSLEISATAQ--- 136 Query: 102 QYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSME 161 + +N L + + ++ N + ++ + E Sbjct: 137 --TVRRPVDLVLVVDNTTSLRLGSIGDVTDDVVARSKDFITNFNESFDRIALVKYAYGAE 194 Query: 162 DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSK-----------YAPAPAPANRKI 210 + + ++ + + S S T S PA + Sbjct: 195 VPVAFQSSRGHSRSDITTEIDAFDFGSLSSLQYTNSSEGIYLALDALRNVTDPANLKVIV 254 Query: 211 DVLIESAGNLVNSIQKAIQEKK--------NLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + + S + + Q +++ Sbjct: 255 FFTDGAPNTFASEFDFVGTSTNYTGSIRSSDESSGTPRGLWYHDAIATQLPGSGYEGSDI 314 Query: 263 KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L P T + A + L T + D Sbjct: 315 DDYIAEL-PDYYTA-HSASATEFNVLNPSHPDRPVTQYNPSTHSA----NDLYIRVNRVA 368 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSA---------PPEGQDLLRKCTDSS--------- 364 +N L I R + ++++ + + +G+DLL + + Sbjct: 369 RNLL--EDIATAARGEDIYVFTLGLGSSLTSSTGPDNEQGEDLLMRMANDPRMLDDADLA 426 Query: 365 ---------GQFFAVNDSRELLESFDKITDKIQEQSV 392 G + D L FD++ D I ++ Sbjct: 427 GDFRADQLQGVYCHAVDEEALGPCFDEMLDVIIRLTL 463 >gi|153833207|ref|ZP_01985874.1| conserved hypothetical protein [Vibrio harveyi HY01] gi|148870478|gb|EDL69393.1| conserved hypothetical protein [Vibrio harveyi HY01] Length = 515 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 25/265 (9%), Positives = 76/265 (28%), Gaps = 23/265 (8%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 +++ F +A++ + ++++ + +AA ++ + + Sbjct: 22 MGLLLVPIMGFTFWAVEGTRYVQETSRLRDSAEAAAMAVTIE-------DQAGSASTLAA 74 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + ++ + +A + +QY +K ++ + F+ Sbjct: 75 KYVESYVRDIKSMNVSAQRYYRAADDRAGV------LEYIQYTVNAKTTHDSWFASSFIP 128 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLY------LQKHNDNNNMT 175 + + I I V D S SM + + Sbjct: 129 SFDEQQDLAGRSLAR-KYPAYLADNNIDIVFVSDFSGSMREQWGFNRHIKIDDLKTAISQ 187 Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 + +L ++ + + K + D L+ G + +++ N + Sbjct: 188 ISNNILCTSTRQEYVD---GEWKDVCDEPGEDTTSDKLLNRVGFVPYNVRTREIIGWNQA 244 Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLN 260 + Y G + N+++ Sbjct: 245 NTTSQLNYTNGYNTHLSPYTYNDID 269 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 33/266 (12%), Positives = 79/266 (29%), Gaps = 17/266 (6%) Query: 138 IIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKS 197 ++ L + +S + + + T N + Sbjct: 242 NQANTTSQLNYTNGYNTHLSPYTYNDIDWDYWSRFSQTQVFNCANNQIFCPRPGWNDQQY 301 Query: 198 KYAPAPAPANRKIDVLI-ESAGNLVNSIQKAIQEKKNLSVRIGTIAY--------NIGIV 248 A + V + +L +S+ +K + Sbjct: 302 ARRIADVIRLDQYQVADVYNYVDLPSSVSTMFTDKSARRSNYYGVNQSQLFNGHGRNDSR 361 Query: 249 GNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK--- 305 L+N L+++ + +N + T + + + L+ +S + Sbjct: 362 QFYNLRLTNKLSDL-NPINAMWADGGTAAFQGILRGAQILHEGDPNSSDQEEQQAYNKKI 420 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNA--GMKIYSVAVSAPPEGQDLLRKCTDS 363 K ++ ++DG+ S + L +C+ R G+ I + + Q + C Sbjct: 421 KMLLILSDGQESPNNGILKGLVDWGMCDKARQEIPGLYIGVIGIDFRASQQSGFQDCVVD 480 Query: 364 SGQFFAVNDSRELLESFDKITDKIQE 389 + + D L E +KI + I++ Sbjct: 481 PRE--DIIDVSNLDELIEKIEELIRK 504 >gi|77920224|ref|YP_358039.1| von Willebrand factor type A domain-containing protein [Pelobacter carbinolicus DSM 2380] gi|77546307|gb|ABA89869.1| von Willebrand factor type A domain protein [Pelobacter carbinolicus DSM 2380] Length = 442 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 22/154 (14%), Positives = 56/154 (36%), Gaps = 10/154 (6%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 + Y+ + +++ ++++R+ ++ P +T + A+ E+ ++ + Sbjct: 105 LVVYDDSVETLVPAQPVSDIGDIEARIRRIRPGGSTALFGAVSQGAAEVRKHSDAPYVNR 164 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 V+ ++DG + + L L + G+ + +V V + + Sbjct: 165 --------VVLLSDGLANVGPSRPADLARLGAA--LLKEGISVTTVGVGTDFNEDLMTQL 214 Query: 360 CTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 S G + V SR+L F + R Sbjct: 215 AERSDGNHYFVESSRDLPRIFAAELGDVLSVVAR 248 >gi|77359908|ref|YP_339483.1| von Willebrand factor type A [Pseudoalteromonas haloplanktis TAC125] gi|76874819|emb|CAI86040.1| conserved protein of unknown function; putative Von Willebrand factor type A domain protein [Pseudoalteromonas haloplanktis TAC125] Length = 328 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 32/272 (11%), Positives = 82/272 (30%), Gaps = 52/272 (19%) Query: 134 RSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKN 193 S + + + L + + + + +L S Sbjct: 44 TSQSVEAHARRLTPFEWVIWLLLVIAAANPTWLDDPISMPNEGRDIMLAVDLSGSM---- 99 Query: 194 TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 + ++ ++ + + R+G I + T Sbjct: 100 --TEQDMAYNGQYVDRLTMVKAVLTDFIEQ---------RQGDRLGLILFGDTAFLQ--T 146 Query: 254 PLSNNLNEVKSRLN--KLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ ++ V L+ ++ T A+ + + + + V+ Sbjct: 147 PLTRDVKTVSKMLSEAQIGLVGRATAIGDALGLSVKRF----------ANKDKSNRIVVL 196 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------GQ 354 +TDG+N+ + L + R+AG+K+Y++ V + + Sbjct: 197 LTDGQNTAGN-----LKPEEALLLARDAGIKVYTIGVGSDNPRGFSLFNMGGMSGDTIDE 251 Query: 355 DLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 LL++ + + G +F D L + + ++ Sbjct: 252 GLLKRIAEQTGGLYFRAKDVAGLQQIYAELDK 283 >gi|192289227|ref|YP_001989832.1| hypothetical protein Rpal_0799 [Rhodopseudomonas palustris TIE-1] gi|192282976|gb|ACE99356.1| conserved hypothetical protein [Rhodopseudomonas palustris TIE-1] Length = 468 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 55/468 (11%), Positives = 112/468 (23%), Gaps = 93/468 (19%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 +TA+++ + +D + + Q+ +D Sbjct: 20 ITALVMIPIIFLLGMTLDFTQALRKKQQL-------------DAAADAAAIAAVRPAMLM 66 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 T Q + S A + IT K Y + N F Sbjct: 67 QTDAVAQNTAYAIFMSTANRLASGLTSVPTPTITITDVGL---QRTVKVSYNAASLNNFP 123 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS---- 176 + L+ + +S ST ++ +++D S SM N ++ Sbjct: 124 QLLMNNVSWAISGASTAQAS---SAPNMNFYLLMDDSPSMGIGATATDISNLIASTAPKY 180 Query: 177 ------NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE 230 T + +ID++ + L+N+ Q Sbjct: 181 QKASSSQNCGFACHETNIAHDGGTKDNLAIARANNITLRIDLVTSAVNQLLNTWSNCPQS 240 Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN---------------- 274 + V A N +LN + S Sbjct: 241 GVSGGVMQCMSALNNTTYRAALYTFDLSLNTLASLTTPTTAGAQVSNIALMPVAYQNCVV 300 Query: 275 ------TNTYPAMHHAYRELYN--EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 T+ + A L + ++ V +TDG ++ Sbjct: 301 PTTNCKTDNGTDIAGALTSLNGIMPSPGLGSNASGDTPQEVVFLVTDGVEDKIASSCPNG 360 Query: 327 NT-----------LQICEYMRNAGMKI---YSVAVSA----------------PPEGQ-- 354 + IC ++ G+KI Y+ + + Sbjct: 361 SYASYSRCQQPLDTAICTTIKKRGIKIAILYTEYLQLKTPNVPVTDTWYMSWIDAYDEPT 420 Query: 355 -------DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 L+ C G F V + ++ + K+ + + Sbjct: 421 SSTGAIAKNLQACAS-PGFFSNVQTGGNITQALTDLFLKVASSTASLT 467 >gi|238060187|ref|ZP_04604896.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] gi|237881998|gb|EEP70826.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] Length = 779 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 46/346 (13%), Positives = 89/346 (25%), Gaps = 11/346 (3%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPL 101 + + S I ++ + + G + + + + Sbjct: 443 RDLAEADLGGAGDARARDMSAIVVQEELLYQYNEGQLSPIPGQKPRVPLVAVHPSEGTFN 502 Query: 102 QYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSME 161 + + + + RS + S +V V E Sbjct: 503 LDHPYVVLPSADQAQQAAAQDFLAFLQDDPQQRSFTDLGFRDHERRASDALVAAVRGGPE 562 Query: 162 DLYLQ-KHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNL 220 + + KK+ S A + V +A Sbjct: 563 GQLTYFDPPAPEVVDAILRGWSTLRKKANILLAVDTSGSMNARVGGRTRFQVATTAADRA 622 Query: 221 VNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPA 280 V + A + T + + RL L NT Y Sbjct: 623 VGLLNSADRVALWSFS-SETDQRRGKPYSEEIRLGPYDRAAFTRRLTGLRVGGNTALYAT 681 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + A+R L + + V+ +TDG+ + + L E N + Sbjct: 682 VRAAHRRLLDNHDPDRI--------NAVVVLTDGK-NEYPRDNDLDRLLADIELDPNRPV 732 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 K++ VA + L R S+G+ F D + E+F K+ Sbjct: 733 KVFCVAFDRESDLAALDRIAGASAGKAFDATDPATIDEAFVKLVSS 778 >gi|293334601|ref|NP_001168718.1| hypothetical protein LOC100382510 [Zea mays] gi|223950381|gb|ACN29274.1| unknown [Zea mays] Length = 629 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 21/185 (11%), Positives = 57/185 (30%), Gaps = 26/185 (14%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT--PLSNNLNEVKSR 265 K+ +L + G ++ ++ + R+ IA++ + + Sbjct: 198 TKLALLKRAMGFVIQNLGSSD--------RLSVIAFSSSARRLFPLRRMTESGRQQSLLA 249 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 +N L TN + + + S +I ++DG+++ + Sbjct: 250 VNSLTANGGTNIAEGLRKGSKVIEE--------RQSKNPVCSIILLSDGQDTYTVSPTAG 301 Query: 326 --LNTLQICEYM------RNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 + C + + + ++ A + L S G F + + Sbjct: 302 VHKGATEYCALLPSTTTNGSQQVPVHVFGFGADHDSVSLHSISQTSGGTFSFIETEATIQ 361 Query: 378 ESFDK 382 ++F + Sbjct: 362 DAFAQ 366 >gi|311253580|ref|XP_003125597.1| PREDICTED: matrilin-2-like [Sus scrofa] Length = 423 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 27/187 (14%), Positives = 66/187 (35%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +++ + +++S+ + + R+G + Y+ + + ++K Sbjct: 127 GEENFEIVKQFVTGIIDSLTISPKAA-----RVGLLQYSTQVRTEFTLRNFGSAKDMKKA 181 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + S R+ + I TDG + Sbjct: 182 VASMKYMGKGSMTGLALKHMFERSFTQIEGARPL--SARVPRVAIVFTDGRAQDDVSEW- 238 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ + F D + E DK Sbjct: 239 -------ASKAKANGITMYAVGVGKAI--EEELQEIASEPTDKHLFYAEDFSTMGEISDK 289 Query: 383 ITDKIQE 389 + I E Sbjct: 290 LQKGICE 296 >gi|311253578|ref|XP_001926459.2| PREDICTED: matrilin-2 [Sus scrofa] Length = 707 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 27/187 (14%), Positives = 66/187 (35%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +++ + +++S+ + + R+G + Y+ + + ++K Sbjct: 411 GEENFEIVKQFVTGIIDSLTISPKAA-----RVGLLQYSTQVRTEFTLRNFGSAKDMKKA 465 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + S R+ + I TDG + Sbjct: 466 VASMKYMGKGSMTGLALKHMFERSFTQIEGARPL--SARVPRVAIVFTDGRAQDDVSEW- 522 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ + F D + E DK Sbjct: 523 -------ASKAKANGITMYAVGVGKAI--EEELQEIASEPTDKHLFYAEDFSTMGEISDK 573 Query: 383 ITDKIQE 389 + I E Sbjct: 574 LQKGICE 580 >gi|288800165|ref|ZP_06405624.1| BatA protein [Prevotella sp. oral taxon 299 str. F0039] gi|288333413|gb|EFC71892.1| BatA protein [Prevotella sp. oral taxon 299 str. F0039] Length = 323 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 26/196 (13%), Positives = 57/196 (29%), Gaps = 44/196 (22%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY------ 272 N + + +K E + P++ + + + L + Sbjct: 99 NRMEAAKKVAAEFISDRANDNIGLTIFAGEAFTQCPMTTDHASLLNLLQGVRTDIASRGL 158 Query: 273 --ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 + T + +A L S K +I +TDG N+ L Sbjct: 159 IADGTAVGMGLANAVSRL----------KESKAKSKVIILLTDGSNNMGDIS-----PLT 203 Query: 331 ICEYMRNAGMKIYSVAVSAP--------------------PEGQDLLRKCT-DSSGQFFA 369 + ++ G+++Y++ V L+ + G F+ Sbjct: 204 AAQIAKSLGIRVYTIGVGTNTVAPYPVTVGGTTQYVNVPAEIDTKTLKDIAQSTDGGFYR 263 Query: 370 VNDSRELLESFDKITD 385 ++ EL E ++ I Sbjct: 264 ATNNAELKEIYNDIDR 279 >gi|296156498|ref|ZP_06839336.1| von Willebrand factor type A [Burkholderia sp. Ch1-1] gi|295893097|gb|EFG72877.1| von Willebrand factor type A [Burkholderia sp. Ch1-1] Length = 446 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 44/431 (10%), Positives = 116/431 (26%), Gaps = 83/431 (19%) Query: 17 IDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGS 76 +D I+ ++ +A D AV++ ++ + T + + + G Sbjct: 34 VDSGLGYVIKARLDAATDGAVIAAGEAVTRG---SNQTEQTNNAQQAATAFFTANYPAG- 89 Query: 77 YIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRST 136 + A + ++A + ++ + N+S S Sbjct: 90 -----FLGSSVAAGTPSIVFDAGTVTIGMTAQASV-----PVSFSKVLGFKVLNVSSTSQ 139 Query: 137 GIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFW------ 190 I + + + V+D + S+ + +N + + + Sbjct: 140 AIRKT------LDMAFVIDTTGSLNTSGVPAAVRSNAVAFLNNFDVTNDRVALMHFAYGT 193 Query: 191 SKNTTKSKYAPAPAPANRKIDVLIE----------SAGNLVNSIQKAIQEKKNLSVRIGT 240 + S A D+ + N N + I + +L V Sbjct: 194 VVDVPFSGNARGFDRTTMTADINKYTFNGSTNSAEAIWNARNQLNTVISQPSSLRV---I 250 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMH------------------ 282 + ++ G + + + N + + ++T T ++ Sbjct: 251 VFFSDGAPNSFSSFFTTNQSGCNKSAGTIASPDSTGTMSGLYNMNALNQTLGSPCFQSNV 310 Query: 283 ----HAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 A + YN + + T + + + R+ Sbjct: 311 TKLVTAMPKWYNAHDVNEKTFPIWPVTAPRAVPNGNVTYVNVNRASRNLLEAMASAARDE 370 Query: 339 GMKIYSVAVS---------APPEGQDLLRKCTDSS-------------GQFFAVNDSREL 376 G ++++ GQD+L+ +++ G + +L Sbjct: 371 GTYVFTLGYGPELVQPAGPDNELGQDVLKCMANTADSLSRCYNPKQPVGVYCYAATPADL 430 Query: 377 LESFDKITDKI 387 F ++ +I Sbjct: 431 KPCFSQLASQI 441 >gi|159045656|ref|YP_001534450.1| von Willebrand factor type A domain-containing protein [Dinoroseobacter shibae DFL 12] gi|157913416|gb|ABV94849.1| von Willebrand factor type A domain protein [Dinoroseobacter shibae DFL 12] Length = 328 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 27/140 (19%), Positives = 52/140 (37%), Gaps = 25/140 (17%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 P + +L+ V LN++ NT A+ A R + S ++ +I Sbjct: 156 PFTEDLDSVVELLNQVQTGMAGPNTAIGDAIGLAIRSFED----------SEIEERLLIL 205 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------GQDLLRKCTD-S 363 ++DG ++ +T+ + + G+ IY++ V P L Sbjct: 206 LSDGADTA-----STMTPINAAQIAAQEGITIYTIGVGNPDGSGEERLDPATLEDIATRG 260 Query: 364 SGQFFAVNDSRELLESFDKI 383 G F+ +D L E + +I Sbjct: 261 GGAFYFADDVEGLSEIYAEI 280 >gi|291295702|ref|YP_003507100.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] gi|290470661|gb|ADD28080.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] Length = 318 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 21/151 (13%), Positives = 56/151 (37%), Gaps = 25/151 (16%) Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 PL+ + + ++ L T A+ + + L E + + R +I + Sbjct: 139 VVPLTTDRGRLLEAVDFLQLNLGTAIGDAILESIQALPPLSERAED--PDPRRLATIILL 196 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------GQD 355 TDG + G ++ + + ++++++ + ++ Sbjct: 197 TDGRSLGG------VDPVVAAQEAARQQIRVHTIGIGRTTSGPVPGLPEVYAQAALFDEE 250 Query: 356 LLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 L++ GQ+F V+ + +L E++ +T Sbjct: 251 TLKEVARVGDGQYFYVDSAEKLKEAYRDLTR 281 >gi|260823583|ref|XP_002606160.1| hypothetical protein BRAFLDRAFT_126487 [Branchiostoma floridae] gi|229291499|gb|EEN62170.1| hypothetical protein BRAFLDRAFT_126487 [Branchiostoma floridae] Length = 515 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 68/198 (34%), Gaps = 20/198 (10%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + + + ++V+ + RIG + Y+ + N + Sbjct: 185 SGSVGADNFETVKDFVVSVVDGFE-----IGQSRTRIGVVQYSDEVQNEFNLTEYGNKAD 239 Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 V+S ++ + T T A+ + ++E+ + + K I +TDGE + Sbjct: 240 VQSAISNITYLQGRTYTGAALRYMTDVSFSEEAGARPPY--QAIPKVGIVVTDGEATD-- 295 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLE 378 N AG+ ++++ + LR+ FAV++ Sbjct: 296 ------NVQGPASSAHEAGVNVFAIGIG--GYDVRELRQIATDPDATHVFAVDNFAATDY 347 Query: 379 SFDKITDKIQEQSVRIAP 396 D + + ++ +A Sbjct: 348 IKDALEGRTCKEPAEVAV 365 >gi|330878848|gb|EGH12997.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. morsprunorum str. M302280PT] Length = 352 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 26/146 (17%), Positives = 55/146 (37%), Gaps = 35/146 (23%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + V+ L++ +NT A+ A + L +S + ++ Sbjct: 150 PLTYDRRTVRVWLDEAKIGIAGKNTAVGDAIGLALKRLRLRPANS----------RVLVL 199 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------GQ 354 +TDG N+ ++ + + G+KIY + + + P+ + Sbjct: 200 VTDGANNAGQ-----IDPITAARLAADEGVKIYPIGIGSDPDKDALQSALGLNPSLDLDE 254 Query: 355 DLLRKCTD-SSGQFFAVNDSRELLES 379 L++ S GQ+F D +L + Sbjct: 255 PTLKEIASISGGQYFRARDGDQLEKI 280 >gi|260901770|ref|ZP_05910165.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AQ4037] gi|308108909|gb|EFO46449.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AQ4037] gi|328470487|gb|EGF41398.1| protein BatA [Vibrio parahaemolyticus 10329] Length = 356 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 25/150 (16%), Positives = 60/150 (40%), Gaps = 15/150 (10%) Query: 246 GIVGNQCTPLSNNLN---EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 G TP + + E+ ++ + ++T+ A+ A + +S Sbjct: 159 GDAAFVQTPFTADQKVWLELLNQTDVAMAGQSTHLGDAIGLAIKVFEQSDKSRGALEQDQ 218 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-------EGQD 355 +K I +TDG ++G + + + + + G++++ +A+ P + Sbjct: 219 NREKVAIVLTDGNDTG-----SFVEPIDAAKVAKAKGVRVHVIAMGDPETIGETALDMDT 273 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + R +S G+ F + EL ++D+I Sbjct: 274 IHRIAKESGGEAFEALNRDELSAAYDEIGK 303 >gi|153836342|ref|ZP_01989009.1| von Willebrand factor, type A [Vibrio parahaemolyticus AQ3810] gi|149750244|gb|EDM60989.1| von Willebrand factor, type A [Vibrio parahaemolyticus AQ3810] Length = 356 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 25/150 (16%), Positives = 60/150 (40%), Gaps = 15/150 (10%) Query: 246 GIVGNQCTPLSNNLN---EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 G TP + + E+ ++ + ++T+ A+ A + +S Sbjct: 159 GDAAFVQTPFTADQKVWLELLNQTDVAMAGQSTHLGDAIGLAIKVFEQSDKSRGALEQDQ 218 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-------EGQD 355 +K I +TDG ++G + + + + + G++++ +A+ P + Sbjct: 219 NREKVAIVLTDGNDTG-----SFVEPIDAAKVAKAKGVRVHVIAMGDPETIGETALDMDT 273 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + R +S G+ F + EL ++D+I Sbjct: 274 IHRIAKESGGEAFEALNRDELSAAYDEIGK 303 >gi|28900543|ref|NP_800198.1| hypothetical protein VPA0688 [Vibrio parahaemolyticus RIMD 2210633] gi|260365425|ref|ZP_05777962.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus K5030] gi|260877490|ref|ZP_05889845.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AN-5034] gi|260894838|ref|ZP_05903334.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus Peru-466] gi|28808923|dbj|BAC62031.1| conserved hypothetical protein [Vibrio parahaemolyticus RIMD 2210633] gi|308085296|gb|EFO34991.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus Peru-466] gi|308090935|gb|EFO40630.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AN-5034] gi|308114289|gb|EFO51829.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus K5030] Length = 356 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 25/150 (16%), Positives = 60/150 (40%), Gaps = 15/150 (10%) Query: 246 GIVGNQCTPLSNNLN---EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 G TP + + E+ ++ + ++T+ A+ A + +S Sbjct: 159 GDAAFVQTPFTADQKVWLELLNQTDVAMAGQSTHLGDAIGLAIKVFEQSDKSRGALEQDQ 218 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-------EGQD 355 +K I +TDG ++G + + + + + G++++ +A+ P + Sbjct: 219 NREKVAIVLTDGNDTG-----SFVEPIDAAKVAKAKGVRVHVIAMGDPETIGETALDMDT 273 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + R +S G+ F + EL ++D+I Sbjct: 274 IHRIAKESGGEAFEALNRDELSAAYDEIGK 303 >gi|224372482|ref|YP_002606854.1| von Willebrand factor, type A [Nautilia profundicola AmH] gi|223588580|gb|ACM92316.1| von Willebrand factor, type A [Nautilia profundicola AmH] Length = 288 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 22/145 (15%), Positives = 62/145 (42%), Gaps = 20/145 (13%) Query: 252 CTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 +PL+ + + L ++ T Y A+ + N +K + Sbjct: 125 ASPLTFDKKTFEDILKRIYVSIAGGKTAIYDALFLSSNLFKNANG-----------EKII 173 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF 368 I +TDG ++ + + + + ++ +K+YS+A+ + L + +++G+F+ Sbjct: 174 ILLTDGMDNMSITPLDV-----VIKKLKKEHIKVYSIAIGGDADLSVLKKISKETNGKFY 228 Query: 369 AVNDSRELLESFDKITDKIQEQSVR 393 + +L + + I +K+ + +++ Sbjct: 229 IASSLEDLKKIYSDI-NKLTKSNIK 252 >gi|320160918|ref|YP_004174142.1| hypothetical protein ANT_15140 [Anaerolinea thermophila UNI-1] gi|319994771|dbj|BAJ63542.1| hypothetical protein ANT_15140 [Anaerolinea thermophila UNI-1] Length = 486 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 66/185 (35%), Gaps = 22/185 (11%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++D++ SA NL+ +K I +A++ +L + R++ Sbjct: 133 RMDMVKSSALNLLKQFRKQDL--------ISVVAFSDRAEVVIPPTRVPDLAKDDHRISM 184 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L T Y + +L + + +I +TDG Y + Sbjct: 185 LQVGGGTEIYQGLQLGIEQLRSIDPR---------FMRQLILLTDG-----HTYGDDEAC 230 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 +++ E G++I ++ + + L + T S V ++L + F++ + Sbjct: 231 IELAEEAAQDGIQINTMGIGHEWNDELLDKIATISGANSIFVTSPKDLNKFFEQKLQLLN 290 Query: 389 EQSVR 393 + R Sbjct: 291 DTYAR 295 >gi|293356227|ref|XP_574569.3| PREDICTED: rCG40584-like [Rattus norvegicus] Length = 1182 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 56/159 (35%), Gaps = 22/159 (13%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 + + +PL + +N+L +T T A+ H EL+ ++ Sbjct: 192 QFSHTFRTHFTFNDFISTSSPL-----RLLDFVNQLR--GSTRTASAIKHVITELFTTQK 244 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS---AP 350 + K +I ITDG G + + AG+ Y++ V Sbjct: 245 GAR-----KDATKILIVITDGRKEGDRLDYGDVIPMAEA-----AGIIRYAIGVGQAFYQ 294 Query: 351 PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKI 387 + + L+ S F+V + L + +++ +KI Sbjct: 295 AQSRQELKDIASSPSREYVFSVENFDALKDIQNQLKEKI 333 >gi|293344414|ref|XP_001080404.2| PREDICTED: rCG40584-like [Rattus norvegicus] gi|149067645|gb|EDM17197.1| rCG40584, isoform CRA_b [Rattus norvegicus] Length = 1182 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 56/159 (35%), Gaps = 22/159 (13%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 + + +PL + +N+L +T T A+ H EL+ ++ Sbjct: 192 QFSHTFRTHFTFNDFISTSSPL-----RLLDFVNQLR--GSTRTASAIKHVITELFTTQK 244 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS---AP 350 + K +I ITDG G + + AG+ Y++ V Sbjct: 245 GAR-----KDATKILIVITDGRKEGDRLDYGDVIPMAEA-----AGIIRYAIGVGQAFYQ 294 Query: 351 PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKI 387 + + L+ S F+V + L + +++ +KI Sbjct: 295 AQSRQELKDIASSPSREYVFSVENFDALKDIQNQLKEKI 333 >gi|149067644|gb|EDM17196.1| rCG40584, isoform CRA_a [Rattus norvegicus] Length = 1084 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 56/159 (35%), Gaps = 22/159 (13%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 + + +PL + +N+L +T T A+ H EL+ ++ Sbjct: 192 QFSHTFRTHFTFNDFISTSSPL-----RLLDFVNQLR--GSTRTASAIKHVITELFTTQK 244 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS---AP 350 + K +I ITDG G + + AG+ Y++ V Sbjct: 245 GAR-----KDATKILIVITDGRKEGDRLDYGDVIPMAEA-----AGIIRYAIGVGQAFYQ 294 Query: 351 PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKI 387 + + L+ S F+V + L + +++ +KI Sbjct: 295 AQSRQELKDIASSPSREYVFSVENFDALKDIQNQLKEKI 333 >gi|125535226|gb|EAY81774.1| hypothetical protein OsI_36948 [Oryza sativa Indica Group] Length = 633 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 34/244 (13%), Positives = 73/244 (29%), Gaps = 32/244 (13%) Query: 140 ERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY 199 ++L V + S + Sbjct: 40 TIPRGQTNKDFQVLLHVEAPPAANLKGH-------VPIDVVAVLDVSGSMNDPVAAAAAA 92 Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL---- 255 +P ++DVL S ++ + R+ +A+N G V + L Sbjct: 93 SPESNLQASRLDVLKASMKFIIRKLDDGD--------RLSIVAFNDGPVKEYSSGLLDVS 144 Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + + ++++L T PA+ A + L GS F++ +TDG+ Sbjct: 145 GDGRSIAGKKIDRLQARGGTALMPALEEAVKILDE------RQGGSRNHVGFILLLTDGD 198 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE 375 ++ + + + +++ + A + + LL S G + V+D Sbjct: 199 DTTGFRWTRDAIHGAVAK------YPVHTFGLGASHDPEALLHIAQGSRGTYSFVDD-DN 251 Query: 376 LLES 379 L Sbjct: 252 LANI 255 >gi|332970884|gb|EGK09861.1| D-amino-acid dehydrogenase [Desmospora sp. 8437] Length = 440 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 30/216 (13%), Positives = 68/216 (31%), Gaps = 22/216 (10%) Query: 183 PPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIA 242 P + S S A RK+DV ++ V++ + G+ + Sbjct: 123 PKFDQYNVSILLDASGSMAAQVSGGRKMDVAKDAVETFVSTFPEEANVSLIAYGHKGSNS 182 Query: 243 YNI----GIVGNQCTPL-SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHN 297 + PL + + + + L ++ T A+ + + L + Sbjct: 183 QKDKVLSCSEIEEIYPLSTYDQSTFSNALKTVDATGWTPLADAIKKSGKTLKANAD---- 238 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQD 355 K V ++DG + E ++ G+ + + Q Sbjct: 239 ----KNSKNVVYIVSDGLETCGGHPAKE------AEALQKDGISATVNIIGFDVNNAEQQ 288 Query: 356 LLRKCTDSSGQFFA-VNDSRELLESFDKITDKIQEQ 390 L++ ++ G F N +L F+ D+++ + Sbjct: 289 ALKEVAEAGGGTFTSANSKSDLESYFESEYDELKRE 324 >gi|237800421|ref|ZP_04588882.1| von Willebrand factor, type A [Pseudomonas syringae pv. oryzae str. 1_6] gi|331023280|gb|EGI03337.1| von Willebrand factor, type A [Pseudomonas syringae pv. oryzae str. 1_6] Length = 352 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 27/146 (18%), Positives = 54/146 (36%), Gaps = 35/146 (23%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + V+ L++ +NT A+ A + L + ++ Sbjct: 150 PLTYDRQTVRVWLDEAKIGIAGKNTAVGDAIGLALKRL----------RMRPANSRVLVL 199 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------GQ 354 +TDG N+ ++ L + G+KIY++ + + PE + Sbjct: 200 VTDGANNAGQ-----IDPLTAARLAADEGVKIYTIGIGSDPEKNALQSALGLSASLDLDE 254 Query: 355 DLLRKCT-DSSGQFFAVNDSRELLES 379 L++ S GQ+F D +L + Sbjct: 255 PTLKEIARLSGGQYFRTRDGDQLEKI 280 >gi|166033217|ref|ZP_02236046.1| hypothetical protein DORFOR_02942 [Dorea formicigenerans ATCC 27755] gi|166027574|gb|EDR46331.1| hypothetical protein DORFOR_02942 [Dorea formicigenerans ATCC 27755] Length = 1465 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 52/372 (13%), Positives = 104/372 (27%), Gaps = 58/372 (15%) Query: 62 TIFKKQIKKHLKQGSYIR---ENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENL 118 + + I + + + ++ + Sbjct: 33 NAAENAARTTTTDNVTINNWHDANALDDSTKNVGRIWTDKSVSAGDVTLTSREKESGTAT 92 Query: 119 FLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNK 178 KG L LS S+ + + I +VLDVS SM+D N Sbjct: 93 IKKGADSDFLVGLSALSSTAKITGQTTVPLDIVLVLDVSGSMDDPMGSGDNTKRIDAL-- 150 Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 + + + K A +I V+ + K + + + Sbjct: 151 -----KAAVNSFIDGSAKVNDQRADVNKQNRIAVVKFAGN-------KTDKIGNDQYSQN 198 Query: 239 GTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNT 298 +V S N +E ++ +N L P T AM + K ++N Sbjct: 199 RYWYNYTQVVSGYKAYTSGNKSEWETTVNALKPAGCTAADYAMDLTKTLVDQSKTDANNN 258 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQI--CEYMRNAGMKIYSVAVSAPPEG--- 353 +K+ VIF TDGE + S + + + I + ++ IY++ + + + Sbjct: 259 ADRKNVKRVVIFFTDGEPNHQSGFDDDVANDAITSAKTIK-TDADIYTIGIFSGADVSIT 317 Query: 354 --------------QDLLRKCTDS--------------------SGQFFA-VNDSRELLE 378 + + + ++ + EL Sbjct: 318 GHSGSGSWSAKEKFNAFMHGLSSNYPDAERYKKLGTRAKDSKGQDATYYKVATKADELKN 377 Query: 379 SFDKITDKIQEQ 390 F +I D+I Sbjct: 378 IFTQIEDEIISS 389 >gi|222528069|ref|YP_002571951.1| YD repeat-containing protein [Caldicellulosiruptor bescii DSM 6725] gi|222454916|gb|ACM59178.1| YD repeat protein [Caldicellulosiruptor bescii DSM 6725] Length = 3027 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 39/376 (10%), Positives = 112/376 (29%), Gaps = 63/376 (16%) Query: 34 DAA-VLSGCASIVSDR-TIKDPTTKKDQTSTIFKKQIKKHL---------KQGSYIRENA 82 D A V+ G + + T + + + ++ +I Sbjct: 619 DGAEVVMGTDPLTKNPLTSAEKYAVSEDGKVFVRALSDANILIAPLQVKRSDNVFINSLK 678 Query: 83 GDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERS 142 G + + +I + + E L + + + Sbjct: 679 GIVGKAIEITAGGFDIKKAEIVVNYDEAELNGVEENNLMLYYVNYDKKI------LEPLE 732 Query: 143 SENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPA 202 + V + L N +++ + S S + Sbjct: 733 DVVVDTVYNRVSGKTEHFSTFLLGDKNMPVDLSKVDIVFVLDNSGSMSSNDPNYY----- 787 Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 +I+ + N+ L+ R+G + ++ + + L+++ +++ Sbjct: 788 ------RIEATKKFIQNI-----------DELNNRVGLVDFDSSVSVR--SNLTSDKSKL 828 Query: 263 KSRLNKLN-PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 LN + +TN + A + KK ++ ++DG ++ Sbjct: 829 LQALNAMRWTGGSTNIGGGLKAALGLF-----------DQEQSKKIIVLLSDGYHNTGIH 877 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESF 380 + + + + + ++A+ ++LL D + G +F V+++ L + Sbjct: 878 PND------VLPELIKQEIVVNTIALGKD-CDRELLHDIADKTKGGYFYVDNTGGLSQ-- 928 Query: 381 DKITDKIQEQSVRIAP 396 + + +I+ ++ Sbjct: 929 EDVDKQIELIYEKLTK 944 >gi|157374763|ref|YP_001473363.1| von Willebrand factor, type A [Shewanella sediminis HAW-EB3] gi|157317137|gb|ABV36235.1| von Willebrand factor, type A [Shewanella sediminis HAW-EB3] Length = 330 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 26/153 (16%), Positives = 47/153 (30%), Gaps = 36/153 (23%) Query: 254 PLSNNLNEVKSRLN--KLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + V L ++ T A+ A + + ++ Sbjct: 143 PLTQDRRSVAQFLKEAQIGLVGKQTAIGEAIALAVKRFDRV----------DESNRILVL 192 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEG 353 +TDG N+ S Q G+KIYS+ V A Sbjct: 193 LTDGSNNSGSIS-----PEQAAAIAAKRGVKIYSIGVGAEVMERRTLFGKERVNPSMDLD 247 Query: 354 QDLLRKCT-DSSGQFFAVNDSRELLESFDKITD 385 + L + G +F +++EL + +I Sbjct: 248 ETQLTALAQTTGGLYFRARNAQELESIYQEIDK 280 >gi|332140758|ref|YP_004426496.1| von Willebrand factor, type A [Alteromonas macleodii str. 'Deep ecotype'] gi|327550780|gb|AEA97498.1| von Willebrand factor, type A [Alteromonas macleodii str. 'Deep ecotype'] Length = 349 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 60/187 (32%), Gaps = 38/187 (20%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS----RLNKLNPYEN 274 N + + + + V PL+ + + V + + L E Sbjct: 113 NRLTMTKSVVYDFIQRRVGDRLGLILFADTAYVQAPLTYDRDTVSTLLSEAVIGLV-GEQ 171 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 T A+ A + +I +TDG+N+ + + Q E Sbjct: 172 TAIGDAIGLAVKRFDER----------DESNNVLILLTDGQNTAGN-----ITPEQAKEL 216 Query: 335 MRNAGMKIYSVAVSAP-----------------PEGQDLLRKCT-DSSGQFFAVNDSREL 376 N G+K+Y++ V A + +L + GQ+F +++EL Sbjct: 217 AINKGVKVYTIGVGADKMLIQSFFGSREINPSQELDEGMLTDIATSTGGQYFRARNAQEL 276 Query: 377 LESFDKI 383 + ++ Sbjct: 277 EAIYQQL 283 >gi|327193756|gb|EGE60633.1| hypothetical protein RHECNPAF_136001 [Rhizobium etli CNPAF512] Length = 433 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 25/198 (12%), Positives = 57/198 (28%), Gaps = 20/198 (10%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 K+ +L + LVN ++ + + Y Sbjct: 73 VSGSMDEPDKLPLLKSAFRLLVNRLKADDT--------VSIVTYAGNAGTVLEPTRVAEK 124 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +++ S ++KL +T + AY V+ TDG+ + Sbjct: 125 SKILSAIDKLEAGGSTGGAEGIEAAYDLAKKAFVKDGVNR--------VMLATDGDFNVG 176 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 + + +I E R G+ + + L++ + + L E+ Sbjct: 177 --PSSDEDLKRIIEEKRKDGIFLTVLGFGRGNLNDSLMQTLAQNGNG--SAAYIDTLAEA 232 Query: 380 FDKITDKIQEQSVRIAPN 397 + ++ IA + Sbjct: 233 QKTLVEEAGSTLFPIAKD 250 >gi|56797855|emb|CAG27023.1| matrilin-3a [Danio rerio] Length = 460 Score = 71.9 bits (174), Expect = 2e-10, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 56/170 (32%), Gaps = 20/170 (11%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNT 277 L + + + + R+ + Y + + +K + ++ P T T Sbjct: 86 FLADMVDT--LDVGPDATRVAVVNYASTVKIESLLKSHLTKDTIKQAITRIEPLAAGTMT 143 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 A+ A E + EK + S + K I +TDG ++ R Sbjct: 144 GMAIKKAMDEAFTEKSGARPK--SKNISKVAIIVTDGRPQD--------QVEEVSAAARA 193 Query: 338 AGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 +G++IY+V V L+ + F V +L F + Sbjct: 194 SGIEIYAVGV--DRADMRSLKLMASNPLEDHVFYVETYGVIEKLTSKFRE 241 >gi|110626529|gb|ABG79013.1| TadG [Yersinia ruckeri] Length = 478 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 48/428 (11%), Positives = 118/428 (27%), Gaps = 72/428 (16%) Query: 4 IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTI 63 I + I D ++ + ++ AL+ L+ A + + + + Sbjct: 52 ISLPFFIAIIMLLFDFTQLINNKIKLSDALEQGALALTAENNAKND--------TRNNEL 103 Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGL 123 I +L + + Q QY ++ E PT F L Sbjct: 104 ISAYINFYLGHRHQLTQYNNITVNYQQNPDRLYHTQLSQYHIDANI--EQPTLFPFTSLL 161 Query: 124 IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLY------------------- 164 I + + I + A+ + V D S SME + Sbjct: 162 IDHDNFIIGGSAAAIKDV----PAMDVVFVTDFSGSMEGDFHNPDDPEVLSKLDELKRIF 217 Query: 165 --LQKHNDNNNMTSNKYLLPPPPKKSFW-----------------SKNTTKSKYAPAPAP 205 + N + P + ++ Sbjct: 218 FKIADDIYTANKD-STISFSPFSWGTKSADNKKCSLHFMPKEKNKIYPIPSNEIERNTEA 276 Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS--NNLNEVK 263 K + I + + +I+ + + + + + + + + P+S +++++ Sbjct: 277 HQEKYMIAITENIDYLATIENIGTNNEKIVIPLDHVHDELCLYSSNAYPISLAKDIDDL- 335 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 + + +N +T + L + +HN + T + Sbjct: 336 NVIKTMNEGGSTLVSSGIMQGADLLMDG--VNHNKLMIILSDGHDYPTTAVVHDKTITSA 393 Query: 324 NTLNTL----------QICEYMRNAGMKIYSVAVSAPPEGQDLL---RKCTDSSGQFFAV 370 + +C+ +R +I + + P + C D+ F+ Sbjct: 394 KEVRVNVDISRQLVQHGMCKKIRETVGRIVFIGIGYNPSANHYINWAEDCVDTE-NFYLA 452 Query: 371 NDSRELLE 378 +++EL + Sbjct: 453 MNTKELED 460 >gi|308070278|ref|YP_003871883.1| hypothetical protein PPE_03528 [Paenibacillus polymyxa E681] gi|305859557|gb|ADM71345.1| Conserved hypothetical protein [Paenibacillus polymyxa E681] Length = 695 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 29/174 (16%), Positives = 67/174 (38%), Gaps = 20/174 (11%) Query: 234 LSVRIGTIAYNIGIVGNQCTPL---SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 R+G +AYN +V ++ + ++++ + LN T+ + L Sbjct: 81 DRTRVGFVAYNHNVVASKPLTSIAVAAQKSQIQQDIRTLNRSGYTDLGLGLRRGSELLAA 140 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGA------SAYQNTLNTLQICEYMRNAGMKIYS 344 S + F+I ++DGE S + + + + + G +Y+ Sbjct: 141 GA--------SQGRQPFMILLSDGETDFGASSGSRSKGDSNNDVSSVIKSAQTKGYPVYT 192 Query: 345 VAVSAPPE-GQDLLRKCTD-SSGQFFAVNDSRELLESFDKI-TDKIQEQSVRIA 395 + ++ + L + + G F + + +L E ++I +I+ + V IA Sbjct: 193 IGLNHDGTVNRQELERIASQTGGASFITSSAEDLPEILNRIFASQIRSKLVPIA 246 >gi|301780322|ref|XP_002925578.1| PREDICTED: collagen alpha-1(XIV) chain-like [Ailuropoda melanoleuca] Length = 1796 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 66/201 (32%), Gaps = 26/201 (12%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + +I + V ++ K + ++ + + N Sbjct: 1037 VDGSWSIGDENFNKIINFLYSTVGALNKIGADGT----QVAMVQFTDDPRTEFKLNAYNT 1092 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + +++ NT T A+ H L+ + + + K ++ ITDG + Sbjct: 1093 KETLLDAIKRISYKGGNTKTGKAIKHVRDSLFTAESGTRR-----GIPKVIVVITDGRSQ 1147 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRE 375 +I M++ G I++V V L F V+D Sbjct: 1148 DDVN--------KISGEMQSNGYNIFAVGV--ADADYSELVSIGSKPSSRHVFFVDDF-- 1195 Query: 376 LLESFDKITDKIQEQSVRIAP 396 ++F KI D++ A Sbjct: 1196 --DAFKKIEDELITFVCETAS 1214 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 25/168 (14%), Positives = 58/168 (34%), Gaps = 18/168 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + RIG Y+ N +EV + L NT T A+++ + Sbjct: 186 TAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIEAVRNLPYKGGNTLTGLALNYIF 245 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + + + K I ITDG++ + +R +G++++++ Sbjct: 246 ENSFKPEAGARA-----GVSKIGILITDGKSQDDIIPPS--------RNLRESGVELFAI 292 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 V ++ LR+ + V + + + +T + + Sbjct: 293 GV--KNADENELREIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSRV 338 >gi|281350861|gb|EFB26445.1| hypothetical protein PANDA_015099 [Ailuropoda melanoleuca] Length = 1741 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 66/201 (32%), Gaps = 26/201 (12%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + +I + V ++ K + ++ + + N Sbjct: 1009 VDGSWSIGDENFNKIINFLYSTVGALNKIGADGT----QVAMVQFTDDPRTEFKLNAYNT 1064 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + +++ NT T A+ H L+ + + + K ++ ITDG + Sbjct: 1065 KETLLDAIKRISYKGGNTKTGKAIKHVRDSLFTAESGTRR-----GIPKVIVVITDGRSQ 1119 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRE 375 +I M++ G I++V V L F V+D Sbjct: 1120 DDVN--------KISGEMQSNGYNIFAVGV--ADADYSELVSIGSKPSSRHVFFVDDF-- 1167 Query: 376 LLESFDKITDKIQEQSVRIAP 396 ++F KI D++ A Sbjct: 1168 --DAFKKIEDELITFVCETAS 1186 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 25/168 (14%), Positives = 58/168 (34%), Gaps = 18/168 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + RIG Y+ N +EV + L NT T A+++ + Sbjct: 158 TAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIEAVRNLPYKGGNTLTGLALNYIF 217 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + + + K I ITDG++ + +R +G++++++ Sbjct: 218 ENSFKPEAGARA-----GVSKIGILITDGKSQDDIIPPS--------RNLRESGVELFAI 264 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 V ++ LR+ + V + + + +T + + Sbjct: 265 GV--KNADENELREIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSRV 310 >gi|194671641|ref|XP_591137.3| PREDICTED: matrilin 3 [Bos taurus] gi|297480578|ref|XP_002691564.1| PREDICTED: matrilin 3 [Bos taurus] gi|296482389|gb|DAA24504.1| matrilin 3 [Bos taurus] Length = 574 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 24/180 (13%), Positives = 61/180 (33%), Gaps = 23/180 (12%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 + + +++++ + + R+ + Y + ++ +K + + Sbjct: 186 QFTKVKTFVSKIIDTL-----DIGPMDTRVAVVNYASTVKIEFHLQTHSDKQSLKRAVAR 240 Query: 269 LNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + P T + A+ A E + + + S+ + K I +TDG Sbjct: 241 ITPLSTGTMSGLAIQTAMDEAFTVEAGARG--PSSNIPKVAIIVTDGRPQD--------Q 290 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 ++ R +G+++Y+V V + L+ F V +L F + Sbjct: 291 VNEVAARARASGIELYAVGV--DRADMESLKMMASEPLDEHVFYVETYGVIEKLSSRFQE 348 >gi|125975554|ref|YP_001039464.1| von Willebrand factor, type A [Clostridium thermocellum ATCC 27405] gi|125715779|gb|ABN54271.1| von Willebrand factor, type A [Clostridium thermocellum ATCC 27405] Length = 536 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 26/199 (13%), Positives = 63/199 (31%), Gaps = 22/199 (11%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 K+ +L + LV+ + + R+ + Y + N Sbjct: 189 VSGSMDEPNKLPLLKSAFKLLVDELDEDD--------RVSIVVYAGAAGLVLDSTPGNEK 240 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +++ L L +T + AY S N VI TDG+ + Sbjct: 241 DKILDALMNLEAGGSTAGAEGIKLAYDVAKKNFIKSGNNR--------VILATDGDFNVG 292 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLE 378 + +++ E R+ G+ + + + D G + +++ + E Sbjct: 293 --ISSEAELVRLIEKKRDEGIFLTVLGFGTGNYKDSKMESLADKGNGNYAYIDN---IAE 347 Query: 379 SFDKITDKIQEQSVRIAPN 397 + + +++ +A + Sbjct: 348 ARKVLVNEMGATLNTVAKD 366 >gi|301133566|gb|ADK63405.1| C3HC4 type zinc finger protein [Brassica rapa] Length = 677 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 18/147 (12%), Positives = 46/147 (31%), Gaps = 17/147 (11%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 E +N TN + + L + ++ ++DG++ Sbjct: 303 TGKQEALQAVNSFVSNGGTNIAEGLTKGAKVLID--------RRFKNSVSSIVLLSDGQD 354 Query: 317 SGASAYQNTLNTLQI-CEYM--RN-AGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND 372 + NT + + + + +++ A + + +S G F + Sbjct: 355 TYTMTSPTGSNTKGADYKTLLPKEVNRIPVHAFGFGADHDASLMHSIAENSGGTFSFIES 414 Query: 373 SRELLESFDK-ITDKIQ----EQSVRI 394 + ++F + I + E V+I Sbjct: 415 ETVIQDAFAQCIGGLLSVMVQELRVKI 441 >gi|330963348|gb|EGH63608.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. actinidiae str. M302091] Length = 352 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 26/146 (17%), Positives = 54/146 (36%), Gaps = 35/146 (23%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + V+ L++ +NT A+ A + L +S + ++ Sbjct: 150 PLTYDRRTVRVWLDEAKIGIAGKNTAVGDAIGLALKRLRLRPANS----------RVLVL 199 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------GQ 354 +TDG N+ ++ + G+KIY + + + P+ + Sbjct: 200 VTDGANNAGQ-----IDPITAARLAAEEGVKIYPIGIGSDPDKDALQSVLGLNPSLDLDE 254 Query: 355 DLLRKCTD-SSGQFFAVNDSRELLES 379 L++ S GQ+F D +L + Sbjct: 255 PTLKEIASISGGQYFRARDGDQLEKI 280 >gi|225465131|ref|XP_002271188.1| PREDICTED: hypothetical protein [Vitis vinifera] Length = 768 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 23/192 (11%), Positives = 56/192 (29%), Gaps = 24/192 (12%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 K+ +L + L+ ++ + + + + + + N +N Sbjct: 300 SKLSLLKRAVCFLIQNLGPSDRLSIVSFSSTARRIFPLRRMSD------NGREAAGLAIN 353 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 L TN + R L E + + D N +++ + N Sbjct: 354 SLTSSGGTNIVEGLKKGVRVLEERSEQNPVASIILLSDGKDTYNCDNVNRRQTSHCASSN 413 Query: 328 TLQ-----------ICEYMRNAG-------MKIYSVAVSAPPEGQDLLRKCTDSSGQFFA 369 Q IC R +G + +++ + + + +S G F Sbjct: 414 PRQVLEYLNLLPASICPRNRESGDEGRQAIIPVHTFGFGSDHDSTAMHAISDESGGTFSF 473 Query: 370 VNDSRELLESFD 381 + + ++F Sbjct: 474 IESVATVQDAFA 485 >gi|213968792|ref|ZP_03396933.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato T1] gi|213926395|gb|EEB59949.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato T1] Length = 328 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 26/146 (17%), Positives = 54/146 (36%), Gaps = 35/146 (23%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + V+ L++ +NT A+ A + L +S + ++ Sbjct: 126 PLTYDRRTVRVWLDEAKIGIAGKNTAVGDAIGLALKRLRLRPANS----------RVLVL 175 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------GQ 354 +TDG N+ ++ + G+KIY + + + P+ + Sbjct: 176 VTDGANNAGQ-----IDPITAARLAAEEGVKIYPIGIGSDPDKDALQSALGLSPSLDLDE 230 Query: 355 DLLRKCTD-SSGQFFAVNDSRELLES 379 L++ S GQ+F D +L + Sbjct: 231 PTLKEIASISGGQYFRARDGDQLEKI 256 >gi|153873859|ref|ZP_02002297.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] gi|152069676|gb|EDN67702.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] Length = 367 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 20/138 (14%), Positives = 50/138 (36%), Gaps = 14/138 (10%) Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 ++ L +L +T+ +H AY + ++ TDG+ + + Sbjct: 65 KINGALERLTAGGSTHGSAGIHLAYNLAEQAFIKNGINR--------ILLATDGDFNVGT 116 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLES 379 + + E R +G+ + ++ L+ + D+ G + ++ L E+ Sbjct: 117 V--DFEALKNLVEEKRKSGISLTTLGFGRGNYNDQLMEQLADAGNGNYAYIDT---LNEA 171 Query: 380 FDKITDKIQEQSVRIAPN 397 + D++ IA + Sbjct: 172 QKVLVDEMSSTLNTIAKD 189 >gi|28870917|ref|NP_793536.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. tomato str. DC3000] gi|301385766|ref|ZP_07234184.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato Max13] gi|302061830|ref|ZP_07253371.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato K40] gi|302134226|ref|ZP_07260216.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato NCPPB 1108] gi|28854166|gb|AAO57231.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. tomato str. DC3000] gi|331018299|gb|EGH98355.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. lachrymans str. M302278PT] Length = 352 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 26/146 (17%), Positives = 54/146 (36%), Gaps = 35/146 (23%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + V+ L++ +NT A+ A + L +S + ++ Sbjct: 150 PLTYDRRTVRVWLDEAKIGIAGKNTAVGDAIGLALKRLRLRPANS----------RVLVL 199 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------GQ 354 +TDG N+ ++ + G+KIY + + + P+ + Sbjct: 200 VTDGANNAGQ-----IDPITAARLAAEEGVKIYPIGIGSDPDKDALQSALGLSPSLDLDE 254 Query: 355 DLLRKCTD-SSGQFFAVNDSRELLES 379 L++ S GQ+F D +L + Sbjct: 255 PTLKEIASISGGQYFRARDGDQLEKI 280 >gi|47216147|emb|CAG10021.1| unnamed protein product [Tetraodon nigroviridis] Length = 1453 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 30/214 (14%), Positives = 71/214 (33%), Gaps = 45/214 (21%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ + +++ + + + +G + Y+ + NN ++K + K Sbjct: 788 NFELVKKWINQIIDKL-----DVSDNKAHVGLVQYSSAVKQEFPLGRYNNKKDLKEAVKK 842 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ + + + + + K I TDG + Sbjct: 843 MAYMERGTMTGQALRYLTDNSFGPGQGAR-----PGVTKVGIVFTDGRSQDYIGD----- 892 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDS--------RELL 377 + ++ G K+Y+V V +D L++ ++ +F D EL Sbjct: 893 ---AAKKAKDQGFKMYAVGVG--NAVEDELKEIASEPTAEHYFYTADFKTDGQRREDELK 947 Query: 378 ES--------------FDKITDKIQEQSVRIAPN 397 E F +T ++ + I + Sbjct: 948 EIASEPTAEHYFYTADFKTMTQIAKKLQINICQD 981 Score = 68.8 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 54/159 (33%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + R+G + Y + + + K+ P T T A+ A Sbjct: 575 DVGPNATRVGVVNYASRVKNEVSLKTHRTKAGLIKAVTKIEPLSTGTMTGLAIQFAMNVA 634 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + S + K I +TDG N + + R+AG++I+++ V Sbjct: 635 FSEAEGARLR--SPDISKVAIVVTDGRPQD--------NVKDVAQRARDAGIEIFAIGVG 684 Query: 349 APPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 LR+ V +L + F + Sbjct: 685 R--VEMSTLRQMASDPLDDHVDYVESYSVIEKLTKKFQE 721 >gi|127513358|ref|YP_001094555.1| von Willebrand factor, type A [Shewanella loihica PV-4] gi|126638653|gb|ABO24296.1| von Willebrand factor, type A [Shewanella loihica PV-4] Length = 339 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 28/153 (18%), Positives = 53/153 (34%), Gaps = 36/153 (23%) Query: 254 PLSNNLNEVKSRL--NKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + V L ++ T A+ A + K+S+ + +I Sbjct: 150 PLTQDRRSVAQFLTEAQIGLVGKQTAIGEAIALAVKRFDKAKQSN----------RVLIL 199 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEG 353 +TDG N+ S + Q + G+ IY++ V A Sbjct: 200 LTDGSNNSGS-----ITPEQAADIAAKRGVTIYTIGVGAEVMERRTLFGKERVNPSMDLD 254 Query: 354 QDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + L + G++F +S EL + + +I Sbjct: 255 EAQLTLLAQKTKGRYFRARNSDELEQIYQEIDK 287 >gi|255522879|ref|NP_001157342.1| collagen alpha-1(XIV) chain [Equus caballus] Length = 1796 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 67/201 (33%), Gaps = 26/201 (12%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + +I + V ++ K + ++ + + N Sbjct: 1037 VDGSWSIGDENFNKIINFLYSTVGALNKIGADGT----QVAIVQFTDDPRTEFKLDAYKN 1092 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + ++ NT T A+ H L+ E+ + + K ++ ITDG + Sbjct: 1093 KETLLDAIKHISYKGGNTKTGKAIKHVRDTLFTEESGTRR-----GIPKVIVVITDGRSQ 1147 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRE 375 +I + M++ G I++V V L F V+D Sbjct: 1148 DDVN--------KISKEMQSDGYSIFAVGV--ADADYSELVSIGSKPSARHVFFVDDF-- 1195 Query: 376 LLESFDKITDKIQEQSVRIAP 396 ++F KI D++ A Sbjct: 1196 --DAFKKIEDELITFVCETAS 1214 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 60/168 (35%), Gaps = 18/168 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + RIG Y+ N +EV + L NT T A+++ + Sbjct: 186 TAFDVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIDAVRNLPYKGGNTLTGLALNYIF 245 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + + + + K I ITDG++ + +R +G++++++ Sbjct: 246 ENSFKPEAGAR-----SGVSKIGILITDGKSQDDIIPPS--------RNLRESGVELFAI 292 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 V ++ L++ + V + + + +T + + Sbjct: 293 GV--KNADENELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSRV 338 >gi|242066912|ref|XP_002454745.1| hypothetical protein SORBIDRAFT_04g036560 [Sorghum bicolor] gi|241934576|gb|EES07721.1| hypothetical protein SORBIDRAFT_04g036560 [Sorghum bicolor] Length = 737 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 21/183 (11%), Positives = 57/183 (31%), Gaps = 24/183 (13%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT--PLSNNLNEVKSR 265 K+ +L + G ++ ++ + R+ IA++ + + Sbjct: 308 TKLALLKRAMGFVIQNLGSSD--------RLSVIAFSSSARRLFPLRRMTESGRQQSLLA 359 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 +N L TN + + + + +I ++DG+++ + Sbjct: 360 VNSLTSNGGTNIAEGLRKGSKVIEE--------RQAKNPVCSIILLSDGQDTYTVSPTAG 411 Query: 326 --LNTLQICEYM----RNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 + C + N + ++ A + L S G F + + ++ Sbjct: 412 VHKGAPEYCALLPSTNGNQQIPVHVFGFGADHDSVSLHSISQTSGGTFSFIETEAAIQDA 471 Query: 380 FDK 382 F + Sbjct: 472 FAQ 474 >gi|281416565|ref|ZP_06247585.1| von Willebrand factor type A [Clostridium thermocellum JW20] gi|281407967|gb|EFB38225.1| von Willebrand factor type A [Clostridium thermocellum JW20] gi|316939671|gb|ADU73705.1| Protein of unknown function DUF3520 [Clostridium thermocellum DSM 1313] Length = 538 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 26/199 (13%), Positives = 63/199 (31%), Gaps = 22/199 (11%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 K+ +L + LV+ + + R+ + Y + N Sbjct: 191 VSGSMDEPNKLPLLKSAFKLLVDELDEDD--------RVSIVVYAGAAGLVLDSTPGNEK 242 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +++ L L +T + AY S N VI TDG+ + Sbjct: 243 DKILDALMNLEAGGSTAGAEGIKLAYDVAKKNFIKSGNNR--------VILATDGDFNVG 294 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLE 378 + +++ E R+ G+ + + + D G + +++ + E Sbjct: 295 --ISSEAELVRLIEKKRDEGIFLTVLGFGTGNYKDSKMESLADKGNGNYAYIDN---IAE 349 Query: 379 SFDKITDKIQEQSVRIAPN 397 + + +++ +A + Sbjct: 350 ARKVLVNEMGATLNTVAKD 368 >gi|312877126|ref|ZP_07737097.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] gi|311796100|gb|EFR12458.1| von Willebrand factor type A [Caldicellulosiruptor lactoaceticus 6A] Length = 900 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 67/189 (35%), Gaps = 27/189 (14%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 K+++ ++ +V ++ + AY G + +V Sbjct: 425 GIPKLEIAKSASAKMVEHLESSDGVGVIAFDHNYYWAYKFGKLVR--------KEDVIES 476 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 ++ + T P + A + L S K V+ +TDG + Sbjct: 477 ISSIEVGGGTAIIPPLSEAVKTL----------KKSKAKNKLVVLLTDGMGEQSGY---- 522 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKIT 384 + + +KI ++ V +L D +SG+F+ V++ EL++ F K T Sbjct: 523 ---EIPADEAKRNNIKITTIGVG-KFVNASVLSWIADYTSGRFYLVSNPSELVDVFLKET 578 Query: 385 DKIQEQSVR 393 I+ + ++ Sbjct: 579 KIIKGKYIK 587 >gi|88857796|ref|ZP_01132439.1| von Willebrand factor type A domain protein [Pseudoalteromonas tunicata D2] gi|88820993|gb|EAR30805.1| von Willebrand factor type A domain protein [Pseudoalteromonas tunicata D2] Length = 608 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 37/326 (11%), Positives = 86/326 (26%), Gaps = 32/326 (9%) Query: 75 GSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLR 134 REN Q + + + + Y + + P+ Sbjct: 131 NEQARENYLKNEQNPVKQVMLEPVSTFSIDVD-TGSYSNSRRMIKMGKRPPADAVREEAF 189 Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQ-KHNDNNNMTSNKYLLPPPPKKSFWSKN 193 S ++ + + Q + + + + Sbjct: 190 INYFDYHYSAPKSLETPFNVHTEVAPAPWNNQRQLLKIGIKGFDIEKAELKAANLVFLLD 249 Query: 194 TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 + S AP K+ +L S L + + + + Y Sbjct: 250 VSGSMNAP------DKLPLLKSSLTMLTKQLDENDS--------VAIVVYAGAAGLVLPA 295 Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 N + + LN L+ +TN + AY+ + VI TD Sbjct: 296 TKGNEYQVISNALNNLSAGGSTNGAQGIELAYQIASQNFKKEGINR--------VILATD 347 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT--DSSGQFFAVN 371 G+ + + + L ++ R G+ + ++ L+ + + + Sbjct: 348 GDFNVGMSSVDAL--KKLIANKRKTGIALTTLGFGQGNYNDGLMEQLANIGNGQHAY--- 402 Query: 372 DSRELLESFDKITDKIQEQSVRIAPN 397 + E+ + D++ IA + Sbjct: 403 -IDTINEARKVLVDELSSTMQIIAKD 427 >gi|24375056|ref|NP_719099.1| von Willebrand factor type A domain-containing protein [Shewanella oneidensis MR-1] gi|24349804|gb|AAN56543.1|AE015791_7 von Willebrand factor type A domain protein [Shewanella oneidensis MR-1] Length = 621 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 38/362 (10%), Positives = 95/362 (26%), Gaps = 46/362 (12%) Query: 36 AVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK 95 A + +S+ + D F+KQ++ + I + I Sbjct: 101 AQYAASSSVAAPGLNDDWQGAVLPERNQFEKQVQNGIMVAGEIPVSTFSIDVD------- 153 Query: 96 DKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD 155 Y L L + + + Sbjct: 154 ------------TGSYTTLRRMLKEGRLPQKDTLRVEEMLNYFSYNYPQPNKNEAPFSV- 200 Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 + E ++D + + S ++ K+ +L Sbjct: 201 ---TTELAPSPYNDDMMLLRIGLKGYEQSKAELGASNLVFL-LDVSGSMASDDKLPLLQT 256 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 + L + + ++ + Y N++ + L +L +T Sbjct: 257 ALKMLTQQLDEQD--------KVSIVVYAGAAGVVLDGAAGNDIKILTYALEQLTAGGST 308 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 N + AY+ VI TDG+ + + + L + + E Sbjct: 309 NGAEGIQLAYQLAQKHFVKGGINR--------VILATDGDFNVGTTNLDEL--VDLVEVQ 358 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRI 394 + G+ + ++ L+ + + GQ+ ++ E + + +++ + I Sbjct: 359 KKHGIGLTTLGFGMGNYNDHLMEQLANKGNGQYAYIDSVNEARKV---LVEQLGATLLTI 415 Query: 395 AP 396 Sbjct: 416 VK 417 >gi|257056239|ref|YP_003134071.1| hypothetical protein Svir_22360 [Saccharomonospora viridis DSM 43017] gi|256586111|gb|ACU97244.1| Mg-chelatase subunit ChlD [Saccharomonospora viridis DSM 43017] Length = 326 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 19/150 (12%), Positives = 47/150 (31%), Gaps = 16/150 (10%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + V + L ++T T + A + + + G ++ +TDG+ Sbjct: 147 TTEREGVVHAIENLKLAQSTATGEGIFAALQAIESFSAVIGGAEG--PPPARIVLMTDGK 204 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PEGQDLLRKCT- 361 + + G+ I +++ +R+ Sbjct: 205 QTVPQDEYAPRGAFTAAGVAKQKGIPITTISFGTSYGSVEIDGTRVPVEVDDASMREIAR 264 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQS 391 S G F+ + EL + +D + ++I + Sbjct: 265 LSGGDFYKAATAEELKQVYDSLGEQIGYEI 294 >gi|209524446|ref|ZP_03272995.1| von Willebrand factor type A [Arthrospira maxima CS-328] gi|209495237|gb|EDZ95543.1| von Willebrand factor type A [Arthrospira maxima CS-328] Length = 541 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 25/199 (12%), Positives = 61/199 (30%), Gaps = 22/199 (11%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 ++ +L E LV+ + + + Y N Sbjct: 186 VSGSMNQPNRLPLLKEGFKLLVDQLTEQDT--------VAIAVYAGAAGVVLPPTPGNEK 237 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 ++ + ++ L +T + AY + S VI TDG+ + Sbjct: 238 QKIIAAIDGLQAQGSTAGGEGIKLAYELAT--------RMLSEGKNNRVILATDGDFNVG 289 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLE 378 + L +++ E R+ G+ + + + K ++ G + +++ E + Sbjct: 290 VSSDAEL--VRLIESYRDRGIYLTVLGFGMGNYKDSKMEKLSNHGNGNYAYIDNLMEAKK 347 Query: 379 SFDKITDKIQEQSVRIAPN 397 ++ IA + Sbjct: 348 VMS---TELTGTLFTIAKD 363 >gi|320102039|ref|YP_004177630.1| VWFA-like domain-containing protein [Isosphaera pallida ATCC 43644] gi|319749321|gb|ADV61081.1| VWFA-related domain protein [Isosphaera pallida ATCC 43644] Length = 784 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 76/187 (40%), Gaps = 23/187 (12%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP-LSNNLNEVKSRL 266 +I L E+ G + ++ + I ++ +V + ++VKS++ Sbjct: 545 NRIGALKEAVGVFLGTLP-----PGSKVAVIEFNSFVNPLVFGPANEIFTTRFDDVKSQV 599 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 N+ T+ Y A+ A + + ++ V+ +TDGE++ + Sbjct: 600 NRFRANGGTSYYDAVDRALELIA-----------NQTGRRAVLALTDGEDTSSR----LA 644 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQ--DLLRKCTDSSGQFFAVNDSRELLESFDKIT 384 + RN G+ ++++ V E + +L R ++ G++F D+ +L F ++ Sbjct: 645 GLDSVILKARNLGLPVHTLGVGREDEIEVGELQRLARETRGRYFPARDATKLRVIFAELA 704 Query: 385 DKIQEQS 391 ++E Sbjct: 705 QSLRESY 711 >gi|124002443|ref|ZP_01687296.1| OmpA family protein [Microscilla marina ATCC 23134] gi|123992272|gb|EAY31640.1| OmpA family protein [Microscilla marina ATCC 23134] Length = 756 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 15/116 (12%), Positives = 44/116 (37%), Gaps = 13/116 (11%) Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA--SAYQNTLN 327 +T Y L N + + K ++ TDGE + + + Sbjct: 471 RYGGSTALYAGADEGLESLKNAQNN-----------KVMLLFTDGEENSSLQYFGKRAFR 519 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 ++ + R G++++++A + L + G+ + + + E+ + ++++ Sbjct: 520 ASEVVKKAREKGIRVFTIAYGTGVNNKTLNALSMLTDGKTYFIENPDEIKQVYEEL 575 >gi|298488105|ref|ZP_07006142.1| von Willebrand factor type A domain protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] gi|298157384|gb|EFH98467.1| von Willebrand factor type A domain protein [Pseudomonas savastanoi pv. savastanoi NCPPB 3335] Length = 352 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 24/146 (16%), Positives = 51/146 (34%), Gaps = 35/146 (23%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + V+ L++ +NT A+ + L + ++ Sbjct: 150 PLTYDRRTVRVWLDEAKIGIAGKNTALGDAIGLGLKRL----------RLRPATSRVLVL 199 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------GQ 354 +TDG N+ ++ + G+KIY + + + P+ + Sbjct: 200 VTDGANNAGQ-----IDPITAARLAAEEGVKIYPIGIGSDPDKDALQSVLGLNPSLDLDE 254 Query: 355 DLLRKCTD-SSGQFFAVNDSRELLES 379 L++ S GQ+F D +L + Sbjct: 255 PTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|289624057|ref|ZP_06457011.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. aesculi str. NCPPB3681] gi|289650363|ref|ZP_06481706.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. aesculi str. 2250] gi|330866187|gb|EGH00896.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. aesculi str. 0893_23] Length = 352 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 24/146 (16%), Positives = 51/146 (34%), Gaps = 35/146 (23%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + V+ L++ +NT A+ + L + ++ Sbjct: 150 PLTYDRRTVRVWLDEAKIGIAGKNTALGDAIGLGLKRL----------RLRPATSRVLVL 199 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------GQ 354 +TDG N+ ++ + G+KIY + + + P+ + Sbjct: 200 VTDGANNAGQ-----IDPITAARLAAEEGVKIYPIGIGSDPDKDALQSVLGLNPSLDLDE 254 Query: 355 DLLRKCTD-SSGQFFAVNDSRELLES 379 L++ S GQ+F D +L + Sbjct: 255 PTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|171742038|ref|ZP_02917845.1| hypothetical protein BIFDEN_01142 [Bifidobacterium dentium ATCC 27678] gi|283456833|ref|YP_003361397.1| hypothetical protein BDP_2000 [Bifidobacterium dentium Bd1] gi|171277652|gb|EDT45313.1| hypothetical protein BIFDEN_01142 [Bifidobacterium dentium ATCC 27678] gi|283103467|gb|ADB10573.1| Conserved hypothetical protein containing a von Willebrand factor type A (vWA) domain [Bifidobacterium dentium Bd1] Length = 967 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 52/382 (13%), Positives = 112/382 (29%), Gaps = 54/382 (14%) Query: 40 GCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNN 99 AS + ++ ++ + + Sbjct: 136 ATASATPTQKPTGTENPTTVERSVQSDDDDADTVANQNEAKDDETKNNADKTVRLGIASY 195 Query: 100 PLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRS 159 + S + + L++ +++ I I +VLDVS S Sbjct: 196 RGMLKSASSGLSTPEHTKSIEYQGNGAYILKLNVIGKDASTSTTDTTPIDIALVLDVSGS 255 Query: 160 MEDLY--LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 M D + + + + + K+ + K A +I + Sbjct: 256 MNDDFGGRGSPSKISALKTAVNSFLDETAKTNDTIEDDNDKVKVALVKYANQIGTATGAD 315 Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNT 277 G +++ +++ G L+ + +K+ +N L T Sbjct: 316 GCRISNSRQSD----------------TGNCTQIVQELTTDAGLLKTSVNGLQAAGATYA 359 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI--CEYM 335 AM A + L + + KK+VIF TDGE + S + + I + + Sbjct: 360 DAAMEVAQQALAGGRAGA---------KKYVIFFTDGEPNHWSGFDGDVANAAIKKSQEL 410 Query: 336 RNAGMKIYSVAVSAPPEGQ----------DLLRKCTD---------------SSGQFFAV 370 +NAG +YS+ + + + S +++ Sbjct: 411 KNAGTTVYSIGIFDGANPSASVSSASNANKFMHGISSNYPNATGYWNLGDRASGDYYYSA 470 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + +L + F+ I I E+ V Sbjct: 471 SSATQLAQIFNDIQKTITEKHV 492 >gi|209546584|ref|YP_002278502.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209537828|gb|ACI57762.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 698 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 31/318 (9%), Positives = 78/318 (24%), Gaps = 26/318 (8%) Query: 80 ENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGII 139 E + A ++ D + + A Y +L + + Sbjct: 225 ERFANAAANPIKSVATDPVSTFSADVD-SASYSFVRRSLTGGAMPDPQSVRVEEMINYFP 283 Query: 140 ERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY 199 + V+ + + P + Sbjct: 284 YDWAGPEKADQPFKATVTVMPTPWNHDTELMHVAIKGYDIAPATAPHANLV-----FLID 338 Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 K+ +L + LV+ ++ + + Y Sbjct: 339 VSGSMDEPDKLPLLKSAFRLLVSKLKADDT--------VSIVTYAGNAGTVLEPTRVAEK 390 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +++ S +++L +T + AY V+ TDG+ + Sbjct: 391 SKILSAIDRLEAGGSTGGAEGIEAAYNLAKQAFVKDGVNR--------VMLATDGDFNVG 442 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 + + +I E R G+ + + L++ + + L E+ Sbjct: 443 --PSSDEDLKRIIEEKRKDGIFLTVLGFGRGNLNDSLMQTLAQNGNG--SAAYIDTLAEA 498 Query: 380 FDKITDKIQEQSVRIAPN 397 + ++ IA + Sbjct: 499 QKTLVEEAGSTLFPIAKD 516 >gi|110598614|ref|ZP_01386881.1| von Willebrand factor, type A [Chlorobium ferrooxidans DSM 13031] gi|110339783|gb|EAT58291.1| von Willebrand factor, type A [Chlorobium ferrooxidans DSM 13031] Length = 336 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 30/153 (19%), Positives = 50/153 (32%), Gaps = 36/153 (23%) Query: 254 PLSNNLNEVKSRLNKLNP----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 PL+ + + + ++ ++P E T A+ A GST L+K +I Sbjct: 155 PLTIDHDVLAMLIDHISPQVIQDEGTAIGSAILIATNRF----------KGSTSLQKVII 204 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV--AVSAPPE--------------G 353 ITDGEN+ + G++IY V + Sbjct: 205 LITDGENNTG-----DVGPATAATLAAQNGIRIYVVNAGFKSGGSAGNLSAESSAHAAMD 259 Query: 354 QDLLRKCT-DSSGQFFAVNDSRELLESFDKITD 385 + LR + G +F D L + I Sbjct: 260 EASLRGIARTTGGGYFRAEDPSVLDNTIKTIGR 292 >gi|88704964|ref|ZP_01102676.1| conserved hypothetical protein [Congregibacter litoralis KT71] gi|88700659|gb|EAQ97766.1| conserved hypothetical protein [Congregibacter litoralis KT71] Length = 344 Score = 71.5 bits (173), Expect = 2e-10, Method: Composition-based stats. Identities = 32/207 (15%), Positives = 69/207 (33%), Gaps = 55/207 (26%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 R+ID + + + ++ R+G I + +PLS ++ V+ Sbjct: 107 MARRIDAVKQLGSDFMSR---------RSGDRLGLILFGSRAYLQ--SPLSFDIQTVQRF 155 Query: 266 LNKLNPYEN-----TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 L L T A+ A + L + ++ +TDG+++ Sbjct: 156 L--LEAQIGFAGQETAIGDAIGLAVKRLQER----------PASSRVLVLLTDGQDTA-- 201 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPP----------------------EGQDLLR 358 +T++ L+ + G++IY++ + A + L+ Sbjct: 202 ---STVDPLEAANLAADLGVRIYTIGIGADSLTLPGLLGSPLGARTVNPSADLDESTLIE 258 Query: 359 KCTDSSGQFFAVNDSRELLESFDKITD 385 + GQ+F D EL + + Sbjct: 259 IARSTGGQYFRARDPEELATVYRLLDQ 285 >gi|149066378|gb|EDM16251.1| procollagen, type XIV, alpha 1 (predicted), isoform CRA_b [Rattus norvegicus] Length = 1127 Score = 71.5 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 28/201 (13%), Positives = 63/201 (31%), Gaps = 26/201 (12%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + +I + V ++ K + ++ + + Sbjct: 386 VDGSWSIGDDNFNKIINFLYSTVGALDKIGADGT----QVAMVQFTDDPRTEFKLDAYKT 441 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + ++ NT T A+ H L+ + + K ++ ITDG + Sbjct: 442 KETLLDAIRHISYKGGNTKTGKAIKHVRDTLFTADSGTRR-----GIPKVIVVITDGRSQ 496 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRE 375 +I M+ G I+++ V L + F V+D Sbjct: 497 DDVN--------KISREMQADGYNIFAIGV--ADADYSELVRIGSKPSSRHVFFVDDF-- 544 Query: 376 LLESFDKITDKIQEQSVRIAP 396 ++F KI D++ A Sbjct: 545 --DAFKKIEDELITFVCETAS 563 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 25/168 (14%), Positives = 54/168 (32%), Gaps = 18/168 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + RIG Y+ N +EV + L NT T A++ + Sbjct: 187 TAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIDAVRSLPYKGGNTLTGLALNFIF 246 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + + K I ITDG++ + +R AG++++++ Sbjct: 247 ENSFK-----PEAGSRSGVSKIGILITDGKSQDDIIPPS--------RNLREAGVELFAI 293 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 V L++ + V + + + +T + + Sbjct: 294 GV--KNADLSELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSRV 339 >gi|78048779|ref|YP_364954.1| putative secreted protein [Xanthomonas campestris pv. vesicatoria str. 85-10] gi|78037209|emb|CAJ24954.1| putative secreted protein [Xanthomonas campestris pv. vesicatoria str. 85-10] Length = 602 Score = 71.5 bits (173), Expect = 3e-10, Method: Composition-based stats. Identities = 38/293 (12%), Positives = 75/293 (25%), Gaps = 32/293 (10%) Query: 108 KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMV---LDVSRSMEDLY 164 Y L L P+ + V L + +D Sbjct: 155 TGSYSNVRRFLNAGSLPPADAVRVEELINYFRYDDPAPTNGQPFAVRTELATTPWNKDSL 214 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 LPP A K+ +L S LV + Sbjct: 215 -LLRVGIAGRDIATADLPPANLVFLVD--------VSGSMDAPDKLPLLQSSLKLLVRQL 265 Query: 225 QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHA 284 + RI + Y + + ++ L +T + A Sbjct: 266 RAQD--------RITLVTYAGNTSVVLPPTPGDQQGRIVEAIDALQSGGSTAGASGIELA 317 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 Y+ ++ TDG+ + N + R +G+ + + Sbjct: 318 YKAAQQGYLRGGINR--------ILLATDGDFNVGV--TNFDQLKGMVAEKRRSGIALST 367 Query: 345 VAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + +L+ + D+ +A D+ LE+ +T ++ IA + Sbjct: 368 LGFGTGNYNDNLMEQLADAGDGAYAYIDTA--LEARKVLTHELGATLATIARD 418 >gi|326675803|ref|XP_002665305.2| PREDICTED: collagen alpha-1(XII) chain [Danio rerio] Length = 3039 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 22/163 (13%), Positives = 54/163 (33%), Gaps = 18/163 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + R+ + Y+ +V + L NT T AM + + Sbjct: 139 DIEEDKTRVAVVQYSSDTRTEFSLNTHFRRPDVLRAIKNLPYKGGNTMTGDAMDYLVKNT 198 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + + + K + ITDG++ + E +RN G++I+ + + Sbjct: 199 FTQAAGAR-----KGFPKVAMIITDGKSQDP--------VEEYAERLRNIGVEIFVLGI- 244 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQE 389 +D L++ + V + + + + ++ Sbjct: 245 -KGADEDELKEIASRPHSKHVYNVPNFDMINQVQKALITEVCS 286 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 57/165 (34%), Gaps = 18/165 (10%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + V+IG Y+ ++ + L NT T A+++ + Sbjct: 1200 DIGPDRVQIGLAQYSGDPKTEWHLNAHRTRTQLLDAVANLPYKGGNTLTGLALNYILQ-- 1257 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 N +K + +TDG++ + + +R+ G+++Y++ V Sbjct: 1258 ---NNFKPNVGMRPNSRKIGVLVTDGKSQDDIVANS--------QNLRDQGIELYAIGV- 1305 Query: 349 APPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 ++ LR + V D LL+ D +T+ + Sbjct: 1306 -KNADENELRTIASDPDDIHMYNVADFSFLLDIVDDLTNNLCNSV 1349 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 61/169 (36%), Gaps = 23/169 (13%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + V+I + Y+ ++ + + TNT AM + ++ Sbjct: 439 DIGPNKVQISLVQYSRDPHTEFALNKFDDNAAMVKAVRTFPYRGGSTNTGKAMTYVREKI 498 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + + + + ++ ITDG++S + +R+ ++I++V V Sbjct: 499 FVSGRGAR-----DNVPRVMVLITDGKSSDSFKDP--------ANKLRDTDVEIFAVGV- 544 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQS-VRI 394 + L + F V D ++F++I+ ++ +RI Sbjct: 545 -KDAVRSELEAIANVPADNHVFEVEDF----DAFERISKELTASICLRI 588 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 29/183 (15%), Positives = 62/183 (33%), Gaps = 18/183 (9%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENT 275 V S+ A + +++ + Y+ ++ V S L +N NT Sbjct: 2315 VRQFVFSMIGAFDVVSHEGMQVSFVQYSDDAKTEFKLNTYHDKGMVLSALQMVNYRGGNT 2374 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 T A+ H +++ + + K ++ +TDG + + + + Sbjct: 2375 KTGAALKHVGEKVFTS-----DNGMRRTVPKVLVVVTDGRSQD--------DVKKSAAKL 2421 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVR 393 ++AG ++ V V + LR F V++ + D + I E + Sbjct: 2422 QHAGYSVFVVGV--ADIDVNELRNIGSKPSDRHVFIVDNFDAFAKIQDNLITFICETATS 2479 Query: 394 IAP 396 P Sbjct: 2480 TCP 2482 >gi|45361321|ref|NP_989238.1| matrilin 2 [Xenopus (Silurana) tropicalis] gi|39645939|gb|AAH63920.1| matrilin 2 [Xenopus (Silurana) tropicalis] Length = 839 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 23/187 (12%), Positives = 67/187 (35%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +++ + +++S++ + +G I Y+ I ++ +VK Sbjct: 583 GENNFEIVKQFVNGILDSLE-----ISQKAAHVGLIQYSTHIRTEFTMAQYSSAKDVKKA 637 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 ++++ + T A+ + + ++E + + + + I TDG Sbjct: 638 VSQIKYMGRGSMTGLALKLMHEKSFSEVQGARPRAMG--VPRVAIVFTDGRAQD------ 689 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDK 382 + + + +G+ IY++ V + + L++ + D + +K Sbjct: 690 --EVSEYAKKAKQSGITIYAIGVGKAIDEE--LQEIASAPQEKHVIYAEDFSAMGYITEK 745 Query: 383 ITDKIQE 389 + I E Sbjct: 746 LKSSICE 752 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 53/159 (33%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + + R+G + Y + +++ + ++ T T A+ +A Sbjct: 83 DIGPDTTRVGLLQYGSTVKNEFSLKTYKKKMDIERAVKRMMHLATGTMTGLAIQYAMNIA 142 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + + +TDG +I RN+G+ I++V V Sbjct: 143 FSESEGARPL--NQHVPRIAMIVTDGRPQDP--------VAEIAAKARNSGILIFAVGVG 192 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 L+ F V + + L F Sbjct: 193 R--VDMSTLKTIGSEPHTEHVFLVANFSQIETLTTVFQN 229 >gi|194474004|ref|NP_001124020.1| collagen alpha-1(XIV) chain [Rattus norvegicus] gi|149066377|gb|EDM16250.1| procollagen, type XIV, alpha 1 (predicted), isoform CRA_a [Rattus norvegicus] Length = 1794 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 28/201 (13%), Positives = 63/201 (31%), Gaps = 26/201 (12%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + +I + V ++ K + ++ + + Sbjct: 1035 VDGSWSIGDDNFNKIINFLYSTVGALDKIGADGT----QVAMVQFTDDPRTEFKLDAYKT 1090 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + ++ NT T A+ H L+ + + K ++ ITDG + Sbjct: 1091 KETLLDAIRHISYKGGNTKTGKAIKHVRDTLFTADSGTRR-----GIPKVIVVITDGRSQ 1145 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRE 375 +I M+ G I+++ V L + F V+D Sbjct: 1146 DDVN--------KISREMQADGYNIFAIGV--ADADYSELVRIGSKPSSRHVFFVDDF-- 1193 Query: 376 LLESFDKITDKIQEQSVRIAP 396 ++F KI D++ A Sbjct: 1194 --DAFKKIEDELITFVCETAS 1212 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 25/168 (14%), Positives = 54/168 (32%), Gaps = 18/168 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + RIG Y+ N +EV + L NT T A++ + Sbjct: 187 TAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIDAVRSLPYKGGNTLTGLALNFIF 246 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + + K I ITDG++ + +R AG++++++ Sbjct: 247 ENSFK-----PEAGSRSGVSKIGILITDGKSQDDIIPPS--------RNLREAGVELFAI 293 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 V L++ + V + + + +T + + Sbjct: 294 GV--KNADLSELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSRV 339 >gi|331012285|gb|EGH92341.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. tabaci ATCC 11528] Length = 352 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 25/146 (17%), Positives = 51/146 (34%), Gaps = 35/146 (23%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + V+ L++ +NT A+ + L + ++ Sbjct: 150 PLTYDRRTVRVWLDEAKIGIAGKNTALGDAIGLGLKRL----------RLRPATSRVLVL 199 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------GQ 354 +TDG N+ ++ + G+KIY + + A P+ + Sbjct: 200 VTDGANNAGQ-----IDPITAARLAAEEGVKIYPIGIGADPDKDALQSVLGLNPSLDLDE 254 Query: 355 DLLRKCTD-SSGQFFAVNDSRELLES 379 L++ S GQ+F D +L + Sbjct: 255 PTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|330989218|gb|EGH87321.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. lachrymans str. M301315] Length = 352 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 25/146 (17%), Positives = 51/146 (34%), Gaps = 35/146 (23%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + V+ L++ +NT A+ + L + ++ Sbjct: 150 PLTYDRRTVRVWLDEAKIGIAGKNTALGDAIGLGLKRL----------RLRPATSRVLVL 199 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------GQ 354 +TDG N+ ++ + G+KIY + + A P+ + Sbjct: 200 VTDGANNAGQ-----IDPITAARLAAEEGVKIYPIGIGADPDKDALQSVLGLNPSLDLDE 254 Query: 355 DLLRKCTD-SSGQFFAVNDSRELLES 379 L++ S GQ+F D +L + Sbjct: 255 PTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|325496693|gb|EGC94552.1| hypothetical protein ECD227_0790 [Escherichia fergusonii ECD227] Length = 496 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 34/302 (11%), Positives = 87/302 (28%), Gaps = 42/302 (13%) Query: 108 KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI--------SICMVLDVSRS 159 Y L L + + + + I + ++ + Sbjct: 48 TGSYANVRRFLKTGSLPGADVVRVEELVNYFPLTEATKKNIPGCKGCEENSPFSINYELT 107 Query: 160 MEDLY---LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIES 216 D + LPP ++ ++ ++ S Sbjct: 108 PAPWNEKHTLLRLDIAANDIARSKLPPANLVFLID--------TSGSMNSDERLPLIKSS 159 Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 LVN ++ RI + Y + + N + + + L TN Sbjct: 160 LKLLVNELRDQD--------RISIVTYAGSARLLLSSTSGSEKNTILNAIANLQAGGGTN 211 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 + AY + ++ TDG+ + + + + + R Sbjct: 212 GGAGVAMAYEQAQAGYIKGGVNR--------ILLATDGDFNIG---DDPASVEDLVKKQR 260 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 +G+ + ++ V ++++ K D+ G + L E+ +++++ + V +A Sbjct: 261 ESGITLSTLGVGDNNYNEEMMVKIADTGNGNYSY---LDSLSEAQKVLSNEMNQTLVTVA 317 Query: 396 PN 397 + Sbjct: 318 KD 319 >gi|257482758|ref|ZP_05636799.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. tabaci ATCC 11528] Length = 265 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 25/146 (17%), Positives = 51/146 (34%), Gaps = 35/146 (23%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + V+ L++ +NT A+ + L + ++ Sbjct: 63 PLTYDRRTVRVWLDEAKIGIAGKNTALGDAIGLGLKRL----------RLRPATSRVLVL 112 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------GQ 354 +TDG N+ ++ + G+KIY + + A P+ + Sbjct: 113 VTDGANNAGQ-----IDPITAARLAAEEGVKIYPIGIGADPDKDALQSVLGLNPSLDLDE 167 Query: 355 DLLRKCTD-SSGQFFAVNDSRELLES 379 L++ S GQ+F D +L + Sbjct: 168 PTLKEIASLSGGQYFRARDGDQLEKI 193 >gi|71737462|ref|YP_275714.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. phaseolicola 1448A] gi|71558015|gb|AAZ37226.1| von Willebrand factor type A domain protein [Pseudomonas syringae pv. phaseolicola 1448A] gi|320329710|gb|EFW85699.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. glycinea str. race 4] gi|330882170|gb|EGH16319.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. glycinea str. race 4] Length = 352 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 25/146 (17%), Positives = 51/146 (34%), Gaps = 35/146 (23%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + V+ L++ +NT A+ + L + ++ Sbjct: 150 PLTYDRRTVRVWLDEAKIGIAGKNTALGDAIGLGLKRL----------RLRPATSRVLVL 199 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------GQ 354 +TDG N+ ++ + G+KIY + + A P+ + Sbjct: 200 VTDGANNAGQ-----IDPITAARLAAEEGVKIYPIGIGADPDKDALQSVLGLNPSLDLDE 254 Query: 355 DLLRKCTD-SSGQFFAVNDSRELLES 379 L++ S GQ+F D +L + Sbjct: 255 PTLKEIASLSGGQYFRARDGDQLEKI 280 >gi|224057976|ref|XP_002299418.1| predicted protein [Populus trichocarpa] gi|222846676|gb|EEE84223.1| predicted protein [Populus trichocarpa] Length = 595 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 26/192 (13%), Positives = 64/192 (33%), Gaps = 30/192 (15%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL----SNN 258 K+ +L + ++ ++ + R+ + ++ + PL + Sbjct: 164 SGSMAGKLILLKRAVNFIIQNLGPSD--------RLSIVTFSSSAR--RILPLRTMSGSG 213 Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + S +N L+ TN + R L ++ + +I ++DG ++ Sbjct: 214 REDAISVVNSLSATGGTNIVAGLRKGVRVLEERRQHNSVAS--------IILLSDGCDTQ 265 Query: 319 ASAYQNTLNTLQIC--------EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV 370 + + N L L++ E R I++ + + S G F + Sbjct: 266 SHSTHNRLEYLKLIFPSNNASGEESRQPTFPIHTFGFGLDHDSAAMHAISDVSGGTFSFI 325 Query: 371 NDSRELLESFDK 382 L ++F + Sbjct: 326 ESIDILQDAFAR 337 >gi|73973310|ref|XP_867431.1| PREDICTED: similar to alpha 1 type XII collagen short isoform precursor isoform 2 [Canis familiaris] Length = 1901 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 53/160 (33%), Gaps = 18/160 (11%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKE 293 V+I Y+ + + + L NT T A++ ++ Sbjct: 71 RVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQ-----N 125 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 R +K + ITDG++ + + +++ G++++++ + Sbjct: 126 FRTQAGMRPRARKIGVLITDGKSQDDVEAPS--------KKLKDEGVELFAIGI--KNAD 175 Query: 354 QDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 +D L+ + V D L + D +T + Sbjct: 176 EDELKMIATDPDDTHAYNVADFESLSKIVDDLTINLCNSV 215 >gi|150019021|ref|YP_001311275.1| von Willebrand factor, type A [Clostridium beijerinckii NCIMB 8052] gi|149905486|gb|ABR36319.1| von Willebrand factor, type A [Clostridium beijerinckii NCIMB 8052] Length = 962 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 42/292 (14%), Positives = 76/292 (26%), Gaps = 40/292 (13%) Query: 138 IIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPP------------ 185 I +VLD S SM D Y + + Sbjct: 70 PQPFKISIPPKEIVLVLDSSGSMADNYKLTNLKKAATDFITKMSTVKNLKIAIVDFDTQA 129 Query: 186 ------KKSFWSKNTT---KSKYAPAPAPANRKIDVLIESAGNLVNS-----------IQ 225 S N T +S + L ++A L NS I Sbjct: 130 TIINKLTDVSSSTNVTALKRSINNLTAGGGTNTGEGLRQAAYLLSNSSEQNPLASKNIIF 189 Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTP---LSNNLNEVKSRLNKLNPYENTNTYPAMH 282 + E + + + G S N N + T Y + Sbjct: 190 MSDGEPTYYNWQTANSGWIWGDYNGTYYTYYDWSFNRVNYGFDSNHMPSGYRTGPYTGNY 249 Query: 283 HAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKI 342 Y+ + + + I + + V G + N + E ++ G + Sbjct: 250 TGYKYQGSSYSNYYTDITQSNVASNVSQSGTGYSDTDGKSLN--YAKIMGEIIKGKGYNV 307 Query: 343 YSVAVSAPPEGQDLLRKC---TDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 +S+ +G +++ ++ F DS + F I D I + Sbjct: 308 FSIGYGLDSDGNTKMQQIHSSMSTNNSNFYETDSGAIDAVFSNIGDDIADSY 359 >gi|57524519|ref|NP_001004007.1| matrilin 3a [Danio rerio] gi|51330145|gb|AAH80220.1| Matrilin 3a [Danio rerio] gi|123233072|emb|CAM15633.1| novel protein similar to vertebrate matrilin 3 (MATN3) (zgc:101120) [Danio rerio] gi|158254363|gb|AAI54374.1| Matn3a protein [Danio rerio] Length = 337 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 56/170 (32%), Gaps = 20/170 (11%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNT 277 L + + + + R+ + Y + + +K + ++ P T T Sbjct: 86 FLADMVDT--LDVGPDATRVAVVNYASTVKIEFLLKSHLTKDTIKQAITRIEPLAAGTMT 143 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 A+ A E + EK + S + K I +TDG ++ R Sbjct: 144 GMAIKKAMDEAFTEKSGARPK--SKNISKVAIIVTDGRPQD--------QVEEVSAAARA 193 Query: 338 AGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 +G++IY+V V L+ + F V +L F + Sbjct: 194 SGIEIYAVGV--DRADMRSLKLMASNPLEDHVFYVETYGVIEKLTSKFRE 241 >gi|116624819|ref|YP_826975.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116227981|gb|ABJ86690.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 837 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 18/136 (13%), Positives = 51/136 (37%), Gaps = 17/136 (12%) Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 ++ + + +K ++ + P T PA+ AY+ + + Sbjct: 435 FDNSFQWAVPIRKAEDRATIKKLISGITPDGGTQIAPALTEAYQRILPQTAMY------- 487 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 K ++ +TDG + +++ + + + + I +V + L + ++ Sbjct: 488 ---KHIVLLTDGISEEG-------DSMTLTKEAQANHVTISTVGLGQDVNRAFLEKVASN 537 Query: 363 SSGQFFAVNDSRELLE 378 + G+ + +ND L + Sbjct: 538 ADGKAYFLNDPSGLEQ 553 >gi|116623628|ref|YP_825784.1| hypothetical protein Acid_4540 [Candidatus Solibacter usitatus Ellin6076] gi|116226790|gb|ABJ85499.1| hypothetical protein Acid_4540 [Candidatus Solibacter usitatus Ellin6076] Length = 543 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 41/317 (12%), Positives = 89/317 (28%), Gaps = 47/317 (14%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 +T ++ S+ + AID + ++ ++Q+A+D A + + + Sbjct: 19 ITLLLPSIMIPLVGLAIDASVARLVQLRLQAAVDGAAMGAGRLLGTP----------AVP 68 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 T+ + + + + I ++ AQ L Sbjct: 69 ETLAAEFLASNFRTDGSAGTWGAHDLHS---TIVYTPGITKIIDIDATAQV-----PLLF 120 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 ++ + R +G S L I +DVS +N + Sbjct: 121 LRILGKTSATVRARGSGTRTDSRVMLVIDRSGTMDVSDGTGL---PTRIENAKTVAQTLF 177 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 +P + + + G + + + T Sbjct: 178 IPAFTEGA--------------------------DEIGLVAFDGSAYVAYPPSQPGWDPT 211 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG 300 + + NN N + +++N ++ T T A+ AY EL G Sbjct: 212 TTSSSRGGPDMYFKDPNNPNNMINQVNAIDAGSYTGTAEALWMAYIELQKAHLKDLAQDG 271 Query: 301 STRLKKFVIFITDGENS 317 ++ +TDG Sbjct: 272 VDLRMNSILLLTDGLPQ 288 >gi|89098674|ref|ZP_01171556.1| hypothetical protein B14911_00755 [Bacillus sp. NRRL B-14911] gi|89086636|gb|EAR65755.1| hypothetical protein B14911_00755 [Bacillus sp. NRRL B-14911] Length = 920 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 22/141 (15%), Positives = 53/141 (37%), Gaps = 18/141 (12%) Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 ++ T + + ++ + P T + ++ AY EL Sbjct: 445 FDDRPWVIVETGPLEDKKDAVDKIGSVTPGGGTEIFTSLEKAYEEL----------ENLK 494 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 +K +I +TDG+ + + + E + + + +VA+ + ++LL + Sbjct: 495 LQRKHIILLTDGQ------SARSTDYESMIETGKENNITLSTVALGSDA-DRNLLEELAG 547 Query: 363 -SSGQFFAVNDSRELLESFDK 382 +G+F+ V DS + + Sbjct: 548 LGAGRFYDVTDSSVIPSILSR 568 >gi|158316887|ref|YP_001509395.1| von Willebrand factor type A [Frankia sp. EAN1pec] gi|158112292|gb|ABW14489.1| von Willebrand factor type A [Frankia sp. EAN1pec] Length = 319 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 27/220 (12%), Positives = 66/220 (30%), Gaps = 31/220 (14%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 +++ S A +++ + A V+ + RI + Sbjct: 85 ERATIILAIDVSNSMAATDITPTRLEAAKQGAQAFVDQLP----------PRINLGLVSF 134 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 P S + V+S + L T ++ + + + + + G Sbjct: 135 AGSAAVLVPASTDRESVRSGIRGLQLGPATAVGEGIYASLQAIATAGQ-RLSDEGQPPPP 193 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------------- 352 ++ ++DGE + Q R+A + + ++A Sbjct: 194 AAIVLLSDGETTRGRP------NTQAATAARDAEIPVDTIAYGTSDGTLDVGGQQIPVPV 247 Query: 353 GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQS 391 ++ LR+ D + G + EL + + I ++ Sbjct: 248 NEEALRELADQTGGSYHRATSGDELQSVYRGLGSSIGYRT 287 >gi|149197810|ref|ZP_01874859.1| hypothetical protein LNTAR_04966 [Lentisphaera araneosa HTCC2155] gi|149139031|gb|EDM27435.1| hypothetical protein LNTAR_04966 [Lentisphaera araneosa HTCC2155] Length = 833 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 36/178 (20%), Positives = 69/178 (38%), Gaps = 19/178 (10%) Query: 220 LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYP 279 L KA E + ++G IA++ + N EV S+++ + TN YP Sbjct: 431 LAREASKAAAELLSSRDQVGVIAFDGSAKLVTDLTSAANKGEVLSQIDGIGAGGGTNLYP 490 Query: 280 AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG 339 AM L ++ K +I ++DG++ G I + G Sbjct: 491 AMVMGRDML----------GIASAKIKHMIVLSDGQSQGGDF-------EGISSELAQMG 533 Query: 340 MKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 + I +V++ DL+ +G+ + N++ E+ F K T + +++ P Sbjct: 534 VTISTVSLGQGA-AVDLMAAIAQIGNGRAYVTNNAEEMPRIFTKETMEASRSAIKEEP 590 >gi|313225346|emb|CBY06820.1| unnamed protein product [Oikopleura dioica] Length = 369 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 19/154 (12%), Positives = 52/154 (33%), Gaps = 18/154 (11%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEK 292 R+ + Y+ + + + ++ + NT T A+ + ++++ + Sbjct: 125 DYTRVSVLQYSDDPRIEFYLKDYQDKTTLLNAIDAITYKGGNTRTGEAIRYMMGQIFSVE 184 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 S +KK ++ +TDG++ +N ++ +++ V Sbjct: 185 AGSR-----PYVKKHMVLLTDGQSQDDVGAP--------ARAAKNFNIRTFAIGVG--DA 229 Query: 353 GQDLLRKCTDSS--GQFFAVNDSRELLESFDKIT 384 +D L+ + V D + D + Sbjct: 230 IEDELKLVATPPFSDTLYHVEDYDGIRHLQDTLA 263 >gi|260808371|ref|XP_002598981.1| hypothetical protein BRAFLDRAFT_221835 [Branchiostoma floridae] gi|229284256|gb|EEN54993.1| hypothetical protein BRAFLDRAFT_221835 [Branchiostoma floridae] Length = 193 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 27/160 (16%), Positives = 56/160 (35%), Gaps = 11/160 (6%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKES 294 R+G I Y N + + +N ++ T T A+ A ++ Sbjct: 43 TRVGLIEYTDSPTVEFKLADHTNKASLATAINNVSYQSGGTQTGRALDAARTQMD--WRQ 100 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 L + I +TDG + N Q + +R+ + Y V + Sbjct: 101 PPVPNVCFSLLQAAIVVTDGM--------SGDNVQQPAKALRDNDISAYGVGIGPAINAN 152 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRI 394 +L +G F + + +L + +KI++ + ++I Sbjct: 153 ELNEIAGGDAGHVFYIPNYDKLEKEMEKISNSVCSGVLQI 192 >gi|225435353|ref|XP_002285265.1| PREDICTED: hypothetical protein isoform 1 [Vitis vinifera] Length = 729 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 19/185 (10%), Positives = 57/185 (30%), Gaps = 24/185 (12%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 K+ +L + G ++ S+ + + + + + + +N Sbjct: 292 TKLALLKRAMGFVIQSLGPCDRLSVISFSSTARRLFPLRRMTDT------GRQQALQAVN 345 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG-----ASAY 322 L TN + + + + +I ++DG+++ A+ Sbjct: 346 SLVSNGGTNIAEGLRKGAKVMLD--------RKWKNPVSSIILLSDGQDTYTVCSPGGAH 397 Query: 323 QNTLNTLQICEYMRNAG-----MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 T +L + + G + +++ + + S G F + + Sbjct: 398 SRTDYSLLLPFSIHRNGGTGFQIPVHAFGFGTDHDATSMHAISETSGGTFSFIEAEGVIQ 457 Query: 378 ESFDK 382 ++F + Sbjct: 458 DAFAQ 462 >gi|300871001|ref|YP_003785873.1| aerotolerance-like membrane protein [Brachyspira pilosicoli 95/1000] gi|300688701|gb|ADK31372.1| aerotolerance-related membrane protein [Brachyspira pilosicoli 95/1000] Length = 328 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 24/187 (12%), Positives = 60/187 (32%), Gaps = 40/187 (21%) Query: 225 QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY--ENTNTYPAMH 282 +K + + + + +P + + ++ + + +T+ + Sbjct: 112 KKTMADFIKKRNFDKISLVAFALRASVLSPSTFDYTSLEKEIGNIKIDEEGSTSIGLGIA 171 Query: 283 HAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKI 342 A L + K+ + +K +I +TDGEN+ ++ E N +KI Sbjct: 172 TAVDMLRSVKDDA---------EKVIILLTDGENNSG-----EIDPKLASEIASNFNIKI 217 Query: 343 YSVAVSAPP------------------------EGQDLLRKCTDSSGQFFAVNDSRELLE 378 Y++ + + L+ + + G++F S L Sbjct: 218 YTIGIGDAAGSHAWVTYTDPNYGKRRIRADFTLNEKALIEIASITGGKYFNAKTSSALDN 277 Query: 379 SFDKITD 385 ++ I Sbjct: 278 VYNTIDR 284 >gi|241554201|ref|YP_002979414.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240863507|gb|ACS61169.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 706 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 38/365 (10%), Positives = 94/365 (25%), Gaps = 25/365 (6%) Query: 35 AAVLSGC--ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQIN 92 AA L A+ + + + + + + N A A Sbjct: 182 AAALGATKRAAPAAPGIVPQRQFAEPMAAIAPSPVPPAEGRMQMQLDPNRERFANAAANP 241 Query: 93 ITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICM 152 I +P+ + L G ++ + + Sbjct: 242 IKSVATDPVSTFSADVDSASYAFVRRSLTGGAMPDPLSVRVEEMINYFPYDWPGPNNADQ 301 Query: 153 VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV 212 + ++ + + ++ Y + P N K+ + Sbjct: 302 PFKATVTVMPTPWNRDTELMHVAIKGYDIAPATTP---RANLVFLIDVSGSMDEPDKLPL 358 Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY 272 L + +VN ++ + + Y +++ S +++L P Sbjct: 359 LKSAFRLMVNRLKADDT--------VSIVTYAGNAGTVLAPTRVAEKSKILSAIDRLEPG 410 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 +T + AY V+ TDG+ + + + +I Sbjct: 411 GSTGGAEGIEAAYDLAKQGFVKDGVNR--------VMLATDGDFNVG--PSSDGDLKRII 460 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 E R G+ + + L++ + + L E+ + ++ Sbjct: 461 EEKRKDGIFLTVLGFGRGNLNDSLMQTLAQNGNG--SAAYIDTLAEAQKTLVEEAGSTLF 518 Query: 393 RIAPN 397 IA + Sbjct: 519 PIASD 523 >gi|327280282|ref|XP_003224881.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-1(XIV) chain-like [Anolis carolinensis] Length = 1885 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 38/285 (13%), Positives = 83/285 (29%), Gaps = 22/285 (7%) Query: 110 QYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHN 169 + ++GLIP + + + A + D+ + E K Sbjct: 74 TLQNNANKAIIQGLIPDQSYTVQVVAFAKD--QESKPAQGQFRIKDIEKKKETSGKPKVK 131 Query: 170 DNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ 229 D + P+K+ + T + I L A Sbjct: 132 DVGKW--KPTVPTEDPEKNQFKCTTPAIADIVILVDGSWSIGRFNFRLVRLFLENLVAAF 189 Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + RIG Y+ + + V + L NT T A+ Sbjct: 190 NVGSEKTRIGLAQYSGDPRIEWHLNTYSTKDAVLDAVRNLPYKGGNTLTGLALTFILENN 249 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + + + + K I ITDG++ + +++AG++++++ V Sbjct: 250 FKSEAGAR-----PGVPKIGILITDGKSQDDVIPP--------AKNLKDAGIELFAIGV- 295 Query: 349 APPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 + L++ + V D + + +T + + Sbjct: 296 -KNADETELKEIASEPDNTHVYNVADFSFMNSIVEGLTKTVCSRV 339 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 27/201 (13%), Positives = 64/201 (31%), Gaps = 26/201 (12%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + +I + V ++ K + ++ ++ Sbjct: 1044 VDGSWSIGDDNFNKIIGFLYSTVGALDKIGPDGT----QVAIAQFSDDPRTEFKLNAYRT 1099 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + ++ NT T A+ H + L+ ++ + K ++ ITDG + Sbjct: 1100 KETLLEAIQQIAYKGGNTKTGKAIKHVQQVLFT-----IDSGTRKGIPKVLVVITDGRSQ 1154 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRE 375 ++ M+ G I+++ V L F V+D Sbjct: 1155 DDVN--------KVSREMQLDGFSIFAIGV--ADADYSELLNIGSKPSERHVFFVDDF-- 1202 Query: 376 LLESFDKITDKIQEQSVRIAP 396 ++F KI D++ A Sbjct: 1203 --DAFKKIEDELVTFVCETAS 1221 >gi|331003698|ref|ZP_08327192.1| hypothetical protein HMPREF0491_02054 [Lachnospiraceae oral taxon 107 str. F0167] gi|330412081|gb|EGG91476.1| hypothetical protein HMPREF0491_02054 [Lachnospiraceae oral taxon 107 str. F0167] Length = 528 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 33/338 (9%), Positives = 86/338 (25%), Gaps = 31/338 (9%) Query: 46 SDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIA 105 + + D + + + +Y E I + + + + Sbjct: 34 NSGAVYDGAKTAENAAPYVS-----YEPNVNYNTEEYNAIKESSFHAVATSPLSTFAADV 88 Query: 106 ESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYL 165 + A Y + L P+ + + V + + Sbjct: 89 D-TASYANIRRFINSGELPPADSVRIEEMLNYFRYDYPQPKDGEPFSV--STEISACPWN 145 Query: 166 QKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQ 225 K + K S K+ ++ + L ++ Sbjct: 146 PDTKLMMIGMQAKKVEESEKKPSNLVFLID----VSGSMDEPDKLPLVKNAFLLLCEELK 201 Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAY 285 + I + Y ++ E+ S + L +T + AY Sbjct: 202 ENDT--------ISIVTYAGYDQVVLEGASGSDSKEIMSAIEDLEAAGSTAGSDGIKTAY 253 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + +S N VI TDG+ + + ++ + + +G+ + + Sbjct: 254 KIAKKYFKSDGNNR--------VILATDGDLNVG--ITSEGKLTRLIKKEKESGVFLSVL 303 Query: 346 AVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDK 382 + + D+ G + ++ E + + Sbjct: 304 GFGTENIKDNKMEALADNGNGNYSYIDSRFEAKKVLSE 341 >gi|297668115|ref|XP_002812300.1| PREDICTED: LOW QUALITY PROTEIN: matrilin-3-like [Pongo abelii] Length = 493 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 59/176 (33%), Gaps = 23/176 (13%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP- 271 + +++++ + R+ + Y + + +K + ++ P Sbjct: 109 VKTFVSRIIDTL-----DIGPADTRVAVVNYASTVKIEFQLQTYTDKQSLKQAVGRITPL 163 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T + A+ A E++ + + S+ + K I +TDG ++ Sbjct: 164 STGTMSGLAIQTAMDEIFTVEAGARG--PSSNIPKVAIIVTDGRPQD--------QVNEV 213 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 + +G+++Y+V V + L+ F V +L F + Sbjct: 214 AARAQASGIELYAVGV--DRADMESLKMMASEPLEEHVFYVETYGVIEKLSSRFQE 267 >gi|241113476|ref|YP_002973311.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240861684|gb|ACS59350.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 329 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 23/150 (15%), Positives = 54/150 (36%), Gaps = 25/150 (16%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYEN---TNTYPAMHHAYRELYNEKESSHNTIGST 302 G P + + V++ + P T+ A+ A + +T Sbjct: 147 GDAPYPLAPFTMDHELVRTMIADTVPGMAGPRTSLGDALGLAIKMF----------GKTT 196 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-------EGQD 355 +K +I +TDG ++ + L+ E ++ G+ ++V + P + Sbjct: 197 APEKVLIVLTDGNDTASRMP-----PLKAAEIAKSKGVIFHTVGIGDPAATGEDKLDTAT 251 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITD 385 L + + G++F D +L ++ + Sbjct: 252 LQKIAASTGGRYFFGGDQSQLAAIYEVLDQ 281 >gi|242034231|ref|XP_002464510.1| hypothetical protein SORBIDRAFT_01g019870 [Sorghum bicolor] gi|241918364|gb|EER91508.1| hypothetical protein SORBIDRAFT_01g019870 [Sorghum bicolor] Length = 647 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 24/209 (11%), Positives = 69/209 (33%), Gaps = 35/209 (16%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN--LNEVK 263 K+++L ++ G +++++ R+ ++++ G ++ + + Sbjct: 189 VGTKLELLKQAMGFVIDNLG--------PRDRLCVVSFSSGANRLMRLARMSDAGKSLAR 240 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG----- 318 + L TN A+ A + + V+ ++DG+++ Sbjct: 241 RAVQSLAAGGGTNIGEALRRAAKVIDE--------RMHRNAVASVVLLSDGQDTYTVPRR 292 Query: 319 ASAYQNTLNTLQICE---YMRNAG----MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 N + AG +++ + + + G F + Sbjct: 293 GGYGGRDANYDALVPPSFAFTGAGGRPAAPVHTFGFGTDHDAAAMHTIAEATGGTFSFIE 352 Query: 372 DSRELLESFDK-----ITDKIQEQSVRIA 395 D + ++F + ++ +QE + IA Sbjct: 353 DEAAIQDAFAQCIGGLLSVTVQELRLDIA 381 >gi|288919019|ref|ZP_06413360.1| von Willebrand factor type A [Frankia sp. EUN1f] gi|288349559|gb|EFC83795.1| von Willebrand factor type A [Frankia sp. EUN1f] Length = 319 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 28/220 (12%), Positives = 65/220 (29%), Gaps = 31/220 (14%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 +++ S A +++ + A V+ + RI + Sbjct: 85 ERATIILAIDVSNSMAATDIQPTRLEAAKQGAQAFVDQLP----------PRINLGLVSF 134 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 P S + V+S + L T + + + + E + G Sbjct: 135 AGSAAVLVPASTDRESVRSGIRGLQLGPATAVGEGIFASLQAITTAGERMSDE-GQPPPP 193 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------------- 352 ++ ++DGE + Q R+A + + ++A Sbjct: 194 AAIVLLSDGETTRGRP------NTQAATAARDAEVPVDTIAYGTSDGTLDVGGQQIPVPV 247 Query: 353 GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQS 391 +D LR+ + + G + EL + + I ++ Sbjct: 248 NEDALRELAEQTGGSYHRATTGDELQSVYRGLGSSIGYRT 287 >gi|296224421|ref|XP_002758053.1| PREDICTED: matrilin-3 isoform 2 [Callithrix jacchus] Length = 445 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 24/176 (13%), Positives = 58/176 (32%), Gaps = 23/176 (13%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP- 271 + +++++ + R+ + Y + + +K + ++ P Sbjct: 103 VKTFVSRIIDTL-----DIGPADTRVAVVNYASTVKIEFQLQAYTDKQSLKQAVGRITPL 157 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T + A+ A E + + + S+ + K I +TDG ++ Sbjct: 158 STGTMSGLAIQTAMDEAFTLEAGARG--PSSNVPKVAIIVTDGRPQD--------QVNEV 207 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 R +G+++Y+V V + L+ F V +L F + Sbjct: 208 AARARASGIELYAVGV--DRADMESLKMMASEPLEEHVFYVETYGVIEKLSSRFQE 261 >gi|296224419|ref|XP_002758052.1| PREDICTED: matrilin-3 isoform 1 [Callithrix jacchus] Length = 487 Score = 71.1 bits (172), Expect = 3e-10, Method: Composition-based stats. Identities = 24/176 (13%), Positives = 58/176 (32%), Gaps = 23/176 (13%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP- 271 + +++++ + R+ + Y + + +K + ++ P Sbjct: 103 VKTFVSRIIDTL-----DIGPADTRVAVVNYASTVKIEFQLQAYTDKQSLKQAVGRITPL 157 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T + A+ A E + + + S+ + K I +TDG ++ Sbjct: 158 STGTMSGLAIQTAMDEAFTLEAGARG--PSSNVPKVAIIVTDGRPQD--------QVNEV 207 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 R +G+++Y+V V + L+ F V +L F + Sbjct: 208 AARARASGIELYAVGV--DRADMESLKMMASEPLEEHVFYVETYGVIEKLSSRFQE 261 >gi|326932831|ref|XP_003212516.1| PREDICTED: cartilage matrix protein-like [Meleagris gallopavo] Length = 493 Score = 70.7 bits (171), Expect = 3e-10, Method: Composition-based stats. Identities = 27/183 (14%), Positives = 64/183 (34%), Gaps = 23/183 (12%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ + +V S++ ++G + Y+ + N ++K+ + K Sbjct: 287 NFELVKKFINQIVESLE-----VSEKQAQVGLVQYSSSVRQEFPLGQFKNKKDIKAAVKK 341 Query: 269 LNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + + T T A+ + ++ + + K I TDG + Sbjct: 342 MAYMEKGTMTGQALKYLVDNSFSIANGAR-----PGVPKVGIVFTDGRSQDYITD----- 391 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITD 385 + ++ G ++++V V +D LR+ +F D R + K+ Sbjct: 392 ---AAKKAKDLGFRMFAVGVG--NAVEDELREIASEPVAEHYFYTADFRTISNIGKKLQM 446 Query: 386 KIQ 388 KI Sbjct: 447 KIC 449 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 24/152 (15%), Positives = 47/152 (30%), Gaps = 15/152 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + S R+G I Y + + + ++ P T T A+ A Sbjct: 70 DVGPNSTRVGVINYASAVKNEFSLKTHQTKAGLLQAVRRIEPLSTGTMTGLAIQFAISRA 129 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 +++ E + + K I +TDG + R AG++I+++ V Sbjct: 130 FSDAEGARLRSSNIN--KVAIVVTDGRPQDG--------VQDVSARARQAGIEIFAIGVG 179 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLE 378 LR+ V + + Sbjct: 180 R--VDMHTLRQIASEPLDDHVDYVESYSVIEK 209 >gi|221108467|ref|XP_002170770.1| PREDICTED: similar to collagen, type XXIX, alpha 1, partial [Hydra magnipapillata] Length = 592 Score = 70.7 bits (171), Expect = 3e-10, Method: Composition-based stats. Identities = 25/183 (13%), Positives = 56/183 (30%), Gaps = 19/183 (10%) Query: 198 KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN 257 + D L + S+ R I ++ + + Sbjct: 410 SSGSLRKYYQNEKDFLKSAISAFGVSVNGT---------RAAVITFSYHAQLSIKLNKYS 460 Query: 258 NLNEVKSRLNKLNPYENT-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 NLN K ++ + +T A+ A +E++ + + + K + +TDG Sbjct: 461 NLNSFKEAVDNIVLMGSTTRIDKALRLAQKEVFELENGAR-----PGVAKILFLLTDGSQ 515 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 + N + I +R+AG+ I + + L + + +L Sbjct: 516 TQERGS---ENPVAIANELRSAGVTIIVIGI-TNAVDVSELFDIAGGEENAYFADSFEKL 571 Query: 377 LES 379 + Sbjct: 572 KDV 574 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 30/199 (15%), Positives = 60/199 (30%), Gaps = 21/199 (10%) Query: 198 KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN 257 + D L S R I ++ + + Sbjct: 214 SSGSLRKYYQNEKDFLKSVISAFGVSFNGT---------RAAVITFSYHAQLSIKLNKYS 264 Query: 258 NLNEVKSRLNKLNPYENT-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 NLN K ++ + +T A+ A +E++ + + + K + +TDG Sbjct: 265 NLNSFKEAVDNIVLMGSTTRIDKALRLAQKEVFELENGAR-----PGVAKILFLLTDGSQ 319 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 + N + I +R+AG+ I + + L +A +L Sbjct: 320 TQERGS---ENPVAIANELRSAGVTIIVIGI-TNAVNVSELSDIAGGEENAYAAESFEKL 375 Query: 377 LES--FDKITDKIQEQSVR 393 + D I K+ E + Sbjct: 376 KDVNFLDVIKTKMCETATF 394 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 30/196 (15%), Positives = 59/196 (30%), Gaps = 21/196 (10%) Query: 198 KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN 257 + D L S R I ++ + + Sbjct: 18 SSGSLRKYYQNEKDFLKSVISAFGVSFNGT---------RAAVITFSYHAQLSIKLNKYS 68 Query: 258 NLNEVKSRLNKLNPYENT-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 NLN K ++ + +T A+ A +E++ + + + K + +TDG Sbjct: 69 NLNSFKEAVDNIVLMGSTTRIDKALRLAQKEVFELENGAR-----PGVAKILFLLTDGSQ 123 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 + N + I +R+AG+ I + + L +A +L Sbjct: 124 TQERGS---ENPVAIANELRSAGVTIIVIGI-TNAVNVSELSDIAGGEENAYATESFEKL 179 Query: 377 LES--FDKITDKIQEQ 390 + D I K+ E Sbjct: 180 KDVNFLDVIKTKMCET 195 >gi|254452693|ref|ZP_05066130.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|198267099|gb|EDY91369.1| conserved hypothetical protein [Octadecabacter antarcticus 238] Length = 173 Score = 70.7 bits (171), Expect = 3e-10, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 47/165 (28%), Gaps = 34/165 (20%) Query: 4 IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTI 63 +++ + F A+DL R ++Q +LD A L+ + D Sbjct: 28 MMMIMILWFGGMAVDLMRYETTRAKLQGSLDRATLAAA-------DLDQIMPPADVVRDY 80 Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGL 123 K H QG + ++A + L + Sbjct: 81 LDKAGMLHFLQGEP-----------------TVSQGINYRVVSAQASAPMALFFYDLPRI 123 Query: 124 I------PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMED 162 N+S ST + + +VLDVS SM Sbjct: 124 FTSPFSPGMTAINVSGASTAEERV----TDVEVSLVLDVSSSMNS 164 >gi|119899154|ref|YP_934367.1| hypothetical protein azo2864 [Azoarcus sp. BH72] gi|119671567|emb|CAL95480.1| conserved hypothetical membrane protein [Azoarcus sp. BH72] Length = 339 Score = 70.7 bits (171), Expect = 3e-10, Method: Composition-based stats. Identities = 29/236 (12%), Positives = 69/236 (29%), Gaps = 50/236 (21%) Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYN 244 ++ S AP ++ +A + V + VRIG +++ Sbjct: 83 SEQRTIILAIDVSLSMSAPDVLPDRLSAAQAAARDFVR--------NQPPDVRIGIVSFA 134 Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN--EKESSHNTIGST 302 Q ++N ++ +++L +T + A L+ + + S Sbjct: 135 GTATVVQAP--TDNREDLLGAIDRLQLARHTAIGSGIIVALSALFPEESFDPDPTMMSSA 192 Query: 303 RLKKF-----------------VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + VI +TDG + + G+++++V Sbjct: 193 EPGRAPNAPREEVAPGSNGSAAVILLTDGRRTSG------PEPVDAARMAAVRGIRVFTV 246 Query: 346 AVSAPPE--------------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDK 386 + LR D + ++F + EL + + + + Sbjct: 247 GFGTAEGATIQNEGWSVFMRFDEGTLRAIADLTQAKYFHAGTAAELQQIYHDLNAR 302 >gi|94313457|ref|YP_586666.1| hypothetical protein Rmet_4532 [Cupriavidus metallidurans CH34] gi|93357309|gb|ABF11397.1| conserved hypothetical protein, (Von Willebrand factor, type A); putative membrane protein [Cupriavidus metallidurans CH34] Length = 334 Score = 70.7 bits (171), Expect = 3e-10, Method: Composition-based stats. Identities = 25/189 (13%), Positives = 60/189 (31%), Gaps = 33/189 (17%) Query: 220 LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL---NPYENTN 276 V+++++ + + G P + + V+ L + +T+ Sbjct: 121 RVDAVRQVVADFVARRTGDRIGLIVFGDAPYPLAPFTLDHALVRELLADMVPGMAGASTS 180 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 A+ + K +I +TDG ++ + Q + + Sbjct: 181 LGDAIGLGIKMFDQSHAQE----------KVMILLTDGNDTASRMP-----PAQAADIAK 225 Query: 337 NAGMKIYSVAVSAPPE------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQE 389 G+ +++V + P D L+ + G++F D L + + Sbjct: 226 TRGVVVHTVGIGDPATTGEQKVDLDALKHIASTTGGRYFFGADQTSLASIYATLD----- 280 Query: 390 QSVRIAPNR 398 R+ P+R Sbjct: 281 ---RVTPHR 286 >gi|221104447|ref|XP_002170122.1| PREDICTED: similar to tyrosine kinase receptor, partial [Hydra magnipapillata] Length = 898 Score = 70.7 bits (171), Expect = 3e-10, Method: Composition-based stats. Identities = 22/157 (14%), Positives = 54/157 (34%), Gaps = 11/157 (7%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYN 290 N G + ++ + + + + ++ + T A+ + Sbjct: 124 SNNGAHAGVVTFSSIAELSIKLNQYYDQEQFERAVDDIPYMGYVTRIDLALRKSLEMFDE 183 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + + +TDGE + N + I + +R+ G+ I+++ + + Sbjct: 184 INGAR------KSIPQILFLLTDGEQYAGKGVVDE-NPVSIAKLLRDKGIVIFAIGIGS- 235 Query: 351 PEGQDLLRKCTDSSGQFFAVNDSREL--LESFDKITD 385 Q L SS + F + EL + KI + Sbjct: 236 AVRQSQLNDIAGSSEKAFLAKNFNELVNSDFLKKIKE 272 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 26/185 (14%), Positives = 55/185 (29%), Gaps = 32/185 (17%) Query: 211 DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN 270 + + L+N K G + ++ + N + + + ++ Sbjct: 732 NHYQQEVEFLINLASTFNISKYGA--HAGVVTFSYDAFLSIKLNDYFNQAQFNNAVKDIS 789 Query: 271 P-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T A+ + + + + +TDGE SG N + Sbjct: 790 YLNGGTRIDLALEKSLEMFDELNGAR------KNTPQILFLLTDGEQSGD------KNPV 837 Query: 330 QICEYMRNAGMKI-------YSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL--LESF 380 I + +R+ G+ I Y + L S+ + F + EL + Sbjct: 838 DIAKRLRDRGIIIFAIGIGSYV--------NKTELNNIVGSNDKAFLAENFNELVNNDFL 889 Query: 381 DKITD 385 KI + Sbjct: 890 KKIKE 894 >gi|58425974|gb|AAW75011.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC10331] Length = 365 Score = 70.7 bits (171), Expect = 3e-10, Method: Composition-based stats. Identities = 39/327 (11%), Positives = 85/327 (25%), Gaps = 29/327 (8%) Query: 72 LKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNL 131 + + RE I A + + + + Y L L P + Sbjct: 51 IPAPAEDREGYQHIDDNAIVQAAQQPISTFSIDVD-TGSYSNVRRFLSAGSLPPVDAVRV 109 Query: 132 SLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWS 191 V +L N ++ + Sbjct: 110 EELINYFRYDHPAPTNDKPFAV------RTELATTPWNKDSLLLRVGIAGRDIATADLPP 163 Query: 192 KNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ 251 N A K+ +L S LV ++K RI + Y Sbjct: 164 ANLVFQVDVSGSMDAPDKLPLLRSSLKLLVRQLRKQD--------RITLVTYAGNTAVVL 215 Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 + + ++ L +T + AY+ ++ Sbjct: 216 PPTPGDQQGRIVEAIDSLQSGGSTAGASGIELAYKAAQQGYLRGGINR--------ILLA 267 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAV 370 TDG+ + + L + R +G+ + ++ +L+ + D+ G + + Sbjct: 268 TDGDFNVGVTDFDAL--KGMVSEKRRSGVALSTLGFGTGNYNDNLMEQSADAGDGAYAYI 325 Query: 371 NDSRELLESFDKITDKIQEQSVRIAPN 397 + E + +T ++ IA + Sbjct: 326 DTPLEARKV---LTHELGATLATIARD 349 >gi|218516298|ref|ZP_03513138.1| hypothetical protein Retl8_22869 [Rhizobium etli 8C-3] Length = 432 Score = 70.7 bits (171), Expect = 3e-10, Method: Composition-based stats. Identities = 24/198 (12%), Positives = 56/198 (28%), Gaps = 20/198 (10%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 K+ +L + LVN ++ + + Y Sbjct: 72 VSGSMDEPDKLPLLKSAFRLLVNRLKADDT--------VSIVTYAGNAGTVLEPTRVAEK 123 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 ++ S +++L +T + AY V+ TDG+ + Sbjct: 124 PKILSAIDRLEAGGSTGGAEGIEAAYDLAKKAFVQDGVNR--------VMLATDGDFNVG 175 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 + + +I E R G+ + + L++ + + L E+ Sbjct: 176 --PSSDEDLKRIIEEKRKDGIFLTVLGFGRGNLNDSLMQTLAQNGNG--SAAYIDTLAEA 231 Query: 380 FDKITDKIQEQSVRIAPN 397 + ++ IA + Sbjct: 232 QKTLVEQAGSTLFPIAKD 249 >gi|253701795|ref|YP_003022984.1| von Willebrand factor A [Geobacter sp. M21] gi|251776645|gb|ACT19226.1| von Willebrand factor type A [Geobacter sp. M21] Length = 368 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 19/156 (12%), Positives = 53/156 (33%), Gaps = 13/156 (8%) Query: 1 MTAIIISVCFLF--ITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKD 58 + + + F A+D ++ ++ +A+DAA ++ D ++ Sbjct: 13 LLVVAAVMFFGIFLAALAVDAGRAYGVKAKLHAAVDAASYEAAKALAHGEDEDD---MEE 69 Query: 59 QTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENL 118 + S + + + + + +G ++ +++ K+ A T Sbjct: 70 KASEAARDYFRANFPAAYFGAQCSG-----PELELSERKSGKKMRALTVSATA---TLPN 121 Query: 119 FLKGLIPSALTNLSLRSTGIIERSSENLAISICMVL 154 G++ +L +S + + L + VL Sbjct: 122 VFAGILGWDSIDLPAQSRAVRTDADVVLVLESSDVL 157 >gi|239995770|ref|ZP_04716294.1| von Willebrand factor, type A [Alteromonas macleodii ATCC 27126] Length = 358 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 30/187 (16%), Positives = 64/187 (34%), Gaps = 38/187 (20%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS----RLNKLNPYEN 274 N + + + + V PL+ + + V + + L E Sbjct: 113 NRLTMTKSVVYDFIQRRVGDRIGLILFADTAYVQAPLTYDRDTVSTLLSEAVIGLV-GEQ 171 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 T A+ A + +ES+ +I +TDG+N+ + + Q E Sbjct: 172 TAIGDAIGLAVKRFDEREESN----------NVLILLTDGQNTAGN-----ITPEQAKEL 216 Query: 335 MRNAGMKIYSVAVSAP-----------------PEGQDLLRKCT-DSSGQFFAVNDSREL 376 + G+K+Y++ V A + +L + GQ+F +++EL Sbjct: 217 AISKGVKVYTIGVGADKMLIQSFFGSRQINPSQELDEGMLTNIATSTGGQYFRARNAQEL 276 Query: 377 LESFDKI 383 + ++ Sbjct: 277 QAIYQQL 283 >gi|254229828|ref|ZP_04923234.1| von Willebrand factor, type A [Vibrio sp. Ex25] gi|262395606|ref|YP_003287459.1| protein BatA [Vibrio sp. Ex25] gi|151937664|gb|EDN56516.1| von Willebrand factor, type A [Vibrio sp. Ex25] gi|262339200|gb|ACY52994.1| protein BatA [Vibrio sp. Ex25] Length = 356 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 25/150 (16%), Positives = 60/150 (40%), Gaps = 15/150 (10%) Query: 246 GIVGNQCTPLSNNLN---EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 G TP + + E+ ++ + ++T+ A+ A + +S Sbjct: 159 GDAAFVQTPFTVDQKVWLELLNQTDVAMAGQSTHLGDAIGLAIKVFEQSDKSRGALEQDQ 218 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-------EGQD 355 +K I +TDG ++G + + + + + G++++ +A+ P + Sbjct: 219 NREKVAIVLTDGNDTG-----SFVEPIDAAKVAKAKGVRVHVIAMGDPETIGETALDMDT 273 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + R +S G+ F + EL ++D+I Sbjct: 274 IHRIAKESGGEAFEALNRDELSAAYDEIGK 303 >gi|320102588|ref|YP_004178179.1| Heat shock protein 70 [Isosphaera pallida ATCC 43644] gi|319749870|gb|ADV61630.1| Heat shock protein 70 [Isosphaera pallida ATCC 43644] Length = 688 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 53/383 (13%), Positives = 116/383 (30%), Gaps = 44/383 (11%) Query: 11 LFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPT-TKKDQTSTIFKKQI- 68 + + AI + + A+ LSG I + T + + Sbjct: 338 VALGAAIQA--ALETGTHAEDAMPRFTLSGARVIRDVMSHSLGTVAVSADGTRYVNDVVV 395 Query: 69 --KKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPS 126 + + + + + + + + + Sbjct: 396 PRNQPIPAANTRSYLHATHEGRNTRLEVYLTQGESERPLDCQILGKYVFNGIQPTQAEVM 455 Query: 127 ALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPK 186 ++S G+++ + + + + EDL + P Sbjct: 456 VDVSISYDENGMVQVEARQRDCDTPLAMTIEPVPEDLSWLDRPPIDVTER----HQVEPL 511 Query: 187 KSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIG 246 + + S P +D E+A + ++ + R+G I+Y+ Sbjct: 512 AILLLIDVSSSMAGPP-------LDEAREAARSFLDQCDFT-------TTRVGLISYSDQ 557 Query: 247 IVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 +V T L++N+ +V++ L +L TN A+ R+L K Sbjct: 558 VVLQ--TDLTDNVRKVEAGLARLEADGTTNLAGALELGRRKLATVPTG---------HVK 606 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ 366 +++ +TDG + N L + + +G++I VA+ Q L + + Sbjct: 607 YLVVLTDGYPD------DPDNALLEAAHAKGSGIEI--VAIGTGEADQAYLDRIASTQAG 658 Query: 367 FFAVNDSRELLESFDKITDKIQE 389 EL+ +F I I E Sbjct: 659 SIFAR-KGELVRAFGHIARVIAE 680 >gi|304382530|ref|ZP_07365025.1| aerotolerance protein BatA [Prevotella marshii DSM 16973] gi|304336361|gb|EFM02602.1| aerotolerance protein BatA [Prevotella marshii DSM 16973] Length = 332 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 24/161 (14%), Positives = 49/161 (30%), Gaps = 44/161 (27%) Query: 254 PLSNNLNEVKSRLNK----LNPYE----NTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 P++ + + + L L+ T + +A L S Sbjct: 143 PMTTDHQSLLNLLQNVRTDLSARGLIEDGTAVGMGLANAVSRL----------KDSKAKS 192 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--------------- 350 K VI +TDG N+ L+ + ++ G+++Y++ V Sbjct: 193 KVVILLTDGSNNRG-----DLSPMTSANIAKSLGIRVYTIGVGTNKVAPYPMPVAGGIQY 247 Query: 351 -----PEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 L + G F+ ++ EL + + I Sbjct: 248 VNIPVEIDTKTLSGIASVTHGNFYRATNNNELKQIYKDIDK 288 >gi|147776143|emb|CAN69721.1| hypothetical protein VITISV_014218 [Vitis vinifera] Length = 686 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 23/192 (11%), Positives = 56/192 (29%), Gaps = 24/192 (12%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 K+ +L + L+ ++ + + + + + + N +N Sbjct: 218 SKLSLLKRAVCFLIQNLGPSDRLSIVSFSSTARRIFPLRRMSD------NGREAAGLAIN 271 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 L TN + R L E + + D N +++ + N Sbjct: 272 SLXSSGGTNIVEGLKKGVRVLEERSEQNPVASIILLSDGKDTYNCDNVNRRQTSHCASSN 331 Query: 328 TLQ-----------ICEYMRNAG-------MKIYSVAVSAPPEGQDLLRKCTDSSGQFFA 369 Q IC R +G + +++ + + + +S G F Sbjct: 332 PRQVLEYLNLLPASICPRNRESGDEGRQAIIPVHTFGFGSDHDSTAMHAISDESGGTFSF 391 Query: 370 VNDSRELLESFD 381 + + ++F Sbjct: 392 IESVAXVQDAFA 403 >gi|147834997|emb|CAN61381.1| hypothetical protein VITISV_037547 [Vitis vinifera] Length = 1324 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 19/185 (10%), Positives = 57/185 (30%), Gaps = 24/185 (12%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 K+ +L + G ++ S+ + + + + + + +N Sbjct: 292 TKLALLKRAMGFVIQSLGPCDRLSVISFSSTARRLFPLRRMTDT------GRQQALQAVN 345 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG-----ASAY 322 L TN + + + + +I ++DG+++ A+ Sbjct: 346 SLISNGGTNIAEGLRKGAKVMLD--------RKWKNPVSSIILLSDGQDTYTVCSPGGAH 397 Query: 323 QNTLNTLQICEYMRNAG-----MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 T +L + + G + +++ + + S G F + + Sbjct: 398 SRTDYSLLLPFSIHRNGGTGFQIPVHAFGFGTDHDATSMHAISETSGGTFSFIEAEGVIQ 457 Query: 378 ESFDK 382 ++F + Sbjct: 458 DAFAQ 462 >gi|163849338|ref|YP_001637382.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222527332|ref|YP_002571803.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163670627|gb|ABY36993.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222451211|gb|ACM55477.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 418 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 19/182 (10%), Positives = 54/182 (29%), Gaps = 13/182 (7%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + + V I + + +N L+ E+K + Sbjct: 49 VIDRSSSMRGERLQQVKQAAMQILDLLGDHESFALVTFNDRAEVVVSAQLARARAEIKRQ 108 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 ++ + T + +EL ++ +TDG Y + Sbjct: 109 ISAIEAAGGTEMATGLALGVQELQRAMMPRAVHR--------LLLLTDG-----RTYGDE 155 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 ++I ++ G+ I ++ + + L + + + + E+ + F + Sbjct: 156 GRCVEIARRAQSRGIGITALGIGSEWNEDLLETIAARENSRTHYITSAAEITKIFTAEVE 215 Query: 386 KI 387 ++ Sbjct: 216 RM 217 >gi|226504618|ref|NP_001148048.1| protein binding protein [Zea mays] gi|195615516|gb|ACG29588.1| protein binding protein [Zea mays] Length = 696 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 19/135 (14%), Positives = 43/135 (31%), Gaps = 19/135 (14%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + + +N L TN A+ A + + + +I ++DG++ Sbjct: 315 SGRQQALQTVNSLVASGGTNIADALKKAAKVIED--------RSHQNPVCSIILLSDGQD 366 Query: 317 SGASAY--------QNTLNTLQICEYMRNAG-MKIYSVAVSAPPEGQDLLRKCTDSSGQF 367 + ++L I R G + ++ + L S G F Sbjct: 367 TYNIPSNIRGARPEYSSLVPSSILN--RTFGLVPVHGFGFGVDHDSDALHSIAEASGGTF 424 Query: 368 FAVNDSRELLESFDK 382 + D + ++F + Sbjct: 425 SFIEDEGVIQDAFAQ 439 >gi|75907530|ref|YP_321826.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] gi|75701255|gb|ABA20931.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] Length = 418 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 30/196 (15%), Positives = 73/196 (37%), Gaps = 30/196 (15%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 ++++ ++A LV+ ++ R+ +A++ + +N ++K ++N+L Sbjct: 59 LEIVKQAAIRLVDRLKTGD--------RLSVVAFDHRAKVLVPNQVIDNPEQIKKQINRL 110 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T + EL G +TDGE + + + L Sbjct: 111 AADGGTAIDEGLRLGIEEL---------AKGKKETISQAFLLTDGE----NEHGDNNRCL 157 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQ 388 + + + + ++ QD+L K D+ G + + + ++ F ++ +IQ Sbjct: 158 KFAQLAAGYNLTLNTLGFGDNWN-QDVLEKIADAGLGSLSYIQKAEQAVDEFGRLFSRIQ 216 Query: 389 E-------QSVRIAPN 397 + +APN Sbjct: 217 TVGLTNAYLLLSLAPN 232 >gi|84623314|ref|YP_450686.1| hypothetical protein XOO_1657 [Xanthomonas oryzae pv. oryzae MAFF 311018] gi|122879130|ref|YP_200396.6| hypothetical protein XOO1757 [Xanthomonas oryzae pv. oryzae KACC10331] gi|188577378|ref|YP_001914307.1| von Willebrand factor type A domain protein [Xanthomonas oryzae pv. oryzae PXO99A] gi|84367254|dbj|BAE68412.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF 311018] gi|188521830|gb|ACD59775.1| von Willebrand factor type A domain protein [Xanthomonas oryzae pv. oryzae PXO99A] Length = 350 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 39/327 (11%), Positives = 85/327 (25%), Gaps = 29/327 (8%) Query: 72 LKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNL 131 + + RE I A + + + + Y L L P + Sbjct: 36 IPAPAEDREGYQHIDDNAIVQAAQQPISTFSIDVD-TGSYSNVRRFLSAGSLPPVDAVRV 94 Query: 132 SLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWS 191 V +L N ++ + Sbjct: 95 EELINYFRYDHPAPTNDKPFAV------RTELATTPWNKDSLLLRVGIAGRDIATADLPP 148 Query: 192 KNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ 251 N A K+ +L S LV ++K RI + Y Sbjct: 149 ANLVFQVDVSGSMDAPDKLPLLRSSLKLLVRQLRKQD--------RITLVTYAGNTAVVL 200 Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 + + ++ L +T + AY+ ++ Sbjct: 201 PPTPGDQQGRIVEAIDSLQSGGSTAGASGIELAYKAAQQGYLRGGINR--------ILLA 252 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAV 370 TDG+ + + L + R +G+ + ++ +L+ + D+ G + + Sbjct: 253 TDGDFNVGVTDFDAL--KGMVSEKRRSGVALSTLGFGTGNYNDNLMEQSADAGDGAYAYI 310 Query: 371 NDSRELLESFDKITDKIQEQSVRIAPN 397 + E + +T ++ IA + Sbjct: 311 DTPLEARKV---LTHELGATLATIARD 334 >gi|87121300|ref|ZP_01077190.1| batB protein, putative [Marinomonas sp. MED121] gi|86163457|gb|EAQ64732.1| batB protein, putative [Marinomonas sp. MED121] Length = 333 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 68/198 (34%), Gaps = 47/198 (23%) Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 ++D + + RIG I + PLS +L+ + + Sbjct: 115 ANRLDAAKQVLNRFITE---------RQGDRIGIIVFGSKAYLQA--PLSYDLDTIAQLV 163 Query: 267 NKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 N+ ENT A+ + L K+ +I +TDG N+ Sbjct: 164 NETQIGFAGENTAIGDAIGLGIKRL----------ANIDADKRVMILMTDGANTAGRVKP 213 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEGQDLLRKCTD-SSG 365 + Q ++ G+KI+++ + A ++LL+K D + G Sbjct: 214 D-----QAAQFAAKQGVKIHTIGIGAEQMVSQGFFGPRVINPSTDLDEELLQKVADLTQG 268 Query: 366 QFFAVNDSRELLESFDKI 383 Q+F ++EL + + Sbjct: 269 QYFRAKSTQELASIYATL 286 >gi|256005895|ref|ZP_05430841.1| von Willebrand factor type A [Clostridium thermocellum DSM 2360] gi|255990131|gb|EEU00267.1| von Willebrand factor type A [Clostridium thermocellum DSM 2360] Length = 524 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 26/199 (13%), Positives = 63/199 (31%), Gaps = 22/199 (11%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 K+ +L + LV+ + + R+ + Y + N Sbjct: 177 VSGSMDEPNKLPLLKSAFKLLVDELDEDD--------RVSIVVYAGAAGLVLDSTPGNEK 228 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +++ L L +T + AY S N VI TDG+ + Sbjct: 229 DKILDALMNLEAGGSTAGAEGIKLAYDVAKKNFIKSGNNR--------VILATDGDFNVG 280 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLE 378 + +++ E R+ G+ + + + D G + +++ + E Sbjct: 281 --ISSEAELVRLIEKKRDEGIFLTVLGFGTGNYKDSKMESLADKGNGNYAYIDN---IAE 335 Query: 379 SFDKITDKIQEQSVRIAPN 397 + + +++ +A + Sbjct: 336 ARKVLVNEMGATLNTVAKD 354 >gi|148685682|gb|EDL17629.1| mCG133494, isoform CRA_a [Mus musculus] gi|148685684|gb|EDL17631.1| mCG133494, isoform CRA_a [Mus musculus] Length = 828 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 54/159 (33%), Gaps = 22/159 (13%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 + + + +PLS + + +L T T A+ H EL+ + Sbjct: 193 QFSDYFRVHFTFNNFISTSSPLS-----LLGSVRQLR--GYTYTASAIKHVITELFTTQS 245 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS---AP 350 + K +I ITDG G + +++ + + Y++ V Sbjct: 246 GAR-----QDATKVLIVITDGRKQGDNLSYDSVIPMAEAASIIR-----YAIGVGKAFYN 295 Query: 351 PEGQDLLRKCTDSSGQ--FFAVNDSRELLESFDKITDKI 387 + L+ F+V + L + +++ +KI Sbjct: 296 EHSKQELKAIASMPSHEYVFSVENFDALKDIENQLKEKI 334 >gi|126334034|ref|XP_001370526.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 1247 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 31/182 (17%), Positives = 56/182 (30%), Gaps = 23/182 (12%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGT-IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY 272 ++N ++ + I + S N + + + L Sbjct: 223 KSFVRAVINQFKETNTLFSLMQYSNRLKIHFTFADFQR-----STNWGNLVNPI--LQLR 275 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T T A+ EL+ + + K +I ITDGE YQ+ L + Sbjct: 276 GLTYTATAIRKVVTELFQSRNGAR-----KNATKILIVITDGE-----KYQDKLQYKDVI 325 Query: 333 EYMRNAGMKIYSVAVSAP---PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKI 387 AG+ Y++ V Q+ L F V++ L + + +KI Sbjct: 326 PEAEQAGIIRYAIGVGDAFEYASAQEELNIIASQPAKEHVFQVDNFSALKTIQEDLQEKI 385 Query: 388 QE 389 Sbjct: 386 FS 387 >gi|10946646|ref|NP_067309.1| integrin alpha-X precursor [Mus musculus] gi|48428495|sp|Q9QXH4|ITAX_MOUSE RecName: Full=Integrin alpha-X; AltName: Full=CD11 antigen-like family member C; AltName: Full=Leukocyte adhesion glycoprotein p150,95 alpha chain; AltName: Full=Leukocyte adhesion receptor p150,95; AltName: CD_antigen=CD11c; Flags: Precursor gi|6684131|gb|AAF23492.1|AF211864_1 leukocyte adhesion glycoprotein p150,95 alpha integrin subunit [Mus musculus] gi|189442099|gb|AAI67225.1| Integrin alpha X [synthetic construct] Length = 1169 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 54/159 (33%), Gaps = 22/159 (13%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 + + + +PLS + + +L T T A+ H EL+ + Sbjct: 193 QFSDYFRVHFTFNNFISTSSPLS-----LLGSVRQLR--GYTYTASAIKHVITELFTTQS 245 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS---AP 350 + K +I ITDG G + +++ + + Y++ V Sbjct: 246 GAR-----QDATKVLIVITDGRKQGDNLSYDSVIPMAEAASIIR-----YAIGVGKAFYN 295 Query: 351 PEGQDLLRKCTDSSGQ--FFAVNDSRELLESFDKITDKI 387 + L+ F+V + L + +++ +KI Sbjct: 296 EHSKQELKAIASMPSHEYVFSVENFDALKDIENQLKEKI 334 >gi|74215118|dbj|BAE41792.1| unnamed protein product [Mus musculus] Length = 1169 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 54/159 (33%), Gaps = 22/159 (13%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 + + + +PLS + + +L T T A+ H EL+ + Sbjct: 193 QFSDYFRVHFTFNNFISTSSPLS-----LLGSVRQLR--GYTYTASAIKHVITELFTTQS 245 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS---AP 350 + K +I ITDG G + +++ + + Y++ V Sbjct: 246 GAR-----QDATKVLIVITDGRKQGDNLSYDSVIPMAEAASIIR-----YAIGVGKAFYN 295 Query: 351 PEGQDLLRKCTDSSGQ--FFAVNDSRELLESFDKITDKI 387 + L+ F+V + L + +++ +KI Sbjct: 296 EHSKQELKAIASMPSHEYVFSVENFDALKDIENQLKEKI 334 >gi|255011031|ref|ZP_05283157.1| putative outer membrane protein [Bacteroides fragilis 3_1_12] gi|313148836|ref|ZP_07811029.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] gi|313137603|gb|EFR54963.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] Length = 608 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 32/284 (11%), Positives = 83/284 (29%), Gaps = 26/284 (9%) Query: 97 KNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDV 156 + + A Y L L P+ ++ + Sbjct: 159 PLSTFSIDVDV-ASYSNMRRFLNKGELPPADAIRTEELINYFSYNYAQPTGNDPVRIT-- 215 Query: 157 SRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIES 216 S + ++H K + S S P ++D++ S Sbjct: 216 SEIGTCPWNEQHRLVRIGLKAKEIPTENLPASNLIFLIDVSGSMYGP----ERLDLVKSS 271 Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 LVN+++ ++ + Y+ + ++ +++ +++L +T Sbjct: 272 LKLLVNNLRDKD--------KVAIVVYSGAAGEKLASTPGSDKQKIREAIDELEAGGSTA 323 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 + AY+ + N +I TDG+ + + L ++ E R Sbjct: 324 GGEGIKLAYKIARKNFITGGNNR--------IILCTDGDFNMGVSSDQEL--KKLIEQKR 373 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLES 379 +G+ + + ++ + G +++ +E Sbjct: 374 KSGVFLTVLGYGMGNYKDSKMQTLAEKGNGNHAYIDNLQEANRV 417 >gi|171913221|ref|ZP_02928691.1| hypothetical protein VspiD_18615 [Verrucomicrobium spinosum DSM 4136] Length = 868 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 52/381 (13%), Positives = 115/381 (30%), Gaps = 39/381 (10%) Query: 22 IMYIRNQMQSALDAAVLS----------------GCASIVSDRTIKDPTTKKDQTSTIFK 65 + ++ +++S +D A L S V++ ++ P T+ Sbjct: 216 ALQLKVEVESTMDGAGLLKLFENGVEVERRKVKVVSGSTVTETFVRHPDTRNIYKYRAVL 275 Query: 66 KQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIP 125 + + D+ + ++ + N QY+ ++ A+ I E + Sbjct: 276 EGFAGDAIPANNEALTLVDVRGRLRLLYVEGDMNEGQYLVQAMAKEGIELELRAPNSIPN 335 Query: 126 SA-LTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP 184 + + + + ++ + D + ++ N+ Y P Sbjct: 336 TPQELSGFDGVILSDVPAHQVGETAMVAIRDYVDKLGGGFIMLG-GPNSFGVGGYYRTPI 394 Query: 185 PKKSFWSKNTTKS--KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIA 242 + K + A A + + + S A E + IG A Sbjct: 395 EEVLPVRLKAPDEEEKQSSALALVIDRSGSMSGEKLEMAKSAAIATAEVLTRNDSIGVYA 454 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 ++ + + V ++ L TN +PA A L K Sbjct: 455 FDSEAHVVVPMTRLTSSSAVAGQIAGLTSGGGTNLHPAFTEARNALQRTKAK-------- 506 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 K +I +TDG+ + + R G+ I +VA+ LL+ Sbjct: 507 --IKHMIILTDGQ-------TSGQGYEALASQCRAEGVTISTVAIGDGAH-VGLLQAIAS 556 Query: 363 -SSGQFFAVNDSRELLESFDK 382 G+ + D+ ++ F + Sbjct: 557 LGGGKSYTTLDAANIVRIFTQ 577 >gi|332559488|ref|ZP_08413810.1| von Willebrand (VWA) domain-containing protein [Rhodobacter sphaeroides WS8N] gi|332277200|gb|EGJ22515.1| von Willebrand (VWA) domain-containing protein [Rhodobacter sphaeroides WS8N] Length = 651 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 46/367 (12%), Positives = 98/367 (26%), Gaps = 41/367 (11%) Query: 32 ALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQI 91 A D A+ K + + + Sbjct: 154 AADEAMPMAVPPAPDFALSKQAAEAPARALPQGDSEAFANAPDNPLRVTAE--------- 204 Query: 92 NITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISIC 151 D + + A Y I +L L P + + + Sbjct: 205 ----DPVSTFSIDVD-TASYAILRSSLRAGQLPPREAVRIEEMINYFPYDYPAPESGTPP 259 Query: 152 MVLDVSRSMEDLYLQKH-NDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKI 210 +S + + + PP + +T+ S PA K+ Sbjct: 260 FRPSLSVTRTPWNPETRLVHVALQGRMPAIEDRPPLNLVFLIDTSGSMQDPA------KL 313 Query: 211 DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN 270 +L +S G ++ ++ ++ + Y +N + + S L++L+ Sbjct: 314 PLLKQSFGLMLGRLRPED--------QVAIVTYAGSAGEVLAPTAANQRSTILSALDRLD 365 Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 +T + AYR + T V+ TDG+ + + L L Sbjct: 366 AGGSTAGEEGLALAYRTASEMAGAGEVTR--------VVLATDGDFNLGISDPEELARLV 417 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 E R+ G+ + + ++ + L E+ + D++ Sbjct: 418 AHE--RDTGVYLSVLGFGRGNLDDATMQALAQNGNG--QAAYIDSLNEAQKVLVDQLSGA 473 Query: 391 SVRIAPN 397 IA + Sbjct: 474 LFPIADD 480 >gi|242247116|ref|NP_081039.2| collagen alpha-4(VI) chain precursor [Mus musculus] gi|189082905|sp|A2AX52|CO6A4_MOUSE RecName: Full=Collagen alpha-4(VI) chain; Flags: Precursor Length = 2309 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 23/158 (14%), Positives = 54/158 (34%), Gaps = 21/158 (13%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEK 292 VR G + Y+ I+ ++ + + ++ + T T A+ N Sbjct: 884 DRVRFGVVQYSDKIISQFFLTQYASMAGLSAAIDNIQQVGGGTTTGKALSKMVPVFQNTA 943 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + +++I ITDG+++ + + +R+ G+ IY++ V Sbjct: 944 R--------IDVARYLIVITDGQSTDP--------VAEAAQGLRDIGVNIYAIGV--RDA 985 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 L + + F + + L ++ I Sbjct: 986 NTTELEEIASK--KMFFIYEFDSLKSIHQEVIRDICSS 1021 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 61/184 (33%), Gaps = 17/184 (9%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-E 273 S N + + ++Q ++R+G Y+ + + +V + L Sbjct: 49 HSVRNFLYILANSLQ-VGRDNIRVGLAQYSDTPTSEFLLSVYHRKGDVLKHIRGLQFKPG 107 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 A+ L + + S + + + ++ G + + E Sbjct: 108 GNRMGQALQFI---LEHHFREGAGSRASQGVPQVAVVVSSGLT--------EDHIREPAE 156 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQS 391 +R AG+ +Y++ V Q LR+ + S F V + L K+ ++ Sbjct: 157 ALRRAGILVYAIGV--KDASQAELREISSSPKDNFTFFVPNFPGLPGLAQKLRPELCSTL 214 Query: 392 VRIA 395 + A Sbjct: 215 GKAA 218 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 17/119 (14%), Positives = 43/119 (36%), Gaps = 16/119 (13%) Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 + T+T A++ + + + +++I ITDG + A Sbjct: 1104 SDGTHTGKALNFTLPFFDSSRGGRPR------VHQYLIVITDGVSQDNVAPP-------- 1149 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + +R+ + I+++ V + L + T+ + F + L +I ++ Sbjct: 1150 AKALRDRNIIIFAIGVG--NVQRAQLLEITNDQDKVFQEENFESLQSLEKEILSEVCSS 1206 Score = 47.2 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 37/302 (12%), Positives = 88/302 (29%), Gaps = 30/302 (9%) Query: 93 ITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICM 152 + + E ++ GL L ++ + + Sbjct: 539 VVITSGKSEDEVGEVAQILRKRGVDIVSVGLQDFDRAELEGIGPVVLVSDLQGEDRIRQL 598 Query: 153 VLDVSRSMEDLYLQKHNDNN-NMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKID 211 +LDV+ ++ + + + L P P + ++ A P +++ Sbjct: 599 MLDVNMFIQGSPKPPRVMTDVAKDAVEECLVPVPADLVFL----VEDFSSARQPNFQRV- 653 Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN- 270 L ++ + R+ + Y+ + + +V L++L Sbjct: 654 -----VHFLTTTVHSLNIHP--DTTRVSLVFYSEKPRLEFSLDMYQSAAQVLRHLDRLTF 706 Query: 271 --PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 A+ +E++ ++ S G ++ +I + N Sbjct: 707 RARRGRAKAGAALDFLRKEVFLPEKGSRPHRGVQQIAVVII-----------ESPSLDNV 755 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDK 386 Y+R AG+ IY+ E +D L K + +L +K+ K Sbjct: 756 STPASYLRRAGVTIYAAGTQPASESKD-LEKIVTYPPWKHAIRLESFLQLSVVGNKLKKK 814 Query: 387 IQ 388 + Sbjct: 815 LC 816 >gi|123718338|emb|CAJ77152.1| collagen type VI alpha 4 [Mus musculus] Length = 1451 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 23/158 (14%), Positives = 54/158 (34%), Gaps = 21/158 (13%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEK 292 VR G + Y+ I+ ++ + + ++ + T T A+ N Sbjct: 26 DRVRFGVVQYSDKIISQFFLTQYASMAGLSAAIDNIQQVGGGTTTGKALSKMVPVFQNTA 85 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + +++I ITDG+++ + + +R+ G+ IY++ V Sbjct: 86 R--------IDVARYLIVITDGQSTDP--------VAEAAQGLRDIGVNIYAIGV--RDA 127 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 L + + F + + L ++ I Sbjct: 128 NTTELEEIASK--KMFFIYEFDSLKSIHQEVIRDICSS 163 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 17/119 (14%), Positives = 43/119 (36%), Gaps = 16/119 (13%) Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 + T+T A++ + + + +++I ITDG + A Sbjct: 246 SDGTHTGKALNFTLPFFDSSRGGRPR------VHQYLIVITDGVSQDNVAPP-------- 291 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + +R+ + I+++ V + L + T+ + F + L +I ++ Sbjct: 292 AKALRDRNIIIFAIGVG--NVQRAQLLEITNDQDKVFQEENFESLQSLEKEILSEVCSS 348 >gi|327290735|ref|XP_003230077.1| PREDICTED: collagen alpha-1(XIV) chain-like, partial [Anolis carolinensis] Length = 562 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 55/168 (32%), Gaps = 18/168 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 A + RIG Y+ + + V + L NT T A+ Sbjct: 80 AAFNVGSEKTRIGLAQYSGDPRIEWHLNTYSTKDAVLDAVRNLPYKGGNTLTGLALTFIL 139 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + + + K I ITDG++ + +++AG++++++ Sbjct: 140 ENNFKSEAGAR-----PGVPKIGILITDGKSQDDVIPP--------AKNLKDAGIELFAI 186 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 V + L++ + V D + + +T + + Sbjct: 187 GV--KNADETELKEIASEPDNTHVYNVADFSFMNSIVEGLTKTVCSRV 232 >gi|322419943|ref|YP_004199166.1| hypothetical protein GM18_2437 [Geobacter sp. M18] gi|320126330|gb|ADW13890.1| Protein of unknown function DUF2134, membrane [Geobacter sp. M18] Length = 351 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 22/188 (11%), Positives = 55/188 (29%), Gaps = 12/188 (6%) Query: 3 AIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDP----TTKKD 58 A+++ V F++ A+D+ ++ + Q+Q+A DA L+G A + T + + Sbjct: 21 ALMLVVLVAFVSLAVDMGYMFVAKGQLQNAADAGALAGVAKLSDTVTARQSAKLFAERNK 80 Query: 59 QTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPT--- 115 K + + I D + ++ E T Sbjct: 81 AAGESVKVALNETNSADGDIVVGYWDKVTRTMSATVPTGKVANAVKVVARRTTETGTGIS 140 Query: 116 -----ENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 +LF ++ + + + + + +Y + Sbjct: 141 AENKQVDLFFGQVLNWGQMSAKAEAIACRPAKPSAPIVLCQDLCSSTTFPFKVYFNQTIA 200 Query: 171 NNNMTSNK 178 + + Sbjct: 201 TDPSGALN 208 >gi|211546|gb|AAA48695.1| cartilage matrix protein [Gallus gallus] Length = 416 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 27/183 (14%), Positives = 64/183 (34%), Gaps = 23/183 (12%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ + +V S++ ++G + Y+ + N ++K+ + K Sbjct: 210 NFELVKKFINQIVESLE-----VSEKQAQVGLVQYSSSVRQEFPLGQFKNKKDIKAAVKK 264 Query: 269 LNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + + T T A+ + ++ + + K I TDG + Sbjct: 265 MAYMEKGTMTGQALKYLVDSSFSIANGAR-----PGVPKVGIVFTDGRSQDYITD----- 314 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITD 385 + ++ G ++++V V +D LR+ +F D R + K+ Sbjct: 315 ---AAKKAKDLGFRMFAVGVG--NAVEDELREIASEPVAEHYFYTADFRTISNIGKKLQM 369 Query: 386 KIQ 388 KI Sbjct: 370 KIC 372 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 22/142 (15%), Positives = 43/142 (30%), Gaps = 15/142 (10%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNT 298 I Y + E+ + ++ P T T A+ A +++ E + Sbjct: 3 VINYASAVKNEFSLKTHQTKAELLQAVQRIEPLSTGTMTGLAIQFAISRAFSDTEGARLR 62 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLR 358 + K I +TDG + R AG++I+++ V LR Sbjct: 63 SPNIN--KVAIVVTDGRPQDG--------VQDVSARARQAGIEIFAIGVGR--VDMHTLR 110 Query: 359 KCTDSS--GQFFAVNDSRELLE 378 + V + + Sbjct: 111 QIASEPLDDHVDYVESYSVIEK 132 >gi|115555|sp|P05099|MATN1_CHICK RecName: Full=Cartilage matrix protein; AltName: Full=Matrilin-1; Flags: Precursor gi|833607|emb|CAA30915.1| cartilage matrix protein [Gallus gallus] Length = 493 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 27/183 (14%), Positives = 64/183 (34%), Gaps = 23/183 (12%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ + +V S++ ++G + Y+ + N ++K+ + K Sbjct: 287 NFELVKKFINQIVESLE-----VSEKQAQVGLVQYSSSVRQEFPLGQFKNKKDIKAAVKK 341 Query: 269 LNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + + T T A+ + ++ + + K I TDG + Sbjct: 342 MAYMEKGTMTGQALKYLVDSSFSIANGAR-----PGVPKVGIVFTDGRSQDYITD----- 391 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITD 385 + ++ G ++++V V +D LR+ +F D R + K+ Sbjct: 392 ---AAKKAKDLGFRMFAVGVG--NAVEDELREIASEPVAEHYFYTADFRTISNIGKKLQM 446 Query: 386 KIQ 388 KI Sbjct: 447 KIC 449 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 25/152 (16%), Positives = 48/152 (31%), Gaps = 15/152 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + S R+G I Y + E+ + ++ P T T A+ A Sbjct: 70 DVGPNSTRVGVINYASAVKNEFSLKTHQTKAELLQAVQRIEPLSTGTMTGLAIQFAISRA 129 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 +++ E + + K I +TDG + R AG++I+++ V Sbjct: 130 FSDTEGARLRSPNIN--KVAIVVTDGRPQDG--------VQDVSARARQAGIEIFAIGVG 179 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLE 378 LR+ V + + Sbjct: 180 R--VDMHTLRQIASEPLDDHVDYVESYSVIEK 209 >gi|324112804|gb|EGC06780.1| von Willebrand protein type A [Escherichia fergusonii B253] Length = 530 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 33/302 (10%), Positives = 85/302 (28%), Gaps = 42/302 (13%) Query: 108 KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI--------SICMVLDVSRS 159 Y L L + + + + I + ++ + Sbjct: 82 TGSYANVRRFLKTGSLPGADVVRVEELVNYFPLTEATKKNIPGCKGCEENSPFSINYELT 141 Query: 160 MEDLY---LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIES 216 D + LPP ++ ++ ++ Sbjct: 142 PAPWNEKHTLLRLDIAANDIARSKLPPANLVFLID--------TSGSMNSDERLPLIKSL 193 Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 LVN ++ RI + Y + + N + + + L TN Sbjct: 194 LKLLVNELRDQD--------RISIVTYAGSARLLLSSTSGSEKNTILNAIANLQAGGGTN 245 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 + AY + ++ TDG+ + + + + + R Sbjct: 246 GGAGVAMAYEQAQAGYIKGGVNR--------ILLATDGDFNIG---DDPSSVEDLVKKQR 294 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 +G+ + ++ V + ++ K D+ G + L E+ +++++ + V +A Sbjct: 295 ESGITLSTLGVGDNNYNEAMMVKIADTGNGNYSY---LDSLSEAQKVLSNEMNQTLVTVA 351 Query: 396 PN 397 + Sbjct: 352 KD 353 >gi|284052693|ref|ZP_06382903.1| von Willebrand factor type A domain-containing protein [Arthrospira platensis str. Paraca] gi|291571888|dbj|BAI94160.1| von Willebrand factor type A [Arthrospira platensis NIES-39] Length = 541 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 34/306 (11%), Positives = 83/306 (27%), Gaps = 29/306 (9%) Query: 93 ITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICM 152 + + + + A Y + + P + + Sbjct: 86 VAANPLSTFSIDVD-TASYSNVRRFINQRQRPPIDAVRIEELINYFSYDYPQPQGEEPFS 144 Query: 153 VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV 212 V S + + PP + + + S P ++ + Sbjct: 145 VTTEVSSAPWNPQHQLVHIGLQGKTLAIEELPPSNLVFLLDVSGSMNQP------NRLPL 198 Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY 272 L E LV+ + + + Y N ++ + ++ L Sbjct: 199 LKEGFKLLVDQLSEQDT--------VAIAVYAGAAGVVLPPTPGNEKQKIIAAIDGLQAQ 250 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 +T + AY + S VI TDG+ + + L +++ Sbjct: 251 GSTAGGEGIKLAYELAT--------RMLSEGKNNRVILATDGDFNVGVSSDAEL--VRLI 300 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQS 391 E R+ G+ + + + K ++ G + +++ E + ++ Sbjct: 301 ESYRDRGIYLTVLGFGMGNYKDSKMEKLSNHGNGNYAYIDNLMEAKKVMS---TELTGTL 357 Query: 392 VRIAPN 397 IA + Sbjct: 358 FTIAQD 363 >gi|261212659|ref|ZP_05926943.1| protein BatA [Vibrio sp. RC341] gi|260837724|gb|EEX64401.1| protein BatA [Vibrio sp. RC341] Length = 232 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 28/154 (18%), Positives = 53/154 (34%), Gaps = 36/154 (23%) Query: 253 TPLSNNLNEVKSRLNK--LNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 TPL+ + V +LN+ L T + A + + S ++ +I Sbjct: 57 TPLTLDRETVTQQLNQAVLKLIGTQTAIGEGIGLATKIFID----------SDAPQRVII 106 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PE 352 ++DG N+ L+ L+ + IY+V V A Sbjct: 107 LLSDGSNTAGV-----LDPLEAANIAKQYHSTIYTVGVGAGEMVVKDFLFSRKVNTAQDL 161 Query: 353 GQDLLRKCTDS-SGQFFAVNDSRELLESFDKITD 385 + L+ + GQ+F + ++L +D I Sbjct: 162 DEKTLQTIASTTGGQYFRARNQQDLQNIYDTINQ 195 >gi|74205761|dbj|BAE23197.1| unnamed protein product [Mus musculus] Length = 751 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 26/183 (14%), Positives = 56/183 (30%), Gaps = 23/183 (12%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENT 275 L +++ + ++ + + + + ++ NT Sbjct: 7 INFLYSTVGALD-KIGADGTQVAMVQFTDDPRTEFKLDSYKTKETLLDAIRHISYKGGNT 65 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 T A+ H L+ + + K ++ ITDG + +I M Sbjct: 66 KTGKAIKHVRDTLFTSDSGTRR-----GIPKVIVVITDGRSQDDVN--------KISREM 112 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVR 393 + G I+++ V L + F V+D ++F KI D++ Sbjct: 113 QADGFNIFAIGV--ADADYSELVQIGSKPSSRHVFFVDDF----DAFKKIEDELITFVCE 166 Query: 394 IAP 396 A Sbjct: 167 TAS 169 >gi|323488845|ref|ZP_08094085.1| hypothetical protein GPDM_05856 [Planococcus donghaensis MPA1U2] gi|323397543|gb|EGA90349.1| hypothetical protein GPDM_05856 [Planococcus donghaensis MPA1U2] Length = 857 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 24/144 (16%), Positives = 60/144 (41%), Gaps = 18/144 (12%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 IA++ T ++ + ++ + P T Y ++ AY EL + Sbjct: 445 VIAFDDQPWEILPTGKVDDPKKAADKILSITPGGGTEIYRSLEQAYTELED--------- 495 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 +K +I +TDG++S ++ + + E ++ + + +V++ ++LL + Sbjct: 496 -LELQRKHIILLTDGQSSTSN------DYDALIENGKDHNITLSTVSIGQDA-DRNLLEQ 547 Query: 360 CTDSS-GQFFAVNDSRELLESFDK 382 + G+F+ V D+ + + Sbjct: 548 LAGTGSGRFYDVTDATTIPAILSR 571 >gi|332827795|gb|EGK00530.1| hypothetical protein HMPREF9455_03173 [Dysgonomonas gadei ATCC BAA-286] Length = 603 Score = 70.7 bits (171), Expect = 4e-10, Method: Composition-based stats. Identities = 33/309 (10%), Positives = 82/309 (26%), Gaps = 28/309 (9%) Query: 73 KQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLS 132 E+ + ++ + + + KA Y + + P+ + Sbjct: 130 PAYFDNTESYRAFTDNKFVKVSAEPLSTFSIDVD-KASYSNMRRFINKGEVPPADAIRVE 188 Query: 133 LRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDN-NNMTSNKYLLPPPPKKSFWS 191 + + P F Sbjct: 189 ELVNYFKYDYPKPEGDDPVAFGIEVGKCPWDGKHRLVKIGLKAREIDTDNLPATNFVFLI 248 Query: 192 KNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ 251 + K+D++ S LVN+++ R+ + Y Sbjct: 249 D-------VSGSMDWDGKLDLVKSSMKLLVNNLRPID--------RVAIVVYAGAAGQVL 293 Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 + + +++ LN L +T + AY+ N +I Sbjct: 294 PSTPGSEKSKILESLNGLTAGGSTAGGEGIVLAYKIAKENLIEGGNNR--------IILC 345 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAV 370 TDG+ + + + L ++ E R +G+ + + D ++ + G + Sbjct: 346 TDGDFNVGVSSNDGL--EKLIENERKSGVFLSILGYGMGNYKDDKMQTLAQAGNGNHAYI 403 Query: 371 NDSRELLES 379 ++ +E + Sbjct: 404 DNMQEANKV 412 >gi|282900568|ref|ZP_06308510.1| von Willebrand factor, type A [Cylindrospermopsis raciborskii CS-505] gi|281194368|gb|EFA69323.1| von Willebrand factor, type A [Cylindrospermopsis raciborskii CS-505] Length = 426 Score = 70.3 bits (170), Expect = 4e-10, Method: Composition-based stats. Identities = 29/183 (15%), Positives = 53/183 (28%), Gaps = 25/183 (13%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL-NKLNPYE 273 + L++ +Q RI +A+ + + +KS+L NKL Sbjct: 64 NAVEQLIDQLQSGD--------RIAIVAFAGSGEVIIPNQIIKDPKTIKSQLHNKLKAGG 115 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ-------NTL 326 T + EL GS +TDG + Sbjct: 116 GTIIGEGLSLGITELL---------KGSKGACSHAFLLTDGYGDNGFKIWRLQIGPNDNQ 166 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 L++ + + I S L + G + + +E F++I + Sbjct: 167 RCLELAQKAAKLNLTINSFGFGDEWNQDLLEKIADAGGGTLAYIETPQNAIEQFNRIFKR 226 Query: 387 IQE 389 IQ Sbjct: 227 IQS 229 >gi|169338033|ref|ZP_02621346.2| von Willebrand factor type A domain protein [Clostridium botulinum C str. Eklund] gi|169295279|gb|EDS77412.1| von Willebrand factor type A domain protein [Clostridium botulinum C str. Eklund] Length = 1242 Score = 70.3 bits (170), Expect = 4e-10, Method: Composition-based stats. Identities = 37/256 (14%), Positives = 82/256 (32%), Gaps = 42/256 (16%) Query: 169 NDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAI 228 N + N L K F+ T + Y + + S K + Sbjct: 129 IKNKSYLVNTAFLQGSRHKLFYITIGTTNYYIQGNKCYRQSSYNEKNRLQHAKESAIKFV 188 Query: 229 QEKKNL-SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN--PYENTNTYPAMHHAY 285 Q+ +N ++ IG ++++ + ++ LNEV+ +N L TN + A Sbjct: 189 QKFENDKNISIGLVSFDTTANSQKDI--TSKLNEVEDSINSLKVADNGATNIEAGLKSAQ 246 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGEN-------------------------SGAS 320 + L G+ K+VI ++DG + Sbjct: 247 QLL---------KKGNKDADKYVILMSDGFPTAFDYAGEKVEKNFNYHEIQDNTFINFGY 297 Query: 321 AYQNTLNTL---QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 + ++ G+ + + S + L + G++ ++ L Sbjct: 298 YDYSGYAMKHSINQANSLKKDGINSFIIGFSEGANSEKLNNIAKAAGGEYEEAKNTDTLN 357 Query: 378 ESFDKITDKIQEQSVR 393 ++DK+ K++ ++ Sbjct: 358 GAYDKLETKVKAPLIK 373 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 30/252 (11%), Positives = 71/252 (28%), Gaps = 57/252 (22%) Query: 192 KNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ 251 +K ++D + + A + V+ + + + VR + A + N+ Sbjct: 707 YYVKDNKVYEFNEKDRSRLDSVKKVANDFVDKFKNDENTEIAI-VRYSSKANIVLDGSNK 765 Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYREL---------------------YN 290 + +K R+N L TN + +Y L Y Sbjct: 766 IFLNGKDNEIIKKRINSLKADGGTNIGDGIRKSYSILDKCDKDSEKYMILMTDGVPTAYT 825 Query: 291 EKESSHNTIGSTRLKKFVI---FITDGENSGAS--------------------------- 320 ++ + + K + + +G + Sbjct: 826 CYANTIKASNNCKYSKDNLDFGYCPEGYIDCYNRKYYYSEVKGNFKLENNNRDEGYVIKF 885 Query: 321 AYQNTLNTLQICEYM----RNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 + N L+ + ++ + V + + L++ + Q+ D EL Sbjct: 886 GDEYDKNALEYAKQAMQKSKSKNINNVIVGF-SDGIDTEKLKEIAGDNAQYKEAKDLGEL 944 Query: 377 LESFDKITDKIQ 388 + +D+I I Sbjct: 945 SKQYDEIQKDIN 956 >gi|268611865|ref|ZP_06145592.1| von Willebrand factor type A [Ruminococcus flavefaciens FD-1] Length = 550 Score = 70.3 bits (170), Expect = 4e-10, Method: Composition-based stats. Identities = 28/199 (14%), Positives = 60/199 (30%), Gaps = 22/199 (11%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 + + K+ ++ + L + K RI + Y +N Sbjct: 197 SSGSMNSYDKLPLVQSAFSMLAEQLDKND--------RISIVTYAGSSAVLLDGEKGSNT 248 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +E+ +L + +TN + AY N VI TDG+ + Sbjct: 249 DEILEQLYSITASGSTNGEGGIKTAYELAEEHFIKGGNNR--------VILATDGDLNVG 300 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLE 378 + ++ E R+ G+ + + + D+ G F ++ E Sbjct: 301 --ASSEEELTRLIETKRDNGIYLSVLGFGEGNYKDARMEALADNGNGNFSYIDSEDEAER 358 Query: 379 SFDKITDKIQEQSVRIAPN 397 + ++ IA + Sbjct: 359 V---LVQEMSGTLYTIAKD 374 >gi|51244490|ref|YP_064374.1| hypothetical protein DP0638 [Desulfotalea psychrophila LSv54] gi|50875527|emb|CAG35367.1| conserved hypothetical membrane protein (BatA) [Desulfotalea psychrophila LSv54] Length = 328 Score = 70.3 bits (170), Expect = 4e-10, Method: Composition-based stats. Identities = 23/193 (11%), Positives = 56/193 (29%), Gaps = 40/193 (20%) Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---EN 274 N + ++ + + + P + + N + RL+ L+ + Sbjct: 111 TNRLEVVKDVMAKFISQRPNDSIGLVAFAGRPYVVCPPTLDHNWLTLRLHSLSIGMIEDG 170 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 T A+ L + +I +TDG N+ L E Sbjct: 171 TAIGSAIGTGVNRL----------REKKSPSQIIILLTDGINNAGKVP-----PLIAAEA 215 Query: 335 MRNAGMKIYSVAVSA----------------------PPEGQDLLRKCTDSSGQFFAVND 372 ++ +K+Y++ + + L + + ++F D Sbjct: 216 AKSFKVKVYTIGAGTRGEAPIPITDAFGRRQLVRARVDIDDKTLSKVAQITGARYFRATD 275 Query: 373 SRELLESFDKITD 385 + L + + +I Sbjct: 276 TESLEKVYAEINS 288 >gi|73542573|ref|YP_297093.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] gi|72119986|gb|AAZ62249.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] Length = 340 Score = 70.3 bits (170), Expect = 4e-10, Method: Composition-based stats. Identities = 22/162 (13%), Positives = 52/162 (32%), Gaps = 33/162 (20%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKL---NPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 G P + + V++ + L +T A+ + S Sbjct: 147 GDAPYPLAPFTLDHALVQTMIRDLLPGMAGPSTALGDAVGLGIKMFDQ----------SP 196 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD------- 355 +K +I +TDG ++ + + + + + ++++ + P + Sbjct: 197 APEKVLIVLTDGNDTASKMP-----PERAADIAKQRHVTVHTIGIGDPSAEGEQRVDLGV 251 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 L R + G++F D L + + RI P+ Sbjct: 252 LQRMAAQTGGRYFFGADQNSLESIYATLD--------RITPH 285 >gi|332307030|ref|YP_004434881.1| von Willebrand factor type A [Glaciecola agarilytica 4H-3-7+YE-5] gi|332174359|gb|AEE23613.1| von Willebrand factor type A [Glaciecola agarilytica 4H-3-7+YE-5] Length = 338 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 33/170 (19%), Positives = 63/170 (37%), Gaps = 38/170 (22%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK--LNPYE-NTNTYPAMHHAYRELYNEKE 293 R+G I + PL+ + V L + + T A+ A + ++KE Sbjct: 132 RLGLIFFADTAYLQA--PLTYDRETVSQLLGESLIGLVGEQTAIGDAIGLAIKRFQSKKE 189 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--- 350 S+ K +I +TDG+N+ + N E N G+ +Y++ V A Sbjct: 190 SN----------KVLILLTDGQNTAGNISPQQAN-----ELAINNGVTLYTIGVGADQMM 234 Query: 351 --------------PEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITD 385 + +L + + G++F D+ L +DK+ + Sbjct: 235 VQSIFGSRQVNPSQELDESMLTQLAESTGGRYFRARDAESLKAIYDKLDE 284 >gi|120436991|ref|YP_862677.1| von Willebrand factor type A domain-containing protein [Gramella forsetii KT0803] gi|117579141|emb|CAL67610.1| secreted protein containing von Willebrand factor type A domain [Gramella forsetii KT0803] Length = 592 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 32/345 (9%), Positives = 86/345 (24%), Gaps = 34/345 (9%) Query: 38 LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK 97 L+ +S+ K + ++ + + + + + D Sbjct: 89 LADTSSLDEVIITGYSAEMKIRGTSNIQTSVTANESYNKREDNQFKLVKASPLSTFSIDV 148 Query: 98 NNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 + I IP + + + + Sbjct: 149 DKAGYSNIRRMINNGIQ---------IPKDAVKIEEMINYFNYDYKQPTGKHPFSI--QT 197 Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 + + K + S K+ +L + Sbjct: 198 EYAQTPWNSDTKLVKIGLQGKTIPLENVPASNLV----FLLDVSGSMGQQNKLPLLKSAF 253 Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNT 277 L N++++ +I + Y + +++ L+KL+ +T Sbjct: 254 KLLTNNLREQD--------KISIVVYAGSSGVVLEPTSGDQKTKIEEALDKLSAGGSTAG 305 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 + AY+ + + N VI TDG+ + + + + R Sbjct: 306 GEGIELAYKIAKDNFIKNGNNR--------VILATDGDFNVGL--SSDKAMEDLIKEKRE 355 Query: 338 AGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFD 381 +G+ + ++ L + G ++ +E + Sbjct: 356 SGIFLTALGFGMGNYKDSKLESLAQTGNGNHAYIDSMQEAQRVLE 400 >gi|30687725|ref|NP_850306.1| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|330254526|gb|AEC09620.1| C3HC4-type RING finger-containing protein [Arabidopsis thaliana] Length = 692 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 19/183 (10%), Positives = 59/183 (32%), Gaps = 25/183 (13%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN--LNEVKSR 265 K+ +L + G ++ ++ R+ IA++ ++ Sbjct: 267 TKLALLKRAMGFVIQNLGSND--------RLSVIAFSSTARRLFPLTKMSDAGRQRALQA 318 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 +N + TN + + + + ++ + +I ++DG ++ + Sbjct: 319 VNSVVANGGTNIAEGLRKGVKVMEDRRDKNPVAS--------IILLSDGRDTYTMNQADP 370 Query: 326 LNTLQI------CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 L + CE + + ++S + + + S G F + + ++ Sbjct: 371 NYKLLLPLSMHGCES-KRFQIPVHSFGFGSDHDASLMHSVSETSGGTFSFIESESVIQDA 429 Query: 380 FDK 382 + Sbjct: 430 LAQ 432 >gi|3928084|gb|AAC79610.1| putative retroelement pol polyprotein [Arabidopsis thaliana] Length = 689 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 19/183 (10%), Positives = 59/183 (32%), Gaps = 25/183 (13%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN--LNEVKSR 265 K+ +L + G ++ ++ R+ IA++ ++ Sbjct: 264 TKLALLKRAMGFVIQNLGSND--------RLSVIAFSSTARRLFPLTKMSDAGRQRALQA 315 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 +N + TN + + + + ++ + +I ++DG ++ + Sbjct: 316 VNSVVANGGTNIAEGLRKGVKVMEDRRDKNPVAS--------IILLSDGRDTYTMNQADP 367 Query: 326 LNTLQI------CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 L + CE + + ++S + + + S G F + + ++ Sbjct: 368 NYKLLLPLSMHGCES-KRFQIPVHSFGFGSDHDASLMHSVSETSGGTFSFIESESVIQDA 426 Query: 380 FDK 382 + Sbjct: 427 LAQ 429 >gi|301613500|ref|XP_002936246.1| PREDICTED: collagen alpha-1(XII) chain-like [Xenopus (Silurana) tropicalis] Length = 3127 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 23/166 (13%), Positives = 54/166 (32%), Gaps = 18/166 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+G + Y+ E+ S + ++ NT T A+ Sbjct: 168 SAFDIGEDKTRVGVVQYSSDTRTEFNLNSYYKKEELVSAIKRIPYKGGNTMTGDALDFLI 227 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + + I ITDG++ +R+ G++++S+ Sbjct: 228 KNSF-----VKTAGSRKGFPRIAIIITDGKSQD--------EVEIPARELRSLGVEVFSL 274 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQE 389 + L++ F V + ++E D+I ++ Sbjct: 275 GI--KAADAKELKQIASLPSLKHVFNVANFDSIVEVQDEIITQVCS 318 Score = 66.1 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 49/374 (13%), Positives = 113/374 (30%), Gaps = 42/374 (11%) Query: 32 ALDA------AVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDI 85 A DA A L + + +D+ T +++ L + E Sbjct: 278 AADAKELKQIASLPSLKHVFNVANFDSIVEVQDEIITQVCSGVEEQLSELVSGEEAVEPP 337 Query: 86 AQKAQINITKDKNN-PLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + I+ + Y I + S S + + S + Sbjct: 338 SDMTVTEISSKSMKITWRPSPSQITGYRIQLLPMLAGSKQHSLNLGPQTTSINVKDLSPD 397 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + E + + + + + S A Sbjct: 398 TEYQINLYAMKGLTASEPISTLEKTQAVKIKVECSGGVDIKADVVLLVDGSYSIGVANFA 457 Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 ++VL++S + V+I + Y+ +++ ++ Sbjct: 458 KVRAFLEVLVKSF------------DISPSKVQISLVQYSRDPFTEFTLNRYDSIEDILK 505 Query: 265 RLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +N TNT AM + +++ E + + + + +I ITDG+ Sbjct: 506 AVNTFPYRGGSTNTGKAMTYVREKVFVEIKGAR-----PNVPRVMILITDGK-------- 552 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFD 381 ++ +RN+ ++I++V V + L S + V D ++F Sbjct: 553 SSDAFKDPAIKLRNSDVEIFAVGV--KDAVRSELEAIASPPSDTHVYTVEDF----DAFQ 606 Query: 382 KITDKIQEQS-VRI 394 +I+ ++ + +RI Sbjct: 607 RISFELTQSICLRI 620 Score = 65.3 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 24/165 (14%), Positives = 55/165 (33%), Gaps = 18/165 (10%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + V+I Y+ + + + L NT T A++ + Sbjct: 1230 DIGSDRVQIALAQYSGDPRSEWQLNAHSTKKSLMDAVANLPYKGGNTLTGMALNFILQNN 1289 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + + KK + ITDG++ + + +R+ G+ ++++ + Sbjct: 1290 FKAEAGMRAK-----SKKIGVLITDGKSQDDIVAPS--------KKLRDQGIDLFAIGI- 1335 Query: 349 APPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 ++ LR+ F V D L+ D +T + Sbjct: 1336 -KNADENELRQIGSDPDETYVFNVADFSLLVNIVDDLTTNLCNSV 1379 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 19/199 (9%), Positives = 63/199 (31%), Gaps = 22/199 (11%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + + + + N + + +++ + ++ + + Sbjct: 2343 SSWSIGDDNYNKVRQFIFNTIGAFDVINPSG----IQVSFVQFSDDAQTEFKLNTYADKS 2398 Query: 261 EVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 + L + NT T A+ + ++ + + + ++ +TDG + Sbjct: 2399 QALGALANIRYKGGNTKTGKALKFVHEKVMTVENG-----MRRSVPRVLVVVTDGRSQD- 2452 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELL 377 + + ++ +G ++ + V + LR F V+D Sbjct: 2453 -------DVKKSALDIQQSGFSVFVIGV--ADVDYNELRNIGSKPSERHVFIVDDFDAFE 2503 Query: 378 ESFDKITDKIQEQSVRIAP 396 + D + + + E + P Sbjct: 2504 KIEDNLINFVCETATSTCP 2522 >gi|125532271|gb|EAY78836.1| hypothetical protein OsI_33941 [Oryza sativa Indica Group] Length = 645 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 21/192 (10%), Positives = 66/192 (34%), Gaps = 31/192 (16%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIG-IVGNQCTPLSN-NLNEVK 263 K+ +L ++ G +++++ A R+ I+++ G + + +++ K Sbjct: 187 VGNKLALLKQAMGFVIDNLGPAD--------RLCVISFSSGASRLMRLSRMTDAGKAHAK 238 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG----- 318 + L+ TN A+ A + L + + VI ++DG+++ Sbjct: 239 RAVGSLSARGGTNIGAALRKAAKVLDD--------RLYRNAVESVILLSDGQDTYTVPPR 290 Query: 319 ASAYQNTLNTLQICEYMRNAGM--------KIYSVAVSAPPEGQDLLRKCTDSSGQFFAV 370 ++ + + + +++ + + + G F + Sbjct: 291 GGYDRDANYDALVPPSLVRSDAGGGGGRAPPVHTFGFGKDHDAAAMHTIAEVTGGTFSFI 350 Query: 371 NDSRELLESFDK 382 + + + F + Sbjct: 351 ENEAAIQDGFAQ 362 >gi|293335787|ref|NP_001168683.1| hypothetical protein LOC100382472 [Zea mays] gi|223948855|gb|ACN28511.1| unknown [Zea mays] gi|223949305|gb|ACN28736.1| unknown [Zea mays] gi|223949981|gb|ACN29074.1| unknown [Zea mays] gi|223950189|gb|ACN29178.1| unknown [Zea mays] gi|224028553|gb|ACN33352.1| unknown [Zea mays] Length = 731 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 22/183 (12%), Positives = 58/183 (31%), Gaps = 24/183 (13%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT--PLSNNLNEVKSR 265 K+ +L + G ++ ++ + R+ IA++ + + Sbjct: 302 TKLALLKRAMGFVIQNLGSSD--------RLSVIAFSSSARRLFPLRRMTESGRQQSLLA 353 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 +N L TN A+ + + + +I ++DG+++ + Sbjct: 354 VNSLTSNGGTNIAEALRKGSKVIEE--------RQAKNPVCSIILLSDGQDTYTVSPTAG 405 Query: 326 --LNTLQICEYM----RNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 + C + N + ++ A + L S G F + + ++ Sbjct: 406 VHKGAPEYCALLPSTNGNQQVPVHVFGFGADHDSVSLHSISQTSGGTFSFIETEAAIQDA 465 Query: 380 FDK 382 F + Sbjct: 466 FAQ 468 >gi|172037673|ref|YP_001804174.1| hypothetical protein cce_2760 [Cyanothece sp. ATCC 51142] gi|171699127|gb|ACB52108.1| unknown [Cyanothece sp. ATCC 51142] Length = 423 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 33/246 (13%), Positives = 81/246 (32%), Gaps = 25/246 (10%) Query: 147 AISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPA 206 I + L+ S + + ++++ P + + + Sbjct: 2 KIQLTSALNDSHIDANQSNTQRQVAISLSAVTESSAPQSRSLRDRTLPLNLGLILDHSGS 61 Query: 207 NRK--IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 I + E+A LV+ + + R+ +A++ +++ VK Sbjct: 62 MTGKPIKTVKEAAMRLVDGLGASD--------RLSVVAFDHRAKVIVPNQPVDDIERVKQ 113 Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + +L P T+ M +E+ K+ + +TDGE + + + Sbjct: 114 AIERLKPEGGTSIDEGMKLGIKEVALGKDDR---------VSQIFLLTDGE----NEHGD 160 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS-SGQFFAVNDSRELLESFDKI 383 L++ + + + ++ QD+L D+ G + + L F ++ Sbjct: 161 NERCLKLAQVAAEYNITVNTLGFG-NHWNQDVLESIADAVGGTLCYIEQPEQALTEFSRL 219 Query: 384 TDKIQE 389 +IQ Sbjct: 220 FTRIQS 225 >gi|284046352|ref|YP_003396692.1| hypothetical protein Cwoe_4905 [Conexibacter woesei DSM 14684] gi|283950573|gb|ADB53317.1| conserved hypothetical protein [Conexibacter woesei DSM 14684] Length = 317 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 25/201 (12%), Positives = 60/201 (29%), Gaps = 40/201 (19%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 +I +A ++ + I+ V + + + + ++ + Sbjct: 106 DRITAAKRAASRFLDQLPPGIRVGVTTF----------SDVPDGTQTPTYDHDLIRRTIE 155 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 T T A+ A L R ++ ++DG + + Sbjct: 156 AQIADGGTATGDALQVALDTL------ERLEQNGERTPAAMVLLSDGATTTGR------D 203 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPE-----------------GQDLLRKCTD-SSGQFFA 369 + + A + IY+VA+ + L+ D S G+ F Sbjct: 204 PVMVARAAGEARIPIYTVALGTRDATVPNPGPTGPPLLPVAPDPETLQAIADASGGRAFQ 263 Query: 370 VNDSRELLESFDKITDKIQEQ 390 D +EL ++ + ++ + Sbjct: 264 AQDDQELSSIYETLGSRLGTR 284 >gi|254460998|ref|ZP_05074414.1| von Willebrand factor, type A [Rhodobacterales bacterium HTCC2083] gi|206677587|gb|EDZ42074.1| von Willebrand factor, type A [Rhodobacteraceae bacterium HTCC2083] Length = 627 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 47/365 (12%), Positives = 102/365 (27%), Gaps = 27/365 (7%) Query: 33 LDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQIN 92 D AV + + + + K Q+ T+ I + A A + + Sbjct: 123 ADMAVGAIASELAPSSVPVLRMSAKSQSRTLVAGSIADASVVAVEVDTEAFASASENTLK 182 Query: 93 ITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICM 152 +T++ + + A Y I +L L P+A + + Sbjct: 183 VTRETSVSTFSVDVDTASYAIVRSSLMNGQLPPAAAVRIEEMVNYFPYDYAAPTQGVFAT 242 Query: 153 VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV 212 L + + + + PP + +T+ S P P ++ Sbjct: 243 SLAM-MATPWNENTQLLRIALQGQMPEVEARPPLNLVFLIDTSGSMNQPNKLPLLKQSLR 301 Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY 272 L+ + + +AY ++ + + L+ L Sbjct: 302 LLLGQLRAEDQVA--------------IVAYAGSAGQVLEPTAAHERDTILGALDNLAAG 347 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 TN + AY +E + V+ TDG+ + N Sbjct: 348 GETNGQAGLQQAYALAKQMQEEGEVSR--------VLLATDGDFNVGL--SNAEALKGYI 397 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 R++G + + + ++ + N L E+ + D++ Sbjct: 398 ADKRDSGTFLSVLGFGRGNLDDETMQALAQNGNG--QANYIDNLPEAQKVLVDQLTGALF 455 Query: 393 RIAPN 397 IA + Sbjct: 456 PIASD 460 >gi|196231436|ref|ZP_03130294.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] gi|196224289|gb|EDY18801.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] Length = 341 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 29/200 (14%), Positives = 69/200 (34%), Gaps = 41/200 (20%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENT 275 + V+ +++ Q+ +PL+ + + L+++ + T Sbjct: 119 SRVDVVKQVTQKFIEARPNDRIGMIAFAARPYLVSPLTLDHGWLIQNLDRVKLGLVEDGT 178 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 A+ L K+ + V+ +TDG+N+ L E Sbjct: 179 AIGSAIASCTTRLIERKD---------SKSRIVVLLTDGDNNAGKVS-----PLTAAEAA 224 Query: 336 RNAGMKIYSVAVSAP----------------------PEGQDLLRKCTD-SSGQFFAVND 372 G+K+Y++ +D L+K D + +F+ D Sbjct: 225 SALGVKVYTIGAGTKGFAPMPVGRDVFGRKVYQNVKVDVDEDTLKKIADMTKAKFYRATD 284 Query: 373 SRELLESFDKITDKIQEQSV 392 ++ L + +++I D++++ V Sbjct: 285 TKSLTQIYEEI-DQLEKTKV 303 >gi|45384318|ref|NP_990352.1| collagen alpha-1(XII) chain precursor [Gallus gallus] gi|2506307|sp|P13944|COCA1_CHICK RecName: Full=Collagen alpha-1(XII) chain; AltName: Full=Fibrochimerin; Flags: Precursor gi|222811|dbj|BAA00701.1| alpha 1 chain of type XII collagen [Gallus gallus] gi|2326442|emb|CAA43358.1| collagen type XII alpha 1 chain [Gallus gallus] Length = 3124 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 22/166 (13%), Positives = 55/166 (33%), Gaps = 18/166 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+G + Y+ +++ + ++ NT T A+ + Sbjct: 167 SAFDIGEEKTRVGVVQYSSDTRTEFNLNQYFRRSDLLDAIKRIPYKGGNTMTGEAIDYLV 226 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + E + K I ITDG+ +RN G++++S+ Sbjct: 227 KNTFTESAGAR-----KGFPKVAIVITDGKAQD--------EVEIPARELRNIGVEVFSL 273 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQE 389 + L+ F V + +++ ++I ++ Sbjct: 274 GI--KAADAKELKLIASQPSLKHVFNVANFDGIVDIQNEIILQVCS 317 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 63/163 (38%), Gaps = 23/163 (14%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKES 294 V+I + Y+ N + ++ +N TNT AM + +++ + Sbjct: 476 VQISLVQYSRDPHMEFSLNRYNRVKDIIQAINTFPYRGGSTNTGKAMTYVREKVFVTSKG 535 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 S + + +I ITDG+ ++ + +R+A ++I++V V + Sbjct: 536 SR-----PNVPRVMILITDGK--------SSDAFKEPAIKLRDADVEIFAVGV--KDAVR 580 Query: 355 DLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS-VRI 394 L + V D ++F +I+ ++ + +RI Sbjct: 581 TELEAIASPPAETHVYTVEDF----DAFQRISFELTQSVCLRI 619 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 24/196 (12%), Positives = 60/196 (30%), Gaps = 19/196 (9%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 A + D + + N++ +++ + Y+ ++ + Sbjct: 2334 ASWSIGDDNFNKVVKFVFNTVGAFDL-INPAGIQVSLVQYSDEAQSEFKLNTFDDKAQAL 2392 Query: 264 SRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 L + NT T A+ ++ + + K ++ +TDG + Sbjct: 2393 GALQNVQYRGGNTRTGKALTFIKEKVLT-----WESGMRRGVPKVLVVVTDGRSQD---- 2443 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESF 380 + ++++G ++ V V + L K F V+D + Sbjct: 2444 ----EVRKAATVIQHSGFSVFVVGV--ADVDYNELAKIASKPSERHVFIVDDFDAFEKIQ 2497 Query: 381 DKITDKIQEQSVRIAP 396 D + + E + P Sbjct: 2498 DNLVTFVCETATSTCP 2513 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 23/166 (13%), Positives = 49/166 (29%), Gaps = 20/166 (12%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY-RE 287 + V+IG Y+ + + L NT T A+ Sbjct: 1230 DIGPDKVQIGLAQYSGDPRTEWNLNAYRTKEALLDAVTNLPYKGGNTLTGMALDFILKNN 1289 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 E K + ITDG++ + +R+ G+++Y++ + Sbjct: 1290 FKQEAGLRPRAR------KIGVLITDGKSQDDVVTPS--------RRLRDEGVELYAIGI 1335 Query: 348 SAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 ++ L++ + V D L + +T + Sbjct: 1336 --KNADENELKQIATDPDDIHAYNVADFSFLASIGEDVTTNLCNSV 1379 >gi|294141682|ref|YP_003557660.1| von Willebrand factor type A domain-containing protein [Shewanella violacea DSS12] gi|194578720|dbj|BAG66046.1| von Willebrand factor typeA domain protein [Shewanella violacea] gi|293328151|dbj|BAJ02882.1| von Willebrand factor type A domain protein [Shewanella violacea DSS12] Length = 334 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 47/153 (30%), Gaps = 36/153 (23%) Query: 254 PLSNNLNEVKSRLN--KLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + V L ++ T A+ + + + ++ Sbjct: 143 PLTQDRRSVAQFLKEAQIGLVGKQTAIGEAIALGVKRFD----------MVDKSNRILVL 192 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEG 353 +TDG N+ S Q G+KIY++ V A Sbjct: 193 LTDGSNNSGSIS-----PEQAAAIAAKRGVKIYAIGVGADVMERRSIFGTERVNPSMDLD 247 Query: 354 QDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + L + G +F S++L + + +I Sbjct: 248 EAQLSSLAKITGGLYFRARSSQDLQQIYQEIDK 280 >gi|220675931|emb|CAX12090.1| matrilin 3b [Danio rerio] Length = 434 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 58/164 (35%), Gaps = 15/164 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + + R+ + Y + + EVK ++++P T T A+ A ++ Sbjct: 235 DIGSDATRVALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDPLSTGTMTGMAIKTAMEQV 294 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + E + + K I +TDG ++ R +G++IY+V V Sbjct: 295 FTENAGARPLK--KGIGKVAIIVTDGRPQD--------KVEEVSAAARASGIEIYAVGV- 343 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 L++ F V + + K + + E+ Sbjct: 344 -DRAEMRSLKQMASQPLDDHVFYVETYGVIEKLTSKFRETLCEE 386 >gi|157827515|ref|YP_001496579.1| hypothetical protein A1I_06110 [Rickettsia bellii OSU 85-389] gi|157802819|gb|ABV79542.1| hypothetical protein A1I_06110 [Rickettsia bellii OSU 85-389] Length = 446 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 38/329 (11%), Positives = 91/329 (27%), Gaps = 23/329 (6%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQIN-ITKDKNNP 100 +VS + K K ++ +K + + + + Sbjct: 121 KLLVSKGKLSFKNPIKKDIKEATLKLLQATIKDEKSLSKILVNEEDITPFQKAIYHPTDF 180 Query: 101 LQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLA--ISICMVLDVSR 158 Q I + + E + G I ++ + I S I +L Sbjct: 181 SQLITQISSNEENSLNFIMNNGAIAQSVQVYTADGKAPIIASDLKDGFIIDKQYLLKYLL 240 Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAG 218 + + ++ + P + S + V + Sbjct: 241 PIFNGFIWNEEGKFPIMFAP--KNPKVLDGENNYAHNISLLIDISGSMEKDFSVYKNNIL 298 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 +++ + + + +I + +N + NN+ ++K +N L T Y Sbjct: 299 KILDKLAEIP------NWQINIVVFNDESTARSFSNQENNIEDIKVYINNLKANGYTKLY 352 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY-MRN 337 + A G +I TDG++ G ++ + + + ++N Sbjct: 353 GTIKEALESF----------KGKIDESSTLIVFTDGKDEGTNSNVTEKDVVDVTSEVIKN 402 Query: 338 AGMKIYSVAVSAPPEGQDLLRKCTDSSGQ 366 +Y+V Q+ + G Sbjct: 403 PQFNMYTVGFGQ-YYNQEFFEQVATRGGF 430 >gi|56797851|emb|CAF33338.1| matrilin-3a [Danio rerio] Length = 295 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 31/179 (17%), Positives = 60/179 (33%), Gaps = 17/179 (9%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNT 277 L + + + + R+ + Y + + +K + ++ P T T Sbjct: 86 FLADMVDT--LDVGPDATRVAVVNYASTVKIEFLLKSHLTKDTIKQAITRIEPLAAGTMT 143 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 A+ A E + EK + S + K I +TDG ++ R Sbjct: 144 GMAIKKAMDEAFTEKSGARPK--SKNISKVAIIVTDGRPQD--------QVEEVSAAARA 193 Query: 338 AGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQSVRI 394 +G++IY+V V L+ + F V + + K + + E V I Sbjct: 194 SGIEIYAVGV--DRADMRSLKLMASNPLEDHVFYVETYGVIEKLTSKFRETLCEVKVEI 250 >gi|109897980|ref|YP_661235.1| von Willebrand factor, type A [Pseudoalteromonas atlantica T6c] gi|109700261|gb|ABG40181.1| von Willebrand factor, type A [Pseudoalteromonas atlantica T6c] Length = 343 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 31/170 (18%), Positives = 58/170 (34%), Gaps = 38/170 (22%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK--LNPYE-NTNTYPAMHHAYRELYNEKE 293 R+G I + PL+ + V LN+ + T A+ A + Sbjct: 132 RLGLIFFADTAYLQA--PLTYDRETVSQLLNESLIGLVGEQTAIGDAIGLAIKRF----- 184 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--- 350 K +I +TDG+N+ + N E N G+ +Y++ V A Sbjct: 185 -----KSKEESNKVLILLTDGQNTAGNITPEQAN-----ELAINNGVTLYTIGVGADQML 234 Query: 351 --------------PEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITD 385 + +L + G++F D++ L E + K+ + Sbjct: 235 VQSIFGSRQVNPSQELDEGMLTTLAESTGGRYFRARDAQSLTEIYSKLDE 284 >gi|307133505|dbj|BAJ19017.1| TadG [Aggregatibacter actinomycetemcomitans] Length = 538 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 49/481 (10%), Positives = 117/481 (24%), Gaps = 112/481 (23%) Query: 13 ITYAIDLAHIMYIRNQMQSALDAAV-LSGCASIVSDRTIK-------------------- 51 + + +D I+ + ++ A D A L + Sbjct: 41 VAFTVDGTGILLDKARLAQATDQAALLLIAEDNQYRKNKDHSDVKRQNVSQQEIEREGRN 100 Query: 52 --DPTTKKD---QTSTIFKKQIKKHLKQGSYIRENAGDIAQ--------KAQINITKDKN 98 + + + + + K +L+ + + Sbjct: 101 FSNAKVQAQWKKRNQELVQGVCKLYLRSDDSKGQKNSSPVTIKEPFLAECLEEKTQPKNQ 160 Query: 99 NPLQYIAESKAQYEI-------PTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISIC 151 N Q + + L + ++ T ++ + I + Sbjct: 161 NGTAKSVACVVQGSVQRKFWLPWGQTLVSSNQLHDGRVGINSGKTYAVKEKQITIPIDLM 220 Query: 152 MVLDVSRSMEDLY--LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR- 208 MV D+S SM+ + N + + + K K + Sbjct: 221 MVTDLSGSMKWYIDRKGDAHKPNRRIDALVEVVGEVQNILVTPRKIKEKCESLQSNGLLY 280 Query: 209 -------KIDVLIESAGNLVNSIQKAIQ-------------------------------- 229 K K Sbjct: 281 LLQQVLAKKGDTSGCVLPYYVQQSKTEYISELLRGRRGRTIREGLKEIERHMDIAKTVNQ 340 Query: 230 --EKKNLSVRIGTIAYNIGI--------VGNQCTPLSNNLNEVKSRLNKLNPYENTNTYP 279 N + + ++N G V L K+ P T Sbjct: 341 IKNFNNGEKQSYSFSFNNGDFCLGGNEGKETTQAWFDQKKPNVSEALGKIEPLGGTAVTS 400 Query: 280 AMHHAYRELY--NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ-NTLNTLQICEYMR 336 M + N + + + +T ++ ++ ++DGE++ S L +C ++ Sbjct: 401 GMLIGINLMTSKNSEPEAAPSKLNTNTRRVLLILSDGEDNQPSEKTLVNLMGAGLCREIK 460 Query: 337 NA----------GM--KIYSVAVSAPPEGQDL--LRKCTDSSGQFFAVNDSRELLESFDK 382 + + ++ +A L ++C +++V + LL++F + Sbjct: 461 DKMNSLQDPKYGQVEPRVAFIAFGTNLPDNQLNAWKQCV--GKHYYSVFSKQGLLDAFKQ 518 Query: 383 I 383 I Sbjct: 519 I 519 >gi|332828718|gb|EGK01410.1| hypothetical protein HMPREF9455_02243 [Dysgonomonas gadei ATCC BAA-286] Length = 330 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 59/191 (30%), Gaps = 39/191 (20%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENT 275 + + +K E N G PL+ + + + L ++ + T Sbjct: 111 TRLEAAKKVAAEFINDRPNDRIGLVIFGGESFTQCPLTTDHKVLLNLLTEVKFGMIEDGT 170 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + ++ L S + VI +TDG N+ + L E Sbjct: 171 AIGLGLANSVNRL----------KDSKSKSRVVILLTDGSNNAGQ-----IAPLTAAELA 215 Query: 336 RNAGMKIYSVAVSA---------------------PPEGQDLLRKCTDSSGQFFAVNDSR 374 + +++Y++ + + + + L + GQ+F D+ Sbjct: 216 ASYDIRVYTIGIGSRGTSTARIMTPYGLQTMQVSGDFDERTLTEIAAITKGQYFRATDNT 275 Query: 375 ELLESFDKITD 385 L +D+I Sbjct: 276 SLSAIYDEIDQ 286 >gi|159901412|ref|YP_001547659.1| hypothetical protein Haur_4901 [Herpetosiphon aurantiacus ATCC 23779] gi|159894451|gb|ABX07531.1| conserved hypothetical membrane protein [Herpetosiphon aurantiacus ATCC 23779] Length = 330 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 38/216 (17%), Positives = 68/216 (31%), Gaps = 51/216 (23%) Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL-NEVKSR 265 +I V E V K RIG + ++ L + + + Sbjct: 108 KDRITVAKEVIAEFV---------KGRKDDRIGLVVFSGHAFTQVPLTLDYDFLQNLLGQ 158 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + + + T A+ H+ L ST K VI +TDG N+ Sbjct: 159 VQTVRRPDGTAIGLALAHSVNGL----------RNSTTKSKVVILLTDGSNNRG-----D 203 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPE----------------------GQDLLRKCTD- 362 + Q E R +++Y++ V P + LR + Sbjct: 204 IEPAQAAEIARALDVRVYTILVGKPGNGEYPVHDPWRDETYLIPAPTAEDEVALRDIAEQ 263 Query: 363 SSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + G FF D + L + +D I + ++A + Sbjct: 264 TGGIFFRAGDEQGLRDVYDTIDKM---ERSQVASEK 296 >gi|297460736|ref|XP_599315.5| PREDICTED: collagen, type XXII, alpha 1 [Bos taurus] Length = 1605 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 31/195 (15%), Positives = 70/195 (35%), Gaps = 20/195 (10%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + + + NLV++ + R+G + Y+ + Sbjct: 45 SSSSVGKENFEKVRQWVANLVDTFE-----VGPERTRVGVVRYSDRPATAFELGRFGSRA 99 Query: 261 EVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 V++ +L + T+T A+ R + + G K+ I +TDG + Sbjct: 100 AVRAAARQLAYHGGHTHTGDALRFITRHSFT--PRAGGRPGDRAFKQVAILLTDGRSQD- 156 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELL 377 L R AG++I++V V ++ L + ++ F V+D + Sbjct: 157 -------LVLPAATAARRAGIRIFAVGVG--EALREELEEIASEPTAAHVFHVSDFDAID 207 Query: 378 ESFDKITDKIQEQSV 392 + K+ ++ E + Sbjct: 208 KIRGKLRRRLCENVL 222 >gi|297482250|ref|XP_002692646.1| PREDICTED: collagen, type XXII, alpha 1-like [Bos taurus] gi|296480820|gb|DAA22935.1| collagen, type XXII, alpha 1-like [Bos taurus] Length = 1605 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 31/195 (15%), Positives = 70/195 (35%), Gaps = 20/195 (10%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + + + NLV++ + R+G + Y+ + Sbjct: 45 SSSSVGKENFEKVRQWVANLVDTFE-----VGPERTRVGVVRYSDRPATAFELGRFGSRA 99 Query: 261 EVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 V++ +L + T+T A+ R + + G K+ I +TDG + Sbjct: 100 AVRAAARQLAYHGGHTHTGDALRFITRHSFT--PRAGGRPGDRAFKQVAILLTDGRSQD- 156 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELL 377 L R AG++I++V V ++ L + ++ F V+D + Sbjct: 157 -------LVLPAATAARRAGIRIFAVGVG--EALREELEEIASEPTAAHVFHVSDFDAID 207 Query: 378 ESFDKITDKIQEQSV 392 + K+ ++ E + Sbjct: 208 KIRGKLRRRLCENVL 222 >gi|297626137|ref|YP_003687900.1| ChlD, Mg-chelatase subunit ChlD [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] gi|296921902|emb|CBL56462.1| ChlD, Mg-chelatase subunit ChlD [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] Length = 324 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 20/166 (12%), Positives = 52/166 (31%), Gaps = 24/166 (14%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 N P + + V + L + T ++ L + + Sbjct: 143 SGHPNTLVPPTTDRAPVNQGIKTLELADGTAIASSIDVGLEALK----QAPAGDDGKQAP 198 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------------- 352 ++ ++DG +G + + + + + IY++A Sbjct: 199 GLMVLLSDGSETGGG------DPVASADKAKQQNVPIYTIAFGTQNGYVDLDGQRFNVAP 252 Query: 353 GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 D+L++ D SSG+ + +L + + +T + ++ Sbjct: 253 DTDMLKRIADASSGKALDAASASQLDDVYKTLTSDVGYETAHTEVT 298 >gi|254443725|ref|ZP_05057201.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] gi|198258033|gb|EDY82341.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] Length = 339 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 24/162 (14%), Positives = 60/162 (37%), Gaps = 35/162 (21%) Query: 251 QCTPLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 PL+ + + + +L + T ++ A L + F Sbjct: 143 TVAPLTFDHKWLARQTERLQIGLIEDGTAIGDSLAVATSRLLEGAKERAGEREG----AF 198 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP---------------- 351 ++ +TDGEN+ + ++ ++AG+++Y++A Sbjct: 199 IVLLTDGENTAGMM-----DPMEGATLAKDAGIRVYTIAAGKNGYVPFPRRNERGERIGT 253 Query: 352 -------EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + + L++ +++G+FF +S + ++F+KI Sbjct: 254 TQEFLRVDTETLMKIANETNGEFFRAENSDTIDQAFEKIDAS 295 >gi|153842534|ref|ZP_01993517.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AQ3810] gi|149745366|gb|EDM56617.1| von Willebrand factor type A domain protein [Vibrio parahaemolyticus AQ3810] Length = 223 Score = 70.3 bits (170), Expect = 5e-10, Method: Composition-based stats. Identities = 26/154 (16%), Positives = 54/154 (35%), Gaps = 36/154 (23%) Query: 253 TPLSNNLNEVKSRLNK--LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 TPL+ + V ++ + + T + + + S ++ +I Sbjct: 48 TPLTADRQTVIQQIKQTVIGLVGQRTAIGDGIGLGTKTFVD----------SDAPQRVMI 97 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP------------------ 351 ++DG N+ L+ ++ E + IY+V V A Sbjct: 98 LLSDGSNTAGV-----LDPIEAAEIAKKYNATIYTVGVGAGEMMVKDFFMTRKVDTAADL 152 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + Q L + + GQ+F D+ +L + +D I Sbjct: 153 DEQTLTKIAEMTGGQYFRARDAEQLEKIYDTINK 186 >gi|56797859|emb|CAG27402.1| matrilin-3b [Danio rerio] Length = 434 Score = 70.3 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 58/164 (35%), Gaps = 15/164 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + + R+ + Y + + EVK ++++P T T A+ A ++ Sbjct: 235 DIGSDATRVALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDPLSTGTMTGMAIKTAMDQV 294 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + E + + K I +TDG ++ R +G++IY+V V Sbjct: 295 FTENAGARPLK--KGIGKVAIIVTDGRPQD--------KVEEVSAAARASGIEIYAVGV- 343 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 L++ F V + + K + + E+ Sbjct: 344 -DRAEMRSLKQMASQPLDDHVFYVETYGVIEKLTSKFRETLCEE 386 >gi|18700173|gb|AAL77698.1| At2g38970/T7F6.14 [Arabidopsis thaliana] Length = 692 Score = 70.3 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 19/183 (10%), Positives = 59/183 (32%), Gaps = 25/183 (13%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN--LNEVKSR 265 K+ +L + G ++ ++ R+ IA++ ++ Sbjct: 267 TKLALLKRAMGFVIQNLGSND--------RLSVIAFSSTARRLFPLTKMSDAGRQRALQA 318 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 +N + TN + + + ++++ + +I ++DG + + Sbjct: 319 VNSVVANGGTNIAEGLRKGVKVMEDQRDKNPVAS--------IILLSDGRATYTMNQADP 370 Query: 326 LNTLQI------CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 L + CE + + ++S + + + S G F + + ++ Sbjct: 371 NYKLLLPLSMHGCES-KRFQIPVHSFGFGSDHDASLMHSVSETSGGTFSFIESESVIQDA 429 Query: 380 FDK 382 + Sbjct: 430 LAQ 432 >gi|221109964|ref|XP_002168937.1| PREDICTED: similar to predicted protein [Hydra magnipapillata] Length = 221 Score = 70.3 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 53/154 (34%), Gaps = 10/154 (6%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT-NTYPAMHHAY 285 + S R I ++ + +NLN K ++ + +T A+ A Sbjct: 59 SAFGVSINSTRAAVITFSYHAQLSIKLNKYSNLNSFKEAVDNIVLMGSTTRIDKALRLAQ 118 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 +E++ + + + K + +TDG + +N + I +R+AG+ I + Sbjct: 119 KEVFELENGAR-----PGVAKILFLLTDGSQTQERGSENPVV---IANELRSAGVTIIVI 170 Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 + L + + +L + Sbjct: 171 GI-TNAVDVSELFDIAGGEENAYFADSFEKLKDV 203 >gi|209527269|ref|ZP_03275780.1| von Willebrand factor type A [Arthrospira maxima CS-328] gi|209492336|gb|EDZ92680.1| von Willebrand factor type A [Arthrospira maxima CS-328] Length = 414 Score = 70.3 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 66/181 (36%), Gaps = 23/181 (12%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 ++ + ++A L++ + N+ RI +A++ + + +K +++ L Sbjct: 59 LETVKQAAKELIDRL--------NVGDRISVVAFDHRAKVLVPNQDLTDPDGIKKKIDGL 110 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T + EL K+ + +TDGE + + + L Sbjct: 111 RCSGGTAIDEGIKLGIEELGKGKQDRISQG---------FLLTDGE----NEHGDNKRCL 157 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF-AVNDSRELLESFDKITDKIQ 388 ++ + + I S+ QD+L K D+ G + + + F ++ ++Q Sbjct: 158 KLAKLATEYKLTINSLGFGDDWN-QDILEKIADAGGGALGYIEYPEQAIAEFGRLFTRMQ 216 Query: 389 E 389 Sbjct: 217 S 217 >gi|156741949|ref|YP_001432078.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233277|gb|ABU58060.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 847 Score = 70.3 bits (170), Expect = 6e-10, Method: Composition-based stats. Identities = 22/140 (15%), Positives = 53/140 (37%), Gaps = 21/140 (15%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 +L +++ R++ L T+ Y A+ EL + + + +TDG Sbjct: 456 SLADIQRRISTLPLGGGTDIYNALQTGLPELARQPGRVRHA----------VLLTDGR-- 503 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSREL 376 S + + E R+ + + ++A+ DLL+ +G+++ + ++ Sbjct: 504 --SFTDDRQAYQALIEEARSRNITLSTIAIGTDA-DIDLLQTLARWGAGRYYFAAEPGDI 560 Query: 377 LESFDKITDKIQEQSVRIAP 396 ++ + VR P Sbjct: 561 PR-----LTLLESEIVRTEP 575 >gi|323345325|ref|ZP_08085548.1| aerotolerance protein BatA [Prevotella oralis ATCC 33269] gi|323093439|gb|EFZ36017.1| aerotolerance protein BatA [Prevotella oralis ATCC 33269] Length = 332 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 24/161 (14%), Positives = 54/161 (33%), Gaps = 44/161 (27%) Query: 254 PLSNNLNEVKSRLNKLNPY--------ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 P++ + + + L + + T + +A L S Sbjct: 143 PMTTDHASLLNLLQNVRTDIAARGLIQDGTAVGMGLANAVSRL----------KDSKAKS 192 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--------------- 350 K VI +TDG N+ L+ + + ++ G+++Y++ V Sbjct: 193 KVVILLTDGSNNMG-----DLSPMTSAQIAKSLGIRVYTIGVGTNKVARYPMPVTGGIQY 247 Query: 351 ------PEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + + L + G F+ ++REL + ++ I Sbjct: 248 VNIPVEIDTKTLSDIAATTDGNFYRATNNRELKQIYNDIDK 288 >gi|281354485|gb|EFB30069.1| hypothetical protein PANDA_020540 [Ailuropoda melanoleuca] Length = 1096 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 63/193 (32%), Gaps = 20/193 (10%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + + + NLV++ + R+G + Y+ L + Sbjct: 557 TSSSVGKEDFEKVRQWVANLVDTFE-----VGPERTRVGVVRYSDQPTTAFELGLFGSRE 611 Query: 261 EVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 VK+ L NTNT A+ R ++ + G K+ I + G + Sbjct: 612 AVKAAARHLAYHGGNTNTGDALRFITRHSFS--PQAGGRPGDRAFKQVAILLPAGRSQDL 669 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELL 377 G++I++V V ++ L + F V+D + Sbjct: 670 VLDAAAAAHRA--------GIRIFAVGVG--AALKEELEEIASEPKSAHVFHVSDFNAID 719 Query: 378 ESFDKITDKIQEQ 390 + K+ ++ E Sbjct: 720 KIRGKLRRRLCES 732 >gi|254514588|ref|ZP_05126649.1| von Willebrand factor, type A [gamma proteobacterium NOR5-3] gi|219676831|gb|EED33196.1| von Willebrand factor, type A [gamma proteobacterium NOR5-3] Length = 347 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 40/308 (12%), Positives = 91/308 (29%), Gaps = 53/308 (17%) Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS---RSMEDLYL 165 A + + P T +R + V+ VS ++ + Sbjct: 8 AMLLLLPLPWVFRRFAPPRSTERPALRAPFFQRWQSLASQDADSVVRVSAKIPAIALWTI 67 Query: 166 QKHNDNNNMTSNKYLLP---PPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVN 222 P + ++ I++ L + Sbjct: 68 WLCMLLAAARPLWVGEAIELPNSGRDLMLAVDISGSMRVEDMQVGNRMARRIDAVKQLGS 127 Query: 223 SIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL--NKLN-PYENTNTYP 279 + R+G I + +PLS ++ V+ L +++ + T Sbjct: 128 DFMS-----RRSGDRLGLILFGSRAYLQ--SPLSFDIQTVQRFLLESQIGFAGQETAIGD 180 Query: 280 AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG 339 A+ A + L +S + +I +TDG+++ +T++ L+ + G Sbjct: 181 AIGLAVKRLQERPATS----------RVLILLTDGQDTA-----STVDPLEAANLAADLG 225 Query: 340 MKIYSVAVSAPP----------------------EGQDLLRKCTDSSGQFFAVNDSRELL 377 ++IY++ + A + L+ + + GQ+F D EL Sbjct: 226 VRIYTIGIGADSLTLPGLLGSPLGARTVNPSADLDENSLIAIASSTGGQYFRARDPEELA 285 Query: 378 ESFDKITD 385 + + Sbjct: 286 TVYRLLEK 293 >gi|83816834|ref|YP_446668.1| von Willebrand factor type A domain-containing protein [Salinibacter ruber DSM 13855] gi|83758228|gb|ABC46341.1| von Willebrand factor type A domain protein [Salinibacter ruber DSM 13855] Length = 289 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 26/155 (16%), Positives = 54/155 (34%), Gaps = 38/155 (24%) Query: 254 PLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + + ++ L + + T A+ A L S K I Sbjct: 106 PLTLDYSFLQRMLEDVEVGAVEDGTAVGTALATAVNRL----------KDSEAESKVAIL 155 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------------------ 352 +TDG N+ ++ E R G+++Y++ V + + Sbjct: 156 LTDGRNNRGQ-----IDPRTAAEVARTMGVRVYAIGVGSSEDRDTWEEPLPQGQRDESAG 210 Query: 353 -GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITD 385 ++LR + + GQ+F+ + L + +I Sbjct: 211 VDAEMLRSVSVSTGGQYFSATNRDALERIYAEIDT 245 >gi|260782980|ref|XP_002586557.1| hypothetical protein BRAFLDRAFT_106340 [Branchiostoma floridae] gi|229271674|gb|EEN42568.1| hypothetical protein BRAFLDRAFT_106340 [Branchiostoma floridae] Length = 534 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 30/161 (18%), Positives = 58/161 (36%), Gaps = 21/161 (13%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYREL 288 + + R+G + Y+ + V+ + + TNT A+ E+ Sbjct: 178 DISATTTRVGMVQYSTNVTPEFMLKEHTTKKSVEKAIGDVKRLGGGTNTGKALKFVRTEM 237 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 K+ I +TDG++ +R AG+ +Y+V V Sbjct: 238 DWRDPP---------TKRVAIVVTDGKSQDDVGTP--------ATALRQAGVVLYAVGVG 280 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 P D L++ T + +A+N EL + I++ + E Sbjct: 281 LP---TDELKEITGDPTKVYALNSYDELQDIIQDISNSVVE 318 >gi|161788949|dbj|BAF95091.1| double von Willebrand factor A domains [Mus musculus] Length = 2309 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 23/158 (14%), Positives = 54/158 (34%), Gaps = 21/158 (13%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEK 292 VR G + Y+ I+ ++ + + ++ + T T A+ N Sbjct: 884 DRVRFGVVQYSDKIISQFFLTQYASMAGLSAAIDNIQQEGGGTTTGKALSKMVPVFQNTA 943 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + +++I ITDG+++ + + +R+ G+ IY++ V Sbjct: 944 RV--------DVARYLIVITDGQSTDP--------VAEAAQGLRDIGVNIYAIGV--RDA 985 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 L + + F + + L ++ I Sbjct: 986 NTTELEEIASK--KMFFIYEFDSLKSIHQEVIRDICSS 1021 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 60/184 (32%), Gaps = 17/184 (9%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-E 273 S N + + ++Q ++R+G Y + + +V + L Sbjct: 49 HSVRNFLYILANSLQ-VGRDNIRVGLAQYGDTPTSEFLLSVYHRKGDVLKHIRGLQFKPG 107 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 A+ L + + S + + + ++ G + + E Sbjct: 108 GNRMGQALQFI---LEHHFREGAGSRASQGVPQVAVVVSSGL--------AEDHIREPAE 156 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQS 391 +R AG+ +Y++ V Q LR+ + S F V + L K+ ++ Sbjct: 157 ALRRAGILVYAIGV--KDASQAELREISSSPKDNFTFFVPNFPGLPGLAQKLRPELCSTL 214 Query: 392 VRIA 395 + A Sbjct: 215 AKAA 218 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 17/119 (14%), Positives = 43/119 (36%), Gaps = 16/119 (13%) Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 + T+T A++ + + + +++I ITDG + A Sbjct: 1104 SDGTHTGKALNFTLPFFDSSRGGRPR------VHQYLIVITDGVSQDNVAPP-------- 1149 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + +R+ + I+++ V + L + T+ + F + L +I ++ Sbjct: 1150 AKALRDRNIIIFAIGVG--NVQRAQLLEITNDQDKVFQEENFESLQSLEKEILSEVCSS 1206 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 38/302 (12%), Positives = 91/302 (30%), Gaps = 30/302 (9%) Query: 93 ITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICM 152 + + E ++ GL L ++ + + Sbjct: 539 VVITSGKSEDEVGEVAQILRKRGVDIVSVGLQDFDRAELEGIGPVVLVSDLQGEDRIRQL 598 Query: 153 VLDVSRSMEDLYLQKHNDNN-NMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKID 211 +LDV+ ++ + + + L P P + ++ A P +++ Sbjct: 599 MLDVNMFIQGSPKPPRVMTDVAKDAVEECLVPVPADLVFL----VEDFSSAGQPNFQRV- 653 Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN- 270 L ++ + R+ + Y+ + + +V S L++L Sbjct: 654 -----VHFLTTTVHSLNIHP--DTTRVSLVFYSEKPRLEFSLDMYQSAAQVLSHLDRLTF 706 Query: 271 --PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 T A+ +E++ ++ S G ++ ++ + N Sbjct: 707 QARRGRTKAGAALDFLRKEVFLPEKGSRPHRGVQQIAVVIM-----------ESPSLDNV 755 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDK 386 Y+R AG+ IY+ + E +D L K + +L +K+ K Sbjct: 756 STPASYLRRAGVTIYAAGIQPASESKD-LEKIVTYPPWKHAIRLESFLQLSVVGNKLKKK 814 Query: 387 IQ 388 + Sbjct: 815 LC 816 >gi|148689169|gb|EDL21116.1| mCG140660 [Mus musculus] Length = 2242 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 23/158 (14%), Positives = 54/158 (34%), Gaps = 21/158 (13%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEK 292 VR G + Y+ I+ ++ + + ++ + T T A+ N Sbjct: 884 DRVRFGVVQYSDKIISQFFLTQYASMAGLSAAIDNIQQEGGGTTTGKALSKMVPVFQNTA 943 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + +++I ITDG+++ + + +R+ G+ IY++ V Sbjct: 944 RV--------DVARYLIVITDGQSTDP--------VAEAAQGLRDIGVNIYAIGV--RDA 985 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 L + + F + + L ++ I Sbjct: 986 NTTELEEIASK--KMFFIYEFDSLKSIHQEVIRDICSS 1021 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 60/184 (32%), Gaps = 17/184 (9%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-E 273 S N + + ++Q ++R+G Y + + +V + L Sbjct: 49 HSVRNFLYILANSLQ-VGRDNIRVGLAQYGDTPTSEFLLSVYHRKGDVLKHIRGLQFKPG 107 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 A+ L + + S + + + ++ G + + E Sbjct: 108 GNRMGQALQFI---LEHHFREGAGSRASQGVPQVAVVVSSGLT--------EDHIREPAE 156 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQS 391 +R AG+ +Y++ V Q LR+ + S F V + L K+ ++ Sbjct: 157 ALRRAGILVYAIGV--KDASQAELREISSSPKDNFTFFVPNFPGLPGLAQKLRPELCSTL 214 Query: 392 VRIA 395 + A Sbjct: 215 GKAA 218 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 17/119 (14%), Positives = 43/119 (36%), Gaps = 16/119 (13%) Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 + T+T A++ + + + +++I ITDG + A Sbjct: 1104 SDGTHTGKALNFTLPFFDSSRGGRPR------VHQYLIVITDGVSQDNVAPP-------- 1149 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + +R+ + I+++ V + L + T+ + F + L +I ++ Sbjct: 1150 AKALRDRNIIIFAIGVG--NVQRAQLLEITNDQDKVFQEENFESLQSLEKEILSEVCSS 1206 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 38/302 (12%), Positives = 91/302 (30%), Gaps = 30/302 (9%) Query: 93 ITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICM 152 + + E ++ GL L ++ + + Sbjct: 539 VVITSGKSEDEVGEVAQILRKRGVDIVSVGLQDFDRAELEGIGPVVLVSDLQGEDRIRQL 598 Query: 153 VLDVSRSMEDLYLQKHNDNN-NMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKID 211 +LDV+ ++ + + + L P P + ++ A P +++ Sbjct: 599 MLDVNMFIQGSPKPPRVMTDVAKDAVEECLVPVPADLVFL----VEDFSSAGQPNFQRV- 653 Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN- 270 L ++ + R+ + Y+ + + +V S L++L Sbjct: 654 -----VHFLTTTVHSLNIHP--DTTRVSLVFYSEKPRLEFSLDMYQSAAQVLSHLDRLTF 706 Query: 271 --PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 T A+ +E++ ++ S G ++ ++ + N Sbjct: 707 QARRGRTKAGAALDFLRKEVFLPEKGSRPHRGVQQIAVVIM-----------ESPSLDNV 755 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDK 386 Y+R AG+ IY+ + E +D L K + +L +K+ K Sbjct: 756 STPASYLRRAGVTIYAAGIQPASESKD-LEKIVTYPPWKHAIRLESFLQLSVVGNKLKKK 814 Query: 387 IQ 388 + Sbjct: 815 LC 816 >gi|119470787|ref|ZP_01613398.1| hypothetical protein ATW7_05591 [Alteromonadales bacterium TW-7] gi|119446014|gb|EAW27293.1| hypothetical protein ATW7_05591 [Alteromonadales bacterium TW-7] Length = 328 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 29/194 (14%), Positives = 62/194 (31%), Gaps = 35/194 (18%) Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN--KL 269 + + ++ + + G TPL+ +L V LN ++ Sbjct: 105 AYNGQYVDRLTMVKAVLSDFIEQRTGDRLGLILFGDTAFLQTPLTRDLKTVTKMLNEAQI 164 Query: 270 NPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 T A+ + + + + V+ +TDG+N+ + LN Sbjct: 165 GLVGRATAIGDALGLSVKRFAS----------KDDSNRIVVLLTDGQNTAGN-----LNP 209 Query: 329 LQICEYMRNAGMKIYSVAVSAPPE----------------GQDLLRKCT-DSSGQFFAVN 371 R G+K+Y++ V + + LL+ D+ G +F Sbjct: 210 DDALLLAREEGIKVYTIGVGSDNPRGFSLFNMGGSGGSNLDERLLKNIADDTGGLYFRAK 269 Query: 372 DSRELLESFDKITD 385 D L + + ++ Sbjct: 270 DVAGLKQIYAELDK 283 >gi|74214868|dbj|BAE33446.1| unnamed protein product [Mus musculus] Length = 1169 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 54/159 (33%), Gaps = 22/159 (13%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 + + + +PLS + + +L T T A+ H EL+ + Sbjct: 193 QFSDYFRVHFTFNNFISTSSPLS-----LLGSVRQLR--GYTYTASAIKHVITELFTTQS 245 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS---AP 350 + K +I ITDG G + +++ + + Y++ V Sbjct: 246 GAR-----QDATKVLIVITDGRKQGDNLSYDSVIPMAEAASIIR-----YAIGVGKAFYN 295 Query: 351 PEGQDLLRKCTDSSGQ--FFAVNDSRELLESFDKITDKI 387 + L+ F+V + L + +++ +KI Sbjct: 296 EHSKQELKAIASMPSHEYVFSVENFDTLKDIENQLKEKI 334 >gi|326435125|gb|EGD80695.1| hypothetical protein PTSG_01285 [Salpingoeca sp. ATCC 50818] Length = 1006 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 23/184 (12%), Positives = 53/184 (28%), Gaps = 24/184 (13%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + ++ + + V+ + +VR+ + Y+ + S + Sbjct: 315 VDVSGSVGAANFALVRDFIASTVDMLP-----VGENTVRVALMTYHSSNMPQFDFDDSFD 369 Query: 259 LNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 V S ++ L T T A++ + V TDG Sbjct: 370 RATVVSAISSLVYDDSRQYGTATGSALNFFADNMLQASAGYRGG------PAIVYVFTDG 423 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSR 374 + ++ G ++ ++ V + L++ S F V D Sbjct: 424 ASQDDVTPG--------AAALQATGAQVVTIGV-TAAVNEAELQEIASSPSDVFIVADFD 474 Query: 375 ELLE 378 L + Sbjct: 475 SLTD 478 >gi|291295619|ref|YP_003507017.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] gi|290470578|gb|ADD27997.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] Length = 412 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 16/133 (12%), Positives = 46/133 (34%), Gaps = 12/133 (9%) Query: 251 QCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 TP+++ + + + + +T + ++ +E+ V+ Sbjct: 101 PSTPVADGRAAIANLIRTIRTGGSTALHAGWLEGATQVAAYQEAGRLNR--------VVL 152 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFA 369 ++DG + N + + G+ ++ V +DL+ D+ G ++ Sbjct: 153 LSDGLANRGE--TNPGVIAEQVRELARRGVSTSTLGVGLD-YNEDLMTTMADAGEGNYYF 209 Query: 370 VNDSRELLESFDK 382 + +L F + Sbjct: 210 IESPADLPRIFAQ 222 >gi|222616410|gb|EEE52542.1| hypothetical protein OsJ_34771 [Oryza sativa Japonica Group] Length = 654 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 32/197 (16%), Positives = 68/197 (34%), Gaps = 25/197 (12%) Query: 187 KSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIG 246 S N + +P ++DVL S ++ + R+ +A+N G Sbjct: 77 DVSGSMNDPVAAASPKSNLQGSRLDVLKASMKFVIRKLADGD--------RLSIVAFNDG 128 Query: 247 IVGNQCTPL----SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 V + L + + ++++L T PA+ A + L S Sbjct: 129 PVKEYSSGLLDVSGDGRSIAGKKIDRLQARGGTALMPALEEAVKILDE------RQGSSR 182 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 F++ +TDG+++ + + + +++ + A + + LL Sbjct: 183 NRVGFILLLTDGDDTTGFRWTRDAIHGAVAK------YPVHTFGLGASHDPEALLHIAQG 236 Query: 363 SSGQFFAVNDSRELLES 379 S G + V+D L Sbjct: 237 SRGTYSFVDD-DNLANI 252 >gi|329922584|ref|ZP_08278159.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] gi|328942128|gb|EGG38410.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] Length = 421 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 24/206 (11%), Positives = 68/206 (33%), Gaps = 28/206 (13%) Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 S P + + + ++ + + + Sbjct: 115 IVLVIDNSGSMNETDPNQDRYTAAKNLINRMDRDNRVSVIMFDHATTLLQPFTRVNNQET 174 Query: 250 NQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 +E+ + ++ L T+ A+ + +++ + V Sbjct: 175 ---------KDEIIAEIDGLATTDGGTDISLALEDTMSHIQESRDAGRSA--------MV 217 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS-APPEGQDLLRKCT-DSSGQ 366 I ++DG + + ++ + + + ++ +S P+G LL+ ++ GQ Sbjct: 218 IMLSDG--------FSETDHDRVLAEYKQQQIAVNTIGLSLVNPDGAQLLQTIAAETGGQ 269 Query: 367 FFAVNDSRELLESFDKITDKIQEQSV 392 ++ V + +L F KI D + ++S+ Sbjct: 270 YYDVQHAEDLSFVFQKIYDDVGDRSL 295 >gi|170578661|ref|XP_001894496.1| Transmembrane cell adhesion receptor mua-3 precursor [Brugia malayi] gi|158598882|gb|EDP36664.1| Transmembrane cell adhesion receptor mua-3 precursor, putative [Brugia malayi] Length = 1742 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 30/185 (16%), Positives = 61/185 (32%), Gaps = 13/185 (7%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 + V + + + + R+G I Y+ I + V Sbjct: 512 GSGSIGSYVFKNEVLRFIKEFVEL-FDIGLDNTRVGLIQYSDQIRHEFDLSQYTDKASVI 570 Query: 264 SRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 S ++++ T T A+ H E ++E+ + + + I ITDG + Sbjct: 571 SAISQVQYLTGLTRTGAAIQHMVMEGFSERRGAR--KEGDDVARVSIVITDGRSQD---- 624 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 N + R + + ++SV V L S ++F V+ ++L Sbjct: 625 ----NVTEPAXNARKSHINMFSVGV-TDHVLGPELEAIAGSPLRWFHVDKFKDLDTRLRS 679 Query: 383 ITDKI 387 + K Sbjct: 680 LIQKA 684 >gi|53802771|ref|YP_115472.1| batB protein [Methylococcus capsulatus str. Bath] gi|53756532|gb|AAU90823.1| putative batB protein [Methylococcus capsulatus str. Bath] Length = 328 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 54/153 (35%), Gaps = 36/153 (23%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + V+ L++ + T A+ A + L + ++ +I Sbjct: 150 PLTFDRKTVEKLLDEAAIGLAGDKTAIGDAIGLAIKRL----------RDNPADQRVLIL 199 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEG 353 ++DG N+ + LQ E G+KIY++ V A Sbjct: 200 LSDGANTAGQ-----VQPLQAAELAAREGLKIYTIGVGADEMIVRDFFGTRRVNPSEDLD 254 Query: 354 QDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + + + + G++F ++ EL + + Sbjct: 255 EAAMTAIAEKTGGRYFRARNTEELDRIYALLDR 287 >gi|139439379|ref|ZP_01772820.1| Hypothetical protein COLAER_01839 [Collinsella aerofaciens ATCC 25986] gi|133775158|gb|EBA38978.1| Hypothetical protein COLAER_01839 [Collinsella aerofaciens ATCC 25986] Length = 2432 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 47/334 (14%), Positives = 101/334 (30%), Gaps = 62/334 (18%) Query: 97 KNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRS-TGIIERSSENLAISICMVLD 155 ++ I T+ + +L + + + + + I MVLD Sbjct: 60 AGGYNGKEVTTQNVGRIWTDKTVKAVESGDSDFLTTLSAISSTSDTTISGKPLDIVMVLD 119 Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 S SM+D N + + + A ++ ++ Sbjct: 120 ASGSMDDPMGTGDNTKRIDALKT-------AANTFIDAIAAQNQSITDASKQHRVAIVKF 172 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP-LSNNLNEVKSRLNKLNPYEN 274 + +K + N + R G YN T + + +K + +NP + Sbjct: 173 AG-------KKKTDKVGNDTYRDGRYTYNYSQTMKNLTSCKGKDADSLKDTVGNINPAGS 225 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC-- 332 T + A E+ G KK V+F TDG + +S +Q ++ I Sbjct: 226 TQADYGLELA--------ENITINSGRADAKKIVVFFTDGSPTSSSGFQASVADSAIASA 277 Query: 333 EYMRNAGMKIYSVAVSAPPEG------------QDLLRKC-------------------- 360 + ++ G IY++ + + + Sbjct: 278 KSLKANGADIYTIGIFSGANPSADPTAEGTSKVNKFMHAVSSNYPGATSSISFWGEWVID 337 Query: 361 ----TDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 ++S + + + EL + F++I+ I + Sbjct: 338 YGTRAENSDYYKSATSASELEKIFEEISGSIIQT 371 >gi|73974565|ref|XP_532319.2| PREDICTED: similar to collagen, type XIV, alpha 1 [Canis familiaris] Length = 1796 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 65/201 (32%), Gaps = 26/201 (12%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + +I + V ++ K + ++ + + N Sbjct: 1037 VDGSWSIGDENFNKIINFLYSTVGALNKIGADGT----QVAMVQFTDDPRTEFKLNAYNT 1092 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + ++ NT T A+ H L+ + + + K ++ ITDG + Sbjct: 1093 KETLLDAIKHISYKGGNTKTGKAIKHVRDSLFTAESGTRR-----GIPKVIVVITDGRSQ 1147 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRE 375 +I M++ G I++V V L F V+D Sbjct: 1148 DDVN--------KISGEMQSNGYSIFAVGV--ADADYSELVSIGSKPSSRHVFFVDDF-- 1195 Query: 376 LLESFDKITDKIQEQSVRIAP 396 ++F KI D++ A Sbjct: 1196 --DAFKKIEDELITFVCETAS 1214 Score = 66.1 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 26/168 (15%), Positives = 59/168 (35%), Gaps = 18/168 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + RIG Y+ N +EV + L NT T A+++ + Sbjct: 186 TAFNVGSEKTRIGLAQYSGDPRIEWHLNAFNTKDEVIEAVRNLPYKGGNTLTGLALNYIF 245 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + + T + K I ITDG++ + +R +G++++++ Sbjct: 246 ENSFKPEAGAR-----TGVSKIGILITDGKSQDDIIPPS--------RNLRESGVELFAI 292 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 V ++ LR+ + V + + + +T + + Sbjct: 293 GV--KNADENELREIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSRV 338 >gi|22761666|dbj|BAC11648.1| unnamed protein product [Homo sapiens] Length = 451 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 57/159 (35%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ P T T A+ +A Sbjct: 88 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHPSTGTMTGLAIQYALNIA 147 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ R+ G+ I+++ V Sbjct: 148 FSEAEGARPLREN--VPRVIMIVTDGRPQDS--------VAEVAAKARDTGILIFAIGVG 197 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F K Sbjct: 198 --QVDFNTLKSIGSEPHEDHVFLVANFSQIETLTSVFQK 234 >gi|59939916|ref|NP_001012385.1| matrilin 3b [Danio rerio] gi|56797875|emb|CAG30518.1| matrilin-3b precursor [Danio rerio] gi|220675929|emb|CAX12088.1| matrilin 3b [Danio rerio] Length = 478 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 55/159 (34%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + + R+ + Y + + EVK ++++P T T A+ A ++ Sbjct: 235 DIGSDATRVALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDPLSTGTMTGMAIKTAMEQV 294 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + E + + K I +TDG ++ R +G++IY+V V Sbjct: 295 FTENAGARPLK--KGIGKVAIIVTDGRPQD--------KVEEVSAAARASGIEIYAVGV- 343 Query: 349 APPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 L++ F V +L F + Sbjct: 344 -DRAEMRSLKQMASQPLDDHVFYVETYGVIEKLTSKFRE 381 >gi|56460106|ref|YP_155387.1| von Willebrand factor type A (vWA) domain-containing protein [Idiomarina loihiensis L2TR] gi|56179116|gb|AAV81838.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Idiomarina loihiensis L2TR] Length = 327 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 31/199 (15%), Positives = 66/199 (33%), Gaps = 47/199 (23%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 ++ ++ + + R+G I + TP++ + N VK LN Sbjct: 111 NRLTMVKHVLSDFIER---------REGDRLGLILFADTAYLQ--TPMTYDRNTVKQMLN 159 Query: 268 K--LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + L T A+ + + + + ++ +TDG+N+ + Sbjct: 160 ESVLGLVGERTAIGDAIALSVKRF----------RDDEKSNRVLVLLTDGQNTAGN---- 205 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEGQDLLRKCT-DSSGQ 366 L Q E + + IY +AV A L++ + G+ Sbjct: 206 -LPPEQALELAQAYDVTIYPIAVGAEEVVVDSFFGQRRVNPSRDLDVPLMQSIAKQTGGK 264 Query: 367 FFAVNDSRELLESFDKITD 385 +F + EL E + ++ Sbjct: 265 YFRARSTNELEEIYQRLDK 283 >gi|291290987|ref|NP_001167519.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform 3 [Mus musculus] Length = 1150 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 36/341 (10%), Positives = 92/341 (26%), Gaps = 21/341 (6%) Query: 38 LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK 97 L+ A K++ K L G +++ +D Sbjct: 140 LADAAENFQKAHRWQDNIKEEDIMYYDAK-ADAELDDPESEDMERGSKTSALRLDFIEDP 198 Query: 98 NNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 N + A +IPT+ +I + L IE ++ + + + Sbjct: 199 NFKNKVNYSYTA-VQIPTDIYKGSTVILNELNWTEALENVFIENRRQDPTLLWQVFGSAT 257 Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 + P + S + + + + ++ S Sbjct: 258 GVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSV 317 Query: 218 GNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 +++++ + + + ++ +V N K + + T Sbjct: 318 CEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVR----NKKVFKEAVQGMVAKGTTG 373 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 +A+ +L N + K ++ TDG N Sbjct: 374 YKAGFEYAFDQLQNSNITRA------NCNKMIMMFTDGGEDRVQDVFEKYNWP------- 420 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSREL 376 N +++++ +V L+ ++ G +F + + Sbjct: 421 NRTVRVFTFSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 461 >gi|291290985|ref|NP_001167518.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform 1 [Mus musculus] Length = 1156 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 36/341 (10%), Positives = 92/341 (26%), Gaps = 21/341 (6%) Query: 38 LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK 97 L+ A K++ K L G +++ +D Sbjct: 140 LADAAENFQKAHRWQDNIKEEDIMYYDAK-ADAELDDPESEDMERGSKTSALRLDFIEDP 198 Query: 98 NNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 N + A +IPT+ +I + L IE ++ + + + Sbjct: 199 NFKNKVNYSYTA-VQIPTDIYKGSTVILNELNWTEALENVFIENRRQDPTLLWQVFGSAT 257 Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 + P + S + + + + ++ S Sbjct: 258 GVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSV 317 Query: 218 GNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 +++++ + + + ++ +V N K + + T Sbjct: 318 CEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVR----NKKVFKEAVQGMVAKGTTG 373 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 +A+ +L N + K ++ TDG N Sbjct: 374 YKAGFEYAFDQLQNSNITRA------NCNKMIMMFTDGGEDRVQDVFEKYNWP------- 420 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSREL 376 N +++++ +V L+ ++ G +F + + Sbjct: 421 NRTVRVFTFSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 461 >gi|291290989|ref|NP_001167520.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform 4 [Mus musculus] gi|187957756|gb|AAI58059.1| Cacna2d2 protein [Mus musculus] Length = 1147 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 36/341 (10%), Positives = 92/341 (26%), Gaps = 21/341 (6%) Query: 38 LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK 97 L+ A K++ K L G +++ +D Sbjct: 140 LADAAENFQKAHRWQDNIKEEDIMYYDAK-ADAELDDPESEDMERGSKTSALRLDFIEDP 198 Query: 98 NNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 N + A +IPT+ +I + L IE ++ + + + Sbjct: 199 NFKNKVNYSYTA-VQIPTDIYKGSTVILNELNWTEALENVFIENRRQDPTLLWQVFGSAT 257 Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 + P + S + + + + ++ S Sbjct: 258 GVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSV 317 Query: 218 GNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 +++++ + + + ++ +V N K + + T Sbjct: 318 CEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVR----NKKVFKEAVQGMVAKGTTG 373 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 +A+ +L N + K ++ TDG N Sbjct: 374 YKAGFEYAFDQLQNSNITRA------NCNKMIMMFTDGGEDRVQDVFEKYNWP------- 420 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSREL 376 N +++++ +V L+ ++ G +F + + Sbjct: 421 NRTVRVFTFSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 461 >gi|148689241|gb|EDL21188.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_b [Mus musculus] Length = 1149 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 36/341 (10%), Positives = 92/341 (26%), Gaps = 21/341 (6%) Query: 38 LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK 97 L+ A K++ K L G +++ +D Sbjct: 141 LADAAENFQKAHRWQDNIKEEDIMYYDAK-ADAELDDPESEDMERGSKTSALRLDFIEDP 199 Query: 98 NNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 N + A +IPT+ +I + L IE ++ + + + Sbjct: 200 NFKNKVNYSYTA-VQIPTDIYKGSTVILNELNWTEALENVFIENRRQDPTLLWQVFGSAT 258 Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 + P + S + + + + ++ S Sbjct: 259 GVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSV 318 Query: 218 GNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 +++++ + + + ++ +V N K + + T Sbjct: 319 CEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVR----NKKVFKEAVQGMVAKGTTG 374 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 +A+ +L N + K ++ TDG N Sbjct: 375 YKAGFEYAFDQLQNSNITRA------NCNKMIMMFTDGGEDRVQDVFEKYNWP------- 421 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSREL 376 N +++++ +V L+ ++ G +F + + Sbjct: 422 NRTVRVFTFSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 462 >gi|40737976|gb|AAR89454.1| voltage-gated calcium channel alpha2-delta2 subunit [Mus musculus] Length = 1186 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 36/341 (10%), Positives = 92/341 (26%), Gaps = 21/341 (6%) Query: 38 LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK 97 L+ A K++ K L G +++ +D Sbjct: 179 LADAAENFQKAHRWQDNIKEEDIMYYDAK-ADAELDDPESEDMERGSKTSALRLDFIEDP 237 Query: 98 NNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 N + A +IPT+ +I + L IE ++ + + + Sbjct: 238 NFKNKVNYSYTA-VQIPTDIYKGSTVILNELNWTEALENVFIENRRQDPTLLWQVFGSAT 296 Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 + P + S + + + + ++ S Sbjct: 297 GVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSV 356 Query: 218 GNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 +++++ + + + ++ +V N K + + T Sbjct: 357 CEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVR----NKKVFKEAVQGMVAKGTTG 412 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 +A+ +L N + K ++ TDG N Sbjct: 413 YKAGFEYAFDQLQNSNITRA------NCNKMIMMFTDGGEDRVQDVFEKYNWP------- 459 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSREL 376 N +++++ +V L+ ++ G +F + + Sbjct: 460 NRTVRVFTFSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 500 >gi|161353447|ref|NP_064659.2| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform 2 [Mus musculus] gi|81892698|sp|Q6PHS9|CA2D2_MOUSE RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-2; AltName: Full=Protein ducky; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-2; Flags: Precursor gi|38614142|gb|AAH56389.1| Cacna2d2 protein [Mus musculus] gi|148689240|gb|EDL21187.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_a [Mus musculus] Length = 1154 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 36/341 (10%), Positives = 92/341 (26%), Gaps = 21/341 (6%) Query: 38 LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK 97 L+ A K++ K L G +++ +D Sbjct: 140 LADAAENFQKAHRWQDNIKEEDIMYYDAK-ADAELDDPESEDMERGSKTSALRLDFIEDP 198 Query: 98 NNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 N + A +IPT+ +I + L IE ++ + + + Sbjct: 199 NFKNKVNYSYTA-VQIPTDIYKGSTVILNELNWTEALENVFIENRRQDPTLLWQVFGSAT 257 Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 + P + S + + + + ++ S Sbjct: 258 GVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSV 317 Query: 218 GNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 +++++ + + + ++ +V N K + + T Sbjct: 318 CEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVR----NKKVFKEAVQGMVAKGTTG 373 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 +A+ +L N + K ++ TDG N Sbjct: 374 YKAGFEYAFDQLQNSNITRA------NCNKMIMMFTDGGEDRVQDVFEKYNWP------- 420 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSREL 376 N +++++ +V L+ ++ G +F + + Sbjct: 421 NRTVRVFTFSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 461 >gi|291290991|ref|NP_001167521.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform 5 [Mus musculus] gi|26336631|dbj|BAC31998.1| unnamed protein product [Mus musculus] Length = 1148 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 36/341 (10%), Positives = 92/341 (26%), Gaps = 21/341 (6%) Query: 38 LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK 97 L+ A K++ K L G +++ +D Sbjct: 140 LADAAENFQKAHRWQDNIKEEDIMYYDAK-ADAELDDPESEDMERGSKTSALRLDFIEDP 198 Query: 98 NNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 N + A +IPT+ +I + L IE ++ + + + Sbjct: 199 NFKNKVNYSYTA-VQIPTDIYKGSTVILNELNWTEALENVFIENRRQDPTLLWQVFGSAT 257 Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 + P + S + + + + ++ S Sbjct: 258 GVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSV 317 Query: 218 GNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 +++++ + + + ++ +V N K + + T Sbjct: 318 CEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVR----NKKVFKEAVQGMVAKGTTG 373 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 +A+ +L N + K ++ TDG N Sbjct: 374 YKAGFEYAFDQLQNSNITRA------NCNKMIMMFTDGGEDRVQDVFEKYNWP------- 420 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSREL 376 N +++++ +V L+ ++ G +F + + Sbjct: 421 NRTVRVFTFSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 461 >gi|15553133|gb|AAL01650.1|AF247141_1 voltage-dependent calcium channel alpha-2-delta-2 mutant subunit 2 [Mus musculus] Length = 1084 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 36/341 (10%), Positives = 92/341 (26%), Gaps = 21/341 (6%) Query: 38 LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK 97 L+ A K++ K L G +++ +D Sbjct: 68 LADAAENFQKAHRWQDNIKEEDIMYYDAK-ADAELDDPESEDMERGSKTSALRLDFIEDP 126 Query: 98 NNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 N + A +IPT+ +I + L IE ++ + + + Sbjct: 127 NFKNKVNYSYTA-VQIPTDIYKGSTVILNELNWTEALENVFIENRRQDPTLLWQVFGSAT 185 Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 + P + S + + + + ++ S Sbjct: 186 GVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSV 245 Query: 218 GNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 +++++ + + + ++ +V N K + + T Sbjct: 246 CEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVR----NKKVFKEAVQGMVAKGTTG 301 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 +A+ +L N + K ++ TDG N Sbjct: 302 YKAGFEYAFDQLQNSNITRA------NCNKMIMMFTDGGEDRVQDVFEKYNWP------- 348 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSREL 376 N +++++ +V L+ ++ G +F + + Sbjct: 349 NRTVRVFTFSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 389 >gi|12044402|gb|AAG47846.1|AF247139_1 voltage-dependent calcium channel alpha-2-delta-2 subunit [Mus musculus] Length = 1156 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 36/341 (10%), Positives = 92/341 (26%), Gaps = 21/341 (6%) Query: 38 LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK 97 L+ A K++ K L G +++ +D Sbjct: 140 LADAAENFQKAHRWQDNIKEEDIMYYDAK-ADAELDDPESEDMERGSKTSALRLDFIEDP 198 Query: 98 NNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 N + A +IPT+ +I + L IE ++ + + + Sbjct: 199 NFKNKVNYSYTA-VQIPTDIYKGSTVILNELNWTEALENVFIENRRQDPTLLWQVFGSAT 257 Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 + P + S + + + + ++ S Sbjct: 258 GVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSV 317 Query: 218 GNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 +++++ + + + ++ +V N K + + T Sbjct: 318 CEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVR----NKKVFKEAVQGMVAKGTTG 373 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 +A+ +L N + K ++ TDG N Sbjct: 374 YKAGFEYAFDQLQNSNITRA------NCNKMIMMFTDGGEDRVQDVFEKYNWP------- 420 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSREL 376 N +++++ +V L+ ++ G +F + + Sbjct: 421 NRTVRVFTFSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 461 >gi|26006175|dbj|BAC41430.1| mKIAA0558 protein [Mus musculus] Length = 1098 Score = 69.9 bits (169), Expect = 6e-10, Method: Composition-based stats. Identities = 36/341 (10%), Positives = 92/341 (26%), Gaps = 21/341 (6%) Query: 38 LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK 97 L+ A K++ K L G +++ +D Sbjct: 88 LADAAENFQKAHRWQDNIKEEDIMYYDAK-ADAELDDPESEDMERGSKTSALRLDFIEDP 146 Query: 98 NNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 N + A +IPT+ +I + L IE ++ + + + Sbjct: 147 NFKNKVNYSYTA-VQIPTDIYKGSTVILNELNWTEALENVFIENRRQDPTLLWQVFGSAT 205 Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 + P + S + + + + ++ S Sbjct: 206 GVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSV 265 Query: 218 GNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 +++++ + + + ++ +V N K + + T Sbjct: 266 CEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVR----NKKVFKEAVQGMVAKGTTG 321 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 +A+ +L N + K ++ TDG N Sbjct: 322 YKAGFEYAFDQLQNSNITRA------NCNKMIMMFTDGGEDRVQDVFEKYNWP------- 368 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSREL 376 N +++++ +V L+ ++ G +F + + Sbjct: 369 NRTVRVFTFSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 409 >gi|226310161|ref|YP_002770055.1| hypothetical protein BBR47_05740 [Brevibacillus brevis NBRC 100599] gi|226093109|dbj|BAH41551.1| hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 477 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 27/201 (13%), Positives = 70/201 (34%), Gaps = 25/201 (12%) Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL- 255 S A + ++D E+ S+ + + G+ + + + L Sbjct: 181 SGSMAAKSNGKTRMDAAKEAIQAFAESLPEQANVALRVYGHKGSGKESDKTLSCGSSELV 240 Query: 256 ----SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 + N ++ LN+ P T ++ A ++L + + + + Sbjct: 241 YGMQTYNKEKLTQSLNQFQPTGYTPIAYSLQEAKKDLS--------KLPGDKNTNMIFLV 292 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMK--IYSVAVSAPPEGQDLLRKCT-DSSGQFF 368 +DG + + ++ + + + + I + GQ L++ + G++ Sbjct: 293 SDGIET------CDGDPVEAAKQLAQSEITPIINVIGFGVDGPGQQQLKEVAKAAGGRYV 346 Query: 369 AVNDSRELLESF---DKITDK 386 + D +EL + F +I +K Sbjct: 347 LIQDQKELQDEFNRGKEIANK 367 >gi|238796988|ref|ZP_04640492.1| Flp pilus assembly protein TadG [Yersinia mollaretii ATCC 43969] gi|238719248|gb|EEQ11060.1| Flp pilus assembly protein TadG [Yersinia mollaretii ATCC 43969] Length = 536 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 40/501 (7%), Positives = 98/501 (19%), Gaps = 107/501 (21%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 I + + + + +++ + + ++ A++ A L+ +D + Sbjct: 37 FVIFLPIFIGLLFLSFEISQFLQKKAKLSDAIEQATLALTVENNDIPDANQSQKNRDLVT 96 Query: 62 TIFKKQIKKH-------------------LKQGSYIRENAGDIAQKAQINITKDKNNPLQ 102 + + + A I+ + + Sbjct: 97 HYATAYLPSEKFSTPIIDISNNKGHLLYKAETTMSYPAQFLANSPLANTKISIADSGAAR 156 Query: 103 YIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMED 162 + + + S I+ + +L Sbjct: 157 KDVAVGPSELTDVVFVVDYSSSMINPFHGTYGSRSKIDELRDIFYKLNGNILKNDNINTI 216 Query: 163 LYLQKHNDNNNMT-----SNKYLLPPP--------------------------PKKSFWS 191 ++ + S Y P ++ Sbjct: 217 GFIPFSWGIKKIVGTGQQSKTYCHFPYAPIKHKSTGDYLRQYTASNLKQFLAPENFNYVD 276 Query: 192 KNTTKSKYAPAPAPANRKI-DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 KI D + S NL N + + + Sbjct: 277 NIEYGELSNRNNRVNYLKIKDEISHSKSNLANEFMIKTHYINKYYIISNILT-SNIDYDK 335 Query: 251 QCTPLSNNLNEVKSRLNKL-------------------------------NPYENTNTYP 279 +S + ++ + P T Sbjct: 336 TINLMSKKYKSIDIPIDDVLDGNICLSSSTTYSLEFNKVSDESITESLATEPVGLTLVSS 395 Query: 280 AMHHAYRELYNEKESS----HNTIGSTRLKKFVIFITDGE------------NSGASAYQ 323 + A + + F DG + Sbjct: 396 GILAANNLFKEANDKNKKLMIVLSDGEDSDNTTTFDKDGNLIAVDDYIKNDEDRKPFRIT 455 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL------LRKCTDSSGQFFAVNDSRELL 377 L +CE + +++ +A+ P KC F+ D+ EL Sbjct: 456 KNLIDKGMCEAIAANKIRMVFIAIGYTPVNDAYSPTYIDWEKCVGKD-NFYLAKDAHELE 514 Query: 378 -ESFDKITDKIQEQSVRIAPN 397 + + + R P Sbjct: 515 ADLQQALGGENIRDVGRNTPK 535 >gi|313225343|emb|CBY06817.1| unnamed protein product [Oikopleura dioica] Length = 321 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 25/158 (15%), Positives = 51/158 (32%), Gaps = 18/158 (11%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNT 298 Y+ + + + +N++ NT T A+ A ++ Sbjct: 176 IAQYSDNPRLEFGLNEHYDFPSLNAAVNRMKYKGGNTATGKALTFALDHVF-------GR 228 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLR 358 +K V+ ITDGE+ + + +R G++I+S+ V + L+ Sbjct: 229 SSRPNAQKVVLIITDGESLQDT-------VTEPARRLRENGVEIFSIGVG-DEINLEELK 280 Query: 359 KCTDSS--GQFFAVNDSRELLESFDKITDKIQEQSVRI 394 F V + ++ I VR+ Sbjct: 281 DMATDPDSNHVFQVGGYNAITGITTQVLKDICRIKVRV 318 >gi|187736265|ref|YP_001878377.1| von Willebrand factor type A [Akkermansia muciniphila ATCC BAA-835] gi|187426317|gb|ACD05596.1| von Willebrand factor type A [Akkermansia muciniphila ATCC BAA-835] Length = 328 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 35/195 (17%), Positives = 61/195 (31%), Gaps = 42/195 (21%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 ++ V+ RIG + + PL+ + V S + Sbjct: 115 NRLVAAKHVITQFVD---------SRPDDRIGIVGFAGKTKSF--CPLTLDHALVNSIIR 163 Query: 268 KLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +P + T A+ A L + KE+ K +I +TDG ++ Sbjct: 164 DFHPRMIQADGTAIGSAIAAAATRLDDRKET---------KSKIIILVTDGASNSGQIS- 213 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPE------------GQDLLRKCTD-SSGQFFAV 370 L E G+KIY++AV + LRK + G+ F Sbjct: 214 ----PLVAAENAAKLGIKIYTIAVGTEEGTLANGMVVQSEFDEPTLRKIAQLTGGEHFRA 269 Query: 371 NDSRELLESFDKITD 385 + ++F I Sbjct: 270 TNMASFNKAFTSIGK 284 >gi|332654605|ref|ZP_08420348.1| von Willebrand factor, type A [Ruminococcaceae bacterium D16] gi|332516569|gb|EGJ46175.1| von Willebrand factor, type A [Ruminococcaceae bacterium D16] Length = 472 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 25/176 (14%), Positives = 62/176 (35%), Gaps = 22/176 (12%) Query: 219 NLVNSIQKAIQEKKNLS-VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNT 277 +L++ + Q+ + S R+G ++++ V + S + +K+ ++ L+ +TN Sbjct: 82 DLIDEATDSSQDGQIGSGSRMGVVSFSNTAVADTQLITSVD--ALKAAVDNLSAGGSTNH 139 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 A A + ++ K ++ TDG + + + Sbjct: 140 ADAFAKAIQLFDPASANA----------KVMVMFTDGNTTIGAPPAPVAAAARA------ 183 Query: 338 AGMKIYSVA-VSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQ 390 G+ IY + + + L ++ +L E F ++ I + Sbjct: 184 QGIIIYCIGLIGSDGLDITALNDWATDPDASHVAVTPNAADLEELFAELAANISKP 239 >gi|332665830|ref|YP_004448618.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] gi|332334644|gb|AEE51745.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] Length = 630 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 32/199 (16%), Positives = 67/199 (33%), Gaps = 22/199 (11%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 A K+ ++ S L ++ R+ + Y + NN Sbjct: 268 VSGSMSAANKLPLVQASYKLLAEQLRPQD--------RVAIVVYAGAAGLVLESTTGNNK 319 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 ++K ++KL +T + AY+ N VI +DG+ + Sbjct: 320 TKIKEAIDKLQAGGSTAGGEGILLAYKTAKENFIKGGNNR--------VILASDGDFNVG 371 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLE 378 + L ++I E R +G+ + + + ++K DS G +++ L E Sbjct: 372 VSSDGEL--VRIIEEERKSGVYLTILGYGMGNYKDNKMQKLADSGNGNHAYIDN---LDE 426 Query: 379 SFDKITDKIQEQSVRIAPN 397 + + + IA + Sbjct: 427 ARRVLVSEFGGTMYTIAKD 445 >gi|308270599|emb|CBX27211.1| hypothetical protein N47_A12400 [uncultured Desulfobacterium sp.] Length = 330 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 30/195 (15%), Positives = 65/195 (33%), Gaps = 40/195 (20%) Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL---NPYENTNTYPA 280 ++K + + G +PL+ + + S ++K+ ++T A Sbjct: 116 VKKVVADFIKERETDRIGLVVFGQEAFTQSPLTMDKGLLLSLVDKMEIGMAGDSTAIGNA 175 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + A + L K +I +TDG ++ G+ Sbjct: 176 IAVAGKRL----------KDLKAKSKIMIILTDGRSNTGDITPEEAAGAAA-----ALGI 220 Query: 341 KIYSVAVSAPPE--------------------GQDLLRKCTDSS-GQFFAVNDSRELLES 379 KIY++ V + L++ G++F DS+EL Sbjct: 221 KIYTIGVGGTGPAPFKVNTFFGPRIVNQSVDLDEKTLKEIAAIGKGKYFRATDSKELANI 280 Query: 380 FDKITDKIQEQSVRI 394 + +I +K ++ V++ Sbjct: 281 Y-EIINKAEKTEVKV 294 >gi|242065788|ref|XP_002454183.1| hypothetical protein SORBIDRAFT_04g026250 [Sorghum bicolor] gi|241934014|gb|EES07159.1| hypothetical protein SORBIDRAFT_04g026250 [Sorghum bicolor] Length = 703 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 31/316 (9%), Positives = 91/316 (28%), Gaps = 31/316 (9%) Query: 74 QGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSL 133 + + G +A Q+ + + L + + P + + + S + Sbjct: 152 NQVRLPQQDGYMALLRQVPNRQREGPVLVTSEPADFNDDEPLQKMEAANIGSSRTVEIKT 211 Query: 134 RSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKN 193 S + S ++ + L + + + + P Sbjct: 212 YSEFSAIQQSSQDDFAVLIHLKAPYANPEQVTGRSVSATS--------VGYPTSRAPVDL 263 Query: 194 TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 T + + A K+ +L + G ++ + + + +++ + + Sbjct: 264 VTVLDVSGSMAG--TKLALLKRAMGFVIQHLGPSDRLSVIAFSSTARRLFHLRRMSH--- 318 Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 + + +N L TN A+ A + + + +I ++D Sbjct: 319 ---SGRQQALQAVNSLGASGGTNIADALKKAAKVIED--------RSHQNPVCSIILLSD 367 Query: 314 GENSGASAYQ----NTLNTLQICEYMRNAG---MKIYSVAVSAPPEGQDLLRKCTDSSGQ 366 G+++ + + + N + ++ + L S G Sbjct: 368 GQDTYNIPSNIRGARPDYSSLVPSSILNHTFRLVPVHGFGFGVDHDSDALHSIAEASGGT 427 Query: 367 FFAVNDSRELLESFDK 382 F + D + ++F + Sbjct: 428 FSFIEDEGVIQDAFAQ 443 >gi|325927536|ref|ZP_08188772.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas perforans 91-118] gi|325542075|gb|EGD13581.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas perforans 91-118] Length = 501 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 39/293 (13%), Positives = 76/293 (25%), Gaps = 32/293 (10%) Query: 108 KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMV---LDVSRSMEDLY 164 Y L L P+ + V L + +D Sbjct: 54 TGSYSNVRRFLNAGSLPPADAVRVEELINYFRYDDPAPTNGQPFAVRTELATTPWNKDSL 113 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 LPP A K+ +L S LV + Sbjct: 114 -LLRVGIAGRDIATADLPPANLVFLVD--------VSGSMDAPDKLPLLQSSLKLLVRQL 164 Query: 225 QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHA 284 + RI + Y I + + ++ L +T + A Sbjct: 165 RAQD--------RITLVTYAGNISVVLPPTPGDQQGRIVEAIDALQSGGSTAGASGIELA 216 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 Y+ ++ TDG+ + N + R +G+ + + Sbjct: 217 YKAAQQGYLRGGINR--------ILLATDGDFNVGV--TNFDQLKGMVAEKRRSGIALST 266 Query: 345 VAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + +L+ + D+ +A D+ LE+ +T ++ IA + Sbjct: 267 LGFGTGNYNDNLMEQLADAGDGAYAYIDTA--LEARKVLTHELGATLATIARD 317 >gi|170076505|ref|YP_001733144.1| hypothetical protein SYNPCC7002_G0035 [Synechococcus sp. PCC 7002] gi|169887367|gb|ACB01075.1| conserved hypothetical protein (von Willebrand factor type A domain) [Synechococcus sp. PCC 7002] Length = 420 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 22/187 (11%), Positives = 59/187 (31%), Gaps = 20/187 (10%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 K+D ++ + + + R+ ++ + + L+ + + + Sbjct: 59 NKLDYAKQAIAYAIEQLLPSD--------RLSLTLFDTQVETKIPSTLATDKQRLLETIK 110 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + +T + ++ + H VI ++DG + + + Sbjct: 111 LIRSGSSTALHDGWVQGGIQVGQYLNNDHLNR--------VILLSDGLANVGETNPD-VI 161 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDK 386 + M+ G+ ++ V +DLL G FF + +L + F+ Sbjct: 162 ASDVHGLMK-TGISTSALGVGRD-YDEDLLEAIARSGDGNFFHIASPEDLPQIFETELQG 219 Query: 387 IQEQSVR 393 + R Sbjct: 220 LATTIGR 226 >gi|325919326|ref|ZP_08181363.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas gardneri ATCC 19865] gi|325550197|gb|EGD21014.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas gardneri ATCC 19865] Length = 520 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 39/364 (10%), Positives = 86/364 (23%), Gaps = 36/364 (9%) Query: 40 GCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNN 99 + + + K + +N + I + N Sbjct: 3 AASIALESFVAQRAPAAKLVARRPMLDNLMPMPPILPAPADNRETYQTLSDNPIVQTAAN 62 Query: 100 PLQY-IAES-KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMV---L 154 P+ + Y L L P + V L Sbjct: 63 PVSTFSIDVDTGSYSNVRRFLSAGTLPPVDAVRVEELINYFRYDDPAPADGKPFAVRTEL 122 Query: 155 DVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLI 214 + D LPP + K+ +L Sbjct: 123 APTPWNNDSL-LLRVGVAGRAIATADLPPANLVFLVD--------VSGSMESPDKLPLLQ 173 Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 S LV ++ RI + Y + + ++ + + Sbjct: 174 SSLKLLVRQLRAKD--------RITLVTYAGNTAVVLPPTPGDQQGRIIEAIDTVQSGGS 225 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 T + AY+ ++ TDG+ + + L + Sbjct: 226 TAGASGIELAYKAAQQGYLRGGINR--------ILLATDGDFNVGVTDFDQL--KGMVAE 275 Query: 335 MRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVR 393 R +G+ + ++ L+ + D+ G + ++ E + +T ++ Sbjct: 276 KRRSGVALSTLGFGTGNYNDTLMEQLADAGDGAYAYIDSPLEARKV---LTHELGATLAT 332 Query: 394 IAPN 397 IA + Sbjct: 333 IARD 336 >gi|311063719|ref|YP_003970444.1| cell surface protein [Bifidobacterium bifidum PRL2010] gi|310866038|gb|ADP35407.1| Cell surface protein with gram positive anchor domain [Bifidobacterium bifidum PRL2010] Length = 1176 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 40/374 (10%), Positives = 108/374 (28%), Gaps = 46/374 (12%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPL 101 + + T + + I E++ + ++ + Sbjct: 502 TATGTKDGETRTTDSNGVITLKAGQYAVLLGSDAKRITESSKYKVTEINVDQDTYAVSAN 561 Query: 102 QYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE-NLAISICMVLDVSRSM 160 + + + T G +P ++ + + + N + + L+V+ + Sbjct: 562 GGQVKVTQEKDSATTEPVSVGEVPRTTVTNTVVTAPRYRKYIKANNDGTYDLSLNVTGTQ 621 Query: 161 EDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNL 220 + + + S P N +++V + ++ Sbjct: 622 SGSSQTTVSPADIVVVFDT-----------------SGSMSNPMGHNSRLEVAKTAVNSM 664 Query: 221 VNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNT--- 277 + + + K+ ++R+ + + + ++N ++ S +N L TN Sbjct: 665 AQHLLTSENQGKDSNIRMALVPF--STTAGNVSNFTDNAMDIVSAVNGLGADGGTNWEAA 722 Query: 278 --------YPAMHHAYREL------YNEKESSHNTIGSTRLKKFVIFITD------GENS 317 + + +S G+ + D G + Sbjct: 723 LKAANAKLTSGRKGVKKYIVFMSDGDPTFRTSSVRTGTDWWGRPTYDDDDRRGLPAGVHG 782 Query: 318 GASAYQNTLNTLQICEYMRNAG-MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 S+ Q N G ++SV VS+ P + + G +++ + EL Sbjct: 783 SGSSDQYGANLSSAVAEANRRGDATLFSVGVSSDPT--KMRGFADQTKGSYYSATSTDEL 840 Query: 377 LESFDKITDKIQEQ 390 ++F I +I + Sbjct: 841 NKAFADIIGQINRK 854 >gi|297816770|ref|XP_002876268.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata] gi|297322106|gb|EFH52527.1| zinc finger family protein [Arabidopsis lyrata subsp. lyrata] Length = 672 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 26/202 (12%), Positives = 71/202 (35%), Gaps = 36/202 (17%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS----NNLNEVK 263 K+ +L + G ++ ++ + R+ IA+ + PL+ + Sbjct: 257 TKLALLKRAMGFVIQNLGSSD--------RLSVIAF--SSTARRLFPLTRMSDAGRQQAL 306 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +N L TN + + + + + +E + +I ++DG ++ + + Sbjct: 307 QAVNSLVANGGTNIFDGLRKGAKVMEDRRERNSVAS--------IILLSDGRDTYTTNHP 358 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK- 382 + + + + ++S + + + S G F + + ++ + Sbjct: 359 DPSYKAMLPQ------IPVHSFGFGSDHDASVMHSVSEFSGGTFSFIESESVIQDALAQC 412 Query: 383 ----ITDKIQEQSVRI---APN 397 ++ +QE V I PN Sbjct: 413 IGGLLSVAVQELRVEIEGVCPN 434 >gi|254776723|ref|ZP_05218239.1| hypothetical protein MaviaA2_18931 [Mycobacterium avium subsp. avium ATCC 25291] Length = 309 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 25/160 (15%), Positives = 54/160 (33%), Gaps = 19/160 (11%) Query: 248 VGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 P + L KL+ ++T T A+ A + G T Sbjct: 122 TPYLLVPPTPQHQATIDALKKLDFADSTATGEAIFTALHAISATA----VAGGDTPPPAR 177 Query: 308 VIFITDGENSGASAYQNTLN-TLQICEYMRNAGMKIYSVAVSAPPE-------------G 353 ++ ++DG + S + + ++ G+ I ++ Sbjct: 178 IVLLSDGGENKPSNPSDPHDGVYTAARLAKDEGVPISTITFGTKGGEIEMDGQKVAVPVS 237 Query: 354 QDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSV 392 D ++ S GQ + + EL +S++ I ++I ++V Sbjct: 238 TDQMKMVAKLSGGQSYTATNLGELQKSYNAIENEIGYRTV 277 >gi|296168869|ref|ZP_06850541.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295896486|gb|EFG76136.1| von Willebrand factor [Mycobacterium parascrofulaceum ATCC BAA-614] Length = 327 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 28/160 (17%), Positives = 55/160 (34%), Gaps = 19/160 (11%) Query: 248 VGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 P + L KL+ ++T T A+ A + T G T Sbjct: 140 TPYLLVPPTPQHQATIDALKKLDFADSTATGEAIFTALHAVSATA----ITGGDTPPPAR 195 Query: 308 VIFITDGENSGASAYQNTLN-TLQICEYMRNAGMKIYSVAVSAP-------------PEG 353 ++ ++DG + S + + R+ G+ I +++ P Sbjct: 196 IVLLSDGRENKPSNPSDPHDGVYTAARLARDEGVPISTISFGTKTGEIEMDGQRVAVPVS 255 Query: 354 QDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSV 392 D ++ S GQ + + EL +S++ I I ++V Sbjct: 256 TDQMKTIAKLSGGQSYTAGNLAELNKSYNAIEKDIGYRTV 295 >gi|47208832|emb|CAF90336.1| unnamed protein product [Tetraodon nigroviridis] Length = 443 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 39/281 (13%), Positives = 75/281 (26%), Gaps = 28/281 (9%) Query: 109 AQYEIPTENLFLK-GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQK 167 A F + P T + LA VL V S Sbjct: 140 ASCPKNLVPTFSSAPVAPLPTIPSPPNPTVQLTTPPAPLATITPEVLPVETSAPATLS-- 197 Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 T + + + + + + + L + + Sbjct: 198 -----VSTFTTTVSAVETETDSSCLSRPLDLVFIIDSSRSVRPSEFEKVKIFLADMVDT- 251 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYR 286 + + R+ + Y + N +K ++++ P T T A+ A Sbjct: 252 -LDVGADATRVAVVNYASTVKTEFLLKDHFNKPNLKKAISRIEPLATGTMTGLAIKTAVS 310 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 E + E+ + + K I +TDG ++ R +G++IY+V Sbjct: 311 EAFTEQSGARPRPRNI--AKVAIIVTDGRPQD--------QVEEVSAAARASGVEIYAVG 360 Query: 347 VSAPPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 V L+ F V +L F + Sbjct: 361 V--DRADMRSLQLMASVPLEDHVFYVETYGVIEKLTSKFRE 399 >gi|282879637|ref|ZP_06288368.1| von Willebrand factor type A domain protein [Prevotella timonensis CRIS 5C-B1] gi|281306585|gb|EFA98614.1| von Willebrand factor type A domain protein [Prevotella timonensis CRIS 5C-B1] Length = 332 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 26/192 (13%), Positives = 55/192 (28%), Gaps = 37/192 (19%) Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YEN 274 +A N+ A+ + T L N L V++ + + Sbjct: 112 AAKNVAAEFISGRPNDNIGLTIFAGEAFTQCPMTTDHTSLLNLLRNVRTDIAARGLISDG 171 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 T + +A L S K VI +TDG N+ + + Sbjct: 172 TAVGMGLANAVSRL----------KDSKTKSKVVILLTDGSNNMGDIS-----PMTSAQI 216 Query: 335 MRNAGMKIYSVAVSAP--------------------PEGQDLLRKCTD-SSGQFFAVNDS 373 ++ +++Y++ V L + G F+ ++ Sbjct: 217 AKSLDIRVYTIGVGTNKVAPYPMSVGGGTQYINIPVEIDSKTLSDIAAVTEGNFYRATNN 276 Query: 374 RELLESFDKITD 385 ++L + + I Sbjct: 277 QQLKQIYKDIDK 288 >gi|258651542|ref|YP_003200698.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] gi|258554767|gb|ACV77709.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] Length = 681 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 23/182 (12%), Positives = 57/182 (31%), Gaps = 25/182 (13%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL-----SNNLNEVK 263 +ID + +L+ ++ Q + G Q + N + Sbjct: 78 RIDAAKAAVTDLLGTLPAPTQVGLMVYGTSTGSTDAERAAGCQDIKTLAPVGTLNAATLT 137 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 S++ + T A+ A + L NE + ++ ++DGE++ Sbjct: 138 SQVAGITASGYTPIGNALRAAAQALPNEGP------------RSIVLVSDGEDT-----C 180 Query: 324 NTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESF 380 + + G+ +++V +D L + G + ++ L ++ Sbjct: 181 APPAPCDVARELHEQGVDLTVHTVGFKVDATARDQLSCVAQATGGTYSDAGNATGLTDAL 240 Query: 381 DK 382 Sbjct: 241 QA 242 >gi|222612967|gb|EEE51099.1| hypothetical protein OsJ_31818 [Oryza sativa Japonica Group] Length = 600 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 26/273 (9%), Positives = 78/273 (28%), Gaps = 33/273 (12%) Query: 125 PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP 184 + T L + + DV + + Sbjct: 92 GFSTTTSPLVRAPRLFGDRRQGGGAASWASDVGGGTPFERTSAVLVHARDPTGVADGDDA 151 Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYN 244 + T + + K+ +L ++ G +++++ R+ I+++ Sbjct: 152 EAQRAPLDLVTVLDVSGSMVG--NKLALLKQAMGFVIDNLGPGD--------RLCVISFS 201 Query: 245 IG-IVGNQCTPLSN-NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 G + + +++ K + L+ TN A+ A + L + Sbjct: 202 SGASRLMRLSRMTDAGKAHAKRAVGSLSARGGTNIGAALRKAAKVLDD--------RLYR 253 Query: 303 RLKKFVIFITDGENSG-----ASAYQNTLNTLQICEYMRNAGM--------KIYSVAVSA 349 + VI ++DG+++ ++ + + A +++ Sbjct: 254 NAVESVILLSDGQDTYTVPPRGGYDRDANYDALVPPSLVRADAGGGGGRAPPVHTFGFGK 313 Query: 350 PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + G F + + + + F + Sbjct: 314 DHDAAAMHTIAEVTGGTFSFIENEAAIQDGFAQ 346 >gi|218661390|ref|ZP_03517320.1| von Willebrand factor type A [Rhizobium etli IE4771] Length = 370 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 26/198 (13%), Positives = 58/198 (29%), Gaps = 20/198 (10%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 K+ +L S LVN ++ + + Y + Sbjct: 10 VSGSMDELDKLPLLKSSFRLLVNRLKADDT--------VAIVTYAGNAGTVLEPTRVSEK 61 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +++ S +++L +T + AY V+ TDG+ + Sbjct: 62 SKILSAIDRLEAGGSTGGAEGIEAAYDLAQKAFVKDGVNR--------VMLATDGDFNVG 113 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 + + +I E R G+ + + L++ + + L E+ Sbjct: 114 --PSSDEDLKRIIEEKRKEGIFLTVLGFGRGNLNDSLMQTLAQNGNG--SAAYIDTLAEA 169 Query: 380 FDKITDKIQEQSVRIAPN 397 + D+ IA + Sbjct: 170 QKTLVDEAGSTLFPIAKD 187 >gi|146276888|ref|YP_001167047.1| hypothetical protein Rsph17025_0838 [Rhodobacter sphaeroides ATCC 17025] gi|145555129|gb|ABP69742.1| hypothetical protein Rsph17025_0838 [Rhodobacter sphaeroides ATCC 17025] Length = 563 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 37/380 (9%), Positives = 88/380 (23%), Gaps = 96/380 (25%) Query: 13 ITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHL 72 +A+D+ R +Q A+D ++ + ++ K K Sbjct: 52 GGFAVDVMSFEAKRTDLQQAVDRC-------ALTAAALAQTRDPEEVVEDCMLKAGKADY 104 Query: 73 KQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLS 132 +T ++ E + PT+ L ++ Sbjct: 105 --------------------VTLIDHDEGLNYREVVVTAQQPTKPL-FAHMLGIDSLTAP 143 Query: 133 LRSTGIIERSSENLAISICMVLDVSRSM------EDLYLQKHNDNNNMTSNKYLLPPPPK 186 + + + I MVLDVS SM K+ +L Sbjct: 144 AATKA----EQKVTNVEIVMVLDVSGSMVRDSYSRPTDKLKNLKAAAKEFVDTMLAKDLN 199 Query: 187 KSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK------------AIQEKKNL 234 + +K ++ + ++ + ++ Sbjct: 200 HRISIAIVPYNGQVNLGKSLRQKFNIYDNNGVTYMDCVDMPASVYASTGLSRTLKMPMTA 259 Query: 235 SVRIGTIAYNIGIVGNQCTPL-----------------------------SNNLNEVKSR 265 + + A++ G + +++ +++ Sbjct: 260 NADTFSAAFSNARTGGTPPDTYSGPKTSEAQPTPGNRWCLPTAANVVRLPTGSISSLQAS 319 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKE-----------------SSHNTIGSTRLKKFV 308 ++ L T+ M L + K + Sbjct: 320 IDGLEGNGATSINAGMKVGLSLLDPSARPMFSEFVGSGEIQSYFHGRPFDYTDEEVMKVM 379 Query: 309 IFITDGENSGASAYQNTLNT 328 I +TDGE+ + Sbjct: 380 IVMTDGEHFEEERVNDGYRV 399 Score = 65.3 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 23/173 (13%), Positives = 54/173 (31%), Gaps = 4/173 (2%) Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAM 281 + + + R Y + N + S +P + T + + Sbjct: 391 ERVNDGYRVGDSPIYRSSDGEYLSLKLSNGKFYWPYDNTTTNSAAKGKSPTQLT--WQQV 448 Query: 282 HHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL--QICEYMRNAG 339 +YR Y + G + + +T++ +C ++ Sbjct: 449 WASYRTSYIAWQLYSRRPGFRSSDRLAAYTAQMNAFRTLTPISTMDAQLQALCNLAKSNN 508 Query: 340 MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + I+ +A AP G+ ++ C+ S + E+ +F I +I + Sbjct: 509 VTIFGIAFEAPANGKTQIQNCSTSRSSHYFDASGLEIQTAFRAIASQISYLRL 561 >gi|32475925|ref|NP_868919.1| hypothetical protein RB9502 [Rhodopirellula baltica SH 1] gi|32446468|emb|CAD76304.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 368 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 25/136 (18%), Positives = 52/136 (38%), Gaps = 23/136 (16%) Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T A+ + E + K +I +TDG ++ + + + Sbjct: 195 MAGPRTAFGDAIGLGVNLFDEDTERA----------KTIIALTDGNDTK-----SKVPPV 239 Query: 330 QICEYMRNAGMKIYSVAVSAPPE------GQDLLRKCTD-SSGQFFAVNDSRELLESFDK 382 + +KIY+VA+ P + L+ + G++F D L +D+ Sbjct: 240 EAARVATQRDIKIYTVAIGDPTTVGEDKLDEQSLKDVASETGGKYFFAADREHLAGIYDE 299 Query: 383 ITDKIQEQSVRIAPNR 398 + DKI+ Q+++ +R Sbjct: 300 L-DKIETQTIQTISHR 314 >gi|307153048|ref|YP_003888432.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] gi|306983276|gb|ADN15157.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] Length = 413 Score = 69.9 bits (169), Expect = 7e-10, Method: Composition-based stats. Identities = 23/133 (17%), Positives = 50/133 (37%), Gaps = 15/133 (11%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 +LN V ++ L T + +E + +G + +TDGE Sbjct: 98 DLNTVIEQIKALRAAGGTAIDEGLKLGIKE---------SALGKQERVSQIFLLTDGE-- 146 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS-SGQFFAVNDSREL 376 + + + L++ + + + + ++ QD+L K DS G + + + Sbjct: 147 --NEHGDNERCLKLAQVASDYNITLNTLGFG-NHWNQDVLEKIADSAGGSLSYIENPEKA 203 Query: 377 LESFDKITDKIQE 389 LE F ++ + Q Sbjct: 204 LEEFSRLFSRAQS 216 >gi|13529371|gb|AAH05429.1| Matn2 protein [Mus musculus] Length = 956 Score = 69.6 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 27/187 (14%), Positives = 66/187 (35%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + + +++S+ + + R+G + Y+ + ++ E+K Sbjct: 667 GEENFETVKHFVTGIIDSLAVSPKAA-----RVGLLQYSTQVRTEFTLRGFSSAKEMKKA 721 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + ST++ + I TDG + Sbjct: 722 VAHMKYMGKGSMTGLALKHMFERSFTQVEGARP--PSTQVPRVAIVFTDGRAQDDVSEW- 778 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ F D + E +K Sbjct: 779 -------ASKAKANGITMYAVGVGKAI--EEELQEIASEPIDKHLFYAEDFSTMGEISEK 829 Query: 383 ITDKIQE 389 + + I E Sbjct: 830 LKEGICE 836 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 55/159 (34%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 88 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 147 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ RN G+ I+++ V Sbjct: 148 FSEAEGARPLREN--VPRIIMIVTDGRPQDS--------VAEVAAKARNTGILIFAIGVG 197 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F Sbjct: 198 --QVDLNTLKAIGSEPHKDHVFLVANFSQIESLTSVFQN 234 >gi|297727663|ref|NP_001176195.1| Os10g0464900 [Oryza sativa Japonica Group] gi|22758314|gb|AAN05518.1| hypothetical protein [Oryza sativa Japonica Group] gi|31432565|gb|AAP54180.1| von Willebrand factor type A domain containing protein, expressed [Oryza sativa Japonica Group] gi|255679473|dbj|BAH94923.1| Os10g0464900 [Oryza sativa Japonica Group] Length = 646 Score = 69.6 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 21/192 (10%), Positives = 65/192 (33%), Gaps = 31/192 (16%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIG-IVGNQCTPLSN-NLNEVK 263 K+ +L ++ G +++++ R+ I+++ G + + +++ K Sbjct: 187 VGNKLALLKQAMGFVIDNLGPGD--------RLCVISFSSGASRLMRLSRMTDAGKAHAK 238 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG----- 318 + L+ TN A+ A + L + + VI ++DG+++ Sbjct: 239 RAVGSLSARGGTNIGAALRKAAKVLDD--------RLYRNAVESVILLSDGQDTYTVPPR 290 Query: 319 ASAYQNTLNTLQICEYMRNAGM--------KIYSVAVSAPPEGQDLLRKCTDSSGQFFAV 370 ++ + + A +++ + + + G F + Sbjct: 291 GGYDRDANYDALVPPSLVRADAGGGGGRAPPVHTFGFGKDHDAAAMHTIAEVTGGTFSFI 350 Query: 371 NDSRELLESFDK 382 + + + F + Sbjct: 351 ENEAAIQDGFAQ 362 >gi|332706285|ref|ZP_08426352.1| hypothetical protein LYNGBM3L_16440 [Lyngbya majuscula 3L] gi|332354933|gb|EGJ34406.1| hypothetical protein LYNGBM3L_16440 [Lyngbya majuscula 3L] Length = 413 Score = 69.6 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 24/132 (18%), Positives = 53/132 (40%), Gaps = 13/132 (9%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 NL+++K ++N+L T + +EL K+ V +TDGE Sbjct: 99 NLDQIKRKINRLGADGGTAIDEGLKLGVKELIKAKQ---------DTVSQVFLLTDGE-- 147 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 + + N + +++ E + I S+ A L + ++G + + + L Sbjct: 148 --NEHGNNESCIKLAELAAENNLTINSLGFGANWNQDILEKIADIATGSLSYIEEPEQAL 205 Query: 378 ESFDKITDKIQE 389 F ++ +++Q Sbjct: 206 SEFARLFNRMQS 217 >gi|326424188|ref|NP_762140.2| aerotolerance operon protein BatA [Vibrio vulnificus CMCP6] gi|319999572|gb|AAO07130.2| BatA (Bacteroides aerotolerance operon) [Vibrio vulnificus CMCP6] Length = 362 Score = 69.6 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 40/295 (13%), Positives = 99/295 (33%), Gaps = 34/295 (11%) Query: 101 LQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSM 160 + Y + + + F L+ + S ++ + + + + +L VS Sbjct: 31 VYYFVPAYRTKQTAIKVPFFHQLVEAMGETPSEGASQLTPSGWQRATLVLSWLLVVSALA 90 Query: 161 EDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNL 220 + L ++ + ++ S + + + A ++D E + Sbjct: 91 KPTILGAPQIRESLGRDVMVVVD------LSGSMAEQDFTSASGAKISRLDATKEVLADF 144 Query: 221 VNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN---EVKSRLNKLNPYENTNT 277 K R+G I + TP + + + ++ + ++T+ Sbjct: 145 A---------KTRQGDRLGLILFGDAAFVQ--TPFTADQKVWLALLNQTDVAMAGQSTHL 193 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 A+ A + + S S +K I +TDG ++G + + + + + Sbjct: 194 GDAIGLAIKVFEQSEPSQAAF--SKPRQKVAIVLTDGNDTG-----SFVEPIDAAKVAKA 246 Query: 338 AGMKIYSVAVSAPPE------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 G++I+ +A+ P + + S G+ F + EL ++D I Sbjct: 247 KGVRIHVIAMGDPSTVGESALDLQTIERIASESGGKAFQALNRDELASAYDDIGK 301 >gi|190149857|ref|YP_001968382.1| tight adherence protein G [Actinobacillus pleuropneumoniae serovar 7 str. AP76] gi|307263180|ref|ZP_07544801.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 13 str. N273] gi|189914988|gb|ACE61240.1| tight adherence protein G [Actinobacillus pleuropneumoniae serovar 7 str. AP76] gi|306871542|gb|EFN03265.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 13 str. N273] Length = 530 Score = 69.6 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 61/487 (12%), Positives = 128/487 (26%), Gaps = 123/487 (25%) Query: 14 TYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKD---------------PTTKKD 58 +++ A I+ + ++ +L+ AVLS A S R D + Sbjct: 36 FVSLESAGIIQDQARLSDSLEQAVLSLTAENNSGRKANDYKLGGSNPNDDSFNISSEVGK 95 Query: 59 QTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENL 118 + I +K L Q + + N I +N T K + + ++ Sbjct: 96 RDHAIVTTFVKTFLPQTNDDKMNL--IPVCKTVNNTSGKGHTSSSEVTCTVSGTVEHKSW 153 Query: 119 FLKGLIPSALTNLSLRSTGIIERSSENL---AISICMVLDVSRSMEDLYLQKHNDNNNM- 174 F + + + + +N I + +V D+S SM Sbjct: 154 FPLKVGNLEVIPKQVDVASKSKAFKKNTFNIPIDLMVVADLSGSMNFDLDNNETKKTGKP 213 Query: 175 ------------TSNKYLLPPPPKKSFWSKNTTKS-----KYAPAPAPA----------- 206 ++K LL + T + P Sbjct: 214 SKISILKEVLVELADKTLLSEDANQHNRIYVTPFALGAEINKNSCALPYSWDIESSSKIE 273 Query: 207 ------------NRKIDVLIESAGNLVNSIQKAIQEK---KNLSVRIGTIAYNIGIVGNQ 251 + D++ + N++ T NQ Sbjct: 274 NIKKILNKENSQYNRADLINNLVYRISTKETLNNINGKQKYNVTFPKNTFCLKDMKTSNQ 333 Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 ++ ++ S + + T + A + K+ S + K+ ++ + Sbjct: 334 GWYTRSDKSKFTSYVQSIKASGATLASSGVLVAANNMI--KDGSRTEQLKEQTKRVILVL 391 Query: 312 TDGENSGASAYQNTL--------------------------------------------- 326 +DG + + N+ Sbjct: 392 SDGNDEIIKSDPNSKVPFLNYTRITENLIYGKQEVFLSQKQKISLSLSHSTIETYLTDTQ 451 Query: 327 ---NTLQICEYMRNA--------GMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE 375 T +C+ +R+ KI V + + + C +G +++ ND Sbjct: 452 PRNETDGMCKVIRDRLDTLNNDKNTKIVFVEFGYASKAKQAWQHCV-GNGNYYSANDKES 510 Query: 376 LLESFDK 382 LL SF + Sbjct: 511 LLNSFKQ 517 >gi|125559999|gb|EAZ05447.1| hypothetical protein OsI_27661 [Oryza sativa Indica Group] Length = 704 Score = 69.6 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 16/174 (9%), Positives = 56/174 (32%), Gaps = 6/174 (3%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 K+ +L + G +++ + + + + +YN V + KS + Sbjct: 247 KLTLLKRAMGFVIDKLGPGD------RLAVVSFSYNAQRVIRLTRMSDDGKASAKSAVES 300 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L TN + A + + + + +G +++ ++ Sbjct: 301 LAAGGGTNILKGLVEAAKVFDGRRYRNAVASVILLSDGQDTYNVNGGWGASNSKNYSVLV 360 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + +++ + + ++ G F + + + ++F + Sbjct: 361 PPSFKRSGDRRLSVHTFGFGTDHDAAAMHAIAEETGGTFSFIENQAVVQDAFAQ 414 >gi|288942712|ref|YP_003444952.1| Pyrrolo-quinoline quinone [Allochromatium vinosum DSM 180] gi|288898084|gb|ADC63920.1| Pyrrolo-quinoline quinone [Allochromatium vinosum DSM 180] Length = 1059 Score = 69.6 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 33/281 (11%), Positives = 81/281 (28%), Gaps = 43/281 (15%) Query: 149 SICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR 208 ++ +LD S SM+D N+ T + + + T + N Sbjct: 49 NVLFILDESGSMDDYNRMTDLKNSMKTLLDNPMMDHVQAAIMGYTTANTDIYYNNRGWNT 108 Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +V A L + + + +++ + L + SR + Sbjct: 109 YANVNGSLAIKLHSDFKTVGEGSNRSAMKAIVDGLSPRASTPTVWALKLGMEWFDSRATQ 168 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 + T + + + ++ + ++ +TDGE + L + Sbjct: 169 IE---GTPSDRTRATSPTYSSPMSDPAY-----WCVPNHLVLLTDGEPNSNDPNYYGLTS 220 Query: 329 LQ-----------------ICEYMR---------------NAGMKIYSVAVSA--PPEGQ 354 + E N + +++A+ + Sbjct: 221 YKGTTCVKDSTSIEEKGRCAAEIAAWGYNNDLRTDSVWEDNQNVTTHTIALGDSLGSNDK 280 Query: 355 DLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSVRI 394 L+ G ++ +++ LL +F+ I D + + I Sbjct: 281 AFLKNIAIKGGGGYYEADNASTLLNAFETIVDDAKSEVEYI 321 >gi|322435250|ref|YP_004217462.1| Protein of unknown function DUF2134, membrane [Acidobacterium sp. MP5ACTX9] gi|321162977|gb|ADW68682.1| Protein of unknown function DUF2134, membrane [Acidobacterium sp. MP5ACTX9] Length = 515 Score = 69.6 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 58/496 (11%), Positives = 125/496 (25%), Gaps = 104/496 (20%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASI-VSDRTIKDPTTKKDQ 59 + +++ V I +D+ HI + ++Q++ DAA L+G I + Sbjct: 22 IAGLMMFVITAMIGLVVDVGHIYLCQRELQASSDAAALAGAEIIPTATTAAAVYAKATAY 81 Query: 60 TSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLF 119 +ST + K++ + + PL Y S + L+ Sbjct: 82 SSTTGAANVYKNMTNITMVSGYPILKCLSTMQTQGISCVGPLSYN--SIQVMQQAVVPLY 139 Query: 120 LKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNN-----NM 174 +I + +S ST + + ++ +VLD + S N + Sbjct: 140 FARIIGRSSMTISATSTAAKG-GASSRPYNVALVLDTTYSEISYDSDCGNSQMLCTLQGV 198 Query: 175 TSNKYLLPPPPKKSFWSKNTTKSKYAPAP-APANRKIDVLIESAGNLVNSIQKAIQEKKN 233 L P T+ ++ + + + Sbjct: 199 QILLNQLDPCGTSVTTCSVTSGQATNSVVRVGIFTFPQMVTSTVSSDYDCSSNTPANTVY 258 Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNL------NEVKSRLNKLNPYENTNTYPAMHHAYRE 287 G Y + ++ + N + ++ A Sbjct: 259 TFPIPGAGTYAPSSSTYRVLDFQSDYRVGDTSTSLNQASNLTKAVGGFSGCTGIYPATSA 318 Query: 288 LYNEKESSHNTIGSTRL--------------------KKFVIFITDGENS---------- 317 + +++ G+ + +I I DG + Sbjct: 319 SQSNFQATSGQYGTYYPSTIYAAQSSLIHEQTLFPDSQNVMIIIGDGNATAPQTNNGYPV 378 Query: 318 -----------------------GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 S + + + G ++Y+VA +P G Sbjct: 379 MSTTASVSATPGASTLAGTSSGLYPSWNGECGQAITAANFATSQGTRVYTVAYGSPSAGC 438 Query: 355 D--------------------LLRKCTDSSGQFFA-------------VNDSRELLESFD 381 + + +S FF+ EL + F Sbjct: 439 ASDQSGQGKIPGLYPNVLPCNEMAQMASASYYFFSDFKQSGSGSVCTAAQVMVELSDIFL 498 Query: 382 KITDKIQEQSVRIAPN 397 +I + R+ PN Sbjct: 499 QIAGDL--TVARLIPN 512 >gi|320158501|ref|YP_004190879.1| BatA [Vibrio vulnificus MO6-24/O] gi|319933813|gb|ADV88676.1| BatA [Vibrio vulnificus MO6-24/O] Length = 362 Score = 69.6 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 39/291 (13%), Positives = 96/291 (32%), Gaps = 34/291 (11%) Query: 105 AESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLY 164 + + + F L+ + S ++ + + + + +L VS + Sbjct: 35 VPAYRTKQTAIKVPFFHQLVEAMGETPSEGASQLTPSGWQRATLVLSWLLVVSALAKPTI 94 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 L ++ + ++ S + + + A ++D E Sbjct: 95 LGAPQIRESLGRDVMVVVD------LSGSMAEQDFTSASGANISRLDATKEVLAEFA--- 145 Query: 225 QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN---EVKSRLNKLNPYENTNTYPAM 281 K R+G I + TP + + + ++ + ++T+ A+ Sbjct: 146 ------KTRQGDRLGLILFGDAAFVQ--TPFTADQKVWLALLNQTDVAMAGQSTHLGDAI 197 Query: 282 HHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK 341 A + + S S +K I +TDG ++G + + + + + G++ Sbjct: 198 GLAIKVFEQSEPS--QAASSKPRQKVAIVLTDGNDTG-----SFVEPIDAAKVAKAKGVR 250 Query: 342 IYSVAVSAPPE------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 I+ +A+ P + + S G+ F + EL ++D I Sbjct: 251 IHVIAMGDPSTVGESALDLQTIERIASESGGKAFQALNRDELASAYDDIGK 301 >gi|290769918|gb|ADD61688.1| putative protein [uncultured organism] Length = 570 Score = 69.6 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 37/352 (10%), Positives = 103/352 (29%), Gaps = 32/352 (9%) Query: 50 IKDPTTKKDQTSTIFKKQIKKHLK---QGSYIRENAGDIAQKAQINITKDKNNPLQYIAE 106 I + T+ +L+ + + +N+ + + Sbjct: 62 IMQESATNMATADCESGFYDDYLQCPDTEEWNTNEYSYTEENPWMNVQTSPLSTFAADVD 121 Query: 107 SKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ 166 + + +I + + PS + + + + + + + + Sbjct: 122 TASYTQIRSAIENGYDIDPSMVRIEEMLNYFHYDYPLPKDDEKFAVYTEYMDCPWNEDTK 181 Query: 167 KHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK 226 + N + + P F + K+ ++ ++ L ++ + Sbjct: 182 LALVSMNTQAIDFKSAPASNLVFLID-------VSGSMFDDNKLPLVQQALTMLAENLTE 234 Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYR 286 R+ + Y ++ +E+ S + L Y +TN + AY Sbjct: 235 KD--------RVSIVTYAGSDEVVLQGVSGDDYHEISSAIEGLEAYGSTNGSAGIETAYA 286 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 N VI TDG+ + + ++ +++G+ + + Sbjct: 287 LAKKYFIKGGNNR--------VILCTDGDLNVGL--TSEGQLEKLITEKKDSGVFLSTFG 336 Query: 347 VSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 V + L D G + ++ E ++ + D++ V +A + Sbjct: 337 VGYGNYKDNKLELLADKGNGNYAYIDSMFEAKKA---LVDELGANMVTVAKD 385 >gi|255570576|ref|XP_002526245.1| protein binding protein, putative [Ricinus communis] gi|223534439|gb|EEF36142.1| protein binding protein, putative [Ricinus communis] Length = 540 Score = 69.6 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 21/152 (13%), Positives = 49/152 (32%), Gaps = 22/152 (14%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNE--VKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 + R+ + ++ G + ++ ++ +N LN TN + A + L Sbjct: 134 TDRLSIVTFSGGANRLCPLRQTTGKSQEEFENLINGLNADGATNITAGLQTALKVLKGRS 193 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + +G ++ ++DGE + S + I++ Sbjct: 194 FNGERVVG-------IMLMSDGEQNAGSDATGVSVG----------NVPIHTFGFGINH- 235 Query: 353 GQDLLRKCT--DSSGQFFAVNDSRELLESFDK 382 L+ G F V + L ++F + Sbjct: 236 EPKGLKAIAHNSIGGTFSDVQNIDSLTKAFAQ 267 >gi|224097862|ref|XP_002311085.1| predicted protein [Populus trichocarpa] gi|222850905|gb|EEE88452.1| predicted protein [Populus trichocarpa] Length = 713 Score = 69.6 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 20/185 (10%), Positives = 58/185 (31%), Gaps = 24/185 (12%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 K+ +L + G ++ ++ N + + + + + +N Sbjct: 279 TKLALLKRAMGFVIQNL------GSNDRLSVIAFSSTARRLFSLRRMSDAGRQHALQAVN 332 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 L TN + + + +E + +I ++DG+++ + + Sbjct: 333 SLVANGGTNIAEGLRKGAKVMEERREKNPVAS--------IILLSDGQDTYTVSGSSGNQ 384 Query: 328 TLQICEYM--------RNAG--MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 + NAG + +++ A + + S G F + + Sbjct: 385 PQPNYRLLLPLSIHGGDNAGFQIPVHAFGFGADHDASSMHSISEISGGTFSFIETEAVIQ 444 Query: 378 ESFDK 382 ++F + Sbjct: 445 DAFAQ 449 >gi|311268857|ref|XP_003132237.1| PREDICTED: collagen alpha-1(VII) chain-like [Sus scrofa] Length = 2945 Score = 69.6 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 57/162 (35%), Gaps = 19/162 (11%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYN 290 VR + Y+ + +V + +L+ NT T A+ H +++ Sbjct: 73 SAQGVRFAAVQYSDDPRTEFSLDTLGSGGDVIRAIRELSYKGGNTRTGAAILHVADQVFL 132 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + K I ITDG++ + ++ G+K+++V + Sbjct: 133 PQLARPG------VPKVCILITDGKSQD--------LVDAAAQRLKEQGVKLFAVGI--K 176 Query: 351 PEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 + L++ FF VND L ++ ++ Sbjct: 177 NADPEELKRIASQPTSDFFFFVNDFSILRTLLPLVSRRVCTT 218 >gi|296269770|ref|YP_003652402.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] gi|296092557|gb|ADG88509.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] Length = 315 Score = 69.6 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 27/220 (12%), Positives = 62/220 (28%), Gaps = 34/220 (15%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 ++ S A A ++ E+A V + + R + Sbjct: 84 DRATIIIAVDVSLSMEARDVAPNRLIAAKEAAQQFVRDLPERFNVGVVAFARTAAV---- 139 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + + V + + L T+ A+ +A + S + Sbjct: 140 ------VISPTTDHAAVTNAIAGLTTRPGTSIGEAVFNAL----DSIRSFDREAATDPPP 189 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-------------- 351 ++ ++DG+N+ + E NA + + ++A P Sbjct: 190 AAIVLLSDGDNTSGRP------VSEAIEAAANAKVPVSTIAYGTPDGYVMIDNRPVQVPV 243 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 L + G+ + + EL E + +I + Sbjct: 244 NKAALQELSEGTGGRAYTAESASELREVYQQIGTSLGYTI 283 >gi|149184581|ref|ZP_01862899.1| hypothetical protein ED21_27723 [Erythrobacter sp. SD-21] gi|148831901|gb|EDL50334.1| hypothetical protein ED21_27723 [Erythrobacter sp. SD-21] Length = 528 Score = 69.6 bits (168), Expect = 8e-10, Method: Composition-based stats. Identities = 39/316 (12%), Positives = 89/316 (28%), Gaps = 27/316 (8%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+ V F +D+A + ++Q A+D L+G S + + T+ Q Sbjct: 21 LVAMGAPVLFGSAGLGVDMAQYYMWKREIQYAVDQGALAGAWSRGNGDMGLEYKTRAKQE 80 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 I + K +L S + T D + ++P Sbjct: 81 FYINLSETKDYLLTHSIELQ-------------TFDGTPDSAVYMHATVSAQLP-----F 122 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 ++ + +++++ E + A LD S + + + Sbjct: 123 TKVMINEGMTIAVQARATWETQKQFTAC--LYSLDPSSTRTMWFNGGPTVDAACGVGARS 180 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIES-----AGNLVNSIQ--KAIQEKKN 233 + + A N + + N+V+ + Sbjct: 181 NADNAIVTNGGSGAQNINWVVAGGTINDGAGAFVNAEVVENYDNMVDPWEGLTPPDNATP 240 Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 +V G+ N Q ++ K++ + + + + P AY Y Sbjct: 241 RTVTCGSADANWQADEAQLDAITFKYYRGKNKNDAKSAGAISYSGPGSESAYDVTYATNV 300 Query: 294 SSHNTIGSTRLKKFVI 309 + T + + Sbjct: 301 GAMFTSEPNDYNQAPV 316 >gi|148676906|gb|EDL08853.1| matrilin 2, isoform CRA_b [Mus musculus] Length = 941 Score = 69.6 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 27/187 (14%), Positives = 66/187 (35%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + + +++S+ + + R+G + Y+ + ++ E+K Sbjct: 667 GEENFETVKHFVTGIIDSLAVSPKAA-----RVGLLQYSTQVRTEFTLRGFSSAKEMKKA 721 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + ST++ + I TDG + Sbjct: 722 VTHMKYMGKGSMTGLALKHMFERSFTQVEGARP--PSTQVPRVAIVFTDGRAQDDVSEW- 778 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ F D + E +K Sbjct: 779 -------ASKAKANGITMYAVGVGKAI--EEELQEIASEPIDKHLFYAEDFSTMGEISEK 829 Query: 383 ITDKIQE 389 + + I E Sbjct: 830 LKEGICE 836 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 55/159 (34%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 88 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 147 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ RN G+ I+++ V Sbjct: 148 FSEAEGARPLREN--VPRIIMIVTDGRPQDS--------VAEVAAKARNTGILIFAIGVG 197 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F Sbjct: 198 --QVDLNTLKAIGSEPHKDHVFLVANFSQIESLTSVFQN 234 >gi|120407045|ref|NP_058042.2| matrilin-2 [Mus musculus] Length = 937 Score = 69.6 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 27/187 (14%), Positives = 66/187 (35%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + + +++S+ + + R+G + Y+ + ++ E+K Sbjct: 667 GEENFETVKHFVTGIIDSLAVSPKAA-----RVGLLQYSTQVRTEFTLRGFSSAKEMKKA 721 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + ST++ + I TDG + Sbjct: 722 VTHMKYMGKGSMTGLALKHMFERSFTQVEGARP--PSTQVPRVAIVFTDGRAQDDVSEW- 778 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ F D + E +K Sbjct: 779 -------ASKAKANGITMYAVGVGKAI--EEELQEIASEPIDKHLFYAEDFSTMGEISEK 829 Query: 383 ITDKIQE 389 + + I E Sbjct: 830 LKEGICE 836 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 55/159 (34%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 88 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 147 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ RN G+ I+++ V Sbjct: 148 FSEAEGARPLREN--VPRIIMIVTDGRPQDS--------VAEVAAKARNTGILIFAIGVG 197 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F Sbjct: 198 --QVDLNTLKAIGSEPHKDHVFLVANFSQIESLTSVFQN 234 >gi|20136122|gb|AAM11539.1| matrilin-2 [Mus musculus] Length = 956 Score = 69.6 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 27/187 (14%), Positives = 66/187 (35%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + + +++S+ + + R+G + Y+ + ++ E+K Sbjct: 667 GEENFETVKHFVTGIIDSLAVSPKAA-----RVGLLQYSTQVRTEFTLRGFSSAKEMKKA 721 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + ST++ + I TDG + Sbjct: 722 VTHMKYMGKGSMTGLALKHMFERSFTQVEGARP--PSTQVPRVAIVFTDGRAQDDVSEW- 778 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ F D + E +K Sbjct: 779 -------ASKAKANGITMYAVGVGKAI--EEELQEIASEPIDKHLFYAEDFSTMGEISEK 829 Query: 383 ITDKIQE 389 + + I E Sbjct: 830 LKEGICE 836 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 55/159 (34%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 88 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 147 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ RN G+ I+++ V Sbjct: 148 FSEAEGARPLREN--VPRIIMIVTDGRPQDS--------VAEVAAKARNTGILIFAIGVG 197 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F Sbjct: 198 --QVDLNTLKAIGSEPHKDHVFLVANFSQIESLTSVFQN 234 >gi|62185620|gb|AAH92298.1| Matrilin 2 [Mus musculus] Length = 937 Score = 69.6 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 27/187 (14%), Positives = 66/187 (35%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + + +++S+ + + R+G + Y+ + ++ E+K Sbjct: 667 GEENFETVKHFVTGIIDSLAVSPKAA-----RVGLLQYSTQVRTEFTLRGFSSAKEMKKA 721 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + ST++ + I TDG + Sbjct: 722 VTHMKYMGKGSMTGLALKHMFERSFTQVEGARP--PSTQVPRVAIVFTDGRAQDDVSEW- 778 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ F D + E +K Sbjct: 779 -------ASKAKANGITMYAVGVGKAI--EEELQEIASEPIDKHLFYAEDFSTMGEISEK 829 Query: 383 ITDKIQE 389 + + I E Sbjct: 830 LKEGICE 836 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 55/159 (34%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 88 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 147 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ RN G+ I+++ V Sbjct: 148 FSEAEGARPLREN--VPRIIMIVTDGRPQDS--------VAEVAAKARNTGILIFAIGVG 197 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F Sbjct: 198 --QVDLNTLKAIGSEPHKDHVFLVANFSQIESLTSVFQN 234 >gi|74202868|dbj|BAE37504.1| unnamed protein product [Mus musculus] Length = 928 Score = 69.6 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 27/187 (14%), Positives = 66/187 (35%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + + +++S+ + + R+G + Y+ + ++ E+K Sbjct: 667 GEENFETVKHFVTGIIDSLAVSPKAA-----RVGLLQYSTQVRTEFTLRGFSSAKEMKKA 721 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + ST++ + I TDG + Sbjct: 722 VTHMKYMGKGSMTGLALKHMFERSFTQVEGARP--PSTQVPRVAIVFTDGRAQDDVSEW- 778 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ F D + E +K Sbjct: 779 -------ASKAKANGITMYAVGVGKAI--EEELQEIASEPIDKHLFYAEDFSTMGEISEK 829 Query: 383 ITDKIQE 389 + + I E Sbjct: 830 LKEGICE 836 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 55/159 (34%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 88 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 147 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ RN G+ I+++ V Sbjct: 148 FSEAEGARPLREN--VPRIIMIVTDGRPQDS--------VAEVAAKARNTGILIFAIGVG 197 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F Sbjct: 198 --QVDLNTLKAIGSEPHKDHVFLVANFSQIESLTSVFQN 234 >gi|7387906|sp|O08746|MATN2_MOUSE RecName: Full=Matrilin-2; Flags: Precursor gi|2072792|gb|AAC53163.1| matrilin-2 precursor [Mus musculus] Length = 956 Score = 69.6 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 27/187 (14%), Positives = 66/187 (35%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + + +++S+ + + R+G + Y+ + ++ E+K Sbjct: 667 GEENFETVKHFVTGIIDSLAVSPKAA-----RVGLLQYSTQVRTEFTLRGFSSAKEMKKA 721 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + ST++ + I TDG + Sbjct: 722 VTHMKYMGKGSMTGLALKHMFERSFTQVEGARP--PSTQVPRVAIVFTDGRAQDDVSEW- 778 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ F D + E +K Sbjct: 779 -------ASKAKANGITMYAVGVGKAI--EEELQEIASEPIDKHLFYAEDFSTMGEISEK 829 Query: 383 ITDKIQE 389 + + I E Sbjct: 830 LKEGICE 836 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 55/159 (34%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 88 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 147 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ RN G+ I+++ V Sbjct: 148 FSEAEGARPLREN--VPRIIMIVTDGRPQDS--------VAEVAAKARNTGILIFAIGVG 197 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F Sbjct: 198 --QVDLNTLKAIGSEPHKDHVFLVANFSQIESLTSVFQN 234 >gi|171741586|ref|ZP_02917393.1| hypothetical protein BIFDEN_00672 [Bifidobacterium dentium ATCC 27678] gi|171277200|gb|EDT44861.1| hypothetical protein BIFDEN_00672 [Bifidobacterium dentium ATCC 27678] Length = 1256 Score = 69.6 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 34/391 (8%), Positives = 99/391 (25%), Gaps = 56/391 (14%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPL 101 D + T + + + + ++ Sbjct: 446 KDTYPDADSHTFNVGDEVPYTFVVRNSGTTTLNNVAVNDPNITNVSCGTDTLAANQQTTC 505 Query: 102 QYIAESKAQ-YEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI---SICMVLDVS 157 + + ++ S + + + Sbjct: 506 SGTLTLTEDMVDSEGHFTNTATASGTDDEGNAVNSPQASVTIKAIKPLGAPEKHKRIKKN 565 Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 N ++ ++ + K+ L + Sbjct: 566 SDNTYTVNVDVTGAANSSTITTTQSVDFTLVLDVSSSMSDEMDSDQGSIK-KMTALKSAV 624 Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIG-----------------IVGNQCTPLSNNLN 260 N + + ++ + +R+G + + +PL+ +++ Sbjct: 625 NNFLGEAAEINEQSGSELIRVGLVKFAGKESSKVGNETYTEGRFVYNYSQIVSPLTADMS 684 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 ++K++++ L T HA + + + K+ VIF TDG + S Sbjct: 685 DLKNKVSALRHNGATRADLGFKHASTVMSGARTDA---------KRVVIFFTDGTPTKVS 735 Query: 321 AYQNTLNTLQI--CEYMRNAGMKIYSVAVSAPPE--------GQDLLRKCT--------- 361 + + + + ++++G +YS+ V + + Sbjct: 736 DFDKDVANSAVTYAKSLKDSGATVYSIGVFDGANPSSIEEDQKNQFMNAVSSNYPHATAY 795 Query: 362 ------DSSGQFFAVNDSRELLESFDKITDK 386 ++G + V++ +L F+KI Sbjct: 796 DKLGTGSNAGYYKVVSNVSDLKSIFEKIQTN 826 >gi|71278376|ref|YP_269691.1| von Willebrand factor type A domain-containing protein [Colwellia psychrerythraea 34H] gi|71144116|gb|AAZ24589.1| von Willebrand factor type A domain protein [Colwellia psychrerythraea 34H] Length = 364 Score = 69.6 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 41/257 (15%), Positives = 86/257 (33%), Gaps = 43/257 (16%) Query: 139 IERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSK 198 + E A + + +D+S SM N L K S T S Sbjct: 85 APINQEKSARDLMIAVDLSGSMAVEDFTLPIATN-------ELTNRAKNDTDSSATKSST 137 Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 ++ + V K R+G I + P +++ Sbjct: 138 NDTGKGEKVNRLVAVKHVLNAFV---------KSREHDRLGLILFGDAPYLQA--PFTDD 186 Query: 259 LNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + ++ LN+ + ++T A+ A S + +I +TDG Sbjct: 187 IATWQALLNESDIGMAGQSTAFGDAIGLAISVFQQ----------SDTQNRVLIVLTDGN 236 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------GQDLLRKCTD-SSGQFF 368 ++ + + ++ + +KIY++A+ P ++L+ + + G+ F Sbjct: 237 DTA-----SKVPPVEAAKVAAARDIKIYTIAIGDPSAVGEEKVDLEVLQAMAEITQGKSF 291 Query: 369 AVNDSRELLESFDKITD 385 +S ELL+ + +I Sbjct: 292 QALNSEELLKVYAEIDR 308 >gi|283455087|ref|YP_003359651.1| fimbriae protein with LPXTG motif and von Willebrand factor typeA domain [Bifidobacterium dentium Bd1] gi|283101721|gb|ADB08827.1| Fimbriae protein with LPXTG motif and von Willebrand factor typeA domain [Bifidobacterium dentium Bd1] Length = 1256 Score = 69.6 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 34/391 (8%), Positives = 99/391 (25%), Gaps = 56/391 (14%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPL 101 D + T + + + + ++ Sbjct: 446 KDTYPDADSHTFNVGDEVPYTFVVRNSGTTTLNNVAVNDPNITNVSCGTDTLAANQQTTC 505 Query: 102 QYIAESKAQ-YEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI---SICMVLDVS 157 + + ++ S + + + Sbjct: 506 SGTLTLTEDMVDSEGHFTNTATASGTDDEGNAVNSPQASVTIKAIKPLGAPEKHKRIKKN 565 Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 N ++ ++ + K+ L + Sbjct: 566 SDNTYTVNVDVTGAANSSTITTTQSVDFTLVLDVSSSMSDEMDSDQGSIK-KMTALKSAV 624 Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIG-----------------IVGNQCTPLSNNLN 260 N + + ++ + +R+G + + +PL+ +++ Sbjct: 625 NNFLGEAAEINEQSGSELIRVGLVKFAGKESSKVGNETYTEGRFVYNYSQIVSPLTADMS 684 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 ++K++++ L T HA + + + K+ VIF TDG + S Sbjct: 685 DLKNKVSALRHNGATRADLGFKHASTVMSGARTDA---------KRVVIFFTDGTPTKVS 735 Query: 321 AYQNTLNTLQI--CEYMRNAGMKIYSVAVSAPPE--------GQDLLRKCT--------- 361 + + + + ++++G +YS+ V + + Sbjct: 736 DFDKDVANSAVTYAKSLKDSGATVYSIGVFDGANPSSIEENQKNQFMNAVSSNYPHATAY 795 Query: 362 ------DSSGQFFAVNDSRELLESFDKITDK 386 ++G + V++ +L F+KI Sbjct: 796 DKLGTGSNAGYYKVVSNVSDLKSIFEKIQTN 826 >gi|223462031|gb|AAI46869.1| Collagen, type XII, alpha 1 [Homo sapiens] Length = 1899 Score = 69.6 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 21/165 (12%), Positives = 52/165 (31%), Gaps = 18/165 (10%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + V+I Y+ + + + L NT T A++ ++ Sbjct: 66 DIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQ- 124 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 R +K + ITDG++ + + +++ G++++++ + Sbjct: 125 ----NFRTQAGMRPRARKIGVLITDGKSQDDVEAPS--------KKLKDEGVELFAIGI- 171 Query: 349 APPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 + L+ + V D L D +T + Sbjct: 172 -KNADEVELKMIATDPDDTHAYNVADFESLSRIVDDLTINLCNSV 215 >gi|160858157|emb|CAP19998.1| collagen type VI alpha 5 [Homo sapiens] Length = 609 Score = 69.6 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 18/166 (10%), Positives = 59/166 (35%), Gaps = 16/166 (9%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHA 284 + VR+G + Y+ +N +++ + + T T A+ + Sbjct: 122 TEMFSIGPDKVRVGVVQYSDDTEVEFYITDYSNDIDLRKAIFNIKQLTGGTYTGKALDYI 181 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + + N + + + ++I +TDG ++ ++ + +R + +++ Sbjct: 182 LQIIKNGMKDRMSK-----VPCYLIVLTDGMSTD--------RVVEPAKRLRAEQITVHA 228 Query: 345 VAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 V + + L++ + + L +++ +I + Sbjct: 229 VGIG--AANKIELQEIAGKEERVSFGQNFDALKSIKNEVVREICAE 272 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 27/120 (22%), Positives = 47/120 (39%), Gaps = 11/120 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL-NKLNPYENTNTYPAMHHAYREL 288 + V+ G + Y+ +N + + L + + NT T A+ HA Sbjct: 498 DVGRDRVQFGALKYSDQPNILFYLNTYSNRSAIIENLRKRRDTGGNTYTAKALKHANALF 557 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 E H + +K+ +I ITDGE+ +T +RN G+ I++V V Sbjct: 558 TEE----HGSRIKQNVKQMLIVITDGESHDHDQLNDT------ALELRNKGITIFAVGVG 607 Score = 54.1 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 49/153 (32%), Gaps = 17/153 (11%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + +IG + ++ E+ +++++ E T T A++ Sbjct: 309 TKIQIGADKTQIGVVQFSDKTKEEFQLNRYFTQQEISDAIDRMSLINEGTLTGKALNFVG 368 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + K + KF+I ITDG +R + I+SV Sbjct: 369 QYFTHSKGARLGAK------KFLILITDGVAQDDVRDP--------ARILRGKDVTIFSV 414 Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 V + L + + S F V + L Sbjct: 415 GV--YNANRSQLEEISGDSSLVFHVENFDHLKA 445 >gi|332970881|gb|EGK09858.1| D-amino-acid dehydrogenase [Desmospora sp. 8437] Length = 442 Score = 69.6 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 32/227 (14%), Positives = 75/227 (33%), Gaps = 24/227 (10%) Query: 172 NNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEK 231 N T + + P ++ S K+D+ ++ + + + Q Sbjct: 115 NEKTPDNKIDVPSIEQMNVEILLDASGSMAGRVDGGVKMDLAKQAIRAFASDVPEGAQVS 174 Query: 232 KNLSVRIGTIAYNIGIVGNQCTPL-----SNNLNEVKSRLNKLNPYENTNTYPAMHHAYR 286 + G+ V Q L S + ++ + LN+ P T A+ A Sbjct: 175 LRVYGHKGSNQKKDKAVSCQSNELVYPLKSYDSSQFEQSLNQFKPTGWTPLASAIQAARE 234 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK--IYS 344 +L + V ++DG + + ++ + + +G++ + Sbjct: 235 DL----------KEWAGARNIVYVVSDGVETCGG------DPVREAKKLGESGIEPMVKI 278 Query: 345 VAVSAPPEGQDLLRKCTDS-SGQFFAVNDSRELLESFDKITDKIQEQ 390 + GQ L+K ++ G + V +L E + I+++ Sbjct: 279 IGFDVDDAGQQQLKKVAEAADGSYQTVTSGDDLKEYLKGEKESIKQE 325 >gi|301756400|ref|XP_002914037.1| PREDICTED: matrilin-2-like isoform 3 [Ailuropoda melanoleuca] Length = 957 Score = 69.6 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 28/187 (14%), Positives = 68/187 (36%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +++ + +++S+ + + R+G + Y+ + N+ ++K Sbjct: 668 GEENFEIVKQFVTGIIDSLAVSPKAA-----RVGLLQYSTQVRTEFTLRNFNSAKDMKKA 722 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + STR+ + I TDG + Sbjct: 723 VAHMKYMGKGSMTGLALKHMFERSFTQVEGARPL--STRVPRVAIVFTDGRAQDDVSEW- 779 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ + F D + E +K Sbjct: 780 -------ASKAQANGITMYAVGVGKAI--EEELQEIASEPTDKHLFYAEDFSTMGEISEK 830 Query: 383 ITDKIQE 389 + I E Sbjct: 831 LKKGICE 837 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 23/159 (14%), Positives = 55/159 (34%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 89 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 148 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ R+ G+ I+++ V Sbjct: 149 FSEAEGARPLREN--VLRVIMIVTDGRPQDS--------VAEVAAKARDTGILIFAIGVG 198 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F Sbjct: 199 --QVDLNTLKAIGSEPHEDHVFLVANFSQMESLTSVFQN 235 >gi|301756398|ref|XP_002914036.1| PREDICTED: matrilin-2-like isoform 2 [Ailuropoda melanoleuca] Length = 938 Score = 69.6 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 28/187 (14%), Positives = 68/187 (36%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +++ + +++S+ + + R+G + Y+ + N+ ++K Sbjct: 668 GEENFEIVKQFVTGIIDSLAVSPKAA-----RVGLLQYSTQVRTEFTLRNFNSAKDMKKA 722 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + STR+ + I TDG + Sbjct: 723 VAHMKYMGKGSMTGLALKHMFERSFTQVEGARPL--STRVPRVAIVFTDGRAQDDVSEW- 779 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ + F D + E +K Sbjct: 780 -------ASKAQANGITMYAVGVGKAI--EEELQEIASEPTDKHLFYAEDFSTMGEISEK 830 Query: 383 ITDKIQE 389 + I E Sbjct: 831 LKKGICE 837 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 23/159 (14%), Positives = 55/159 (34%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 89 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 148 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ R+ G+ I+++ V Sbjct: 149 FSEAEGARPLREN--VLRVIMIVTDGRPQDS--------VAEVAAKARDTGILIFAIGVG 198 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F Sbjct: 199 --QVDLNTLKAIGSEPHEDHVFLVANFSQMESLTSVFQN 235 >gi|301756396|ref|XP_002914035.1| PREDICTED: matrilin-2-like isoform 1 [Ailuropoda melanoleuca] Length = 957 Score = 69.6 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 28/187 (14%), Positives = 68/187 (36%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +++ + +++S+ + + R+G + Y+ + N+ ++K Sbjct: 668 GEENFEIVKQFVTGIIDSLAVSPKAA-----RVGLLQYSTQVRTEFTLRNFNSAKDMKKA 722 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + STR+ + I TDG + Sbjct: 723 VAHMKYMGKGSMTGLALKHMFERSFTQVEGARPL--STRVPRVAIVFTDGRAQDDVSEW- 779 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ + F D + E +K Sbjct: 780 -------ASKAQANGITMYAVGVGKAI--EEELQEIASEPTDKHLFYAEDFSTMGEISEK 830 Query: 383 ITDKIQE 389 + I E Sbjct: 831 LKKGICE 837 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 23/159 (14%), Positives = 55/159 (34%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 89 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 148 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ R+ G+ I+++ V Sbjct: 149 FSEAEGARPLREN--VLRVIMIVTDGRPQDS--------VAEVAAKARDTGILIFAIGVG 198 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F Sbjct: 199 --QVDLNTLKAIGSEPHEDHVFLVANFSQMESLTSVFQN 235 >gi|281350435|gb|EFB26019.1| hypothetical protein PANDA_001886 [Ailuropoda melanoleuca] Length = 942 Score = 69.6 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 28/187 (14%), Positives = 68/187 (36%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +++ + +++S+ + + R+G + Y+ + N+ ++K Sbjct: 668 GEENFEIVKQFVTGIIDSLAVSPKAA-----RVGLLQYSTQVRTEFTLRNFNSAKDMKKA 722 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + STR+ + I TDG + Sbjct: 723 VAHMKYMGKGSMTGLALKHMFERSFTQVEGARPL--STRVPRVAIVFTDGRAQDDVSEW- 779 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ + F D + E +K Sbjct: 780 -------ASKAQANGITMYAVGVGKAI--EEELQEIASEPTDKHLFYAEDFSTMGEISEK 830 Query: 383 ITDKIQE 389 + I E Sbjct: 831 LKKGICE 837 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 23/159 (14%), Positives = 55/159 (34%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 89 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 148 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ R+ G+ I+++ V Sbjct: 149 FSEAEGARPLREN--VLRVIMIVTDGRPQDS--------VAEVAAKARDTGILIFAIGVG 198 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F Sbjct: 199 --QVDLNTLKAIGSEPHEDHVFLVANFSQMESLTSVFQN 235 >gi|167034052|ref|YP_001669283.1| von Willebrand factor type A [Pseudomonas putida GB-1] gi|166860540|gb|ABY98947.1| von Willebrand factor type A [Pseudomonas putida GB-1] Length = 324 Score = 69.6 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 31/212 (14%), Positives = 72/212 (33%), Gaps = 44/212 (20%) Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 + Y A + ++ + + + + RIG I + G PL+ Sbjct: 105 TDYTDANGAKSDRLSAVKSVVRDFIARRKD---------DRIGLIVFGTGAYPQA--PLT 153 Query: 257 NNLNEVKSRLNKL---NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 + + L+++ NT A+ + L E K +I +TD Sbjct: 154 LDHASLLLLLDEVGIGMAGPNTALGDAIGLTIKALEKTPEQE----------KVLILLTD 203 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------GQDLLRKCT-DSSGQ 366 G ++ ++ + + G+ ++++ + P L+ + GQ Sbjct: 204 GNDTSSAITPD-----HAAHLAQANGIVVHTIGIGDPQATGDAKVDLTTLQAIARTTGGQ 258 Query: 367 FFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 FF +D + L + + + R+ P++ Sbjct: 259 FFRADDRQALQQVYATLD--------RLTPHK 282 >gi|310643461|ref|YP_003948219.1| protein [Paenibacillus polymyxa SC2] gi|309248411|gb|ADO57978.1| Putative uncharacterized protein [Paenibacillus polymyxa SC2] Length = 696 Score = 69.6 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 27/174 (15%), Positives = 67/174 (38%), Gaps = 20/174 (11%) Query: 234 LSVRIGTIAYNIGIVGNQCTPL---SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 R+G +AYN +V ++ + ++++ + LN T+ + L Sbjct: 81 DRTRVGFVAYNHHVVASKPLTSIGVAAQKSQIQQEIRMLNRSGYTDLGLGLRKGSELLAA 140 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAY------QNTLNTLQICEYMRNAGMKIYS 344 S + F+I ++DGE + + + + + + G +Y+ Sbjct: 141 GA--------SQGRQPFMILLSDGETDFGVSSGSRSKGDSNNDVSSVIKSAQTKGYPVYT 192 Query: 345 VAVSAPPE-GQDLLRKCTD-SSGQFFAVNDSRELLESFDKI-TDKIQEQSVRIA 395 + ++ + L + + G F + + +L E ++I +I+ + V +A Sbjct: 193 IGLNHDGTVNRQELERIASQTGGASFITSSAEDLPEILNRIFASQIRSKLVPVA 246 >gi|47214619|emb|CAG01460.1| unnamed protein product [Tetraodon nigroviridis] Length = 1723 Score = 69.6 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 29/169 (17%), Positives = 63/169 (37%), Gaps = 23/169 (13%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + V+I + Y+ +NL V + L TNT AM + + Sbjct: 469 DIGPDKVQISLVQYSRDPHTEFYLDSHHNLEAVVTALRTFPYRGGSTNTGRAMTYVRETV 528 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + + + + I ITDG+ ++ + +RN+ ++I++V V Sbjct: 529 FQASRGARAH-----VPRVTILITDGK--------SSDAFQEPAANLRNSDVEIFAVGV- 574 Query: 349 APPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS-VRI 394 + L ++ + V D ++F +I+ ++ + +RI Sbjct: 575 -KDAVRSELEAIANAPAETHVYTVEDF----DAFQRISTELTQSICLRI 618 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 55/168 (32%), Gaps = 18/168 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 ++ V+IG Y+ + + L NT T A+++ Sbjct: 1247 SVFNIGPGRVQIGLAQYSGDPKTEWHLNAHPTKESLLDAVANLPYKGGNTMTGMALNYIL 1306 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + N +K + ITDG++ + + + +R+ +++Y++ Sbjct: 1307 Q-----NNFKTNVGMRPGARKIGVLITDGKSQDDVVFNS--------QNLRDNDIELYAI 1353 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 + ++ LR + V D + L + +T + Sbjct: 1354 GI--KNADENQLRSIASDPEQIHMYNVRDFKFLADIVKNVTSNLCNSV 1399 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 15/182 (8%), Positives = 47/182 (25%), Gaps = 32/182 (17%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYP----AMHH-AY 285 R+G + Y + + + L T T Sbjct: 147 GQDRTRVGVVQYGSDARVEFRLDAHPSRPALLRAIGTLPYMGGDTRTGSRRSPGRCWLGS 206 Query: 286 RELYNEKESSHN--------------TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 + + S K ++ +TD ++ Sbjct: 207 NYVVHPFSRSPGHALKFLLEKSLTEEAGARKDFPKVLVVVTD--------SKSVDPVENS 258 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQE 389 +R+AG++++ + V + +++ + ++V + + ++ ++ Sbjct: 259 AGRLRSAGVEVFVLGVG--QADEAEMKQIASTPYRNHVYSVATFQTIKSVQRELISQLCA 316 Query: 390 QS 391 Sbjct: 317 AV 318 >gi|85374101|ref|YP_458163.1| hypothetical protein ELI_06370 [Erythrobacter litoralis HTCC2594] gi|84787184|gb|ABC63366.1| hypothetical protein ELI_06370 [Erythrobacter litoralis HTCC2594] Length = 435 Score = 69.6 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 29/329 (8%), Positives = 84/329 (25%), Gaps = 29/329 (8%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+ + Y +D+A ++ ++Q A+D A ++G S+ + T D + + +Q Sbjct: 22 ILALGLPALVGGAGYGLDMAQWYMLKRELQYAVDQAAVAGAYSLSYNGTAGDWSARAEQE 81 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + + + + + +P Sbjct: 82 YDANRSITTGYATANDSTKGVTDYGSFTQN-----------SVTVSATMDVSLP-----F 125 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 ++ S T +++ S + E++ I N + Sbjct: 126 SSILLSTPTTINVNSQAMFEKTDGATGCMI----------ALKPNDTAITINGNVTINAP 175 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 + + + L++ + V Sbjct: 176 CGMAVSSTSSISINKSGGSGSINPGWVYTGGGVNDKFQELIDKFNATAPAADQVEVNEDA 235 Query: 241 IAYNIGIVGNQCTPLSNNLNE--VKSRLNKLNPYENTNTYPAMHHAYR-ELYNEKESSHN 297 + ++ + ++ +++ + +H + + E +N Sbjct: 236 SGFVDPFASVTVPTVTGSGSKNCSTGKVSPGVFSGGIDVDCNLHFEPGVYVIDGGELKNN 295 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTL 326 T +F G+ + L Sbjct: 296 NKTITGSDVLFVFKNGGKVNFGGNSDAKL 324 >gi|224113057|ref|XP_002316375.1| predicted protein [Populus trichocarpa] gi|222865415|gb|EEF02546.1| predicted protein [Populus trichocarpa] Length = 714 Score = 69.6 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 16/178 (8%), Positives = 51/178 (28%), Gaps = 10/178 (5%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 K+ +L + G ++ ++ N + + + + +N Sbjct: 280 TKLALLKRAMGFVIQNL------GSNDRLSVIAFSSTARRLFPLRRMSDTGRQHALQAVN 333 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIG---STRLKKFVIFITDGENSGASAYQN 324 L TN + + + + +E + S + + G + Sbjct: 334 ALVANGGTNIAEGLRKGAKVMEDRREKNPVASIILLSDGQDTYTVSGNGGNQPQPNYQLL 393 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 ++ + + +++ A + + S G F + + ++F + Sbjct: 394 LPVSIHGGDNA-GFQIPVHAFGFGADHDASSMHSISEISGGTFSFIETEAVIQDAFAQ 450 >gi|220910752|ref|YP_002486062.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] gi|219867524|gb|ACL47861.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] Length = 411 Score = 69.6 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 23/192 (11%), Positives = 62/192 (32%), Gaps = 26/192 (13%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 K++V ++A V + + R+ ++ + + L + + + Sbjct: 59 KMEVARQAACFAVEQLLPSD--------RLSVTIFDDRVECPVPSTLVRDKATIIRTIQG 110 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 ++ +T + ++ +H VI ++DG + + Sbjct: 111 IHSRGSTALHDGWVQGGIQVSQHLNPAHLNR--------VILLSDGLANVGETNPD---- 158 Query: 329 LQICEY---MRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKIT 384 I ++ + G+ ++ + G+DLL S G F+ + + +L F Sbjct: 159 -AIAQHVHGLAQRGVSTSTMGIG-EDYGEDLLEAMARSGAGSFYHIERTEQLAAIFQAEL 216 Query: 385 DKIQEQSVRIAP 396 + + Sbjct: 217 QGLMGTLGQTVS 228 >gi|310641811|ref|YP_003946569.1| von willebrand factor type a [Paenibacillus polymyxa SC2] gi|309246761|gb|ADO56328.1| von Willebrand factor type A [Paenibacillus polymyxa SC2] Length = 429 Score = 69.6 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 30/177 (16%), Positives = 71/177 (40%), Gaps = 23/177 (12%) Query: 223 SIQKAIQEKKNLSVRIGTIAYNIGIV-GNQCTPLSNN--LNEVKSRLNKLNP--YENTNT 277 + K++ + + R+ +++ TP+ + V S+++ + T Sbjct: 137 TAAKSLIGQMDGDKRVAIVSFESTAQLVQPFTPIGTDAEKQAVYSKIDSMQTIMSGGTEI 196 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 A+ +E+ + G+ VI ++DG + L+T Sbjct: 197 GLALDETIKEIETQ--------GNAEKGSLVIMLSDG--------FSELDTQTALAPYIA 240 Query: 338 AGMKIYSVAVS-APPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + I ++ + A EG LL+ + G + V +++ L ++F KI +KI ++++ Sbjct: 241 RQIPINTIGLKLAESEGIALLQNIASLTGGTYSNVANAQGLTQAFGKIYNKIGDRTL 297 >gi|292624276|ref|XP_002665574.1| PREDICTED: collagen alpha-1(XXVIII) chain [Danio rerio] gi|225310547|emb|CAQ19234.1| collagen type XXVIII alpha 1 c precursor [Danio rerio] Length = 1170 Score = 69.6 bits (168), Expect = 9e-10, Method: Composition-based stats. Identities = 26/197 (13%), Positives = 69/197 (35%), Gaps = 26/197 (13%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + +V+ + +L++ + + R+G + Y+ V + Sbjct: 809 SSESVGPDNYEVVKDFVNSLIDHVS-----VSREATRVGVVLYSHVEVVVASLQQLYDQA 863 Query: 261 EVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 VK+ + ++ T T A+ A + + ++K + +TDG Sbjct: 864 AVKTAVRRMPYLGEGTFTGSAIRRATQLFQAARPG---------VRKVAVVLTDGLADN- 913 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ-----DLLRKCTDSS--GQFFAVND 372 ++ ++ E +AG++I+ V + + Q + + + +D Sbjct: 914 ---RDAVSLKDAAEGAHSAGIEIFVVGIVNNSDSQYAEFKNEMNILASDPDENYVYLTDD 970 Query: 373 SRELLESFDKITDKIQE 389 +L ++ + I E Sbjct: 971 FLKLHALESRLLNHICE 987 >gi|332524448|ref|ZP_08400660.1| von Willebrand factor type A [Rubrivivax benzoatilyticus JA2] gi|332107769|gb|EGJ08993.1| von Willebrand factor type A [Rubrivivax benzoatilyticus JA2] Length = 343 Score = 69.6 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 22/194 (11%), Positives = 64/194 (32%), Gaps = 46/194 (23%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNE---- 291 VR+G +++ Q S + +V + + + T + + ++ + Sbjct: 126 VRVGVVSFAGTAAVVQAPTTSRD--DVFAAIERFQLQRGTAIGSGIVLSLATIFPDAGID 183 Query: 292 -------------------KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 K + +I +TDG+ + + + Sbjct: 184 IQQITGQRTMPRMLGDPEKKAEFTPVPPGSYASAAMILLTDGQRTTG------PDPIDAA 237 Query: 333 EYMRNAGMKIYSVAVSAPPE--------------GQDLLRKCTD-SSGQFFAVNDSRELL 377 + + G+++Y+V V +D LR+ ++G++F + +L Sbjct: 238 KMAADRGIRVYTVGVGTTQGEIIGFEGWSMRVRLDEDTLRQIAQMTTGEYFYAGTAEDLK 297 Query: 378 ESFDKITDKIQEQS 391 + + ++ ++ + Sbjct: 298 KVYQRLGSRMVVER 311 >gi|326675074|ref|XP_003200270.1| PREDICTED: collagen alpha-1(XIV) chain-like [Danio rerio] Length = 164 Score = 69.6 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 29/162 (17%), Positives = 58/162 (35%), Gaps = 20/162 (12%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNT 298 + YN N N++ + + + T T A+ A L+ +K S Sbjct: 1 MVLYNDRPSAEFYLDTFANKNDIMNYIKIIPYRGGGTATGAALKFAQNNLFTQKRGSRKA 60 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLR 358 +G +K+ I +TDGE + + +R +G+ +Y++ V + L+ Sbjct: 61 LG---VKQIAIVMTDGE--------SEDDVTTTAAELRRSGVTVYALGV--KNASVEELK 107 Query: 359 KCTDSSGQ--FFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 K F V L + +++ ++ NR Sbjct: 108 KIGSYPEHEFVFNVGSFLML----SSLEKSLRKSLCKVVVNR 145 >gi|153871328|ref|ZP_02000529.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] gi|152072210|gb|EDN69475.1| von Willebrand factor type A domain protein [Beggiatoa sp. PS] Length = 280 Score = 69.6 bits (168), Expect = 1e-09, Method: Composition-based stats. Identities = 41/286 (14%), Positives = 89/286 (31%), Gaps = 26/286 (9%) Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSN 177 + + L + + S ++D+ + + + + +T Sbjct: 1 MEVYILQGESEVPMECVILNQYTFSQITHVPDKLAIIDIEYAYDKNGVVTVSATERVTGQ 60 Query: 178 KYLLPPPPKKSFW--SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEK--KN 233 S + IDV G+ + ++A QE K+ Sbjct: 61 TLPKQAQIPDDLSWLSLPPSPESVTLVHQSVFLLIDVSYSMDGSALAEAKQAAQEFVRKS 120 Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 G + L+ N + +N+L +TN + AY +L N + Sbjct: 121 DLAHTAIGLIEFGSKAKIISGLTQNAKHLYKAINRLKTNGSTNMTEGLTTAYLKLKNVDD 180 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 +F+I +TDG + NT QI + + G+++ ++ Sbjct: 181 P-----------RFIILLTDGLPNH------PKNTQQIAQEICADGIELITIG--TGDAD 221 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE--QSVRIAPN 397 + L+ F + ++ +F +I + E ++I N Sbjct: 222 KTYLQSLACYDQNSFFAK-AGTMVSTFSRIAQVLTESGSYIQITQN 266 >gi|222528098|ref|YP_002571980.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] gi|222454945|gb|ACM59207.1| von Willebrand factor type A [Caldicellulosiruptor bescii DSM 6725] Length = 902 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 25/189 (13%), Positives = 63/189 (33%), Gaps = 27/189 (14%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 K+++ ++ ++ ++ + IA++ + +V Sbjct: 425 GIPKLEIAKSASAKMIEHLESSDGVG--------VIAFDHNYYWAYKFGKISKKEDVIES 476 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 ++ + T + E+ S K ++ +TDG Sbjct: 477 ISSIEVGGGTAI----------IPPLSEAVKTLKKSKAKSKLIVLLTDGMGEQGGY---- 522 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKIT 384 + +KI ++ V +L +SG+F+ V++ EL++ F K T Sbjct: 523 ---EIPANEAKRNNIKITTIGVGKYVNA-TVLSWIASFTSGRFYLVSNPSELVDVFLKET 578 Query: 385 DKIQEQSVR 393 I+ + ++ Sbjct: 579 KIIKGKYIK 587 >gi|77464595|ref|YP_354099.1| von Willebrand (VWA) domain-containing protein [Rhodobacter sphaeroides 2.4.1] gi|77389013|gb|ABA80198.1| Putative membrane protein with von Willebrand (VWA) domain [Rhodobacter sphaeroides 2.4.1] Length = 651 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 45/366 (12%), Positives = 104/366 (28%), Gaps = 29/366 (7%) Query: 35 AAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINIT 94 A + A+ + + + + + + ++ A + Sbjct: 141 RARSAEGAAPQTFAADEAMPMAAPPAPDLALSKQAAEAPARALPQGDSEAFANAPDNPLR 200 Query: 95 KDKNNPLQY-IAES-KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICM 152 +P+ + A Y I +L L P + + Sbjct: 201 VTAEDPVSTFSIDVDTASYAILRSSLRAGQLPPREAVRIEEMINYFPYDYPAPENGTPPF 260 Query: 153 VLDVSRSMEDLY-LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKID 211 +S + + + PP + +T+ S PA K+ Sbjct: 261 RPTLSVTRTPWNPETQLVHVALQGRMPAIEDRPPLNLVFLIDTSGSMQDPA------KLP 314 Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP 271 +L +S G ++ ++ ++ + Y +N + + S L++L+ Sbjct: 315 LLKQSFGLMLGRLRPED--------QVAIVTYAGSAGEVLAPTAANQRSTILSALDRLDA 366 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 +T + AYR + T V+ TDG+ + + L L Sbjct: 367 GGSTAGEEGLALAYRTASEMAGAGEVTR--------VVLATDGDFNLGISDPEELARLVA 418 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 E R+ G+ + + ++ + L E+ + D++ Sbjct: 419 HE--RDTGIYLSVLGFGRGNLDDATMQALAQNGNG--QAAYIDSLNEAQKVLVDQLSGAL 474 Query: 392 VRIAPN 397 IA + Sbjct: 475 FPIADD 480 >gi|331090683|ref|ZP_08339532.1| hypothetical protein HMPREF9477_00175 [Lachnospiraceae bacterium 2_1_46FAA] gi|330400097|gb|EGG79748.1| hypothetical protein HMPREF9477_00175 [Lachnospiraceae bacterium 2_1_46FAA] Length = 3699 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 41/213 (19%), Positives = 75/213 (35%), Gaps = 37/213 (17%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPL--SNNLN 260 + K+ L + K + R+ + ++ Q SN ++ Sbjct: 219 NSQSKMGALKIAVNQFAQETAKRNDSITDAAKQHRMSIVTFSSESYIRQSLKAYNSNTVS 278 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 E + +N LN T M A L N +E + +K VIF TDG + Sbjct: 279 EFERTINGLNANGATYANLGMEKAKESLKNVREKA---------QKVVIFFTDGTPGRSG 329 Query: 321 AYQNTLN-TLQICEYMRNAGMKIYSVAVSAPPEGQ-----------------------DL 356 +T N T+Q + +++ KIYS+ V Sbjct: 330 FDDDTANNTIQAAKSLKDDLTKIYSIGVFDQANPDNTSSSFNAYMHGVSSNYPNATKWTE 389 Query: 357 LRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 L + ++S + A D+ EL + F++I +++ Sbjct: 390 LGERAENSNYYKAAQDADELNKIFEEIFEEMNS 422 >gi|324499530|gb|ADY39800.1| Transmembrane cell adhesion receptor mua-3 [Ascaris suum] Length = 3675 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 29/185 (15%), Positives = 61/185 (32%), Gaps = 13/185 (7%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 + V V + + + R+G I Y+ I + V Sbjct: 1199 GSGSIGSYVFKHEVLRFVKEFVEL-FDIGLDNTRVGLIQYSDQIRHEFDLSQYRDKASVI 1257 Query: 264 SRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 +++++ T T A+ H E ++E+ + S + + I ITDG + Sbjct: 1258 QAISQVHYLTGLTRTGAAIQHMVMEGFSERRGARAE--SDDVARVAIVITDGRSQD---- 1311 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 N + R + ++++ V L S ++F V+ ++L Sbjct: 1312 ----NVTEPAIAARRLHVNMFAIGV-TDHVLASELESIAGSPSRWFYVDRFKDLDTRLRS 1366 Query: 383 ITDKI 387 + K Sbjct: 1367 LIQKA 1371 >gi|311253435|ref|XP_001924360.2| PREDICTED: collagen alpha-1(XIV) chain [Sus scrofa] Length = 1795 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 28/201 (13%), Positives = 62/201 (30%), Gaps = 26/201 (12%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + + + V ++ K + ++ + + Sbjct: 1036 VDGSWSIGDENFNKITNFLYSTVGALDKIGADGT----QVAMVQFTDDPRTEFKLNTYKT 1091 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + ++ NT T A+ H L+ + + K ++ ITDG + Sbjct: 1092 KETLLDAIKHISYKGGNTKTGKAIKHVRDNLFTAESGIRR-----GIPKVIVVITDGRSQ 1146 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRE 375 +I + M+ G I++V V L F V+D Sbjct: 1147 DDVN--------KISKEMQLDGYSIFAVGV--ADADYSELVSIGSKPSARHVFFVDDF-- 1194 Query: 376 LLESFDKITDKIQEQSVRIAP 396 ++F KI D++ A Sbjct: 1195 --DAFKKIEDELITFVCETAS 1213 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 25/168 (14%), Positives = 59/168 (35%), Gaps = 18/168 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + RIG Y+ + +EV + L NT T A+++ + Sbjct: 185 TAFNVGSEKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNYIF 244 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + + T + K I ITDG++ + +R +G++++++ Sbjct: 245 ENSFKPEAGAR-----TGVSKIGILITDGKSQDDVIPPS--------RNLRESGVELFAI 291 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 V ++ LR+ + V + + + +T + + Sbjct: 292 GV--KNADENELREIASEPDNTHVYNVAEFDLMHTVVESLTRTVCSRV 337 >gi|73974062|ref|XP_548552.2| PREDICTED: similar to matrilin 2 isoform a precursor [Canis familiaris] Length = 978 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 56/159 (35%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y I +EV+ + ++ T T A+ +A Sbjct: 124 DISPDLTRVGLLQYGSTIKNEFSLKTFKKKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 183 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ R+ G+ I+++ V Sbjct: 184 FSEAEGARPLREN--VLRVIMIVTDGRPQDS--------VAEVAAKARDTGILIFAIGVG 233 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F K Sbjct: 234 --QVDLNTLKAIGSEPHEDHVFLVANFSQMESLTSVFQK 270 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 27/187 (14%), Positives = 66/187 (35%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +++ + +++S+ + + R+G + Y+ + ++ ++K Sbjct: 703 GEENFEIVKQFVAGIIDSLAVSPKAA-----RVGLLQYSTQVRTEFTLGDFSSARDMKKA 757 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + +N E + S + + I TDG + Sbjct: 758 VAHMKYMGKGSMTGLALKHMFERSFNPVEGARPV--SPGVSRVAIVFTDGRAQDDVSAW- 814 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDK 382 R G+ +Y+V V ++ L++ + F D + E +K Sbjct: 815 -------ARRARAGGITMYAVGVGKAI--EEELQEIASEPTDKHLFYAEDFSTMGEISEK 865 Query: 383 ITDKIQE 389 + I E Sbjct: 866 LKKGICE 872 >gi|291399641|ref|XP_002716222.1| PREDICTED: collagen, type XXIX, alpha 1 [Oryctolagus cuniculus] Length = 2738 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 28/160 (17%), Positives = 58/160 (36%), Gaps = 13/160 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYREL 288 + V+ G + Y+ +N + + L + + +T T A+ + Sbjct: 845 DVGRDRVQFGALRYSDDPDILFYLNTYSNRSAIIEHLRRRRDTGGSTFTAKALGRSATLF 904 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + H + +K+ +I ITDGE+ +T +RN G+ I +AV Sbjct: 905 EEQ----HGSRIKQNVKQMLIIITDGESHDRHLLNDT------ALKLRNKGITI--IAVG 952 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 Q+ L + V D +L + + + + + Sbjct: 953 VGKANQEELEAMAGNKENTIHVKDFDKLKDVYLPLQESMC 992 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 29/147 (19%), Positives = 51/147 (34%), Gaps = 17/147 (11%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEK 292 +IG ++ E+ ++++ NT T A+ K Sbjct: 663 DKTQIGVAQFSDYNKEEFPLNKYFTQKEISDAIDRMLLITGNTLTGSALKFIDTYFTQSK 722 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + H +KKF+I ITDGE + + +R+ G+ I SV V Sbjct: 723 GARHG------VKKFLILITDGEAQD--------DVREPAVALRDKGVIILSVGV--YGA 766 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLES 379 + L + + F V + +L E Sbjct: 767 NRTQLEEISGDGSLVFHVENFEDLKEI 793 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 27/307 (8%), Positives = 86/307 (28%), Gaps = 31/307 (10%) Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLS--LRSTGIIERSSENL 146 QI + + ++ + +F + + T L + S + Sbjct: 339 PQIAVLVTSRPSDDEVRDAALDLRLEGVTVFAVNIQGANSTQLEEIVSYPPGQSVSVMSS 398 Query: 147 AISICMVLDVSRSMEDLYLQKHNDNNNM--TSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +K + + S Sbjct: 399 YADLGNYTTKFMKKLQNEIWAQVSTVAEQMELDKTGCVDTKEADIYFLIDGSS---SIRD 455 Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +I + + ++ N VR+G + Y+ + ++ + ++ Sbjct: 456 KQFLQIKEFMLAVTDMFN--------IGPDKVRVGVVQYSNDRAVEFDIDVYHDNSVLRK 507 Query: 265 RLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 + + T T A+ + +++ ++++ +++ +TDG+ Sbjct: 508 AIYNIKQLKGGTLTGKALDFILPIMKKGRKTR-----ASQVPCYLLVLTDGK-------- 554 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 + L E +R + +++ + + L + + ++ L +I Sbjct: 555 SEDEVLGPAERIRAEQISTHAIGIGKAH--KKELLQIAGEEERVNFGQNTDALKSIKKEI 612 Query: 384 TDKIQEQ 390 I + Sbjct: 613 VHSICTE 619 >gi|114608142|ref|XP_001142833.1| PREDICTED: collagen alpha-1(XII) chain isoform 1 [Pan troglodytes] Length = 1899 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 21/165 (12%), Positives = 52/165 (31%), Gaps = 18/165 (10%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + V+I Y+ + + + L NT T A++ ++ Sbjct: 66 DIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQ- 124 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 R +K + ITDG++ + + +++ G++++++ + Sbjct: 125 ----NFRTQAGMRPRARKIGVLITDGKSQDDVEAPS--------KKLKDEGVELFAIGI- 171 Query: 349 APPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 + L+ + V D L D +T + Sbjct: 172 -KNADEVELKMIATDPDDTHAYNVADFESLSRIVDDLTINLCNSV 215 >gi|170744425|ref|YP_001773080.1| hypothetical protein M446_6382 [Methylobacterium sp. 4-46] gi|168198699|gb|ACA20646.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 482 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 46/467 (9%), Positives = 108/467 (23%), Gaps = 94/467 (20%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A + I +D + + ++ A A++ + T K + Sbjct: 26 IVAFSLIPLIGMIGLGVDYGMAVGAKTKLDHAA------DAAALAAVVTAKAYVAANARN 79 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 ++ + + + + + + E+ Y T + Sbjct: 80 WNVWDIAVAEGQARAANAFAVNAGSVPFTHFALDPIQLTRSGQTFEATVTY-TATVSNNF 138 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDN-NNMTSNKY 179 L T +S R+ + +++DVS SM + + Sbjct: 139 GPLFGIRTTAVSGRAVATTSV---PSYLDFYLLIDVSGSMGLPSTTDGQAQLAALNRIDF 195 Query: 180 LLPPPPKKSFWSKNTTKSKYAPAP-APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 F Y A + + + NL+ + + Sbjct: 196 FKVYQQGCQFACHFPGFVGYDLAVFNNIQLRSGAVNVAVCNLIKRAAQPEVAN-----QY 250 Query: 239 GTIAYNIGIVGNQCTPLSNNLNEVKSRL-----------------------NKLNPYENT 275 Y L+ + + + + N Sbjct: 251 RVGLYPFITQMGTLQDLTADTSALNLKAGCAASNPMVFTQLLDTGATQLDSNGDPSTGVG 310 Query: 276 NTYPAMHHAYRELYNE--KESSHNTIGSTRLKKFVIFITDGENS---------------- 317 + + + + + K FV ITDG + Sbjct: 311 SGGTHFETSLTSMLATIKANGYGDGSTQIKPKPFVFLITDGMQNNQWYSIQINGKRYYSG 370 Query: 318 ---------GASAYQNTLNTL----QICEYMRNAGMKIYSVAV----------------- 347 A+ + + C+ +R AG+ + + + Sbjct: 371 SPSKFSAYPDANWSPGGSDPVPMDWGYCDTLRQAGVTVSVLLIPYIKIDFTYVKSDIADE 430 Query: 348 -----SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 D+ R+C G F + + + D + K + Sbjct: 431 NNKVNGFSSGLPDVARQCAS-PGYFQMADTPEAIDRALDAMFMKATQ 476 >gi|297488656|ref|XP_002697119.1| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Bos taurus] gi|296474935|gb|DAA17050.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Bos taurus] Length = 1192 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 37/350 (10%), Positives = 97/350 (27%), Gaps = 24/350 (6%) Query: 33 LDAAVLSGCASIVSDRTIKDPTTKKDQTST----IFKKQIKKHLKQGSYIRENAGDIAQK 88 LD V + + T + +D + + L G A Sbjct: 176 LDRKVQALKRLADAAETFQKAHRWQDNIKEEDIVYYDAKADAELDDPESEDVERGSKAST 235 Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI 148 +++ +D N + A +IPT+ +I + L +E ++ + Sbjct: 236 LRLDFVEDSNFKNKVNYSYTA-VQIPTDIYKGSTVILNELNWTEALENVFMENRRQDPTL 294 Query: 149 SICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR 208 + + + P + S + + + Sbjct: 295 LWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGL 354 Query: 209 KIDVLIESAGNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 + ++ S +++++ + + + ++ +V N K + Sbjct: 355 TLKLMKTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVR----NKKVFKEAVQ 410 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + T +A+ +L N + K ++ TDG N Sbjct: 411 GMVAKGTTGYKAGFEYAFDQLQNPNITRA------NCNKMIMMFTDGGEDRVQDVFEKYN 464 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSREL 376 N +++++ +V L+ ++ G +F + + Sbjct: 465 WP-------NRTVRVFTFSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 507 >gi|169826904|ref|YP_001697062.1| hypothetical protein Bsph_1324 [Lysinibacillus sphaericus C3-41] gi|168991392|gb|ACA38932.1| conserved hypothetical protein [Lysinibacillus sphaericus C3-41] Length = 825 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 35/194 (18%), Positives = 66/194 (34%), Gaps = 19/194 (9%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 + + S L E +G IA++ T NN E Sbjct: 370 VIVLDRSGSMSGSKLELAKEAAARSVEMLRDEDTLGFIAFDDRPWEIIETGPLNNKEEAV 429 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 + + P T Y ++ AY L + K K +I +TDG++ + Sbjct: 430 DTILSVTPGGGTEIYGSLAKAYENLADMKLQR----------KHIILLTDGQSQPGNYDD 479 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDK 382 + E ++ G+ + +VA+ +LL ++ G+F+ V D + + + Sbjct: 480 -------LIEQGKDNGITLSTVAIGQDA-DANLLEALSEMGSGRFYNVIDEQTIPSILSR 531 Query: 383 ITDKIQEQSVRIAP 396 T I + P Sbjct: 532 ETAMISRTYIEDNP 545 >gi|163801617|ref|ZP_02195515.1| hypothetical protein 1103602000597_AND4_09192 [Vibrio sp. AND4] gi|159174534|gb|EDP59336.1| hypothetical protein AND4_09192 [Vibrio sp. AND4] Length = 367 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 27/150 (18%), Positives = 59/150 (39%), Gaps = 18/150 (12%) Query: 246 GIVGNQCTPLSNNLN---EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 G TP + + E+ ++ + ++T+ AM A + K Sbjct: 159 GDAAFVQTPFTPDQKVWLELLNQTDVAMAGQSTHLGDAMGLAIKVFEQSKSRIGVEENKE 218 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------GQDL 356 K I +TDG ++G + + ++ + + G++I+ +A+ P Sbjct: 219 ---KVAIVLTDGNDTG-----SFVEPIEAAKVAKAKGVRIHVIAMGDPQTLGEAALDMKT 270 Query: 357 LRKCT-DSSGQFFAVNDSRELLESFDKITD 385 +R+ +S G+ F + EL +++D I Sbjct: 271 IRRIAKESGGKAFEAMNRDELAKAYDDIGR 300 >gi|152993581|ref|YP_001359302.1| von Willebrand factor A [Sulfurovum sp. NBC37-1] gi|151425442|dbj|BAF72945.1| von Willebrand factor type A domain protein [Sulfurovum sp. NBC37-1] Length = 305 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 33/197 (16%), Positives = 70/197 (35%), Gaps = 30/197 (15%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 P K DV+ E + + KK + RIG + + + + Sbjct: 105 PYKNKFDVVKEVVADFI---------KKRKNDRIGMVTFADVAFIASPLTFEKDFLTNIT 155 Query: 265 RLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 + KL + T A+ AY + K S K +I +TDG ++ + Sbjct: 156 EMQKLGMAGKRTAINDALVQAYNLMSKSKAKS----------KIIILLTDGRDNMSKIPL 205 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAP-PEGQDLLRKCTDSS-GQFFAVNDSRELLESFD 381 + + + + +K+Y++ + P L+ + GQ +A + L + +D Sbjct: 206 SDVKHM-----IEKRDVKLYTIGIGGPRDYDAQYLKTLAKAGKGQAYAARSAAMLSKIYD 260 Query: 382 KITDKIQEQSVRIAPNR 398 +I + ++ + Sbjct: 261 EINKL---EVTKLDSKK 274 >gi|116329598|ref|YP_799317.1| BatA [Leptospira borgpetersenii serovar Hardjo-bovis L550] gi|116332487|ref|YP_802204.1| BatA [Leptospira borgpetersenii serovar Hardjo-bovis JB197] gi|116122491|gb|ABJ80384.1| BatA [Leptospira borgpetersenii serovar Hardjo-bovis L550] gi|116127354|gb|ABJ77446.1| BatA [Leptospira borgpetersenii serovar Hardjo-bovis JB197] Length = 312 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 26/141 (18%), Positives = 48/141 (34%), Gaps = 24/141 (17%) Query: 254 PLSNNLNEVKSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 PL+ + + L + + T A+ + L S K ++ Sbjct: 143 PLTGDRESLNEILGTIEEETVAEQGTAIGDAIILSTYRL----------RASQARSKVIV 192 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG-----QDLLRKCTDSS 364 ITDG ++ + + + E++ G+KIYSV + + L + Sbjct: 193 LITDGVSNTGKI--DPVTATDLAEHI---GVKIYSVGIGKEDGSYEINFEILRELSASTG 247 Query: 365 GQFFAVNDSRELLESFDKITD 385 G+FF D E+ I Sbjct: 248 GKFFRAEDPEEMKAVLTSIDS 268 >gi|93141049|ref|NP_542376.2| collagen alpha-1(XII) chain short isoform precursor [Homo sapiens] Length = 1899 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 21/165 (12%), Positives = 52/165 (31%), Gaps = 18/165 (10%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + V+I Y+ + + + L NT T A++ ++ Sbjct: 66 DIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQ- 124 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 R +K + ITDG++ + + +++ G++++++ + Sbjct: 125 ----NFRTQAGMRPRARKIGVLITDGKSQDDVEAPS--------KKLKDEGVELFAIGI- 171 Query: 349 APPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 + L+ + V D L D +T + Sbjct: 172 -KNADEVELKMIATDPDDTHAYNVADFESLSRIVDDLTINLCNSV 215 >gi|120554865|ref|YP_959216.1| von Willebrand factor, type A [Marinobacter aquaeolei VT8] gi|120324714|gb|ABM19029.1| von Willebrand factor, type A [Marinobacter aquaeolei VT8] Length = 339 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 30/199 (15%), Positives = 66/199 (33%), Gaps = 47/199 (23%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL- 266 ++ + + ++ R+G I + PL+ + V++ L Sbjct: 114 NRLQAVKRVLDDFIDQ---------REGDRLGLILFGTEPYVQA--PLTFDRETVRTLLF 162 Query: 267 -NKL-NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 L T A+ + + L + ++ VI +TDG N+ + Sbjct: 163 EAGLGMAGRATAIGDAIGLSVKRLRERPQE----------QRVVILLTDGANTAGQVSPD 212 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEGQDLLRKCTD-SSGQ 366 E + AG+++Y++ + A ++LL + + + G+ Sbjct: 213 K-----ATEIAQAAGVRLYTIGIGADTMIQRGLLGSRRVNPSRDLDEELLTRMAEQTGGR 267 Query: 367 FFAVNDSRELLESFDKITD 385 +F EL +D I Sbjct: 268 YFRARSLPELEMIYDSINQ 286 >gi|297678516|ref|XP_002817116.1| PREDICTED: collagen alpha-1(XII) chain-like isoform 2 [Pongo abelii] Length = 1899 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 20/165 (12%), Positives = 52/165 (31%), Gaps = 18/165 (10%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + V++ Y+ + + + L NT T A++ ++ Sbjct: 66 DIGPKRVQVALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQ- 124 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 R +K + ITDG++ + + +++ G++++++ + Sbjct: 125 ----NFRTQAGMRPRARKIGVLITDGKSQDDVEAPS--------KKLKDEGVELFAIGI- 171 Query: 349 APPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 + L+ + V D L D +T + Sbjct: 172 -KNADEVELKMIATDPDDTHAYNVADFESLSRIVDDLTINLCNSV 215 >gi|168705263|ref|ZP_02737540.1| hypothetical protein GobsU_37375 [Gemmata obscuriglobus UQM 2246] Length = 987 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 42/370 (11%), Positives = 93/370 (25%), Gaps = 41/370 (11%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + + + F F+ A+DL + R Q Q+ D A L G ++ + + T Sbjct: 19 LLGVCLIGLFGFVALAVDLGMLAVSRTQSQNGADVAALVGTRTLNNRDGVAYNNLPAAVT 78 Query: 61 STIFKKQIKKHLKQG---------------SYIRENAGDIAQKAQINITKDKNNPLQYIA 105 + HL + Q+ + P Sbjct: 79 AAQASVTSNPHLSTNFVSGEVSKMEVGQYLYDPTSQTFQVQNWTQVTGGGAMSAPGGNSW 138 Query: 106 ESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYL 165 + ++ + + +T + I VLD++ SM Sbjct: 139 TAMRVTLGVSQPTYFMRVFGVNSMPSGAVATAVYR------PRDIAFVLDMTGSMAFSST 192 Query: 166 QKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNS-- 223 + +S+ + P P TT A + + + + Sbjct: 193 FNSGNAQGSSSDYQSMNPDPLVPKAGHYTTVQSRIVAADNLANSSNEALPRNNFTITTPG 252 Query: 224 ----IQKAIQEKKNLSVRIGTIAYNI--GIVGNQCTPLSNNLNEVKSRLNK---LNPYEN 274 ++ + N + G P + + +S + + P N Sbjct: 253 GPPIVRSYYYDPSNWGTPSTVASPVTTKGDGSANLLPAFHRWSPPESGADSDNYIAPTYN 312 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF---------ITDGENSGASAYQNT 325 Y A H + +T G+ + + + + + N Sbjct: 313 FAGYNAFHKGNETTPQGPTPAPDTYGTMTDASGLTYTGDRWRRRDGRIDKTTTDWSSTNN 372 Query: 326 LNTLQICEYM 335 + + Sbjct: 373 RAAYHAADLL 382 >gi|119569134|gb|EAW48749.1| collagen, type XII, alpha 1, isoform CRA_b [Homo sapiens] Length = 1899 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 21/165 (12%), Positives = 52/165 (31%), Gaps = 18/165 (10%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + V+I Y+ + + + L NT T A++ ++ Sbjct: 66 DIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQ- 124 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 R +K + ITDG++ + + +++ G++++++ + Sbjct: 125 ----NFRTQAGMRPRARKIGVLITDGKSQDDVEAPS--------KKLKDEGVELFAIGI- 171 Query: 349 APPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 + L+ + V D L D +T + Sbjct: 172 -KNADEVELKMIATDPDDTHAYNVADFESLSRIVDDLTINLCNSV 215 >gi|254481548|ref|ZP_05094792.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] gi|214038176|gb|EEB78839.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] Length = 345 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 24/160 (15%), Positives = 55/160 (34%), Gaps = 44/160 (27%) Query: 253 TPLSNNLNEVKSRLNKLNPYEN-----TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 +PLS + VK L L T A+ A + L + + Sbjct: 150 SPLSFDTATVKRFL--LEAQIGFAGQDTAIGDAIGLAVKRLKERPAEN----------RV 197 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP---------------- 351 +I ++DG+++ +++ L + + G++IY++ + A Sbjct: 198 LILLSDGKDTA-----SSVQPLNAAKLAADLGIRIYTIGIGADSLTMPGLFGSSFGARQV 252 Query: 352 -----EGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + L++ + G++F + EL + + Sbjct: 253 NPSAELDEAGLQQIAKITDGKYFRARNPEELANIYQLLDQ 292 >gi|289640775|ref|ZP_06472946.1| von Willebrand factor type A [Frankia symbiont of Datisca glomerata] gi|289509351|gb|EFD30279.1| von Willebrand factor type A [Frankia symbiont of Datisca glomerata] Length = 319 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 28/219 (12%), Positives = 64/219 (29%), Gaps = 31/219 (14%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 +++ S A A ++ + A ++ + RI + Sbjct: 85 ERATIILAIDVSNSMAATDVAPNRLAAAKDGADAFIDQLP----------PRINLGLVSF 134 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 P + + V+S ++ L +T + + + E G T Sbjct: 135 SGSAALLVPPTTDRQSVRSGIHGLQLGPSTAIGEGIFAGLQAITTAGEQ-LAADGGTPPP 193 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------------- 352 ++ ++DGE R+AG+ + ++A Sbjct: 194 AAIVLLSDGETQRGRPNAQAAQAA------RDAGVPVDTIAYGTADGSLDVGGQEIPVPV 247 Query: 353 GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQ 390 +D LR+ + G + EL ++ + I + Sbjct: 248 NEDALREIARATDGSYHRAASGDELRSVYENLGSSIGYR 286 >gi|126733209|ref|ZP_01748956.1| hypothetical protein RCCS2_03619 [Roseobacter sp. CCS2] gi|126716075|gb|EBA12939.1| hypothetical protein RCCS2_03619 [Roseobacter sp. CCS2] Length = 632 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 27/266 (10%), Positives = 58/266 (21%), Gaps = 43/266 (16%) Query: 9 CFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQI 68 + A+D R ++QS D AVL+ + D + F + Sbjct: 80 MLVVGGMAVDFMRFESERTKLQSVADRAVLAAA-------NLNQEREAADVITDFFTAE- 131 Query: 69 KKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSAL 128 +I + + I + F L+ Sbjct: 132 -------------------GFGGSIVGTPSIQKNRNGSTIRLESIVDVDTFYLRLVGIDT 172 Query: 129 TNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKS 188 + +T I + + +VLD+S SM ++ + P Sbjct: 173 LSAPANATAIEGT----GNVEVSLVLDISGSMGSRMTGDAYLYDSDGEIRLDPDGNPLTE 228 Query: 189 F------------WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 +K + + + + ++ Sbjct: 229 RRTEDRMFFLIQEANKFIGDLLKDEYRDRVSINLVAYSQHVRLGDDLYTALNTTPDSIDE 288 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEV 262 + I P + Sbjct: 289 DDNLGSSYGSITDGYTAPFTYTWVNA 314 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 18/140 (12%), Positives = 45/140 (32%), Gaps = 15/140 (10%) Query: 267 NKLNPYENTNTYPAM-HHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 N L+ Y+ + + + ++ T ++ + + + + + Sbjct: 496 NGLDMYDTYAEWEGFRDNGGQSFVDDDGDPTTTWEYQQVNGRWVPSGNAHETIGTVDELN 555 Query: 326 LNTLQICEYMRNAG------MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN---DSRE- 375 +C R ++++++ D + C +G F D E Sbjct: 556 AKLTSLCNLARTKNGDTNDRYTVFTISMGL---VNDTMTTCATDTGDAFTSTITNDPDEP 612 Query: 376 -LLESFDKITDKIQEQSVRI 394 L E F I+D+I + + Sbjct: 613 GLQEIFKTISDQITALRLSL 632 >gi|330810109|ref|YP_004354571.1| hypothetical protein PSEBR_a3255 [Pseudomonas brassicacearum subsp. brassicacearum NFM421] gi|327378217|gb|AEA69567.1| Conserved hypothetical protein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] Length = 359 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 30/146 (20%), Positives = 55/146 (37%), Gaps = 35/146 (23%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + V+ L++ +NT A+ A + L + +I Sbjct: 150 PLTFDRRTVRHWLDEARIGIAGKNTAIGDAIGLALKRL----------RQRPAHSRVLIL 199 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------GQ 354 +TDG N+G ++ L + G+KIY + + A PE + Sbjct: 200 VTDGANNGG-----EIDPLTAARLAADEGVKIYPIGIGAAPEQSGTTGSLGVNPSLDLDE 254 Query: 355 DLLRKCTD-SSGQFFAVNDSRELLES 379 L++ + + GQ+F D +LL Sbjct: 255 PTLKEIAEVTGGQYFRAQDGEQLLGI 280 >gi|126649837|ref|ZP_01722073.1| hypothetical protein BB14905_16605 [Bacillus sp. B14905] gi|126593556|gb|EAZ87501.1| hypothetical protein BB14905_16605 [Bacillus sp. B14905] Length = 865 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 35/194 (18%), Positives = 67/194 (34%), Gaps = 19/194 (9%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 A + + S L E +G IA++ T NN E Sbjct: 410 AIVLDRSGSMSGSKLELAKEAAARSVEMLRDEDTLGFIAFDDRPWEIIETGPLNNKEEAV 469 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 + + P T Y ++ AY L + +K +I +TDG++ + Sbjct: 470 DTILSVTPGGGTEIYGSLAKAYENLAD----------IKLQRKHIILLTDGQSQPGNY-- 517 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDK 382 + E ++ G+ + +VA+ +LL ++ G+F+ V D + + + Sbjct: 518 -----EDLIEQGKDNGITLSTVAIGQDA-DANLLEALSEMGSGRFYNVIDEQTIPSILSR 571 Query: 383 ITDKIQEQSVRIAP 396 T I + P Sbjct: 572 ETAMISRTYIEDNP 585 >gi|310286822|ref|YP_003938080.1| von Willebrand factor type A domain [Bifidobacterium bifidum S17] gi|309250758|gb|ADO52506.1| conserved hypothetical protein containing von Willebrand factor type A domain [Bifidobacterium bifidum S17] Length = 1156 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 39/374 (10%), Positives = 110/374 (29%), Gaps = 46/374 (12%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPL 101 + + T + + I E++ ++ ++ + Sbjct: 482 TATGTKDGETRTTDSNGVITLKADQYAVLLGSDAKRITESSKYKVKEINVDQDTYAVSAN 541 Query: 102 QYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE-NLAISICMVLDVSRSM 160 + + + T G +P ++ + + + N + + L+V+ + Sbjct: 542 GGQVKVTQEKDSATTEPVSVGEVPRTTVTNTVVTAPRYRKYIKANNDGTYDLSLNVTGTQ 601 Query: 161 EDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNL 220 + + + S P N +++V + ++ Sbjct: 602 SGSSQTTVSPADIVVVFDT-----------------SGSMSNPMGHNSRLEVAKTAVNSM 644 Query: 221 VNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNT--- 277 + + + K+ ++R+ + + + ++N ++ S +N L TN Sbjct: 645 AQHLLTSENQGKDSNIRMALVPF--STTAGNVSNFTDNAMDIVSAVNGLGADGGTNWEAA 702 Query: 278 --------YPAMHHAYREL------YNEKESSHNTIGSTRLKKFVIFITDG------ENS 317 + + +S G+ L + + DG + Sbjct: 703 LKAANAKLTSGRKGVKKYIVFMSDGDPTYRTSSVRTGTDWLGRPIYDADDGWGLPAGVHG 762 Query: 318 GASAYQNTLNTLQICEYMRNAG-MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 + + N G ++SV VS+ P + + G +++ + EL Sbjct: 763 SGLSDRYGANLSSAVAEANRRGDATLFSVGVSSDPT--KMRGFADQTKGSYYSATSTDEL 820 Query: 377 LESFDKITDKIQEQ 390 ++F I +I + Sbjct: 821 NKAFADIIGQINRK 834 >gi|332253805|ref|XP_003276022.1| PREDICTED: matrilin-3 [Nomascus leucogenys] Length = 488 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 58/176 (32%), Gaps = 23/176 (13%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP- 271 + +++++ + R+ + Y + + ++ + ++ P Sbjct: 104 VKTFVSRIIDTL-----DIGPADTRVAVVNYASTVKIEFQLQAYTDKQSLRQAVGRITPL 158 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T + A+ A E + + + + S+ + K I +TDG ++ Sbjct: 159 STGTMSGLAIQTAMDEAFTVEAGARD--PSSNIPKVAIIVTDGRPQD--------QVNEV 208 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 R +G+++Y+V V L+ F V +L F + Sbjct: 209 AARARASGIELYAVGV--DRADLKSLKMMASEPLEEHVFYVETYGVIEKLSSRFQE 262 >gi|149725809|ref|XP_001495640.1| PREDICTED: similar to leukocyte adhesion glycoprotein [Equus caballus] Length = 1140 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 30/191 (15%), Positives = 66/191 (34%), Gaps = 21/191 (10%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + E ++N +K+ K S+ + + + + N Sbjct: 158 SGSIYENDFQKMKEFVTIVMNQFKKS---KTLFSLMQYSDTFQTHFTFKEFAN-NPNPGS 213 Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + +N+L T+T + REL++ + + K ++ ITDGE Sbjct: 214 LVRPINQL--GGRTHTATGIRKVVRELFHSRNGAR-----KNALKILVVITDGE-----K 261 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAP---PEGQDLLRKCTDSS--GQFFAVNDSREL 376 + + L + G+ Y + V + ++ L F VN+ L Sbjct: 262 FGDRLEYEDVIPEADQEGIIRYVIGVGIAFSIEKSREELNTIASKPARDHVFRVNNFEAL 321 Query: 377 LESFDKITDKI 387 +++ +KI Sbjct: 322 KTIQNQLQEKI 332 >gi|26349121|dbj|BAC38200.1| unnamed protein product [Mus musculus] Length = 280 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 60/166 (36%), Gaps = 15/166 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 88 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 147 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ RN G+ I+++ V Sbjct: 148 FSEAEGARPLREN--VPRIIMIVTDGRPQDS--------VAEVAAKARNTGILIFAIGVG 197 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQSV 392 + L+ F V + ++ +K+ +SV Sbjct: 198 --QVDLNTLKAIGSEPHKDHVFLVANFSQIESLTSVFQNKLCSKSV 241 >gi|242034233|ref|XP_002464511.1| hypothetical protein SORBIDRAFT_01g019880 [Sorghum bicolor] gi|241918365|gb|EER91509.1| hypothetical protein SORBIDRAFT_01g019880 [Sorghum bicolor] Length = 584 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 29/196 (14%), Positives = 73/196 (37%), Gaps = 22/196 (11%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP--LSNNLNEVKSR 265 K+ +L ++ G +++++ A R+ ++++ + K Sbjct: 163 SKLALLKQAMGFVIDNLGPAD--------RLSIVSFSNDASREIRLTRMSGDGKASAKEA 214 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + L +TN + A L + + + T VI ++DG+++ + +N Sbjct: 215 VESLVADGSTNISRGLLVASEVLADRRYRNAVTS--------VILLSDGQDNQSGVGRNH 266 Query: 326 LNTLQ-ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK-I 383 N + + + I++ + + + + G F V + + +SF + I Sbjct: 267 QNLVPPLFRDADSRPGSIHTFGFGSDHDAAAMHAIAEVARGTFSFVENLAVIQDSFAQCI 326 Query: 384 TDKIQ--EQSVRIAPN 397 + Q+ RIA + Sbjct: 327 GGLLSVVAQNARIAVD 342 >gi|297626138|ref|YP_003687901.1| Von Willebrand factor, type A [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] gi|296921903|emb|CBL56463.1| Von Willebrand factor, type A [Propionibacterium freudenreichii subsp. shermanii CIRM-BIA1] Length = 321 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 23/216 (10%), Positives = 64/216 (29%), Gaps = 31/216 (14%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 +++ S+ A ++D A + V+S+ A Sbjct: 87 ERATIVVTIDVSRSMEATDVTPNRLDAAKSGAKDFVDSLPSA----------FNVALVTF 136 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 N P + + ++K+ ++ + +T ++ + L + Sbjct: 137 AGTANVKMPPTTDRTQLKAAIDAIRLAPSTAIGEGIYTSLDVL-EKLAPQDPDHPDDPAP 195 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PE 352 ++ ++DG + + + + IY++A Sbjct: 196 GAIVLLSDGATNMGRDSAD------AATEAKKKNVPIYTIAYGTSTGYVVENGQRQTVAV 249 Query: 353 GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKI 387 L + S G+ ++ + + L + I+ +I Sbjct: 250 NHAELSQVAKLSGGKKYSADSMKNLQAVYQTISRQI 285 >gi|255557538|ref|XP_002519799.1| Inter-alpha-trypsin inhibitor heavy chain H3 precursor, putative [Ricinus communis] gi|223541038|gb|EEF42595.1| Inter-alpha-trypsin inhibitor heavy chain H3 precursor, putative [Ricinus communis] Length = 514 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 30/244 (12%), Positives = 68/244 (27%), Gaps = 27/244 (11%) Query: 141 RSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYA 200 ++ +V+ + + + L + Sbjct: 11 IPQDSGPRPTPIVVGRVQLISRNNNTAPLQESKFKVMLELTGGDSSNDRPGLDLVAVLDV 70 Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 KI+ + + ++ + LSV + N Q T N+ Sbjct: 71 SGSMAG-DKIEKVKTAMLFVIKKLSPID----RLSVVTFSADANRLCPLRQITE--NSQK 123 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 +++ +N LN TN + + L + S +G ++ ++DGE + Sbjct: 124 DLEKLINGLNADGATNITAGLQTGLKVLSDRSLSGGRVVG-------IMLMSDGEQNAGG 176 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT--DSSGQFFAVNDSRELLE 378 + +Y+ +L+ G F V D+ L + Sbjct: 177 DAAQVPVG----------NVPVYTFGFGINH-EPRVLKAIAHNSIGGTFSDVQDTNNLSK 225 Query: 379 SFDK 382 +F + Sbjct: 226 AFSQ 229 >gi|198421751|ref|XP_002123463.1| PREDICTED: similar to cartilage matrix protein [Ciona intestinalis] Length = 272 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 25/174 (14%), Positives = 59/174 (33%), Gaps = 10/174 (5%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENT 275 + V + + +++G I + G+ +E+ ++ + T Sbjct: 53 VKDYVKNFTDIFEAFGPNDMQVGVIQFGSGVREEILLNQFYVRHELMEAIDNIRYMETGT 112 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 T A+ E + + + + V+ ITDG++ S T T + Sbjct: 113 MTGLALRKLVTETLTVEHGARV--DNPIVHTVVVIITDGKSQDYSRGGVTKWT----KEA 166 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKI 387 + G +I+++ + ++LL F V + + + D+I Sbjct: 167 KARGFEIFAIGIGRKANRKELLE-MASEPKELHTFRVQNFNAIKRVDVNLKDRI 219 >gi|224046544|ref|XP_002198814.1| PREDICTED: matrilin 2 [Taeniopygia guttata] Length = 902 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 23/158 (14%), Positives = 54/158 (34%), Gaps = 18/158 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + + +G I Y + +++ + ++ T T A+ +A Sbjct: 46 DISPDATHVGLIQYGSTVKQEFSLKTFRRKQDIERAVKRMMHLGTGTMTGLALQYAVNIA 105 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + ++ +TDG +I RN+G+ I+++ V Sbjct: 106 FSETEGARPLR--QNVPRIIMIVTDGRPQDP--------VAEIAAKARNSGILIFAIGVG 155 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFD 381 + L+ F V + + L +F Sbjct: 156 R--VDMNTLKSIGSEPHEEHVFLVANFSQIETLTSAFQ 191 Score = 45.3 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 22/195 (11%), Positives = 66/195 (33%), Gaps = 21/195 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +++ + ++++++ + + R+G + Y+ + ++ ++K Sbjct: 625 GEDNFEIVKQFVSGILDTLEISPKAA-----RVGLLQYSTEVRTEFTLRQFSSAKDMKKA 679 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 ++++ + T A+ + E E + S + + I TDG + Sbjct: 680 VSQMKYMGRGSMTGLALRQMSERSFTETEGARPF--SANVPRISIVFTDGRAQDEVSEWA 737 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDK 382 T + ++LL F D L + ++ Sbjct: 738 TRAKQRGIIIYAIG---------IGKAIEEELLE-IASEPSYKHLFYAEDFTALEDISEE 787 Query: 383 ITDKIQEQSVRIAPN 397 + +I E +++ +P+ Sbjct: 788 LRAQICE-ALKESPH 801 >gi|89094518|ref|ZP_01167457.1| hypothetical protein MED92_09161 [Oceanospirillum sp. MED92] gi|89081254|gb|EAR60487.1| hypothetical protein MED92_09161 [Oceanospirillum sp. MED92] Length = 445 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 23/162 (14%), Positives = 56/162 (34%), Gaps = 12/162 (7%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 + + + ++Y+ + ++ N + +N++ NT + + EL Sbjct: 102 SQNDIVSVVSYDSRVNVVVPATKVSDTNTIARAINRIQANGNTALFAGVSKGANEL---- 157 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 VI ++DG + + N L L + + GM + ++ + Sbjct: 158 ----RKFLDLNKVNRVILLSDGLANIGPSTPNELGKLGL--SLAKEGMSVTTIGLGLG-Y 210 Query: 353 GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 +DL+ + S G V ++ +L F + + Sbjct: 211 NEDLMTQLAGFSDGNHAFVENADDLARVFQYEFGDVLSVVAQ 252 >gi|332970976|gb|EGK09950.1| D-amino-acid dehydrogenase [Desmospora sp. 8437] Length = 441 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 27/227 (11%), Positives = 73/227 (32%), Gaps = 23/227 (10%) Query: 170 DNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ 229 + + P K+ + S A K+ V E+ + + + + Sbjct: 114 KISGDQPGTEVNGPEEKQHNVTILLDASGSMAARVSGGEKMQVAKEAVRSFTSQMPEGTN 173 Query: 230 EKKNLSVRIGTIAYNIGIVG----NQCTPLS-NNLNEVKSRLNKLNPYENTNTYPAMHHA 284 + G+ + + L N + ++S+L+ + T AM+ A Sbjct: 174 VSLIVYGHKGSNSKADQAESCKGIEEIVELGPYNESTLQSKLDPIRATGWTPLAGAMNQA 233 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM--KI 342 + L + + + + ++DG + + ++ + + + + + Sbjct: 234 GQRLKETEGQA---------ENVIYVVSDGLETCGG------DPVKEAKSLNQSNIKATV 278 Query: 343 YSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQ 388 + + L+K + G++F+ EL F K++ Sbjct: 279 NIIGFDVGNKEHQALKKVAEAGGGKYFSATSKTELDLYFRNEYAKLK 325 >gi|156523281|ref|NP_001096038.1| anthrax toxin receptor 2b [Danio rerio] gi|151335854|gb|ABS00409.1| capillary morphogenesis protein 2B [Danio rerio] Length = 487 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 74/198 (37%), Gaps = 21/198 (10%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 + + ++ + +++ + +R+ I ++ PL+ + ++ Sbjct: 41 YFVLDRSGSVSDNWLEIYGFVEQLTNRFVSPKMRVSFIVFSSSA--EIILPLTGDRVDID 98 Query: 264 S---RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 S +L+K+ P +T + + A ++ ++ + +I +TDG+ Sbjct: 99 SGLQQLSKIRPAGDTYMHEGLKKAIEQMTSQGARA---------SSIIIALTDGKLEVFM 149 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA-VNDSRELLES 379 ++ + R G ++Y V V + L + D+ Q F V+ + L Sbjct: 150 ----NELAIKEADLARQYGARVYCVGV--KDFDANQLTEIADNKDQVFPVVDGFQALKNI 203 Query: 380 FDKITDKIQEQSVRIAPN 397 + I K + ++ P+ Sbjct: 204 VNSILQKSCVEVFKLEPS 221 >gi|149920662|ref|ZP_01909127.1| hypothetical protein PPSIR1_01724 [Plesiocystis pacifica SIR-1] gi|149818449|gb|EDM77898.1| hypothetical protein PPSIR1_01724 [Plesiocystis pacifica SIR-1] Length = 540 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 25/141 (17%), Positives = 59/141 (41%), Gaps = 15/141 (10%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + E+ + + L P+ +TN Y + A+ + + + V+ ++DG + Sbjct: 214 DSVELATAIAALVPWGSTNLYAGLRTAFEQ--------TDLYAQEGWQNRVLLVSDGVPT 265 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSREL 376 + + + E G + +V + +L+R ++ SG F+ V D + Sbjct: 266 TGIVNSDKI--EGLAEAWSGMGYGLTTVGIG-NDFDIELMRNLSELGSGSFYYVEDPDAV 322 Query: 377 LESFDKITDKIQEQSVRIAPN 397 +E F + ++Q +V +A + Sbjct: 323 IEVFSE---EVQAFTVPLAED 340 >gi|118359890|ref|XP_001013183.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89294950|gb|EAR92938.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 2138 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 22/125 (17%), Positives = 42/125 (33%), Gaps = 9/125 (7%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 +N+ +K+ +N+L TN M A+ L V ++DG N Sbjct: 1505 DNIKSIKNEINRLVAKGGTNICQGMQLAFDVLK--------QRRYKNPITSVFLLSDGLN 1556 Query: 317 SG-ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE 375 G + ++ L L + I + + + + G F+ + D Sbjct: 1557 DGAENKIRDLLKQLNFYQNYNEENFTIQTFGFGKDHDPNLMDKISQLMDGNFYYIGDIHR 1616 Query: 376 LLESF 380 + E F Sbjct: 1617 IDECF 1621 >gi|51476525|emb|CAH18248.1| hypothetical protein [Homo sapiens] Length = 637 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 23/175 (13%), Positives = 54/175 (30%), Gaps = 11/175 (6%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 ++ I + + N+N+ +S + TN AM A + Sbjct: 303 DLSPRDQFNLIVFSTEATQWRPSLVPASAENVNKARSFAAGIQALGGTNINDAMLMAVQL 362 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 L + + GS +I +TDG+ + N + + G ++ + Sbjct: 363 LDSSNQEERLPEGS---VSLIILLTDGDPTVGE--TNPRSIQNNVREAVSGGYSLFCLGF 417 Query: 348 SAPPEGQDLLRKCTDSSGQ-----FFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 L K +G + + +L + + ++ + + P+ Sbjct: 418 GFDV-SYAFLEKLALDNGGLARRIHEDSDSALQLQDFYQEVANPLLTAVTFEYPS 471 >gi|89096888|ref|ZP_01169779.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] gi|89088268|gb|EAR67378.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] Length = 459 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 56/183 (30%), Gaps = 18/183 (9%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL-SNN 258 + + E +L + ++ ++ ++ PL + Sbjct: 169 VSGGNKMMLAKETIKEFTSSLEDDASVSLMAYGHVGTGNDEDKAESCSRIDEVFPLGAYE 228 Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 ++ T A+ A L ST K + ++DG + Sbjct: 229 KTAFNKSMDSFEASGWTPLAGAIDKARELLS--------AYNSTDYKNTLYIVSDGVET- 279 Query: 319 ASAYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRE 375 + ++ + ++ + + K+ + EGQ L++ + G + V D E Sbjct: 280 -----CDGDPVEAAQQLQGSNIEAKVNIIGFDVDDEGQKQLKEVAEAGGGTYATVRDKDE 334 Query: 376 LLE 378 L + Sbjct: 335 LED 337 >gi|162454179|ref|YP_001616546.1| hypothetical protein sce5902 [Sorangium cellulosum 'So ce 56'] gi|161164761|emb|CAN96066.1| hypothetical protein sce5902 [Sorangium cellulosum 'So ce 56'] Length = 940 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 27/126 (21%), Positives = 48/126 (38%), Gaps = 19/126 (15%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N + + + ++ P T + A+ AY+++ + KK VI +TDG+ S Sbjct: 531 NRSRIAGEIARIQPGGGTEIFSALDAAYQDMT----------VTQARKKHVILLTDGKAS 580 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSREL 376 + M + + +V + + LL+ D G+F AV D L Sbjct: 581 TGGI-------RDLVSAMIAESITVTTVGLG-NDLDEQLLKMIADVGGGRFHAVPDPNNL 632 Query: 377 LESFDK 382 F K Sbjct: 633 PRIFTK 638 >gi|311253076|ref|XP_003125384.1| PREDICTED: matrilin-3-like [Sus scrofa] Length = 488 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 24/176 (13%), Positives = 59/176 (33%), Gaps = 23/176 (13%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP- 271 + +++++ + R+ + Y + ++ +K + ++ P Sbjct: 104 VKTFVSRIIDNL-----DIGAEDTRVAVVNYASTVKIEFHLQTHSDKQALKRAVARIAPL 158 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T + A+ A E + + + ++ + K I +TDG ++ Sbjct: 159 STGTMSGLAIQTAMDEAFTVEAGARG--PNSNIPKVAIIVTDGRPQD--------QVNEV 208 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 R +G+++Y+V V + LR F V +L F + Sbjct: 209 AARARASGIELYAVGV--DRADMESLRLMASEPLDEHVFYVETYGVIEKLSSRFQE 262 >gi|171058998|ref|YP_001791347.1| von Willebrand factor type A [Leptothrix cholodnii SP-6] gi|170776443|gb|ACB34582.1| von Willebrand factor type A [Leptothrix cholodnii SP-6] Length = 350 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 21/194 (10%), Positives = 64/194 (32%), Gaps = 46/194 (23%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK--- 292 VR+G +++ Q +++ ++V + +++ T + + L+ E Sbjct: 133 VRVGVVSFAGTAAVVQAP--THSRDDVFAAIDRFQLQRGTAIGSGIVLSLATLFPEAGID 190 Query: 293 --------------------ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 + + VI +TDG+ + + + Sbjct: 191 LSDITGERRMPKGIGESDKQDDFKPVAPGSYGSAAVILLTDGQRTTG------PDPMDAA 244 Query: 333 EYMRNAGMKIYSVAVSAPPE--------------GQDLLRKCTD-SSGQFFAVNDSRELL 377 + + G+K+Y+V + L+ + + ++F + +L Sbjct: 245 KMAADRGVKVYTVGFGTTSGEIIGFEGWSMRVRLDEATLKNIANLTQAEYFYAGSATDLQ 304 Query: 378 ESFDKITDKIQEQS 391 + +D ++ ++ + Sbjct: 305 KVYDTLSSRLVFER 318 >gi|56797994|emb|CAG27564.2| matrilin-3b [Danio rerio] gi|220675932|emb|CAX12091.1| matrilin 3b [Danio rerio] Length = 343 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 55/159 (34%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + + R+ + Y + + EVK ++++P T T A+ A ++ Sbjct: 100 DIGSDATRVALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDPLSTGTMTGMAIKTAMEQV 159 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + E + + K I +TDG ++ R +G++IY+V V Sbjct: 160 FTENAGARPLK--KGIGKVAIIVTDGRPQD--------KVEEVSAAARASGIEIYAVGV- 208 Query: 349 APPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 L++ F V +L F + Sbjct: 209 -DRAEMRSLKQMASQPLDDHVFYVETYGVIEKLTSKFRE 246 >gi|149018615|gb|EDL77256.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Rattus norvegicus] Length = 1157 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 35/341 (10%), Positives = 92/341 (26%), Gaps = 21/341 (6%) Query: 38 LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK 97 L+ A K++ K L G +++ ++ Sbjct: 140 LADAAENFQKAHRWQDNIKEEDIMYYDAK-ADAELDDPESEDMERGSKTSALRLDFIEEP 198 Query: 98 NNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 N + A +IPT+ +I + L IE ++ + + + Sbjct: 199 NFKNKVNYSYTA-VQIPTDIYKGSTVILNELNWTEALENVFIENRRQDPTLLWQVFGSAT 257 Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 + P + S + + + + ++ S Sbjct: 258 GVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSV 317 Query: 218 GNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 +++++ + + + ++ +V N K + + T Sbjct: 318 CEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVR----NKKVFKEAVQGMVAKGTTG 373 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 +A+ +L N + K ++ TDG N Sbjct: 374 YKAGFEYAFDQLQNSNITRA------NCNKMIMMFTDGGEDRVQDVFEKYNWP------- 420 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSREL 376 N +++++ +V L+ ++ G +F + + Sbjct: 421 NRTVRVFTFSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 461 >gi|70730104|ref|YP_259843.1| von Willebrand factor type A domain-containing protein [Pseudomonas fluorescens Pf-5] gi|68344403|gb|AAY92009.1| von Willebrand factor type A domain protein [Pseudomonas fluorescens Pf-5] Length = 358 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 28/152 (18%), Positives = 54/152 (35%), Gaps = 35/152 (23%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + + V+ L++ +NT A+ A + L + +I Sbjct: 150 PLTFDRHTVRVWLDEAKIGIAGKNTAIGDAIGLALKRL----------RQRPAQSRVLIL 199 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------GQ 354 +TDG N+G + + G+KIY + + A PE + Sbjct: 200 VTDGANNGG-----EIAPITAARLAAEEGVKIYPIGIGADPEQSATLGVLGINPSLDLDE 254 Query: 355 DLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 L++ + G++F D +EL + Sbjct: 255 PALKELAQVTGGRYFRARDGQELQAIKQTLDQ 286 >gi|28212250|ref|NP_783182.1| voltage-dependent calcium channel subunit alpha-2/delta-2 [Rattus norvegicus] gi|81871226|sp|Q8CFG6|CA2D2_RAT RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-2; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-2; Flags: Precursor gi|27450706|gb|AAO14653.1|AF486277_1 calcium channel alpha-2 delta-2 subunit [Rattus norvegicus] Length = 1157 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 35/341 (10%), Positives = 92/341 (26%), Gaps = 21/341 (6%) Query: 38 LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK 97 L+ A K++ K L G +++ ++ Sbjct: 140 LADAAENFQKAHRWQDNIKEEDIMYYDAK-ADAELDDPESEDMERGSKTSALRLDFIEEP 198 Query: 98 NNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 N + A +IPT+ +I + L IE ++ + + + Sbjct: 199 NFKNKVNYSYTA-VQIPTDIYKGSTVILNELNWTEALENVFIENRRQDPTLLWQVFGSAT 257 Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 + P + S + + + + ++ S Sbjct: 258 GVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSV 317 Query: 218 GNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 +++++ + + + ++ +V N K + + T Sbjct: 318 CEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVR----NKKVFKEAVQGMVAKGTTG 373 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 +A+ +L N + K ++ TDG N Sbjct: 374 YKAGFEYAFDQLQNSNITRA------NCNKMIMMFTDGGEDRVQDVFEKYNWP------- 420 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSREL 376 N +++++ +V L+ ++ G +F + + Sbjct: 421 NRTVRVFTFSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 461 >gi|330829762|ref|YP_004392714.1| von Willebrand factor, type A [Aeromonas veronii B565] gi|328804898|gb|AEB50097.1| von Willebrand factor, type A [Aeromonas veronii B565] Length = 347 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 25/142 (17%), Positives = 58/142 (40%), Gaps = 15/142 (10%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 P + +L ++ L + + ++T+ A+ A + N S + +K I Sbjct: 159 PFTADLETWQTLLQETDVAMAGQSTHLGDAIGLAIKVFNNSDRHGQQDQNSAKREKVAII 218 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------GQDLLRKCTD-S 363 +TDG ++G + ++ G++++++A+ P D L++ + Sbjct: 219 LTDGNDTG-----SFVSPRDAARVAAVNGVRLHTIAMGDPATVGEQALDLDTLQQLATLT 273 Query: 364 SGQFFAVNDSRELLESFDKITD 385 GQ F D +L ++ I + Sbjct: 274 GGQLFQALDEAQLTRAYQVIGE 295 >gi|326335929|ref|ZP_08202106.1| aerotolerance-related membrane protein [Capnocytophaga sp. oral taxon 338 str. F0234] gi|325691893|gb|EGD33855.1| aerotolerance-related membrane protein [Capnocytophaga sp. oral taxon 338 str. F0234] Length = 348 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 25/184 (13%), Positives = 67/184 (36%), Gaps = 50/184 (27%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVK---SRLN-KLNPYENTNTYPAMHHAYRELYNEK 292 RI I Y PL+++ + K +N + + T A+ A + Sbjct: 130 RIAFIPYAAQAYPQ--LPLTSDYSAAKIFLEAINTDMLSSQGTAIGEAIQTAINYFEDSN 187 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + K +I ++DGE+ ++ + ++ G++I+++ + Sbjct: 188 Q----------SSKILIILSDGEDHQ-------QGATEMIQEVKEKGIRIFTIGLGTTQG 230 Query: 353 --------------------------GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITD 385 Q LL + + G++F +++++++++ K+ + Sbjct: 231 TTIPIKENGQTFPKRDKDGEVVITKLNQALLEEIAQEGDGKYFDGSNTQQVIDNLQKVLN 290 Query: 386 KIQE 389 I++ Sbjct: 291 NIEK 294 >gi|238793630|ref|ZP_04637253.1| hypothetical protein yinte0001_2490 [Yersinia intermedia ATCC 29909] gi|238727045|gb|EEQ18576.1| hypothetical protein yinte0001_2490 [Yersinia intermedia ATCC 29909] Length = 480 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 28/241 (11%), Positives = 73/241 (30%), Gaps = 13/241 (5%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 I + + I ++ +LAH + + ++ A++ A L+ V + + + + Sbjct: 29 FIIFLPLIIGLIFFSFELAHFLQKKTKLSDAMEQATLALT---VENNNSTPSAAQITKNA 85 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 I + +L ++ I N ++Y AE Y + Sbjct: 86 EIVSSYAQAYLPAETFSTPTINIIYN----------NGRIEYGAEINMSYSAKFLSNIQV 135 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 + + + + S+ + V D S SM++ + N+ + + + + Sbjct: 136 TNLSTIINATDRGAARKNIISAPIEKTDVVFVADYSNSMDEYFYHDENEPKKIVALREIF 195 Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 +KN + P +++ +R + Sbjct: 196 NRLNDNVLKNKNIHTIGFIPFSWGTKNRVENGTRIIEYCHLPFVPKKHSPNGDYLRKYIL 255 Query: 242 A 242 + Sbjct: 256 S 256 >gi|108760371|ref|YP_628476.1| von Willebrand factor type A domain-containing protein [Myxococcus xanthus DK 1622] gi|108464251|gb|ABF89436.1| von Willebrand factor type A domain protein [Myxococcus xanthus DK 1622] Length = 422 Score = 69.2 bits (167), Expect = 1e-09, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 59/164 (35%), Gaps = 13/164 (7%) Query: 237 RIGTIAYNIGIVGNQCTPLSNN-LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS 295 R+ I Y + ++ E+ + ++ L +TN A+ A L Sbjct: 83 RLAFIDYGTDVRVQPSRRMTEEAREELLTLISGLQDDGSTNISGALDAAANALRPHMREY 142 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 + I ++DG+ + + L L +R G+ + ++ V + Sbjct: 143 RVSR--------AILLSDGQPTTGIVSEPGL--LDQVRQLRRDGITVSALGVGRD-YQET 191 Query: 356 LLRKCTDSSGQFF-AVNDSRELLESFDKITDKIQEQSVRIAPNR 398 L+R + G F ++DS L E F + D+ R+ R Sbjct: 192 LMRGMAEQGGGFSGFIDDSARLAEVFSRELDQATSTVARMVELR 235 >gi|254409659|ref|ZP_05023440.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] gi|196183656|gb|EDX78639.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] Length = 413 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 33/180 (18%), Positives = 71/180 (39%), Gaps = 23/180 (12%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 ++ + ++A L+ +++ RI IA++ +NLN +KS++ +L Sbjct: 59 LETVKKAAMQLIERLKEGD--------RICVIAFDHRAKVLVPNQAIDNLNTIKSQIRQL 110 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 + T + E+ G V +TDGE + + + L Sbjct: 111 SADGGTAIDEGLKLGIEEV---------AKGKADAVSQVFLLTDGE----NEHGDNERCL 157 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQ 388 ++ + + I ++ A QD+L K DS G + + ++ F ++ ++IQ Sbjct: 158 KLAHFAVEHKLTINTLGFGASWN-QDVLEKIADSGSGTLCYIEQPEQAVQEFGRLFNRIQ 216 >gi|218442094|ref|YP_002380423.1| von Willebrand factor A [Cyanothece sp. PCC 7424] gi|218174822|gb|ACK73555.1| von Willebrand factor type A [Cyanothece sp. PCC 7424] Length = 412 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 25/134 (18%), Positives = 53/134 (39%), Gaps = 15/134 (11%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 +NLN + ++N L P T + +E N K+ + +TDGE Sbjct: 97 DNLNTIIEQINSLKPAGGTAIDEGLKLGIQESANGKKDR---------VSQIFLLTDGE- 146 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS-SGQFFAVNDSRE 375 + + + L++ + + + ++ QD+L K +DS G + + Sbjct: 147 ---NEHGDNERCLKLAHVASDYNITLNTLGFG-NHWNQDVLEKISDSAGGTLCYIETPDK 202 Query: 376 LLESFDKITDKIQE 389 +E F ++ ++ Q Sbjct: 203 AIEEFSRLFNRAQS 216 >gi|260914303|ref|ZP_05920772.1| Flp pilus assembly protein TadG [Pasteurella dagmatis ATCC 43325] gi|260631404|gb|EEX49586.1| Flp pilus assembly protein TadG [Pasteurella dagmatis ATCC 43325] Length = 584 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 23/148 (15%), Positives = 54/148 (36%), Gaps = 18/148 (12%) Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK--ESSHNTIGSTRLKKFVIFIT 312 + + +V L+K+ P +T L ++ + T ++ ++ ++ Sbjct: 423 FTKSKPKVAEALSKIKPTGSTAASSGFIIGANLLMDKNTVPEAQPAKLGTNTQRILMVLS 482 Query: 313 DGENSGASAYQ-NTLNTLQICEYMRNA--------------GMKIYSVAVSAPPEGQDLL 357 DGE++ + TL +C+ +R + + PPE + Sbjct: 483 DGEDNRPTFDTLTTLLNAGLCDNIRKKADSLQDPKFNTLPTKIAFAAFGFQPPPEQKAAW 542 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITD 385 +KC ++ + LL++F +I Sbjct: 543 QKCV-GENNYYEPSSKEALLDAFKQILS 569 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 42/377 (11%), Positives = 108/377 (28%), Gaps = 50/377 (13%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVS---DRTIKDPTTK- 56 MTA++ + I + +D I+ + ++ A D A L A + + D T + Sbjct: 38 MTALLSFPLLVLIAFTVDGTGIILDKVRLAQATDQAALLLVAENNAYRKNPMHDDVTKQS 97 Query: 57 ------------------KDQTSTIFKKQIKKHLKQGS---------------YIRENAG 83 + + + K +L+ + + + Sbjct: 98 VSKEELSKFSGDKLSAQKDKRNQELIQGLAKMYLRSENKAQKDNHLPVTIDQPFDYKCEE 157 Query: 84 DIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSS 143 + ++ K ++ IP +K + ++ + ++ + Sbjct: 158 LDLINPKNQYSRRKPVTCYVQGSVNREFWIPLSADLVKTHTKNGRLPINSGISYAVKEKA 217 Query: 144 ENLAISICMVLDVSRSMEDLYLQKHN--------DNNNMTSNKYLLPPPPKKSFWSKNTT 195 + + + +V D S SM N D + P K + Sbjct: 218 IVIPVDLMLVSDFSGSMLWDLKNNENAQYPNRKIDILRSVVSDIQNILFPTKLSEDASPY 277 Query: 196 KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL 255 A A R+ + ++ + + + ++ + Y + T + Sbjct: 278 NRMGFAAFAGGTRQRGDKNSCV--MPYYLKSGVHDFRVAYWQLDSFNYRGSPWDCKDTNV 335 Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 ++ + +N N A+ A + + + K + + + + Sbjct: 336 LDDRGNPR-PVNACLIKG--NPEDALRTALNDRHLSTSMKLIFEDVLDVDKTIKQVENFD 392 Query: 316 NSGASAYQNTLNTLQIC 332 + + Y+ T N C Sbjct: 393 GNRVNDYKLTYNNPDHC 409 >gi|73695954|gb|AAZ80783.1| matrilin [Biomphalaria glabrata] Length = 418 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 28/189 (14%), Positives = 61/189 (32%), Gaps = 19/189 (10%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS--NNLNEVKSRLNKLNP 271 + L + + VR+ I+Y GI L+ + +EV + ++ Sbjct: 65 KTAIKFLQEFLSQYEISSDPNGVRVSIISYGKGIYPEIGFNLTTYDTKDEVIEAIGRIPH 124 Query: 272 YEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T+T A+ + + + K I ITDG + T Sbjct: 125 KAGLRTDTGRAIQYMHEAQLANGVVRPG------VTKVSIVITDGNSQEW------KLTK 172 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ- 388 + E R + ++++ V +LL + V++ +L + + + Sbjct: 173 EAAEEARKDNIVMFAIGVGTDIRNSELL-NIAGDQSRVTKVDNYNQLSSIKESLAHQTCF 231 Query: 389 -EQSVRIAP 396 ++ P Sbjct: 232 VQEKTTTTP 240 >gi|37676326|ref|NP_936722.1| hypothetical protein VVA0666 [Vibrio vulnificus YJ016] gi|37200868|dbj|BAC96692.1| conserved hypothetical protein [Vibrio vulnificus YJ016] Length = 362 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 38/291 (13%), Positives = 95/291 (32%), Gaps = 34/291 (11%) Query: 105 AESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLY 164 + + + F L+ + S + + + + + +L VS + Sbjct: 35 VPAYRTKQTAIKVPFFHQLVEAMGETPSEGVSQLTPSGWQRATLVLSWLLVVSALAKPTI 94 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 L ++ + ++ S + + + A ++D E Sbjct: 95 LGAPQIRESLGRDVMVVVD------LSGSMAEQDFTSASGANISRLDATKEVLAEFA--- 145 Query: 225 QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN---EVKSRLNKLNPYENTNTYPAM 281 K R+G I + TP + + + ++ + ++T+ A+ Sbjct: 146 ------KTRQGDRLGLILFGDAAFVQ--TPFTADQKVWLALLNQTDVAMAGQSTHLGDAI 197 Query: 282 HHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK 341 A + + + S +K I +TDG ++G + + + + + G++ Sbjct: 198 GLAIKVFEQSESN--QAASSKPRQKVAIVLTDGNDTG-----SFVEPIDAAKVAKAKGVR 250 Query: 342 IYSVAVSAPPE------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 I+ +A+ P + + S G+ F + EL ++D I Sbjct: 251 IHVIAMGDPSTVGESALDLQTIERIASESGGKAFQALNRDELARAYDDIGK 301 >gi|73695956|gb|AAZ80784.1| matrilin [Biomphalaria glabrata] Length = 418 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 29/189 (15%), Positives = 61/189 (32%), Gaps = 19/189 (10%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS--NNLNEVKSRLNKLNP 271 + L + + VR+ I+Y GI L+ + +EV + K+ Sbjct: 65 KTAIKFLQEFLSQYEISSDPNGVRVSIISYGKGIYPEIGFNLTTYDTKDEVIEAIGKIPH 124 Query: 272 YEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T+T A+ + + + K I ITDG + T Sbjct: 125 KAGLRTDTGRAIQYMHEAQLANGVVRPG------VTKVSIVITDGNSQEW------KLTK 172 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ- 388 + E R + ++++ V +LL + V++ +L + + + Sbjct: 173 EAAEEARKDNIVMFAIGVGTDIRNSELL-NIAGDQSRVTKVDNYNQLSSIKESLAHQTCF 231 Query: 389 -EQSVRIAP 396 ++ P Sbjct: 232 VQEKTTTTP 240 >gi|319787647|ref|YP_004147122.1| von Willebrand factor type A [Pseudoxanthomonas suwonensis 11-1] gi|317466159|gb|ADV27891.1| von Willebrand factor type A [Pseudoxanthomonas suwonensis 11-1] Length = 585 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 42/363 (11%), Positives = 91/363 (25%), Gaps = 30/363 (8%) Query: 34 DAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINI 93 +AA L T + L + E Sbjct: 66 EAAALH--TPFAPSPASASAPTAALAYRIAPPPPPVRPLPRPETNTETYEAREDNPVRRA 123 Query: 94 TKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMV 153 + + + Y L P+ + + + Sbjct: 124 REVPVSTFSVDVD-TGSYANVRRMLRDGYRPPADSVRVEEMLNYFDYGHPAPASREVPFK 182 Query: 154 LDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL 213 + + + + P + T K+ +L Sbjct: 183 VTTELAPAPWNPARQLLMVGIKGYDVDKRELPPANLVLLVDTSGSMDDPA-----KLPLL 237 Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE 273 + LV ++ R+ +AY N E+ + L L+ Sbjct: 238 KRAFAQLVPQLRAKD--------RVSIVAYAGHAGLVLPPTPGNRHGEILAALEGLHAAG 289 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 +TN + AY ++ TDG+ + +N L L + Sbjct: 290 STNGGEGLRLAYAMARQGHVEGGVNR--------ILLATDGDFNVGITDRNAL--LTLVA 339 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSV 392 R +G+ + ++ + + + D+ GQ ++ L E+ + ++Q + Sbjct: 340 DQRRSGIALSTLGFGSGNYNDAMAERLADAGNGQHLYIDT---LDEARRALVQQMQATLL 396 Query: 393 RIA 395 IA Sbjct: 397 TIA 399 >gi|295399398|ref|ZP_06809380.1| Ig domain protein group 2 domain protein [Geobacillus thermoglucosidasius C56-YS93] gi|294978864|gb|EFG54460.1| Ig domain protein group 2 domain protein [Geobacillus thermoglucosidasius C56-YS93] Length = 929 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 24/262 (9%), Positives = 61/262 (23%), Gaps = 17/262 (6%) Query: 146 LAISICMVLDVSRSM-EDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPA-P 203 I + V DVS SM ++ Y ++ + Sbjct: 78 PPIDVVFVFDVSGSMVMPSLKLDSAKYALQSAVDYFKANANPNDRFALVPFSDDVQYSKV 137 Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVR---IGTIAYNIGIVGNQCTPLSNNLN 260 P + + + + +N L++ + Sbjct: 138 VPFPSGTYDVKQHLNWIATVANSLRANGGTNYTQALQQAQSFFNDPARKKYIIFLTDGMP 197 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 V + K ++ + + I+ D + Sbjct: 198 TV--SIAKEPITYKVCEGILFWKTCNQVTENLNVQYILYSNGITAARTIYYPDHPETKTY 255 Query: 321 AYQNTLN---------TLQICEYMRNAGMKIYSVAVSAPPE-GQDLLRKCTDSSGQFFAV 370 + + + + + + +YS+ E L K + ++G Sbjct: 256 SDREKYREFEEKIRLHGTNVAKTLGMNNITLYSIGFGNNQEVDMGYLEKLSSTAGGQAKK 315 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + L E F + + + + Sbjct: 316 GTPQNLTEIFQQFSKLANDPVL 337 >gi|226358120|ref|YP_002787859.1| hypothetical protein Deide_2p00900 [Deinococcus deserti VCD115] gi|226319763|gb|ACO47757.1| Conserved hypothetical protein [Deinococcus deserti VCD115] Length = 418 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 23/165 (13%), Positives = 56/165 (33%), Gaps = 16/165 (9%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKES 294 R+ +A++ + + L+ + V + ++ +TN + ++ Sbjct: 78 DDRVSVVAFDDRVDVIVPSQLATSREAVIQAIGTIDDRGSTNLHGGWLEGATQVAQHLTP 137 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR---NAGMKIYSVAVSAPP 351 VI ++DG+ + + +I +R G+ ++ + + Sbjct: 138 GALNR--------VILLSDGQANVGVTDR-----REIARQVRGLTERGISTTTIGLGSHY 184 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 + + LL G F V D L F++ + + RI Sbjct: 185 DEELLLAIANAGDGNFEHVEDPSRLPTFFEEELQGLTRTTGRIVS 229 >gi|108760959|ref|YP_630756.1| von Willebrand factor type A domain-containing protein [Myxococcus xanthus DK 1622] gi|108464839|gb|ABF90024.1| von Willebrand factor type A domain protein [Myxococcus xanthus DK 1622] Length = 476 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 22/163 (13%), Positives = 54/163 (33%), Gaps = 13/163 (7%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSN-NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNE 291 N R+ I Y + + N + ++ + TN + +L Sbjct: 127 NDQDRLAIIHYGSDVKSLPSLEATAANRERMFQYVDGIWDEGGTNIGAGLSAGRYQLSTA 186 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 + + +I ++DG+ + L ++ +R G+ + ++ V Sbjct: 187 QRTYGVNR--------LILMSDGQPTEGLTADEEL--TRMARELRATGLTLSAIGVGTD- 235 Query: 352 EGQDLLRKCTDSSGQFF-AVNDSRELLESFDKITDKIQEQSVR 393 +DL++ + + + D+ +L F K + R Sbjct: 236 FNEDLMQAFAEYGAGAYGFLEDAAQLSTLFQKDLQQAGTTVAR 278 >gi|15602708|ref|NP_245780.1| hypothetical protein PM0843 [Pasteurella multocida subsp. multocida str. Pm70] gi|12721152|gb|AAK02927.1| TadG [Pasteurella multocida subsp. multocida str. Pm70] Length = 588 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 43/305 (14%), Positives = 89/305 (29%), Gaps = 42/305 (13%) Query: 117 NLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS 176 G + + S+ +I + + + + + + Sbjct: 271 FTTFSGGVRQRDVTEGCVFPYEGKVSTTAQYFTIPYWITGNNTPWRRGDNGRWERSTVRF 330 Query: 177 NK-----YLLPPPPKKSFWSKNTTKSKYAPAP--APANRKIDVLIESAGNLVNSIQKAIQ 229 Y NT + YA KI+ + N+ Sbjct: 331 EDHYKGYYERFDRRSCRGSGNNTKCTIYAYPKKIMDYALKINDWTT-VREIFNNYMNTEG 389 Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNL-------------NEVKSRLNKLNPYENTN 276 +S G + ++ L N+ ++ LN + P T+ Sbjct: 390 TLAQISEFNGKNRNYDMVFTDEDRCLGGNIGRRTTRAWFDQKNKDITRELNIVRPSGWTS 449 Query: 277 TYPAMHHAYRELYNEK--ESSHNTIGSTRLKKFVIFITDGENSGASAYQ-NTLNTLQICE 333 + + +E + + T +++ ++ ++DGE++ + TL +C+ Sbjct: 450 ASSGLLVGANIMMDENKSPDAKPSKLGTNIQRVILVLSDGEDNWPTYSTLTTLLNNGMCD 509 Query: 334 YMRNA-------------GMKIYSVAVSA--PPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 +R G +I VA P +KC Q++ ELLE Sbjct: 510 KIREQLGKLQDPNLRELPG-RIAFVAFGYSPPANQVAAWKKCV--GDQYYTAYSKEELLE 566 Query: 379 SFDKI 383 SF +I Sbjct: 567 SFKQI 571 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 31/288 (10%), Positives = 70/288 (24%), Gaps = 44/288 (15%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIV--------------- 45 MTA++ + I + +D ++ + ++ +D A L+ A Sbjct: 28 MTALLAFPLLVLIGFTVDGTGVVLDKARLAQGMDQAALALVAENNDYRENKKHGDVNRQV 87 Query: 46 -----------SDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINIT 94 ++ K ++ I K ++ S D Sbjct: 88 VSPQDKAKFGGNEFMAKQEKRNQELIQGIAKLYLRSENANASSDAPITIDKPFHYSCEEL 147 Query: 95 KDKNNP---------LQYIAESKAQYEIPTE-NLFLKGLIPSALTNLSLRSTGIIERSSE 144 + ++ +P +L + + ++ I+ Sbjct: 148 DLPTGNEYARRKPIVCEVQGGVNRKFWLPVSESLVSADKLKQDRIRMESDTSYAIKEKGI 207 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP--------PKKSFWSKNTTK 196 + + + +V D S SM K+ + T L P+ + Sbjct: 208 VIPVELMLVSDFSGSMNSHLQDKNGRSLGKTKITILREVVSEISKILLPEDVSEGVSPFN 267 Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYN 244 + R+ DV + Q G Sbjct: 268 RIGFTTFSGGVRQRDVTEGCVFPYEGKVSTTAQYFTIPYWITGNNTPW 315 >gi|32474636|ref|NP_867630.1| hypothetical protein RB7099 [Rhodopirellula baltica SH 1] gi|32445175|emb|CAD75177.1| conserved hypothetical protein-containing vWFA domain [Rhodopirellula baltica SH 1] Length = 885 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 25/286 (8%), Positives = 81/286 (28%), Gaps = 31/286 (10%) Query: 128 LTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKK 187 ++ S + + + + + + + ++ + + Sbjct: 418 SIDVDTASYAKVRSYLQRGQLPRPDSV-RIEELINYFDYQYTPPSAEDPVPFSSAMAVAS 476 Query: 188 SFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ---------------EKK 232 W++N + A + + + L+++ + ++ Sbjct: 477 CPWNENNRLVRVGI-QAKDIDRKERPRCNLVFLIDTSGSMKRPNKLPLVIEGMKVLLDQL 535 Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 R+ + Y + ++ L+ L+ +TN + AY+ Sbjct: 536 KNRDRVAIVVYAGSSGLVLDSTPVKQKKKIIRALSALSAGGSTNGGAGLQLAYQTARENF 595 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 VI +DG+ + + L +G ++ + Sbjct: 596 IEDGVNR--------VILCSDGDFNVGMTGTDQLVAE--ATRQSKSGTELTVLGFGMGNH 645 Query: 353 GQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 ++ + ++S G + V+ E + + D++ +A + Sbjct: 646 NDAMMERISNSGAGNYAFVDTIAEAKKV---LADQVAGTLFTVAKD 688 >gi|301627723|ref|XP_002943019.1| PREDICTED: complement C2-like [Xenopus (Silurana) tropicalis] Length = 678 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 40/300 (13%), Positives = 91/300 (30%), Gaps = 18/300 (6%) Query: 97 KNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDV 156 ++ I L+ S S ++ D+ Sbjct: 86 PPGAMKTGVRYDMDNSIKYACSRGMSLVGSPH----RTCLESRRWSGTEISCQYPYSFDL 141 Query: 157 SRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIES 216 +++ + + N+ K + + D+ E Sbjct: 142 PEDVQEQFKASLSGILNIKERSASFGRTIKIKRDGILNVYFLLDASRSVGEANFDIYKEC 201 Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL-NKLNPYE-- 273 + LV+ + + + T+ I + ++ S+N V + N L + Sbjct: 202 SVYLVDELASFDMTIQFGIISYATVPKVIIPIYDEN---SDNDAHVFEVIENDLKYSDHK 258 Query: 274 ---NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 TN A+ Y + +KE+ N + +I +TDG+ + +T+ ++ Sbjct: 259 DKTGTNIKTALEEVYNMMSFQKETYKNESVWNSIHHIIILLTDGKANIGGRPADTIKHIE 318 Query: 331 ICEYMRNAG---MKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 ++ + +Y+ + + DL + G F + + E+ F KI D Sbjct: 319 EFLDIKKKREDYLDVYTFGIGPDVDMADLSEIASKKDGESHVFRMESANEMKTVFQKIVD 378 >gi|162457601|ref|YP_001619968.1| hypothetical protein sce9315 [Sorangium cellulosum 'So ce 56'] gi|161168183|emb|CAN99488.1| conserved hypothetical protein with a vWF type A domain [Sorangium cellulosum 'So ce 56'] Length = 617 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 18/147 (12%), Positives = 47/147 (31%), Gaps = 18/147 (12%) Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 +K+ ++ + TN + Y + K V+ ++DG + Sbjct: 271 RAAIKATIDGIREGGGTNIGQGLALGYAQAARPGIPEGAV-------KVVLLLSDGRANA 323 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELL 377 ++ ++ G++ + + A L+ G ++ + D ++ Sbjct: 324 G--ITSSERLSRLALDAFQGGVQTSTFGLGAD-YDGALMSAIASDGAGGYYYLRDPDQIA 380 Query: 378 ESFDKITDK-------IQEQSVRIAPN 397 + D+ E VR+ P+ Sbjct: 381 PALATELDRRLDPVATAVELRVRLKPD 407 >gi|124008506|ref|ZP_01693199.1| von Willebrand factor, type A [Microscilla marina ATCC 23134] gi|123986014|gb|EAY25864.1| von Willebrand factor, type A [Microscilla marina ATCC 23134] Length = 425 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 19/189 (10%), Positives = 69/189 (36%), Gaps = 19/189 (10%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 K++ + ++ ++++++ + + Y+ I + N + ++ Sbjct: 60 DKLNYVKKAVDFVIDNLKSDDV--------LSIVQYDDEIDVVASSAKVTNKKALHEKVK 111 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + TN M Y ++ + + + + V+ ++DG + L Sbjct: 112 GIQARNMTNLSGGMMEGYAQVKSTQSNGYVNR--------VLLLSDGLANAGITAPEQLQ 163 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG-QFFAVNDSRELLESFDKITDK 386 + + R AG+ + + V + + L+ ++ G ++ ++ ++ + F + + Sbjct: 164 QIAQ-KKFREAGIALSTFGVGSD-FNEVLMTNLSEYGGANYYFIDMPDKIPQIFAQELEG 221 Query: 387 IQEQSVRIA 395 + + Sbjct: 222 LLSVVAQNT 230 >gi|170672288|gb|ACB29772.1| matrilin-3 alternative transcript [Homo sapiens] Length = 444 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 57/176 (32%), Gaps = 23/176 (13%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP- 271 + +++++ + R+ + Y + + +K + ++ P Sbjct: 102 VKTFVSRIIDTL-----DIGPADTRVAVVNYASTVKIEFQLQAYTDKQSLKQAVGRITPL 156 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T + A+ A E + + + S+ + K I +TDG ++ Sbjct: 157 STGTMSGLAIQTAMDEAFTVEAGAR--EPSSNIPKVAIIVTDGRPQD--------QVNEV 206 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 + +G+++Y+V V L+ F V +L F + Sbjct: 207 AARAQASGIELYAVGV--DRADMASLKMMASEPLEEHVFYVETYGVIEKLSSRFQE 260 >gi|158255148|dbj|BAF83545.1| unnamed protein product [Homo sapiens] Length = 486 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 57/176 (32%), Gaps = 23/176 (13%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP- 271 + +++++ + R+ + Y + + +K + ++ P Sbjct: 102 VKTFVSRIIDTL-----DIGPADTRVAVVNYASTVKIEFQLQAYTDKQSLKQAVGRITPL 156 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T + A+ A E + + + S+ + K I +TDG ++ Sbjct: 157 STGTMSGLAIQTAMDEAFTVEAGAR--EPSSNIPKVAIIVTDGRPQD--------QVNEV 206 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 + +G+++Y+V V L+ F V +L F + Sbjct: 207 AARAQASGIELYAVGV--DRADMASLKMMASEPLEEHVFYVETYGVIEKLSSRFQE 260 >gi|73990549|ref|XP_542778.2| PREDICTED: similar to alpha 3 type VI collagen isoform 4 precursor [Canis familiaris] Length = 1320 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 57/166 (34%), Gaps = 23/166 (13%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEK 292 V+ G + Y+ GI +++ E+K+ ++ + T T A+ + + Sbjct: 878 DRVQFGVVQYSDGINIQFALSQYSSMAELKAAIDDIQQRKGGTMTGEALSRMAQVFVDTA 937 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 S+ ++I ITDG+ + + E +R G+ IY++ V Sbjct: 938 RSNVPW--------YLIIITDGK--------SEDPVAEPAEALRGEGVIIYAIGV--KNA 979 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 L++ + F + L + I NR Sbjct: 980 NVMELKEIAK--DKTFFTPEFDSLKVIQRDVVQDICSS--ETCKNR 1021 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 56/154 (36%), Gaps = 17/154 (11%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYNEKES 294 ++IG + ++ ++ ++ ++K+ + T T A+ + + Sbjct: 1061 IQIGLLQFSSSPQEEFRLNQYSSKADIHRAISKVVQMNDGTRTGKALTFTLPFFDSSRGG 1120 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + +++I ITDG A + +R+ + I+++ V + Sbjct: 1121 RP------NVHQYLIVITDGVAQDDVAIP--------AKALRDRNIVIFAIGVG-EAKSA 1165 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 LL+ TD + + + L KI K+ Sbjct: 1166 QLLQ-ITDDVQKVYYEENFESLQNLEKKILLKVC 1198 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 38/314 (12%), Positives = 87/314 (27%), Gaps = 29/314 (9%) Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI 148 Q + + ++ + G+ L +T + + Sbjct: 535 PQYAVVMTSGKSEDEVWDAAQTLREKGVKVMSVGVQDFDRKELEGMATPPLIYEMQGEDG 594 Query: 149 SICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR 208 ++ DVS ++ + + + + Sbjct: 595 VRQLMQDVSVVIQGTGKPQFGIASEKETRVACPMA----------IPADLVFLVEEFSRD 644 Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 K + L ++ VRIG + Y+ ++ L+K Sbjct: 645 KQWNFQQVIDFLKTTVSSLNVHP--DGVRIGLVFYSEEPRLEFSLDTFQTPAKMLEHLDK 702 Query: 269 LNP---YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 L T T A+ E++ E + + + + IT+G + Sbjct: 703 LTYRRRSGRTKTGAALDFLRNEVFVE---ERGSRSKQGVLQMAVVITEGFSQD------- 752 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKI 383 + +R AG+ IY+V E +D L ++ +L KI Sbjct: 753 -QLSEPASLLRRAGVTIYAVGTHRASESKD-LENIASYPPWKHVISLESFLQLSVVGSKI 810 Query: 384 TDKIQEQSVRIAPN 397 +++ ++V + + Sbjct: 811 KNQLCPETVDTSVS 824 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 58/169 (34%), Gaps = 16/169 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 S+R+G Y +V + KL P + M A + L Sbjct: 63 NVSKDSIRVGLAQYGDVPRSEFLLSTYPRKGDVLKHIQKLQPK---SWGHKMGLALQFLL 119 Query: 290 NE-KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + +++ + S + + + I++ + + ++ AG+ +Y+V V Sbjct: 120 DHHFQATAGSRASQGVPQMAMVISN--------SPAEDPVQEAAKALKRAGVLLYTVGV- 170 Query: 349 APPEGQDLLRKCTDSSGQFF--AVNDSRELLESFDKITDKIQEQSVRIA 395 L++ S + F V + +L K+ ++ + + A Sbjct: 171 -KDAVLAELKEIASSPAEKFTSFVPNFPDLGSHAQKLRQQVCDTLAKAA 218 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 29/165 (17%), Positives = 54/165 (32%), Gaps = 16/165 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + VR+G YN I + V ++ L +TNT A+ Sbjct: 266 DISSDQVRVGLAQYNDNIYPAFQLNQYPLKSVVLEQIQNLPYRTGDTNTGSALEFIRMHY 325 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 E S + + VI +TDGE++ + ++ G+ +Y V Sbjct: 326 LTEAAGSRAK---DSVPQIVILVTDGESND--------EVQEAANKLKEDGVVVYV--VG 372 Query: 349 APPEGQDLLRKCTDSS-GQF-FAVNDSRELLESFDKITDKIQEQS 391 + L+K +F F + + L + I + Sbjct: 373 VNVQDVQELQKIASEPFEKFLFNIENFNILQDFSGGILQTLCSAV 417 >gi|11321565|ref|NP_002372.1| matrilin-3 precursor [Homo sapiens] gi|14548113|sp|O15232|MATN3_HUMAN RecName: Full=Matrilin-3; Flags: Precursor gi|3647275|emb|CAA12110.1| matrilin-3 [Homo sapiens] gi|62630192|gb|AAX88937.1| unknown [Homo sapiens] gi|119621242|gb|EAX00837.1| matrilin 3 [Homo sapiens] gi|146218451|gb|AAI39908.1| Matrilin 3 [Homo sapiens] Length = 486 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 57/176 (32%), Gaps = 23/176 (13%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP- 271 + +++++ + R+ + Y + + +K + ++ P Sbjct: 102 VKTFVSRIIDTL-----DIGPADTRVAVVNYASTVKIEFQLQAYTDKQSLKQAVGRITPL 156 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T + A+ A E + + + S+ + K I +TDG ++ Sbjct: 157 STGTMSGLAIQTAMDEAFTVEAGAR--EPSSNIPKVAIIVTDGRPQD--------QVNEV 206 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 + +G+++Y+V V L+ F V +L F + Sbjct: 207 AARAQASGIELYAVGV--DRADMASLKMMASEPLEEHVFYVETYGVIEKLSSRFQE 260 >gi|307720884|ref|YP_003892024.1| von Willebrand factor A [Sulfurimonas autotrophica DSM 16294] gi|306978977|gb|ADN09012.1| von Willebrand factor type A [Sulfurimonas autotrophica DSM 16294] Length = 304 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 34/228 (14%), Positives = 77/228 (33%), Gaps = 38/228 (16%) Query: 162 DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLV 221 D + K + +L S + + KY +++V+ E N + Sbjct: 69 DPIITKKIKAVKSNAVDIVLALDTSDSMSTYGFNEKKY------KQSRLNVVKEVVQNFI 122 Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTY 278 N V+ G +PLS + K+ + +N ++T Sbjct: 123 -----------NSRVKDRIGLVVFGTTAGIASPLSFDKEAQKNIVGNINVGVLGKSTALI 171 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 + + + L S K +I ++DGE+S + + + Sbjct: 172 DGIVSSIQLL----------KNSKSKSKIIILLSDGEDSASKIPLEF-----ALKLAKKY 216 Query: 339 GMKIYSVAVSAPPEGQDLLRKCTDSSG-QFFAVNDSRELLESFDKITD 385 +KIY++ + +++ + +G + F V + +L++ + I Sbjct: 217 NIKIYTITIDKSYSD--MMKVIANKNGAKNFEVQNKEDLVKVYKSIDS 262 >gi|163748339|ref|ZP_02155613.1| hypothetical protein OIHEL45_20491 [Oceanibulbus indolifex HEL-45] gi|161378385|gb|EDQ02880.1| hypothetical protein OIHEL45_20491 [Oceanibulbus indolifex HEL-45] Length = 405 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 32/212 (15%), Positives = 65/212 (30%), Gaps = 10/212 (4%) Query: 189 FWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIV 248 KN + + P A + K+ +A +L ++ N Y Sbjct: 196 TDGKNDNEYRLQPEYASGDAKVIRTTSTASHLFYTVDAPETNGTNDGSWPNNERYFYATH 255 Query: 249 GNQCTPLSNNLNEVKS------RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 + + + + ++ N+L ++ + E Y Sbjct: 256 PFEEERM-WDEHTLRDNPRLERYANQLE-EKHLSWPEVWAEMSPEYYGYNLYGRQGNSWW 313 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-EGQDLLRKCT 361 + +S + IC + AG+ IY++ + DLL++C Sbjct: 314 SWNSRLQSFWQNMHSTIGTDEKDRRLRNICAAAQRAGIVIYAIGMDVDSQNSLDLLKECA 373 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + +F V D E+ +FD I I + Sbjct: 374 STEAHYFDV-DGLEIQTAFDMIAASISMLRLT 404 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 20/220 (9%), Positives = 47/220 (21%), Gaps = 37/220 (16%) Query: 153 VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAP-----APAN 207 + S + +L + S A Sbjct: 1 SMAWSARGGGGAKIDLLKGAASDFVETILNDSEEDRVSISIIPYSTKVNAGEDLLEQYTV 60 Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ--------------CT 253 + + + R G + + T Sbjct: 61 TQEHDYSHCVDFNADDYTDLAISPRTELQRTGHFQFQQMSTSDPRSGQWVCRHDGGFSIT 120 Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKES-----------------SH 296 PLS +++++K ++ L P +T+ L Sbjct: 121 PLSKSVSDLKRQIAALTPEGSTSIDMGAKWGLALLDPSARGPVSALIASGQVDASFRERP 180 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 + + K ++ +TDG+ + + +R Sbjct: 181 HPHDAENSMKVLVLMTDGK-NDNEYRLQPEYASGDAKVIR 219 >gi|307941757|ref|ZP_07657112.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307775365|gb|EFO34571.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] Length = 358 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 26/217 (11%), Positives = 66/217 (30%), Gaps = 21/217 (9%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 +TA + V + I +D + + ++ ++ +LD A L+ + +D + Sbjct: 3 ITAFVFFVLIVAIGVGVDYSRALTLKTRVLGSLDTAALAAAVEFSKLGSEQDARKAAKKA 62 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + + A+ ++NI D ++ ++ + L Sbjct: 63 FDA-------------QVSQLNLHGAKLKKLNIVTDDE-TMKVSVDAVFEL-----PTTL 103 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 + ++ RS + + + + I M +D + SM + + +N Sbjct: 104 MQIAGFKTLEVATRSDAVGG--GQEVILDIVMCIDATGSMGATLRSVQRNALSFEANLKN 161 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 + Y + L +S Sbjct: 162 RLKELGRQVDIIRVRPIYYWDYDYDGWSRSYGLKKST 198 >gi|71983551|ref|NP_509176.2| hypothetical protein C16E9.1 [Caenorhabditis elegans] gi|34366001|gb|AAC47957.2| Hypothetical protein C16E9.1 [Caenorhabditis elegans] Length = 565 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 26/153 (16%), Positives = 57/153 (37%), Gaps = 20/153 (13%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYRELYN 290 + + R+G + + S N +++K+ +++ T T A+ Sbjct: 423 SDNATRVGIVQFAGKTKVRVLANFSQNKSQLKTIIDRSPFYSGTTFTNQALKKMAALYEE 482 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 K + ++ TDG + +T + E +++ G+ +Y+V +S Sbjct: 483 SKRPNAKLK--------LMLFTDGY--------SAEDTSEGEEALKSQGVVVYTVGISTD 526 Query: 351 PE---GQDLLRKCTDSSGQFFAVNDSRELLESF 380 LR SS ++ +D +LL+ F Sbjct: 527 KSAGLNMKELRGMATSSEHYYDSSDFADLLKHF 559 >gi|126463435|ref|YP_001044549.1| von Willebrand factor, type A [Rhodobacter sphaeroides ATCC 17029] gi|126105099|gb|ABN77777.1| von Willebrand factor, type A [Rhodobacter sphaeroides ATCC 17029] Length = 651 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 45/366 (12%), Positives = 104/366 (28%), Gaps = 29/366 (7%) Query: 35 AAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINIT 94 A + A+ + + + + + + ++ A + Sbjct: 141 RARSAEGAAPQTFAADEAMPMAAPPAPDLALSKQAAEAPARALPQGDSEAFANAPDNPLR 200 Query: 95 KDKNNPLQY-IAES-KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICM 152 +P+ + A Y I +L L P + + Sbjct: 201 VTAEDPVSTFSIDVDTASYAILRSSLRAGQLPPREAVRIEEMINYFPYDYPAPENGTPPF 260 Query: 153 VLDVSRSMEDLYLQKH-NDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKID 211 +S + + + PP + +T+ S PA K+ Sbjct: 261 RPTLSITRTPWNPETRLVHVALQGRMPAIEDRPPLNLVFLIDTSGSMQDPA------KLP 314 Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP 271 +L +S G ++ ++ ++ + Y +N + + S L++L+ Sbjct: 315 LLKQSFGLMLGRLRPED--------QVAIVTYAGSAGEVLAPTAANQRSTILSALDRLDA 366 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 +T + AYR + T V+ TDG+ + + L L Sbjct: 367 GGSTAGDEGLALAYRTASEMAGAGEVTR--------VVLATDGDFNLGISDPEELARLVA 418 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 E R+ G+ + + ++ + L E+ + D++ Sbjct: 419 HE--RDTGVYLSVLGFGRGNLDDATMQALAQNGNG--QAAYIDSLNEAQKVLVDQLSGAL 474 Query: 392 VRIAPN 397 IA + Sbjct: 475 FPIADD 480 >gi|332667371|ref|YP_004450159.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] gi|332336185|gb|AEE53286.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] Length = 425 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 28/176 (15%), Positives = 66/176 (37%), Gaps = 19/176 (10%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 KI ++A +V+++ R+ + Y+ + + N E++ R+ Sbjct: 59 DKIAYAKKAAQFIVDNLSPED--------RVSIVQYDDIVEVLSPSAPVLNKQELRQRIA 110 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + TN M Y ++ K++ V+ ++DG + L Sbjct: 111 LMEARNMTNLSGGMLAGYEQVERTKQARFVNR--------VLLLSDGLANHGITDPTVLQ 162 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG-QFFAVNDSRELLESFDK 382 + + E RNAG+ + + V A + L+ ++ G ++ + ++ F + Sbjct: 163 QM-VQEKFRNAGIAVSTFGVGAD-FNELLMTSLSEYGGANYYFIESPDKIPGIFAE 216 >gi|149410544|ref|XP_001506183.1| PREDICTED: similar to protocadherin 9, partial [Ornithorhynchus anatinus] Length = 588 Score = 68.8 bits (166), Expect = 1e-09, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 54/159 (33%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYREL 288 + R+G I Y + +EV+ + + T T A+ +A Sbjct: 42 DVAPDVTRVGLIQYGSTVKNEFSLKTYGRKSEVERAVKVMKRLGTGTMTGLAIQYAVNIA 101 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG ++ RN+G+ I+++ V Sbjct: 102 FSESEGARPLREN--VPRIIMIVTDGRPQDP--------VAEVAAKARNSGILIFAIGVG 151 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F Sbjct: 152 --QVDYNTLKSIGSKPHQDHVFLVANFSQIESLTSVFQN 188 >gi|89095762|ref|ZP_01168656.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] gi|89089508|gb|EAR68615.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] Length = 432 Score = 68.8 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 29/228 (12%), Positives = 72/228 (31%), Gaps = 23/228 (10%) Query: 175 TSNKYLLPPPPKKSFWSKN-TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 +Y +++ S K+++ E+ N + Sbjct: 120 AFGEYGAEEEEQQTKNISIQIDSSGSMNGQVSGGVKMNLAKEAVENFAAGFPEDTIMTLR 179 Query: 234 LSVRIGTIAYNIGIVGNQCT-----PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYREL 288 GT + T + + K+ L K P T ++ Y +L Sbjct: 180 TYGHKGTGDDKDKAMSCASTEVMYDANTYDQAAFKAALEKFKPSGWTPLAASIKAGYEDL 239 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA--GMKIYSVA 346 + + ++DG + + ++ + + ++ MK++ + Sbjct: 240 K--------KKAGEDTENILYIVSDGIET------CEGDPVKEAKALADSDLNMKVHIIG 285 Query: 347 VSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVR 393 GQD L+K ++ G+++ VN EL + +++ + + Sbjct: 286 FDVDDAGQDQLKKTAEAGNGKYYTVNSKLELTNTLNELMGEAISSIRK 333 >gi|332214177|ref|XP_003256207.1| PREDICTED: collagen alpha-1(XIV) chain [Nomascus leucogenys] Length = 1796 Score = 68.8 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 27/201 (13%), Positives = 63/201 (31%), Gaps = 26/201 (12%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + +I + V ++ K + ++ + + Sbjct: 1037 VDGSWSIGDDNFNKIISFLYSTVGALNKIGTDGT----QVAMVQFTDDPRTEFKLNAYKT 1092 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + ++ NT T A+ + L+ + + + K ++ ITDG + Sbjct: 1093 KETLLDAIKHISYKGGNTKTGKAIKYVRDTLFTAESGTRR-----GIPKVIVVITDGRSQ 1147 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRE 375 +I M+ G I+++ V L F V+D Sbjct: 1148 DDVN--------KISREMQLDGYSIFAIGV--ADADYSELVSIGSKPSARHVFFVDDF-- 1195 Query: 376 LLESFDKITDKIQEQSVRIAP 396 ++F KI D++ A Sbjct: 1196 --DAFKKIEDELITFVCETAS 1214 Score = 66.1 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 57/168 (33%), Gaps = 18/168 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + RIG Y+ + +EV + L NT T A+++ + Sbjct: 186 TAFDVGSEKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNYIF 245 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + T + K I ITDG++ + +R +G++++++ Sbjct: 246 ENSFK-----PEAGSRTGVSKIGILITDGKSQDDIIPPS--------RNLRESGVELFAI 292 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 V + L++ + V + + + +T + + Sbjct: 293 GV--KNADVNELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTLCSRV 338 >gi|149200035|ref|ZP_01877061.1| hypothetical protein LNTAR_03619 [Lentisphaera araneosa HTCC2155] gi|149136908|gb|EDM25335.1| hypothetical protein LNTAR_03619 [Lentisphaera araneosa HTCC2155] Length = 307 Score = 68.8 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 27/196 (13%), Positives = 65/196 (33%), Gaps = 33/196 (16%) Query: 196 KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL 255 S A + +V +++ G P+ Sbjct: 96 SSGSMRADFGGKNRYEVAMQAVKEF-----------TEYREGDAFGLTVFGTEYINWVPV 144 Query: 256 SNNLNEVKSRLNKLNPY------ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 + + + + L P TN A+ + ++L +++ + +I Sbjct: 145 TKDTSAIALATPFLAPDRMSKWFGGTNIAKALRGSQQQLLQQEDG----------DRMII 194 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA 369 ++DG + + +T+ + + +RN + Y + + + ++ + GQ F Sbjct: 195 LVSDGVSGSPN------DTVDMAQELRNNKIVAYCIYIGSGNGSPEMNALAAITGGQVFG 248 Query: 370 VNDSRELLESFDKITD 385 VN+ + L E+F I Sbjct: 249 VNNPKALDETFRFIDK 264 >gi|307353371|ref|YP_003894422.1| von Willebrand factor type A [Methanoplanus petrolearius DSM 11571] gi|307156604|gb|ADN35984.1| von Willebrand factor type A [Methanoplanus petrolearius DSM 11571] Length = 317 Score = 68.8 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 30/205 (14%), Positives = 69/205 (33%), Gaps = 43/205 (20%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 +I E+ G L+N + I ++ G LS + V +L Sbjct: 109 DRITAAKEAIGTLINQLDLKDYAG--------IITFDSGASTAAY--LSPDKQRVIEKLG 158 Query: 268 KLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 + ++T + A + K VI ++DGE++ Sbjct: 159 MIAASDDSTAIGDGLALAVDM----------SKSIPNRKSVVILLSDGESNAGYVS---- 204 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPE----------------GQDLLRKCT-DSSGQFFA 369 E+ + +G+++++VA+ + + ++ L + G F++ Sbjct: 205 -PETAAEFAKESGVQVFTVAMGSSEKVLVGYDWANNPQYATVDEETLEYIADSTGGGFYS 263 Query: 370 VNDSRELLESFDKITDKIQEQSVRI 394 D + L + ++ D I + + Sbjct: 264 SVDEKTLGNIYSQLDDAIVHEKEKT 288 >gi|170079352|ref|YP_001735990.1| von Willebrand factor type A domain-containing protein [Synechococcus sp. PCC 7002] gi|169887021|gb|ACB00735.1| von Willebrand factor type A domain protein [Synechococcus sp. PCC 7002] Length = 414 Score = 68.8 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 27/192 (14%), Positives = 68/192 (35%), Gaps = 28/192 (14%) Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP 271 + E+A LV+ +++ R+ IA++ + +K+++++L Sbjct: 62 TVKEAAIQLVDQLREGD--------RLSVIAFDHKAKVIVPNQDVTDKAHIKAQIDRLEA 113 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T + +EL + V +TDGE + + + L+I Sbjct: 114 AGGTCIDDGIKLGLQELASSPGKRAAQ---------VFMLTDGE----NEHGDNGRCLEI 160 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ--- 388 G+ + S+ L + ++G + + L F+++ + Q Sbjct: 161 AAVAAEHGVTLNSLGFGENWNQDVLEKIADAANGSLAYIETPNQALTEFERLLQRAQSVG 220 Query: 389 ----EQSVRIAP 396 + ++++P Sbjct: 221 LTNAQLLLQLSP 232 >gi|89899605|ref|YP_522076.1| hypothetical protein Rfer_0795 [Rhodoferax ferrireducens T118] gi|89344342|gb|ABD68545.1| conserved hypothetical protein [Rhodoferax ferrireducens T118] Length = 424 Score = 68.8 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 31/309 (10%), Positives = 75/309 (24%), Gaps = 25/309 (8%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 ++++V FI A+D H+ + ++Q++ DA L+ + +I + + + + Sbjct: 26 VGLMMAVLVGFIGLALDGGHLYLTKTELQNSADACALAASYELTGAPSIAPASFARAEAA 85 Query: 62 TIFKKQIK-----KHLKQGSYIRENAGDIAQKAQINITKDKNN---PLQYIAESKAQYEI 113 Q+ S I + G I P Sbjct: 86 GQAVGQMNKVDFQNSAIASSDIVVSFGTDLSAGNAAIKWVNAGAALPSSKYVRCTITRSN 145 Query: 114 PTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDN-- 171 + +S +T + + N I + + + + Sbjct: 146 IMPWFMQVLMPSLDTLTVSSLATATLAPAQNNCGIPMAIC-SKGSAPSYGMTPGQWVSGF 204 Query: 172 ------NNMTSNKYLLPPPPKKSFW------SKNTTKSKYAPAPAPANRKIDVLIESAGN 219 + N PP + ++ Sbjct: 205 FGAGGGVTGSFNWIDFTPPAGGTSELAALLTGNGVCTLNVPTPVGEPGALGAAAAKAWNT 264 Query: 220 LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN--NLNEVKSRLNKLNPYENTNT 277 Q +Y ++ L++ + + NT T Sbjct: 265 RFGLYQTGSTNVTTAPPDFTGYSYTPTNWPSKANALADFLSRRSAHASYGATVSVGNTIT 324 Query: 278 YPAMHHAYR 286 ++++Y Sbjct: 325 GLGINNSYN 333 >gi|304407684|ref|ZP_07389335.1| von Willebrand factor type A [Paenibacillus curdlanolyticus YK9] gi|304343167|gb|EFM09010.1| von Willebrand factor type A [Paenibacillus curdlanolyticus YK9] Length = 966 Score = 68.8 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 38/265 (14%), Positives = 84/265 (31%), Gaps = 24/265 (9%) Query: 129 TNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKS 188 +SL +++ N A++ + + + + + + + + Sbjct: 12 VTMSLVLGLMVQAPLVNAAVNDYVSVTKTVNPTTITTEDEAEVTLNVTGIPPANVVVPND 71 Query: 189 FWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIV 248 AP+ K+ E+A V+ + I Sbjct: 72 VVLIIDKSGSMAPSYNNGEDKMLNAKEAAKGFVDLMDLTKHRV--------AIVDFSSSN 123 Query: 249 GNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P + N E K+ ++ +N +T T A+ A L N + + + Sbjct: 124 MIGNLPFTTNPTEAKNYIDTINANGSTATGDAIDSAIALLANHRPEAQP---------VI 174 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG------QDLLRKCTD 362 + +TDG+ + S Q ++ G+ Y++A+ + LL++ Sbjct: 175 VIMTDGDATQPSTDPYGY-AKQKALLAKDNGIIFYTIALLKSTDDPVTSGPNILLKEMAT 233 Query: 363 SSGQFFAVNDSRELLESFDKITDKI 387 +S V S L + + I +I Sbjct: 234 TSDHHHFVLGSTGLSQIYAAIVKEI 258 >gi|224077994|ref|XP_002192008.1| PREDICTED: similar to matrilin 4 [Taeniopygia guttata] Length = 580 Score = 68.8 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 58/164 (35%), Gaps = 15/164 (9%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYR 286 + + R+G I Y+ + +++ +N + P T T A+ +A Sbjct: 63 NLDVGPNATRVGVIQYSSQVQNIFSLKTFFTRADMERAINSIIPLAQGTMTGLAIQYAMN 122 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 + +E + R+ + I +TDG ++ RNAG++IY+V Sbjct: 123 VAFTTQEGARPL--HKRIPRIAIVVTDGRPQD--------RVTEVATQARNAGIEIYAVG 172 Query: 347 VSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 + + LR F V + + + DK+ Sbjct: 173 IQR--ADMNSLRAMASPPLEEHVFLVESFELIQQFAKQFQDKLC 214 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 24/183 (13%), Positives = 64/183 (34%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ + +V+ ++ + R+G + Y+ + ++ +E+K + Sbjct: 360 NFELVKQFVNRIVDLLEVSPHG-----TRVGLVQYSSRVRTEFPLNKYHSADEIKKAVMD 414 Query: 269 LNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + + T T A+ H ++E E + S + + + TDG + + Sbjct: 415 VEYMEKGTMTGLALKHMVEHSFSELEGARPL--SYNIPRIGLVFTDGRSQDDISEW---- 468 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITD 385 + +G+ +++V V ++ LR F D + + + Sbjct: 469 ----ARRAKESGIVMFAVGVGKAV--EEELRAIASEPVEQHFSYSADFTTMTHLVENFSL 522 Query: 386 KIQ 388 I Sbjct: 523 NIC 525 >gi|116254826|ref|YP_770662.1| hypothetical protein pRL100386 [Rhizobium leguminosarum bv. viciae 3841] gi|115259474|emb|CAK10612.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae 3841] Length = 644 Score = 68.8 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 32/318 (10%), Positives = 77/318 (24%), Gaps = 26/318 (8%) Query: 80 ENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGII 139 E + A ++ D + + A Y +L + + Sbjct: 170 ERFANAAANPIKSVATDPVSTFSADVD-SASYAFVRRSLTGGAMPDPLSVRVEEMINYFP 228 Query: 140 ERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY 199 V+ + + P + Sbjct: 229 YDWPGPDNAEQPFKATVTVMPTPWNHDTELMHVAIKGYDIAPATTPHANLV-----FLID 283 Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 K+ +L + LVN ++ + + Y Sbjct: 284 VSGSMDEPDKLPLLKSAFRLLVNRLKPDDT--------VSIVTYAGNAGTVLTPTRVAEK 335 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +++ S +++L +T + AY V+ TDG+ + Sbjct: 336 SKILSAIDRLEAGGSTGGAEGIEAAYDLAKQGFVKDGVNR--------VMLATDGDFNVG 387 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 + + +I E R G+ + + L++ + + L E+ Sbjct: 388 --PSSDEDLKRIIEERRKDGIFLTVLGFGRGNLNDSLMQTLAQNGNG--SAAYIDTLAEA 443 Query: 380 FDKITDKIQEQSVRIAPN 397 + ++ IA + Sbjct: 444 QKTLVEEAGSTLFPIASD 461 >gi|294508603|ref|YP_003572662.1| von Willebrand factor type A domain protein [Salinibacter ruber M8] gi|294344932|emb|CBH25710.1| von Willebrand factor type A domain protein [Salinibacter ruber M8] Length = 317 Score = 68.8 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 25/155 (16%), Positives = 54/155 (34%), Gaps = 38/155 (24%) Query: 254 PLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + + ++ L + + T A+ A L S K I Sbjct: 134 PLTLDYSFLQRMLEDVEVGAVEDGTAVGTALATAVNRL----------KDSEAESKVAIL 183 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------------------ 352 +TDG N+ ++ E + G+++Y++ V + + Sbjct: 184 LTDGRNNRGQ-----IDPRTAAEVAQTMGVRVYAIGVGSSEDRDTWEEPLPQGQRDESAG 238 Query: 353 -GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITD 385 ++LR + + GQ+F+ + L + +I Sbjct: 239 VDAEMLRSVSTSTGGQYFSATNRDALERIYAEIDT 273 >gi|297300028|ref|XP_002805526.1| PREDICTED: collagen alpha-1(XIV) chain-like [Macaca mulatta] Length = 1717 Score = 68.8 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 27/201 (13%), Positives = 63/201 (31%), Gaps = 26/201 (12%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + +I + V ++ K + ++ + + Sbjct: 1037 VDGSWSIGDDNFNKIISFLYSTVGALNKIGTDGT----QVAMVQFTDDPRTEFKLNAYKT 1092 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + ++ NT T A+ + L+ + + + K ++ ITDG + Sbjct: 1093 KETLLDAIKHISYKGGNTKTGKAIKYVRDSLFTAESGTRR-----GIPKVIVVITDGRSQ 1147 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRE 375 +I M+ G I+++ V L F V+D Sbjct: 1148 DDVN--------KISREMQLDGYSIFAIGV--ADADYSELVSIGSKPSARHVFFVDDF-- 1195 Query: 376 LLESFDKITDKIQEQSVRIAP 396 ++F KI D++ A Sbjct: 1196 --DAFKKIEDELITFVCETAS 1214 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 57/168 (33%), Gaps = 18/168 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + RIG Y+ + +EV + L NT T A+++ + Sbjct: 186 TAFDVGSEKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNYIF 245 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + T + K I ITDG++ + +R +G++++++ Sbjct: 246 ENSFK-----PEAGSRTGVSKIGILITDGKSQDDIIPPS--------RNLRESGVELFAI 292 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 V + L++ + V + + + +T + + Sbjct: 293 GV--KNADVNELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTVCSRV 338 >gi|157819015|ref|NP_001100328.1| procollagen, type VII, alpha 1 [Rattus norvegicus] gi|149018489|gb|EDL77130.1| procollagen, type VII, alpha 1 (predicted) [Rattus norvegicus] Length = 2588 Score = 68.8 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 55/162 (33%), Gaps = 19/162 (11%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYN 290 VR T+ Y+ + + + +L+ NT T A+ H ++ Sbjct: 74 NAQGVRFATVQYSDDPQTEFGLDTLGSGGDTIRAIRELSYKGGNTRTGAALLHVSDRVFL 133 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + K I ITDG++ + ++ G+K+++V + Sbjct: 134 PHLTRPG------IPKVCILITDGKSQD--------LVDTAAQKLKRQGVKLFAVGI--K 177 Query: 351 PEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 + L++ FF VND L I+ ++ Sbjct: 178 NADPEELKRIASQPTSDFFFFVNDFSILRTLLPLISRRVCTT 219 >gi|327540682|gb|EGF27254.1| von Willebrand factor type A domain-containing protein [Rhodopirellula baltica WH47] Length = 887 Score = 68.8 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 34/332 (10%), Positives = 84/332 (25%), Gaps = 44/332 (13%) Query: 97 KNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDV 156 + E + P + S+ ++ + V Sbjct: 372 AGGLPGMVVPPTPDGEGRGPGMSGDKFEPIQENKFRRVADDDDALSTFSIDVDTASYAKV 431 Query: 157 SRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIES 216 ++ L + + Y S S A A P N ++ Sbjct: 432 RSYLQRGQLPRPDSVRIEELINYFDYQYTPPSAEDPVPFSSAMAVASCPWNENNRLVRVG 491 Query: 217 AG---------------NLVNSIQKAIQ---------------EKKNLSVRIGTIAYNIG 246 L+++ + ++ N R+ + Y Sbjct: 492 IQAKDIDRKKRPRCNLVFLIDTSGSMKRPNKLPLVIEGMKVLLDQLNKKDRVAIVVYAGS 551 Query: 247 IVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 + ++ L+ L+ +TN + AY+ Sbjct: 552 SGLVLDSTPVKQKKKIIRALSALSAGGSTNGGAGLQLAYQTARENFIEDGVNR------- 604 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-G 365 VI +DG+ + + L +G ++ + ++ + ++S G Sbjct: 605 -VILCSDGDFNVGMTGTDQLVAE--ATRQSKSGTELTVLGFGMGNHNDAMMERISNSGAG 661 Query: 366 QFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + V+ E + + D++ +A + Sbjct: 662 NYAFVDTIAEAKKV---LADQVAGTLFTVAKD 690 >gi|296268803|ref|YP_003651435.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] gi|296091590|gb|ADG87542.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] Length = 607 Score = 68.8 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 68/205 (33%), Gaps = 15/205 (7%) Query: 184 PPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAY 243 P + + S A P K+++ ++A N + + + Sbjct: 415 KPAHVLMVIDVSGSMGADVPGTGQTKLELAKQAAINALPQLG-----PNDQVGLWMFSTN 469 Query: 244 NIGIV-GNQCTPLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGS 301 G + P+ NN + +K R+ L P T Y AYR + S Sbjct: 470 QDGGKDYRELVPMGRNNRDLLKKRIQGLIPGGGTGLYDTTRAAYRTVLE--------RHS 521 Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT 361 + V+ +TDG+N ++ ++ ++++++A + + L + Sbjct: 522 NDVINAVVVLTDGKNEDDNSISLEDLLAELRTETGQETVRVFTIAYGNDADLEVLRQISQ 581 Query: 362 DSSGQFFAVNDSRELLESFDKITDK 386 + + + + + F + Sbjct: 582 VTDAAAYDSREPGSIDQVFTAVLSN 606 >gi|149019071|gb|EDL77712.1| procollagen, type XII, alpha 1, isoform CRA_c [Rattus norvegicus] Length = 1721 Score = 68.8 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 21/160 (13%), Positives = 53/160 (33%), Gaps = 18/160 (11%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKE 293 V+I Y+ + + + L NT T A++ ++ + Sbjct: 71 RVQIALAQYSGDPRTEWHLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQSFKT-- 128 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 R +K + ITDG++ + + +++ G++++++ + Sbjct: 129 ---QAGMRPRARKIGVLITDGKSQDDVEAPS--------KKLKDEGVELFAIGI--KNAD 175 Query: 354 QDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 + L+ + V D L + D +T + Sbjct: 176 EVELKMIATDPDDIHAYNVADFESLSKIVDDLTINLCNSV 215 >gi|198417199|ref|XP_002122571.1| PREDICTED: similar to MGC81791 protein, partial [Ciona intestinalis] Length = 847 Score = 68.8 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 26/200 (13%), Positives = 63/200 (31%), Gaps = 22/200 (11%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + ++ + + +LV + + R+G + Y + + Sbjct: 42 ASSSVGDQDFGRVRKWVSDLVATF-----DIGPDYTRVGVVVYAEEPEMAIALNQYTDRD 96 Query: 261 EVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 + + + NT T A+ E ++ + + + I +TDG Sbjct: 97 SLIQAVGNITYLNGNTRTGKAIRFMNEESFSIANGARDI--EFGYNRLAIVLTDGRAQDN 154 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELL 377 + +N G+++Y+V VS + L + V+D + + Sbjct: 155 VFNPSL--------EAQNNGIQLYAVGVSTAVVEE--LNEIASDPDSRHVMQVDDFQAIE 204 Query: 378 ESFDKITDKIQEQSVRIAPN 397 + + I I P+ Sbjct: 205 RIRELLRQIIC--VDAICPD 222 >gi|162452621|ref|YP_001614988.1| hypothetical protein sce4345 [Sorangium cellulosum 'So ce 56'] gi|161163203|emb|CAN94508.1| hypothetical protein sce4345 [Sorangium cellulosum 'So ce 56'] Length = 521 Score = 68.8 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 18/186 (9%), Positives = 58/186 (31%), Gaps = 22/186 (11%) Query: 208 RKIDVLIESAGNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 ++ +++A V+ + R + + + + + Sbjct: 121 TRLTNAVQAATTAVSRLNDGDVVSVVTFDTRTSVVVPPTTVGPET-------RGRILASV 173 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 ++ +T + L ++ ++DG+ + ++ Sbjct: 174 RGISLGGDTCISCGIEEGLSLLGQTSAGV----------SRMLVLSDGDANHGV--RDVP 221 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITD 385 + + R+ G+ I ++ V + +L DS+G+ + V + L F+ + Sbjct: 222 GFRAMAQRARDRGVAITTIGVDVD-YNEKILSAIALDSNGRHYFVENDAALARIFEAEAE 280 Query: 386 KIQEQS 391 ++ Sbjct: 281 QLTTSV 286 >gi|118085865|ref|XP_418677.2| PREDICTED: similar to collagen, type XXVIII [Gallus gallus] Length = 1144 Score = 68.8 bits (166), Expect = 2e-09, Method: Composition-based stats. Identities = 28/206 (13%), Positives = 71/206 (34%), Gaps = 26/206 (12%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + +++ + + + RIG I ++ + Sbjct: 782 SSESVGPDNFNSTKTFMKTVIDEVSA-----NHATTRIGIINFSHKVELVSSLETYTTKE 836 Query: 261 EVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +KS ++K+ T T A+ A + + ++K + +TDG+ Sbjct: 837 SLKSAVDKMLYLGEGTYTASAIKKAISLFQAARPA---------VRKVALVVTDGQADN- 886 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAV--SAPPEGQDLLRK---CTDSSG--QFFAVND 372 ++ ++ + + A ++I+ + + P + L++ F+ + D Sbjct: 887 ---RDKVHLDLVVKEAHAANIEIFVIGIVQKTDPHYHNFLKEMHLIATDPDEEHFYQIED 943 Query: 373 SRELLESFDKITDKIQEQSVRIAPNR 398 + L DK+ KI E I + Sbjct: 944 FKTLSALADKLITKICENVSEIYSRK 969 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 58/156 (37%), Gaps = 15/156 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLS-VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 D+ +L ++I + K V++ + ++ + + N+ K ++ Sbjct: 63 FDLQKNFVLSLTDNIFQMKPVKSQNYNVKLAGMQFSSTVSIDHPFTAWKNVQNFKEKIRA 122 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 L T +Y A+ +A + E K +TDG + + N+ N Sbjct: 123 LVYIGQGTYSYYAISNATQLFKTEGRERSI--------KVAFLMTDGVD-----HPNSPN 169 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS 363 I R+ G+ +++ +S ++ LR +D Sbjct: 170 VQGIATAARSLGIHFFTIGLSKKNVKEEKLRLISDD 205 >gi|291299992|ref|YP_003511270.1| von Willebrand factor type A [Stackebrandtia nassauensis DSM 44728] gi|290569212|gb|ADD42177.1| von Willebrand factor type A [Stackebrandtia nassauensis DSM 44728] Length = 316 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 16/151 (10%), Positives = 50/151 (33%), Gaps = 24/151 (15%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + + N+VKS + + T + A + + + S ++ ++DG Sbjct: 145 TKDRNQVKSAIKSMKLDRATAIGEGIFSALQAIQSVPPDGA----SEPAPARILLLSDGY 200 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP--------------EGQDLLRKCT 361 + ++ + + + + ++A + + L + Sbjct: 201 RTSGRLVEDGAKAAKAAK------VPVSTIAFGTDTGTVEIEGETQEVPVDRETLSQTAE 254 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + G+F+ +L ++ + I ++V Sbjct: 255 TTGGKFYEAASVDDLKGVYEDMGSSIGHRTV 285 >gi|255578117|ref|XP_002529928.1| protein binding protein, putative [Ricinus communis] gi|223530558|gb|EEF32436.1| protein binding protein, putative [Ricinus communis] Length = 731 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 19/185 (10%), Positives = 56/185 (30%), Gaps = 24/185 (12%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 K+ +L + G ++ ++ + + + + + E +N Sbjct: 295 TKLALLKRAMGFVIQNLGPSDRLSVIAFSSTARRLFPLRCMTEA------GRQEALLSVN 348 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 L TN + + + + +I ++DG+++ + +N Sbjct: 349 SLVSNGGTNIAEGLRKGAKVIVD--------RKWKNPVASIILLSDGQDTYTVTSPSGMN 400 Query: 328 TLQ-----ICEYMRNAG-----MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 + + G + ++S A + + S G F + + Sbjct: 401 PRADYKSLLPISIHRNGGTGLKIPVHSFGFGADHDAASMHSISEISGGTFSFIEAEGVIQ 460 Query: 378 ESFDK 382 ++F + Sbjct: 461 DAFAQ 465 >gi|291569213|dbj|BAI91485.1| hypothetical protein [Arthrospira platensis NIES-39] Length = 412 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 66/181 (36%), Gaps = 23/181 (12%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 ++ + ++A L++ + N+ RI +A++ + + +K +++ L Sbjct: 59 LETVKQAAKELIDRL--------NVGDRISVVAFDHRAKVLVPNQDIADPDGIKKKIDGL 110 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T + EL K+ + +TDGE + + + L Sbjct: 111 RCSGGTAIDEGLKLGIEELGKGKQDRISQG---------FLLTDGE----NEHGDNKRCL 157 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF-AVNDSRELLESFDKITDKIQ 388 ++ + + I S+ QD+L K D+ G + + + F ++ ++Q Sbjct: 158 KLAKLATEYKLTINSLGFGNDWN-QDILEKIADAGGGALGYIEYPEQAIAEFGRLFTRMQ 216 Query: 389 E 389 Sbjct: 217 S 217 >gi|291393629|ref|XP_002713393.1| PREDICTED: alpha 1 type VII collagen [Oryctolagus cuniculus] Length = 2937 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 57/162 (35%), Gaps = 19/162 (11%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYN 290 VR + Y+ + EV + +L+ NT T A+ H ++ Sbjct: 73 SAQGVRFAAVQYSDDPRTEFGLDTLASGGEVIRAIRELSYKGGNTRTGAAIRHVADHIFL 132 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + K I ITDG++ + ++ G+K+++V + Sbjct: 133 PQLTRPG------IPKVCILITDGKSQD--------QVDAAAQRLKGQGVKLFAVGI--K 176 Query: 351 PEGQDLLRKCTD--SSGQFFAVNDSRELLESFDKITDKIQEQ 390 + L++ + FF VND L ++ ++ Sbjct: 177 NADPEELKRVASQPTGDFFFFVNDFSILRTLLPLVSRRVCTT 218 >gi|282877523|ref|ZP_06286341.1| von Willebrand factor type A domain protein [Prevotella buccalis ATCC 35310] gi|281300347|gb|EFA92698.1| von Willebrand factor type A domain protein [Prevotella buccalis ATCC 35310] Length = 332 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 23/166 (13%), Positives = 50/166 (30%), Gaps = 44/166 (26%) Query: 249 GNQCTPLSNNLNEVKSRLNKLNPY--------ENTNTYPAMHHAYRELYNEKESSHNTIG 300 P++ + + + L + + T + +A L Sbjct: 138 SFTQCPMTTDHASLLNLLRNVRTDIAARGLISDGTAVGMGLANAVSRL----------KD 187 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP---------- 350 S K VI ITDG N+ + + ++ G+++Y++ V Sbjct: 188 SKAKSKVVILITDGSNNMGDIS-----PMTSAQIAQSLGIRVYTIGVGTNKVAPYPMNVG 242 Query: 351 ----------PEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 L + G F+ +++EL + ++ I Sbjct: 243 GTTQYVNIPVEIDSKTLSDIAAVTEGNFYRATNNKELKQIYNDIDK 288 >gi|194221273|ref|XP_001915997.1| PREDICTED: similar to Voltage-dependent calcium channel subunit alpha-2/delta-2 precursor (Voltage-gated calcium channel subunit alpha-2/delta-2) [Equus caballus] Length = 1127 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 37/341 (10%), Positives = 92/341 (26%), Gaps = 21/341 (6%) Query: 38 LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK 97 L+ A K++ K L G A +++ +D Sbjct: 143 LADAAENFQKAHRWQDNIKEEDIMYYDAK-ADAELDDPESEDVERGSKASNLRLDFVEDS 201 Query: 98 NNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 N + A +IPT+ +I + L IE ++ + + + Sbjct: 202 NFKNKVNYSYTA-VQIPTDIYKGSTVILNELNWTEALENVFIENRRQDPTLLWQVFGSAT 260 Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 + P + S + + + + ++ S Sbjct: 261 GVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSV 320 Query: 218 GNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 +++++ + + + ++ +V N K + + T Sbjct: 321 CEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVR----NKKVFKEAVQGMVAKGTTG 376 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 +A+ +L N + K ++ TDG N Sbjct: 377 YKAGFEYAFDQLQNSNITRA------NCNKMIMMFTDGGEDRVQDVFEKYNWP------- 423 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 N +++++ +V L+ + G +F + + Sbjct: 424 NRTVRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 464 >gi|328712316|ref|XP_001943179.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4-like [Acyrthosiphon pisum] Length = 830 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 50/353 (14%), Positives = 101/353 (28%), Gaps = 43/353 (12%) Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKA 109 + D T + + I K ++ K + K QI T + Sbjct: 191 VPDIKTANEIETKISKNKLAKITHESGNKVTITWSPTVKDQIKFT-----ERGVKGQFIV 245 Query: 110 QYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHN 169 QY++ ++ + LI + L + +LDVS SM + + Sbjct: 246 QYDVDHKSAPNQVLIDDGYF-----VHFFAPTDLKPLKTHVIFILDVSGSMNGQKITQVK 300 Query: 170 DNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ 229 + ++ +S +N D Sbjct: 301 GAMSQILSEIDSEDFFTLILFSSLAQIWTINATQNTSNYWDDRGRN-------------- 346 Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 N +G + N+ K + L P TN A++ A Sbjct: 347 --LNNFETMGENHFIFSANEQ-------NIQYAKKFIQALEPDSTTNMEDALNKALSIAK 397 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 K + K ++F+TDGE + N ++ + IYS+ Sbjct: 398 LGKM--RFKDSAKTPKPIIVFLTDGEMNEG--ITNPQALMKYVSDINVDNYPIYSLGFGK 453 Query: 350 PPEGQDLLRKCTDSSGQF----FAVNDSR-ELLESFDKITDKIQEQSVRIAPN 397 + L+K + ++ F + +D+ +L + +I+ + + Sbjct: 454 GA-DIEFLKKLSLNNTGFARVIYEASDASLQLHNFYKEISSPVLSNVTFQYVD 505 >gi|163857470|ref|YP_001631768.1| putative lipoprotein [Bordetella petrii DSM 12804] gi|163261198|emb|CAP43500.1| putative lipoprotein [Bordetella petrii] Length = 582 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 32/356 (8%), Positives = 94/356 (26%), Gaps = 28/356 (7%) Query: 43 SIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQ 102 + ++ + + + ++ + + REN + + + Sbjct: 77 AALTRQYAPQAYSAQPAAVSLLPAPSGYYAPPQAEERENYARYRDNPVVAAQEQPVSTFG 136 Query: 103 YIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMED 162 + Y L L P + + + S Sbjct: 137 ADVD-TGSYTNVRRLLNEGRLPPPDAVRAEEFINYFDYGYATPDSRQQPFSIITEVSAAP 195 Query: 163 LYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVN 222 Q+ + P+ + + + A K+ ++ + LV Sbjct: 196 WNPQRQL----LKIGIQGYRVAPQDIPAANLVFLVDTSGSMAE-RDKLPLIKGALKQLVA 250 Query: 223 SIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMH 282 ++ R+ + Y + + + + +++L +TN + Sbjct: 251 QLRPQD--------RVAIVTYAGQASMTLDSTPGDQKARINAAIDELRAAGSTNGGAGLD 302 Query: 283 HAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKI 342 AY + ++ +DG+ + + + R G+ + Sbjct: 303 LAYAQAAKGFVKGGVNR--------ILLASDGDFNVG--ATDLEDLKDKIARQRQGGIAL 352 Query: 343 YSVAVSAPPEGQDL-LRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 ++ V L ++ +G + L E+ + ++ + IA + Sbjct: 353 TTLGVGGGNFNDALAMQLADAGNGSYHY---LDSLREARKVLAAQMSSTLLTIARD 405 >gi|148694465|gb|EDL26412.1| procollagen, type XII, alpha 1, isoform CRA_a [Mus musculus] Length = 1722 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 21/160 (13%), Positives = 53/160 (33%), Gaps = 18/160 (11%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKE 293 V+I Y+ + + + L NT T A++ ++ + Sbjct: 71 RVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQSFKT-- 128 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 R +K + ITDG++ + + +++ G++++++ + Sbjct: 129 ---QAGMRPRARKIGVLITDGKSQDDVEAPS--------KKLKDEGVELFAIGI--KNAD 175 Query: 354 QDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 + L+ + V D L + D +T + Sbjct: 176 EVELKMIATDPDDTHAYNVADFESLSKIVDDLTINLCNSV 215 >gi|145587695|ref|NP_001038174.2| anthrax toxin receptor 2a [Danio rerio] gi|141796884|gb|AAI39637.1| Anthrax toxin receptor 2a [Danio rerio] Length = 478 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 33/214 (15%), Positives = 74/214 (34%), Gaps = 21/214 (9%) Query: 188 SFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGI 247 S + T A + + + + ++ + + ++R+ I ++ Sbjct: 22 SSFKAETPSCHGAYDLYFVLDRSGSVSTDWSEIYDFVKNLTERFVSPNLRVSFIVFSSRA 81 Query: 248 VGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 PL+ + +E+ L L+ P T + + A ++ E + S Sbjct: 82 --EIVLPLTGDRSEINKGLKTLSEVNPAGETYMHEGIKLATEQMKKEPKKS--------- 130 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS 364 ++ +TDG+ T+ + R G ++Y V V ++ L DS Sbjct: 131 SSIIVALTDGKLETYIHQL----TIDEADSARKYGARVYCVGV--KDFDEEQLADVADSK 184 Query: 365 GQFFAVN-DSRELLESFDKITDKIQEQSVRIAPN 397 Q F V + L + I + + + + P+ Sbjct: 185 EQVFPVKGGFQALKGIVNSILKQSCTEILTVEPS 218 >gi|89513613|gb|ABD74633.1| capillary morphogenesis protein 2A [Danio rerio] gi|122891370|emb|CAM13145.1| novel protein similar to vertebrate anthrax toxin receptor family protein [Danio rerio] Length = 478 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 33/214 (15%), Positives = 74/214 (34%), Gaps = 21/214 (9%) Query: 188 SFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGI 247 S + T A + + + + ++ + + ++R+ I ++ Sbjct: 22 SSFKAETPSCHGAYDLYFVLDRSGSVSTDWSEIYDFVKNLTERFVSPNLRVSFIVFSSRA 81 Query: 248 VGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 PL+ + +E+ L L+ P T + + A ++ E + S Sbjct: 82 --EIVLPLTGDRSEINKGLKTLSEVNPAGETYMHEGIKLATEQMKKEPKKS--------- 130 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS 364 ++ +TDG+ T+ + R G ++Y V V ++ L DS Sbjct: 131 SSIIVALTDGKLETYIHQL----TIDEADSARKYGARVYCVGV--KDFDEEQLADVADSK 184 Query: 365 GQFFAVN-DSRELLESFDKITDKIQEQSVRIAPN 397 Q F V + L + I + + + + P+ Sbjct: 185 EQVFPVKGGFQALKGIVNSILKQSCTEILTVEPS 218 >gi|288925756|ref|ZP_06419687.1| BatA protein [Prevotella buccae D17] gi|315608294|ref|ZP_07883284.1| aerotolerance protein BatA [Prevotella buccae ATCC 33574] gi|288337411|gb|EFC75766.1| BatA protein [Prevotella buccae D17] gi|315250075|gb|EFU30074.1| aerotolerance protein BatA [Prevotella buccae ATCC 33574] Length = 332 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 23/161 (14%), Positives = 53/161 (32%), Gaps = 44/161 (27%) Query: 254 PLSNNLNEVKSRLNKLNP--------YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 P++ + + + L + + T + +A L S Sbjct: 143 PMTTDHASLLTLLQDVRTDMATRGLINDGTAIGMGLANAVSRL----------KDSKTKS 192 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA---------------- 349 + VI +TDG N+ L + ++ G+++Y++AV Sbjct: 193 RVVILLTDGANNAGDIS-----PLTAAQMAKSLGIRVYTIAVGTSKVAPYPIEVGGRVQY 247 Query: 350 -----PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + + L + G F++ N++ +L + + I Sbjct: 248 ISRPADIDTKTLREIAAVTEGNFYSANNTAQLKQIYHDIDQ 288 >gi|219850594|ref|YP_002465027.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219544853|gb|ACL26591.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 958 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 53/360 (14%), Positives = 97/360 (26%), Gaps = 60/360 (16%) Query: 37 VLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKK-HLKQGSYIRENAGDIAQKAQINITK 95 VL + R + T D + I + L S + + Sbjct: 298 VLLVARNAADARPLATALTAADIVAEIIAPEAAPRSLADLSAYDALVLVNTPARALPVGL 357 Query: 96 DKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD 155 + P + I E F G +L + ++I V+D Sbjct: 358 MQAIPGYVRDLGRGLLMIGGEESFGVGGYGRTAVEEALPVYMDVRNRELRPDLAIVFVID 417 Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 S SM T+ S+ RKID+ + Sbjct: 418 KSGSM-------------------DACHCANPDRGGPITSSSE---------RKIDIAKD 449 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 + + + ++ + +V ++ + P T Sbjct: 450 AVAQATALLSPQDTVG--------VVTFDGAAFPTFVATRGATVEQVMDAVSGVEPRGPT 501 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 N + A L K +I +TDG SG + L I + Sbjct: 502 NIRAGLLRAEEMLQQV----------DARIKHMILLTDGWGSGG-------DQLDIAARL 544 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF-----DKITDKIQEQ 390 R G+ + VA + L + + G+++ D ++ + F I + I EQ Sbjct: 545 REQGITLTVVAAGSGSATY-LQQLAAEGGGRYYPAADMADVPQIFVQETITAIGNYIVEQ 603 >gi|156523168|ref|NP_001095998.1| matrilin-2 [Bos taurus] gi|146186873|gb|AAI40517.1| MATN2 protein [Bos taurus] gi|296480475|gb|DAA22590.1| matrilin 2 [Bos taurus] Length = 958 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 56/159 (35%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 90 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 149 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ R+ G+ I+++ V Sbjct: 150 FSEAEGARPLREN--VPRVIMIVTDGRPQDS--------VAEVAAKARDTGILIFAIGVG 199 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F K Sbjct: 200 --QVDFNTLKAIGSEPHEDHVFLVANFSQIETLTSVFQK 236 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 27/187 (14%), Positives = 68/187 (36%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +++ + +++S+ + + R+G + Y+ + ++ ++K Sbjct: 669 GEDNFEIVKQFVTGIIDSLAISPKAA-----RVGLLQYSTLVRTEFTLRNFSSAKDMKKA 723 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + S R+ + I TDG + Sbjct: 724 VAHMKYMGKGSMTGLALKHMFERSFTQVEGARPL--SARVPRVAIVFTDGRAQDDVSEW- 780 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDK 382 + +G+ +Y+V V ++ L++ + F D + E DK Sbjct: 781 -------ASKAQASGITMYAVGVGKAI--EEELQEIASEPTEKHLFYAEDFSTMGEISDK 831 Query: 383 ITDKIQE 389 + I E Sbjct: 832 LQKGICE 838 >gi|268573054|ref|XP_002641504.1| C. briggsae CBR-MUA-3 protein [Caenorhabditis briggsae] gi|187031289|emb|CAP29350.1| CBR-MUA-3 protein [Caenorhabditis briggsae AF16] Length = 3770 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 52/153 (33%), Gaps = 12/153 (7%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKES 294 R+G I Y+ I + + + +++ T T A+ H +E ++E+ Sbjct: 1275 TRVGLIQYSDQIRHEFDLDQYGDRSSLLKGISETQYLTGLTRTGAAIQHMVQEGFSERRG 1334 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + + I +TDG + N + R + +++ V Sbjct: 1335 ARPQQNDI--ARVAIILTDGRSQD--------NVTGPADAARKLSINTFAIGV-TDHVLA 1383 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 L S ++F V+ ++L + K Sbjct: 1384 SELESIAGSPNRWFYVDKFKDLDTRLRSMIQKA 1416 >gi|84498148|ref|ZP_00996945.1| putative secreted protein [Janibacter sp. HTCC2649] gi|84381648|gb|EAP97531.1| putative secreted protein [Janibacter sp. HTCC2649] Length = 533 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 16/159 (10%), Positives = 46/159 (28%), Gaps = 14/159 (8%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 + Y + + + + +++L +TN + Y + + Sbjct: 225 IVTYQTDATPLLEPTPVRDTDTILAAIDRLEAGGSTNLEAGLLLGYDQAREAYKQGAT-- 282 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 V+ +DG + + G+ + +V L+ + Sbjct: 283 ------NVVLLASDGVANVGV--TDGGRLATAIRDNGRRGIHLVTVGYGMGNYSDHLMEQ 334 Query: 360 CTDSSGQFF-AVNDSRELLESFDKITDKIQEQSVRIAPN 397 D F+ ++ E + F + ++ +A + Sbjct: 335 LADQGDGFYEYIDTFEEARKLF---VEDLRATLTPVAKD 370 >gi|293348660|ref|XP_001058523.2| PREDICTED: matrilin 2 [Rattus norvegicus] Length = 922 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 22/165 (13%), Positives = 58/165 (35%), Gaps = 15/165 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + ++V+ + ++ T T A+ +A Sbjct: 87 DIGPDITRVGLLQYGSTVKNEFSLKTFKRKSDVERAVKRMRHLSTGTMTGLAIQYALNIA 146 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ RN G+ I+++ V Sbjct: 147 FSEAEGARPLREN--VPRVIMIVTDGRPQDS--------VAEVASKARNTGILIFAIGVG 196 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQS 391 + L+ F V + ++ +K+ + Sbjct: 197 --QVDLNTLKAIGSEPHKDHVFLVANFSQIESLTSVFQNKLCSKY 239 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 27/187 (14%), Positives = 66/187 (35%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +++ +++S+ + + R+G + Y+ + ++ ++K Sbjct: 653 GEENFEIVKHFVTGIIDSLAVSPKAA-----RVGLLQYSTQVRTEFTLKGFSSAKDMKKA 707 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H Y + + E + STR+ + I TDG + Sbjct: 708 VAHMKYMGKGSMTGLALKHMYERSFTQVEGARPL--STRVPRAAIVFTDGRAQDDVSEW- 764 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDK 382 + G+ +Y+V + ++ L++ F D + E +K Sbjct: 765 -------ARKAKANGITMYAVGIGKAI--EEELQEIASEPIDKHLFYAEDFSTMGEISEK 815 Query: 383 ITDKIQE 389 + I E Sbjct: 816 LKLGICE 822 >gi|229825155|ref|ZP_04451224.1| hypothetical protein GCWU000182_00506 [Abiotrophia defectiva ATCC 49176] gi|229790527|gb|EEP26641.1| hypothetical protein GCWU000182_00506 [Abiotrophia defectiva ATCC 49176] Length = 1109 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 36/295 (12%), Positives = 91/295 (30%), Gaps = 29/295 (9%) Query: 119 FLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS-RSMEDLYLQKHNDNNNMTSN 177 + ++ + G + E + + +V+D S +M Sbjct: 58 ITAKNYNNGKFDVEMVVKGDGSTTIEQKNLDVVLVVDRSYDNMRKNGRMAAAKAEAAKLV 117 Query: 178 KYLLPPPPKKSFWSKNTTKSKYAPAPAPA------NRKIDVLIESAGNLVNS-------- 223 Y L K + +P+P + D L + + Sbjct: 118 DYFLQSGNNKIRVGLVSFAGNNGGSPSPVLGVTQLTQDADELKNAIRGYNTAGWNNSPVL 177 Query: 224 --------IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 + KA + + I G L+ + N+ + L+ P E Sbjct: 178 REAFTQAGLIKANEMFGASNTNKKIIVLISGGAPTISYGLTLDFNQREEALSN-TPKEGY 236 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + + ++ + ++ + + + NT+ + Sbjct: 237 EIWSLTRWLDPKYHSSWKKGYHKSNEWTPYYVKLVKGLIDVPMGKTAEVKTNTIAEANKI 296 Query: 336 R-NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + ++G++I+SV + A + ++L+K + V++ L + DK+ + ++E Sbjct: 297 KTDSGVEIFSVGIKADGDAAEVLKKIAS--DNRYYVDNI--LKDVIDKVIESLKE 347 >gi|225418703|ref|ZP_03761892.1| hypothetical protein CLOSTASPAR_05927 [Clostridium asparagiforme DSM 15981] gi|225041758|gb|EEG52004.1| hypothetical protein CLOSTASPAR_05927 [Clostridium asparagiforme DSM 15981] Length = 1360 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 15/141 (10%), Positives = 46/141 (32%), Gaps = 11/141 (7%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 T + + ++S + TN + + H + Sbjct: 592 NSSVQGLTDFTVQPDVIRSTAESMTANGGTNIFDTVVHGLESFP---------KNGPEVL 642 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG 365 ++ ++DG+ + +A+ I + ++ + ++ + + + + L + G Sbjct: 643 NTLVVMSDGQEN--NAHSAEEIQTAIGQAAKDKSILVHCLGLGSEVDANYLQTIAQSAGG 700 Query: 366 QFFAVNDSRELLESFDKITDK 386 + V DS L + + + Sbjct: 701 TYQYVTDSSSLAVFYQNLASQ 721 >gi|56797861|emb|CAG27403.1| matrilin-3b [Danio rerio] gi|220675930|emb|CAX12089.1| matrilin 3b [Danio rerio] Length = 299 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 58/164 (35%), Gaps = 15/164 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + + R+ + Y + + EVK ++++P T T A+ A ++ Sbjct: 100 DIGSDATRVALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDPLSTGTMTGMAIKTAMEQV 159 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + E + + K I +TDG ++ R +G++IY+V V Sbjct: 160 FTENAGARPLK--KGIGKVAIIVTDGRPQD--------KVEEVSAAARASGIEIYAVGV- 208 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 L++ F V + + K + + E+ Sbjct: 209 -DRAEMRSLKQMASQPLDDHVFYVETYGVIEKLTSKFRETLCEE 251 >gi|308050346|ref|YP_003913912.1| von Willebrand factor type A [Ferrimonas balearica DSM 9799] gi|307632536|gb|ADN76838.1| von Willebrand factor type A [Ferrimonas balearica DSM 9799] Length = 322 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 24/203 (11%), Positives = 62/203 (30%), Gaps = 49/203 (24%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + ++ + + R+G I + PL+ + V Sbjct: 107 VVDRFTMVRHVLSDFIER---------RDGDRLGLILFADQAYLQA--PLTFDRFAVARF 155 Query: 266 LNK--LNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 L++ L T A+ + + ++ + ++ +TDGEN+ Sbjct: 156 LDEAVLGLVGQQTAIGDAIALGVKRFNDLEQ----------SSRVLVLLTDGENNAGRFT 205 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSA--------------------PPEGQDLLRKCTD 362 Q R +G+K+Y++ + + + ++ Sbjct: 206 -----PAQAVSLARQSGVKLYTIGIGSAEIRRRGLLGTRTVNPSSDLDQAEKSFIQLSES 260 Query: 363 SSGQFFAVNDSRELLESFDKITD 385 + G++F + EL + ++ Sbjct: 261 TGGRYFRARSTEELESIYQELDQ 283 >gi|47207527|emb|CAF87062.1| unnamed protein product [Tetraodon nigroviridis] Length = 409 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 24/207 (11%), Positives = 64/207 (30%), Gaps = 31/207 (14%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + D + N+ + + + ++G + Y+ +E Sbjct: 19 SWSVGVSDFDTAKQWLINITSQF-----DISSHYTQVGVVQYSDAPRLEIPLGKHQGQDE 73 Query: 262 VKSRLNKLNPYENTNTY--------PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 + + ++ PA+ A +++ + + + + +TD Sbjct: 74 LIRAIQSISYLGGNTQARRRSERHLPAIKFAVDHVFSSSQRASQVKN-----RIAVVVTD 128 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF--AVN 371 G++ R G+ +++V V + +L+ + + Sbjct: 129 GKSQDDVVD--------ASMEARTQGVTVFAVGVGSEITTSELI-AIANKPSSTYVLYAE 179 Query: 372 DSRELLESFDKITDKIQEQSVRIAPNR 398 D + D + K+ E+SV P R Sbjct: 180 DYTTIHHIRDAMEQKLCEESV--CPTR 204 >gi|257052678|ref|YP_003130511.1| von Willebrand factor type A [Halorhabdus utahensis DSM 12940] gi|256691441|gb|ACV11778.1| von Willebrand factor type A [Halorhabdus utahensis DSM 12940] Length = 1100 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 46/377 (12%), Positives = 104/377 (27%), Gaps = 39/377 (10%) Query: 31 SALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQ 90 + +A V S + T + L E + + Sbjct: 333 ATTEATVTVVARDESSGEEVS--KTITLDAPGFADEVYDIELTDPESGAEISVTGEGIVK 390 Query: 91 INITKDKNNP----------------LQYIAESKAQYEIPTENLFLKGLIPSALTNLSLR 134 + D + + + + G + + Sbjct: 391 SDFVVDPVPAEENRSFYAGPFIHIRNFSDFESATVEMPLDDDVDPSDGNLSVYKWDQHDE 450 Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 + ++ + S S ++ + ++ + P + + T Sbjct: 451 KPWHAVETDVHVENGTAVATVDSFSYFSVFWVDNWNDAITDTVNLAEHPE----YVANET 506 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 S A + + + + K R +++ G Q Sbjct: 507 EGSIEPIDLAFVIDESGSMGGARIQDAKASAKRFVGGLYEDDRAALVSFAGGATLGQ--S 564 Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 L+ + V + +++LN TNT + A EL + E + +I + DG Sbjct: 565 LTTDHGAVNASIDQLNAGGGTNTGAGLQKAVDELTSNGEGDT---------QEIILLADG 615 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSR 374 + + I + + I ++ + + Q+L + G+F+ V+DS Sbjct: 616 GTGLG------PDPVTIAQTADEHRITINTIGMGTGIDAQELTSIADATGGEFYQVSDSS 669 Query: 375 ELLESFDKITDKIQEQS 391 EL E FD++ Sbjct: 670 ELPEVFDRVEQNRISLV 686 >gi|288942396|ref|YP_003444636.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] gi|288897768|gb|ADC63604.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] Length = 346 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 30/162 (18%), Positives = 54/162 (33%), Gaps = 37/162 (22%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNP---YENTNTYPAMHHAYRELYNEKESSHNTIGST 302 G +PL+ + + + L+ + P T A+ +L E S Sbjct: 151 GSQAFILSPLTLDRHAARQLLDGVVPSIAGPATALGDAIALGVSKLRERPEGS------- 203 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE---------- 352 + +I I DG+N+ S + R G +IY + V + Sbjct: 204 ---RVMIVIADGDNNAGSF-----APKEAARLARATGTRIYVIGVGSKQPSIPILEEGSV 255 Query: 353 --------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + L++ D + G +F D+R L E +I Sbjct: 256 RYRDDLTMDEGTLQEIADLTGGGYFRATDTRALEEISSRIGQ 297 >gi|194291603|ref|YP_002007510.1| hypothetical protein RALTA_B0837 [Cupriavidus taiwanensis LMG 19424] gi|193225507|emb|CAQ71453.1| conserved hypothetical protein, Von Willebrand factor type A domain (vwa), putative membrane protein [Cupriavidus taiwanensis LMG 19424] Length = 353 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 26/229 (11%), Positives = 69/229 (30%), Gaps = 66/229 (28%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 +I ++A +LV + + VR+G +++ P ++N ++ + Sbjct: 106 NRISAAQQAARDLVVGLPAS--------VRLGIVSFAGTAAV--VLPPTDNRQDMLDAIE 155 Query: 268 KLNPYENTNTYPAMHHAYRELYNE-----------------------------------K 292 + T T + A L+ E + Sbjct: 156 RFQLQRGTATGSGLFQALAVLFPEDGIDLEVILFGSRSDRAGRGTSLDEAAAADAARRRE 215 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + + VI ++DG + + L G+++Y+V + Sbjct: 216 QGQQAAQPGSYRHGAVILLSDGRRTTG------PDPLDAARMAAQRGVRVYTVGFGSQQV 269 Query: 353 --------------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDK 386 + LR + G+++ + +L + + +++ + Sbjct: 270 TSAPESSLSYFMQLDEPALRAVASITGGEYYHAGSAADLSQVYRQLSAR 318 >gi|73958268|ref|XP_848527.1| PREDICTED: similar to Integrin alpha-M precursor (Cell surface glycoprotein MAC-1 alpha subunit) (CR-3 alpha chain) (CD11b) (Leukocyte adhesion receptor MO1) (Neutrophil adherence receptor) isoform 2 [Canis familiaris] Length = 1153 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 54/181 (29%), Gaps = 25/181 (13%) Query: 214 IESAGNLVNSIQKAIQ--EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP 271 E +++ + + S N + + +L Sbjct: 171 KEFVSTVMDQFKNSKTLFSLMQFSEDFQIHFTFNEFKKNPKPSF------LVKSIKQL-- 222 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T+T + REL++ + K ++ ITDGE Y + L+ + Sbjct: 223 LGRTHTATGIRKVVRELFHSSSGAR-----ENALKILVVITDGE-----KYGDPLDYKDV 272 Query: 332 CEYMRNAGMKIYSVAVSAPPE---GQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDK 386 G+ Y + V ++ L F VN+ L +++ +K Sbjct: 273 IPEADREGIIRYVIGVGDAFNHLKNREELNIIASKPPRDHVFRVNNFEALKTIQNQLQEK 332 Query: 387 I 387 I Sbjct: 333 I 333 >gi|73958266|ref|XP_856286.1| PREDICTED: similar to Integrin alpha-M precursor (Cell surface glycoprotein MAC-1 alpha subunit) (CR-3 alpha chain) (CD11b) (Leukocyte adhesion receptor MO1) (Neutrophil adherence receptor) isoform 3 [Canis familiaris] Length = 789 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 54/181 (29%), Gaps = 25/181 (13%) Query: 214 IESAGNLVNSIQKAIQ--EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP 271 E +++ + + S N + + +L Sbjct: 171 KEFVSTVMDQFKNSKTLFSLMQFSEDFQIHFTFNEFKKNPKPSF------LVKSIKQL-- 222 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T+T + REL++ + K ++ ITDGE Y + L+ + Sbjct: 223 LGRTHTATGIRKVVRELFHSSSGAR-----ENALKILVVITDGE-----KYGDPLDYKDV 272 Query: 332 CEYMRNAGMKIYSVAVSAPPE---GQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDK 386 G+ Y + V ++ L F VN+ L +++ +K Sbjct: 273 IPEADREGIIRYVIGVGDAFNHLKNREELNIIASKPPRDHVFRVNNFEALKTIQNQLQEK 332 Query: 387 I 387 I Sbjct: 333 I 333 >gi|73958270|ref|XP_856370.1| PREDICTED: similar to Integrin alpha-M precursor (Cell surface glycoprotein MAC-1 alpha subunit) (CR-3 alpha chain) (CD11b) (Leukocyte adhesion receptor MO1) (Neutrophil adherence receptor) isoform 4 [Canis familiaris] Length = 1036 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 54/181 (29%), Gaps = 25/181 (13%) Query: 214 IESAGNLVNSIQKAIQ--EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP 271 E +++ + + S N + + +L Sbjct: 171 KEFVSTVMDQFKNSKTLFSLMQFSEDFQIHFTFNEFKKNPKPSF------LVKSIKQL-- 222 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T+T + REL++ + K ++ ITDGE Y + L+ + Sbjct: 223 LGRTHTATGIRKVVRELFHSSSGAR-----ENALKILVVITDGE-----KYGDPLDYKDV 272 Query: 332 CEYMRNAGMKIYSVAVSAPPE---GQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDK 386 G+ Y + V ++ L F VN+ L +++ +K Sbjct: 273 IPEADREGIIRYVIGVGDAFNHLKNREELNIIASKPPRDHVFRVNNFEALKTIQNQLQEK 332 Query: 387 I 387 I Sbjct: 333 I 333 >gi|73958264|ref|XP_547048.2| PREDICTED: similar to Integrin alpha-M precursor (Cell surface glycoprotein MAC-1 alpha subunit) (CR-3 alpha chain) (CD11b) (Leukocyte adhesion receptor MO1) (Neutrophil adherence receptor) isoform 1 [Canis familiaris] Length = 1165 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 54/181 (29%), Gaps = 25/181 (13%) Query: 214 IESAGNLVNSIQKAIQ--EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP 271 E +++ + + S N + + +L Sbjct: 171 KEFVSTVMDQFKNSKTLFSLMQFSEDFQIHFTFNEFKKNPKPSF------LVKSIKQL-- 222 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T+T + REL++ + K ++ ITDGE Y + L+ + Sbjct: 223 LGRTHTATGIRKVVRELFHSSSGAR-----ENALKILVVITDGE-----KYGDPLDYKDV 272 Query: 332 CEYMRNAGMKIYSVAVSAPPE---GQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDK 386 G+ Y + V ++ L F VN+ L +++ +K Sbjct: 273 IPEADREGIIRYVIGVGDAFNHLKNREELNIIASKPPRDHVFRVNNFEALKTIQNQLQEK 332 Query: 387 I 387 I Sbjct: 333 I 333 >gi|262403351|ref|ZP_06079911.1| protein BatA [Vibrio sp. RC586] gi|262350850|gb|EEY99983.1| protein BatA [Vibrio sp. RC586] Length = 248 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 27/154 (17%), Positives = 52/154 (33%), Gaps = 36/154 (23%) Query: 253 TPLSNNLNEVKSRLNK--LNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 TPL+ + V +LN+ L T + A + + ++ +I Sbjct: 73 TPLTLDRQTVTEQLNQAVLKLIGTQTAMGEGIGLATKTFIDSA----------APQRVMI 122 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PE 352 ++DG N+ L+ L+ + IY+V V A Sbjct: 123 LLSDGSNTAGV-----LDPLEAANIAKQYQTTIYTVGVGAGEMIVKDFLFSRKVNTAQDL 177 Query: 353 GQDLLRKCTDS-SGQFFAVNDSRELLESFDKITD 385 + L+ + GQ+F + ++L +D I Sbjct: 178 DEKTLQTIASTTGGQYFRARNQQDLQSIYDTINQ 211 >gi|86740090|ref|YP_480490.1| von Willebrand factor, type A [Frankia sp. CcI3] gi|86566952|gb|ABD10761.1| von Willebrand factor, type A [Frankia sp. CcI3] Length = 319 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 26/220 (11%), Positives = 66/220 (30%), Gaps = 31/220 (14%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 +++ S A A ++ + A V+ + + +G +++ Sbjct: 85 ERATIILAIDVSNSMAATDIAPTRLAAAKQGASAFVDQLP--------PRINLGLVSFAG 136 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 P S + V++ + L T + + + + + G + Sbjct: 137 SATV--LVPASADRESVRAGIRGLQLGPATAVGEGIFASLQAITTAG-KRFSDTGQSAPP 193 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------------- 352 ++ ++DGE + Q E R A + + ++A Sbjct: 194 AAIVLLSDGETTRGRPNN------QAIEAARQARIPVDTIAYGTADGTLDVGGQEVPVPV 247 Query: 353 GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQS 391 + LR + + G + EL + + I ++ Sbjct: 248 NEQALRDIAEQTGGSYHRATSGDELRSVYRGLGSSIGYRT 287 >gi|293360567|ref|XP_216941.5| PREDICTED: matrilin 2 [Rattus norvegicus] Length = 900 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 27/187 (14%), Positives = 66/187 (35%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +++ +++S+ + + R+G + Y+ + ++ ++K Sbjct: 625 GEENFEIVKHFVTGIIDSLAVSPKAA-----RVGLLQYSTQVRTEFTLKGFSSAKDMKKA 679 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H Y + + E + STR+ + I TDG + Sbjct: 680 VAHMKYMGKGSMTGLALKHMYERSFTQVEGARPL--STRVPRAAIVFTDGRAQDDVSEW- 736 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDK 382 + G+ +Y+V + ++ L++ F D + E +K Sbjct: 737 -------ARKAKANGITMYAVGIGKAI--EEELQEIASEPIDKHLFYAEDFSTMGEISEK 787 Query: 383 ITDKIQE 389 + I E Sbjct: 788 LKLGICE 794 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 23/159 (14%), Positives = 55/159 (34%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + ++V+ + ++ T T A+ +A Sbjct: 87 DIGPDITRVGLLQYGSTVKNEFSLKTFKRKSDVERAVKRMRHLSTGTMTGLAIQYALNIA 146 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ RN G+ I+++ V Sbjct: 147 FSEAEGARPLREN--VPRVIMIVTDGRPQDS--------VAEVASKARNTGILIFAIGVG 196 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F Sbjct: 197 --QVDLNTLKAIGSEPHKDHVFLVANFSQIESLTSVFQN 233 >gi|149066552|gb|EDM16425.1| matrilin 2 (predicted) [Rattus norvegicus] Length = 898 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 27/187 (14%), Positives = 66/187 (35%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +++ +++S+ + + R+G + Y+ + ++ ++K Sbjct: 625 GEENFEIVKHFVTGIIDSLAVSPKAA-----RVGLLQYSTQVRTEFTLKGFSSAKDMKKA 679 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H Y + + E + STR+ + I TDG + Sbjct: 680 VAHMKYMGKGSMTGLALKHMYERSFTQVEGARPL--STRVPRAAIVFTDGRAQDDVSEW- 736 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDK 382 + G+ +Y+V + ++ L++ F D + E +K Sbjct: 737 -------ARKAKANGITMYAVGIGKAI--EEELQEIASEPIDKHLFYAEDFSTMGEISEK 787 Query: 383 ITDKIQE 389 + I E Sbjct: 788 LKLGICE 794 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 23/159 (14%), Positives = 55/159 (34%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + ++V+ + ++ T T A+ +A Sbjct: 87 DIGPDITRVGLLQYGSTVKNEFSLKTFKRKSDVERAVKRMRHLSTGTMTGLAIQYALNIA 146 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ RN G+ I+++ V Sbjct: 147 FSEAEGARPLREN--VPRVIMIVTDGRPQDS--------VAEVASKARNTGILIFAIGVG 196 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F Sbjct: 197 --QVDLNTLKAIGSEPHKDHVFLVANFSQIESLTSVFQN 233 >gi|126657060|ref|ZP_01728231.1| von Willebrand factor type A domain protein [Cyanothece sp. CCY0110] gi|126621603|gb|EAZ92313.1| von Willebrand factor type A domain protein [Cyanothece sp. CCY0110] Length = 328 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 32/193 (16%), Positives = 71/193 (36%), Gaps = 33/193 (17%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 +++ + + + RIG I + P + +L + Sbjct: 116 GNKIDRLEAVKLVVDDFIER---------REGDRIGLILFGTKAYLQ--VPFTQDLETAR 164 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNT-IGSTRLKKFVIFITDGENSGASAY 322 L++ M A L + + T S + +I +TDG ++G+ Sbjct: 165 FLLDE--------AQIGMAGAQTMLGDAIGLAIQTFEDSKTENRVLILLTDGNDTGSQVP 216 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPE------GQDLLRKCTD-SSGQFFAVNDSRE 375 + Q ++ + IY++A+ P ++ L+ D + GQFF +D + Sbjct: 217 PD-----QAAKFAAQRNIVIYTIAIGNPETEGTEKIDEETLQLIADQTGGQFFRGSDRQG 271 Query: 376 LLESFDKITDKIQ 388 L++ + I D+++ Sbjct: 272 LIQIY-DILDQLE 283 >gi|327481077|gb|AEA84387.1| von Willebrand factor type A domain-containing protein [Pseudomonas stutzeri DSM 4166] Length = 339 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 26/147 (17%), Positives = 54/147 (36%), Gaps = 35/147 (23%) Query: 253 TPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 +PL+ + V+ L++ + NT A+ A + L +S + ++ Sbjct: 149 SPLTFDRRTVRVWLDEASVGIAGSNTAIGDAIGLALKRLRERPANS----------RVLV 198 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------G 353 +TDG N+G + L ++I+++ + A PE Sbjct: 199 LVTDGANNGG-----EIEPLLAATLAAEENVRIHTIGIGAVPEEGGVLSRFGFNPGLDLD 253 Query: 354 QDLLRKCTD-SSGQFFAVNDSRELLES 379 + LR + + G++F S +L Sbjct: 254 EPTLRAIAEQTGGEYFRAASSEQLQAI 280 >gi|297285706|ref|XP_001090735.2| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like [Macaca mulatta] Length = 1417 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 36/350 (10%), Positives = 95/350 (27%), Gaps = 24/350 (6%) Query: 33 LDAAVLSGCASIVSDRTIKDPTTKKDQTST----IFKKQIKKHLKQGSYIRENAGDIAQK 88 LD V + + + +D + + L G A Sbjct: 394 LDRKVQALKRLADAAENFQKAHRWQDNIKEEDIVYYDAKADAELDDPESEDVERGFKAST 453 Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI 148 +++ +D N + A +IPT+ +I + L +E ++ + Sbjct: 454 LRLDFIEDPNFKNKVNYSYTA-VQIPTDIYKGSTVILNELNWTEALENVFMENRRQDPTL 512 Query: 149 SICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR 208 + + + P + S + + + Sbjct: 513 LWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGL 572 Query: 209 KIDVLIESAGNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 + ++ S +++++ + + + ++ +V N K + Sbjct: 573 TLKLMKTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVR----NKKVFKEAVQ 628 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + T +A+ +L N + K ++ TDG N Sbjct: 629 GMVAKGTTGYKAGFEYAFDQLQNSNITRA------NCNKMIMMFTDGGEDRVQDVFEKYN 682 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 N +++++ +V L+ + G +F + + Sbjct: 683 WP-------NRTVRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 725 >gi|301770299|ref|XP_002920606.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like [Ailuropoda melanoleuca] Length = 1081 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 36/350 (10%), Positives = 95/350 (27%), Gaps = 24/350 (6%) Query: 33 LDAAVLSGCASIVSDRTIKDPTTKKDQTST----IFKKQIKKHLKQGSYIRENAGDIAQK 88 LD V + + + +D + + + L G Sbjct: 62 LDRKVQALKRLADAAENFQKAHRWQDNIKEEDILYYDAKAEAELDDPESEDVERGSKVST 121 Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI 148 +++ +D N + A +IPT+ +I + L IE ++ + Sbjct: 122 LRLDFIEDSNFKNKVNYSYTA-VQIPTDIYKGSTVILNELNWTEALENVFIENRRQDPTL 180 Query: 149 SICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR 208 + + + P + S + + + Sbjct: 181 LWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGL 240 Query: 209 KIDVLIESAGNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 + ++ S +++++ + + + ++ +V N K + Sbjct: 241 TLKLMKTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVR----NKKVFKEAVQ 296 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + T +A+ +L N + K ++ TDG N Sbjct: 297 GMVAKGTTGYKAGFEYAFDQLQNSNITRA------NCNKVIMMFTDGGEDRVQDVFEKYN 350 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 N +++++ +V L+ + G +F + + Sbjct: 351 WP-------NRTVRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 393 >gi|281338317|gb|EFB13901.1| hypothetical protein PANDA_009310 [Ailuropoda melanoleuca] Length = 1046 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 36/350 (10%), Positives = 95/350 (27%), Gaps = 24/350 (6%) Query: 33 LDAAVLSGCASIVSDRTIKDPTTKKDQTST----IFKKQIKKHLKQGSYIRENAGDIAQK 88 LD V + + + +D + + + L G Sbjct: 60 LDRKVQALKRLADAAENFQKAHRWQDNIKEEDILYYDAKAEAELDDPESEDVERGSKVST 119 Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI 148 +++ +D N + A +IPT+ +I + L IE ++ + Sbjct: 120 LRLDFIEDSNFKNKVNYSYTA-VQIPTDIYKGSTVILNELNWTEALENVFIENRRQDPTL 178 Query: 149 SICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR 208 + + + P + S + + + Sbjct: 179 LWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGL 238 Query: 209 KIDVLIESAGNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 + ++ S +++++ + + + ++ +V N K + Sbjct: 239 TLKLMKTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVR----NKKVFKEAVQ 294 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + T +A+ +L N + K ++ TDG N Sbjct: 295 GMVAKGTTGYKAGFEYAFDQLQNSNITRA------NCNKVIMMFTDGGEDRVQDVFEKYN 348 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 N +++++ +V L+ + G +F + + Sbjct: 349 WP-------NRTVRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 391 >gi|312110043|ref|YP_003988359.1| hypothetical protein GY4MC1_0934 [Geobacillus sp. Y4.1MC1] gi|311215144|gb|ADP73748.1| Ig domain protein group 2 domain protein [Geobacillus sp. Y4.1MC1] Length = 932 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 24/262 (9%), Positives = 60/262 (22%), Gaps = 17/262 (6%) Query: 146 LAISICMVLDVSRSM-EDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPA-P 203 I + V DVS SM ++ Y ++ Sbjct: 78 PPIDVVFVFDVSGSMVMPSLKLDSAKYALQSAVDYFKANANPNDRFALVPFSDGVQSDKV 137 Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVR---IGTIAYNIGIVGNQCTPLSNNLN 260 P + + + + +N L++ + Sbjct: 138 VPFPSGTYDVKQHLNWIATVANSLRANGGTNYTQALQQAQSFFNDPARKKYIIFLTDGMP 197 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 V + K ++ + + I+ D + Sbjct: 198 TV--SIAKEPITYKVCEGILFWKTCNQVTENLNVQYVLYSNGITAARTIYYPDHPETKTY 255 Query: 321 AYQNTLN---------TLQICEYMRNAGMKIYSVAVSAPPE-GQDLLRKCTDSSGQFFAV 370 + + + + + + +YS+ E L K + ++G Sbjct: 256 SDREKYREFEEKIRLHGTNVAKTLGMNNITLYSIGFGNNQEVDMGYLEKLSSTAGGQAKK 315 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + L E F + + + + Sbjct: 316 GTPQNLTEIFQQFSKLANDPVL 337 >gi|261820223|ref|YP_003258329.1| hypothetical protein Pecwa_0904 [Pectobacterium wasabiae WPP163] gi|261604236|gb|ACX86722.1| conserved hypothetical protein [Pectobacterium wasabiae WPP163] Length = 539 Score = 68.4 bits (165), Expect = 2e-09, Method: Composition-based stats. Identities = 44/444 (9%), Positives = 101/444 (22%), Gaps = 100/444 (22%) Query: 3 AIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTST 62 A+ + + +D + Q++ A DAA L+ + ++ + Sbjct: 41 ALGAMALLVTAAFIVDTSTATGDATQIKRATDAAALAVGHQATING--------EEYSQE 92 Query: 63 IFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKA------------- 109 +++K + + + +++T+ +N+ + Sbjct: 93 DINTLAYEYVKSNLGMNKALSEKLVAGDVSVTEGRNSATRKTYTVTVAFATKPSLLSLGA 152 Query: 110 ----------------QYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICM- 152 + + S S + Sbjct: 153 RAQEVYSTSEVINRPTEVALVMPVTGDMSAGDIRSLKSVSHSFVERLLGSADTKRDNLWL 212 Query: 153 -VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTT---------------- 195 ++ S+S+ + N + L PP + F S + Sbjct: 213 SLVPYSQSVNVYDAEDANRIRRWAAPGALNPPELRSLFASGVVSSLADRRFPDRRANLLC 272 Query: 196 -KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI--------- 245 + V + Q +S R ++ Sbjct: 273 MYRGLGREENFFWDEPPVGQFRVYYRHDLPQNGSPGAPPISWRGPNPDFDDNDAVDTRWI 332 Query: 246 ----GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGS 301 G PL+N +++ R+++ + N N M A L S + Sbjct: 333 VADKGCPNAALMPLTNEASKLNQRIDQFSARFNVNYAIGMSWAGAALSPNMRGSDGWGDT 392 Query: 302 TRL---------KKFVIFITDGENSGASAYQNTLN----------------------TLQ 330 T + + G ++ N Sbjct: 393 TLPLDFNLDGNGDGQKVIVMMANTIGDWFDTDSYNFNRNEFRGGTGNDLARTFAAQRFRD 452 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQ 354 +C R +K Y V + Sbjct: 453 LCSSFRARNIKFYFVGIRPGDPED 476 >gi|239926966|ref|ZP_04683919.1| lipoprotein [Streptomyces ghanaensis ATCC 14672] gi|291435315|ref|ZP_06574705.1| lipoprotein [Streptomyces ghanaensis ATCC 14672] gi|291338210|gb|EFE65166.1| lipoprotein [Streptomyces ghanaensis ATCC 14672] Length = 527 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 40/291 (13%), Positives = 84/291 (28%), Gaps = 27/291 (9%) Query: 108 KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQK 167 A Y L L A + + +D +R+ ED + Sbjct: 90 TASYGYARRALAEGRLPDPATVRPEEFVNSFRQEYERPGGDGFTVTVDGARTDEDGWSLV 149 Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 ++ PP +F ++D+ +S + + ++ Sbjct: 150 RVGLATRSAAPGGERPPAALTFVVDT-------SGSMAEPGRLDLARKSLAAMTDRLRDD 202 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTP-LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYR 286 + + ++ L N +EV +++L P +TN + Y Sbjct: 203 DS--------VALVTFSDEAETVLPMTRLGGNRDEVHDAVDRLEPDRSTNLGAGVEAGYE 254 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 V+ ++D + + +I R G+ ++ V Sbjct: 255 TAVEGLREGATNR--------VVLVSDALANTGETDAD-AILERISRSRRAHGITLFGVG 305 Query: 347 VSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 V + G L+ + D G V+ + E F + + E + R A Sbjct: 306 VGSD-YGDALMERLADRGDGHTVYVSGEEDAREVFGERLPRAVELTARDAK 355 >gi|296225305|ref|XP_002758277.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2 [Callithrix jacchus] Length = 1251 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 36/341 (10%), Positives = 92/341 (26%), Gaps = 21/341 (6%) Query: 38 LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK 97 L+ A K++ K L G A +++ +D Sbjct: 107 LADAAENFQKAHHWQDNIKEEDILYYDAK-ADAELDDPESEDVERGSKASTLRLDFIEDP 165 Query: 98 NNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 N + A +IPT+ +I + L +E ++ + + + Sbjct: 166 NFKNKVNYSYTA-VQIPTDIYKGSTVILNELNWTEALENVFMENRRQDPTLLWQVFGSAT 224 Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 + P + S + + + + ++ S Sbjct: 225 GVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSV 284 Query: 218 GNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 +++++ + + + ++ +V N K + + T Sbjct: 285 CEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVR----NKKVFKEAVQGMVAKGTTG 340 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 +A+ +L N + K ++ TDG N Sbjct: 341 YKAGFEYAFDQLQNSNITRA------NCNKMIMMFTDGGEDRVQDVFEKYNWP------- 387 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 N +++++ +V L+ + G +F + + Sbjct: 388 NRTVRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 428 >gi|17554334|ref|NP_498645.1| MUscle Positioning family member (mup-4) [Caenorhabditis elegans] gi|15789306|gb|AAA28092.5| Muscle positioning protein 4 [Caenorhabditis elegans] Length = 2104 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 58/176 (32%), Gaps = 14/176 (7%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 + V + + + R+ + Y+ I ++ ++ Sbjct: 444 GSGSIGSYVFQTEVLRFLAEFTEL-FDIAPQKTRVSVVQYSDQIRHEFGLDNYSDRKSLQ 502 Query: 264 SRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + + + T T A+ H E ++E+ + ++ + I ITDG + Sbjct: 503 NAIRNIEYLTGLTRTGAAIEHVANEAFSERRGARPVG---QVSRVAIVITDGRSQDNVTR 559 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + R +++++V V L + + S + F V+ +L Sbjct: 560 PS--------DNARRQDIQLFAVGV-TNHVLDAELEEISGSKDRTFHVSGFEDLNT 606 >gi|14579227|gb|AAK69172.1|AF289202_1 transmembrane matrix receptor MUP-4 [Caenorhabditis elegans] Length = 2104 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 58/176 (32%), Gaps = 14/176 (7%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 + V + + + R+ + Y+ I ++ ++ Sbjct: 444 GSGSIGSYVFQTEVLRFLAEFTEL-FDIAPQKTRVSVVQYSDQIRHEFGLDNYSDRKSLQ 502 Query: 264 SRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + + + T T A+ H E ++E+ + ++ + I ITDG + Sbjct: 503 NAIRNIEYLTGLTRTGAAIEHVANEAFSERRGARPVG---QVSRVAIVITDGRSQDNVTR 559 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + R +++++V V L + + S + F V+ +L Sbjct: 560 PS--------DNARRQDIQLFAVGV-TNHVLDAELEEISGSKDRTFHVSGFEDLNT 606 >gi|149199796|ref|ZP_01876826.1| hypothetical protein LNTAR_23599 [Lentisphaera araneosa HTCC2155] gi|149137084|gb|EDM25507.1| hypothetical protein LNTAR_23599 [Lentisphaera araneosa HTCC2155] Length = 333 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 24/101 (23%), Positives = 48/101 (47%), Gaps = 13/101 (12%) Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------GQDLLR 358 K +I +TDG ++G + + + + R+ G+ I++VAV P + LR Sbjct: 198 DKVLILLTDGNDTG-----SLVAPEKAAQIARDKGIVIHTVAVGDPAAAGEQALDEATLR 252 Query: 359 KCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + + G+++ + EL +D+I DKI + + +R Sbjct: 253 SISSLTKGKYYWAGNREELAGIYDEI-DKIGVRELDTVSHR 292 >gi|6469599|gb|AAF13350.1|AF121336_1 unknown [Eufolliculina uhligi] Length = 494 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 26/213 (12%), Positives = 61/213 (28%), Gaps = 22/213 (10%) Query: 173 NMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK 232 T N + S + KI ++ + +V + A Sbjct: 70 PCTINLESPAQTSEASRSGVDIVCVIDVSGSMQGE-KIQLVQTTLNFMVERLSPAD---- 124 Query: 233 NLSVRIGTIAYNIGIV--GNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 RI I+++ ++KS + +L TN + + + L Sbjct: 125 ----RICLISFSNDATKISRLVQMSPKGKKQLKSMIPRLVASGGTNIVGGLEYGLQAL-- 178 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + +I ++DG+++ + + + +++ Sbjct: 179 ------RQRRTINQLSSIILLSDGQDNNGTTVLQRAKA-TMDSIVIRDDYSVHTFGYGHG 231 Query: 351 PEGQDLLRKCTD-SSGQFFAVNDSRELLESFDK 382 LL + +G F+ V D + +F Sbjct: 232 H-DSTLLNALAEPKNGAFYYVKDEETIATAFAN 263 >gi|87308177|ref|ZP_01090319.1| hypothetical protein DSM3645_21307 [Blastopirellula marina DSM 3645] gi|87289259|gb|EAQ81151.1| hypothetical protein DSM3645_21307 [Blastopirellula marina DSM 3645] Length = 1032 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 30/140 (21%), Positives = 60/140 (42%), Gaps = 17/140 (12%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 +N +++ KL+ TN P + +R+L K +I ++DG+ Sbjct: 513 DNPGMFVAQVRKLSASGGTNMTPGVALGFRDLQ----------NVDAGVKHMIVLSDGQT 562 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 N QI M+ GM + +VAV + + + + + G+F+AVN+ + + Sbjct: 563 EPG-------NVAQIASDMKKMGMTVSAVAVGSDADQKLMATVARNGGGKFYAVNNPKAI 615 Query: 377 LESFDKITDKIQEQSVRIAP 396 F + ++ + V+ AP Sbjct: 616 PRIFMREARRVAQPLVKEAP 635 >gi|326916308|ref|XP_003204450.1| PREDICTED: collagen alpha-1(XII) chain-like, partial [Meleagris gallopavo] Length = 902 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 22/166 (13%), Positives = 55/166 (33%), Gaps = 18/166 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+G + Y+ +++ + ++ NT T A+ + Sbjct: 167 SAFDIGEEKTRVGVVQYSSDTRTEFNLNQYFRRSDLLDAIKRIPYKGGNTMTGEAIDYLV 226 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + E + K I ITDG+ +RN G++++S+ Sbjct: 227 KNTFTESAGAR-----KGFPKVAIVITDGKAQD--------EVEIPARELRNIGVEVFSL 273 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQE 389 + L+ F V + +++ ++I ++ Sbjct: 274 GI--KAADAKELKLIASQPSLKHVFNVANFDGIVDIQNEIILQVCS 317 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 26/159 (16%), Positives = 60/159 (37%), Gaps = 22/159 (13%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKES 294 V+I + Y+ N + ++ +N TNT AM + +++ + Sbjct: 476 VQISLVQYSRDPHMEFSLNRYNRVEDIIQAINTFPYRGGSTNTGKAMTYVREKVFVTSKG 535 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 S + + +I ITDG+ ++ + +R+A ++I++V V + Sbjct: 536 SR-----PNVPRVMILITDGK--------SSDAFKEPAIKLRDADVEIFAVGV--KDAVR 580 Query: 355 DLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 L + V D ++F +I+ ++ + Sbjct: 581 TELEAIASPPAETHVYTVEDF----DAFQRISFELTQSV 615 >gi|297299828|ref|XP_002805471.1| PREDICTED: matrilin-2-like [Macaca mulatta] Length = 897 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 23/159 (14%), Positives = 56/159 (35%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 88 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 147 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ R+ G+ I+++ V Sbjct: 148 FSEAEGARPLREN--VPRVIMIVTDGRPQDS--------VAEVAAKARDTGILIFAIGVG 197 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F + Sbjct: 198 --QVDFNTLKAIGSEPHEDHVFLVANFSQIETLTSVFQR 234 >gi|119612173|gb|EAW91767.1| matrilin 2, isoform CRA_c [Homo sapiens] Length = 451 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 56/159 (35%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 88 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 147 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ R+ G+ I+++ V Sbjct: 148 FSEAEGARPLREN--VPRVIMIVTDGRPQDS--------VAEVAAKARDTGILIFAIGVG 197 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F K Sbjct: 198 --QVDFNTLKSIGSEPHEDHVFLVANFSQIETLTSVFQK 234 >gi|116695550|ref|YP_841126.1| hypothetical protein H16_B1611 [Ralstonia eutropha H16] gi|113530049|emb|CAJ96396.1| conserved hypothetical protein [Ralstonia eutropha H16] Length = 354 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 26/230 (11%), Positives = 71/230 (30%), Gaps = 67/230 (29%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 +I ++A +L+ + + VR+G +++ P ++N ++ ++ Sbjct: 106 TRIGASKQAARDLIVGLPAS--------VRLGMVSFAATATV--VLPPTDNRQDMLDAID 155 Query: 268 KLNPYENTNTYPAMHHAYRELYNE------------------------------------ 291 + T T + A L+ + Sbjct: 156 RFQLQLGTATGSGLIQALAVLFPDDGIDLEAILFSGESLAPGPGGRSLTEAAAADAVRKR 215 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 ++ + VI ++DG + + L G+++Y+V P Sbjct: 216 EQERPAAQPGSYRHGAVILLSDGRRTTG------PDPLDAARMAAQRGVRVYTVGFGTPQ 269 Query: 352 E--------------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDK 386 + LR ++G++F + +L + + +++ + Sbjct: 270 GGAAAESGLSYYMQLDEPALRAVAAITNGEYFQAGSAADLSQVYRQLSAR 319 >gi|326433400|gb|EGD78970.1| hypothetical protein PTSG_11807 [Salpingoeca sp. ATCC 50818] Length = 2673 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 17/162 (10%), Positives = 49/162 (30%), Gaps = 12/162 (7%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHA 284 R+ + G N + + ++ + + P T T + Sbjct: 693 TTYFTIGEHDTRVAVATFASGATVNIRLNDHFDGDALRDAIADIPYPQGQTYTSLGLRAV 752 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 +++ E S + + ++ +TDG Q + + + + + +++ Sbjct: 753 RQDILTEANGMRPA--SEGVPRVLVVLTDGN------SQPSYDPATEASILHDQNVNVFA 804 Query: 345 VAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKIT 384 + V + Q L F + + + D ++ Sbjct: 805 IGVGS-SISQSQLEDIASDPDARHVFNLRSFSLIGDIVDAMS 845 >gi|149721558|ref|XP_001490961.1| PREDICTED: matrilin 2 isoform 1 [Equus caballus] Length = 956 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 23/159 (14%), Positives = 55/159 (34%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 88 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 147 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ R+ G+ I+++ V Sbjct: 148 FSEAEGARPLREN--VPRVIMIVTDGRPQDS--------VAEVAAKARDTGILIFAIGVG 197 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F Sbjct: 198 --QVDFNTLKAIGSEPHEDHVFLVANFSQMESLTSVFQN 234 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 26/187 (13%), Positives = 65/187 (34%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +++ + +++S+ + + R+G + Y+ + + ++K Sbjct: 667 GEENFEIVKQFVTGIIDSLAISPKAA-----RVGLLQYSTQVRTEFTLRNFGSAKDMKKA 721 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + S R+ + I TDG + Sbjct: 722 VAHMKYMGKGSMTGLALKHMFERSFTQVEGARPL--SARVPRVAIVFTDGRAQDDVSEW- 778 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ F D + E +K Sbjct: 779 -------ASKAKANGITMYAVGVGKAI--EEELQEIASEPIDKHLFYAEDFSTMGEINEK 829 Query: 383 ITDKIQE 389 + I E Sbjct: 830 LKKGICE 836 >gi|149721562|ref|XP_001490991.1| PREDICTED: matrilin 2 isoform 2 [Equus caballus] Length = 915 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 23/159 (14%), Positives = 55/159 (34%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 88 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 147 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ R+ G+ I+++ V Sbjct: 148 FSEAEGARPLREN--VPRVIMIVTDGRPQDS--------VAEVAAKARDTGILIFAIGVG 197 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F Sbjct: 198 --QVDFNTLKAIGSEPHEDHVFLVANFSQMESLTSVFQN 234 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 26/187 (13%), Positives = 65/187 (34%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +++ + +++S+ + + R+G + Y+ + + ++K Sbjct: 626 GEENFEIVKQFVTGIIDSLAISPKAA-----RVGLLQYSTQVRTEFTLRNFGSAKDMKKA 680 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T A+ H + + + E + S R+ + I TDG + Sbjct: 681 VAHMKYMGKGSMTGLALKHMFERSFTQVEGARPL--SARVPRVAIVFTDGRAQDDVSEW- 737 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDK 382 + G+ +Y+V V ++ L++ F D + E +K Sbjct: 738 -------ASKAKANGITMYAVGVGKAI--EEELQEIASEPIDKHLFYAEDFSTMGEINEK 788 Query: 383 ITDKIQE 389 + I E Sbjct: 789 LKKGICE 795 >gi|222478562|ref|YP_002564799.1| von Willebrand factor type A [Halorubrum lacusprofundi ATCC 49239] gi|222451464|gb|ACM55729.1| von Willebrand factor type A [Halorubrum lacusprofundi ATCC 49239] Length = 491 Score = 68.0 bits (164), Expect = 2e-09, Method: Composition-based stats. Identities = 26/183 (14%), Positives = 59/183 (32%), Gaps = 33/183 (18%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 +I A L +I A + + R + L+++L++V++ ++ Sbjct: 241 SRIANTKSGAKQLAETILDANPDNQVGVTRF-------NNGASTPQQLTDDLDDVEAAID 293 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 L+ TN + EL N + + ++ DG+ + + Sbjct: 294 GLSASGGTNAQAGVDAGQAELENCPHDN----------RVMVVFGDGDINTDGS------ 337 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + AG +I+++ V L F D + + F ++ + Sbjct: 338 ------AAKVAGTEIFAIGVGGASFSD--LEDLASDPADEHVFFAIDDGAIEQIFGQVAE 389 Query: 386 KIQ 388 I Sbjct: 390 TIT 392 >gi|148689378|gb|EDL21325.1| procollagen, type VII, alpha 1 [Mus musculus] Length = 2944 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 28/162 (17%), Positives = 58/162 (35%), Gaps = 19/162 (11%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYN 290 VR T+ Y+ + ++ + +L+ NT T A+HH ++ Sbjct: 74 SAQGVRFATVQYSDDPQTEFGLDTLGSGSDTIRAIRELSYKGGNTRTGAALHHVSDRVFL 133 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + K I ITDG++ + ++ G+K+++V + Sbjct: 134 PRLTRPG------VPKVCILITDGKSQD--------LVDTAAQKLKGQGVKLFAVGI--K 177 Query: 351 PEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 + L++ FF VND L I+ ++ Sbjct: 178 NADPEELKRVASQPTSDFFFFVNDFSILRTLLPLISRRVCTT 219 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 28/187 (14%), Positives = 68/187 (36%), Gaps = 21/187 (11%) Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL---NPY 272 + ++ + A+ + ++G + Y+ S++L + ++ + +P Sbjct: 1072 AVRRVLERLVSALGPLGPQAAQVGLLTYSHRPSPLFPLNSSHDLGIILRKIRDIPYVDPS 1131 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 N N A+ A+R L ++ ++ + D G Sbjct: 1132 GN-NLGTAVTTAHRYLLASNAPGRR----QQVPGVMVLLVDEPLRGDILSP--------I 1178 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG---QFFAVNDSRELLESFDKITDKIQE 389 + +G+K+ +A+S + LR+ + FFAV++ L + + + + Sbjct: 1179 REAQTSGLKV--MALSLVGADPEQLRRLAPGTDPIQNFFAVDNGPGLDRAVSDLAVALCQ 1236 Query: 390 QSVRIAP 396 +V I P Sbjct: 1237 AAVTIEP 1243 >gi|115647999|ref|NP_031764.2| collagen alpha-1(VII) chain precursor [Mus musculus] gi|143955303|sp|Q63870|CO7A1_MOUSE RecName: Full=Collagen alpha-1(VII) chain; AltName: Full=Long-chain collagen; Short=LC collagen; Flags: Precursor Length = 2944 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 28/162 (17%), Positives = 58/162 (35%), Gaps = 19/162 (11%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYN 290 VR T+ Y+ + ++ + +L+ NT T A+HH ++ Sbjct: 74 SAQGVRFATVQYSDDPQTEFGLDTLGSGSDTIRAIRELSYKGGNTRTGAALHHVSDRVFL 133 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + K I ITDG++ + ++ G+K+++V + Sbjct: 134 PRLTRPG------VPKVCILITDGKSQD--------LVDTAAQKLKGQGVKLFAVGI--K 177 Query: 351 PEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 + L++ FF VND L I+ ++ Sbjct: 178 NADPEELKRVASQPTSDFFFFVNDFSILRTLLPLISRRVCTT 219 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 28/187 (14%), Positives = 68/187 (36%), Gaps = 21/187 (11%) Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL---NPY 272 + ++ + A+ + ++G + Y+ S++L + ++ + +P Sbjct: 1072 AVRRVLERLVSALGPLGPQAAQVGLLTYSHRPSPLFPLNSSHDLGIILRKIRDIPYVDPS 1131 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 N N A+ A+R L ++ ++ + D G Sbjct: 1132 GN-NLGTAVTTAHRYLLASNAPGRR----QQVPGVMVLLVDEPLRGDILSP--------I 1178 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG---QFFAVNDSRELLESFDKITDKIQE 389 + +G+K+ +A+S + LR+ + FFAV++ L + + + + Sbjct: 1179 REAQTSGLKV--MALSLVGADPEQLRRLAPGTDPIQNFFAVDNGPGLDRAVSDLAVALCQ 1236 Query: 390 QSVRIAP 396 +V I P Sbjct: 1237 AAVTIEP 1243 >gi|2326168|gb|AAB66593.1| type VII collagen [Mus musculus] Length = 2944 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 28/162 (17%), Positives = 58/162 (35%), Gaps = 19/162 (11%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYN 290 VR T+ Y+ + ++ + +L+ NT T A+HH ++ Sbjct: 74 SAQGVRFATVQYSDDPQTEFGLDTLGSGSDTIRAIRELSYKGGNTRTGAALHHVSDRVFL 133 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + K I ITDG++ + ++ G+K+++V + Sbjct: 134 PRLTRPG------VPKVCILITDGKSQD--------LVDTAAQKLKGQGVKLFAVGI--K 177 Query: 351 PEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 + L++ FF VND L I+ ++ Sbjct: 178 NADPEELKRVASQPTSDFFFFVNDFSILRTLLPLISRRVCTT 219 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 28/187 (14%), Positives = 68/187 (36%), Gaps = 21/187 (11%) Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL---NPY 272 + ++ + A+ + ++G + Y+ S++L + ++ + +P Sbjct: 1072 AVRRVLERLVSALGPLGPQAAQVGLLTYSHRPSPLFPLNSSHDLGIILRKIRDIPYVDPS 1131 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 N N A+ A+R L ++ ++ + D G Sbjct: 1132 GN-NLGTAVTTAHRYLLASNAPGRR----QQVPGVMVLLVDEPLRGDILSP--------I 1178 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG---QFFAVNDSRELLESFDKITDKIQE 389 + +G+K+ +A+S + LR+ + FFAV++ L + + + + Sbjct: 1179 REAQTSGLKV--MALSLVGADPEQLRRLAPGTDPIQNFFAVDNGPGLDRAVSDLAVALCQ 1236 Query: 390 QSVRIAP 396 +V I P Sbjct: 1237 AAVTIEP 1243 >gi|323699770|ref|ZP_08111682.1| von Willebrand factor type A [Desulfovibrio sp. ND132] gi|323459702|gb|EGB15567.1| von Willebrand factor type A [Desulfovibrio desulfuricans ND132] Length = 2034 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 17/134 (12%), Positives = 45/134 (33%), Gaps = 14/134 (10%) Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE--N 316 + L + T A+ L K + ++ TDG N Sbjct: 468 RDAALKGLAGIKANGATCLNDAVLLGLHMLAGAKRPA------------LLVFTDGFDAN 515 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 + + ++ + ++ G+ ++++ + L R + S G+++ +D L Sbjct: 516 FNDTGPGSKATRREVLDAVKTGGVPVFTIGFGKGHDVSTLDRIASLSGGRYYPASDPGAL 575 Query: 377 LESFDKITDKIQEQ 390 ++F + + Sbjct: 576 DKAFAVVNANLANT 589 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 47/402 (11%), Positives = 115/402 (28%), Gaps = 60/402 (14%) Query: 17 IDLAHIMYIRNQMQSALDA-----AVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKH 71 +D + ++ QM++ALDA A L A + P +T K + Sbjct: 421 VDSSG--SMKGQMRNALDATRKFIAALPADARVRVVDFDTKPRALPGETRDAALKGLAGI 478 Query: 72 LKQGSY--------------IRENAGDIAQKAQINITKDKNNPLQY-----IAESKAQYE 112 G+ + + + + P + ++ Sbjct: 479 KANGATCLNDAVLLGLHMLAGAKRPALLVFTDGFDANFNDTGPGSKATRREVLDAVKTGG 538 Query: 113 IPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNN 172 +P + + + +G + + V+ ++ + + ++ Sbjct: 539 VPVFTIGFGKGHDVSTLDRIASLSGGRYYPASDPGALDKAFAVVNANLANTFTARYERPT 598 Query: 173 NMTSNKYLLPPPPKKSF--WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE 230 + + K S ++D + + + ++ + Sbjct: 599 RGRPSNVPVVTCMVDISGSMDKTPDFSGCNY-------RMDKVKAILHDFLAALPDEVLA 651 Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 + ++ V Q T + N E+ + +N L T A+ Sbjct: 652 --------QGMTFSDQNVIEQVT--TANTGEMLAAMNDLYADGGTEIAGAVAAVLE---- 697 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 +++++FITD ++ L +R+ G +Y + V Sbjct: 698 ------TQRAIPSTRRYLLFITDAAL--DVEPEDKLFFETTLAKLRDEG--VYCLWVGIG 747 Query: 351 PEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQS 391 ++ + S G + D EL +FD + I++ Sbjct: 748 ELDPAPFKRAAEISGGSYVLTEDPAELGRAFDGLVADIRKPV 789 >gi|302551540|ref|ZP_07303882.1| lipoprotein [Streptomyces viridochromogenes DSM 40736] gi|302469158|gb|EFL32251.1| lipoprotein [Streptomyces viridochromogenes DSM 40736] Length = 518 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 33/275 (12%), Positives = 76/275 (27%), Gaps = 27/275 (9%) Query: 108 KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQK 167 A Y+ L L + + + +D +R+ ++ + Sbjct: 91 TASYDYTRRTLADGRLPAPSTVRPEEFVNSFRQDYERPGGDGFSVTVDGARTGDEDWSLV 150 Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 + T ++D+ ES + + ++ Sbjct: 151 RVGLATRS-------AERTGERLPAALTFVIDVSGSMSEPGRLDLAQESLSVMTDRLRDD 203 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTP-LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYR 286 + + ++ L N +E++ ++ L ++TN + Y Sbjct: 204 DS--------VAIVTFSDEAETVLPMTRLDGNRDEIQDVISDLATQDSTNLGAGVETGYE 255 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 V+ ++D + +T+ E R G+ ++ V Sbjct: 256 TAVEGLREGATNR--------VVLVSDALANTGDTDADTILERIAGER-REHGITLFGVG 306 Query: 347 VSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESF 380 V + G L+ + D G V+ E E F Sbjct: 307 VGSD-YGDALMERLADRGDGHTVYVSGPDEAHEVF 340 >gi|261409463|ref|YP_003245704.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] gi|261285926|gb|ACX67897.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] Length = 421 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 25/206 (12%), Positives = 67/206 (32%), Gaps = 28/206 (13%) Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 S P + + ++ N + + Sbjct: 115 IVLVIDNSGSMNETDPNQDRYTAAKNLINRM---------DRDNRVSVMVFDHATTLLQP 165 Query: 250 NQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 +E+ + ++ L T+ A+ + +++ + V Sbjct: 166 FTRVKNQETKDEIIAEIDGLATNDGGTDISLALEDTMSHIQESRDAGRSA--------MV 217 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS-APPEGQDLLRKCT-DSSGQ 366 I ++DG + + ++ + + + ++ +S P+G LL+ ++ GQ Sbjct: 218 IMLSDG--------FSETDHDRVLAEYKQQQIAVNTIGLSLVNPDGAQLLQTIAAETGGQ 269 Query: 367 FFAVNDSRELLESFDKITDKIQEQSV 392 ++ V + +L F KI D + ++S+ Sbjct: 270 YYDVQHAEDLSFVFQKIYDDVGDRSL 295 >gi|294011439|ref|YP_003544899.1| Flp pilus assembly protein TadG [Sphingobium japonicum UT26S] gi|292674769|dbj|BAI96287.1| Flp pilus assembly protein TadG [Sphingobium japonicum UT26S] Length = 771 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 29/302 (9%), Positives = 88/302 (29%), Gaps = 36/302 (11%) Query: 127 ALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLY-LQKHNDNNNMTSNKYLLPPPP 185 + + + + + + + + D + + ++ + + +P Sbjct: 470 STIRSDIETVADLSVTWDGCIEERQTFFNTDGNPSDDWLNYPSSPSDAIDMDIDRVPDSD 529 Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 + W + + P + + + + + + Sbjct: 530 PATRWKPLLPNAVWGPKGTLVGNTW-SGDYTTDPVKAGENTSSGDTGRNLSNNSCVTASR 588 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNE---KESSHNTIGST 302 + + + ++ + + L P+ NT + R + + T G Sbjct: 589 KLTNYNGVGGNPSAQDLSNYVGTLVPHNNTYHDIGLLWGARLMSPTGIFASENATTGGGA 648 Query: 303 RLKKFVIFITDGE-------------------------NSGASAYQN-----TLNTLQIC 332 ++++ +IF+TDG + A+ N + +C Sbjct: 649 QIQRHLIFMTDGATATTVNNYASYGLEWWDRRQIAPAGPNDANYDDNLNAVNNARSNALC 708 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDL-LRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 ++N + ++ + + L C S F+ ++ L+ F +I D+I Sbjct: 709 TAIKNKNITLWVIYYGSSDTATKTRLTNCATSPSYFYEARNTTLLIGKFREIADRISNLR 768 Query: 392 VR 393 + Sbjct: 769 LT 770 Score = 64.2 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 47/388 (12%), Positives = 88/388 (22%), Gaps = 58/388 (14%) Query: 11 LFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKK 70 + A D+A I ++ ++QSA DA L+G + S R + F + Sbjct: 36 GLVGGAFDMARIYAVKTRLQSACDAGALAGRRIMGSGRWTDNNGRPNTTALATFDLNFAQ 95 Query: 71 HLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTN 130 + + A ++P + L ++ Sbjct: 96 N----------------SFGAENRTRSYSESDGTVSGTASADVP---MTLMRVLNVPTKR 136 Query: 131 LSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKH--NDNNNMTSNKYLLPPPPKKS 188 + + G + + VLD S SM ++ Sbjct: 137 VEVTCEGQMRI----PNTDVMFVLDNSGSMNEVIPGDTTGLKKMAGLQLAIRCFYEALAR 192 Query: 189 FWSKNTTKSKYAPAPAPANRKIDVL--IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIG 246 + + ++ + P + N N + N T Sbjct: 193 QNITDVSAAQCGTSADPTGDLSSQVQLRFGFVNYDNMVNVGKLLPNNYLADSWTYRSRTA 252 Query: 247 IVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 + T P SS ++ S Sbjct: 253 TGIQTVYAWTAGTQSA------------TTWGPWSSTPNNFANPSSYSSTYSVVSGGNTS 300 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ 366 DG A + C + + G D L ++S+G Sbjct: 301 TTTLA-DGLAYVKKAAPTSA---AACSALNSYG-------------SYDQLVGISESNGT 343 Query: 367 FFAVNDSRELLESFDKITDKIQEQSVRI 394 V + + I I R Sbjct: 344 SSDVTTPE--VPIYPAIQQNINTVRSRT 369 >gi|290970562|ref|XP_002668176.1| predicted protein [Naegleria gruberi] gi|284081406|gb|EFC35432.1| predicted protein [Naegleria gruberi] Length = 518 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 28/199 (14%), Positives = 63/199 (31%), Gaps = 21/199 (10%) Query: 192 KNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA-----IQEKKNLSVRIGTIAYNIG 246 + + + ++ + S L + +++ + V + Sbjct: 331 HFIFVNDKSGSMGGSDARPTSSKYSNDRLGALFESCEKFLEVRDGSSDLVSCIMYD-HSA 389 Query: 247 IVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 PLS + S ++ T+ AM + + K Sbjct: 390 YNCFTTNPLS---TSLVSTMSSYVAGGGTSFTNAMQSVSSLISSTYP------NHQSYKI 440 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ 366 V+F++DGE+S A T + + + ++++ + + L + GQ Sbjct: 441 VVLFMSDGEDSADEAVSITGQLVS------SHDIILHTIQLGGSSDNTGLRQMAATGRGQ 494 Query: 367 FFAVNDSRELLESFDKITD 385 F NDS L + +I + Sbjct: 495 FKRANDSASLAGIYQEIAN 513 >gi|148676905|gb|EDL08852.1| matrilin 2, isoform CRA_a [Mus musculus] Length = 836 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 55/159 (34%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 114 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 173 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ RN G+ I+++ V Sbjct: 174 FSEAEGARPLREN--VPRIIMIVTDGRPQDS--------VAEVAAKARNTGILIFAIGVG 223 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + L+ F V + + L F Sbjct: 224 --QVDLNTLKAIGSEPHKDHVFLVANFSQIESLTSVFQN 260 >gi|329849363|ref|ZP_08264209.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] gi|328841274|gb|EGF90844.1| von Willebrand factor type A domain protein [Asticcacaulis biprosthecum C19] Length = 590 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 35/357 (9%), Positives = 87/357 (24%), Gaps = 30/357 (8%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPL 101 A+ + + + + + + + E + + + +T+ + Sbjct: 90 AATSDAVVVTGNRPALQSSVQMKRSAAPAYEAREAPNTEKYNGESVSSVMRVTETPVSTF 149 Query: 102 QYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSME 161 + Y L A S + + Sbjct: 150 SVDVD-TGAYANVRRMLNDGTTPTEAAVRTEELLNYFRYDYPLPQDRSKPFSITTDVAQT 208 Query: 162 DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLV 221 Q + + P + K+ ++ + L Sbjct: 209 PWNAQTRLMRVGLRAYDVPRSERPAANLVFLV-----DVSGSMNDPDKLPLVKTALSMLS 263 Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAM 281 + NL ++ VK L L+ +T M Sbjct: 264 D----------NLRPDDKVSIVVYAGAAGMVLAPTHEGKYVKQALECLSAGGSTAGGQGM 313 Query: 282 HHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK 341 AY VI TDG+ + + + + R +G+ Sbjct: 314 ALAYATAEANFIKGGINR--------VILATDGDFNVG--ISSIGEVEALVKQNRESGVT 363 Query: 342 IYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + ++ + L+ K D +G + ++ + E + + D++ +A + Sbjct: 364 LTALGFGTGNYNEALMEKMADVGNGNYAYIDSAMEARKV---LDDELSSTLFTVAKD 417 >gi|52082207|ref|YP_080998.1| YwmC protein [Bacillus licheniformis ATCC 14580] gi|52787598|ref|YP_093427.1| YwmC [Bacillus licheniformis ATCC 14580] gi|319648080|ref|ZP_08002297.1| YwmC protein [Bacillus sp. BT1B_CT2] gi|52005418|gb|AAU25360.1| YwmC [Bacillus licheniformis ATCC 14580] gi|52350100|gb|AAU42734.1| YwmC [Bacillus licheniformis ATCC 14580] gi|317389715|gb|EFV70525.1| YwmC protein [Bacillus sp. BT1B_CT2] Length = 228 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 19/135 (14%), Positives = 44/135 (32%), Gaps = 16/135 (11%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + + ++ LN + P T A+ A L + K V +TDGE Sbjct: 109 TYDEQSFRNSLNGIGPTGWTPIANALQDAKNALDQLDNNG---------KNVVYLLTDGE 159 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC-TDSSGQFFAVNDSR 374 + N +++ +R + + + + L G++F Sbjct: 160 ETCGG------NPVKVATELRKSNAVVNVIGFDYEGDFHGQLTSIAAAGGGEYFQAKTKN 213 Query: 375 ELLESFDKITDKIQE 389 ++ F + ++ + Sbjct: 214 DIKRIFTQEAIELSK 228 >gi|295398785|ref|ZP_06808791.1| von Willebrand factor type A domain protein [Aerococcus viridans ATCC 11563] gi|294972971|gb|EFG48792.1| von Willebrand factor type A domain protein [Aerococcus viridans ATCC 11563] Length = 516 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 29/260 (11%), Positives = 78/260 (30%), Gaps = 46/260 (17%) Query: 144 ENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAP 203 + + + +VLD S SM +++ N + K ++ S+ + Sbjct: 76 QQAPVDVVLVLDRSGSM--NFVETPNSPTRLDYGKLAAINFAERVLGPNGIPGSRVSVVS 133 Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 + + +A + LS++L V Sbjct: 134 FSGPAYATGVRNNPQRHYGQQNQATTD----------------------LDLSSDLRAVT 171 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +N++ + TNT + + + K VI +TDG + ++ Sbjct: 172 DSINRITAFGGTNTEAGFEQGRSVIEGTTSNQ-----NPNSNKVVIMLTDGLPTASNGNP 226 Query: 324 NTL----------NTLQICEYMRNAGMK-IYSVAV--SAPPEGQDLLRKC---TDSSGQF 367 + + + + ++++ + + L + G + Sbjct: 227 YAETTDINHVHIQRAINAGKNIYQNDIADVFTIGLTTGMNATEKALADNILTQAQNKG-Y 285 Query: 368 FAVNDSRELLESFDKITDKI 387 + + +L F++I+ ++ Sbjct: 286 YPAPSATDLDAIFEEISQRL 305 >gi|77567855|gb|AAI07522.1| Matn3b protein [Danio rerio] Length = 299 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 58/164 (35%), Gaps = 15/164 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + + R+ + Y + + EVK ++++P T T A+ A ++ Sbjct: 100 DIGSDATRVALVNYASTVNIEFHLKKYFSKAEVKQAFSRIDPLSTGTMTGMAIKTAMEQV 159 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + E + + K I +TDG ++ R +G++IY+V V Sbjct: 160 FTENAGARPLK--KGIGKVAIIVTDGRPQD--------KVEEVSAAARASGIEIYAVGV- 208 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 L++ F V + + K + + E+ Sbjct: 209 -DRAEVRSLKQMASQPLDDHVFYVETYGVIEKLTSKFRETLCEE 251 >gi|261409634|ref|YP_003245875.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] gi|261286097|gb|ACX68068.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] Length = 968 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 33/195 (16%), Positives = 69/195 (35%), Gaps = 26/195 (13%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 P K+ E+A V+ + +A P + + Sbjct: 86 TYGPNNGEDKMTNAKEAAKGFVDLMDMT----------KHRVAVVDFSSSASSFPFTVDK 135 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 + KS +N +N T T A+ A L + + + ++ +TDG + + Sbjct: 136 DAAKSYINTINSGGGTATGNAIDAAVALLADHRTEAQP---------VIVLMTDGAATES 186 Query: 320 SAYQNTLN-TLQICEYMRNAGMKIYSVAVSAPPEG------QDLLRKCTDSSGQFFAVND 372 + + LQ + ++AG+ Y++A+ P E L++ ++ V Sbjct: 187 PKNTDPFDYALQRAQAAKDAGVIFYTIALLNPNEDPITSAPNVLMKNMATTATHHHFVLG 246 Query: 373 SRELLESFDKITDKI 387 S+ L + + I +I Sbjct: 247 SKGLNQIYAAIVKEI 261 >gi|221640506|ref|YP_002526768.1| von Willebrand factor, type A [Rhodobacter sphaeroides KD131] gi|221161287|gb|ACM02267.1| von Willebrand factor, type A [Rhodobacter sphaeroides KD131] Length = 651 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 45/366 (12%), Positives = 104/366 (28%), Gaps = 29/366 (7%) Query: 35 AAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINIT 94 A + A+ + + + + + + ++ A + Sbjct: 141 RARSAEGAAPQTFAADEAMPMAAPPAPDLALSKQAAEAPARALPQGDSEAFANAPDNPLR 200 Query: 95 KDKNNPLQY-IAES-KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICM 152 +P+ + A Y I +L L P + + Sbjct: 201 VTAEDPVSTFSIDVDTASYAILRSSLRAGQLPPREAVRIEEMINYFPYDYPAPENGTPPF 260 Query: 153 VLDVSRSMEDLYLQKH-NDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKID 211 +S + + + PP + +T+ S PA K+ Sbjct: 261 RPTLSVTRTPWNPETRLVHVALQGRMPAIEDRPPLNLVFLIDTSGSMQDPA------KLP 314 Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP 271 +L +S G ++ ++ ++ + Y +N + + S L++L+ Sbjct: 315 LLKQSFGLMLGRLRPED--------QVAIVTYAGSAGEVLAPTAANQRSTILSALDRLDA 366 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 +T + AYR + T V+ TDG+ + + L L Sbjct: 367 GGSTAGEEGLALAYRTASEMAGAGEVTR--------VVLATDGDFNLGISDPEDLARLVA 418 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 E R+ G+ + + ++ + L E+ + D++ Sbjct: 419 HE--RDTGVYLSVLGFGRGNLDDATMQALAQNGNG--QAAYIDSLNEAQKVLVDQLSGAL 474 Query: 392 VRIAPN 397 IA + Sbjct: 475 FPIADD 480 >gi|84386025|ref|ZP_00989055.1| von Willebrand factor type A domain protein [Vibrio splendidus 12B01] gi|84379341|gb|EAP96194.1| von Willebrand factor type A domain protein [Vibrio splendidus 12B01] Length = 345 Score = 68.0 bits (164), Expect = 3e-09, Method: Composition-based stats. Identities = 31/205 (15%), Positives = 72/205 (35%), Gaps = 37/205 (18%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 + ++ + + R+G I + P + ++N + Sbjct: 116 GNKHDRLTIAKQVLREFAAQ---------REHDRLGLILFADSAYVQA--PFTEDINVWQ 164 Query: 264 SRLNKLNPYEN---TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 S L + T A+ + E ++ +I +TDG+++ + Sbjct: 165 SLLEDVELGYAGFKTAFGDAIGLSIAVFEQE----------QSRQRVMILLTDGDDTSSK 214 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG-------QDLLRKCTDSSGQFFAVNDS 373 ++ E G+KIY++A+ P L + + GQ F D Sbjct: 215 MP-----PVKAAEIAAKYGVKIYTIAIGDPSTKGRYKMDLPTLEKVSAATGGQMFHAMDR 269 Query: 374 RELLESFDKITDKIQEQSVRIAPNR 398 ++L +++ I D++++Q + +R Sbjct: 270 KQLDQAYATI-DQLEQQEFELLSHR 293 >gi|281338025|gb|EFB13609.1| hypothetical protein PANDA_007564 [Ailuropoda melanoleuca] Length = 901 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 20/175 (11%), Positives = 56/175 (32%), Gaps = 11/175 (6%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 K+ I + N+N+ +S + + T+ A+ A + Sbjct: 272 DLSPKDQFNLISFSGDAAQWKPLLVPASAENVNQARSYAAGIQAHGGTDINEAVLMAVQL 331 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 L + K+ + +I +TDG+ + N + + + ++ + Sbjct: 332 LNSAKQKE---LMPEGTVSLIILLTDGDPTMGE--TNPARIQRNVKEAIDGQYSLFCLGF 386 Query: 348 SAPPEGQDLLRKCTDSSGQ-----FFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 L K +G + + + +L + ++++ + + P+ Sbjct: 387 GFDV-SYAFLEKLALDNGGLARRIYEDSDSALQLQDFYEEVANPLLTAVTFEYPS 440 >gi|325964121|ref|YP_004242027.1| Flp pilus assembly protein TadG [Arthrobacter phenanthrenivorans Sphe3] gi|323470208|gb|ADX73893.1| Flp pilus assembly protein TadG [Arthrobacter phenanthrenivorans Sphe3] Length = 345 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 20/144 (13%), Positives = 45/144 (31%), Gaps = 13/144 (9%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + AI++ F+ A+D+ I R Q+Q+ DA+ ++ D T D +T Sbjct: 18 IVAILMVALLGFVAIAVDIGVIYSERAQLQNGADASAIAVAQKCARDATGVDCSTTSALA 77 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 S + + + + I + + + +LF Sbjct: 78 SGLANRNALDGMSKVHTIDLDKTTRKVSVTTSAKEVGGADNSV-------------SLFF 124 Query: 121 KGLIPSALTNLSLRSTGIIERSSE 144 + + R++ + Sbjct: 125 ADALGIPTKEVGARASAVWGSPMA 148 >gi|315126124|ref|YP_004068127.1| von Willebrand factor type A [Pseudoalteromonas sp. SM9913] gi|315014638|gb|ADT67976.1| von Willebrand factor type A [Pseudoalteromonas sp. SM9913] Length = 327 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 68/193 (35%), Gaps = 34/193 (17%) Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN--KL 269 + + ++ + + G TPL+ ++ V L+ ++ Sbjct: 105 AYNGQYVDRLTMVKAVLSDFIEQRQGDRLGLILFGDTAFLQTPLTRDVKTVSKMLSEAQI 164 Query: 270 NPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 T A+ + + ++KES+ + V+ +TDG+N+ + LN Sbjct: 165 GLVGRATAIGDALGLSVKRFASKKESN----------RIVVLLTDGQNTAGN-----LNP 209 Query: 329 LQICEYMRNAGMKIYSVAVSAPPE---------------GQDLLRKCTD-SSGQFFAVND 372 R G+K+Y++ V + + LL+K + + G +F D Sbjct: 210 EDALLLAREEGIKVYTIGVGSDNPRGFSLFNMGSGGSNLDEGLLKKIAEQTGGLYFRAKD 269 Query: 373 SRELLESFDKITD 385 L + + ++ Sbjct: 270 VAGLQQIYAELDK 282 >gi|149188837|ref|ZP_01867127.1| hypothetical protein VSAK1_05790 [Vibrio shilonii AK1] gi|148837257|gb|EDL54204.1| hypothetical protein VSAK1_05790 [Vibrio shilonii AK1] Length = 504 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 29/194 (14%), Positives = 69/194 (35%), Gaps = 8/194 (4%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 +D + A +N A E + S+ + + L+++ Sbjct: 302 SYHVEVYDYVDFDLSVAEMFINKFPDARTEYRLDSLELYRGYGSSNENQFYSIDLTSDRT 361 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK---KFVIFITDGENS 317 E+ ++ + +T ++ M ++ + K ++ + K V+ ++DG S Sbjct: 362 EI-DVIDDMWADGSTASFQGMLRGFQHMLAGKPNTSDEDELAEYNDKIKMVLVLSDGVES 420 Query: 318 GASAYQNTLNTLQICEYMRNA--GMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE 375 + L +C+ R G+ I + + Q + C + + + D + Sbjct: 421 PNNGILKGLVDAGMCDKAREEIPGLYIAVIGIDFAASEQSGFQDCVLNPDE--DITDVTD 478 Query: 376 LLESFDKITDKIQE 389 E +KI + IQ+ Sbjct: 479 TEEFIEKIEELIQK 492 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 21/178 (11%), Positives = 53/178 (29%), Gaps = 14/178 (7%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A + +A++ + N+++ + + + + + Sbjct: 1 MAFTLIPVLGMTFFAVEGTRYIQETNRLRD-----------AAQAAASAVTIEDQSANAN 49 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + K I+ +++ + A Q D+ +QY E+ + + + Sbjct: 50 EMAKDYIRDYVRDINSETVVATRFYQAPDPENDVDE--FIQYTVEATTNHNSWFASNLIP 107 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKY 179 + + + + I I V D S SM + +D T+ K Sbjct: 108 VFGETQDLKGVAVAK-KYPFNLGDKNIDIVFVSDFSGSMSWQWGGNSSDPCTATNCKI 164 >gi|107102622|ref|ZP_01366540.1| hypothetical protein PaerPA_01003686 [Pseudomonas aeruginosa PACS2] Length = 340 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 30/165 (18%), Positives = 58/165 (35%), Gaps = 37/165 (22%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + + V+ L++ +NT A+ A + L + ++ Sbjct: 150 PLTFDRHTVRVWLDEAQIGIAGKNTAIGDAIGLAVKRL----------RQRPAESRVLVL 199 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------GQ 354 ITDG N+G + + +KIY++ + A P+ + Sbjct: 200 ITDGANTGGQ-----IAPQIAAQLAAEQQVKIYTIGIGADPQQGGVPGLFGFNPGLDLDE 254 Query: 355 DLLRKCT-DSSGQFFAVNDSRELLESFDKITDK--IQEQSVRIAP 396 LR + G++F S EL + + +Q+ R P Sbjct: 255 PTLRGIAESTGGEYFRARSSAELESISATLDRLEPVTQQTTRARP 299 >gi|198436156|ref|XP_002124087.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 1702 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 24/191 (12%), Positives = 65/191 (34%), Gaps = 22/191 (11%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + D + N ++S +IG Y+ + L++ Sbjct: 381 STSVLENDFDGIKVWLRNTISSFP-----IGEEYTQIGLATYSDNPRIIFHLNKYHKLDD 435 Query: 262 VKSRLNKLN-PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 ++ + ++ T T A+ + ++ + K+ V+ +TDG++ Sbjct: 436 IRKAVLEVEHTSGGTATGKAILYLTNNMFTHENGVR-----PNAKRLVVVLTDGKSQDDV 490 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF-FAVNDSRELLES 379 + + +G+ ++++ V G+ LR ++ + +ND L Sbjct: 491 IVPS--------RIAKESGIVMFAIGVGKVVMGE--LRAIASDPDRYVYKINDFSALESI 540 Query: 380 FDKITDKIQEQ 390 +++ I Sbjct: 541 RRELSHSIASL 551 >gi|296473276|gb|DAA15391.1| integrin, alpha D [Bos taurus] Length = 1165 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 27/180 (15%), Positives = 60/180 (33%), Gaps = 23/180 (12%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPY 272 +++ + + + + Y+ + + +S ++ + Sbjct: 175 KNFVRAVMDRSKGT-------NTQFSLMQYSNLMKTHFTFNQFWTSRSSQSLVDPIVQLN 227 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T T + REL++ K + +K +I ITDGE Y++ L + Sbjct: 228 GLTFTATGIRTVVRELFHSKNGAR-----KSARKIIIVITDGE-----KYKDPLEYKDVI 277 Query: 333 EYMRNAGMKIYSVAVSAP---PEGQDLLRKCTD--SSGQFFAVNDSRELLESFDKITDKI 387 A + Y++ V ++ L+ S F V+ L ++ +KI Sbjct: 278 PEAEKANIIRYAIGVGDAFQAHAAREELKIIGSVPSEDHVFKVDSFAALSSIQKQLQEKI 337 >gi|156523104|ref|NP_001095966.1| integrin alpha-D [Bos taurus] gi|151556938|gb|AAI49717.1| ITGAD protein [Bos taurus] Length = 1165 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 27/180 (15%), Positives = 60/180 (33%), Gaps = 23/180 (12%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPY 272 +++ + + + + Y+ + + +S ++ + Sbjct: 175 KNFVRAVMDRSKGT-------NTQFSLMQYSNLMKTHFTFNQFWTSRSSQSLVDPIVQLN 227 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T T + REL++ K + +K +I ITDGE Y++ L + Sbjct: 228 GLTFTATGIRTVVRELFHSKNGAR-----KSARKIIIVITDGE-----KYKDPLEYKDVI 277 Query: 333 EYMRNAGMKIYSVAVSAP---PEGQDLLRKCTD--SSGQFFAVNDSRELLESFDKITDKI 387 A + Y++ V ++ L+ S F V+ L ++ +KI Sbjct: 278 PEAEKANIIRYAIGVGDAFQAHAAREELKIIGSVPSEDHVFKVDSFAALSSIQKQLQEKI 337 >gi|72162079|ref|YP_289736.1| von Willebrand factor, type A [Thermobifida fusca YX] gi|71915811|gb|AAZ55713.1| von Willebrand factor, type A [Thermobifida fusca YX] Length = 315 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 18/151 (11%), Positives = 51/151 (33%), Gaps = 24/151 (15%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + + V + L T + + + + + E + ++ ++DGE Sbjct: 144 TQDHQAVADSIANLTISSGTAIGEGVFASLQAIRSFDEKAT----DDPPPAAIVLLSDGE 199 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------------GQDLLRKCT- 361 N+ + + AG+ + ++A ++ L++ Sbjct: 200 NTSGRPVAAAADEARA------AGVPVSTIAFGTGVSIIEIEGHYVPANIDKETLKELAM 253 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + G+F+ + EL + + I + ++V Sbjct: 254 TTGGRFYEAESTGELKDVYADIGSSLGTETV 284 >gi|317505805|ref|ZP_07963650.1| von Willebrand factor type A domain-containing protein [Segniliparus rugosus ATCC BAA-974] gi|316255887|gb|EFV15112.1| von Willebrand factor type A domain-containing protein [Segniliparus rugosus ATCC BAA-974] Length = 343 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 25/192 (13%), Positives = 63/192 (32%), Gaps = 27/192 (14%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL----NPYENTNTYPAMHHAY 285 + +V++G + + S + K ++++ + T T ++ A Sbjct: 131 DGMAPTVQLGVVTFAGNAQPLVRP--STDHETAKKVIDQMIRPDKLEKQTATGEGIYTAL 188 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 +++ + + ++ ++DG+ + + + + +V Sbjct: 189 QQIET--IAGALGGKNHAPPARIVLVSDGKETVPDDLNAPRGAYAAARTAKEKHVPVCTV 246 Query: 346 AVSAPPEGQDL-------------LRKCTD---SSGQ---FFAVNDSRELLESFDKITDK 386 A L L+K +D S G FF EL + + + + Sbjct: 247 AFGTKAGKITLDNQVDEVPVDLESLKKISDLSNSPGNSCRFFPAESQGELAQIYQSLNED 306 Query: 387 IQEQSVRIAPNR 398 I ++VR +R Sbjct: 307 IGYENVRSESSR 318 >gi|256822867|ref|YP_003146830.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] gi|256796406|gb|ACV27062.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] Length = 986 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 40/308 (12%), Positives = 94/308 (30%), Gaps = 61/308 (19%) Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSN 177 L LI + L + ++ +++ +VLDVS SM Sbjct: 7 LISALLIVAGSVPALLHADDTEVYFGQSQPVNLLLVLDVSGSMAWTTDACRL-------- 58 Query: 178 KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVR 237 + W + ++D++ E+ ++ + ++ Sbjct: 59 ----------NRWGQPYPSC---YPGNGEKSRLDIMKEALELFLDDLPDNVKVGILTYSA 105 Query: 238 IGTIAYNIGIVGNQCTPLSNN--LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS 295 I + LS+N + + ++ L T T A++ A + ++ Sbjct: 106 GNNIDLLHEVKQ-----LSDNNHKATLLTTIDGLEANGGTLTAGALYEAGSYFRGQYDNL 160 Query: 296 H-NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI----------------------- 331 ++F+TDG+ + S + I Sbjct: 161 PSPITPGCSNASNIVFLTDGQPNSMSYNGYSYRNSIINMTGSSCARSDDGKECSEKLAGF 220 Query: 332 ------CEYMRNAGMKIYSVAVSAPPEG-QDLLRKCTDSS-GQFFAVNDSRELLESFD-K 382 E + + +K +++A + + L D+ GQ + + + L+++F Sbjct: 221 LSTVDQIEDLTPSKVKTHTIAFALEDNNARTFLENVADAGNGQSYTADSTDGLVDAFKSS 280 Query: 383 ITDKIQEQ 390 I I++ Sbjct: 281 IQTDIEQS 288 >gi|218676637|ref|YP_002395456.1| hypothetical protein VS_II0874 [Vibrio splendidus LGP32] gi|218324905|emb|CAV26683.1| Conserved hypothetical protein [Vibrio splendidus LGP32] Length = 355 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 24/150 (16%), Positives = 66/150 (44%), Gaps = 15/150 (10%) Query: 246 GIVGNQCTPLSNNLN---EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 G TP + + + E+ ++ + ++T+ A+ A + +++ + Sbjct: 154 GDAAFVQTPFTADQDVWLELLNQTDVAMAGQSTHLGDAIGLAIKVFEQSAAVQDSSVDAN 213 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------GQDL 356 +K VI +TDG ++G + + + + + G++I+ +A+ P + Sbjct: 214 VKEKVVIVLTDGNDTG-----SFVEPIDAAKVAKAKGVRIHVIAMGDPQTVGEVALDMET 268 Query: 357 LRKCT-DSSGQFFAVNDSRELLESFDKITD 385 +++ +S G+ F + EL +++ +I + Sbjct: 269 IKRVAQESGGEAFEALNRDELTKAYAQIGE 298 >gi|308068881|ref|YP_003870486.1| von Willebrand factor A [Paenibacillus polymyxa E681] gi|305858160|gb|ADM69948.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Paenibacillus polymyxa E681] Length = 600 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 31/175 (17%), Positives = 69/175 (39%), Gaps = 20/175 (11%) Query: 232 KNLSVRIGTIAYNIGIVGNQCT---PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYREL 288 + ++G +AY + + + E+K+ ++ LN T+T + A R L Sbjct: 74 STQNDKVGIVAYTDVVQREKALLNITSEADKQELKTFIDGLNRGAYTDTSVGVKEALRIL 133 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGEN-----SGASAYQNTLNTLQICEYMRNAGMKIY 343 + K + H ++ + DG N +G + Q+ + Q +N+G+ IY Sbjct: 134 QDGKTAGHAP--------MIVMLADGNNDFNKTTGRTESQSDQDMAQAVAEAKNSGVPIY 185 Query: 344 SVAVSAPP--EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 ++ ++A L + G+ F + + +L +I +++ P Sbjct: 186 TIGLNADGKLNKNKLADIAQQTGGKSFITSSADDLPNILSEIF--ASNLKLKVVP 238 >gi|49087064|gb|AAT51411.1| PA3073 [synthetic construct] Length = 341 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 27/146 (18%), Positives = 51/146 (34%), Gaps = 35/146 (23%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + + V+ L++ +NT A+ A + L + ++ Sbjct: 150 PLTFDRHTVRVWLDEAQIGIAGKNTAIGDAIGLAVKRL----------RQRPAESRVLVL 199 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------GQ 354 ITDG N+G + + +KIY++ + A P+ + Sbjct: 200 ITDGANTGGQ-----IAPQIAAQLAAEQQVKIYTIGIGADPQQGGVPGLFGFNPGLDLDE 254 Query: 355 DLLRKCT-DSSGQFFAVNDSRELLES 379 LR + G++F S EL Sbjct: 255 PTLRGIAESTGGEYFRARSSAELESI 280 >gi|15598269|ref|NP_251763.1| hypothetical protein PA3073 [Pseudomonas aeruginosa PAO1] gi|9949180|gb|AAG06461.1|AE004731_9 hypothetical protein PA3073 [Pseudomonas aeruginosa PAO1] Length = 340 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 27/146 (18%), Positives = 51/146 (34%), Gaps = 35/146 (23%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + + V+ L++ +NT A+ A + L + ++ Sbjct: 150 PLTFDRHTVRVWLDEAQIGIAGKNTAIGDAIGLAVKRL----------RQRPAESRVLVL 199 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------GQ 354 ITDG N+G + + +KIY++ + A P+ + Sbjct: 200 ITDGANTGGQ-----IAPQIAAQLAAEQQVKIYTIGIGADPQQGGVPGLFGFNPGLDLDE 254 Query: 355 DLLRKCT-DSSGQFFAVNDSRELLES 379 LR + G++F S EL Sbjct: 255 PTLRGIAESTGGEYFRARSSAELESI 280 >gi|254241773|ref|ZP_04935095.1| hypothetical protein PA2G_02484 [Pseudomonas aeruginosa 2192] gi|126195151|gb|EAZ59214.1| hypothetical protein PA2G_02484 [Pseudomonas aeruginosa 2192] Length = 340 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 27/146 (18%), Positives = 51/146 (34%), Gaps = 35/146 (23%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + + V+ L++ +NT A+ A + L + ++ Sbjct: 150 PLTFDRHTVRVWLDEAQIGIAGKNTAIGDAIGLAVKRL----------RQRPAESRVLVL 199 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------GQ 354 ITDG N+G + + +KIY++ + A P+ + Sbjct: 200 ITDGANTGGQ-----IAPQIAAQLAAEQQVKIYTIGIGADPQQGGVPGLFGFNPGLDLDE 254 Query: 355 DLLRKCT-DSSGQFFAVNDSRELLES 379 LR + G++F S EL Sbjct: 255 PTLRGIAESTGGEYFRARSSAELESI 280 >gi|218890727|ref|YP_002439591.1| putative von Willebrand factor type A domain [Pseudomonas aeruginosa LESB58] gi|254236045|ref|ZP_04929368.1| hypothetical protein PACG_02002 [Pseudomonas aeruginosa C3719] gi|126167976|gb|EAZ53487.1| hypothetical protein PACG_02002 [Pseudomonas aeruginosa C3719] gi|218770950|emb|CAW26715.1| putative von Willebrand factor type A domain [Pseudomonas aeruginosa LESB58] Length = 340 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 27/146 (18%), Positives = 51/146 (34%), Gaps = 35/146 (23%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + + V+ L++ +NT A+ A + L + ++ Sbjct: 150 PLTFDRHTVRVWLDEAQIGIAGKNTAIGDAIGLAVKRL----------RQRPAESRVLVL 199 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------GQ 354 ITDG N+G + + +KIY++ + A P+ + Sbjct: 200 ITDGANTGGQ-----IAPQIAAQLAAEQQVKIYTIGIGADPQQGGVPGLFGFNPGLDLDE 254 Query: 355 DLLRKCT-DSSGQFFAVNDSRELLES 379 LR + G++F S EL Sbjct: 255 PTLRGIAESTGGEYFRARSSAELESI 280 >gi|301767168|ref|XP_002919014.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4-like [Ailuropoda melanoleuca] Length = 849 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 20/175 (11%), Positives = 56/175 (32%), Gaps = 11/175 (6%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 K+ I + N+N+ +S + + T+ A+ A + Sbjct: 305 DLSPKDQFNLISFSGDAAQWKPLLVPASAENVNQARSYAAGIQAHGGTDINEAVLMAVQL 364 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 L + K+ + +I +TDG+ + N + + + ++ + Sbjct: 365 LNSAKQKE---LMPEGTVSLIILLTDGDPTMGE--TNPARIQRNVKEAIDGQYSLFCLGF 419 Query: 348 SAPPEGQDLLRKCTDSSGQ-----FFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 L K +G + + + +L + ++++ + + P+ Sbjct: 420 GFDV-SYAFLEKLALDNGGLARRIYEDSDSALQLQDFYEEVANPLLTAVTFEYPS 473 >gi|326911082|ref|XP_003201891.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like [Meleagris gallopavo] Length = 951 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 21/138 (15%), Positives = 50/138 (36%), Gaps = 11/138 (7%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 NN+ + K ++ ++P TN A+ + L + ++ S +IF+TDG Sbjct: 366 NNIRDAKKYIHNMSPTGGTNINSALQTGAKLLNDYIAQNNIDARS---VSLIIFLTDGRP 422 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS----GQFFAVND 372 + + + + + ++++ + LL + + F D Sbjct: 423 TVGETQSSKILSNT--KDAIRDKFCLFTIGIG-NDVDYKLLERMALENCGMVRHFQEDED 479 Query: 373 SRE-LLESFDKITDKIQE 389 + L +D+I + Sbjct: 480 AASHLKGFYDEIGTPLLS 497 >gi|118081959|ref|XP_417299.2| PREDICTED: similar to inter-alpha trypsin inhibitor heavy chain precursor 5 [Gallus gallus] Length = 955 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 21/138 (15%), Positives = 50/138 (36%), Gaps = 11/138 (7%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 NN+ + K ++ ++P TN A+ + L + ++ S +IF+TDG Sbjct: 370 NNIRDAKKYIHNMSPTGGTNINSALQTGAKLLNDYIAQNNIDARS---VSLIIFLTDGRP 426 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS----GQFFAVND 372 + + + + + ++++ + LL + + F D Sbjct: 427 TVGETQSSKILSNT--KDAIRDKFCLFTIGIG-NDVDYKLLERMALENCGMVRHFQEDED 483 Query: 373 SRE-LLESFDKITDKIQE 389 + L +D+I + Sbjct: 484 AASHLKGFYDEIGTPLLS 501 >gi|47211020|emb|CAF94689.1| unnamed protein product [Tetraodon nigroviridis] Length = 2225 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 18/164 (10%), Positives = 58/164 (35%), Gaps = 22/164 (13%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKES 294 ++ + ++ + ++ ++ NT T A+ H ++ E+ Sbjct: 1306 TQVAIVQFSDEPRTEVQLKSYRKKERLLEAISSISYKGGNTKTGRAIQHMKDSVFTEEGG 1365 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + + K ++ +TDG + + ++ + ++ G ++++ + G+ Sbjct: 1366 ARTA-----VPKVLVLLTDGRSQDDVS--------KVSKELQKQGFVVFAIGFADADYGE 1412 Query: 355 DLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 L F V+D ++F ++ +++ A Sbjct: 1413 --LVNVASKPSSTHVFFVDDL----DAFKEMEEELVASVCEAAS 1450 Score = 63.8 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 26/164 (15%), Positives = 53/164 (32%), Gaps = 18/164 (10%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + RIG ++ V L NT T A+ Sbjct: 39 DVDLDRTRIGLAQFSGEPRIEWHLNTHTTKEAVMEAARNLPYKGGNTLTGLALTFILENS 98 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ + S + K + +TDG++ + +R+AG++++++ V Sbjct: 99 FSPESGSR-----PGIPKIGVLLTDGKSQDDVIPP--------AQRLRDAGVEVFAIGV- 144 Query: 349 APPEGQDLLRKCT--DSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + LR + V D + + D +T I E+ Sbjct: 145 -KNADEGELRAIASVSEDTHVYNVADFHLMADIVDVLTRTICER 187 >gi|183222779|ref|YP_001840775.1| putative von Willebrand factor, type A [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] gi|189912810|ref|YP_001964365.1| hypothetical protein LBF_3320 [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|167777486|gb|ABZ95787.1| Conserved hypothetical protein [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)'] gi|167781201|gb|ABZ99499.1| Hypothetical protein; putative von Willebrand factor, type A [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Paris)'] Length = 550 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 22/154 (14%), Positives = 49/154 (31%), Gaps = 10/154 (6%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 +AY+ + Q V ++ + +TN A + L K Sbjct: 81 IVAYSADVQLIQPVTHLTEKVSVTDKIRNIQVATSTNLSGGWLSALKSLNQSK------- 133 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 K V+ +TDG + ++ + I + G+ ++ V + L+ Sbjct: 134 -IPNAYKRVLLLTDGNPTSG--IKDKEALVTIAADHLSMGISTTTIGVGNDFNEEMLVEI 190 Query: 360 CTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 G F+ +++ + F + I + Sbjct: 191 AKAGGGNFYYIDNPENASDIFFEEFGDIGALYAQ 224 >gi|163751139|ref|ZP_02158369.1| von Willebrand factor type A domain protein [Shewanella benthica KT99] gi|161329095|gb|EDQ00167.1| von Willebrand factor type A domain protein [Shewanella benthica KT99] Length = 334 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 46/153 (30%), Gaps = 36/153 (23%) Query: 254 PLSNNLNEVKSRLN--KLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + V L ++ T A+ + + + ++ Sbjct: 143 PLTQDRRSVAQFLQEAQIGLVGKQTAIGEAIALGVKRFD----------MVDKSNRILVL 192 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEG 353 +TDG N+ S Q G+KIY++ V A Sbjct: 193 LTDGSNNSGSIS-----PEQAAAIAAKRGVKIYAIGVGADVMERRSIFGTERVNPSMDLD 247 Query: 354 QDLLRKCT-DSSGQFFAVNDSRELLESFDKITD 385 + L + G +F S++L + +I Sbjct: 248 EAQLISLAKTTGGLYFRARSSQDLQLIYQEIDK 280 >gi|78484443|ref|YP_390368.1| von Willebrand factor, type A [Thiomicrospira crunogena XCL-2] gi|78362729|gb|ABB40694.1| Hypothetical protein; predicted integral membrane protein with a von Willebrand factor type A domain [Thiomicrospira crunogena XCL-2] Length = 363 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 45/288 (15%), Positives = 94/288 (32%), Gaps = 44/288 (15%) Query: 126 SALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKH--------NDNNNMTSN 177 S+ + + ++ R + S +L Sbjct: 43 SSDWEQTANAKSLLFRHPLIDQLDFNASAPSSSKGLRWFLNLLRLMLLVGIVTALATPVK 102 Query: 178 KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVR 237 K LPP P+ + + + + +ID +S ++ ++ + R Sbjct: 103 KVPLPPEPQTKTVRDIVFVVETSVSMVLEDYQIDGEPQSRIKVIQTVLDQFISGLAGN-R 161 Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY--ENT--NTYPAMHHAYRELYNEKE 293 G I Y PL+++ + L +L PY T T A+ A ++ + Sbjct: 162 FGFILYADDAY--TLMPLTSDATTARLMLKRLKPYLAGRTDEATGEALGLALQQAEKSTD 219 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S+ N + V+ I+DG S + L + Y + + IY++ V A + Sbjct: 220 STEN--------RIVVLISDG-----STRDSRLPIAEAINYAQGLNIPIYTIGVGANSKD 266 Query: 354 QDL---------------LRKCTD-SSGQFFAVNDSRELLESFDKITD 385 D L++ D + G+++ + ++L + I Sbjct: 267 ADKREFRGLLYEALESSSLKQIADQTQGRYYQIGSGQDLQKVLQAIDQ 314 >gi|301778757|ref|XP_002924796.1| PREDICTED: integrin alpha-M-like [Ailuropoda melanoleuca] Length = 1153 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 29/199 (14%), Positives = 60/199 (30%), Gaps = 32/199 (16%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE------- 261 + D I + SI ++ V + S + Sbjct: 147 RQDSDIAFLIDGSGSINPIDFQRMKEFVSTVMDRFKNSKTLFSLMQFSEDFQTHFTFNEF 206 Query: 262 --------VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 + + + + Y T+T + REL++ + K ++ ITD Sbjct: 207 KANPNPRFLVNAI--IQLYGRTHTATGILKVVRELFHSSSGAR-----ENALKILVVITD 259 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG---QDLLRKCTDSS--GQFF 368 GE + + L+ + G+ Y + V + ++ L F Sbjct: 260 GE-----KFGDPLDYKDVIPEADREGIIRYVIGVGEAFDNPKHREELNTIASKPARDHVF 314 Query: 369 AVNDSRELLESFDKITDKI 387 VN+ L +++ +KI Sbjct: 315 RVNNFEALKTIQNQLQEKI 333 >gi|296272313|ref|YP_003654944.1| von Willebrand factor type A [Arcobacter nitrofigilis DSM 7299] gi|296096487|gb|ADG92437.1| von Willebrand factor type A [Arcobacter nitrofigilis DSM 7299] Length = 301 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 26/159 (16%), Positives = 56/159 (35%), Gaps = 17/159 (10%) Query: 229 QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYREL 288 Q+ V G +PLS + N K + + + Sbjct: 119 QDFIEKRVNDNIGLVVFGTSVLTASPLSFDKNSQKEIIK--------YIDIGIVGEQTAM 170 Query: 289 YNEKESSHN-TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 ++ +S N S +I +TDGE++ + I + + +KIY++ + Sbjct: 171 FDSLATSINILKNSKAKSNIIILLTDGEDNASKIP-----PQIILKLAKKYKIKIYTIGI 225 Query: 348 SAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITD 385 + +L + ++ + F N +L+E ++ I Sbjct: 226 G--ESNRQMLSTISQETGAKSFLANSKDDLVEVYNTINK 262 >gi|281352695|gb|EFB28279.1| hypothetical protein PANDA_014199 [Ailuropoda melanoleuca] Length = 1110 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 29/199 (14%), Positives = 60/199 (30%), Gaps = 32/199 (16%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE------- 261 + D I + SI ++ V + S + Sbjct: 102 RQDSDIAFLIDGSGSINPIDFQRMKEFVSTVMDRFKNSKTLFSLMQFSEDFQTHFTFNEF 161 Query: 262 --------VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 + + + + Y T+T + REL++ + K ++ ITD Sbjct: 162 KANPNPRFLVNAI--IQLYGRTHTATGILKVVRELFHSSSGAR-----ENALKILVVITD 214 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG---QDLLRKCTDSS--GQFF 368 GE + + L+ + G+ Y + V + ++ L F Sbjct: 215 GE-----KFGDPLDYKDVIPEADREGIIRYVIGVGEAFDNPKHREELNTIASKPARDHVF 269 Query: 369 AVNDSRELLESFDKITDKI 387 VN+ L +++ +KI Sbjct: 270 RVNNFEALKTIQNQLQEKI 288 >gi|182413803|ref|YP_001818869.1| von Willebrand factor type A [Opitutus terrae PB90-1] gi|177841017|gb|ACB75269.1| von Willebrand factor type A [Opitutus terrae PB90-1] Length = 792 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 44/385 (11%), Positives = 101/385 (26%), Gaps = 35/385 (9%) Query: 24 YIRNQMQSALDAAVL------SGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSY 77 R Q + A A+L + +S D + I + H + + Sbjct: 243 QARAQKKDAAMQALLVANEEPAALSSFPGQAPAMDGYIASTTFAGIGTRVRGDHRQAMNT 302 Query: 78 IRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTG 137 + A Y L L P+ + Sbjct: 303 EAYRFLRESDFLSAREHPLSTFAADVD---TASYANVRRFLREGRLPPADAVRIEELVNY 359 Query: 138 IIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKS 197 R + + V + L+ + K + S + Sbjct: 360 FPYRYAAPGRVRDEGVAAPGEAPFAAALEVAAAPWAAQHRLVRIGLKAKDAAVSGRAAAN 419 Query: 198 KY----APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 K+ ++ ES L+ +Q R+ + Y + Sbjct: 420 LVFLLDVSGSMDQPNKLRLVQESMRLLLGRLQPED--------RVAIVTYAGNSGLALPS 471 Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 E+ +++L +TN + AY ++ VI TD Sbjct: 472 TPVARQREILDAIDELRAGGSTNGAMGLQLAYDIAKANFVANGVNR--------VILCTD 523 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVND 372 G+ + + +++ E +G+ + + +L++ D G + ++ Sbjct: 524 GDFNVGV--TSEGELVRLIEEKAKSGVFLTVLGFGMGNLKDAMLQQIADRGNGSYGYIDT 581 Query: 373 SRELLESFDKITDKIQEQSVRIAPN 397 E+ + ++ + +A + Sbjct: 582 R---REAEKLLVQQVSGTLLTVAKD 603 >gi|262197272|ref|YP_003268481.1| hypothetical protein Hoch_4090 [Haliangium ochraceum DSM 14365] gi|262080619|gb|ACY16588.1| Myxococcales GC_trans_RRR domain protein [Haliangium ochraceum DSM 14365] Length = 602 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 24/125 (19%), Positives = 50/125 (40%), Gaps = 10/125 (8%) Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 +E+ + ++ L P TN Y + + + + + VI ++DG + Sbjct: 225 RSEMHALVDTLQPGGGTNIYEGLERGFEIAKEAR------VNHPDRAQRVILLSDGLATE 278 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELL 377 ++ + + + E GM + +V V A +L+R + G F+ V D + Sbjct: 279 G--ITDSASIIALSEAFIEGGMGLTTVGVGA-SFNVELMRGLAERGAGNFYFVEDPEAVR 335 Query: 378 ESFDK 382 E F + Sbjct: 336 EVFTE 340 >gi|239908812|ref|YP_002955554.1| hypothetical protein DMR_41770 [Desulfovibrio magneticus RS-1] gi|239798679|dbj|BAH77668.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 439 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 20/171 (11%), Positives = 58/171 (33%), Gaps = 20/171 (11%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY 272 A +V+ ++ R+ IAY+ I + + + ++ Sbjct: 65 AKRCASFVVDKLKNTD--------RVSLIAYDSSIETRVPSVKVEDKAIFHRAIEGIDDG 116 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 TN + ++ + S + +I ++DG+ + + + C Sbjct: 117 GCTNLHGGWLKGAEQISPYIDPSTISR--------IILLSDGQANEGL--TDEAEIFKQC 166 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDK 382 + +AG+ + + + + L+ + G + + +L++ F + Sbjct: 167 RELADAGVTTSTYGLGSN-FNETLMIGMAKNGQGNSYYGRTADDLMDPFQE 216 >gi|187251530|ref|YP_001876012.1| von Willebrand factor type A [Elusimicrobium minutum Pei191] gi|186971690|gb|ACC98675.1| Von Willebrand factor type [Elusimicrobium minutum Pei191] Length = 373 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 39/226 (17%), Positives = 75/226 (33%), Gaps = 38/226 (16%) Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE 230 + K +LPP S A +I +A N + Sbjct: 131 AKPRDAQKTVLPPTEGVD-IILAIDTSGSMAAQDFDPNRITAAKVAAANFIA-------- 181 Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK----LNPYENTNTYPAMHHAYR 286 LS RIG + + + +PL+ + + L + + T A+ + Sbjct: 182 -NRLSDRIGIVVFASDAMLQ--SPLTLDYESLLDFLADVRIGMVRTDGTAIGDAIAVS-- 236 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 S + S K +I +TDGE++ L + G+K+Y++A Sbjct: 237 --------SVHLERSPARSKVIILLTDGESNSGVIS-----PLDAAKTAALYGIKVYTIA 283 Query: 347 -VSAPPEG------QDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 +S DL + + G+++ + EL + + +I Sbjct: 284 TISKNSRDSLDFKPDDLEQIAKLTGGKYYRAYNEAELTKIYAEIDS 329 >gi|113682008|ref|NP_001038479.1| hypothetical protein LOC563353 [Danio rerio] gi|94732542|emb|CAK03688.1| novel collagen protein [Danio rerio] Length = 873 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 25/176 (14%), Positives = 56/176 (31%), Gaps = 18/176 (10%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHA 284 + + ++ + Y+ N E+ + ++ NT T A+ A Sbjct: 41 TKGFDVSSRHTQVAVVQYSDTPRLEIPLGKHQNSQELVEAVGSVSYLGGNTRTGRAIKFA 100 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 ++ N + + + +TDG + + R + +++ Sbjct: 101 TDHVF----GMPNHTSQSPRNRIAVVLTDGRSQD--------DVEDAAMEARAQNIVLFA 148 Query: 345 VAVSAPPEGQDLLRKCTDSSGQFF--AVNDSRELLESFDKITDKIQEQSVRIAPNR 398 V V + L + + V D + +D + K+ E+SV P R Sbjct: 149 VGVGNEITNSE-LVSMANKPASTYVLHVEDYNSIASIWDLMEQKLCEESV--CPLR 201 >gi|313235286|emb|CBY10850.1| unnamed protein product [Oikopleura dioica] Length = 977 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 27/189 (14%), Positives = 66/189 (34%), Gaps = 18/189 (9%) Query: 204 APANRKIDVLI-ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 A D + ++ L + I + + + LS++++E+ Sbjct: 585 AIVFDGSDSVKADNFKKLKTWTGEFIDKLGVQEYGAQVALVKYATSIIKVSELSSDVDEL 644 Query: 263 KSRLNKLN-PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 K +L K+ TNT A+ A + L + + K ++ ITDG+ Sbjct: 645 KEKLMKVPFIQGKTNTGGALERAQQMLAEGRP---------SVPKIILLITDGD------ 689 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFD 381 + E ++ + + IY++ V ++ L + + D + + Sbjct: 690 ATDKERLDAQIEKLKKSNILIYTIGVG-DLIDRNELNRIATDEDFVYETRDFDSISKIKS 748 Query: 382 KITDKIQEQ 390 + ++ ++ Sbjct: 749 SLLGRVCKK 757 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 29/249 (11%), Positives = 76/249 (30%), Gaps = 31/249 (12%) Query: 155 DVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRK-IDVL 213 D D ++ + P + + + RK Sbjct: 732 DFVYETRDFDSISKIKSSLLGRVCKKAKPKTSGVCGDISVDLQFIVDSSSSVTRKNFGFA 791 Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN--NLNEVKSRLNKLNP 271 N+ ++ + ++ V++G + Y+ + + L + ++ ++ + Sbjct: 792 KNFVANV-----SSVFDLRSGDVQVGVLTYSTNVHSDSAIGLGAIHSQDDFVEKVQSMKY 846 Query: 272 YEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T+T A+ + S N + K +IF+TDG Q+ Sbjct: 847 TGGDTHTGTALRYI----------STNNRWREEVPKILIFVTDG------TPQDRAIVPA 890 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVN--DSRELLESFDKITDK 386 +R+ G++I+++ V + L++ + D + + Sbjct: 891 AARSLRDKGVRIFAIGVG--NAVESELKEIASEPYENHAIFIQGADYSAVQRVRGHLERL 948 Query: 387 IQEQSVRIA 395 + ++I Sbjct: 949 VCNDVLKIT 957 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 16/162 (9%), Positives = 53/162 (32%), Gaps = 17/162 (10%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPA--MHHAYRELYNEK 292 ++ I Y + EV+ + ++ +T T A + ++ + Sbjct: 421 TQVSVIQYTSEPIPEFDLNDFKTAAEVEDGIKRMRLQNGSTRTDKARALDFVSDVIFTKN 480 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + + K ++ I+DG+++ L+ + + + ++ + + + Sbjct: 481 FGARSQA-----PKVLVLISDGQDNHPE------RVLEAAKKLHGQDVSVFVIGIGNESQ 529 Query: 353 -GQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQS 391 ++ + N + + + +T +I E Sbjct: 530 MNIKMMNQIATEPITKHIKYANTVEGINKFKNALTGQICEDV 571 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 18/135 (13%), Positives = 47/135 (34%), Gaps = 24/135 (17%) Query: 261 EVKSRLNKLN-PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 + ++ + +T A+ + YR ++ + + +++ VI +TDG+ + Sbjct: 57 DFAGAIDGMKKLNGDTCIGEALDYFYRNMFTSQAGQR-----SDVEQRVIVMTDGKRN-- 109 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVS--------APPEGQDLLRKCTDSSG--QFFA 369 + E +R +IY++ + + L + F Sbjct: 110 ----CPAEIAKPAELIRAQEAEIYAIGIGHQCGYGENHNCYDRQELHEIASKPADKYVFE 165 Query: 370 VNDSRELLESFDKIT 384 +N+ +L +I Sbjct: 166 INNFDQL--ILKRIG 178 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 24/206 (11%), Positives = 67/206 (32%), Gaps = 32/206 (15%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + ++ D + ++ S + R + Y + + Sbjct: 200 VDSSGSIGPKRFDYMKNWVKSIAASFK-----VGENFARFSVVQY-TKTAKTVVDFQTLD 253 Query: 259 LNEVKSRLNKLNP-------YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 L+ + +++ + T T A+ A+ L + K+ V+ + Sbjct: 254 LSSISQKIDSMIYFQGRNGRGGKTFTGNALERAHTLLKESEPGR---------KRIVLLL 304 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDG + + + +R+ + I++V V ++ L + T + + Sbjct: 305 TDGSSDDEYGP--------VAKAIRDDKVDIFAVGVGR--ARKNELVEITGDEQRVWQTR 354 Query: 372 DSRELLESFDKITDKIQEQSVRIAPN 397 + + K+ ++ S + P+ Sbjct: 355 TFNNIGQFNQKLLAEVCSASEDVCPD 380 >gi|320353059|ref|YP_004194398.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] gi|320121561|gb|ADW17107.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] Length = 336 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 24/152 (15%), Positives = 58/152 (38%), Gaps = 14/152 (9%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 N + + + L++N +++ + +N L+ +T T A+ A EL + Sbjct: 101 PNNTTQVGIVKYSSSANMVEMLQDLTSNKSDLIATINGLSASGSTATGTAIQVATAELLS 160 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + + ++DGE + ++ G+ +++V V Sbjct: 161 SRAIAGHAKME-------VVLSDGEYNVG------IDPKIAAAQAHAQGITVHTVGVQLY 207 Query: 351 PEGQDLLRKCTDSSGQFFA-VNDSRELLESFD 381 G +++ + G F VN+ +L+ F Sbjct: 208 GTGYTSMQQTAVAGGGIFTNVNNLNDLVALFS 239 >gi|119612406|gb|EAW92000.1| collagen, type XIV, alpha 1 (undulin), isoform CRA_c [Homo sapiens] Length = 849 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 27/201 (13%), Positives = 63/201 (31%), Gaps = 26/201 (12%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + +I + V ++ K + ++ + + Sbjct: 106 VDGSWSIGDENFNKIISFLYSTVGALNKIGTDGT----QVAMVQFTDDPRTEFKLNAYKT 161 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + ++ NT T A+ + L+ + + + K ++ ITDG + Sbjct: 162 KETLLDAIKHISYKGGNTKTGKAIKYVRDTLFTAESGTRR-----GIPKVIVVITDGRSQ 216 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRE 375 +I M+ G I+++ V L F V+D Sbjct: 217 DDVN--------KISREMQLDGYSIFAIGV--ADADYSELVSIGSKPSARHVFFVDDF-- 264 Query: 376 LLESFDKITDKIQEQSVRIAP 396 ++F KI D++ A Sbjct: 265 --DAFKKIEDELITFVCETAS 283 >gi|119612404|gb|EAW91998.1| collagen, type XIV, alpha 1 (undulin), isoform CRA_a [Homo sapiens] Length = 865 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 27/201 (13%), Positives = 63/201 (31%), Gaps = 26/201 (12%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + +I + V ++ K + ++ + + Sbjct: 106 VDGSWSIGDENFNKIISFLYSTVGALNKIGTDGT----QVAMVQFTDDPRTEFKLNAYKT 161 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + ++ NT T A+ + L+ + + + K ++ ITDG + Sbjct: 162 KETLLDAIKHISYKGGNTKTGKAIKYVRDTLFTAESGTRR-----GIPKVIVVITDGRSQ 216 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRE 375 +I M+ G I+++ V L F V+D Sbjct: 217 DDVN--------KISREMQLDGYSIFAIGV--ADADYSELVSIGSKPSARHVFFVDDF-- 264 Query: 376 LLESFDKITDKIQEQSVRIAP 396 ++F KI D++ A Sbjct: 265 --DAFKKIEDELITFVCETAS 283 >gi|55743096|ref|NP_066933.1| collagen alpha-1(XIV) chain precursor [Homo sapiens] gi|125987815|sp|Q05707|COEA1_HUMAN RecName: Full=Collagen alpha-1(XIV) chain; AltName: Full=Undulin; Flags: Precursor gi|187954653|gb|AAI40894.1| Collagen, type XIV, alpha 1 [Homo sapiens] Length = 1796 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 27/201 (13%), Positives = 63/201 (31%), Gaps = 26/201 (12%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + +I + V ++ K + ++ + + Sbjct: 1037 VDGSWSIGDENFNKIISFLYSTVGALNKIGTDGT----QVAMVQFTDDPRTEFKLNAYKT 1092 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + ++ NT T A+ + L+ + + + K ++ ITDG + Sbjct: 1093 KETLLDAIKHISYKGGNTKTGKAIKYVRDTLFTAESGTRR-----GIPKVIVVITDGRSQ 1147 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRE 375 +I M+ G I+++ V L F V+D Sbjct: 1148 DDVN--------KISREMQLDGYSIFAIGV--ADADYSELVSIGSKPSARHVFFVDDF-- 1195 Query: 376 LLESFDKITDKIQEQSVRIAP 396 ++F KI D++ A Sbjct: 1196 --DAFKKIEDELITFVCETAS 1214 Score = 66.1 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 57/168 (33%), Gaps = 18/168 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + RIG Y+ + +EV + L NT T A+++ + Sbjct: 186 TAFDVGSEKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNYIF 245 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + T + K I ITDG++ + +R +G++++++ Sbjct: 246 ENSFK-----PEAGSRTGVSKIGILITDGKSQDDIIPPS--------RNLRESGVELFAI 292 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 V + L++ + V + + + +T + + Sbjct: 293 GV--KNADVNELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTLCSRV 338 >gi|2065167|emb|CAA72402.1| collagen type XIV [Homo sapiens] Length = 755 Score = 67.6 bits (163), Expect = 3e-09, Method: Composition-based stats. Identities = 27/201 (13%), Positives = 63/201 (31%), Gaps = 26/201 (12%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + +I + V ++ K + ++ + + Sbjct: 12 VDGSWSIGDENFNKIISFLYSTVGALNKIGTDGT----QVAMVQFTDDPRTEFKLNAYKT 67 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + ++ NT T A+ + L+ + + + K ++ ITDG + Sbjct: 68 KETLLDAIKHISYKGGNTKTGKAIKYVRDTLFTAESGTRR-----GIPKVIVVITDGRSQ 122 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRE 375 +I M+ G I+++ V L F V+D Sbjct: 123 DDVN--------KISREMQLDGYSIFAIGV--ADADYSELVSIGSKPSARHVFFVDDF-- 170 Query: 376 LLESFDKITDKIQEQSVRIAP 396 ++F KI D++ A Sbjct: 171 --DAFKKIEDELITFVCETAS 189 >gi|17231851|ref|NP_488399.1| hypothetical protein alr4359 [Nostoc sp. PCC 7120] gi|17133495|dbj|BAB76058.1| alr4359 [Nostoc sp. PCC 7120] Length = 418 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 29/196 (14%), Positives = 72/196 (36%), Gaps = 30/196 (15%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 ++++ ++A LV+ ++ R+ +A++ + +N ++K ++++L Sbjct: 59 LEIVKQAAIRLVDRLKTGD--------RLSVVAFDHRAKVLVPNQVIDNPEQIKKQISRL 110 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T + EL G +TDGE + + + L Sbjct: 111 AADGGTAIDEGLRLGIEEL---------AKGKKETISQAFLLTDGE----NEHGDNSRCL 157 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQ 388 + + + + ++ QD+L K D+ G + + ++ F ++ +IQ Sbjct: 158 KFAQLAAGYSLTLNTLGFGDNWN-QDILEKIADAGLGSLSYIQKPEQAVDEFGRLFSRIQ 216 Query: 389 E-------QSVRIAPN 397 + +APN Sbjct: 217 TVGLTNAYLLLSLAPN 232 >gi|332216457|ref|XP_003257368.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2 [Nomascus leucogenys] Length = 1094 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 36/350 (10%), Positives = 95/350 (27%), Gaps = 24/350 (6%) Query: 33 LDAAVLSGCASIVSDRTIKDPTTKKDQTST----IFKKQIKKHLKQGSYIRENAGDIAQK 88 LD V + + + +D + + L G A Sbjct: 62 LDRKVQALKRLADAAENFQKAHRWQDNIKEEDIVYYDAKADAELDDPESEDMERGSKAST 121 Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI 148 +++ +D N + A +IPT+ +I + L +E ++ + Sbjct: 122 LRLDFIEDPNFKNKVNYSYAA-VQIPTDIYKGSTVILNELNWTEALENVFMENRRQDPTL 180 Query: 149 SICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR 208 + + + P + S + + + Sbjct: 181 LWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGL 240 Query: 209 KIDVLIESAGNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 + ++ S +++++ + + + ++ +V N K + Sbjct: 241 TLKLMKTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVR----NKKVFKEAVQ 296 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + T +A+ +L N + K ++ TDG N Sbjct: 297 GMVAKGTTGYKAGFEYAFDQLQNSNITRA------NCNKMIMMFTDGGEDRVQDVFEKYN 350 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 N +++++ +V L+ + G +F + + Sbjct: 351 WP-------NRTVRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 393 >gi|209527391|ref|ZP_03275898.1| von Willebrand factor type A [Arthrospira maxima CS-328] gi|209492182|gb|EDZ92530.1| von Willebrand factor type A [Arthrospira maxima CS-328] Length = 488 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 25/147 (17%), Positives = 48/147 (32%), Gaps = 20/147 (13%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + + N E++ + +L+ TN + A L N + + Sbjct: 95 SSRASVVADFTRNETELQQAIARLSARGGTNLSEGFNLATSVLQNSDRTPN--------- 145 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG 365 ++ TDG + N I + +R +G+ + VAV + L T Sbjct: 146 --ILLFTDGVPN------NPPMAASIAQQIRASGINL--VAVGTGDAQINYLTALTGDPD 195 Query: 366 QFFAVNDSRELLESFDKITDKIQEQSV 392 F + +L +F I Q + Sbjct: 196 LVFYA-NFGDLDRAFRGAEKAIYGQQL 221 >gi|124003889|ref|ZP_01688737.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] gi|123990944|gb|EAY30411.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] Length = 704 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 23/167 (13%), Positives = 57/167 (34%), Gaps = 14/167 (8%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNE 291 + +I +AY ++ ++ + L + +T + AY+ Sbjct: 376 TDSRTKIAIVAYAGASGLVLPATSVSHREKILTALENIESGGSTAGGEGIELAYKIAQQA 435 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 + N VI TDG+ + + L +Q+ R +G+ + + Sbjct: 436 FIAGGNNR--------VILATDGDFNVGLSSDEEL--MQLISNKRKSGVYLTCLGFGTGN 485 Query: 352 EGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 ++ K T++ G ++ + + E+ + + IA + Sbjct: 486 LNDSMMEKLTNAGNGNYYYI---DGINEAKKVLAKNLTGTLYAIAKD 529 >gi|297671247|ref|XP_002813757.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like isoform 2 [Pongo abelii] Length = 1074 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 36/350 (10%), Positives = 95/350 (27%), Gaps = 24/350 (6%) Query: 33 LDAAVLSGCASIVSDRTIKDPTTKKDQTST----IFKKQIKKHLKQGSYIRENAGDIAQK 88 LD V + + + +D + + L G A Sbjct: 58 LDRKVQALKRLADAAENFQKAHRWQDNIKEEDIVYYDAKADAELDDPESEDVERGSKAST 117 Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI 148 +++ +D N + A +IPT+ +I + L +E ++ + Sbjct: 118 LRLDFIEDPNFKNKVNYSYAA-VQIPTDIYKGSTVILNELNWTEALENVFMENRRQDPTL 176 Query: 149 SICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR 208 + + + P + S + + + Sbjct: 177 LWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGL 236 Query: 209 KIDVLIESAGNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 + ++ S +++++ + + + ++ +V N K + Sbjct: 237 TLKLMKTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVR----NKKVFKEAVQ 292 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + T +A+ +L N + K ++ TDG N Sbjct: 293 GMVAKGTTGYKAGFEYAFDQLQNSNITRA------NCNKMIMMFTDGGEDRVQDVFEKYN 346 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 N +++++ +V L+ + G +F + + Sbjct: 347 WP-------NRTVRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 389 >gi|297671245|ref|XP_002813756.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like isoform 1 [Pongo abelii] Length = 1081 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 36/350 (10%), Positives = 95/350 (27%), Gaps = 24/350 (6%) Query: 33 LDAAVLSGCASIVSDRTIKDPTTKKDQTST----IFKKQIKKHLKQGSYIRENAGDIAQK 88 LD V + + + +D + + L G A Sbjct: 58 LDRKVQALKRLADAAENFQKAHRWQDNIKEEDIVYYDAKADAELDDPESEDVERGSKAST 117 Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI 148 +++ +D N + A +IPT+ +I + L +E ++ + Sbjct: 118 LRLDFIEDPNFKNKVNYSYAA-VQIPTDIYKGSTVILNELNWTEALENVFMENRRQDPTL 176 Query: 149 SICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR 208 + + + P + S + + + Sbjct: 177 LWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGL 236 Query: 209 KIDVLIESAGNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 + ++ S +++++ + + + ++ +V N K + Sbjct: 237 TLKLMKTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVR----NKKVFKEAVQ 292 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + T +A+ +L N + K ++ TDG N Sbjct: 293 GMVAKGTTGYKAGFEYAFDQLQNSNITRA------NCNKMIMMFTDGGEDRVQDVFEKYN 346 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 N +++++ +V L+ + G +F + + Sbjct: 347 WP-------NRTVRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 389 >gi|291290994|ref|NP_001167522.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform c [Homo sapiens] Length = 1150 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 36/350 (10%), Positives = 95/350 (27%), Gaps = 24/350 (6%) Query: 33 LDAAVLSGCASIVSDRTIKDPTTKKDQTST----IFKKQIKKHLKQGSYIRENAGDIAQK 88 LD V + + + +D + + L G A Sbjct: 127 LDRKVQALKRLADAAENFQKAHRWQDNIKEEDIVYYDAKADAELDDPESEDVERGSKAST 186 Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI 148 +++ +D N + A +IPT+ +I + L +E ++ + Sbjct: 187 LRLDFIEDPNFKNKVNYSYAA-VQIPTDIYKGSTVILNELNWTEALENVFMENRRQDPTL 245 Query: 149 SICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR 208 + + + P + S + + + Sbjct: 246 LWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGL 305 Query: 209 KIDVLIESAGNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 + ++ S +++++ + + + ++ +V N K + Sbjct: 306 TLKLMKTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVR----NKKVFKEAVQ 361 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + T +A+ +L N + K ++ TDG N Sbjct: 362 GMVAKGTTGYKAGFEYAFDQLQNSNITRA------NCNKMIMMFTDGGEDRVQDVFEKYN 415 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 N +++++ +V L+ + G +F + + Sbjct: 416 WP-------NRTVRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 458 >gi|119585523|gb|EAW65119.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_b [Homo sapiens] Length = 1146 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 36/350 (10%), Positives = 95/350 (27%), Gaps = 24/350 (6%) Query: 33 LDAAVLSGCASIVSDRTIKDPTTKKDQTST----IFKKQIKKHLKQGSYIRENAGDIAQK 88 LD V + + + +D + + L G A Sbjct: 127 LDRKVQALKRLADAAENFQKAHRWQDNIKEEDIVYYDAKADAELDDPESEDVERGSKAST 186 Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI 148 +++ +D N + A +IPT+ +I + L +E ++ + Sbjct: 187 LRLDFIEDPNFKNKVNYSYAA-VQIPTDIYKGSTVILNELNWTEALENVFMENRRQDPTL 245 Query: 149 SICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR 208 + + + P + S + + + Sbjct: 246 LWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGL 305 Query: 209 KIDVLIESAGNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 + ++ S +++++ + + + ++ +V N K + Sbjct: 306 TLKLMKTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVR----NKKVFKEAVQ 361 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + T +A+ +L N + K ++ TDG N Sbjct: 362 GMVAKGTTGYKAGFEYAFDQLQNSNITRA------NCNKMIMMFTDGGEDRVQDVFEKYN 415 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 N +++++ +V L+ + G +F + + Sbjct: 416 WP-------NRTVRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 458 >gi|2781441|gb|AAB96914.1| alpha 2 delta calcium channel subunit isoform II [Homo sapiens] Length = 1076 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 36/350 (10%), Positives = 95/350 (27%), Gaps = 24/350 (6%) Query: 33 LDAAVLSGCASIVSDRTIKDPTTKKDQTST----IFKKQIKKHLKQGSYIRENAGDIAQK 88 LD V + + + +D + + L G A Sbjct: 58 LDRKVQALKRLADAAENFQKAHRWQDNIKEEDIVYYDAKADAELDDPESEDVERGSKAST 117 Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI 148 +++ +D N + A +IPT+ +I + L +E ++ + Sbjct: 118 LRLDFIEDPNFKNKVNYSYAA-VQIPTDIYKGSTVILNELNWTEALENVFMENRRQDPTL 176 Query: 149 SICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR 208 + + + P + S + + + Sbjct: 177 LWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGL 236 Query: 209 KIDVLIESAGNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 + ++ S +++++ + + + ++ +V N K + Sbjct: 237 TLKLMKTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVR----NKKVFKEAVQ 292 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + T +A+ +L N + K ++ TDG N Sbjct: 293 GMVAKGTTGYKAGFEYAFDQLQNSNITRA------NCNKMIMMFTDGGEDRVQDVFEKYN 346 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 N +++++ +V L+ + G +F + + Sbjct: 347 WP-------NRTVRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 389 >gi|54112394|ref|NP_006021.2| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform b [Homo sapiens] gi|7414316|emb|CAB86192.1| calcium channel, alpha 2/delta subunit 2 [Homo sapiens] gi|119585522|gb|EAW65118.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_a [Homo sapiens] Length = 1143 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 36/350 (10%), Positives = 95/350 (27%), Gaps = 24/350 (6%) Query: 33 LDAAVLSGCASIVSDRTIKDPTTKKDQTST----IFKKQIKKHLKQGSYIRENAGDIAQK 88 LD V + + + +D + + L G A Sbjct: 127 LDRKVQALKRLADAAENFQKAHRWQDNIKEEDIVYYDAKADAELDDPESEDVERGSKAST 186 Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI 148 +++ +D N + A +IPT+ +I + L +E ++ + Sbjct: 187 LRLDFIEDPNFKNKVNYSYAA-VQIPTDIYKGSTVILNELNWTEALENVFMENRRQDPTL 245 Query: 149 SICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR 208 + + + P + S + + + Sbjct: 246 LWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGL 305 Query: 209 KIDVLIESAGNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 + ++ S +++++ + + + ++ +V N K + Sbjct: 306 TLKLMKTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVR----NKKVFKEAVQ 361 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + T +A+ +L N + K ++ TDG N Sbjct: 362 GMVAKGTTGYKAGFEYAFDQLQNSNITRA------NCNKMIMMFTDGGEDRVQDVFEKYN 415 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 N +++++ +V L+ + G +F + + Sbjct: 416 WP-------NRTVRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 458 >gi|54112392|ref|NP_001005505.1| voltage-dependent calcium channel subunit alpha-2/delta-2 isoform a [Homo sapiens] gi|2781439|gb|AAB96913.1| alpha 2 delta calcium channel subunit isoform I [Homo sapiens] gi|3043640|dbj|BAA25484.1| KIAA0558 protein [Homo sapiens] gi|3695006|gb|AAC70914.1| putative tumor suppressor gene 26 protein alpha 2 delta calcium channel subunit [Homo sapiens] gi|119585525|gb|EAW65121.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_d [Homo sapiens] gi|156230959|gb|AAI52439.1| Calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Homo sapiens] gi|168267416|dbj|BAG09764.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2 isoform b [synthetic construct] Length = 1145 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 36/350 (10%), Positives = 95/350 (27%), Gaps = 24/350 (6%) Query: 33 LDAAVLSGCASIVSDRTIKDPTTKKDQTST----IFKKQIKKHLKQGSYIRENAGDIAQK 88 LD V + + + +D + + L G A Sbjct: 127 LDRKVQALKRLADAAENFQKAHRWQDNIKEEDIVYYDAKADAELDDPESEDVERGSKAST 186 Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI 148 +++ +D N + A +IPT+ +I + L +E ++ + Sbjct: 187 LRLDFIEDPNFKNKVNYSYAA-VQIPTDIYKGSTVILNELNWTEALENVFMENRRQDPTL 245 Query: 149 SICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR 208 + + + P + S + + + Sbjct: 246 LWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGL 305 Query: 209 KIDVLIESAGNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 + ++ S +++++ + + + ++ +V N K + Sbjct: 306 TLKLMKTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVR----NKKVFKEAVQ 361 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + T +A+ +L N + K ++ TDG N Sbjct: 362 GMVAKGTTGYKAGFEYAFDQLQNSNITRA------NCNKMIMMFTDGGEDRVQDVFEKYN 415 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 N +++++ +V L+ + G +F + + Sbjct: 416 WP-------NRTVRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 458 >gi|74725352|sp|Q9NY47|CA2D2_HUMAN RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-2; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-2; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-2; Flags: Precursor gi|7414318|emb|CAB86193.1| calcium channel, alpha 2/delta subunit 2 [Homo sapiens] Length = 1150 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 36/350 (10%), Positives = 95/350 (27%), Gaps = 24/350 (6%) Query: 33 LDAAVLSGCASIVSDRTIKDPTTKKDQTST----IFKKQIKKHLKQGSYIRENAGDIAQK 88 LD V + + + +D + + L G A Sbjct: 127 LDRKVQALKRLADAAENFQKAHRWQDNIKEEDIVYYDAKADAELDDPESEDVERGSKAST 186 Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI 148 +++ +D N + A +IPT+ +I + L +E ++ + Sbjct: 187 LRLDFIEDPNFKNKVNYSYAA-VQIPTDIYKGSTVILNELNWTEALENVFMENRRQDPTL 245 Query: 149 SICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR 208 + + + P + S + + + Sbjct: 246 LWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGL 305 Query: 209 KIDVLIESAGNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 + ++ S +++++ + + + ++ +V N K + Sbjct: 306 TLKLMKTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVR----NKKVFKEAVQ 361 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + T +A+ +L N + K ++ TDG N Sbjct: 362 GMVAKGTTGYKAGFEYAFDQLQNSNITRA------NCNKMIMMFTDGGEDRVQDVFEKYN 415 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 N +++++ +V L+ + G +F + + Sbjct: 416 WP-------NRTVRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 458 >gi|257093736|ref|YP_003167377.1| von Willebrand factor type A [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] gi|257046260|gb|ACV35448.1| von Willebrand factor type A [Candidatus Accumulibacter phosphatis clade IIA str. UW-1] Length = 452 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 19/155 (12%), Positives = 48/155 (30%), Gaps = 11/155 (7%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 + ++ + + + L++++ +TN + L + Sbjct: 86 LVVFDDRVQTLVPPRPVGDRQALHLALSRVHSGGSTNLHGGWQAGADGLLPAAGQAALAR 145 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 VI ++DG + + +C G+ + + + +DL+ + Sbjct: 146 --------VILLSDGNANVGE-ITDPAGIAALCAQAAERGVSTSTYGLGS-HFNEDLMVE 195 Query: 360 CTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 G + + + +L E F D I R Sbjct: 196 MAKRGGGNHYYGDTAADLFEPFAAEFDFISALCAR 230 >gi|152985991|ref|YP_001347440.1| hypothetical protein PSPA7_2067 [Pseudomonas aeruginosa PA7] gi|150961149|gb|ABR83174.1| hypothetical protein PSPA7_2067 [Pseudomonas aeruginosa PA7] Length = 337 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 27/146 (18%), Positives = 50/146 (34%), Gaps = 35/146 (23%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + + V+ L++ +NT A+ A + L + ++ Sbjct: 150 PLTFDRHTVRVWLDEAQIGIAGKNTAIGDAIGLALKRL----------RQRPAESRVLVL 199 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------GQ 354 ITDG N+G +KIY++ + A P+ + Sbjct: 200 ITDGANTGGQIS-----PQTAARLAAEERVKIYTIGIGADPQQGGVIGLFGLNPGLDLDE 254 Query: 355 DLLRKCT-DSSGQFFAVNDSRELLES 379 +LR + G++F S EL Sbjct: 255 PVLRGIAETTGGEYFRARSSAELESI 280 >gi|52425826|ref|YP_088963.1| hypothetical protein MS1771 [Mannheimia succiniciproducens MBEL55E] gi|52307878|gb|AAU38378.1| unknown [Mannheimia succiniciproducens MBEL55E] Length = 541 Score = 67.6 bits (163), Expect = 4e-09, Method: Composition-based stats. Identities = 58/500 (11%), Positives = 129/500 (25%), Gaps = 119/500 (23%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAA--VLS------------------- 39 + I+ + +D + ++ + + ++ A L Sbjct: 24 IMGILSFFLIGLVALTVDGSGMLLDKARFSQGIEQAGLALMAENNDFRTTNQKHADVLRQ 83 Query: 40 -GCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKA-QINITKDK 97 + + K + + ++ + +Y ++N + Q N + Sbjct: 84 TVTKEELEGFSDTFSAQKYKRNQELVSGLVRHYYYPSTYFKDNLKISDKYDYQCNNLQGP 143 Query: 98 NNPLQYIAESKAQYEIPTENLFLKG----LIPSALTNLSLRSTGIIERSSENLAISICMV 153 N + + + G L + T ++ I + E + I + +V Sbjct: 144 NGEQLKSIACEISGKFERPSWLYLGKNNGLSFAETTTINANKIYIQKNLDEIIPIDLMLV 203 Query: 154 LD-----VSRSMEDLYLQKHNDNNNMTSNKYL-----LPPPPKKSFWSKNTT-------- 195 D S Y D + + S+ Sbjct: 204 ADLSGSMNSSVSGTKYGTAKIDILREVVSAIAKELLEQNNTEEGKVISQYNRIGFTSFAF 263 Query: 196 --------------------------------------KSKYAPAPAPANRKIDVLIES- 216 S+ ++ + Sbjct: 264 GAQQQNNTAQCYLPYEIKPSITIRNNNYYGGYYNTTMQYSELLSYVGSNQQRYSYATLAQ 323 Query: 217 AGNLVNSIQKAIQEKKNLSVR------------IGTIAYNIGIVGNQCTPLSNNLNEVKS 264 + K I+ + + + N I N ++ S Sbjct: 324 YFDAFVDYDKTIESINSFDGKDLSSLMYFSKNSWCLGSANTRINSTYIWAGKNESADLVS 383 Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKE--SSHNTIGSTRLKKFVIFITDGENSGASAY 322 R N++ T + + L N ++ + ++ ++ ++DGE+ +A Sbjct: 384 RFNRVPALGATLSSSGLLIGANLLMNTNPDENAQPSKLGANTQRIILVLSDGEDQINNAS 443 Query: 323 -----QNTLNTLQICEYMR------------NAGMKIYSVAVSAPPEG--QDLLRKCTDS 363 +TL +CE ++ +I VA P G + KC Sbjct: 444 SSLNITSTLINQGMCEKIKSKLNSLQDKTYLEQPTRIGFVAFGYGPSGTQKAAWEKCV-- 501 Query: 364 SGQFFAVNDSRELLESFDKI 383 ++ N+ ELLESF KI Sbjct: 502 GKYYYVANNKEELLESFRKI 521 >gi|33592721|ref|NP_880365.1| hypothetical protein BP1639 [Bordetella pertussis Tohama I] gi|33572367|emb|CAE41926.1| putative exported protein [Bordetella pertussis Tohama I] gi|332382136|gb|AEE66983.1| hypothetical protein BPTD_1619 [Bordetella pertussis CS] Length = 336 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 69/197 (35%), Gaps = 36/197 (18%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL---NP 271 ++ +V + R+G I + G PL+ + ++ L + Sbjct: 121 QAVQAVVGDFIDKRPD-----DRLGLIVFGAGAYPQA--PLTRDHAALRLLLQRTAVGMA 173 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 NT A+ R L + E K +I +TDG ++ + + + Sbjct: 174 GPNTALGDAIGLGIRMLDHAGER----------DKILILLTDGNDTA-----SAVPPARA 218 Query: 332 CEYMRNAGMKIYSVAVSAPPE------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKIT 384 E + ++++ + P D LR + G+FF D L E + + Sbjct: 219 AELAAQHRVVVHTIGIGDPAASGEDRVDFDALRDIARIAGGRFFRARDQASLQEVYATL- 277 Query: 385 DKIQE---QSVRIAPNR 398 D+I +++R P R Sbjct: 278 DRITPHEVRTLRHQPKR 294 >gi|162448748|ref|YP_001611115.1| hypothetical protein sce0478 [Sorangium cellulosum 'So ce 56'] gi|161159330|emb|CAN90635.1| hypothetical protein sce0478 [Sorangium cellulosum 'So ce 56'] Length = 521 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 55/156 (35%), Gaps = 12/156 (7%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 + + + RI + Y+ ++ + L +TN Y + AY Sbjct: 158 MIDALQPTDRISLVRYSDAAEVVLEQAEGSDREALTEAFEGLTARGSTNLYEGLFTAYAL 217 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 + + + VIF++DG + L +L G+ + ++ V Sbjct: 218 AEQHLDPAW--------QNRVIFLSDGVATAGLTSPQRLVSLAAG--YAEKGIGLTAIGV 267 Query: 348 SAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDK 382 A D +R ++ G F+ + D + + E F + Sbjct: 268 GAE-FDVDAMRGISEVGAGNFYFLEDPKAVEEVFAE 302 >gi|309790845|ref|ZP_07685389.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308227132|gb|EFO80816.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 885 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 48/398 (12%), Positives = 107/398 (26%), Gaps = 54/398 (13%) Query: 17 IDLAHIMYIRNQMQSALDAAVLS--------GCASIVSDRTIKDPTTKKDQTSTIFKKQI 68 ++ + ++ D VL+ G + + + ++ + + Sbjct: 222 VESSVAQRASVRL--IGDQGVLAEEVLDLPAGSTQVPFEVQVTARGMQQLRVVVQGEVDG 279 Query: 69 KKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIP--- 125 + + + I + G + D + + Q I LI Sbjct: 280 RVQNNEAAAIIQAYGPRQILLVAANSADAQPLASALEAANFQTAIRVPAEMPNDLIGLSE 339 Query: 126 --------SALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSN 177 + L + +E S +L + M+ Y + Sbjct: 340 YAAVIVVNTPARALPAGAMEALEISVRDLGRGLLMIGGEQSFGAGGYRDTPVEAALPVYM 399 Query: 178 KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVR 237 +S PA +K+D+ E+ + + R Sbjct: 400 DVRDREQRPDLALVFVIDRSGSMAEPAGNVQKLDIAKEALVQAIRMLYGED--------R 451 Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHN 297 +G + ++ EV + + TN + R L Sbjct: 452 VGIVTFDSQAYTTMPITQGVGEEEVLQAIASVTADGGTNIGAGLSAGQRMLT-------- 503 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 G K +I +TDG G + L + E MR G+ + VA + + L Sbjct: 504 --GVEAKIKHMILLTDGWGEGN-------DQLAVVEAMRAQGITLSVVAAGSDTAEE--L 552 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 + + G + + + ++++ +A Sbjct: 553 KTLATAGGGRYYA------AAIMQAVPQILVDETITVA 584 Score = 41.4 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 15/93 (16%), Positives = 31/93 (33%), Gaps = 19/93 (20%) Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 R++ L T+ A+ E E K ++ ++DG + Sbjct: 127 RIDALPEVGATDIAAAIQLGVALFPAESE------------KRLVLLSDGAENRG----- 169 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 + + R+ + I V +S P ++L Sbjct: 170 --DAQAAAQLARSRNIPISFVDLSLPGGDAEIL 200 >gi|332285111|ref|YP_004417022.1| hypothetical protein PT7_1858 [Pusillimonas sp. T7-7] gi|330429064|gb|AEC20398.1| hypothetical protein PT7_1858 [Pusillimonas sp. T7-7] Length = 342 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 71/197 (36%), Gaps = 37/197 (18%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 + V+ + ++ R+G I + G PL+ + ++ L+ Sbjct: 118 SRWTVVKAVVADFIDKRTD---------DRLGLIVFGTGAFPQA--PLTRDHKSLRLLLD 166 Query: 268 KL---NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 NT A+ R L + +E K +I +TDG ++G + Sbjct: 167 HTAVGMAGPNTAIGDAIGMGIRMLDSAQER----------DKVLILLTDGNDTG-----S 211 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPE------GQDLLRKCTD-SSGQFFAVNDSRELL 377 + L+ + ++++ + +P D LR + S GQFF D L Sbjct: 212 AVPPLRAANLAAQHHVTVHTIGIGSPTASGDDQVDFDTLRGISSASGGQFFQAQDGAALH 271 Query: 378 ESFDKITDKIQEQSVRI 394 + + + D+I + V+ Sbjct: 272 DVYATL-DRITPREVKT 287 >gi|227833165|ref|YP_002834872.1| putative secreted protein [Corynebacterium aurimucosum ATCC 700975] gi|227454181|gb|ACP32934.1| putative secreted protein [Corynebacterium aurimucosum ATCC 700975] Length = 521 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 41/368 (11%), Positives = 101/368 (27%), Gaps = 36/368 (9%) Query: 35 AAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINIT 94 +A+ + + K + K + L GS Q + Sbjct: 167 SALSAATTAFADTGRALTEKDIKQSAGKVQKLFGNQTLTSGSSGWLADRFREHPEQADAI 226 Query: 95 KD---------KNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSEN 145 + + L T +++ + Sbjct: 227 FNYESVLYQLKDEGADLEVVIPSDGVISADYPLSSLASSSDKDTEAKVQALAEWLAERPD 286 Query: 146 LAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAP 205 S + +D S ++ + N + L + NT Sbjct: 287 KLTSFHLRVDNSDLPGTVFELPY--PANEQTVDALEAAFAHELRNPGNTALVLDTSGSME 344 Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKK-----NLSVRIGTIAYNIGIVGNQCTPLSNNL- 259 ++D+L S L++ + + + +I I Y+ + + Sbjct: 345 GE-RMDLLKSSLLPLIDGSADGVPDGEGQVAFRNREQIKLIPYSSEPQQPTRARVDKDKP 403 Query: 260 ---NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 E+ R+ +L +T T+ A+ +A+ E+ V+ +TDGE Sbjct: 404 ATTKELADRVERLVADGDTATFEAVLNAFDEVDTSGGDIGT----------VVLMTDGEV 453 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRE 375 + + + + + ++ + + + + G+ F + + Sbjct: 454 TRGRTFAQFKDAYAQLPEDKKE-IPVFVILYG--EANIQEMEELAQLTGGKTFDALN-GD 509 Query: 376 LLESFDKI 383 L +F++I Sbjct: 510 LAAAFEEI 517 >gi|302670289|ref|YP_003830249.1| von Willebrand factor type A domain-containing protein [Butyrivibrio proteoclasticus B316] gi|302394762|gb|ADL33667.1| von Willebrand factor type A domain-containing protein [Butyrivibrio proteoclasticus B316] Length = 568 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 42/362 (11%), Positives = 94/362 (25%), Gaps = 29/362 (8%) Query: 44 IVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQY 103 D + ++ + + TK + N Sbjct: 55 AAQAPVNSDAKSADERGKYSAGTNNNYKYSDNTTDNYYGDKKVINNNESYTKPEENGFNL 114 Query: 104 IAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDL 163 + + + + + E I+ DL Sbjct: 115 VMNQPLSTFAADVDTASYSNVRRMIEDGYAVGEIDPESVRPEEFINYFNYDLYEPEKGDL 174 Query: 164 Y-LQKHNDNNNMTSNKYLLPPPPKKSFWS------KNTTKSKYAPAPAPANRKIDVLIES 216 + + S+ L+ K N + K+ +L +S Sbjct: 175 FGITTEVSACPWNSDNQLMFVGMKTGEIDMEEAPVSNLVFLIDVSGSMSSRNKLPLLQKS 234 Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 LV+S+ I + Y+ +N +K ++KL+ TN Sbjct: 235 FDELVDSLPDEGT--------ISIVTYSGEEKVVLSGEPMSNKKGIKKAIDKLHANGCTN 286 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 M AY N VI TDG+ + + + + + + Sbjct: 287 GQAGMQKAYEIAQKYFIEGGNNR--------VIMATDGDLNVGISDLD--DLEKFITDKK 336 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 + G+ + + D ++ D +G + ++ E + + D++ +A Sbjct: 337 DEGVFLSILGFGEGNYKDDKMQTLADCGNGNYSYIDSLEEGKKV---LVDEMSSTLYTVA 393 Query: 396 PN 397 + Sbjct: 394 KD 395 >gi|256261598|gb|ACU65921.1| CR4 receptor subunit alphaX [Ovis aries] Length = 1158 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 59/177 (33%), Gaps = 17/177 (9%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYENT 275 +++ ++ + + + S + + ++ + ++ + LN T Sbjct: 167 FNKMLSFVKAVMSQFQRPSSQFSLVQFSDRFQEHFTFKDFATSSDPLNLLNSVWQLGGWT 226 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 T A+ L + + K +I ITDGE + ++ ++ Sbjct: 227 FTASAIRFVTDRLLSAAYGAR-----KDASKILIVITDGEKTE------KVDYKEVIPRA 275 Query: 336 RNAGMKIYSVAVSAP---PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKI 387 AG+ Y++ V + L + F V + L + ++ +KI Sbjct: 276 EAAGIIRYAIGVGSAFQYRNSLQELIDIASTPSKEHVFQVENFDALRDIQKQLKEKI 332 >gi|156977400|ref|YP_001448306.1| Flp pilus assembly protein TadG [Vibrio harveyi ATCC BAA-1116] gi|156528994|gb|ABU74079.1| hypothetical protein VIBHAR_06187 [Vibrio harveyi ATCC BAA-1116] Length = 515 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 30/260 (11%), Positives = 77/260 (29%), Gaps = 18/260 (6%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 +++ F +A++ + ++++ + +AA ++ P + + Sbjct: 22 MGLLLVPIMGFTFWAVEGTRYVQESSRLRDSAEAAAMAVTIE-------DQPGAARALAT 74 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + ++ + + +QY ++ ++ + F+ Sbjct: 75 KYVENYVRDIKSTNLSAQRFYQAEDKGTG------ALEYIQYTVNARTTHDSWFASSFIP 128 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLY---LQKHNDNNNMTSNK 178 + + I I V D SRSM D + K D+ ++ Sbjct: 129 SFDKQQELAGRSLAR-KYPAYLGDNNIDIVFVSDFSRSMNDKWGSSWNKKIDDLKTAIDQ 187 Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 K + + K + D L+ G + +++ N + Sbjct: 188 ISNNILCKSTRQEYVDGEWKDVCDEPGDDTTSDKLLNRVGFVPFNVRTREIVAGNRANAT 247 Query: 239 GTIAYNIGIVGNQCTPLSNN 258 ++Y G +P S N Sbjct: 248 SQLSYKNG-YKAYLSPYSYN 266 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 32/247 (12%), Positives = 73/247 (29%), Gaps = 17/247 (6%) Query: 157 SRSMEDLYLQKHNDNNNMTSNK-------YLLPPPPKKSFWSKNTTK--SKYAPAPAPAN 207 S + + + Y P + ++K A A Sbjct: 261 SPYSYNDVDWNYWRTYTSSEVNNCAYWQSYCQNPRAENHNYAKRIRDVLEADRYAVADVY 320 Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 +D+ + + R+ + LSN L+++ + Sbjct: 321 NYVDLPTSVSTMFTDKSGLKANFYGVSGTRLFNAHGSSYSSQFHNIQLSNKLSDL-DSIK 379 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK---KFVIFITDGENSGASAYQN 324 + T + + + L+ +S + K ++ ++DG+ S + Sbjct: 380 SMWADGGTAAFQGILRGSQVLHEGDPNSSDQEEQQAYNKKIKMLLILSDGQESPDNGILK 439 Query: 325 TLNTLQICEYMRNA--GMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 L +C+ R G+ I + + Q + C + + D L E +K Sbjct: 440 GLVDWGMCDKARQEIPGLYIGVIGIDFRASQQSGFQDCVVDPRE--DIIDVSNLDELIEK 497 Query: 383 ITDKIQE 389 I + I++ Sbjct: 498 IEELIRK 504 >gi|149019069|gb|EDL77710.1| procollagen, type XII, alpha 1, isoform CRA_b [Rattus norvegicus] gi|149019070|gb|EDL77711.1| procollagen, type XII, alpha 1, isoform CRA_b [Rattus norvegicus] Length = 2827 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 21/160 (13%), Positives = 53/160 (33%), Gaps = 18/160 (11%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKE 293 V+I Y+ + + + L NT T A++ ++ + Sbjct: 1177 RVQIALAQYSGDPRTEWHLNAHRDKKSLLQAVANLPYKGGNTLTGMALNFIRQQSFKT-- 1234 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 R +K + ITDG++ + + +++ G++++++ + Sbjct: 1235 ---QAGMRPRARKIGVLITDGKSQDDVEAPS--------KKLKDEGVELFAIGI--KNAD 1281 Query: 354 QDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 + L+ + V D L + D +T + Sbjct: 1282 EVELKMIATDPDDIHAYNVADFESLSKIVDDLTINLCNSV 1321 Score = 66.1 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 32/167 (19%), Positives = 63/167 (37%), Gaps = 23/167 (13%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYN 290 V+I + Y+ N + ++ +N TNT AM + +++ Sbjct: 415 SPNRVQISLVQYSRDPHTEFTLKEFNRVEDIIKAINNFPYRGGSTNTGKAMTYVREKIF- 473 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 N + + K +I ITDG++S A +RN+ ++I++V V Sbjct: 474 ----VPNKGSRSNVPKVMILITDGKSSDAFRDP--------AIKLRNSDVEIFAVGV--K 519 Query: 351 PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS-VRI 394 + L F V D ++F +I+ ++ + +RI Sbjct: 520 DAVRSELEAIASPPAETHVFTVEDF----DAFQRISFELTQSICLRI 562 >gi|257456195|ref|ZP_05621392.1| BatA protein [Treponema vincentii ATCC 35580] gi|257446281|gb|EEV21327.1| BatA protein [Treponema vincentii ATCC 35580] Length = 332 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 51/168 (30%), Gaps = 41/168 (24%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 G P + + + +RL++L + T + A L Sbjct: 138 GSSAAVLIPPTIDRHTFLTRLDQLQVGELGDGTAIGMGLASAVLHLTQYS---------- 187 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE---------- 352 L +I TDG+N+ + + +++ + Y + + Sbjct: 188 TLPSHIILFTDGDNNTGEIH-----PRAAADIIKHKKIGFYIIGLGKSGYAPVKYIDPIQ 242 Query: 353 ------------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKI 387 + L+K +G++F+ L + F++ KI Sbjct: 243 KKEISGTLNTVFNETELQKIAGYGNGRYFSAKSPELLTDIFNRFIQKI 290 >gi|268575272|ref|XP_002642615.1| C. briggsae CBR-MUP-4 protein [Caenorhabditis briggsae] gi|187032080|emb|CAP28849.1| CBR-MUP-4 protein [Caenorhabditis briggsae AF16] Length = 2101 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 22/176 (12%), Positives = 56/176 (31%), Gaps = 14/176 (7%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 + V + + + R+ + Y+ I ++ ++ Sbjct: 444 GSGSIGSYVFQTEVLRFLAEFTEL-FDIAPQKTRVSVVQYSDQIRHEFGLDNYSDRKSLQ 502 Query: 264 SRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + + + T T A+ H E ++E+ + ++ + I ITDG + Sbjct: 503 NAIRNIEYLTGLTRTGAAIEHVANEAFSERRGARPVG---QVSRVAIVITDGRSQDNVTR 559 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + +++++V V L + + S + F V+ +L Sbjct: 560 PSN--------NAHKQDIQLFAVGV-TNHVLDAELEEISGSKDRTFHVSGFEDLNT 606 >gi|332970076|gb|EGK09074.1| D-amino acid dehydrogenase large subunit [Desmospora sp. 8437] Length = 454 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 31/194 (15%), Positives = 70/194 (36%), Gaps = 22/194 (11%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL-----SNNLNEVK 263 K+D+ E+ N V+ + + + + G+ T + S + Sbjct: 163 KMDLAKEAIENFVSDMPENAKISLRVYGHKGSNRKQDQKESCASTEVVYPHGSYVKGKFG 222 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 LN P T AM A ++L + + ++DG + Sbjct: 223 KALNSFEPTGWTPLAAAMEEARQDLKPYA--------GEDAENIIYVVSDGIETCGG--- 271 Query: 324 NTLNTLQICEYMRNAGMK--IYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESF 380 + ++ + + N+ ++ + + GQ L+K + G++ N EL +SF Sbjct: 272 ---DPVKAAKSLYNSDIQAVVNIIGFDVDDAGQQALKKVAEAGGGEYKTANTREELNQSF 328 Query: 381 DKITDKIQEQSVRI 394 D+I+++ ++ Sbjct: 329 GIDWDEIEKEVSKV 342 >gi|282896313|ref|ZP_06304335.1| von Willebrand factor, type A [Raphidiopsis brookii D9] gi|281198809|gb|EFA73688.1| von Willebrand factor, type A [Raphidiopsis brookii D9] Length = 336 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 22/175 (12%), Positives = 56/175 (32%), Gaps = 20/175 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 +K R+ + +N + + +K ++N+L+ T+ + EL Sbjct: 71 DKLRDQDRLSIVVFNHRAEVLLSNQNVVDRDHIKQQINRLSANGGTSIDEGLRLGIEEL- 129 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 G +TDGE + + + L+ + + + + ++ Sbjct: 130 --------AKGRRDTISQAFLLTDGE----NEHGDNNRCLKFAQLAADYNLTVNTLGFGN 177 Query: 350 PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE-------QSVRIAPN 397 L + G + + ++ F+ + ++Q +APN Sbjct: 178 NWNQHILEKISDAGLGSLSHIEHPDQAVDKFNSLLMRMQTVGLTNAYLLFSLAPN 232 >gi|149188854|ref|ZP_01867144.1| hypothetical protein VSAK1_05875 [Vibrio shilonii AK1] gi|148837274|gb|EDL54221.1| hypothetical protein VSAK1_05875 [Vibrio shilonii AK1] Length = 505 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 34/295 (11%), Positives = 77/295 (26%), Gaps = 36/295 (12%) Query: 128 LTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKK 187 + RS+ L+ + S + + P Sbjct: 203 NIRTREVISSQNARSTSQLSYRNDTHANRSTYTYNQVPWDAWRGESWNQVSSCANNPGNC 262 Query: 188 S-----FWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIA 242 W++ A P A L S + + Sbjct: 263 KPVRNWSWNRRQNCYYNARCPQEALNNQQYAKRINDVLSRSYYYPDDYNYVDFNQTVSTM 322 Query: 243 YNI-----------------------GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYP 279 ++ L+N L+++ + ++ + +T + Sbjct: 323 FSDKSGLSSNYYRINGVQLFAGFGDQSSTQFHNIALTNKLSDL-NAISPMWANGSTAAFQ 381 Query: 280 AMHHAYRELYNEKESSHNTIGSTRLK---KFVIFITDGENSGASAYQNTLNTLQICEYMR 336 + + L +S + K ++ ++DG+ S + L +C R Sbjct: 382 GILRGAQILKQGDPNSSDQDKQQAYTKKIKMLLILSDGQESPNNGILRGLVNAGMCNKAR 441 Query: 337 NA--GMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 G+ I + ++ Q + C + + D L E KI + I++ Sbjct: 442 QQIPGLYIGVIGINFQASQQSGFQDCVVDPNE--DIIDVSNLDELIKKIEELIRK 494 Score = 65.3 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 39/379 (10%), Positives = 98/379 (25%), Gaps = 43/379 (11%) Query: 10 FLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIK 69 F +A++ + ++++ A +AA L+ + + + K I+ Sbjct: 1 MGFTFWAVEGTRYVQETSRLRDASEAAALAVTIE-----------DQPTLANNLATKYIE 49 Query: 70 KHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALT 129 +++ A Q N + +QY +K ++ + F+ Sbjct: 50 NYVRDIKSTALTAQRFHQAENQNAGVLEY--IQYTVNAKTTHDSWFASSFIPSFGDQQDL 107 Query: 130 NLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ------KHNDNNNMTSNKYLLPP 183 + + I I V D S SM + + +L Sbjct: 108 AGRSLAR-KYPVYLGDNNIDIVFVSDFSGSMNSRWGTNRNRKIDDLKTAINQISDKILCT 166 Query: 184 PPKKSFWSKNTTK----SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG 239 ++ + N ++ + + + + LS R Sbjct: 167 SIRRDQVNGNVEDVCDEQGQDSTANKLLNRVGFVPFNIRTREVISSQNARSTSQLSYRND 226 Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSR------------LNKLNPYENTNTYPAMHHAYRE 287 T A NQ + + + N Y Sbjct: 227 THANRSTYTYNQVPWDAWRGESWNQVSSCANNPGNCKPVRNWSWNRRQNCYYNARCPQEA 286 Query: 288 LYNEKESSHNT--IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 L N++ + + + D + ++ + + Y R G+++++ Sbjct: 287 LNNQQYAKRINDVLSRSYYYPDDYNYVDFNQTVSTMFSDKSGLSS--NYYRINGVQLFA- 343 Query: 346 AVSAPPEGQDLLRKCTDSS 364 + ++ Sbjct: 344 GFG--DQSSTQFHNIALTN 360 >gi|167951278|ref|ZP_02538352.1| von Willebrand factor, type A [Endoriftia persephone 'Hot96_1+Hot96_2'] Length = 269 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 31/266 (11%), Positives = 64/266 (24%), Gaps = 23/266 (8%) Query: 108 KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQK 167 A Y L L P + + + + Sbjct: 3 SASYANLRRILNEGRLPPMDAVRVEEMINYFNYEDMAAEQRDTPFGITTEVAANPWNPED 62 Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 S P W S +P + + Sbjct: 63 KASAYRHQSLAAQGSPKCPPPIWVFLVDVSGSMHSPDKLPL------------LKRSLRL 110 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 + + R+ + Y N ++ L +L+ +TN + AY + Sbjct: 111 LSRSLDADDRVSLVVYAGASGVVLEPTPGNKRATIEQALQQLSAGGSTNGGAGIRLAYAK 170 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 VI TDG+ + + N + + + R AG+ + ++ Sbjct: 171 AREAFIEGGINR--------VILATDGDFNVGTV--NHQALIDLIKQQRQAGIALTTLGF 220 Query: 348 SAPPEGQDLLRKCTDSS-GQFFAVND 372 L+ + D G + ++ Sbjct: 221 GGGNYNDHLMEQLADQGDGNYAYIDS 246 >gi|33601708|ref|NP_889268.1| hypothetical protein BB2732 [Bordetella bronchiseptica RB50] gi|33576145|emb|CAE33224.1| putative exported protein [Bordetella bronchiseptica RB50] Length = 336 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 35/197 (17%), Positives = 70/197 (35%), Gaps = 36/197 (18%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL---NP 271 ++ +V + R+G I + G PL+ + ++ L + Sbjct: 121 QAVQAVVGDFIDKRPD-----DRLGLIVFGAGAYPQA--PLTRDHAALRLLLQRTAVGMA 173 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 NT A+ R L + +E K +I +TDG ++ + + + Sbjct: 174 GPNTALGDAIGLGIRMLDHARER----------DKILILLTDGNDTA-----SAVPPARA 218 Query: 332 CEYMRNAGMKIYSVAVSAPPE------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKIT 384 E + ++++ + P D LR + G+FF D L E + + Sbjct: 219 AELAAQHRVVVHTIGIGDPAASGEDRVDFDALRDIARIAGGRFFRARDQASLQEVYATL- 277 Query: 385 DKIQE---QSVRIAPNR 398 D+I +++R P R Sbjct: 278 DRITPHEVRTLRHQPKR 294 >gi|303241024|ref|ZP_07327534.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] gi|302591449|gb|EFL61187.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] Length = 569 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 30/259 (11%), Positives = 83/259 (32%), Gaps = 14/259 (5%) Query: 126 SALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPP 185 S + + ++ +S + + + + + +L Sbjct: 320 SKDLQTKGMNEYFFRPADSSIPLSNAISVQNGVNPSEPQTTLELP--GIDVINGILNDWQ 377 Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 N ID + NL N + ++ Sbjct: 378 TNMKKRANVLFVIDTSGSMSGEP-IDNARSAIQNLFNKEAQEKNYTSIDDEDTISLMTFN 436 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 V + T +++E+ ++ L+ NT+ Y A+ A E K+S Sbjct: 437 TDVSDVYTVKGKDISEMSVVIDSLSASGNTHLYDAVDKAITEHQALKQSESEKKID---- 492 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SS 364 ++ ++DG ++ + + L ++ + + + + + +D+L +D + Sbjct: 493 -IIVVLSDGADTNSQIQFSQLESML---KQKEGNLPV-IITIGYGNVDKDVLESISDKTG 547 Query: 365 GQFFAVNDSRELLESFDKI 383 G+++ + + + F++I Sbjct: 548 GKYYE-GNPDTIKKVFEEI 565 >gi|291229807|ref|XP_002734862.1| PREDICTED: polydom-like [Saccoglossus kowalevskii] Length = 1730 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 61/191 (31%), Gaps = 29/191 (15%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP---LSN 257 + + + + +L+ + + V + S Sbjct: 42 RSASVGSANFEAEKGFVESLLGQFSISPASTRVDVVSYSEDVVRHIDYIREPKNKCHFSQ 101 Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 ++ V TNT A+ A ++ H K V+ ++DG+++ Sbjct: 102 DIRHVT-----YRNSGKTNTNGALQEARNIFVGSRQDVH---------KVVVLLSDGQSN 147 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 + E +R G++I+++A+ +D L S F + RE Sbjct: 148 TGG------DPTTTAEELRQDGVEIFTIAIGL--FNKDELNSIATSDHHTFEYSSFRE-- 197 Query: 378 ESFDKITDKIQ 388 F K+ +I+ Sbjct: 198 --FKKLASRIR 206 >gi|317493250|ref|ZP_07951672.1| von Willebrand factor type A domain-containing protein [Enterobacteriaceae bacterium 9_2_54FAA] gi|316918643|gb|EFV39980.1| von Willebrand factor type A domain-containing protein [Enterobacteriaceae bacterium 9_2_54FAA] Length = 544 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 35/284 (12%), Positives = 83/284 (29%), Gaps = 28/284 (9%) Query: 108 KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQK 167 Y + L L PS L + + S V K Sbjct: 88 TGSYTVMRSLLNRGALPPSDSIRLEEWLNYFHYNYPKPVDNSPFSVATEIAPTPWNAHSK 147 Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 ++ PP + + + S K+ ++ S LVN ++ Sbjct: 148 LLRIAIKATDINATALPPANLVFLIDVSGSMSD------EDKLPLVKNSLKLLVNKMRDQ 201 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 +I + Y+ + +++ S +N+L+ +T + AY+ Sbjct: 202 D--------KISIVIYSGETKTVLPPTSGKDKSDILSAINQLSAGGSTAGGSGIDLAYQM 253 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 + +I TDG+ + +T + + G+ + ++ Sbjct: 254 AEKGFIKNGINR--------IILATDGDFNVG--ITDTQQLEEKIKKKSKNGINLTTLGF 303 Query: 348 SAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQ 390 L+ D +G + ++ +E + + +++ Sbjct: 304 GQGNYNDSLMMHIADVGNGNYAYIDSMQEAQKV---LVEQLSST 344 >gi|227832539|ref|YP_002834246.1| hypothetical protein cauri_0711 [Corynebacterium aurimucosum ATCC 700975] gi|227453555|gb|ACP32308.1| putative membrane protein [Corynebacterium aurimucosum ATCC 700975] Length = 693 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 34/233 (14%), Positives = 71/233 (30%), Gaps = 31/233 (13%) Query: 154 LDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL 213 L V + + + N +S+ + P T A +ID Sbjct: 60 LAVVSGLLPVASAEEETNAPSSSSSTMAPTMVVFDSSGSMIT------NDAGGQTRIDAA 113 Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIV-----GNQCTPLSNNLNEVKSRLNK 268 ++A + A P + N ++ + ++ Sbjct: 114 KDAARTFITEAGDDAPLGLVTYGGNTGEAPEDEAAGCQDITVVTPPEAGNSEKMIAHMDG 173 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L P T ++ A EL E + + +I ++DG + Sbjct: 174 LQPRGFTPIGESLRKAAAELPKEGQ------------RSIILVSDGVAT-----CTPPPV 216 Query: 329 LQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLE 378 + + ++ G+ I +V + PE Q L+ D + G + +D+ L + Sbjct: 217 CDVAKELKEQGIDLVINTVGFNVEPEAQQELQCIADATGGTYANASDADSLAK 269 >gi|326921803|ref|XP_003207144.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Meleagris gallopavo] Length = 1054 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 27/175 (15%), Positives = 63/175 (36%), Gaps = 21/175 (12%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYN 290 + RIG I ++ + +KS ++K+ T T A+ A Sbjct: 722 NHAKTRIGVINFSHKVELVSSLEKYTTKESLKSAVDKMLYLGEGTYTASAIKKAINLFQA 781 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--S 348 + + ++K + +TDG+ ++ ++ + A ++I+ + + Sbjct: 782 ARPA---------VRKVAVVVTDGQAD----ARDEVHLDMVVREAHAANIEIFVIGIVQE 828 Query: 349 APPEGQDLLRK---CTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 P + L++ F+ + D + L DK+ KI + + I + Sbjct: 829 TDPHYDNFLKEMHLIATDPDEEHFYRIEDFKTLSALTDKLITKICDNASEIYSRK 883 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 51/159 (32%), Gaps = 16/159 (10%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN-NLNEVKSRLNKLNPY 272 +L ++I + K V ++ N+ K ++ L Sbjct: 30 KNFVLSLTDNIFQMKPVKSQKYDVKLAGMQFSSTVSVDHPFIAWKNVQNFKEKIRALVYI 89 Query: 273 E-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T +Y A+ +A + E K +TDG + + N+ + I Sbjct: 90 GQGTYSYYAISNATQLFKTEGR--------EGSIKVAFLMTDGVD-----HPNSPSVEGI 136 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV 370 R+ G+ +++ +S ++ LR F V Sbjct: 137 ATAARSLGIHFFTIGLSKKNVKEEKLRLI-SGDSSFKHV 174 >gi|308501643|ref|XP_003113006.1| CRE-MUA-3 protein [Caenorhabditis remanei] gi|308265307|gb|EFP09260.1| CRE-MUA-3 protein [Caenorhabditis remanei] Length = 3860 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 26/185 (14%), Positives = 58/185 (31%), Gaps = 13/185 (7%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 + V V+ + + R+G I Y+ I + + + Sbjct: 1238 GSGSIGSYVFKNEVLRFVSEFVELFE-IGRSKTRVGLIQYSDQIRHEFDLDQYGDRSSLL 1296 Query: 264 SRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 +++ T T A+ H +E ++E+ + + I +TDG + Sbjct: 1297 KGISETQYLTGLTRTGAAIQHMVQEGFSERRGARPQQSDI--ARVAIILTDGRSQD---- 1350 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 N + R + +++ V L S ++F V+ ++L Sbjct: 1351 ----NVTGPADSARKLSINTFAIGV-TDHVLASELESIAGSPNRWFYVDKFKDLDTRLRS 1405 Query: 383 ITDKI 387 + K Sbjct: 1406 MIQKA 1410 >gi|229491170|ref|ZP_04384998.1| von Willebrand factor type A domain protein [Rhodococcus erythropolis SK121] gi|229321908|gb|EEN87701.1| von Willebrand factor type A domain protein [Rhodococcus erythropolis SK121] Length = 614 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 21/131 (16%), Positives = 55/131 (41%), Gaps = 23/131 (17%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 ++N +++ + N+L T T A+ A +L + + +I I+DG+ Sbjct: 108 TDNRDQLNNATNQLTAGGTTPTPDALRAAAGDLPSTG------------DRTIILISDGQ 155 Query: 316 NSGASAYQNTLNTLQICEYMRNA-GM--KIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVN 371 ++ + + ++ G+ ++++V +AP + L + G++F Sbjct: 156 STCG-------DPCAVATELKTQLGIDFRVHAVGFNAPDVAESELSCIANATGGRYFTAT 208 Query: 372 DSRELLESFDK 382 ++ EL ++ Sbjct: 209 NTTELSDAISA 219 >gi|160878421|ref|YP_001557389.1| von Willebrand factor type A [Clostridium phytofermentans ISDg] gi|160427087|gb|ABX40650.1| von Willebrand factor type A [Clostridium phytofermentans ISDg] Length = 551 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 24/164 (14%), Positives = 55/164 (33%), Gaps = 14/164 (8%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 RI + Y N ++++ + +L +T + AY+ Sbjct: 228 TEKDRISIVTYAGNDTVVLSGAKGNQKEKIQNAITELEAGGSTFGSKGIETAYQLAMENY 287 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 N VI TDG+ + ++ L + E R +G+ + + Sbjct: 288 IEGGNNR--------VILATDGDLNVGVTSESEL--TNLIEEKRKSGVALSVLGFGTGNI 337 Query: 353 GQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 + + D G + ++ L+E+ + +++ V +A Sbjct: 338 KDNKMEALADHGNGNYAYIDS---LMEARKVLVEEMGATLVTVA 378 >gi|296394903|ref|YP_003659787.1| von Willebrand factor type A [Segniliparus rotundus DSM 44985] gi|296182050|gb|ADG98956.1| von Willebrand factor type A [Segniliparus rotundus DSM 44985] Length = 343 Score = 67.2 bits (162), Expect = 4e-09, Method: Composition-based stats. Identities = 31/236 (13%), Positives = 74/236 (31%), Gaps = 35/236 (14%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 ++ SK A ++D +A V + +V++G + + Sbjct: 95 NQATVILVLDISKSMAATDVKPSRVDAARAAAIKFV--------DGMAPTVQLGVVTFAG 146 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKL----NPYENTNTYPAMHHAYRELYNEKESSHNTIGS 301 S + K ++++ + T T ++ A +++ + + Sbjct: 147 SAQPLVRP--STDHETAKKVIDQMIRADKLEKQTATGEGIYTALQQIET--IAGALGGKN 202 Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE--------- 352 ++ ++DG+ + + + + +VA Sbjct: 203 HTPPARIVLVSDGKETVPDDLNAPRGAYAAARTAKEKHIPVCTVAFGTKSGKITIDNQVD 262 Query: 353 ----GQDLLRKCTD---SSGQ---FFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 D L+K +D S G FF EL + + + + I ++VR +R Sbjct: 263 EVPVDLDSLKKISDLSNSPGNSCRFFPAESQAELAQIYQSLNEDIGYENVRSESSR 318 >gi|301059316|ref|ZP_07200243.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300446545|gb|EFK10383.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 527 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 23/149 (15%), Positives = 51/149 (34%), Gaps = 10/149 (6%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 I Y+ + + N+ ++SR+ + P NT + + E+ Sbjct: 184 QKDMFSVIVYDHNVKTIVPAQSARNVEWIESRIRGIGPGGNTALFGGVSQGASEVRKNLS 243 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + + +I ++DG + + L L + + + ++ V Sbjct: 244 NKYVHR--------IILLSDGLANVGPSSPEDLGRLGAA--LIKESISVTTIGVGTDYNE 293 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + R +S G + V SR+L + F Sbjct: 294 DLMARLSQNSDGNTYFVESSRDLPKIFAA 322 >gi|222623880|gb|EEE58012.1| hypothetical protein OsJ_08791 [Oryza sativa Japonica Group] Length = 759 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 20/184 (10%), Positives = 57/184 (30%), Gaps = 25/184 (13%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT--PLSNNLNEVKSR 265 K+ +L + G ++ ++ + R+ IA++ + Sbjct: 332 TKLALLKRAMGFVIQNLGSSD--------RLSVIAFSSSARRLFPLRRMTETGRQQSLQA 383 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + L TN + + + + + +I ++DG+++ + Sbjct: 384 VYSLTSNGGTNIAEGLRKGSKVIED--------RQAKNPVCSIILLSDGQDTYTVSPTAG 435 Query: 326 --LNTLQICEYM--RNAG---MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + C + + G + ++ A + L S G F + + + Sbjct: 436 VHKAAPEYCSLLPYTSNGCQQVPVHVFGFGADHDSVSLHSISQTSGGTFSFIETEAAIQD 495 Query: 379 SFDK 382 +F + Sbjct: 496 AFAQ 499 >gi|218191772|gb|EEC74199.1| hypothetical protein OsI_09355 [Oryza sativa Indica Group] Length = 723 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 20/184 (10%), Positives = 57/184 (30%), Gaps = 25/184 (13%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT--PLSNNLNEVKSR 265 K+ +L + G ++ ++ + R+ IA++ + Sbjct: 296 TKLALLKRAMGFVIQNLGSSD--------RLSVIAFSSSARRLFPLRRMTETGRQQSLQA 347 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + L TN + + + + + +I ++DG+++ + Sbjct: 348 VYSLTSNGGTNIAEGLRKGSKVIED--------RQAKNPVCSIILLSDGQDTYTVSPTAG 399 Query: 326 --LNTLQICEYM--RNAG---MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + C + + G + ++ A + L S G F + + + Sbjct: 400 VHKAAPEYCSLLPYTSNGCQQVPVHVFGFGADHDSVSLHSISQTSGGTFSFIETEAAIQD 459 Query: 379 SFDK 382 +F + Sbjct: 460 AFAQ 463 >gi|15451571|gb|AAK98695.1|AC069158_7 Hypothetical protein protein containing a von Willebrand factor type A domain [Oryza sativa Japonica Group] Length = 714 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 20/184 (10%), Positives = 57/184 (30%), Gaps = 25/184 (13%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT--PLSNNLNEVKSR 265 K+ +L + G ++ ++ + R+ IA++ + Sbjct: 287 TKLALLKRAMGFVIQNLGSSD--------RLSVIAFSSSARRLFPLRRMTETGRQQSLQA 338 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + L TN + + + + + +I ++DG+++ + Sbjct: 339 VYSLTSNGGTNIAEGLRKGSKVIED--------RQAKNPVCSIILLSDGQDTYTVSPTAG 390 Query: 326 --LNTLQICEYM--RNAG---MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + C + + G + ++ A + L S G F + + + Sbjct: 391 VHKAAPEYCSLLPYTSNGCQQVPVHVFGFGADHDSVSLHSISQTSGGTFSFIETEAAIQD 450 Query: 379 SFDK 382 +F + Sbjct: 451 AFAQ 454 >gi|27754463|gb|AAO22679.1| unknown protein [Arabidopsis thaliana] Length = 641 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 17/177 (9%), Positives = 59/177 (33%), Gaps = 19/177 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN--NLNEVKSRL 266 K++++ + ++ ++ + R+ I+++ L + + Sbjct: 219 KMELMKNAMSFVIQNLGETD--------RLSVISFSSMARRLFPLRLMSETGKQAAMQAV 270 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 N L TN + R + ++ ++DG+++ ++ Sbjct: 271 NSLVADGGTNIAEGLKIGARVIE--------GRRWKNPVSGMMLLSDGQDNFTFSHAGVR 322 Query: 327 NTLQICEYMRNA-GMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + ++ + I++ + + + + SSG F + + ++F + Sbjct: 323 LRTDYESLLPSSCRIPIHTFGFGSDHDAELMHTISEVSSGTFSFIETETVIQDAFAQ 379 >gi|15223093|ref|NP_172283.1| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|8778841|gb|AAF79840.1|AC026875_20 T6D22.13 [Arabidopsis thaliana] gi|332190114|gb|AEE28235.1| C3HC4-type RING finger-containing protein [Arabidopsis thaliana] Length = 641 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 17/177 (9%), Positives = 59/177 (33%), Gaps = 19/177 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN--NLNEVKSRL 266 K++++ + ++ ++ + R+ I+++ L + + Sbjct: 219 KMELMKNAMSFVIQNLGETD--------RLSVISFSSMARRLFPLRLMSETGKQAAMQAV 270 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 N L TN + R + ++ ++DG+++ ++ Sbjct: 271 NSLVADGGTNIAEGLKIGARVIE--------GRRWKNPVSGMMLLSDGQDNFTFSHAGVR 322 Query: 327 NTLQICEYMRNA-GMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + ++ + I++ + + + + SSG F + + ++F + Sbjct: 323 LRTDYESLLPSSCRIPIHTFGFGSDHDAELMHTISEVSSGTFSFIETETVIQDAFAQ 379 >gi|115449371|ref|NP_001048450.1| Os02g0806700 [Oryza sativa Japonica Group] gi|47497349|dbj|BAD19389.1| zinc finger-like [Oryza sativa Japonica Group] gi|113537981|dbj|BAF10364.1| Os02g0806700 [Oryza sativa Japonica Group] gi|215701428|dbj|BAG92852.1| unnamed protein product [Oryza sativa Japonica Group] Length = 723 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 20/184 (10%), Positives = 57/184 (30%), Gaps = 25/184 (13%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT--PLSNNLNEVKSR 265 K+ +L + G ++ ++ + R+ IA++ + Sbjct: 296 TKLALLKRAMGFVIQNLGSSD--------RLSVIAFSSSARRLFPLRRMTETGRQQSLQA 347 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + L TN + + + + + +I ++DG+++ + Sbjct: 348 VYSLTSNGGTNIAEGLRKGSKVIED--------RQAKNPVCSIILLSDGQDTYTVSPTAG 399 Query: 326 --LNTLQICEYM--RNAG---MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + C + + G + ++ A + L S G F + + + Sbjct: 400 VHKAAPEYCSLLPYTSNGCQQVPVHVFGFGADHDSVSLHSISQTSGGTFSFIETEAAIQD 459 Query: 379 SFDK 382 +F + Sbjct: 460 AFAQ 463 >gi|189501234|ref|YP_001960704.1| von Willebrand factor type A [Chlorobium phaeobacteroides BS1] gi|189496675|gb|ACE05223.1| von Willebrand factor type A [Chlorobium phaeobacteroides BS1] Length = 331 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 65/186 (34%), Gaps = 36/186 (19%) Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV-----KSRLNKLNPY 272 G+ +++ +K + PL+ + + + ++ ++ Sbjct: 115 GSRLDAAKKIALRFIRERPQDRFGLVLFRGKSFTLCPLTLDHRLLGMLVRQVSVDAIS-D 173 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 + T A+ L S ++ ++ +TDGE++ + + Sbjct: 174 KGTAIGSAILVGTNRL----------RASVSKERVLLLLTDGEHNSG-----EVGPVTAS 218 Query: 333 EYMRNAGMKIYSVAV------SAPPE--------GQDLLRKCTD-SSGQFFAVNDSRELL 377 E ++ G++IY + V +P + +L + G++F +D L Sbjct: 219 EIAQSEGIRIYVIGVRNEEEAGSPESMDAEREGVDEQVLGTVAGMTGGRYFRASDENSLK 278 Query: 378 ESFDKI 383 ++F +I Sbjct: 279 DAFGEI 284 >gi|294653581|ref|NP_714598.2| von Willebrand factor type A domain-containing protein [Leptospira interrogans serovar Lai str. 56601] gi|293630705|gb|AAN51613.2| BatA [Leptospira interrogans serovar Lai str. 56601] Length = 312 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 22/117 (18%), Positives = 38/117 (32%), Gaps = 20/117 (17%) Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T A+ + L S K ++ ITDG ++ ++ + + Sbjct: 167 GTAIGDAIILSTYRL----------RNSKARSKVIVLITDGVSNTG-----KIDPVTATD 211 Query: 334 YMRNAGMKIYSVAVSAPPEG-----QDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 G KIYSV + + L ++ G+FF D E+ I Sbjct: 212 LAEQIGAKIYSVGIGKEDGSYEINFEILQELSANTGGRFFRAEDPEEMKAVLSSIDS 268 >gi|56797849|emb|CAF33009.1| matrilin-1 [Danio rerio] Length = 320 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 29/159 (18%), Positives = 51/159 (32%), Gaps = 19/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + R+G + Y + + ++K+ P T T A+ A Sbjct: 69 SVGPDATRVGVVNYASRVKNEVSLKSHKTKAALVKAVSKIEPLSTGTMTGLAIQFAMNVA 128 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E S + K I +TDG N I R AG++I+++ V Sbjct: 129 FSEAEGG---RKSPDISKVAIIVTDGRPQD--------NIRDIAARAREAGIEIFAIGVG 177 Query: 349 APPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 LR+ V +L + F + Sbjct: 178 R--VDMTTLRQMASEPLEDHVDYVESYSLIEKLTKKFQE 214 >gi|45655623|ref|YP_003432.1| BatA [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130] gi|45602594|gb|AAS72069.1| BatA [Leptospira interrogans serovar Copenhageni str. Fiocruz L1-130] Length = 320 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 22/117 (18%), Positives = 38/117 (32%), Gaps = 20/117 (17%) Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T A+ + L S K ++ ITDG ++ ++ + + Sbjct: 175 GTAIGDAIILSTYRL----------RNSKARSKVIVLITDGVSNTG-----KIDPVTATD 219 Query: 334 YMRNAGMKIYSVAVSAPPEG-----QDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 G KIYSV + + L ++ G+FF D E+ I Sbjct: 220 LAEQIGAKIYSVGIGKEDGSYEINFEILQELSANTGGRFFRAEDPEEMKAVLSSIDS 276 >gi|254414094|ref|ZP_05027862.1| Appr-1-p processing enzyme family protein [Microcoleus chthonoplastes PCC 7420] gi|196179230|gb|EDX74226.1| Appr-1-p processing enzyme family protein [Microcoleus chthonoplastes PCC 7420] Length = 601 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 21/148 (14%), Positives = 53/148 (35%), Gaps = 12/148 (8%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKES 294 S R+ Y+ + + L+ N + ++ +++P T + +++ Sbjct: 77 SDRVSVTIYDDIVETLIPSTLATEKNYITRQIERIHPRNMTALHDGWVEGGKQVSQYLNP 136 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 VI ++DG + + + + + G+ ++ V + Sbjct: 137 EGLNR--------VILLSDGLANKGQTNADAIASDVYG--LAQQGVSTTTMGVG-DDYNE 185 Query: 355 DLLRKCTDSS-GQFFAVNDSRELLESFD 381 DLL +S G ++ ++ +L E F Sbjct: 186 DLLEVMANSGDGNYYYIDTPEQLPEIFQ 213 >gi|148655541|ref|YP_001275746.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148567651|gb|ABQ89796.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 504 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 32/328 (9%), Positives = 82/328 (25%), Gaps = 38/328 (11%) Query: 91 INITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISI 150 Q ++ + PS T + ++ + + Sbjct: 20 AACGGAPPPASQVDVQATIDAGVRATLAAQPTEAPSPTTPPPSPTAVPPTATTAPTSPPV 79 Query: 151 CMVLDVSRSMEDLYLQKHNDNN-------NMTSNKYLLPPPPKKSFWSKNTTKSKYAPAP 203 + + P + + + N A Sbjct: 80 APTVAPFTPATPTATGADAPTTVPESSSPPTDTTTIFRPAEGEAAQVTTNIQLVFDASGS 139 Query: 204 A----PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG------TIAYNIGIVGNQCT 253 KI + +++++ V Sbjct: 140 MAQRIGGETKIQAARRAMERIIDTLPDNPDLNVGFRVFGHEGDSSEAQKARSCQSTALLV 199 Query: 254 PLSN-NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 P+ N ++ + P T A+ A + ++ +I +T Sbjct: 200 PMQGVNKALLRQQAQAWQPTGWTPISLALQRAGEDFQAG----------ENVRNVIIMVT 249 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAG--MKIYSVAVSAPPEGQDLLRKCTD-SSGQFFA 369 DGE + + + + + + ++I V P+ LR + S G + Sbjct: 250 DGEETCGG------DPCAVAKALAESQAEVRIDVVGFGTTPDVAKTLRCIAENSGGVYTD 303 Query: 370 VNDSRELLESFDK-ITDKIQEQSVRIAP 396 + L+++ ++ I ++ ++R P Sbjct: 304 AQNGDALVQTLEELIAATLKRSTLRFIP 331 >gi|282900569|ref|ZP_06308511.1| von Willebrand factor, type A [Cylindrospermopsis raciborskii CS-505] gi|281194369|gb|EFA69324.1| von Willebrand factor, type A [Cylindrospermopsis raciborskii CS-505] Length = 418 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 23/175 (13%), Positives = 57/175 (32%), Gaps = 20/175 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 +K R+ + +N + + +K ++N+L+ T+ + EL Sbjct: 71 DKLRDQDRLSIVVFNHRAEVLLSNQNVVDRDHIKQQINRLSANGGTSIDEGLRLGIEEL- 129 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 G +TDGE + + + L+ + + + + ++ Sbjct: 130 --------AKGRKDTISQAFLLTDGE----NEHGDNNRCLKFAQLAADYNLTVNTLGFGN 177 Query: 350 PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE-------QSVRIAPN 397 L + G + + ++ FD + ++Q + +APN Sbjct: 178 NWNQHILEKISDAGLGSLSHIEHPDQAMDKFDSLLTRMQTVGLTNAYLLLSLAPN 232 >gi|269960459|ref|ZP_06174831.1| hypothetical protein VME_12150 [Vibrio harveyi 1DA3] gi|269834536|gb|EEZ88623.1| hypothetical protein VME_12150 [Vibrio harveyi 1DA3] Length = 420 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 22/166 (13%), Positives = 52/166 (31%), Gaps = 3/166 (1%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTI-KDPTTKKDQ 59 + ++++ + + IDL H + + ++Q+A+D+A L+G + + + T K Sbjct: 20 LISMVLLILLGMAAFGIDLNHQVLNKTRLQNAVDSAALAGAVVVDENGNVSAAETAAKAT 79 Query: 60 TSTIFKKQIKKHLKQGSYIRENAGDIAQKAQIN-ITKDKNNPLQYIAESKAQYEIPTENL 118 S+I L + ++ + +Y + Sbjct: 80 LSSISASDGNAELVFTDSNTAVTFSTDRATFVSAASFTPPASGEYDIYVRVAVTEIGLTQ 139 Query: 119 FLKGLIPSA-LTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDL 163 +L + + S + + I M D S Sbjct: 140 YLSDVFGINKNVSASAVAGRSAAIAYTCNISPIAMCADTSSPNTTW 185 >gi|224106794|ref|XP_002314287.1| predicted protein [Populus trichocarpa] gi|222850695|gb|EEE88242.1| predicted protein [Populus trichocarpa] Length = 688 Score = 67.2 bits (162), Expect = 5e-09, Method: Composition-based stats. Identities = 16/185 (8%), Positives = 56/185 (30%), Gaps = 24/185 (12%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 K+ +L + G ++ ++ + + + + + E +N Sbjct: 279 TKLALLKRAMGFVIQNLGPSDRLSVIAFSSTARRHFPLRRMTET------GKLEALQAVN 332 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 L TN + ++ + + +I ++DG+++ + + Sbjct: 333 SLVSSGGTNIAEGLRKGFKVVVD--------RKWKNPVCSIILLSDGQDTYTISGTSMTR 384 Query: 328 TLQ-----ICEYMRNAG-----MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 + + G + +++ + + + S G F + + Sbjct: 385 PQADYKSLLPTSIHRNGSSGFRIPVHAFGFGSDHDAASMHSISEISGGTFSFIEAEGVIQ 444 Query: 378 ESFDK 382 ++F + Sbjct: 445 DAFAQ 449 >gi|310814568|ref|YP_003962532.1| von Willebrand factor, type A [Ketogulonicigenium vulgare Y25] gi|308753303|gb|ADO41232.1| von Willebrand factor, type A [Ketogulonicigenium vulgare Y25] Length = 1160 Score = 66.9 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 42/401 (10%), Positives = 98/401 (24%), Gaps = 43/401 (10%) Query: 1 MTAIIISVCF-----LFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSD-RTIKDPT 54 +TA + + I + + + ++ + Sbjct: 543 LTAAYLPQVIYAGDQVVARLGITAQRATQATLTLTAGEQQISTMVDVQLGANRVDLPLAM 602 Query: 55 TKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIP 114 + + Q + D +I + + P +A Sbjct: 603 ADTGAVDVVIELQAVGDPAPDNNRLVLPRDTLAAPRIAVVAQDDGPRDAVAGMLIDQGFD 662 Query: 115 TENLFLKGLIPSALT-------------NLSLRSTGIIERSSENLAISICMVLDVSRSME 161 L + + ++L +S + +V+ Sbjct: 663 AIPLTPGRVPVNPDIWDRYDAALLLDLPAIALEMRQSELLASRVQDHGLGLVIAGGPHSF 722 Query: 162 DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLV 221 ++ LP +S ++DV ++ Sbjct: 723 GPGGYLETPLETLSPLSARLPHEGPGIAMVFVLDRSGSMSQTVGDVTRLDVAKQAVSAAA 782 Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAM 281 N + + +G + + + + + L L P TN YP + Sbjct: 783 NLLD-------PQTGSLGVVMFGSEAEVALPLGPLPDAAGIAAALGHLQPGGGTNIYPGL 835 Query: 282 HHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK 341 A++ L S + ++ +TDG + + + +R G+ Sbjct: 836 QLAFQAL----------RASDADARHIVVMTDGM-------SDEADFPGLLAAIRAEGIT 878 Query: 342 IYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + SVA+ + E G+F D L + Sbjct: 879 VSSVAIGSTSETSIAEDIALLGGGRFHNTRDFGALPSILAQ 919 >gi|198426242|ref|XP_002124410.1| PREDICTED: similar to Vwa1 protein [Ciona intestinalis] Length = 402 Score = 66.9 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 28/261 (10%), Positives = 66/261 (25%), Gaps = 24/261 (9%) Query: 127 ALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPK 186 + S ++ + +S + + + P P Sbjct: 147 PEVEIRSGSATCTRGNNVGSTCTFSCQDGLSLHPANFSHSECDGTAWNEPIPCCTRPCPP 206 Query: 187 KSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI-AYNI 245 + + P ++ +V+ + + + + +L V Sbjct: 207 FARADAVLILDSSSSVRKPNWDRM------IEFVVSMLTQFVVNESSLRVGAFRYNRAVD 260 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 +N+ + + + T T A+ HA L + Sbjct: 261 SDTQILLNGFTNDKTGLVQAIQDIPYRGSGTRTGNAIRHAKDVLLL-----PENGNRPNV 315 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA--VSAPPEGQDLLRKCTD 362 V +TDG + ++ +R G + +A + +D + Sbjct: 316 TDLVFVVTDGRSQD--------AVAEVARELRATGAVTFVIAVIIQGSTIERDQMLDIAG 367 Query: 363 SSGQFFAVN-DSRELLESFDK 382 S + F V +L F Sbjct: 368 SPDRLFEVTGGFDDLDSVFAD 388 >gi|146282738|ref|YP_001172891.1| von Willebrand factor type A domain-containing protein [Pseudomonas stutzeri A1501] gi|145570943|gb|ABP80049.1| von Willebrand factor type A domain protein [Pseudomonas stutzeri A1501] Length = 339 Score = 66.9 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 29/146 (19%), Positives = 53/146 (36%), Gaps = 35/146 (23%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + V+ L++ NT A+ A + L +S + ++ Sbjct: 150 PLTFDRRTVRVWLDEARVGIAGSNTAIGDAIGLAVKRLRERPTNS----------RVLVL 199 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------GQ 354 ITDG N+G L+ L ++I+++ + A PE + Sbjct: 200 ITDGANNGG-----ELDPLLAATLAAEESVRIHTIGIGAVPEEGGVLSRFGFNPGLDLDE 254 Query: 355 DLLRKCTD-SSGQFFAVNDSRELLES 379 LR + + G++F S EL Sbjct: 255 PTLRAIAEQTGGEYFRAASSAELKAI 280 >gi|330995094|ref|ZP_08319011.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] gi|329576670|gb|EGG58173.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] Length = 332 Score = 66.9 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 23/142 (16%), Positives = 48/142 (33%), Gaps = 37/142 (26%) Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T + +A L S K +I +TDG N+ L E Sbjct: 171 GTAVGMGIANAVSRL----------KDSKAKSKVIILLTDGTNNAGDIS-----PLTAAE 215 Query: 334 YMRNAGMKIYSVAVSAPP--------------------EGQDLLRKCTD-SSGQFFAVND 372 ++ G+++Y++ V L + G+F+ D Sbjct: 216 IAKSFGIRVYTIGVGTNGLAPYPMPVAGGVQYLNVPVEIDTKTLAAIAGKTDGEFYRATD 275 Query: 373 SRELLESFDKITDKIQEQSVRI 394 +++L + + I DK+++ + + Sbjct: 276 NKKLEDVYKDI-DKLEKTKLNV 296 >gi|327271798|ref|XP_003220674.1| PREDICTED: matrilin-4-like [Anolis carolinensis] Length = 592 Score = 66.9 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 27/267 (10%), Positives = 70/267 (26%), Gaps = 29/267 (10%) Query: 125 PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP 184 L +S V + + + Sbjct: 297 GYTLQADGKSCQASDLCNSIEHGCEFKCVSTSGSYHCVCPEGQQLQADKK-TCNKCKAGH 355 Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYN 244 + + + +++ + +V+ + + R+G + Y+ Sbjct: 356 IDLVLVIDGSKSVR--------PQNFELVKQFVNQIVDFL-----DVSPHGTRVGLVQYS 402 Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 + ++K + ++ + T T A+ H + E E + S Sbjct: 403 SRVRTEFPLNKFTTAADLKKAVQRVQYMEKGTMTGLALKHMLEHSFTEAEGARPL--SQN 460 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS 363 + + + TDG + + + AG+ +++V V + LR+ Sbjct: 461 VPRIGLVFTDGRSQDDISEW--------ARRSKEAGIIMFAVGVGKAV--ESELREIASE 510 Query: 364 S--GQFFAVNDSRELLESFDKITDKIQ 388 F D + + + I Sbjct: 511 PVDKHFSYSADFNTMTNIVENLKINIC 537 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 58/174 (33%), Gaps = 17/174 (9%) Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTN 276 +++ I R+G I Y+ + E++ +N + P T Sbjct: 67 RFMIDIIHNLDIGPNA--TRVGVIQYSSQVQNVFSLKSFFTRAEMEKAINNIVPLAQGTM 124 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 T A+ + + +E + ++ + + +TDG ++ R Sbjct: 125 TGLAIQYVMNVAFTTQEGARPL--HKKIPRVAVIVTDGRPQD--------RVTEVSAQAR 174 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDS--SGQFFAVNDSRELLESFDKITDKIQ 388 AG++IY+V V + LR F V + + + DK+ Sbjct: 175 AAGIEIYAVGVQR--ADMNSLRAMASPALEEHVFLVESFDLIQQFGKQFQDKLC 226 >gi|309791336|ref|ZP_07685859.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308226646|gb|EFO80351.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 853 Score = 66.9 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 13/119 (10%), Positives = 42/119 (35%), Gaps = 14/119 (11%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 ++ +++ ++ L T A+ L + + + +TDG Sbjct: 461 SVAQIQEQIASLPSGGGTRIERALEVGLPALAEQPTKVRHA----------VLLTDGR-- 508 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 S + ++ E R+ + + ++A+ + L + G+++ + ++ Sbjct: 509 --SFMNDNALYQRLVETARSQQITLSTIAIGLDSDTALLKQLAAWGGGRYYYADQPADI 565 >gi|288921527|ref|ZP_06415802.1| von Willebrand factor type A [Frankia sp. EUN1f] gi|288347095|gb|EFC81397.1| von Willebrand factor type A [Frankia sp. EUN1f] Length = 587 Score = 66.9 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 26/137 (18%), Positives = 49/137 (35%), Gaps = 8/137 (5%) Query: 248 VGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 N P S +L + + + L T Y A+ AYR + + + Sbjct: 454 TVNDPQPGSADLTAISAAADGLTLGSGTAIYSALEAAYRYVADS---AAAPADGVAPLTS 510 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQ 366 ++ +TDGEN+ + ++ ++ ++V D +R D + G Sbjct: 511 IVLMTDGENNQGTTADAFHSSYLALPDA-ARSVRTFTVVFG--DARVDEMRTIADWTGGA 567 Query: 367 FFAVNDSRELLESFDKI 383 F L E+F +I Sbjct: 568 MFDART-SSLSEAFREI 583 >gi|149918750|ref|ZP_01907237.1| aerotolerance-related membrane protein [Plesiocystis pacifica SIR-1] gi|149820351|gb|EDM79767.1| aerotolerance-related membrane protein [Plesiocystis pacifica SIR-1] Length = 350 Score = 66.9 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 25/197 (12%), Positives = 63/197 (31%), Gaps = 36/197 (18%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 ++ V + + G + PL+ + +++ Sbjct: 144 GLDRLTVAKQVIDEFIRR-----------RPHDRIALVGFGAHASTIAPLTLDHAVLRNL 192 Query: 266 LNKLNPY--EN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + ++ + T + + L S K ++ +TDG ++ Sbjct: 193 IVQVRLGVVDGQETAIGAGLGVSLNRL----------KESQAATKIIVLLTDGVHNADGM 242 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD-----LLRKCTD-SSGQFFAVNDSRE 375 +T + + G+ IY+V + + L + + G + D++ Sbjct: 243 DPDT-----VAQTAAERGVVIYTVLMGQQTGDRSSVDAGQLERLAGATDGYAYLAEDTQT 297 Query: 376 LLESFDKITDKIQEQSV 392 L SF + DK+++ S+ Sbjct: 298 LETSFQDLLDKLEKSSI 314 >gi|47216852|emb|CAG11659.1| unnamed protein product [Tetraodon nigroviridis] Length = 1042 Score = 66.9 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 25/190 (13%), Positives = 68/190 (35%), Gaps = 26/190 (13%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ + LV + + R+G + ++ + ++L+E+K+ + Sbjct: 698 NFELVKDFVNALVERV-----WLSQEAARMGVVLFSHTSLAVASLQPRSSLSELKATVRS 752 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + T T A+H A++ + ++K + +TDG+ ++ + Sbjct: 753 MPYLGEGTFTGSAIHRAHQLFQASRPG---------VRKVALVLTDGQAD----PRDVVQ 799 Query: 328 TLQICEYMRNAGMKIYSVAVSAPP-----EGQDLLRKCTDSS--GQFFAVNDSRELLESF 380 + G++++ + V + ++ + G + D L Sbjct: 800 VGVSAAEAQARGIEMFVIGVMNESHPLYPDFSAEMKAIASNPKEGHVHLIQDFGSLHVLE 859 Query: 381 DKITDKIQEQ 390 I ++I EQ Sbjct: 860 KVIVNQICEQ 869 Score = 44.5 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 15/140 (10%), Positives = 49/140 (35%), Gaps = 19/140 (13%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSN--NLNEVKSRLNKLNPYE-NTNTYPAMHHAYRE 287 VR+ + + +++ + R+ ++ T + A+ +A + Sbjct: 110 PLAWRVRLRLAILQYSSTVSVEHNFRDWQDVDVFQGRVASMSFIGHGTYSAYAIANATQL 169 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA- 346 E S+ + + +TDG + + + + + + +++++++ Sbjct: 170 FSQE--------TSSNSLRVALLMTDGVD-----HPRSPSAVTAAAEAKQHSIRLFAISL 216 Query: 347 VSAPPEGQ--DLLRKCTDSS 364 + P +G LR + Sbjct: 217 LGLPDQGPAGTKLRSIASTP 236 >gi|308175402|ref|YP_003922107.1| hypothetical protein BAMF_3511 [Bacillus amyloliquefaciens DSM 7] gi|307608266|emb|CBI44637.1| conserved hypothetical protein YwmC [Bacillus amyloliquefaciens DSM 7] gi|328555380|gb|AEB25872.1| hypothetical protein BAMTA208_18610 [Bacillus amyloliquefaciens TA208] gi|328913751|gb|AEB65347.1| hypothetical protein LL3_03821 [Bacillus amyloliquefaciens LL3] Length = 229 Score = 66.9 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 28/214 (13%), Positives = 58/214 (27%), Gaps = 22/214 (10%) Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 N + S + SKY A R D + + + K + V Sbjct: 37 NNTAILLDASGSMAKRIDGVSKYNMAKDEIVRFADQIKSKSQVRMTVFGSEGNNKNSGKV 96 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 + + + + LN + P T A+ A Sbjct: 97 QSCESIRGVYGFQRF------DKQSFLNSLNGIGPTGWTPIAKALEDAKASFN------- 143 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 K V +TDGE + + ++ + +R +K+ + Sbjct: 144 --GVHKLGSKSVYLLTDGEETCGG------DPIKTAKELRKQHIKVNVIGFDFNEGFNGQ 195 Query: 357 LRKCTD-SSGQFFAVNDSRELLESFDKITDKIQE 389 L G+++ + +++ F + E Sbjct: 196 LHAIAGAGGGKYYEAHSQKDMNRIFKLEASSLGE 229 >gi|194334883|ref|YP_002016743.1| von Willebrand factor type A [Prosthecochloris aestuarii DSM 271] gi|194312701|gb|ACF47096.1| von Willebrand factor type A [Prosthecochloris aestuarii DSM 271] Length = 327 Score = 66.9 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 36/200 (18%), Positives = 72/200 (36%), Gaps = 37/200 (18%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 ++D A + RIG + + L +++ + R Sbjct: 116 GKSRLDAAKTVALQFIE---------NRRRDRIGLVLFKGKSFTQCPLTLDHDVLSMLVR 166 Query: 266 LNKLN--PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 ++ P T T A+ A L S ++ +I +TDGE++ Sbjct: 167 AASVDAVPESGTATGSAILIAVNRL----------RASESPERVLILLTDGEHNAG---- 212 Query: 324 NTLNTLQICEYMRNAGMKIY--SVAVSAPPEGQDLLRKCTD--------SSGQFFAVNDS 373 ++ + G++IY +V+V G+D+L D + G+ F ND+ Sbjct: 213 -EVDPVTAAGIAAGEGVRIYMATVSVPGSRSGEDMLASARDLSGEVSRITGGRSFRANDA 271 Query: 374 RELLESFDKITDKIQEQSVR 393 L +F +I D++++ Sbjct: 272 NSLNRTFSEI-DQLEKSRFT 290 >gi|224052500|ref|XP_002194907.1| PREDICTED: similar to anthrax toxin receptor 1 [Taeniopygia guttata] Length = 537 Score = 66.9 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 35/187 (18%), Positives = 61/187 (32%), Gaps = 18/187 (9%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 K + + + ++ ++ + +R+ I ++ L+ N ++ Sbjct: 29 YFVLDKSGSVKNHWTEIYSFVESLAEKFISPMLRMSFIVFSSRGT--TIMKLTENREAIR 86 Query: 264 SRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 L+ L P +T + A ++Y H T G R +I +TDGE Sbjct: 87 RGLDTLKEELPGGDTFMHEGFKRANEQIY------HETYGGVRTASVIIALTDGELQ--- 137 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN-DSRELLES 379 Q R+ G +Y V V + L DS F V L + Sbjct: 138 -DAQFYYAEQEANRARSFGAIVYCVGV--KDFNETQLSTIADSIDHVFPVKGGFYALRGT 194 Query: 380 FDKITDK 386 D I K Sbjct: 195 IDSILKK 201 >gi|91789735|ref|YP_550687.1| von Willebrand factor, type A [Polaromonas sp. JS666] gi|91698960|gb|ABE45789.1| von Willebrand factor, type A [Polaromonas sp. JS666] Length = 346 Score = 66.9 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 23/245 (9%), Positives = 74/245 (30%), Gaps = 57/245 (23%) Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYN 244 + S A ++ +A + + + V++G +A+ Sbjct: 83 SNQQTIMLAIDVSGSMRATDVQPSRLVAAQTAAKAFLTELPRT--------VKVGLVAFA 134 Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN-------------- 290 Q + N ++ S +++ T + + L+ Sbjct: 135 GSAQVAQIP--TVNREDLVSAIDRFQLQRGTAIGNGIVMSLATLFPDAGIDLQSMQSGRE 192 Query: 291 ------------EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 E + + +I +TDG+ + +++L + + Sbjct: 193 RQRGFAIDQEKKEAKEFTPVAPGSYNSAAIILLTDGQRTTG------VDSLDAAKLAADR 246 Query: 339 GMKIYSVAVSAPPE--------------GQDLLRKCT-DSSGQFFAVNDSRELLESFDKI 383 G+++Y+V + ++ L+ + ++F + +L + ++ + Sbjct: 247 GVRVYTVGIGTVDGETIGFEGWSMRVRLDEETLKGIARATQAEYFYAGTATDLKKVYETL 306 Query: 384 TDKIQ 388 + ++ Sbjct: 307 SSRLT 311 >gi|83717579|ref|YP_440458.1| hypothetical protein BTH_II2270 [Burkholderia thailandensis E264] gi|167579118|ref|ZP_02371992.1| hypothetical protein BthaT_13315 [Burkholderia thailandensis TXDOH] gi|257141105|ref|ZP_05589367.1| hypothetical protein BthaA_18159 [Burkholderia thailandensis E264] gi|83651404|gb|ABC35468.1| conserved hypothetical protein [Burkholderia thailandensis E264] Length = 418 Score = 66.9 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 31/322 (9%), Positives = 77/322 (23%), Gaps = 29/322 (9%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A++++V F+ A+DL + R+++Q++ D+ L+ + + P Sbjct: 26 LVALMLAVLIGFVGLALDLGKLYVTRSELQNSADSCALAAARDLTGAINLSVPEAAGITA 85 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + ++ Q + +P + N F+ Sbjct: 86 GHLNYALFEQFPVQLQTNASVTFTDSLSNPFQPKSAITSPSSIKYVKCMTSQTGIVNWFI 145 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMV----------LDVSRSMEDLYLQKHND 170 + L ++ S ++ A + C + + + + Sbjct: 146 QALDMVPGVTVANASVSATAIATIGAAQTTCAIPVFICKAGTQTNPPVAGATYNIGDWLS 205 Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR------------KIDVLIESAG 218 + + + N+ S + Sbjct: 206 AKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGNKAATTNAYN 265 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYN---IGIVGNQCTPLSNNLNEVKSR----LNKLNP 271 ++ + AY+ N + S + +N Sbjct: 266 TRFGIYANPYKDPSYGTPDFTGYAYDATTWPSQSNAYADFVSKRQTFTSYQGDLITGINT 325 Query: 272 YENTNTYPAMHHAYRELYNEKE 293 N A R L E Sbjct: 326 GGTYNPNYYAAGADRRLALAPE 347 >gi|226943994|ref|YP_002799067.1| von Willebrand factor, type A (VWA) domain-containing protein [Azotobacter vinelandii DJ] gi|226718921|gb|ACO78092.1| von Willebrand factor, type A (VWA) domain protein [Azotobacter vinelandii DJ] Length = 335 Score = 66.9 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 26/164 (15%), Positives = 52/164 (31%), Gaps = 45/164 (27%) Query: 254 PLSNNLNEVKS----RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 PL+ + V++ + ++T A+ + L + +I Sbjct: 150 PLTFDRRTVRTWLEEAAIGI-AGKDTAIGDAIGLGLKRL----------RQRPAQSRVLI 198 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------G 353 +TDG N+ + G++I+++ + A P Sbjct: 199 LVTDGANTAG-----EIAPSVAARLAAAEGVRIHTIGIGADPRQDGPPGLLGLTPGLDLD 253 Query: 354 QDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 + LR ++ G +F S EL I+E R+ P Sbjct: 254 EPTLRAIAEETGGSYFRARSSEEL--------RAIEETLARLEP 289 >gi|326789198|ref|YP_004307019.1| hypothetical protein Clole_0061 [Clostridium lentocellum DSM 5427] gi|326539962|gb|ADZ81821.1| Protein of unknown function DUF3520 [Clostridium lentocellum DSM 5427] Length = 670 Score = 66.9 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 37/336 (11%), Positives = 89/336 (26%), Gaps = 29/336 (8%) Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAES-KAQYEIPTENLFLKG 122 + + + + + I + + A Y + + Sbjct: 45 AESATADIEETAPIVFNSESYNSFTENPFIKTTDETFSTFSIDVDIASYSNVRRFITNQE 104 Query: 123 LIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLP 182 L P E + +++ N+++ + Sbjct: 105 LPPVDAIRTEELINYFNYSYPEPTDNIPFSL------SQEMMPCPWNESSQLLLIGLQGK 158 Query: 183 PPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIA 242 N K+ +L +S L ++++++ I + Sbjct: 159 HLQPDEVPPSNLVFLLDVSGSMSDTNKLPLLKKSFNILTSNLKESD--------CISIVV 210 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 Y N+ + + L L +T + AY N Sbjct: 211 YAGASGVVLDGVAGNDESLINEALESLEAGGSTAGAEGIAMAYELAEKHFIKDGNNR--- 267 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 VI TDG+ + + + ++I E R G+ + + + D + D Sbjct: 268 -----VILATDGDFNVG--PNSESDLIRIIEKKREKGIFLSVLGLGMGNYKDDKMESLAD 320 Query: 363 SS-GQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 G + ++ L E+ + +++ IA + Sbjct: 321 HGNGNYAYIDS---LQEAKKVLGEQLTGTLFTIAKD 353 >gi|269926840|ref|YP_003323463.1| von Willebrand factor type A [Thermobaculum terrenum ATCC BAA-798] gi|269790500|gb|ACZ42641.1| von Willebrand factor type A [Thermobaculum terrenum ATCC BAA-798] Length = 918 Score = 66.9 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 26/141 (18%), Positives = 50/141 (35%), Gaps = 20/141 (14%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 +A++ + + + ++ + TN Y + A L K + Sbjct: 460 VVAFDTAPRWVVRPEPVTDKSSIAEKVAGIQGSGGTNIYGGLAEAIDSLIKVKAKN---- 515 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 K VI +TDG ++ + ++ R G+ I +V SA LLR Sbjct: 516 ------KHVILLTDGWSNVGNYD-------ELISKARRHGITISTV--SAAGGSAQLLRS 560 Query: 360 CTD-SSGQFFAVNDSRELLES 379 + G F+ DS ++ + Sbjct: 561 IAEKGGGTFYNTRDSADIPQI 581 >gi|86741605|ref|YP_482005.1| von Willebrand factor, type A [Frankia sp. CcI3] gi|86568467|gb|ABD12276.1| von Willebrand factor, type A [Frankia sp. CcI3] Length = 534 Score = 66.9 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 32/185 (17%), Positives = 59/185 (31%), Gaps = 19/185 (10%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL--------SNNL 259 +I L + L + R N S +L Sbjct: 356 SRIAALQAALRGLTGADDTLSGRFARFRGREKITMITFAGRANDPVDFAVNDPRPGSADL 415 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 V + ++ L + T Y A+ YR E+ + S ++ +TDGEN+ Sbjct: 416 AGVNTFVDGLRLQDGTAIYSALEAGYRAAGAAVEADPGYLTS------IVLMTDGENNSG 469 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC-TDSSGQFFAVNDSRELLE 378 + + ++ Q A ++ +++A LR D+ G F L + Sbjct: 470 ISAADFRSSYQRLPAAARA-VRTFTIAFG--EADPAALRDISADTGGAVFDART-SSLAD 525 Query: 379 SFDKI 383 +F I Sbjct: 526 AFKDI 530 >gi|260797332|ref|XP_002593657.1| hypothetical protein BRAFLDRAFT_131951 [Branchiostoma floridae] gi|229278884|gb|EEN49668.1| hypothetical protein BRAFLDRAFT_131951 [Branchiostoma floridae] Length = 949 Score = 66.9 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 21/161 (13%), Positives = 53/161 (32%), Gaps = 21/161 (13%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + R+G + Y+ + + + +N + T T A+ A Sbjct: 59 DIGTDLTRVGVVQYSDTPTMEFNLGVHADKGSTIAAVNNIQYQNGGTATGAALEFA--RA 116 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + + K +I +TDG+ + + + + G+ +Y++ V Sbjct: 117 NANWRGAP-------VPKVMIVVTDGK--------SGDDVTAAAQALAGEGVAVYAIGVG 161 Query: 349 APPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQ 388 L++ ++ + D L + D+I ++ Sbjct: 162 --NYDLPELQQIANGNNNNVIELQDYNALTAAIDQIAGQVC 200 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 22/151 (14%), Positives = 50/151 (33%), Gaps = 20/151 (13%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNT 298 I Y+ + V + ++ + T T A+ + + + Sbjct: 723 VIQYSSTVQEEFSLNAHFTKTAVLNAIDNIVYMGGGTLTGAAI----TYMKDNSQWRPG- 777 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLR 358 + K I +TDG++S ++ + G+ ++++ V A Q L Sbjct: 778 -----VAKIAIVVTDGKSSDDVGPPSS--------AAQQTGITMHAIGVGANV-DQTELS 823 Query: 359 KCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + +S V D L ++T + + Sbjct: 824 QIASTSQYVTTVADYDALDAQMAQLTASVCD 854 Score = 56.8 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 24/157 (15%), Positives = 51/157 (32%), Gaps = 18/157 (11%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 N + +IG I Y+ + V + ++ + A + + Sbjct: 362 NCNKQIGVIQYSSTVQEEFSLNAHFTKTAVLNAIDNIVYMGG---GTLTGTAITYMKDNS 418 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + + K I +TDG++S A ++ G+ ++++ V A Sbjct: 419 QWRP------NVAKIAIVVTDGKSSDDVAAPSSAAQQA--------GITMHAIGVGANV- 463 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 Q L + +S V D L ++T + Sbjct: 464 DQTELSQIASTSQYVTNVADYDALDAQMAQLTASVCN 500 >gi|262184150|ref|ZP_06043571.1| putative secreted protein [Corynebacterium aurimucosum ATCC 700975] Length = 500 Score = 66.9 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 41/368 (11%), Positives = 101/368 (27%), Gaps = 36/368 (9%) Query: 35 AAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINIT 94 +A+ + + K + K + L GS Q + Sbjct: 146 SALSAATTAFADTGRALTEKDIKQSAGKVQKLFGNQTLTSGSSGWLADRFREHPEQADAI 205 Query: 95 KD---------KNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSEN 145 + + L T +++ + Sbjct: 206 FNYESVLYQLKDEGADLEVVIPSDGVISADYPLSSLASSSDKDTEAKVQALAEWLAERPD 265 Query: 146 LAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAP 205 S + +D S ++ + N + L + NT Sbjct: 266 KLTSFHLRVDNSDLPGTVFELPY--PANEQTVDALEAAFAHELRNPGNTALVLDTSGSME 323 Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKK-----NLSVRIGTIAYNIGIVGNQCTPLSNNL- 259 ++D+L S L++ + + + +I I Y+ + + Sbjct: 324 GE-RMDLLKSSLLPLIDGSADGVPDGEGQVAFRNREQIKLIPYSSEPQQPTRARVDKDKP 382 Query: 260 ---NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 E+ R+ +L +T T+ A+ +A+ E+ V+ +TDGE Sbjct: 383 ATTKELADRVERLVADGDTATFEAVLNAFDEVDTSGGDIGT----------VVLMTDGEV 432 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRE 375 + + + + + ++ + + + + G+ F + + Sbjct: 433 TRGRTFAQFKDAYAQLPEDKKE-IPVFVILYG--EANIQEMEELAQLTGGKTFDALN-GD 488 Query: 376 LLESFDKI 383 L +F++I Sbjct: 489 LAAAFEEI 496 >gi|87200512|ref|YP_497769.1| hypothetical protein Saro_2499 [Novosphingobium aromaticivorans DSM 12444] gi|87136193|gb|ABD26935.1| hypothetical protein Saro_2499 [Novosphingobium aromaticivorans DSM 12444] Length = 631 Score = 66.9 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 35/309 (11%), Positives = 86/309 (27%), Gaps = 43/309 (13%) Query: 113 IPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNN 172 T + N+ + S + + + +VS + + ++ + Sbjct: 335 YTTVTSQVFSNFAYKQINVDVSSYKKFQTVTVQNGTNGA---NVSYTWKGCIEERDTEAA 391 Query: 173 NMTSNKYL--LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE 230 + S + + P S A AP ++ + Sbjct: 392 SSFSYSTVDGMSPSTALDLDIDRVPDSDPATKWAPMWPELGYYRTA--------SSRSST 443 Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 + + + + + + + L+ +T M R Sbjct: 444 PVSTLETTSGSQLSAACPYKAQLLQTMSQSAFYAYADALSANGSTYHDLGMLWGLRLSSP 503 Query: 291 EKE----SSHNTIGSTRLKKFVIFITDGENSG-----------------------ASAYQ 323 + + + + +IF+TDG+ + Sbjct: 504 DGPWQAMVNETPENGGEVSRHIIFMTDGQMDTNYKVMSTYGIEWHDRRITDDGVTDQDAR 563 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 +TL +C+ + G +++ +A ++ L C +S F ++ EL +F +I Sbjct: 564 HTLRFRALCDAAKAKGFRVWVIAFASDLNDD--LSYCASASST-FPATNATELNTAFQEI 620 Query: 384 TDKIQEQSV 392 + E V Sbjct: 621 AKNVAELRV 629 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 40/272 (14%), Positives = 79/272 (29%), Gaps = 26/272 (9%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A + V L I +D+ + RN++QSA DA L+G S+ S + Sbjct: 4 LAATCVPVLILLIGSGLDMGRLYKARNRLQSACDAGALAGRRSVSSAGYDDAAKAQA--- 60 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 N D+ T + + E+ NLF Sbjct: 61 ------------AAFFNANFNEDDLGATETNFATSSADGGSLVEGIATTDVEMVLMNLFG 108 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 +P N+ +T I + + + + + S + Sbjct: 109 VISVP---INVECSATMDIGNTDVTMVLDTTGSMSQTLSGTTTKRIDALRTAMKNFYDTV 165 Query: 181 LP---PPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA-----GNLVNSIQKAIQEKK 232 + +S S + D L+++ + N++ + I Sbjct: 166 SAATTGSNARVRYSFVPYSSSVNVGQLIYDLDPDYLVDTWAIQSRTPVFNTVTEQILTGY 225 Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 + V +Y+ GN + S N + + Sbjct: 226 DTPVTTTASSYSNETTGNDQSYNSTRYNSLSA 257 >gi|332993941|gb|AEF03996.1| von Willebrand factor, type A [Alteromonas sp. SN2] Length = 344 Score = 66.9 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 32/169 (18%), Positives = 56/169 (33%), Gaps = 40/169 (23%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKS----RLNKLNPYENTNTYPAMHHAYRELYNEK 292 R+G I + PL+ + + V + + L E T A+ A + Sbjct: 133 RLGLILFADTAYLQA--PLTYDRDTVSTLLSESVIGLV-GEQTAIGDAIGLAVKRFDE-- 187 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-- 350 +I +TDG+N+ + Q E N +K+Y++ V A Sbjct: 188 --------KEESNNVLILLTDGQNTAG-----FITPEQAKELAVNKKVKVYTIGVGADKM 234 Query: 351 ---------------PEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKI 383 +D+L + GQ+F D EL + K+ Sbjct: 235 LIQSFFGSRQVNPSQELDEDMLSDLASSTGGQYFRARDVNELEAIYAKL 283 >gi|332206625|ref|XP_003252399.1| PREDICTED: collagen alpha-1(XXVIII) chain [Nomascus leucogenys] Length = 1129 Score = 66.9 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 23/162 (14%), Positives = 57/162 (35%), Gaps = 20/162 (12%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKE 293 + RIG I Y+ + ++ ++ K ++ + T T A+ A + + Sbjct: 867 TARIGIINYSHKVEKVANLKQFSSKDDFKLAVDNMQYLGEGTYTATALQAANDMFEDARP 926 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 +KK + ITDG+ ++ ++ + + ++I+ + V + Sbjct: 927 G---------VKKVALVITDGQTD----SRDKEKLTEVVKNASDTNVEIFVIGVVKKNDP 973 Query: 354 Q-----DLLRKCTDSSGQFFAVNDSRELLESFDK-ITDKIQE 389 + + +D L ++ + + KI E Sbjct: 974 NFEIFHKEMNLIATDPEHVYQFDDFFTLQDTLKQKLFQKICE 1015 Score = 44.5 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 50/156 (32%), Gaps = 16/156 (10%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN-NLNEVKSRLNK 268 D + +L + I + + V S +L K ++ Sbjct: 64 FDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQIDPPFSSWKDLQTFKQKVKS 123 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 +N T +Y A+ +A R L E K + +TDG + + + Sbjct: 124 MNLIGQGTFSYYAISNATRLLKREGRKDGV--------KVALLMTDGID-----HPKNPD 170 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS 363 I E R +G+ ++ +S + LR + Sbjct: 171 VQSISEDARISGISFITIGLST-VVNEAKLRLISGD 205 >gi|154759255|ref|NP_001032852.2| collagen alpha-1(XXVIII) chain precursor [Homo sapiens] gi|167009138|sp|Q2UY09|COSA1_HUMAN RecName: Full=Collagen alpha-1(XXVIII) chain; Flags: Precursor Length = 1125 Score = 66.9 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 23/162 (14%), Positives = 57/162 (35%), Gaps = 20/162 (12%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKE 293 + RIG I Y+ + ++ ++ K ++ + T T A+ A + + Sbjct: 834 TARIGIINYSHKVEKVANLKQFSSKDDFKLAVDNMQYLGEGTYTATALQAANDMFEDARP 893 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 +KK + ITDG+ ++ ++ + + ++I+ + V + Sbjct: 894 G---------VKKVALVITDGQTD----SRDKEKLTEVVKNASDTNVEIFVIGVVKKNDP 940 Query: 354 Q-----DLLRKCTDSSGQFFAVNDSRELLESFDK-ITDKIQE 389 + + +D L ++ + + KI E Sbjct: 941 NFEIFHKEMNLIATDPEHVYQFDDFFTLQDTLKQKLFQKICE 982 Score = 43.4 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 28/156 (17%), Positives = 52/156 (33%), Gaps = 16/156 (10%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN-NLNEVKSRLNK 268 D + +L + I + + V S +L K ++ Sbjct: 64 FDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQIDPPFSSWKDLQTFKQKVKS 123 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 +N T +Y A+ +A R L E K V+ +TDG + + + Sbjct: 124 MNLIGQGTFSYYAISNATRLLKREGRKDGV--------KVVLLMTDGID-----HPKNPD 170 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS 363 I E R +G+ ++A+S + LR + Sbjct: 171 VQSISEDARISGISFITIALST-VVNEAKLRLISGD 205 >gi|119613999|gb|EAW93593.1| hCG19532, isoform CRA_a [Homo sapiens] Length = 302 Score = 66.9 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 23/162 (14%), Positives = 57/162 (35%), Gaps = 20/162 (12%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKE 293 + RIG I Y+ + ++ ++ K ++ + T T A+ A + + Sbjct: 11 TARIGIINYSHKVEKVANLKQFSSKDDFKLAVDNMQYLGEGTYTATALQAANDMFEDARP 70 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 +KK + ITDG+ ++ ++ + + ++I+ + V + Sbjct: 71 G---------VKKVALVITDGQTD----SRDKEKLTEVVKNASDTNVEIFVIGVVKKNDP 117 Query: 354 Q-----DLLRKCTDSSGQFFAVNDSRELLESFDK-ITDKIQE 389 + + +D L ++ + + KI E Sbjct: 118 NFEIFHKEMNLIATDPEHVYQFDDFFTLQDTLKQKLFQKICE 159 >gi|119614000|gb|EAW93594.1| hCG19532, isoform CRA_b [Homo sapiens] Length = 304 Score = 66.9 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 23/162 (14%), Positives = 57/162 (35%), Gaps = 20/162 (12%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKE 293 + RIG I Y+ + ++ ++ K ++ + T T A+ A + + Sbjct: 11 TARIGIINYSHKVEKVANLKQFSSKDDFKLAVDNMQYLGEGTYTATALQAANDMFEDARP 70 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 +KK + ITDG+ ++ ++ + + ++I+ + V + Sbjct: 71 G---------VKKVALVITDGQTD----SRDKEKLTEVVKNASDTNVEIFVIGVVKKNDP 117 Query: 354 Q-----DLLRKCTDSSGQFFAVNDSRELLESFDK-ITDKIQE 389 + + +D L ++ + + KI E Sbjct: 118 NFEIFHKEMNLIATDPEHVYQFDDFFTLQDTLKQKLFQKICE 159 >gi|114612128|ref|XP_518969.2| PREDICTED: collagen alpha-1(XXVIII) chain [Pan troglodytes] Length = 1125 Score = 66.9 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 23/162 (14%), Positives = 57/162 (35%), Gaps = 20/162 (12%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKE 293 + RIG I Y+ + ++ ++ K ++ + T T A+ A + + Sbjct: 834 TARIGIINYSHKVEKVANLKQFSSKDDFKLAVDNMQYLGEGTYTATALQAANDMFEDARP 893 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 +KK + ITDG+ ++ ++ + + ++I+ + V + Sbjct: 894 G---------VKKVALVITDGQTD----SRDKEKLTEVVKNASDTNVEIFVIGVVKKNDP 940 Query: 354 Q-----DLLRKCTDSSGQFFAVNDSRELLESFDK-ITDKIQE 389 + + +D L ++ + + KI E Sbjct: 941 NFEIFHKEMNLIATDPEHVYQFDDFFTLQDTLKQKLFQKICE 982 Score = 44.9 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 27/156 (17%), Positives = 51/156 (32%), Gaps = 16/156 (10%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN-NLNEVKSRLNK 268 D + +L + I + + V S +L K ++ Sbjct: 64 FDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQIDPPFSSWKDLQTFKQKVKS 123 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 +N T +Y A+ +A R L E K V+ +TDG + + + Sbjct: 124 MNLIGQGTFSYYAISNATRLLKREGRKDGV--------KVVLLMTDGID-----HPKNPD 170 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS 363 I E R +G+ ++ +S + LR + Sbjct: 171 VQSISEDARISGISFITIGLST-VVNEAKLRLISGD 205 >gi|115486675|ref|NP_001068481.1| Os11g0687100 [Oryza sativa Japonica Group] gi|77552567|gb|ABA95364.1| von Willebrand factor type A domain containing protein, expressed [Oryza sativa Japonica Group] gi|113645703|dbj|BAF28844.1| Os11g0687100 [Oryza sativa Japonica Group] Length = 633 Score = 66.9 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 32/197 (16%), Positives = 67/197 (34%), Gaps = 25/197 (12%) Query: 187 KSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIG 246 S N + +P ++DVL S ++ + R+ +A+N G Sbjct: 77 DVSGSMNDPVAAASPKSNLQGSRLDVLKASMKFVIRKLADGD--------RLSIVAFNDG 128 Query: 247 IVGNQCTPL----SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 V + L + + ++++L T PA+ A + L S Sbjct: 129 PVKEYSSGLLDVSGDGRSIAGKKIDRLQARGGTALMPALEEAVKILDE------RQGSSR 182 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 F++ +TDG+++ + + +++ + A + + LL Sbjct: 183 NHVGFILLLTDGDDTTGFRWTRDAIHGAV------FKYPVHTFGLGASHDPEALLHIAQG 236 Query: 363 SSGQFFAVNDSRELLES 379 S G + V+D L Sbjct: 237 SRGTYSFVDD-DNLANI 252 >gi|83423290|emb|CAI67595.1| collagen, type XXVIII [Homo sapiens] gi|223462744|gb|AAI36893.1| Collagen, type XXVIII, alpha 1 [Homo sapiens] Length = 1125 Score = 66.9 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 23/162 (14%), Positives = 57/162 (35%), Gaps = 20/162 (12%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKE 293 + RIG I Y+ + ++ ++ K ++ + T T A+ A + + Sbjct: 834 TARIGIINYSHKVEKVANLKQFSSKDDFKLAVDNMQYLGEGTYTATALQAANDMFEDARP 893 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 +KK + ITDG+ ++ ++ + + ++I+ + V + Sbjct: 894 G---------VKKVALVITDGQTD----SRDKEKLTEVVKNASDTNVEIFVIGVVKKNDP 940 Query: 354 Q-----DLLRKCTDSSGQFFAVNDSRELLESFDK-ITDKIQE 389 + + +D L ++ + + KI E Sbjct: 941 NFEIFHKEMNLIATDPEHVYQFDDFFTLQDTLKQKLFQKICE 982 Score = 44.9 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 27/156 (17%), Positives = 51/156 (32%), Gaps = 16/156 (10%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN-NLNEVKSRLNK 268 D + +L + I + + V S +L K ++ Sbjct: 64 FDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQIDPPFSSWKDLQTFKQKVKS 123 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 +N T +Y A+ +A R L E K V+ +TDG + + + Sbjct: 124 MNLIGQGTFSYYAISNATRLLKREGRKDGV--------KVVLLMTDGID-----HPKNPD 170 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS 363 I E R +G+ ++ +S + LR + Sbjct: 171 VQSISEDARISGISFITIGLST-VVNEAKLRLISGD 205 >gi|51095063|gb|EAL24307.1| similar to matrilin 2 precursor [Homo sapiens] Length = 448 Score = 66.9 bits (161), Expect = 5e-09, Method: Composition-based stats. Identities = 23/162 (14%), Positives = 57/162 (35%), Gaps = 20/162 (12%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKE 293 + RIG I Y+ + ++ ++ K ++ + T T A+ A + + Sbjct: 157 TARIGIINYSHKVEKVANLKQFSSKDDFKLAVDNMQYLGEGTYTATALQAANDMFEDARP 216 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 +KK + ITDG+ ++ ++ + + ++I+ + V + Sbjct: 217 G---------VKKVALVITDGQTD----SRDKEKLTEVVKNASDTNVEIFVIGVVKKNDP 263 Query: 354 Q-----DLLRKCTDSSGQFFAVNDSRELLESFDK-ITDKIQE 389 + + +D L ++ + + KI E Sbjct: 264 NFEIFHKEMNLIATDPEHVYQFDDFFTLQDTLKQKLFQKICE 305 >gi|302796876|ref|XP_002980199.1| hypothetical protein SELMODRAFT_444452 [Selaginella moellendorffii] gi|300151815|gb|EFJ18459.1| hypothetical protein SELMODRAFT_444452 [Selaginella moellendorffii] Length = 550 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 15/176 (8%), Positives = 61/176 (34%), Gaps = 18/176 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP--LSNNLNEVKSRL 266 K++++ + ++ ++++ R+ ++++ + + S + Sbjct: 99 KLELVKTAMEFVIRNLRQQD--------RLAIVSFSDEPKVHLGLKRMTHDGRAAALSAV 150 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 KL T P + + L + ++ ++DG ++ + + + Sbjct: 151 EKLRSLGGTEIRPGLKAGFDLLS--------RRKNRNPVSSIMLLSDGMDNAITFKRCKV 202 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + +++ + + + +L + G F V + + +F + Sbjct: 203 LPVDSYLEDCSERVPVHTFGFGSDHDPEAMLSIAEATGGSFCYVQEESTVQHAFAQ 258 >gi|242042269|ref|XP_002468529.1| hypothetical protein SORBIDRAFT_01g047460 [Sorghum bicolor] gi|241922383|gb|EER95527.1| hypothetical protein SORBIDRAFT_01g047460 [Sorghum bicolor] Length = 698 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 18/180 (10%), Positives = 57/180 (31%), Gaps = 19/180 (10%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 KI +L + ++ ++ N + + + + + ++ Sbjct: 255 TKIALLKNAMSFVIQTL------GPNDRLSVIAFSSTARRLFPLRRMTLAGRQQALQAVS 308 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS-AYQNTL 326 L TN + + + + +I ++DG+++ + +N L Sbjct: 309 SLVASGGTNIADGLKKGAKVIED--------RRLKNPVCSIILLSDGQDTYTLPSDRNLL 360 Query: 327 NTLQICEYMR----NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + ++I++ + + + SSG F ++ + + F + Sbjct: 361 DYSALVPPSILPGTGHHVQIHTFGFGSDHDSAAMHAIAEISSGTFSFIDAEGSIQDGFAQ 420 >gi|291297006|ref|YP_003508404.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] gi|290471965|gb|ADD29384.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] Length = 313 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 23/217 (10%), Positives = 71/217 (32%), Gaps = 38/217 (17%) Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYN 244 + S A + + + A N V S+ I ++G +++ Sbjct: 82 DPTATVIVTIDISLSMRAQDIQPTRFEAAKQEAKNFVRSLPDGI--------KVGLVSFA 133 Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 + + V ++ L T + + R + ++ + Sbjct: 134 GYATLEAEP--TTDHQRVIDQIELLQMARRTAIGDGLLESLRAIPKDENGKPLGPST--- 188 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------------ 352 V+ ++DG + ++ +++ + R+ G+ ++++ + Sbjct: 189 ---VVLLSDGRTNSG------VDPMEVAPFARDMGVVVHTIGLGRRSNPGDPDQYWGGYW 239 Query: 353 ---GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 ++ LR + + GQ++A + L +++ + Sbjct: 240 MQFDEETLRAIAEATGGQYYAAGSAEALRQAYRNLGR 276 >gi|326670350|ref|XP_001332841.4| PREDICTED: anthrax toxin receptor 1-like [Danio rerio] Length = 609 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 36/187 (19%), Positives = 64/187 (34%), Gaps = 18/187 (9%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 K + + N ++ Q+ + +R+ I ++ G L+ + +++ Sbjct: 43 YFVLDKSGSVQHHWNEIYNFVEHLAQKFISPQLRMSFIVFSD--QGKILMQLTEDREQIR 100 Query: 264 SRLNKL---NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 L +L P +T + + A ++Y + T +I +TDGE Sbjct: 101 KGLKELQDVRPGGDTFMHEGIQRASEQIYYGNTEGYRT------ASVIIALTDGELHENH 154 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND-SRELLES 379 + R+ G +Y V V + L K DS F VND L Sbjct: 155 F----YYAEREANRSRSLGASVYCVGV--KDFNETQLAKIADSKDHVFPVNDGFEALQGV 208 Query: 380 FDKITDK 386 D I K Sbjct: 209 IDSILKK 215 >gi|94732992|emb|CAK03801.1| novel protein similar to vertebrate calcium channel, voltage-dependent, alpha 2/delta subunit 2 (CACNA2D2) [Danio rerio] Length = 1056 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 42/360 (11%), Positives = 103/360 (28%), Gaps = 30/360 (8%) Query: 15 YAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQ 74 A D+ ++ + + AL+ L+ A + K++ K + Sbjct: 48 VATDIERLLAKKRK---ALER--LASEAERLQKEHRWQDGIKEENIEYYNSKAEMDY--- 99 Query: 75 GSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLR 134 D +++ D + Q A +IPT+ +I + L Sbjct: 100 ----DGEDIDSQMSLKLDFVYDPSFKNQVNYSHTA-VQIPTDIYKGAPVILNELNWTQAL 154 Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 IE S ++ ++ + + + P + S Sbjct: 155 ERVFIENSRDDPSLLWQAFGSATGVTRYYPAAPWRAPDKIDLYDVRRRPWYIQGASSPKD 214 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCT 253 + + + + ++ S +++++ + + + + Sbjct: 215 MVILVDVSGSVSGLTLKLIKASVTEMLDTLSDDDYVNVARFNEKAEAVVPCFDHLVQANV 274 Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 N K + ++ T+ H A+ +L N+ K ++ TD Sbjct: 275 R---NKKIFKEAVQQMQAKGTTDYKSGFHFAFNQLLNKTNVPRANCN-----KIIMLFTD 326 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVND 372 G A N N +++++ +V L+ S+ G +F + Sbjct: 327 GGEDRAQDIFEQYNWP-------NKTVRVFTFSVGQHNYDVTPLQWIACSNKGYYFEIRS 379 >gi|242042271|ref|XP_002468530.1| hypothetical protein SORBIDRAFT_01g047470 [Sorghum bicolor] gi|241922384|gb|EER95528.1| hypothetical protein SORBIDRAFT_01g047470 [Sorghum bicolor] Length = 686 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 19/226 (8%), Positives = 55/226 (24%), Gaps = 31/226 (13%) Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE 230 + K+ ++ ++ G ++ S+ Sbjct: 145 KAPGVIVNEAAAGDRDAPRAPLDLVTVLDVSGSMRW-DKLALVKQAMGFVIGSLGPHD-- 201 Query: 231 KKNLSVRIGTIAYNIGIVGNQCT--PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYREL 288 R+ ++++ G + + L TN + A + L Sbjct: 202 ------RLSVVSFSSGARRVTRLLRMSHTGKSLATEAVESLRAGGGTNIAEGLRTAAKVL 255 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM------------R 336 VI ++DG ++ + + E + Sbjct: 256 --------GERRHRNAVSSVILLSDGHDNYSMPRRARGGVPPNYEVLVPPSFVPGTASTG 307 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 I++ + + + G F + + + ++F + Sbjct: 308 EGSAPIHTFGFGNDHDAAAMHVVAEATGGTFSFIENEAVIQDAFAQ 353 >gi|221111402|ref|XP_002161005.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 1100 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 26/260 (10%), Positives = 76/260 (29%), Gaps = 24/260 (9%) Query: 120 LKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKY 179 + I + +S +IS ++ S +++ Sbjct: 1 MTSGIDKTIIGISPSGAITFISQLYKGSISDKEIVVKSGLFNKELWNENDS----VLADK 56 Query: 180 LLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG 239 + S + D+L + S+ V Sbjct: 57 KNEVCNEGIVDIGFIMDSS-GSLGKNYKNEKDLLKT-----LASLFSIKPNGSQAGV--- 107 Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT-NTYPAMHHAYRELYNEKESSHNT 298 I ++ + ++ + ++++ +T + A +E++ + Sbjct: 108 -ITFSFYTEHSIKLNQFSDQDSFNDAVDRIPLMGHTTRIDKGLRLAQKEMFKVENGGR-- 164 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLR 358 + K ++ +TDG + ++ I + +R G+ I ++ + +L++ Sbjct: 165 ---PGVSKLLVLLTDGSQTQGKG---VIDPAIIADEIRKQGVPIIAIGIGKEINKNELIK 218 Query: 359 KCTDSSGQFFAVNDSRELLE 378 ++ +D +L E Sbjct: 219 -IGGGEANTYSADDFEKLKE 237 >gi|312883317|ref|ZP_07743043.1| putative Flp pilus assembly protein TadG [Vibrio caribbenthicus ATCC BAA-2122] gi|309368933|gb|EFP96459.1| putative Flp pilus assembly protein TadG [Vibrio caribbenthicus ATCC BAA-2122] Length = 432 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 53/444 (11%), Positives = 121/444 (27%), Gaps = 97/444 (21%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + +I+ V + ++ + I+ ++++ A + L+ A + + Sbjct: 19 IFILILPVLICVMALSLQASQILLAQSKITEASEVTSLALSAL--------SEERAQQKL 70 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 S+ + +K +L ++ A + Q Q ++ E Sbjct: 71 SSYATRVLKHYLVGTDDVKGQATMQSSTFQF------------------QTDLVGEATHE 112 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 + S R + + + + D+S SM + + + + Sbjct: 113 FWFKHKPQADTFKVSGASTSRKHKPQPMDVYFITDLSESM--NRSEPSRLTIVKDAIRQV 170 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRK-----IDVLI-ESAGNLVNSIQKAIQEKKNL 234 + PK S + ++ K I +K+I + Sbjct: 171 VSKLPKGSRAAFIGYNTENVKLTGRYFDKRTGREITSKKPTELQGPNIWAEKSIYDYLTG 230 Query: 235 SVRIGTIAYNIGI--------------------------------VGNQCTPLSNNLNEV 262 I I L+ +L Sbjct: 231 RHPDFVIENLFDIALSKKKKSSIDKLLEGYGPPDLDQKIRDFETKYPFYDISLTTDLGSF 290 Query: 263 KSRL--NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 K + N +N NT+++ + A RE + ++ + +TDGE S Sbjct: 291 KETIKSNAINANGNTHSWNGIIAAAREAH------RQPSSVFNPQQVFVLLTDGE---DS 341 Query: 321 AYQNTLNTLQICEYMRNA-------------------GMKIYSVAVSAPPEGQDLLRKCT 361 +CE +RN + + + V P + + +C Sbjct: 342 KKFPKGYYAPLCEKIRNDISDKQNRSQIQNASVEEKTKVTMSVIGVEFNPYKNEGVTEC- 400 Query: 362 DSSGQFFAVNDSRELLESFDKITD 385 F EL++ ++ + Sbjct: 401 FGRENIFEAKREDELVKKILQLFE 424 >gi|115377250|ref|ZP_01464460.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] gi|115365726|gb|EAU64751.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 520 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 31/311 (9%), Positives = 79/311 (25%), Gaps = 39/311 (12%) Query: 77 YIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRST 136 + I+ ++ + + A Y + L L + Sbjct: 39 FADMYFKHYGVNPTIDTEEENVSTFSVDVD-SASYALARAYLSRNHLPAEEAIRVEEFVN 97 Query: 137 GIIERSSENLA------ISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFW 190 + A + + + + L Sbjct: 98 AFRYDYQDPGAEPFGVQVEAFPSPNRQGYHVLHVGLQGQKVSAAERLPAHLV-------- 149 Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 + +++++ S LV + + + Y Sbjct: 150 -----FTIDVSGSMNMENRLELVKRSLAMLVEKLDS--------RDTLAIVVYGDTARTV 196 Query: 251 QCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 + + + +N L+P +TN + AY ++ + VI Sbjct: 197 LEPTRIMDRSRILEAINALHPEGSTNVQAGLQVAYAIAASQVREGATSR--------VIL 248 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFA 369 +DG + +++ Q + G+++ +V +L+ + + GQ+ Sbjct: 249 CSDGVANNGITQADSI--FQSVKAYAQQGVRLTTVGFGMGNYNDELMERLSHVGDGQYAY 306 Query: 370 VNDSRELLESF 380 V+ E F Sbjct: 307 VDALPEARRIF 317 >gi|110677910|ref|YP_680917.1| hypothetical protein RD1_0526 [Roseobacter denitrificans OCh 114] gi|109454026|gb|ABG30231.1| conserved hypothetical protein [Roseobacter denitrificans OCh 114] Length = 327 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 29/159 (18%), Positives = 60/159 (37%), Gaps = 27/159 (16%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL---NPYENTNTYPAMHHAYRELYNEKE 293 R+ I + +PL+ + + L++ +T A+ + R Sbjct: 140 RMALIVFGSKAYLQ--SPLTEDTGTIVELLDQTEVGMAGPHTAIGDAIGLSIRTFEA--- 194 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE- 352 S ++ +I ++DG ++ + L E R AG++I+++AV P Sbjct: 195 -------SEIEQRLLILLSDGADTASRMS-----PLNAAEIARGAGVEIFTIAVGDPDGT 242 Query: 353 -----GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 L+ + +SG +F D L E + +I + Sbjct: 243 GENRVDVAALQDIANRTSGSYFFAADQAALDEIYARIDE 281 >gi|149197908|ref|ZP_01874957.1| BatA [Lentisphaera araneosa HTCC2155] gi|149139129|gb|EDM27533.1| BatA [Lentisphaera araneosa HTCC2155] Length = 341 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 38/291 (13%), Positives = 77/291 (26%), Gaps = 38/291 (13%) Query: 124 IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPP 183 I + T + L + +S + Sbjct: 31 INFSSTENMSHAKKSWRIHLLFLLPLTLHLALISLIFTLADPMTEVTKKRQDRQGIAIQV 90 Query: 184 PPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAY 243 S K ++DV + ++ Sbjct: 91 LVDVSSSMDINMKYGEERL-----TRMDVAKIVVEKFIG----GDGDELVGRPDDLIGLI 141 Query: 244 NIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY-PAMHHAYRELYNEKESSHNTIGST 302 + PLS + S + + NT + + Sbjct: 142 TFARYADTIAPLSLAHEALISIVQDVTI--NTRPNEDGTAYGDATALAAAQLDLLQGDQD 199 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS-------------- 348 K +I +TDGEN+ + LQ + G+KIY++++ Sbjct: 200 IKSKIIILLTDGENNCGNH-----LPLQAASLAKEWGIKIYTISIQNKPTPERKKTDQGT 254 Query: 349 -----APPEGQDLLRKCTDSSGQFFA-VNDSRELLESFDKITDKIQEQSVR 393 P G +L+K +S+G F +D L + +I +K+++ ++ Sbjct: 255 FFVPPTPSAGDQVLKKMAESTGGVFRLAHDYDSLKSVYKEI-NKLEKSKLK 304 >gi|114576315|ref|XP_515709.2| PREDICTED: matrilin-3 [Pan troglodytes] Length = 486 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 58/176 (32%), Gaps = 23/176 (13%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP- 271 + +++++ + R+ + Y + + +K + ++ P Sbjct: 102 VKTFVSRIIDTL-----DIGPADTRVAVVNYASTVKIEFQLQAYTDKQSLKQAVGRITPL 156 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T + A+ A E + + + S+ + K I +TDG ++ Sbjct: 157 STGTMSGLAIQTAMDEAFTVEAGAR--EPSSNIPKVAIIVTDGRPQD--------QVNEV 206 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 + +G+++Y+V V + L+ F V +L F + Sbjct: 207 VARAQASGIELYAVGV--DRADMESLKMMASEPLEEHVFYVETYGVIEKLSSRFQE 260 >gi|163734461|ref|ZP_02141901.1| hypothetical protein RLO149_09454 [Roseobacter litoralis Och 149] gi|161392469|gb|EDQ16798.1| hypothetical protein RLO149_09454 [Roseobacter litoralis Och 149] Length = 327 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 29/159 (18%), Positives = 60/159 (37%), Gaps = 27/159 (16%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL---NPYENTNTYPAMHHAYRELYNEKE 293 R+ I + +PL+ + + L++ +T A+ + R Sbjct: 140 RMALIVFGSKAYLQ--SPLTEDTGTIVELLDQTEVGMAGPHTAIGDAIGLSIRTFEA--- 194 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE- 352 S ++ +I ++DG ++ + L E R AG++I+++AV P Sbjct: 195 -------SEIEQRLLILLSDGADTASRMS-----PLNAAEIARGAGVEIFTIAVGDPDAT 242 Query: 353 -----GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 L+ + +SG +F D L E + +I + Sbjct: 243 GENRVDVAALQDIANRTSGSYFFAADQAALDEIYARIDE 281 >gi|332879552|ref|ZP_08447247.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332682518|gb|EGJ55420.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 332 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 23/142 (16%), Positives = 48/142 (33%), Gaps = 37/142 (26%) Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T + +A L S K +I +TDG N+ L E Sbjct: 171 GTAVGMGIANAVSRL----------KDSKAKSKVIILLTDGTNNAGDIS-----PLTAAE 215 Query: 334 YMRNAGMKIYSVAVSAPP--------------------EGQDLLRKCTD-SSGQFFAVND 372 ++ G+++Y++ V L + G+F+ D Sbjct: 216 IAKSFGIRVYTIGVGTNGLAPYPMPVAGGVQYLNVPVEIDTKTLAAIAGKTDGEFYRATD 275 Query: 373 SRELLESFDKITDKIQEQSVRI 394 +++L + + I DK+++ + + Sbjct: 276 NKKLEDVYKDI-DKLEKTKLNV 296 >gi|255566346|ref|XP_002524159.1| protein binding protein, putative [Ricinus communis] gi|223536577|gb|EEF38222.1| protein binding protein, putative [Ricinus communis] Length = 514 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 17/124 (13%), Positives = 42/124 (33%), Gaps = 20/124 (16%) Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 E+++ +N L TN + L + + S +G ++ ++DGE + Sbjct: 124 ELENLINGLKAEGATNITAGLKTGLNVLNDRRLSGGRVVG-------IMLMSDGEQNAGG 176 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS--SGQFFAVNDSRELLE 378 + +++ +L+ + G F V ++ L + Sbjct: 177 DAAQVPVG----------NVPVHTFGFGINH-EPRVLKAIAQNSVGGTFSDVQNTDNLSK 225 Query: 379 SFDK 382 +F + Sbjct: 226 AFSQ 229 >gi|15779150|gb|AAH14640.1| COL14A1 protein [Homo sapiens] Length = 759 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 26/185 (14%), Positives = 57/185 (30%), Gaps = 26/185 (14%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YE 273 + V ++ K + ++ + + + + ++ Sbjct: 16 SFLYSTVGALNKIGTDGT----QVAMVQFTDDPRTEFKLNAYKTKETLLDAIKHISYKGG 71 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 NT T A+ + L+ + + + K ++ ITDG + +I Sbjct: 72 NTKTGKAIKYVRDTLFTAESGTRR-----GIPKVIVVITDGRSQDDVN--------KISR 118 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 M+ G I+++ V L F V+D ++F KI D++ Sbjct: 119 EMQLDGYSIFAIGV--ADADYSELVSIGSKPSARHVFFVDDF----DAFKKIEDELITFV 172 Query: 392 VRIAP 396 A Sbjct: 173 CETAS 177 >gi|297680998|ref|XP_002818254.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Pongo abelii] Length = 1125 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 23/162 (14%), Positives = 57/162 (35%), Gaps = 20/162 (12%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKE 293 + RIG I Y+ + ++ ++ K ++ + T T A+ A + + Sbjct: 834 TARIGVINYSHKVEKVANLKQFSSKDDFKLAVDNMQYLGEGTYTATALQAANDMFEDARP 893 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 +KK + ITDG+ ++ ++ + + ++I+ + V + Sbjct: 894 G---------VKKVALVITDGQTD----SRDKEKLTEVVKNASDTNVEIFVIGVVKKNDP 940 Query: 354 Q-----DLLRKCTDSSGQFFAVNDSRELLESFDK-ITDKIQE 389 + + +D L ++ + + KI E Sbjct: 941 NFEIFHKEMNLIATDPEHVYQFDDFFTLQDTLKQKLFQKICE 982 Score = 43.7 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 50/156 (32%), Gaps = 16/156 (10%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN-NLNEVKSRLNK 268 D + +L + I + + V S +L K ++ Sbjct: 64 FDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQIDPPFSSWKDLQTFKQKVKS 123 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 +N T +Y A+ +A R L E K + +TDG + + + Sbjct: 124 MNLIGQGTFSYYAISNATRLLKREGRKDGV--------KVALLMTDGID-----HPRNPD 170 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS 363 I E R +G+ ++ +S + LR + Sbjct: 171 VQSISEDARISGISFITIGLST-VVNEAKLRLISGD 205 >gi|239833540|ref|ZP_04681868.1| Hypothetical protein OINT_2000308 [Ochrobactrum intermedium LMG 3301] gi|239821603|gb|EEQ93172.1| Hypothetical protein OINT_2000308 [Ochrobactrum intermedium LMG 3301] Length = 637 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 45/312 (14%), Positives = 81/312 (25%), Gaps = 74/312 (23%) Query: 153 VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV 212 V ++N + P N + Sbjct: 326 VESRPGPFALDVTPPSDNNPDTLFVPMFGPAEYYNVDSRGNVISTVLNSWWQDDMSLAYS 385 Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN-----NLNEVKSRLN 267 +S +L + +K R N TPL++ + V++ + Sbjct: 386 PRQS--DLKKYYLRDSLDKIYRKGRSEGGGPNYSCTTLPLTPLTDVTTEQGMKTVQTAIK 443 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKE-SSHNTIGSTRLKKFVIFITDGENSGASAY---- 322 + P TN AM +R + + K VI +TDG N+ Sbjct: 444 AMVPNGGTNVPEAMAWGWRTIVQGAPFTEARASTERGNDKVVIVLTDGANTYYKYDGLAG 503 Query: 323 ----------------------------------------QNTLNTLQ-------ICEYM 335 NT T +C+ Sbjct: 504 SGPDRAGNLSYYSTHGYTARITKKYSQSRLFQESGVSVSQNNTTYTKALNARFAKLCDNA 563 Query: 336 RNAGMKIYSVAVSAPPEGQ------DLLRKCTDS---------SGQFFAVNDSRELLESF 380 + A + + +VA+ DLLR C+ + + F + EL E+F Sbjct: 564 KAANIIVMTVALDLNEANSTEKAQIDLLRSCSSNSRVRMEGGKPAKLFWNSTGGELSETF 623 Query: 381 DKITDKIQEQSV 392 +I D++ + Sbjct: 624 RQIGDELSNLRL 635 Score = 44.1 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 25/196 (12%), Positives = 55/196 (28%), Gaps = 20/196 (10%) Query: 13 ITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHL 72 A+D A++M +RN +Q++LDAA L+ + + T+ + + Sbjct: 60 GMVAVDTANLMRVRNNVQASLDAAALAVGKRFSTGES-----------HTVVQDYGARIF 108 Query: 73 KQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN-LFLKGLIPSALTNL 131 A I ++ ++ A + + + L Sbjct: 109 YANV-----TALSADAINFQIAFPQDKTTDQQVQATAAFTYKSLFGVVASRLTGDNWDKH 163 Query: 132 S---LRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKS 188 S + L + +D +RS + + + + Sbjct: 164 QYTLTASVRLKNTIEVALVLDNSKSMDETRSGSSKKRIDLLKDAASQLVETMASQSALIT 223 Query: 189 FWSKNTTKSKYAPAPA 204 + K S A + Sbjct: 224 YVEKPVQFSLVPFAGS 239 >gi|291087243|ref|ZP_06571866.1| putative von Willebrand factor type A domain protein [Clostridium sp. M62/1] gi|291076088|gb|EFE13452.1| putative von Willebrand factor type A domain protein [Clostridium sp. M62/1] Length = 2012 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 46/337 (13%), Positives = 101/337 (29%), Gaps = 29/337 (8%) Query: 63 IFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKG 122 ++ ++ Y + D + + T ++ QY + Sbjct: 1366 YILADGEEPSEENQYKGDEKTDDPSVPEDSQTSSGKPGFPANKKATLQYTYDGGTGRFEY 1425 Query: 123 LIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLP 182 P + +R N + + LDV+ + ++ Sbjct: 1426 PHPVLQIPEPVLPDEYNKRIEPNDDGTYSLTLDVTGIEGNPATVTTKYPVDLVFVI---- 1481 Query: 183 PPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIA 242 K + ++ + D++ ++ ++ + + I Sbjct: 1482 --DKSLSMDYDIDGNEIKWWDDETESRKDIVNDALEEII-------PDLCSQQYDIQIAG 1532 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 Y + S +V S L +T A+ A L S + Sbjct: 1533 YQFSGSSTRVLDWSREEQQVLSGLKIARTSSSTEPSQALADALDMLKT---GSPAHRNQS 1589 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC-- 360 +KK++IF+TDGE + + G IY++ VS+ +L+ Sbjct: 1590 NVKKYLIFMTDGEPTEPEDWSYNAVRNHAVP-----GASIYTIGVSSDA-STNLMEGIRS 1643 Query: 361 -TDSSGQF----FAVNDSRELLESFDKITDKIQEQSV 392 S+G + F ++ + ++F +I D+I S Sbjct: 1644 TALSNGMYAPATFKGTSAQLIRDAFTQIKDEIISTST 1680 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 47/360 (13%), Positives = 102/360 (28%), Gaps = 30/360 (8%) Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK-----NNPLQYIAESKAQY 111 +Q I + + Q + E + DI KA + IT N E+ + Sbjct: 922 DEQNGKIELRFGEDRQLQSGTVYELSFDIKLKAGVEITAADKIEGDENTDYRENENLSSG 981 Query: 112 EIPTENLFLKGL-IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 + T +I + N + D L Sbjct: 982 QRGVLTNEDAYFSFGEDGTTKVRFPHPVIPAPATNHPEYRKYIKDNG--NGTYTLTLDVK 1039 Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKS-KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ 229 ++ + P P + + + S + ++ + +V+ + N + Sbjct: 1040 SDVGSVTTGQKDPTPTAVMFVIDKSGSMDQSFGSGNSDARREVVNSALELFFNQLSDGDY 1099 Query: 230 E----KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAY 285 S + +N + + +L +T + A Sbjct: 1100 NIQFGGYKFSDSGERVNFNDQGWETEYWETDTSNALSHLKLTSRETDGSTYPSQTLRSAI 1159 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR--NAGMKIY 343 L N + + ++IF+TDGE S + ++ ++G Y Sbjct: 1160 SALENVELGENGKR-------YLIFLTDGEPGQNSYSFSEKEAENCYSAIKNLDSGTTFY 1212 Query: 344 SVAVSAPPEGQDLLRKCTDSSGQF-------FAVNDSRELLESFDKITDKIQEQSVRIAP 396 ++ V A + + ++ F N + EL +F ++ +I + P Sbjct: 1213 AIQV-ANSDSHGFMESMVSNANSVDGVTAQKFVGNSADELNAAFSQMAAEISGSAGTTVP 1271 Score = 42.6 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 16/102 (15%), Positives = 34/102 (33%), Gaps = 12/102 (11%) Query: 233 NLSVRIGTIAYNIGIVGNQCTP----LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYREL 288 + + R + Y+ + +K + L TN ++ A L Sbjct: 655 SPNSRFSIVTYSTDASTELGWTEYGRNGSGQQTIKKAIGDLQANGGTNYEAGLYQAVEVL 714 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 G++ VIF++DG+ + + + N L+ Sbjct: 715 KE--------RGNSSNIPVVIFLSDGKPTYYYSDVDEFNGLE 748 >gi|47229708|emb|CAG06904.1| unnamed protein product [Tetraodon nigroviridis] Length = 990 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 42/362 (11%), Positives = 103/362 (28%), Gaps = 30/362 (8%) Query: 15 YAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQ 74 A D+ +++ + + ALD L+ A + + K+ + K L Sbjct: 52 VASDIENLLAKKRR---ALDR--LANEAERLQREHLWQDGIKELDMAYY---DSKADLDY 103 Query: 75 GSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLR 134 S E + ++ D N A +IPT+ +I + L Sbjct: 104 YSMDGEGEMENPSHIKLEFVYDPNFKNNVNYSYTA-VQIPTDIYKGAPVILNELNWTQAL 162 Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 +E S E+ ++ + + + P + S Sbjct: 163 EKVFMENSQEDPSLLWQAFGSATGVTRYYPATPWKSPDKIDLYDVRRRPWYIQGASSPKD 222 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCT 253 + + + + ++ S +++++ + + + + Sbjct: 223 MVILVDVSGSVSGLTLKLIKASVMEMLDTLSDDDYVNVARFNEKAEAVVPCFKHLVQANV 282 Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 N K + ++ T+ H A+ +L N+ K ++ TD Sbjct: 283 R---NKKIFKDAVQQMQAKGTTDYKSGFHFAFNQLLNKTNVPRANCN-----KIIMLFTD 334 Query: 314 GENS--GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAV 370 G Q +++++ +V L+ ++ G +F + Sbjct: 335 GGEDRAQDVFMQYNWPNKT---------VRVFTFSVGQHNYDVTPLQWIACTNKGYYFEI 385 Query: 371 ND 372 Sbjct: 386 RS 387 >gi|85712923|ref|ZP_01043963.1| Uncharacterized protein containing a von Willebrand factor type A(vWA) domain [Idiomarina baltica OS145] gi|85693229|gb|EAQ31187.1| Uncharacterized protein containing a von Willebrand factor type A(vWA) domain [Idiomarina baltica OS145] Length = 328 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 28/160 (17%), Positives = 58/160 (36%), Gaps = 36/160 (22%) Query: 247 IVGNQCTPLSNNLNEVKSRLNK--LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 TP++ + N V+ L++ L T A+ A + G + Sbjct: 139 DTAFLQTPITYDRNTVQQMLDESVLGLVGERTAIGDAIALAVKRF----------KGKQQ 188 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP------------- 350 + ++ +TDG+N+ + L+ Q E + ++IY +AV A Sbjct: 189 TNRVLVLLTDGQNTAGN-----LSPEQALELAKAYDVRIYPIAVGAEEVVVDSVFGRRKV 243 Query: 351 ----PEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 L++ D + G++F + EL + + + Sbjct: 244 NPSRDLDVPLMQNLADETGGEYFRARSTEELERIYQLLDE 283 >gi|194335401|ref|YP_002017195.1| von Willebrand factor type A [Pelodictyon phaeoclathratiforme BU-1] gi|194307878|gb|ACF42578.1| von Willebrand factor type A [Pelodictyon phaeoclathratiforme BU-1] Length = 336 Score = 66.9 bits (161), Expect = 6e-09, Method: Composition-based stats. Identities = 28/161 (17%), Positives = 57/161 (35%), Gaps = 37/161 (22%) Query: 254 PLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 PL+ + + L++L+P + T A+ A L S L K +I Sbjct: 155 PLTLDHEVLAMLLDRLSPGVIQDDGTAIGTAILIAVNRL----------KASESLHKVLI 204 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV--AVSAPPE--------------G 353 +TDGEN+ + +G++IY + + Sbjct: 205 LVTDGENNAG-----EVGPGTAASIAARSGVRIYVINAGFKVVEDRIDPPEESGRYIQKD 259 Query: 354 QDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 ++ L+ + G +F V D ++ I D+++++ Sbjct: 260 EESLQGIARTTGGGYFRVEDPAAFDQTIRSI-DRLEKKRFT 299 >gi|310818002|ref|YP_003950360.1| von willebrand factor type a domain-containing protein [Stigmatella aurantiaca DW4/3-1] gi|309391074|gb|ADO68533.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 568 Score = 66.5 bits (160), Expect = 6e-09, Method: Composition-based stats. Identities = 31/311 (9%), Positives = 79/311 (25%), Gaps = 39/311 (12%) Query: 77 YIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRST 136 + I+ ++ + + A Y + L L + Sbjct: 87 FADMYFKHYGVNPTIDTEEENVSTFSVDVD-SASYALARAYLSRNHLPAEEAIRVEEFVN 145 Query: 137 GIIERSSENLA------ISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFW 190 + A + + + + L Sbjct: 146 AFRYDYQDPGAEPFGVQVEAFPSPNRQGYHVLHVGLQGQKVSAAERLPAHLV-------- 197 Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 + +++++ S LV + + + Y Sbjct: 198 -----FTIDVSGSMNMENRLELVKRSLAMLVEKLDS--------RDTLAIVVYGDTARTV 244 Query: 251 QCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 + + + +N L+P +TN + AY ++ + VI Sbjct: 245 LEPTRIMDRSRILEAINALHPEGSTNVQAGLQVAYAIAASQVREGATSR--------VIL 296 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFA 369 +DG + +++ Q + G+++ +V +L+ + + GQ+ Sbjct: 297 CSDGVANNGITQADSI--FQSVKAYAQQGVRLTTVGFGMGNYNDELMERLSHVGDGQYAY 354 Query: 370 VNDSRELLESF 380 V+ E F Sbjct: 355 VDALPEARRIF 365 >gi|172039857|ref|YP_001799571.1| hypothetical protein cur_0177 [Corynebacterium urealyticum DSM 7109] gi|171851161|emb|CAQ04137.1| hypothetical protein cu0177 [Corynebacterium urealyticum DSM 7109] Length = 675 Score = 66.5 bits (160), Expect = 6e-09, Method: Composition-based stats. Identities = 33/201 (16%), Positives = 73/201 (36%), Gaps = 25/201 (12%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL-----SNN 258 A ++D E++ N S+ + + + + G + N Sbjct: 90 AGGQTRLDAAKEASKNFSRSVSEESELGFMVYGTKVGNSPEEREAGCKDVTTLLPVGKGN 149 Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 ++ ++K+N +T PA+ A +EL NE E + ++ ++DGE++ Sbjct: 150 AGKISGEVDKVNASGHTPMGPALKQAAKELPNEGE------------RSIVLVSDGEDT- 196 Query: 319 ASAYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRE 375 + + + G+ I +V P + L+ + G++ D+ Sbjct: 197 ----CAPPPVCDVAKDLHKQGIDLTINTVGFLVDPAARKELQCIAEAGGGEYLDAQDAES 252 Query: 376 LLESFDKITDKIQEQSVRIAP 396 L ES + + + + AP Sbjct: 253 LAESMKVLATRTAQTAESNAP 273 >gi|114621486|ref|XP_001143977.1| PREDICTED: collagen, type XIV, alpha 1 isoform 2 [Pan troglodytes] Length = 1685 Score = 66.5 bits (160), Expect = 6e-09, Method: Composition-based stats. Identities = 27/201 (13%), Positives = 63/201 (31%), Gaps = 26/201 (12%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + +I + V ++ K + ++ + + Sbjct: 942 VDGSWSIGDENFNKIISFLYSTVGALHKIGTDGT----QVAMVQFTDDPRTEFKLNAYKT 997 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + ++ NT T A+ + L+ + + + K ++ ITDG + Sbjct: 998 KETLLDAIKHISYKGGNTKTGKAIKYVRDTLFTAESGTRR-----GIPKVIVVITDGRSQ 1052 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRE 375 +I M+ G I+++ V L F V+D Sbjct: 1053 DDVN--------KISREMQLDGYSIFAIGV--ADADYSELVSIGSKPSARHVFFVDDF-- 1100 Query: 376 LLESFDKITDKIQEQSVRIAP 396 ++F KI D++ A Sbjct: 1101 --DAFKKIEDELITFVCETAS 1119 >gi|114621482|ref|XP_001144037.1| PREDICTED: collagen, type XIV, alpha 1 isoform 3 [Pan troglodytes] Length = 1780 Score = 66.5 bits (160), Expect = 6e-09, Method: Composition-based stats. Identities = 27/201 (13%), Positives = 63/201 (31%), Gaps = 26/201 (12%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + +I + V ++ K + ++ + + Sbjct: 1037 VDGSWSIGDENFNKIISFLYSTVGALHKIGTDGT----QVAMVQFTDDPRTEFKLNAYKT 1092 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + ++ NT T A+ + L+ + + + K ++ ITDG + Sbjct: 1093 KETLLDAIKHISYKGGNTKTGKAIKYVRDTLFTAESGTRR-----GIPKVIVVITDGRSQ 1147 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRE 375 +I M+ G I+++ V L F V+D Sbjct: 1148 DDVN--------KISREMQLDGYSIFAIGV--ADADYSELVSIGSKPSARHVFFVDDF-- 1195 Query: 376 LLESFDKITDKIQEQSVRIAP 396 ++F KI D++ A Sbjct: 1196 --DAFKKIEDELITFVCETAS 1214 Score = 66.1 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 57/168 (33%), Gaps = 18/168 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + RIG Y+ + +EV + L NT T A+++ + Sbjct: 186 TAFDVGSEKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNYIF 245 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + T + K I ITDG++ + +R +G++++++ Sbjct: 246 ENSFK-----PEAGSRTGVSKIGILITDGKSQDDIIPPS--------RNLRESGVELFAI 292 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 V + L++ + V + + + +T + + Sbjct: 293 GV--KNADVNELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTLCSRV 338 >gi|114621484|ref|XP_001143906.1| PREDICTED: collagen, type XIV, alpha 1 isoform 1 [Pan troglodytes] Length = 1800 Score = 66.5 bits (160), Expect = 6e-09, Method: Composition-based stats. Identities = 27/201 (13%), Positives = 63/201 (31%), Gaps = 26/201 (12%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + +I + V ++ K + ++ + + Sbjct: 1037 VDGSWSIGDENFNKIISFLYSTVGALHKIGTDGT----QVAMVQFTDDPRTEFKLNAYKT 1092 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + ++ NT T A+ + L+ + + + K ++ ITDG + Sbjct: 1093 KETLLDAIKHISYKGGNTKTGKAIKYVRDTLFTAESGTRR-----GIPKVIVVITDGRSQ 1147 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRE 375 +I M+ G I+++ V L F V+D Sbjct: 1148 DDVN--------KISREMQLDGYSIFAIGV--ADADYSELVSIGSKPSARHVFFVDDF-- 1195 Query: 376 LLESFDKITDKIQEQSVRIAP 396 ++F KI D++ A Sbjct: 1196 --DAFKKIEDELITFVCETAS 1214 Score = 66.1 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 57/168 (33%), Gaps = 18/168 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + RIG Y+ + +EV + L NT T A+++ + Sbjct: 186 TAFDVGSEKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNYIF 245 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + T + K I ITDG++ + +R +G++++++ Sbjct: 246 ENSFK-----PEAGSRTGVSKIGILITDGKSQDDIIPPS--------RNLRESGVELFAI 292 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 V + L++ + V + + + +T + + Sbjct: 293 GV--KNADVNELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTLCSRV 338 >gi|114621480|ref|XP_519927.2| PREDICTED: collagen alpha-1(XIV) chain isoform 4 [Pan troglodytes] Length = 1796 Score = 66.5 bits (160), Expect = 6e-09, Method: Composition-based stats. Identities = 27/201 (13%), Positives = 63/201 (31%), Gaps = 26/201 (12%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + +I + V ++ K + ++ + + Sbjct: 1037 VDGSWSIGDENFNKIISFLYSTVGALHKIGTDGT----QVAMVQFTDDPRTEFKLNAYKT 1092 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + ++ NT T A+ + L+ + + + K ++ ITDG + Sbjct: 1093 KETLLDAIKHISYKGGNTKTGKAIKYVRDTLFTAESGTRR-----GIPKVIVVITDGRSQ 1147 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRE 375 +I M+ G I+++ V L F V+D Sbjct: 1148 DDVN--------KISREMQLDGYSIFAIGV--ADADYSELVSIGSKPSARHVFFVDDF-- 1195 Query: 376 LLESFDKITDKIQEQSVRIAP 396 ++F KI D++ A Sbjct: 1196 --DAFKKIEDELITFVCETAS 1214 Score = 66.1 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 57/168 (33%), Gaps = 18/168 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + RIG Y+ + +EV + L NT T A+++ + Sbjct: 186 TAFDVGSEKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNYIF 245 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + T + K I ITDG++ + +R +G++++++ Sbjct: 246 ENSFK-----PEAGSRTGVSKIGILITDGKSQDDIIPPS--------RNLRESGVELFAI 292 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 V + L++ + V + + + +T + + Sbjct: 293 GV--KNADVNELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTLCSRV 338 >gi|114562255|ref|YP_749768.1| von Willebrand factor, type A [Shewanella frigidimarina NCIMB 400] gi|114333548|gb|ABI70930.1| von Willebrand factor, type A [Shewanella frigidimarina NCIMB 400] Length = 612 Score = 66.5 bits (160), Expect = 6e-09, Method: Composition-based stats. Identities = 37/361 (10%), Positives = 91/361 (25%), Gaps = 32/361 (8%) Query: 40 GCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNN 99 G + + + + S+ ++ + + Q + + + Sbjct: 79 GSKELHRSAKVMTSSMAQRIVSSQHVSVSDRNFSLAPTTNDKFESVVQNGNMVAGETPVS 138 Query: 100 PLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRS 159 + Y + L + ++ + Sbjct: 139 TFSIDVD-TGSYSTTRRLINQGQLPTKNTVRVEELVNYFSYDYPVPTNSEQPFSVNTELA 197 Query: 160 MEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGN 219 + P K S + K+ +L ++ Sbjct: 198 PSPYNADTQL----LRIGLKGFDVAPDKLSASNLVLL-LDVSGSMSSADKLPLLKQAMLM 252 Query: 220 LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYP 279 L + ++ + Y N+ +K+ L++LN TN Sbjct: 253 LSQQLSAQD--------KVSIVVYAGASGVVLDGVAGNDFTAIKTALSQLNAQGGTNGSQ 304 Query: 280 AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG 339 + AY+ VI TDG+ + L + G Sbjct: 305 GIQLAYQLAQKHF--------IENGSNRVILATDGDFNLGMTDHQQLVDFVASRSKK--G 354 Query: 340 MKIYSVAVSAP----PEGQDLLRKCTDS-SGQFFAVNDSRELLESFDKITDKIQEQSVRI 394 + + ++ LL + ++ +GQ+ ++ L E+ + D++ + I Sbjct: 355 IGLSTLGFGLGSGSASYNDHLLEQLSNKANGQYAFIDT---LNEARKVLVDQLSATLLTI 411 Query: 395 A 395 A Sbjct: 412 A 412 >gi|168699403|ref|ZP_02731680.1| hypothetical protein GobsU_07777 [Gemmata obscuriglobus UQM 2246] Length = 354 Score = 66.5 bits (160), Expect = 6e-09, Method: Composition-based stats. Identities = 30/209 (14%), Positives = 66/209 (31%), Gaps = 37/209 (17%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 AP+ +++ + + + V P + N + + Sbjct: 111 GAPSVSRLEAARRALKLFLAGGAAPDGTAFDPRPGDAVGLVAFAAVPETVCPATLNHSVL 170 Query: 263 KSRLNKLNPYEN----TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS- 317 + L P TN ++ A L + S + +I ++DGE++ Sbjct: 171 FKVADALQPKGGADAGTNIGDSLAEAVIRLDAADQKS----------RVLILLSDGEHNI 220 Query: 318 --------GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-------------EGQDL 356 TL + + N G+++Y++ P G+ Sbjct: 221 LKEDVRDAQRPGIDRTLKPREAAQLAANLGVRVYTIDAGGDPPLGAPPDAVAQRFAGRKA 280 Query: 357 LRKCTD-SSGQFFAVNDSRELLESFDKIT 384 L+ + + G+ F ELL ++ +I+ Sbjct: 281 LKDVAEMTGGKSFQATSGAELLSAYREIS 309 >gi|16331837|ref|NP_442565.1| hypothetical protein sll0103 [Synechocystis sp. PCC 6803] gi|2496792|sp|Q55874|Y103_SYNY3 RecName: Full=Uncharacterized protein sll0103 gi|1208467|dbj|BAA10635.1| sll0103 [Synechocystis sp. PCC 6803] Length = 420 Score = 66.5 bits (160), Expect = 6e-09, Method: Composition-based stats. Identities = 21/147 (14%), Positives = 50/147 (34%), Gaps = 22/147 (14%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N + + +L T + +E KE + +TDGE Sbjct: 99 NGAAIAKAIERLKAEGGTAIDEGLKLGIQEAAKGKEDR---------VSHIFLLTDGE-- 147 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSREL 376 + + + L++ + + ++++ QD+L S+ G + + E Sbjct: 148 --NEHGDNDRCLKLGTVASDYKLTVHTLGFG-DHWNQDVLEAIAASAQGSLSYIENPSEA 204 Query: 377 LESFDKITDKIQE-------QSVRIAP 396 L +F ++ ++ + +AP Sbjct: 205 LHTFRQLFQRMSNVGLTNAHLLLELAP 231 >gi|212634559|ref|YP_002311084.1| Von Willebrand factor type A domain-containing protein [Shewanella piezotolerans WP3] gi|212556043|gb|ACJ28497.1| Von Willebrand factor type A domain protein [Shewanella piezotolerans WP3] Length = 328 Score = 66.5 bits (160), Expect = 6e-09, Method: Composition-based stats. Identities = 31/197 (15%), Positives = 64/197 (32%), Gaps = 36/197 (18%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + R+ID + + + S R+G IA+ P + + Sbjct: 110 FVDLKGDKTRRIDGVKSLLLDFLAQ---------RASDRVGLIAFGDAAYLQA--PFTED 158 Query: 259 LNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + L +++ T A+ A + + K ++ +TDG Sbjct: 159 KGALSLLLKEMDVRMAGAGTALGDAIGVAVNHFSHSDTDN----------KVLLLLTDGN 208 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------GQDLLRKCTD-SSGQFF 368 ++ + L+ Y G+ IY +A+ P ++L++ D + GQ F Sbjct: 209 DTSSEFP-----PLEAARYAAQQGIVIYPIAIGDPANVGEDSLDIEMLQQIADITYGQVF 263 Query: 369 AVNDSRELLESFDKITD 385 D + + I Sbjct: 264 EAQDGEAFTQVYSIIET 280 >gi|307591433|ref|YP_003900232.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] gi|306986287|gb|ADN18166.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] Length = 491 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 34/185 (18%), Positives = 66/185 (35%), Gaps = 19/185 (10%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 K+ + +A + V + G TPL++++N +++ + Sbjct: 67 NKLSEVKTAATSFVQR---------QDLITNRIAVMGFGSGVQLGTPLTSDVNVLQTAIA 117 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 L T A+ A +L+N S + I S + ++ TDG + N Sbjct: 118 NLYDGGGTMMDQALTAATDQLHNASASLESAIPSGE-NQHILLFTDGV------AADPYN 170 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 TL + +NA + I VAV+ + L + T F ++ +F I Sbjct: 171 TLVAGQTAQNAQINI--VAVATGDADTNFLSQLTGDPNLVFYA-NTGNFDAAFQAAEKAI 227 Query: 388 QEQSV 392 + + Sbjct: 228 YSKQL 232 >gi|111221591|ref|YP_712385.1| hypothetical protein FRAAL2157 [Frankia alni ACN14a] gi|111149123|emb|CAJ60806.1| conserved hypothetical protein; putative membrane protein [Frankia alni ACN14a] Length = 319 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 25/220 (11%), Positives = 67/220 (30%), Gaps = 31/220 (14%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 +++ S A A +++ + A V+ + + +G +++ Sbjct: 85 ERATIILAIDVSNSMAATDIAPNRLEAAKQGAEAFVDQLP--------PRINLGLVSFAG 136 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 P S + V++ + L T + + + + + + S Sbjct: 137 SATV--LVPASTDRESVRAGIRGLQLGPATAIGEGIFASLQAINTAGKRFSDAGQSP-PP 193 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------------- 352 ++ ++DGE + Q + R A + + ++A Sbjct: 194 AAIVLLSDGETTRGRP------NTQATDAARQAHVPVDTIAYGTSDGTLDVGGQEVPVPV 247 Query: 353 GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQS 391 + L + D + G + EL + + I ++ Sbjct: 248 NEQALNEIADQTEGSYHRAATGDELRSVYKGLGSSIGYRT 287 >gi|189347765|ref|YP_001944294.1| von Willebrand factor type A [Chlorobium limicola DSM 245] gi|189341912|gb|ACD91315.1| von Willebrand factor type A [Chlorobium limicola DSM 245] Length = 325 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 73/206 (35%), Gaps = 37/206 (17%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 A ++D + A V + + RIG + + +PL+ + Sbjct: 116 AGGRSRLDAVKSVAREFVTR---------HSNDRIGVVVFKGKGY--TLSPLTLDHRVTG 164 Query: 264 SRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 ++ ++P E T A+ A L S +K +I +DG ++ Sbjct: 165 MLIDNVSPDVIRDEGTAVGTAVLIAVNRL----------RASQSDQKVIILFSDGVSNAG 214 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-----GQDLLRKCTDS-SGQFFAVNDS 373 ++ + + G++IY+ + + LR+ + G++F S Sbjct: 215 -----EIDPVTAASFAAAQGIRIYTAGAGSASSASSALDEGELRRVALTAGGRYFRAGTS 269 Query: 374 RELLESFDKITD-KIQEQSVRIAPNR 398 L E+F+ I + E + + N+ Sbjct: 270 ASLAEAFESIDRLEKSELTSPVTSNK 295 >gi|326529585|dbj|BAK04739.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 742 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 20/195 (10%), Positives = 57/195 (29%), Gaps = 36/195 (18%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT--PLSNNLNEVKSR 265 K+ +L + G ++ ++ + R+ IA++ + + Sbjct: 300 TKLALLKRAMGFVIQNLGSSD--------RLSVIAFSSSARRLFPLRRMTESGRKQSLLA 351 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ-- 323 +N L TN + + + + +I ++DG+++ + Sbjct: 352 VNSLTSNGGTNIAEGLRKGSKVIEE--------RQAKNPVCSIILLSDGQDTYTVSPSTG 403 Query: 324 ----NTLNTLQICEYMRNA------------GMKIYSVAVSAPPEGQDLLRKCTDSSGQF 367 T++ + + ++ A + L S G F Sbjct: 404 AHKPYTVSPTAGAQKASAEYCALLPSTNGSQQVPVHVFGFGADHDAVSLHSISQTSGGTF 463 Query: 368 FAVNDSRELLESFDK 382 + + ++F + Sbjct: 464 SFIETEATIQDAFAQ 478 >gi|162454087|ref|YP_001616454.1| hypothetical protein sce5811 [Sorangium cellulosum 'So ce 56'] gi|161164669|emb|CAN95974.1| hypothetical protein predicted by Glimmer/Critica [Sorangium cellulosum 'So ce 56'] Length = 907 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 23/178 (12%), Positives = 52/178 (29%), Gaps = 18/178 (10%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 ID +A LV+ + A + G VG + ++ + L Sbjct: 519 IDSARRAAQALVDRLAPADDFSLTTFSSDAEVVIEDGPVGPR-------RAAIRRAIEGL 571 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 TN + Y + + V+ ++DG + + L L Sbjct: 572 REGGGTNIGAGLSLGYAQASRPGIP-------EDAVRVVLLVSDGRATSGLTHSERLAWL 624 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDK 386 + G++ ++ + L+ G ++ + ++ + DK Sbjct: 625 AL--DAFQRGIQTSALGLG-DDFDGQLMSAIASDGAGGYYYLRHPEQIAPALSTELDK 679 >gi|148234427|ref|NP_001080470.1| matrilin 2 [Xenopus laevis] gi|28175657|gb|AAH45220.1| Matn2-prov protein [Xenopus laevis] Length = 589 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 22/159 (13%), Positives = 54/159 (33%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + + R+G + Y + + ++++ + ++ T T A+ +A Sbjct: 83 DIGPDTTRVGLLQYGSTVKNEFSLKMYKRKSDIERAVKRMMHLATGTMTGLAIQYAMNIA 142 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + + +TDG +I R +G+ I+++ V Sbjct: 143 FSEAEGARPL--NQYVPRIAMIVTDGRPQDP--------VEEISAKARMSGILIFAIGVG 192 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 L+ F V + + L F Sbjct: 193 R--VDMSTLKTIGSEPHSEHVFLVANFSQIETLTSVFQN 229 >gi|125532270|gb|EAY78835.1| hypothetical protein OsI_33940 [Oryza sativa Indica Group] Length = 606 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 63/184 (34%), Gaps = 25/184 (13%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIG-IVGNQCTPLSN-NLNEVKSR 265 RK+ ++ ++ G +++++ A R+ ++++ + +S K Sbjct: 158 RKLALVKKAMGFVIDNLGPAD--------RLCVVSFSTEASRRTRLLRMSEVGKATAKRA 209 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG------- 318 + L TN + A R L + VI ++DG++S Sbjct: 210 VESLVDDSATNIGDGLRVAGRVLGD--------RRHKNAVSSVILLSDGKDSYVVPRRGN 261 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 +Y + + R I++ A + + + G F V + + + Sbjct: 262 GMSYMDLVPPSFASSGGRGRLAPIHTFGFGADHDAAAMNTIAESTGGTFSFVENEAAIQD 321 Query: 379 SFDK 382 SF + Sbjct: 322 SFAQ 325 >gi|297191182|ref|ZP_06908580.1| von Willebrand factor [Streptomyces pristinaespiralis ATCC 25486] gi|197718543|gb|EDY62451.1| von Willebrand factor [Streptomyces pristinaespiralis ATCC 25486] Length = 424 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 28/200 (14%), Positives = 69/200 (34%), Gaps = 34/200 (17%) Query: 206 ANRKIDVLIESAGNLVNSIQK---------AIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 ++ ++ +++++ + + R PL Sbjct: 58 GKSRMAAAKQAFNEVLDAVPEEVRLGIRTLGADYPGDDRKRGC----KDTRQLYPVGPL- 112 Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + E K+ + L P T PA+ A +L + ++ ITDGE+ Sbjct: 113 -DRTEAKAAVATLAPTGWTPIGPALLGAAEDL-----------EGGDATRRIVLITDGED 160 Query: 317 SGASAYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDS 373 + L+ ++ + G+ I ++ + + + LR + + G + +V + Sbjct: 161 T-----CAPLDPCEVAREIAAKGIHLVIDTLGLVPDAKTRTQLRCIAEATGGTYTSVQHT 215 Query: 374 RELLESFDKITDKIQEQSVR 393 EL + ++ D+ + V Sbjct: 216 DELSDRVSQLVDRAADPVVT 235 >gi|262183593|ref|ZP_06043014.1| hypothetical protein CaurA7_06346 [Corynebacterium aurimucosum ATCC 700975] Length = 604 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 59/191 (30%), Gaps = 25/191 (13%) Query: 196 KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIV-----GN 250 A +ID ++A + A Sbjct: 7 SGSMITNDAGGQTRIDAAKDAARTFITEAGDDAPLGLVTYGGNTGEAPEDEAAGCQDITV 66 Query: 251 QCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 P + N ++ + ++ L P T ++ A EL E + + +I Sbjct: 67 VTPPEAGNSEKMIAHMDGLQPRGFTPIGESLRKAAAELPKEGQ------------RSIIL 114 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCTD-SSGQF 367 ++DG + + + ++ G+ I +V + PE Q L+ D + G + Sbjct: 115 VSDGVAT-----CTPPPVCDVAKELKEQGIDLVINTVGFNVEPEAQQELQCIADATGGTY 169 Query: 368 FAVNDSRELLE 378 +D+ L + Sbjct: 170 ANASDADSLAK 180 >gi|149641369|ref|XP_001505343.1| PREDICTED: similar to matrilin-3, partial [Ornithorhynchus anatinus] Length = 354 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 25/176 (14%), Positives = 62/176 (35%), Gaps = 23/176 (13%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP- 271 + +++++ + + R+ + Y + ++ +K ++++ P Sbjct: 170 VKTFLSQVIDTL-----DIGETATRVAVVNYASTVKVEFHLQTHSDKESLKQAVSRIAPL 224 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T + A+ A E++ + + + + K V+ +TDG + Sbjct: 225 ATGTMSGLAIRTAMDEVFTVEAGARA--PAFNIPKVVVIVTDGRPQD--------QVQEA 274 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 + +G++IY+V V LR+ F V +L +F K Sbjct: 275 VAQAQASGIEIYAVGVGR--ADMQSLRQLASEPVETHAFYVETYGVIEKLTSTFRK 328 >gi|42407699|dbj|BAD08847.1| zinc finger (C3HC4-type RING finger) protein family-like [Oryza sativa Japonica Group] gi|42408121|dbj|BAD09261.1| zinc finger (C3HC4-type RING finger) protein family-like [Oryza sativa Japonica Group] Length = 703 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 15/133 (11%), Positives = 44/133 (33%), Gaps = 16/133 (12%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 KS + L+ TN + A + VI ++DG+++ Sbjct: 324 GKASAKSAVESLHADGCTNILEGLVEAAKVFD--------GRRYRNAVASVILLSDGQDN 375 Query: 318 G-----ASAYQNTLNTLQICEYMRNAG---MKIYSVAVSAPPEGQDLLRKCTDSSGQFFA 369 A + ++ + + +G + +++ + + ++ G F Sbjct: 376 YNVNGGWGASNSKNYSVLVPPSFKRSGDRRLPVHTFGFGTDHDASAMHTIAEETGGTFSF 435 Query: 370 VNDSRELLESFDK 382 + + + ++F + Sbjct: 436 IENQAVVQDAFAQ 448 >gi|326678379|ref|XP_002666248.2| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like [Danio rerio] Length = 1089 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 42/360 (11%), Positives = 102/360 (28%), Gaps = 31/360 (8%) Query: 15 YAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQ 74 A D+ ++ + + AL+ L+ A + K++ K + Sbjct: 84 VATDIERLLAKKRK---ALER--LASEAERLQKEHRWQDGIKEENIEYYNSKAEMDY--- 135 Query: 75 GSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLR 134 D +++ D + Q A +IPT+ +I + L Sbjct: 136 ----DGEDIDSQMSLKLDFVYDPSFKNQVNYSHTA-VQIPTDIYKGAPVILNELNWTQAL 190 Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 IE S ++ ++ + + + P + S Sbjct: 191 ERVFIENSRDDPSLLWQAFGSATGVTRYYPAAPWRAPDKIDLYDVRRRPWYIQGASSPKD 250 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCT 253 + + + + ++ S +++++ + + + + Sbjct: 251 MVILVDVSGSVSGLTLKLIKASVTEMLDTLSDDDYVNVARFNEKAEAVVPCFDHLVQANV 310 Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 N K + ++ T+ H A+ +L N K ++ TD Sbjct: 311 R---NKKIFKEAVQQMQAKGTTDYKSGFHFAFNQLLNTNVPRA------NCNKIIMLFTD 361 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVND 372 G A N N +++++ +V L+ S+ G +F + Sbjct: 362 GGEDRAQDIFEQYNWP-------NKTVRVFTFSVGQHNYDVTPLQWIACSNKGYYFEIRS 414 >gi|297170900|gb|ADI21918.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [uncultured gamma proteobacterium HF0130_26L16] Length = 563 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 70/200 (35%), Gaps = 28/200 (14%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN----NL 259 +I + E+ +L+ K E + G + PL + Sbjct: 44 VEGKPRIVIAKETLSSLIE---KTPAEIRTGITAYGHRRKFDCSDIQEIVPLKSLDPMTK 100 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +V+ R++ L+ T ++ L E+ S ++ I+DG Sbjct: 101 YQVQERISTLSAMGKTPITDSIRQTVDRLKTEEGRST-----------IVLISDGLE--- 146 Query: 320 SAYQNTLNTLQICEYMRNAGMK--IYSVAVSAPPEGQDLLRKC-TDSSGQFFAVNDSREL 376 + + ++++G+ ++ V E + L G+FF ++ +L Sbjct: 147 ---SCGKDPCALTSELKSSGINFVMHVVGFGLTQEQEQKLSCISAAGEGKFFTAGNAADL 203 Query: 377 LESFDKITDKIQEQSVRIAP 396 L++ + + + EQ V + P Sbjct: 204 LDALTVVKESVVEQ-VELTP 222 >gi|284052945|ref|ZP_06383155.1| von Willebrand factor, type A [Arthrospira platensis str. Paraca] gi|291569123|dbj|BAI91395.1| hypothetical protein [Arthrospira platensis NIES-39] Length = 489 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 24/147 (16%), Positives = 48/147 (32%), Gaps = 20/147 (13%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + + + E++ + +L+ + TN + A L N + Sbjct: 96 SSRASVVADFTRDERELQQAIARLSAWGGTNLSEGFNLATSVLQNSDRPGN--------- 146 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG 365 ++ TDGE + I + +R +G+ + VAV + L T Sbjct: 147 --ILLFTDGEPNNRRM------AASIAQQIRASGINL--VAVGTGDAPVNYLTALTGDPD 196 Query: 366 QFFAVNDSRELLESFDKITDKIQEQSV 392 F + +L +F I Q + Sbjct: 197 LVFYA-NFGDLDSAFRGAEKAIYGQQL 222 >gi|284166763|ref|YP_003405042.1| von Willebrand factor A [Haloterrigena turkmenica DSM 5511] gi|284016418|gb|ADB62369.1| von Willebrand factor type A [Haloterrigena turkmenica DSM 5511] Length = 853 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 52/359 (14%), Positives = 109/359 (30%), Gaps = 42/359 (11%) Query: 32 ALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQI 91 +D AV++G A+ D T++ D + + + Sbjct: 496 TIDGAVIAGDAAEGGDDTLEIG---DDVQINFDRSLRSATPLSEEDTDLLFEYSDTRPPV 552 Query: 92 NITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISIC 151 ++T L G + E I Sbjct: 553 DVTFV---------------------LDRSGSMGPHNPTSWSAYEPDYEIDIGEEWEPIP 591 Query: 152 MVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKID 211 + + +Q +D+ + + ++ P + ++ P Sbjct: 592 TDEPFRNTHDWKSIQVRDDDGTIRTLEHRDFVHPDDWTEIRVHPYHQFGYIPGSIGIYPH 651 Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP 271 + V + + I E + R+G + PLS++L K + Sbjct: 652 PGNDPTNQRVEATRNVIDELDPSADRVGVYDFASSGR--ALHPLSDDLESAKESVVG-TA 708 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 Y TN + A + T G+ ++ VI ++DG+N S N ++ Sbjct: 709 YGGTNMAAGLEAALN--------DYATRGTDDRERIVILLSDGKN---SNTANDERMDEL 757 Query: 332 CEYMRNAGMKIYSVAVSA---PPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDK 386 + + +++V + A +D L + G ++ D ELL+ F++I D+ Sbjct: 758 ADRSDDLDYTLHTVGLDALEHDSIPEDKLEGWATETGGNYYQTADPDELLDLFEEIVDE 816 >gi|125575071|gb|EAZ16355.1| hypothetical protein OsJ_31817 [Oryza sativa Japonica Group] Length = 579 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 63/184 (34%), Gaps = 25/184 (13%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIG-IVGNQCTPLSN-NLNEVKSR 265 RK+ ++ ++ G +++++ A R+ ++++ + +S K Sbjct: 158 RKLALVKKAMGFVIDNLGPAD--------RLCVVSFSTEASRRTRLLRMSEVGKATAKRA 209 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG------- 318 + L TN + A R L + VI ++DG++S Sbjct: 210 VESLVDDSATNIGDGLRVAGRVLGD--------RRHKNAVSSVILLSDGKDSYVVPRRGN 261 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 +Y + + R I++ A + + + G F V + + + Sbjct: 262 GMSYMDLVPPSFASSGGRGQLAPIHTFGFGADHDAAAMNTIAESTGGTFSFVENEAAIQD 321 Query: 379 SFDK 382 SF + Sbjct: 322 SFAQ 325 >gi|258652510|ref|YP_003201666.1| hypothetical protein Namu_2300 [Nakamurella multipartita DSM 44233] gi|258555735|gb|ACV78677.1| conserved hypothetical protein [Nakamurella multipartita DSM 44233] Length = 320 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 26/199 (13%), Positives = 67/199 (33%), Gaps = 31/199 (15%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 ++D E+A + V+ + V +G +++ + + K ++ Sbjct: 106 NRLDAAKEAAQSFVDDLTPG--------VNLGIVSFAGIATVLVSP--TTDRTVAKQAID 155 Query: 268 KLNPYENTNTYPAMHHAYRELYN-EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 L E T T A+ + + + K + + ++ +TDG+ + Q+ Sbjct: 156 GLTLDERTATGEAIISSLQTIELFSKTLPPDGTDTGPPPARIVLMTDGKRTVGRTEQDAA 215 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPE-------------GQDLLRKCTD-SSGQFFAVND 372 AG+ + +A + +++ S G F Sbjct: 216 QRAAD------AGVPVSVIAFGTDNGSITVNDEVIPVPLDTEAMQQIAQISGGDFHQAAS 269 Query: 373 SRELLESFDKITDKIQEQS 391 + EL + ++ ++I ++ Sbjct: 270 AEELKSIYAQLGEQIGYET 288 >gi|73538303|ref|YP_298670.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] gi|72121640|gb|AAZ63826.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] Length = 358 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 19/174 (10%), Positives = 55/174 (31%), Gaps = 48/174 (27%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK----------------------- 292 S++ ++V + + L P T + A L + Sbjct: 154 SHSKDDVATAIEGLKPQGGTALGNGLLIALTTLLPQTTNDAERLMNGGDVAQPGKPGKAA 213 Query: 293 ----ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ + ++ +DGE++ +Q + G+++Y+V V Sbjct: 214 PGELDNGEPVRPGSYASGAIVLFSDGESNSG------PGAVQAAQLAATYGVRVYTVGVG 267 Query: 349 APPE--------------GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKI 387 + +L++ + ++F + D+ L + + + ++ Sbjct: 268 TTEGVVLSADGWSARVRLDEKVLKQVADTTGAEYFRLEDTAALKKVYRALNTRL 321 >gi|299534564|ref|ZP_07047896.1| hypothetical protein BFZC1_01007 [Lysinibacillus fusiformis ZC1] gi|298729937|gb|EFI70480.1| hypothetical protein BFZC1_01007 [Lysinibacillus fusiformis ZC1] Length = 864 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 27/155 (17%), Positives = 55/155 (35%), Gaps = 19/155 (12%) Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 ++ T ++ E + + P T Y ++ AY L + K Sbjct: 449 FDDRPWEIIETGPLSSKEEAVDTILSVTPGGGTEIYSSLAKAYENLADLKLQR------- 501 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 K +I +TDG++ + + + G+ + +VA+ +LL +D Sbjct: 502 ---KHIILLTDGQSQAGNY-------EDLITEGKEDGITLSTVAIGQDA-DANLLEALSD 550 Query: 363 SS-GQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 G+F+ V D + + + T I + P Sbjct: 551 MGSGRFYDVIDEQTIPSILSRETAMISRTYIEDNP 585 >gi|260426945|ref|ZP_05780924.1| von Willebrand factor type A [Citreicella sp. SE45] gi|260421437|gb|EEX14688.1| von Willebrand factor type A [Citreicella sp. SE45] Length = 334 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 30/172 (17%), Positives = 65/172 (37%), Gaps = 25/172 (14%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 R+ I + PL+ +L+ + + L++ T M + L + + Sbjct: 140 RMALIVFGSAAYLQA--PLTADLDAILALLDR--------TQVGMAGPHTALGDSIGLAI 189 Query: 297 NTIGSTRLKKFV-IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE--- 352 T S+ + + + I ++DG ++ + + + E ++I++V V P Sbjct: 190 RTFESSEIDQRLLILLSDGSDTASRM-----DPVNAAEIAAGRDVEIFTVGVGDPDATGE 244 Query: 353 ---GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVRIA--PNR 398 D LR D + G +F D L ++ I ++ ++ P R Sbjct: 245 NRVDLDTLRAIADRTGGAYFFAADEAALTAVYESIDALAPRETETLSFRPRR 296 >gi|221110023|ref|XP_002170779.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 671 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 30/236 (12%), Positives = 75/236 (31%), Gaps = 18/236 (7%) Query: 167 KHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK 226 +D + L KK T + L + ++K Sbjct: 6 SPDDTAQAGLIRNLKIGHGKKGTLHLETPDCEGFFDVGFILDSSGSLKSQYWKEKDFLKK 65 Query: 227 AIQEK--KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHH 283 N G + ++ + + + +++++ ++ T A+ Sbjct: 66 LANSFGISNKGSHAGVVTFSHYAELSIRLDAFYSSIDFNDAVDRISHMDSFTRIDLALAK 125 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A L++ K + N + + +TDG+ + + I + ++ G++++ Sbjct: 126 ALE-LFDIKNGARN-----DVPNLLFLLTDGKQ------EPEMPLTHISDEIKQKGIQLF 173 Query: 344 SVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL--ESFDKITDKIQEQSVRIAPN 397 +V + A + L K + F V+D +LL + K+ + Sbjct: 174 AVGIGAGAN-KTELEKIVGNPENVFMVDDFDKLLNGDFLKKVKQGSCSSVLENLKK 228 >gi|297463635|ref|XP_002702824.1| PREDICTED: collagen, type VII, alpha 1 [Bos taurus] Length = 2933 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 26/158 (16%), Positives = 56/158 (35%), Gaps = 19/158 (12%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKES 294 VR + Y+ + +V + +L+ NT T A+ H ++ + + Sbjct: 77 VRFAAVQYSDDPRTEFDLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVADRVFLPQLA 136 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + K I ITDG++ + ++ G+K+++V + Sbjct: 137 RPG------VPKVCILITDGKSQDMVD--------TAAQRLKGQGVKLFAVGI--KNADP 180 Query: 355 DLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 + L++ FF VND L ++ ++ Sbjct: 181 EELKRIASQPTSDFFFFVNDFSILRTLLPLVSRRVCTT 218 Score = 44.9 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 27/204 (13%), Positives = 68/204 (33%), Gaps = 27/204 (13%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 A + + + + +LV+++ +V++G ++Y+ S++L Sbjct: 1058 VHATRDNAHRSEAVKRALEHLVSALGPLGP----QAVQVGLLSYSHRPSPLLSLNSSHDL 1113 Query: 260 NEVKSRLNKLNPY--ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + ++ + N A+ A+R L + ++ + D Sbjct: 1114 GVILQKIRNIPYTDPSGNNLGTAVVTAHRYLLAPDAPGRR----RHVPGVMVLLVDEPLR 1169 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG-----QFFAVND 372 G + AG+++ + + + LR+ G FFAV+D Sbjct: 1170 GDIFNS--------IREAQAAGLRV--MMLGLAGADPEQLRRLV--PGMDPSQTFFAVDD 1217 Query: 373 SRELLESFDKITDKIQEQSVRIAP 396 L + + + + ++ P Sbjct: 1218 GLSLERAVSSLAAALCQTALTTQP 1241 >gi|297488708|ref|XP_002697097.1| PREDICTED: collagen, type VII, alpha 1 [Bos taurus] gi|296474920|gb|DAA17035.1| collagen, type VII, alpha 1 [Bos taurus] Length = 2932 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 26/158 (16%), Positives = 56/158 (35%), Gaps = 19/158 (12%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKES 294 VR + Y+ + +V + +L+ NT T A+ H ++ + + Sbjct: 77 VRFAAVQYSDDPRTEFDLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVADRVFLPQLA 136 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + K I ITDG++ + ++ G+K+++V + Sbjct: 137 RPG------VPKVCILITDGKSQDMVD--------TAAQRLKGQGVKLFAVGI--KNADP 180 Query: 355 DLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 + L++ FF VND L ++ ++ Sbjct: 181 EELKRIASQPTSDFFFFVNDFSILRTLLPLVSRRVCTT 218 Score = 44.9 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 27/204 (13%), Positives = 68/204 (33%), Gaps = 27/204 (13%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 A + + + + +LV+++ +V++G ++Y+ S++L Sbjct: 1058 VHATRDNAHRSEAVKRALEHLVSALGPLGP----QAVQVGLLSYSHRPSPLLSLNSSHDL 1113 Query: 260 NEVKSRLNKLNPY--ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + ++ + N A+ A+R L + ++ + D Sbjct: 1114 GVILQKIRNIPYTDPSGNNLGTAVVTAHRYLLAPDAPGRR----RHVPGVMVLLVDEPLR 1169 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG-----QFFAVND 372 G + AG+++ + + + LR+ G FFAV+D Sbjct: 1170 GDIFNS--------IREAQAAGLRV--MMLGLAGADPEQLRRLV--PGMDPSQTFFAVDD 1217 Query: 373 SRELLESFDKITDKIQEQSVRIAP 396 L + + + + ++ P Sbjct: 1218 GLSLERAVSSLAAALCQTALTTQP 1241 >gi|320007358|gb|ADW02208.1| von Willebrand factor type A [Streptomyces flavogriseus ATCC 33331] Length = 428 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 29/198 (14%), Positives = 69/198 (34%), Gaps = 26/198 (13%) Query: 204 APANRKIDVLIESAGNLVNSIQK-----AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 +I ++ ++++++ + + + PL + Sbjct: 56 MGGESRISAAKQAFNDVLDAVPEEVQLGIRTLGADYPGDDRKVGCKDTKQLYPVGPL--D 113 Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 E K+ + L P T PA+ A +L + ++ ITDGE++ Sbjct: 114 RTEAKTAVATLAPTGWTPIGPALLGAADDL-----------DGGDSTRRIVLITDGEDTC 162 Query: 319 ASAYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRE 375 L+ ++ + G+ I ++ + + + L + + G + AV ++ E Sbjct: 163 G-----PLDPCEVARDIAARGIHLVIDTLGLVPDAKIRQQLTCIAEATGGTYTAVQNTDE 217 Query: 376 LLESFDKITDKIQEQSVR 393 L ++ D+ E V Sbjct: 218 LSGRVKQLVDRAAEPVVT 235 >gi|45384490|ref|NP_990665.1| collagen alpha-1(XIV) chain precursor [Gallus gallus] gi|1705533|sp|P32018|COEA1_CHICK RecName: Full=Collagen alpha-1(XIV) chain; AltName: Full=Undulin; Flags: Precursor gi|288873|emb|CAA50064.1| collagen XIV [Gallus gallus] Length = 1888 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 55/168 (32%), Gaps = 18/168 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+G Y+ + V + L NT T A+ + Sbjct: 186 SAFNVGSEKTRVGLAQYSGDPRIEWHLNAYGTKDAVLDAVRNLPYKGGNTLTGLALTYIL 245 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + + + K I ITDG++ + +R+AG++++++ Sbjct: 246 ENSFKPEAGAR-----PGVSKIGILITDGKSQDDVIPP--------AKNLRDAGIELFAI 292 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 V + L++ + V D + + +T + + Sbjct: 293 GV--KNADINELKEIASEPDSTHVYNVADFNFMNSIVEGLTRTVCSRV 338 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 28/201 (13%), Positives = 62/201 (30%), Gaps = 26/201 (12%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + +I + V ++ K + ++ I ++ Sbjct: 1047 VDGSWSIGDDNFNKIISFLYSTVGALDKIGPDGT----QVAIIQFSDDPRTEFKLNAYKT 1102 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + ++ NT T A+ HA L+ + K ++ ITDG + Sbjct: 1103 KETLLEAIQQIAYKGGNTKTGKAIKHAREVLFTG-----EAGMRKGIPKVLVVITDGRSQ 1157 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRE 375 ++ M+ G +++ V L F V+D Sbjct: 1158 DDVN--------KVSREMQLDGFSFFAIGV--ADADYSELVNIGSKPSERHVFFVDDF-- 1205 Query: 376 LLESFDKITDKIQEQSVRIAP 396 ++F KI D++ A Sbjct: 1206 --DAFTKIEDELITFVCETAS 1224 >gi|288875|emb|CAA50063.1| collagen XIV [Gallus gallus] Length = 1857 Score = 66.5 bits (160), Expect = 7e-09, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 55/168 (32%), Gaps = 18/168 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+G Y+ + V + L NT T A+ + Sbjct: 186 SAFNVGSEKTRVGLAQYSGDPRIEWHLNAYGTKDAVLDAVRNLPYKGGNTLTGLALTYIL 245 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + + + K I ITDG++ + +R+AG++++++ Sbjct: 246 ENSFKPEAGAR-----PGVSKIGILITDGKSQDDVIPP--------AKNLRDAGIELFAI 292 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 V + L++ + V D + + +T + + Sbjct: 293 GV--KNADINELKEIASEPDSTHVYNVADFNFMNSIVEGLTRTVCSRV 338 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 28/201 (13%), Positives = 62/201 (30%), Gaps = 26/201 (12%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + +I + V ++ K + ++ I ++ Sbjct: 1047 VDGSWSIGDDNFNKIISFLYSTVGALDKIGPDGT----QVAIIQFSDDPRTEFKLNAYKT 1102 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + ++ NT T A+ HA L+ + K ++ ITDG + Sbjct: 1103 KETLLEAIQQIAYKGGNTKTGKAIKHAREVLFTG-----EAGMRKGIPKVLVVITDGRSQ 1157 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRE 375 ++ M+ G +++ V L F V+D Sbjct: 1158 DDVN--------KVSREMQLDGFSFFAIGV--ADADYSELVNIGSKPSERHVFFVDDF-- 1205 Query: 376 LLESFDKITDKIQEQSVRIAP 396 ++F KI D++ A Sbjct: 1206 --DAFTKIEDELITFVCETAS 1224 >gi|291527684|emb|CBK93270.1| Mg-chelatase subunit ChlD [Eubacterium rectale M104/1] Length = 1237 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 29/214 (13%), Positives = 65/214 (30%), Gaps = 29/214 (13%) Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 S + N+++ V NL + + + + + + Sbjct: 598 QDVWNTTGFSGVEAILVIDDSGSMVSNDRYNQRLTVAQNLIDNLPENSKVGVVKFTSSTT 657 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 ++ T + S T+ Y A++ ++ Sbjct: 658 KLTTSLTSDKETAKSYLTTSY-----------FRSSGGTSMYTAINSSFSMF-------- 698 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ-- 354 + K +I ++DG T + N G+KIY+V + + Sbjct: 699 -EATDDNILKMMIVLSDGAT------SYTYLHSSVVTTANNNGVKIYTVGLGSSSSSYFT 751 Query: 355 DLLRKCTD-SSGQFFAVNDSRELLESFDKITDKI 387 L+ + + G F+ +D+ +L + + I KI Sbjct: 752 QYLKPLANNTGGAFYLASDASQLEDIYKDINKKI 785 >gi|264678234|ref|YP_003278141.1| hypothetical protein CtCNB1_2099 [Comamonas testosteroni CNB-2] gi|262208747|gb|ACY32845.1| putative membrane protein [Comamonas testosteroni CNB-2] Length = 408 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 32/316 (10%), Positives = 78/316 (24%), Gaps = 22/316 (6%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A+ + F+ A+DL + ++ ++Q+A+D+ L+ + + Sbjct: 18 FALFMLFLLGFMGIALDLGRLFIVKTELQTAMDSCALAAARELNGQSDAITRAQNAGMAA 77 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + + A I+ + Q +T + L ++ QY + + L+L Sbjct: 78 GNSNNANLQSANWNGQGKLPATGISFRKQDYVTPTSDGKLARY--AECQYSMSSIKLWLL 135 Query: 122 GLIPS---------ALTNLSLRSTGIIERSSENLAISIC---MVLDVSRSMEDLYLQKHN 169 + + + R+ S I + D S Sbjct: 136 QAMGAFTGDSATWPNTGTVEARAVATRAPSQSACPIPVQLKKAKFDALNSSPGGGKGTWI 195 Query: 170 DNNNMTSNKYLL-PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIES--AGNLVNSIQK 226 + S + T + ++ + A + Sbjct: 196 VAMTKSGGGSDFGWSNLDGSTSATETWAELEGKYCSTEFPQLPLDTNGLKASAFEYWNHR 255 Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYR 286 KK + + +N ++ + T+ + Sbjct: 256 FGIYKKASDTPAQNSSASRPDYTGYIYTDTNWPSKFNAY--DGPAQGGTSN---FITQRQ 310 Query: 287 ELYNEKESSHNTIGST 302 S T Sbjct: 311 SFTPCGPSCPAKGNET 326 >gi|260837294|ref|XP_002613640.1| hypothetical protein BRAFLDRAFT_227016 [Branchiostoma floridae] gi|229299026|gb|EEN69649.1| hypothetical protein BRAFLDRAFT_227016 [Branchiostoma floridae] Length = 216 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 28/177 (15%), Positives = 63/177 (35%), Gaps = 19/177 (10%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE 273 N+V + + ++G + Y+ I ++L ++ ++ + Sbjct: 24 KTFLNNIVGQF-----DIGPTATQVGVVQYSWFIRQECALNAHSSLASLQQAISNITVLG 78 Query: 274 -NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T+T A+ A + + K V+ +TDG + + Sbjct: 79 LGTHTGAALTFARNTALTAANGAR-----PGVPKIVVVMTDGASEDDVTLPS-------- 125 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + +RN G+ ++++VS LL+ S + FA D L ++ ++ E Sbjct: 126 QNLRNDGVITFAISVSWSLPNDRLLQDIAGSPDRIFAATDFDALDGIKVTLSSQLCE 182 >gi|147901111|ref|NP_001079801.1| matrilin 1, cartilage matrix protein [Xenopus laevis] gi|32450626|gb|AAH54272.1| MGC64509 protein [Xenopus laevis] Length = 490 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 27/183 (14%), Positives = 61/183 (33%), Gaps = 23/183 (12%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ + +V+S+ + +G + Y+ + ++ ++KS + K Sbjct: 283 NFELVKQFINQIVDSM-----DVGERRAHVGLVQYSSSVRQEFPLGRYSSKKDIKSAVKK 337 Query: 269 LNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 ++ + T T A+ + + + K I TDG + Sbjct: 338 MSYMEKGTMTGQALQYLVDNSFAISSGGRPA-----VPKVGIVFTDGRSQDYIND----- 387 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITD 385 + G K+++V V ++ LR F D + + E K+ Sbjct: 388 ---AAARAKELGYKMFAVGVG--NAVEEELRMIASEPQAEHSFYTADFKAMKEIGKKLQM 442 Query: 386 KIQ 388 KI Sbjct: 443 KIC 445 Score = 63.4 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 53/159 (33%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + R+G + Y + + + K+ P T T A+ +A Sbjct: 69 DVGANATRVGLVNYASTVKNEFSLKTHKAKPALLQAVKKVQPLSTGTMTGLAIQYAMNNA 128 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + E E + S + K I +TDG I R +G++IY++ V Sbjct: 129 FTESEGARIK--SPGINKVAIVVTDGRPQDT--------VKDISARARESGLEIYAIGVG 178 Query: 349 APPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 ++ LR+ V +L + F + Sbjct: 179 R--VDKNTLRQIASEPLDEHVDYVESYSLIEKLSKKFQE 215 >gi|255587116|ref|XP_002534143.1| protein binding protein, putative [Ricinus communis] gi|223525789|gb|EEF28236.1| protein binding protein, putative [Ricinus communis] Length = 728 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 26/281 (9%), Positives = 82/281 (29%), Gaps = 25/281 (8%) Query: 112 EIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDN 171 + + F + + + + + I+ E + S D ++ Sbjct: 190 SLDLQPAFSDRSSGNKTPDHNSQKSIEIKTYPEVPSASRSCAYDNF-TVLVHLKAPATVT 248 Query: 172 NNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEK 231 P +S + + + + A K+ +L + G ++ ++ Sbjct: 249 MQNPRINQASLPQLSQSPRAPVDLVTVLDISGSMAGTKLALLKRAMGFVIQNL------G 302 Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNE 291 N + + + + + +N L + TN + + + + Sbjct: 303 SNDRLSVIAFSSTARRLFPLRRMSDTGRQQALQAVNSLVAHGGTNIAEGLRKGAKVMEDR 362 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL---QICEYMRNAG-------MK 341 +E + +I ++DG+++ + + + G + Sbjct: 363 REKNPVAS--------IILLSDGQDTYTVSSSGANQPQPNYHLLLPLSIHGGDTSGFQIP 414 Query: 342 IYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 +++ A + + S G F + + ++F + Sbjct: 415 VHAFGFGADHDASSMHSISEVSGGTFSFIETEAVIQDAFAQ 455 >gi|119512060|ref|ZP_01631154.1| von Willebrand factor, type A [Nodularia spumigena CCY9414] gi|119463286|gb|EAW44229.1| von Willebrand factor, type A [Nodularia spumigena CCY9414] Length = 418 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 24/160 (15%), Positives = 54/160 (33%), Gaps = 15/160 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 +K R+ +A++ N ++K ++N L T + EL Sbjct: 71 DKLKPGDRLSVVAFDHRATVLVPNQTITNPGQIKKQINSLTADGGTAIDEGLRLGIEEL- 129 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 G +TDGE + + + L+ + + + ++ Sbjct: 130 --------AKGKKETVSQAFLLTDGE----NEHGDNQRCLKFAQLATGYNLTLNTLGFGD 177 Query: 350 PPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQ 388 QD+L K D+ G + + + F+++ ++Q Sbjct: 178 KWN-QDVLEKIADAGLGSLSHIQKPEQAADEFNRLFSRVQ 216 >gi|219847249|ref|YP_002461682.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219541508|gb|ACL23246.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 842 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 22/121 (18%), Positives = 47/121 (38%), Gaps = 16/121 (13%) Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 E+++R+ + TN A+ L E + + +TDG + Sbjct: 461 AELQTRIATMAIGGGTNIERALAVGLPALAAE----------PHSVRHAVLLTDGRSYSN 510 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLE 378 + + Q+ E R A + + ++A+ DLL + +G+++ V D+ +L Sbjct: 511 NYPRY----QQLVETARAAQITLSTIAIGTDA-DTDLLEQLARWGNGRYYFVPDAADLPR 565 Query: 379 S 379 Sbjct: 566 I 566 >gi|116051069|ref|YP_790101.1| von Willebrand factor type A domain-containing protein [Pseudomonas aeruginosa UCBPP-PA14] gi|296388430|ref|ZP_06877905.1| von Willebrand factor type A domain-containing protein [Pseudomonas aeruginosa PAb1] gi|313108364|ref|ZP_07794396.1| putative von Willebrand factor type A domain-containing protein [Pseudomonas aeruginosa 39016] gi|115586290|gb|ABJ12305.1| putative von Willebrand factor type A domain [Pseudomonas aeruginosa UCBPP-PA14] gi|310880898|gb|EFQ39492.1| putative von Willebrand factor type A domain-containing protein [Pseudomonas aeruginosa 39016] Length = 340 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 28/146 (19%), Positives = 52/146 (35%), Gaps = 35/146 (23%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + + V+ L++ +NT A+ A + L + ++ Sbjct: 150 PLTFDRHTVRVWLDEAQIGIAGKNTAIGDAIGLAVKRL----------RQRPAESRVLVL 199 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------GQ 354 ITDG N+G + + +KIY++ V A P+ + Sbjct: 200 ITDGANTGGQ-----IAPQIAAQLAAEQQVKIYTIGVGADPQQGGVPGLFGFNPGLDLDE 254 Query: 355 DLLRKCTD-SSGQFFAVNDSRELLES 379 LR + + G++F S EL Sbjct: 255 PTLRGIAEITGGEYFRARSSAELESI 280 >gi|308068884|ref|YP_003870489.1| von Willebrand factor A [Paenibacillus polymyxa E681] gi|305858163|gb|ADM69951.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Paenibacillus polymyxa E681] Length = 432 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 73/177 (41%), Gaps = 23/177 (12%) Query: 223 SIQKAIQEKKNLSVRIGTIAYNIGIV-GNQCTPLSNN--LNEVKSRLNKLNP--YENTNT 277 + K++ + + R+ ++++ TP+ + V S+++ + T Sbjct: 140 TAAKSLIGQMDGDKRVAIVSFDSTAQLVQPFTPIRTDAEKQAVYSKIDSMQTIMSGGTEI 199 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 A+ +E+ + G+ VI ++DG + L+T Sbjct: 200 RLALDETIKEIETQ--------GNAEKGSLVIMLSDG--------FSELDTQTALAPYIA 243 Query: 338 AGMKIYSVAVS-APPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + + ++ + A +G LL+ D + G + V +++ L ++F KI +KI ++++ Sbjct: 244 RQIPVNTIGLKLAESDGIALLQNIADLTGGTYSNVANAQGLTQAFGKIYNKIGDRTL 300 >gi|256261600|gb|ACU65922.1| CR4 receptor subunit alphaX [Ovis aries] Length = 1158 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 59/177 (33%), Gaps = 17/177 (9%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYENT 275 +++ ++ + + + S + + ++ + ++ + LN T Sbjct: 167 FNKMLSFVKAVMSQFQRPSSQFSLVQFSDRFQEHFTFKDFATSSDPLNLLNSVWQLGGWT 226 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 T A+ L + + K +I ITDGE + ++ ++ Sbjct: 227 FTASAIRFVTDRLLSAAYGAR-----KDASKILIVITDGEKTE------KVDYKEVIPRA 275 Query: 336 RNAGMKIYSVAVSAP---PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKI 387 AG+ Y++ V + L + F V + L + ++ +KI Sbjct: 276 EAAGVIRYAIGVGSAFQYRNSLQELIDIASTPSKEHVFQVENFDALRDIQKQLKEKI 332 >gi|317127857|ref|YP_004094139.1| von Willebrand factor A [Bacillus cellulosilyticus DSM 2522] gi|315472805|gb|ADU29408.1| von Willebrand factor type A [Bacillus cellulosilyticus DSM 2522] Length = 282 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 23/146 (15%), Positives = 55/146 (37%), Gaps = 23/146 (15%) Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 +K + ++ TN + + + VI ++DG+ + Sbjct: 104 LKEYIQRIETRGITNLSGGLIQGCQHVL--------KQEVKNYVNRVILLSDGQANAG-- 153 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESF 380 + +++ + ++AG+ I ++ V + ++LL DS G F +N+ + F Sbjct: 154 ITDKEALVKLADDYQSAGLVISTLGV-SEHFDEELLEGVADSGRGNFHFINEVENIPSIF 212 Query: 381 DK--------ITDKIQEQSVRIAPNR 398 ++ I I ++ I P + Sbjct: 213 EQELDGLLNVIGQNI---TLNILPKK 235 >gi|156358436|ref|XP_001624525.1| predicted protein [Nematostella vectensis] gi|156211311|gb|EDO32425.1| predicted protein [Nematostella vectensis] Length = 1323 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 24/184 (13%), Positives = 55/184 (29%), Gaps = 16/184 (8%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + ++ LV ++ G IAY+ S Sbjct: 1143 RSTSVGPTNFNIAKTFLKILVERMK-----ISTNGSHFGLIAYSSSASRVISFRFSQKAA 1197 Query: 261 EVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 ++ +++ + T T A+ A +L+ + +I +T+G S Sbjct: 1198 DINRQIDAIEFTGGKTRTDFALQVAITDLFTNS-----AGDRENVTDVLIVMTNGRTSQG 1252 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 S + + ++ + + +V + +LL V+D L Sbjct: 1253 SLPY-----KDVMKPLKEKKVDVIAVGIGPDVNEAELLEIAEGGLDHVIRVDDYEALATK 1307 Query: 380 FDKI 383 + I Sbjct: 1308 LNSI 1311 >gi|307944861|ref|ZP_07660199.1| von Willebrand factor, type A [Roseibium sp. TrichSKD4] gi|307772075|gb|EFO31298.1| von Willebrand factor, type A [Roseibium sp. TrichSKD4] Length = 862 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 23/186 (12%), Positives = 65/186 (34%), Gaps = 24/186 (12%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL-NEVK 263 KI++ ++ + + + G P+ + ++ Sbjct: 55 GGKSKIEIARDAFAEAKTAWD--AGTGQVGLIAYGHRRKGDCRDIETLVPMGSGSGADIS 112 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +R+N + P T ++ A +EL +E++ V+ +DG + Sbjct: 113 TRINSIRPKGKTPLSQSVRLAAQELQYREEAAT-----------VVLFSDGIET------ 155 Query: 324 NTLNTLQICEYMRNAGM--KIYSVAVSAPPE-GQDLLRKCTD-SSGQFFAVNDSRELLES 379 + + E + G+ + + + + L+ + + G +F +D+ L ++ Sbjct: 156 CNADPCLLAEELERDGIDFTAHVIGFGIGSDADRKKLQCIAENTGGTYFDADDAGSLKDA 215 Query: 380 FDKITD 385 ++T+ Sbjct: 216 LGQVTN 221 >gi|225873423|ref|YP_002754882.1| hypothetical protein ACP_1808 [Acidobacterium capsulatum ATCC 51196] gi|225793805|gb|ACO33895.1| hypothetical protein ACP_1808 [Acidobacterium capsulatum ATCC 51196] Length = 339 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 34/155 (21%), Positives = 69/155 (44%), Gaps = 14/155 (9%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 N SNNL+ + S + L+P T Y A++ A R+ S ++ Sbjct: 159 DEAPNFILNWSNNLDTLSSAIQDLHPGGGTALYDAVYSACRDKLLNAASGP-----IYVR 213 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP---PEGQDLLRKCT- 361 + +I ++DG+++ ++ + ++ C+ + A IY+V+ G D+LRK Sbjct: 214 RAIILVSDGDDNQ--SHAYLTDAIKECQRAQTA---IYAVSTDTDPTPDPGDDILRKMAE 268 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 ++ G+ F L SF+ + D+++ Q + Sbjct: 269 ETGGRAFFPRVITNLPASFNSVEDELRSQYALVYK 303 >gi|254480861|ref|ZP_05094107.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] gi|214038656|gb|EEB79317.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] Length = 726 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 22/137 (16%), Positives = 43/137 (31%), Gaps = 21/137 (15%) Query: 245 IGIVGNQCTPLSNNL--NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 P + N + + L+ Y T ++ A H+ + Sbjct: 322 SCPNTELVLPFTANRQATAINRTADALSAYGYTPIADSLTLA----------GHDLLAID 371 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKC 360 K +I I+DGE + + +R+ G+ + + + Q ++ Sbjct: 372 AQKHMIILISDGEETCGGF------PAAVAANLRSLGIDLQTHVIGFDLDATAQQQMQAI 425 Query: 361 TD-SSGQFFAVNDSREL 376 GQ+F D EL Sbjct: 426 ASAGGGQYFDAADGDEL 442 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 23/180 (12%), Positives = 54/180 (30%), Gaps = 21/180 (11%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQC---TPLSNNLNEV 262 RKID+ + + I ++ + + + Sbjct: 29 GARKIDLAKNLFQGMSEQFALDPHMSLRFFAGGTSSNKMIDCQSSKIGLGVGAARSAAGI 88 Query: 263 KSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 S ++ +N T A+ A +++ + +I I+DG+ + Sbjct: 89 SSMIDSINAVGQQTPITYALERAQQDM-----------EGWLGPRKIILISDGQET---- 133 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFD 381 + + E AG+ + ++ + P L +G+F + L E+ Sbjct: 134 --CQQDPESLAEAFSGAGITVDTIGIGPPGNFAQLGMIALAGAGEFHLAENLAALQEAMA 191 >gi|119896366|ref|YP_931579.1| hypothetical protein azo0074 [Azoarcus sp. BH72] gi|119668779|emb|CAL92692.1| conseved hypothetical exported protein [Azoarcus sp. BH72] Length = 563 Score = 66.5 bits (160), Expect = 8e-09, Method: Composition-based stats. Identities = 40/338 (11%), Positives = 86/338 (25%), Gaps = 27/338 (7%) Query: 60 TSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLF 119 + I + RE I + + + + + Y L Sbjct: 70 AAKIAPPAVAPLPLAEPADRERYQAIERHGIQRVAEAPVSTFSIDVD-TGSYSNLRRMLN 128 Query: 120 LKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKY 179 L P + R S + V ++ Sbjct: 129 AGQLPPRDAVRVEELVNYFPYRYSLPQGDAPFAVDTEIAPTPWNPRSLLLRVGIQAADPA 188 Query: 180 LLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG 239 PP + + + S +P K+ +L + V ++ R+ Sbjct: 189 KQALPPANLVFLVDVSGSMNSP------DKLPLLQNALKLFVAQLRPQD--------RVA 234 Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 + Y G + + + ++ L P T + AYR Sbjct: 235 LVTYASGTRVVLEPTAGDRKAAITAAIDGLVPGGATAGAAGIDLAYRMAEQGFVEHGINR 294 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 ++ TDG+ + TL R +G+ + ++ L+ + Sbjct: 295 --------ILLATDGDFNVGITRFETLKDRVA--ERRKSGIALSTLGFGGGNYNDQLMEQ 344 Query: 360 CTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 D+ + L E+ + D+ +A + Sbjct: 345 LADAGDGAYR--YIDSLAEAQKVLVDEFTSTLATVASD 380 >gi|254784286|ref|YP_003071714.1| matrixin family protein [Teredinibacter turnerae T7901] gi|237683907|gb|ACR11171.1| matrixin family protein [Teredinibacter turnerae T7901] Length = 877 Score = 66.1 bits (159), Expect = 8e-09, Method: Composition-based stats. Identities = 31/202 (15%), Positives = 68/202 (33%), Gaps = 15/202 (7%) Query: 198 KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN 257 + AP P+ K+D L +A ++ + V+ + N + Sbjct: 433 NLSSAPDPSVSKMDALKYAANVFMD-FLDLDAGHRAGLVQFHEVVVPFSPAFNLQPVNAA 491 Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 +L+ ++ +N + TN ++ +L + S + ++ +TDG ++ Sbjct: 492 SLSAAQTAINSMTAGGMTNIIDGVNEGIAQLTTAVDPSDR--------QIMLLLTDGLHN 543 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 T T + + + +YSV +L + G D +L Sbjct: 544 RPVGTSVTDITAPLL----ASEVTLYSVGFGTSTNEAELTPLALSTGGVHLENKDVSDLQ 599 Query: 378 --ESFDKITDKIQEQSVRIAPN 397 + F I + + I P+ Sbjct: 600 LRKHFLSIAASAADSTTLIDPH 621 >gi|198436966|ref|XP_002122845.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H5 [Ciona intestinalis] Length = 1586 Score = 66.1 bits (159), Expect = 8e-09, Method: Composition-based stats. Identities = 31/362 (8%), Positives = 94/362 (25%), Gaps = 17/362 (4%) Query: 40 GCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNN 99 S + + ++ + + + + Sbjct: 790 AARDQSSSVFQTSTNLAPRKRVIFQLTYQEALQRKRGIYQYGVSFRMLQPVSMFSITVSI 849 Query: 100 PLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRS 159 + + TE + G +P S + + ++ + Sbjct: 850 SESVPLSTVNALGLETEQTSVPGPVPLQGITTVRNSPVSAVITYTPTSNQQHLISPFGLN 909 Query: 160 MEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYA----PAPAPANRKIDVLIE 215 + + + + + + + KID + + Sbjct: 910 GKFVIEYDVFRDRTTEMV-IDQSYFAHFITSNLPPMSKRVVFLIDVSGSMFGIKIDQVRQ 968 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYEN 274 + +++ + + + G + + N+N + LN + Sbjct: 969 AMNTILHGLAETDFFSVIAFNSSVSRWSPSGTAAVLASGTTANINSAMNFLNTTVVTRGG 1028 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 T+ A+ A + + NT F++ +TDG + + + + Sbjct: 1029 TDILQAVEAAIQLFDSAATGGTNTASD-----FMVLLTDGRPTDGTVSSTAI--ISAIRN 1081 Query: 335 MRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ----FFAVNDSRELLESFDKITDKIQEQ 390 + I ++ + L + +SG F +N ++ +++I+ I Sbjct: 1082 LNRGRFGINTIGFGTLVDMNLLRKIAAQNSGTSIQIFIDLNSYAQISNFYEEISQPILSN 1141 Query: 391 SV 392 + Sbjct: 1142 TT 1143 >gi|322436225|ref|YP_004218437.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] gi|321163952|gb|ADW69657.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] Length = 304 Score = 66.1 bits (159), Expect = 8e-09, Method: Composition-based stats. Identities = 33/166 (19%), Positives = 69/166 (41%), Gaps = 20/166 (12%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKES 294 + + + +N+ +++ LN+L + T Y A++ A + L Sbjct: 109 REQDEFDLMDFSDTVREVVSFTNDKKRIENGLNELRKGDATAVYDAVYLASQRLGETNAG 168 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS-VAVSAPPEG 353 ++ ITDG+N+ + Q E + AG+ +Y+ + V + Sbjct: 169 GGRRR-------VLVLITDGDNTVHGVGYD-----QAVEQAQRAGVMVYALIVVPIEADA 216 Query: 354 -------QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 L++ TD+ G ++ VND R+L + + K++D ++ Q V Sbjct: 217 GRNTGGEHALIQMATDTGGNYYYVNDPRDLAKVYAKVSDDLRTQYV 262 >gi|28899191|ref|NP_798796.1| hypothetical protein VP2417 [Vibrio parahaemolyticus RIMD 2210633] gi|28807415|dbj|BAC60680.1| hypothetical protein [Vibrio parahaemolyticus RIMD 2210633] Length = 431 Score = 66.1 bits (159), Expect = 8e-09, Method: Composition-based stats. Identities = 26/239 (10%), Positives = 65/239 (27%), Gaps = 6/239 (2%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKK-DQ 59 + ++++ + + IDL H + + ++Q+A+D A L+G + Sbjct: 29 LISMVLLILLGVAAFGIDLNHQVLNKTRLQNAVDTAALAGAVVADKTEDVDQAEAAVIAT 88 Query: 60 TSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLF 119 S+I + L + +N +Y + + + Sbjct: 89 LSSIASESGNTELSFTDGNTSVTFSHDMQTFVNAASFTPPTGEYDIYVRVAVTDMGISQY 148 Query: 120 LKGLIPS-ALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNK 178 L + + S + + I M D + ++ED + + + Sbjct: 149 LSAVFGIVKNVSASAVAGRSAAIAYTCNLTPIAMCGDPNGTVEDAWGYRPPGYDPNVDMD 208 Query: 179 YLLPPP----PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 L + + A N ++ ++ N Sbjct: 209 PSLVHELKVGDQNNTDMGPGNFQLLDFGQATGNSGAALVRDALSGAYNGCAAVGNTVTT 267 >gi|260433775|ref|ZP_05787746.1| von Willebrand factor type A [Silicibacter lacuscaerulensis ITI-1157] gi|260417603|gb|EEX10862.1| von Willebrand factor type A [Silicibacter lacuscaerulensis ITI-1157] Length = 327 Score = 66.1 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 25/141 (17%), Positives = 52/141 (36%), Gaps = 21/141 (14%) Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK-FVIFIT 312 P + +L + L + T M L + T S+ + + +I ++ Sbjct: 155 PFTEDLQSLNGFLEQ--------TAVGMAGPNTALGDAIGLGIRTFESSEVDQRMMIVLS 206 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------GQDLLRKCTD-SSG 365 DG ++ + + + G+ IY++ V P D L+ + + G Sbjct: 207 DGADTSSRMT-----PVIAASIAADKGVVIYTIGVGDPDATGEDRVDLDALKDIANKTQG 261 Query: 366 QFFAVNDSRELLESFDKITDK 386 Q+F +D L E + +I + Sbjct: 262 QYFFADDEAALTEVYRQIDAQ 282 >gi|42407700|dbj|BAD08848.1| zinc finger (C3HC4-type RING finger) protein family-like [Oryza sativa Japonica Group] gi|42408122|dbj|BAD09262.1| zinc finger (C3HC4-type RING finger) protein family-like [Oryza sativa Japonica Group] gi|125602049|gb|EAZ41374.1| hypothetical protein OsJ_25891 [Oryza sativa Japonica Group] Length = 704 Score = 66.1 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 16/174 (9%), Positives = 56/174 (32%), Gaps = 6/174 (3%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 K+ +L + G +++ + + + + +YN V + KS + Sbjct: 247 KLTLLKRAMGFVIDKLGPGD------RLAVVSFSYNAQRVIRLTRMSDDGKASAKSAMES 300 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L TN + A + + + + +G +++ ++ Sbjct: 301 LAAGGGTNILKGLVEAAKVFDGRRYRNAVASVILLSDGQDTYNVNGGWGASNSKNYSVLV 360 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + +++ + + ++ G F + + + ++F + Sbjct: 361 PPSFKRSGDRRLSVHTFGFGTDHDAVAMNAIAEETGGTFSFIENQAVVQDAFAQ 414 >gi|297683547|ref|XP_002819437.1| PREDICTED: collagen alpha-1(XIV) chain-like [Pongo abelii] Length = 1761 Score = 66.1 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 57/168 (33%), Gaps = 18/168 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + RIG Y+ + +EV + L NT T A+++ + Sbjct: 186 TAFDVGSEKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNYIF 245 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + T + K I ITDG++ + +R +G++++++ Sbjct: 246 ENSFK-----PEAGSRTGVSKIGILITDGKSQDDIIPPS--------RNLRESGVELFAI 292 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 V + L++ + V + + + +T + + Sbjct: 293 GV--KNADVNELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTLCSRV 338 >gi|255566338|ref|XP_002524155.1| Inter-alpha-trypsin inhibitor heavy chain H3 precursor, putative [Ricinus communis] gi|223536573|gb|EEF38218.1| Inter-alpha-trypsin inhibitor heavy chain H3 precursor, putative [Ricinus communis] Length = 514 Score = 66.1 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 20/128 (15%), Positives = 46/128 (35%), Gaps = 20/128 (15%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 ++ ++++ +N LN TN + + L + SS +G +I ++DGE Sbjct: 120 DSQKDLENLINGLNADGATNITAGLQTGLKVLNDRSLSSGRVVG-------IILMSDGEQ 172 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS--SGQFFAVNDSR 374 + + +Y+ +L+ ++ G F V ++ Sbjct: 173 NAGGDAAQVPIG----------NVPVYTFGFGINH-EPRVLKAIANNSMGGTFSDVQNTD 221 Query: 375 ELLESFDK 382 L +F + Sbjct: 222 NLSLAFSQ 229 >gi|115374996|ref|ZP_01462267.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] gi|310820519|ref|YP_003952877.1| von willebrand factor type a domain-containing protein [Stigmatella aurantiaca DW4/3-1] gi|115368023|gb|EAU66987.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] gi|309393591|gb|ADO71050.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 476 Score = 66.1 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 23/163 (14%), Positives = 55/163 (33%), Gaps = 13/163 (7%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNE 291 R+ + Y + + N + + + TN + ++ Sbjct: 127 KDDDRLAIVHYGSDVKSLPGLQATPANRERMIQYIEGIWDEGGTNISAGLLAGQAQVETA 186 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 + +I I+DG+ + + + Q+ + +R G+ + S+ V Sbjct: 187 RSDYRVNR--------LILISDGQPTEG--STDEGSLKQVVKDIRTRGITVSSIGVGTD- 235 Query: 352 EGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVR 393 +DL++ + G + + D+ +L F K + Q R Sbjct: 236 FNEDLMQAFAEYGAGSYGFLEDAGKLATLFQKDLQQASTQVAR 278 >gi|307249749|ref|ZP_07531728.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 4 str. M62] gi|306858257|gb|EFM90334.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 4 str. M62] Length = 530 Score = 66.1 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 57/487 (11%), Positives = 121/487 (24%), Gaps = 123/487 (25%) Query: 14 TYAIDLAHIMYIRNQMQSALDAAVLSGCASIVS----------------DRTIKDPTTKK 57 +++ A I+ + ++ +L+ AVLS A + + + + Sbjct: 36 FVSLESAGIIQDQARLSDSLEQAVLSLTAENNNGRKDNDYKLSGSSNKENDSFDISSEVG 95 Query: 58 DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 + S + +K L Q + E I T K + I ++ Sbjct: 96 KRDSQMVTTFVKAFLPQTN--EEKMHLIPTCKTKTDTNKKGHTSSSEVTCTVSGTIEHKS 153 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENL---AISICMVLDVSRSMEDLYLQKH------ 168 F + + + + +N I + +V D+S SM K+ Sbjct: 154 WFPLKVGSVEVIPHEVNVASKSKAFKKNTFNIPIDLMVVADLSGSMNYDLSNKNEIVGSP 213 Query: 169 -------NDNNNMTSNKYLLPPPPKKSFWSKNTTK------------------------- 196 D + + K LL + T Sbjct: 214 MSKLGILQDVLSELAEKTLLSEEANHNNRIYVTPFALGAEISSSNCAIPYSWDMNKNNQE 273 Query: 197 -----SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS---VRIGTIAYNIGIV 248 + + + + + + L + G Sbjct: 274 LENAKNTLSNSKNSQYYRAEFINNLVYKLNTRDTLTNIGGQKDYKLKYSKGAFCLKNMRT 333 Query: 249 GNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 N+ NN ++ + L T + A + S + K+ + Sbjct: 334 QNKGWYSHNNKSDFIKYVKALKADGATLASSGLLVAANNMIKS--GSRTKELGEQTKRVI 391 Query: 309 IFITDGEN-------------SGASAYQNTLNTL-------------------------- 329 + ++DG + + +N + Sbjct: 392 LVLSDGNDEIVKGEISGIPFLNYTRITENLIYGKQEVFLSQKQKISLSHSTIDTYLTNTQ 451 Query: 330 ------QICEYMR--------NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE 375 +C +R + KI V + C +G +++ ND Sbjct: 452 PKEVTNGMCNVIRDKLDKLNNDKNTKIVFVEFGYASRARKAWEHCV-GNGNYYSANDKES 510 Query: 376 LLESFDK 382 LL SF + Sbjct: 511 LLNSFKQ 517 >gi|326433447|gb|EGD79017.1| NOTCH2 protein [Salpingoeca sp. ATCC 50818] Length = 1763 Score = 66.1 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 28/234 (11%), Positives = 73/234 (31%), Gaps = 16/234 (6%) Query: 154 LDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL 213 L + S+ D L + + + S + + ++ ID L Sbjct: 271 LSYNESIPDCPLGTVWEPDGEGCICEVEGSQCTGSACGTGYYELEAGGCQGATDQNIDTL 330 Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PY 272 + + + + +Q ++ + +++ + L+ L P Sbjct: 331 PDC---EIRQFFPHGCADCTCNPLGQKYQPSDSYTVDQLFSMTQDHDDLAATLDSLVFPD 387 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T+ + ++ + + + +I +TDG+++ ++ Sbjct: 388 GATHMSAGLEQIRDTIFQLRNGMREY--EQAIPRVLIVLTDGKSNPGF------EPHEVA 439 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKIT 384 E +RN G+ IY++ V L + + D L D+++ Sbjct: 440 EQLRNRGIIIYAIGVG--DYYLPELEAMASEPMDRHVYELADPSSLFTIVDRLS 491 >gi|301604858|ref|XP_002932077.1| PREDICTED: collagen alpha-1(VI) chain-like [Xenopus (Silurana) tropicalis] Length = 1025 Score = 66.1 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 28/238 (11%), Positives = 72/238 (30%), Gaps = 16/238 (6%) Query: 153 VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV 212 + D + + P F+ + T A P + Sbjct: 32 IKDPNGFPAVGPTITVRPGPGPEPERVTFQDCPVDVFFVLD-TSESVALRVKPFKTLVTQ 90 Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY 272 + + ++ + + + + ++ A + + L+ ++ ++ + + Sbjct: 91 VKDFTKKFIDKL-TSRYYRCDRNLVWNAGALHYSDEVIMISSLTRDMKTLRDDVETVEYI 149 Query: 273 E-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T+T A+ E+ + K++I +TDG Sbjct: 150 GKGTHTDCAIKRGIEEVL-------IGGSHQKENKYLIVVTDGHPLEGYKEPCG-GLEDA 201 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ--FFAVNDSRELLESFDKITDKI 387 ++ G+K++SVA+S L + F + L + ++I + I Sbjct: 202 ANEAKHLGIKVFSVAISPNHLEPR-LSVIASDASHRRNFTATSAAGLTD--EEIDNTI 256 Score = 44.1 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 34/189 (17%), Positives = 64/189 (33%), Gaps = 27/189 (14%) Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN- 270 L KA + S R+ + Y+ +N ++ ++ + Sbjct: 847 TSKSFVKLLAQRFLKAK-APPSGSARVSVVQYSGLNQQKVEAQFVSNYTVLEVPVDNMQF 905 Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 TN A+ A EL + + K ++ +DG + + L+ Sbjct: 906 INGATNVVSALR-AVTEL-------YREDSLAGVSKKLLVFSDGNTQ------DEKDMLK 951 Query: 331 ICEYMRNAGMKIYSVAVSAP---PEGQDLLRKCTDS------SGQFFAVNDSRELLE--S 379 + R AG++IY +AV + P Q +L + F V D LL+ Sbjct: 952 AVQDARAAGIEIYVLAVGSRLNYPNLQVMLTGSAADITAPFPEERLFRVPDYPSLLQGVR 1011 Query: 380 FDKITDKIQ 388 + I+ +I Sbjct: 1012 YQSISRRIS 1020 >gi|150251390|gb|ABR68007.1| matrilin-like 40 kDa protein [Lehmannia valentiana] Length = 390 Score = 66.1 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 25/161 (15%), Positives = 50/161 (31%), Gaps = 16/161 (9%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP---YENTNTYPAMHHAYRE 287 N R G + ++ + + +V + L T T A Sbjct: 241 VGNSKARFGALLFSDFVENLFYLNKYTSTADVSKAI--LRAPYHRGTTLTNEAFDFIRT- 297 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 E S G + V+ TDG+++ TL + ++ ++I +V + Sbjct: 298 ---EGVFSTPKGGRSNAPDIVVVFTDGQST------KPALTLAAADNLKRQNVRIVAVGI 348 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 ++ LR+ S F + L K+ + Sbjct: 349 G-NEVSKEELRQVASSRDDVFEASSFENLDYIEQKLAKNVC 388 >gi|167617233|ref|ZP_02385864.1| hypothetical protein BthaB_13083 [Burkholderia thailandensis Bt4] Length = 396 Score = 66.1 bits (159), Expect = 9e-09, Method: Composition-based stats. Identities = 31/322 (9%), Positives = 77/322 (23%), Gaps = 29/322 (9%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A++++V F+ A+DL + R+++Q++ D+ L+ + + P Sbjct: 4 LVALMLAVLIGFVGLALDLGKLYVTRSELQNSADSCALAAARDLTGAINLSVPEAAGITA 63 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + ++ Q + +P + N F+ Sbjct: 64 GHLNYALFEQFPVQLQTNASVTFTDSLSNPFQPKSAITSPSSIKYVKCMTSQTGIVNWFI 123 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMV----------LDVSRSMEDLYLQKHND 170 + L ++ S ++ A + C + + + + Sbjct: 124 QALDMVPGVTVANASVSATAIATIGAAQTTCAIPVFICKAGTQTNPPVAGATYNIGDWLS 183 Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR------------KIDVLIESAG 218 + + + N+ S + Sbjct: 184 AKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGNKAATTNAYN 243 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYN---IGIVGNQCTPLSNNLNEVKSR----LNKLNP 271 ++ + AY+ N + S + +N Sbjct: 244 TRFGIYANPYKDPSYGTPDFTGYAYDATTWPSQSNAYADFVSKRQTFTSYQGDLITGINT 303 Query: 272 YENTNTYPAMHHAYRELYNEKE 293 N A R L E Sbjct: 304 GGTYNPNYYAAGADRRLALAPE 325 >gi|149635474|ref|XP_001506111.1| PREDICTED: similar to alpha-1 type VII collagen [Ornithorhynchus anatinus] Length = 2993 Score = 66.1 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 33/214 (15%), Positives = 69/214 (32%), Gaps = 26/214 (12%) Query: 172 NNMTSNKYLLPPPPKKSFWSKNTTKSKYAP----APAPANRKIDVLIESAGNLVNSIQKA 227 + + + P + N + + + ++ + LV Sbjct: 12 WAVFFSTGWVVAQPGERVICTNVYAADVVFLVDGSSSIGRNNFRMVRDFLEGLVLPFVNV 71 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYR 286 +++ VR G + Y+ + EV + +LN NT T + H Sbjct: 72 VRDTG---VRFGAVQYSDDPRTEFALGTHASGQEVMRAVRELNYKRGNTRTGAGLRHVAE 128 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 ++ + + + K I ITDG + + Q ++N +K+++V Sbjct: 129 HFFHPQLARPG------VPKVCILITDGRSQD--------DVEQGALKLKNQNVKVFAVG 174 Query: 347 VSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLE 378 + ++ LRK S F + D L Sbjct: 175 IKNAH--EEELRKVASSPVEEYHFFIPDFAILRT 206 >gi|47223676|emb|CAF99285.1| unnamed protein product [Tetraodon nigroviridis] Length = 628 Score = 66.1 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 31/198 (15%), Positives = 65/198 (32%), Gaps = 18/198 (9%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 K + + + + + +R+ I ++ L+ + ++++ Sbjct: 47 YFVLDKSGSVQHYWNEIFYFVHHLAHKFISPQMRMSFIVFSTDGR--TLMALTEDRDKIR 104 Query: 264 SRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + L +L P +T +H A ++Y + R +I +TDGE Sbjct: 105 AGLEELRMVQPGGDTYMDRGLHRASEQIY------YAAGDGYRAASVIIALTDGELREDQ 158 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND-SRELLES 379 + R G +Y V + + L DS F V+D L Sbjct: 159 FDT----AQREAGRARQLGASVYCVGL--KDFNETQLSTIADSKDHVFPVHDGFEALQSV 212 Query: 380 FDKITDKIQEQSVRIAPN 397 D I + + + + P+ Sbjct: 213 IDSILKRSCIEILAVQPS 230 >gi|302796872|ref|XP_002980197.1| hypothetical protein SELMODRAFT_444450 [Selaginella moellendorffii] gi|300151813|gb|EFJ18457.1| hypothetical protein SELMODRAFT_444450 [Selaginella moellendorffii] Length = 542 Score = 66.1 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 15/176 (8%), Positives = 63/176 (35%), Gaps = 18/176 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP-LSNN-LNEVKSRL 266 K++++ + ++ ++++ R+ ++++ + ++ + S + Sbjct: 97 KLELVKTAMEFVIRNLRQQD--------RLAIVSFSDEPKVHLGLKRMTYDGREAALSAV 148 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 KL T P + + L + ++ ++DG ++ + + + Sbjct: 149 EKLRTLGGTEIRPGLKAGFDLLS--------RRRNRNPVSSIMLLSDGMDNAITFKRCKV 200 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + +++ + + + +L + G F V + + +F + Sbjct: 201 LPVDSYLEDCSERVPVHTFGFGSDHDPEAMLSIAEATGGSFCYVQEESTVQHAFAQ 256 >gi|297583258|ref|YP_003699038.1| von Willebrand factor type A [Bacillus selenitireducens MLS10] gi|297141715|gb|ADH98472.1| von Willebrand factor type A [Bacillus selenitireducens MLS10] Length = 978 Score = 66.1 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 25/141 (17%), Positives = 55/141 (39%), Gaps = 18/141 (12%) Query: 255 LSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 +S+N ++ L+ L +P T+ M A + K ++ K+ ++ ITD Sbjct: 114 MSDNRYDLLDALSALPDPSGGTDLSQGMRAANEQFVQTKGAN---------KQIMVLITD 164 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE--GQDLLRKCTD-SSGQFFAV 370 G ++ N R G+ I+++ + + + LL+ D + GQ+ V Sbjct: 165 GADTI-----NLAEVYNQVREARMNGITIFTLGLGSLATGLDEALLQDIADQTRGQYRQV 219 Query: 371 NDSRELLESFDKITDKIQEQS 391 ++ + I ++ Sbjct: 220 PNATVIESVLQDIRSSLEGMR 240 >gi|157961563|ref|YP_001501597.1| von Willebrand factor type A [Shewanella pealeana ATCC 700345] gi|157846563|gb|ABV87062.1| von Willebrand factor type A [Shewanella pealeana ATCC 700345] Length = 328 Score = 66.1 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 23/142 (16%), Positives = 49/142 (34%), Gaps = 25/142 (17%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 P + + + L +++ T A+ A + + K ++ Sbjct: 154 PFTQDKQILSQLLQQMDVRMAGAGTAIGDAIGVAVNHFEQSEVEN----------KVLLL 203 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------GQDLLRKCTD-S 363 +TDG ++ + L Y G+ IY +A+ P L + D + Sbjct: 204 LTDGNDTSSEFP-----PLDAAHYAGEQGVVIYPIAIGDPKNVGEDSLDIATLERIADLT 258 Query: 364 SGQFFAVNDSRELLESFDKITD 385 G+ F +D + L+E + + Sbjct: 259 QGRVFEADDGQSLIEVYKVLEQ 280 >gi|290999945|ref|XP_002682540.1| predicted protein [Naegleria gruberi] gi|284096167|gb|EFC49796.1| predicted protein [Naegleria gruberi] Length = 502 Score = 66.1 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 24/181 (13%), Positives = 59/181 (32%), Gaps = 25/181 (13%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 K+ + LV + + N + + ++ Sbjct: 96 SKLTACKSAIRELVTNFLT---------YKDTIHLITYSDSPKTVFTEKNKESVNLNDID 146 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 K++ +TN A+H A L+N K + F +DG+ + N Sbjct: 147 KISTEGSTNIASALHSAVDLLHNSNAPGT---------KLIAFFSDGQCNVGETNLNIFG 197 Query: 328 TLQICE-----YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFD 381 + + + ++ + I S V + + L+ + G+++ + D ++F+ Sbjct: 198 SGLLKKLKDYSEGKDDQIHISSYGVGSD-YDELWLQAIARTGKGEYYYLEDETYAKDAFE 256 Query: 382 K 382 + Sbjct: 257 R 257 >gi|156405834|ref|XP_001640936.1| predicted protein [Nematostella vectensis] gi|156228073|gb|EDO48873.1| predicted protein [Nematostella vectensis] Length = 250 Score = 66.1 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 19/134 (14%), Positives = 47/134 (35%), Gaps = 6/134 (4%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 +N+N + +N + T A+ A +L++ S + K + TDG+ Sbjct: 95 DNINAKVAAVNYRDWGGLTYIDRALKLANEQLFS---PEGGMRASKDILKVAVVFTDGKQ 151 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-GQDLLRKCTDSSGQFFAVNDSRE 375 + L + +++ +++Y + + +++ + E Sbjct: 152 TKDKGPFTEL--QIASQPLKDKDVQVYGLGIGDETTIDVQEMQEMANKPENVLTAKTFEE 209 Query: 376 LLESFDKITDKIQE 389 L +IT + E Sbjct: 210 LKNLAAQITQGVCE 223 >gi|107028246|ref|YP_625341.1| hypothetical protein Bcen_5496 [Burkholderia cenocepacia AU 1054] gi|116687157|ref|YP_840404.1| hypothetical protein Bcen2424_6782 [Burkholderia cenocepacia HI2424] gi|105897410|gb|ABF80368.1| conserved hypothetical protein [Burkholderia cenocepacia AU 1054] gi|116652872|gb|ABK13511.1| conserved hypothetical protein [Burkholderia cenocepacia HI2424] Length = 423 Score = 66.1 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 38/356 (10%), Positives = 94/356 (26%), Gaps = 28/356 (7%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTI----KDPTTK 56 + + ++V F+ A+DL + R+++Q++ DA LS + S ++ D Sbjct: 25 IVGLALAVMIGFVGLALDLGKLYVTRSELQNSADACALSAARDLTSAISLSVAEADGIAA 84 Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 F+K + + + + N N + Sbjct: 85 GHLNFVFFQK-TSVQMSTNANVTFSDSLTNPFLTKNAVTTPANIKYVQCTATLSNIAHWF 143 Query: 117 NLFLKGLIPSA-LTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMT 175 L L + + ++ I +I + + S + + + + Sbjct: 144 IEVLNVLPGTKLANAAEVSASAIATVGGGQTTCAIPVFVCRGPSDPAYKVGDWISSPSGS 203 Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 S+ Y + + + + + ++ + Sbjct: 204 SSTYGPGNFGWAALDGSTNETTLASELSGNTCN--------ITSPPDLATTGMKSASQRA 255 Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH-AYRELYNEKES 294 Y G G+ P V P + AY + ++ S Sbjct: 256 WNTRFGIYTNGANGSSGQPDFTGYAYV-------GPNYGPPGTAGIKGDAYTQFVADRAS 308 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLN------TLQICEYMRNAGMKIYS 344 G T+G + ++ + C ++ + K++ Sbjct: 309 FKPYQGDGAPPSGSGIATNGTATASNYSTYGSDRRLALAPEGDCSTLKGSSGKVHV 364 >gi|126660809|ref|ZP_01731904.1| hypothetical protein CY0110_12397 [Cyanothece sp. CCY0110] gi|126617906|gb|EAZ88680.1| hypothetical protein CY0110_12397 [Cyanothece sp. CCY0110] Length = 416 Score = 66.1 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 20/134 (14%), Positives = 49/134 (36%), Gaps = 15/134 (11%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 +++ V ++ +L P T M +E+ K+ + +TDGE Sbjct: 98 DDIKTVNQQIQRLEPAGGTCIDEGMKLGIKEVALGKDDR---------VSQIFLLTDGE- 147 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS-SGQFFAVNDSRE 375 + + + L++ + + + ++ QD+L D+ G + + Sbjct: 148 ---NEHGDNERCLKLAQVAAEYNITLNTLGFG-NHWNQDVLESIADAVGGTLCYIEQPEQ 203 Query: 376 LLESFDKITDKIQE 389 + F ++ +IQ Sbjct: 204 AITEFIRLFTRIQS 217 >gi|218680121|ref|ZP_03528018.1| hypothetical protein RetlC8_15005 [Rhizobium etli CIAT 894] Length = 168 Score = 66.1 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 17/118 (14%), Positives = 40/118 (33%), Gaps = 22/118 (18%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+ + + + + D +R +MQS LDAA+++ I + K + Sbjct: 30 IVALSLVPMLVAVGASFDYIRSYNVRQKMQSDLDAALIAAVKQINNTGDTD---ALKLKV 86 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENL 118 + F Q++ + + + + + A +PT + Sbjct: 87 TDWFHAQVENSY-------------------TLGEIDIDTTNHNITATASGTVPTTFM 125 >gi|194677117|ref|XP_001253768.2| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Bos taurus] Length = 1186 Score = 66.1 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 32/350 (9%), Positives = 91/350 (26%), Gaps = 30/350 (8%) Query: 33 LDAAVLSGCASIVSDRTIKDPTTKKDQTST----IFKKQIKKHLKQGSYIRENAGDIAQK 88 LD V + + T + +D + + L G A Sbjct: 176 LDRKVQALKRLADAAETFQKAHRWQDNIKEEDIVYYDAKADAELDDPESEDVERGSKAST 235 Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI 148 +++ +D N + A +IPT+ +I + L +E ++ + Sbjct: 236 LRLDFVEDSNFKNKVNYSYTA-VQIPTDIYKGSTVILNELNWTEALENVFMENRRQDPTL 294 Query: 149 SICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR 208 + + + P + S + + + Sbjct: 295 LWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGL 354 Query: 209 KIDVLIESAGNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 + ++ S +++++ + + + ++ +V N K + Sbjct: 355 TLKLMKTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVR----NKKVFKEAVQ 410 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + T +A+ +L N + + T Sbjct: 411 GMVAKGTTGYKAGFEYAFDQLQNXXXXXXXXFTDGGEDRV-----QDVFEKYNWPNRT-- 463 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSREL 376 +++++ +V L+ ++ G +F + + Sbjct: 464 ------------VRVFTFSVGQHNYDVTPLQWMACTNKGYYFEIPSIGAI 501 >gi|326931809|ref|XP_003212016.1| PREDICTED: matrilin-4-like, partial [Meleagris gallopavo] Length = 465 Score = 66.1 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 59/164 (35%), Gaps = 15/164 (9%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYR 286 + + R+G I Y+ + E++ +N + P T T A+ +A Sbjct: 63 NLDVGPNATRVGVIQYSSQVQNIFSLKTFFTRAEMERAINSIVPLAQGTMTGLAIQYAMN 122 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 + +E + ++ + I +TDG ++ + RNAG++IY+V Sbjct: 123 VAFTVQEGARP--PHKKIPRIAIIVTDGRPQD--------RVSEVAAHARNAGIEIYAVG 172 Query: 347 VSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 + + LR F V + + + DK+ Sbjct: 173 IQR--ADMNSLRAMASPPLEEHVFLVESFELIQQFGKQFQDKLC 214 >gi|118100589|ref|XP_425698.2| PREDICTED: similar to matrilin-4 [Gallus gallus] Length = 564 Score = 66.1 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 59/164 (35%), Gaps = 15/164 (9%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYR 286 + + R+G I Y+ + E++ +N + P T T A+ +A Sbjct: 63 NLDVGPNATRVGVIQYSSQVQNIFSLKTFFTRAEMERAINSIVPLAQGTMTGLAIQYAMN 122 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 + +E + ++ + I +TDG ++ + RNAG++IY+V Sbjct: 123 VAFTVQEGARP--PHKKIPRIAIIVTDGRPQD--------RVSEVAAHARNAGIEIYAVG 172 Query: 347 VSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 + + LR F V + + + DK+ Sbjct: 173 IQR--ADMNSLRAMASPPLEEHVFLVESFELIQQFGKQFQDKLC 214 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 23/172 (13%), Positives = 62/172 (36%), Gaps = 20/172 (11%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ + +V+ ++ + +G + Y+ + ++ E+K+ + K Sbjct: 360 NFELVKQFVNRIVDLLEVSPDG-----THVGLVQYSSRVRTEFPLNKYHSAEEIKAAVMK 414 Query: 269 LNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + + T T A+ H ++E E + S + + + TDG + + Sbjct: 415 MEYMEKGTMTGLALKHMVEHSFSELEGARPL--SHNVPRIGLVFTDGRSQDDISEW---- 468 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELL 377 + + +G+ +++V V ++ LR F D + Sbjct: 469 ----AQRAKESGIVMFAVGVGKAV--EEELRAIASEPVEQHFSYSADFTTMT 514 >gi|156383644|ref|XP_001632943.1| predicted protein [Nematostella vectensis] gi|156220006|gb|EDO40880.1| predicted protein [Nematostella vectensis] Length = 982 Score = 66.1 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 25/164 (15%), Positives = 56/164 (34%), Gaps = 13/164 (7%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSN----NLNEVKSRLNKLNPYEN-TNTYPAMH 282 + +G I Y+ + + N +K ++ + N T A+ Sbjct: 112 AYSVSEEATHVGVITYSTEATLDIAFDKYSGVEMNSVNLKKDIDIIPQKNNLTFMDKALE 171 Query: 283 HAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKI 342 A L+ E K+ +F+TDG + + + +++ G+ + Sbjct: 172 LANSVLFTEARGMRPNK-----KQVCLFLTDGIQTFDQGPYTK--PSIVSQKLKDRGIDV 224 Query: 343 YSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 Y+V V + +LL + ++ + EL +I + Sbjct: 225 YTVGVGDDVDLFELL-SISSGDKYTYSAKNFDELQAKVQEILQE 267 Score = 41.0 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 32/367 (8%), Positives = 91/367 (24%), Gaps = 37/367 (10%) Query: 35 AAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINIT 94 AA + A T+ + + E + + Sbjct: 357 AAAIKVAADHAFSMFGGVRQTQPKVFVLFAPRGSTSTAAEIKEAAEKLKKNGIRLMVVGI 416 Query: 95 KDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVL 154 + + Y + + N + +A+ + + + + + Sbjct: 417 DNSADTATYSSAVTQPAKRFLMNTKDYDDLNAAVWEI---ADTVCKSAVTPGKCRTPDAG 473 Query: 155 DVSRSMEDLYLQKHND----------NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 D + ++ D + T+ K P + T S+ Sbjct: 474 DTGGAQCEVDKDCGVDQLCCDDGSGKTSCKTAIKNCFVPFEMAIAMDASETVSRQDFVRM 533 Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 + + + + V G + ++ + + Sbjct: 534 KSFVRDLMWPNANSENNIHFGLMTFAGTTKKVTEGFRKFRSEQELDELLDKIEKTQDPQR 593 Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 R++K A +E ++ + + + + ++ +D + G+ Sbjct: 594 RVDK-----------TFKFASKEFFSMEGGTRHGHE----RYLLVLASDATSPGSGDLNE 638 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPE-GQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 L + R +AV+ + LR + + EL + +I Sbjct: 639 AAKDLDLLNVKR--------IAVATNSDVQSTFLRAVASDDKYVYQAKSTDELGQVSTQI 690 Query: 384 TDKIQEQ 390 + ++ Sbjct: 691 NQILCKE 697 >gi|326916310|ref|XP_003204451.1| PREDICTED: collagen alpha-1(XII) chain-like, partial [Meleagris gallopavo] Length = 2040 Score = 66.1 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 24/196 (12%), Positives = 60/196 (30%), Gaps = 19/196 (9%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 A + D + + N++ +++ + Y+ ++ + Sbjct: 1250 ASWSIGDDNFNKVVKFVFNTVGAFDL-INPAGIQVSFVQYSDEAKSEFKLNTFDDKAQAL 1308 Query: 264 SRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 L + NT T A+ ++ + + K ++ +TDG + Sbjct: 1309 GALQNVQYRGGNTRTGKALTFIKEKVLT-----WESGMRRGVPKVLVVVTDGRSQD---- 1359 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESF 380 + ++++G ++ V V + L K F V+D + Sbjct: 1360 ----EVRKAATVIQHSGFSVFVVGV--ADVDYNELAKIASKPSERHVFIVDDFDAFEKIQ 1413 Query: 381 DKITDKIQEQSVRIAP 396 D + + E + P Sbjct: 1414 DNLVTFVCETATSTCP 1429 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 22/166 (13%), Positives = 49/166 (29%), Gaps = 20/166 (12%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY-RE 287 + V+IG Y+ + + L NT T A+ Sbjct: 146 DIGPDKVQIGLAQYSGDPRTEWNLNAYRTKEALLEAVTNLPYKGGNTLTGMALDFILKNN 205 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 + K + ITDG++ + +R+ G+++Y++ + Sbjct: 206 FKQDAGLRPRAR------KIGVLITDGKSQDDVVTPS--------RRLRDEGVELYAIGI 251 Query: 348 SAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 ++ L++ + V D L + +T + Sbjct: 252 --KNADENELKQIATDPDDIHAYNVADFSFLASIVEDVTTNLCNSV 295 >gi|149042955|gb|EDL96529.1| matrilin 4 (predicted), isoform CRA_a [Rattus norvegicus] Length = 637 Score = 66.1 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 28/153 (18%), Positives = 55/153 (35%), Gaps = 15/153 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + + R+G I Y+ + +N +++ + + P T T A+ +A Sbjct: 80 DVGLNATRVGVIQYSSQVQSVFPLGAFSNREDMERAIRAVVPLAQGTMTGLAIQYAMNVA 139 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + R+ + ++ +TDG ++ R G++IY+V V Sbjct: 140 FSEAEGARP--SEERVPRVLVIVTDGRPQD--------RVAEVAAQARARGIEIYAVGVQ 189 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 G LR F V + E Sbjct: 190 RADVGS--LRAMASPPLDQHVFLVESFDLIQEF 220 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 57/183 (31%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 416 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 470 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + TDG + + Sbjct: 471 VEYMERGTMTGLALRHMVEHSFSEVQGARPR--DLNVPRVGLVFTDGRSQDDISVW---- 524 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V ++ LR+ D + + + Sbjct: 525 ----AARAKEEGIVMYAVGVGKAV--EEELREIASEPSELHVSYSPDFNTMTHLLENLKG 578 Query: 386 KIQ 388 I Sbjct: 579 SIC 581 >gi|119585524|gb|EAW65120.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2, isoform CRA_c [Homo sapiens] Length = 664 Score = 66.1 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 36/350 (10%), Positives = 95/350 (27%), Gaps = 24/350 (6%) Query: 33 LDAAVLSGCASIVSDRTIKDPTTKKDQTST----IFKKQIKKHLKQGSYIRENAGDIAQK 88 LD V + + + +D + + L G A Sbjct: 127 LDRKVQALKRLADAAENFQKAHRWQDNIKEEDIVYYDAKADAELDDPESEDVERGSKAST 186 Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI 148 +++ +D N + A +IPT+ +I + L +E ++ + Sbjct: 187 LRLDFIEDPNFKNKVNYSYAA-VQIPTDIYKGSTVILNELNWTEALENVFMENRRQDPTL 245 Query: 149 SICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR 208 + + + P + S + + + Sbjct: 246 LWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGL 305 Query: 209 KIDVLIESAGNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 + ++ S +++++ + + + ++ +V N K + Sbjct: 306 TLKLMKTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVR----NKKVFKEAVQ 361 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + T +A+ +L N + K ++ TDG N Sbjct: 362 GMVAKGTTGYKAGFEYAFDQLQNSNITRA------NCNKMIMMFTDGGEDRVQDVFEKYN 415 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 N +++++ +V L+ + G +F + + Sbjct: 416 WP-------NRTVRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 458 >gi|118087212|ref|XP_424219.2| PREDICTED: similar to matrilin 2 [Gallus gallus] Length = 1799 Score = 66.1 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 25/158 (15%), Positives = 55/158 (34%), Gaps = 18/158 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + + R+G I Y + E++ + ++ T T A+ +A Sbjct: 63 DVSPDATRVGLIQYGSTVKHEFSLKTFRRKQEIERAVRRMMHLATGTMTGLAIQYAVNIA 122 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG +I RN+G+ I+++ V Sbjct: 123 FSESEGARPL--NQNVPRIIMIVTDGRPQDPVG--------EIAAKARNSGILIFAIGVG 172 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFD 381 + L+ F V + + L F Sbjct: 173 R--VDMNTLKSIGSEPHEEHIFLVANFSQIETLTSVFQ 208 Score = 40.7 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 19/187 (10%), Positives = 58/187 (31%), Gaps = 20/187 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +++ + ++++++ + + R+G + Y+ + + ++K Sbjct: 1501 GEDNFEIVKQFVSGILDTLEISPKAA-----RVGLLQYSSEVRTEFTLRRFSTAKDMKKA 1555 Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 ++++ + T A+ + + E E + S + + I TDG + Sbjct: 1556 VSQMKYMGRGSMTGLALKQMFERSFTETEGARPL--SANIPRISIVFTDGRAQDEVSEWA 1613 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDK 382 ++LL F D + + ++ Sbjct: 1614 ARAKRSGIIIYAIG---------IGKAIEEELLE-IASEPSYKHLFYAEDFTAMEDISEE 1663 Query: 383 ITDKIQE 389 + +I E Sbjct: 1664 LKVQICE 1670 >gi|157818269|ref|NP_001100009.1| matrilin-4 [Rattus norvegicus] gi|149042956|gb|EDL96530.1| matrilin 4 (predicted), isoform CRA_b [Rattus norvegicus] Length = 624 Score = 66.1 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 28/153 (18%), Positives = 55/153 (35%), Gaps = 15/153 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + + R+G I Y+ + +N +++ + + P T T A+ +A Sbjct: 67 DVGLNATRVGVIQYSSQVQSVFPLGAFSNREDMERAIRAVVPLAQGTMTGLAIQYAMNVA 126 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + R+ + ++ +TDG ++ R G++IY+V V Sbjct: 127 FSEAEGARP--SEERVPRVLVIVTDGRPQD--------RVAEVAAQARARGIEIYAVGVQ 176 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 G LR F V + E Sbjct: 177 RADVGS--LRAMASPPLDQHVFLVESFDLIQEF 207 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 57/183 (31%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 403 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 457 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + TDG + + Sbjct: 458 VEYMERGTMTGLALRHMVEHSFSEVQGARPR--DLNVPRVGLVFTDGRSQDDISVW---- 511 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V ++ LR+ D + + + Sbjct: 512 ----AARAKEEGIVMYAVGVGKAV--EEELREIASEPSELHVSYSPDFNTMTHLLENLKG 565 Query: 386 KIQ 388 I Sbjct: 566 SIC 568 >gi|320450208|ref|YP_004202304.1| von Willebrand factor, type A [Thermus scotoductus SA-01] gi|320150377|gb|ADW21755.1| von Willebrand factor, type A [Thermus scotoductus SA-01] Length = 414 Score = 66.1 bits (159), Expect = 1e-08, Method: Composition-based stats. Identities = 20/158 (12%), Positives = 52/158 (32%), Gaps = 13/158 (8%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 R+ + Y+ + + L+ N E+ RL + P +TN + ++ ++ Sbjct: 90 RVAVVIYDHQVEVVVPSTLAENKEEILRRLRPVRPRGSTNLHAGWLEGSTQVAAHLDAKR 149 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 VI ++DG + N + + G+ ++ V + Sbjct: 150 LNR--------VIVLSDGLANTGETNPNVIAEQ--VRGLSQRGVSTSTLGVGLDYNEDLM 199 Query: 357 LRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRI 394 + G ++ + +L F ++ + + Sbjct: 200 MAMAEAGQGNYYFIESPDDLPGIF---AQELSGLATTL 234 >gi|281345873|gb|EFB21457.1| hypothetical protein PANDA_003362 [Ailuropoda melanoleuca] Length = 399 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 21/160 (13%), Positives = 56/160 (35%), Gaps = 17/160 (10%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + ++IG + ++ ++ ++ + + + T T A++ Sbjct: 253 DISTDEIQIGLLQFSSTPQEEFRLDQYSSKVDIHRAITNVQQMNDGTRTGKALNFTRPFF 312 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + + +++++I ITDG + +R+ + I+++ V Sbjct: 313 DSSRGGRP------NVQQYLIVITDGVAQDDVVMP--------AKALRDRNIVIFAIGVG 358 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 + LL+ TD + + + L KI K+ Sbjct: 359 -EAKNAQLLQ-ITDDPQKVYYEENFESLQNLEKKILLKVC 396 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 63/191 (32%), Gaps = 19/191 (9%) Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 + + L +++ VRIG + Y+ N + L Sbjct: 13 WDRQSNFQQVVNFLKSTVSSLNVHP--DGVRIGLVFYSEEPRLEFSLDAFQNPASILEYL 70 Query: 267 NKLNP---YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 ++L T T A+ E++ + +++ + IT+G + Sbjct: 71 DRLTYRRRSGRTKTGAALDFLRNEVF---IEERGSRSKHGVQQMAVVITEGFSQD----- 122 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFD 381 + +R AG+ IY+V E +D L ++ +L + Sbjct: 123 ---QLSKSASLLRRAGVTIYAVGTHLASESKD-LENIASYPPWKHVISLESFLQLSVVGN 178 Query: 382 KITDKIQEQSV 392 KI +++ +++ Sbjct: 179 KIKNQLCPETL 189 >gi|332262934|ref|XP_003280513.1| PREDICTED: LOW QUALITY PROTEIN: integrin alpha-X-like [Nomascus leucogenys] Length = 1164 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 28/182 (15%), Positives = 64/182 (35%), Gaps = 20/182 (10%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL---SNNLNEVKSRLNKLN 270 + ++N ++ I + + S + + ++ + S+N + + + +L Sbjct: 164 SGNFATMMNFVRAVISQFQRPSTQFSLMQFSNRFETHFTFEKFRRSSNPLSLLASVRQL- 222 Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T T A+ +L++ + K +I ITDG ++L+ Sbjct: 223 -GGLTYTATAIQKVVHQLFHASYGARRDAT-----KILIVITDG-----KKEGDSLDYKH 271 Query: 331 ICEYMRNAGMKIYSVAVSA---PPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + AG+ Y++ V + L F V D L + +++ + Sbjct: 272 VIPMADAAGIIRYAIGVGSAFQNTNSWKELNDIASKPSQEHIFKVEDFDALKDIQNQLKE 331 Query: 386 KI 387 KI Sbjct: 332 KI 333 >gi|332531455|ref|ZP_08407359.1| von Willebrand factor, type A [Hylemonella gracilis ATCC 19624] gi|332039124|gb|EGI75546.1| von Willebrand factor, type A [Hylemonella gracilis ATCC 19624] Length = 346 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 15/174 (8%), Positives = 53/174 (30%), Gaps = 47/174 (27%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNE------------------------ 291 + N ++ + +++ T + + ++ E Sbjct: 144 TTNREDLNAAIDRFQLQRGTAIGNGIVLSLAAIFPEVGIDLGQFSYGRPGQPRGPSLDMP 203 Query: 292 --KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + +I +TDG+ + ++ L+ + G+++Y+V V Sbjct: 204 PAGPPPTPVAPGSYGSAAIILLTDGQRTTG------IDPLEAAKLASERGVRVYTVGVGT 257 Query: 350 PPE--------------GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQ 388 ++ L+ + ++F + L + ++ ++ ++ Sbjct: 258 VEGVTVGFEGWSMHARLDEESLKHIAQQTRAEYFHAASAEALTQVYETLSSRLT 311 >gi|189230272|ref|NP_001121460.1| anthrax toxin receptor 2 [Xenopus (Silurana) tropicalis] gi|183985706|gb|AAI66225.1| LOC100158556 protein [Xenopus (Silurana) tropicalis] Length = 488 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 31/167 (18%), Positives = 57/167 (34%), Gaps = 18/167 (10%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNE 291 S R+ PL+ + E+ L L+ P T + A ++ Sbjct: 73 SPRMRLSFIVFSTQAKIILPLTGDRYEITKGLKDLSSVIPAGETYMHEGFKLANEQI--- 129 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 G +I +TDG+ A Q + T + R G ++Y V V Sbjct: 130 -----VKAGGKSTASVIIALTDGK----LADQIPVLTEKEANIARGRGARVYCVGVL--D 178 Query: 352 EGQDLLRKCTDSSGQFFAVN-DSRELLESFDKITDKIQEQSVRIAPN 397 D L++ + F V ++L + I +K + + + P+ Sbjct: 179 FNFDQLKRIAAAPENVFRVEGGFKDLGLIINSILEKSCTEILYVDPS 225 >gi|153836806|ref|ZP_01989473.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810] gi|149749952|gb|EDM60697.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810] gi|328474272|gb|EGF45077.1| hypothetical protein VP10329_16235 [Vibrio parahaemolyticus 10329] Length = 418 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 26/239 (10%), Positives = 65/239 (27%), Gaps = 6/239 (2%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKK-DQ 59 + ++++ + + IDL H + + ++Q+A+D A L+G + Sbjct: 16 LISMVLLILLGVAAFGIDLNHQVLNKTRLQNAVDTAALAGAVVADKTEDVDQAEAAVIAT 75 Query: 60 TSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLF 119 S+I + L + +N +Y + + + Sbjct: 76 LSSIASESGNTELSFTDGNTSVTFSHDMQTFVNAASFTPPTGEYDIYVRVAVTDMGISQY 135 Query: 120 LKGLIPS-ALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNK 178 L + + S + + I M D + ++ED + + + Sbjct: 136 LSAVFGIVKNVSASAVAGRSAAIAYTCNLTPIAMCGDPNGTVEDAWGYRPPGYDPNVDMD 195 Query: 179 YLLPPP----PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 L + + A N ++ ++ N Sbjct: 196 PSLVHELKVGDQNNTDMGPGNFQLLDFGQATGNSGAALVRDALSGAYNGCAAVGNTVTT 254 >gi|310641808|ref|YP_003946566.1| von willebrand factor type a [Paenibacillus polymyxa SC2] gi|309246758|gb|ADO56325.1| von Willebrand factor type A [Paenibacillus polymyxa SC2] Length = 600 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 27/146 (18%), Positives = 59/146 (40%), Gaps = 17/146 (11%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN- 316 + E+K+ ++ LN T+T + A R L + K + H ++ + DG N Sbjct: 103 DKQELKTFIDGLNRGAYTDTSVGVKEAIRILQDGKTAGHAP--------MIVMLADGNND 154 Query: 317 ----SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP--EGQDLLRKCTDSSGQFFAV 370 +G + Q+ + Q + +G+ IY++ ++A L + G+ F Sbjct: 155 FNKTTGRTESQSAQDMAQAVAEAKKSGVPIYTIGLNADGKLNKNKLADIAQQTGGKSFIT 214 Query: 371 NDSRELLESFDKITDKIQEQSVRIAP 396 + + +L +I +++ P Sbjct: 215 SSADDLPNILSEIF--ASNLKLKVVP 238 >gi|145497681|ref|XP_001434829.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124401957|emb|CAK67432.1| unnamed protein product [Paramecium tetraurelia] Length = 648 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 25/162 (15%), Positives = 57/162 (35%), Gaps = 16/162 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCT-PLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 + + R+ I ++ L+ +N N K + + TN A+ + Sbjct: 257 DFLSEKDRLCLITFDGSAQRLTPLKTLTQDNKNYFKKAIYSIRASGQTNIAKGTEIAFNQ 316 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 + K + T + ++DG++ G A + + E + + I+S Sbjct: 317 IQQRKMKNQVTS--------IFLLSDGQDQG--AAEYIQRQKDVVEDI----VTIHSFGY 362 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + + + + C G F+ + D + L E F ++ Sbjct: 363 GSDHDAALMSKICKVGQGSFYYIEDVKLLDEFFADALGRLSS 404 >gi|119591510|gb|EAW71104.1| collagen, type VI, alpha 3, isoform CRA_b [Homo sapiens] Length = 2210 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 18/172 (10%), Positives = 57/172 (33%), Gaps = 17/172 (9%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 + + K++ +N + +N EV S ++ ++ TN Sbjct: 58 VREFLYDVVKSLAVGENDF-HFALVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGGTN 116 Query: 277 -TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 T + + + + + + + + ++ +TDG + A + + Sbjct: 117 QTGKGLEYIMQ---SHLTKAAGSRAGDGVPQVIVVLTDGHSKDGLALPS--------AEL 165 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITD 385 ++A + ++++ V + L++ F + + L + + Sbjct: 166 KSADVNVFAIGV--EDADEGALKEIASEPLNMHMFNLENFTSLHDIVGNLVS 215 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 56/189 (29%), Gaps = 20/189 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ N +V + Sbjct: 838 GVRSGFPLLKEFVQRVVESL-----DVGQDRVRVAVVQYSDRTRPEFYLNSYMNKQDVVN 892 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S T + + +I +T + Sbjct: 893 AVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRIT---EGVPQLLIVLTADRSGDDVRN 949 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + + ++ + A+ R+L Sbjct: 950 PSVVVKRGGAVP----------IGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV 999 Query: 383 ITDKIQEQS 391 I++++ + + Sbjct: 1000 ISERVTQLT 1008 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 57/153 (37%), Gaps = 15/153 (9%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKE 293 VR+G + ++ + + V + +L + NT A+ R L+ Sbjct: 1272 VRVGVVQFSNDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLF---V 1328 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S + + + ++ + G++ + + +R++G+ S+ V Sbjct: 1329 KSAGSRIEDGVPQHLVLVLGGKSQDDVSRF--------AQVIRSSGIV--SLGVGDRNID 1378 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + L+ T+ F V + REL ++I + Sbjct: 1379 RTELQTITNDPRLVFTVREFRELPNIEERIMNS 1411 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 59/172 (34%), Gaps = 15/172 (8%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN + ++ +N Sbjct: 1447 INFRRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLKDFSTKRQIIDAIN 1506 Query: 268 KLNPYENTNTYPAMHHAYRELY-NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 K+ + L N + R+ + IT G+ + Sbjct: 1507 KVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGK--------SVE 1556 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 1557 DAQDVSLALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 1606 >gi|114584075|ref|XP_516178.2| PREDICTED: alpha 3 type VI collagen isoform 5 [Pan troglodytes] Length = 2976 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 18/172 (10%), Positives = 57/172 (33%), Gaps = 17/172 (9%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 + + K++ +N + +N EV S ++ ++ TN Sbjct: 58 VREFLYDVVKSLAVGENDF-HFALVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGGTN 116 Query: 277 -TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 T + + + + + + + + ++ +TDG + A + + Sbjct: 117 QTGKGLEYIMQ---SHLTKAAGSRAGDGVPQVIVVLTDGHSKDGLALPS--------AEL 165 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITD 385 ++A + ++++ V + L++ F + + L + + Sbjct: 166 KSADVNVFAIGV--EDADEGALKEIASEPLNMHMFNLENFTSLHDIVGNLVS 215 Score = 53.8 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 56/189 (29%), Gaps = 20/189 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ N +V + Sbjct: 838 GVRSGFPLLKEFVQRVVESL-----DVGQDRVRVAVVQYSDRTRPEFYLNSYMNQQDVVN 892 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S T + + +I +T + Sbjct: 893 AVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRIT---EGVPQLLIVLTADRSGDDVRN 949 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + + ++ + A+ R+L Sbjct: 950 PSVVVKRGGAVP----------IGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV 999 Query: 383 ITDKIQEQS 391 I++++ + + Sbjct: 1000 ISERVTQLT 1008 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 57/153 (37%), Gaps = 15/153 (9%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKE 293 VR+G + ++ + + V + +L + NT A+ R L+ Sbjct: 1272 VRVGVVQFSNDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLF---V 1328 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S + + + ++ + G++ + + +R++G+ S+ V Sbjct: 1329 KSAGSRIEDGVPQHLVLVLGGKSQDDVSRF--------AQVIRSSGIV--SLGVGDRNID 1378 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + L+ T+ F V + REL ++I + Sbjct: 1379 RTELQTITNDPRLVFTVREFRELPNIEERIMNS 1411 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 59/172 (34%), Gaps = 15/172 (8%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN + ++ +N Sbjct: 1447 INFRRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLKDFSTKRQIIDAIN 1506 Query: 268 KLNPYENTNTYPAMHHAYRELY-NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 K+ + L N + R+ + IT G+ + Sbjct: 1507 KVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGK--------SVE 1556 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 1557 DAQDVSLALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 1606 >gi|22758317|gb|AAN05521.1| hypothetical protein [Oryza sativa Japonica Group] gi|31432564|gb|AAP54179.1| von Willebrand factor type A domain containing protein [Oryza sativa Japonica Group] Length = 606 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 63/184 (34%), Gaps = 25/184 (13%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIG-IVGNQCTPLSN-NLNEVKSR 265 RK+ ++ ++ G +++++ A R+ ++++ + +S K Sbjct: 158 RKLALVKKAMGFVIDNLGPAD--------RLCVVSFSTEASRRTRLLRMSEVGKATAKRA 209 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG------- 318 + L TN + A R L + VI ++DG++S Sbjct: 210 VESLVDDSATNIGDGLRVAGRVLGD--------RRHKNAVSSVILLSDGKDSYVVPRRGN 261 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 +Y + + R I++ A + + + G F V + + + Sbjct: 262 GMSYMDLVPPSFASSGGRGQLAPIHTFGFGADHDAAAMNTIAESTGGTFSFVENEAAIQD 321 Query: 379 SFDK 382 SF + Sbjct: 322 SFAQ 325 >gi|222081474|ref|YP_002540837.1| von Willebrand factor, type A [Agrobacterium radiobacter K84] gi|221726153|gb|ACM29242.1| von Willebrand factor, type A [Agrobacterium radiobacter K84] Length = 329 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 26/180 (14%), Positives = 61/180 (33%), Gaps = 25/180 (13%) Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL---NPY 272 + V++++K + + G P + + V+ ++ Sbjct: 117 NLEARVDAVRKVVGDFVARRPGDRIGLIAFGDAPYPLAPFTMDHELVREIISGTLPGIAG 176 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T+ A+ A + T +K +I +TDG ++ + L+ Sbjct: 177 PRTSLGDAVGLAIKMFEKT----------TVPEKVLIVLTDGNDTASKMP-----PLKAA 221 Query: 333 EYMRNAGMKIYSVAVSAPPE------GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITD 385 E + G+ +++V + P L K + G++F D +L ++D + Sbjct: 222 EIAKRNGVVVHTVGIGDPQATGEDRLDATALEKIAETTGGRYFFGGDQAQLAAAYDVLDK 281 >gi|269961128|ref|ZP_06175496.1| conserved hypothetical protein [Vibrio harveyi 1DA3] gi|269834079|gb|EEZ88170.1| conserved hypothetical protein [Vibrio harveyi 1DA3] Length = 362 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 25/150 (16%), Positives = 63/150 (42%), Gaps = 17/150 (11%) Query: 246 GIVGNQCTPLSNNLN---EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 G TP + + E+ ++ + ++T+ A+ A + ++S + S Sbjct: 159 GDAAFVQTPFTPDQKVWLELLNQTDVAMAGQSTHLGDAIGLAIKVFEQSEKSRTDVEESK 218 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------GQDL 356 K I +TDG ++G + + + + + ++I+ +A+ P + Sbjct: 219 E--KVAIVLTDGNDTG-----SFVEPIDAAKVAKAKDVRIHVIAMGDPQTVGETALDMNT 271 Query: 357 LRKCT-DSSGQFFAVNDSRELLESFDKITD 385 +++ +S G+ F + EL +++D+I Sbjct: 272 IKRIAKESGGEAFEALNRDELAKAYDEIGK 301 >gi|260797291|ref|XP_002593637.1| hypothetical protein BRAFLDRAFT_235784 [Branchiostoma floridae] gi|229278863|gb|EEN49648.1| hypothetical protein BRAFLDRAFT_235784 [Branchiostoma floridae] Length = 371 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 36/293 (12%), Positives = 88/293 (30%), Gaps = 27/293 (9%) Query: 80 ENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGII 139 E+ G + + + ++ + L+ G+ L L+ ++ Sbjct: 96 ESNGARKNVPHVAVIVTDGRSSDSVDQAALETRQSGIVLYAVGVGNYDLGQLTDIASTNE 155 Query: 140 ERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWS------KN 193 + + ++ DV S+ NDN T K Sbjct: 156 TLGVVD---NFNLLDDVRNSLLSSVCSVLNDNFLTTLTIQDCNSDHISITMPCYTLLEKV 212 Query: 194 TTKSKYAPAPAPANRKIDVL-IESAGNLVNSIQKAIQEKK--NLSVRIGTIAYNIGIVGN 250 T + + + ++K + + S R+G + Y+ + Sbjct: 213 TPPCNNPVDIVFVLDGSGSVGRRNFEKVQAGVKKIVGDFNIALDSTRVGVVQYSSIVRQE 272 Query: 251 QCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 +NL ++S + + T T AM +A + + + + ++ Sbjct: 273 FALDTFSNLQGLESGIQSIPYMAGGTRTGAAMEYAIQNSFTSANGAR-----PDVGHVIV 327 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 +TDG + + Q + + AG+ +++V + + L + Sbjct: 328 LVTDGRSYDDVS--------QASQKAKQAGIVVFAVGIG-DGAVESQLNQIAS 371 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 21/157 (13%), Positives = 54/157 (34%), Gaps = 16/157 (10%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEK 292 + R+ + Y+ + N L ++ + ++ T T A+ R+ + E Sbjct: 38 SNTRVAVMQYSSSVRQEFALDAFNTLEDLLVGIEEIRYMRGGTRTGKALTRLRRQGFLES 97 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + + + +TDG +++ + Q R +G+ +Y+V V Sbjct: 98 NGAR-----KNVPHVAVIVTDG--------RSSDSVDQAALETRQSGIVLYAVGVG--NY 142 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 L ++ V++ L + + + + Sbjct: 143 DLGQLTDIASTNETLGVVDNFNLLDDVRNSLLSSVCS 179 >gi|71988405|ref|NP_001022675.1| MUscle Attachment abnormal family member (mua-3) [Caenorhabditis elegans] gi|51587416|emb|CAH19087.1| C. elegans protein K08E5.3b, confirmed by transcript evidence [Caenorhabditis elegans] gi|51591699|emb|CAH19103.1| C. elegans protein K08E5.3b, confirmed by transcript evidence [Caenorhabditis elegans] Length = 3183 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 26/185 (14%), Positives = 57/185 (30%), Gaps = 13/185 (7%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 + V V + + R+G I Y+ I + + + Sbjct: 653 GSGSIGSYVFKNEVLRFVREFVELFE-IGRSKTRVGLIQYSDQIRHEFDLDQYGDRDSLL 711 Query: 264 SRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 +++ T T A+ H +E ++E+ + + I +TDG + Sbjct: 712 KGISETQYLTGLTRTGAAIQHMVQEGFSERRGARPQQSDI--ARVAIILTDGRSQD---- 765 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 N + R + +++ V L S ++F V+ ++L Sbjct: 766 ----NVTGPADSARKLSINTFAIGV-TDHVLASELESIAGSPNRWFYVDKFKDLDTRLRS 820 Query: 383 ITDKI 387 + K Sbjct: 821 MIQKA 825 >gi|71988401|ref|NP_001022674.1| MUscle Attachment abnormal family member (mua-3) [Caenorhabditis elegans] gi|22096361|sp|P34576|MUA3_CAEEL RecName: Full=Transmembrane cell adhesion receptor mua-3; AltName: Full=Muscle attachment abnormal protein 3; Flags: Precursor gi|4761646|gb|AAD29428.1|AF139060_1 transmembrane cell adhesion receptor MUA-3 precursor [Caenorhabditis elegans] gi|14530487|emb|CAA83226.2| C. elegans protein K08E5.3a, confirmed by transcript evidence [Caenorhabditis elegans] gi|14530561|emb|CAC42345.1| C. elegans protein K08E5.3a, confirmed by transcript evidence [Caenorhabditis elegans] Length = 3767 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 26/185 (14%), Positives = 57/185 (30%), Gaps = 13/185 (7%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 + V V + + R+G I Y+ I + + + Sbjct: 1237 GSGSIGSYVFKNEVLRFVREFVELFE-IGRSKTRVGLIQYSDQIRHEFDLDQYGDRDSLL 1295 Query: 264 SRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 +++ T T A+ H +E ++E+ + + I +TDG + Sbjct: 1296 KGISETQYLTGLTRTGAAIQHMVQEGFSERRGARPQQSDI--ARVAIILTDGRSQD---- 1349 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 N + R + +++ V L S ++F V+ ++L Sbjct: 1350 ----NVTGPADSARKLSINTFAIGV-TDHVLASELESIAGSPNRWFYVDKFKDLDTRLRS 1404 Query: 383 ITDKI 387 + K Sbjct: 1405 MIQKA 1409 >gi|165975965|ref|YP_001651558.1| Flp pilus assembly protein [Actinobacillus pleuropneumoniae serovar 3 str. JL03] gi|165876066|gb|ABY69114.1| Flp pilus assembly protein [Actinobacillus pleuropneumoniae serovar 3 str. JL03] Length = 529 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 56/486 (11%), Positives = 117/486 (24%), Gaps = 122/486 (25%) Query: 14 TYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKD---------------PTTKKD 58 +++ A I+ + ++ +L+ AVLS A S R D + Sbjct: 36 FVSLESAGIIQDQARLSDSLEQAVLSLTAENNSGRKTTDYKLGGSNPSDESFNISSEVGK 95 Query: 59 QTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENL 118 + I +K L Q + + IN T K + + ++ Sbjct: 96 RDHAIVTAFVKTFLPQTDEKNMHLTPL--CKTINNTSGKGHTSSSEVTCTVSGTVEHKSW 153 Query: 119 FLKGLIPSALTNLSLRSTGIIERSSENL---AISICMVLDVSRSMEDLYLQKHNDNNNM- 174 F + + + +N I + +V D+S SM Sbjct: 154 FPLKVGNLEVIPKQVNVASQSRAIKKNTFNIPIDLMVVADLSGSMNFDLDNNEIKKTGKP 213 Query: 175 ------------TSNKYLLPPPPKKSFWSKNTTKS-----KYAPAPAPA----------- 206 ++K LL ++ T + P Sbjct: 214 SKISILKEVLVELADKTLLSEDANQNNRIYVTPFALGAEINNNNCALPYSWSVESSSRTQ 273 Query: 207 ------------NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG---TIAYNIGIVGNQ 251 + D++ + K N+ Sbjct: 274 NIKNILNKQNSQYNRADLINNLVYKISTKETLDNINGKQNYNVTFSKNAFCLKDMKTSNK 333 Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 ++ E + + + T + A + ++ S + K+ ++ + Sbjct: 334 GWYSRSDKLEFTNYVQSIKANGATLASSGVLVAANNMI--RDGSRTEQLKEQTKRVILVL 391 Query: 312 TDGENSGASAYQNT---------------------------------------------- 325 +DG + N Sbjct: 392 SDGNDEIVKGDPNNKVPFLNYTRITENLIYGRQEEFSSEKKRVSFGHSTTIETYLTDTQP 451 Query: 326 -LNTLQICEYM--------RNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 T +C + ++ KI V + C +G +++ ND L Sbjct: 452 KKVTDGMCNVIRDKLDTLNKDKNTKIVFVEFGYKSTSKQAWEHCV-GNGNYYSANDKASL 510 Query: 377 LESFDK 382 L SF + Sbjct: 511 LNSFKQ 516 >gi|125532269|gb|EAY78834.1| hypothetical protein OsI_33939 [Oryza sativa Indica Group] gi|125575070|gb|EAZ16354.1| hypothetical protein OsJ_31816 [Oryza sativa Japonica Group] Length = 654 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 14/139 (10%), Positives = 43/139 (30%), Gaps = 21/139 (15%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + R++ L TN A+ A R + + ++ ++DG + Sbjct: 241 SGRQRALQRVSSLVADGGTNIADALRKAARVMED--------RRERNPVCSIVLLSDGRD 292 Query: 317 SGAS--------AYQNTLNTLQICEYM-----RNAGMKIYSVAVSAPPEGQDLLRKCTDS 363 + + + + +++++ A + + S Sbjct: 293 TYTVPVPRGGGGGGDQPDYAVLVPSSLLPGGGSARHVQVHAFGFGADHDSPAMHSIAEMS 352 Query: 364 SGQFFAVNDSRELLESFDK 382 G F ++ + + ++F + Sbjct: 353 GGTFSFIDAAGSIQDAFAQ 371 >gi|115482404|ref|NP_001064795.1| Os10g0464500 [Oryza sativa Japonica Group] gi|110289213|gb|AAP54178.2| von Willebrand factor type A domain containing protein, expressed [Oryza sativa Japonica Group] gi|113639404|dbj|BAF26709.1| Os10g0464500 [Oryza sativa Japonica Group] Length = 719 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 14/139 (10%), Positives = 43/139 (30%), Gaps = 21/139 (15%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + R++ L TN A+ A R + + ++ ++DG + Sbjct: 318 SGRQRALQRVSSLVADGGTNIADALRKAARVMED--------RRERNPVCSIVLLSDGRD 369 Query: 317 SGAS--------AYQNTLNTLQICEYM-----RNAGMKIYSVAVSAPPEGQDLLRKCTDS 363 + + + + +++++ A + + S Sbjct: 370 TYTVPVPRGGGGGGDQPDYAVLVPSSLLPGGGSARHVQVHAFGFGADHDSPAMHSIAEMS 429 Query: 364 SGQFFAVNDSRELLESFDK 382 G F ++ + + ++F + Sbjct: 430 GGTFSFIDAAGSIQDAFAQ 448 >gi|22758319|gb|AAN05523.1| unknown protein [Oryza sativa Japonica Group] Length = 731 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 14/139 (10%), Positives = 43/139 (30%), Gaps = 21/139 (15%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + R++ L TN A+ A R + + ++ ++DG + Sbjct: 318 SGRQRALQRVSSLVADGGTNIADALRKAARVMED--------RRERNPVCSIVLLSDGRD 369 Query: 317 SGAS--------AYQNTLNTLQICEYM-----RNAGMKIYSVAVSAPPEGQDLLRKCTDS 363 + + + + +++++ A + + S Sbjct: 370 TYTVPVPRGGGGGGDQPDYAVLVPSSLLPGGGSARHVQVHAFGFGADHDSPAMHSIAEMS 429 Query: 364 SGQFFAVNDSRELLESFDK 382 G F ++ + + ++F + Sbjct: 430 GGTFSFIDAAGSIQDAFAQ 448 >gi|329888194|ref|ZP_08266792.1| von Willebrand factor type A domain protein [Brevundimonas diminuta ATCC 11568] gi|328846750|gb|EGF96312.1| von Willebrand factor type A domain protein [Brevundimonas diminuta ATCC 11568] Length = 655 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 28/280 (10%), Positives = 68/280 (24%), Gaps = 24/280 (8%) Query: 108 KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQK 167 A Y + P + + + + + + + + Sbjct: 211 TAAYANVRRFIDNGRAPPRDAVRVEEMINYFDYGYTRPTSAARPFAVTATTTASPWSEGR 270 Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 P+ N T + K+D+ +S +++ ++ Sbjct: 271 RI-----VHVGLQGYELPENQRRPLNLTFLVDVSGSMNSPDKLDLAKQSMNLIIDRLRPQ 325 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 R+ Y G +++ + L T M +AY + Sbjct: 326 D--------RVAVAYYAEGAGTTLAPTAGTQKLKLRCAVASLRASGGTAGATGMTNAYDQ 377 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 ++ TDG+ + + R G+ + Sbjct: 378 AQASF--------GRNKVNRILMFTDGDFNVGV--TDDKRLEDYVADKRRTGIYLSVYGF 427 Query: 348 SAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDK 386 ++ + G V+D +E F + D+ Sbjct: 428 GRGNYQDARMQAIAQAGNGTAAYVDDLKEARRLFGPMFDR 467 >gi|308474504|ref|XP_003099473.1| CRE-MUP-4 protein [Caenorhabditis remanei] gi|308266662|gb|EFP10615.1| CRE-MUP-4 protein [Caenorhabditis remanei] Length = 2161 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 22/176 (12%), Positives = 58/176 (32%), Gaps = 14/176 (7%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 + V + + + R+ + Y+ I ++ ++ Sbjct: 444 GSGSIGSYVFQTEVLRFLAEFTEL-FDIAPQKTRVSVVQYSDQIRHEFGLDNYSDRKSLQ 502 Query: 264 SRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + + + T T A+ H E ++E+ + ++ + I ITDG + Sbjct: 503 NAIRNIEYLTGLTRTGAAIEHVANEAFSERRGARPVG---QVSRVAIVITDGRSQDNVTR 559 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + R +++++V V L + + + + F V+ +L Sbjct: 560 PS--------DNARKQEIQLFAVGV-TNHVLDAELEEISGAKDRTFHVSGFEDLNT 606 >gi|254283762|ref|ZP_04958730.1| conserved hypothetical protein [gamma proteobacterium NOR51-B] gi|219679965|gb|EED36314.1| conserved hypothetical protein [gamma proteobacterium NOR51-B] Length = 325 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 28/177 (15%), Positives = 64/177 (36%), Gaps = 28/177 (15%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 ++ +V+ R+G I + P + +LN ++ ++ + Sbjct: 123 DAVQRVVDQF-----VANREGDRVGLIVFGAKAYLQ--LPFTRDLNTARALVDLMQ---- 171 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLK-KFVIFITDGENSGASAYQNTLNTLQICE 333 M L + + S+ + + +I +TDG ++ + + + E Sbjct: 172 ----VGMAGPQTALGDSIGLAIRAFESSEVDDRVLILLTDGNDTA-----SKMTPINAAE 222 Query: 334 YMRNAGMKIYSVAVSAPPE------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKI 383 + G++IY++ + + L + S GQFF D L + +D+I Sbjct: 223 IAQLNGIEIYTIGIGDAEATGEDRIDFETLASIAERSGGQFFDAQDETALRQVYDRI 279 >gi|124485081|ref|YP_001029697.1| hypothetical protein Mlab_0254 [Methanocorpusculum labreanum Z] gi|124362622|gb|ABN06430.1| von Willebrand factor, type A [Methanocorpusculum labreanum Z] Length = 313 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 36/237 (15%), Positives = 83/237 (35%), Gaps = 47/237 (19%) Query: 181 LPPPPKKSFWSKNTT------KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNL 234 P S+N S A + +++ S+ L+ S+ ++ Sbjct: 74 ADPHIPLVSASENVNLVVALDVSASMSASDYSPTRVEAAKGSSEILIRSLSESDTAG--- 130 Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKE 293 + + G LS++ N V SRL +++ T + A + Sbjct: 131 -----VVIFESGASSAAY--LSSDKNRVVSRLEQVSVKTGKTALGDGLALAVDMVTAIPA 183 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE- 352 ++ V+ ++DG ++ + + EY +N+G+ +Y++ V + Sbjct: 184 GTY----------IVVLLSDGVSNSGM-----ITPQEAAEYAKNSGVVVYTIGVGSESPV 228 Query: 353 -------------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 ++ LR + + G++F D + L++ + I I + V + Sbjct: 229 EVSSDGVQQYASLDEETLRSIAEITGGEYFRSVDEKTLVQIQNTIQTSIIREPVETS 285 >gi|303252178|ref|ZP_07338346.1| hypothetical protein APP2_1152 [Actinobacillus pleuropneumoniae serovar 2 str. 4226] gi|302648961|gb|EFL79149.1| hypothetical protein APP2_1152 [Actinobacillus pleuropneumoniae serovar 2 str. 4226] Length = 532 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 59/488 (12%), Positives = 124/488 (25%), Gaps = 123/488 (25%) Query: 14 TYAIDLAHIMYIRNQMQSALDAAVLSGCASIVS----------------DRTIKDPTTKK 57 +++ A I+ + ++ +L+ AVLS A S + + + Sbjct: 36 FVSLESAGIIQDQARLSDSLEQAVLSLTAENNSGRKDNDYKLSGSSNKENDSFDISSEVG 95 Query: 58 DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 + S + K +K L Q + E IN T K + I ++ Sbjct: 96 KRDSQMATKFVKAFLPQTN--EEKMHLTPVCKTINNTNGKGHTSSSEVTCTVSGTIEHKS 153 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENL---AISICMVLDVSRSMEDLYLQKHNDNNNM 174 F + + + + +N I + +V D+S SM K+ NN Sbjct: 154 WFPLKVGTVEVIPHEVNVASKSKAFKKNTFNIPIDLMVVADLSGSMRYDITNKYETNNET 213 Query: 175 T------------SNKYLLPPPPKKSFWSKNTTKS-----KYAPAPAPA----------- 206 + + K LL + T + + P Sbjct: 214 SKLGILKDVLIELAEKTLLSEDANQHNRIYVTPFALGAEIDASSCALPYHWDKGKDPDTP 273 Query: 207 ---------------NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG---TIAYNIGIV 248 + + + + +K Sbjct: 274 LKIKNILSKKEGNNQQSRAEFIDNFVYKMNTQATLDNIGEKQNYKVTFPKGVFCLKNMKN 333 Query: 249 GNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 N + ++ + L T + A + S + K+ + Sbjct: 334 KNHGWFSRKDKSDFTGYVRSLRADGATLASSGVLVAANNMIKG--GSRTKELKEQTKRVI 391 Query: 309 IFITDGEN----------------------------------------------SGASAY 322 + ++DG + + + Sbjct: 392 LVLSDGNDEIVKGDPNSKVPFLNYTRITENLIYGRQEEFLSGKKKVSFGNTTIETYLTDT 451 Query: 323 QNTLNTLQICEYMR--------NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSR 374 Q T +C +R + KI V + ++ + C + +++ ND Sbjct: 452 QPKEVTNGMCNVIRDKLDKLNNDKNTKIVFVEFGYASKSKEAWKHCVEGDKNYYSANDKA 511 Query: 375 ELLESFDK 382 LL SF + Sbjct: 512 SLLNSFKQ 519 >gi|161529149|ref|YP_001582975.1| von Willebrand factor type A [Nitrosopumilus maritimus SCM1] gi|160340450|gb|ABX13537.1| von Willebrand factor type A [Nitrosopumilus maritimus SCM1] Length = 316 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 26/231 (11%), Positives = 70/231 (30%), Gaps = 47/231 (20%) Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYN 244 S S+ A ++D + NL+ + + + Sbjct: 85 ENGINLSIVLDGSESMAATDYEPTRLDAAKNAINNLILKMGPQHNVG--------VVLFE 136 Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 G + L+ + + + ++ + T + + + Sbjct: 137 SGAT--TVSYLTPDKEKSVNAISSIEQGLGATAIGDGLALGVDMASSIPDK--------- 185 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA-------------- 349 K VI ++DG ++ + + EY + ++I+++ + + Sbjct: 186 -KGVVILLSDGVHNSGL-----VTPEEATEYAKINNVQIHTIGLGSIEPVFLRDDIYGEP 239 Query: 350 --PPEGQDLLRKCT-DSSGQFFAVNDSRELLESF----DKITDKIQEQSVR 393 ++ L +SG ++ D + L E F + +I+ ++R Sbjct: 240 QYAELDEETLVIIAQQTSGNYYKSLDEQTLNEIFVNLSSNLAYEIEYSTIR 290 >gi|126732236|ref|ZP_01748037.1| hypothetical protein SSE37_18135 [Sagittula stellata E-37] gi|126707318|gb|EBA06383.1| hypothetical protein SSE37_18135 [Sagittula stellata E-37] Length = 710 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 23/191 (12%), Positives = 54/191 (28%), Gaps = 20/191 (10%) Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 K+ +L +S ++ + + + + Y +++ + L Sbjct: 366 ANKLPLLKQSFRLMLGQLGEED--------MVSIVTYAGSAGRVLEPTKASDRQTILDAL 417 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 ++L +T + AY + VI TDG+ + + Sbjct: 418 DRLEAGGSTAGQAGLQQAYATATEMARDGAVSR--------VILATDGDFNVG--ISDPD 467 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + E R G + + ++ L E+ + D+ Sbjct: 468 DMKDYIETQRGTGTYLSVLGFGRGNLDDATMQALAQHGNG--MAAYIDTLSEAQKVLVDQ 525 Query: 387 IQEQSVRIAPN 397 + V IA + Sbjct: 526 LTGALVPIADD 536 >gi|31789431|gb|AAP58546.1| hypothetical protein [uncultured Acidobacteria bacterium] Length = 327 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 71/192 (36%), Gaps = 34/192 (17%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 A ++D + ++ V RIG + ++ +PL+ + + Sbjct: 112 TAAGRTRMDAVKDAVRTFVR---------GRRDDRIGLVVFSDNAYV--ISPLTFDHQYL 160 Query: 263 KSR---LNK--LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 ++ L T + A L + R + V+ TDGE++ Sbjct: 161 LDYLGFVDGEILLGEGQTAIGDGLALASAVL------ARQAGRDARGHQVVVLFTDGESN 214 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-----GQDLLRK-CTDSSGQFFAVN 371 + +++ ++AG++++ + V E G LLR+ + G++FA + Sbjct: 215 RGR------DPIEVVGEAKSAGIRVHVIGVDLDAEVKTRPGVQLLRRGVVAAGGRYFAAD 268 Query: 372 DSRELLESFDKI 383 R+LL + I Sbjct: 269 SERDLLTASRTI 280 >gi|87306384|ref|ZP_01088531.1| hypothetical protein DSM3645_08632 [Blastopirellula marina DSM 3645] gi|87290563|gb|EAQ82450.1| hypothetical protein DSM3645_08632 [Blastopirellula marina DSM 3645] Length = 1030 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 26/157 (16%), Positives = 61/157 (38%), Gaps = 17/157 (10%) Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGS 301 + GN + N + ++++++ P + P++ A R + Sbjct: 490 QWLWTSNGNGLIRVGPNRPGMLAKMSRMTPGDMPQFDPSLQMALRAFNQLPP-------N 542 Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA-PPEGQDLLRKC 360 K +I I+DG+ + N + + G+K+ +VA+ P L+K Sbjct: 543 EVAVKHMIIISDGDP-------SPANPFTLSAIAKA-GIKVTTVAIGTHGPANSLELKKI 594 Query: 361 TD-SSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 + G+++ V + + L + + +I + V+ P Sbjct: 595 ASATGGKYYEVTNPKALPRIYQREARRIAQPLVKDHP 631 >gi|166031603|ref|ZP_02234432.1| hypothetical protein DORFOR_01303 [Dorea formicigenerans ATCC 27755] gi|166028580|gb|EDR47337.1| hypothetical protein DORFOR_01303 [Dorea formicigenerans ATCC 27755] Length = 685 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 31/271 (11%), Positives = 84/271 (30%), Gaps = 37/271 (13%) Query: 138 IIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKS 197 +N A+ +D S D + L + N Sbjct: 268 YQYIEPKNQAVVSIQQIDASEFPTVKLYMSIKDKTTGNVIENLDDAFFYINKQDANAKYV 327 Query: 198 KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG----------------TI 241 K A + + L + + N + ++ Sbjct: 328 KQVVKSANQLNEKEALK--VDMVADVSGSMDGSPLNEAKQVMSDFVGSVQFDAGDLVELT 385 Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGS 301 +++ G+ Q S++ + + +N L + T+ Y A++ A + + Sbjct: 386 SFSTGVCLEQE--FSDDAATLTNDINNLVTGDMTSLYDALYTAVERVA-----------A 432 Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT 361 + VI TDG ++ ++ + + + ++ + + + D+ T Sbjct: 433 QNGARCVIAFTDGNDNYSNCTK-----EDVVNVANRYHVPVFIIGIGSIDYA-DVNDIAT 486 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + G ++ V+D + + +++I ++ + Sbjct: 487 QTGGMYYNVSDVTSMDKIYEEIYQMEKQLYL 517 >gi|307247518|ref|ZP_07529563.1| Flp pilus assembly protein [Actinobacillus pleuropneumoniae serovar 2 str. S1536] gi|306856021|gb|EFM88179.1| Flp pilus assembly protein [Actinobacillus pleuropneumoniae serovar 2 str. S1536] Length = 510 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 59/488 (12%), Positives = 124/488 (25%), Gaps = 123/488 (25%) Query: 14 TYAIDLAHIMYIRNQMQSALDAAVLSGCASIVS----------------DRTIKDPTTKK 57 +++ A I+ + ++ +L+ AVLS A S + + + Sbjct: 14 FVSLESAGIIQDQARLSDSLEQAVLSLTAENNSGRKDNDYKLSGSSNKENDSFDISSEVG 73 Query: 58 DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 + S + K +K L Q + E IN T K + I ++ Sbjct: 74 KRDSQMATKFVKAFLPQTN--EEKMHLTPVCKTINNTNGKGHTSSSEVTCTVSGTIEHKS 131 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENL---AISICMVLDVSRSMEDLYLQKHNDNNNM 174 F + + + + +N I + +V D+S SM K+ NN Sbjct: 132 WFPLKVGTVEVIPHEVNVASKSKAFKKNTFNIPIDLMVVADLSGSMRYDITNKYETNNET 191 Query: 175 T------------SNKYLLPPPPKKSFWSKNTTKS-----KYAPAPAPA----------- 206 + + K LL + T + + P Sbjct: 192 SKLGILKDVLIELAEKTLLSEDANQHNRIYVTPFALGAEIDASSCALPYHWDKGKDPDTP 251 Query: 207 ---------------NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG---TIAYNIGIV 248 + + + + +K Sbjct: 252 LKIKNILSKKEGNNQQSRAEFIDNFVYKMNTQATLDNIGEKQNYKVTFPKGVFCLKNMKN 311 Query: 249 GNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 N + ++ + L T + A + S + K+ + Sbjct: 312 KNHGWFSRKDKSDFTGYVRSLRADGATLASSGVLVAANNMIKG--GSRTKELKEQTKRVI 369 Query: 309 IFITDGEN----------------------------------------------SGASAY 322 + ++DG + + + Sbjct: 370 LVLSDGNDEIVKGDPNSKVPFLNYTRITENLIYGRQEEFLSGKKKVSFGNTTIETYLTDT 429 Query: 323 QNTLNTLQICEYMR--------NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSR 374 Q T +C +R + KI V + ++ + C + +++ ND Sbjct: 430 QPKEVTNGMCNVIRDKLDKLNNDKNTKIVFVEFGYASKSKEAWKHCVEGDKNYYSANDKA 489 Query: 375 ELLESFDK 382 LL SF + Sbjct: 490 SLLNSFKQ 497 >gi|116251678|ref|YP_767516.1| transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] gi|115256326|emb|CAK07407.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] Length = 329 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 66/187 (35%), Gaps = 36/187 (19%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++D + + + V RIG +A+ P + + V++ + Sbjct: 121 RVDAVKQVVADFVGR---------RPGDRIGLVAFGDAPYP--LAPFTMDHALVETMIAD 169 Query: 269 LNPYEN---TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 P T+ A+ A + T +K +I +TDG ++ + Sbjct: 170 AVPGMAGPRTSLGDALGLAVKMFEKT----------TVPEKVLIVLTDGNDTASRMP--- 216 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPE------GQDLLRKCTD-SSGQFFAVNDSRELLE 378 L+ E ++ G+ +++V + P L+K + + G++F D +L Sbjct: 217 --PLKAAEIAKSKGVVVHAVGIGDPLATGEDKLDTATLQKIAEKTGGRYFFGGDQAQLAS 274 Query: 379 SFDKITD 385 + + Sbjct: 275 IYQVLDQ 281 >gi|328953621|ref|YP_004370955.1| hypothetical protein Desac_1940 [Desulfobacca acetoxidans DSM 11109] gi|328453945|gb|AEB09774.1| hypothetical protein Desac_1940 [Desulfobacca acetoxidans DSM 11109] Length = 376 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 45/335 (13%), Positives = 100/335 (29%), Gaps = 31/335 (9%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 +TA+++ V F AID+ ++ I+ +MQSA+DAAV G + Sbjct: 16 ITALLLPVLIGFTGLAIDIGNLYVIKTRMQSAVDAAVCGGGLKLP-----NQGLAMTTAN 70 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 S I + +Y ++ + A +IN + F Sbjct: 71 SFITSNGFDPNDATITYTQDTVNNPAGSPEINCSMTNQ-----------------VPTFF 113 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNN---MTSN 177 GL ++++ + GI++ + + D + + + + N S Sbjct: 114 LGLFGYPNISITVSAKGILQTGGAGGPFNYAIFSDQNLPISGNQKITGSVHTNHQLTISG 173 Query: 178 KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVR 237 + + + + + + A I V + + I S Sbjct: 174 NTKISGAAEGATGVRLSGSNTIGSVQADTTANIHVSGSNTIGSQSGGATQIAMPDFTSQI 233 Query: 238 IGTIAYNIG-IVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 + +N+L+ + T + L N + Sbjct: 234 EAAAQAQGTVYYSAKILSGTNDLDGNIWVQGDITMSGMTTSGTG-----AILANGNITIS 288 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 + ++ +G + + ++ N N +I Sbjct: 289 GNTTIGGNGQVCLYSANGNITVSGSFNNGSNGSEI 323 >gi|170744040|ref|YP_001772695.1| von Willebrand factor type A [Methylobacterium sp. 4-46] gi|168198314|gb|ACA20261.1| von Willebrand factor type A [Methylobacterium sp. 4-46] Length = 654 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 34/290 (11%), Positives = 75/290 (25%), Gaps = 25/290 (8%) Query: 108 KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQK 167 A Y I + L L P A + + S + Sbjct: 194 TASYGIVRDALNRNHLPPPAAVRTEELINYFPYAYPAPASPDAPFRVTASVFPSPWAEGR 253 Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 + + P + S AP ++ ++ +S L+ ++ Sbjct: 254 KL-LHIGIRGYAVAPAERPPANLVFLVDTSGSMAAP----NRLPLVKQSLAMLLTTLDA- 307 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 R+ +AY + + + + + L + +T + AY Sbjct: 308 -------RDRVALVAYAGEVGTVLEPTPAGEAGRILAAIETLQAHGSTAGGEGIRQAYAL 360 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 + VI TDG+ + ++ L E R G+ + + Sbjct: 361 AARHFDPKAVNR--------VILATDGDFNVGITGRDELTGFVARE--RRKGIFLSVLGF 410 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 L++ E+ + ++ + IA + Sbjct: 411 GMGNLNDALMQALAKDGNGV--AAHIDTAQEARKVLVEEATSTLIPIARD 458 >gi|153010351|ref|YP_001371565.1| hypothetical protein Oant_3028 [Ochrobactrum anthropi ATCC 49188] gi|151562239|gb|ABS15736.1| conserved hypothetical protein [Ochrobactrum anthropi ATCC 49188] Length = 605 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 40/310 (12%), Positives = 83/310 (26%), Gaps = 70/310 (22%) Query: 153 VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV 212 V + ++N + P + N T + Sbjct: 294 VESRPGTYALDVTPPSDNNPDTLFVPMFGPAEYYNTDSRGNVTSTVLNSWWQDDTSLAYS 353 Query: 213 LIESA---GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 +S L +S+ K + ++ + + + +++ + + Sbjct: 354 PRQSDLKKYYLRDSLDKIYRGGRSNDGGPNYSCTSSPLTPLTDVTTEQGMKTIQTAIKAM 413 Query: 270 NPYENTNTYPAMHHAYRELYNEKE-SSHNTIGSTRLKKFVIFITDGENSGASAY------ 322 P TN AM +R + + K VI +TDG N+ Sbjct: 414 VPSGGTNVPEAMAWGWRTIVRGAPFTEARPSTERGNDKVVIVLTDGANTYYKYDGLAGSG 473 Query: 323 --------------------------------------QNTLNTLQI-------CEYMRN 337 N+ T + C+ ++ Sbjct: 474 PDRAANFSYYSAHGYTARITKHYSQARLFQESGVSVSQNNSTYTKAMNARFAKLCDNAKS 533 Query: 338 AGMKIYSVAVSAPPEGQ------DLLRKCTDS---------SGQFFAVNDSRELLESFDK 382 A + + +VA+ DLLR C+ + + F + EL E+F + Sbjct: 534 ANIIVMTVALDLSETNSTEKAQIDLLRSCSSNSRVRTESGRPAKLFWNSTGGELSETFRQ 593 Query: 383 ITDKIQEQSV 392 I D++ + Sbjct: 594 IGDELSNLRI 603 Score = 44.5 bits (103), Expect = 0.026, Method: Composition-based stats. Identities = 25/201 (12%), Positives = 51/201 (25%), Gaps = 26/201 (12%) Query: 13 ITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHL 72 A+D A++M +RN +Q++LDAA L+ + + + +L Sbjct: 28 GMVAVDSANLMRVRNNVQASLDAAALAVGRRFSTGESQT-------VVQVYGAQVFTANL 80 Query: 73 KQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN-LFLKGLIPSALTNL 131 A ++ K+ ++ A + + + L Sbjct: 81 ---------TALSADAVNFDVAFPKDKTTDQQIQATAGFTYKSLFGVIASRLTGDDWDQN 131 Query: 132 S---LRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSN------KYLLP 182 S + L + +D +RS L+ Sbjct: 132 QYTLSSSVRLKNTIEVALVLDNSRSMDETRSGSTKKRIDLLKEAASQLVETMAAQSTLIT 191 Query: 183 PPPKKSFWSKNTTKSKYAPAP 203 +S P Sbjct: 192 HVENPVQFSLVPFAGSVNVGP 212 >gi|223936327|ref|ZP_03628239.1| von Willebrand factor type A [bacterium Ellin514] gi|223894845|gb|EEF61294.1| von Willebrand factor type A [bacterium Ellin514] Length = 338 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 54/180 (30%), Gaps = 45/180 (25%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL---NPYEN-TNTYPAMHHAYRELY 289 S RIG + + L + + L +L + N T A+ + L Sbjct: 132 QSDRIGLVVFGTQAYVAVPPTL--DHEFLLKNLERLGIGSINGNQTAIGSALSTSMNRL- 188 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 K +I +TDG+N+ L E R G+KIY++ V Sbjct: 189 ---------RELKSKSKIIILMTDGQNNAGKVP-----PLTAAEAARALGIKIYTIGVGT 234 Query: 350 P-----------------------PEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + L + ++ +++ + + L + + I Sbjct: 235 KGVARMAVGTDPFSGQKIYQQVPVDIDEGTLTSISKMTNAKYYRADSTATLEKIYADIDR 294 >gi|148726250|emb|CAN88322.1| matrilin 1 [Danio rerio] gi|148726498|emb|CAN88268.1| matrilin 1 [Danio rerio] Length = 277 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 29/159 (18%), Positives = 51/159 (32%), Gaps = 19/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + R+G + Y + + ++K+ P T T A+ A Sbjct: 69 SVGPDATRVGVVNYASRVKNEVSLKSHKTKAALVKAVSKIEPLSTGTMTGLAIQFAMNVA 128 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E S + K I +TDG N I R AG++I+++ V Sbjct: 129 FSEAEGG---RKSPDISKVAIIVTDGRPQD--------NIRDIAARAREAGIEIFAIGVG 177 Query: 349 APPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 LR+ V +L + F + Sbjct: 178 R--VDMTTLRQMASEPLEDHVDYVESYSLIEKLTKKFQE 214 >gi|332970883|gb|EGK09860.1| D-amino-acid dehydrogenase [Desmospora sp. 8437] Length = 448 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 24/194 (12%), Positives = 60/194 (30%), Gaps = 22/194 (11%) Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGI----VGNQC 252 S K+++ + +S+ + + + G+ + + Sbjct: 147 SGSMAGQVDGGMKMNLAKAAVERFASSLPENAKVSLWVYGHKGSNSKKDKPVSCKSTEEV 206 Query: 253 TPL-SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 PL + + L++ T +M A EL + + Sbjct: 207 YPLGTYQEEKFSQSLDQFRATGWTPIAASMKAAREELQKN--------SGEDATHMLYIV 258 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMK--IYSVAVSAPPEGQDLLRKCTDSSGQFFA 369 +DG + + + + + + +K + + GQ L+K ++ G + Sbjct: 259 SDGVETCGG------DPVAEAKKLNQSKIKAVVNIIGFDVDDAGQQALQKVAEAGGGEYE 312 Query: 370 -VNDSRELLESFDK 382 V ++L FD+ Sbjct: 313 TVESEQDLRSYFDE 326 >gi|327265755|ref|XP_003217673.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like [Anolis carolinensis] Length = 1078 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 39/341 (11%), Positives = 98/341 (28%), Gaps = 22/341 (6%) Query: 38 LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK 97 L+ A K++ K ++ E D + ++ T D+ Sbjct: 72 LADAAERFQKAHRWQDNIKEENIEYYDSKADTEY--DDLDGEEIERDTSNSLKLEFTDDE 129 Query: 98 NNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 N + A +IPT+ +I + L +E E+ ++ + + Sbjct: 130 NFKTKVNYSYAA-VQIPTDIYKGSTVILNELNWTEALEDVFVENRKEDPSLLWQVFGSAT 188 Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 N + P + S + + + + ++ S Sbjct: 189 GVTRFYPATPWRAPNKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSV 248 Query: 218 GNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 ++++++ + + + ++ +V N K ++ + T+ Sbjct: 249 YDMLDTLSDDDYVNVASFNQKAQAVSCFTHLVQANIR----NKKVFKEKVEVMEARGTTD 304 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 A+ +L N S K ++ TDG N Sbjct: 305 YKAGFEFAFEQLQNSNISRA------NCNKMIMMFTDGGEDRVQDVFEKYNWP------- 351 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 N +++++ +V L+ + G +F + + Sbjct: 352 NKTVRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 392 >gi|84385834|ref|ZP_00988864.1| hypothetical protein V12B01_12445 [Vibrio splendidus 12B01] gi|84379150|gb|EAP96003.1| hypothetical protein V12B01_12445 [Vibrio splendidus 12B01] Length = 359 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 26/154 (16%), Positives = 67/154 (43%), Gaps = 19/154 (12%) Query: 246 GIVGNQCTPLSNNLN---EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 G TP + + + E+ ++ + ++T+ A+ A + ++ S S+ Sbjct: 154 GDAAFVQTPFTADQDVWLELLNQTDVAMAGQSTHLGDAIGLAIKVFEQSEKQSAAVQDSS 213 Query: 303 ----RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------ 352 +K VI +TDG ++G + + + + + G++I+ +A+ P Sbjct: 214 VDANEKEKVVIVLTDGNDTG-----SFVEPIDAAKVAKAKGVRIHVIAMGDPQTVGEVAL 268 Query: 353 GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITD 385 + +++ +S G+ F + EL +++ +I + Sbjct: 269 DMETIKRVAQESGGEAFEALNRDELTKAYAQIGE 302 >gi|218190303|gb|EEC72730.1| hypothetical protein OsI_06342 [Oryza sativa Indica Group] Length = 585 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 33/184 (17%), Positives = 66/184 (35%), Gaps = 29/184 (15%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL----SNNLNEVK 263 ++DVL E+ ++ + R+ +A+N V T L N + Sbjct: 75 SRLDVLKEAMKFIIRKLDDGD--------RLSIVAFNDRPVKEYSTGLLNISGNGRRIAE 126 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +++ L T PA+ A R L S F++ +TDG+++ + Sbjct: 127 KKVDWLEARGGTALMPALEEAIRVLDC------RPGDSRNSVGFILLLTDGDDTSGFRWS 180 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 + + + +++ + A + LL +S G + V+D DKI Sbjct: 181 RDVINGAVGK------YPVHTFGLGAAHSSEALLHIAQESRGTYSFVDDEN-----MDKI 229 Query: 384 TDKI 387 + Sbjct: 230 AGAL 233 >gi|47205231|emb|CAG06181.1| unnamed protein product [Tetraodon nigroviridis] Length = 427 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 34/220 (15%), Positives = 73/220 (33%), Gaps = 23/220 (10%) Query: 169 NDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAI 228 N+ + + + P + + + + E +LV+S++ Sbjct: 29 RRNHAIDPSSTVSPASGSSCRNGPIDLVFIVDSSRSVRPTEFEKAKEFLQDLVDSLE--- 85 Query: 229 QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRE 287 S R+G + Y + + +K L ++ P T T A+ A + Sbjct: 86 --VGLDSTRVGLVNYASTVRMEFPLKAHFSKPALKGALARVEPLASGTMTGLAIRTAVEK 143 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 + + + + ST++ + + +TDG ++ R +G++IY+V V Sbjct: 144 AFAAEAGAR--LNSTKVARVAVVVTDGRPQD--------EVERVSAAARESGIEIYAVGV 193 Query: 348 SAPPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 + LR F V +L F + Sbjct: 194 --DRADRTSLRLMASQPHEDHVFYVETYGVIEKLTSRFRE 231 >gi|126733489|ref|ZP_01749236.1| von Willebrand factor, type A [Roseobacter sp. CCS2] gi|126716355|gb|EBA13219.1| von Willebrand factor, type A [Roseobacter sp. CCS2] Length = 699 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 43/378 (11%), Positives = 93/378 (24%), Gaps = 41/378 (10%) Query: 20 AHIMYIRN-QMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYI 78 R Q + +L+AA + + I + I P D + + + Sbjct: 190 MRQQQTRAVQGRQSLNAAP-AAPSEITAAPRIVVPAPSVDDVTIMPAPNTETFANDDPNP 248 Query: 79 RENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGI 138 + + Y + +L L P+ + Sbjct: 249 LKITAEEPVSTFSIDVDTPA------------YAVVRSSLSRGQLPPAQAVRIEELVNYF 296 Query: 139 IERSSENLAISICMVLDVSRSMEDL-YLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKS 197 A V+ + + PP + +T+ S Sbjct: 297 PYDYPTPDAGEAPFRPTVTTFQTPWNADTQLVHIALQGQMPEVAARPPLNLVFLIDTSGS 356 Query: 198 KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN 257 P K+ +L +S +++ ++ ++ + Y ++ Sbjct: 357 MDDP------TKLPLLKQSFRLMLDQLRPED--------QVAIVEYAGSAGQVLVPTSAS 402 Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + L +TN + AY +E VI TDG+ + Sbjct: 403 ERTTILQAIQSLGAGGSTNGQGGLEQAYSVAEAMREDGEVNR--------VILATDGDFN 454 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 N R G + + ++ + L Sbjct: 455 VGL--SNPDALKDFIADKRETGTYLSVLGFGRGNLDDATMQALAQNGNG--TAAYIDTLS 510 Query: 378 ESFDKITDKIQEQSVRIA 395 E+ + D++ IA Sbjct: 511 EAQKVLVDQLSGALFPIA 528 >gi|290985353|ref|XP_002675390.1| von Willebrand factor type A domain-containing protein [Naegleria gruberi] gi|284088986|gb|EFC42646.1| von Willebrand factor type A domain-containing protein [Naegleria gruberi] Length = 923 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 22/196 (11%), Positives = 69/196 (35%), Gaps = 22/196 (11%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN--NLNEVKSRL 266 K+D++ + +V+ +++ R+ + ++ + N + + K Sbjct: 707 KLDMVKSTLSFMVDQLKEKD--------RVAIVEFDTQVKTNLDLTKMDIEGKKKAKQVS 758 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 + ++P TN A+ + + L + ++ VI TDG + N + Sbjct: 759 SAISPGSCTNLSGALFTSLKLLASRQQ-------EKNEVTSVILFTDGLANRGLISTNEI 811 Query: 327 --NTLQICEYMRN-AGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF-AVNDSRELLESFDK 382 N + + + + + + I++ ++L + + + ++ ++F Sbjct: 812 LQNMQDLMDELLSTSNVTIHTFGFGQD-TDANMLTSIAQKGNGLYDYLETADDIPKAFGN 870 Query: 383 ITDKIQEQSVRIAPNR 398 + + + R Sbjct: 871 VIGNLVSVVGQNIKIR 886 >gi|284053937|ref|ZP_06384147.1| von Willebrand factor, type A [Arthrospira platensis str. Paraca] Length = 339 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 25/158 (15%), Positives = 56/158 (35%), Gaps = 15/158 (9%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 N+ RI +A++ + + +K +++ L T + EL K Sbjct: 1 NVGDRISVVAFDHRAKVLVPNQDIADPDGIKKKIDGLRCSGGTAIDEGLKLGIEELGKGK 60 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + + +TDGE + + + L++ + + I S+ Sbjct: 61 QDRISQG---------FLLTDGE----NEHGDNKRCLKLAKLATEYKLTINSLGFGDDWN 107 Query: 353 GQDLLRKCTDSSGQFF-AVNDSRELLESFDKITDKIQE 389 QD+L K D+ G + + + F ++ ++Q Sbjct: 108 -QDILEKIADAGGGALGYIEYPEQAIAEFGRLFTRMQS 144 >gi|73958318|ref|XP_547049.2| PREDICTED: similar to integrin alpha X precursor [Canis familiaris] Length = 1149 Score = 65.7 bits (158), Expect = 1e-08, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 58/174 (33%), Gaps = 17/174 (9%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 N V ++ Q + + S+N + +++L T+T Sbjct: 162 NFVKAVMSQFQRPSTQFSLMQFSNNFRVHFTFEVFTYSSNPLALLDSVSQL--GGLTHTA 219 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 A+ EL++ + + K +I ITDG+ ++L + A Sbjct: 220 TAIRIVTNELFSASKGAR-----KDASKILIVITDGQ-----KKGDSLGYEDVIPMAEAA 269 Query: 339 GMKIYSVAVSAPPEGQ---DLLRKCTDSSGQFF--AVNDSRELLESFDKITDKI 387 G+ Y+V V + L + V + L + +++ +KI Sbjct: 270 GIIRYAVGVGTAFQKMQSWKELNDIASKPSHEYIFKVENFDALRDIQNQLKEKI 323 >gi|221128149|ref|XP_002161198.1| PREDICTED: similar to inter-alpha trypsin inhibitor, heavy chain 3, partial [Hydra magnipapillata] Length = 464 Score = 65.3 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 57/153 (37%), Gaps = 4/153 (2%) Query: 233 NLSVRIGTIAYNIGIVGNQ-CTPLSN-NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 N R+ I ++ + + TP++ N + + ++P TN + E+ + Sbjct: 83 NEKDRLCLITFDTSVYLDFKLTPMTPMNKYQTLKIIKDISPGSMTNLCGGLMKGLCEVID 142 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + N + S L G + + T + A IY+ + Sbjct: 143 RADEEKNEVASVLLFTDGFANKGGLTNIYCSSSQTAKYTIGIVGPKTADASIYTFGFGSN 202 Query: 351 PEGQDLLRKCTDSS-GQFFAVNDSRELLESFDK 382 Q +L++ +D+ G ++ + + + E+F + Sbjct: 203 HNAQ-MLKEISDAGSGMYYYIENVDMIAEAFGQ 234 >gi|61806576|ref|NP_001013521.1| hypothetical protein LOC541376 [Danio rerio] gi|60649627|gb|AAH90437.1| Zgc:113325 [Danio rerio] Length = 450 Score = 65.3 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 41/361 (11%), Positives = 97/361 (26%), Gaps = 25/361 (6%) Query: 18 DLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSY 77 D+A ++ + + AL+ + + + Sbjct: 52 DIAKLLNRKRK---ALER----LAREAEDIQREHSWQDAIKENDIQYYDAKADSDVMDDD 104 Query: 78 IRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTG 137 EN D + D N + A +IPT+ I + L Sbjct: 105 DGENLLDNPNSLSLEFVDDPNFKNKVNYSYTA-VQIPTDIYKGSPTILNELNWTQSLERV 163 Query: 138 IIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKS 197 IE S E+ ++ + + K N + P + S Sbjct: 164 FIENSREDPSLRWQVFGSTTGVTRYYPATKWKAPNKIDLYDVRRRPWYIQGASSPKDMVI 223 Query: 198 KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLS 256 + + + + ++ S +++++ + + + + Sbjct: 224 IVDVSGSVSGLTLKLMKTSVIEMLDTLSDDDYVNVARFNEKAYAVVPCFTTLVQANI--- 280 Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 N K + + T+ A+ +L N+ + K ++ TDG Sbjct: 281 KNKKIFKEAVMNMQAKGTTDYKTGFQFAFDQLLNDTSAPRANCN-----KMIMMFTDGGE 335 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRE 375 A N N +++++ +V L+ + G +F + Sbjct: 336 DRAQDIFEKYNWP-------NRTVRVFTFSVGQHNYDVTPLQWIACFNKGYYFEIPSIGA 388 Query: 376 L 376 + Sbjct: 389 I 389 >gi|224370036|ref|YP_002604200.1| hypothetical protein HRM2_29490 [Desulfobacterium autotrophicum HRM2] gi|223692753|gb|ACN16036.1| conserved hypothetical protein [Desulfobacterium autotrophicum HRM2] Length = 598 Score = 65.3 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 29/170 (17%), Positives = 64/170 (37%), Gaps = 47/170 (27%) Query: 254 PLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 PL+ + + LN L P T+ A+ A E E +K +I Sbjct: 144 PLTLDHSAFNLFLNALEPDYLPVGGTDLGGAIETALNGFEKEVE----------SEKAII 193 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------- 352 ITDGEN+ +++++ + + G+KI+ + V +P Sbjct: 194 LITDGENTTG-------DSIEMAKKAADQGVKIFCIGVGSPEGAPVPDSAGGFKKDRSGK 246 Query: 353 ------GQDLLRKCTDSSGQFF--AVNDSRELLESF-DKITDKIQEQSVR 393 + L+K + + +V +L + +I ++++++++ Sbjct: 247 IIISRVDEPALKKIAAMTQGVYVRSVAGDMDLDRIYDQEILGRMEKKTLK 296 >gi|290543406|ref|NP_001166514.1| cochlin [Cavia porcellus] gi|195970365|gb|ACG60666.1| coagulation factor C [Cavia porcellus] Length = 553 Score = 65.3 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 25/155 (16%), Positives = 46/155 (29%), Gaps = 20/155 (12%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESS 295 +I + + + V + + + T T A+ R ++ S Sbjct: 408 KIAAVQFTYDQRTEFSFTDYSTKENVLAVIRSIRYMSGGTATGDAISFTVRNVFGPVRDS 467 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 N F++ ITDG++ + +AG+ I+SV V+ P Sbjct: 468 PNK-------NFLVIITDGQSYD--------DVRGPAAAAHDAGITIFSVGVAWAPLDD- 511 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 L+ F + L I I Sbjct: 512 -LKDMASKPKESHAFFTREFTGLEPIVSDIIRGIC 545 >gi|156350148|ref|XP_001622163.1| hypothetical protein NEMVEDRAFT_v1g221080 [Nematostella vectensis] gi|156208611|gb|EDO30063.1| predicted protein [Nematostella vectensis] Length = 2040 Score = 65.3 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 56/166 (33%), Gaps = 23/166 (13%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYRELYN 290 + G ++Y N + + S ++ +N P T+T + + L+ Sbjct: 238 SESGIHTGVVSYGTKAKTNLGFDVHFSQANFNSAIDGINFPGGATDTGNGIRKVMKSLFK 297 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + ++ IT G+++G + E +R +G++++ V V Sbjct: 298 TSKRR-------SIPHILVLITAGKSTGDPSLN--------AEELRASGVRLFCVGVG-G 341 Query: 351 PEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 + L S + L FD ++ + RI Sbjct: 342 AYDRTQLDAIASSPSTTYV------LTAGFDDLSGLVPTLVSRITK 381 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 19/177 (10%), Positives = 47/177 (26%), Gaps = 15/177 (8%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN-KLNPYENTNT 277 + +I +A + + + + + + ++ + P TN Sbjct: 432 SFFKTIYQAFWTPFGSTHVGLVVFGADSTMVFDFDNKLKDKPAIDAAIDASVFPSGQTNL 491 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 A+ S + ++ ++ + + Q +R Sbjct: 492 GAALQQTKDLSLGYLFGS---KHNDGHRRTLVVLA--------GSTAEDDVFQGASDLRM 540 Query: 338 AGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQSV 392 G+ +Y V P L + V +L + +KI+E Sbjct: 541 NGVTVYCVGAG-PNYDSAQLDGIASTPADSNVLTVASYNDLPGISQTLINKIEEGRT 596 Score = 38.3 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 18/133 (13%), Positives = 39/133 (29%), Gaps = 19/133 (14%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN---TYPAMHHAYREL 288 + R+G + + ++ EV S ++ + N+ A+ A L Sbjct: 1683 SSDKTRVGVVVF-NSPSVQFPLDNYSSKEEVYSAIDTITQSSNSGPGTVGEAIAFANTNL 1741 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + K S K +I IT ++ + + + + + V V Sbjct: 1742 FKGKTRSQ-------TPKILIVITSKKSGDDVTSPS--------QDAKRENITMLVVGVG 1786 Query: 349 APPEGQDLLRKCT 361 D+ Sbjct: 1787 PGASKSDMDVIAA 1799 >gi|312080770|ref|XP_003142742.1| hypothetical protein LOAG_07160 [Loa loa] gi|307762094|gb|EFO21328.1| hypothetical protein LOAG_07160 [Loa loa] Length = 634 Score = 65.3 bits (157), Expect = 1e-08, Method: Composition-based stats. Identities = 29/168 (17%), Positives = 60/168 (35%), Gaps = 13/168 (7%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + I Y+ I +++ ++K +N + T T A+ H E Sbjct: 478 DVSPQQTHVAVIQYSDVIRHEIDLNQYSSMEQLKQAINGIEYLTGLTRTGAAIQHVTDEG 537 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E+ + + + +I ITDG + R +++++V V Sbjct: 538 FSERRGARPIGNG--VPRILIVITDGRSQDDVTI--------AVRNARMKQIQLFAVGV- 586 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 L + S+ + F VN +L K+ ++ IAP Sbjct: 587 TNHALDSELEVISGSTKRTFHVNAFEDLNARLRSAIQKVTCPAI-IAP 633 >gi|221127586|ref|XP_002157796.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 449 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 24/131 (18%), Positives = 50/131 (38%), Gaps = 14/131 (10%) Query: 259 LNEVKSRLNKLN-PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 ++ R++K+ P T T A+ A + +++ + + + +I + DG + Sbjct: 94 KQDIFDRIDKIPYPGYRTATDDALRVANKYMFSLSGGAR-----QGVPQVLILVNDGRCT 148 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRE 375 + N ++ G+ I+ V +S + ++LL FF V E Sbjct: 149 V-----CSENVSSASAPLKEKGVSIFCVGISKTVDKKELL-SIASEPAEDHFFYVETIDE 202 Query: 376 LLESFDKITDK 386 L K+ K Sbjct: 203 LPTFISKLHKK 213 Score = 46.1 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 25/190 (13%), Positives = 66/190 (34%), Gaps = 18/190 (9%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + + + A +V+S + R + Y ++ Sbjct: 277 SSSSVGELAYEEMKKFAHQVVDSFS-----ISQQNARFAALVYGSNASVEFNFVRYDSAL 331 Query: 261 EVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 E+K + L+ NT A+ A +L++ + + + DG + Sbjct: 332 EIKQAIQSLSYLKSNTRIDKALEVAKSDLFSLQGKVRSRR-----PMILYVFFDGTVT-- 384 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 ++ + + + +++ G+KI ++ V L+K ++ + F+ +EL Sbjct: 385 ---RSMSDLESVVQPLKDYGVKIIAIGVGPEVNRYQ-LKKISEDNA-IFSGKSFKELAPL 439 Query: 380 FDKITDKIQE 389 I ++ Sbjct: 440 LYSIVEQSCS 449 >gi|301781662|ref|XP_002926252.1| PREDICTED: collagen alpha-3(VI) chain-like [Ailuropoda melanoleuca] Length = 3167 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 18/178 (10%), Positives = 59/178 (33%), Gaps = 17/178 (9%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 + + +++ + R + +N EV S ++ ++ +N Sbjct: 58 VREFLYDVIESLAVGDSDF-RFALVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGGSN 116 Query: 277 -TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 T + + + E + + + + ++ +TDG + A + + Sbjct: 117 ETGKGLEYVMQNHLTE---AAGSRAGDGVPQVIVVLTDGRSDDGLALPSAG--------L 165 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQS 391 ++A + ++++ V + L++ F + + L + + +Q Sbjct: 166 KSADVNVFAIGV--EDADEGALKEIASEPLNMHVFNLENFTSLHDIVGNLVSCVQSSV 221 Score = 53.0 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 26/177 (14%), Positives = 59/177 (33%), Gaps = 17/177 (9%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 N + D E + + + S+++G + YN + ++ Sbjct: 1646 GSINFRRDTFQEVLRFVSEIVDTLYEGG--DSIQVGLVQYNSDPTDEFFLKDFSTKQQII 1703 Query: 264 SRLNKLNPYENTNTYP--AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 +NK+ + + H + N + R+ + IT G+ Sbjct: 1704 DAINKVVYKGGRHANTKVGIEHLRQ---NHFVPEAGSRLDQRVPQIAFVITGGK------ 1754 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + + G+K+++V V + + K +S F V + +EL E Sbjct: 1755 --SVEDAQEASLALTQKGVKVFAVGV--KNIDSEEVGKIASNSATAFRVGNVQELSE 1807 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 21/188 (11%), Positives = 51/188 (27%), Gaps = 20/188 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ + V Sbjct: 1039 GVRSGFPLLKEFVQRVVESL-----DVGPDRVRVAVVQYSDRTRPEFYLNSYMDQQSVVG 1093 Query: 265 RLNKLNPYENT--NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L NT A+ R + S + + +I +T + Sbjct: 1094 AVRRLTLLGGPAPNTGAALDFVLRSILTSSAGSRIA---EGVPQLLIVLTADRSGDDVRG 1150 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + + ++ + A+ R+L Sbjct: 1151 PSVVVKRGGAVP----------IGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQA 1200 Query: 383 ITDKIQEQ 390 I+D++ + Sbjct: 1201 ISDRVIQL 1208 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 18/154 (11%), Positives = 46/154 (29%), Gaps = 18/154 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRE 287 +R+G + Y+ + +V + L N A+ Sbjct: 273 SVGAQQIRVGVVQYSDEPRTVFSLDTYSTKAQVLDAVKALVFTGGELANVGLALDFVVEN 332 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 + + + + + ++ I+ G +S ++ + L+ ++S + Sbjct: 333 HFT---RAGGSRVEEGVPQVLVLISAGPSSD--EIRDGVVALKQAS--------VFSFGL 379 Query: 348 SAPPEGQDLLRKCTDSSGQFFAV---NDSRELLE 378 A + L+ + F V +L E Sbjct: 380 GAQAASRAELQHIATNDNLVFTVPEFRSFGDLQE 413 Score = 45.3 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 25/173 (14%), Positives = 60/173 (34%), Gaps = 17/173 (9%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 K D + ++ +++ VR+G + ++ + + V L + Sbjct: 1448 KPDDVAHIRDFVIKIVRRLNIGPNK--VRVGVVQFSNEVFPEFYLKTHKSQAAVLDALRR 1505 Query: 269 LNPYENT--NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 L + NT A+ R L+ S + + + ++ G++ + + Sbjct: 1506 LRFRGGSPLNTGRALEFVARNLF---VKSAGSRIEDGVPQHLVLFLGGKSQDDISRFS-- 1560 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 + + ++G+ S+ V + L+ T+ F V + REL Sbjct: 1561 ------QVISSSGIV--SLGVGDRNIDRTELQTITNDPRLVFTVREFRELPSI 1605 >gi|118575253|ref|YP_874996.1| hypothetical protein CENSYa_0043 [Cenarchaeum symbiosum A] gi|118193774|gb|ABK76692.1| conserved hypothetical protein [Cenarchaeum symbiosum A] Length = 311 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 20/166 (12%), Positives = 55/166 (33%), Gaps = 34/166 (20%) Query: 251 QCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 + L+++ E R+ ++ T + E S +I Sbjct: 138 TISYLTSDRAEAAGRIGEIVQGDGATALGDGLALGVEMAAAGPEKST-----------II 186 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------G 353 ++DG ++ + E ++++++ + + Sbjct: 187 LLSDGVHNSGRT-----VPGEALELAIQGNIRVHTIGMGSDEPVRVGDDIFGEPRYAELD 241 Query: 354 QDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 +D LR+ D + G ++ D+ L F+ +++ I ++ +R Sbjct: 242 EDTLREIADRTGGMYYTSVDNPTLDGIFEALSEDIAGTEGHLSASR 287 >gi|71051532|gb|AAH36192.1| COL14A1 protein [Homo sapiens] Length = 534 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 57/168 (33%), Gaps = 18/168 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + RIG Y+ + +EV + L NT T A+++ + Sbjct: 186 TAFDVGSEKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNYIF 245 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + T + K I ITDG++ + +R +G++++++ Sbjct: 246 ENSFK-----PEAGSRTGVSKIGILITDGKSQDDIIPPS--------RNLRESGVELFAI 292 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 V + L++ + V + + + +T + + Sbjct: 293 GV--KNADVNELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTLCSRV 338 >gi|56797853|emb|CAG26904.1| matrilin-1 [Danio rerio] Length = 277 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 29/159 (18%), Positives = 51/159 (32%), Gaps = 19/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + R+G + Y + + ++K+ P T T A+ A Sbjct: 69 SVGPDATRVGVVNYASRVKNEVSLKSHKTKAALVKAVSKIEPLSTGTMTGLAIQFAMNVA 128 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E S + K I +TDG N I R AG++I+++ V Sbjct: 129 FSEAEGG---RKSPDISKVAIIVTDGRPQD--------NIRDIAARAREAGIEIFAIGVG 177 Query: 349 APPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 LR+ V +L + F + Sbjct: 178 R--VDMTTLRQMASEPLEDHVDYVESYSLIEKLTKKFQE 214 >gi|70730213|ref|YP_259952.1| von Willebrand factor type A domain-containing protein [Pseudomonas fluorescens Pf-5] gi|68344512|gb|AAY92118.1| von Willebrand factor type A domain protein [Pseudomonas fluorescens Pf-5] Length = 332 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 26/157 (16%), Positives = 51/157 (32%), Gaps = 27/157 (17%) Query: 254 PLSNNLNEVKSRLNKL---NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + + L NT A+ A + L E +K +I Sbjct: 153 PLTLDHASLSLLLEDSGIGMAGPNTAIGDAIGLALKLLEQAHE----------PEKVLIL 202 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------GQDLLRKCT-DS 363 +TDG ++ + + G+ I+++ + P L + + Sbjct: 203 LTDGNDT-----SSAITPQHAAAMAAARGVVIHTIGIGDPSAEGEAKVDLSALEQIARTT 257 Query: 364 SGQFFAVNDSRELLESFDKITDKIQEQS--VRIAPNR 398 G++F D L + + + Q +R P R Sbjct: 258 GGRYFRAEDRSALDQVYATLDRLTPHQVQTLRHQPKR 294 >gi|319761860|ref|YP_004125797.1| von willebrand factor type a [Alicycliphilus denitrificans BC] gi|330826288|ref|YP_004389591.1| von Willebrand factor type A [Alicycliphilus denitrificans K601] gi|317116421|gb|ADU98909.1| von Willebrand factor type A [Alicycliphilus denitrificans BC] gi|329311660|gb|AEB86075.1| von Willebrand factor type A [Alicycliphilus denitrificans K601] Length = 348 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 24/224 (10%), Positives = 72/224 (32%), Gaps = 59/224 (26%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 ++ +A + + VR+G +A+ Q + + ++ ++ Sbjct: 106 DRLTAAQNAAKAFIQDLP--------RHVRVGVVAFAGTAQLAQLP--TQSHEDLLKAID 155 Query: 268 KLNPYENTNTYPAMHHAYRELYN----------------------------EKESSHNTI 299 T T + A L+ K+ Sbjct: 156 SFQLQRGTATGNGIMMALATLFPDAGIDIAALGGRQSMRVRPIDEVGRADPAKKPFTPVA 215 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------- 352 + +I +TDG+ + ++ L+ ++ + G+++Y+V V Sbjct: 216 PGSYRSAAIIMLTDGQRTTG------VDPLEAAQWAADRGVRVYTVGVGTVQGELIGFEG 269 Query: 353 -------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQ 388 +D L+ ++ ++F +++L + ++ ++ ++ Sbjct: 270 WSMRVRLDEDTLKAVALRTNAEYFHAATAQDLRKVYETLSSRLT 313 >gi|228472734|ref|ZP_04057492.1| BatB protein [Capnocytophaga gingivalis ATCC 33624] gi|228275785|gb|EEK14551.1| BatB protein [Capnocytophaga gingivalis ATCC 33624] Length = 353 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 28/184 (15%), Positives = 64/184 (34%), Gaps = 50/184 (27%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE----NTNTYPAMHHAYRELYNEK 292 R+ I Y PL+++ + K L +N T A+ A Sbjct: 130 RVAFIPYAAQAYPQ--LPLTSDYSSAKIFLEGINTNMLSSQGTAIGEAIQMAINYFEESS 187 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 ++S K +I ++DGE+ + + ++ G++++++ + Sbjct: 188 QTS----------KILIILSDGEDHQQGVDT-------VIQEAKDKGIRLFTIGLGTAQG 230 Query: 353 --------------------------GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITD 385 Q LL + + G++F +++E+L++ K D Sbjct: 231 ATIPVSENGQIVAKRDNNGQVVITKLNQALLEEIAQEGGGKYFNGANTKEVLDALQKALD 290 Query: 386 KIQE 389 I++ Sbjct: 291 TIEK 294 >gi|308094406|ref|ZP_05889083.2| conserved hypothetical protein [Vibrio parahaemolyticus AN-5034] gi|308095541|ref|ZP_05906689.2| conserved hypothetical protein [Vibrio parahaemolyticus Peru-466] gi|308125336|ref|ZP_05774598.2| conserved hypothetical protein [Vibrio parahaemolyticus K5030] gi|308126258|ref|ZP_05908794.2| conserved hypothetical protein [Vibrio parahaemolyticus AQ4037] gi|308089078|gb|EFO38773.1| conserved hypothetical protein [Vibrio parahaemolyticus Peru-466] gi|308091462|gb|EFO41157.1| conserved hypothetical protein [Vibrio parahaemolyticus AN-5034] gi|308109129|gb|EFO46669.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ4037] gi|308114017|gb|EFO51557.1| conserved hypothetical protein [Vibrio parahaemolyticus K5030] Length = 400 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 26/236 (11%), Positives = 63/236 (26%), Gaps = 6/236 (2%) Query: 4 IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKK-DQTST 62 +++ + + IDL H + + ++Q+A+D A L+G + S+ Sbjct: 1 MVLLILLGVAAFGIDLNHQVLNKTRLQNAVDTAALAGAVVADKTEDVDQAEAAVIATLSS 60 Query: 63 IFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKG 122 I + L + +N +Y + + +L Sbjct: 61 IASESGNTELSFTDGNTSVTFSHDMQTFVNAASFTPPTGEYDIYVRVAVTDMGISQYLSA 120 Query: 123 LIPS-ALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 + + S + + I M D + ++ED + + + L Sbjct: 121 VFGIVKNVSASAVAGRSAAIAYTCNLTPIAMCGDPNGTVEDAWGYRPPGYDPNVDMDPSL 180 Query: 182 PPP----PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 + + A N ++ ++ N Sbjct: 181 VHELKVGDQNNTDMGPGNFQLLDFGQATGNSGAALVRDALSGAYNGCAAVGNTVTT 236 >gi|126334038|ref|XP_001370553.1| PREDICTED: similar to Integrin alpha-X precursor (Leukocyte adhesion glycoprotein p150,95 alpha chain) (Leukocyte adhesion receptor p150,95) (Leu M5) (CD11c antigen) [Monodelphis domestica] Length = 1224 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 28/179 (15%), Positives = 58/179 (32%), Gaps = 21/179 (11%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE 273 +++ +K + + + S++ + + +L Sbjct: 228 KNFVKAMISQFEK----PSTQFSLMQFASNFKIHFTFEKFKNSHDPRRLVDEITQL--SG 281 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T T + EL+ + + K +I ITDGE Y + L Q+ Sbjct: 282 VTKTASGIKKVINELFQKTRGAR-----QYATKILIVITDGE-----KYDDPLEYSQVIP 331 Query: 334 YMRNAGMKIYSVAVSAP---PEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKI 387 AG+ Y++ V P + L + F V++ L +++ +KI Sbjct: 332 TAEKAGIIRYAIGVGEAFERPSSRQELEEIASEPSKDHIFWVDNFGALSNIQNQLKEKI 390 >gi|325964113|ref|YP_004242019.1| hypothetical protein Asphe3_27670 [Arthrobacter phenanthrenivorans Sphe3] gi|323470200|gb|ADX73885.1| hypothetical protein Asphe3_27670 [Arthrobacter phenanthrenivorans Sphe3] Length = 352 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 19/198 (9%), Positives = 55/198 (27%), Gaps = 20/198 (10%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+++ A+D+ + + Q+Q+ DA L+ + Sbjct: 29 LVAVLMVALLGCAALAVDVGAMYAEKAQIQNGADATSLAIAEECANGVNCAVAMAA---- 84 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK-AQYEIPTENLF 119 + + + + ++T+ + ++ ++ A +LF Sbjct: 85 -------------PANRLADANANDGATGVFSVTQPSPSTIRVETNAREAGSGDNHFSLF 131 Query: 120 LKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKY 179 ++ + ++ + S + + V R D + Sbjct: 132 FARVLGIDTSQITAAAEASWGPPSSGSTLPWTVSECVFRKYLTPTQLTELDTTGSFTGD- 190 Query: 180 LLPPPPKKSFWSKNTTKS 197 P + +NT Sbjct: 191 -PTPTHILLRYDENTPDY 207 >gi|126658524|ref|ZP_01729672.1| hypothetical protein CY0110_21405 [Cyanothece sp. CCY0110] gi|126620266|gb|EAZ90987.1| hypothetical protein CY0110_21405 [Cyanothece sp. CCY0110] Length = 610 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 18/137 (13%), Positives = 43/137 (31%), Gaps = 10/137 (7%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 N E+K+++ K+ TN ++ + ST V+ +TDG Sbjct: 96 TNKQEIKAKIKKIRARGCTNLSGGWLLGCSQVKSHL--------STDKLNRVLLLTDGLA 147 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 + + E G+ ++ + L+ + G F+ + + Sbjct: 148 NIGERKP-EILLKTAAEKA-QQGIVTTTLGFGSNFNEDLLIGMADAAGGNFYFIQSPDDS 205 Query: 377 LESFDKITDKIQEQSVR 393 + F + + + Sbjct: 206 ADVFHIEMESLLSLVTQ 222 >gi|186681468|ref|YP_001864664.1| von Willebrand factor A [Nostoc punctiforme PCC 73102] gi|186463920|gb|ACC79721.1| von Willebrand factor, type A [Nostoc punctiforme PCC 73102] Length = 426 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 24/156 (15%), Positives = 47/156 (30%), Gaps = 26/156 (16%) Query: 258 NLNEVKSRLNK-LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + +KS++ L+ T + EL + +TDG Sbjct: 99 DPESIKSQIKSKLSASGGTAIAEGLELGITELMKGTRGA---------VSQAFLLTDGHG 149 Query: 317 SGASAYQ-------NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFF 368 + + L++ + + I + QDLL K D G Sbjct: 150 ESSLRIWKWDIGRDDNKRCLKLAQKAAKLNLTINTFGFG-NSWNQDLLEKIADVGGGTLA 208 Query: 369 AVNDSRELLESFDKITDKIQE-------QSVRIAPN 397 + + +E F ++ +IQ + + PN Sbjct: 209 HIEHPEQAVEQFSRLFGRIQSIGLTNAYLLLSLVPN 244 >gi|32475885|ref|NP_868879.1| hypothetical protein RB9427 [Rhodopirellula baltica SH 1] gi|32446428|emb|CAD76256.1| hypothetical protein containing vWFA domain [Rhodopirellula baltica SH 1] Length = 484 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 17/161 (10%), Positives = 54/161 (33%), Gaps = 12/161 (7%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 ++ + + + Y+ + + + + +K ++ + +T + + E+ Sbjct: 114 DRLSDDDIVSVVLYDSNVTVLVPATKATDRSSIKQKIRGIQAGSSTALFAGVSKGAAEV- 172 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + VI ++DG + L L + + + ++ + + Sbjct: 173 -------RKFLADEQVNRVILLSDGLANVGPKSPQELEGLG--RSLMKEAISVSTLGLGS 223 Query: 350 PPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQE 389 +DL+ G + D+ L+ F++ D + Sbjct: 224 G-YNEDLMVALASVGGGNHAFIEDADSLVSVFNQEFDGLLS 263 >gi|239616825|ref|YP_002940147.1| von Willebrand factor type A [Kosmotoga olearia TBF 19.5.1] gi|239505656|gb|ACR79143.1| von Willebrand factor type A [Kosmotoga olearia TBF 19.5.1] Length = 730 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 35/292 (11%), Positives = 83/292 (28%), Gaps = 23/292 (7%) Query: 104 IAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDL 163 + + + + S+ + S + + L V+ Sbjct: 184 NVRIEVVADPNEISDVFSPIHSFNYIKKGDESSWVFSASDYVPRSDVSLALTVTGGTLSS 243 Query: 164 YLQKHNDNN--NMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLV 221 + + D L+PP + K+ KI+ + ++ Sbjct: 244 SIMNYWDEADRRGYFLLTLVPPREPERIIPKDIVFILDISGSMSGQ-KIEKAKLALLQVL 302 Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIV--GNQCTPLSNNLNEVKSRLNKLNPYENTNTYP 279 + + R I +N + + P S + E + ++ TN + Sbjct: 303 QMLHEGD--------RFSIITFNNEVNNLTERLLPFS-DRTEWYPAVKQIMAGGMTNIHD 353 Query: 280 AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG 339 A+ L T + K V+F+TDG + T+ + + Sbjct: 354 ALLEGIEVL--------GTQSTDDRYKVVLFLTDGAPTEGITDIGTII-RDSTKLAKVRD 404 Query: 340 MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 + ++ V + L G+ + ++ E+ E ++ I+ Sbjct: 405 VHLFVFGVGYDVNAELLDELAEKGGGKVKYIVENEEIDEKVLELYRMIETPV 456 >gi|170750695|ref|YP_001756955.1| von Willebrand factor type A [Methylobacterium radiotolerans JCM 2831] gi|170657217|gb|ACB26272.1| von Willebrand factor type A [Methylobacterium radiotolerans JCM 2831] Length = 345 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 32/166 (19%), Positives = 57/166 (34%), Gaps = 24/166 (14%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYN--- 290 RIG + + S + V L++ +T + A R L Sbjct: 148 RIGLVIFADQAYVAAAP--SFDTAAVARALDEATIGISGRSTGIGDGLGLALRRLDPRDA 205 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA- 349 E++ + + K VI ++DG N+ + E R G+K+Y++A+ Sbjct: 206 GGEAASGSKPGEKPAKAVILLSDGANNAGQT-----APKDVAELARELGIKVYTIALGPR 260 Query: 350 ---------PPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITD 385 + LR S G+ F V + +L+ D I Sbjct: 261 DMADADGEQDVVDTETLRDMARASGGEAFRVRTTEDLVRVADAIDR 306 >gi|153833319|ref|ZP_01985986.1| von Willebrand factor, type A [Vibrio harveyi HY01] gi|148870455|gb|EDL69376.1| von Willebrand factor, type A [Vibrio harveyi HY01] Length = 363 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 25/150 (16%), Positives = 63/150 (42%), Gaps = 17/150 (11%) Query: 246 GIVGNQCTPLSNNLN---EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 G TP + + E+ ++ + ++T+ A+ A + ++S + S Sbjct: 159 GDAAFVQTPFTPDQKVWLELLNQTDVAMAGQSTHLGDAIGLAIKVFEQSEKSRTDVEESK 218 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------GQDL 356 K I +TDG ++G + + + + + ++I+ +A+ P + Sbjct: 219 E--KVAIVLTDGNDTG-----SFVEPIDAAKVAKAKDVRIHVIAMGDPQTVGETALDMNT 271 Query: 357 LRKCT-DSSGQFFAVNDSRELLESFDKITD 385 +++ +S G+ F + EL +++D+I Sbjct: 272 IKRIAKESGGEAFEALNRDELAKAYDEIGK 301 >gi|326500988|dbj|BAJ98725.1| predicted protein [Hordeum vulgare subsp. vulgare] gi|326527981|dbj|BAJ89042.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 707 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 17/202 (8%), Positives = 61/202 (30%), Gaps = 31/202 (15%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK--SRL 266 K+ +L ++ +++++ R+ ++++ ++ + + + Sbjct: 226 KLALLKQAMRFVIDNLG--------PDDRLSVVSFSSEARRLTRLARMSDAGKALSVNAV 277 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 L TN + A + L V+ ++DG+++ + Sbjct: 278 ESLVARGGTNIAEGLRTAAKVLDE--------RQHRNAVSSVVLLSDGQDTYTMMRRRGP 329 Query: 327 NTLQICE-------------YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDS 373 + + + I++ + + + G F + + Sbjct: 330 SGVHAGNYEELVPPSFARTGADGDWSAPIHTFGFGNDHDAAAMHVIAEATGGTFSFIENE 389 Query: 374 RELLESFDKITDKIQEQSVRIA 395 + ++F + + V+ A Sbjct: 390 AVIQDAFAQCIGGLLSVVVQEA 411 >gi|315647020|ref|ZP_07900133.1| D-amino acid dehydrogenase, large subunit [Paenibacillus vortex V453] gi|315277222|gb|EFU40551.1| D-amino acid dehydrogenase, large subunit [Paenibacillus vortex V453] Length = 471 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 35/205 (17%), Positives = 68/205 (33%), Gaps = 25/205 (12%) Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI-----GIVGNQ 251 S A K+D E+ + + + + GT Sbjct: 172 SGSMKAQINGKSKMDSAKEAIQTFADKLPNNAEVSLRVYGHKGTGDQKDKRVSCDSTEEI 231 Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 + N++K+ L K+ P T A+ +++ E V + Sbjct: 232 FHGQGDQTNQIKTALQKVEPAGWTPIANALQSVKKDINPE-----------TTDSVVYVV 280 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMK--IYSVAVSAPPEGQDLLRKCTDSSGQFF- 368 +DG + Q+ + + + +K + V EGQ LLR+ S G F Sbjct: 281 SDGIETCGG------KPAQVAKELHQSKVKTIVNIVGFDVDNEGQKLLRQIAASGGGEFM 334 Query: 369 AVNDSRELLESFDKITDKIQEQSVR 393 +VN+ L +K ++++ + R Sbjct: 335 SVNNDEALKNVLNKAYEQLRGEWTR 359 >gi|224046671|ref|XP_002199336.1| PREDICTED: collagen, type XIV, alpha 1 (undulin) [Taeniopygia guttata] Length = 1883 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 55/168 (32%), Gaps = 18/168 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+G Y+ + V + L NT T A+ + Sbjct: 186 SAFNVGSEKTRVGLAQYSGDPRIEWHLNAYGTKDAVLDAVRNLPYKGGNTLTGLALTYIL 245 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + + + K I ITDG++ + +R+AG++++++ Sbjct: 246 ENSFKPEAGAR-----PGVSKIGILITDGKSQDDVIPP--------AKNLRDAGIELFAI 292 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 V + L++ + V D + + +T + + Sbjct: 293 GV--KNADINELKEIASEPDSTHVYNVADFNFMHTIVEGLTRTVCSRV 338 Score = 64.2 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 28/201 (13%), Positives = 60/201 (29%), Gaps = 26/201 (12%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + +I + V ++ K + ++ I + Sbjct: 1043 VDGSWSIGDDNFNKIINFLYSTVGALDKIGPDGT----QVAIIQFTDDPRTEFKLNAYKT 1098 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + + NT T A+ HA L+ + K ++ ITDG + Sbjct: 1099 KETLLEAIQHIAYKGGNTKTGKAIKHAREFLFTG-----EAGMRRGIPKVLVVITDGRSQ 1153 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRE 375 ++ M+ G +++ V L F V+D Sbjct: 1154 DDVN--------KVSREMQLDGFTFFAIGV--ADADYSELVNIGSKPSERHVFFVDDF-- 1201 Query: 376 LLESFDKITDKIQEQSVRIAP 396 ++F KI D++ A Sbjct: 1202 --DAFTKIEDELITFVCETAS 1220 >gi|260797293|ref|XP_002593638.1| hypothetical protein BRAFLDRAFT_155309 [Branchiostoma floridae] gi|229278864|gb|EEN49649.1| hypothetical protein BRAFLDRAFT_155309 [Branchiostoma floridae] Length = 388 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 21/172 (12%), Positives = 53/172 (30%), Gaps = 22/172 (12%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAY 285 + + ++G I Y+ + S ++++ T T A+ + Sbjct: 36 SGFDISPSGTQVGVIQYSTRTRQEFSMNSFVTKETLSSAIDEVQYMRGGTLTGKAIRYVT 95 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + + + + + K VI +TDG + + G+ +Y++ Sbjct: 96 KYGFGKSDGAR-----PGVPKVVIVVTDGVSYD--------AVAAPALEAQQKGITVYAI 142 Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI------TDKIQEQS 391 V D L + ++ V++ L + + I Sbjct: 143 GV--SGYDADQLEQIASNNNTLAFVDNFNLLDNLRNTLLTGVCDAKNICSTR 192 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 52/183 (28%), Gaps = 19/183 (10%) Query: 184 PPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK---AIQEKKNLSVRIGT 240 +S T + + S V K + + ++G Sbjct: 221 NNCGRKFSCFPTLCRNPLDIIFLLDGSGSVGASNFEKVKQFTKKAISGFDISPSGTQVGV 280 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTI 299 I Y+ + S ++++ T T A+ + + + + + + Sbjct: 281 IQYSTRTRQEFSMNSFVTKETLSSAIDEVQYMRGGTLTGKAIRYVTKYGFGKSDGAR--- 337 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 + K VI +TDG + + G+ +Y++ V D L + Sbjct: 338 --PGVPKVVIVVTDGVSYD--------AVAAPALEAQQKGITVYAIGV--SGYDADQLEQ 385 Query: 360 CTD 362 Sbjct: 386 IAS 388 >gi|297243668|ref|ZP_06927599.1| hypothetical protein GVAMD_0259 [Gardnerella vaginalis AMD] gi|296888419|gb|EFH27160.1| hypothetical protein GVAMD_0259 [Gardnerella vaginalis AMD] Length = 560 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 25/182 (13%), Positives = 64/182 (35%), Gaps = 27/182 (14%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 ID + + + V ++ + + L + Y +N+ + + Sbjct: 208 SGSMEDSIDTMKRTMSDFVRNLNYKVGDTGELISFDSYLMYMATY--------TNDKDRL 259 Query: 263 KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + ++ + PY T Y A++ N VI TDG+++ ++ Sbjct: 260 LTGIDNMTPYGMTALYDALYTGITNASNHPGF-----------NCVIAFTDGQDNESTHT 308 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESFD 381 + + + G+ +Y + L+ +++G F+ +N ++ E + Sbjct: 309 ADE-----VISLAKEKGIPVYLIG--TSEADSSTLQNIANETNGYFWDMNSISDMQEVMN 361 Query: 382 KI 383 +I Sbjct: 362 RI 363 >gi|126664966|ref|ZP_01735949.1| hypothetical protein MELB17_17899 [Marinobacter sp. ELB17] gi|126630336|gb|EBA00951.1| hypothetical protein MELB17_17899 [Marinobacter sp. ELB17] Length = 341 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 26/197 (13%), Positives = 57/197 (28%), Gaps = 45/197 (22%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK---S 264 ++ + + ++ R+G I + PL+ + V Sbjct: 113 NRLQAVKQVLAEFIDQ---------RQGDRLGLILFGSQAYVQA--PLTFDRTTVNILLQ 161 Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 T A+ A + L ++ I +TDG N+ + Sbjct: 162 EAGLGMAGNATAIGDAVGLAVKRL----------RERPLEQRVAIVLTDGANTAGEITPD 211 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPE---------------GQDLLRKCTD-SSGQFF 368 + L +++Y++ + A + + LL + + GQ+F Sbjct: 212 KASELAQASA-----VRLYTIGIGAGADSAITGLLQRNPSRDLDEALLTRMAQQTGGQYF 266 Query: 369 AVNDSRELLESFDKITD 385 + EL + I Sbjct: 267 RARNLAELGGIYTSINQ 283 >gi|194289639|ref|YP_002005546.1| hypothetical protein RALTA_A1531 [Cupriavidus taiwanensis LMG 19424] gi|193223474|emb|CAQ69479.1| conserved hypothetical protein, vWA domain (Von Willebrand factor, type A); putative membrane protein [Cupriavidus taiwanensis LMG 19424] Length = 359 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 18/150 (12%), Positives = 47/150 (31%), Gaps = 25/150 (16%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKL---NPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 G P + + V++ + L +T A+ + + S Sbjct: 147 GDAPYPLAPFTLDHRLVQTLIADLLPGMAGPSTALGDAIGLGIKMFEH----------SE 196 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG-------QD 355 +K +I +TDG ++ + + + + ++++ + P Sbjct: 197 APEKVLIVLTDGNDTASRMPPERAGGI-----AKERKVVVHTIGIGDPNASGEEKVDLDV 251 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITD 385 L + + G++F D L + + Sbjct: 252 LQKLAAQTGGRYFFGADQAGLETIYATLDQ 281 >gi|316983245|pdb|3N2N|F Chain F, The Crystal Structure Of Tumor Endothelial Marker 8 (Tem8) Extracellular Domain gi|316983246|pdb|3N2N|A Chain A, The Crystal Structure Of Tumor Endothelial Marker 8 (Tem8) Extracellular Domain gi|316983247|pdb|3N2N|B Chain B, The Crystal Structure Of Tumor Endothelial Marker 8 (Tem8) Extracellular Domain gi|316983248|pdb|3N2N|C Chain C, The Crystal Structure Of Tumor Endothelial Marker 8 (Tem8) Extracellular Domain gi|316983249|pdb|3N2N|D Chain D, The Crystal Structure Of Tumor Endothelial Marker 8 (Tem8) Extracellular Domain gi|316983250|pdb|3N2N|E Chain E, The Crystal Structure Of Tumor Endothelial Marker 8 (Tem8) Extracellular Domain Length = 185 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 65/187 (34%), Gaps = 18/187 (9%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 K ++ + +++ + + +R+ I ++ L+ + +++ Sbjct: 11 YFILDKSGSVLHHWNEIYYFVEQLAHKFISPQLRMSFIVFSTRGT--TLMKLTEDREQIR 68 Query: 264 SRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 L +L P +T + A ++Y E + T +I +TDGE Sbjct: 69 QGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRT------ASVIIALTDGELHEDL 122 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND-SRELLES 379 + + R+ G +Y+V V + L + DS F VND + L Sbjct: 123 FFYSERE----ANRSRDLGAIVYAVGV--KDFNETQLARIADSKDHVFPVNDGFQALQGI 176 Query: 380 FDKITDK 386 I K Sbjct: 177 IHSILKK 183 >gi|284029570|ref|YP_003379501.1| von Willebrand factor type A [Kribbella flavida DSM 17836] gi|283808863|gb|ADB30702.1| von Willebrand factor type A [Kribbella flavida DSM 17836] Length = 654 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 26/223 (11%), Positives = 71/223 (31%), Gaps = 29/223 (13%) Query: 175 TSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ----- 229 ++ +++ S A + ++D + G++V+ + Q Sbjct: 22 NASTAAAAGELSPVMVVLDSSGSMTARDAGGSGTRMDAAKRAVGSMVDGLPAGAQVGLAI 81 Query: 230 --EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 S V P+ N +K + T A+ A + Sbjct: 82 YGAGTGSSGAEKVAGCKDVRVVQPVGPV--NKPALKRAVTATKASGYTPIGQALRTAAAQ 139 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM--KIYSV 345 L E + + ++ ++DGE++ ++ + + G+ ++++ Sbjct: 140 LPKEGQ------------RSIVLVSDGEDT-----CAPPQPCEVAKELSKQGVDLHVHTI 182 Query: 346 AVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKI 387 + + L + G + +D+ LL ++T++ Sbjct: 183 GFRVDAKARAQLACIAQNTGGTYHDASDADSLLGVLGRVTERA 225 >gi|56675026|gb|AAW19655.1| matrilin-1 [Cervus elaphus] Length = 230 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 26/159 (16%), Positives = 57/159 (35%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + R+G + Y + ++ E+ + ++ P T T A+H A + Sbjct: 34 DVGPNATRVGLVNYASSVKQEFPLRAHSSKAELLQAVRRIQPLSTGTMTGLAIHFAITKA 93 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E S + K VI +TDG + + R +G++++++ V Sbjct: 94 LSDAEGGRPR--SPDISKVVIVVTDGRPQDS--------VRDVSARARASGIELFAIGVG 143 Query: 349 APPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 + LR+ V +L + F + Sbjct: 144 R--VDKATLRQIASEPQDEHVDYVESYRVIEKLSKKFQE 180 >gi|300113557|ref|YP_003760132.1| von Willebrand factor type A [Nitrosococcus watsonii C-113] gi|299539494|gb|ADJ27811.1| von Willebrand factor type A [Nitrosococcus watsonii C-113] Length = 345 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 22/136 (16%), Positives = 49/136 (36%), Gaps = 23/136 (16%) Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T A+ S K +I +TDG ++ + + Sbjct: 172 MAGPRTAFGDAIGLGVNLFSE----------SEAPAKTIIALTDGNDTKSQVPS-----V 216 Query: 330 QICEYMRNAGMKIYSVAVSAPP-------EGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + ++I++VA+ P + Q L ++ G +F D L +D+ Sbjct: 217 EAARVAARREIRIHTVAIGDPTTAGEDKLDQQALREVAAETGGSYFFAADRASLAGIYDQ 276 Query: 383 ITDKIQEQSVRIAPNR 398 + D+I+ + +++ +R Sbjct: 277 L-DEIETRKIKMVSHR 291 >gi|118384937|ref|XP_001025607.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89307374|gb|EAS05362.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 670 Score = 65.3 bits (157), Expect = 2e-08, Method: Composition-based stats. Identities = 30/220 (13%), Positives = 77/220 (35%), Gaps = 20/220 (9%) Query: 172 NNMTSNKYLLPPPPKKSFWS--KNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ 229 ++ T++ S S K +S + +DV+ S +VN++ Sbjct: 28 SSRTNSNICCVVDVSGSMSSEAKIINQSSQKSDENYSLSILDVVKHSIKMIVNTL----- 82 Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 + I T + + ++ + +N ++ L+ T + ++ A L Sbjct: 83 -GSEDYLSIVTFSDSANVLFDLLPMNDSNKTMAIEKIENLSTEGGTELWKGLNSALNILL 141 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 N K + + +TDG+ + + N + Q + I + S+ Sbjct: 142 NNKTP--------NTNQSIFLLTDGQPTDSGIDTNLVKFKQAYPKL---NCTINTFGFSS 190 Query: 350 PPEGQDLLRKCTDSSGQFFAVNDSRELLESF-DKITDKIQ 388 + + + + +G F + D+ + +F + + + + Sbjct: 191 SSNSELMNKIAMEYNGMFSFIPDASFIATAFANALANTLT 230 >gi|330719552|gb|EGG98147.1| BatA [gamma proteobacterium IMCC2047] Length = 166 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 19/99 (19%), Positives = 39/99 (39%), Gaps = 23/99 (23%) Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------- 350 + +I +TDG N+ + L+ E + +KI+++ V A Sbjct: 13 SRLLILLTDGANTAG-----EIEPLKAAELAQQQQIKIHTIGVGANEMLVPGLFSSRRVN 67 Query: 351 ---PEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 +D L+K + GQ+F D+ +L + + + Sbjct: 68 PSADLDEDTLKKIASQTGGQYFRAQDTEQLQQIYALLDQ 106 >gi|327274818|ref|XP_003222173.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Anolis carolinensis] Length = 1097 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 25/197 (12%), Positives = 66/197 (33%), Gaps = 26/197 (13%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + D + +++++ + R+G I ++ + N Sbjct: 721 SSESVGPENFDRIKRFVKTVIDAVT-----VNQATARVGIINFSHKVDVVSTLQQYPNKE 775 Query: 261 EVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +K ++ + T T A+ +A + + ++K I ITDG+ Sbjct: 776 SLKGAIDVMQYLGEGTYTATAISNATNIFKSARPG---------VRKVAIVITDGQADR- 825 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAV---SAPPEG--QDLLRKCTDSSG--QFFAVND 372 ++ + + + ++I+ + V S P + + + ++D Sbjct: 826 ---RDPNTLEVVVKDAHASNIEIFVIGVVQKSDPNFDSFRKEMDLIASDPDIEHVYQIDD 882 Query: 373 SRELLESFDKITDKIQE 389 L +K+ +I E Sbjct: 883 FITLQALENKVFKQICE 899 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 26/129 (20%), Positives = 52/129 (40%), Gaps = 15/129 (11%) Query: 258 NLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 +L + K R+ ++ T +Y A+ +A + E S K V+ + DG + Sbjct: 114 SLPDFKLRVKEMTYIGHGTYSYYAISNATQLFKTEGRKSSV--------KVVVLMADGID 165 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ-FFAVNDSRE 375 + + + I E R G+ ++ +S + LL +S G F +ND Sbjct: 166 -----HPKSPDVQAISEAARTFGISFITIGLSNVADKVKLLSISGNSPGTPVFILNDPNL 220 Query: 376 LLESFDKIT 384 L + +++ Sbjct: 221 LDKIRNQLA 229 >gi|251789655|ref|YP_003004376.1| hypothetical protein Dd1591_2051 [Dickeya zeae Ech1591] gi|247538276|gb|ACT06897.1| conserved hypothetical protein [Dickeya zeae Ech1591] Length = 526 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 53/412 (12%), Positives = 117/412 (28%), Gaps = 52/412 (12%) Query: 27 NQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIA 86 +++ LDAA +S S+ + + + K ++ + S + Sbjct: 98 SKLAQQLDAADISVTEGRRSNARRTFTVSVSFASQPVLLKVAQRQHEIYSTVEVIYRPSE 157 Query: 87 QKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENL 146 + T + + + T+ L AL S + + + + Sbjct: 158 VALVLPNTSTETAAELAVLRRLGK-NFATDFLADYPDRRLALVPYSQSVSVYDDANWSSR 216 Query: 147 AISICMV-----LDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAP 201 + + +++S + N + M + + + + AP Sbjct: 217 IRNWSLSGALKPVELSSLFRNNDYGIANLASRMMPDLRSKRMCLYRGLNQGDNYFWETAP 276 Query: 202 APAP-ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIG--IVGNQCTPLSNN 258 A + + + D+ I + + + T PL+++ Sbjct: 277 AGSFRVHYRHDLPINAPDMPAIEWIGPNPDFGQANGVNDTRYIIGDKGCPTAALLPLTDD 336 Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRL--------KKFVIF 310 LN + +RL+++ P NTN AM A L S Sbjct: 337 LNAISARLDQMEPQFNTNYAIAMGWAAMSLAPSFRGSDGWGDGEYPLDFNDNGSGNIKAI 396 Query: 311 ITDGENSGASAYQNTLNT------------------------LQICEYMRNAGMKIYSVA 346 + G +G ++ N +C R+ + Y + Sbjct: 397 VMMGNTTGDWFDTDSYNAYVGEAIDGDGQGGDLGKQAATLRFRSLCNSFRSRDLLFYFIG 456 Query: 347 V---SAPPEGQDL--------LRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 V G+ L L+ C + G + DS + +I++ + Sbjct: 457 VRPGDPEDFGRTLFGAVAVPGLQACATNDGGYIRFADSSSFSGAEGQISELL 508 >gi|126304011|ref|XP_001381695.1| PREDICTED: similar to tumor endothelial marker 8 [Monodelphis domestica] Length = 564 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 34/196 (17%), Positives = 65/196 (33%), Gaps = 18/196 (9%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 K ++ + ++ + + +R+ I ++ G+ Sbjct: 37 PACYGGFDLYFILDKSGSVLHHWNEIYYFVEHLAHKFISPQLRMSFIVFSTR--GSILMR 94 Query: 255 LSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 L+ + +++ L +L P +T + A ++Y E + T +I + Sbjct: 95 LTEDREQIRQGLEELQKVLPGGDTYMHEGFQRASEQIYYENMQGYRT------ASVIIAL 148 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDGE + + R G +Y V V + L + DS F VN Sbjct: 149 TDGELHEDLFFY----AEREANRSRELGATVYCVGV--KDFNETQLARIADSKDHVFPVN 202 Query: 372 D-SRELLESFDKITDK 386 D + L D I K Sbjct: 203 DGFQALQGIIDSILKK 218 >gi|21323788|dbj|BAB98414.1| Hypothetical membrane protein [Corynebacterium glutamicum ATCC 13032] Length = 634 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 25/224 (11%), Positives = 63/224 (28%), Gaps = 27/224 (12%) Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE 230 + + + A +ID +++ L+N I Sbjct: 22 ALAFSLSPVAKAQANETPTMIVLDNSGSMTAQDAGGQTRIDAAKQASTQLINDISDRTDV 81 Query: 231 KKNLSVRIGTIAYNI---GIVGNQCT--PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAY 285 G P N + + +N L P T A+ Sbjct: 82 GLTYYGGNTGETEADVEMGCQDVTILGGPSRGNADTLIDTINSLQPRGFTPIGKALTDTA 141 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK--IY 343 EL ++ ++DG + + ++ + + +G+ I Sbjct: 142 AELPEGGN--------------IVLVSDGIAN-----CTPPDVCEVAQELAQSGINLVIN 182 Query: 344 SVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDK 386 ++ ++ P ++ L G + +D++ L ++ + + Sbjct: 183 TIGLNVDPAAREELECIAGVGGGTYADASDAQSLTDALTRAASR 226 >gi|73962941|ref|XP_547762.2| PREDICTED: similar to coagulation factor C homolog, cochlin precursor [Canis familiaris] Length = 847 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 23/155 (14%), Positives = 46/155 (29%), Gaps = 20/155 (12%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESS 295 +I + + + V + + + T T A+ R ++ S Sbjct: 702 KIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFGPVRDS 761 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 N F++ +TDG++ + +AG+ I+SV V+ P Sbjct: 762 PNK-------NFLVIVTDGQSYD--------DVRGPAAAAHDAGITIFSVGVAWAPLDD- 805 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 L+ F + L + I Sbjct: 806 -LKDMASKPKESHAFFTREFTGLEPIVSDVIRGIC 839 >gi|47219204|emb|CAG11222.1| unnamed protein product [Tetraodon nigroviridis] Length = 4421 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 37/376 (9%), Positives = 102/376 (27%), Gaps = 29/376 (7%) Query: 27 NQMQSALDAAVLSGCASIVSDRTIKDP--TTKKDQTSTIFKKQIKKHLKQGSYIREN-AG 83 + +Q + DAAV + + R I D + + + ++L+ + + Sbjct: 40 SVVQYSRDAAVHFYLNTYTTKREILDALRGLRHKGGRALNTGEALQYLRNNVFTASAGSR 99 Query: 84 DIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSS 143 Q+ I + + + +F G S + S Sbjct: 100 RTEGVPQVLILLTGGRSSDSVDSPASDLKQLGVLIFAIGSRGSDNREIQRISHSPTSALV 159 Query: 144 ENLAISICMV----LDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY 199 + V L R + D+ + + P P K+ Sbjct: 160 VPEFTDLPSVQQQLLSSVRDVVDVGPEAPTPAVRDVVDVGPEAPTPAVDTSKKDIVFLLD 219 Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 + + +V + R+ + Y+ + + Sbjct: 220 GS--DGTRNGFPAMRDFVERVVEKLS-----VGPNKDRVSVVQYSRDAEVHFNLNTYSTR 272 Query: 260 NEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 ++ + L T A+ + ++ S + + +I + G + Sbjct: 273 EDIIDSVRGLRHRGGAPVRTGAALQYVRDNVFTNSSGSRRL---QGVPQMLILLNGGRSY 329 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 + ++ G ++ + + + L+K + AV + +L Sbjct: 330 DSVDTP--------ASSLKQQG--VFVIGIGTRNSDRTELQKISFEPSYTLAVTEFTDLP 379 Query: 378 ESFDKITDKIQEQSVR 393 ++++ + ++ Sbjct: 380 SIQEQLSSVMSTVLLK 395 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 36/372 (9%), Positives = 97/372 (26%), Gaps = 36/372 (9%) Query: 27 NQMQSALDAAVLSGCASIVSDRTIKDP--TTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 + +Q + DAAV + + R I D + + + ++L+ + Sbjct: 452 SVVQYSRDAAVHFYLNTYTTKREILDALRGLRHKGGRALNTGEALQYLRNNVFTASAGSR 511 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK-GLIPSALTNLSLRSTGIIERSS 143 + + +S A P L G S + S Sbjct: 512 RTEGVPQVLILLTGGRSSDSVDSPASDLKPLGVLIFAIGSRGSDNREIQRISHSPTSALV 571 Query: 144 ENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAP 203 + V Q+ + + P P K+ Sbjct: 572 VPEFTDLPSV-----------QQQLLSSVRDVVDVGPEVPTPAVDTSKKDIVFLLDGS-- 618 Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 + + +V + R+ + Y+ + + ++ Sbjct: 619 DGTRNGFPAMRDFVERVVEKLS-----VGPNKDRVSVVQYSRDAEVHFNLNTYSTREDII 673 Query: 264 SRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + L T A+ + ++ S + + +I + G + + Sbjct: 674 DSVRGLRHRGGAPVRTGAALQYVRDNVFTNSSGSRRL---QGVPQMLILLNGGRSYDSVD 730 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFD 381 ++ G ++ + + + L+K + AV + +L + Sbjct: 731 TP--------ASSLKQQG--VFVIGIGTRNSDRTELQKISFEPSYTLAVTEFTDLPSIQE 780 Query: 382 KITDKIQEQSVR 393 +++ + ++ Sbjct: 781 QLSSVMSTVLLK 792 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 35/372 (9%), Positives = 99/372 (26%), Gaps = 36/372 (9%) Query: 27 NQMQSALDAAVLSGCASIVSDRTIKDP--TTKKDQTSTIFKKQIKKHLKQGSYIREN-AG 83 + +Q + DAAV + + R I D + + + ++L+ + + Sbjct: 1814 SVVQYSRDAAVHFYLNTYTTKREILDALRGLRHKGGRALNTGEALQYLRNNVFTASAGSR 1873 Query: 84 DIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSS 143 Q+ I + + + +F G S + S Sbjct: 1874 RTEGVPQVLILLTGGRSSDSVDSPASDLKQLGVLIFAIGSRGSDNREIQRISHSPTSALV 1933 Query: 144 ENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAP 203 + V Q+ + + P P K+ Sbjct: 1934 VPEFTDLPSV-----------QQQLLSSVRDVVDVGPEAPTPAVDTSKKDIVFLLDGS-- 1980 Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 + + +V + R+ + Y+ + + ++ Sbjct: 1981 DGTRNGFPAMRDFVERVVEKLS-----VGPNKDRVSVVQYSRDAEVHFNLNTYSTREDII 2035 Query: 264 SRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + L T A+ + ++ S + + +I + G + + Sbjct: 2036 DSVRGLRHRGGAPVRTGAALQYVRDNVFTNSSGSRRL---QGVPQMLILLNGGRSYDSVD 2092 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFD 381 ++ G ++ + + + L+K + AV + +L + Sbjct: 2093 TP--------ASSLKQQG--VFVIGIGTRNSDRTELQKISFEPSYTLAVTEFTDLPSIQE 2142 Query: 382 KITDKIQEQSVR 393 +++ + ++ Sbjct: 2143 QLSSVMSTVLLK 2154 Score = 59.5 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 14/168 (8%), Positives = 50/168 (29%), Gaps = 15/168 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRE 287 R+ + Y+ + + ++ + L T A+ + Sbjct: 3225 SVGPNKDRVSVVQYSRDAEVHFNLNTYSTREDIIDSVRGLRHRGGAPVRTGAALQYVRDN 3284 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 ++ S + + +I + G + + ++ G ++ + + Sbjct: 3285 VFTNSSGSRRL---QGVPQMLILLNGGRSYDSVDTP--------ASSLKQQG--VFVIGI 3331 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 + L+K + AV + +L ++++ + ++ Sbjct: 3332 GTRNSDRTELQKISFEPSYTLAVTEFTDLPSIQEQLSSVMSTVLLKAT 3379 Score = 59.5 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 14/166 (8%), Positives = 50/166 (30%), Gaps = 15/166 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRE 287 R+ + Y+ + + ++ + L T A+ + Sbjct: 2711 SVGPNKDRVSVVQYSRDAEVHFNLNTYSTREDIIDSVRGLRHRGGAPVRTGAALQYVRDN 2770 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 ++ S + + +I + G + + ++ G ++ + + Sbjct: 2771 VFTNSSGSRRL---QSVPQMLILLNGGRSYDSVDTP--------ASSLKQQG--VFVIGI 2817 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + L+K + AV + +L ++++ + ++ Sbjct: 2818 GTRNSDRTELQKISFEPSYTLAVTEFTDLPSIQEQLSSVMSTVLLK 2863 Score = 59.5 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 14/166 (8%), Positives = 50/166 (30%), Gaps = 15/166 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRE 287 R+ + Y+ + + ++ + L T A+ + Sbjct: 853 SVGPNKDRVSVVQYSRDAEVHFNLNTYSTREDIIDSVRGLRHRGGAPVRTGAALQYVRDN 912 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 ++ S + + +I + G + + ++ G ++ + + Sbjct: 913 VFTNSSGSRRL---QGVPQMLILLNGGRSYDSVDTP--------ASSLKQQG--VFVIGI 959 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + L+K + AV + +L ++++ + ++ Sbjct: 960 GTRNSDRTELQKISFEPSYTLAVTEFTDLPSIQEQLSSVMSTVLLK 1005 Score = 59.5 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 14/166 (8%), Positives = 50/166 (30%), Gaps = 15/166 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRE 287 R+ + Y+ + + ++ + L T A+ + Sbjct: 1066 SVGPNKDRVSVVQYSRDAEVHFNLNTYSTREDIIDSVRGLRHRGGAPVRTGAALQYVRDN 1125 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 ++ S + + +I + G + + ++ G ++ + + Sbjct: 1126 VFTNSSGSRRL---QGVPQMLILLNGGRSYDSVDTP--------ASSLKQQG--VFVIGI 1172 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + L+K + AV + +L ++++ + ++ Sbjct: 1173 GTRNSDRTELQKISFEPSYTLAVTEFTDLPSIQEQLSSVMSTVLLK 1218 Score = 59.5 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 14/166 (8%), Positives = 50/166 (30%), Gaps = 15/166 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRE 287 R+ + Y+ + + ++ + L T A+ + Sbjct: 1279 SVGPNKDRVSVVQYSRDAEVHFNLNTYSTREDIIDSVRGLRHRGGAPVRTGAALQYVRDN 1338 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 ++ S + + +I + G + + ++ G ++ + + Sbjct: 1339 VFTNSSGSRRL---QGVPQMLILLNGGRSYDSVDTP--------ASSLKQQG--VFVIGI 1385 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + L+K + AV + +L ++++ + ++ Sbjct: 1386 GTRNSDRTELQKISFEPSYTLAVTEFTDLPSIQEQLSSVMSTVLLK 1431 Score = 43.7 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 8/77 (10%), Positives = 29/77 (37%), Gaps = 5/77 (6%) Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 +G +Y + ++ G ++ + + + L+K + AV + +L Sbjct: 2239 NGGRSYDSVDTP---ASSLKQQG--VFVIGIGTRNSDRTELQKISFEPSYTLAVTEFTDL 2293 Query: 377 LESFDKITDKIQEQSVR 393 ++++ + ++ Sbjct: 2294 PSIQEQLSSVMSTVLLK 2310 >gi|254255255|ref|ZP_04948571.1| hypothetical protein BDAG_04588 [Burkholderia dolosa AUO158] gi|124900992|gb|EAY71742.1| hypothetical protein BDAG_04588 [Burkholderia dolosa AUO158] Length = 511 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 21/198 (10%), Positives = 61/198 (30%), Gaps = 4/198 (2%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTI----KDPTTK 56 + + ++V F+ A+DL + R+++Q++ DA LS + S ++ + Sbjct: 113 IVGLALAVMIGFVGLALDLGKLYVTRSELQNSADACALSAARDLTSAISLSVAEANGIAA 172 Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 F+++ + L + ++ ++ + N S + Sbjct: 173 GHMNFAFFQQKPVQMLTDSNVTFSDSLTNPFLSKTAVATPANVKYVKCVTSLSNIAHWFI 232 Query: 117 NLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS 176 + + ++ I +I + + S + + + +S Sbjct: 233 EVLNVLPGTKLANAAEVSASAIATVGGGQTTCAIPVFVCRGPSDPPYKVGDWISSPSGSS 292 Query: 177 NKYLLPPPPKKSFWSKNT 194 + Y + Sbjct: 293 STYGPGNFGWAALDGSTN 310 >gi|325287596|ref|YP_004263386.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] gi|324323050|gb|ADY30515.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] Length = 696 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 21/147 (14%), Positives = 46/147 (31%), Gaps = 11/147 (7%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 ++ + Y + ++ + L KL +T + AY+ + + Sbjct: 375 KVSIVVYAGAAGVVLPPTNGDQKEKIINALQKLEAGGSTAGGQGIKLAYKLAEKNFKKNG 434 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 N VI TDG+ + + ++ E R +G+ + + Sbjct: 435 NNR--------VILATDGDFNVG--ASSDTAMEKLIEKKRASGVFLSVLGFGMGNYKDSK 484 Query: 357 LRKCTDSS-GQFFAVNDSRELLESFDK 382 L D G ++ +E + F Sbjct: 485 LETLADKGNGNHAYIDTMQEAQKVFGD 511 >gi|170734866|ref|YP_001773980.1| hypothetical protein Bcenmc03_6370 [Burkholderia cenocepacia MC0-3] gi|169820904|gb|ACA95485.1| conserved hypothetical protein [Burkholderia cenocepacia MC0-3] Length = 423 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 36/333 (10%), Positives = 88/333 (26%), Gaps = 22/333 (6%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTI----KDPTTK 56 + + ++V F+ A+DL + R+++Q++ DA LS + S ++ D Sbjct: 25 IVGLALAVMIGFVGLALDLGKLYVTRSELQNSADACALSAARDLTSAISLSVAEADGIAA 84 Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 F+K+ + + + + N N + Sbjct: 85 GHLNFVFFQKKS-VQMSTNANVTFSDSLTNPFLTKNAVTTPANIKYVQCTATLSNIAHWF 143 Query: 117 NLFLKGLIPSA-LTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMT 175 L L + + ++ I +I + + S + + + + Sbjct: 144 IEVLNVLPGTKLANAAEVSASAIATVGGGQTTCAIPVFVCRGPSDPAYKVGDWISSPSGS 203 Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 S+ Y + + + + + ++ + Sbjct: 204 SSTYGPGNFGWAALDGSTNETTLASELSGNTCN--------ITSPPDLATTGMKSASQRA 255 Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH-AYRELYNEKES 294 Y G G+ P V P + AY + ++ S Sbjct: 256 WNTRFGIYTNGANGSSGQPDFTGYAYV-------GPNYGPPGTAGIKGDAYTQFVADRAS 308 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 G T+G + ++ + Sbjct: 309 FKPYQGDGAPPSGSGIATNGTATASNYSTYGSD 341 >gi|262091909|gb|ACY25458.1| putative von Willebrand factor type A domain-containing protein [uncultured microorganism] Length = 621 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 26/191 (13%), Positives = 68/191 (35%), Gaps = 16/191 (8%) Query: 193 NTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQC 252 N A +ID+ + L S+ R+G I +++ + Sbjct: 261 NVIFVADASGSMAEGNRIDIARAALQALWASL-------VPDLDRVGMIQFSVDPIPASF 313 Query: 253 TPLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 P + + +++ +++L PY TN + + + +++ +V+ I Sbjct: 314 VPHTRPDSEFLQASIDRLLPYYGTNVQAGIDLGVQLANDARQA------WPDSDNYVVLI 367 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAV 370 +DG + + E + +++ ++ V LL + +G ++ + Sbjct: 368 SDGVANVDATDP-FAILRSAGEDDESNPIRLITIGVGIGHYNDVLLEQLAQYGNGWYYYI 426 Query: 371 NDSRELLESFD 381 + + E+F Sbjct: 427 DSPEQAWETFT 437 >gi|188994392|ref|YP_001928644.1| putative aerotolerance-related exported protein BatB [Porphyromonas gingivalis ATCC 33277] gi|188594072|dbj|BAG33047.1| putative aerotolerance-related exported protein BatB [Porphyromonas gingivalis ATCC 33277] Length = 339 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 55/165 (33%), Gaps = 36/165 (21%) Query: 251 QCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 P++ +L+ K L ++P T A+ A + + K +I Sbjct: 143 TQIPITTDLSAAKQFLADISPNMVTAQGTAIGAAIELASKSFS------DNKEIGKTIIV 196 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------------- 350 +TDGEN N ++ + AG+++ + + Sbjct: 197 LTDGENHEG-------NAIEAAQQAHEAGIRVNVIGLGTALGAPIPIEEGYLKDETGNPV 249 Query: 351 --PEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSV 392 + + R + G FF+ + L+ + + DK+ + + Sbjct: 250 VTKFDEKMCRDIASAGEGTFFSGQSASALVRAIESQLDKLPKAVL 294 >gi|34541235|ref|NP_905714.1| batB protein [Porphyromonas gingivalis W83] gi|34397551|gb|AAQ66613.1| batB protein [Porphyromonas gingivalis W83] Length = 339 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 55/165 (33%), Gaps = 36/165 (21%) Query: 251 QCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 P++ +L+ K L ++P T A+ A + + K +I Sbjct: 143 TQIPITTDLSAAKQFLADISPNMVTAQGTAIGAAIELASKSFS------DNKEIGKTIIV 196 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------------- 350 +TDGEN N ++ + AG+++ + + Sbjct: 197 LTDGENHEG-------NAIEAAQQAHEAGIRVNVIGLGTALGAPIPIEEGYLKDETGNPV 249 Query: 351 --PEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSV 392 + + R + G FF+ + L+ + + DK+ + + Sbjct: 250 VTKFDEKMCRDIASAGEGTFFSGQSASALVRAIESQLDKLPKAVL 294 >gi|62389907|ref|YP_225309.1| hypothetical protein cg1159 [Corynebacterium glutamicum ATCC 13032] gi|41325243|emb|CAF19723.1| putative secreted protein [Corynebacterium glutamicum ATCC 13032] Length = 634 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 25/224 (11%), Positives = 63/224 (28%), Gaps = 27/224 (12%) Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE 230 + + + A +ID +++ L+N I Sbjct: 22 ALAFSLSPVAKAQANETPTMIVLDNSGSMTAQDAGGQTRIDAAKQASTQLINDISDRTDV 81 Query: 231 KKNLSVRIGTIAYNI---GIVGNQCT--PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAY 285 G P N + + +N L P T A+ Sbjct: 82 GLTYYGGNTGETEADVEMGCQDVTILGGPSRGNADTLIDTINSLQPRGFTPIGKALTDTA 141 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK--IY 343 EL ++ ++DG + + ++ + + +G+ I Sbjct: 142 AELPEGGN--------------IVLVSDGIAN-----CTPPDVCEVAQELAQSGINLVIN 182 Query: 344 SVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDK 386 ++ ++ P ++ L G + +D++ L ++ + + Sbjct: 183 TIGLNVDPAAREELECIAGVGGGTYADASDAQSLTDALTRAASR 226 >gi|7258382|emb|CAB77598.1| putative protein [Arabidopsis thaliana] Length = 676 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 22/194 (11%), Positives = 66/194 (34%), Gaps = 29/194 (14%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE--VKSR 265 K+ +L + G ++ ++ + R+ IA++ ++ Sbjct: 258 TKLALLKRAMGFVIQNLGSSD--------RLSVIAFSSTARRLFPLTRMSDAGRQLALQA 309 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 +N L TN + + + + E + +I ++DG ++ + + + Sbjct: 310 VNSLVANGGTNIVDGLRKGAKVMEDRLERNSVAS--------IILLSDGRDTYTTNHPDP 361 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK--- 382 + + + + ++S + + + S G F + + ++ + Sbjct: 362 SYKVMLPQ------ISVHSFGFGSDHDASVMHSVSEVSGGTFSFIESESVIQDALAQCIG 415 Query: 383 --ITDKIQEQSVRI 394 ++ +QE V I Sbjct: 416 GLLSVAVQELRVEI 429 >gi|28393354|gb|AAO42101.1| unknown protein [Arabidopsis thaliana] Length = 650 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 22/194 (11%), Positives = 66/194 (34%), Gaps = 29/194 (14%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE--VKSR 265 K+ +L + G ++ ++ + R+ IA++ ++ Sbjct: 232 TKLALLKRAMGFVIQNLGSSD--------RLSVIAFSSTARRLFPLTRMSDAGRQLALQA 283 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 +N L TN + + + + E + +I ++DG ++ + + + Sbjct: 284 VNSLVANGGTNIVDGLRKGAKVMEDRLERNSVAS--------IILLSDGRDTYTTNHPDP 335 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK--- 382 + + + + ++S + + + S G F + + ++ + Sbjct: 336 SYKVMLPQ------ISVHSFGFGSDHDASVMHSVSEVSGGTFSFIESESVIQDALAQCIG 389 Query: 383 --ITDKIQEQSVRI 394 ++ +QE V I Sbjct: 390 GLLSVAVQELRVEI 403 >gi|30694117|ref|NP_191038.2| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|332645763|gb|AEE79284.1| C3HC4-type RING finger protein [Arabidopsis thaliana] Length = 675 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 22/194 (11%), Positives = 66/194 (34%), Gaps = 29/194 (14%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE--VKSR 265 K+ +L + G ++ ++ + R+ IA++ ++ Sbjct: 257 TKLALLKRAMGFVIQNLGSSD--------RLSVIAFSSTARRLFPLTRMSDAGRQLALQA 308 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 +N L TN + + + + E + +I ++DG ++ + + + Sbjct: 309 VNSLVANGGTNIVDGLRKGAKVMEDRLERNSVAS--------IILLSDGRDTYTTNHPDP 360 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK--- 382 + + + + ++S + + + S G F + + ++ + Sbjct: 361 SYKVMLPQ------ISVHSFGFGSDHDASVMHSVSEVSGGTFSFIESESVIQDALAQCIG 414 Query: 383 --ITDKIQEQSVRI 394 ++ +QE V I Sbjct: 415 GLLSVAVQELRVEI 428 >gi|71896057|ref|NP_001025613.1| matrilin 1, cartilage matrix protein [Xenopus (Silurana) tropicalis] gi|60552391|gb|AAH91071.1| MGC108367 protein [Xenopus (Silurana) tropicalis] Length = 490 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 27/175 (15%), Positives = 58/175 (33%), Gaps = 19/175 (10%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENT 275 +N I +++ + + +G + Y+ + + ++KS + K++ + T Sbjct: 287 VKQFINQIVESM-DVGDHRAHVGLVQYSSSVRQEFPLGRYTSKKDIKSAVKKMSYMEKGT 345 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 T A+ + + + K I TDG + Sbjct: 346 MTGQALQYLIDNSFAISSGGRPA-----VPKVGIVFTDGRSQDYIND--------AALRA 392 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 + G K+++V V ++ LR F D + + E K+ KI Sbjct: 393 KELGYKMFAVGVG--NAVEEELRMIASEPVVEHSFYTADFKAMKEIGKKLQMKIC 445 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 28/159 (17%), Positives = 53/159 (33%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + R+G + Y + + + K+ P T T A+ +A Sbjct: 69 DVGANATRVGLVNYASTVKNEFSLKTHKAKPALLQAVKKVQPLSTGTMTGLAIQYAINIA 128 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + E E + S + K I +TDG I R +G++IY++ V Sbjct: 129 FTEPEGARLK--SPGINKVAIIVTDGRPQD--------AVKDISARARESGLEIYAIGVG 178 Query: 349 APPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 ++ LR+ V +L + F + Sbjct: 179 R--VDKNTLRQIASEPLDEHVDYVESYSLIEKLSKKFQE 215 >gi|194291599|ref|YP_002007506.1| hypothetical protein RALTA_B0833 [Cupriavidus taiwanensis LMG 19424] gi|193225503|emb|CAQ71449.1| conserved hypothetical protein, Von Willebrand factor type A domain (vwa), putative exported protein [Cupriavidus taiwanensis LMG 19424] Length = 356 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 51/169 (30%), Gaps = 46/169 (27%) Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST---------------- 302 V + + +L P T M A L E + + Sbjct: 157 REAVATAIERLQPQGGTALGNGMLIALTTLLPELTPDAERLMNDDTPPPRKPRALANPPA 216 Query: 303 ---------RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE- 352 ++ +DGE++ A G++IY+V V P Sbjct: 217 DTEPVKPGSYTSGAIVLFSDGESNAGPAALRAAQLAA------EHGVRIYTVGVGTPEGV 270 Query: 353 -------------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKI 387 + +L++ D +S ++F + D+ EL + + ++ Sbjct: 271 VLSVDGWSARVRLDEKVLKEVADATSAEYFRLEDAAELKRVYRALNARL 319 >gi|125540180|gb|EAY86575.1| hypothetical protein OsI_07955 [Oryza sativa Indica Group] Length = 477 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 15/134 (11%), Positives = 40/134 (29%), Gaps = 16/134 (11%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG-- 314 + KS + L TN + A + VI ++DG Sbjct: 171 DGKASAKSAVESLAAGGGTNILKGLVEAAKVFD--------GRRYRNAVASVILLSDGQD 222 Query: 315 --ENSGASAYQNTLNTLQIC----EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF 368 + N+ N + + + + +++ + + ++ G F Sbjct: 223 TYNVNSGWGASNSKNYSVLVPPSFKRSGDRRLPVHTFGFGTDHDAAAMHAIAEETGGTFS 282 Query: 369 AVNDSRELLESFDK 382 + + + ++F + Sbjct: 283 FIENQAVVQDAFAQ 296 >gi|73542340|ref|YP_296860.1| hypothetical protein Reut_A2655 [Ralstonia eutropha JMP134] gi|72119753|gb|AAZ62016.1| putative membrane protein [Ralstonia eutropha JMP134] Length = 412 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 28/280 (10%), Positives = 70/280 (25%), Gaps = 7/280 (2%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + + ++V IDL + + ++QSA+D+ L+ + T T Sbjct: 16 IVGLTLAVLLGMAGLVIDLGAMFVAKTELQSAVDSCALAAAQELDGAADALTRATSAGLT 75 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + K + + + + T +Y +K + +L Sbjct: 76 AGNANKVQYQKASASLIDTDVTFSDSLTGAFSSTFTPVANARY---AKCGHLTTGILAYL 132 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 ++ +N +++ +L + Y D + Y Sbjct: 133 IQMVGGPTSNAVAAIGVATRTHAQSTCPIPVGLLPRTGGTAPDYGFAVGD---WVTVLYD 189 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 + + + + + + R G Sbjct: 190 GTKTAGPGEMGWYNLDGSTNANETKNEMSVGYCNSKVNDTLRTPGAKVAVDDQWNSRFGI 249 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPA 280 N G + + ++ N + Y T+ Sbjct: 250 YK-NNGDMQTMRPDFTGYSYTTQNWPNGKSAYNGTSGGSD 288 >gi|332710564|ref|ZP_08430509.1| hypothetical protein LYNGBM3L_52760 [Lyngbya majuscula 3L] gi|332350619|gb|EGJ30214.1| hypothetical protein LYNGBM3L_52760 [Lyngbya majuscula 3L] Length = 579 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 45/360 (12%), Positives = 106/360 (29%), Gaps = 30/360 (8%) Query: 40 GCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLK-QGSYIRENAGDIAQKAQINITKDKN 98 + + + ++ K + + REN I + A + + Sbjct: 69 ASRRLSHSPNARFKSVPSSSIASKPKPLLTAEQAIPEASNRENYSAIDENAFKRVKHNPL 128 Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR 158 + + A Y L L P+ + E + + Sbjct: 129 STFAIDVD-TASYSNLRRFLNNGQLPPTNAIRIEELINYFNYDYPEPES-DRPFSITTEI 186 Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAG 218 S + + K + P + K+ +L ++ Sbjct: 187 SQAPWNPTHQLVHIGIQGEKMAIEDLPPSNLV-----FLLDVSGSMNTPNKLPLLKDAFR 241 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 LVN +++ ++ + Y N +++ + + LN +T Sbjct: 242 MLVNELREED--------QVSIVVYAGAAGVVLPPTPGNEKDKILTAIENLNAGGSTAGG 293 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 + AY+ + S N VI TDG+ + + L +++ E RN Sbjct: 294 AGIKLAYKLAQDNFIKSGNNR--------VILATDGDFNVGVSSDTEL--VKLIEQKRNK 343 Query: 339 GMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 G+ + + + + K + G + +++ E + + ++I + IA + Sbjct: 344 GVFLTVLGFGSGNLQDSKMEKIANKGNGNYAYIDNELEAKKV---LVNEIGATLLTIAKD 400 >gi|307256565|ref|ZP_07538346.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 10 str. D13039] gi|306864975|gb|EFM96877.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 10 str. D13039] Length = 531 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 64/488 (13%), Positives = 127/488 (26%), Gaps = 124/488 (25%) Query: 14 TYAIDLAHIMYIRNQMQSALDAAVLSGCASIVS---------------DRTIKDPTTKKD 58 +++ A I+ + ++ +L+ AVLS A S + + + Sbjct: 36 FVSLESAGIIQDQARLSDSLEQAVLSLTAENNSGRKDNDYKLSGSNKENDSFDISSEVGK 95 Query: 59 QTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENL 118 + S + K +K L Q + + N I +N T K + I ++ Sbjct: 96 RDSQMVTKFVKAFLPQTNDDKMNL--IPICKTVNNTSGKGHTSSSEVTCTVSGTIEHKSW 153 Query: 119 FLKGLIPSALTNLSLRSTGIIERSSENL---AISICMVLDVSRSMEDLYLQKHNDNN--- 172 F + + + +N I + +V D+S SM K+ NN Sbjct: 154 FPLKVGTLEVIPQQVNVASQSRAFKKNTFNIPIDLMVVADLSGSMRYDITNKYETNNETS 213 Query: 173 ------------------------------------NMTSNKYLLPPPPKKSFWSKNTTK 196 P +T Sbjct: 214 KLGILKDVLIELAEKTLLSEDANQHNRIYVTPFALGAEIDASSCALPYHWDKGKDPDTPL 273 Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ-----EKKNLSVRIGTIAYNIGIVGNQ 251 E +LV + EK+N V + + + N+ Sbjct: 274 KIKNILSKKEGNNQQSRAEFIEHLVYKMNTKATLDNVGEKQNYKVTFPKGNFCLKNMKNK 333 Query: 252 CTPLSN--NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 + + ++ + + L T + A + ++ S + K+ ++ Sbjct: 334 NHGWFSRKDKSDFTNHVRSLRADGATLASSGVLVAANNMI--RDGSRTEQLKEQTKRVIL 391 Query: 310 FITDGENSGASAYQNTLN------------------------------------------ 327 ++DG + N Sbjct: 392 VLSDGNDEIVKGDPNNKVPFLNYTRITENLIYGRQEVFLSQKQKISLSHSTIYTYLTTDT 451 Query: 328 -----TLQICEYMR--------NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSR 374 T +CE +R + KI V + C +G +++ ND Sbjct: 452 QPKETTNGMCEVIRNKLDTLNGDKNTKIVFVEFGYKSTSKQAWEHCV-GNGNYYSANDKE 510 Query: 375 ELLESFDK 382 LL SF + Sbjct: 511 SLLNSFKQ 518 >gi|89068023|ref|ZP_01155440.1| type II/IV secretion system protein, TadC subfamily protein [Oceanicola granulosus HTCC2516] gi|89046262|gb|EAR52319.1| type II/IV secretion system protein, TadC subfamily protein [Oceanicola granulosus HTCC2516] Length = 987 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 24/154 (15%), Positives = 54/154 (35%), Gaps = 10/154 (6%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHN 297 N G N ++ + N + ++ L + T A+ S+ Sbjct: 131 DRAHILNFGDSINVAVGMTADRNRLDQAISGLRAWGATRLNDAV----------FASAGA 180 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 G+ V+ + + A + +++ + +G +Y+V + + L Sbjct: 181 LAGAEGRGAIVLLSEGPDADPSGAPLSVVDSEAALAAVVESGAPVYAVGLGPGADAALLR 240 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 R + G +F V D+ L +F I +++ Q Sbjct: 241 RLAEATGGAYFPVADAAALPATFSDIATRLRHQY 274 >gi|315502365|ref|YP_004081252.1| von willebrand factor type a [Micromonospora sp. L5] gi|315408984|gb|ADU07101.1| von Willebrand factor type A [Micromonospora sp. L5] Length = 319 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 20/161 (12%), Positives = 55/161 (34%), Gaps = 30/161 (18%) Query: 252 CTPLSNNLNEVKSRLNKLNPYE-----NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 P S + + + +L T A+ + + + ++ Sbjct: 140 LVPPSTDREALHDGIGRL-AEGITGVQGTAIGEAISTSLGAVKSLDATAAK----DPPPA 194 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD----------- 355 +I ++DG N+ ++ ++ + A + +++++ P D Sbjct: 195 RIIILSDGANTSG------MDPMEAADQAVAAKVPVHTISFGTPGGSVDRGGRAIQVPVD 248 Query: 356 --LLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 LR + + G F + + EL + ++ I + ++ R Sbjct: 249 GQTLRAVAEQTGGGFHEASTTAELKDVYEDIGTSVGYRTER 289 >gi|302865820|ref|YP_003834457.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] gi|302568679|gb|ADL44881.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] Length = 319 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 20/161 (12%), Positives = 55/161 (34%), Gaps = 30/161 (18%) Query: 252 CTPLSNNLNEVKSRLNKLNPYE-----NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 P S + + + +L T A+ + + + ++ Sbjct: 140 LVPPSTDREALHDGIGRL-AEGITGVQGTAIGEAISTSLGAVKSLDATAAK----DPPPA 194 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD----------- 355 +I ++DG N+ ++ ++ + A + +++++ P D Sbjct: 195 RIIILSDGANTSG------MDPMEAADQAVAAKVPVHTISFGTPGGSVDRGGRAIQVPVD 248 Query: 356 --LLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 LR + + G F + + EL + ++ I + ++ R Sbjct: 249 GQTLRAVAEQTGGGFHEASTTAELKDVYEDIGTSVGYRTER 289 >gi|218517234|ref|ZP_03514074.1| hypothetical protein Retl8_28685 [Rhizobium etli 8C-3] Length = 176 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 26/199 (13%), Positives = 55/199 (27%), Gaps = 28/199 (14%) Query: 7 SVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKK 66 + + + D +R +MQS LDAA+++ I + K + + F Sbjct: 1 MPMLVAVGASFDYIRSYNVRQKMQSDLDAALIAAVKQINNTGDTD---ALKLKVTDWFHA 57 Query: 67 QIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPS 126 Q++ + + + + + A +PT + Sbjct: 58 QVENSY-------------------TLGEIDIDTTNHNITATASGTVPTTFM------KI 92 Query: 127 ALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPK 186 A + S + +++ +V+D S SM Sbjct: 93 ANIDTVPVSVASAVKGPATSYLNVYIVIDTSPSMLLAATTAGQATMYSGIGCQFACHTGD 152 Query: 187 KSFWSKNTTKSKYAPAPAP 205 T + Y + A Sbjct: 153 SHKVGNKTYANNYEYSTAK 171 >gi|297202051|ref|ZP_06919448.1| VWA domain-containing protein [Streptomyces sviceus ATCC 29083] gi|197714313|gb|EDY58347.1| VWA domain-containing protein [Streptomyces sviceus ATCC 29083] Length = 518 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 43/358 (12%), Positives = 97/358 (27%), Gaps = 30/358 (8%) Query: 35 AAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINIT 94 A+ LSG S ++D + + FK Q G A +N Sbjct: 178 ASALSGAQSALTDADVT---KAAPRLKEFFKGQKLTSGSSGWLAAAYDRRGDVDALLNYE 234 Query: 95 KDKNNPLQYIA----ESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISI 150 + + P +L E A + Sbjct: 235 SVLLGIPGLTVIRPRDGVITADYPLTSLRSTSARTREDVRRVSEDLRTERIQREITARTH 294 Query: 151 CMVLDVSRSMEDLYLQKHNDNNN----MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPA 206 + S + LL + T Sbjct: 295 RRPVVASVPPASGLDTTRRRELPFPGTRSVADGLLDSYENELRRPSRTVYVLDTSGSMEG 354 Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 ++D L + +L ++ + + G+ ++ + + L+ ++ Sbjct: 355 -DRLDRLKTALADLTGDFREREEVT---LMPFGSQVKSVRTHVVKPSDPRAGLDAIRDDT 410 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 + L+ +T Y ++ AY L +++ ++ +TDGEN+ + ++ Sbjct: 411 SALSADGDTAIYTSLEKAYDHLGAGRDAFT----------SIVLMTDGENTAGAKARDFD 460 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKI 383 + ++ + + L D + G+ F L +F++I Sbjct: 461 AFYARLGR-KARDTPVFPILFG--DSDRSELAHIADLTGGRLFDARQGS-LDGAFEEI 514 >gi|242074986|ref|XP_002447429.1| hypothetical protein SORBIDRAFT_06g000920 [Sorghum bicolor] gi|241938612|gb|EES11757.1| hypothetical protein SORBIDRAFT_06g000920 [Sorghum bicolor] Length = 519 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 31/242 (12%), Positives = 67/242 (27%), Gaps = 31/242 (12%) Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + +V + + + N L + Sbjct: 15 SGPRPTPIVPGRVQLVSKNNNMAPLEENTQKVLLELTGGDSTSDRSGLDLVAVLDVSGSM 74 Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE--V 262 KID + + +V + R+ + + + ++ + Sbjct: 75 QGE-KIDKMKTAMKFVVKKLSSID--------RLSIVTFMDTATRICPLRQVTDASQPEL 125 Query: 263 KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 ++ LNP NTN + + L + SS +G V+ ++DG+ + Sbjct: 126 LGLIDALNPGGNTNITDGLQTGLKVLADRNLSSGRVVG-------VMLMSDGQQNRGGN- 177 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT--DSSGQFFAVNDSRELLESF 380 ++ +Y+ A +L G F VND +L +F Sbjct: 178 ---------AADVKIGNAPVYTFGFGAD-YDPTVLNAVARNSMGGTFSVVNDVDKLSMAF 227 Query: 381 DK 382 + Sbjct: 228 SQ 229 >gi|212635209|ref|YP_002311734.1| Von Willebrand factor type A domain-containing protein [Shewanella piezotolerans WP3] gi|212556693|gb|ACJ29147.1| Von Willebrand factor type A domain protein [Shewanella piezotolerans WP3] Length = 360 Score = 64.9 bits (156), Expect = 2e-08, Method: Composition-based stats. Identities = 28/202 (13%), Positives = 68/202 (33%), Gaps = 32/202 (15%) Query: 194 TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 S ++D + R+G I + Sbjct: 123 IANSSGTEISDEYISRLDAVKRVLHEFAEQ---------RQGDRLGLILFGDAAYLQA-- 171 Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 P + +L L++ ++T+ A+ A + + +++ S K V+ Sbjct: 172 PFTADLASWLRLLDESRVAMAGQSTHVGDALGLAIKVMSSDEIKSSQK------NKVVLL 225 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-GQDLLRK------CTDS 363 +TDG ++ +S L+ + G++++ +A+ P G+ + + Sbjct: 226 LTDGNDTDSSVP-----PLEAAKIAAKKGIRVHVIAIGDPQTVGEQAMDMEVIEGVAALT 280 Query: 364 SGQFFAVNDSRELLESFDKITD 385 G+ F ++EL + + I+ Sbjct: 281 GGKAFKAISTQELNKVYQTISK 302 >gi|281340555|gb|EFB16139.1| hypothetical protein PANDA_003424 [Ailuropoda melanoleuca] Length = 191 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 69/200 (34%), Gaps = 18/200 (9%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + ++ N ++ +++ N VRI I Y+ ++++ NE Sbjct: 1 NSCFFISRSGSVNNNWMDIYNMVEDVVKKFDNPKVRISFITYSTDGH--TLMKITSDKNE 58 Query: 262 VKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 ++ L KL P T+ + A ++ E + ++ +TDG Sbjct: 59 IRENLAKLQNVVPSGATHMQEGLRKANEQIEQENAGE------KKAPIVILALTDG---- 108 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND-SRELL 377 T E R G +Y + V +D L DS F V++ + L Sbjct: 109 TLLPFPFEETKMEAEESRRLGATVYCIGV--KDYRKDQLLDIADSPDHMFGVDNGFKGLQ 166 Query: 378 ESFDKITDKIQEQSVRIAPN 397 + K ++ P+ Sbjct: 167 NIVGPLASKSCIDVTKVEPS 186 >gi|113866742|ref|YP_725231.1| flp pilus assembly protein TadG [Ralstonia eutropha H16] gi|113525518|emb|CAJ91863.1| flp pilus assembly protein TadG [Ralstonia eutropha H16] Length = 417 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 28/283 (9%), Positives = 85/283 (30%), Gaps = 10/283 (3%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDR----TIKDPTTK 56 + + ++V F+ A+DL + ++++Q+++DA L+ + + Sbjct: 24 IVGLSLAVLIGFVGLALDLGKLYVTKSELQNSVDACALAAARDVTGATPLLVSEAAGLAT 83 Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 + +F+ + + + + + + N + ++Y+ + + I Sbjct: 84 GTSNAALFQGK-AVEMFENLNVSYSDTPDSTFYTKNNVPFSLDKVKYVKCTAERKGIAHW 142 Query: 117 NLFLKGLI-PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMT 175 + + L+ + ++ + + +S A +I + + + + + + Sbjct: 143 FIHMLNLLPGIDIKASTVNAMAVATTTSAQAACAIPVYVCSPPTANPAKTAYNRGDWIKS 202 Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLI-ESAGNLVNSIQKAIQEKKNL 234 P P W+ + + A A G+ V Sbjct: 203 RVDPSDPYGPGSFGWADLSPPAGGASELADLLAGSGQCDLPVVGSKVGEPGSIASLIAAW 262 Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNT 277 + R G + + + + N P Sbjct: 263 NTRFGIYTGSYKDPKDGAPDFT-GYAYTTTTWNA--PNGGNAY 302 >gi|332226724|ref|XP_003262542.1| PREDICTED: anthrax toxin receptor 1 isoform 2 [Nomascus leucogenys] Length = 333 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 64/196 (32%), Gaps = 18/196 (9%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 K ++ + +++ + + +R+ I ++ Sbjct: 37 PACYGGFDLYFILDKSGSVLHHWNEIYYFVEQLAHKFISPQLRMSFIVFSTRGT--TLMK 94 Query: 255 LSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 L+ + +++ L +L P +T + A ++Y E + T +I + Sbjct: 95 LTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRT------ASVIIAL 148 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDGE + + R+ G +Y V V + L + DS F VN Sbjct: 149 TDGELHEDLFFYSERE----ANRSRDLGAIVYCVGV--KDFNETQLARIADSKDHVFPVN 202 Query: 372 D-SRELLESFDKITDK 386 D + L I K Sbjct: 203 DGFQALQGIIHSILKK 218 >gi|297266186|ref|XP_002799329.1| PREDICTED: anthrax toxin receptor 1-like [Macaca mulatta] Length = 484 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 64/196 (32%), Gaps = 18/196 (9%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 K ++ + +++ + + +R+ I ++ Sbjct: 37 PACYGGFDLYFILDKSGSVLHHWNEIYYFVEQLAHKFISPQLRMSFIVFSTRGT--TLMK 94 Query: 255 LSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 L+ + +++ L +L P +T + A ++Y E + T +I + Sbjct: 95 LTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRT------ASVIIAL 148 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDGE + + R+ G +Y V V + L + DS F VN Sbjct: 149 TDGELHEDLFFYSERE----ANRSRDLGAIVYCVGV--KDFNETQLARIADSKDHVFPVN 202 Query: 372 D-SRELLESFDKITDK 386 D + L I K Sbjct: 203 DGFQALQGIIHSILKK 218 >gi|307154093|ref|YP_003889477.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] gi|306984321|gb|ADN16202.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] Length = 240 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 15/136 (11%), Positives = 44/136 (32%), Gaps = 10/136 (7%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + +K+++N++N TN + + V+ +TDG + Sbjct: 97 DQAAIKNQINRINARGCTNLSGGWLTGCDHVKANLSAERLNR--------VLLLTDGLAN 148 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 + + + G+ ++ L+ G F+ + ++ Sbjct: 149 VGN--SDPKILTKTATEKAEQGIITTTLGFGTYFNEDLLINMANGGKGNFYFIQSPQDAA 206 Query: 378 ESFDKITDKIQEQSVR 393 + F+ + + +V+ Sbjct: 207 QVFEIEIESLVSDAVK 222 >gi|158257658|dbj|BAF84802.1| unnamed protein product [Homo sapiens] Length = 333 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 64/196 (32%), Gaps = 18/196 (9%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 K ++ + +++ + + +R+ I ++ Sbjct: 37 PACYGGFDLYFILDKSGSVLHHWNEIYYFVEQLAHKFISPQLRMSFIVFSTRGT--TLMK 94 Query: 255 LSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 L+ + +++ L +L P +T + A ++Y E + T +I + Sbjct: 95 LTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRT------ASVIIAL 148 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDGE + + R+ G +Y V V + L + DS F VN Sbjct: 149 TDGELHEDLFFYSERE----ANRSRDLGAIVYCVGV--KDFNETQLARIADSKDHVFPVN 202 Query: 372 D-SRELLESFDKITDK 386 D + L I K Sbjct: 203 DGFQALQGIIHSILKK 218 >gi|114577872|ref|XP_001136716.1| PREDICTED: anthrax toxin receptor 1 isoform 5 [Pan troglodytes] Length = 321 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 64/196 (32%), Gaps = 18/196 (9%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 K ++ + +++ + + +R+ I ++ Sbjct: 37 PACYGGFDLYFILDKSGSVLHHWNEIYYFVEQLAHKFISPQLRMSFIVFSTRGT--TLMK 94 Query: 255 LSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 L+ + +++ L +L P +T + A ++Y E + T +I + Sbjct: 95 LTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRT------ASVIIAL 148 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDGE + + R+ G +Y V V + L + DS F VN Sbjct: 149 TDGELHEDLFFYSERE----ANRSRDLGAIVYCVGV--KDFNETQLARIADSKDHVFPVN 202 Query: 372 D-SRELLESFDKITDK 386 D + L I K Sbjct: 203 DGFQALQGIIHSILKK 218 >gi|114577870|ref|XP_001136388.1| PREDICTED: tumor endothelial marker 8 isoform 1 [Pan troglodytes] Length = 326 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 64/196 (32%), Gaps = 18/196 (9%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 K ++ + +++ + + +R+ I ++ Sbjct: 37 PACYGGFDLYFILDKSGSVLHHWNEIYYFVEQLAHKFISPQLRMSFIVFSTRGT--TLMK 94 Query: 255 LSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 L+ + +++ L +L P +T + A ++Y E + T +I + Sbjct: 95 LTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRT------ASVIIAL 148 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDGE + + R+ G +Y V V + L + DS F VN Sbjct: 149 TDGELHEDLFFYSERE----ANRSRDLGAIVYCVGV--KDFNETQLARIADSKDHVFPVN 202 Query: 372 D-SRELLESFDKITDK 386 D + L I K Sbjct: 203 DGFQALQGIIHSILKK 218 >gi|114577874|ref|XP_001136473.1| PREDICTED: tumor endothelial marker 8 isoform 2 [Pan troglodytes] Length = 297 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 64/196 (32%), Gaps = 18/196 (9%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 K ++ + +++ + + +R+ I ++ Sbjct: 37 PACYGGFDLYFILDKSGSVLHHWNEIYYFVEQLAHKFISPQLRMSFIVFSTRGT--TLMK 94 Query: 255 LSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 L+ + +++ L +L P +T + A ++Y E + T +I + Sbjct: 95 LTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRT------ASVIIAL 148 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDGE + + R+ G +Y V V + L + DS F VN Sbjct: 149 TDGELHEDLFFYSERE----ANRSRDLGAIVYCVGV--KDFNETQLARIADSKDHVFPVN 202 Query: 372 D-SRELLESFDKITDK 386 D + L I K Sbjct: 203 DGFQALQGIIHSILKK 218 >gi|114577864|ref|XP_525774.2| PREDICTED: anthrax toxin receptor 1 isoform 6 [Pan troglodytes] Length = 564 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 64/196 (32%), Gaps = 18/196 (9%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 K ++ + +++ + + +R+ I ++ Sbjct: 37 PACYGGFDLYFILDKSGSVLHHWNEIYYFVEQLAHKFISPQLRMSFIVFSTRGT--TLMK 94 Query: 255 LSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 L+ + +++ L +L P +T + A ++Y E + T +I + Sbjct: 95 LTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRT------ASVIIAL 148 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDGE + + R+ G +Y V V + L + DS F VN Sbjct: 149 TDGELHEDLFFYSERE----ANRSRDLGAIVYCVGV--KDFNETQLARIADSKDHVFPVN 202 Query: 372 D-SRELLESFDKITDK 386 D + L I K Sbjct: 203 DGFQALQGIIHSILKK 218 >gi|62898107|dbj|BAD96993.1| tumor endothelial marker 8 isoform 3 precursor variant [Homo sapiens] Length = 317 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 64/196 (32%), Gaps = 18/196 (9%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 K ++ + +++ + + +R+ I ++ Sbjct: 37 PACYGGFDLYFILDKSGSVLHHWNEIYYFVEQLAHKFISPQLRMSFIVFSTRGT--TLMK 94 Query: 255 LSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 L+ + +++ L +L P +T + A ++Y E + T +I + Sbjct: 95 LTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRT------ASVIIAL 148 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDGE + + R+ G +Y V V + L + DS F VN Sbjct: 149 TDGELHEDLFFYSERE----ANRSRDLGAIVYCVGV--KDFNETQLARIADSKDHVFPVN 202 Query: 372 D-SRELLESFDKITDK 386 D + L I K Sbjct: 203 DGFQALQGIIHSILKK 218 >gi|16933551|ref|NP_444262.1| anthrax toxin receptor 1 isoform 2 precursor [Homo sapiens] gi|114577866|ref|XP_001136564.1| PREDICTED: anthrax toxin receptor 1 isoform 3 [Pan troglodytes] gi|332226726|ref|XP_003262543.1| PREDICTED: anthrax toxin receptor 1 isoform 3 [Nomascus leucogenys] gi|16566413|gb|AAL26496.1|AF421380_1 anthrax toxin receptor [Homo sapiens] Length = 368 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 64/196 (32%), Gaps = 18/196 (9%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 K ++ + +++ + + +R+ I ++ Sbjct: 37 PACYGGFDLYFILDKSGSVLHHWNEIYYFVEQLAHKFISPQLRMSFIVFSTRGT--TLMK 94 Query: 255 LSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 L+ + +++ L +L P +T + A ++Y E + T +I + Sbjct: 95 LTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRT------ASVIIAL 148 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDGE + + R+ G +Y V V + L + DS F VN Sbjct: 149 TDGELHEDLFFYSERE----ANRSRDLGAIVYCVGV--KDFNETQLARIADSKDHVFPVN 202 Query: 372 D-SRELLESFDKITDK 386 D + L I K Sbjct: 203 DGFQALQGIIHSILKK 218 >gi|16933553|ref|NP_060623.2| anthrax toxin receptor 1 isoform 3 precursor [Homo sapiens] gi|114577868|ref|XP_001136638.1| PREDICTED: anthrax toxin receptor 1 isoform 4 [Pan troglodytes] gi|15082333|gb|AAH12074.1| Anthrax toxin receptor 1 [Homo sapiens] gi|190690489|gb|ACE87019.1| anthrax toxin receptor 1 protein [synthetic construct] gi|190691863|gb|ACE87706.1| anthrax toxin receptor 1 protein [synthetic construct] gi|312150350|gb|ADQ31687.1| anthrax toxin receptor 1 [synthetic construct] Length = 333 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 64/196 (32%), Gaps = 18/196 (9%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 K ++ + +++ + + +R+ I ++ Sbjct: 37 PACYGGFDLYFILDKSGSVLHHWNEIYYFVEQLAHKFISPQLRMSFIVFSTRGT--TLMK 94 Query: 255 LSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 L+ + +++ L +L P +T + A ++Y E + T +I + Sbjct: 95 LTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRT------ASVIIAL 148 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDGE + + R+ G +Y V V + L + DS F VN Sbjct: 149 TDGELHEDLFFYSERE----ANRSRDLGAIVYCVGV--KDFNETQLARIADSKDHVFPVN 202 Query: 372 D-SRELLESFDKITDK 386 D + L I K Sbjct: 203 DGFQALQGIIHSILKK 218 >gi|62988680|gb|AAY24067.1| unknown [Homo sapiens] Length = 234 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 64/196 (32%), Gaps = 18/196 (9%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 K ++ + +++ + + +R+ I ++ Sbjct: 37 PACYGGFDLYFILDKSGSVLHHWNEIYYFVEQLAHKFISPQLRMSFIVFSTRGT--TLMK 94 Query: 255 LSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 L+ + +++ L +L P +T + A ++Y E + T +I + Sbjct: 95 LTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRT------ASVIIAL 148 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDGE + + R+ G +Y V V + L + DS F VN Sbjct: 149 TDGELHEDLFFYSERE----ANRSRDLGAIVYCVGV--KDFNETQLARIADSKDHVFPVN 202 Query: 372 D-SRELLESFDKITDK 386 D + L I K Sbjct: 203 DGFQALQGIIHSILKK 218 >gi|62870691|gb|AAY18344.1| anthrax toxin receptor/neuroblastoma fusion protein transcript variant 4 [Homo sapiens] Length = 411 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 64/196 (32%), Gaps = 18/196 (9%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 K ++ + +++ + + +R+ I ++ Sbjct: 37 PACYGGFDLYFILDKSGSVLHHWNEIYYFVEQLAHKFISPQLRMSFIVFSTRGT--TLMK 94 Query: 255 LSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 L+ + +++ L +L P +T + A ++Y E + T +I + Sbjct: 95 LTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRT------ASVIIAL 148 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDGE + + R+ G +Y V V + L + DS F VN Sbjct: 149 TDGELHEDLFFYSERE----ANRSRDLGAIVYCVGV--KDFNETQLARIADSKDHVFPVN 202 Query: 372 D-SRELLESFDKITDK 386 D + L I K Sbjct: 203 DGFQALQGIIHSILKK 218 >gi|62870685|gb|AAY18341.1| anthrax toxin receptor/neuroblastoma fusion protein transcript variant 1 [Homo sapiens] Length = 387 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 64/196 (32%), Gaps = 18/196 (9%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 K ++ + +++ + + +R+ I ++ Sbjct: 37 PACYGGFDLYFILDKSGSVLHHWNEIYYFVEQLAHKFISPQLRMSFIVFSTRGT--TLMK 94 Query: 255 LSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 L+ + +++ L +L P +T + A ++Y E + T +I + Sbjct: 95 LTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRT------ASVIIAL 148 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDGE + + R+ G +Y V V + L + DS F VN Sbjct: 149 TDGELHEDLFFYSERE----ANRSRDLGAIVYCVGV--KDFNETQLARIADSKDHVFPVN 202 Query: 372 D-SRELLESFDKITDK 386 D + L I K Sbjct: 203 DGFQALQGIIHSILKK 218 >gi|62870689|gb|AAY18343.1| anthrax toxin receptor/neuroblastoma fusion protein transcript variant 3 [Homo sapiens] Length = 410 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 64/196 (32%), Gaps = 18/196 (9%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 K ++ + +++ + + +R+ I ++ Sbjct: 37 PACYGGFDLYFILDKSGSVLHHWNEIYYFVEQLAHKFISPQLRMSFIVFSTRGT--TLMK 94 Query: 255 LSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 L+ + +++ L +L P +T + A ++Y E + T +I + Sbjct: 95 LTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRT------ASVIIAL 148 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDGE + + R+ G +Y V V + L + DS F VN Sbjct: 149 TDGELHEDLFFYSERE----ANRSRDLGAIVYCVGV--KDFNETQLARIADSKDHVFPVN 202 Query: 372 D-SRELLESFDKITDK 386 D + L I K Sbjct: 203 DGFQALQGIIHSILKK 218 >gi|62870687|gb|AAY18342.1| anthrax toxin receptor/neuroblastoma fusion protein transcript variant 2 [Homo sapiens] Length = 419 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 64/196 (32%), Gaps = 18/196 (9%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 K ++ + +++ + + +R+ I ++ Sbjct: 37 PACYGGFDLYFILDKSGSVLHHWNEIYYFVEQLAHKFISPQLRMSFIVFSTRGT--TLMK 94 Query: 255 LSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 L+ + +++ L +L P +T + A ++Y E + T +I + Sbjct: 95 LTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRT------ASVIIAL 148 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDGE + + R+ G +Y V V + L + DS F VN Sbjct: 149 TDGELHEDLFFYSERE----ANRSRDLGAIVYCVGV--KDFNETQLARIADSKDHVFPVN 202 Query: 372 D-SRELLESFDKITDK 386 D + L I K Sbjct: 203 DGFQALQGIIHSILKK 218 >gi|83745131|ref|NP_001032954.1| collagen alpha-1(XXVIII) chain precursor [Mus musculus] gi|123789585|sp|Q2UY11|COSA1_MOUSE RecName: Full=Collagen alpha-1(XXVIII) chain; Flags: Precursor gi|83423286|emb|CAI67593.1| collagen, type XXVIII [Mus musculus] gi|189442117|gb|AAI67245.1| Collagen, type XXVIII, alpha 1 [synthetic construct] Length = 1141 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 23/162 (14%), Positives = 58/162 (35%), Gaps = 20/162 (12%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKE 293 + RIG I Y+ + ++ ++ K ++ + T T A+ A + Sbjct: 834 TARIGIINYSHKVEKVASLKQFSSKDDFKLVVDNMQYLGEGTYTATALQAANDMFKEARP 893 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 +KK + ITDG+ ++ + + ++ ++I+ + V + Sbjct: 894 G---------VKKVALVITDGQTD----SRDKKKLADVVKDANDSNVEIFVIGVVKKDDP 940 Query: 354 Q-----DLLRKCTDSSGQFFAVNDSRELLESFDK-ITDKIQE 389 + + + +D L ++ + ++ KI E Sbjct: 941 NFEIFHKEMNLIATDAEHVYQFDDFFTLQDTLKQKLSKKICE 982 Score = 43.4 bits (100), Expect = 0.063, Method: Composition-based stats. Identities = 33/160 (20%), Positives = 55/160 (34%), Gaps = 18/160 (11%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN--NLNEVKSRLN 267 D + +L I + ++L I A PLS+ +L K R+ Sbjct: 64 FDNQKDFVDSLSEKIFQLTP-GRSLKYDIKLAALQFSSSVQIDPPLSSWKDLRTFKQRVK 122 Query: 268 KLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 LN T +Y A+ +A R L E K + +TDG + + + Sbjct: 123 SLNLIGQGTFSYYAISNATRLLKREGRKDGV--------KVALLMTDGID-----HPKSP 169 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ 366 + I E R G+ +V +S + LR + Sbjct: 170 DVQSISEDARILGISFITVGLST-VVNEAKLRLISGDPSN 208 >gi|14149904|ref|NP_115584.1| anthrax toxin receptor 1 isoform 1 precursor [Homo sapiens] gi|332226722|ref|XP_003262541.1| PREDICTED: anthrax toxin receptor 1 isoform 1 [Nomascus leucogenys] gi|17366074|sp|Q9H6X2|ANTR1_HUMAN RecName: Full=Anthrax toxin receptor 1; AltName: Full=Tumor endothelial marker 8; Flags: Precursor gi|14017381|gb|AAK52094.1| tumor endothelial marker 8 precursor [Homo sapiens] Length = 564 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 64/196 (32%), Gaps = 18/196 (9%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 K ++ + +++ + + +R+ I ++ Sbjct: 37 PACYGGFDLYFILDKSGSVLHHWNEIYYFVEQLAHKFISPQLRMSFIVFSTRGT--TLMK 94 Query: 255 LSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 L+ + +++ L +L P +T + A ++Y E + T +I + Sbjct: 95 LTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRT------ASVIIAL 148 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDGE + + R+ G +Y V V + L + DS F VN Sbjct: 149 TDGELHEDLFFYSERE----ANRSRDLGAIVYCVGV--KDFNETQLARIADSKDHVFPVN 202 Query: 372 D-SRELLESFDKITDK 386 D + L I K Sbjct: 203 DGFQALQGIIHSILKK 218 >gi|296209627|ref|XP_002751626.1| PREDICTED: collagen alpha-1(XXVIII) chain [Callithrix jacchus] Length = 1125 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 23/162 (14%), Positives = 58/162 (35%), Gaps = 20/162 (12%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKE 293 + RIG I Y+ + ++ ++ K ++ + T T A+ A + + Sbjct: 834 TARIGVINYSHKVEKVANLKQFSSKDDFKLAVDNMQYLGEGTYTATALQAANDMFKDARP 893 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 +KK + ITDG+ ++ ++ + + + ++I+ + V + Sbjct: 894 G---------VKKVALVITDGQTD----SRDKEKLTEVVKNVSDTNVEIFVIGVVKKNDP 940 Query: 354 Q-----DLLRKCTDSSGQFFAVNDSRELLESFDK-ITDKIQE 389 + + +D L ++ + + KI E Sbjct: 941 NFEIFHKEMNLIATDPEHVYQFDDFFTLQDTLKQKLFQKICE 982 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 22/159 (13%), Positives = 55/159 (34%), Gaps = 16/159 (10%) Query: 210 IDVLIESAGNLVNS-IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 D + L + + + +++ + ++ + + +L K ++ Sbjct: 64 FDKQKDFVDGLSDKIFRLTPRRSVEYDIKLAALQFSSSVQIDPPFSSWKDLQTFKQKVKS 123 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 +N T +Y A+ +A + L E K + +TDG + + + Sbjct: 124 MNLIGQGTFSYYAISNATKLLKREGRKGSV--------KVALLMTDGID-----HPKNPD 170 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ 366 I + R +G+ ++ +S + LR + S Sbjct: 171 VQSISDDARISGISFITIGLST-VVNEAKLRLISGDSSN 208 >gi|271963053|ref|YP_003337249.1| hypothetical protein Sros_1513 [Streptosporangium roseum DSM 43021] gi|270506228|gb|ACZ84506.1| hypothetical protein Sros_1513 [Streptosporangium roseum DSM 43021] Length = 605 Score = 64.5 bits (155), Expect = 2e-08, Method: Composition-based stats. Identities = 45/365 (12%), Positives = 103/365 (28%), Gaps = 26/365 (7%) Query: 27 NQMQSALDAAV-----LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIREN 81 + +Q A DA + S D + + K L Sbjct: 261 SNLQRADDAGAGLSYISAVAVEEKSVWDYNQGNPTGDPKTLGKHGKPKVPLVAIYPKEGT 320 Query: 82 AGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIER 141 A + + Q A+ A + P + + A + ++ +I Sbjct: 321 LYSDNPYAVLTAPWVDDAKRQVAADFLAHLQAPEQ---QRRFADFAFRSHEGKAGKLITE 377 Query: 142 SSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAP 201 ++ L L + + P + + S A Sbjct: 378 ANGLLPAEPRTALSPPAPNVLDKVLSSWADLRK----------PANVLMVIDVSGSMGAG 427 Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 P K+D+ ++A N + + + + + T + Sbjct: 428 VPDTGRSKLDLAKQAAINALPQFGPHDKVGLWMFSTKRDGEKDHLELAPLDTVDAAQRKT 487 Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 +++RL+ L P T Y AY+ + + VIF+TDG+N ++ Sbjct: 488 LRTRLDGLTPDGGTGLYDTALAAYQHVRDRHSGEAI--------NAVIFLTDGKNEDNNS 539 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFD 381 + ++++++A + L + ++ + +S + + F Sbjct: 540 LSLENLLPDLRAESAEESVRMFTIAYGQDADLGVLKQISETTNAAAYDSRESGSIDQVFT 599 Query: 382 KITDK 386 + Sbjct: 600 AVVSN 604 >gi|300782091|ref|YP_003762382.1| von Willebrand factor type A [Amycolatopsis mediterranei U32] gi|299791605|gb|ADJ41980.1| von Willebrand factor type A [Amycolatopsis mediterranei U32] Length = 602 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 43/361 (11%), Positives = 85/361 (23%), Gaps = 28/361 (7%) Query: 27 NQMQSALDA-AVL----SGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIREN 81 +Q A D A L + S D K + Sbjct: 260 TNLQKADDRGAALSYISAVTVEESSLIGYNQGNPTNDPAKVGQHAPPKVPIVAIYPADGT 319 Query: 82 AGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIER 141 +N + + + G ST + Sbjct: 320 LNSDHPFVTLNWADPTRKQIAADFLGYLRGPETQQRFAALGFRSFDGKPGPQASTANGVQ 379 Query: 142 SSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAP 201 ++ V + + N + + S Sbjct: 380 PDAKISFLQPPSPTVLAKLLTTWTDLRKKANVLLVVD---------------VSGSMGDE 424 Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 KID+ ++A + + Q + + SN Sbjct: 425 VKGTGKSKIDLAKQAAIDSLGQFVPRDQVGLWQFATHLDGDKDYQELLPVQPLGSNGKET 484 Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + SRL+ L P T Y + AY L + S V+ +TDG N Sbjct: 485 LASRLSGLTPQSGTGLYDSSLAAYEYLKAHLDPSAI--------NAVVVLTDGRNEDPGG 536 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFD 381 Q+ ++++++A + L + ++G + + + + F Sbjct: 537 VDLDHLVPQLRPEGNAESVRLFTIAYGGDADQNVLKQIAEATAGSEYDSSKPDSINQVFT 596 Query: 382 K 382 Sbjct: 597 S 597 >gi|295092462|emb|CBK78569.1| von Willebrand factor type A domain. [Clostridium cf. saccharolyticum K10] Length = 2061 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 45/337 (13%), Positives = 100/337 (29%), Gaps = 29/337 (8%) Query: 63 IFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKG 122 ++ ++ Y + D + + T ++ QY + Sbjct: 1415 YILADGEEPSEENQYKGDEKTDDPSVPEDSQTSSGKPGFPANKKATLQYTYDGGTGRFEY 1474 Query: 123 LIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLP 182 P + +R N + + LDV+ + ++ Sbjct: 1475 PHPVLQIPEPVLPDEYNKRIEPNDDGTYSLTLDVTGIEGNPATVTTKYPVDLVFVI---- 1530 Query: 183 PPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIA 242 K + + + D++ ++ ++ + + I Sbjct: 1531 --DKSLSMDYDIDGDEIKWWEDETESRKDIVNDALDEII-------PDLCSQQYDIQIAG 1581 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 Y + S +V + L N NT A+ A L + + Sbjct: 1582 YQFSGSSTRVLDWSREEQQVLNNLKISNTSYNTEPSQALADALDMLKTGSQ---AHQNQS 1638 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 +KK++IF+TDGE + + + G IY++ VS+ DL+ Sbjct: 1639 NVKKYLIFMTDGEPTESEELSYYAISKNPVP-----GASIYTIGVSSDA-STDLMEGIRS 1692 Query: 363 SSGQ-------FFAVNDSRELLESFDKITDKIQEQSV 392 ++ F ++ + ++F +I D+I S Sbjct: 1693 TAEGNGMTAPATFKGTSAQLIKDAFTQIKDEIISTST 1729 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 40/335 (11%), Positives = 91/335 (27%), Gaps = 24/335 (7%) Query: 76 SYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRS 135 S+ + I D N + + N T Sbjct: 996 SFDIKLKEGTVITPDDKIEGDPNTDYRKNENISSGQRGVPTNADAYFSFGEDGTTKVRFP 1055 Query: 136 TGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTT 195 +I + N + D L ++ + P P + + + Sbjct: 1056 HPVIPAPATNHPEYKKYIKDNG--NGTYTLTLDVKSDVGSVTTGQKDPTPTAVMFVIDKS 1113 Query: 196 KS-KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE----KKNLSVRIGTIAYNIGIVGN 250 S + ++ + +V+ + N + S + +N + Sbjct: 1114 GSMDQSFGSGNSDARREVVNSALELFFNQLSDGDYNIQFGGYKFSDSGERVNFNDWGWQD 1173 Query: 251 QCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 + + +L +T + A L N + + ++IF Sbjct: 1174 KYWETDTSNALSHLKLTSWETDGSTYPSQTLRSAISALENVELGENGKR-------YLIF 1226 Query: 311 ITDGENSGASAYQNTLNTLQICEYMR--NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF- 367 +TDGE S + ++ ++G Y++ V A + + ++ Sbjct: 1227 LTDGEPGQNSYSFSKEEAENCYSAIKNLDSGTTFYAIQV-ANSDSHGFMESMVSNANFVD 1285 Query: 368 ------FAVNDSRELLESFDKITDKIQEQSVRIAP 396 F N + EL +F ++ +I + P Sbjct: 1286 GVTAQKFVGNSADELNAAFSQMAAEISGSAGTTVP 1320 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 58/187 (31%), Gaps = 27/187 (14%) Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + I +V+D+S SM+D + +D+ L K + Sbjct: 638 QQGLDIVLVIDLSNSMDDGISEDSSDSRLKVLKDTL--------------GYYKESYNSR 683 Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 P ID +L + S T N + +K Sbjct: 684 PGKPTIDEKSGFIDDLFEQSPNSRFSIVTYSTYASTELDWTEYGMN-----GSGQQTIKE 738 Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + +L TN ++ A L G++ VIF++DG+ + + + Sbjct: 739 AIGELQANGGTNYEAGLYQAMEVLKE--------RGNSSNIPVVIFLSDGKPTYYYSDVD 790 Query: 325 TLNTLQI 331 N L+I Sbjct: 791 EFNGLEI 797 >gi|291411003|ref|XP_002721794.1| PREDICTED: integrin, alpha X (complement component 3 receptor 4 subunit-like [Oryctolagus cuniculus] Length = 1100 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 55/191 (28%), Gaps = 21/191 (10%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + + +++ + S N + Sbjct: 160 SGSISFSNFATMKNFVKAVMSQF----PRPSTQFSLMQFSNEFQTHFTFNDFVSSTNPLQ 215 Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + R+ +L T+T A+ +L++ + K +I ITDG+ Sbjct: 216 LLDRVYQL--MGTTHTATAILRVVDQLFHASTGAR-----KDATKILIVITDGQ-----K 263 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAP---PEGQDLLRKCTDSSG--QFFAVNDSREL 376 + L + AG+ Y++ V L F V + L Sbjct: 264 LDDPLGYEDVIPKAEAAGIIRYAIGVGLAFQVVSSLRELHDIASEPAHEHVFRVENFDAL 323 Query: 377 LESFDKITDKI 387 + ++ +KI Sbjct: 324 RDIQGRLKEKI 334 >gi|90413889|ref|ZP_01221875.1| inter-alpha-trypsin inhibitor domain protein [Photobacterium profundum 3TCK] gi|90325073|gb|EAS41583.1| inter-alpha-trypsin inhibitor domain protein [Photobacterium profundum 3TCK] Length = 714 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 26/147 (17%), Positives = 59/147 (40%), Gaps = 14/147 (9%) Query: 248 VGNQCTPLSNNL-NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 Q P++++ ++ L+ T A+ A+ +H+ + STR Sbjct: 382 YSEQLLPVTSSTITRALRFVDGLDADGGTEMAAALKAAFSI------KTHDQLNSTRWLN 435 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ 366 ++FITDG SA + + + + + ++++V + + P + R G Sbjct: 436 QIVFITDGSVGNESALFDLIE-----QQLVDR--RLFTVGIGSAPNSYFMTRAAMKGKGT 488 Query: 367 FFAVNDSRELLESFDKITDKIQEQSVR 393 + + D +E+ + KI + +R Sbjct: 489 YTYIGDVKEVNTKMRLLFSKISQPVMR 515 >gi|254416823|ref|ZP_05030572.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] gi|196176369|gb|EDX71384.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] Length = 538 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 21/196 (10%), Positives = 55/196 (28%), Gaps = 15/196 (7%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + + + + + + + + Y+ L + +K++ Sbjct: 46 VLDRSGSMAGAPLRYAIQAAQNLIDYLTADDFVSVVIYDDTAEVIIPPQLVGDQAALKAK 105 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + K+ TN ++ + V+ +TDG + Sbjct: 106 IGKIRARGCTNLSGGWLLGCSQVQANQSPERINR--------VLLLTDGLANYGIKDPQV 157 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 L T E A + ++ L+ + G F+ + + + F+ + Sbjct: 158 L-TKTALEKA-EADIVTTTLGFGNYFNEDLLINMANAARGNFYFIQSPDDASQVFEIEME 215 Query: 386 KIQE-----QSVRIAP 396 + VR+ P Sbjct: 216 SLVSVVAQNLRVRLQP 231 >gi|118349478|ref|XP_001008020.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89289787|gb|EAR87775.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 642 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 22/125 (17%), Positives = 51/125 (40%), Gaps = 10/125 (8%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N ++K +N ++ +NTN +M A+ L S + ++DG +S Sbjct: 257 NKQDLKRIINNISITQNTNITKSMIKAFNILQ--------FRQSQNKVSSIFLLSDGVDS 308 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 A + Q + ++N I+S + + + + C+ +G F+ + + ++ Sbjct: 309 SAEKQIQNYISSQ--QSLQNKNFAIHSFGYGFDQDAEMINKICSLKNGNFYYIQNMNQVD 366 Query: 378 ESFDK 382 + F Sbjct: 367 QYFAD 371 >gi|86145196|ref|ZP_01063527.1| hypothetical protein MED222_04745 [Vibrio sp. MED222] gi|85836773|gb|EAQ54893.1| hypothetical protein MED222_04745 [Vibrio sp. MED222] Length = 359 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 26/154 (16%), Positives = 67/154 (43%), Gaps = 19/154 (12%) Query: 246 GIVGNQCTPLSNNLN---EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 G TP + + + E+ ++ + ++T+ A+ A + ++ S S+ Sbjct: 154 GDAAFVQTPFTADQDVWLELLNQTDVAMAGQSTHLGDAIGLATKVFEQSEKQSAAVQDSS 213 Query: 303 R----LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------ 352 +K VI +TDG ++G + + + + + G++I+ +A+ P Sbjct: 214 IDANVKEKVVIVLTDGNDTG-----SFVEPIDAAKVAKAKGVRIHVIAMGDPQTVGEVAL 268 Query: 353 GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITD 385 + +++ +S G+ F + EL +++ +I + Sbjct: 269 DMETIKRVAQESGGEAFEALNRDELTKAYAQIGE 302 >gi|194224466|ref|XP_001500626.2| PREDICTED: matrilin 4 [Equus caballus] Length = 542 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 53/153 (34%), Gaps = 15/153 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + + R+G I Y+ + + +++ + L P T T A+ +A Sbjct: 67 DVGPNATRVGVIQYSSQVQSVFPLSAFSRREDMERAIRALVPLAQGTMTGLAIQYAMNVA 126 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E + R+ + + +TDG ++ R G++IY+V V Sbjct: 127 FSVAEGARP--PEARVPRVAVIVTDGRPQD--------RVAEVAAQARARGIEIYAVGVQ 176 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 G LR F V+ + E Sbjct: 177 RADVGS--LRAMASPPLDEHVFLVDSFDLIQEF 207 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 24/183 (13%), Positives = 58/183 (31%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 321 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 375 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + + TDG + + Sbjct: 376 VEYMERGTMTGLALRHMVEHSFSEAQGARPR--ALNVPRVGLVFTDGRSQDDISVW---- 429 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V ++ LRK D + + + Sbjct: 430 ----AARAKEEGIVMYAVGVGKAV--EEELRKIASEPAELHVSYSPDFGTMTHLLENLKG 483 Query: 386 KIQ 388 I Sbjct: 484 SIC 486 >gi|147902754|ref|NP_001082889.1| calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Danio rerio] gi|94732178|emb|CAK04720.1| novel protein similar to vertebrate calcium channel voltage-dependent alpha 2 delta subunit 2 (CACNA2D2) [Danio rerio] Length = 1052 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 37/325 (11%), Positives = 95/325 (29%), Gaps = 18/325 (5%) Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKA 109 + ++ ++ ++ K+H Q D +++ D + Q A Sbjct: 57 LAKKRKALERLASEAERLQKEHRWQDGIKDGEDIDSQMSLKLDFVYDPSFKNQVNYSHTA 116 Query: 110 QYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHN 169 +IPT+ +I + L IE S ++ ++ + Sbjct: 117 -VQIPTDIYKGAPVILNELNWTQALERVFIENSRDDPSLLWQAFGSATGVTRYYPAAPWR 175 Query: 170 DNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ 229 + + P + S + + + + ++ S +++++ Sbjct: 176 APDKIDLYDVRRRPWYIQGASSPKDMVILVDVSGSVSGLTLKLIKASVTEMLDTLSDDDY 235 Query: 230 -EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYREL 288 + + + + N K + ++ T+ H A+ +L Sbjct: 236 VNVARFNEKAEAVVPCFDHLVQANVR---NKKIFKEAVQQMQAKGTTDYKSGFHFAFNQL 292 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 N+ K ++ TDG A N N +++++ +V Sbjct: 293 LNKTNVPRANCN-----KIIMLFTDGGEDRAQDIFEQYNWP-------NKTVRVFTFSVG 340 Query: 349 APPEGQDLLRKCTDSS-GQFFAVND 372 L+ S+ G +F + Sbjct: 341 QHNYDVTPLQWIACSNKGYYFEIRS 365 >gi|254517645|ref|ZP_05129701.1| von Willebrand factor [Clostridium sp. 7_2_43FAA] gi|226911394|gb|EEH96595.1| von Willebrand factor [Clostridium sp. 7_2_43FAA] Length = 979 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 45/379 (11%), Positives = 95/379 (25%), Gaps = 82/379 (21%) Query: 82 AGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTN----LSLRSTG 137 I+ + + + ++ + + P+ Sbjct: 7 FKIISIITSLFVISTLISLKSLNVKATGEIPDKPDFDLEISASPNPAMVGEDITVGGKII 66 Query: 138 IIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNN---MTSNKYLLPPPPKKSFWSKNT 194 + A I +VLD+S SM++ + + W + Sbjct: 67 PKPFETAIPAKEIVLVLDISGSMDEEIENPCTNKRVRYCTRHSSSDPNHEEWFLSWHRWI 126 Query: 195 TKSKYAPAPAP------ANRKIDVLIESAGNLVNSIQKAIQ--------------EKKNL 234 + N+KID L +A + ++ + Sbjct: 127 NDYCVEHNTSGEHNITANNKKIDELKRAANGFIERMKDVPNLKIGIVAYSSIATINPNSK 186 Query: 235 SVRIGTIAYNIGIVGN-------QCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 S + + + L +N + + S +N L TN M A Sbjct: 187 SGTKKVKSLDSNSSHDVTNYNSLGANFLQSNDSRLTSVINNLEALGGTNIGEGMRKAVYM 246 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT------------------- 328 L + G K ++ +TDG + S + N Sbjct: 247 LDS---------GDKSASKTIVLMTDGLPTFYSVTGSNKNNYMTIDNTEPKIAGIGTGLD 297 Query: 329 -------LQICEYMRNAGMKIYSVAVSAPPEGQDLL---------RKCTDSSGQF----F 368 + E +++ G +S+ +G D L + F Sbjct: 298 TKSINYSRAVGEIIKSRGYNSFSIGYGLDTDGNDTLLSIHEAMTGVSIKGKPDLYESSGF 357 Query: 369 AVNDSRELLESFDKITDKI 387 + + F++I +I Sbjct: 358 FPTSTNAIQAVFNRIATQI 376 >gi|46143335|ref|ZP_00135441.2| COG4961: Flp pilus assembly protein TadG [Actinobacillus pleuropneumoniae serovar 1 str. 4074] Length = 520 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 61/475 (12%), Positives = 125/475 (26%), Gaps = 109/475 (22%) Query: 14 TYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIF--------- 64 +++ A I+ + ++ +L+ AVLS A S R D Sbjct: 36 FVSLESAGIIQDQARLSDSLEQAVLSLTAENNSGRKDNDYKLSGSSNKENDSFDISSEVG 95 Query: 65 --KKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNN---PLQYIAESKAQYEIPTENLF 119 Q+ Q + N + N + ++ F Sbjct: 96 KRDNQMVTTFVQAFLPQTNEKAMRLTPTCKTVTTDNKKGHTSSSEVTCTVSGTVEHKSWF 155 Query: 120 LKGLIPSALTNLSLRSTGIIERSSENL---AISICMVLDVSRSMEDLYLQKHNDNNNM-- 174 + + + +N I + +V D+S SM K NN Sbjct: 156 PLKVGNLEVIPQQVDVASKSRAFKKNTFNIPIDLMVVADLSGSMNFDLDNKKIINNAKPS 215 Query: 175 -----------TSNKYLLPPPPKKSF--------------------------WSKNTTKS 197 + K L + +SK+ Sbjct: 216 KIRILKEVLEELAAKSLFNQDSNNNNRIAVAPFALGAQHSNNQCIIPFILKKYSKDRISE 275 Query: 198 KYAPAPAPANRKIDVLIESAG-NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 K + AN I + + + I K I I + + Sbjct: 276 KNIKSYLSANNNISAKDFALSLSYLVDIDKTINSIGGTFSSNSIIFNKNKFCLGRSNKNT 335 Query: 257 ------NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 + + S + +L+ +T + A + KE S + + K+ ++ Sbjct: 336 HHWYNRDESSNFFSFIKRLHAEGSTLASSGLITASNIML--KEESRSKSLGEQTKRVILV 393 Query: 311 ITDGEN----------------------------SGASAYQ-------NTLNTLQICEYM 335 ++DG + + + + + +C+ + Sbjct: 394 LSDGNDELRLNDEGTPFTQYSRITENLLLGQEEQTTDTYPYFMSKPPKKLTSNINVCDRI 453 Query: 336 R--------NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 R + KI V + + + C +G +++ ND LL SF + Sbjct: 454 RNKLDEHNEDKNTKIVFVEFGYASKAKQAWQHCV-GNGNYYSANDKASLLNSFKQ 507 >gi|297199802|ref|ZP_06917199.1| lipoprotein [Streptomyces sviceus ATCC 29083] gi|197710264|gb|EDY54298.1| lipoprotein [Streptomyces sviceus ATCC 29083] Length = 506 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 32/298 (10%), Positives = 84/298 (28%), Gaps = 27/298 (9%) Query: 93 ITKDKNNPLQYIAES-KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISIC 151 +T + + + A Y L + + Sbjct: 67 VTPSPDYLSTFALDVDTASYGYARRTLAEGRRPDPSTIRPEEFVNSFRQDYDRPDGNGFT 126 Query: 152 MVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKID 211 + +D +R+ +D + ++ + PP + + + S P ++D Sbjct: 127 VTVDGARTSKDDWSLVRVGLATRSAGENADERPPAALTFVIDISGSMSEPG------RLD 180 Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP-LSNNLNEVKSRLNKLN 270 + S + ++ + + ++ L + + + ++ L Sbjct: 181 LAQRSLDTMTERLRDDDS--------VALVTFSDRARKVLPMTRLGGHRDRIHEAIDGLE 232 Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 P +TN + Y+ V+ I+D + + + Sbjct: 233 PTYSTNLGAGVETGYKTAVEGLRKGATNR--------VVLISDALANDGETDPD-AILER 283 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF-DKITDKI 387 I R G+ ++ V V + G L+ + D + + F +++ I Sbjct: 284 IDTARREHGITLFGVGVGSD-YGDALMERLADKGDGHTVYVSGDDAEKVFCEQLPQNI 340 >gi|297622708|ref|YP_003704142.1| von Willebrand factor type A [Truepera radiovictrix DSM 17093] gi|297163888|gb|ADI13599.1| von Willebrand factor type A [Truepera radiovictrix DSM 17093] Length = 329 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 32/253 (12%), Positives = 80/253 (31%), Gaps = 27/253 (10%) Query: 147 AISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPA 206 + V V+ Q++ +T + + + S P Sbjct: 72 SNPRATVTSVTPLPGLAAAQRYTATATITVDITVQEVINAVLNMDR----SGSMRLNDPE 127 Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 ++D + + A+ E S +++ +++ L Sbjct: 128 RLRVDAAKSFLERVTPEDRIAVMEFPGQSSGFRA--------STLLQGFTSDKALLEAAL 179 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 +++ NT + L + + G + V+ TDGE G Sbjct: 180 DRVGQRGNTPI-------WDSLLDTLDLHAADEGGQGASRVVLLFTDGEREGGQVAFGEA 232 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + + ++++++ + + + +L ++ G F V + EL E F + + Sbjct: 233 LAAAL-----ESDVRVFTIGLGSDIDTAELQELAAETGGTFANVASAAELEELFQRAFNA 287 Query: 387 IQEQ---SVRIAP 396 I+ ++ I+P Sbjct: 288 IRASGTITLSISP 300 >gi|149727851|ref|XP_001493364.1| PREDICTED: anthrax toxin receptor 1 [Equus caballus] Length = 603 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 64/196 (32%), Gaps = 18/196 (9%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 K ++ + +++ + + +R+ I ++ Sbjct: 37 PACYGGFDLYFILDKSGSVLHHWNEIYYFVEQLAHKFISPQLRMSFIVFSTRGT--TLMK 94 Query: 255 LSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 L+ + +++ L +L P +T + A ++Y E + T +I + Sbjct: 95 LTEDREQIRQGLEELQKVLPGGDTYMHEGFQRASEQIYYENSQGYRT------ASVIIAL 148 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDGE + + R+ G +Y V V + L + DS F VN Sbjct: 149 TDGELHEDLFFYSERE----ANRSRDLGAIVYCVGV--KDFNETQLARIADSKDHVFPVN 202 Query: 372 D-SRELLESFDKITDK 386 D + L I K Sbjct: 203 DGFQALQGIIHSILKK 218 >gi|26350291|dbj|BAC38785.1| unnamed protein product [Mus musculus] Length = 810 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 23/159 (14%), Positives = 55/159 (34%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + R+G + Y + +EV+ + ++ T T A+ +A Sbjct: 88 DIGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEVERAVKRMRHLSTGTMTGLAIQYALNIA 147 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + + + + ++ +TDG + ++ RN G+ I+++ V Sbjct: 148 FSEAEGARPLREN--VPRIIMIVTDGRPQDS--------VAEVAAKARNTGILIFAIGVG 197 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE---LLESFDK 382 + ++ F V + + L F Sbjct: 198 --QVDLNTVKAIGSEPHKDHVFLVANFSQIESLTSVFQN 234 >gi|91201135|emb|CAJ74194.1| conserved hypothetical protein [Candidatus Kuenenia stuttgartiensis] Length = 333 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 32/186 (17%), Positives = 60/186 (32%), Gaps = 55/186 (29%) Query: 251 QCTPLSNNLNEVKSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 PL+++ + + LN LN P T A++ + K Sbjct: 141 TYCPLTSDYSAFRLFLNDLNVNIIPVGGTAIAEAIYKGIDAF----------GENENNHK 190 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------------- 352 +I ITDGEN + + L+ + G+ IY+V V Sbjct: 191 AMIIITDGEN-------HETDPLKAASKAKEKGIVIYTVGVGKKEGSYIKIIDEQGKETL 243 Query: 353 -------------GQDLLRKCT-DSSGQFFAVNDSR-ELLESFDKITDKI-----QEQSV 392 + L K ++ G + ++ L + +++ +I + Q V Sbjct: 244 LKDAHGQVVKSRLDEITLNKIALETGGLYTPAYGTKWGLAKIYNESFAQIEGSVYKTQRV 303 Query: 393 RIAPNR 398 + NR Sbjct: 304 KKYVNR 309 >gi|257897779|ref|ZP_05677432.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com15] gi|257835691|gb|EEV60765.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com15] Length = 819 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 27/268 (10%), Positives = 76/268 (28%), Gaps = 26/268 (9%) Query: 140 ERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY 199 + + +V+D S SM D L S Sbjct: 304 NTQKNITPLDLVLVVDWSGSMNDNNRIGEVKIGVDRFVDTLADSGITDKINMGYVGYSIE 363 Query: 200 APAPAPANRKI---DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 + + ++ D + ++ S + G + L+ Sbjct: 364 GYSYSNGAVQMGSFDSVKNQVKSITPSWTNGGTFTQKALRDAGNMLSVPNGHKKVIVLLT 423 Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + + ++ +++ ++N Y + ++ + G+T L + D N Sbjct: 424 DGVPTFSYKVQRVHAQSSSNYYG----------TQFSNTQDRPGNTSLISRIYDAPDQNN 473 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTDS---S 364 + T+ ++ G++I+ + + + + +R+ S Sbjct: 474 LSRRIDSTFIATIGEAMALKERGIEIHGLGIQLQSDPAAGLSKAEVESRMRQMVSSDEKG 533 Query: 365 GQFFA-VNDSRELLESFDKITDKIQEQS 391 ++ + + ++ E K +I Sbjct: 534 DLYYESADHATDISEYLAKKAVQISATV 561 >gi|114047772|ref|YP_738322.1| vault protein inter-alpha-trypsin subunit [Shewanella sp. MR-7] gi|113889214|gb|ABI43265.1| Vault protein inter-alpha-trypsin domain protein [Shewanella sp. MR-7] Length = 755 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 34/274 (12%), Positives = 81/274 (29%), Gaps = 29/274 (10%) Query: 137 GIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTK 196 + R N + L S + + + P + + + Sbjct: 295 ALANRVPANRDFVLQWRLKQGTSPVAWVFNQTGKTHTTQDDNASADTGPTGNSSNTDNYS 354 Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE-----------------KKNLSVRIG 239 P + L ++++ + ++ I Sbjct: 355 LVMVLPPKVEASEQLNLPRELILVIDTSGSMAGDSIIQAKNALRYALRGLRPQDSFNIIE 414 Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 + + + NL + +N+L T A++ A +++ + Sbjct: 415 FNSDVSLLSPTPLPATATNLAMARQFVNRLQADGGTEMAQALNAAL-----PRQAFNTAS 469 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 G + + VIF+TDG SA + + + ++++V + + P + R Sbjct: 470 GEDKSLRQVIFMTDGSVGNESALFELIR-----NQIGDN--RLFTVGIGSAPNSHFMQRA 522 Query: 360 CTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 G F + D E+ + ++ KIQ + Sbjct: 523 AELGRGTFTYIGDVDEVEQKISQLLAKIQYPVLT 556 >gi|315649632|ref|ZP_07902717.1| von Willebrand factor type A [Paenibacillus vortex V453] gi|315275105|gb|EFU38480.1| von Willebrand factor type A [Paenibacillus vortex V453] Length = 421 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 21/206 (10%), Positives = 62/206 (30%), Gaps = 28/206 (13%) Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 S P + + + + + Sbjct: 115 VVLVIDNSGSMKDTDPNQDRYTAAKNLINRMD---RDNRVSVIVFDHATTLLQPFTRVKN 171 Query: 250 NQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 + +E+ + ++ L T+ A+ + +++ + V Sbjct: 172 QEI------KDEIMAEIDGLATTDGGTDISLALEDTMSHIQESQDAGRSA--------MV 217 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS--APPEGQDLLRKCTDSSGQ 366 I ++DG + + ++ + + + ++ +S L ++ GQ Sbjct: 218 IMLSDG--------FSETDHDRVLADYKQQQIAVNTIGLSLVYKDGANLLQTIAAETGGQ 269 Query: 367 FFAVNDSRELLESFDKITDKIQEQSV 392 ++ V ++ +L F KI D + ++S+ Sbjct: 270 YYDVQNAADLSFVFQKIYDDVGDRSL 295 >gi|311070192|ref|YP_003975115.1| hypothetical protein BATR1942_16325 [Bacillus atrophaeus 1942] gi|310870709|gb|ADP34184.1| hypothetical protein BATR1942_16325 [Bacillus atrophaeus 1942] Length = 227 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 17/134 (12%), Positives = 44/134 (32%), Gaps = 16/134 (11%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + ++ LN + P T A++ A + + V +TDGE Sbjct: 109 FDEQSFRNSLNTIGPTGWTPIAKALNEAKSSFDQLDKKG---------ENVVYLLTDGEE 159 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRE 375 + N ++ + + + + + + L G++F + + Sbjct: 160 TCGG------NPIKTAKELHKHNITVNVIGFDFKEGYKGQLNAIAKVGGGEYFPASSQSD 213 Query: 376 LLESFDKITDKIQE 389 + + F + K+ + Sbjct: 214 IKQIFKAESIKLAK 227 >gi|296223654|ref|XP_002757711.1| PREDICTED: anthrax toxin receptor 1 [Callithrix jacchus] Length = 568 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 64/196 (32%), Gaps = 18/196 (9%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 K ++ + +++ + + +R+ I ++ Sbjct: 39 PACYGGFDLYFILDKSGSVLHHWNEIYYFVEQLAHKFISPQLRMSFIVFSTRGT--TLMK 96 Query: 255 LSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 L+ + +++ L +L P +T + A ++Y E + T +I + Sbjct: 97 LTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENSQGYRT------ASVIIAL 150 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDGE + + R+ G +Y V V + L + DS F VN Sbjct: 151 TDGELHEDLFFYSERE----ANRSRDLGAIVYCVGV--KDFNETQLARIADSKDHVFPVN 204 Query: 372 D-SRELLESFDKITDK 386 D + L I K Sbjct: 205 DGFQALQGIIHSILKK 220 >gi|158259621|dbj|BAF85769.1| unnamed protein product [Homo sapiens] Length = 1105 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 57/168 (33%), Gaps = 18/168 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + RIG Y+ + +EV + L NT T A+++ + Sbjct: 186 TAFDVGSEKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNYIF 245 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + T + K I ITDG++ + +R +G++++++ Sbjct: 246 ENSFK-----PEAGSRTGVSKIGILITDGKSQDDIIPPS--------RNLRESGVELFAI 292 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 V + L++ + V + + + +T + + Sbjct: 293 GV--KNADVNELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTLCSRV 338 >gi|326489689|dbj|BAK01825.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 691 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 17/202 (8%), Positives = 61/202 (30%), Gaps = 31/202 (15%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK--SRL 266 K+ +L ++ +++++ R+ ++++ ++ + + + Sbjct: 210 KLALLKQAMRFVIDNLG--------PDDRLSVVSFSSEARRLTRLARMSDAGKALSVNAV 261 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 L TN + A + L V+ ++DG+++ + Sbjct: 262 ESLVARGGTNIAEGLRTAAKVLDE--------RQHRNAVSSVVLLSDGQDTYTMMRRRGP 313 Query: 327 NTLQICE-------------YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDS 373 + + + I++ + + + G F + + Sbjct: 314 SGVHAGNYEELVPPSFARTGADGDWSAPIHTFGFGNDHDAAAMHVIAEATGGTFSFIENE 373 Query: 374 RELLESFDKITDKIQEQSVRIA 395 + ++F + + V+ A Sbjct: 374 AVIQDAFAQCIGGLLSVVVQEA 395 >gi|281182610|ref|NP_001162037.1| matrilin-4 [Pongo abelii] gi|134093133|gb|ABO52993.1| matrilin 4 isoform 1 precursor [Pongo abelii] Length = 581 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 57/183 (31%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 360 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 414 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + + TDG + + Sbjct: 415 VEYMERGTMTGLALRHMVEHSFSEAQGARPR--ALNVPRVGLVFTDGRSQDDISVW---- 468 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V + LR+ D + + + Sbjct: 469 ----AARAKEEGIVMYAVGVGKAV--EAELREIASKPAELHVSYAPDFGTMTHLLENLRG 522 Query: 386 KIQ 388 I Sbjct: 523 SIC 525 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 51/153 (33%), Gaps = 15/153 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + R+G I Y+ + + +++ + L P T T A+ +A Sbjct: 65 NVGPNATRVGVIQYSSQVQSVFPLRAFSRSEDMERAIRDLVPLAQGTMTGLAIQYAMNVA 124 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E + R+ + + +TDG ++ R G++IY+V V Sbjct: 125 FSVAEGARP--PEERVPRVAVIVTDGRPQD--------RVAEVAAQARARGIEIYAVGVQ 174 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 G LR F V + E Sbjct: 175 RADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 >gi|113867618|ref|YP_726107.1| von Willebrand factor type A domain-containing protein [Ralstonia eutropha H16] gi|113526394|emb|CAJ92739.1| von Willebrand factor (vWF) type A domain [Ralstonia eutropha H16] Length = 345 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 22/162 (13%), Positives = 52/162 (32%), Gaps = 33/162 (20%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKL---NPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 G P + + V++ + L +T A+ + + S Sbjct: 147 GDAPYPLAPFTLDHQLVQTLITGLLPGMAGPSTALGDAIGLGIKMFEH----------SE 196 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD------- 355 +K +I +TDG ++ + + + + ++++ + P + Sbjct: 197 APEKVLIVLTDGNDTASRMPPERAGGI-----AKERKVVVHTIGIGDPNASGEEKVDLGV 251 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 L R + G++F D L + + RI P+ Sbjct: 252 LQRLAAQTGGRYFFGADQAGLETIYATLD--------RITPH 285 >gi|297286920|ref|XP_001082067.2| PREDICTED: collagen alpha-4(VI) chain-like, partial [Macaca mulatta] Length = 1624 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 40/334 (11%), Positives = 101/334 (30%), Gaps = 30/334 (8%) Query: 65 KKQIKKHLKQGSYIREN-AGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGL 123 + ++L Q + E + + Q + + + ++ + + G+ Sbjct: 510 TGKALQYLLQTFFQEEAGSRYLQGIPQYAVVINSGKSKDEVQDAAQRLREKGVKVMSVGV 569 Query: 124 IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPP 183 L + + + +V DV+ ++ Q+H N + Sbjct: 570 QDFDRRELEGMGSPDLVYDMQREDEVRHIVEDVNVVIQGTGQQQHRITANEEAVG----A 625 Query: 184 PPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAY 243 +++ P +++ L + +VR G + Y Sbjct: 626 CTTAIRADLVFLIEEFSRVRQPNFQQV------VNFLKTIVSSLSIHP--DTVRFGLVFY 677 Query: 244 NIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRELYNEKESSHNTIG 300 + N ++ L+KL T T A+ E++ + Sbjct: 678 SEEPRLEFSLDTFQNPAKILEHLDKLTYRERRGRTKTGAALDFLRNEVF---IQEKGSRS 734 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 + +++ + I +G + + + ++R AG+ IY+V L K Sbjct: 735 NHGVQQIAVVIMEGFSQDSVSRP--------ASHLRRAGITIYAVGTQ-NVSESKELEKI 785 Query: 361 TDSSGQFFAV--NDSRELLESFDKITDKIQEQSV 392 ++V +L K+ +++ + V Sbjct: 786 ASYPHWKYSVPLESFLQLSVVRSKLINQLCSEMV 819 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 29/157 (18%), Positives = 55/157 (35%), Gaps = 19/157 (12%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 V+ G I Y+ I ++ E+K ++ + A E N Sbjct: 878 NRVQFGVIQYSDKIQSQFILSQYPSVAELKVAIDNIQ-------QGGGGTATGEALNNMT 930 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 G + +++I ITDG++S + E +R G+ IY++ V Sbjct: 931 QVFADTGRINVARYLIVITDGKSSDP--------VAEAAEGLRENGVIIYAIGV--REAN 980 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 D L++ + F V + L + ++ I Sbjct: 981 IDELKEIAK--DKIFFVYEFDLLKDIQKEVVQDICSS 1015 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 21/162 (12%), Positives = 56/162 (34%), Gaps = 17/162 (10%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYREL 288 ++IG + ++ ++ ++ S + + + T T A++ Sbjct: 1055 NIGTDGIQIGLLQFSSIPQEEFRLNQYSSKVDIYSAIFDVQQMRDGTRTGKALNFTLPFF 1114 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + K +++++I ITDG + +R+ + I+++ V Sbjct: 1115 DSSKGGRP------SVQQYLIVITDGVAQDNVIIP--------AKALRDKNIIIFAIGVG 1160 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + LL T+ + + + L +I K+ + Sbjct: 1161 -EAKKSQLLE-ITNDEDKVYHDVNFEALQNLEKEILSKVCDP 1200 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 30/165 (18%), Positives = 56/165 (33%), Gaps = 16/165 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + VR+G YN I + V ++ L TNT A+ Sbjct: 266 DISSDHVRVGLAQYNDNIYPAFQLNQHPLKSTVLEQIQNLPYRTGGTNTGSALEFIRTNY 325 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 E+ S R+ + VI +TDGE++ ++ + ++ G+ +Y V Sbjct: 326 LTEESGSRAK---DRVPQIVILVTDGESND--------EVQEVADRLKEDGVVVY--VVG 372 Query: 349 APPEGQDLLRKCTDSS-GQF-FAVNDSRELLESFDKITDKIQEQS 391 + L+K +F F + L + I + Sbjct: 373 VNVQDVQELQKIASEPFEKFLFNTENFNILQDFSGSILQTLCSAV 417 Score = 46.8 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 24/171 (14%), Positives = 53/171 (30%), Gaps = 16/171 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYREL 288 + ++R+G Y+ + +V + + A+ L Sbjct: 63 NVSSKTIRVGLAQYSDVPHSEFLLSTYHRKADVLRHIRQFKFKPGGKKMGLALKFI---L 119 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + + + + S + + + I+ G E +R AG+ +Y+ V Sbjct: 120 DHHFQEASGSRASQGVPQIAMVISSGPVEDHVHGP--------AEALRRAGILLYATGV- 170 Query: 349 APPEGQDLLRKCTDSSGQFF--AVNDSRELLESFDKITDKIQEQSVRIAPN 397 LR+ S + F V L K+ ++ + + AP Sbjct: 171 -RDAVWAELREIASSPQENFTSFVPYFSGLSNLAQKLRQELCDMLAKAAPR 220 >gi|221135318|ref|ZP_03561621.1| von Willebrand factor, type A [Glaciecola sp. HTCC2999] Length = 342 Score = 64.5 bits (155), Expect = 3e-08, Method: Composition-based stats. Identities = 31/211 (14%), Positives = 72/211 (34%), Gaps = 39/211 (18%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 + V+ + + + ++ + + V PL+ + Sbjct: 103 VDLSGSMQTQDMVVNGNEVDRLVMVKTVLGDFIQRRVGDRIGLILFADTAYLQAPLTFDR 162 Query: 260 NEVKS----RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 V+ + L ++T A+ A + + + K ++ +TDG+ Sbjct: 163 TTVEQLLSETVIGLV-GDSTAIGDAIGLAAKRFSD----------KPNVNKVLVLLTDGQ 211 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----------------PEGQDLLR 358 N+ + + Q + +KIY + V A + LL Sbjct: 212 NTAGNITPD-----QALSLAVDQNIKIYPIGVGADAMTVNSLFGQRQVNPSADLDEGLLT 266 Query: 359 KCT-DSSGQFFAVNDSRELLESFDKITDKIQ 388 + D+ GQ+F D++EL + + ++ D+I+ Sbjct: 267 RLAKDTGGQYFRARDTQELEQIY-RLLDRIE 296 >gi|149067646|gb|EDM17198.1| integrin alpha M [Rattus norvegicus] Length = 1151 Score = 64.2 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 59/180 (32%), Gaps = 23/180 (12%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PY 272 E +++ QK+ + Y+ + + KS + + Sbjct: 170 KEFVSTVMDQFQKSKTLFS-------LMQYSDEFRTHFTFNDFKRNPDPKSHVRPIRQLN 222 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T T + REL+ + + K ++ ITDGE + + LN + Sbjct: 223 GRTKTASGIRKVVRELFQKINGAR-----DNAAKILVVITDGE-----KFGDPLNYEDVI 272 Query: 333 EYMRNAGMKIYSVAVSAP---PEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKI 387 AG+ Y + V P+ + L F V++ L +++ +KI Sbjct: 273 PEAEEAGIIRYVIGVGNAFHKPQSRRELDTIASKPAGDHVFQVDNFEALNTIRNQLQEKI 332 >gi|29789036|ref|NP_036843.1| integrin alpha-M [Rattus norvegicus] gi|8917587|gb|AAF81280.1| integrin beta 2 alpha subunit [Rattus norvegicus] Length = 1151 Score = 64.2 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 30/180 (16%), Positives = 59/180 (32%), Gaps = 23/180 (12%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PY 272 E +++ QK+ + Y+ + + KS + + Sbjct: 170 KEFVSTVMDQFQKSKTLFS-------LMQYSDEFRTHFTFNDFKRNPDPKSHVRPIRQLN 222 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T T + REL+ + + K ++ ITDGE + + LN + Sbjct: 223 GRTKTASGIRKVVRELFQKINGAR-----DNAAKILVVITDGE-----KFGDPLNYEDVI 272 Query: 333 EYMRNAGMKIYSVAVSAP---PEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKI 387 AG+ Y + V P+ + L F V++ L +++ +KI Sbjct: 273 PEAEEAGIIRYVIGVGNAFHKPQSRRELDTIASKPAGDHVFQVDNFEALNTIRNQLQEKI 332 >gi|328712314|ref|XP_001943110.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 1 [Acyrthosiphon pisum] Length = 884 Score = 64.2 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 51/357 (14%), Positives = 105/357 (29%), Gaps = 38/357 (10%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPL 101 +S ++ + D T + +TI K ++ K + K Q+ T Sbjct: 259 SSNITTLEVPDIKTANEIETTISKNKLAKISYESGNKATITWSPTVKEQLTFT-----EH 313 Query: 102 QYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSME 161 + Y++ ++ + LI + L + VLDVS SM Sbjct: 314 GVKGQFIVHYDVDHKSAPNQVLIDDGYF-----VHFFAPTDLKPLRTHVIFVLDVSGSMV 368 Query: 162 DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLV 221 L + + ++ +S Sbjct: 369 GQKLPQVKEAMGQILSEIHSEDFFTLILFSDFAQVWT------------------INATQ 410 Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAM 281 + ++ N +G N+ K + L +TN A+ Sbjct: 411 ETSNHWDEKVSNWKTNNNISLDTLGENRFVFPATEQNVQYAKKFIQDLQSESSTNMEDAL 470 Query: 282 HHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK 341 + A L + + G+ K ++F+TDGE + L ++ Sbjct: 471 NKA--HLIAKLGETRFKDGANTPKPIIVFLTDGEPTTGITEPQEL--IKYVSNTNEEKYP 526 Query: 342 IYSVAVSAPPEGQDLLRKCTDSSGQF----FAVNDSR-ELLESFDKITDKIQEQSVR 393 IYS+ D L+K + ++ F + +D+ +L + +I+ + Sbjct: 527 IYSLGFGEGA-DIDFLKKLSLNNTGFARVIYEASDASLQLRNFYKEISSPVLSNVTF 582 >gi|46906212|emb|CAA46928.2| collagen type XIV/undulin [Gallus gallus] Length = 1148 Score = 64.2 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 28/201 (13%), Positives = 62/201 (30%), Gaps = 26/201 (12%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + +I + V ++ K + ++ I ++ Sbjct: 646 VDGSWSIGDDNFNKIISFLYSTVGALDKIGPDGT----QVAIIQFSDDPRTEFKLNAYKT 701 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + ++ NT T A+ HA L+ + K ++ ITDG + Sbjct: 702 KETLLEAIQQIAYKGGNTKTGKAIKHAREVLFTG-----EAGMRKGIPKVLVVITDGRSQ 756 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRE 375 ++ M+ G +++ V L F V+D Sbjct: 757 DDVN--------KVSREMQLDGFSFFAIGV--ADADYSELVNIGSKPSERHVFFVDDF-- 804 Query: 376 LLESFDKITDKIQEQSVRIAP 396 ++F KI D++ A Sbjct: 805 --DAFTKIEDELITFVCETAS 823 >gi|224065787|ref|XP_002190547.1| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta subunit 2 [Taeniopygia guttata] Length = 1068 Score = 64.2 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 37/343 (10%), Positives = 98/343 (28%), Gaps = 26/343 (7%) Query: 38 LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK 97 L+ A +++ K ++ E + + ++ T D Sbjct: 68 LANAAEKFQKAHHWQDNIREEDIEYYDSKADTEY--DDPDGEEIEREKSNSLKLEFTDDD 125 Query: 98 NNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 N + A +IPT+ +I + L IE E+ ++ + + Sbjct: 126 NFKTKVNYSYAA-VQIPTDIYKGSTVILNELNWTQALEDVFIENRKEDPSLLWQVFGSAT 184 Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 N + P + S + + + + ++ S Sbjct: 185 GVTRYYPATPWRAPNKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSV 244 Query: 218 GNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 +++++ + + + ++ +V N K + + T+ Sbjct: 245 YEMLDTLSDDDYVNVASFNEKAKPVSCFKHLVQANIR----NKKVFKEDVQGMVAKGTTD 300 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 +A+ +L N + K ++ TDG + E + Sbjct: 301 YKAGFEYAFDQLQNSNITRA------NCNKMIMMFTDG---------GEDRVQDVFEKYK 345 Query: 337 --NAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 N +++++ +V L+ + G +F + + Sbjct: 346 WPNKTVRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 388 >gi|19552242|ref|NP_600244.1| hypothetical protein NCgl0978 [Corynebacterium glutamicum ATCC 13032] Length = 594 Score = 64.2 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 60/191 (31%), Gaps = 27/191 (14%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI---GIVGNQCT--PLSNN 258 A +ID +++ L+N I G P N Sbjct: 15 AGGQTRIDAAKQASTQLINDISDRTDVGLTYYGGNTGETEADVEMGCQDVTILGGPSRGN 74 Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + + +N L P T A+ EL ++ ++DG + Sbjct: 75 ADTLIDTINSLQPRGFTPIGKALTDTAAELPEGGN--------------IVLVSDGIAN- 119 Query: 319 ASAYQNTLNTLQICEYMRNAGMK--IYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRE 375 + ++ + + +G+ I ++ ++ P ++ L G + +D++ Sbjct: 120 ----CTPPDVCEVAQELAQSGINLVINTIGLNVDPAAREELECIAGVGGGTYADASDAQS 175 Query: 376 LLESFDKITDK 386 L ++ + + Sbjct: 176 LTDALTRAASR 186 >gi|301778755|ref|XP_002924795.1| PREDICTED: integrin alpha-X-like [Ailuropoda melanoleuca] Length = 1160 Score = 64.2 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 25/177 (14%), Positives = 62/177 (35%), Gaps = 16/177 (9%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENT 275 ++N ++ + + + + + + ++ + + + L+ + T Sbjct: 168 FTKMLNFVKAVMSQFRRPNTQFSLMQFSDDFRVHFTFKDFTDSSNPLVLLDSVYQLGGYT 227 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 +T A+ +L++ + K +I ITDG+ G LN + Sbjct: 228 HTATAIQMVTNQLFSTSSGAR-----KDASKILIVITDGQKQGDY-----LNYEDVIPMA 277 Query: 336 RNAGMKIYSVAVSAPPEGQ---DLLRKCTDSSGQFF--AVNDSRELLESFDKITDKI 387 AG+ Y+V V + L + V++ L + +++ +KI Sbjct: 278 EAAGIIRYAVGVGLAFRKRHSWKELNDIASKPSNEYIFKVDNFDVLRDIQNQLKEKI 334 >gi|126208028|ref|YP_001053253.1| tight adherence protein G [Actinobacillus pleuropneumoniae L20] gi|126096820|gb|ABN73648.1| tight adherence protein G [Actinobacillus pleuropneumoniae serovar 5b str. L20] Length = 520 Score = 64.2 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 61/475 (12%), Positives = 125/475 (26%), Gaps = 109/475 (22%) Query: 14 TYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIF--------- 64 +++ A I+ + ++ +L+ AVLS A S R D Sbjct: 36 FVSLESAGIIQDQARLSDSLEQAVLSLTAENNSGRKDNDYKLSGSSNKENDSFDISSEVG 95 Query: 65 --KKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNN---PLQYIAESKAQYEIPTENLF 119 Q+ Q + N + N + ++ F Sbjct: 96 KRDTQMVTTFVQAFLPQTNEKAMRLTPTCKTVTTDNKKGHTSSSEVTCTVSGTVEHKSWF 155 Query: 120 LKGLIPSALTNLSLRSTGIIERSSENL---AISICMVLDVSRSMEDLYLQKHNDNNNM-- 174 + + + +N I + +V D+S SM K NN Sbjct: 156 PLKVGNLEVIPQQVDVASKSRAFKKNTFNIPIDLMVVADLSGSMNFDLDNKKIINNAKPS 215 Query: 175 -----------TSNKYLLPPPPKKSF--------------------------WSKNTTKS 197 + K L + +SK+ Sbjct: 216 KIRILKEVLEELAAKSLFNQDSNNNNRIAVAPFALGAQHSNNQCIIPFILKKYSKDRISE 275 Query: 198 KYAPAPAPANRKIDVLIESAG-NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 K + AN I + + + I K I I + + Sbjct: 276 KNIKSYLSANNNISAKDFALSLSYLVDIDKTINSIGGTFSSNSIIFNKNKFCLGRSNKNT 335 Query: 257 ------NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 + + S + +L+ +T + A + KE S + + K+ ++ Sbjct: 336 HHWYNRDESSNFFSFIKRLHAEGSTLASSGLITASNIML--KEESRSKSLGEQTKRVILV 393 Query: 311 ITDGEN----------------------------SGASAYQ-------NTLNTLQICEYM 335 ++DG + + + + + +C+ + Sbjct: 394 LSDGNDELRLNDEGTPFTQYSRITENLLLGQEEQTTDTYPYFMSKPPKKLTSNINVCDRI 453 Query: 336 R--------NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 R + KI V + + + C +G +++ ND LL SF + Sbjct: 454 RNKLDEHNEDKNTKIVFVEFGYASKAKQAWQHCV-GNGNYYSANDKASLLNSFKQ 507 >gi|319784280|ref|YP_004143756.1| hypothetical protein Mesci_4597 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317170168|gb|ADV13706.1| hypothetical protein Mesci_4597 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 643 Score = 64.2 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 26/189 (13%), Positives = 51/189 (26%), Gaps = 24/189 (12%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 MT + + + A+D + + + +ALDAA + + DQ Sbjct: 1 MTVVAMVPLMGALAMAVDFTEMSREKQAVSNALDAANFATARRLTE-------GATDDQL 53 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPT--ENL 118 +L + + + +T N + + A+ + + Sbjct: 54 RAYALDFFNANLNKINPANT---------TLTVTLPSNTTGGGLLKMTARLDYKPYFYPV 104 Query: 119 FLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ------KHNDNN 172 F + + S S I + + +VLD S SM Sbjct: 105 FGQLVGKSETDANQRISFNITSEVRLKNTLEVALVLDNSGSMTKTGTGSGQTRIDLLKTA 164 Query: 173 NMTSNKYLL 181 L Sbjct: 165 AKQLVDTLA 173 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 39/323 (12%), Positives = 74/323 (22%), Gaps = 79/323 (24%) Query: 149 SICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR 208 V D + S + P P + N Sbjct: 319 YPYNVNDAAPSGGSANTGIGVGDPATMFVPMFAPDEPGNHWRLTQDPDEAAPTTYGAINS 378 Query: 209 KID---VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN-----NLN 260 + L N + N N N TPL++ L Sbjct: 379 WWNDDPTSSTGQSRLRNMSKYFQPRPINAPALAAGNGPNYSCSTNPITPLTDVSVTTGLT 438 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE-SSHNTIGSTRLKKFVIFITDG----- 314 +K+ ++ + P TN M +R + + + + K VI +TDG Sbjct: 439 AIKAAIDLMKPDGGTNVPEGMAWGWRVVSSGEPFTQGRPETERGNDKVVIVLTDGANTYY 498 Query: 315 -----------------------------------------------ENSGASAYQNTLN 327 ++G Sbjct: 499 TPSSLSHSDPADSKSTYASFGYLNPGYNGTSVGRLFMGTSSAIGQFDYSNGNYTNALNEQ 558 Query: 328 TLQICEYMRNAGMKIYSVAVS------APPEGQDLLRKC------------TDSSGQFFA 369 +C + A + + +VA+ A + + L+ C + F Sbjct: 559 MATLCNNAKAANIMVMTVALDLSTTKTADQQAIEALKSCSSNSRFRKDPTDASKPAKLFW 618 Query: 370 VNDSRELLESFDKITDKIQEQSV 392 L F +I +++ + Sbjct: 619 NATGASLSNDFKEIGNELSNLRI 641 >gi|326435586|gb|EGD81156.1| hypothetical protein PTSG_11196 [Salpingoeca sp. ATCC 50818] Length = 1445 Score = 64.2 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 46/377 (12%), Positives = 107/377 (28%), Gaps = 33/377 (8%) Query: 29 MQSALDAAV---LSGCASIVSDRTIKDP-----TTKKDQTSTIFKKQIKKHLKQGSYIRE 80 MQ A A V L+ + S + + + Q S I Q + + + Sbjct: 1 MQIATLARVAILLAVATATTSTYALLPGEWYPDGSTEAQVSGTAVDYIDLGTAQYNALID 60 Query: 81 NAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIE 140 + + + ++ L I + Sbjct: 61 FEEFYLNSTGTSSPVTFASSTFRLMTEYLAEKLTALALDTPDTIVIVQGFAPRPTNPTRP 120 Query: 141 RSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSK-NTTKSKY 199 ++ + M + S+ + N + + +P S N + Sbjct: 121 PELTDIGRGLHMRYLNNGSLTNFLSDLANRSIEAGFDFVAIPGQGSYVNVSVPNVNCGRS 180 Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQ---EKKNLSVRIGTIAYNIGIVGNQCTP-- 254 A + S + + + + R+ + Y+ + Sbjct: 181 AVDLLFILDGSGSIGSSNFETMRQFTATVTSFFDVSPDTTRVALMVYSSSVTEIFDFSYV 240 Query: 255 LSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 LSN +E+ + + N P T T A+ +A ++ + S + + I I D Sbjct: 241 LSNTRDEIITTIRNTNYPGGGTRTGSALDYARTNMFLTSRGARP--SSAGVPRVAIVIID 298 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVN 371 G++ + A E +RN + I+++ + + L ++ Sbjct: 299 GQSGDSVAQP--------AENLRNENVNIFAIGI--SGADVNELNMIASPPITNNVKFID 348 Query: 372 DSRELLESFDKITDKIQ 388 ++F ++ +I Sbjct: 349 TF----QAFSQLPAEIS 361 >gi|156383823|ref|XP_001633032.1| predicted protein [Nematostella vectensis] gi|156220096|gb|EDO40969.1| predicted protein [Nematostella vectensis] Length = 182 Score = 64.2 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 29/194 (14%), Positives = 65/194 (33%), Gaps = 22/194 (11%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + +L + AI E V + ++ Sbjct: 7 VDSSGSVNDTDFGKFQMFIKDLAEEFKDAISEGDTE-VAAVLFSTIPKTKIEFDLDDYDH 65 Query: 259 LNEVKSRLNKL--NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 +N++K+ ++ T T A+ E++ + +K ++ +TDG+ Sbjct: 66 INDIKAAVDAFSHQHGGQTRTGEALTFTLEEVFKKAPR-------PSVKNVLVVLTDGKA 118 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSR 374 G N + +R+ G++++++ V P + L+ F V D Sbjct: 119 QG--------NVTGPAQDVRDHGVEVFAIGVG-PHSNEAQLKDIASDPDDKHVFHVTDY- 168 Query: 375 ELLESFDKITDKIQ 388 +L + I D I Sbjct: 169 KLEDITGPILDGIC 182 >gi|296205952|ref|XP_002806985.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-3(VI) chain-like [Callithrix jacchus] Length = 3176 Score = 64.2 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 18/172 (10%), Positives = 58/172 (33%), Gaps = 17/172 (9%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 + + K++ +N + +N EV S ++ ++ TN Sbjct: 58 VREFLYDVVKSLAVGENDF-HFALVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYTGGTN 116 Query: 277 -TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 T + + + + + + + + ++ +TDG + A + + Sbjct: 117 QTGKGLEYVMQ---SHLTKAAGSRAGDGVPQVIVVLTDGHSKEGLALPSAG--------L 165 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITD 385 ++A + ++++ V + L++ F + + L + + + Sbjct: 166 KSADVNVFAIGV--EGADEGALKEIASEPLNMHMFNLENFTSLHDIVENLVS 215 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 24/189 (12%), Positives = 60/189 (31%), Gaps = 20/189 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ N ++ + Sbjct: 1041 GVRSGFPLLKEFVQRVVESL-----DVGQDRVRVAVVQYSDRTRPEFYLNSYMNQQDIVN 1095 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S T + + +I +T A Sbjct: 1096 AIRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRIT---EGVPQLLIVVT--------AD 1144 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 ++ + ++ G + + ++ + A+ R+L Sbjct: 1145 RSGDDVRNPSVVLKKGGAV--PIGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTIQQV 1202 Query: 383 ITDKIQEQS 391 I++++ + + Sbjct: 1203 ISERVTQLT 1211 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 23/153 (15%), Positives = 56/153 (36%), Gaps = 15/153 (9%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKE 293 VR+G + ++ + + V + +L + NT A+ R + Sbjct: 1475 VRVGVVQFSNDVFPEFYLKTYRSQVPVLEAIRRLRLRGGSPLNTGRALEFVARNFF---V 1531 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S + + + ++ + G++ + + +R++G+ S+ V Sbjct: 1532 KSAGSRIEDGVPQHLVLVLGGKSQDDVSRF--------AQVIRSSGIV--SLGVGDRNID 1581 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + L+ T+ F V D R+L ++I + Sbjct: 1582 RTELQTITNDPRLVFTVRDFRDLPNIEERIMNS 1614 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 34/314 (10%), Positives = 87/314 (27%), Gaps = 33/314 (10%) Query: 72 LKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNL 131 ++ + Q+ + + + +A A + N+F G+ + L Sbjct: 126 MQSHLTKAAGSRAGDGVPQVIVVLTDGHSKEGLALPSAGLKSADVNVFAIGVEGADEGAL 185 Query: 132 SLRSTGIIERSSEN--LAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF 189 ++ + N S+ +++ S + + T F Sbjct: 186 KEIASEPLNMHMFNLENFTSLHDIVENLVSCVHSSVSPERAGDTETLKDITAQESADIIF 245 Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 + + LVN ++K +R+G + ++ Sbjct: 246 LIDGSNNTGSVNFAVI-----------LDFLVNLLEKLP--VGTQQIRVGVVQFSDEPRT 292 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 ++ +V S + L N A+ + + + + + Sbjct: 293 MFSLDTYSSKAQVLSAVKALGFAGGELANIGLALDFVVENHFT---RAGGSRVEEGVPQV 349 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF 367 ++ I+ G +S + + L+ ++S + A + L+ Sbjct: 350 LVLISAGPSSD--EIGDGVVALKQAS--------VFSFGLGAQAASRAELQHVATDDNLV 399 Query: 368 FAV---NDSRELLE 378 F V +L E Sbjct: 400 FTVPEFRSFGDLQE 413 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 24/172 (13%), Positives = 63/172 (36%), Gaps = 15/172 (8%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN + ++ +N Sbjct: 1650 INFRRDSFQEVLXFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFXRDFSTKRQIIDAIN 1709 Query: 268 KLNPYENTNTYPAMHHAYRELY-NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 K+ + L N + R+ + +T G+ + Sbjct: 1710 KVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLEQRVPQIAFVVTGGK--------SVE 1759 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + Q+ + +G+K+++V V + +++ + +S F V + +EL E Sbjct: 1760 DAQQVSLALTQSGVKVFAVGV-RNIDSEEVGKT-ASNSATAFRVGNVQELSE 1809 >gi|257889535|ref|ZP_05669188.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,410] gi|257825895|gb|EEV52521.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,410] Length = 857 Score = 64.2 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 27/272 (9%), Positives = 78/272 (28%), Gaps = 26/272 (9%) Query: 136 TGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTT 195 + + + +V+D S SM D L Sbjct: 7 NARGNVQKDITPLDLVLVVDWSGSMNDNNRIGEVKIGVDRFVDTLADSGITDKINMGYVG 66 Query: 196 KSKYAPAPAPANRKI---DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQC 252 S + + ++ D + ++ S + G++ Sbjct: 67 YSSEGYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGTFTQKALRDAGSMLSVPNGHKKVI 126 Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 L++ + ++ +++ ++N Y + ++ + G+T L + Sbjct: 127 VLLTDGVPTFSYKVQRVHAQSSSNYYG----------TQFSNTQDRPGNTSLISRIYDAP 176 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTDS 363 D N + T+ ++ G++I+ + + + + +R+ S Sbjct: 177 DQNNLSRRIDSTFIATIGEAMALKERGIEIHGLGIQLQSDPAAGLSKAEVESRMRQMVSS 236 Query: 364 ---SGQFFA-VNDSRELLESFDKITDKIQEQS 391 ++ + + ++ E K +I Sbjct: 237 DEKGDLYYESADHATDISEYLAKKAVQISATV 268 >gi|114682165|ref|XP_001154021.1| PREDICTED: matrilin 4 isoform 4 [Pan troglodytes] gi|114682167|ref|XP_001154256.1| PREDICTED: matrilin 4 isoform 7 [Pan troglodytes] gi|114682175|ref|XP_514674.2| PREDICTED: matrilin 4 isoform 9 [Pan troglodytes] gi|114682177|ref|XP_001154315.1| PREDICTED: matrilin-4 isoform 8 [Pan troglodytes] Length = 581 Score = 64.2 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 57/183 (31%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 360 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 414 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + + TDG + + Sbjct: 415 VEYMERGTMTGLALRHMVEHSFSEAQGARPR--ALNVPRVGLVFTDGRSQDDISVW---- 468 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V + LR+ D + + + Sbjct: 469 ----AARAKEEGIVVYAVGVGKAV--EAELREIASEPAELHVSYAPDFGTMTHLLENLRG 522 Query: 386 KIQ 388 I Sbjct: 523 SIC 525 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 51/153 (33%), Gaps = 15/153 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + R+G I Y+ + + +++ + L P T T A+ +A Sbjct: 65 NVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVA 124 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E + R+ + + +TDG ++ R G++IY+V V Sbjct: 125 FSVAEGARP--PEERVPRVAVIVTDGRPQD--------RVAEVAAQARARGIEIYAVGVQ 174 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 G LR F V + E Sbjct: 175 RADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 >gi|114682171|ref|XP_001153893.1| PREDICTED: matrilin 4 isoform 2 [Pan troglodytes] gi|114682181|ref|XP_001153957.1| PREDICTED: matrilin-4 isoform 3 [Pan troglodytes] Length = 499 Score = 64.2 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 57/183 (31%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 278 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 332 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + + TDG + + Sbjct: 333 VEYMERGTMTGLALRHMVEHSFSEAQGARPR--ALNVPRVGLVFTDGRSQDDISVW---- 386 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V + LR+ D + + + Sbjct: 387 ----AARAKEEGIVVYAVGVGKAV--EAELREIASEPAELHVSYAPDFGTMTHLLENLRG 440 Query: 386 KIQ 388 I Sbjct: 441 SIC 443 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 51/153 (33%), Gaps = 15/153 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + R+G I Y+ + + +++ + L P T T A+ +A Sbjct: 65 NVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVA 124 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E + R+ + + +TDG ++ R G++IY+V V Sbjct: 125 FSVAEGARP--PEERVPRVAVIVTDGRPQD--------RVAEVAAQARARGIEIYAVGVQ 174 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 G LR F V + E Sbjct: 175 RADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 >gi|114682173|ref|XP_001153832.1| PREDICTED: matrilin 4 isoform 1 [Pan troglodytes] Length = 488 Score = 64.2 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 57/183 (31%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 267 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 321 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + + TDG + + Sbjct: 322 VEYMERGTMTGLALRHMVEHSFSEAQGARPR--ALNVPRVGLVFTDGRSQDDISVW---- 375 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V + LR+ D + + + Sbjct: 376 ----AARAKEEGIVVYAVGVGKAV--EAELREIASEPAELHVSYAPDFGTMTHLLENLRG 429 Query: 386 KIQ 388 I Sbjct: 430 SIC 432 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 51/153 (33%), Gaps = 15/153 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + R+G I Y+ + + +++ + L P T T A+ +A Sbjct: 13 NVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVA 72 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E + R+ + + +TDG ++ R G++IY+V V Sbjct: 73 FSVAEGARP--PEERVPRVAVIVTDGRPQD--------RVAEVAAQARARGIEIYAVGVQ 122 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 G LR F V + E Sbjct: 123 RADVGS--LRAMASPPLDEHVFLVESFDLIQEF 153 >gi|114682169|ref|XP_001154082.1| PREDICTED: matrilin 4 isoform 5 [Pan troglodytes] gi|114682179|ref|XP_001154207.1| PREDICTED: matrilin-4 isoform 6 [Pan troglodytes] Length = 540 Score = 64.2 bits (154), Expect = 3e-08, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 57/183 (31%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 319 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 373 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + + TDG + + Sbjct: 374 VEYMERGTMTGLALRHMVEHSFSEAQGARPR--ALNVPRVGLVFTDGRSQDDISVW---- 427 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V + LR+ D + + + Sbjct: 428 ----AARAKEEGIVVYAVGVGKAV--EAELREIASEPAELHVSYAPDFGTMTHLLENLRG 481 Query: 386 KIQ 388 I Sbjct: 482 SIC 484 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 51/153 (33%), Gaps = 15/153 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + R+G I Y+ + + +++ + L P T T A+ +A Sbjct: 65 NVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVA 124 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E + R+ + + +TDG ++ R G++IY+V V Sbjct: 125 FSVAEGARP--PEERVPRVAVIVTDGRPQD--------RVAEVAAQARARGIEIYAVGVQ 174 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 G LR F V + E Sbjct: 175 RADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 >gi|134093164|gb|ABO53024.1| matrilin 4 isoform 1 precursor, 3 prime [Chlorocebus aethiops] Length = 243 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 57/183 (31%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 22 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAVEVKQAVLA 76 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + + TDG + + Sbjct: 77 MEYMERGTMTGLALRHMVEHSFSEAQGARPR--ALNVPRVGLVFTDGRSQDDISVW---- 130 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V + LR+ D + + + Sbjct: 131 ----AARAKEEGIVMYAVGVGKAV--EAELREIASEPAELHVSYAPDFGTMTHLLENLRG 184 Query: 386 KIQ 388 I Sbjct: 185 SIC 187 >gi|193786651|dbj|BAG51974.1| unnamed protein product [Homo sapiens] Length = 581 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 57/183 (31%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 360 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 414 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + + TDG + + Sbjct: 415 VEYMERGTMTGLALRHMVEHSFSEAQGARPR--ALNVPRVGLVFTDGRSQDDISVW---- 468 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V + LR+ D + + + Sbjct: 469 ----AARAKEEGIVMYAVGVGKAV--EAELREIASEPAELHVSYAPDFGTMTHLLENLRS 522 Query: 386 KIQ 388 I Sbjct: 523 SIC 525 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 51/153 (33%), Gaps = 15/153 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + R+G I Y+ + + +++ + L P T T A+ +A Sbjct: 65 NVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVA 124 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E + R+ + + +TDG ++ R G++IY+V V Sbjct: 125 FSVAEGARP--PEERVPRVAVIVTDGRPQD--------RVAEVAAQARARGIEIYAVGVQ 174 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 G LR F V + E Sbjct: 175 RADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 >gi|109009638|ref|XP_001105446.1| PREDICTED: epithelial chloride channel protein-like [Macaca mulatta] Length = 829 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 24/165 (14%), Positives = 63/165 (38%), Gaps = 26/165 (15%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKL--NPYENTNTYPAMHHAYRELYNEKESSHN 297 + ++ + N + L P T+ + ++ + Sbjct: 349 MVTFDSSAEIQNNLTKIIDENTYQKITANLPQKPSGGTSICGGLKAGFQAISQS------ 402 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ-DL 356 + +I +TDGE++ S+ E ++ +G I+++A+ + + + Sbjct: 403 --NQSTSGSEIILLTDGEDNQMSSCF---------EEVKQSGAIIHTIALGPSADRELET 451 Query: 357 LRKCTDSSGQFFAVN-DSRELLESFDKITDK---IQEQSVRIAPN 397 L + G+ F + D L+++F +I+ + I +Q+V++ Sbjct: 452 LSNM--TRGRRFYAHKDINGLIDAFSRISSRSGNISQQAVQLESK 494 >gi|22760140|dbj|BAC11083.1| unnamed protein product [Homo sapiens] Length = 540 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 57/183 (31%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 319 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 373 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + + TDG + + Sbjct: 374 VEYMERGTMTGLALRHMVEHSFSEAQGARPR--ALNVPRVGLVFTDGRSQDDISVW---- 427 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V + LR+ D + + + Sbjct: 428 ----AARAKEEGIVMYAVGVGKAV--EAELREIASEPAELHVSYAPDFGTMTHLLENLRS 481 Query: 386 KIQ 388 I Sbjct: 482 SIC 484 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 26/153 (16%), Positives = 51/153 (33%), Gaps = 15/153 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + R+G I Y+ + + +++ + L P T T A+ +A Sbjct: 65 NVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVA 124 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E + R+ + + +TDG ++ R G++IY+V + Sbjct: 125 FSVAEGARP--PEERVPRVAVIVTDGRPQD--------RVAEVAAQARARGIEIYAVGLQ 174 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 G LR F V + E Sbjct: 175 RADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 >gi|320101795|ref|YP_004177386.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] gi|319749077|gb|ADV60837.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] Length = 764 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 26/139 (18%), Positives = 49/139 (35%), Gaps = 19/139 (13%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N E + + P +T+ + + + +E +VIF+TDG + Sbjct: 344 NRAEALRFIEGVRPGGSTDIDQGLRAGLKLIADESRP-----------NYVIFLTDGLPT 392 Query: 318 GASAYQNTLNTLQICEYMRNAG---MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSR 374 N L+I E R A K++ V + L R ++ G F V Sbjct: 393 SG-----ETNELKIAEAARAANPLKAKLFVFGVGYDVNARLLDRLSGENGGVSFYVKPDD 447 Query: 375 ELLESFDKITDKIQEQSVR 393 L + + ++I ++ Sbjct: 448 NLEVAVSRFYERISTPALT 466 >gi|257127006|ref|YP_003165120.1| von Willebrand factor type A [Leptotrichia buccalis C-1013-b] gi|257050945|gb|ACV40129.1| von Willebrand factor type A [Leptotrichia buccalis C-1013-b] Length = 509 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 19/143 (13%), Positives = 56/143 (39%), Gaps = 18/143 (12%) Query: 245 IGIVGNQCTPL-SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 +P+ + + +++KS L + P T+ ++ + +L + + Sbjct: 220 SCSANELISPIETLDKDKLKSSLAPIQPTGWTSIAKSIENGTNDLK--------ALKGEK 271 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG--MKIYSVAVSAPPEGQDLLRKCT 361 + ITDG + N ++ + +N + + + + +L++ Sbjct: 272 TLNILYIITDGIET------CDGNPVETAKKFKNENTDIVLGIIGFNVDAHQNKVLKEIA 325 Query: 362 DSSGQFF-AVNDSRELLESFDKI 383 +++ ++ + ND+ +L E +I Sbjct: 326 NAANGYYSSANDAAKLTEELQRI 348 >gi|116625272|ref|YP_827428.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228434|gb|ABJ87143.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 323 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 29/202 (14%), Positives = 72/202 (35%), Gaps = 36/202 (17%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 +K+D ++ + + +N P + NL E Sbjct: 104 CSGSMGQKLDKSRQAVSQFFKLANPEDE--------FFLVQFNDSASL--IQPFTRNLEE 153 Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 +++ L T A++ A E+ + +K ++ I+DG ++ + Sbjct: 154 IQNHLAFTQSKGRTALLDAVYLALHEM----------KKAKNPRKALLLISDGGDNSSRY 203 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAV----------SAPPEGQDLLRKCTD-SSGQFFAV 370 + + ++ A ++IY++ + G LL + + + G+ + V Sbjct: 204 TEPEIK-----NLVKEADVQIYAIGIYESAAGRGRTPEESSGPALLTEIAEQTGGRQYQV 258 Query: 371 NDSRELLESFDKITDKIQEQSV 392 ++ EL + KI +++ Q + Sbjct: 259 DNLNELPDVAAKIGVELRNQYI 280 >gi|296119163|ref|ZP_06837733.1| putative secreted protein [Corynebacterium ammoniagenes DSM 20306] gi|295967789|gb|EFG81044.1| putative secreted protein [Corynebacterium ammoniagenes DSM 20306] Length = 585 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 24/129 (18%), Positives = 49/129 (37%), Gaps = 20/129 (15%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 S L++++ ++ + T A+ A EL E E + ++ ++DG Sbjct: 29 SGQLDDIRGEVDDIEASGYTPMGSALRQAAEELPAEGE------------RSIVLVSDGI 76 Query: 316 NSGASAYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVND 372 ++ ++ E + + G+ I +V E + L D GQ+ +D Sbjct: 77 DT-----CAPPPVCEVAEELHDQGIDLIINTVGFLVDDEARSELECIADAGGGQYLDADD 131 Query: 373 SRELLESFD 381 + L ES Sbjct: 132 AESLAESMK 140 >gi|260836190|ref|XP_002613089.1| hypothetical protein BRAFLDRAFT_89971 [Branchiostoma floridae] gi|229298473|gb|EEN69098.1| hypothetical protein BRAFLDRAFT_89971 [Branchiostoma floridae] Length = 267 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 25/188 (13%), Positives = 59/188 (31%), Gaps = 29/188 (15%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 I + + ++G + Y L+E+ + + + Sbjct: 15 ITAPNFEITKSFVQNTTSDFQIGTAHTQVGVVQYEDNPYDEFPLNQYATLDELLTAIRNI 74 Query: 270 NP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 T T A+ H E + + K VI +TDG++ + Sbjct: 75 TYRGGGTQTGKAIDHVVDNSLTESHGAR-----PGVPKVVIVVTDGQSWDS--------V 121 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG-----------QFFAVNDSRELL 377 + + ++G+ + ++ V + + +L+ S+ ++ VN+ L Sbjct: 122 VAPAQRANHSGIIMVAIGVGSGYDINELME-IASSNDTLGTIEYFLRCKYLKVNN---LT 177 Query: 378 ESFDKITD 385 F I + Sbjct: 178 FLFQDINE 185 >gi|114562801|ref|YP_750314.1| vault protein inter-alpha-trypsin subunit [Shewanella frigidimarina NCIMB 400] gi|114334094|gb|ABI71476.1| Vault protein inter-alpha-trypsin domain protein [Shewanella frigidimarina NCIMB 400] Length = 722 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 30/187 (16%), Positives = 72/187 (38%), Gaps = 20/187 (10%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV---KSRL 266 I ++ + ++ I +N + TPLS N + + Sbjct: 361 ITQAKQALQFALAGLRDIDS--------FNIIEFNSDVTMLSATPLSANSRNIGKANRFI 412 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 L+ T A+ A + + ++ S T + + + VIF+TDG + Sbjct: 413 QSLDADGGTEMRSALQTAL--VDSVQQDSDQTDAHSEMLRQVIFMTDGAVGNEHELYQLI 470 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 N + + ++ ++++V + + P + R T G F + + E+ + +++ +K Sbjct: 471 N-----DQLGDS--RLFTVGIGSAPNSDFMRRAATMGRGTFTYIGNESEVQQKIEQLLNK 523 Query: 387 IQEQSVR 393 I++ + Sbjct: 524 IEQPVLT 530 >gi|327543487|gb|EGF29906.1| von Willebrand factor type A domain-containing protein [Rhodopirellula baltica WH47] Length = 464 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 17/161 (10%), Positives = 54/161 (33%), Gaps = 12/161 (7%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 ++ + + + Y+ + + + + +K ++ + +T + + E+ Sbjct: 94 DRLSDDDIVSVVLYDSNVTVLVPATKATDRSSIKQKIRGIQAGSSTALFAGVSKGAAEV- 152 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + VI ++DG + L L + + + ++ + + Sbjct: 153 -------RKFLADEQVNRVILLSDGLANVGPKSPQELEGLG--RSLMKEAISVSTLGLGS 203 Query: 350 PPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQE 389 +DL+ G + D+ L+ F++ D + Sbjct: 204 G-YNEDLMVALASVGGGNHAFIEDADSLVAVFNQEFDGLLS 243 >gi|224080732|ref|XP_002192406.1| PREDICTED: anthrax toxin receptor 1 [Taeniopygia guttata] Length = 516 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 66/196 (33%), Gaps = 18/196 (9%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 K ++ + + ++ ++ + +R+ I ++ Sbjct: 37 PACYGGFDLYFILDKSGSVLHHWNEIYHFVEHLARKFISPQLRMSFIVFSTRGT--ILMR 94 Query: 255 LSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 L+ + +++ L +L P +T + A ++Y E + T +I + Sbjct: 95 LTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENVHGYRT------ASVIIAL 148 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDGE + + R+ G +Y V V + L + DS F VN Sbjct: 149 TDGELHEDLFFYSERE----ANRSRDLGATVYCVGV--KDFNETQLARIADSRDHVFPVN 202 Query: 372 D-SRELLESFDKITDK 386 D + L D I K Sbjct: 203 DGFQALQGIIDSILKK 218 >gi|329664416|ref|NP_001193157.1| anthrax toxin receptor 1 [Bos taurus] gi|297480483|ref|XP_002691486.1| PREDICTED: anthrax toxin receptor 1 [Bos taurus] gi|296482440|gb|DAA24555.1| anthrax toxin receptor 1 [Bos taurus] Length = 564 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 64/196 (32%), Gaps = 18/196 (9%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 K ++ + +++ + + +R+ I ++ Sbjct: 37 PACYGGFDLYFILDKSGSVLHHWNEIYYFVEQLAHKFISPQLRMSFIVFSTQGT--TLMK 94 Query: 255 LSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 L+ + +++ L +L P +T + A ++Y E + T +I + Sbjct: 95 LTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENSQGYRT------ASVIIAL 148 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDGE + + R+ G +Y V V + L + DS F VN Sbjct: 149 TDGELHEDLFFYSERE----ANRSRDLGAIVYCVGV--KDFNETQLARIADSKDHVFPVN 202 Query: 372 D-SRELLESFDKITDK 386 D + L I K Sbjct: 203 DGFQALQGIIHSILKK 218 >gi|148656823|ref|YP_001277028.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148568933|gb|ABQ91078.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 851 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 21/140 (15%), Positives = 52/140 (37%), Gaps = 21/140 (15%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 +L +V+ R++ L T+ Y A+ L + + + +TDG Sbjct: 456 SLADVQRRISTLPLGGGTDIYNALQEGLPALAQQPGRVRHA----------VLLTDGR-- 503 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSREL 376 S + + E R+ + + ++A+ +LL++ +G++ + ++ Sbjct: 504 --SFTDDRQAYRMLLEEARSQNITLSTIAIGTDA-DINLLQELARWGAGRYHYAAEPNDI 560 Query: 377 LESFDKITDKIQEQSVRIAP 396 ++ + VR P Sbjct: 561 PR-----LTLLESEIVRTEP 575 >gi|297698645|ref|XP_002826428.1| PREDICTED: integrin alpha-X-like, partial [Pongo abelii] Length = 836 Score = 64.2 bits (154), Expect = 4e-08, Method: Composition-based stats. Identities = 27/180 (15%), Positives = 63/180 (35%), Gaps = 16/180 (8%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPY 272 ++ ++N ++ I + + S + + ++ + + + S L+ + Sbjct: 164 SDNFATMMNFVRAVISQFQRPSTQFSLMQFSNKFQTHFTFEKFRSSSNPLSLLDSVHQLR 223 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T+T A+ + L++ + K +I ITDG + L+ + Sbjct: 224 GLTHTATAIQNVVHRLFHASYGARR-----DAAKILIVITDG-----KKEGDRLDYKDVI 273 Query: 333 EYMRNAGMKIYSVAVSAP---PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKI 387 AG+ Y++ V + L F V D L + +++ +KI Sbjct: 274 PMADAAGIIRYAIGVGSAFEYINSWKELNDIASKPSQEHIFKVEDFEALKDIQNQLKEKI 333 >gi|260575971|ref|ZP_05843966.1| von Willebrand factor type A [Rhodobacter sp. SW2] gi|259021897|gb|EEW25198.1| von Willebrand factor type A [Rhodobacter sp. SW2] Length = 670 Score = 63.8 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 38/366 (10%), Positives = 92/366 (25%), Gaps = 28/366 (7%) Query: 35 AAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINIT 94 AA T + + + + + + A +N Sbjct: 160 AAAPEPAMESADAMAAPMAGAPAPAFKTAPGGMVGLAMPEMAPMPLENTEEFSSAPVNPV 219 Query: 95 KDKNNPLQYIAES---KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISIC 151 K A Y + +L L + + + +A Sbjct: 220 KVTAEEPVSTFSIDVDTASYAVVRSSLNAGYLPEAEAVRVEEMVNYFPYAYAAPVAGEAP 279 Query: 152 MVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKID 211 V+ + L P + T K+ Sbjct: 280 FRTAVTVMQTPWNPGTRLVRIGLQGRLPALDDRPPLNLVFLIDTSG-----SMEDANKLP 334 Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP 271 +L +S ++ ++ ++ +AY + N +E+ + L++L Sbjct: 335 LLKQSLRLMLAELRPED--------QVAIVAYAGSAGEILPPTKAENADEILAALDRLGA 386 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 +T + AY+ + + V+ TDG+ + + + Sbjct: 387 GGSTAGAEGLALAYQVARKMAGAGEVSR--------VLLATDGDFNVG--IDDPEGLTKY 436 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 R+ G+ + + ++ + L E+ + D++ Sbjct: 437 IAKQRDTGVYLSVLGFGRGNLDDATMQALAQNGNG--TAAYIDTLNEARKVLVDQLTGAL 494 Query: 392 VRIAPN 397 IA + Sbjct: 495 FPIADD 500 >gi|134291855|ref|YP_001115624.1| hypothetical protein Bcep1808_6472 [Burkholderia vietnamiensis G4] gi|134135044|gb|ABO59369.1| conserved hypothetical protein [Burkholderia vietnamiensis G4] Length = 423 Score = 63.8 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 23/212 (10%), Positives = 60/212 (28%), Gaps = 6/212 (2%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTI----KDPTTKK 57 + ++V F+ A+DL + R+++Q++ DA LS + S ++ D Sbjct: 26 VGLALAVMIGFVGLALDLGKLYVTRSELQNSADACALSAARDLTSAISLSVAEADGIAAG 85 Query: 58 DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 F+K + S + + N + Sbjct: 86 HVNFAFFQK-SAVQMLTDSNVTFSDALTNPFLTKTAVSTPANVKYVKCTATLSGIANWFV 144 Query: 118 LFLKGLIPSA-LTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS 176 L + + ++ I ++ +I + + + S + + + +S Sbjct: 145 GVLNAMPGVQVANATQVSASAIATVAAGQTTCAIPVFVCRASSAAPYKVGDWISSPSGSS 204 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR 208 + Y + + + Sbjct: 205 STYGPGNFGWAALDGSTNETTIASELSGNTCN 236 >gi|326932668|ref|XP_003212436.1| PREDICTED: anthrax toxin receptor 1-like [Meleagris gallopavo] Length = 499 Score = 63.8 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 64/196 (32%), Gaps = 18/196 (9%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 K ++ + + ++ ++ + +R+ I ++ Sbjct: 37 PACYGGFDLYFILDKSGSVLHHWNEIYHFVEHLARKFISPQLRMSFIVFSTRGT--ILMR 94 Query: 255 LSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 L+ + +++ L +L P +T + A ++Y E + T +I + Sbjct: 95 LTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENVHGYRT------ASVIIAL 148 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDGE + + R G +Y V V + L + DS F VN Sbjct: 149 TDGELHEDLFFYSERE----ANRSRELGATVYCVGV--KDFNETQLARIADSKDHVFPVN 202 Query: 372 D-SRELLESFDKITDK 386 D L D I K Sbjct: 203 DGFEALQGIIDSILKK 218 >gi|118101296|ref|XP_425758.2| PREDICTED: similar to tumor endothelial marker 8 [Gallus gallus] Length = 552 Score = 63.8 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 64/196 (32%), Gaps = 18/196 (9%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 K ++ + + ++ ++ + +R+ I ++ Sbjct: 37 PACYGGFDLYFILDKSGSVLHHWNEIYHFVEHLARKFISPQLRMSFIVFSTRGT--ILMR 94 Query: 255 LSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 L+ + +++ L +L P +T + A ++Y E + T +I + Sbjct: 95 LTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENVHGYRT------ASVIIAL 148 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDGE + + R G +Y V V + L + DS F VN Sbjct: 149 TDGELHEDLFFYSERE----ANRSRELGATVYCVGV--KDFNETQLARIADSKDHVFPVN 202 Query: 372 D-SRELLESFDKITDK 386 D L D I K Sbjct: 203 DGFEALQGIIDSILKK 218 >gi|118350692|ref|XP_001008625.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89290392|gb|EAR88380.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 648 Score = 63.8 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 26/182 (14%), Positives = 61/182 (33%), Gaps = 15/182 (8%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN-NLNEVKSRLNKLNPYENT 275 + ++ K I ++ G T +++ N + + + ++ T Sbjct: 239 IQLVKETLVKIINLMSSMDRICIVCFNESGDRPLTFTRVTDENKQTLLNLIQQIYAGGGT 298 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 N ++HA + + N K ++ T ++ ++DG+++ A + Sbjct: 299 NISEGINHALKAIQNRKFKNNVTS--------ILLLSDGQDTKAYTRVKAYIDKYQIKDA 350 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVRI 394 I ++ LLR +D +G F + D L +F I + + Sbjct: 351 ----FNIETIGFGEDH-DPKLLRTLSDLRNGTFNFMQDVNYLDTAFINIFAGMISTVAQN 405 Query: 395 AP 396 Sbjct: 406 IK 407 >gi|67922256|ref|ZP_00515770.1| von Willebrand factor, type A [Crocosphaera watsonii WH 8501] gi|67855959|gb|EAM51204.1| von Willebrand factor, type A [Crocosphaera watsonii WH 8501] Length = 416 Score = 63.8 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 23/156 (14%), Positives = 54/156 (34%), Gaps = 15/156 (9%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKES 294 R+ +A++ + ++ VK + L T+ M +++ KE Sbjct: 76 DDRLSVVAFDHRAKVIVPNQPVDEIDGVKDAIASLKAEGGTSIDEGMKLGIKQVALGKED 135 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + +TDGE + + + L++ + + + ++ Q Sbjct: 136 R---------VSQIFLLTDGE----NEHGDNERCLKLAQVAGEYNITLNTLGFG-NHWNQ 181 Query: 355 DLLRKCTDS-SGQFFAVNDSRELLESFDKITDKIQE 389 D+L DS G + + L F ++ ++Q Sbjct: 182 DVLESIADSVGGTLCYIEQPEQALTEFSRLFTRMQS 217 >gi|294632153|ref|ZP_06710713.1| LOW QUALITY PROTEIN: secreted protein [Streptomyces sp. e14] gi|292835486|gb|EFF93835.1| LOW QUALITY PROTEIN: secreted protein [Streptomyces sp. e14] Length = 403 Score = 63.8 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 26/139 (18%), Positives = 55/139 (39%), Gaps = 19/139 (13%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N E K+ + L+P T PA+ A +L + + ++ I+DGE++ Sbjct: 95 NRTEAKTAVATLSPTGWTPIGPALLKAATDLEGGDGT-----------RRIVLISDGEDT 143 Query: 318 GASAYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSR 374 L+ ++ + G+ I ++ + + +D L D + G + V Sbjct: 144 -----CQPLDPCEVAREIAAKGIGLTIDTLGLVPDAKTRDQLSCIADATGGTYTDVRHKD 198 Query: 375 ELLESFDKITDKIQEQSVR 393 EL + ++ D+ + V Sbjct: 199 ELSDRVGQLVDRAADPVVT 217 >gi|260786070|ref|XP_002588082.1| hypothetical protein BRAFLDRAFT_83083 [Branchiostoma floridae] gi|229273239|gb|EEN44093.1| hypothetical protein BRAFLDRAFT_83083 [Branchiostoma floridae] Length = 528 Score = 63.8 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 30/173 (17%), Positives = 61/173 (35%), Gaps = 21/173 (12%) Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTN 276 VN + R+G + Y+ N S +N ++ TN Sbjct: 16 QFAVNVVNTFDISPTA--TRVGVVQYSDRNSLVFNLGNKVNKPSTVSAINGISYQSGGTN 73 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 T A+ + + + + K +I +TDG++S + + + + Sbjct: 74 TGAALKYVRQ---------YAAWREGNVPKVIIVLTDGKSSDSVSGPS--------RDLV 116 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 AG+++Y++ V GQ LL+ + +N+ L D I+ + Sbjct: 117 AAGVEVYAIGVGKFDHGQ-LLQIANNKQNNVIELNNFNALATKIDMISTNVCS 168 >gi|300716700|ref|YP_003741503.1| von Willebrand factor, type A domain protein [Erwinia billingiae Eb661] gi|299062536|emb|CAX59653.1| von Willebrand factor, type A domain protein [Erwinia billingiae Eb661] Length = 325 Score = 63.8 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 30/216 (13%), Positives = 70/216 (32%), Gaps = 36/216 (16%) Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 + + S ++ + S V + RIG + + Sbjct: 101 ILDVSGSMEKNDAQDGITRLQAVQRSVRAFVAQRKT---------DRIGLVIFASSAWPF 151 Query: 251 QCTPLSNNLNEVKSRLNKLNPYE---NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 P+S + + +R+N+L P T A+ A + L + + + Sbjct: 152 A--PISEDKQALLARINQLAPGMIGQQTAIGDALGVAVKLLDSSLDR--------DASRL 201 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA------PPEGQDLLRKCT 361 I +TDG ++ + L+ + + ++++++A LL++ Sbjct: 202 AILLTDGNDTA-----SQLSPALAAQLAASHHVQVHTIAFGDINSSGEDKVDTALLKQIA 256 Query: 362 D-SSGQFFAV-NDSRELLESFDKITDKIQEQSVRIA 395 + G+ + L + +I D + VR Sbjct: 257 QLTGGEALQASTSGKALDSVWQQI-DAMTPSQVRTT 291 >gi|194221223|ref|XP_001915876.1| PREDICTED: inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Equus caballus] Length = 834 Score = 63.8 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 21/175 (12%), Positives = 55/175 (31%), Gaps = 11/175 (6%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 K+ + + + N+ E ++ + TN A+ A + Sbjct: 301 DLGPKDQFNLVCFNEEATQWKPSLVPASAENMKEARNFAAGIMARGGTNINDAVLLAVQL 360 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 L + GS +I +TDG+ + N N + + + ++ + Sbjct: 361 LERANKQELLPAGS---VSLIILLTDGDPTVGE--TNRANIQKNVQEAISGQCSLFCLGF 415 Query: 348 SAPPEGQDLLRKCTDSSGQ-----FFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 L K +G + + + +L + + ++ + + + P+ Sbjct: 416 GF-HVSYAFLEKLALDNGGLARRIYEDSDSALQLQDFYQEVANPLLMKVAFEYPS 469 >gi|2654431|gb|AAC01506.1| type XII collagen [Homo sapiens] Length = 517 Score = 63.8 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 61/167 (36%), Gaps = 23/167 (13%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYN 290 V+I + Y+ + ++ +N TNT AM + +++ Sbjct: 174 SPNRVQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYRGGSTNTGKAMTYVREKIF- 232 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + K +I ITDG++S A +RN+ ++I++V V Sbjct: 233 ----VPSKGSRSNVPKVMILITDGKSSDAFRDP--------AIKLRNSDVEIFAVGV--K 278 Query: 351 PEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS-VRI 394 L F V D ++F +I+ ++ + +RI Sbjct: 279 DAVDSELEAIASPPAETHVFTVEDF----DAFQRISFELTQSICLRI 321 >gi|163848161|ref|YP_001636205.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222526064|ref|YP_002570535.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163669450|gb|ABY35816.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222449943|gb|ACM54209.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 905 Score = 63.8 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 25/138 (18%), Positives = 51/138 (36%), Gaps = 26/138 (18%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 +L ++ L++L+ TN M A + + VI +TDG Sbjct: 475 DLATIEDALSRLSAGGGTNIRSGMALAAETIATA----------DARIRHVILLTDGV-- 522 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 + + +R G+ + +VA+ +L R G+++ V + L Sbjct: 523 ------SETEYADLVANLRAQGVTVSTVAIGLN-TDPELERVAQIGGGKYYVVRQAEALP 575 Query: 378 ESFDKITDKIQEQSVRIA 395 + + E++VR+A Sbjct: 576 QI-------LLEETVRVA 586 >gi|328712312|ref|XP_003244777.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 isoform 2 [Acyrthosiphon pisum] Length = 919 Score = 63.8 bits (153), Expect = 4e-08, Method: Composition-based stats. Identities = 51/357 (14%), Positives = 105/357 (29%), Gaps = 38/357 (10%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPL 101 +S ++ + D T + +TI K ++ K + K Q+ T Sbjct: 259 SSNITTLEVPDIKTANEIETTISKNKLAKISYESGNKATITWSPTVKEQLTFT-----EH 313 Query: 102 QYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSME 161 + Y++ ++ + LI + L + VLDVS SM Sbjct: 314 GVKGQFIVHYDVDHKSAPNQVLIDDGYF-----VHFFAPTDLKPLRTHVIFVLDVSGSMV 368 Query: 162 DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLV 221 L + + ++ +S Sbjct: 369 GQKLPQVKEAMGQILSEIHSEDFFTLILFSDFAQVWT------------------INATQ 410 Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAM 281 + ++ N +G N+ K + L +TN A+ Sbjct: 411 ETSNHWDEKVSNWKTNNNISLDTLGENRFVFPATEQNVQYAKKFIQDLQSESSTNMEDAL 470 Query: 282 HHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK 341 + A L + + G+ K ++F+TDGE + L ++ Sbjct: 471 NKA--HLIAKLGETRFKDGANTPKPIIVFLTDGEPTTGITEPQEL--IKYVSNTNEEKYP 526 Query: 342 IYSVAVSAPPEGQDLLRKCTDSSGQF----FAVNDSR-ELLESFDKITDKIQEQSVR 393 IYS+ D L+K + ++ F + +D+ +L + +I+ + Sbjct: 527 IYSLGFGEGA-DIDFLKKLSLNNTGFARVIYEASDASLQLRNFYKEISSPVLSNVTF 582 >gi|251791982|ref|YP_003006702.1| TadG [Aggregatibacter aphrophilus NJ8700] gi|247533369|gb|ACS96615.1| TadG [Aggregatibacter aphrophilus NJ8700] Length = 592 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 48/152 (31%), Gaps = 15/152 (9%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 S +E+ ++ ++ T + L N + + T + Sbjct: 423 KSTATDIWFSSKKSSELNKVMSGIHAGGWTLASAGVFVGTNLLMNINKDATPDKIKTNTQ 482 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNA------------GMKIYSVAVSAPPEG 353 + ++ ++DG ++ L +C +RN KI VA + Sbjct: 483 RILLVLSDGVDTALPTLTQELLKGGMCNKVRNKLDELQDKNYRILPTKIAFVAFGYEQDS 542 Query: 354 Q--DLLRKCTDSSGQFFAVNDSRELLESFDKI 383 + C G + + + LLE F +I Sbjct: 543 ELRKEWENCV-GPGNYHQAKNEKALLEVFKQI 573 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 26/225 (11%), Positives = 59/225 (26%), Gaps = 44/225 (19%) Query: 13 ITYAIDLAHIMYIRNQMQSALDAAVLSGCASIV--------------------------- 45 I +A+D + I+ R ++ A + A L Sbjct: 40 IAFAVDGSGILLDRARLAQATEQAALLLTTENNQYRADKSNLSNVQVTDEEIKNAKGSFK 99 Query: 46 SDRTIKDPTTKKDQTSTIFKKQIKKHLKQGS-----YIRENAGDIAQKAQINITKDKNNP 100 + + K + + + +K +L+ T + N Sbjct: 100 TAQDRKKGAQALKRNQELVQGMVKLYLRSYDKEQKSSSPITIPKDFVAECRTQTSTRTNG 159 Query: 101 LQYIAESKAQYEIPTENL--FLKGLIPSALTNLSLRSTGIIERSSEN--LAISICMVLDV 156 + ++ + + L + + + S ++ + I + +V D+ Sbjct: 160 ESSSVACLVEGDVKRKFWLPWSYTLTSNNRNTVDINSGKSYAVKEKDILIPIDLMLVNDI 219 Query: 157 SRSM--------EDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKN 193 S SM + L+P P K+ N Sbjct: 220 STSMFKPPKDDPQGPKKIDSLKTVVKAVANILIPDEPPKNISKYN 264 >gi|149502255|ref|XP_001506498.1| PREDICTED: similar to anthrax toxin receptor, partial [Ornithorhynchus anatinus] Length = 183 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 62/192 (32%), Gaps = 18/192 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 + + + ++ ++ + +R+ I ++ L+ + ++ LN L Sbjct: 1 SGSVRHHWTEIYSFVESLAEKFISPMLRMSFIVFSSRGT--TVMKLTEDREAIRRGLNIL 58 Query: 270 N---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 P +T + A ++ E T +I +TDGE Sbjct: 59 RSEVPGGDTFMHEGFIRANEQISFENSGGLRT------ASVIIALTDGELQRDQF----Y 108 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN-DSRELLESFDKITD 385 + + R+ G +Y V V + L DS F V L D I Sbjct: 109 YAEKEADRARSLGAIVYCVGV--KDFNETQLSTIADSIDHVFPVTGGFHALRGVIDSILK 166 Query: 386 KIQEQSVRIAPN 397 K + + + P+ Sbjct: 167 KSCIEILAVEPS 178 >gi|198420236|ref|XP_002121660.1| PREDICTED: similar to collagen type VI alpha 6 [Ciona intestinalis] Length = 1638 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 30/195 (15%), Positives = 67/195 (34%), Gaps = 24/195 (12%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + R+ + E G+ V + + +IG + Y+ + ++ + Sbjct: 802 SNSIGPREFETTKEWIGSFVREFE-----IGEYNTKIGVVQYSSRVRSEIDIGDYDSKAD 856 Query: 262 VKSRLNKLN-PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + + ++ + NTNT A+ + H + K +I +TDG Sbjct: 857 LLAAISSIEFAAGNTNTGSALEYVRTV---GFSGRHGARNG--VPKVLIVLTDGNAQDG- 910 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD---SSGQFFAVNDSRELL 377 L + G+ +Y++ V P GQ L T +S + V + + Sbjct: 911 -------VLDAASKLHRDGVAVYAIGVGRPNMGQ--LNAVTSEPITSPNIYHVRNFDAIQ 961 Query: 378 ESFDKITDKIQEQSV 392 + ++ + V Sbjct: 962 TIQSGLLRRVCNKVV 976 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 25/179 (13%), Positives = 62/179 (34%), Gaps = 23/179 (12%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHA 284 + + R+ + Y+ ++ V ++ + NT T A+ +A Sbjct: 61 TLNFDISRDTTRVAVVQYSSYPRTEFDLDTYSSAVGVLRGIDLIQYMSGNTQTGLALRYA 120 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 +++ K I ++DG + + +R++G+ ++ Sbjct: 121 IESVFSRAR--------EDSAKVAIVLSDGRSQD--------QVNEAATSLRSSGIATFA 164 Query: 345 VAVSAP--PEGQDLLRKC----TDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 V + E + LR+ + F D R + + +++ + EQ+V+ P Sbjct: 165 VGIGDEMSHERMEELRQISVAAAEDQSSVFMAKDFRSIGQLQERLVSAVCEQTVQECPT 223 Score = 59.9 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 31/296 (10%), Positives = 78/296 (26%), Gaps = 37/296 (12%) Query: 107 SKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ 166 S A + P + L + + V D S + Sbjct: 319 SSANGDRPDVPDLAIVMTDGKAQEPDLVVEAANRVHEAGVTVYTVGVADYSLEEIKVIAS 378 Query: 167 KHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYA-------------PAPAPANRKIDVL 213 + N + + + + ++ T ++ + + + L Sbjct: 379 DPDKNYVIEALNFDIIELKRRGLIKSICTDAEQTCPAATAELVFLIDGSTSIGFDNFEKL 438 Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY- 272 ++V++ Q R+ + + V + + ++ Sbjct: 439 KRWLKSIVDAFQ-----VGPHYTRVAVVQFTNRPVLEFGLNDHSTTQATLQAIQRIRYRR 493 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 +T+T A+ E++ + K +I +TDG++ + Q Sbjct: 494 GSTSTGRAIEFVMNEVFTHSR--------ENVPKILIALTDGQSQD--------DVTQAT 537 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 AG ++++ LR+ F + + K+ I Sbjct: 538 ASAAEAG--VHTLVFGIGNTRPGQLRQLVSKEDHVFQAAGFDVIQKMQSKLVSLIC 591 Score = 54.1 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 18/165 (10%), Positives = 47/165 (28%), Gaps = 18/165 (10%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + + R+ Y I + E+ ++++ T T A+ H Sbjct: 635 DVGKYTTRVALTQYTSTINTEFDFKKFSTKREIDYAIDQMEFAGGATLTAQALVHIRENG 694 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + E+ + K ++ I+DG ++ N + +G+ ++++ V Sbjct: 695 FTEESGAR-----PGAPKVLVVISDG--------RSADNIETPARKLHESGVYVFAIGVG 741 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQS 391 L + + I ++ Sbjct: 742 NTWRSA--LEIIGSEPVVTHVQEGASYDAINNFRRDLVRNICRET 784 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 30/168 (17%), Positives = 58/168 (34%), Gaps = 19/168 (11%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKES 294 VR+G I Y+ N N+ ++ ++ +T T A+ + + + Sbjct: 1022 VRVGMIQYSTRPKTNIAIGQYNDKESLQEAFGQVEWQLGDTYTARALRYVSKSY-----A 1076 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 T + K +I ITDG+ + Q + + G +I+++ V Sbjct: 1077 RATTRENLHATKLLIIITDGQPQDRN------EVKQAVRNLHSEGWRIFAIGVG--QSDI 1128 Query: 355 DLLRKCTDSS--GQFFAVNDSRELLESFD-KITDKIQ-EQSVRIAPNR 398 L + F N+ F +++ I + S PNR Sbjct: 1129 SELGILASNPDSDHVFYANNFNS-TRIFQGRLSRLICGDPSFESKPNR 1175 >gi|255262383|ref|ZP_05341725.1| von Willebrand factor, type A [Thalassiobium sp. R2A62] gi|255104718|gb|EET47392.1| von Willebrand factor, type A [Thalassiobium sp. R2A62] Length = 634 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 46/370 (12%), Positives = 101/370 (27%), Gaps = 30/370 (8%) Query: 30 QSALD--AAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQ 87 +SA D A + G ++ + ++ Q L + Sbjct: 120 ESAADLSAGGVQGMIAMDTTAAAPQALSRNAQPLGSVAASSMSDLAIVAPQPTTKQYPNA 179 Query: 88 KAQINITKDKNNPLQYIAES-KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENL 146 + + + A Y + +L L PS + Sbjct: 180 EQNPLKIASDEPVSTFSIDVDAASYAVVRSSLSRAQLPPSGAVRIEEMINYFSYDYLATD 239 Query: 147 AISICMVLDVSRS-MEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAP 205 A + S + + + PP + +T+ S P Sbjct: 240 AGDVPFRATTSVFETPWNADTQLLHIGIQGTLPDVSEHPPLNLVFLIDTSGSMNQP---- 295 Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 K+ +LI + +++ ++ + + I Y +++ + + Sbjct: 296 --DKLPLLISAFRLMLSELRPEDE--------VSIITYAGSAGQVLAPTPASDRATILAA 345 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 LN+L+ +T + AY E VI TDG+ + +T Sbjct: 346 LNRLSAGGSTAGQAGLRQAYAIAAAMSEDGEIAR--------VILATDGDFNVGLNDPDT 397 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 L R++G + + +++ + L E+ + D Sbjct: 398 L--KDYITTRRDSGTYLSVLGFGRGNLNDAVMQSLAQNGNGI--AAYIDTLSEAQKVLVD 453 Query: 386 KIQEQSVRIA 395 ++ IA Sbjct: 454 QLSGALFPIA 463 >gi|291403690|ref|XP_002718170.1| PREDICTED: cochlin [Oryctolagus cuniculus] Length = 551 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 24/155 (15%), Positives = 46/155 (29%), Gaps = 20/155 (12%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESS 295 +I + + + V + + + T T A+ R ++ S Sbjct: 406 KIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFGPVRDS 465 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 N F++ ITDG++ + +AG+ I+SV V+ P Sbjct: 466 PNK-------NFLVIITDGQSYD--------DVRGPAAAAHDAGITIFSVGVAWAPLDD- 509 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 L+ F + L + I Sbjct: 510 -LKDMASKPKESHAFFTREFTGLEPIVSDVIRGIC 543 >gi|198436258|ref|XP_002122777.1| PREDICTED: similar to HyTSR1 protein [Ciona intestinalis] Length = 4258 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 32/233 (13%), Positives = 73/233 (31%), Gaps = 23/233 (9%) Query: 156 VSRSMEDLYLQKHNDNNNMTSN-KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLI 214 + D + LPP + ++ + + N +++ Sbjct: 2045 NGDPGDGNCTGAPVDVLACNTIACSDLPPDGVREDCAQLDIAIIIDSSSSIGNANFEIVR 2104 Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE- 273 ++ + S RI + +N ++ S + ++ R++++ Sbjct: 2105 SFLRRVMAQLN-----VGLNSTRIAALRFNRDVIPLWSFAQSTSSEDLIERIDRVTYDGS 2159 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T+T A+ +A L+ E E + + +TDG + Sbjct: 2160 GTHTGKALTYAANRLFTEAEG-----DRPDVPDLAVVLTDGRAQDNPGDT--------VQ 2206 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKIT 384 ++N G+K+ ++AV + + F VND L + I Sbjct: 2207 ALKNKGVKVIAIAV-TNRVDINEIYAIASDPDEQNAFFVNDFEGLFSVVENIA 2258 >gi|114586712|ref|XP_001158576.1| PREDICTED: alpha 1 type VII collagen isoform 1 [Pan troglodytes] Length = 2944 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 57/162 (35%), Gaps = 19/162 (11%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYN 290 VR T+ Y+ + +V + +L+ NT T A+ H ++ Sbjct: 73 SAQGVRFATVQYSDDPRTEFGLDAFGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVFL 132 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + K I ITDG++ + ++ G+K+++V + Sbjct: 133 PQLARPG------VPKVCILITDGKSQD--------LVDTAAQRLKGQGVKLFAVGI--K 176 Query: 351 PEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 + L++ FF VND L ++ ++ Sbjct: 177 NADPEELKRVASQPTSDFFFFVNDFSILRTLLPLVSRRVCTT 218 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 55/169 (32%), Gaps = 23/169 (13%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY--ENTNTYPAMHHAYRELYNEKE 293 V++G ++Y+ S++L + R+ + N A+ A+R + Sbjct: 1091 VQVGLLSYSHRPSPLFPLNGSHDLGIILQRIRDMPYMDPSGNNLGTAVVTAHRYMLAPDA 1150 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + ++ + D G + AG+ + V + Sbjct: 1151 PGRR----QHVPGVMVLLVDEPLRGDIFSP--------IREAQAAGLNV--VMLGMAGAD 1196 Query: 354 QDLLRKCTDSSG-----QFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + LR+ G FFAV+D L ++ + + + S P Sbjct: 1197 PEQLRRLA--PGMDSVQTFFAVDDGPSLDQAVSGLATALCQASFTTQPR 1243 >gi|114586714|ref|XP_516439.2| PREDICTED: alpha 1 type VII collagen isoform 2 [Pan troglodytes] Length = 2912 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 57/162 (35%), Gaps = 19/162 (11%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYN 290 VR T+ Y+ + +V + +L+ NT T A+ H ++ Sbjct: 73 SAQGVRFATVQYSDDPRTEFGLDAFGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVFL 132 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + K I ITDG++ + ++ G+K+++V + Sbjct: 133 PQLARPG------VPKVCILITDGKSQD--------LVDTAAQRLKGQGVKLFAVGI--K 176 Query: 351 PEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 + L++ FF VND L ++ ++ Sbjct: 177 NADPEELKRVASQPTSDFFFFVNDFSILRTLLPLVSRRVCTT 218 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 55/169 (32%), Gaps = 23/169 (13%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY--ENTNTYPAMHHAYRELYNEKE 293 V++G ++Y+ S++L + R+ + N A+ A+R + Sbjct: 1091 VQVGLLSYSHRPSPLFPLNGSHDLGIILQRIRDMPYMDPSGNNLGTAVVTAHRYMLAPDA 1150 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + ++ + D G + AG+ + V + Sbjct: 1151 PGRR----QHVPGVMVLLVDEPLRGDIFSP--------IREAQAAGLNV--VMLGMAGAD 1196 Query: 354 QDLLRKCTDSSG-----QFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + LR+ G FFAV+D L ++ + + + S P Sbjct: 1197 PEQLRRLA--PGMDSVQTFFAVDDGPSLDQAVSGLATALCQASFTTQPR 1243 >gi|260890922|ref|ZP_05902185.1| D-amino acid dehydrogenase large subunit [Leptotrichia hofstadii F0254] gi|260859475|gb|EEX73975.1| D-amino acid dehydrogenase large subunit [Leptotrichia hofstadii F0254] Length = 516 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 21/143 (14%), Positives = 57/143 (39%), Gaps = 18/143 (12%) Query: 245 IGIVGNQCTPL-SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 +P+ + N +++KS L + P T+ ++ + +L + + Sbjct: 226 SCSANELISPIETLNKDKLKSSLAPIQPTGWTSIAKSIENGTNDLK--------ALKGEK 277 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG--MKIYSVAVSAPPEGQDLLRKCT 361 + ITDG + N ++ + +N + + + + +L++ Sbjct: 278 TLNILYIITDGIET------CDGNPVETAKKFKNENTDIVLGIIGFNVDAHQNKVLKEIA 331 Query: 362 DSSGQFF-AVNDSRELLESFDKI 383 +++ ++ +VND+ +L E +I Sbjct: 332 NAANGYYSSVNDAAKLTEELQRI 354 >gi|260810222|ref|XP_002599902.1| hypothetical protein BRAFLDRAFT_74022 [Branchiostoma floridae] gi|229285186|gb|EEN55914.1| hypothetical protein BRAFLDRAFT_74022 [Branchiostoma floridae] Length = 1201 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 28/161 (17%), Positives = 61/161 (37%), Gaps = 19/161 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + R+G + Y+ N S +N + TNT A+ + + Sbjct: 700 DVSLTATRVGVVQYSDRNTLVFNLGNKVNKPSTVSAINNIVYQSGGTNTGAALQYVRQ-- 757 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + + K +I +TDG++S + + + + + AG+++Y++ V Sbjct: 758 -------YAAWRGGNVPKVIIVLTDGKSSDSVSGPS--------QNLVAAGVEVYAIGVG 802 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + GQ LL+ + +N+ L D I+ + Sbjct: 803 SFDHGQ-LLQIANNKQNNVIELNNFNALATKIDMISTNVCS 842 >gi|296475339|gb|DAA17454.1| cochlin precursor [Bos taurus] Length = 550 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 23/155 (14%), Positives = 47/155 (30%), Gaps = 20/155 (12%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESS 295 +I + + + V + + ++ T T A+ R ++ S Sbjct: 405 KIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNISYMSGGTATGDAISFTVRNVFGPVRDS 464 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 N F++ +TDG++ + +AG+ I+SV V+ P Sbjct: 465 PNK-------NFLVIVTDGQSYD--------DVRGPAAAAHDAGITIFSVGVAWAPLDD- 508 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 L+ F + L + I Sbjct: 509 -LKDMASKPKESHAFFTREFTGLEPIVSDVIRGIC 542 >gi|304314705|ref|YP_003849852.1| cobaltochelatase subunit-like protein [Methanothermobacter marburgensis str. Marburg] gi|302588164|gb|ADL58539.1| predicted cobaltochelatase subunit-like protein [Methanothermobacter marburgensis str. Marburg] Length = 663 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 12/136 (8%), Positives = 39/136 (28%), Gaps = 9/136 (6%) Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 P + + + ++ + T + L + F++ + Sbjct: 529 IIPSTARASSFRDTVDSIRVGGTTPMAQGIKRGLEIL-------REEKRHSEYVPFMVIL 581 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQFFA 369 +DG + ++ +R + + G + S G ++ Sbjct: 582 SDGMPNVGVERNPKREAVEAAARLREEDIPSAVINFEQGSRGGRDLNMEIALASGGSYYD 641 Query: 370 VNDSRELLESFDKITD 385 ++D + + +I + Sbjct: 642 LHDLEDPSMAVPRIME 657 >gi|115305395|gb|AAI23842.1| COCH protein [Bos taurus] Length = 550 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 23/155 (14%), Positives = 47/155 (30%), Gaps = 20/155 (12%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESS 295 +I + + + V + + ++ T T A+ R ++ S Sbjct: 405 KIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNISYMSGGTATGDAISFTVRNVFGPVRDS 464 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 N F++ +TDG++ + +AG+ I+SV V+ P Sbjct: 465 PNK-------NFLVIVTDGQSYD--------DVRGPAAAAHDAGITIFSVGVAWAPLDD- 508 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 L+ F + L + I Sbjct: 509 -LKDMASKPKESHAFFTREFTGLEPIVSDVIRGIC 542 >gi|118150796|ref|NP_001071310.1| cochlin precursor [Bos taurus] gi|75057908|sp|Q5EA64|COCH_BOVIN RecName: Full=Cochlin; Flags: Precursor gi|59857775|gb|AAX08722.1| coagulation factor C homolog, cochlin precursor [Bos taurus] Length = 550 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 23/155 (14%), Positives = 47/155 (30%), Gaps = 20/155 (12%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESS 295 +I + + + V + + ++ T T A+ R ++ S Sbjct: 405 KIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNISYMSGGTATGDAISFTVRNVFGPVRDS 464 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 N F++ +TDG++ + +AG+ I+SV V+ P Sbjct: 465 PNK-------NFLVIVTDGQSYD--------DVRGPAAAAHDAGITIFSVGVAWAPLDD- 508 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 L+ F + L + I Sbjct: 509 -LKDMASKPKESHAFFTREFTGLEPIVSDVIRGIC 542 >gi|290957869|ref|YP_003489051.1| hypothetical protein SCAB_34031 [Streptomyces scabiei 87.22] gi|260647395|emb|CBG70500.1| putative membrane protein [Streptomyces scabiei 87.22] Length = 534 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 41/355 (11%), Positives = 95/355 (26%), Gaps = 23/355 (6%) Query: 35 AAVLSGCASIVSDRTIKDPTTKKDQTSTIFK-KQIKKHLKQGSYIRENAGDIAQKAQINI 93 A+ LSG S ++D + T + + K +Y R D + + Sbjct: 193 ASGLSGAQSALTDADVARATPRLKRFFAGQKLTSGSSGWLATAYDRRGDVDALLNYESVL 252 Query: 94 TKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMV 153 + + + + P +L G + + ++ Sbjct: 253 KSRPDLTVIRPRDGVVTADYPLSSLASTGTDVRDDVRRLTDALRTPDVQRLITERTLRRP 312 Query: 154 LDVSRSMEDLYLQKHNDNNN----MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRK 209 + S + LL T + Sbjct: 313 VVASVPPAAGLDTTRRRELPFPGSRSVAVGLLDAYENDLRRPSRTVYVLDTSGSMEG-DR 371 Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 +D L + L + + + L+ +++ KL Sbjct: 372 LDRLKTALTELTGDFRDREEVTLMPFGSDVKSVRTHVVRPADPKA---GLDGIRADTRKL 428 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 + T Y ++ AY L + + ++ +TDGEN+ ++ + + Sbjct: 429 SAAGETAIYTSLRRAYEHLGAVDRDTFTS---------IVLMTDGENTEGASPADFDDFY 479 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKI 383 + ++ + +D L + + G+ F L +F++I Sbjct: 480 GRLPDA-ARHIPVFPILFG--DSDRDELEHIAEVTGGRLFDATRGS-LDGAFEEI 530 >gi|260813588|ref|XP_002601499.1| hypothetical protein BRAFLDRAFT_248612 [Branchiostoma floridae] gi|229286796|gb|EEN57511.1| hypothetical protein BRAFLDRAFT_248612 [Branchiostoma floridae] Length = 375 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 21/189 (11%), Positives = 60/189 (31%), Gaps = 21/189 (11%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + D + + ++ + ++G I Y+ Sbjct: 16 SGSVGASNFDKVKQFTKKAISGF-----DISPSGTQVGVIQYSTRTRQEFSLNSFLTKET 70 Query: 262 VKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + S ++++ T T A+ + + + + + + + K VI +TDG + Sbjct: 71 LSSAIDEVQYMRGGTLTGKAIRYVTKYGFGKSDGAR-----PGVPKVVIVVTDGVSYD-- 123 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 + G+ +Y++ V D L + ++ V++ L Sbjct: 124 ------AVAAPALEAQQKGITVYAIGV--SGYDADQLEQIASNNNTLAFVDNFNLLDNLR 175 Query: 381 DKITDKIQE 389 + + + + Sbjct: 176 NTLLTGVCD 184 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 17/137 (12%), Positives = 44/137 (32%), Gaps = 16/137 (11%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAY 285 + + ++G I Y+ + + ++++ T T A+ + Sbjct: 254 SGFDISPSGTQVGVIQYSTRTRQEFSMNSFLTKETLSAAIDEVQYMRGGTLTGKAIRYVT 313 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + + + + + K VI +TDG + + G+ +Y++ Sbjct: 314 KYGFGKSDGAR-----PGVPKVVIVVTDGVSYD--------AVAAPALEAQQKGITVYAI 360 Query: 346 AVSAPPEGQDLLRKCTD 362 V D L + Sbjct: 361 GV--SGYDVDQLEQIAS 375 >gi|326923641|ref|XP_003208043.1| PREDICTED: anthrax toxin receptor 1-like [Meleagris gallopavo] Length = 597 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 35/187 (18%), Positives = 60/187 (32%), Gaps = 18/187 (9%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 K + + + ++ ++ + +R+ I ++ L+ N ++ Sbjct: 90 YFILDKSGSVRNHWTEIYSFVESLAEKFISPMLRMSFIVFSSRGT--TIMKLTENREAIR 147 Query: 264 SRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 L L P +T + A ++Y H T G R +I +TDGE Sbjct: 148 RGLEILQHEVPGGDTFMHEGFKRANEQIY------HETYGGVRTASVIIALTDGELQ--- 198 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN-DSRELLES 379 Q R+ G +Y V V + L DS F V L + Sbjct: 199 -DVQFYYAEQEANRARSFGAIVYCVGV--KDFNETQLSTIADSIDHVFPVTGGFYALRGT 255 Query: 380 FDKITDK 386 D I K Sbjct: 256 IDSILKK 262 >gi|118092778|ref|XP_421647.2| PREDICTED: similar to tumor endothelial marker 8 [Gallus gallus] Length = 555 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 35/187 (18%), Positives = 60/187 (32%), Gaps = 18/187 (9%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 K + + + ++ ++ + +R+ I ++ L+ N ++ Sbjct: 49 YFILDKSGSVRNHWTEIYSFVESLAEKFISPMLRMSFIVFSSRGT--TIMKLTENREAIR 106 Query: 264 SRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 L L P +T + A ++Y H T G R +I +TDGE Sbjct: 107 RGLEILQYEVPGGDTFMHEGFKRANEQIY------HETYGGVRTASVIIALTDGELQ--- 157 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN-DSRELLES 379 Q R+ G +Y V V + L DS F V L + Sbjct: 158 -DVQFYYAEQEANRARSFGAIVYCVGV--KDFNETQLSTIADSIDHVFPVTGGFYALRGT 214 Query: 380 FDKITDK 386 D I K Sbjct: 215 IDSILKK 221 >gi|134093078|gb|ABO52938.1| matrilin 4 isoform 1 precursor [Colobus guereza] Length = 581 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 56/183 (30%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 360 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 414 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + + TDG + + Sbjct: 415 VEYMERGTMTGLALRHMVEHSFSEAQGARPR--ALNVPRVGLVFTDGRSQDDISVW---- 468 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V + LR+ D + + Sbjct: 469 ----AARAKEEGIVMYAVGVGKAV--EAELREIASEPAELHVSYAPDFGTMTHLLQNLRG 522 Query: 386 KIQ 388 I Sbjct: 523 SIC 525 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 51/153 (33%), Gaps = 15/153 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + R+G I Y+ + + +++ + L P T T A+ +A Sbjct: 65 NVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVA 124 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E + R+ + + +TDG ++ R G++IY+V V Sbjct: 125 FSVAEGARP--PEERVPRVAVIVTDGRPQD--------RVAEVAAQARARGIEIYAVGVQ 174 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 G LR F V + E Sbjct: 175 RADVGS--LRAMASPPLDEHVFLVESFDVIQEF 205 >gi|281183022|ref|NP_001162498.1| matrilin-4 [Papio anubis] gi|134093054|gb|ABO52914.1| matrilin 4 isoform 1 precursor [Papio anubis] Length = 581 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 56/183 (30%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 360 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 414 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + + TDG + + Sbjct: 415 VEYMERGTMTGLALRHMVEHSFSEAQGARPR--ALNVPRVGLVFTDGRSQDDISVW---- 468 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V + LR+ D + + Sbjct: 469 ----AARAKEEGIVMYAVGVGKAV--EAELREIASEPAELHVSYAPDFGTMTHLLQNLRG 522 Query: 386 KIQ 388 I Sbjct: 523 SIC 525 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 51/153 (33%), Gaps = 15/153 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + R+G I Y+ + + +++ + L P T T A+ +A Sbjct: 65 NVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVA 124 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E + R+ + + +TDG ++ R G++IY+V V Sbjct: 125 FSVAEGARP--PEERVPRVAVIVTDGRPQD--------RVAEVAAQARARGIEIYAVGVQ 174 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 G LR F V + E Sbjct: 175 RADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 >gi|156383825|ref|XP_001633033.1| predicted protein [Nematostella vectensis] gi|156220097|gb|EDO40970.1| predicted protein [Nematostella vectensis] Length = 204 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 25/181 (13%), Positives = 59/181 (32%), Gaps = 20/181 (11%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-Y 272 N +K + + A + N L ++N +K+ ++ L Sbjct: 36 DTEFDNFKEFAKKLAESFTISATYTHVAAVYFNTLANFGFNLKYDINVIKTAIDNLPNIG 95 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T+ A+ + ++ +K ++ +TDG++ + Sbjct: 96 GGTHIGKALTYTLDNVFKVAPR-------QNVKNVLVVLTDGKSHDSVTLP--------A 140 Query: 333 EYMRNAG--MKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 +RN G +++++V V A L F V ++ + + D+I Sbjct: 141 AAVRNYGPGVEVFAVGVGAGDSFVAQLNVIASDPDEDHVFHVEHFSQIESTTGAVEDEIC 200 Query: 389 E 389 + Sbjct: 201 K 201 >gi|266631670|emb|CBH29316.3| integrin, alpha X (complement component 3 receptor 4 subunit [Equus caballus] Length = 1160 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 67/192 (34%), Gaps = 23/192 (11%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKK-NLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + ++ +++ Q+ + + + + PL Sbjct: 160 SGSIYFKDFAKMLSFVKAVMSQFQRPSTQFSLMQFSNKFLVHFTFKDFMDSSDPLG---- 215 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + + +++L T+T A+ +EL++ + K +I ITDG+ G Sbjct: 216 -LLNSVSQLR--GLTHTASAIQVVIKELFSATRGAR-----KDASKILIVITDGQKQGDY 267 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSA---PPEGQDLLRKCTDSSGQF--FAVNDSRE 375 + + + AG+ Y+V V + + L F V+D Sbjct: 268 LGYDDVIPMAEA-----AGIIRYAVGVGSAFQSTQAWQELNDIASKPSHEHIFKVDDFDA 322 Query: 376 LLESFDKITDKI 387 L + +++ +KI Sbjct: 323 LRDIQNQLKEKI 334 >gi|166795266|ref|NP_001107649.1| integrin alpha-X [Equus caballus] gi|164507177|gb|ABY59790.1| integrin alpha X [Equus caballus] Length = 1160 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 28/192 (14%), Positives = 67/192 (34%), Gaps = 23/192 (11%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKK-NLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + ++ +++ Q+ + + + + PL Sbjct: 160 SGSIYFKDFAKMLSFVKAVMSQFQRPSTQFSLMQFSNKFLVHFTFKDFMDSSDPLG---- 215 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + + +++L T+T A+ +EL++ + K +I ITDG+ G Sbjct: 216 -LLNSVSQLR--GLTHTASAIQVVIKELFSATRGAR-----KDASKILIVITDGQKQGDY 267 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSA---PPEGQDLLRKCTDSSGQF--FAVNDSRE 375 + + + AG+ Y+V V + + L F V+D Sbjct: 268 LGYDDVIPMAEA-----AGIIRYAVGVGSAFQSTQAWQELNDIASKPSHEHIFKVDDFDA 322 Query: 376 LLESFDKITDKI 387 L + +++ +KI Sbjct: 323 LRDIQNQLKEKI 334 >gi|301781320|ref|XP_002926070.1| PREDICTED: collagen alpha-1(VII) chain-like [Ailuropoda melanoleuca] Length = 2994 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 56/162 (34%), Gaps = 19/162 (11%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYN 290 VR + Y+ + +V + +L+ NT T A+ H ++ Sbjct: 73 SAQGVRFAAVQYSDDPRTEFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVFL 132 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + K I ITDG++ + ++ G+K+++V + Sbjct: 133 PQLARPG------VPKVCILITDGKSQD--------LVDTAAQRLKGQGVKLFAVGI--K 176 Query: 351 PEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 + L++ FF VND L ++ ++ Sbjct: 177 NADPEELKRVASQPTSDFFFFVNDFSILRTLLPLVSRRVCTT 218 Score = 41.4 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 26/189 (13%), Positives = 62/189 (32%), Gaps = 25/189 (13%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + + + + LV+++ +V++G ++Y+ S + + + Sbjct: 1065 NAHRAEAVKRALERLVSALGPLGP----QAVQVGLLSYSHRPSPLFPLNSSYDPGVILQK 1120 Query: 266 LNKL---NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + + +P N N A+ A+R L + ++ + D G Sbjct: 1121 IRNIPYVDPSGN-NLGTAVVTAHRHLLAPDAPGRR----QHVPGIMVLLVDEPLRGDIFN 1175 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG---QFFAVNDSRELLES 379 + AG+K+ + + LR+ + FFAV+D L + Sbjct: 1176 P--------IREAQAAGLKVMI--LGQAGADPEQLRRLVPGTDPVQTFFAVDDGPSLERA 1225 Query: 380 FDKITDKIQ 388 + + Sbjct: 1226 VSSLATSLC 1234 >gi|281343115|gb|EFB18699.1| hypothetical protein PANDA_015680 [Ailuropoda melanoleuca] Length = 2904 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 56/162 (34%), Gaps = 19/162 (11%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYN 290 VR + Y+ + +V + +L+ NT T A+ H ++ Sbjct: 45 SAQGVRFAAVQYSDDPRTEFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVFL 104 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + K I ITDG++ + ++ G+K+++V + Sbjct: 105 PQLARPG------VPKVCILITDGKSQD--------LVDTAAQRLKGQGVKLFAVGI--K 148 Query: 351 PEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 + L++ FF VND L ++ ++ Sbjct: 149 NADPEELKRVASQPTSDFFFFVNDFSILRTLLPLVSRRVCTT 190 Score = 41.4 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 26/189 (13%), Positives = 62/189 (32%), Gaps = 25/189 (13%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + + + + LV+++ +V++G ++Y+ S + + + Sbjct: 1037 NAHRAEAVKRALERLVSALGPLGP----QAVQVGLLSYSHRPSPLFPLNSSYDPGVILQK 1092 Query: 266 LNKL---NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + + +P N N A+ A+R L + ++ + D G Sbjct: 1093 IRNIPYVDPSGN-NLGTAVVTAHRHLLAPDAPGRR----QHVPGIMVLLVDEPLRGDIFN 1147 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG---QFFAVNDSRELLES 379 + AG+K+ + + LR+ + FFAV+D L + Sbjct: 1148 P--------IREAQAAGLKVMI--LGQAGADPEQLRRLVPGTDPVQTFFAVDDGPSLERA 1197 Query: 380 FDKITDKIQ 388 + + Sbjct: 1198 VSSLATSLC 1206 >gi|226496057|ref|NP_001151334.1| LOC100284967 [Zea mays] gi|195645892|gb|ACG42414.1| protein binding protein [Zea mays] Length = 516 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 34/247 (13%), Positives = 72/247 (29%), Gaps = 35/247 (14%) Query: 142 SSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAP 201 S + +V + + + N L + Sbjct: 12 SPNSGPRPTPIVPGRVQLVSKNNNMAPLEENTQKVLLELTGGDSTSDRSGLDLVAVLDVS 71 Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS----N 257 KI+ + + +V + R+ + + N+ PL + Sbjct: 72 GSMQGE-KIEKMKTAMKFVVKKLSSID--------RLSIVTFLDTA--NRICPLQQVTED 120 Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + ++ ++ L P NTN + + L + K SS +G V+ ++DG+ + Sbjct: 121 SQPQLLKLIDALQPGGNTNISDGLQTGLKVLADRKLSSGRVVG-------VMLMSDGQQN 173 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT--DSSGQFFAVNDSRE 375 ++ + +Y+ A +L G F VND Sbjct: 174 RGEP----------AANVKIGNVPVYTFGFGAD-YDPTVLNAVARNSMGGTFSVVNDVNL 222 Query: 376 LLESFDK 382 L +F + Sbjct: 223 LSMAFSQ 229 >gi|148656885|ref|YP_001277090.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148568995|gb|ABQ91140.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 412 Score = 63.8 bits (153), Expect = 5e-08, Method: Composition-based stats. Identities = 20/188 (10%), Positives = 61/188 (32%), Gaps = 23/188 (12%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 K+ L ++ ++ ++ + + ++ + + + + ++++ Sbjct: 58 KLAALKDAVKRVIETLTPQDI--------VAIVLFDDTVQTLVPATFATDKATLIAQVDA 109 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 + T M EL + ++ +TDG+ + + Sbjct: 110 IEEAGGTAMSGGMAAGIVELRKNHDPGRV--------GAMLLLTDGQT-----WGDEDRC 156 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKI 387 + + + G++I ++ + A + LL + + G + D ++ F Sbjct: 157 RALAQELARDGVRITALGLGAEWN-EALLDDIAEATGGISDYIADPAQITTFFQHAVRTA 215 Query: 388 QEQSVRIA 395 Q R A Sbjct: 216 QGTVARDA 223 >gi|148681993|gb|EDL13940.1| mCG48880 [Mus musculus] Length = 249 Score = 63.4 bits (152), Expect = 5e-08, Method: Composition-based stats. Identities = 23/162 (14%), Positives = 58/162 (35%), Gaps = 20/162 (12%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKE 293 + RIG I Y+ + ++ ++ K ++ + T T A+ A + Sbjct: 47 TARIGIINYSHKVEKVASLKQFSSKDDFKLVVDNMQYLGEGTYTATALQAANDMFKEARP 106 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 +KK + ITDG+ ++ + + ++ ++I+ + V + Sbjct: 107 G---------VKKVALVITDGQTD----SRDKKKLADVVKDANDSNVEIFVIGVVKKDDP 153 Query: 354 Q-----DLLRKCTDSSGQFFAVNDSRELLESFDK-ITDKIQE 389 + + + +D L ++ + ++ KI E Sbjct: 154 NFEIFHKEMNLIATDAEHVYQFDDFFTLQDTLKQKLSKKICE 195 >gi|282897673|ref|ZP_06305672.1| von Willebrand factor, type A Precursor [Raphidiopsis brookii D9] gi|281197352|gb|EFA72249.1| von Willebrand factor, type A Precursor [Raphidiopsis brookii D9] Length = 474 Score = 63.4 bits (152), Expect = 5e-08, Method: Composition-based stats. Identities = 32/149 (21%), Positives = 62/149 (41%), Gaps = 20/149 (13%) Query: 244 NIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 N G PL+N++N + + +++L +T ++ A +L +T Sbjct: 92 NFGATVQTPAPLTNDINTLNNAIDQLLEIGSTPMGEGINTAQDQLQ-----------ATT 140 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS 363 L K +I TDG + N+ +RNAG+K+ +AV+ + L + T Sbjct: 141 LNKNIILFTDGLPDDPNFAYNS------ALSVRNAGIKL--IAVATGGADTNYLTQITGD 192 Query: 364 SGQFFAVNDSRELLESFDKITDKIQEQSV 392 F +S + ++F + I +Q + Sbjct: 193 RSLVFYA-NSGQFDQAFSQAEAVIYKQLI 220 >gi|172087820|ref|YP_001816750.1| hypothetical protein VF_A1192 [Vibrio fischeri ES114] gi|171902402|gb|ACB55718.1| conserved hypothetical protein containing von Willebrand factor type A domain [Vibrio fischeri ES114] Length = 350 Score = 63.4 bits (152), Expect = 5e-08, Method: Composition-based stats. Identities = 32/197 (16%), Positives = 69/197 (35%), Gaps = 28/197 (14%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + ++D + + + V K R+G I + TP + + Sbjct: 118 FTSTNGLKISRLDAVKKVLNDFV---------KTRKGDRLGLILFGDAAFVQ--TPFTAD 166 Query: 259 LNEVKSRLNK---LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 LN+ ++T+ A+ + + S + S K I +TDG Sbjct: 167 HKVWLDLLNQTHVEMAGKSTHLGDAIGLTIKRFEDSNNSQPLSTTSRE--KVAIILTDGN 224 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------GQDLLRKCTD-SSGQFF 368 ++ + + + + G++I+ +A+ P D + D S GQ F Sbjct: 225 DTDSYVP-----PMDAAKVAKVKGVRIHMIAIGDPQTVGEQALDMDTINTIADASGGQAF 279 Query: 369 AVNDSRELLESFDKITD 385 + EL+ ++ +I+ Sbjct: 280 QALNQDELINAYAEISK 296 >gi|302557483|ref|ZP_07309825.1| secreted protein [Streptomyces griseoflavus Tu4000] gi|302475101|gb|EFL38194.1| secreted protein [Streptomyces griseoflavus Tu4000] Length = 417 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 29/196 (14%), Positives = 65/196 (33%), Gaps = 26/196 (13%) Query: 206 ANRKIDVLIESAGNLVNSIQK-----AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 ++ ++ ++++ + N PL N Sbjct: 51 GQSRMAAAKQAFNEVLDATPEEVELGIRTLGANYPGDDRKEGCKDTAQLYPVGPL--NRT 108 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 E K+ + L P T PA+ A +L K ++ I+DGE++ Sbjct: 109 EAKTAVATLAPTGWTPIGPALLKAADDL-----------DGGDGSKRIVLISDGEDT--- 154 Query: 321 AYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELL 377 L+ ++ + G+ I ++ + + L + + G + +V EL Sbjct: 155 --CAPLDPCEVAREIAARGIGLTIDTLGLVPNAKLSRQLSCIAEATGGTYTSVEHQDELT 212 Query: 378 ESFDKITDKIQEQSVR 393 + +++ D+ E V Sbjct: 213 DRVNELVDRAAEPVVT 228 >gi|13699836|ref|NP_085095.1| matrilin-4 isoform 3 precursor [Homo sapiens] Length = 499 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 57/183 (31%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 278 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 332 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + + TDG + + Sbjct: 333 VEYMERGTMTGLALRHMVEHSFSEAQGARPR--ALNVPRVGLVFTDGRSQDDISVW---- 386 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V + LR+ D + + + Sbjct: 387 ----AARAKEEGIVMYAVGVGKAV--EAELREIASEPAELHVSYAPDFGTMTHLLENLRG 440 Query: 386 KIQ 388 I Sbjct: 441 SIC 443 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 51/153 (33%), Gaps = 15/153 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + R+G I Y+ + + +++ + L P T T A+ +A Sbjct: 65 NVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVA 124 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E + R+ + + +TDG ++ R G++IY+V V Sbjct: 125 FSVAEGARP--PEERVPRVAVIVTDGRPQD--------RVAEVAAQARARGIEIYAVGVQ 174 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 G LR F V + E Sbjct: 175 RADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 >gi|146327011|gb|AAI41812.1| Matrilin 4 [Homo sapiens] Length = 540 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 57/183 (31%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 319 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 373 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + + TDG + + Sbjct: 374 VEYMERGTMTGLALRHMVEHSFSEAQGARPR--ALNVPRVGLVFTDGRSQDDISVW---- 427 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V + LR+ D + + + Sbjct: 428 ----AARAKEEGIVMYAVGVGKAV--EAELREIASEPAELHVSYAPDFGTMTHLLENLRG 481 Query: 386 KIQ 388 I Sbjct: 482 SIC 484 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 52/153 (33%), Gaps = 15/153 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + R+G I Y+ + + +++ + L P T T A+ +A Sbjct: 65 NVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVA 124 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E + R+ + + +TDG ++ R +G++IY+V V Sbjct: 125 FSVAEGARP--PEERVPRVAVIVTDGRPQD--------RVAEVAAQARASGIEIYAVGVQ 174 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 G LR F V + E Sbjct: 175 RADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 >gi|134093095|gb|ABO52955.1| matrilin 4 isoform 1 precursor [Gorilla gorilla gorilla] Length = 581 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 57/183 (31%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 360 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 414 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + + TDG + + Sbjct: 415 VEYMERGTMTGLALRHMVEHSFSEAQGARPR--ALNVPRVGLVFTDGRSQDDISVW---- 468 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V + LR+ D + + + Sbjct: 469 ----AARAKEEGIVMYAVGVGKAV--EAELREIASEPAELHVSYAPDFGTMTHLLENLRG 522 Query: 386 KIQ 388 I Sbjct: 523 SIC 525 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 51/153 (33%), Gaps = 15/153 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + R+G I Y+ + + +++ + L P T T A+ +A Sbjct: 65 NVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVA 124 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E + R+ + + +TDG ++ R G++IY+V V Sbjct: 125 FSVAEGARP--PEERVPRVAVIVTDGRPQD--------RVAEVAAQARARGIEIYAVGVQ 174 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 G LR F V + E Sbjct: 175 RADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 >gi|119596272|gb|EAW75866.1| matrilin 4, isoform CRA_a [Homo sapiens] Length = 391 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 57/183 (31%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 170 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 224 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + + TDG + + Sbjct: 225 VEYMERGTMTGLALRHMVEHSFSEAQGARPR--ALNVPRVGLVFTDGRSQDDISVW---- 278 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V + LR+ D + + + Sbjct: 279 ----AARAKEEGIVMYAVGVGKAV--EAELREIASEPAELHVSYAPDFGTMTHLLENLRG 332 Query: 386 KIQ 388 I Sbjct: 333 SIC 335 >gi|119596273|gb|EAW75867.1| matrilin 4, isoform CRA_b [Homo sapiens] Length = 620 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 57/183 (31%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 399 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 453 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + + TDG + + Sbjct: 454 VEYMERGTMTGLALRHMVEHSFSEAQGARPR--ALNVPRVGLVFTDGRSQDDISVW---- 507 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V + LR+ D + + + Sbjct: 508 ----AARAKEEGIVMYAVGVGKAV--EAELREIASEPAELHVSYAPDFGTMTHLLENLRG 561 Query: 386 KIQ 388 I Sbjct: 562 SIC 564 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 51/153 (33%), Gaps = 15/153 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + R+G I Y+ + + +++ + L P T T A+ +A Sbjct: 65 NVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVA 124 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E + R+ + + +TDG ++ R G++IY+V V Sbjct: 125 FSVAEGARP--PEERVPRVAVIVTDGRPQD--------RVAEVAAQARARGIEIYAVGVQ 174 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 G LR F V + E Sbjct: 175 RADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 >gi|56417742|emb|CAI21077.1| matrilin 4 [Homo sapiens] Length = 432 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 57/183 (31%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 211 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 265 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + + TDG + + Sbjct: 266 VEYMERGTMTGLALRHMVEHSFSEAQGARPR--ALNVPRVGLVFTDGRSQDDISVW---- 319 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V + LR+ D + + + Sbjct: 320 ----AARAKEEGIVMYAVGVGKAV--EAELREIASEPAELHVSYAPDFGTMTHLLENLRG 373 Query: 386 KIQ 388 I Sbjct: 374 SIC 376 >gi|3927992|emb|CAA07569.1| matrilin-4 [Homo sapiens] gi|124297520|gb|AAI31764.1| Matrilin 4 [Homo sapiens] gi|153217472|gb|AAI51220.1| Matrilin 4 [Homo sapiens] Length = 581 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 57/183 (31%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 360 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 414 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + + TDG + + Sbjct: 415 VEYMERGTMTGLALRHMVEHSFSEAQGARPR--ALNVPRVGLVFTDGRSQDDISVW---- 468 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V + LR+ D + + + Sbjct: 469 ----AARAKEEGIVMYAVGVGKAV--EAELREIASEPAELHVSYAPDFGTMTHLLENLRG 522 Query: 386 KIQ 388 I Sbjct: 523 SIC 525 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 52/153 (33%), Gaps = 15/153 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + R+G I Y+ + + +++ + L P T T A+ +A Sbjct: 65 NVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVA 124 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E + R+ + + +TDG ++ R +G++IY+V V Sbjct: 125 FSVAEGARP--PEERVPRVAVIVTDGRPQD--------RVAEVAAQARASGIEIYAVGVQ 174 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 G LR F V + E Sbjct: 175 RADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 >gi|13699830|ref|NP_003824.2| matrilin-4 isoform 1 precursor [Homo sapiens] gi|4499937|emb|CAB39280.1| matrilin 4 [Homo sapiens] Length = 581 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 57/183 (31%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 360 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 414 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + + TDG + + Sbjct: 415 VEYMERGTMTGLALRHMVEHSFSEAQGARPR--ALNVPRVGLVFTDGRSQDDISVW---- 468 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V + LR+ D + + + Sbjct: 469 ----AARAKEEGIVMYAVGVGKAV--EAELREIASEPAELHVSYAPDFGTMTHLLENLRG 522 Query: 386 KIQ 388 I Sbjct: 523 SIC 525 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 51/153 (33%), Gaps = 15/153 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + R+G I Y+ + + +++ + L P T T A+ +A Sbjct: 65 NVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVA 124 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E + R+ + + +TDG ++ R G++IY+V V Sbjct: 125 FSVAEGARP--PEERVPRVAVIVTDGRPQD--------RVAEVAAQARARGIEIYAVGVQ 174 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 G LR F V + E Sbjct: 175 RADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 >gi|13699834|ref|NP_085080.1| matrilin-4 isoform 2 precursor [Homo sapiens] gi|119596274|gb|EAW75868.1| matrilin 4, isoform CRA_c [Homo sapiens] Length = 540 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 57/183 (31%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 319 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 373 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + + TDG + + Sbjct: 374 VEYMERGTMTGLALRHMVEHSFSEAQGARPR--ALNVPRVGLVFTDGRSQDDISVW---- 427 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V + LR+ D + + + Sbjct: 428 ----AARAKEEGIVMYAVGVGKAV--EAELREIASEPAELHVSYAPDFGTMTHLLENLRG 481 Query: 386 KIQ 388 I Sbjct: 482 SIC 484 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 51/153 (33%), Gaps = 15/153 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + R+G I Y+ + + +++ + L P T T A+ +A Sbjct: 65 NVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVA 124 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E + R+ + + +TDG ++ R G++IY+V V Sbjct: 125 FSVAEGARP--PEERVPRVAVIVTDGRPQD--------RVAEVAAQARARGIEIYAVGVQ 174 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 G LR F V + E Sbjct: 175 RADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 >gi|73920229|sp|O95460|MATN4_HUMAN RecName: Full=Matrilin-4; Flags: Precursor gi|5419632|emb|CAB46380.1| matrilin 4 [Homo sapiens] Length = 622 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 57/183 (31%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 401 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 455 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + + TDG + + Sbjct: 456 VEYMERGTMTGLALRHMVEHSFSEAQGARPR--ALNVPRVGLVFTDGRSQDDISVW---- 509 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V + LR+ D + + + Sbjct: 510 ----AARAKEEGIVMYAVGVGKAV--EAELREIASEPAELHVSYAPDFGTMTHLLENLRG 563 Query: 386 KIQ 388 I Sbjct: 564 SIC 566 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 51/153 (33%), Gaps = 15/153 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + R+G I Y+ + + +++ + L P T T A+ +A Sbjct: 65 NVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVA 124 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E + R+ + + +TDG ++ R G++IY+V V Sbjct: 125 FSVAEGARP--PEERVPRVAVIVTDGRPQD--------RVAEVAAQARARGIEIYAVGVQ 174 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 G LR F V + E Sbjct: 175 RADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 >gi|22760136|dbj|BAC11081.1| unnamed protein product [Homo sapiens] Length = 488 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 57/183 (31%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 267 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 321 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + + TDG + + Sbjct: 322 VEYMERGTMTGLALRHMVEHSFSEAQGARPR--ALNVPRVGLVFTDGRSQDDISVW---- 375 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V + LR+ D + + + Sbjct: 376 ----AARAKEEGIVMYAVGVGKAV--EAELREIASEPAELHVSYAPDFGTMTHLLENLRG 429 Query: 386 KIQ 388 I Sbjct: 430 SIC 432 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 52/153 (33%), Gaps = 15/153 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + R+G I Y+ + + +++ + L P T T A+ +A Sbjct: 13 NVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVA 72 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E + R+ + + +TDG ++ R +G++IY+V V Sbjct: 73 FSVAEGARP--PEERVPRVAVIVTDGRPQD--------RVAEVAAQARASGIEIYAVGVQ 122 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 G LR F V + E Sbjct: 123 RADVGS--LRAMASPPLDEHVFLVESFDLIQEF 153 >gi|310657870|ref|YP_003935591.1| hypothetical protein CLOST_0560 [Clostridium sticklandii DSM 519] gi|308824648|emb|CBH20686.1| exported protein of unknown function [Clostridium sticklandii] Length = 873 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 41/320 (12%), Positives = 100/320 (31%), Gaps = 13/320 (4%) Query: 82 AGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIER 141 + + + N + + + + +LS + + R Sbjct: 295 ITNTVRVYGTGPVGEINKTATVVVNGVLEEVLEVMSEEFIEDPGEIYLDLSEFDSQPMMR 354 Query: 142 SSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP-PKKSFWSKNTTKSKYA 200 S + + ++ +S L + PP P +T+ S Sbjct: 355 SFQMMTVNPYENDHLSIDKTATTLDLEERMYQVDFEITGTPPEKPVDVILVIDTSGSMGT 414 Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 P + + +A N NSI + + + Y + L+NN Sbjct: 415 RIPGDSKAPLYYAKLAAINFANSIIDENPDSRVGVIEFSGGYYGYASDASTVINLTNNKA 474 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + S +N L + TN AY ++ + + L V ++ G S ++ Sbjct: 475 NLASSINGLTTHNMTNIQAGFRLAYNKISAISSTRDSVKSVVFLTDGVANVSIGNWSSSN 534 Query: 321 AYQNTLNTLQICEYMRNA----GMKIYSVA-VSAPPEG------QDLLRKCTDSS-GQFF 368 + +T+ ++ ++++ A +D L+K +++ Sbjct: 535 PVVHNTHTIAAYTEGQSLYSYINGNLFTIGLFGAISNSSVKSIARDTLQKAVYDDLEKYY 594 Query: 369 AVNDSRELLESFDKITDKIQ 388 + + +L ++ I+ K++ Sbjct: 595 EASSAVDLGPVYETISQKLE 614 >gi|291461066|ref|ZP_06026725.2| D-amino acid dehydrogenase large subunit [Fusobacterium periodonticum ATCC 33693] gi|291379168|gb|EFE86686.1| D-amino acid dehydrogenase large subunit [Fusobacterium periodonticum ATCC 33693] Length = 529 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 21/131 (16%), Positives = 54/131 (41%), Gaps = 17/131 (12%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N+ ++ L + P T+ ++ + +L + + + ITDG + Sbjct: 252 NVEGIEKALEPIQPTGWTSIAKSIEYGVEDLK--------ALDGEKTLNILYIITDGIET 303 Query: 318 GASAYQNTLNTLQICEYMR--NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF-AVNDSR 374 N ++I + ++ N + + + + LL++ D++G ++ +VND+ Sbjct: 304 CGG------NPVEIAKQLKGENTNIVLGIIGFNVDANQNRLLKQIADAAGGYYSSVNDAD 357 Query: 375 ELLESFDKITD 385 +L +I + Sbjct: 358 KLTGELYRINE 368 >gi|94498564|ref|ZP_01305119.1| hypothetical protein SKA58_08324 [Sphingomonas sp. SKA58] gi|94422007|gb|EAT07053.1| hypothetical protein SKA58_08324 [Sphingomonas sp. SKA58] Length = 634 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 94/382 (24%), Gaps = 37/382 (9%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A + + ID + +++Q+Q+ +DAA + T P + Q Sbjct: 37 LMAAGLIPVIAALGAGIDAGRLYLVKSQLQAGVDAAA--LAGARAFAVTDGSPAARDKQA 94 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 S F S ++ + + Sbjct: 95 SAYFYGNFASDYMGVSNLQ---------------LTPDFKTVGGINVTTITARAIVPMTF 139 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKH-----NDNNNMT 175 + + + + + + +VLD + SM+ Sbjct: 140 MRIFGFQPRTMQAVAKA----ELQPRPLEVMVVLDDTGSMKANLSGGRTRMVALKEAAND 195 Query: 176 SNKYLLPPPPKKS-----FWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE 230 L + F + T + P A +K++ AG + Sbjct: 196 FVDILHQGASSRRDLAMGFIGYDVTVNVGHLLPGNAVKKVEGFNSFAGASAYGLSVPSHA 255 Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 N G + + + L++ + L P E ++ Sbjct: 256 SGNYLAWKGCVMADTTVKDVDKNRLTSETGAW-DLVRSL-PGEGSHPPVEPFFVPPMYVP 313 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS--VAVS 348 +S+ + + I+ T+ N+ ++ + R + Y V Sbjct: 314 NAKSNSGA-DNPSSTYYGIYNTEPGNNLYRIDKSAFGEMGADYLARTELYRQYFYDYYVG 372 Query: 349 APPEGQDLLRKCTDS-SGQFFA 369 G ++ Sbjct: 373 LNNGDATKADDVVRRLDGSYYT 394 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 32/263 (12%), Positives = 76/263 (28%), Gaps = 40/263 (15%) Query: 166 QKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQ 225 + + L S ++ P V + A VN + Sbjct: 374 NNGDATKADDVVRRLDGSYYTPSAANRAGNDWYIDWKRIPKYYDGAVWKDPASATVNPLG 433 Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTP-----LSNNLNEVKSRLNKLNPYENTNTYPA 280 ++ + + + N P S L+ + ++ + P T + Sbjct: 434 GSVDSGSKNTTPMP--SPNWQCPEEAMPPRYGRMKSEYLSHIANQHGAIYPANGTIHHAG 491 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA-----------SAYQNTLNTL 329 + YR L + + + R ++ ++F+TDG N + Y ++ Sbjct: 492 LLWGYRLLVRDDVFTRANPTTERPRRALVFMTDGVNEIGESQNGYLDRTFTWYGRWSDSR 551 Query: 330 QICEYMRNA------------GMK-------IYSVAVSAPPEG-QDLLRKCTDSSGQFFA 369 + ++ IY +A+ A C G+ + Sbjct: 552 IAANQSNSETQMLRRFEKTCANIQREANPPEIYIIALVANSSAIDTAFNNCA--PGRVYR 609 Query: 370 VNDSRELLESFDKITDKIQEQSV 392 + + EL +F + ++ + + Sbjct: 610 TSSTDELRRAFQDVAAELVDLHL 632 >gi|313238340|emb|CBY13422.1| unnamed protein product [Oikopleura dioica] Length = 345 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 17/118 (14%), Positives = 42/118 (35%), Gaps = 16/118 (13%) Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 TNT A+ A + L + + K ++ ITDG+ + Sbjct: 24 GKTNTGGALERAQQMLAEGRP---------SVPKIILLITDGD------ATDKERLDAQI 68 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 E ++ + + IY++ V ++ L + + D + + + ++ ++ Sbjct: 69 EKLKKSNILIYTIGVG-DLIDRNELNRIATDEDFVYETRDFDSISKIKSSLLGRVCKK 125 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 29/249 (11%), Positives = 76/249 (30%), Gaps = 31/249 (12%) Query: 155 DVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRK-IDVL 213 D D ++ + P + + + RK Sbjct: 100 DFVYETRDFDSISKIKSSLLGRVCKKAKPKTSGVCGDISVDLQFIVDSSSSVTRKNFGFA 159 Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN--NLNEVKSRLNKLNP 271 N+ ++ + ++ V++G + Y+ + + L + ++ ++ + Sbjct: 160 KNFVANV-----SSVFDLRSGDVQVGVLTYSTNVHSDSAIGLGAIHSQDDFVEKVQSMKY 214 Query: 272 YEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T+T A+ + S N + K +IF+TDG Q+ Sbjct: 215 TGGDTHTGTALRYI----------STNNRWREEVPKILIFVTDG------TPQDRAIVPA 258 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVN--DSRELLESFDKITDK 386 +R+ G++I+++ V + L++ + D + + Sbjct: 259 AARSLRDKGVRIFAIGVG--NAVESELKEIASEPYENHAIFIQGADYSAVQRVRGHLERL 316 Query: 387 IQEQSVRIA 395 + ++I Sbjct: 317 VCNDVLKIT 325 >gi|311268149|ref|XP_003131915.1| PREDICTED: integrin alpha-E-like, partial [Sus scrofa] Length = 1032 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 24/155 (15%), Positives = 50/155 (32%), Gaps = 16/155 (10%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNT 298 + Y I S ++ R+ + + T T AM H ++ S Sbjct: 246 LVQYGSVIQTELDLQDSQDVAASLDRVQNITQVGSVTKTASAMQHVLDNIFTPSHGSKAK 305 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS---APPEGQD 355 K ++ +TDG+ + + LN + + G++ +++ V + Sbjct: 306 AS-----KVMVVLTDGD-----IFDDPLNLTTVINSPKMQGVERFAIGVGGAFNKSNTYN 355 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 L+ F V + L K+ +I Sbjct: 356 ELKLIASDPDEDHAFKVTNYMALDGLLSKLQQRII 390 >gi|296118874|ref|ZP_06837447.1| putative von Willebrand factor type A domain protein [Corynebacterium ammoniagenes DSM 20306] gi|295967972|gb|EFG81224.1| putative von Willebrand factor type A domain protein [Corynebacterium ammoniagenes DSM 20306] Length = 674 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 33/181 (18%), Positives = 63/181 (34%), Gaps = 25/181 (13%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN-----N 258 ++D ++A LVNS+ + G+ A + G Q + + Sbjct: 73 VDGGTRLDAAKQAANQLVNSLPETAVMGMLAYGASGSNAPDNRERGCQDIDVLAPVERID 132 Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 E+KS + L T A+ A EL +E + +I ++DG ++ Sbjct: 133 NEELKSEIGALEAQGYTPMGNALRAAADELGSEG------------DRSIILVSDGIDT- 179 Query: 319 ASAYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRE 375 ++ E + G I++V Q L ++ S G + ++ E Sbjct: 180 ----CAPPPACEVAEELAGDGFDLAIHTVGFKPDEAAQAELECISEASGGTYVEAENAEE 235 Query: 376 L 376 L Sbjct: 236 L 236 >gi|150389538|ref|YP_001319587.1| von Willebrand factor, type A [Alkaliphilus metalliredigens QYMF] gi|149949400|gb|ABR47928.1| von Willebrand factor, type A [Alkaliphilus metalliredigens QYMF] Length = 551 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 36/315 (11%), Positives = 95/315 (30%), Gaps = 32/315 (10%) Query: 78 IRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTG 137 + D +I +++ + S + + + S Sbjct: 67 LTNLVKDKNVLEEIPVSQLTGDKSSEEVLSVTPESFIAKEAIDLKINQLDTSQFPKISLY 126 Query: 138 IIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKS 197 +++ + I D E+ + + L P + + S Sbjct: 127 FSALNAQGIPILNLTR-DSFEIYEERIINSDTKPVKSDTFLANLQQVPLSVSLILDNSGS 185 Query: 198 KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN 257 + +A +N + + + ++ I +N + P + Sbjct: 186 MSGNP-------MTQAKSAAKQFLNYVDFSNGD------QVEIIEFNSDVYIR--IPYGS 230 Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 ++ + + ++ + T Y A++ S K ++ TDGE + Sbjct: 231 DIKSLNTAIDTMESNSQTALYDALYTGL-----------VRAYSQSGPKCILAFTDGEEN 279 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 + + + L + I+ + V + + + L + G++F + EL Sbjct: 280 ASIRSVSEVTELSRATS-----IPIFIIGVGSLIDEESLKEIAEQTGGEYFYSPTAVELE 334 Query: 378 ESFDKITDKIQEQSV 392 + + + D+ +EQ V Sbjct: 335 QIYKTVYDQQKEQYV 349 >gi|311274909|ref|XP_003134506.1| PREDICTED: matrilin-4-like [Sus scrofa] Length = 721 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 55/162 (33%), Gaps = 15/162 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + + R+G I Y+ + + +++ + L P T T A+ +A Sbjct: 205 DVGPNATRVGVIQYSSQVQSVFPLGAFSRREDMEGAIRALVPLAQGTMTGLAIQYAMNVA 264 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E + R+ + + +TDG ++ R G++IY+V V Sbjct: 265 FSVAEGARP--PEARVPRVAVIVTDGRPQD--------RVAEVAAQARARGIEIYAVGVQ 314 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 G LR F V + E + ++ Sbjct: 315 RADVGS--LRAMASPPLDEHVFLVESFDLIQEFGQQFQGRLC 354 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 58/183 (31%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 500 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 554 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + + TDG + + Sbjct: 555 VEYMERGTMTGLALRHMVEHSFSEAQGARPR--ALNVPRVGLVFTDGRSQDDVSVW---- 608 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V ++ LR+ D + + + Sbjct: 609 ----AARAKEEGIVMYAVGVGKAV--EEELREIASEPAELHVSYSPDFSTMTHLLENLKG 662 Query: 386 KIQ 388 I Sbjct: 663 SIC 665 >gi|302527162|ref|ZP_07279504.1| von Willebrand factor [Streptomyces sp. AA4] gi|302436057|gb|EFL07873.1| von Willebrand factor [Streptomyces sp. AA4] Length = 326 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 18/150 (12%), Positives = 49/150 (32%), Gaps = 16/150 (10%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + + V ++ L ++T T ++ A + + + ++ ++DG+ Sbjct: 147 TTDRAGVTKAIDNLKLAQSTATGEGIYAAMQSIQSFSAVVGGA--DGPPPARIVLMSDGK 204 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP--------------EGQDLLRKCT 361 + + + A M I S++ + + L Sbjct: 205 QTVPEDLYAPRGAYTAAQAAKQAQMPISSISFGTEHGSVDIEGKQQDVRVDDESLREIAR 264 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQS 391 S G+F+ + EL + + ++I + Sbjct: 265 LSGGEFYKAASADELKRVYADLGEQIGYEL 294 >gi|260823577|ref|XP_002606157.1| hypothetical protein BRAFLDRAFT_92024 [Branchiostoma floridae] gi|229291496|gb|EEN62167.1| hypothetical protein BRAFLDRAFT_92024 [Branchiostoma floridae] Length = 550 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 31/229 (13%), Positives = 68/229 (29%), Gaps = 20/229 (8%) Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR------KIDVLIESAGNLVNSI 224 K KS T + A A + + ++ L Sbjct: 174 PAQAAFWKNYCNSALDKSPNDYVTAVEEVAGCQATSLDLVFALDGSASVGDNNFQLSKDF 233 Query: 225 QKAIQ---EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYP 279 I R+G + Y+ + + + + ++ E T+T Sbjct: 234 INTIIQSFTIGPDLTRVGVVQYSFSVNLELELKDHLSAAPLIQAVEEIEYDEGLFTHTGE 293 Query: 280 AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG 339 A+ + + + + + K I +TDGE+ + + + AG Sbjct: 294 ALQYITTHSFAVENGAREVR--KGVPKVSILLTDGESGDYH------DVTVWSQRAQEAG 345 Query: 340 MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 + ++ V V Q L +G+ + V R+L + + + + Sbjct: 346 ITVFPVGV-TNNVDQQELETMAGHAGKVYRVGWFRDLEKILTDMEESLC 393 >gi|134093103|gb|ABO52963.1| matrilin 4 isoform 1 precursor [Lemur catta] Length = 583 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 51/153 (33%), Gaps = 15/153 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + R+G I Y+ + + +++ + L P T T A+ +A Sbjct: 67 NVGPNATRVGVIQYSSQVQSVFPLGAFSRPEDMERAIRALVPLAQGTMTGLAIQYAMNVA 126 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E + R+ + + +TDG ++ R G++IY+V V Sbjct: 127 FSVAEGARP--PEERVPRVAVIVTDGRPQD--------RVAEVAAQARARGIEIYAVGVQ 176 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 G LR F V + E Sbjct: 177 RADVGS--LRAMASHPLDEHVFLVESFDLIQEF 207 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 26/191 (13%), Positives = 61/191 (31%), Gaps = 27/191 (14%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 362 NFELVKRFVNQIVDFL-----DVSPDGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 416 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + + TDG + + Sbjct: 417 VEYMERGTMTGLALRHMVEHSFSEAQGARPR--ALNVPRVGLVFTDGRSQDDISVW---- 470 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V ++ LR+ D F +T Sbjct: 471 ----AARAKEEGIVMYAVGVGKAV--EEELREIASEPAELHVSYSPD-------FSTMTH 517 Query: 386 KIQEQSVRIAP 396 ++ I P Sbjct: 518 LLENLRGSICP 528 >gi|149437045|ref|XP_001515975.1| PREDICTED: similar to inter-alpha trypsin inhibitor heavy chain precursor 5 [Ornithorhynchus anatinus] Length = 949 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 41/281 (14%), Positives = 91/281 (32%), Gaps = 21/281 (7%) Query: 113 IPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNN 172 +P + + A + + S + + DV+R +Q N Sbjct: 231 VPPPSTVINRNETFARITFNPSVVQQTKISQNGILGDFIIRYDVNREQSIGDIQVLNGYF 290 Query: 173 NMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK 232 LPP PK + +++ S L ++ L + E Sbjct: 291 VHYFAPKDLPPLPKNVVFVLDSSASMVG----------AKLKQTKEALFTILHDLRPEDN 340 Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 V + TP N++ + K ++ ++P TN A+ R L + Sbjct: 341 FNIVGFSSRIKVWKDQLVPVTP--NSIRDGKVYIHHMSPSGGTNINGALQTGIRLLND-- 396 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 +HN I + + ++F+TDG + + + + ++++ + + Sbjct: 397 FVAHNDIDARSV-SLIVFLTDGRPTVGEIQTPKILNNT--KEAARDRVCLFTIGIGDDVD 453 Query: 353 GQDL----LRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + L L C + + + +L +D+I + Sbjct: 454 FKLLEKLSLENCGMTRRFQVEADAAAQLKGFYDEIGTPLLS 494 >gi|119889916|ref|XP_001252289.1| PREDICTED: Epithelial chloride channel protein-like [Bos taurus] gi|297473018|ref|XP_002686328.1| PREDICTED: Epithelial chloride channel protein-like [Bos taurus] gi|296489229|gb|DAA31342.1| Epithelial chloride channel protein-like [Bos taurus] Length = 903 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 23/133 (17%), Positives = 55/133 (41%), Gaps = 24/133 (18%) Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T+ + ++ + ++S+ +I +TDGE++ + Sbjct: 380 EANGGTSICRGLKAGFQAIIQSQQST--------SGSEIILLTDGEDNEIHSC------- 424 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN-DSRELLESFDKITDK- 386 E ++ +G+ I+++A+ P L +D + G F N D L +F +I+ + Sbjct: 425 --IEEVKQSGVIIHTIALG--PSAAKELETLSDMTGGHRFYANKDINGLTNAFSRISSRS 480 Query: 387 --IQEQSVRIAPN 397 I +Q++++ Sbjct: 481 GNITQQTIQLESK 493 >gi|224004848|ref|XP_002296075.1| predicted protein [Thalassiosira pseudonana CCMP1335] gi|209586107|gb|ACI64792.1| predicted protein [Thalassiosira pseudonana CCMP1335] Length = 868 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 24/142 (16%), Positives = 50/142 (35%), Gaps = 16/142 (11%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N + +++L+ TN A+ A + + E + + V +TDG + Sbjct: 196 NKQQALHAIDRLSVKGRTNIASAVSLAAQVVNGVAEPNKV--------RSVFLLTDGNAN 247 Query: 318 GASA-YQNTLNTLQICEYMRNAG----MKIYSVAVSAPPEGQDLLR--KCTDSSGQFFAV 370 + + I + +++ P Q LLR S G F++V Sbjct: 248 TGYTEAIDLVKLTSIFVEANRNPHTPPISLHTFGYGPEP-DQKLLRGMAMATSGGSFYSV 306 Query: 371 NDSRELLESFDKITDKIQEQSV 392 D+ ++ +F I ++ Sbjct: 307 RDNSQVSSAFGDAIGGILSLAL 328 >gi|119899150|ref|YP_934363.1| hypothetical protein azo2860 [Azoarcus sp. BH72] gi|119671563|emb|CAL95476.1| conserved hypothetical membrane protein [Azoarcus sp. BH72] Length = 343 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 20/179 (11%), Positives = 50/179 (27%), Gaps = 43/179 (24%) Query: 247 IVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK-------------- 292 + N ++V + ++++ T M + L E Sbjct: 136 ATAALIQAPTRNHDDVLAAIDRVQLQRGTAIGSGMVLSLATLLPEAGIDLRLLAGDGSPP 195 Query: 293 --------ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + + ++ +TDGE + + G+++Y+ Sbjct: 196 ADKPPPGEPTHAPVPPGSHAYGAIVLLTDGERTTGPPLDF------ATRLAADHGVRVYT 249 Query: 345 VAVSAPPE--------------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQ 388 V V + L+ D + G++F + L + K+ ++ Sbjct: 250 VGVGTAEGGVVGYEGWSMRVRLDEAALKSIADETRGEYFHAQSAEALRTIYRKLGTRLT 308 >gi|167759260|ref|ZP_02431387.1| hypothetical protein CLOSCI_01607 [Clostridium scindens ATCC 35704] gi|167663134|gb|EDS07264.1| hypothetical protein CLOSCI_01607 [Clostridium scindens ATCC 35704] Length = 800 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 34/368 (9%), Positives = 86/368 (23%), Gaps = 27/368 (7%) Query: 48 RTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAES 107 + + + + + + +I N Sbjct: 40 ENQSNKIPDTVKAEEVSAETVPGETSGTESSTDGTDEILNTEGSPENGSGNGKTADTGAV 99 Query: 108 KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQK 167 + ++ + + Sbjct: 100 S---DAEVTPEKEAATEGITEQDVPEAENVGAGNPVAVHLSRGAASEAAPEHQKYIKKNA 156 Query: 168 HNDNNNMTSNKYLLPPPPKKSFWS-------KNTTKSKYAPAPAPANRKIDVLIESAGNL 220 ND + K + K + K ++DVL + Sbjct: 157 ENDYTLTLNVKGMYDSETTKPMIDVLLIVDKSGSMNWKMDTDKVGKPSRMDVLKQVVTG- 215 Query: 221 VNSIQKAIQEKKNLSVRIGTIAYNIG--------IVGNQCTPLSNNLNEVKSRLNKLNPY 272 + +I + ++ + Y+ + + V + +N + Sbjct: 216 TGGLTDSIFGNTQIDAQMAVVTYSGSNDFLDQRYDDAEIIQEWTKQKDTVNNAVNNIQAK 275 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIG-STRLKKFVIFITDGEN-SGASAYQNTLNTLQ 330 TN + L +E++ + + + DG S T Sbjct: 276 GGTNCEAGLRTGATALEGSRENAKKFVIFLSDGDATFYYGDDGYTKGPGSGSSPTAREKA 335 Query: 331 ICEYMRNAGMK-IYSVAVSAPPEGQDLLRKCT----DSSGQFFAVNDSRELLESFDKITD 385 I + + G++ Y++ + L S +F+ N++ L ++F +I Sbjct: 336 IAQVQKITGLEGFYTIG-MTSSSSSEFLTNLANNSKASEKRFYPANNTEALEKAFQEIVG 394 Query: 386 KIQEQSVR 393 + E R Sbjct: 395 ETTEFICR 402 >gi|260813586|ref|XP_002601498.1| hypothetical protein BRAFLDRAFT_146514 [Branchiostoma floridae] gi|229286795|gb|EEN57510.1| hypothetical protein BRAFLDRAFT_146514 [Branchiostoma floridae] Length = 384 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 20/164 (12%), Positives = 54/164 (32%), Gaps = 16/164 (9%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAY 285 + + ++G I Y+ + S ++++ T T A+ + Sbjct: 36 SGFDISPSGTQVGVIQYSTRTRQEFSMNSFLTKETLSSAIDEVQYMRGGTLTGKAIRYVT 95 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + + + + + K VI +TDG + + G+ +Y++ Sbjct: 96 KYGFGKSDGAR-----PGVPKVVIVVTDGVSYD--------AVAAPALEAQQKGITVYAI 142 Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 V D L + ++ V++ L + + + + Sbjct: 143 GV--SGYDADQLEQIASNNNTLAFVDNFNLLDNLRNTLLTGVCD 184 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 31/307 (10%), Positives = 80/307 (26%), Gaps = 39/307 (12%) Query: 76 SYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRS 135 ++ G ++ I +A + + ++ G+ L + Sbjct: 97 YGFGKSDGARPGVPKVVIVVTDGVSYDAVAAPALEAQQKGITVYAIGVSGYDADQLEQIA 156 Query: 136 TGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKS------- 188 ++ + +LD R+ + + + Sbjct: 157 ----SNNNTLAFVDNFNLLDNLRNTLLTGVCDAKSVYSRSCLSICCSHNETNICHRGTDR 212 Query: 189 ---------FWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK---AIQEKKNLSV 236 +S +T + + S V K + + Sbjct: 213 EFGINNCGRKFSCFSTLCRNPLDIIFLLDGSGSVGASNFEKVKQFTKKTISGFDISPSGT 272 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESS 295 ++G I Y+ + S ++++ T T A+ + + + + + + Sbjct: 273 QVGVIQYSTRTRQEFSMNSFLTKETLSSAIDEVQYMRGGTLTGKAIRYVTKYGFGKSDGA 332 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 + K VI +TDG + + G+ +Y++ V D Sbjct: 333 R-----PGVPKVVIVVTDGVSYD--------AVAAPALEAQQKGITVYAIGV--SGYDAD 377 Query: 356 LLRKCTD 362 L + Sbjct: 378 QLEQIAS 384 >gi|145527514|ref|XP_001449557.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124417145|emb|CAK82160.1| unnamed protein product [Paramecium tetraurelia] Length = 606 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 18/138 (13%), Positives = 45/138 (32%), Gaps = 26/138 (18%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N K + ++ +TN A++++ K + T + ++DG++ Sbjct: 245 NKKYFKGAIAAISAGGSTNIAAGTDIAFQQIQQRKMKNQVTS--------IFLLSDGQD- 295 Query: 318 GASAYQNTLNTLQICEYMRNAG------MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 E ++ + I+S + + + C G F+ + Sbjct: 296 -----------SGAAERIQKQKDRISDVVTIHSFGYGNDHDADLMSKICKVGQGSFYYIE 344 Query: 372 DSRELLESFDKITDKIQE 389 + + L E F ++ Sbjct: 345 NVKLLDEFFADALGRLSS 362 >gi|254453558|ref|ZP_05066995.1| von Willebrand factor, type A [Octadecabacter antarcticus 238] gi|198267964|gb|EDY92234.1| von Willebrand factor, type A [Octadecabacter antarcticus 238] Length = 676 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 43/361 (11%), Positives = 96/361 (26%), Gaps = 32/361 (8%) Query: 35 AAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINIT 94 AA LS + ++ + + S + ++ + + + IT Sbjct: 175 AAPLSLTRAAPTNDLVAEGMALGQVVSAP-----EPNVAGIGELDTESFANDTPNPLKIT 229 Query: 95 KDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVL 154 ++ I A Y + +L L P+ + + Sbjct: 230 TEEPVSTLSIDVDTAAYALIRSSLTAGQLPPADAVRIEEMINYFPYAYPAPDGQPFQPTI 289 Query: 155 DVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLI 214 +V + + Q + P F + K+ +L Sbjct: 290 NVFETPWNADTQLVHIGLQGEMPSIQDRPALNLVFLIDT-------SGSMESADKLPLLR 342 Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 +S +++++ + + + Y ++ + + LN LN + Sbjct: 343 QSFRLMLDNLAPEDE--------VAIVTYAGSTSIALQPTQASERTTILAALNALNAGGS 394 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 TN + AY K + VI TDG+ + + E Sbjct: 395 TNGQGGLEQAYALAETMKTDGDVSR--------VILATDGDFNVGL--SDPRGLQAYIED 444 Query: 335 MRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRI 394 R+ G + + ++ + L E+ + D + I Sbjct: 445 KRDTGTYLSVLGFGRGNLQDATMQSLAQNGNG--TAVYIDTLSEAQKVLVDNLTGALFPI 502 Query: 395 A 395 A Sbjct: 503 A 503 >gi|168698099|ref|ZP_02730376.1| von Willebrand factor type A domain protein [Gemmata obscuriglobus UQM 2246] Length = 311 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 26/208 (12%), Positives = 57/208 (27%), Gaps = 44/208 (21%) Query: 194 TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 S P + D +E+ S ++ Sbjct: 89 VDVSGSMNNPFGRATRYDGAMEAVTAFT-SYRQGDAFGLTFFGNEVLHWC---------- 137 Query: 254 PLSNNLNEVKSRLNKLNPY------ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 PL+ +++ + + P T A+ EL E + Sbjct: 138 PLTTDVSAINCATPFMRPGQLPPWFGGTLIAKALRACKAELIKRPEG----------DRM 187 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC------- 360 ++ ITDG++ ++ E ++ G+ +++V + + Sbjct: 188 IVLITDGDSQ----DFANGADAEVAEELKAEGITVFAVVIG---NDRQFQNPIIRNGSVQ 240 Query: 361 ---TDSSGQFFAVNDSRELLESFDKITD 385 + G+ F D L F +I + Sbjct: 241 TVTARTGGESFEAGDPNALATVFKRIDE 268 >gi|307260995|ref|ZP_07542677.1| Flp pilus assembly protein [Actinobacillus pleuropneumoniae serovar 12 str. 1096] gi|306869297|gb|EFN01092.1| Flp pilus assembly protein [Actinobacillus pleuropneumoniae serovar 12 str. 1096] Length = 539 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 58/494 (11%), Positives = 129/494 (26%), Gaps = 128/494 (25%) Query: 14 TYAIDLAHIMYIRNQMQSALDAAVLSGCASIVS---------------DRTIKDPTTKKD 58 +++ A I+ + ++ +L+ AVLS A S + + + Sbjct: 36 FVSLESAGIIQDQARLSDSLEQAVLSLTAENNSGRKDNDYKLSGSNKENDSFDISSEVGK 95 Query: 59 QTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENL 118 + S + K +K L Q + + IN K + + ++ Sbjct: 96 RDSQMVTKFVKAFLPQTDEKNMHLTPL--CKTINNNSGKGHTSSSEVTCTVSGTVEHKSW 153 Query: 119 FLKGLIPSALTNLSLRSTGIIERSSENL---AISICMVLDVSRSMEDLYLQKHNDNNN-- 173 F + + + + +N I + +V D+S SM+D + + Sbjct: 154 FPLKVGNLEVIPKQVNVASKSKALKKNTFNIPIDLMVVADLSGSMKDGIKGEKLEGGTNS 213 Query: 174 ---------MTSNKYLLPPPPKKSFWSKNTTK---------------------------- 196 L + T Sbjct: 214 KIYILREVLKELADKSLFTQESNEYNRIGITAFAMGAEHPKENKCVLPFVLQDNLHGMSK 273 Query: 197 -----SKYAPAPAPANRKIDVLIESAGNLV--NSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 + ++ + + L ++ Q +++ +I Sbjct: 274 IKIKQYLASRYNRTTLKRTEFVDNFVALLDIEKTLNSIGQTNYDITFPKSSICLEGLKNA 333 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK----------------- 292 +Q ++ + R+N L T + A ++ +EK Sbjct: 334 SQFWYTKTENDKFRDRVNSLKANGGTLASSGLLTASNQMLSEKSRSEELNQETKRVILVL 393 Query: 293 ----------------ESSHNTIGSTRLKKFVIF--------------------ITDGEN 316 +S +R+ K +IF T N Sbjct: 394 SDGNDDMSNLNLADLERNSIPFTNFSRITKNLIFGRKEDLSSTPKTNKAYNTRSSTYQYN 453 Query: 317 SGASAYQNTLNTLQICEYM--------RNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF 368 + + + +C + ++ K+ V E D + C + G +F Sbjct: 454 TYLTDKTKNIIGKGMCSIIQEKLNALNKDENTKLVFVEFGYRSESADAWKTCVGN-GNYF 512 Query: 369 AVNDSRELLESFDK 382 ++ LL SF + Sbjct: 513 YADNRESLLNSFKQ 526 >gi|115438797|ref|NP_001043678.1| Os01g0640200 [Oryza sativa Japonica Group] gi|20805117|dbj|BAB92788.1| zinc finger (C3HC4-type RING finger)-like protein [Oryza sativa Japonica Group] gi|113533209|dbj|BAF05592.1| Os01g0640200 [Oryza sativa Japonica Group] gi|125527021|gb|EAY75135.1| hypothetical protein OsI_03030 [Oryza sativa Indica Group] gi|125571342|gb|EAZ12857.1| hypothetical protein OsJ_02777 [Oryza sativa Japonica Group] Length = 589 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 26/198 (13%), Positives = 61/198 (30%), Gaps = 33/198 (16%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL--SNNLNEVKSR 265 +ID + + ++ + R+ + + + E+K+ Sbjct: 83 DRIDKVKTALQFVIRKLSDLD--------RLCIVTFCTNATRLCPLRFVTAAAQAELKAL 134 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 ++ L Y +TN + + ++ + V+ ++DG + ++ Sbjct: 135 VDGLKAYGDTNMKGGLETGMSVVDGRSLAAGRAVS-------VMLMSDGYQNHGGDARDV 187 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT--DSSGQFFAVNDSRELLESFDKI 383 + +Y+ + A + LL G F V DS L F ++ Sbjct: 188 HL----------KNVPVYTFSFGASHDSN-LLEAIARKSLGGTFNYVADSANLTGPFSQL 236 Query: 384 TD---KIQEQSVRIAPNR 398 I Q + + R Sbjct: 237 LGGLLTIIAQDLELTVTR 254 >gi|294055720|ref|YP_003549378.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] gi|293615053|gb|ADE55208.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] Length = 326 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 19/100 (19%), Positives = 43/100 (43%), Gaps = 13/100 (13%) Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------GQDLLRK 359 + +I +TDG ++G+ + + + +K++ V V P ++ L Sbjct: 200 RVIIALTDGNDTGSRVP-----PAEAAKIANDNTVKVHVVGVGDPTTTGEDVLDEEALNA 254 Query: 360 CTDS-SGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + G++F N+ EL + + ++ D+I + V R Sbjct: 255 VASTTGGRYFHANNREELEDIYTEL-DRIDSREVETESVR 293 >gi|254802534|sp|Q0PMD2|ANTR1_RAT RecName: Full=Anthrax toxin receptor 1; Flags: Precursor Length = 562 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 63/196 (32%), Gaps = 18/196 (9%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 K ++ + +++ + +R+ I ++ Sbjct: 35 PACYGGFDLYFILDKSGSVLHHWNEIYYFVEQLAHRFISPQLRMSFIVFSTRGT--TLMK 92 Query: 255 LSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 L+ + +++ L +L P +T + A ++Y E + T +I + Sbjct: 93 LTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENSQGYRT------ASVIIAL 146 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDGE + + R+ G +Y V V + L + DS F VN Sbjct: 147 TDGELHEDLFFYSERE----ANRSRDLGAIVYCVGV--KDFNETQLARIADSKDHVFPVN 200 Query: 372 D-SRELLESFDKITDK 386 D + L I K Sbjct: 201 DGFQALQGIIHSILKK 216 >gi|113205504|ref|NP_001037714.1| anthrax toxin receptor 1 [Rattus norvegicus] gi|111052876|gb|ABH03702.1| anthrax toxin receptor [Rattus norvegicus] gi|124297147|gb|AAI31854.1| Anthrax toxin receptor 1 [Rattus norvegicus] Length = 576 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 63/196 (32%), Gaps = 18/196 (9%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 K ++ + +++ + +R+ I ++ Sbjct: 49 PACYGGFDLYFILDKSGSVLHHWNEIYYFVEQLAHRFISPQLRMSFIVFSTRGT--TLMK 106 Query: 255 LSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 L+ + +++ L +L P +T + A ++Y E + T +I + Sbjct: 107 LTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENSQGYRT------ASVIIAL 160 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDGE + + R+ G +Y V V + L + DS F VN Sbjct: 161 TDGELHEDLFFYSERE----ANRSRDLGAIVYCVGV--KDFNETQLARIADSKDHVFPVN 214 Query: 372 D-SRELLESFDKITDK 386 D + L I K Sbjct: 215 DGFQALQGIIHSILKK 230 >gi|297473450|ref|XP_002686618.1| PREDICTED: collagen, type VI, alpha 3-like isoform 3 [Bos taurus] gi|296488813|gb|DAA30926.1| collagen, type VI, alpha 3-like isoform 3 [Bos taurus] Length = 2962 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 21/185 (11%), Positives = 64/185 (34%), Gaps = 20/185 (10%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 + + +++ N R + +N + EV S ++ ++ +N Sbjct: 57 VREFLYDVIESLAVGDNDF-RFALVQFNGNPHTEFLFNTYRSKQEVLSHVSNMSYIGGSN 115 Query: 277 -TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 T + + + E + + S + + ++ +T G + A + + Sbjct: 116 QTGKGLAYVMQNHLTE---AAGSRASDGVPQVIVVLTHGHSEDGLALPS--------AEL 164 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQSVR 393 ++A + ++++ V + L++ F + + L + + ++ Sbjct: 165 KSADVNVFAIGV--EDADEAALKEIASEPLNMHVFNLENYTSLHDIVGNLVACVRSS--- 219 Query: 394 IAPNR 398 +AP R Sbjct: 220 MAPER 224 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 26/173 (15%), Positives = 60/173 (34%), Gaps = 17/173 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKN--LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN ++ +N Sbjct: 1447 INFRRDSFQEVLRFVSEIVDTVYEGGDSIQVGLVQYNSDPTDEFFLKDFPTKQQIIDAIN 1506 Query: 268 KLNPYENTNTYP--AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 K+ + + H R N + R+ + IT G+ + Sbjct: 1507 KVVYKGGRHANTKVGLEHLRR---NHFVPEAGSRLDQRVPQIAFVITGGK--------SV 1555 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 1556 EDAQEASMALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 1606 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 18/156 (11%), Positives = 50/156 (32%), Gaps = 15/156 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRE 287 +R+G + Y+ + +V + L N A+ Sbjct: 272 SIGTQQIRVGVVQYSDEPRTMFSLNSYSTKAQVLDAVKALGFIGGELANVGLALDFVVEN 331 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 + + + + + ++ I+ G +S ++ + L+ ++S + Sbjct: 332 HFT---RAGGSRAEEGVPQVLVLISAGPSSD--EIRDGVIALKQAS--------VFSFGL 378 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 A + L+ + F V + R L + +++ Sbjct: 379 GAQAASKAELQHIATNDNLVFTVPEFRSLGDVQEQL 414 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 58/169 (34%), Gaps = 15/169 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRE 287 VRIG + ++ + + V + +L + NT A+ R Sbjct: 1266 NVGPNKVRIGVLQFSNDVFPEFQLKTYKSQASVLDAIRRLRFKGGSPLNTGKALEFVARN 1325 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 + S + + + ++ G++ + + + +++AG+ S+ V Sbjct: 1326 YF---VKSAGSRIEDGVPQHLVLFLGGKSQDDISRYS--------QVIKSAGI--ASLGV 1372 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 + L+ T F V + R+L +++ + V AP Sbjct: 1373 GDRNIDRTELQTITSDPRLVFTVREFRDLPSIEERMVNSFGSSGVTPAP 1421 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 20/161 (12%), Positives = 54/161 (33%), Gaps = 16/161 (9%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 + + + + + E + +++ Y + + EV + + K+ + Sbjct: 461 NAIRDFIAKVIQ-RLEIRQDLIQVAVAQYADTVRPEFYFNTYPSKREVINAVRKMKALDG 519 Query: 275 TN--TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 + T A+ L+ E + + + K ++ +T G+ + Q Sbjct: 520 SALYTGSALDFVRNNLFTE---AAGYRAAEGVPKLLVLVTGGK--------SLDAVSQPA 568 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDS 373 + ++ +G+ + AV Q L + S F + Sbjct: 569 QELKRSGIL--AFAVGNKVADQAELEEIAFDSSLVFTATEF 607 Score = 48.0 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 34/290 (11%), Positives = 80/290 (27%), Gaps = 31/290 (10%) Query: 103 YIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMED 162 +S + + P NL G++P L + + I + L ++ Sbjct: 747 VAGKSSDRVDTPALNLKQSGVVPFILQAKNADPAELELIVPSPAFILVAESLPKIGDLQP 806 Query: 163 LYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVN 222 + + P + S+ + P +L E +V Sbjct: 807 QIVNLL-----KSVQNGAPAPVSVEKDVVFLIDGSEGVRSGFP------LLKEFVQRVVE 855 Query: 223 SIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPA 280 S+ + VR+ + Y+ + V + L NT A Sbjct: 856 SL-----DVGPDRVRVAVVQYSDRTRPEFYLNSYMDQQSVVGAIRGLTLLGGPAPNTGAA 910 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + R + S + + +I +T A ++ + +R G Sbjct: 911 LEFVLRNILVGSAGSRIA---EGVPQLLIVLT--------ADRSGDDVRGPSVVLRRGGA 959 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + + ++ + + R+L I++++ + Sbjct: 960 V--PIGIGIGNADITEMQTLSFVPDFAVVIPTFRQLGTIQQVISERVTQL 1007 >gi|297473446|ref|XP_002686616.1| PREDICTED: collagen, type VI, alpha 3-like isoform 1 [Bos taurus] gi|296488811|gb|DAA30924.1| collagen, type VI, alpha 3-like isoform 1 [Bos taurus] Length = 3162 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 21/185 (11%), Positives = 64/185 (34%), Gaps = 20/185 (10%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 + + +++ N R + +N + EV S ++ ++ +N Sbjct: 57 VREFLYDVIESLAVGDNDF-RFALVQFNGNPHTEFLFNTYRSKQEVLSHVSNMSYIGGSN 115 Query: 277 -TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 T + + + E + + S + + ++ +T G + A + + Sbjct: 116 QTGKGLAYVMQNHLTE---AAGSRASDGVPQVIVVLTHGHSEDGLALPS--------AEL 164 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQSVR 393 ++A + ++++ V + L++ F + + L + + ++ Sbjct: 165 KSADVNVFAIGV--EDADEAALKEIASEPLNMHVFNLENYTSLHDIVGNLVACVRSS--- 219 Query: 394 IAPNR 398 +AP R Sbjct: 220 MAPER 224 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 23/177 (12%), Positives = 60/177 (33%), Gaps = 16/177 (9%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + NLVNS+ + + +R+G + ++ V +E Sbjct: 646 SSNVGETNFPYVRDFVMNLVNSL-----DVGSDHIRVGLVQFSDTPVTEFSLNTYPTKSE 700 Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + + L ++ + +Y N + + + + ++ +T G+ Sbjct: 701 LLAHLRQMQLQGGSVLNTGAALSY-VHANHFTEAGGSRIQDHVPQLLLLLTAGQ------ 753 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + LQ + AG+ + V + L + + G + ++D L Sbjct: 754 --SEDSYLQAANALARAGILTFCVG--TSQADRAELEEIAFNPGLVYLMDDFSSLPA 806 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 26/173 (15%), Positives = 60/173 (34%), Gaps = 17/173 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKN--LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN ++ +N Sbjct: 1647 INFRRDSFQEVLRFVSEIVDTVYEGGDSIQVGLVQYNSDPTDEFFLKDFPTKQQIIDAIN 1706 Query: 268 KLNPYENTNTYP--AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 K+ + + H R N + R+ + IT G+ + Sbjct: 1707 KVVYKGGRHANTKVGLEHLRR---NHFVPEAGSRLDQRVPQIAFVITGGK--------SV 1755 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 1756 EDAQEASMALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 1806 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 18/156 (11%), Positives = 50/156 (32%), Gaps = 15/156 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRE 287 +R+G + Y+ + +V + L N A+ Sbjct: 272 SIGTQQIRVGVVQYSDEPRTMFSLNSYSTKAQVLDAVKALGFIGGELANVGLALDFVVEN 331 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 + + + + + ++ I+ G +S ++ + L+ ++S + Sbjct: 332 HFT---RAGGSRAEEGVPQVLVLISAGPSSD--EIRDGVIALKQAS--------VFSFGL 378 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 A + L+ + F V + R L + +++ Sbjct: 379 GAQAASKAELQHIATNDNLVFTVPEFRSLGDVQEQL 414 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 58/169 (34%), Gaps = 15/169 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRE 287 VRIG + ++ + + V + +L + NT A+ R Sbjct: 1466 NVGPNKVRIGVLQFSNDVFPEFQLKTYKSQASVLDAIRRLRFKGGSPLNTGKALEFVARN 1525 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 + S + + + ++ G++ + + + +++AG+ S+ V Sbjct: 1526 YF---VKSAGSRIEDGVPQHLVLFLGGKSQDDISRYS--------QVIKSAGI--ASLGV 1572 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 + L+ T F V + R+L +++ + V AP Sbjct: 1573 GDRNIDRTELQTITSDPRLVFTVREFRDLPSIEERMVNSFGSSGVTPAP 1621 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 20/161 (12%), Positives = 54/161 (33%), Gaps = 16/161 (9%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 + + + + + E + +++ Y + + EV + + K+ + Sbjct: 461 NAIRDFIAKVIQ-RLEIRQDLIQVAVAQYADTVRPEFYFNTYPSKREVINAVRKMKALDG 519 Query: 275 TN--TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 + T A+ L+ E + + + K ++ +T G+ + Q Sbjct: 520 SALYTGSALDFVRNNLFTE---AAGYRAAEGVPKLLVLVTGGK--------SLDAVSQPA 568 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDS 373 + ++ +G+ + AV Q L + S F + Sbjct: 569 QELKRSGIL--AFAVGNKVADQAELEEIAFDSSLVFTATEF 607 Score = 48.0 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 34/290 (11%), Positives = 80/290 (27%), Gaps = 31/290 (10%) Query: 103 YIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMED 162 +S + + P NL G++P L + + I + L ++ Sbjct: 947 VAGKSSDRVDTPALNLKQSGVVPFILQAKNADPAELELIVPSPAFILVAESLPKIGDLQP 1006 Query: 163 LYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVN 222 + + P + S+ + P +L E +V Sbjct: 1007 QIVNLL-----KSVQNGAPAPVSVEKDVVFLIDGSEGVRSGFP------LLKEFVQRVVE 1055 Query: 223 SIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPA 280 S+ + VR+ + Y+ + V + L NT A Sbjct: 1056 SL-----DVGPDRVRVAVVQYSDRTRPEFYLNSYMDQQSVVGAIRGLTLLGGPAPNTGAA 1110 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + R + S + + +I +T A ++ + +R G Sbjct: 1111 LEFVLRNILVGSAGSRIA---EGVPQLLIVLT--------ADRSGDDVRGPSVVLRRGGA 1159 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + + ++ + + R+L I++++ + Sbjct: 1160 V--PIGIGIGNADITEMQTLSFVPDFAVVIPTFRQLGTIQQVISERVTQL 1207 >gi|32189436|ref|NP_473382.1| anthrax toxin receptor 1 precursor [Mus musculus] gi|17366052|sp|Q9CZ52|ANTR1_MOUSE RecName: Full=Anthrax toxin receptor 1; AltName: Full=Tumor endothelial marker 8; Flags: Precursor gi|15987505|gb|AAL11999.1|AF378762_1 tumor endothelial marker 8 precursor [Mus musculus] gi|63100398|gb|AAH94544.1| Anthrax toxin receptor 1 [Mus musculus] Length = 562 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 63/196 (32%), Gaps = 18/196 (9%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 K ++ + +++ + +R+ I ++ Sbjct: 35 PACYGGFDLYFILDKSGSVLHHWNEIYYFVEQLAHRFISPQLRMSFIVFSTRGT--TLMK 92 Query: 255 LSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 L+ + +++ L +L P +T + A ++Y E + T +I + Sbjct: 93 LTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENSQGYRT------ASVIIAL 146 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDGE + + R+ G +Y V V + L + DS F VN Sbjct: 147 TDGELHEDLFFYSERE----ANRSRDLGAIVYCVGV--KDFNETQLARIADSKDHVFPVN 200 Query: 372 D-SRELLESFDKITDK 386 D + L I K Sbjct: 201 DGFQALQGIIHSILKK 216 >gi|116625432|ref|YP_827588.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228594|gb|ABJ87303.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 307 Score = 63.4 bits (152), Expect = 6e-08, Method: Composition-based stats. Identities = 31/278 (11%), Positives = 96/278 (34%), Gaps = 36/278 (12%) Query: 123 LIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLP 182 + A+T + T + N+ + +DV + + + +K Sbjct: 14 FLGVAVTAQEVTITPRPRVQAVNMPRANLR-MDVQMVQIPVTVTDLRGKPLVDLSKTNFR 72 Query: 183 PPPK------KSFWSKNTTKSKYAP--APAPANRKIDVLIESAGNLVNSIQKAIQEKKNL 234 +F++ +T S + ++ +S + + + Sbjct: 73 VFEDEVEKPISAFFTADTPLSTGVVFDSSRSMKNRLQDARQSVEQFLRTGSTGDE----- 127 Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKES 294 I ++ P + + E+ +L + T ++ A + Sbjct: 128 ---YFLIRFSDEAKM--LAPFTADTEEIARQLGSIEAKGWTALNDSIVLAANQ------- 175 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + + +K ++ I+DG ++ + ++ +R A +++Y+V++ Q Sbjct: 176 ---SRKARNHRKALLVISDGGDNNSRYT-----VGEMISILREADLRVYAVSIF--ERSQ 225 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 L + C ++ G+ V +L + ++++ +++ + + Sbjct: 226 LLEKICEETGGRALWVRKLGDLPDIMERLSQEMRSEYI 263 >gi|254304217|ref|ZP_04971575.1| von Willebrand factor domain protein [Fusobacterium nucleatum subsp. polymorphum ATCC 10953] gi|148324409|gb|EDK89659.1| von Willebrand factor domain protein [Fusobacterium nucleatum subsp. polymorphum ATCC 10953] Length = 529 Score = 63.4 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 22/145 (15%), Positives = 57/145 (39%), Gaps = 18/145 (12%) Query: 245 IGIVGNQCTPLSN-NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 P+ + N+ ++ L + P T+ ++ + +L + + Sbjct: 238 SCGANELIYPIGDLNVEGIEKALEPIQPTGWTSIAKSIEYGVEDLK--------ALDGEK 289 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMR--NAGMKIYSVAVSAPPEGQDLLRKCT 361 + ITDG + N ++I + ++ N + + + + LL++ Sbjct: 290 TLNILYIITDGIETCGG------NPVEIAKQLKGENTNIVLGIIGFNVDANQNRLLKQIA 343 Query: 362 DSSGQFF-AVNDSRELLESFDKITD 385 D++G ++ +VND+ +L +I + Sbjct: 344 DAAGGYYSSVNDADKLTGELYRINE 368 >gi|114652511|ref|XP_509886.2| PREDICTED: coagulation factor C homolog, cochlin isoform 7 [Pan troglodytes] Length = 540 Score = 63.4 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 23/160 (14%), Positives = 48/160 (30%), Gaps = 20/160 (12%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYN 290 ++ +I + + + V + + + T T A+ R ++ Sbjct: 390 SDIGAKIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFG 449 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 S N F++ +TDG++ + +AG+ I+SV V+ Sbjct: 450 PIRESPNK-------NFLVIVTDGQSYD--------DVQGPAAAAHDAGITIFSVGVAWA 494 Query: 351 PEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 P L+ F + L + I Sbjct: 495 PLDD--LKDMASKPKESHAFFTREFTGLEPIVSDVIRGIC 532 >gi|73967405|ref|XP_537778.2| PREDICTED: similar to integrin, alpha E (antigen CD103, human mucosal lymphocyte antigen 1; alpha polypeptide) [Canis familiaris] Length = 1178 Score = 63.4 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 25/156 (16%), Positives = 51/156 (32%), Gaps = 16/156 (10%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNT 298 + Y I S ++ +++ + N T T AM H ++ S Sbjct: 247 LVQYGEVIQTEFDLRDSQDVMTSLAKVQNITQVGNVTKTASAMQHVLDNIFTPNHGSR-- 304 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD--- 355 K ++ +TDG+ + + LN + + G++ +++ V E Sbjct: 305 ---KNASKVMVVLTDGD-----IFGDPLNLTTVINSPKMQGVERFAIGVGNAFEKNKTYH 356 Query: 356 LLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQE 389 L+ F V + L K+ I + Sbjct: 357 ELKLIASDPDDRYAFKVTNYTALDGLLSKLQQTIIQ 392 >gi|254420639|ref|ZP_05034363.1| von Willebrand factor type A domain protein [Brevundimonas sp. BAL3] gi|196186816|gb|EDX81792.1| von Willebrand factor type A domain protein [Brevundimonas sp. BAL3] Length = 613 Score = 63.4 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 21/328 (6%), Positives = 68/328 (20%), Gaps = 21/328 (6%) Query: 53 PTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAES-KAQY 111 + + + + + A Y Sbjct: 108 AAAPGQAPLNAVVVTGSRIMPGAPAPSDTETYPDATPNPVKRTADQPVSTFSIDVDTAAY 167 Query: 112 EIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDN 171 + P+ + + +++ + + + Sbjct: 168 SNVRRFIDEGRSPPADAVRVEELINAFDYGYARPTSLARPFAITTAVVASPWAPRTERGG 227 Query: 172 NNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEK 231 + P+ N T + K+D+ ++ ++ ++ Sbjct: 228 RQIVHIGLQGYELPQGEQRPLNLTFLVDVSGSMRSPDKLDLAKQAMNLAIDRLRPQDT-- 285 Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNE 291 + Y G + +++ + L T M +AY + Sbjct: 286 ------LSVTYYAEGAGTTLQPTPGDQKLKMRCAVASLRASGGTAGATGMTNAYDQAQAS 339 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 ++ TDG+ + + R G+ + Sbjct: 340 FARDKVNR--------ILMFTDGDFNVGV--TDNKRLEDYVAEKRGTGVYLSVYGFGRGN 389 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLES 379 ++ + +L ++ Sbjct: 390 YQDARMQTIAQAGNGV--AAYVGDLRDA 415 >gi|326675264|ref|XP_002665076.2| PREDICTED: collagen alpha-1(XXVIII) chain-like [Danio rerio] Length = 1046 Score = 63.4 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 25/190 (13%), Positives = 64/190 (33%), Gaps = 26/190 (13%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + +V+ + L++ R+G + Y+ + + Sbjct: 766 SSESVGPENFEVIKDFVNTLIDR-----TSVSPEVTRVGIVLYSNINLLVTNIQDRLTRD 820 Query: 261 EVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 EVK+ + ++ T T + A + ++K + ITDG+ Sbjct: 821 EVKAAVRRMPYIGEGTYTGSGIRKANEMFAFARPG---------VRKVAMVITDGQTD-- 869 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAV-----SAPPEGQDLLRKCTD--SSGQFFAVND 372 +++T+ + ++++ V + + L+ + F+V D Sbjct: 870 --HRDTVKLEDAVREAHLDNITMFAIGVVNQSDPIYDDFKQELKSIASPPTEEHMFSVED 927 Query: 373 SRELLESFDK 382 R L +F++ Sbjct: 928 FRMLHGAFEE 937 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 29/179 (16%), Positives = 63/179 (35%), Gaps = 20/179 (11%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE 273 E + + + +L R+ I Y+ + NQ +L+ +L + Sbjct: 65 KEFVRSFSRRLMEMQVSGWHLRTRLALIYYSSSVHINQHFNDWQDLDVFLDQLEDASYIG 124 Query: 274 -NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T + A+ +A + S + + + +TDG S + + + + Sbjct: 125 QGTYSTYAISNATQLF--------IRETSGQSVRVSLLMTDG-----SDHPRNPDIMTVV 171 Query: 333 EYMRNAGMKIYSVAVSAP--PEGQDLLRKCTDSSG-QFFAVNDSRELLESFDKITDKIQ 388 ++ +KI+++ +S LR S Q+F R L E ++ +I Sbjct: 172 AEAKSHNIKIFAIGLSMRAMDSNSAKLRAVASSPAQQYFHSLTDRGLEE---RLLQQIC 227 >gi|238011090|gb|ACR36580.1| unknown [Zea mays] Length = 516 Score = 63.4 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 34/247 (13%), Positives = 72/247 (29%), Gaps = 35/247 (14%) Query: 142 SSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAP 201 S + +V + + + N L + Sbjct: 12 SPNSGPRPTPIVPGRVQLVSKNNNMAPLEENTQKVLLELTGGDSTSDRSGLDLVAVLDVS 71 Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL----SN 257 KI+ + + +V + R+ + + N+ PL + Sbjct: 72 GSMQGE-KIEKMKTAMKFVVKKLSSID--------RLSIVTFLDTA--NRICPLRQVTED 120 Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + ++ ++ L P NTN + + L + K SS +G V+ ++DG+ + Sbjct: 121 SQPQLLKLIDALQPGGNTNISDGLQTGLKVLADRKLSSGRVVG-------VMLMSDGQQN 173 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT--DSSGQFFAVNDSRE 375 ++ + +Y+ A +L G F VND Sbjct: 174 RGEP----------AANVKIGNVPVYTFGFGAD-YDPTVLNAVARNSMGGTFSVVNDVNL 222 Query: 376 LLESFDK 382 L +F + Sbjct: 223 LSMAFSQ 229 >gi|167399327|ref|ZP_02304851.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167051831|gb|EDR63239.1| conserved hypothetical protein [Yersinia pestis biovar Antiqua str. UG05-0454] Length = 253 Score = 63.4 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 21/176 (11%), Positives = 60/176 (34%), Gaps = 28/176 (15%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTI---AYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 + +I + + + + + I + + + N + + + T Sbjct: 69 DYKETINSIDRNGETIDIPMDDILDPFFCLKETNAKSLNFDPNSKGDINEILNMKAEGGT 128 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY------------Q 323 + + L + ++ K +I ++DG+++ Sbjct: 129 LASSGILVGNKMLTESQNNN----------KLMIILSDGDDNTQKMSSPHDQKAGIINIT 178 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ--DLLRKCTDSSGQFFAVNDSRELL 377 L T +C+ +++ G+K+ + + P+ D + C + G F+ ++ EL Sbjct: 179 QKLITEGMCQKIKDNGIKMVFIGIGYVPDNNIIDWEKDCVGT-GNFYLAKNAHELE 233 >gi|187919333|ref|YP_001888364.1| von Willebrand factor type A [Burkholderia phytofirmans PsJN] gi|187717771|gb|ACD18994.1| von Willebrand factor type A [Burkholderia phytofirmans PsJN] Length = 451 Score = 63.4 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 45/432 (10%), Positives = 112/432 (25%), Gaps = 84/432 (19%) Query: 17 IDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGS 76 +D I+ ++ +A D AV++ ++ QT Q + Sbjct: 38 VDSGFGYMIKARLDAATDGAVIAAGEAVTRGNN---------QTEQTNNAQQAATAFFAA 88 Query: 77 YIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRST 136 + A + ++A + ++ + N+S S Sbjct: 89 NYPAGFLGSSVSAGTPSIVFNAGTVTIGMTAQASV-----PVTFSKVLGFNVLNVSSSSQ 143 Query: 137 GIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPK--------KS 188 I + + + V+D ++S+ + +N + + + Sbjct: 144 AIRKT------LDMVFVIDNTKSLNTSGVPAAVRSNAVAFLNNFDVTNDRVALMHFAYGT 197 Query: 189 FWSKNTTKSKYAPAPAPANRKIDVLI--------ESAGNLVNSIQKAIQEKKNLSVRIGT 240 + A I+ E+ N N + I + +L V Sbjct: 198 VVDVPFKGNTRGFDRATMTTDINKYTFDGSTNSPEALWNARNQLNTVITQPSSLRV---I 254 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-----------------------TNT 277 + ++ G + + + N ++ + L + N Sbjct: 255 VFFSDGAPNSFSSFFTTNQSKCNNISTTLASGDGPSGSASGLFSINALSQKLASPCYQND 314 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 A + YN + T + + + + R Sbjct: 315 TSGFVTAMPKWYNAHNVNEQTFPIWPVTSPRVVSNGNATYVNVNRVSRNLLEAMAAAARA 374 Query: 338 AGMKIYSVAVS---------APPEGQDLLRKCTDSS-------------GQFFAVNDSRE 375 G ++++ GQD+L+ +++ G + + Sbjct: 375 EGTYVFTLGYGNELTKPAGPDNELGQDVLKCMANTADSLSRCYNPKQPVGVYCYAATPAD 434 Query: 376 LLESFDKITDKI 387 L F ++ +I Sbjct: 435 LKPCFSQLASQI 446 >gi|119585300|gb|EAW64896.1| collagen, type VII, alpha 1 (epidermolysis bullosa, dystrophic, dominant and recessive), isoform CRA_d [Homo sapiens] Length = 2978 Score = 63.4 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 57/162 (35%), Gaps = 19/162 (11%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYN 290 VR T+ Y+ + +V + +L+ NT T A+ H ++ Sbjct: 73 SAQGVRFATVQYSDDPRTEFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVFL 132 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + K I ITDG++ + ++ G+K+++V + Sbjct: 133 PQLARPG------VPKVCILITDGKSQD--------LVDTAAQRLKGQGVKLFAVGI--K 176 Query: 351 PEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 + L++ FF VND L ++ ++ Sbjct: 177 NADPEELKRVASQPTSDFFFFVNDFSILRTLLPLVSRRVCTT 218 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 55/169 (32%), Gaps = 23/169 (13%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY--ENTNTYPAMHHAYRELYNEKE 293 V++G ++Y+ S++L + R+ + N A+ A+R + Sbjct: 1091 VQVGLLSYSHRPSPLFPLNGSHDLGIILQRIRDMPYMDPSGNNLGTAVVTAHRYMLAPDA 1150 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + ++ + D G + +G+ + V + Sbjct: 1151 PGRR----QHVPGVMVLLVDEPLRGDIFSP--------IREAQASGLNV--VMLGMAGAD 1196 Query: 354 QDLLRKCTDSSG-----QFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + LR+ G FFAV+D L ++ + + + S P Sbjct: 1197 PEQLRRLA--PGMDSVQTFFAVDDGPSLDQAVSGLATALCQASFTTQPR 1243 >gi|119585298|gb|EAW64894.1| collagen, type VII, alpha 1 (epidermolysis bullosa, dystrophic, dominant and recessive), isoform CRA_b [Homo sapiens] Length = 2609 Score = 63.4 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 57/162 (35%), Gaps = 19/162 (11%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYN 290 VR T+ Y+ + +V + +L+ NT T A+ H ++ Sbjct: 73 SAQGVRFATVQYSDDPRTEFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVFL 132 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + K I ITDG++ + ++ G+K+++V + Sbjct: 133 PQLARPG------VPKVCILITDGKSQD--------LVDTAAQRLKGQGVKLFAVGI--K 176 Query: 351 PEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 + L++ FF VND L ++ ++ Sbjct: 177 NADPEELKRVASQPTSDFFFFVNDFSILRTLLPLVSRRVCTT 218 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 55/169 (32%), Gaps = 23/169 (13%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY--ENTNTYPAMHHAYRELYNEKE 293 V++G ++Y+ S++L + R+ + N A+ A+R + Sbjct: 1091 VQVGLLSYSHRPSPLFPLNGSHDLGIILQRIRDMPYMDPSGNNLGTAVVTAHRYMLAPDA 1150 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + ++ + D G + +G+ + V + Sbjct: 1151 PGRR----QHVPGVMVLLVDEPLRGDIFSP--------IREAQASGLNV--VMLGMAGAD 1196 Query: 354 QDLLRKCTDSSG-----QFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + LR+ G FFAV+D L ++ + + + S P Sbjct: 1197 PEQLRRLA--PGMDSVQTFFAVDDGPSLDQAVSGLATALCQASFTTQPR 1243 >gi|119585297|gb|EAW64893.1| collagen, type VII, alpha 1 (epidermolysis bullosa, dystrophic, dominant and recessive), isoform CRA_a [Homo sapiens] Length = 2944 Score = 63.4 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 57/162 (35%), Gaps = 19/162 (11%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYN 290 VR T+ Y+ + +V + +L+ NT T A+ H ++ Sbjct: 73 SAQGVRFATVQYSDDPRTEFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVFL 132 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + K I ITDG++ + ++ G+K+++V + Sbjct: 133 PQLARPG------VPKVCILITDGKSQD--------LVDTAAQRLKGQGVKLFAVGI--K 176 Query: 351 PEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 + L++ FF VND L ++ ++ Sbjct: 177 NADPEELKRVASQPTSDFFFFVNDFSILRTLLPLVSRRVCTT 218 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 55/169 (32%), Gaps = 23/169 (13%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY--ENTNTYPAMHHAYRELYNEKE 293 V++G ++Y+ S++L + R+ + N A+ A+R + Sbjct: 1091 VQVGLLSYSHRPSPLFPLNGSHDLGIILQRIRDMPYMDPSGNNLGTAVVTAHRYMLAPDA 1150 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + ++ + D G + +G+ + V + Sbjct: 1151 PGRR----QHVPGVMVLLVDEPLRGDIFSP--------IREAQASGLNV--VMLGMAGAD 1196 Query: 354 QDLLRKCTDSSG-----QFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + LR+ G FFAV+D L ++ + + + S P Sbjct: 1197 PEQLRRLA--PGMDSVQTFFAVDDGPSLDQAVSGLATALCQASFTTQPR 1243 >gi|119585301|gb|EAW64897.1| collagen, type VII, alpha 1 (epidermolysis bullosa, dystrophic, dominant and recessive), isoform CRA_e [Homo sapiens] Length = 2597 Score = 63.4 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 57/162 (35%), Gaps = 19/162 (11%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYN 290 VR T+ Y+ + +V + +L+ NT T A+ H ++ Sbjct: 73 SAQGVRFATVQYSDDPRTEFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVFL 132 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + K I ITDG++ + ++ G+K+++V + Sbjct: 133 PQLARPG------VPKVCILITDGKSQD--------LVDTAAQRLKGQGVKLFAVGI--K 176 Query: 351 PEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 + L++ FF VND L ++ ++ Sbjct: 177 NADPEELKRVASQPTSDFFFFVNDFSILRTLLPLVSRRVCTT 218 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 55/169 (32%), Gaps = 23/169 (13%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY--ENTNTYPAMHHAYRELYNEKE 293 V++G ++Y+ S++L + R+ + N A+ A+R + Sbjct: 1091 VQVGLLSYSHRPSPLFPLNGSHDLGIILQRIRDMPYMDPSGNNLGTAVVTAHRYMLAPDA 1150 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + ++ + D G + +G+ + V + Sbjct: 1151 PGRR----QHVPGVMVLLVDEPLRGDIFSP--------IREAQASGLNV--VMLGMAGAD 1196 Query: 354 QDLLRKCTDSSG-----QFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + LR+ G FFAV+D L ++ + + + S P Sbjct: 1197 PEQLRRLA--PGMDSVQTFFAVDDGPSLDQAVSGLATALCQASFTTQPR 1243 >gi|495866|gb|AAA58965.1| collagen type VII [Homo sapiens] Length = 2912 Score = 63.4 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 57/162 (35%), Gaps = 19/162 (11%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYN 290 VR T+ Y+ + +V + +L+ NT T A+ H ++ Sbjct: 73 SAQGVRFATVQYSDDPRTEFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVFL 132 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + K I ITDG++ + ++ G+K+++V + Sbjct: 133 PQLARPG------VPKVCILITDGKSQD--------LVDTAAQRLKGQGVKLFAVGI--K 176 Query: 351 PEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 + L++ FF VND L ++ ++ Sbjct: 177 NADPEELKRVASQPTSDFFFFVNDFSILRTLLPLVSRRVCTT 218 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 55/169 (32%), Gaps = 23/169 (13%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY--ENTNTYPAMHHAYRELYNEKE 293 V++G ++Y+ S++L + R+ + N A+ A+R + Sbjct: 1091 VQVGLLSYSHRPSPLFPLNGSHDLGIILQRIRDMPYMDPSGNNLGTAVVTAHRYMLAPDA 1150 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + ++ + D G + +G+ + V + Sbjct: 1151 PGRR----QHVPGVMVLLVDEPLRGDIFSP--------IREAQASGLNV--VMLGMAGAD 1196 Query: 354 QDLLRKCTDSSG-----QFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + LR+ G FFAV+D L ++ + + + S P Sbjct: 1197 PEQLRRLA--PGMDSVQTFFAVDDGPSLDQAVSGLATALCQASFTTQPR 1243 >gi|4502961|ref|NP_000085.1| collagen alpha-1(VII) chain precursor [Homo sapiens] gi|1345650|sp|Q02388|CO7A1_HUMAN RecName: Full=Collagen alpha-1(VII) chain; AltName: Full=Long-chain collagen; Short=LC collagen; Flags: Precursor gi|987125|gb|AAA75438.1| alpha-1 type VII collagen [Homo sapiens] Length = 2944 Score = 63.4 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 57/162 (35%), Gaps = 19/162 (11%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYN 290 VR T+ Y+ + +V + +L+ NT T A+ H ++ Sbjct: 73 SAQGVRFATVQYSDDPRTEFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVFL 132 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + K I ITDG++ + ++ G+K+++V + Sbjct: 133 PQLARPG------VPKVCILITDGKSQD--------LVDTAAQRLKGQGVKLFAVGI--K 176 Query: 351 PEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 + L++ FF VND L ++ ++ Sbjct: 177 NADPEELKRVASQPTSDFFFFVNDFSILRTLLPLVSRRVCTT 218 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 55/169 (32%), Gaps = 23/169 (13%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY--ENTNTYPAMHHAYRELYNEKE 293 V++G ++Y+ S++L + R+ + N A+ A+R + Sbjct: 1091 VQVGLLSYSHRPSPLFPLNGSHDLGIILQRIRDMPYMDPSGNNLGTAVVTAHRYMLAPDA 1150 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + ++ + D G + +G+ + V + Sbjct: 1151 PGRR----QHVPGVMVLLVDEPLRGDIFSP--------IREAQASGLNV--VMLGMAGAD 1196 Query: 354 QDLLRKCTDSSG-----QFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + LR+ G FFAV+D L ++ + + + S P Sbjct: 1197 PEQLRRLA--PGMDSVQTFFAVDDGPSLDQAVSGLATALCQASFTTQPR 1243 >gi|50950211|ref|NP_001002980.1| collagen alpha-1(VII) chain [Canis lupus familiaris] gi|33149359|gb|AAO64414.1| type VII collagen [Canis lupus familiaris] Length = 2936 Score = 63.4 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 56/162 (34%), Gaps = 19/162 (11%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYN 290 VR + Y+ + +V + +L+ NT T A+ H ++ Sbjct: 73 SAQGVRFAAVQYSDDPRTEFGLGALGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVFL 132 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + K I ITDG++ + ++ G+K+++V + Sbjct: 133 PQLARPG------VPKVCILITDGKSQD--------LVDTAAQRLKGQGVKLFAVGI--K 176 Query: 351 PEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 + L++ FF VND L ++ ++ Sbjct: 177 NADPEELKRVASQPTSDFFFFVNDFSILRTLLPLVSRRVCTT 218 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 30/199 (15%), Positives = 66/199 (33%), Gaps = 29/199 (14%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + + + LV+++ +V++G ++Y+ S N + + + Sbjct: 1065 NAHRAEAAKRALERLVSALGPLGP----QAVQVGLLSYSHRPSPLFPLNSSYNPDVILQK 1120 Query: 266 LNKL---NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 ++ + +P N N A+ A+R L + ++ + D G Sbjct: 1121 IHSIPYVDPSGN-NLGTAVVTAHRHLLAPDAPGRR----QHIPGIMVLLVDEPLRGDIFN 1175 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG-----QFFAVNDSRELL 377 + AG+K+ + + LR+ G FFAV+D L Sbjct: 1176 P--------IREAQAAGLKVMI--LGQAGADPEQLRRLV--PGMDPVQTFFAVDDGSSLD 1223 Query: 378 ESFDKITDKIQEQSVRIAP 396 + + + + I P Sbjct: 1224 RAVSGLATSLCQIGSTIQP 1242 >gi|3850207|gb|AAC72024.1| alpha-1 type VII collagen non-collagenous domain [Canis lupus familiaris] Length = 1253 Score = 63.4 bits (152), Expect = 7e-08, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 56/162 (34%), Gaps = 19/162 (11%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYN 290 VR + Y+ + +V + +L+ NT T A+ H ++ Sbjct: 73 SAQGVRFAAVQYSDDPRTEFGLGALGSGGDVIRAIRELSYKGGNTRTGAAILHVADHVFL 132 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + K I ITDG++ + ++ G+K+++V + Sbjct: 133 PQLARPG------VPKVCILITDGKSQD--------LVDTAAQRLKGQGVKLFAVGI--K 176 Query: 351 PEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 + L++ FF VND L ++ ++ Sbjct: 177 NADPEELKRVASQPTSDFFFFVNDFSILRTLLPLVSRRVCTT 218 Score = 42.2 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 30/199 (15%), Positives = 67/199 (33%), Gaps = 29/199 (14%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + + + + LV+++ +V++G ++Y+ S N + + + Sbjct: 1065 NAHRAEAVKRALERLVSALGPLGP----QAVQVGLLSYSHRPSPLFPLNSSYNPDVILQK 1120 Query: 266 LNKL---NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 ++ + +P N N A+ A+R L + ++ + D G Sbjct: 1121 IHSIPYVDPSGN-NLGTAVVTAHRHLLAPDAPGRR----QHIPGIMVLLVDESLRGDIFN 1175 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG-----QFFAVNDSRELL 377 + AG+K+ + + LR+ G FFAV+D L Sbjct: 1176 P--------IREAQAAGLKVMI--LGQAGADPEQLRRLV--PGMDPVQTFFAVDDGSSLD 1223 Query: 378 ESFDKITDKIQEQSVRIAP 396 + + + + I P Sbjct: 1224 RAVSGLATSLCQIGSTIQP 1242 >gi|327260894|ref|XP_003215268.1| PREDICTED: collagen alpha-3(VI) chain-like [Anolis carolinensis] Length = 3053 Score = 63.0 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 22/184 (11%), Positives = 54/184 (29%), Gaps = 20/184 (10%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY 272 L +V+S+ + VR+ + Y+ ++ V + + ++ Sbjct: 1047 LKSFVQRVVDSL-----DVGPGKVRVAVVQYSNDANTEFNLNEYSDKASVITAVQRMTAM 1101 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 + N + + +FVI +T A ++ + + Sbjct: 1102 GGYAVNTGAA-LNYLISNVFTREAGSRVQEGVPQFVILLT--------AERSRDDVRRPA 1152 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 ++ G + + L+ + V+ EL +I I E+ Sbjct: 1153 LELKTRGAV--PLGIGFGNADITQLQTISFVPEFAVFVSGVSEL----GRIQQLIAERVT 1206 Query: 393 RIAP 396 R+ Sbjct: 1207 RLTK 1210 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 38/400 (9%), Positives = 112/400 (28%), Gaps = 38/400 (9%) Query: 15 YAIDLAHIMYI-----RNQMQSALDA----------AVLSGCASIVSDRTIKDPTTKKDQ 59 + ID + ++ +Q +D+ AV+ ++ + + + K Sbjct: 1032 FLIDGSDATRSSFPELKSFVQRVVDSLDVGPGKVRVAVVQYSNDANTEFNLNEYSDKASV 1091 Query: 60 TSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLF 119 + + + + N + ++ + Q++ A+ Sbjct: 1092 ITAVQRMTAMGGYAVNTGAALNYLISNVFTREAGSRVQEGVPQFVILLTAERSRDDVRRP 1151 Query: 120 LKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKY 179 L L + + ++ + VS E +Q+ K Sbjct: 1152 ALELKTRGAVPLGIGFGNADITQLQTISFVPEFAVFVSGVSELGRIQQLIAERVTRLTKA 1211 Query: 180 LLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLI--ESAGNLVNSIQKAIQEKKNLSVR 237 + + + + S + + + + R Sbjct: 1212 EIEALTPEITVPLPRPGDGKKDVVFLIDDSQYAVPEFNSVREFIERLVS-NLNVGSDNTR 1270 Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESS 295 I I ++ + EV+ + +L P N A+ + + ++ S Sbjct: 1271 IAVIQFSEDPRVAFLLNAHSTKEEVQDAVRRLKPKGGRQVNLGSALEYVSKNIFTRPSGS 1330 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 G+ +F+I + ++ + + ++ G+ ++A ++ Sbjct: 1331 RIEEGA---PQFLILL--------FSHPSDDDVEDPAIQVKQVGVAPLTIA---KNVDRE 1376 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 L++ S F V+ ++L + ++ + Sbjct: 1377 LMQTIALSPQYVFQVSTYQDLPT----LEQQLISPVTTLT 1412 Score = 55.3 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 31/252 (12%), Positives = 80/252 (31%), Gaps = 25/252 (9%) Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 + + + +A+S V VS + L++ + T + + + Sbjct: 1370 AKNVDRELMQTIALSPQYVFQVSTYQDLPTLEQQLISPVTTLTTQQIQGLIADTSSPTDI 1429 Query: 195 TKSK-----YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 + + + ++ + + ++ VRIG + ++ + Sbjct: 1430 DSEAKDIVFLIDSSDNVGADFAHIRDFIIRIIQQL-----DVRSRKVRIGVVQFSNNVFP 1484 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 N V + ++ P T N A+ + + N S + + + Sbjct: 1485 EFFLKTHPTKNAVLQAIRRMRPRGGTPLNVGKALDY---VVKNHFIKSAGSRREDGVPQH 1541 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF 367 ++ + G + + + + ++G+K S+ V A L++ T+ Sbjct: 1542 LVLLLGGRSQDDVGRPSNV--------ILSSGIK--SLGVGAKNADSAELQRITNDQRTA 1591 Query: 368 FAVNDSRELLES 379 F V + EL Sbjct: 1592 FIVREFAELPTI 1603 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 22/171 (12%), Positives = 59/171 (34%), Gaps = 20/171 (11%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 + + N++ + + S+R+G + Y+ +V + L Sbjct: 253 FATVRDFVANVIERLS-----VGSESIRVGVVTYSDQSRTAFFLNSHTRKADVLEAVKAL 307 Query: 270 NPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + N A+ + +N S + + + ++ I S+ +++ + Sbjct: 308 SFPGGEEANIGEALEFVVQNHFN---RSGGSRIEEHVPQVLVLI--------SSSESSDD 356 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + M+ AG ++S ++ L++ F + D+R L + Sbjct: 357 IREGVLAMKQAG--VFSFSIGVKNADNVELQQIATDGSFVFTILDTRNLGD 405 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 24/174 (13%), Positives = 58/174 (33%), Gaps = 19/174 (10%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKN--LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ ++ +V+ + I S+++G + YN + + ++ + Sbjct: 1642 INFGRDNFQEVVDFVYGIIDAIYEEGDSIKVGLVQYNSDVSDEFFLKDFTDKEQILEAVK 1701 Query: 268 KLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 ++ T NT A+ H + + ++ + IT G Sbjct: 1702 RIAYKGGRTANTGTAIKHIKAK---HFVKEAGSRVDQKVPQIAFIITGGRP--------E 1750 Query: 326 LNTLQICEYMRNAGMKIYSVAV-SAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + G+K+++V V + L + + F ++EL E Sbjct: 1751 DDGQTAALALAQQGVKVFAVGVRNIDLGDIAKLSSDSTTG---FRAATAQELSE 1801 Score = 40.7 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 17/169 (10%), Positives = 56/169 (33%), Gaps = 15/169 (8%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHA 284 + + R+ ++ + + E+ R+ K+ N A+ A Sbjct: 863 SDLNLGPDATRVAVAQFSDNVQVEFNFQDIPSKQEILQRVKKMRIKGGRSLNIGAALETA 922 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 R+++ + + +F++ + G +++ + Q ++ AG+ + Sbjct: 923 MRDVF---VRQAGSRIEEGVPQFLVLLAAG--------RSSDDVDQPANALKQAGVATFV 971 Query: 345 VAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + + L + + + + E +I + ++ +R Sbjct: 972 I--KSRTADPVELERIVFAPQFILNADSLSRIGEIQPEIVNLLKTIEIR 1018 >gi|340082|gb|AAA36794.1| undulin 1 [Homo sapiens] Length = 843 Score = 63.0 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 24/165 (14%), Positives = 57/165 (34%), Gaps = 18/165 (10%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + RIG Y+ + +EV + L NT T A+++ + Sbjct: 2 DVGSEKTRIGLAQYSGDPRIEWHLNAFSTKDEVIEAVRNLPYKGGNTLTGLALNYIFENS 61 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + T + K I ITDG++ + +R +G++++++ V Sbjct: 62 FK-----PEAGSRTGVSKIGILITDGKSQDDIIPPS--------RNLRESGVELFAIGV- 107 Query: 349 APPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 + L++ + V + + + +T + + Sbjct: 108 -KNADVNELQEIASEPDSTHVYNVAEFDLMHTVVESLTRTLCSRV 151 >gi|291386646|ref|XP_002709870.1| PREDICTED: anthrax toxin receptor 1 [Oryctolagus cuniculus] Length = 564 Score = 63.0 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 64/196 (32%), Gaps = 18/196 (9%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 K ++ + +++ + + +R+ I ++ Sbjct: 37 PACYGGFDLYFILDKSGSVLHHWNEIYFFVEQLAHKFISPQLRMSFIVFSTRGT--TLMK 94 Query: 255 LSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 L+ + +++ L +L P +T + A ++Y E + T +I + Sbjct: 95 LTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENSQGYRT------ASVIIAL 148 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDGE + + R+ G +Y V V + L + DS F VN Sbjct: 149 TDGELHEDLFFYSERE----ANRSRDLGAIVYCVGV--KDFNETQLARIADSKDHVFPVN 202 Query: 372 D-SRELLESFDKITDK 386 D + L I K Sbjct: 203 DGFQALQGIIHSILKK 218 >gi|126178503|ref|YP_001046468.1| von Willebrand factor, type A [Methanoculleus marisnigri JR1] gi|125861297|gb|ABN56486.1| von Willebrand factor, type A [Methanoculleus marisnigri JR1] Length = 1002 Score = 63.0 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 27/268 (10%), Positives = 73/268 (27%), Gaps = 30/268 (11%) Query: 144 ENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAP 203 + I + + D S SM Y + + + + + Sbjct: 496 QPDPIDVVLCTDRSGSMLKDYPDRMVKAMDASRIFNVQIDHGRDRL--------GLVSFG 547 Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 I ++ G + + + G Y LS++ + + Sbjct: 548 GQGQTNIIQYSQNVGGWLGRDSGTWDDLSYRTANYGNNRYYTEYAT-LDLGLSDDQSAIN 606 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD---------- 313 + + + P T ++ A EL + S + D Sbjct: 607 TTIAGMIPDSGTPMRYGIYKAITELKDNGRSDAVKAVVVLGDGEYNYYGDPLARGTGRTT 666 Query: 314 -----GENSGASAYQNTLNTLQ-ICEYMRNAGMKIYSVAVSAP----PEGQDLLRKCTD- 362 + + + + Q + Y N +++Y ++ S + + Sbjct: 667 KFDWSNTQNDYTYFSDLGAAEQRMSTYANNHNVRLYMISFSDDIVNGSHTWNTMETLASE 726 Query: 363 SSGQFFAVNDSRELLESFDKITDKIQEQ 390 + G + +L + + +I +++ + Sbjct: 727 TGGTHYHAPSDSDLAQVYTEIAGELKTE 754 >gi|313238339|emb|CBY13421.1| unnamed protein product [Oikopleura dioica] Length = 501 Score = 63.0 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 56/185 (30%), Gaps = 25/185 (13%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI--GIVGNQCTPLSNNLNEVKSRLN 267 D++ + + V+ + + R+ + + PLS VKS L Sbjct: 53 FDLMKKWIADFVDKF-----DVSQTATRVSVVQFTDNVSDRNGFGFPLSGQAARVKSDLA 107 Query: 268 KLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 L NT+ A+ + + + K +I +TDG + Sbjct: 108 NLRYMTGNTHAGNALRYVKNNVLSRAR--------KSASKVLIVLTDGVSQD-------- 151 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + + + ++SV + + L K F + L D + +K Sbjct: 152 EIEKAASDLIGDKVLVFSVGIG-NSVDANELEKIAGLPEYVFKTANYNALTGITDTLYNK 210 Query: 387 IQEQS 391 + Sbjct: 211 LCSSL 215 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 20/140 (14%), Positives = 49/140 (35%), Gaps = 24/140 (17%) Query: 256 SNNLNEVKSRLNKLN-PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 S + V ++ + +T A+ + YR ++ + + +++ VI +TDG Sbjct: 292 STSKEAVIGAIDGMKKLNGDTCIGEALDYFYRNMFTSQAGQR-----SDVEQRVIVMTDG 346 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS--------APPEGQDLLRKCTDSSG- 365 + + + E +R +IY++ + + L + Sbjct: 347 KRN------CPAEIAKPAELIRAQEAEIYAIGIGHQCGYGENHNCYDRQELHEIASKPAD 400 Query: 366 -QFFAVNDSRELLESFDKIT 384 F +N+ +L +I Sbjct: 401 KYVFEINNFDQL--ILKRIG 418 >gi|303251581|ref|ZP_07337755.1| hypothetical protein APP6_0784 [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|307252106|ref|ZP_07534005.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|302649579|gb|EFL79761.1| hypothetical protein APP6_0784 [Actinobacillus pleuropneumoniae serovar 6 str. Femo] gi|306860406|gb|EFM92420.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 6 str. Femo] Length = 538 Score = 63.0 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 58/493 (11%), Positives = 120/493 (24%), Gaps = 127/493 (25%) Query: 14 TYAIDLAHIMYIRNQMQSALDAAVLSGCASIVS----------------DRTIKDPTTKK 57 +++ A I+ + ++ +L+ AVLS A + + + + Sbjct: 36 FVSLESAGIIQDQARLSDSLEQAVLSLTAENNNGRKDNDYKLSGSSNKENDSFDISSEVG 95 Query: 58 DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 + S + +K L Q + + N I +N T K + I ++ Sbjct: 96 KRDSQMVTTFVKAFLPQTNDDKMNL--IPICKTVNNTSGKGHTSSSEVTCTVSGTIEHKS 153 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENL---AISICMVLDVSRSMEDLYLQKHNDNNN- 173 F + + + + +N I + +V D+S SM+D + + Sbjct: 154 WFPLKVGTVEVIPQQVDVASKSKAFKKNTFNIPIDLMVVADLSGSMKDGIKGEKLNGGTN 213 Query: 174 ----------MTSNKYLLPPPPKKSFWSKNT---------TKSKYAPAPAPANRKIDVLI 214 L + K P + + Sbjct: 214 SKIYILREVLKELADKSLFTQEANEYNRIGITAFAMGAEHPKENKCVLPFVLQNNLHEMS 273 Query: 215 ES------------------AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 +S N V + I I S Sbjct: 274 KSKIKQYLTSSHKSLRRTEFVDNFVALLDTEATLNSIGKPNYDIIFPKSSICLEGLKKAS 333 Query: 257 N------NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHN------------- 297 + ++R++ L T + A ++ +EK S Sbjct: 334 QFWYTKEEKEKFRNRVDSLKANGGTLASSGLLTASNQMLSEKSRSEELNQETKRVILVLS 393 Query: 298 -------------------------------------TIGSTRLKKFVIFITDGENSGAS 320 + ST + + + Sbjct: 394 DGNDDMSNLNLADLERNSIPFTNFSRITKNLILGKKEDLSSTATSNRAYYNRHSTFNYNT 453 Query: 321 AYQNTLNT---LQICEYM--------RNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA 369 N +C + ++ K+ V E D + C + G +F Sbjct: 454 YLTNKTKDISRKGMCSIIQEKLNTLNKDKNTKLVFVEFGYRSESADAWKTCVGN-GNYFY 512 Query: 370 VNDSRELLESFDK 382 ++ LL SF + Sbjct: 513 ADNRESLLNSFKQ 525 >gi|332223236|ref|XP_003260773.1| PREDICTED: cochlin [Nomascus leucogenys] Length = 550 Score = 63.0 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 23/160 (14%), Positives = 48/160 (30%), Gaps = 20/160 (12%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYN 290 ++ +I + + + V + + + T T A+ R ++ Sbjct: 400 SDIGAKIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFG 459 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 S N F++ +TDG++ + +AG+ I+SV V+ Sbjct: 460 PIRESPNK-------NFLVIVTDGQSYD--------DVQGPAAAAHDAGITIFSVGVAWA 504 Query: 351 PEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 P L+ F + L + I Sbjct: 505 PLDD--LKDMASKPKESHAFFTREFTGLEPIVSDVIRGIC 542 >gi|332215868|ref|XP_003257064.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-1(VII) chain-like [Nomascus leucogenys] Length = 2944 Score = 63.0 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 27/163 (16%), Positives = 57/163 (34%), Gaps = 19/163 (11%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELY 289 VR T+ Y+ + +V + +L+ NT T A+ H ++ Sbjct: 72 VSAQGVRFATVQYSDDPRTEFGLDALGSGGDVIRAIRELSYKGGNTRTGAAILHVADRVF 131 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + + + K I ITDG++ + ++ G+K+++V + Sbjct: 132 LPQLARPG------IPKVCILITDGKSQD--------LVDTAAQRLKGQGVKLFAVGI-- 175 Query: 350 PPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 + L++ FF VND L ++ ++ Sbjct: 176 KNADPEELKRVASQPTSDFFFFVNDFSILRTLLPLVSRRVCTT 218 Score = 43.7 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 29/206 (14%), Positives = 63/206 (30%), Gaps = 24/206 (11%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 P E+ ++ + A+ +V++G ++Y+ S++ Sbjct: 1055 VVFLPHATQDNAHRA-EATRRVLERLVSALGPLGPQAVQVGLLSYSHRPSPLFPLNGSHD 1113 Query: 259 LNEVKSRLNKLNPY--ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 L + R+ + N A+ A+R + + ++ + D Sbjct: 1114 LGIILQRIRDMPYMDPSGNNLGTAVVTAHRYVLAPDAPGRR----QHVPGVMVLLVDEPL 1169 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG-----QFFAVN 371 G + AG+ + V + + LR+ G FFAV+ Sbjct: 1170 RGDIFSP--------IREAQAAGLNV--VMLGMAGADPEQLRRLA--PGMDSVQTFFAVD 1217 Query: 372 DSRELLESFDKITDKIQEQSVRIAPN 397 D L + + + + S P Sbjct: 1218 DGPSLDRAVSGLAAALCQASFTTQPR 1243 >gi|296200542|ref|XP_002747689.1| PREDICTED: matrilin-4 [Callithrix jacchus] Length = 770 Score = 63.0 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 57/183 (31%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 400 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 454 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + + TDG + + Sbjct: 455 VEYMERGTMTGLALRHMVEHSFSEAQGARPR--ALNVPRVGLVFTDGRSQDDISVW---- 508 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V + LR+ D + + + Sbjct: 509 ----AARAKEEGIVMYAVGVGKAV--EAELREIASEPAEQHMSYAPDFGTMTHLLENLKG 562 Query: 386 KIQ 388 I Sbjct: 563 SIC 565 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 50/153 (32%), Gaps = 15/153 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + R+G I Y+ + + ++ + L P T T A+ +A Sbjct: 64 NVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMVRAIRDLVPLAQGTMTGLAIQYAMNVA 123 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E + R+ + + +TDG ++ R G++IY+V V Sbjct: 124 FSVAEGARP--PEERVPRVAVIVTDGRPQD--------RVAEVAAQARARGIEIYAVGVQ 173 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 G LR F V + E Sbjct: 174 RADVGS--LRAMASPPLEEHVFLVESFDLIQEF 204 >gi|225030986|gb|ACN79500.1| inter-alpha-trypsin inhibitor heavy chain H4 precursor [Nilaparvata lugens] Length = 315 Score = 63.0 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 44/274 (16%), Positives = 85/274 (31%), Gaps = 31/274 (11%) Query: 129 TNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKS 188 L + VLD+S SM +++ D + Sbjct: 40 IEDGHFVHFFAPAELPPLRKQVVFVLDISGSMFGEKIKQLKDAMLKILSDLNPQDHFSIV 99 Query: 189 FWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIV 248 +S N ++ A +KI L E NL N + + +N Sbjct: 100 LFSDNAYV--WSKAKTAVMKKI--LDEGFYNLDNETLAILDDHRN--------------- 140 Query: 249 GNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 +N+ K + + P +TN + + + KE+ T + + Sbjct: 141 -EILQATPDNVKTAKEFVELIKPTTSTNIIDGLRKGLKLVKEGKETLDTTK--EPSQPIM 197 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNA-GMKIYSVAVSAPPEGQDLLRKCTDSSGQF 367 F+TDGE + + + IYS+A L+K + ++ F Sbjct: 198 FFLTDGEPNVDLTDPVEIVNET--SSLNEQLKTPIYSLAFGQGA-DITFLKKLSKANHGF 254 Query: 368 ----FAVNDSR-ELLESFDKITDKIQEQSVRIAP 396 + +D+ +L + +I+ + I P Sbjct: 255 ARNIYEGSDATLQLNNFYKEISSPLLANVTFIYP 288 >gi|302870768|ref|YP_003839404.1| von Willebrand factor type A [Caldicellulosiruptor obsidiansis OB47] gi|302573627|gb|ADL41418.1| von Willebrand factor type A [Caldicellulosiruptor obsidiansis OB47] Length = 900 Score = 63.0 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 25/189 (13%), Positives = 61/189 (32%), Gaps = 27/189 (14%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 K+++ ++ ++ ++ + AY + +V Sbjct: 425 GISKLEIAKSASAKMIEHLESSDGVGVIAFDHNYYWAYEFSKLVR--------KKDVIES 476 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 ++ + T + E+ S K ++ +TDG Sbjct: 477 ISSIEVGGGTAI----------IPPLSEAVKTLKKSKAKSKLIVLLTDGMGEQGGY---- 522 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKIT 384 + +KI ++ V +L +SG+F+ V++ EL++ F K T Sbjct: 523 ---EIPANEAKRNNIKITTIGVG-KFVNLPVLSWIASFTSGRFYLVSNPYELVDVFLKET 578 Query: 385 DKIQEQSVR 393 I+ + ++ Sbjct: 579 KIIKGKYMK 587 >gi|327272010|ref|XP_003220779.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2-like [Anolis carolinensis] Length = 919 Score = 63.0 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 21/186 (11%), Positives = 67/186 (36%), Gaps = 13/186 (6%) Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE--VKSRLNKLNPYENT 275 V +++ + + ++ + ++N K + ++P T Sbjct: 306 RQTVEAMKTILDDLRSNDQFSVLDFNHNVRCWRDSLVQASNAQTEAAKKYIEGIHPNGGT 365 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 N A+ A L ST ++ ++DG+ + T+ + + + Sbjct: 366 NINDALLRAIFILKEASNMGMLDPSST---SMIVLVSDGDPTVGELKLPTIQ-KNVKKNI 421 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAV----NDSRELLESFDKITDKIQEQ 390 ++ + ++ + + D L++ +++G V S ++ + F++++ + ++ Sbjct: 422 QDD-ISLFCLGIGFDV-DYDFLKRLAQENNGMAHRVFGNQETSSQMRKFFNQVSTPLLKK 479 Query: 391 SVRIAP 396 P Sbjct: 480 LEFNYP 485 >gi|301753369|ref|XP_002912539.1| PREDICTED: anthrax toxin receptor 2-like [Ailuropoda melanoleuca] Length = 611 Score = 63.0 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 38/248 (15%), Positives = 83/248 (33%), Gaps = 27/248 (10%) Query: 161 EDLYLQKHNDNNNMTSNKYLLPP-------PPKKSFWSKNTTKSKYAPAPAPANRKIDVL 213 + + +L+P P ++ + A K + Sbjct: 119 AGWWRGCLWPALARSPGSWLVPGLWLLVLSCPGGPVSAQEQPSCRGAFDLYFVLDKSGSV 178 Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL---N 270 + + N +Q+ + + +R+ I ++ PL+ + +++ L L + Sbjct: 179 ANNWIEIYNFVQQLTERFVSPQMRLSFIVFSSQAT--IILPLTGDRSKISKGLEDLKRVS 236 Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 P T + + A ++ G + +I +TDG+ G + Sbjct: 237 PVGETYIHEGLKLANEQIQKA--------GGFKASSIIIALTDGKLDG----LVPSYAEK 284 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN-DSRELLESFDKITDKIQE 389 + R+ G ++Y V V Q L + DS Q F V + L + I D+ Sbjct: 285 EAKISRSFGARVYCVGVL--DFEQAQLERIADSKDQVFPVKGGFQALKGIINSILDRSCT 342 Query: 390 QSVRIAPN 397 + + + P+ Sbjct: 343 EILELRPS 350 >gi|296214738|ref|XP_002807270.1| PREDICTED: LOW QUALITY PROTEIN: cochlin-like [Callithrix jacchus] Length = 594 Score = 63.0 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 24/155 (15%), Positives = 46/155 (29%), Gaps = 20/155 (12%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESS 295 +I + + N V + + + T T A+ R ++ S Sbjct: 449 KIAAVQFTYDQRMEFSFTDYNTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFGPIRES 508 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 N F++ +TDG++ + +AG+ I+SV V+ P Sbjct: 509 PNK-------NFLVIVTDGQSYD--------DVQGPAAAAHDAGITIFSVGVAWAPLDD- 552 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 L+ F + L + I Sbjct: 553 -LKDMASKPKESHAFFTREFTGLEPIVSDVIRGIC 586 >gi|238790016|ref|ZP_04633794.1| von Willebrand factor type A domain protein [Yersinia frederiksenii ATCC 33641] gi|238721829|gb|EEQ13491.1| von Willebrand factor type A domain protein [Yersinia frederiksenii ATCC 33641] Length = 448 Score = 63.0 bits (151), Expect = 7e-08, Method: Composition-based stats. Identities = 21/156 (13%), Positives = 52/156 (33%), Gaps = 13/156 (8%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYENTNTYPAMHHAYRELYNEKESSHNT 298 +AY+ NN + +++ + ++P T + + ++ Sbjct: 111 VVAYDNNAEVIIPATKVNNKPALIAKIQQHIHPMGMTALFAGVSKGIGQVDKNLNPEQVN 170 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLR 358 +I ++DG+ + + L + G+ I ++ + +DL+ Sbjct: 171 R--------IILLSDGQANTGPTSISEL--SDLARMAAKKGIAITTIGLG-EDYNEDLMT 219 Query: 359 KCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 S G V +S +L +F K + + Sbjct: 220 AIAGYSDGNHSFVANSADLESAFTKEFQDVMSVVAQ 255 >gi|242097078|ref|XP_002439029.1| hypothetical protein SORBIDRAFT_10g030210 [Sorghum bicolor] gi|241917252|gb|EER90396.1| hypothetical protein SORBIDRAFT_10g030210 [Sorghum bicolor] Length = 607 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 31/199 (15%), Positives = 66/199 (33%), Gaps = 32/199 (16%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT-PLSNN 258 ++D L + ++ + R+ + +N G +S + Sbjct: 113 VSGSMRDFGRLDKLKSAMRFIIKKLAPMD--------RLSVVTFNGGATRECPLRAMSED 164 Query: 259 LNEVKSRL-NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 V + + + L TN + + L + + T G VI ++DGE + Sbjct: 165 AVPVLTDIVDGLVARGGTNIEAGLKMGLQVLDGRRYTGARTAG-------VILMSDGEQN 217 Query: 318 GASAYQNTLNTLQICEYMRN-AGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 +RN +Y+++ + +LL+K G + V DS + Sbjct: 218 SGD-----------ATRVRNPQNYPVYTLSFGSNA-DMNLLQKLAGGGGTYNPVLDSGGM 265 Query: 377 L--ESFDKITDKIQEQSVR 393 + F ++ + VR Sbjct: 266 SMLDVFSQLMAGLLTVVVR 284 >gi|42526759|ref|NP_971857.1| batA protein, putative [Treponema denticola ATCC 35405] gi|41817074|gb|AAS11768.1| batA protein, putative [Treponema denticola ATCC 35405] Length = 332 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 22/168 (13%), Positives = 54/168 (32%), Gaps = 41/168 (24%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 P + + SRL+ L+ + T + + + K +S Sbjct: 138 SSSAALILPPTIDHKVFLSRLDSLSIGELGDGTAIGMGLAVSSAYMTRTKLNS------- 190 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE---------- 352 +++ +TDGEN+ +N + + N + Y + + + Sbjct: 191 ---SYIVLLTDGENNTG-----EINPKTAAKVLVNKNIGFYVIGIGSSGYTTLEYTDRKT 242 Query: 353 ------------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKI 387 + L+K +G++ + + L F+ I+ ++ Sbjct: 243 GKTYSGSIFSKFDELELKKIAQYGNGKYASASSPEILENIFNTISKQV 290 >gi|239814531|ref|YP_002943441.1| hypothetical protein Vapar_1524 [Variovorax paradoxus S110] gi|239801108|gb|ACS18175.1| conserved hypothetical protein [Variovorax paradoxus S110] Length = 409 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 38/320 (11%), Positives = 78/320 (24%), Gaps = 10/320 (3%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A+++ F+ A+D + ++ ++Q+ALD+ LS + T +T+ Sbjct: 18 VALVLLFLLGFMGIALDFGRLFIVKTELQTALDSCALSAAQELDGAGDALTRATSAGKTA 77 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 K + G E + + T +Y + I L Sbjct: 78 ADLNKINFQGEATGLAQTEVVFSDSLIGTYSHTFTPIANARYAKCLHTKSGIAPWILHAL 137 Query: 122 G-LIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 ++ S + + + + + V + Sbjct: 138 SAFSGNSTYGASKSVAAVAVATRAPSQTNCLVPVGVCQRTATPGWGFARGEWIEGVTNSN 197 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA----GNLVNSIQKAIQEKKNLSV 236 + W S + + V K + Sbjct: 198 DDVESGQFRW---VDFSGSGGGAREIKDLLTSSGQCGLPGMATNVGKAGKTNGAVAAWNT 254 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 R G Y + P + NP ++ P A R + E + Sbjct: 255 RFGI--YQGSLSAATAIPDQTGYAWYADSVAATNPGRYDDSGPNGFAAKRNAFAPYEGDN 312 Query: 297 NTIGSTRLKKFVIFITDGEN 316 + LK F + Sbjct: 313 KNPDTKNLKTQGNFSSTDYT 332 >gi|274320027|ref|NP_001162099.1| matrilin-4 [Macaca mulatta] gi|134093113|gb|ABO52973.1| matrilin 4 isoform 1 precursor [Macaca mulatta] Length = 581 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 56/183 (30%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 360 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 414 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + + TDG + + Sbjct: 415 VEYMERGTMTGLALRHMVEHSFSEAQGARPR--ALNVPRVGLVFTDGRSQDDISVW---- 468 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V + LR+ D + + Sbjct: 469 ----AARAKEEGIAMYAVGVGKAV--EAELREIASEPAELHVSYAPDFGTMTHLLRNLRS 522 Query: 386 KIQ 388 I Sbjct: 523 SIC 525 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 51/153 (33%), Gaps = 15/153 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + R+G I Y+ + + +++ + L P T T A+ +A Sbjct: 65 NVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVA 124 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E + R+ + + +TDG ++ R G++IY+V V Sbjct: 125 FSVAEGARP--PEERVPRVAVIVTDGRPQD--------RVAEVAAQARARGIEIYAVGVQ 174 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 G LR F V + E Sbjct: 175 RADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 >gi|109094740|ref|XP_001104627.1| PREDICTED: matrilin-4-like, partial [Macaca mulatta] Length = 222 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 56/183 (30%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 22 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 76 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + + TDG + + Sbjct: 77 VEYMERGTMTGLALRHMVEHSFSEAQGARPR--ALNVPRVGLVFTDGRSQDDISVW---- 130 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V + LR+ D + + Sbjct: 131 ----AARAKEEGIAMYAVGVGKAV--EAELREIASEPAELHVSYAPDFGTMTHLLRNLRS 184 Query: 386 KIQ 388 I Sbjct: 185 SIC 187 >gi|268579981|ref|XP_002644973.1| Hypothetical protein CBG10938 [Caenorhabditis briggsae] Length = 548 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 39/388 (10%), Positives = 99/388 (25%), Gaps = 41/388 (10%) Query: 13 ITYAIDLA-HIMYIRNQMQSALDAAVLSGCASIVS-----DRTIKDPTTKKDQTSTIFKK 66 + + + + ++ D A L + T + T Sbjct: 176 VGAIVYSSERKQRQKIKLGEHKDMASLMKAVDNLPFFSGITATGEALKFAATHTEGRRDN 235 Query: 67 QIKKH--LKQGSYIRENAGDIAQKAQINITKDKNNPLQ-YIAESKAQYEIPTENLFLKGL 123 + L G ++ + + + + ++ + G Sbjct: 236 LTLNYVVLTDGYSYDIIESGARLLREVPNSVVYAVTIGEIYLRKELELITGNKSNVMIGS 295 Query: 124 IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYL----QKHNDNNNMTSNKY 179 + +++ R+ + + ++ + D + + N+N Sbjct: 296 MSYGTVVKRIKNCEARARAQQLRDENPVELVHPGEFLSDAFSHRQSVQTNENIKKDEPAK 355 Query: 180 LLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS-VRI 238 P + L ++ + K Q + R+ Sbjct: 356 DSVTEPTDKL-----PVNDCQYDVGIIFDSSGSLEKNFQTQLQIANKLQQMPIRPNLTRV 410 Query: 239 GTIAYNIGIVGNQCTPLSNNLNE--VKSRLNKLN-PYENTNTYPAMHHAYRELYNEKESS 295 + + N + ++ + K T T A+ K + Sbjct: 411 AIVQFAGKSKTRVLADFVQNKTKDQLEKIIEKSPFYSGTTFTNQALKRMALLFEASKRDN 470 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE--- 352 ++ TDG + +T + E ++ G+ +Y+V +S Sbjct: 471 CKMK--------LLVFTDGY--------SAEDTAEGIEALKRQGITVYTVGISTDKNAGL 514 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESF 380 L+ S +F +D LL+ F Sbjct: 515 NVSELKGMATSPSHYFDSSDFDNLLKHF 542 >gi|332208765|ref|XP_003253479.1| PREDICTED: LOW QUALITY PROTEIN: matrilin-4-like [Nomascus leucogenys] Length = 448 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 51/153 (33%), Gaps = 15/153 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + R+G I Y+ + + +++ + L P T T A+ +A Sbjct: 65 NVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYAMNVA 124 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E + R+ + + +TDG ++ R G++IY+V V Sbjct: 125 FSVAEGARP--PEERVPRVAVIVTDGRPQD--------RVAEVAAQARARGIEIYAVGVQ 174 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 G LR F V + E Sbjct: 175 RADVGS--LRAMASPPLDEHVFLVESFDLIQEF 205 >gi|291232650|ref|XP_002736273.1| PREDICTED: mind bomb 2-like [Saccoglossus kowalevskii] Length = 847 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 30/173 (17%), Positives = 54/173 (31%), Gaps = 12/173 (6%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE 273 L+ Q A Q +V + G L++N K+ ++ L Sbjct: 464 KACTEFLLGIQQTATQTGLRENVAVV----EFGSKTRIVRNLTDNYRLTKNAIDSLQAGG 519 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T + + A +E+ G + +I +TDG Q + Sbjct: 520 TTPMFEGLMEAMKEVIQNGGVLTLPGGKKMTPR-IILMTDGRPDD--KDQVLKAAMSFGP 576 Query: 334 YMRNAG----MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + G + I V P ++LL + F V D +L + F + Sbjct: 577 LWKAVGLPFPIPIACVGCG-PDVDKELLAVIAKVTNGMFVVGDISQLGDFFRR 628 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 28/177 (15%), Positives = 56/177 (31%), Gaps = 11/177 (6%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 I L + + +Q +R G L++N VK ++ L Sbjct: 6 IAELKRAGNEFLIGVQLTA---NQTGLRENVAVVEFGRNTRIVQSLTDNYVSVKRAIDSL 62 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 P T + + A +E+ + G + I +TDG G + + + Sbjct: 63 VPGGTTPMFEGLMEAMKEIISNGGVLTLKGGKKMTPRV-ILMTDGY--GDDKDKVAKSAM 119 Query: 330 QICEYMRNAG----MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + G + I V P ++LL + + + + +L F + Sbjct: 120 SFGPLWKAVGLPHPIPIACVGCG-PNVDKELLAIIAEITNGMYVTGEMGQLSGFFRR 175 >gi|119494080|ref|ZP_01624623.1| hypothetical protein L8106_01082 [Lyngbya sp. PCC 8106] gi|119452182|gb|EAW33385.1| hypothetical protein L8106_01082 [Lyngbya sp. PCC 8106] Length = 608 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 14/162 (8%), Positives = 49/162 (30%), Gaps = 15/162 (9%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 + Y+ + + ++ ++ TN + + Sbjct: 79 VVIYDDQPETILTPQTVEDKAAICKQIGRIRAGGCTNLSGGWLMGCDCVKS--------R 130 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 ++ V+ +TDG+ + + + + G+ ++ + L+ Sbjct: 131 QTSDRLNRVLLLTDGQANMG--ITDPKVITKTAQNQAETGIITTTLGFGSYFNEDLLISM 188 Query: 360 CTDSSGQFFAVNDSRELLESFDKITDKIQ-----EQSVRIAP 396 + G F+ + ++ + F + + +V++ P Sbjct: 189 ADAAGGNFYFIQSPDDVAQVFRIELESLTAVVAQNLTVKLQP 230 >gi|7022738|dbj|BAA91707.1| unnamed protein product [Homo sapiens] Length = 218 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 30/145 (20%), Positives = 50/145 (34%), Gaps = 16/145 (11%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 G L+ + +++ L +L P +T + A ++Y E + T Sbjct: 7 STRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRT---- 62 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 +I +TDGE + + R+ G +Y V V + L + D Sbjct: 63 --ASVIIALTDGELHEDLFFYSERE----ANRSRDLGAIVYCVGV--KDFNETQLARIAD 114 Query: 363 SSGQFFAVND-SRELLESFDKITDK 386 S F VND + L I K Sbjct: 115 SKDHVFPVNDGFQALQGIIHSILKK 139 >gi|326670654|ref|XP_002663415.2| PREDICTED: collagen alpha-3(VI) chain [Danio rerio] Length = 3218 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 20/197 (10%), Positives = 59/197 (29%), Gaps = 20/197 (10%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + +V + + RI + Y+ N E+ Sbjct: 213 SDGTRNGFPAMKDFVQRMVEKL-----DVAENRDRISVVQYSREPGANFYLNTYTTKEEI 267 Query: 263 KSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + L T A+ + ++ S + + ++ ++ G Sbjct: 268 VDAVRGLRHKGGRPLYTGEALQYVRDNVFTASSGSRRL---EGVPQILVLLSGG------ 318 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 ++ + ++ G+ + + + L++ + +V+D EL Sbjct: 319 --RSFDSVNAAASSLKELGVL--TFGIGSRGSDSRELQRISYEPSYALSVSDFSELPNVQ 374 Query: 381 DKITDKIQEQSVRIAPN 397 +++ +Q S+ + P Sbjct: 375 EQLLASVQVTSIPVTPT 391 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 32/337 (9%), Positives = 99/337 (29%), Gaps = 33/337 (9%) Query: 65 KKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLI 124 + ++++ + + + + + A L Sbjct: 284 TGEALQYVRDNVFTASSGSRRLEGVPQILVLLSGGRSFDSVNAAAS-----------SLK 332 Query: 125 PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP 184 + + S G R + ++ L VS E +Q+ + ++ + P Sbjct: 333 ELGVLTFGIGSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASVQVTS-IPVTPT 391 Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK--NLSVRIGTIA 242 +T + D S + + +Q+ +++ R+ + Sbjct: 392 SPTVTAEYSTPRKDVVFL----LDGSDGTRSSFPAMRDFVQRVVEKFNIEANRDRVSVVQ 447 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIG 300 Y+ + ++ R+ L T A+ + ++ S Sbjct: 448 YSRDAEVHFYLNSYTKKEDILDRVTGLRHKGGRPLYTGAALQYVRDNVFTASSGSRRL-- 505 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 + + +I ++ G ++ + ++ G+ ++ + + L++ Sbjct: 506 -EGVPQILILLSGG--------RSFDSVDAAASSLKELGVL--TLGIGSRGSDSRELQRI 554 Query: 361 TDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + + +V D EL +++ +Q ++ P Sbjct: 555 SYEANYALSVADFSELPNVQEQLLASVQTVAMSGTPT 591 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 37/319 (11%), Positives = 95/319 (29%), Gaps = 16/319 (5%) Query: 75 GSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLR 134 I + Q + Q++ + + L S + + + Sbjct: 1488 ADLAEAIKYVIRNELQASAGVRLAQASQHLVVLTGGRSTSDVSTYGSILKGSRVNCIGIG 1547 Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 + R +A S VL V + N S ++ PP + + Sbjct: 1548 AENADSRQLIQIATSSDDVLQVPSFPNLPNI--QNKFIARLSGSIVVEPPIEIDETTPGL 1605 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT--IAYNIGIVGNQC 252 ++K A + I++ + ++ I I R+ Y + Sbjct: 1606 PQAKAADIVFLVDGSINLGRNNFKEVMEFILNLIDLFYTERDRLQIGLAHYATDVTDVFY 1665 Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 NN +++ + + + + + + ++ +T Sbjct: 1666 LNTYNNKDDIINAITRAEYKGGREIRTGSA-IRHVQKTHFVKEKGSRKDEGIPQILMVVT 1724 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND 372 G ++ ++ ++ +G+++Y+V V + +D L + + Sbjct: 1725 GG--------RSRDDSKSAALGLKASGVRVYAVGVG---DIEDELNNLGSEATTVARAST 1773 Query: 373 SRELLESFDKITDKIQEQS 391 +EL E ++I D + + Sbjct: 1774 FQELSELNEQILDTLDQDV 1792 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 22/226 (9%), Positives = 61/226 (26%), Gaps = 18/226 (7%) Query: 167 KHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK 226 P + K+ + + + +V+ + Sbjct: 1384 TKVSTMTRDEISTPAVPRDPLNLGRKDIIFLIDGSDSV-GQSGVAHIRDFILKVVDQL-- 1440 Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYR 286 + + VR+ + Y +N V S + +L + A + Sbjct: 1441 ---DVRPDQVRVALVQYGERPKTEFSLNSHDNKQSVISAIKRLRHMGGR--GADLAEAIK 1495 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 + + + + + + ++ +T G ++ + ++ G ++ + Sbjct: 1496 YVIRNELQASAGVRLAQASQHLVVLTGGRSTSDVSTYGSIL----------KGSRVNCIG 1545 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + A L + SS V L +K ++ V Sbjct: 1546 IGAENADSRQLIQIATSSDDVLQVPSFPNLPNIQNKFIARLSGSIV 1591 >gi|311695164|gb|ADP98037.1| von Willebrand factor type A domain protein [marine bacterium HP15] Length = 342 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 25/192 (13%), Positives = 63/192 (32%), Gaps = 43/192 (22%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL---NP 271 ++ +++ + R+G I + PL+ +L V++ + + Sbjct: 117 QAVKRVLDDFIS-----RRQGDRLGLILFGTEPYVQA--PLTFDLETVRTLMREAGLGMA 169 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T A+ A + L + ++ V+ +TDG N+ + + Sbjct: 170 GRATAIGDAVGLATKRL----------RNRPQDQRVVVLLTDGANTAGEITPDKATEIAA 219 Query: 332 CEYMRNAGMKIYSVAVSAP-----------------PEGQDLLRKCTD-SSGQFFAVNDS 373 +++Y++ + A ++LL + + G++F Sbjct: 220 AAS-----IRLYTIGIGAESMVQRGLLGSRRVNPSRDLDENLLTRMAQQTGGEYFRARSL 274 Query: 374 RELLESFDKITD 385 EL ++ I Sbjct: 275 PELELIYESIDR 286 >gi|167647386|ref|YP_001685049.1| von Willebrand factor type A [Caulobacter sp. K31] gi|167349816|gb|ABZ72551.1| von Willebrand factor type A [Caulobacter sp. K31] Length = 592 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 29/343 (8%), Positives = 75/343 (21%), Gaps = 30/343 (8%) Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 + + + E A + ++ + + A Y Sbjct: 101 AAYSFAAPSPVVAPNFAPPIRDTEKYPGAAANPVKRVAEEPVSTFSIDVD-TAAYANVRR 159 Query: 117 NLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS 176 L P + + A V+ + + + Sbjct: 160 FLNEGAAPPHDALRVEELINYFDYGYARPTAQEPPFKPTVTVVPSPWSQDRQLMHIGVQG 219 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 P + T ++ + ++ L++ + Sbjct: 220 YATPRAGQPPLNLVFLIDTSG-----SMSGPDRLPLAKKALNVLIDQL----------RP 264 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKS--RLNKLNPYENTNTYPAMHHAYRELYNEKES 294 + ++ +++K L L +T + AY + Sbjct: 265 QDRVSMVAYAGSAGAVLSPTDGKSKLKMRCALTALRSGGSTAGGQGLELAYALARQNLDP 324 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 VI +TDG+ + + R +G+ + Sbjct: 325 KAVNR--------VILMTDGDFNVG--IADPTRLKDFVADQRKSGVYLSVYGFGRGNYND 374 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 +++ + L E+ + D IA + Sbjct: 375 TMMQALAQNGNG--TAAYVDGLQEARKLLRDDFDSALFPIADD 415 >gi|487830|gb|AAA59180.1| eukocyte adhesion glycoprotein precursor [Homo sapiens] Length = 1163 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 27/174 (15%), Positives = 56/174 (32%), Gaps = 17/174 (9%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 N V ++ Q + + ++N + + +++L T T Sbjct: 172 NFVRAVISQFQRPSTQFSLMQFSNKFQTHFTFEEFRRTSNPLSLLASVHQLQ--GFTYTA 229 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 A+ + L++ + K +I ITDG ++L+ + A Sbjct: 230 TAIQNVVHRLFHASYGARRDAT-----KILIVITDG-----KKEGDSLDYKDVIPMADAA 279 Query: 339 GMKIYSVAVSA---PPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKI 387 G+ Y++ V L F V D L + +++ +KI Sbjct: 280 GIIRYAIGVGLAFQNRNSWKELNDIASKPSQEHIFKVEDFDALKDIQNQLKEKI 333 >gi|73992503|ref|XP_543015.2| PREDICTED: similar to Matrilin-4 precursor [Canis familiaris] Length = 624 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 52/153 (33%), Gaps = 15/153 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + + R+G I Y+ + + +++ + L P T T A+ +A Sbjct: 67 DVGPNATRVGVIQYSSQVQSVFPLGAFSRREDMERAIRALVPLAQGTMTGLAIQYAMNVA 126 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E + R+ + + +TDG ++ R G++IY+V V Sbjct: 127 FSVAEGARP--PEARVPRIAVIVTDGRPQD--------RVAEVAAQARARGIEIYAVGVQ 176 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 G LR F V + E Sbjct: 177 RADVGS--LRAMASPPLDEHVFLVESFNLIQEF 207 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 58/183 (31%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 403 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 457 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + + TDG + + Sbjct: 458 VEYMERGTMTGLALRHMVEHSFSEAQGARPR--ALNVPRVGLVFTDGRSQDDISVW---- 511 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V ++ LR+ D + + + Sbjct: 512 ----ARRAKEEGIVMYAVGVGKAV--EEELRQIASEPAELHVSYSPDFGTMTHLLENLRG 565 Query: 386 KIQ 388 I Sbjct: 566 SIC 568 >gi|156742544|ref|YP_001432673.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233872|gb|ABU58655.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 562 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 43/344 (12%), Positives = 98/344 (28%), Gaps = 39/344 (11%) Query: 57 KDQTSTIFKKQI-KKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPT 115 DQ + L + I D + + + + P Sbjct: 241 ADQMFNRGPGYLSAAVLYENLVIEAYNRDRYPSVSLPVVAIYPKEGTFWT------DHPY 294 Query: 116 ENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMT 175 L + ++ +++R + LA+ + + N + Sbjct: 295 AILNAPWVTDEQREAANIFLRYLLDRPQQELALRYGYRPSNTDVAVGAPITPENGVDPQQ 354 Query: 176 SNKYLLPPPPKKSFWSKNTTKSK----------YAPAPAPANRKIDVLIESAGNLVNSIQ 225 L P P ++ + +++ + V +Q Sbjct: 355 PQTLLEVPRPDVLSAIRSIWEQNKKRVDVMAVLDVSGSMEDEGRLEQAKAALRIFVEQLQ 414 Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAY 285 + T+ +P+ + EV +R+ L P T + AY Sbjct: 415 DDDGFGLTIFSDQATV-------LTPISPIGSRRTEVLNRIAGLTPRGGTRLLDTVVEAY 467 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN-AGMKIYS 344 +EL + V+ +TDG ++ + ++ + L + R +K+++ Sbjct: 468 QELTATPPGQRI--------RAVVVLTDGLDN--RSQRSAEDVLDLLRQDREGYSIKVFT 517 Query: 345 VAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE---LLESFDKITD 385 +A LL++ ++G V E + + IT Sbjct: 518 IAFG-GDADVHLLKEIASATGAKSYVGKPGERGAIERIYQDITT 560 >gi|297565996|ref|YP_003684968.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] gi|296850445|gb|ADH63460.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] Length = 717 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 28/172 (16%), Positives = 55/172 (31%), Gaps = 15/172 (8%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 + S++ + + + + + V E +S L + T Sbjct: 326 AVAGSLELIRSARDQDYIGVVAFSSSARWVFRPRPMTPQGRREAESLLLSVRAGGGTEIG 385 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 A A + L G K V+ +TDG TL TLQ + Sbjct: 386 EAYAEALQAL----------RGLKTEDKQVLVLTDGLVQDP-----TLPTLQAAHQAQAN 430 Query: 339 GMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 ++ +VA+ + + L G F+ V ++L F + + ++ Sbjct: 431 KIRTNAVALGSDADRAFLRELAKQGGGTFYDVPSPKDLPRFFLEEAQRAFQR 482 >gi|148704833|gb|EDL36780.1| coagulation factor C homolog (Limulus polyphemus), isoform CRA_a [Mus musculus] Length = 608 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 26/155 (16%), Positives = 46/155 (29%), Gaps = 20/155 (12%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESS 295 +I + + N V + L + T T A+ R ++ S Sbjct: 463 KIAAVQFTYDQRTEFSFTDYNTKENVLAVLANIRYMSGGTATGDAIAFTVRNVFGPIRDS 522 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 N F++ +TDG++ + +AG+ I+SV V+ P Sbjct: 523 PNK-------NFLVIVTDGQSYD--------DVRGPAAAAHDAGITIFSVGVAWAPLDD- 566 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 LR F + L + I Sbjct: 567 -LRDMASKPKESHAFFTREFTGLEPIVSDVIRGIC 600 >gi|121606137|ref|YP_983466.1| von Willebrand factor, type A [Polaromonas naphthalenivorans CJ2] gi|120595106|gb|ABM38545.1| von Willebrand factor, type A [Polaromonas naphthalenivorans CJ2] Length = 354 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 26/237 (10%), Positives = 74/237 (31%), Gaps = 61/237 (25%) Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 S A ++ +A + + +VR+G +A+ Q +S Sbjct: 99 SGSMRATDVLPNRLVASQNAAKAFLADLP--------RNVRVGVVAFAGTAAVVQPPTVS 150 Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN-------------------------- 290 ++ + ++K T + + EL+ Sbjct: 151 --REDLTAAIDKFQLQRGTAIGNGIIVSLAELFPEAGIDLESMENNRERKHGLSLDQAGK 208 Query: 291 ----EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 K++ + +I +TDG+ + +++L + + G+++Y+V Sbjct: 209 DDGNGKKAFTPVAPGSYTSAAIILLTDGQRTTG------IDSLDAAKVAADRGIRVYTVG 262 Query: 347 VSAPPE--------------GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQ 388 V ++ L+ + ++F + +L + + ++ ++ Sbjct: 263 VGTVEGETIGFEGWSMRVKLDEETLKGIARATQAEYFYAGTATDLKKVYQTLSSRLT 319 >gi|194217500|ref|XP_001502733.2| PREDICTED: integrin, alpha E (antigen CD103, human mucosal lymphocyte antigen 1; alpha polypeptide) [Equus caballus] Length = 1163 Score = 63.0 bits (151), Expect = 8e-08, Method: Composition-based stats. Identities = 26/155 (16%), Positives = 51/155 (32%), Gaps = 16/155 (10%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNT 298 + Y I S ++ +R+ + N T T AM H ++ SS Sbjct: 230 LVQYGEVIQTEFDLQDSQDVMASLARVQNITQVGNVTKTASAMQHVLDHIFTPSHSSRKK 289 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP---PEGQD 355 K ++ +TDG+ + + LN + + G++ +++ V + Sbjct: 290 AS-----KIMVVLTDGD-----IFGDPLNLTTVINSPKMQGVERFAIGVGEAFQKAKTDK 339 Query: 356 LLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQ 388 L+ F V + L K+ I Sbjct: 340 ELKLIASDPDETHAFKVTNYAALDGLLSKLQQNII 374 >gi|332845803|ref|XP_510951.3| PREDICTED: integrin alpha-X [Pan troglodytes] Length = 1209 Score = 63.0 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 57/174 (32%), Gaps = 17/174 (9%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 N V ++ Q + + S+N + + +++L T T Sbjct: 172 NFVRAVISQFQRPSTQFSLMQFSNKFQTHFTFEEFRRSSNPLSLLASVHQLQ--GFTYTA 229 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 A+ + +L++ + K +I ITDG ++L+ + A Sbjct: 230 TAIQNVVHQLFHASYGARR-----DAAKILIVITDG-----KKEGDSLDYKDVIPMADAA 279 Query: 339 GMKIYSVAVSA---PPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKI 387 G+ Y++ V L F V D L + +++ +KI Sbjct: 280 GIIRYAIGVGLAFQNRNSWKELNDIASKPSQEHIFKVEDFDALKDIQNQLKEKI 333 >gi|255526265|ref|ZP_05393182.1| von Willebrand factor type A [Clostridium carboxidivorans P7] gi|296186259|ref|ZP_06854663.1| von Willebrand factor type A domain protein [Clostridium carboxidivorans P7] gi|255510045|gb|EET86368.1| von Willebrand factor type A [Clostridium carboxidivorans P7] gi|296049060|gb|EFG88490.1| von Willebrand factor type A domain protein [Clostridium carboxidivorans P7] Length = 580 Score = 63.0 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 20/217 (9%), Positives = 77/217 (35%), Gaps = 15/217 (6%) Query: 180 LLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG 239 L+ + + + + + ++ E+ ++ Q + ++ Sbjct: 24 LVNAESSNTSSNLDVVFVLDSSGSMKESDPEEIRTEAIKMFLDMSQVQGNKFGLVAYSDN 83 Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 + + +++ +K+ + + T+T + A + + + +H Sbjct: 84 VVREHNLDT----INSNDDKERIKNMALNIPLGQKTDTGAGILEAVNLMNSGHDKNHKP- 138 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTL-NTLQICEYMRNAGMKIYSVAVSAPPE-GQDLL 357 +I ++DG+N +++L + ++ G +Y++ ++ + L Sbjct: 139 -------VIILLSDGKNDPQRKTEDSLKDLKSSISTCKDKGYPVYTIGLNYDGTVDKTQL 191 Query: 358 RKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + + ++ G+ + + + +L + I + V+ Sbjct: 192 EEMSNETKGKNYITSTAADLPKILTDIYADNSKLKVQ 228 >gi|194221347|ref|XP_001494879.2| PREDICTED: similar to alpha 1 type VII collagen [Equus caballus] Length = 3065 Score = 63.0 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 56/162 (34%), Gaps = 19/162 (11%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYN 290 VR + Y+ + +V + +L+ NT T A+ H ++ Sbjct: 73 SAQGVRFAAVQYSDDPRTEFGLDALGSGGDVIHAIRELSYKGGNTRTGAAILHVADHVFL 132 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + K I ITDG++ + ++ G+K+++V + Sbjct: 133 PQLARPG------VPKVCILITDGKSQD--------LVDTAAQRLKGQGVKLFAVGI--K 176 Query: 351 PEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 + L++ FF VND L ++ ++ Sbjct: 177 NADHEELKRVASQPTSDFFFFVNDFSILRTLLPLVSRRVCTT 218 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 28/199 (14%), Positives = 67/199 (33%), Gaps = 21/199 (10%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 A + + + + LV+++ +V+IG ++Y+ S++ Sbjct: 1061 ATRDNAHRAEAVKRALERLVSALGPLGP----QAVQIGLLSYSHRPSPLFPLNSSHDPGV 1116 Query: 262 VKSRLNKLNPY--ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 + ++ + N A+ A+R L + ++ + D G Sbjct: 1117 ILQKIRSIPYMDPSGNNLGIAVVTAHRHLMAPDAPGRR----QHVPGVMVLLVDEPLRGD 1172 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELL 377 + AG+K+ + + + + L R + FFAVND L Sbjct: 1173 IFNP--------VREAQAAGLKVMMLGL-VGADPEQLRRLVPSTDPVQNFFAVNDGSSLD 1223 Query: 378 ESFDKITDKIQEQSVRIAP 396 ++ + + + ++ P Sbjct: 1224 QAVSSLATALCQTALATQP 1242 >gi|192360615|ref|YP_001982630.1| von Willebrand factor type A domain-containing protein [Cellvibrio japonicus Ueda107] gi|190686780|gb|ACE84458.1| von Willebrand factor type A domain protein [Cellvibrio japonicus Ueda107] Length = 318 Score = 63.0 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 27/136 (19%), Positives = 52/136 (38%), Gaps = 21/136 (15%) Query: 254 PLSNNLNEVKSRLNKLNPYEN---TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ ++ V+ L + T A+ + + L K+ +I Sbjct: 149 PLTFDVKTVQEMLIEAESGYAGEATAIGDAIALSIKRL----------REQPNAKRVIIL 198 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP---EGQDLLRKCTDSSGQF 367 +TDGEN+ L + A KIY++A S + + + + G+F Sbjct: 199 LTDGENTAG-----ELGIATATDLAVKANTKIYTIAFSPYDREVDSHSMQQIAEQTGGEF 253 Query: 368 FAVNDSRELLESFDKI 383 F ++R+L E ++ Sbjct: 254 FRARNTRDLEEIHRQL 269 >gi|307595413|ref|YP_003901730.1| von Willebrand factor type A [Vulcanisaeta distributa DSM 14429] gi|307550614|gb|ADN50679.1| von Willebrand factor type A [Vulcanisaeta distributa DSM 14429] Length = 495 Score = 63.0 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 36/299 (12%), Positives = 86/299 (28%), Gaps = 43/299 (14%) Query: 102 QYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSME 161 E + + G+ + + S DV + Sbjct: 228 SRDGERMFEALRNVVRNAMSGMGQVKVVRFTDIDKYPTYVVSVREYKIGDNYFDV-DLQK 286 Query: 162 DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAP----APANRKIDVLIESA 217 + + + + + + KI++ ++ Sbjct: 287 TAMNLSRKSMMHKLFTNKDIVVKEYANVKTIDIVLCLDVSGSMRELSNGMPKIEIAKDAV 346 Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN------- 270 + + K N + + + ++ L++V+ ++N Sbjct: 347 AQYIQFLSKT-----NDRLAMVLFNFRADVL--------WGLHQVRRYWQQMNYMLKYVY 393 Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 TN A+ + L + +S K VI +TDG +S ++ Sbjct: 394 AGGGTNLANALERSREVLTRSRSNS----------KHVICVTDGRTVNSSMC------IK 437 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 +R +G I ++A+ + + L+R G F ++ +L ++ I DK+ Sbjct: 438 EAVRLRRSGTTISTIAIGENSDDELLMRLSKIGGGLFIKISSIHDLGKAL--IMDKLHS 494 >gi|126303381|ref|XP_001379571.1| PREDICTED: similar to matrilin-4 [Monodelphis domestica] Length = 623 Score = 63.0 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 26/153 (16%), Positives = 54/153 (35%), Gaps = 15/153 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + + R+G I Y+ + + +++ ++ + P T T A+ +A Sbjct: 67 DIGPNATRVGVIQYSSQVQSVFPLGAFSRREDMERAIHAIVPLAQGTMTGLAIQYAMNVA 126 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E + R+ + + +TDG ++ RN G++IY+V V Sbjct: 127 FSVAEGARP--SQARVPRVAVIVTDGRPQD--------RVTEVAAQARNRGIEIYAVGVQ 176 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 G LR F V + + Sbjct: 177 RADVGS--LRAMASPPLDEHVFLVESFDLIQQF 207 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 63/191 (32%), Gaps = 27/191 (14%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + Y+ + +EVK + Sbjct: 403 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQYSSRVRTEFPLGRYGTADEVKQAVLA 457 Query: 269 LNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + + T T A+ H ++E + + + + + + TDG + + Sbjct: 458 VEYMEKGTMTGLALRHLVEHSFSEAQGARPR--AQNVPRVGLVFTDGRSQDDISVW---- 511 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITD 385 + G+ +Y+V V ++ LR+ D F +T Sbjct: 512 ----AARAKEEGIIMYAVGVGKAV--EEELREIASDPPEQHVSYSPD-------FSTMTH 558 Query: 386 KIQEQSVRIAP 396 ++ V I P Sbjct: 559 MLENLKVNICP 569 >gi|90020471|ref|YP_526298.1| arginine biosynthesis bifunctional glutamate N-acetyltransferase/amino-acid acetyltransferase [Saccharophagus degradans 2-40] gi|89950071|gb|ABD80086.1| von Willebrand factor, type A [Saccharophagus degradans 2-40] Length = 708 Score = 63.0 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 38/379 (10%), Positives = 97/379 (25%), Gaps = 37/379 (9%) Query: 30 QSALDAAVL----SGCASIVSDRTIKDPTTKKDQTSTIFKK-------QIKKHLKQGSYI 78 SA ++A L + VS +D DQ++ + + + Sbjct: 177 SSAAESAKLSKKPAASQRQVSAIRAQDIGALPDQSNAVALQRIAGMPVDGDTIVAPAPQG 236 Query: 79 RENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGI 138 + + + + ++ + + + A Y L L Sbjct: 237 NDKFEHVEENSVKSVAEAPVSTFSIDVD-TASYSFVRRQLNSGYLPEKDAIRAEELINYF 295 Query: 139 IERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSK 198 + +++ K + + P+ + Sbjct: 296 DYNYPLPSDSTAPFKPNITVIDSPWAKGKKLVHIGLKGYDIAPDQKPRTNLV-----FLL 350 Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + K+ ++ +S L++++ N + + Y + + Sbjct: 351 DVSGSMNSQDKLPLVKQSMEMLLSTL--------NPDDTVAIVVYAGAAGTVLEPTPAKD 402 Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 ++ S + +L +T + AY + VI TDG+ + Sbjct: 403 KQKILSAMQRLQAGGSTAGGAGIALAYDLAEANFDKKAVNR--------VILATDGDFNV 454 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 N E R G+ + + L++ + + E Sbjct: 455 G--STNNETLQGFVERKREKGIFLSVLGFGQGNYNDHLMQTLAQNGNGV--AAYIDTVSE 510 Query: 379 SFDKITDKIQEQSVRIAPN 397 + + + IA + Sbjct: 511 AQKVLVQEASSSLFPIAKD 529 >gi|69244153|ref|ZP_00602689.1| von Willebrand factor, type A:Cna B-type [Enterococcus faecium DO] gi|293560613|ref|ZP_06677101.1| von Willebrand factor type A domain protein [Enterococcus faecium E1162] gi|294621687|ref|ZP_06700851.1| von Willebrand factor type A domain protein [Enterococcus faecium U0317] gi|314940439|ref|ZP_07847593.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133a04] gi|314943384|ref|ZP_07850154.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133C] gi|314953415|ref|ZP_07856334.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133A] gi|314993087|ref|ZP_07858476.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133B] gi|314997388|ref|ZP_07862342.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133a01] gi|68196610|gb|EAN11036.1| von Willebrand factor, type A:Cna B-type [Enterococcus faecium DO] gi|209491032|gb|ACI49667.1| putative pilus tip protein [Enterococcus faecium] gi|291598696|gb|EFF29749.1| von Willebrand factor type A domain protein [Enterococcus faecium U0317] gi|291605436|gb|EFF34882.1| von Willebrand factor type A domain protein [Enterococcus faecium E1162] gi|313588546|gb|EFR67391.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133a01] gi|313592412|gb|EFR71257.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133B] gi|313594552|gb|EFR73397.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133A] gi|313597919|gb|EFR76764.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133C] gi|313640359|gb|EFS04940.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0133a04] Length = 1129 Score = 63.0 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 27/272 (9%), Positives = 78/272 (28%), Gaps = 26/272 (9%) Query: 136 TGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTT 195 + + + +V+D S SM D L Sbjct: 279 NARGNVQKDITPLDLVLVVDWSGSMNDNNRIGEVKIGVDRFVDTLADSGITDKINMGYVG 338 Query: 196 KSKYAPAPAPANRKI---DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQC 252 S + + ++ D + ++ S + G++ Sbjct: 339 YSSEGYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGTFTQKALRDAGSMLSVPNGHKKVI 398 Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 L++ + ++ +++ ++N Y + ++ + G+T L + Sbjct: 399 VLLTDGVPTFSYKVQRVHAQSSSNYYG----------TQFSNTQDRPGNTSLISRIYDAP 448 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTDS 363 D N + T+ ++ G++I+ + + + + +R+ S Sbjct: 449 DQNNLSRRIDSTFIATIGEAMALKERGIEIHGLGIQLQSDPAAGLSKAEVESRMRQMVSS 508 Query: 364 ---SGQFFA-VNDSRELLESFDKITDKIQEQS 391 ++ + + ++ E K +I Sbjct: 509 DEKGDLYYESADHATDISEYLAKKAVQISATV 540 >gi|309790583|ref|ZP_07685138.1| von Willebrand factor, type A [Oscillochloris trichoides DG6] gi|308227385|gb|EFO81058.1| von Willebrand factor, type A [Oscillochloris trichoides DG6] Length = 430 Score = 63.0 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 25/198 (12%), Positives = 70/198 (35%), Gaps = 37/198 (18%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 KID ++A V ++ ++ I +N +V + P +++ +++ + Sbjct: 109 NKIDGARDAAQAFVGMVRSDD--------QVALIGFNDQVVVLE--PFTDDQAILEAAIR 158 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN-------SGAS 320 +L T Y ++ L ++ + ++ +TDG++ + Sbjct: 159 RLRADGGTALYDSIVEGVDLLRDQPGR-----------RALLVLTDGQDCRDLDSCPDDA 207 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAV------SAPPEGQDLLRKCTD-SSGQFFAVNDS 373 +TL N + + + + + +L++ + G + D+ Sbjct: 208 GSSHTLAEAIAYANAANQPVTL--IGLGQRGSSGDDGIDERVLQRIATETRGSYAYSPDA 265 Query: 374 RELLESFDKITDKIQEQS 391 L + + +I +Q + Sbjct: 266 AALTDLYREIAGGLQSEY 283 >gi|226315298|ref|YP_002775194.1| hypothetical protein BBR47_57130 [Brevibacillus brevis NBRC 100599] gi|226098248|dbj|BAH46690.1| hypothetical membrane protein [Brevibacillus brevis NBRC 100599] Length = 424 Score = 63.0 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 22/172 (12%), Positives = 55/172 (31%), Gaps = 30/172 (17%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL----NPYENTNTYPAMHHAYREL 288 + + I + ++ Q ++ + + KL T A+ +L Sbjct: 147 DSDMNIAVVTFHDQTNVLQPLTELSSQSVKDEVVKKLLQFPRTDGGTRIDLALQAGLDQL 206 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV- 347 + V+ ++DG + L+ + + +++V + Sbjct: 207 QA----------NQMANSTVVLMSDGY--------SDLDVPAALAPYKQNQVIVHTVGMS 248 Query: 348 SAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKI------TDKIQEQSV 392 +G LL+K ++ G +F V + ++ F +I I + Sbjct: 249 QIDADGTALLQKIAAETGGSYFNVEHADQMTGIFGQIYDMSRTDRNIVSERT 300 >gi|219363061|ref|NP_001136879.1| hypothetical protein LOC100217033 [Zea mays] gi|194697454|gb|ACF82811.1| unknown [Zea mays] Length = 459 Score = 63.0 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 20/174 (11%), Positives = 56/174 (32%), Gaps = 8/174 (4%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 K+ +L + ++ ++ + + + + + Q S ++ +N Sbjct: 5 KLALLKRAMRFVIENL-----DPSDRLSVVAFSSSAWRLFPLQRMTASGQQQSLQ-AVNS 58 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L TN + A R + + + + S L + + SA Sbjct: 59 LAADGGTNIAEGLRKAARVVEDRQARNPVC--SIMLLSDGVDSHNLPPRDGSAPDYGPLV 116 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + I++ + + + SSG F ++ + + ++F + Sbjct: 117 PRSILPGSEHHVPIHAFGFGMDHDSRAMHAVAQMSSGTFSFIDMAGSIQDAFAQ 170 >gi|311252475|ref|XP_003125114.1| PREDICTED: anthrax toxin receptor 1-like [Sus scrofa] Length = 564 Score = 63.0 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 64/196 (32%), Gaps = 18/196 (9%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 K ++ + +++ + + +R+ I ++ Sbjct: 37 PACYGGFDLYFILDKSGSVLHHWNEIYYFVEQLAHKFISPQLRMSFIVFSTQGT--TLMK 94 Query: 255 LSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 L+ + +++ L +L P +T + A ++Y E + T +I + Sbjct: 95 LTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENRQGYRT------ASVIIAL 148 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDGE + + R+ G +Y V V + L + DS F VN Sbjct: 149 TDGELHEELFFYSERE----ANRSRDLGAIVYCVGV--KDFNETQLARIADSKDHVFPVN 202 Query: 372 D-SRELLESFDKITDK 386 D + L I K Sbjct: 203 DGFQALQGIIHSILKK 218 >gi|293569888|ref|ZP_06680975.1| putative pilus subunit protein PilB [Enterococcus faecium E1071] gi|291587636|gb|EFF19513.1| putative pilus subunit protein PilB [Enterococcus faecium E1071] Length = 1277 Score = 63.0 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 27/272 (9%), Positives = 77/272 (28%), Gaps = 26/272 (9%) Query: 136 TGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTT 195 + + + +V+D S SM D L Sbjct: 279 NARGNVQKDITPLDLVLVVDWSGSMNDNNRIGEVKIGVDRFVDTLADSGITDKINMGYVG 338 Query: 196 KSKYAPAPAPANRKI---DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQC 252 S + + ++ D + ++ S + G + Sbjct: 339 YSSEGYSYSNGAVQMGSFDSVKNQVKSITPSWTNGGTFTQKALRDAGNMLSVPNGHKKVI 398 Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 L++ + ++ +++ ++N Y + ++ + G+T L + Sbjct: 399 VLLTDGVPTFSYKVQRVHAQSSSNYYG----------TQFSNTQDRPGNTSLISRIYDAP 448 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTDS 363 D N + T+ ++ G++I+ + + + + +R+ S Sbjct: 449 DQNNLSRRIDSTFIATIGEAMALKERGIEIHGLGIQLQSDSAAGLSKAEVESRMRQMVSS 508 Query: 364 ---SGQFFA-VNDSRELLESFDKITDKIQEQS 391 ++ + + ++ E K +I Sbjct: 509 DEKGDLYYESADHATDISEYLAKKAVQISATV 540 >gi|257880953|ref|ZP_05660606.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,502] gi|257892525|ref|ZP_05672178.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,408] gi|257816611|gb|EEV43939.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,502] gi|257828904|gb|EEV55511.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,408] Length = 1107 Score = 63.0 bits (151), Expect = 9e-08, Method: Composition-based stats. Identities = 27/272 (9%), Positives = 78/272 (28%), Gaps = 26/272 (9%) Query: 136 TGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTT 195 + + + +V+D S SM D L Sbjct: 257 NARGNVQKDITPLDLVLVVDWSGSMNDNNRIGEVKIGVDRFVDTLADSGITDKINMGYVG 316 Query: 196 KSKYAPAPAPANRKI---DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQC 252 S + + ++ D + ++ S + G++ Sbjct: 317 YSSEGYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGTFTQKALRDAGSMLSVPNGHKKVI 376 Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 L++ + ++ +++ ++N Y + ++ + G+T L + Sbjct: 377 VLLTDGVPTFSYKVQRVHAQSSSNYYG----------TQFSNTQDRPGNTSLISRIYDAP 426 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTDS 363 D N + T+ ++ G++I+ + + + + +R+ S Sbjct: 427 DQNNLSRRIDSTFIATIGEAMALKERGIEIHGLGIQLQSDPAAGLSKAEVESRMRQMVSS 486 Query: 364 ---SGQFFA-VNDSRELLESFDKITDKIQEQS 391 ++ + + ++ E K +I Sbjct: 487 DEKGDLYYESADHATDISEYLAKKAVQISATV 518 >gi|149634622|ref|XP_001513644.1| PREDICTED: similar to AMACO [Ornithorhynchus anatinus] Length = 801 Score = 62.6 bits (150), Expect = 9e-08, Method: Composition-based stats. Identities = 22/166 (13%), Positives = 58/166 (34%), Gaps = 23/166 (13%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + VR+G + +++ E K ++ K+ T T A+ + + Sbjct: 96 DINPDRVRVGAVQFSVASWLEFPLDSCLTRQEAKDKIKKIVFRGGSTETGLALKYILWKG 155 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + ++ + + +I ITDG++ G + +++ G+ +++V V Sbjct: 156 FPGGRNA-------SVPQILILITDGKSQGNVTVP--------AQQLKDRGITVFAVGVR 200 Query: 349 APPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSV 392 P + L D ++++ + + +V Sbjct: 201 FPRWEE--LHLLASEPNEQHVLFAEDVD---DAYNGLMSTLTSSTV 241 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 25/167 (14%), Positives = 59/167 (35%), Gaps = 16/167 (9%) Query: 225 QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT-NTYPAMHH 283 + +IG + + I + V ++++ + A+ H Sbjct: 572 SSLRFDINRDVTQIGLVVFGRQIRTVFALDTHPTGSGVLEAVSQMPFVGGVGSAGTALLH 631 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 Y E+ ++ + + K V+ ITDG + + + +R+ G+ ++ Sbjct: 632 VYDEVMTVQKGAR-----PGVSKAVVLITDG--------TGIEDAVVPAQKLRSNGVSVF 678 Query: 344 SVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + V P ++ L + S + ++L D D+I E+ Sbjct: 679 VIRVG--PFQKEALLRIAGSPSYLVQASSYKDLKSYEDSFIDRICEE 723 >gi|126334042|ref|XP_001370610.1| PREDICTED: similar to Integrin, alpha M (complement component 3 receptor 3 subunit) [Monodelphis domestica] Length = 1154 Score = 62.6 bits (150), Expect = 9e-08, Method: Composition-based stats. Identities = 27/170 (15%), Positives = 57/170 (33%), Gaps = 16/170 (9%) Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMH 282 ++ + + + + + Y+ + + N + +N++ T T A+ Sbjct: 173 VRTIMGQFRGTNTLFSLMQYSNDFRIHFTFNIFKNNPDPGILVNRIEQLGGLTFTATAIQ 232 Query: 283 HAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKI 342 +E++ + K +I ITDG + L + AG+ Sbjct: 233 KVVKEIFQSWNGAR-----KDAVKILIVITDG-----RKESDRLEYEDVIPLAEKAGIIR 282 Query: 343 YSVAVS---APPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKI 387 Y++ V P LR F VN+ L +++ +KI Sbjct: 283 YAIGVGNTFTTPSALQELRTIASQPSQEHVFQVNNFAALRNIQNQLQEKI 332 >gi|119606782|gb|EAW86376.1| inter-alpha (globulin) inhibitor H2, isoform CRA_a [Homo sapiens] gi|119606783|gb|EAW86377.1| inter-alpha (globulin) inhibitor H2, isoform CRA_a [Homo sapiens] Length = 947 Score = 62.6 bits (150), Expect = 9e-08, Method: Composition-based stats. Identities = 26/223 (11%), Positives = 71/223 (31%), Gaps = 23/223 (10%) Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 L P PK + + + S + + ++ + +L ++ + N ++R Sbjct: 304 LDPIPKNILFVIDVSGSMWG---VKMKQTVEAMKTILDDLRAEDHFSVIDF-NQNIRTWR 359 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG 300 + + K + K+ P TN A+ A L Sbjct: 360 NDLISATKTQ--------VADAKRYIEKIQPSGGTNINEALLRAIFILNEANNLGLLDPN 411 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 S +I ++DG+ + + + + ++S+ + D L++ Sbjct: 412 S---VSLIILVSDGDPTVGKCELKLSKIQKNVKENIQDNISLFSLGMGFDV-DYDFLKRL 467 Query: 361 TDSSGQ------FFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + + + S +L + +++++ + P+ Sbjct: 468 -SNENHGIAQRIYGNQDTSSQLKKFYNQVSTPLLRNVQFNYPH 509 >gi|218460583|ref|ZP_03500674.1| hypothetical protein RetlK5_14236 [Rhizobium etli Kim 5] Length = 309 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 39/298 (13%), Positives = 86/298 (28%), Gaps = 32/298 (10%) Query: 31 SALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQ 90 S LDAA+++ I + T K + S F Q+ G Sbjct: 1 SDLDAALIAAVKQIDNVEDAD---TLKKKVSDWFHAQVDNSYSLGE-------------- 43 Query: 91 INITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISI 150 + + + + A +PT + A S + +++ Sbjct: 44 -----IEIDTANHNITATASGTVPTTFM------KIANIESVDVSVASAVKGPATSYLNV 92 Query: 151 CMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAP-APANRK 209 +V+D S SM K + Y + + Sbjct: 93 YIVIDTSPSMLLAATTSGQATMYAGIGCQFACHTGDAHTIGKKKYANNYEYSSEKKIKLR 152 Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 DV ++ +++ I ++ + + +++G + + LS + + Sbjct: 153 ADVAGDAVREVLDMIDES--DANHERIKVGLYSLGDTLTEVLTPTLSTDTARTRLADASY 210 Query: 270 NPYENTN-TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 +T+ + L + + + S K V+ +TDG S + + Sbjct: 211 GLTSSTSKAATYFDVSLATLKQKVGAGGDGSTSGSPLKLVLLLTDGVQSQREWVTDGV 268 >gi|301766292|ref|XP_002918563.1| PREDICTED: cochlin-like [Ailuropoda melanoleuca] Length = 550 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 23/155 (14%), Positives = 46/155 (29%), Gaps = 20/155 (12%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESS 295 +I + + + V + + + T T A+ R ++ S Sbjct: 405 KIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFGPVRDS 464 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 N F++ +TDG++ + +AG+ I+SV V+ P Sbjct: 465 PNK-------NFLVIVTDGQSYD--------DVRGPAAAAHDAGITIFSVGVAWAPLDD- 508 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 L+ F + L + I Sbjct: 509 -LKDMASKPKESHAFFTREFTGLEPIVSDVIRGIC 542 >gi|282864727|ref|ZP_06273782.1| von Willebrand factor type A [Streptomyces sp. ACTE] gi|282560666|gb|EFB66213.1| von Willebrand factor type A [Streptomyces sp. ACTE] Length = 424 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 28/196 (14%), Positives = 69/196 (35%), Gaps = 26/196 (13%) Query: 206 ANRKIDVLIESAGNLVNSIQK-----AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 ++ ++ ++++++ + N + PL + Sbjct: 58 GQSRMSAAKQAFNDVLDAVPEEVQLGIRTLGANYPGDDRKVGCKDTKQLYPVGPL--DRT 115 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 E K+ + L P T PA+ A +L + ++ ITDGE++ Sbjct: 116 EAKTAVATLAPTGWTPIGPALLGAADDL-----------DGGDATRRIVLITDGEDTCG- 163 Query: 321 AYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELL 377 L+ ++ + G+ I ++ + + + L + + G + AV + EL Sbjct: 164 ----PLDPCEVARDIAARGIHLVIDTLGLVPNAKIRQQLTCIAEATGGTYTAVQHADELS 219 Query: 378 ESFDKITDKIQEQSVR 393 ++ D+ E ++ Sbjct: 220 GRVKQLVDRAAEPTIT 235 >gi|149036652|gb|EDL91270.1| anthrax toxin receptor 1 [Rattus norvegicus] Length = 457 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 30/145 (20%), Positives = 50/145 (34%), Gaps = 16/145 (11%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 G L+ + +++ L +L P +T + A ++Y E + T Sbjct: 7 STRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENSQGYRT---- 62 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 +I +TDGE + + R+ G +Y V V + L + D Sbjct: 63 --ASVIIALTDGELHEDLFFYSERE----ANRSRDLGAIVYCVGV--KDFNETQLARIAD 114 Query: 363 SSGQFFAVND-SRELLESFDKITDK 386 S F VND + L I K Sbjct: 115 SKDHVFPVNDGFQALQGIIHSILKK 139 >gi|148666786|gb|EDK99202.1| anthrax toxin receptor 1 [Mus musculus] Length = 533 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 30/145 (20%), Positives = 50/145 (34%), Gaps = 16/145 (11%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 G L+ + +++ L +L P +T + A ++Y E + T Sbjct: 82 STRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENSQGYRT---- 137 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 +I +TDGE + + R+ G +Y V V + L + D Sbjct: 138 --ASVIIALTDGELHEDLFFYSERE----ANRSRDLGAIVYCVGV--KDFNETQLARIAD 189 Query: 363 SSGQFFAVND-SRELLESFDKITDK 386 S F VND + L I K Sbjct: 190 SKDHVFPVNDGFQALQGIIHSILKK 214 >gi|54124354|gb|AAV29939.1| anthrax toxin receptor [Rattus norvegicus] Length = 245 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 30/145 (20%), Positives = 50/145 (34%), Gaps = 16/145 (11%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 G L+ + +++ L +L P +T + A ++Y E + T Sbjct: 25 STRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENSQGYRT---- 80 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 +I +TDGE + + R+ G +Y V V + L + D Sbjct: 81 --ASVIIALTDGELHEDLFFYSERE----ANRSRDLGAIVYCVGV--KDFNETQLARIAD 132 Query: 363 SSGQFFAVND-SRELLESFDKITDK 386 S F VND + L I K Sbjct: 133 SKDHVFPVNDGFQALQGIIHSILKK 157 >gi|73970245|ref|XP_855427.1| PREDICTED: similar to tumor endothelial marker 8 isoform 1 precursor [Canis familiaris] Length = 555 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 30/145 (20%), Positives = 50/145 (34%), Gaps = 16/145 (11%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 G L+ + +++ L +L P +T + A ++Y E + T Sbjct: 77 STRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENSQGYRT---- 132 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 +I +TDGE + + R+ G +Y V V + L + D Sbjct: 133 --ASVIIALTDGELHEDLFFYSERE----ANRSRDLGAIVYCVGV--KDFNETQLARIAD 184 Query: 363 SSGQFFAVND-SRELLESFDKITDK 386 S F VND + L I K Sbjct: 185 SKDHVFPVNDGFQALQGIIHSILKK 209 >gi|154149676|ref|YP_001403294.1| von Willebrand factor, type A [Candidatus Methanoregula boonei 6A8] gi|153998228|gb|ABS54651.1| von Willebrand factor, type A [Methanoregula boonei 6A8] Length = 1081 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 38/389 (9%), Positives = 91/389 (23%), Gaps = 39/389 (10%) Query: 41 CASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNP 100 ++ +K + + +++T D N Sbjct: 449 ATDSGYVTATATWSSISQPLQLSWKNYPYLSELTTISSQNVNVNGKFYVTVSLTADGWNL 508 Query: 101 LQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI----SICMVLDV 156 A+ ++ L + + + + Sbjct: 509 TGTPADVVIVSDLAAGIGGATRLTHTKAAEVGFIKNATDNTYVALASFGSAPNAGSTPYD 568 Query: 157 SRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIES 216 S ++L+ + + +N + + P + N + Sbjct: 569 SSDTQNLWNLQLSKSNTTYTYRPFNPYGNVWDYNLVNPANWNSISSSTAYCFNSSSQKNP 628 Query: 217 AGN---LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE 273 + L + + ++ T + N N + + +N Y Sbjct: 629 SSQSLGLRACVNATSPYGYTYLNPWSDSKIDADLMNAGPTYKTTNQNALVNTVNAYTAYG 688 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV------------IFITDGENSGASA 321 T+ ++ A +EL ++ SHN + +D + Sbjct: 689 GTDYAAGINAALQELQSKGNPSHNQTIIIMGDGVNMMAPIAPGSFESYWPSDWNPRNGTG 748 Query: 322 YQNTLN-------------TLQICEYMRNAGMKIYSVAVSAPP------EGQDLLRKCTD 362 N L +N G+ IY + P ++ Sbjct: 749 ISEGPNLWYLDESDVGKAAALNASTTAKNLGITIYGIQFPTPDNYGHNINDTAFFQQMVS 808 Query: 363 SS-GQFFAVNDSRELLESFDKITDKIQEQ 390 S ++ D + F +I +IQ Sbjct: 809 SPTSTWYYAPDPTTMTGIFQQIEGQIQNT 837 >gi|116695554|ref|YP_841130.1| hypothetical protein H16_B1615 [Ralstonia eutropha H16] gi|113530053|emb|CAJ96400.1| conserved hypothetical protein [Ralstonia eutropha H16] Length = 352 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 18/169 (10%), Positives = 52/169 (30%), Gaps = 42/169 (24%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGS-------------- 301 S + + + +++L P T + A L + + + Sbjct: 154 SRSKDAAATAIDRLKPQGGTALGNGLLIALTTLLPQTAGDAERLMNGDTTPLQKPDASHS 213 Query: 302 -------TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-- 352 + ++ +DGE++ A G+++Y+V V Sbjct: 214 GEAVTPGSYPSGAIVLFSDGESNTGPAATQAAQLAAA------HGVRVYTVGVGTTDGVV 267 Query: 353 ------------GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQ 388 + +L++ + ++F + D+ +L + + ++ Sbjct: 268 LSVDGWSARVRLDEKVLKEVANATGAEYFPLADAAQLKRVYRALNMRLT 316 >gi|298705150|emb|CBJ28593.1| glycin-rich protein [Ectocarpus siliculosus] Length = 1238 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 35/212 (16%), Positives = 62/212 (29%), Gaps = 28/212 (13%) Query: 182 PPPPKKSFWSKN----TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK---AIQEKKNL 234 P S +S TT + A + + L K A ++NL Sbjct: 58 CPCEAGSQYSFTMEGQTTVTANTVNVAVIIDASVSVGTADWELSKEFAKNTVASFAQQNL 117 Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYNEKE 293 G+ ++ +L + + ++ T+ + L Sbjct: 118 FTNGGSASFAQFSSDASEGGTFYSLEDFNAFVDADAKYSGGTDIIDGIAKGRELLSA--- 174 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S F+I TDG + + R G +Y+V V + P Sbjct: 175 -------SPTTTSFMIVTTDGVAP---------DPQDEADAARAEGTILYAVGVGSGPSQ 218 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 ++LL F V++ EL + D I Sbjct: 219 ENLL-AIGGDEANVFDVDNFEELDLALDDIVS 249 >gi|297694868|ref|XP_002824689.1| PREDICTED: LOW QUALITY PROTEIN: cochlin-like [Pongo abelii] Length = 482 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 22/155 (14%), Positives = 46/155 (29%), Gaps = 20/155 (12%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESS 295 +I + + + V + + + T T A+ R ++ S Sbjct: 337 KIAAVQFTYDQRTEFSFTDYSTKENVLAAIRNIRYMSGGTATGDAISFTVRNVFGPIRES 396 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 N F++ +TDG++ + +AG+ ++SV V+ P Sbjct: 397 PNK-------NFLVIVTDGQSYD--------DVQGPAAAAHDAGITVFSVGVAWAPLDD- 440 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 L+ F + L + I Sbjct: 441 -LKDMASKPKESHAFFTREFTGLEPIVSDVIRGIC 474 >gi|223974345|gb|ACN31360.1| unknown [Zea mays] Length = 516 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 34/247 (13%), Positives = 72/247 (29%), Gaps = 35/247 (14%) Query: 142 SSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAP 201 S + +V + + + N L + Sbjct: 12 SPNSGPRPTPIVPGRVQLVSKNNNMAPLEENTQKVLLELTGGDSTSDRSGLDLVAVLDVS 71 Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL----SN 257 KI+ + + +V + R+ + + N+ PL + Sbjct: 72 GSMQGE-KIEKMKTAMKFVVKKLSSID--------RLSIVTFLDTA--NRICPLRQVTED 120 Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + ++ ++ L P NTN + + L + K SS +G V+ ++DG+ + Sbjct: 121 SQPQLLKLIDALQPGGNTNISDGLQTGLKVLADRKLSSGRVVG-------VMLMSDGQQN 173 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT--DSSGQFFAVNDSRE 375 ++ + +Y+ A +L G F VND Sbjct: 174 RGEP----------AANVKIGNVPVYTFGFGAD-YDPTVLNAVARNSMGGTFSVVNDVNL 222 Query: 376 LLESFDK 382 L +F + Sbjct: 223 LSMAFSQ 229 >gi|119890597|ref|XP_001256059.1| PREDICTED: alpha 3 type VI collagen, partial [Bos taurus] Length = 1632 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 21/185 (11%), Positives = 64/185 (34%), Gaps = 20/185 (10%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 + + +++ N R + +N + EV S ++ ++ +N Sbjct: 57 VREFLYDVIESLAVGDNDF-RFALVQFNGNPHTEFLFNTYRSKQEVLSHVSNMSYIGGSN 115 Query: 277 -TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 T + + + E + + S + + ++ +T G + A + + Sbjct: 116 QTGKGLAYVMQNHLTE---AAGSRASDGVPQVIVVLTHGHSEDGLALPS--------AEL 164 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQSVR 393 ++A + ++++ V + L++ F + + L + + ++ Sbjct: 165 KSADVNVFAIGV--EDADEAALKEIASEPLNMHVFNLENYTSLHDIVGNLVACVRSS--- 219 Query: 394 IAPNR 398 +AP R Sbjct: 220 MAPER 224 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 23/177 (12%), Positives = 60/177 (33%), Gaps = 16/177 (9%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + NLVNS+ + + +R+G + ++ V +E Sbjct: 646 SSNVGETNFPYVRDFVMNLVNSL-----DVGSDHIRVGLVQFSDTPVTEFSLNTYPTKSE 700 Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + + L ++ + +Y N + + + + ++ +T G+ Sbjct: 701 LLAHLRQMQLQGGSVLNTGAALSY-VHANHFTEAGGSRIQDHVPQLLLLLTAGQ------ 753 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + LQ + AG+ + V + L + + G + ++D L Sbjct: 754 --SEDSYLQAANALARAGILTFCVG--TSQADRAELEEIAFNPGLVYLMDDFSSLPA 806 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 18/156 (11%), Positives = 50/156 (32%), Gaps = 15/156 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRE 287 +R+G + Y+ + +V + L N A+ Sbjct: 272 SIGTQQIRVGVVQYSDEPRTMFSLNSYSTKAQVLDAVKALGFIGGELANVGLALDFVVEN 331 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 + + + + + ++ I+ G +S ++ + L+ ++S + Sbjct: 332 HFT---RAGGSRAEEGVPQVLVLISAGPSSD--EIRDGVIALKQAS--------VFSFGL 378 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 A + L+ + F V + R L + +++ Sbjct: 379 GAQAASKAELQHIATNDNLVFTVPEFRSLGDVQEQL 414 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 58/169 (34%), Gaps = 15/169 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRE 287 VRIG + ++ + + V + +L + NT A+ R Sbjct: 1466 NVGPNKVRIGVLQFSNDVFPEFQLKTYKSQASVLDAIRRLRFKGGSPLNTGKALEFVARN 1525 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 + S + + + ++ G++ + + + +++AG+ S+ V Sbjct: 1526 YF---VKSAGSRIEDGVPQHLVLFLGGKSQDDISRYS--------QVIKSAGI--ASLGV 1572 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 + L+ T F V + R+L +++ + V AP Sbjct: 1573 GDRNIDRTELQTITSDPRLVFTVREFRDLPSIEERMVNSFGSSGVTPAP 1621 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 20/161 (12%), Positives = 54/161 (33%), Gaps = 16/161 (9%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 + + + + + E + +++ Y + + EV + + K+ + Sbjct: 461 NAIRDFIAKVIQ-RLEIRQDLIQVAVAQYADTVRPEFYFNTYPSKREVINAVRKMKALDG 519 Query: 275 TN--TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 + T A+ L+ E + + + K ++ +T G+ + Q Sbjct: 520 SALYTGSALDFVRNNLFTE---AAGYRAAEGVPKLLVLVTGGK--------SLDAVSQPA 568 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDS 373 + ++ +G+ + AV Q L + S F + Sbjct: 569 QELKRSGIL--AFAVGNKVADQAELEEIAFDSSLVFTATEF 607 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 31/290 (10%), Positives = 76/290 (26%), Gaps = 31/290 (10%) Query: 103 YIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMED 162 +S + + P NL G++P L + + I + L ++ Sbjct: 947 VAGKSSDRVDTPALNLKQSGVVPFILQAKNADPAELELIVPSPAFILVAESLPKIGDLQP 1006 Query: 163 LYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVN 222 + + P + S+ + P +L E +V Sbjct: 1007 QIVNLL-----KSVQNGAPAPVSVEKDVVFLIDGSEGVRSGFP------LLKEFVQRVVE 1055 Query: 223 SIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPA 280 S+ + VR+ + Y+ + V + L NT A Sbjct: 1056 SL-----DVGPDRVRVAVVQYSDRTRPEFYLNSYMDQQSVVGAIRGLTLLGGPAPNTGAA 1110 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + R + S + + +I +T + + + Sbjct: 1111 LEFVLRNILVGSAGSRIA---EGVPQLLIVLTADRSGDDVRGPSVVLRRGGAVP------ 1161 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + + ++ + + R+L I++++ + Sbjct: 1162 ----IGIGIGNADITEMQTLSFVPDFAVVIPTFRQLGTIQQVISERVTQL 1207 >gi|114652507|ref|XP_001170996.1| PREDICTED: coagulation factor C homolog, cochlin isoform 3 [Pan troglodytes] Length = 534 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 23/155 (14%), Positives = 46/155 (29%), Gaps = 20/155 (12%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESS 295 +I + + + V + + + T T A+ R ++ S Sbjct: 389 KIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFGPIRES 448 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 N F++ +TDG++ + +AG+ I+SV V+ P Sbjct: 449 PNK-------NFLVIVTDGQSYD--------DVQGPAAAAHDAGITIFSVGVAWAPLDD- 492 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 L+ F + L + I Sbjct: 493 -LKDMASKPKESHAFFTREFTGLEPIVSDVIRGIC 526 >gi|233142080|gb|ACQ91095.1| complement factor B-like protein [Ruditapes decussatus] Length = 697 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 26/195 (13%), Positives = 63/195 (32%), Gaps = 9/195 (4%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQ-KAIQEKKNLSVRIGTIAYNIGIVGNQCTP--L 255 + + +R ++ + LV+ R + ++ Sbjct: 201 VDVSSSIGDRSMESAKKFMKLLVDIFGVSNETSGGKNGTRFALLTFSNEADIVFNLNDGT 260 Query: 256 SNNLNEVKSRLNKLN-PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 + + EVK R++++ TN A+ ++ + + + V +TD Sbjct: 261 ARSKEEVKRRIDEIQNTGGGTNFRAALLKVVGGIFFNVIKKESQRLNHAT-RAVFLLTDA 319 Query: 315 ENSGASAYQNTLNTLQICEYMRNAG-MKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVN 371 E + Q ++N G +I+ + V + L + + F ++ Sbjct: 320 EETSTLEKDRLPRIRQAANDLKNEGHFEIFCIGVGQNI-DETTLAEIASTPHIEHVFTLS 378 Query: 372 DSRELLESFDKITDK 386 +L + D I +K Sbjct: 379 KFDDLEKVGDIIAEK 393 >gi|194207263|ref|XP_001489838.2| PREDICTED: coagulation factor C homolog, cochlin (Limulus polyphemus) [Equus caballus] Length = 549 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 23/155 (14%), Positives = 46/155 (29%), Gaps = 20/155 (12%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESS 295 +I + + + V + + + T T A+ R ++ S Sbjct: 404 KIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFGPMRDS 463 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 N F++ +TDG++ + +AG+ I+SV V+ P Sbjct: 464 PNK-------NFLVIVTDGQSYD--------DVRGPAAAAHDAGITIFSVGVAWAPLDD- 507 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 L+ F + L + I Sbjct: 508 -LKDMASKPKESHAFFTREFTGLEPIVSDVIRGIC 541 >gi|109083237|ref|XP_001114797.1| PREDICTED: cochlin-like isoform 4 [Macaca mulatta] Length = 550 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 23/155 (14%), Positives = 47/155 (30%), Gaps = 20/155 (12%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESS 295 +I + + + +V + + + T T A+ R ++ S Sbjct: 405 KIAAVQFTYDQRTEFSFTDYSTKEDVLAVIRNIRYMSGGTATGDAISFTVRNVFGPIRES 464 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 N F++ +TDG++ + +AG+ I+SV V+ P Sbjct: 465 PNK-------NFLVIVTDGQSYD--------DVQGPAAAAHDAGITIFSVGVAWAPLDD- 508 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 L+ F + L + I Sbjct: 509 -LKDMASKPKESHAFFTREFTGLEPIVSDVIRGIC 542 >gi|225028889|ref|ZP_03718081.1| hypothetical protein EUBHAL_03177 [Eubacterium hallii DSM 3353] gi|224953773|gb|EEG34982.1| hypothetical protein EUBHAL_03177 [Eubacterium hallii DSM 3353] Length = 1070 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 48/370 (12%), Positives = 106/370 (28%), Gaps = 49/370 (13%) Query: 43 SIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQ 102 + + + KKD+T K L ++ + +T + + Sbjct: 279 QMDTLQMQHLKEEKKDRTVETVAKAADHKLSVSKSKVTAKFEVKSFSDFILTWNIDATPA 338 Query: 103 Y---IAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRS 159 ++ A E + L +L+L G + + + +LD S S Sbjct: 339 DPLETGDNAASIEKQINHEKYATLRDDGTYDLTLTVAGKKGTETNKAKLDVIYILDKSGS 398 Query: 160 MEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGN 219 M++ + + T +K A + + ++ S Sbjct: 399 MKEDFGGTSKRIAA----------------SNAITALTKSLKQNANIDARFSMVTFSGNK 442 Query: 220 LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYP 279 + + + + + + G + P SN TN Sbjct: 443 TTGMWGQGDTKTWDDAEVAVSWTTDAGTIERGSKPTSN---------------GGTNYQA 487 Query: 280 AMHHAYRELYNEKESSHNTIG-STRLKKFVIFITDGEN--SGASAYQNTLNTLQICEYMR 336 + A L +++ + + + + DG G + + L++C Sbjct: 488 GIRTAKELLTSKRAGAMTAVIFISDGDPTFYYNPDGYTRGDGNNDGNGGADNLKVCLDAA 547 Query: 337 NAGM------KIYSVAVSAPPEGQDLLRKCTDSS---GQFFAVNDSRELLESFDKITDKI 387 + Y+V V + +L C+ S + F ++ EL ++F I I Sbjct: 548 KNEIANLGVNYFYTVGVGKANDYVNLSDLCSASGVSGAKNFDGTNTDELTKAFSTIESDI 607 Query: 388 QEQSVRIAPN 397 + N Sbjct: 608 ---LTFLCSN 614 >gi|148704834|gb|EDL36781.1| coagulation factor C homolog (Limulus polyphemus), isoform CRA_b [Mus musculus] Length = 574 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 26/155 (16%), Positives = 46/155 (29%), Gaps = 20/155 (12%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESS 295 +I + + N V + L + T T A+ R ++ S Sbjct: 429 KIAAVQFTYDQRTEFSFTDYNTKENVLAVLANIRYMSGGTATGDAIAFTVRNVFGPIRDS 488 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 N F++ +TDG++ + +AG+ I+SV V+ P Sbjct: 489 PNK-------NFLVIVTDGQSYD--------DVRGPAAAAHDAGITIFSVGVAWAPLDD- 532 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 LR F + L + I Sbjct: 533 -LRDMASKPKESHAFFTREFTGLEPIVSDVIRGIC 566 >gi|283807143|pdb|3K6S|A Chain A, Structure Of Integrin Alphaxbeta2 Ectodomain gi|283807145|pdb|3K6S|C Chain C, Structure Of Integrin Alphaxbeta2 Ectodomain gi|283807147|pdb|3K6S|E Chain E, Structure Of Integrin Alphaxbeta2 Ectodomain gi|283807149|pdb|3K6S|G Chain G, Structure Of Integrin Alphaxbeta2 Ectodomain gi|283807156|pdb|3K71|A Chain A, Structure Of Integrin Alphax Beta2 Ectodomain gi|283807158|pdb|3K71|C Chain C, Structure Of Integrin Alphax Beta2 Ectodomain gi|283807160|pdb|3K71|E Chain E, Structure Of Integrin Alphax Beta2 Ectodomain gi|283807162|pdb|3K71|G Chain G, Structure Of Integrin Alphax Beta2 Ectodomain gi|283807164|pdb|3K72|A Chain A, Structure Of Integrin Alphax Beta2 gi|283807166|pdb|3K72|C Chain C, Structure Of Integrin Alphax Beta2 Length = 1095 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 56/174 (32%), Gaps = 17/174 (9%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 N V ++ Q + + S+N + + +++L T T Sbjct: 153 NFVRAVISQFQRPSTQFSLMQFSNKFQTHFTFEEFRRSSNPLSLLASVHQLQ--GFTYTA 210 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 A+ + L++ + K +I ITDG ++L+ + A Sbjct: 211 TAIQNVVHRLFHASYGARR-----DAAKILIVITDG-----KKEGDSLDYKDVIPMADAA 260 Query: 339 GMKIYSVAVSA---PPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKI 387 G+ Y++ V L F V D L + +++ +KI Sbjct: 261 GIIRYAIGVGLAFQNRNSWKELNDIASKPSQEHIFKVEDFDALKDIQNQLKEKI 314 >gi|240137370|ref|YP_002961839.1| hypothetical protein MexAM1_META1p0632 [Methylobacterium extorquens AM1] gi|240007336|gb|ACS38562.1| Conserved hypothetical protein containing a von Willebrand factor type A (vWA) domain; putative membrane protein [Methylobacterium extorquens AM1] Length = 339 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 24/165 (14%), Positives = 55/165 (33%), Gaps = 23/165 (13%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK--LNPYE-NTNTYPAMHHAYRELYNEK- 292 RIG + + + + V L + + +T + A + L + Sbjct: 143 RIGLVEFADQAYVAAAP--TFDTATVARTLEEATIGLVGRSTGIGDGLGLALKRLAPAQV 200 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA--- 349 ++ K V+ ++DG N+ + ++ G+++Y++A+ Sbjct: 201 AAADGAGPPPSRDKVVVLLSDGANNAGQT-----APKDVAALAKDLGVRVYTIALGPIDM 255 Query: 350 --------PPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITD 385 + LR S G+ F V + +L + I + Sbjct: 256 ADNPNNEQDVVDVETLRAMAETSGGRAFRVKTTDDLENVANAIDE 300 >gi|34452173|ref|NP_000878.2| integrin alpha-X precursor [Homo sapiens] gi|146345441|sp|P20702|ITAX_HUMAN RecName: Full=Integrin alpha-X; AltName: Full=CD11 antigen-like family member C; AltName: Full=Leu M5; AltName: Full=Leukocyte adhesion glycoprotein p150,95 alpha chain; AltName: Full=Leukocyte adhesion receptor p150,95; AltName: CD_antigen=CD11c; Flags: Precursor Length = 1163 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 56/174 (32%), Gaps = 17/174 (9%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 N V ++ Q + + S+N + + +++L T T Sbjct: 172 NFVRAVISQFQRPSTQFSLMQFSNKFQTHFTFEEFRRSSNPLSLLASVHQLQ--GFTYTA 229 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 A+ + L++ + K +I ITDG ++L+ + A Sbjct: 230 TAIQNVVHRLFHASYGARR-----DAAKILIVITDG-----KKEGDSLDYKDVIPMADAA 279 Query: 339 GMKIYSVAVSA---PPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKI 387 G+ Y++ V L F V D L + +++ +KI Sbjct: 280 GIIRYAIGVGLAFQNRNSWKELNDIASKPSQEHIFKVEDFDALKDIQNQLKEKI 333 >gi|23398603|gb|AAH38237.1| ITGAX protein [Homo sapiens] gi|123983342|gb|ABM83412.1| integrin, alpha X (complement component 3 receptor 4 subunit) [synthetic construct] gi|123998043|gb|ABM86623.1| integrin, alpha X (complement component 3 receptor 4 subunit) [synthetic construct] Length = 1169 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 56/174 (32%), Gaps = 17/174 (9%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 N V ++ Q + + S+N + + +++L T T Sbjct: 172 NFVRAVISQFQRPSTQFSLMQFSNKFQTHFTFEEFRRSSNPLSLLASVHQLQ--GFTYTA 229 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 A+ + L++ + K +I ITDG ++L+ + A Sbjct: 230 TAIQNVVHRLFHASYGARR-----DAAKILIVITDG-----KKEGDSLDYKDVIPMADAA 279 Query: 339 GMKIYSVAVSA---PPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKI 387 G+ Y++ V L F V D L + +++ +KI Sbjct: 280 GIIRYAIGVGLAFQNRNSWKELNDIASKPSQEHIFKVEDFDALKDIQNQLKEKI 333 >gi|60654537|gb|AAX29959.1| integrin alpha X [synthetic construct] Length = 1170 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 56/174 (32%), Gaps = 17/174 (9%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 N V ++ Q + + S+N + + +++L T T Sbjct: 172 NFVRAVISQFQRPSTQFSLMQFSNKFQTHFTFEEFRRSSNPLSLLASVHQLQ--GFTYTA 229 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 A+ + L++ + K +I ITDG ++L+ + A Sbjct: 230 TAIQNVVHRLFHASYGARR-----DAAKILIVITDG-----KKEGDSLDYKDVIPMADAA 279 Query: 339 GMKIYSVAVSA---PPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKI 387 G+ Y++ V L F V D L + +++ +KI Sbjct: 280 GIIRYAIGVGLAFQNRNSWKELNDIASKPSQEHIFKVEDFDALKDIQNQLKEKI 333 >gi|239983463|ref|ZP_04705987.1| hypothetical protein SalbJ_28780 [Streptomyces albus J1074] Length = 423 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 27/203 (13%), Positives = 69/203 (33%), Gaps = 33/203 (16%) Query: 206 ANRKIDVLIESAGNLVNSIQK---------AIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 ++ ++ ++++++ + R P+ Sbjct: 53 GQSRMSAAKQAFNDVIDAVPEEVELGIRTLGADYPGEDKARGC----KDTRQLYPVGPI- 107 Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + E K+ + L+P T PA+ A +L G + ++ I+DGE+ Sbjct: 108 -DRTEAKTAVATLSPTGWTPIGPALLGAADDLD----------GDEGGSRRIVLISDGED 156 Query: 317 SGASAYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDS 373 + L+ ++ + G+ I ++ + + + L + G + AV Sbjct: 157 TCG-----PLDPCEVAREIAAKGVDLVIDTLGLVPNAKIRQQLSCIAGATGGTYTAVQHK 211 Query: 374 RELLESFDKITDKIQEQSVRIAP 396 +L + ++ D+ + V P Sbjct: 212 EDLSDKVKQLVDRAADPVVTPVP 234 >gi|134093121|gb|ABO52981.1| matrilin 4 isoform 1 precursor [Callithrix jacchus] Length = 580 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 50/153 (32%), Gaps = 15/153 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + R+G I Y+ + + ++ + L P T T A+ +A Sbjct: 64 NVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMVRAIRDLVPLAQGTMTGLAIQYAMNVA 123 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E + R+ + + +TDG ++ R G++IY+V V Sbjct: 124 FSVAEGARP--PEERVPRVAVIVTDGRPQD--------RVAEVAAQARARGIEIYAVGVQ 173 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 G LR F V + E Sbjct: 174 RADVGS--LRAMASPPLEEHVFLVESFDLIQEF 204 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 56/183 (30%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 359 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 413 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + + TDG + + Sbjct: 414 VEYMERGTMTGLALRHMVEHSFSEAQGARPR--ALNVPRVGLVFTDGRSQDDISVW---- 467 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V E R+ D + + + Sbjct: 468 ----AARAKEEGIVMYAVGVGKAVEAD--WREIASEPAEQHMSYAPDFGTMTHLLENLKG 521 Query: 386 KIQ 388 I Sbjct: 522 SIC 524 >gi|116623283|ref|YP_825439.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116226445|gb|ABJ85154.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 299 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 35/188 (18%), Positives = 75/188 (39%), Gaps = 30/188 (15%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL-------- 266 ++ + + V + + I + PL+N+L ++ L Sbjct: 92 DAERGASYRFIETVLRPNKDQVFLMQFDFRIFMRQ----PLTNSLRQLSDSLPYVDTPTF 147 Query: 267 NKLNP--YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 N+L T Y A+ A +E+ + K +I +TDGE+ Y + Sbjct: 148 NQLRAQSGGGTLLYDAVVTASQEVMLNRTGR----------KALILLTDGED-----YGS 192 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKI 383 + E + A IYS+ + +G+ L++ + ++ G FF V+ +++ + F I Sbjct: 193 DASVGDAIEAAQRADTLIYSILFADQGDGRRPLQRMSKETGGSFFEVSKKQDIDQIFTAI 252 Query: 384 TDKIQEQS 391 ++++ Q Sbjct: 253 QEELRSQY 260 >gi|291455286|ref|ZP_06594676.1| von Willebrand factor [Streptomyces albus J1074] gi|291358235|gb|EFE85137.1| von Willebrand factor [Streptomyces albus J1074] Length = 422 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 27/203 (13%), Positives = 69/203 (33%), Gaps = 33/203 (16%) Query: 206 ANRKIDVLIESAGNLVNSIQK---------AIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 ++ ++ ++++++ + R P+ Sbjct: 52 GQSRMSAAKQAFNDVIDAVPEEVELGIRTLGADYPGEDKARGC----KDTRQLYPVGPI- 106 Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + E K+ + L+P T PA+ A +L G + ++ I+DGE+ Sbjct: 107 -DRTEAKTAVATLSPTGWTPIGPALLGAADDLD----------GDEGGSRRIVLISDGED 155 Query: 317 SGASAYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDS 373 + L+ ++ + G+ I ++ + + + L + G + AV Sbjct: 156 TCG-----PLDPCEVAREIAAKGVDLVIDTLGLVPNAKIRQQLSCIAGATGGTYTAVQHK 210 Query: 374 RELLESFDKITDKIQEQSVRIAP 396 +L + ++ D+ + V P Sbjct: 211 EDLSDKVKQLVDRAADPVVTPVP 233 >gi|290995572|ref|XP_002680357.1| predicted protein [Naegleria gruberi] gi|284093977|gb|EFC47613.1| predicted protein [Naegleria gruberi] Length = 269 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 20/197 (10%), Positives = 60/197 (30%), Gaps = 21/197 (10%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 KI+++ + + + ++ + + + + + ++ K ++ Sbjct: 64 SKIEMVKSTLAFMFDQLKPTDRIALVEFDSNISTSLQFTNMNE------SGRSKAKQVVS 117 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT-- 325 + TN A+ R + + ++ TDG + N Sbjct: 118 NIRAGSCTNLSGALFEGLRLI--------GQRTNANEVTSLLLFTDGLANEGITNTNEIV 169 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK-IT 384 + + +++ + L +G ++ + + ++ ++F I Sbjct: 170 KKMTTMIHEEIRTNLTVFTFGFGTDTDANMLTSISQAGNGLYYFLQTTDDIPKAFGNVIG 229 Query: 385 DKIQ----EQSVRIAPN 397 I V+I PN Sbjct: 230 GLISVVGQNIKVKIEPN 246 >gi|258616219|ref|ZP_05713989.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium DO] Length = 1095 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 27/272 (9%), Positives = 78/272 (28%), Gaps = 26/272 (9%) Query: 136 TGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTT 195 + + + +V+D S SM D L Sbjct: 279 NARGNVQKDITPLDLVLVVDWSGSMNDNNRIGEVKIGVDRFVDTLADSGITDKINMGYVG 338 Query: 196 KSKYAPAPAPANRKI---DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQC 252 S + + ++ D + ++ S + G++ Sbjct: 339 YSSEGYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGTFTQKALRDAGSMLSVPNGHKKVI 398 Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 L++ + ++ +++ ++N Y + ++ + G+T L + Sbjct: 399 VLLTDGVPTFSYKVQRVHAQSSSNYYG----------TQFSNTQDRPGNTSLISRIYDAP 448 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTDS 363 D N + T+ ++ G++I+ + + + + +R+ S Sbjct: 449 DQNNLSRRIDSTFIATIGEAMALKERGIEIHGLGIQLQSDPAAGLSKAEVESRMRQMVSS 508 Query: 364 ---SGQFFA-VNDSRELLESFDKITDKIQEQS 391 ++ + + ++ E K +I Sbjct: 509 DEKGDLYYESADHATDISEYLAKKAVQISATV 540 >gi|94969085|ref|YP_591133.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94551135|gb|ABF41059.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 349 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 61/166 (36%), Gaps = 24/166 (14%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKES 294 + Y + P +++L + ++++ T Y + A + L Sbjct: 155 RPQDRLSVYAFSETVEEIVPFTSDLRRIDRGISEIIAGSATAMYDTIFLASKAL------ 208 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV------- 347 +K ++ ITDG ++ +S Q + +YS+ V Sbjct: 209 -----MKHDGRKVMVLITDGGDTFSSTSY-----EQAARAATQSETLLYSIIVVPVANSA 258 Query: 348 SAPPEGQ-DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 G+ L++ D+ G+ + D L +F +I+D+++ Q + Sbjct: 259 GRDTGGEHALIQISQDTGGKHYYATDMGSLDVAFKQISDELRTQYL 304 >gi|197117534|ref|YP_002137961.1| hypothetical protein Gbem_1146 [Geobacter bemidjiensis Bem] gi|197086894|gb|ACH38165.1| conserved hypothetical protein [Geobacter bemidjiensis Bem] Length = 383 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 33/347 (9%), Positives = 87/347 (25%), Gaps = 23/347 (6%) Query: 4 IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTI 63 I++ V + AID+ ++ +Q + + A L+G S+ + + + Sbjct: 8 IMLVVFLVLTGLAIDIGYMYVSDEDLQHSAEMAALTGAESLKQRLLFQAQQSPEKLAQVS 67 Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGL 123 + A + +N + + Sbjct: 68 A----DPLQSAARSTAVDNATGKHSASALVALMNDNGNALSENNDITVGFWNMSSRSYTP 123 Query: 124 IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNN---NMTSNKYL 180 + + + +R+ E SS L V +S + ++ Sbjct: 124 GGTPVNAMQVRARRTAESSSVGLGSLGTFVAKISGTASFGSTPVAVAALVPGTRSNIAIC 183 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 + ++ + AP + + + + L S+ + +S + Sbjct: 184 AAACEPSCSYPDVCNIAERKMSHAPWDPRREN-SSANRYLYTSLLHPVTITNTMSDLVCQ 242 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNT------YPAMHHAYRELYNEKES 294 + G + + + L + N ++ + + + Sbjct: 243 EMPVQEVCGLPIFTAAMKTDAILRDLKAMMYDPNVDSSNKEYDNNGKLAGWWVVVPATDC 302 Query: 295 SHNTIGSTRLKKFVIFIT---------DGENSGASAYQNTLNTLQIC 332 + G T + V+ + GE C Sbjct: 303 AGFQAGETFEQHTVVKYSLVRISRICAGGEPGCGKTSARADQPSAAC 349 >gi|99031847|pdb|2B2X|A Chain A, Vla1 Rdeltah I-Domain Complexed With A Quadruple Mutant Of The Aqc2 Fab gi|99031850|pdb|2B2X|B Chain B, Vla1 Rdeltah I-Domain Complexed With A Quadruple Mutant Of The Aqc2 Fab Length = 223 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 32/206 (15%), Positives = 70/206 (33%), Gaps = 27/206 (13%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + I L + +++ K ++G + Y + ++ EV Sbjct: 31 GSNSIYPWESVIAFLNDLLKRMDIGPKQ--TQVGIVQYGENVTHEFNLNKYSSTEEVLVA 88 Query: 266 LNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 NK+ T T + A +E + E + +KK ++ +TDGE Sbjct: 89 ANKIVQRGGRQTMTALGIDTARKEAFTEARGARR-----GVKKVMVIVTDGE------SH 137 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL---------RKCTD--SSGQFFAVND 372 + Q+ + + ++ +S+A+ + L + + FF V+D Sbjct: 138 DNYRLKQVIQDCEDENIQRFSIAI-LGHYNRGNLSTEKFVEEIKSIASEPTEKHFFNVSD 196 Query: 373 SRELLESFDKITDKIQEQSVRIAPNR 398 L+ + ++I P+R Sbjct: 197 ELALVTIVKALGERIFALEALERPHR 222 >gi|117920853|ref|YP_870045.1| vault protein inter-alpha-trypsin subunit [Shewanella sp. ANA-3] gi|117613185|gb|ABK48639.1| Vault protein inter-alpha-trypsin domain protein [Shewanella sp. ANA-3] Length = 751 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 33/274 (12%), Positives = 80/274 (29%), Gaps = 29/274 (10%) Query: 137 GIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTK 196 + R N + L S + + + P + + Sbjct: 295 ALANRVPANRDFVLQWRLKQGTSPVGWVFNQAGKTHVSQDDNASADTGPTGKSSNTDNYS 354 Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE-----------------KKNLSVRIG 239 P + L ++++ + ++ I Sbjct: 355 LVMVLPPKVEASEQLNLPRELILVIDTSGSMAGDSIIQAKNALRYALRGLRPQDSFNIIE 414 Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 + + ++NL + +N+L T A++ A +++ + Sbjct: 415 FNSDVSLLSPTPLPATASNLAMARQFVNRLQADGGTEMAQALNAAL-----PRQAFNAAS 469 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 + + VIF+TDG SA + + + ++++V + + P + R Sbjct: 470 AEDKSLRQVIFMTDGSVGNESALFELIR-----NQIGDN--RLFTVGIGSAPNSHFMQRA 522 Query: 360 CTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 G F + D E+ + ++ KIQ + Sbjct: 523 AELGRGTFTYIGDVDEVEQKISQLLAKIQYPVLT 556 >gi|238915987|ref|YP_002929504.1| hypothetical protein EUBELI_00011 [Eubacterium eligens ATCC 27750] gi|238871347|gb|ACR71057.1| Hypothetical protein EUBELI_00011 [Eubacterium eligens ATCC 27750] Length = 745 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 44/404 (10%), Positives = 112/404 (27%), Gaps = 64/404 (15%) Query: 37 VLSGCASIVSDRTIKDPTTKKDQTSTIFKKQI--------KKHLKQGSYIRENAGDIAQK 88 L+ + + + D + K ++K G+ ++ + + Sbjct: 36 ALAAENVLSDNIILVTENEVSDTAKDGYSKVTFKIGDHTKDVYIKNGNTVKLKSSNKDYG 95 Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLI-----PSALTNLSLRSTGIIERSS 143 ++ + Y K + + L + S Sbjct: 96 VIAGGIENGEAITENTVIYAKSYSYQGTLSLEKSIQSETPDENGYYTLQFSAKSQNLPSI 155 Query: 144 ENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSK---YA 200 ++ ++ +VLD S SM + + + + P +K+ WS + Sbjct: 156 KSENQNVILVLDRSFSMACSVDEDVDSDA--------MAPTYEKTRWSVTINAVEKFLNE 207 Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 P + K+ V+ ++ ++K + + + N+ S+ Sbjct: 208 FLPEGTSNKVSVISYCGSARTEITNESSKDK----IMSKLNSIYNRNMYNEDYKNSSKRY 263 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEK--------ESSHNTIGSTRLKKFVIFIT 312 V L TN + + + N + + + Sbjct: 264 NVTQIGRGL--GSATNIQQGLKQVSEIAGDTTGASVILFTDGGANRYDNGEIGGYYYIKN 321 Query: 313 DGENSGASAYQNTLNTLQIC----EYMRNAGMKIYSVAVSAPPEGQDLLRKCT------- 361 + E + + ++ +N E ++ AG IY++ + + L K + Sbjct: 322 NNETNRYNKPRDEVNGSYYAGKAGEELKAAGADIYTIVLMSKESDITDLVKVSLGNKSLT 381 Query: 362 ------------DSSGQ---FFAVNDSRELLESFDKITDKIQEQ 390 G F+ ++ +L F +I ++ Sbjct: 382 YEKSWEKTYFTFSDGGYAKEFYTAANAEQLNNRFKQIMTEMTSL 425 >gi|224049043|ref|XP_002191793.1| PREDICTED: similar to capillary morphogenesis protein 2 [Taeniopygia guttata] Length = 554 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 32/198 (16%), Positives = 71/198 (35%), Gaps = 20/198 (10%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 K + + + + + + + + +R+ I ++ PL+ + ++K Sbjct: 121 YFVLDKSGSVATNWREIFDFVNQLTERFVSPKMRLSFIVFSTQAHV--IMPLTGDREKIK 178 Query: 264 SRLNKL---NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 L L P T + + A ++ G++R +I +TDG+ G Sbjct: 179 KGLKDLEEVKPAGETYIHEGLKQANEQI--------AKQGASRFSSIIIALTDGKLDG-- 228 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN-DSRELLES 379 Q L + + R G ++Y V V + L + D Q F V + L Sbjct: 229 --QIPLYAEKEAKKSRELGARVYCVGVQ--DFEPEQLERIADVKEQVFPVTGGFQALKGI 284 Query: 380 FDKITDKIQEQSVRIAPN 397 + + + + + + P+ Sbjct: 285 INSVLKQSCTEILYLEPS 302 >gi|4758022|ref|NP_004077.1| cochlin precursor [Homo sapiens] gi|205277471|ref|NP_001128530.1| cochlin precursor [Homo sapiens] gi|114652503|ref|XP_001171057.1| PREDICTED: cochlin isoform 6 [Pan troglodytes] gi|7387582|sp|O43405|COCH_HUMAN RecName: Full=Cochlin; AltName: Full=COCH-5B2; Flags: Precursor gi|2801413|gb|AAC39545.1| Coch-5B2 gene product [Homo sapiens] gi|37182918|gb|AAQ89259.1| COCH [Homo sapiens] gi|58802453|gb|AAW82432.1| coagulation factor C homolog, cochlin (Limulus polyphemus) [Homo sapiens] gi|119586367|gb|EAW65963.1| coagulation factor C homolog, cochlin (Limulus polyphemus), isoform CRA_a [Homo sapiens] gi|119586369|gb|EAW65965.1| coagulation factor C homolog, cochlin (Limulus polyphemus), isoform CRA_a [Homo sapiens] gi|158258885|dbj|BAF85413.1| unnamed protein product [Homo sapiens] Length = 550 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 23/155 (14%), Positives = 46/155 (29%), Gaps = 20/155 (12%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESS 295 +I + + + V + + + T T A+ R ++ S Sbjct: 405 KIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFGPIRES 464 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 N F++ +TDG++ + +AG+ I+SV V+ P Sbjct: 465 PNK-------NFLVIVTDGQSYD--------DVQGPAAAAHDAGITIFSVGVAWAPLDD- 508 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 L+ F + L + I Sbjct: 509 -LKDMASKPKESHAFFTREFTGLEPIVSDVIRGIC 542 >gi|299139026|ref|ZP_07032203.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] gi|298599180|gb|EFI55341.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] Length = 318 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 67/164 (40%), Gaps = 20/164 (12%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 + + ++ ++++ ++ S L +++ + T Y A++ A + L S+ Sbjct: 124 QDQIDLMDFADDVDELVSFTSDVQKIDSGLGRIHHGDATALYDAVYLASQRLGETPTSAG 183 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV-------SA 349 ++ ITDGEN+ + E + AG IY++ + Sbjct: 184 QRR-------VLVLITDGENTTHHGSYD-----AALEQAQRAGAMIYALIIVPVSADAGR 231 Query: 350 PPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 G+ L + D+ G+++ V D +L +F ++D ++ Q Sbjct: 232 NTGGEHALIQLARDTGGKYYYVEDKHDLAPAFQHVSDDLRTQYT 275 >gi|114652501|ref|XP_001171019.1| PREDICTED: coagulation factor C homolog, cochlin isoform 4 [Pan troglodytes] Length = 569 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 23/155 (14%), Positives = 46/155 (29%), Gaps = 20/155 (12%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESS 295 +I + + + V + + + T T A+ R ++ S Sbjct: 424 KIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFGPIRES 483 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 N F++ +TDG++ + +AG+ I+SV V+ P Sbjct: 484 PNK-------NFLVIVTDGQSYD--------DVQGPAAAAHDAGITIFSVGVAWAPLDD- 527 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 L+ F + L + I Sbjct: 528 -LKDMASKPKESHAFFTREFTGLEPIVSDVIRGIC 561 >gi|34527040|dbj|BAC85316.1| unnamed protein product [Homo sapiens] Length = 401 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 23/155 (14%), Positives = 46/155 (29%), Gaps = 20/155 (12%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESS 295 +I + + + V + + + T T A+ R ++ S Sbjct: 256 KIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFGPIRES 315 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 N F++ +TDG++ + +AG+ I+SV V+ P Sbjct: 316 PNK-------NFLVIVTDGQSYD--------DVQGPAAAAHDAGITIFSVGVAWAPLDD- 359 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 L+ F + L + I Sbjct: 360 -LKDMASKPKESHAFFTREFTGLEPIVSDVIRGIC 393 >gi|224967060|ref|NP_038620.2| matrilin-4 precursor [Mus musculus] Length = 624 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 54/153 (35%), Gaps = 15/153 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + + R+G I Y+ + + +++ + + P T T A+ +A Sbjct: 67 DVGLNATRVGVIQYSSQVQSVFPLGAFSRREDMERAIRAVVPLAQGTMTGLAIQYAMNVA 126 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + R+ + ++ +TDG ++ R G++IY+V V Sbjct: 127 FSEAEGARP--SEERVPRVLVIVTDGRPQD--------RVAEVAAQARARGIEIYAVGVQ 176 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 G LR F V + E Sbjct: 177 RADVGS--LRTMASPPLDQHVFLVESFDLIQEF 207 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 57/183 (31%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 403 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 457 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + TDG + + Sbjct: 458 VEYMERGTMTGLALRHMVEHSFSEAQGARPR--DLNVPRVGLVFTDGRSQDDISVW---- 511 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V ++ LR+ D + + + Sbjct: 512 ----AARAKEEGIVMYAVGVGKAV--EEELREIASEPSELHVSYSPDFSTMTHLLENLKG 565 Query: 386 KIQ 388 I Sbjct: 566 SIC 568 >gi|148656915|ref|YP_001277120.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148569025|gb|ABQ91170.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 561 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 26/193 (13%), Positives = 61/193 (31%), Gaps = 28/193 (14%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 +++ + + +Q + + V +P+ Sbjct: 388 VSGSMADEARLEQAKTALRIFIEQLQDDDGFGLTIFS-------DSATVLTPVSPIGPKR 440 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 E+ +R+ L P T AY+E+ + V+ +TDG ++ Sbjct: 441 AEILNRIAGLTPRGGTRLLDTTVEAYQEMSATPPGQRI--------RAVVVLTDGLDNK- 491 Query: 320 SAYQNTLNTLQICEYMRN----AGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE 375 + N + +R +K+++VA +LL++ +++G V E Sbjct: 492 ----SQRNAQDVLNLLRQDREGYSIKVFTVAFG-GDADVNLLKEIAEATGAKSYVGKPGE 546 Query: 376 ---LLESFDKITD 385 + + I Sbjct: 547 RGSIERVYQDIAT 559 >gi|14548116|sp|O89029|MATN4_MOUSE RecName: Full=Matrilin-4; Short=MAT-4; Flags: Precursor gi|3766288|emb|CAA06889.1| matrilin-4 precursor [Mus musculus] gi|22477196|gb|AAH36558.1| Matrilin 4 [Mus musculus] gi|148674433|gb|EDL06380.1| matrilin 4, isoform CRA_a [Mus musculus] gi|148674434|gb|EDL06381.1| matrilin 4, isoform CRA_a [Mus musculus] Length = 624 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 54/153 (35%), Gaps = 15/153 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + + R+G I Y+ + + +++ + + P T T A+ +A Sbjct: 67 DVGLNATRVGVIQYSSQVQSVFPLGAFSRREDMERAIRAVVPLAQGTMTGLAIQYAMNVA 126 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + R+ + ++ +TDG ++ R G++IY+V V Sbjct: 127 FSEAEGARP--SEERVPRVLVIVTDGRPQD--------RVAEVAAQARARGIEIYAVGVQ 176 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 G LR F V + E Sbjct: 177 RADVGS--LRTMASPPLDQHVFLVESFDLIQEF 207 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 57/183 (31%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 403 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 457 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + TDG + + Sbjct: 458 VEYMERGTMTGLALRHMVEHSFSEAQGARPR--DLNVPRVGLVFTDGRSQDDISVW---- 511 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V ++ LR+ D + + + Sbjct: 512 ----AARAKEEGIVMYAVGVGKAV--EEELREIASEPSELHVSYSPDFSTMTHLLENLKG 565 Query: 386 KIQ 388 I Sbjct: 566 SIC 568 >gi|124127041|gb|ABM92272.1| CD11b [Ovis canadensis] Length = 1152 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 25/120 (20%), Positives = 41/120 (34%), Gaps = 15/120 (12%) Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T+T + REL++ + N K +I ITDGE Y + L + Sbjct: 223 GRTHTATGIRKVVRELFHSSNGARNHA-----LKIMIVITDGE-----KYLDPLEYRDVI 272 Query: 333 EYMRNAGMKIYSVAVSAPPE---GQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKI 387 G+ Y + V + L F VN+ L +++ +KI Sbjct: 273 PEADRKGIIRYVIGVGDAFNSKKSRKELDTIASKPPADHVFQVNNFEALKTIQNQLQEKI 332 >gi|114652499|ref|XP_001171038.1| PREDICTED: coagulation factor C homolog, cochlin isoform 5 [Pan troglodytes] Length = 594 Score = 62.6 bits (150), Expect = 1e-07, Method: Composition-based stats. Identities = 23/155 (14%), Positives = 46/155 (29%), Gaps = 20/155 (12%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESS 295 +I + + + V + + + T T A+ R ++ S Sbjct: 449 KIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFGPIRES 508 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 N F++ +TDG++ + +AG+ I+SV V+ P Sbjct: 509 PNK-------NFLVIVTDGQSYD--------DVQGPAAAAHDAGITIFSVGVAWAPLDD- 552 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 L+ F + L + I Sbjct: 553 -LKDMASKPKESHAFFTREFTGLEPIVSDVIRGIC 586 >gi|320106407|ref|YP_004181997.1| VWFA-like domain-containing protein [Terriglobus saanensis SP1PR4] gi|319924928|gb|ADV82003.1| VWFA-related domain-containing protein [Terriglobus saanensis SP1PR4] Length = 305 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 31/166 (18%), Positives = 68/166 (40%), Gaps = 20/166 (12%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKES 294 + + ++ P +N+ + + + L+ + T+ Y A++ A + L K Sbjct: 109 REQDEMDLISFSDTVDEIVPFTNDAGRMNAGIGNLHKGDATSLYDAIYLASQRLTEAKRD 168 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV------- 347 + K ++ +TDG N+ + Q E AG IY + + Sbjct: 169 ATRR-------KILVIVTDGGNT-----TKGMRYQQAVEAAERAGAAIYPIIMVPIEADA 216 Query: 348 SAPPEGQ-DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 G+ L++ D+ G++F V D +L ++F ++D ++ Q + Sbjct: 217 GRNTGGEHALIQMAQDTGGKYFYVLDKHDLDKAFAHLSDDLRTQYL 262 >gi|328880283|emb|CCA53522.1| putative secreted protein [Streptomyces venezuelae ATCC 10712] Length = 424 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 26/196 (13%), Positives = 69/196 (35%), Gaps = 26/196 (13%) Query: 206 ANRKIDVLIESAGNLVNSIQK-----AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 ++ ++ +++++ + + + PL + Sbjct: 57 GKSRMSAAKQAFNEVLDAVPEEVRLGIRTLGADYPGQDRKQGCKDTRQLYPVGPL--DRT 114 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 E K+ + L P T PA+ A ++L + ++ ITDGE++ Sbjct: 115 EAKTAVASLAPTGWTPIGPALLGAAKDL-----------EGGDATRRIVLITDGEDT--- 160 Query: 321 AYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELL 377 L+ Q+ + G+ I ++ + + + L + + G + +V+ + +L Sbjct: 161 --CAPLDPCQVAREIAAKGIHLVIDTLGLVPDAKTRQQLTCIAEATGGTYTSVHRTEDLS 218 Query: 378 ESFDKITDKIQEQSVR 393 ++ D+ + V Sbjct: 219 RRVRQLVDRAADPVVT 234 >gi|291405306|ref|XP_002719067.1| PREDICTED: integrin, alpha E [Oryctolagus cuniculus] Length = 1187 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 23/157 (14%), Positives = 55/157 (35%), Gaps = 16/157 (10%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNT 298 + Y I S N+ +++ + N T T A+ H +++ ++ S Sbjct: 259 LVQYGRVIQTEFNLQDSQNMTASLAKVQNITQVRNVTRTASAIQHVLDDIFTQRHGSRKK 318 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP---PEGQD 355 K ++ +TDG+ +++ LN + + G++ +++ V + + Sbjct: 319 AS-----KVIVVLTDGDT-----FEDPLNLTVVINSPKMQGIERFAIGVGDAFKKHQTEQ 368 Query: 356 LLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQ 390 L+ F V + L ++ I Sbjct: 369 ELKLIASDPDETHAFKVTNYSALDGLLSRLQQSIIRT 405 >gi|218528855|ref|YP_002419671.1| von Willebrand factor type A [Methylobacterium chloromethanicum CM4] gi|254559548|ref|YP_003066643.1| hypothetical protein METDI1003 [Methylobacterium extorquens DM4] gi|218521158|gb|ACK81743.1| von Willebrand factor type A [Methylobacterium chloromethanicum CM4] gi|254266826|emb|CAX22625.1| Conserved hypothetical protein containing a von Willebrand factor type A (vWA) domain; putative membrane protein [Methylobacterium extorquens DM4] Length = 339 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 24/165 (14%), Positives = 55/165 (33%), Gaps = 23/165 (13%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK--LNPYE-NTNTYPAMHHAYRELYNEK- 292 RIG + + + + V L + + +T + A + L + Sbjct: 143 RIGLVEFADQAYVAAAP--TFDTATVARTLEEATIGLVGRSTGIGDGLGLALKRLAPAQV 200 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA--- 349 ++ K V+ ++DG N+ + ++ G+++Y++A+ Sbjct: 201 AAADGEGPPPARDKVVVLLSDGANNAGQT-----APKDVAALAKDLGVRVYTIALGPIDM 255 Query: 350 --------PPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITD 385 + LR S G+ F V + +L + I + Sbjct: 256 ADNPNNEQDVVDVETLRAMAETSGGRAFRVKTTDDLENVANAIDE 300 >gi|301764008|ref|XP_002917404.1| PREDICTED: matrilin-4-like [Ailuropoda melanoleuca] Length = 594 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 51/153 (33%), Gaps = 15/153 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + + R+G I Y+ + +++ + L P T T A+ +A Sbjct: 67 DVGPNATRVGVIQYSSQVQSVFPLGAFARREDMEHAIRALVPLAQGTMTGLAIQYAMNVA 126 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E + R+ + + +TDG ++ R G++IY+V V Sbjct: 127 FSVAEGARP--PEARVPRIAVIVTDGRPQD--------RVAEVAAQARARGIEIYAVGVQ 176 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 G LR F V + E Sbjct: 177 RADVGS--LRAMASPPLDEHVFLVESFDLIQEF 207 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 24/183 (13%), Positives = 58/183 (31%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 362 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 416 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + + TDG + + Sbjct: 417 VEYMERGTMTGLALRHMVEHSFSEAQGARPR--ALNVPRVGLVFTDGRSQDDISVW---- 470 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V ++ LR+ D + D + Sbjct: 471 ----AARAKEEGIVMYAVGVGKAV--EEELRQIASEPAELHVSYSPDFGTMTHLLDNLRG 524 Query: 386 KIQ 388 I Sbjct: 525 SIC 527 >gi|291436333|ref|ZP_06575723.1| von Willebrand factor [Streptomyces ghanaensis ATCC 14672] gi|291339228|gb|EFE66184.1| von Willebrand factor [Streptomyces ghanaensis ATCC 14672] Length = 424 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 28/196 (14%), Positives = 66/196 (33%), Gaps = 26/196 (13%) Query: 206 ANRKIDVLIESAGNLVNSIQK-----AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 ++ ++ ++++ K N + PL + Sbjct: 59 GQSRMAAAKQAFNEVLDATPKEVELGIRTLGANYAGDDRKEGCKDTAQLYPVGPL--DRT 116 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 E K+ + L P T P++ A +L K ++ I+DGE++ Sbjct: 117 EAKAAVATLTPTGWTPIGPSLLKAADDLEGGNG-----------SKRIVLISDGEDT--- 162 Query: 321 AYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELL 377 L+ ++ + G+ I ++ + + L + + G + +V EL Sbjct: 163 --CAPLDPCEVAREIAAKGIGLTIDTLGLVPNAKLSRQLSCIAEATGGTYASVEHQDELT 220 Query: 378 ESFDKITDKIQEQSVR 393 + +++ D+ E V Sbjct: 221 DRVNELVDRAAEPVVT 236 >gi|281345581|gb|EFB21165.1| hypothetical protein PANDA_005644 [Ailuropoda melanoleuca] Length = 581 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 51/153 (33%), Gaps = 15/153 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + + R+G I Y+ + +++ + L P T T A+ +A Sbjct: 43 DVGPNATRVGVIQYSSQVQSVFPLGAFARREDMEHAIRALVPLAQGTMTGLAIQYAMNVA 102 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E + R+ + + +TDG ++ R G++IY+V V Sbjct: 103 FSVAEGARP--PEARVPRIAVIVTDGRPQD--------RVAEVAAQARARGIEIYAVGVQ 152 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 G LR F V + E Sbjct: 153 RADVGS--LRAMASPPLDEHVFLVESFDLIQEF 183 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 24/183 (13%), Positives = 58/183 (31%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 379 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 433 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + + TDG + + Sbjct: 434 VEYMERGTMTGLALRHMVEHSFSEAQGARPR--ALNVPRVGLVFTDGRSQDDISVW---- 487 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V ++ LR+ D + D + Sbjct: 488 ----AARAKEEGIVMYAVGVGKAV--EEELRQIASEPAELHVSYSPDFGTMTHLLDNLRG 541 Query: 386 KIQ 388 I Sbjct: 542 SIC 544 >gi|3766289|emb|CAA06890.1| matrilin-4 precursor, alternate splice product [Mus musculus] Length = 434 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 57/183 (31%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 213 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 267 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + TDG + + Sbjct: 268 VEYMERGTMTGLALRHMVEHSFSEAQGARPR--DLNVPRVGLVFTDGRSQDDISVW---- 321 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V ++ LR+ D + + + Sbjct: 322 ----AARAKEEGIVMYAVGVGKAV--EEELREIASEPSELHVSYSPDFSTMTHLLENLKG 375 Query: 386 KIQ 388 I Sbjct: 376 SIC 378 >gi|160858159|emb|CAP19999.1| collagen type VI alpha 5 [Homo sapiens] Length = 527 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 24/111 (21%), Positives = 45/111 (40%), Gaps = 12/111 (10%) Query: 280 AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG 339 A+ HA E H + +K+ +I ITDGE+ +T +RN G Sbjct: 3 ALKHANALFTEE----HGSRIKQNVKQMLIVITDGESHDHDQLNDT------ALELRNKG 52 Query: 340 MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + I++V V Q L + V++ +L + F + +++ + Sbjct: 53 ITIFAVGVGK--ANQKELEGMAGNKNNTIYVDNFDKLKDVFTLVQERMCTE 101 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 21/186 (11%), Positives = 66/186 (35%), Gaps = 20/186 (10%) Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 N + +L+++ + ++ ++IG + S + K+++ Sbjct: 125 NSDFVTMTTFLSDLIDNF-----DIQSQRMKIGMAQFGSNYQSIIELKNSLTKTQWKTQI 179 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 ++ +P + A +++ N + + + ++ IT G+ Sbjct: 180 QNVSKSGG---FPRIDFALKKVSNMFNLHAGGRRNAGVPQTLVVITSGDP--------RY 228 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL--ESFDKIT 384 + + +++ G+ + + + + LL T +S + D +L + +I Sbjct: 229 DVADAVKTLKDLGICVLVLGIG-DVYKEHLL-PITGNSEKIITFQDFDKLKNVDVKKRII 286 Query: 385 DKIQEQ 390 +I + Sbjct: 287 REICQS 292 >gi|329896848|ref|ZP_08271743.1| BatA [gamma proteobacterium IMCC3088] gi|328921553|gb|EGG28934.1| BatA [gamma proteobacterium IMCC3088] Length = 328 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 25/142 (17%), Positives = 53/142 (37%), Gaps = 25/142 (17%) Query: 254 PLSNNLNEVKSRLNKL---NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 P +++ + L++ +T A+ + S + +I Sbjct: 155 PFTDDRETWLALLDESIVNMAGPSTALGDAIGLSIAHF----------RESKTKNRVLIV 204 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------GQDLLRKCT-DS 363 +TDG ++G + + L + + G+ IY+VAV P ++L + Sbjct: 205 LTDGNDTG-----SKVPPLDAAQVAKAEGVTIYTVAVGDPETVGEEALDLEVLDSIAQTT 259 Query: 364 SGQFFAVNDSRELLESFDKITD 385 G F D + L E++ +I + Sbjct: 260 GGVSFNAADLKALQETYQRIDE 281 >gi|301780868|ref|XP_002925836.1| PREDICTED: integrin alpha-D-like, partial [Ailuropoda melanoleuca] Length = 926 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 26/154 (16%), Positives = 56/154 (36%), Gaps = 16/154 (10%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNT 298 I Y+ + + + + +S ++ + T T + +EL++ K + Sbjct: 3 LIQYSNHLKIHFTFTQFKSSSSPQSLVDPIVQLNGLTFTATGIRTVVQELFHSKNGAR-- 60 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP---EGQD 355 +K +I ITDG+ Y++ L + AG+ Y++ V + Sbjct: 61 ---KTARKILIVITDGQ-----KYRDPLEYSDVIPQAERAGIVRYAIGVGDAFQKLTARQ 112 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKI 387 L F V++ L +++ +KI Sbjct: 113 ELNTIGSKPSQDHVFRVDNFAALSNIQEQLQEKI 146 >gi|149624864|ref|XP_001517479.1| PREDICTED: similar to Cartilage matrix protein precursor (Matrilin-1), partial [Ornithorhynchus anatinus] Length = 249 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 54/159 (33%), Gaps = 18/159 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + R+G + Y + + + + +L P T T A+ A Sbjct: 58 DVGPNATRVGVVNYASAVKHEFPLKAHRSKASLLQAVRRLEPLSTGTMTGLAIQFAISRA 117 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++E E + S L K I +TDG + + R AG++++++ V Sbjct: 118 FSEVEGARPL--SPALSKVAIVVTDGRPQD--------DVKDVSARAREAGIELFAIGVG 167 Query: 349 APPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 + LR+ V +L + F + Sbjct: 168 R--VDKTTLRRIASEPLAEHVDYVESYSVIEKLAKKFQE 204 >gi|126336622|ref|XP_001380249.1| PREDICTED: similar to Inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Monodelphis domestica] Length = 923 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 24/154 (15%), Positives = 51/154 (33%), Gaps = 11/154 (7%) Query: 249 GNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 S N+ + K + ++ TN A+ A + L + GS V Sbjct: 323 PALLQASSQNVEQAKKFTSLISAMGATNINDAVLLAVKMLDDSNRKEKLPPGS---VSMV 379 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF- 367 I +TDG+ + N + + +Y + L K +G Sbjct: 380 ILLTDGDATDGE--TNPKKIQENVKAAIGGSYHLYCLGFGFDVN-YAFLEKLALENGGVA 436 Query: 368 ---FAVNDSR-ELLESFDKITDKIQEQSVRIAPN 397 + +DS +L + + ++ + + + P+ Sbjct: 437 RRIYEDSDSDLQLQDFYQEVANPLLTKVEFQYPD 470 >gi|84387243|ref|ZP_00990264.1| hypothetical protein V12B01_22476 [Vibrio splendidus 12B01] gi|84377890|gb|EAP94752.1| hypothetical protein V12B01_22476 [Vibrio splendidus 12B01] Length = 421 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 18/162 (11%), Positives = 47/162 (29%), Gaps = 2/162 (1%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M + V A+D+ H++ + ++Q+A+D+A L+ + + + + Sbjct: 19 MITAALLVFLAVSALAVDINHMVVNKTRLQNAVDSATLAAATILDNSKDKDAVDAEVGTA 78 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + + + + T + + + + F Sbjct: 79 LNAMAASTGNQ--EIDFSTASISIDYSNDPKDFTGTATFDSTDDVYVRVRVDALEMDEFF 136 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMED 162 L + G + N + I + + S D Sbjct: 137 IQLFGLEKVVSASAVAGPSSGLAYNNVVPIGVCIGDGTSDND 178 >gi|113971723|ref|YP_735516.1| putative outer membrane adhesin like protein [Shewanella sp. MR-4] gi|113886407|gb|ABI40459.1| putative outer membrane adhesin like protein [Shewanella sp. MR-4] Length = 1215 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 30/344 (8%), Positives = 88/344 (25%), Gaps = 20/344 (5%) Query: 38 LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK 97 L ++ + + L + Q ++ Sbjct: 166 LGAGYTLAHEWGHYAYGVYDEYKGNAVSGAANATLTTDVATDSIMSNQWQARNGDMKWLN 225 Query: 98 NNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 ++ I + + G + ++ R + + L + Sbjct: 226 HSTANNIGDVNRTAQGRV-----YGKSAWEVLTQDVKDDPKSGRKTAQPTRTRYTTLANN 280 Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 ++ + L + + + +P N I Sbjct: 281 APDASNPVKTELPAAQSSCRDQLKFVWVEGDIDMQIVMDRSGSMYGSPINNAIQAAKTLV 340 Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNT 277 + + V P + +K+ ++ + +T Sbjct: 341 DATAEGSTAMGLVSFSSRSSVK----QDFAVQQIPKPDTGIKQALKAAIDNIYASGSTAL 396 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 + A L + ++ ++ V + DG+++ + N + +N Sbjct: 397 FDGSSLALDNLITYQTAA-----ASGAPGVVFVLADGDDN-----SSIKNESTVITAYQN 446 Query: 338 AGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV-NDSRELLESF 380 A + I+S + L+ + G++F+ E++++F Sbjct: 447 ANVPIFSFGYGSASPTGPLVTMANATGGKYFSSPTTLSEIIDAF 490 >gi|114046070|ref|YP_736620.1| putative outer membrane adhesin like protein [Shewanella sp. MR-7] gi|113887512|gb|ABI41563.1| putative outer membrane adhesin like protein [Shewanella sp. MR-7] Length = 1215 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 30/344 (8%), Positives = 88/344 (25%), Gaps = 20/344 (5%) Query: 38 LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK 97 L ++ + + L + Q ++ Sbjct: 166 LGAGYTLAHEWGHYAYGVYDEYKGNAVSGAANATLTTDVATDSIMSNQWQARNGDMKWLN 225 Query: 98 NNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 ++ I + + G + ++ R + + L + Sbjct: 226 HSTANNIGDVNRTAQGRV-----YGKSAWEVLTQDVKDDPKSGRKTAQPTRTRYTTLANN 280 Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 ++ + L + + + +P N I Sbjct: 281 APDASNPVKTELPAAQSSCRDQLKFVWVEGDIDMQIVMDRSGSMYGSPINNAIQAAKTLV 340 Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNT 277 + + V P + +K+ ++ + +T Sbjct: 341 DATAEGSTAMGLVSFSSRSSVK----QDFAVQQIPKPDTGIKQALKAAIDNIYASGSTAL 396 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 + A L + ++ ++ V + DG+++ + N + +N Sbjct: 397 FDGSSLALDNLITYQTAA-----ASGAPGVVFVLADGDDN-----SSIKNEATVITAYQN 446 Query: 338 AGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV-NDSRELLESF 380 A + I+S + L+ + G++F+ E++++F Sbjct: 447 ANVPIFSFGYGSASPTGPLVTMANATGGKYFSSPTTLSEIIDAF 490 >gi|218200417|gb|EEC82844.1| hypothetical protein OsI_27660 [Oryza sativa Indica Group] Length = 423 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 15/134 (11%), Positives = 45/134 (33%), Gaps = 16/134 (11%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + KS + L+ TN + A + VI ++DG++ Sbjct: 43 DGKASAKSAVESLHADGCTNILKGLVEAAKVFD--------GRRYRNAVASVILLSDGQD 94 Query: 317 SG-----ASAYQNTLNTLQICEYMRNAG---MKIYSVAVSAPPEGQDLLRKCTDSSGQFF 368 + A + ++ + + +G + +++ + + ++ G F Sbjct: 95 NYNVNGGWGASNSKNYSVLVPPSFKRSGDRRLPVHTFGFGTDHDAAAMHAIAEETGGTFS 154 Query: 369 AVNDSRELLESFDK 382 + + + ++F + Sbjct: 155 FIENQAVVQDAFAQ 168 >gi|189524674|ref|XP_684145.3| PREDICTED: anthrax toxin receptor 1-like [Danio rerio] Length = 607 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 33/198 (16%), Positives = 66/198 (33%), Gaps = 18/198 (9%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 K + + + + + +R+ I ++ L+ + ++++ Sbjct: 41 YFVLDKSGSVQHHWNEIYYFVDHLAHKFISPQLRMSFIVFSTEGR--ILMELTEDRDQIR 98 Query: 264 SRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + L +L P +T + A +++Y + T R +I +TDGE Sbjct: 99 AGLEELQRVLPGGDTFMHKGFQKASQQIY------YGTGDGYRTASVIIALTDGELRENE 152 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND-SRELLES 379 + R G +Y V V + L DS F VND L Sbjct: 153 FDL----AAREAGRSRQLGASVYCVGV--KDFNETQLATIADSKDHVFPVNDGFEALQGV 206 Query: 380 FDKITDKIQEQSVRIAPN 397 D I + + + + P+ Sbjct: 207 IDSILKRSCIEILAVEPS 224 >gi|126653689|ref|ZP_01725608.1| BatA [Bacillus sp. B14905] gi|126589726|gb|EAZ83861.1| BatA [Bacillus sp. B14905] Length = 973 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 42/324 (12%), Positives = 96/324 (29%), Gaps = 37/324 (11%) Query: 72 LKQGSYIRENAGDIAQKAQINITKDKNNP---LQYIAESKAQYEIPTENLFLKGLIPSAL 128 LK+ + + ++ +T + Y K +Y+ + L Sbjct: 573 LKESADAASRPSGKDRYTKVMVTLVRPGGEPVTDYQGTVKIKYDGVEKTASFITNTSDPL 632 Query: 129 TNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKS 188 N T + S S V+ + S P K+ Sbjct: 633 NNTGSPGTAVAYFDSIIYGKSKVEATLVNPIDPRYATILKGLKDKTVSKDIFTNPYFSKN 692 Query: 189 FWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIV 248 S T + + ++ +K I+ L + + Sbjct: 693 SCSLATEVAYVVDYSSSM--------KAVDPTNYRGKKMIEFINQLKAKNNIVI----ET 740 Query: 249 GNQCTPLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + T L + + + T+ + + A + T+ Sbjct: 741 NTKATILGEGTTDAVLKKDLYKASKEKGATDIFAGIDIALTKFS----------NDTKTA 790 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP--EGQDLLRKCTDS 363 K ++ ++DG+ + ++ + G+KIY+V++ L++ T++ Sbjct: 791 KAIVVVSDGK-------TSKSKMTKVINEAKKQGVKIYTVSMGKKSQINDATLMQVSTET 843 Query: 364 SGQFFAVNDSRELLESFDKITDKI 387 G ++ D+ +L + F K+ D I Sbjct: 844 GGAYYYALDNLQLHQVFQKLIDAI 867 >gi|326675078|ref|XP_692457.5| PREDICTED: collagen alpha-6(VI) chain [Danio rerio] Length = 1605 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 32/268 (11%), Positives = 75/268 (27%), Gaps = 27/268 (10%) Query: 125 PSALTNLSLRSTGIIERSSENLAISICMVLDVS-RSMEDLYLQKHNDNNNMTSNKYLLPP 183 P+ + S I + + M D + Y + + ++ Sbjct: 132 PAEELRIQGVSVYAIGVKDASQDELLKMTADETKDFYVTNYDALNVLKREIVTDICSQEA 191 Query: 184 PPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAY 243 K + + + + SV +G + + Sbjct: 192 CKNKVADIMFLIDGSSSIYGPDFTSMKTFITKVVNGTI---------IGEDSVHVGVVQF 242 Query: 244 NIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 + + NE++ ++ + +T T A+ + Sbjct: 243 SNNPQEQFPLNRYFDQNELEEAIDGIEQLTGDTYTGKALSFISKYFDASNGGRP------ 296 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 + +F++ ITDGE A A + +R+ G+ I+S+ V L + + Sbjct: 297 DVPQFLVVITDGEAHDAVAVP--------AKAIRDKGVTIFSIGV--ASVNTTQLWEISG 346 Query: 363 SSGQFFAVNDSRELLESFDKITDKIQEQ 390 + + + D L + K+ Sbjct: 347 TQDKVYVQRDFDALHSIDKNLQFKLCSS 374 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 26/158 (16%), Positives = 52/158 (32%), Gaps = 16/158 (10%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYNEK 292 VR+G + + N ++ + + P + A+ + K Sbjct: 47 NRVRVGVVKVDRNPTLQFSLTEHKNRASFEAAVRGISQPVGGSEKGKALKYVASLFNQAK 106 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 S +++ +I ITD + E +R G+ +Y++ V Sbjct: 107 ASRPAK-----VQEILIVITDKTSQDDVGDP--------AEELRIQGVSVYAIGV-KDAS 152 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 +LL+ D + F+ V + L +I I Q Sbjct: 153 QDELLKMTADETKDFY-VTNYDALNVLKREIVTDICSQ 189 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 32/294 (10%), Positives = 71/294 (24%), Gaps = 36/294 (12%) Query: 106 ESKAQYEIPTENLFLKGLIPSALTN----LSLRSTGIIERSSENLAISICMVLDVSRSME 161 + F + + AI V S + Sbjct: 275 TYTGKALSFISKYFDASNGGRPDVPQFLVVITDGEAHDAVAVPAKAIRDKGVTIFSIGVA 334 Query: 162 DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAP----APAPANRKIDVLIESA 217 + + + + Y+ KN + P+ + L++ Sbjct: 335 SVNTTQLWEISGTQDKVYVQRDFDALHSIDKNLQFKLCSSHPGGCPSTQLADVIFLVQCT 394 Query: 218 GN-LVNSIQKA---------IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 + +K + + +R G I Y+ N ++ + Sbjct: 395 RQIRLQDFEKIKSFLISVVNSTQIGDNLIRFGVIVYSD-TPSQFSLNQYNTRRQIAEAIT 453 Query: 268 KLNP---YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 L T A+ ++ + + + ITDG+ ++ Sbjct: 454 SLKSPAVSGYTA--RALAYSLTYFTEAN----GGRQKRGVPQMLFMITDGD------ARD 501 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 N + + +Y + L T S + F VN+ L + Sbjct: 502 RENLRARADEFAAKQINVY--GIGVARAQDSELEIITKSKNKIFHVNNYDNLQD 553 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 34/322 (10%), Positives = 92/322 (28%), Gaps = 26/322 (8%) Query: 68 IKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSA 127 IK L + ++ + I + + ++ Q +L + Sbjct: 405 IKSFLISVVNSTQIGDNLIRFGVIVYSDTPSQFSLNQYNTRRQIAEAITSLKSPAVSGYT 464 Query: 128 L----TNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPP 183 +L+ + R + + M+ D + + ++ N Y + Sbjct: 465 ARALAYSLTYFTEANGGRQKRGVPQMLFMITDGDARDRENLRARADEFAAKQINVYGIGV 524 Query: 184 PPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAY 243 + + TKSK + L + ++ + K + + Sbjct: 525 ARAQDSELEIITKSKNKIFHVNNYDNLQDLQMNVSGVLCNATK----PVCQNEVADLVFL 580 Query: 244 NIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-----NTNTYPAM---HHAYRELYNEKESS 295 G + + + ++KL + ++ E + Sbjct: 581 IDGSESISEESWITVIAFLLNVVDKLRIGPELFRVGIAQFSSVYQKEFYMNEYKDADGED 640 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 + + + + ++ ITDG ++ + + +RN G+ ++++ + Sbjct: 641 KGSRKQSGVPQNLVLITDGV--------SSDRVNEAADQLRNLGINVFAIGIG--AVSLQ 690 Query: 356 LLRKCTDSSGQFFAVNDSRELL 377 L S + F V + L Sbjct: 691 QLSYIAGSPDRLFKVQNFNYLN 712 >gi|188580059|ref|YP_001923504.1| von Willebrand factor type A [Methylobacterium populi BJ001] gi|179343557|gb|ACB78969.1| von Willebrand factor type A [Methylobacterium populi BJ001] Length = 339 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 23/164 (14%), Positives = 51/164 (31%), Gaps = 21/164 (12%) Query: 237 RIGTIAYNIGIVGNQCTPL--SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK-E 293 RIG + + + ++ L +T + A + L + Sbjct: 143 RIGLVEFADQAYVAAAPTFDTAAVARTLEEATIGLV-GRSTGIGDGLGLALKRLAPAQLA 201 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA---- 349 + K V+ ++DG N+ + ++ G+++Y++A+ Sbjct: 202 DAEGGGPPPSRDKVVVLLSDGANNAGQT-----APKDVAALAKDLGVRVYTIALGPIDMA 256 Query: 350 -------PPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITD 385 + LR S G+ F V + +L I + Sbjct: 257 DNPNNEQDVVDVETLRAMAETSGGRAFRVKTTDDLESVAAAIDE 300 >gi|260818212|ref|XP_002604277.1| hypothetical protein BRAFLDRAFT_88566 [Branchiostoma floridae] gi|229289603|gb|EEN60288.1| hypothetical protein BRAFLDRAFT_88566 [Branchiostoma floridae] Length = 1119 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 31/302 (10%), Positives = 86/302 (28%), Gaps = 27/302 (8%) Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI 148 T A I + + G+ L +++ + Sbjct: 649 PSTVATTMPTRVATTTQTMAAVCPIAPDYVIFNGICYKDFDELKTYEEARQTCAADGGLL 708 Query: 149 SICM-VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPAN 207 ++ + + N N+ P Sbjct: 709 AMPRDSATNAFIHTLGGNEIRWIGLNDLINEAFPTVEPCDESVDLFFVLDGSDSVS---L 765 Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 D++ E +V+ ++ + R+G + Y+ G + + + +N Sbjct: 766 ADFDIVKEFVVAVVSGFTISLTD-----TRVGVLQYSDGSTLECNLGDHPDWSSFVNSMN 820 Query: 268 KL-NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 + T+T A+ A L + + + +I +TDG+ + Sbjct: 821 TMARQGGGTSTGAALEFA--RLIAAWRPAPV------VPRIMIVLTDGD--------SED 864 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + + + + + ++++ V + +LL+ ++ + F + D + ++I Sbjct: 865 SVVTPAQALATEQVTVFAIGVGS-FNRSELLQITNNNQDRVFELADFNAIANIMNRIIQA 923 Query: 387 IQ 388 Sbjct: 924 AC 925 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 27/300 (9%), Positives = 88/300 (29%), Gaps = 37/300 (12%) Query: 87 QKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENL 146 +I I + + +F G+ + L + Sbjct: 849 VVPRIMIVLTDGDSEDSVVTPAQALATEQVTVFAIGVGSFNRSEL-----------LQIT 897 Query: 147 AISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPA 206 + V +++ + ++ P + + + Sbjct: 898 NNNQDRVFELADFNAIANIMNRIIQAA---CINIVFPTVEPCDVTTDLFFVLDGSGSVGL 954 Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 + + + LV++ + + + R+G + Y+ +L+ + + Sbjct: 955 YN-FNTVKQFVVTLVSAFTIGLNDVND--TRVGVLQYSSSNTLGCNLGDHPDLSSFVNAM 1011 Query: 267 NKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 N + + T T A+ A + + + + ++ +TDG + + Sbjct: 1012 NAMRYHYGPSTQTGAALQAAGQIA--AWRPAP-------VPRIMVVVTDGMAHDSVVAPS 1062 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKIT 384 + + + ++++ V + LL+ ++ + F + D + ++ + I Sbjct: 1063 --------QGLAADQVNVFAIGVGNYVRSE-LLQIANNNQARVFELADFNAIRDNINDIA 1113 >gi|128485808|ref|NP_001076062.1| integrin alpha-M [Ovis aries] gi|124127039|gb|ABM92271.1| CD11b [Ovis aries] Length = 1152 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 25/120 (20%), Positives = 41/120 (34%), Gaps = 15/120 (12%) Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T+T + REL++ + N K +I ITDGE Y + L + Sbjct: 223 GRTHTATGIRKVVRELFHSSSGARNHA-----LKIMIVITDGE-----KYLDPLEYRDVI 272 Query: 333 EYMRNAGMKIYSVAVSAPPE---GQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKI 387 G+ Y + V + L F VN+ L +++ +KI Sbjct: 273 PEADRKGIIRYVIGVGDAFNSKKSRKELDTIASKPPADHVFQVNNFEALKTIQNQLQEKI 332 >gi|73981989|ref|XP_852918.1| PREDICTED: similar to calcium channel, voltage-dependent, alpha 2/delta subunit 1 [Canis familiaris] Length = 1147 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 29/334 (8%), Positives = 92/334 (27%), Gaps = 25/334 (7%) Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK- 108 + ++ + + E +I + + Sbjct: 160 AEKVQAAHQWREDFASNEVVYYNAKDDLDPEKNESEPGSQRIKPVFIDDANFGRQISYQH 219 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR----SMEDLY 164 A IPT+ ++ + L S + E+ ++ + + + Sbjct: 220 AAVHIPTDIYEGSTIVLNELNWTSALDEVFKKNREEDPSLLWQVFGSATGLARYYPASPW 279 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 + N + P + S + + + + ++ S ++ ++ Sbjct: 280 VDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETL 339 Query: 225 QKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + + ++ +V N +K +N + T+ Sbjct: 340 SDDDFVNVASFNSNAQDVSCFQHLVQANVR----NKKVLKDAVNNITAKGITDYKKGFSF 395 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A+ +L N S K ++ TDG +Y ++ ++++ Sbjct: 396 AFEQLLNYNVSRA------NCNKIIMLFTDG--------GEERAQEIFAKYNKDKKVRVF 441 Query: 344 SVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 + +V + ++ ++ G ++ + + Sbjct: 442 TFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 475 >gi|291528739|emb|CBK94325.1| von Willebrand factor type A domain [Eubacterium rectale M104/1] Length = 410 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 23/113 (20%), Positives = 44/113 (38%), Gaps = 17/113 (15%) Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 T Y ++ A E N ST + +I +TDG N+ Q+ ++ + Sbjct: 2 TALYSGINSATTEFKNY---------STDASRIMIVVTDGYNN-----QSGASSATVINN 47 Query: 335 MRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDK 386 + IY V V +L+ + + G ++ +N +L F+ I + Sbjct: 48 AIEENVIIYCVGVG--SVNSTVLKNISESTGGCYYYINQFSQLNGIFENIISE 98 >gi|314948817|ref|ZP_07852188.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0082] gi|313644760|gb|EFS09340.1| von Willebrand factor type A domain protein [Enterococcus faecium TX0082] Length = 1129 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 27/272 (9%), Positives = 77/272 (28%), Gaps = 26/272 (9%) Query: 136 TGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTT 195 + + + +V+D S SM D L Sbjct: 279 NARGNVQKDITPLDLVLVVDWSGSMNDNNRIGEVKIGVDRFVDTLADSGITDKINMGYVG 338 Query: 196 KSKYAPAPAPANRKI---DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQC 252 S + + ++ D + ++ S + G+ Sbjct: 339 YSSEGYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGTFTQKALRDAGSRLSVPNGHKKVI 398 Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 L++ + ++ +++ ++N Y + ++ + G+T L + Sbjct: 399 VLLTDGVPTFSYKVQRVHAQSSSNYYG----------TQFSNTQDRPGNTSLISRIYDAP 448 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTDS 363 D N + T+ ++ G++I+ + + + + +R+ S Sbjct: 449 DQNNLSRRIDSTFIATIGEAMALKERGIEIHGLGIQLQSDPAAGLSKAEVESRMRQMVSS 508 Query: 364 ---SGQFFA-VNDSRELLESFDKITDKIQEQS 391 ++ + + ++ E K +I Sbjct: 509 DEKGDLYYESADHATDISEYLAKKAVQISATV 540 >gi|254519993|ref|ZP_05132049.1| von Willebrand factor [Clostridium sp. 7_2_43FAA] gi|226913742|gb|EEH98943.1| von Willebrand factor [Clostridium sp. 7_2_43FAA] Length = 960 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 29/197 (14%), Positives = 66/197 (33%), Gaps = 23/197 (11%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 + KS A K+ + E+A + ++++ + I IA++ Sbjct: 404 PAISINLIIDKSGSMSAEGGGVSKLTLAKEAAMKALENLREVDE--------ISVIAFDD 455 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + +K ++ + T+ YPA+ Y + S+ Sbjct: 456 TYDEVVPLQKVGDKEAIKELISGIQIRGGTSIYPALEQGYNM----------QMQSSAKI 505 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG 365 K I +TDG++ N + + + + + +VAV L + + G Sbjct: 506 KHTILLTDGQDGYGL-----DNYATLLQNFIDNNITLSTVAVGEGANAGLLNQLASIGKG 560 Query: 366 QFFAVNDSRELLESFDK 382 + + + ++ F K Sbjct: 561 RSYYTDIYTDIPRIFAK 577 >gi|327278404|ref|XP_003223952.1| PREDICTED: LOW QUALITY PROTEIN: integrin alpha-X-like [Anolis carolinensis] Length = 1162 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 31/158 (19%), Positives = 61/158 (38%), Gaps = 19/158 (12%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKES 294 R I ++ + + S + ++ ++++ T T A+ REL+ ++ Sbjct: 181 DTRFALIQFSDRYLEHFNFN-SEDPEQLVLHIHQV--GGWTETATAIRRVVRELFTSQKG 237 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS---APP 351 S N + +I ITDG + + L Q+ + AG+ Y++ V + P Sbjct: 238 SRNGAT-----RILIVITDGVKT------DRLQYSQVIPEAKQAGIIRYAIGVGDAFSSP 286 Query: 352 EGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKI 387 + + L F V + L D++ DKI Sbjct: 287 DAKRELDDIASEPKAEHIFTVYNFNALRGIQDQLKDKI 324 >gi|163816348|ref|ZP_02207714.1| hypothetical protein COPEUT_02536 [Coprococcus eutactus ATCC 27759] gi|158448345|gb|EDP25340.1| hypothetical protein COPEUT_02536 [Coprococcus eutactus ATCC 27759] Length = 612 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 34/338 (10%), Positives = 87/338 (25%), Gaps = 27/338 (7%) Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + + + +Y + + ++ + + A Y + Sbjct: 103 TAMVTDTSNSMYSEVAYDTREYDSMTENGFVSTVDRPLSTFAADRD-TASYSNVRSYIES 161 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 L P + + + S + +++ Sbjct: 162 GSLPPDGAVRIEEMLNYFTYDYRKKPEDGEKFSIYTEYSDCPWNKDTKLMMVGINTDEID 221 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 + T + ++ LV + E + + R+ Sbjct: 222 FGDKKPSNLVFLIDTSGS-------------MYDDNKLPLVQQSFAMLAENLDENDRVSI 268 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG 300 + Y + + L+ + TN A+ AY + N Sbjct: 269 VTYAGEDTVVLSGTPGSEQYTISEALSNMTAEGCTNGGDAIITAYELAEKNFINGGNNR- 327 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 VI TDG+ + ++ L L I E + + + + + L Sbjct: 328 -------VILATDGDLNVGLTSESDLVDL-ITEEKKENNIFLSVLGFGTDNLKDNKLEAL 379 Query: 361 TDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 D+ G + ++ + E + + D++ +A + Sbjct: 380 ADNGDGSYAFIDSAYEAKKV---LVDEMGGTLNTVAKD 414 >gi|153954292|ref|YP_001395057.1| hypothetical protein CKL_1667 [Clostridium kluyveri DSM 555] gi|219854893|ref|YP_002472015.1| hypothetical protein CKR_1550 [Clostridium kluyveri NBRC 12016] gi|146347173|gb|EDK33709.1| Conserved hypothetical protein [Clostridium kluyveri DSM 555] gi|219568617|dbj|BAH06601.1| hypothetical protein [Clostridium kluyveri NBRC 12016] Length = 580 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 24/148 (16%), Positives = 60/148 (40%), Gaps = 17/148 (11%) Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 S + + +KS + ++ + T+T + A + + + H + + +I + Sbjct: 92 SINSSEDKDRIKSMASDISLGQRTDTGRGLLEAVKLMESG----HKSGNNP----VIILL 143 Query: 312 TDGENS----GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-GQDLLRKC-TDSSG 365 +DG+N + + N LQIC ++ G +Y++ ++ + L + + G Sbjct: 144 SDGKNDPERSQDESLNDLKNALQIC---KSKGYPVYTIGLNYNGTVDKTQLGDISSSTGG 200 Query: 366 QFFAVNDSRELLESFDKITDKIQEQSVR 393 + + N + +L I + V+ Sbjct: 201 KDYITNTASDLPAILTDIYADNSKLKVQ 228 >gi|198435216|ref|XP_002126368.1| PREDICTED: similar to integrin alpha Hr1 precursor-like [Ciona intestinalis] Length = 1274 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 28/235 (11%), Positives = 63/235 (26%), Gaps = 24/235 (10%) Query: 169 NDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAI 228 ++ P + + + + ++ + Sbjct: 143 KKPMTSSTATMFKTPCVPGCPKIILIADIMFVLDDSSSVDD-TAFRSALNWIIQVVSYFS 201 Query: 229 QEKKNLSVRIGTIAYNIGIVGNQC-TPL-SNNLNEVKSRLNKL----NPYENTNTYPAMH 282 + +R+G ++ + L +K ++ +L + T A+ Sbjct: 202 SYIDSGDLRVGVYGFSNDDHRSGIRIGLRKWTSATLKKQIGELLNVKSTGAGTYISHAIK 261 Query: 283 HAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKI 342 + + K +I +TDG S + R+ G+ + Sbjct: 262 ETVKVFEANGR--------KGISKEIILLTDGGASDWW------LLKGEADTARDKGIVL 307 Query: 343 YSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 SV V LL + + F D L E + + I ++I N Sbjct: 308 VSVGVGTSVNNDQLL-AIAGNKSRVFQATDYNTLDEVVNNVVSTIDA--IKITVN 359 >gi|253701051|ref|YP_003022240.1| von Willebrand factor A [Geobacter sp. M21] gi|251775901|gb|ACT18482.1| von Willebrand factor type A [Geobacter sp. M21] Length = 331 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 25/180 (13%), Positives = 58/180 (32%), Gaps = 43/180 (23%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRE 287 RIG +A+ PL+++ ++ +++L+ + T A+ Sbjct: 126 GGRKDDRIGLVAFAGRPYPAA--PLTSDHQWLQGVVDRLDTGAVEDGTALGDAILSGVNR 183 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 L + +I ITDG N+ + Q + G++++++ + Sbjct: 184 L----------RRRPAESRALILITDGRNNAGAEPQLAAQAAKA------LGIRVHAIGI 227 Query: 348 SAPPE---------------------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + L+ + + G++F D+ L F +I Sbjct: 228 GSRGSAVIPVPSPLGGTIYRRLDAELDAATLKGVAELTGGRYFEAGDATVLSRVFAEIDR 287 >gi|124008260|ref|ZP_01692956.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] gi|123986209|gb|EAY26038.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] Length = 552 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 41/308 (13%), Positives = 94/308 (30%), Gaps = 29/308 (9%) Query: 91 INITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISI 150 +++ + + A Y +++ L ++ L + + Sbjct: 96 LSVKTAPLSTFSIDVD-NASYSRARKSINNGQLPSTSSVRLEEFINYFNYQYKQPEGQHP 154 Query: 151 CMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKI 210 V + + + K++ + K L K S S AP K+ Sbjct: 155 FSV--NTEVAKCPWNPKNHLVHIGLQGKRLDSRKLKLSNLVFLIDVSGSMSAP----DKL 208 Query: 211 DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN 270 +L ++ LVN++ + R+ + Y + ++ L+KL Sbjct: 209 PLLRKAFKMLVNNLGEED--------RVAIVVYAGNAGLVLPATQGTDKQKIMEALDKLQ 260 Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 +T + AY+ N +I TDG+ + + Sbjct: 261 SGGSTAGGAGIKLAYKIAKQNFIKEGNNR--------IILATDGDFNLG--ASSDQAMQN 310 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQE 389 + E R G+ I + + + D G ++ +++ L E++ ++ Sbjct: 311 LIEEKRKEGVFITVLGLGMGNYRDSKMEIIADKGNGNYYYLDN---LNEAYKVFGKDLKG 367 Query: 390 QSVRIAPN 397 IA + Sbjct: 368 TLFTIAKD 375 >gi|257887439|ref|ZP_05667092.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,141,733] gi|257823493|gb|EEV50425.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,141,733] Length = 1107 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 24/272 (8%), Positives = 71/272 (26%), Gaps = 26/272 (9%) Query: 136 TGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTT 195 + + + +V+D S SM + L Sbjct: 257 NARGNVQKDITPLDLVLVVDWSGSMNNNDRIGEVKIGVDRFVDTLADSGITDKINMGYVG 316 Query: 196 KSKYAPAPAPANRKI---DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQC 252 S + ++ D + ++ S + G + Sbjct: 317 YSSEGHNYSNGTVQMGSFDSVKNQVKSITPSWTNGGTFTQKGLRDAGDMLSVPNGHKKVI 376 Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 L++ + ++ ++ + + Y + ++ + G+T + Sbjct: 377 VLLTDGVPTFSYKVQRVRAQSSNDYYG----------TQFSNTQDQPGNTSRIARSYYAP 426 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT----------- 361 D N + T+ ++ G++I+ + + + L K Sbjct: 427 DQNNQSRRIDSTFIATIGEAMALKERGIEIHGLGIQLQSDSAAGLSKAEVESRMRKMVSA 486 Query: 362 -DSSGQFFA-VNDSRELLESFDKITDKIQEQS 391 + ++ + + ++ E K +I Sbjct: 487 DEKGDLYYESADHATDISEYLAKKAVQISATV 518 >gi|163788218|ref|ZP_02182664.1| hypothetical protein FBALC1_07553 [Flavobacteriales bacterium ALC-1] gi|159876538|gb|EDP70596.1| hypothetical protein FBALC1_07553 [Flavobacteriales bacterium ALC-1] Length = 688 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 43/292 (14%), Positives = 84/292 (28%), Gaps = 32/292 (10%) Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKH 168 A Y + +I S + + D S+ Sbjct: 253 ASYSNVRRMINNGQVIASDAVKIEEMINYFNYNYPQPTD-------DHPFSINTEVTDTP 305 Query: 169 NDNNNMTSNKYLLPPPP--KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK 226 N + L K + N T ++ K+ +L + LVN +++ Sbjct: 306 WH-NKTQLVRIGLQGKSYADKDLPASNLTFLIDVSGSMSSHNKLPLLKSAFKLLVNQLRE 364 Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYR 286 ++ + Y NN ++ S LN L +T + AY+ Sbjct: 365 KD--------KVSIVVYAGAAGVVLEPTSGNNKEKIISALNNLQSGGSTAGGAGIKLAYK 416 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 + N VI TDG+ + ++ N + + E R +G+ + + Sbjct: 417 LAEKNFKKKGNNR--------VILATDGDFNVGASSDN--DMKTLIEEKRKSGVFLSVLG 466 Query: 347 VSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 L D G +++ +E + F + IA + Sbjct: 467 FGYGNYKDSKLETLADKGNGNHAYIDNMQEAQKVF---GKEFGGTLFTIAKD 515 >gi|149915102|ref|ZP_01903630.1| hypothetical protein RAZWK3B_14733 [Roseobacter sp. AzwK-3b] gi|149810823|gb|EDM70662.1| hypothetical protein RAZWK3B_14733 [Roseobacter sp. AzwK-3b] Length = 444 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 19/164 (11%), Positives = 56/164 (34%), Gaps = 15/164 (9%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKES 294 S R+ +A++ + ++ L++++ T + + +E+ Sbjct: 77 SDRLAIVAFDNATEVMFSGGPRGDGQAARAALSRIHARGMTALHDGWLLGVEQSIAMREA 136 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 V ++DG + + + C M G+ + + + Sbjct: 137 GTPAR--------VFLLSDGVANVGLTDASAIAAD--CTRMAEHGITTSTCGLGMG-FNE 185 Query: 355 DLLRKCTDSS-GQFFAVNDSRELLESFDK---ITDKIQEQSVRI 394 DL+ + + G + + +L + F++ + I + +R+ Sbjct: 186 DLMAEMARAGRGNAYYGETAEDLQDPFEQEFDLLRNICARGLRL 229 >gi|296327481|ref|ZP_06870027.1| D-amino acid dehydrogenase large subunit [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] gi|296155307|gb|EFG96078.1| D-amino acid dehydrogenase large subunit [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] Length = 530 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 21/131 (16%), Positives = 54/131 (41%), Gaps = 17/131 (12%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N+ ++ L + P T+ ++ + +L + + + ITDG + Sbjct: 252 NVEGIEKALEPIQPTGWTSIAKSIEYGVEDLK--------ALDGEKTLNILYIITDGIET 303 Query: 318 GASAYQNTLNTLQICEYMR--NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF-AVNDSR 374 N ++I + ++ N + + + + LL++ D++G ++ +VND+ Sbjct: 304 CGG------NPVEIAKQLKGENTNIVLGIIGFNVDANQNRLLKQIADAAGGYYSSVNDAN 357 Query: 375 ELLESFDKITD 385 +L +I + Sbjct: 358 KLTGELYRINE 368 >gi|386831|gb|AAA51620.1|AAA51620 integrin alpha subunit precursor [Homo sapiens] Length = 1163 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 56/174 (32%), Gaps = 17/174 (9%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 N V ++ Q + + + ++N + + +++L T T Sbjct: 172 NFVRAVISQFQRPSTQFSLMQFSNKFQTHLTFEEFRRTSNPLSLLASVHQLQ--GFTYTA 229 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 A+ + L++ + K +I ITDG +TL+ + A Sbjct: 230 TAIQNVVHRLFHASYGARRDAT-----KILIVITDG-----KKEGDTLDYKDVIPMADAA 279 Query: 339 GMKIYSVAVSA---PPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKI 387 G+ Y++ V L F V D L + ++ +KI Sbjct: 280 GIIRYAIGVGLAFQNRNSWKELNDIASKPSQEHIFKVEDFDALKDIQTQLREKI 333 >gi|304441881|gb|ADM34176.1| martilin [Aplysia californica] Length = 348 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 19/143 (13%), Positives = 48/143 (33%), Gaps = 18/143 (12%) Query: 258 NLNEVKSRLNKL--NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 +E+ + + + T+T + + + + + ++ + ITDG Sbjct: 43 GKDELIAAIGNIPHRLGRYTSTGEGIEYMATAQLASQFTR------SWAERVGLVITDGN 96 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE 375 Q + T + R +G+ ++++ V L + + F V+ E Sbjct: 97 ------SQESAKTKEAARQARESGITMFAIGVG--NVKDQELVNIAGDASRVFKVDSYDE 148 Query: 376 LLESFDKITDK--IQEQSVRIAP 396 L + + I++ P Sbjct: 149 LENIKQTLAHQTCIRQLKTTTPP 171 >gi|218782851|ref|YP_002434169.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] gi|218764235|gb|ACL06701.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] Length = 684 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 26/221 (11%), Positives = 71/221 (32%), Gaps = 24/221 (10%) Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 L S S KI + E +VN + + Sbjct: 26 SLANAASPGSNLILIFDASGSMWGQIEGKAKITIAKEVMEGIVNDL---PDDINVGLTAY 82 Query: 239 GTIAYNIGIVGNQCTPLSN-NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHN 297 G PL + ++ +NP T ++ + + ++ + Sbjct: 83 GHRRKGDCDDLETLIPLGPIDKQAFIEKIKAINPKGKTPMLRSIRLTAEAIKHLEDETT- 141 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK--IYSVAVSAPPEGQD 355 ++ ++DG+ + ++ G+ ++ V E ++ Sbjct: 142 ----------ILLVSDGKET------CDPEPCAFVAELKALGINFVMHVVGFDVGGETEE 185 Query: 356 LLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 L+ + G++F +++ +L E+ + +K E++++++ Sbjct: 186 ELKCMAAAGDGEYFPASNADKLKEALATVIEKTVEKNLKVS 226 >gi|120609754|ref|YP_969432.1| von Willebrand factor, type A [Acidovorax citrulli AAC00-1] gi|120588218|gb|ABM31658.1| von Willebrand factor, type A [Acidovorax citrulli AAC00-1] Length = 355 Score = 62.2 bits (149), Expect = 1e-07, Method: Composition-based stats. Identities = 25/242 (10%), Positives = 74/242 (30%), Gaps = 66/242 (27%) Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 S A ++ ++A + + VR+G +A+ Q + Sbjct: 95 SGSMRAADVHPDRLTAAQDAAKAFIADLP--------RHVRVGIVAFAGSAQLAQLP--T 144 Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNE-----------------------KE 293 N ++ ++ T T + + L+ + Sbjct: 145 QNHEDLFRAIDSFQLQRGTATGNGILLSLATLFPDTGIDVSALGGRQAMPRPQSMDEIGR 204 Query: 294 SSHNTIGSTRLKK------------FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK 341 H + +I +TDG+ + ++ ++ ++ + G++ Sbjct: 205 PPHRGGNGKGADRPAPVAPGSYTSAAIIMLTDGQRTTG------VDPMEAAQWAADRGVR 258 Query: 342 IYSVAVSAPPE--------------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDK 386 +Y+V V +D L+ ++ ++F + +L + ++ ++ + Sbjct: 259 VYTVGVGTVAGETIGFEGWSMRVRLDEDTLKAVAQRTNAEYFHAATAADLKKVYETLSSR 318 Query: 387 IQ 388 + Sbjct: 319 LT 320 >gi|188990634|ref|YP_001902644.1| putative secreted protein [Xanthomonas campestris pv. campestris str. B100] gi|167732394|emb|CAP50588.1| putative secreted protein [Xanthomonas campestris pv. campestris] Length = 597 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 35/332 (10%), Positives = 77/332 (23%), Gaps = 37/332 (11%) Query: 72 LKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAES-KAQYEIPTENLFLKGLIPSALTN 130 + + + + + Y L L P Sbjct: 113 MPPSPPTENRETYQTLSDNPIVQAAEQPVSTFSIDVDTGSYSNVRRFLNAGTLPPVDAVR 172 Query: 131 LSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQK----HNDNNNMTSNKYLLPPPPK 186 + + V + + LPP Sbjct: 173 VEELINYFRYDDPAPTDGTPFAV--RTELAPTPWNTDTLLLRIGVAGRDVPTAALPPANL 230 Query: 187 KSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIG 246 A K+ +L S LV ++K RI + Y Sbjct: 231 VFLVD--------VSGSMGAPDKLPLLQSSLKLLVRQLRKQD--------RITLVTYAGS 274 Query: 247 IVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 + ++ L T + AY+ Sbjct: 275 TAVVLPPTSGAQQTRIVEAIDSLQSGGGTAGASGIELAYKAAQQAYLRGGINR------- 327 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-G 365 ++ TDG+ + + L + R +G+ + ++ L+ + D+ G Sbjct: 328 -ILLATDGDFNVGVTDFDQL--KGMVAEKRRSGVALSTLGFGTGNYNDTLMEQLADAGDG 384 Query: 366 QFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + ++ + E + +T ++ IA + Sbjct: 385 AYAYIDSALEARKV---LTHELGSTLATIARD 413 >gi|332970145|gb|EGK09139.1| D-amino-acid dehydrogenase [Desmospora sp. 8437] Length = 485 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 31/210 (14%), Positives = 67/210 (31%), Gaps = 22/210 (10%) Query: 180 LLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG 239 L P ++ S K+D+ ++ +++ + + G Sbjct: 167 LKVPESQEVNIMILLDSSGSMADKVKGGVKMDLAKKAVKEFASNMPEGANVSLVVYGHKG 226 Query: 240 ----TIAYNIGIVGNQCTPLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKES 294 + PL+ + + L+K P T +M A +L + Sbjct: 227 SNAGADKKVSCESIEEIYPLAAYDGKTFQGSLDKFQPTGWTPLAGSMKLAQEKLASH--- 283 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK--IYSVAVSAPPE 352 + ++ V ++DG + + ++ E + + MK + + E Sbjct: 284 -----TGSNVQNIVYVVSDGVET------CDGDPVKAAEELNESNMKAVVNIIGFDVDNE 332 Query: 353 GQDLLRKCTD-SSGQFFAVNDSRELLESFD 381 GQ L++ D G + V L E F+ Sbjct: 333 GQKALKEVADAGGGTYKTVGSKVGLQEYFE 362 >gi|156120138|ref|NP_001095285.1| complement C2 [Sus scrofa] gi|148724911|emb|CAN87699.1| complement component 2 [Sus scrofa] Length = 752 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 45/286 (15%), Positives = 95/286 (33%), Gaps = 17/286 (5%) Query: 123 LIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLP 182 L S S DV+ ++ + N + K Sbjct: 183 LTGSMERECQDNGVWSGTEPICRQPYSYDFPEDVTPALGTSFSHLLGATNPTQTKKTENV 242 Query: 183 P-PPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 + + + + V SA LV+ I N+SV I T Sbjct: 243 GRKIQIQRSGHLNLYLLLDASQSVSEEDFGVFKRSASLLVDRIFSFEI---NISVAIITF 299 Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-----YENTNTYPAMHHAYRELYNEKESSH 296 A I+ + S ++ EV L +N TN Y A++ Y + N+ ++ Sbjct: 300 ASKPKIIMSVLKDKSRDVTEVVHSLENINYKDHENGTGTNIYEALNSVYIMMNNQMQNLG 359 Query: 297 NTIGSTRLKKF-VIFITDGENSGASAYQNTLNTLQICEYMRNAG---MKIYSVAVSAPPE 352 + + + +I +TDG+++ + + ++ ++ ++ + IY++ V Sbjct: 360 MNTMAWQEIRHAIILLTDGKSNMGGSPKPAVDNIKEILNIKEKRKDYLDIYAIGVGNVEV 419 Query: 353 GQDLLRKCTDSSG---QFFAVNDSRELLESFDKITDKIQEQSVRIA 395 L + F + D+ L + F+ + D + + + I Sbjct: 420 DWRELNELGSKKDGERHAFILKDTEALSQVFEHMLD-VSQLTDTIC 464 >gi|38455778|gb|AAR20890.1| complement C2 [Sus scrofa] Length = 734 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 45/286 (15%), Positives = 95/286 (33%), Gaps = 17/286 (5%) Query: 123 LIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLP 182 L S S DV+ ++ + N + K Sbjct: 183 LTGSMERECQDNGVWSGTEPICRQPYSYDFPEDVTPALGTSFSHLLGATNPTQTKKTENV 242 Query: 183 P-PPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 + + + + V SA LV+ I N+SV I T Sbjct: 243 GRKIQIQRSGHLNLYLLLDASQSVSEEDFGVFKRSASLLVDRIFSFEI---NISVAIITF 299 Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-----YENTNTYPAMHHAYRELYNEKESSH 296 A I+ + S ++ EV L +N TN Y A++ Y + N+ ++ Sbjct: 300 ASKPKIIMSVLKDKSRDVTEVVHSLENINYKDHENGTGTNIYEALNSVYIMMNNQMQNLG 359 Query: 297 NTIGSTRLKKF-VIFITDGENSGASAYQNTLNTLQICEYMRNAG---MKIYSVAVSAPPE 352 + + + +I +TDG+++ + + ++ ++ ++ + IY++ V Sbjct: 360 MNTMAWQEIRHAIILLTDGKSNMGGSPKPAVDNIKEILNIKEKRKDYLDIYAIGVGNVEV 419 Query: 353 GQDLLRKCTDSSG---QFFAVNDSRELLESFDKITDKIQEQSVRIA 395 L + F + D+ L + F+ + D + + + I Sbjct: 420 DWRELNELGSKKDGERHAFILKDTEALSQVFEHMLD-VSQLTDTIC 464 >gi|281337632|gb|EFB13216.1| hypothetical protein PANDA_007041 [Ailuropoda melanoleuca] Length = 524 Score = 62.2 bits (149), Expect = 2e-07, Method: Composition-based stats. Identities = 23/155 (14%), Positives = 46/155 (29%), Gaps = 20/155 (12%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESS 295 +I + + + V + + + T T A+ R ++ S Sbjct: 379 KIAAVQFTYDQRTEFSFTDYSTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFGPVRDS 438 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 N F++ +TDG++ + +AG+ I+SV V+ P Sbjct: 439 PNK-------NFLVIVTDGQSYD--------DVRGPAAAAHDAGITIFSVGVAWAPLDD- 482 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 L+ F + L + I Sbjct: 483 -LKDMASKPKESHAFFTREFTGLEPIVSDVIRGIC 516 >gi|332262981|ref|XP_003280535.1| PREDICTED: integrin alpha-M-like [Nomascus leucogenys] Length = 997 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 30/191 (15%), Positives = 63/191 (32%), Gaps = 21/191 (10%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + E ++ ++K+ K S+ + + + N + Sbjct: 158 SGSINPNDFQKMKEFVSTVMEQLKKS---KTLFSLMQYSEEFWTHFTFKEFQDNPNPRSL 214 Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 VK + + T+T + REL+N + + K ++ ITDGE Sbjct: 215 VKPI---MQLFGRTHTATGIRKVVRELFNITQGAR-----KNAFKILVVITDGE-----K 261 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSA---PPEGQDLLRKCTDSS--GQFFAVNDSREL 376 + + L + G+ Y + V + L F VN+ L Sbjct: 262 FGDPLGYEDVIPEADREGVIRYVIGVGDAFHSMNSRQELNTIASKPPRDHVFQVNNFEAL 321 Query: 377 LESFDKITDKI 387 +++ +KI Sbjct: 322 KTIQNQLREKI 332 >gi|163850298|ref|YP_001638341.1| von Willebrand factor type A [Methylobacterium extorquens PA1] gi|163661903|gb|ABY29270.1| von Willebrand factor type A [Methylobacterium extorquens PA1] Length = 339 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 24/165 (14%), Positives = 55/165 (33%), Gaps = 23/165 (13%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK--LNPYE-NTNTYPAMHHAYRELYNEK- 292 RIG + + + + V L + + +T + A + L + Sbjct: 143 RIGLVEFADQAYVAAAP--TFDTATVARTLEEATIGLVGRSTGIGDGLGLALKRLAPAQV 200 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA--- 349 ++ K V+ ++DG N+ + ++ G+++Y++A+ Sbjct: 201 AAADGEGPPPARDKVVVLLSDGANNAGQT-----APKDVAALAKDLGVRVYTIALGPIDM 255 Query: 350 --------PPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITD 385 + LR S G+ F V + +L + I + Sbjct: 256 ADNPNNEQDVVDVETLRAMAETSGGRAFRVKTTDDLENVANAIDE 300 >gi|239928001|ref|ZP_04684954.1| secreted protein [Streptomyces ghanaensis ATCC 14672] Length = 417 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 28/196 (14%), Positives = 66/196 (33%), Gaps = 26/196 (13%) Query: 206 ANRKIDVLIESAGNLVNSIQK-----AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 ++ ++ ++++ K N + PL + Sbjct: 52 GQSRMAAAKQAFNEVLDATPKEVELGIRTLGANYAGDDRKEGCKDTAQLYPVGPL--DRT 109 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 E K+ + L P T P++ A +L K ++ I+DGE++ Sbjct: 110 EAKAAVATLTPTGWTPIGPSLLKAADDLEGGNG-----------SKRIVLISDGEDT--- 155 Query: 321 AYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELL 377 L+ ++ + G+ I ++ + + L + + G + +V EL Sbjct: 156 --CAPLDPCEVAREIAAKGIGLTIDTLGLVPNAKLSRQLSCIAEATGGTYASVEHQDELT 213 Query: 378 ESFDKITDKIQEQSVR 393 + +++ D+ E V Sbjct: 214 DRVNELVDRAAEPVVT 229 >gi|221135528|ref|XP_002156134.1| PREDICTED: similar to collagen, partial [Hydra magnipapillata] Length = 194 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 24/186 (12%), Positives = 64/186 (34%), Gaps = 16/186 (8%) Query: 194 TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEK--KNLSVRIGTIAYNIGIVGNQ 251 T + L + ++K N G + ++ + Sbjct: 6 TPDCEGFFDVGFILDSSGSLKSQYWKEKDFLKKLANSFGISNKGSHAGVVTFSHYAELSI 65 Query: 252 CTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 + + +++++ ++ T A+ A L++ K + N + + Sbjct: 66 RLDAFYSSIDFNDAVDRISHMDSFTRIDLALAKALE-LFDIKNGARN-----DVPNLLFL 119 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV 370 +TDG+ + + I + ++ G+++++V + A + L K + F V Sbjct: 120 LTDGKQ------EPEMPLTHISDEIKQKGIQLFAVGIGAGAN-KTELEKIVGNPENVFMV 172 Query: 371 NDSREL 376 +D +L Sbjct: 173 DDFDKL 178 >gi|6680956|ref|NP_031754.1| cochlin precursor [Mus musculus] gi|311771523|ref|NP_001185764.1| cochlin precursor [Mus musculus] gi|12644458|sp|Q62507|COCH_MOUSE RecName: Full=Cochlin; AltName: Full=COCH-5B2; Flags: Precursor gi|2801415|gb|AAC39949.1| Coch-5B2 gene product [Mus musculus] gi|26324626|dbj|BAC26067.1| unnamed protein product [Mus musculus] gi|28277390|gb|AAH45137.1| Coagulation factor C homolog (Limulus polyphemus) [Mus musculus] gi|74178965|dbj|BAE42713.1| unnamed protein product [Mus musculus] gi|74209551|dbj|BAE23310.1| unnamed protein product [Mus musculus] Length = 552 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 26/155 (16%), Positives = 46/155 (29%), Gaps = 20/155 (12%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESS 295 +I + + N V + L + T T A+ R ++ S Sbjct: 407 KIAAVQFTYDQRTEFSFTDYNTKENVLAVLANIRYMSGGTATGDAIAFTVRNVFGPIRDS 466 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 N F++ +TDG++ + +AG+ I+SV V+ P Sbjct: 467 PNK-------NFLVIVTDGQSYD--------DVRGPAAAAHDAGITIFSVGVAWAPLDD- 510 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 LR F + L + I Sbjct: 511 -LRDMASKPKESHAFFTREFTGLEPIVSDVIRGIC 544 >gi|19703911|ref|NP_603473.1| D-amino acid dehydrogenase large subunit [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] gi|19714079|gb|AAL94772.1| hypothetical protein FN0576 [Fusobacterium nucleatum subsp. nucleatum ATCC 25586] Length = 369 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 22/145 (15%), Positives = 57/145 (39%), Gaps = 18/145 (12%) Query: 245 IGIVGNQCTPLSN-NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 P+ + N+ ++ L + P T+ ++ + +L + + Sbjct: 77 SCGANELIYPIGDLNVEGIEKALEPIQPTGWTSIAKSIEYGVEDLK--------ALDGEK 128 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMR--NAGMKIYSVAVSAPPEGQDLLRKCT 361 + ITDG + N ++I + ++ N + + + + LL++ Sbjct: 129 TLNILYIITDGIETCGG------NPVEIAKQLKGENTNIVLGIIGFNVDANQNRLLKQIA 182 Query: 362 DSSGQFF-AVNDSRELLESFDKITD 385 D++G ++ +VND+ +L +I + Sbjct: 183 DAAGGYYSSVNDANKLTGELYRINE 207 >gi|301607027|ref|XP_002933125.1| PREDICTED: collagen alpha-1(VII) chain-like [Xenopus (Silurana) tropicalis] Length = 2671 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 22/132 (16%), Positives = 41/132 (31%), Gaps = 19/132 (14%) Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 N E+ + L NT T + +A + Sbjct: 64 SATRTEFTFTTHRNGTELVQAIRNLGYKGGNTRTGTGLRYAADNFFGPTIIRP------N 117 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS 363 + K I ITDG++ + +++ G+K+++V + L + + Sbjct: 118 VPKVAILITDGKSQDDVDPPT--------QRLKSQGIKMFAVGI--KNADSRELTRVAST 167 Query: 364 S--GQFFAVNDS 373 FF VND Sbjct: 168 PTEDFFFYVNDF 179 >gi|226326038|ref|ZP_03801556.1| hypothetical protein COPCOM_03856 [Coprococcus comes ATCC 27758] gi|225205580|gb|EEG87934.1| hypothetical protein COPCOM_03856 [Coprococcus comes ATCC 27758] Length = 275 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 33/226 (14%), Positives = 73/226 (32%), Gaps = 22/226 (9%) Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 N Y + + K + AN + L ++A N + ++ S Sbjct: 44 NWYYIDDSKEGLGEKLTDNKERLYYTTNDANDRFYYLKQAATNFTTQLAQSSP----NSE 99 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 + + + +N + T+ + AY+ L N+ Sbjct: 100 IALVTFNKTATEQFDFKNVGKDSAYITETINAMETSGGTHQNEGLDRAYKILNND----- 154 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--- 353 ++ LK++V+ +TDG +G + Q +++ K+ +V V Sbjct: 155 --QNTSNLKRYVVLLTDGCPNGVTYDQ----ITTSINKIKSTNTKLITVGVGLDETNTGL 208 Query: 354 ---QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 +D L+ + ND+ L F +I + + ++ Sbjct: 209 KAAKDYLQANA-DDNMAYNANDASHLNTIFTQILGQTTNSNTPLSV 253 >gi|326922361|ref|XP_003207417.1| PREDICTED: collagen alpha-3(VI) chain-like [Meleagris gallopavo] Length = 3135 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 25/174 (14%), Positives = 64/174 (36%), Gaps = 16/174 (9%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT- 275 + V ++ + +R+G + ++ +++ RL +L P + Sbjct: 663 VRDFVVTLVN-NLDVGTDKIRVGLVQFSDTPKTEFSLYSYQTKSDIIQRLGQLRPKGGSV 721 Query: 276 -NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 NT A++ L N + + + ++ + ++ +T G ++ LQ+ Sbjct: 722 LNTGSALNF---VLSNHFTEAGGSRINEQVPQVLVLVTAGRSADPF--------LQVSND 770 Query: 335 MRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 + AG+ ++V V + L + + + V+D L ++ I Sbjct: 771 LARAGVLTFAVGV--RNADKAELEQIAFNPRMVYFVDDFSGLTALPQELNKPIT 822 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 22/185 (11%), Positives = 62/185 (33%), Gaps = 20/185 (10%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENT 275 + + KA+ N R + ++ + +V S + + + Sbjct: 57 VREFLYDVVKALDVGGNDF-RFALVQFSGNPHTEFQLNTYPSNQDVLSHIANMPYMGGGS 115 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 T + + + N + + S + + +I +TDG++ A +++ + Sbjct: 116 KTGKGLEY---LIENHLTKAAGSRASEGVPQVIIVLTDGQSQDDVALPSSV--------L 164 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQSVR 393 ++A + + +AV + L++ F + + L + D + Sbjct: 165 KSAHVNM--IAVGVQDAVEGELKEIASRPFDTHLFNIENFTALHGI---VGDLVASVRTS 219 Query: 394 IAPNR 398 + P + Sbjct: 220 MTPEK 224 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 23/197 (11%), Positives = 61/197 (30%), Gaps = 24/197 (12%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 R +L +V S+ + VR+ + Y+ I + ++ Sbjct: 1043 SDGVRRGFPLLKTFVERVVESL-----DIGRDKVRVAIVQYSNVIQPEFLLDAYEDKADL 1097 Query: 263 KSRLNKLNPYENT--NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 S + L + NT A+ + + ++ S + +F+I +T + Sbjct: 1098 VSAIQALTIMGGSPLNTGAALDYLIKNVFTVSSGSRIA---EGVPQFLILLTADRSQDDV 1154 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 + + ++ +G + + + L+ + +V D +L Sbjct: 1155 RRPSVV--------LKTSGTVPFGIGIG--NADLTELQTISFLPDFAISVPDFSQLDSV- 1203 Query: 381 DKITDKIQEQSVRIAPN 397 + + +R+ Sbjct: 1204 ---QQAVSNRVIRLTKK 1217 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 34/322 (10%), Positives = 90/322 (27%), Gaps = 36/322 (11%) Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 Q+ I +A + + N+ G+ + L ++ + Sbjct: 138 SEGVPQVIIVLTDGQSQDDVALPSSVLKSAHVNMIAVGVQDAVEGELKEIASRPFDTHLF 197 Query: 145 NLAISIC---MVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAP 201 N+ +V D+ S+ + + + + Sbjct: 198 NIENFTALHGIVGDLVASVRTSMTPEKAGAKGLVKDITAQESADLIFLIDGSNNIGSVNF 257 Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 ++ + + ++ ++++ + IG + Y+ + + Sbjct: 258 -------------QAIRDFLVNLIESLRVGAQQ-IHIGVVQYSDQPRTEFALNSYSTKAD 303 Query: 262 VKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 V + L+ NT A+ + L+ + + + + + ++ I+ GE Sbjct: 304 VLDAVKALSFRGGKEANTGAALEYVVENLFTQ---AGGSRIEEAVPQILVLISGGE---- 356 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 + + L + I+S ++ L++ F D R L Sbjct: 357 -SSDDIREGLLAVKQA-----SIFSFSIGVLNADSAELQQIATDGSFAFTALDIRNLAAL 410 Query: 380 FD----KITDKIQEQSVRIAPN 397 + I Q + AP Sbjct: 411 RELLLPNIVGVAQRLILLEAPT 432 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 40/373 (10%), Positives = 99/373 (26%), Gaps = 49/373 (13%) Query: 22 IMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSY-IRE 80 + + SA+ A + G + + + + +L + + + Sbjct: 1090 AYEDKADLVSAIQALTIMGGSPLNTGAALD-------------------YLIKNVFTVSS 1130 Query: 81 NAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIE 140 + Q I + + + F G+ + LT L S Sbjct: 1131 GSRIAEGVPQFLILLTADRSQDDVRRPSVVLKTSGTVPFGIGIGNADLTELQTISFLPDF 1190 Query: 141 RSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYA 200 S + V + +K ++ P K S+ Sbjct: 1191 AISVPDFSQLDSVQQAVSNRVIRLTKKEIESLAPDLVFTSPSPAGAKRDIVFLVDGSR-- 1248 Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 A D++ NL R+ + ++ + + Sbjct: 1249 YAAQEFYLIRDLIERIVNNLDVGFDTT---------RVSVVQFSEHPHVEFLLNAHSTKD 1299 Query: 261 EVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 EV+ + +L P N A+ + ++ S + +F++ ++ ++ Sbjct: 1300 EVQGAVRRLRPRGGQQVNMGEALEFVAKTIFTRPSGSRIE---EGVPQFLVILSSRKSDD 1356 Query: 319 A-SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 + + + + M + + + + S F V+ +EL Sbjct: 1357 DLEFPSVQVKQVGVAPLVIAKNM------------DPEEMVQISLSPDYVFQVSSFQELP 1404 Query: 378 ESFDKITDKIQEQ 390 K+ I+ Sbjct: 1405 SLEQKLLAPIETL 1417 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 22/170 (12%), Positives = 55/170 (32%), Gaps = 15/170 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRE 287 + VRIG + ++ + + N V + +L N A+ + Sbjct: 1471 DVGPNKVRIGVVQFSNNVFPEFFLKTHKSKNAVLQAIRRLRLRGGYPVNAGKALDY---V 1527 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 + N S + + + ++ I ++ + + + + V Sbjct: 1528 VKNYFIKSAGSRIEDGVPQHLVVILGDQSQDDVNRPANVISSTSIQPL----------GV 1577 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 A ++ L+ T+ G+ V D L ++ + ++E V + Sbjct: 1578 GARNVDRNQLQVITNDPGRVLVVQDFTGLPTLERRVQNILEELPVPTTES 1627 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 26/173 (15%), Positives = 62/173 (35%), Gaps = 17/173 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I++ ++ ++ + + S+++G YN + + ++ +N Sbjct: 1649 INLGRDNFQEVLQFVYSIVDAIYEDGDSIQVGLAQYNSDVTDEFFLKDYSTKPQILDAIN 1708 Query: 268 KLNPYENT--NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 K+ NT A+ H + + R+ + IT G+ ++ Sbjct: 1709 KVIYKGGRVANTGAAIRHLQAK---HFVKEAGSRIDQRVPQIAFIITGGK--------SS 1757 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + G+K+++V V + + K S F V+ ++EL E Sbjct: 1758 DDGQGASMEVAQKGVKVFAVGV--RNIDLEEVSKLASESATSFRVSTAQELSE 1808 >gi|308511201|ref|XP_003117783.1| hypothetical protein CRE_00574 [Caenorhabditis remanei] gi|308238429|gb|EFO82381.1| hypothetical protein CRE_00574 [Caenorhabditis remanei] Length = 566 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 27/168 (16%), Positives = 53/168 (31%), Gaps = 20/168 (11%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENT 275 +++ V I A + + E+K+ + + + T Sbjct: 409 LKFATTLVEQMPISPNATRVAIIQFAGKTKLRVLADFAQKKSAAELKTIIGRSHFFSGTT 468 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 T A+ K + V+ TDG + +T + E + Sbjct: 469 FTNGALKTMADLFQKSKRADAKLK--------VVLFTDGY--------SAEDTSEGAEAL 512 Query: 336 RNAGMKIYSVAVSAPPE---GQDLLRKCTDSSGQFFAVNDSRELLESF 380 ++ G+ +Y+V +S L S FF +D EL ++F Sbjct: 513 KSQGVVVYTVGISTEKSTGLNMKELHGMATSPNHFFNASDFVELSKNF 560 >gi|317133828|ref|YP_004089739.1| von Willebrand factor type A [Ruminococcus albus 7] gi|315450290|gb|ADU23853.1| von Willebrand factor type A [Ruminococcus albus 7] Length = 1061 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 33/235 (14%), Positives = 77/235 (32%), Gaps = 35/235 (14%) Query: 148 ISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPAN 207 S MV+D + + ++ ++ S KS Sbjct: 534 FSKYMVVDKTEWFNNWRKIYNSYAAIFSAIPSYTAICVDCSGSMSTNDKS---------F 584 Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 + + ++ N+ +++ V ++ ++NN + + Sbjct: 585 KDDNGVLTCYRNIAVQ-----NYVESMFVFDNASIITFESSASEECEMTNNKRTLSGKA- 638 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 TN A+ A EL KK +I ++DG+ + + + Sbjct: 639 SFYNRGGTNANSAIDIAIDEL-----------NHVYGKKNIILLSDGDVNVS------DD 681 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 ++ C +N G++I++VA+ + Q L + D+ G + L + ++ Sbjct: 682 NIKYC---KNNGIRIHTVALGSGANSQLLKQYANDTGGTPLTATTAEGLTKIYES 733 >gi|145295537|ref|YP_001138358.1| hypothetical protein cgR_1465 [Corynebacterium glutamicum R] gi|140845457|dbj|BAF54456.1| hypothetical protein [Corynebacterium glutamicum R] Length = 354 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 32/218 (14%), Positives = 72/218 (33%), Gaps = 23/218 (10%) Query: 173 NMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK 232 + T L+ + TT +I +L ++ +L++ Sbjct: 149 SKTVTDALIDAYTNQFRVPGETTFVLDVSGSMLGQ-RITLLKDTMSDLISGGATTDLANV 207 Query: 233 N--LSVRIGTIAYNIGIVGNQCTPL----SNNLNEVKSRLNKLNPYENTNTYPAMHHAYR 286 + ++ I ++ G L S + +++ R+ L T Y A+ AY Sbjct: 208 SLRGREKVSIIPFSFGPHEVISETLGAVGSPSRIDLQQRVEALQADGGTGIYDAVLAAY- 266 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL-NTLQICEYMRNAGMKIYSV 345 ++ +TDGE + Y L + +R+ + ++ + Sbjct: 267 ----------AESAGGDYIPSIVLMTDGELTAGRTYDQFLTEWNALPSNIRS--IPVFVI 314 Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 D+ + + G+ F + +L E+F +I Sbjct: 315 LYG-EANVADMEQLAATTGGKTFDAIN-GDLDEAFKEI 350 >gi|194227183|ref|XP_001916967.1| PREDICTED: similar to inter-alpha globulin inhibitor H2 polypeptide [Equus caballus] Length = 946 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 19/170 (11%), Positives = 62/170 (36%), Gaps = 11/170 (6%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 + + + + + + + K + K+ P TN A+ A L Sbjct: 344 DQFSVVDFNHNVRTWRNDLVSATTTQVADAKKYIEKIQPSGGTNINEALLRAIFILNEAN 403 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 S +I ++DG+ + + + + + +++ + ++S+ + Sbjct: 404 NLGLLDPNS---VSLIILVSDGDPTVGELKLSKIQ-KNVKQNIQD-NISLFSLGIGFDV- 457 Query: 353 GQDLLRKCT-DSSGQFFAV----NDSRELLESFDKITDKIQEQSVRIAPN 397 D L++ + ++ G + + S +L + +++++ + P+ Sbjct: 458 DYDFLKRLSNENRGIAHRIYGNQDTSSQLKKFYNQVSTPLLRNIQFNYPH 507 >gi|120407060|ref|NP_766396.2| anthrax toxin receptor-like precursor [Mus musculus] Length = 641 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 69/196 (35%), Gaps = 18/196 (9%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 K + ++ ++ + + +++ N ++RI I Y+ PL+++ E+ Sbjct: 80 VLDKSGSVADNWIHIYSFAEGLVKKFTNPNLRISIITYSTEA--EVILPLTSDSKEINKS 137 Query: 266 L---NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 L + P T+ + A ++ + +I +TDG Sbjct: 138 LLVLKNIVPQGLTHMQKGLRKANEQIRKSTLGGRI------VNSVIIALTDGLLLLKPY- 190 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV-NDSRELLESFD 381 L+T++ + R G +Y+V V + L + F V L D Sbjct: 191 ---LDTMEEAKKARRMGAIVYTVGVFM--YSKQQLVNIAGDPDRCFGVDEGFSALEGVVD 245 Query: 382 KITDKIQEQSVRIAPN 397 +T K + + + P Sbjct: 246 PLTSKSCTEILSVQPT 261 >gi|91082539|ref|XP_973726.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Tribolium castaneum] Length = 842 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 40/337 (11%), Positives = 93/337 (27%), Gaps = 36/337 (10%) Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAES---KAQYEI 113 + + K +KQ + + + + QY++ Sbjct: 218 NEISKNDDKTASLAEIKQNNSTSATVKFNPNIERQKQLATGLGTKEENGLAGQFVVQYDV 277 Query: 114 PTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNN 173 + L E L + VLD S SM+ +++ + N Sbjct: 278 ERDP-----KGGEVLLKDGYFVHFFAPSEVEALPKQVIFVLDTSGSMDGNRIKQLKEAMN 332 Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 ++ +S D + + + + Sbjct: 333 SILSELKKEDVFNIVEFSSIVKVWNVDKVQVDYEVGEDPW---------PLYDSPEAPQK 383 Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 ++ N + K + KLN Y T+ A+ + + KE Sbjct: 384 NKT--------NQVLPPAYKATDENKEKAKKVVEKLNAYGGTDIKSALEVGLKLVKKNKE 435 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + + ++F+TDGE + + + I+S++ Sbjct: 436 N-----KEDAHQPIIVFLTDGEPTMGETNTEKITSAISEMNSGETRAPIFSLSFG-DGAD 489 Query: 354 QDLLRKCTDSS----GQFFAVNDSR-ELLESFDKITD 385 ++ L+K + + + D+ +L E + +I+ Sbjct: 490 REFLQKISLKNLGFARHIYEAADASLQLQEFYKQISS 526 >gi|316956996|gb|EFV46978.1| putative von Willebrand factor type A domain protein [Trichinella spiralis] Length = 328 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 28/185 (15%), Positives = 61/185 (32%), Gaps = 13/185 (7%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 + V + N + R+ I Y+ I +N+ EV Sbjct: 44 GSGSIGSAVFKNEILRFLREFINLFTIGSNHT-RLAIIQYSDQIRHELDFKEANSKAEVD 102 Query: 264 SRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 LN++ T T A+ ++ +++ + T + + I ITDG + ++ Sbjct: 103 EALNRVEYLTGLTKTGDALTDMFKIGFSKSRGARPI--ETGVHRVAIVITDGRSQDIVSF 160 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + +++V V + L + S + F V + +L Sbjct: 161 S--------ANEAKKSNVLMFAVGV-TDHVSEAELVEIAGSKDRVFLVKEFTDLNVRLRS 211 Query: 383 ITDKI 387 + K Sbjct: 212 LIQKA 216 >gi|149188995|ref|ZP_01867284.1| hypothetical protein VSAK1_21554 [Vibrio shilonii AK1] gi|148837181|gb|EDL54129.1| hypothetical protein VSAK1_21554 [Vibrio shilonii AK1] Length = 266 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 23/159 (14%), Positives = 57/159 (35%), Gaps = 27/159 (16%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKE 293 R+G IA+ P++ + + + L +++ T A+ A + + Sbjct: 74 RVGLIAFGDDAYLQA--PVTEDFDTLSLLLEQMDVRMAGAGTALGDAIGVAVNHFEHSES 131 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE- 352 ++ K ++ +TDG+++ + + + G+ IY +A+ Sbjct: 132 NN----------KVLLLLTDGKDTTSQFP-----PVDAAHFAGERGVTIYPIAIGDATNV 176 Query: 353 -----GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 ++L + + GQ F D L + + + Sbjct: 177 GEEAIDLEMLARIASYTGGQVFEALDGDALAAVYQTLNE 215 >gi|260828797|ref|XP_002609349.1| hypothetical protein BRAFLDRAFT_99028 [Branchiostoma floridae] gi|229294705|gb|EEN65359.1| hypothetical protein BRAFLDRAFT_99028 [Branchiostoma floridae] Length = 421 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 22/189 (11%), Positives = 66/189 (34%), Gaps = 13/189 (6%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + +V++ + R+G + ++ ++ Sbjct: 39 SGSISADDFVSAKSFISRVVDAF-----DIAADFTRVGVVQFSSFFTEEFPLDRYSDKAS 93 Query: 262 VKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 +K + + T +++ + E + + S + + + +TDG ++ Sbjct: 94 LKQAIGNIPQRGGGTLLGQVINYLVNTSFTEAKGARPL--SDGIPRIAVLMTDG----SA 147 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 T + +R +G+ +S+ V P +D L + + F V + + Sbjct: 148 HDNPTTVLAPAIDALRASGIIAFSIGVG-PSVNRDQLEAVAGDTDRVFLVGAYSVIDDIR 206 Query: 381 DKITDKIQE 389 D + ++++E Sbjct: 207 DLLVERVRE 215 >gi|162419860|ref|YP_001607152.1| von Willebrand factor type A domain-containing protein [Yersinia pestis Angola] gi|162352675|gb|ABX86623.1| von Willebrand factor type A domain protein [Yersinia pestis Angola] Length = 472 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 21/163 (12%), Positives = 55/163 (33%), Gaps = 13/163 (7%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYENTNTYPAMHHAYRELYNE 291 N++ + +AY+ + + + + + ++P T + + ++ Sbjct: 128 NITDTLSVVAYDNHAEVIIPATKVTDKPALIASIQQHIHPRGMTALFAGVSMGIGQVDKH 187 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 +I I+DG+ + + L + G+ I ++ + Sbjct: 188 LNREQVNR--------IILISDGQANTGPTSISEL--SDLARMAAKKGIAITTIGLGQD- 236 Query: 352 EGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 +DL+ S G V +S +L ++F K + + Sbjct: 237 YNEDLMTAIAGYSDGNHTFVANSADLEKAFTKEFQDVMSVVAQ 279 >gi|320105612|ref|YP_004181202.1| VWFA-like domain-containing protein [Terriglobus saanensis SP1PR4] gi|319924133|gb|ADV81208.1| VWFA-related domain-containing protein [Terriglobus saanensis SP1PR4] Length = 335 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 24/151 (15%), Positives = 69/151 (45%), Gaps = 15/151 (9%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + + +N+++ + ++KL P T + A++ R + S ++ Sbjct: 155 DVKTDLAQDYTNSVDLLNQAIHKLRPGGGTAFFDALYTTCR------DQMLTLKESNTVR 208 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS---APPEGQDLLRKCTD 362 + +I ++DG ++ + A +N + +++C+ +Y+++ + ++LR+ D Sbjct: 209 RALIVVSDGHDNQSRAQEN--DAIKMCQRAE---TIVYTISTNISPTKDAADEVLRRIAD 263 Query: 363 -SSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + G+ F N ++ F I ++++ Q + Sbjct: 264 ATGGRVFFPNRIEDVANGFHSIEEELRSQYL 294 >gi|281346820|gb|EFB22404.1| hypothetical protein PANDA_000280 [Ailuropoda melanoleuca] Length = 482 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 30/167 (17%), Positives = 59/167 (35%), Gaps = 18/167 (10%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL---NPYENTNTYPAMHHAYRELYNE 291 S ++ PL+ + +++ L L +P T + + A ++ Sbjct: 68 SPQMRLSFIVFSSQATIILPLTGDRSKISKGLEDLKRVSPVGETYIHEGLKLANEQIQKA 127 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 G + +I +TDG+ G + + R+ G ++Y V V Sbjct: 128 --------GGFKASSIIIALTDGKLDG----LVPSYAEKEAKISRSFGARVYCVGVL--D 173 Query: 352 EGQDLLRKCTDSSGQFFAVN-DSRELLESFDKITDKIQEQSVRIAPN 397 Q L + DS Q F V + L + I D+ + + + P+ Sbjct: 174 FEQAQLERIADSKDQVFPVKGGFQALKGIINSILDRSCTEILELRPS 220 >gi|198427770|ref|XP_002125315.1| PREDICTED: similar to Vwa1 protein [Ciona intestinalis] Length = 835 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 28/235 (11%), Positives = 64/235 (27%), Gaps = 24/235 (10%) Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAG 218 + ND+ P P + + + + Sbjct: 613 PADMTQNTCRNDSQWNLPKPCCARPCPPFALMDAVF---ILDSSSSIGTANWVTMKTFVR 669 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNT 277 N++ S A + R N+++ ++ + + T T Sbjct: 670 NVLGSFVLAPDAARFSVFRYNRH--VDNTTQILLNEFENDIDLFLNKFDDIPYDGSGTWT 727 Query: 278 YPAMHHAY-RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 A+ HA L + V+ ITDG + + ++ + +R Sbjct: 728 GQALRHAKDTILLPGNGNRPGVKD------VVLIITDGRSQD--------DVREVSQQLR 773 Query: 337 NAGMKIYSVAV--SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF-DKITDKIQ 388 G+ Y++ + + L S L + F ++++ +I Sbjct: 774 AQGVLTYALGIVPPIGSLDETQLLDIAGSQSNLLIATSFSTLDQQFSNQLSSQIC 828 >gi|295093271|emb|CBK82362.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Coprococcus sp. ART55/1] Length = 612 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 44/381 (11%), Positives = 98/381 (25%), Gaps = 53/381 (13%) Query: 18 DLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSY 77 D+ + A D A ++G ++V+D + T T Sbjct: 86 DVMYAE-------EACDVATVAGDTAMVTDTSNSMYTEIAYDTREY-------------- 124 Query: 78 IRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTG 137 D + T D+ A Y + L P + Sbjct: 125 ------DSVAENGFVSTADRPLSTFAADRDTASYSNVRSYIESGCLPPDGAVRIEEMLNY 178 Query: 138 IIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKS 197 + S + +++ + T Sbjct: 179 FTYDYRRKPEDGEKFSIYTEYSDCPWNKATKLMMVGINTDEIDFGDKKPSNLVFLIDTSG 238 Query: 198 KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN 257 + K+ ++ +S L ++ + ++ + Y + Sbjct: 239 -----SMYEDNKLPLVQQSFAMLAENLDEND--------KVSIVTYAGEDTVVLSGTSGS 285 Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + L+ + TN A+ AY N VI TDG+ + Sbjct: 286 EQYTINEALSSMTAEGCTNGGDAIITAYELAEKNFIEGGNNR--------VILATDGDLN 337 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSREL 376 ++ L L I E + + + + + L D+ G + ++ + E Sbjct: 338 VGLTSESDLVDL-ITEEKKENNIFLSVLGFGTDNLKDNKLEALADNGDGSYAFIDSAYEA 396 Query: 377 LESFDKITDKIQEQSVRIAPN 397 + + D++ +A + Sbjct: 397 KKV---LVDEMGGTLNTVAKD 414 >gi|292490950|ref|YP_003526389.1| von Willebrand factor A [Nitrosococcus halophilus Nc4] gi|291579545|gb|ADE14002.1| Forkhead-associated protein [Nitrosococcus halophilus Nc4] Length = 510 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 24/136 (17%), Positives = 50/136 (36%), Gaps = 15/136 (11%) Query: 259 LNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + + L+KL N+ M A EL N +K +I +T+G Sbjct: 105 RDALVTNLDKLTYTGRFRNSAAGMERALYELKNFGRPEA--------EKAIILLTNGPIE 156 Query: 318 GASAYQNTLNT----LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVND 372 ++ + + AG+K++ +A LL+ + G ++ Sbjct: 157 TGDEKRDRDFSRWMQEYLAHEAAEAGIKVFGIAF-TEAADFHLLQILAHTTGGTYYRAPQ 215 Query: 373 SRELLESFDKITDKIQ 388 + +L +F++I I Sbjct: 216 AVDLQSAFNRIRKVIT 231 >gi|74224199|dbj|BAE33710.1| unnamed protein product [Mus musculus] Length = 552 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 26/155 (16%), Positives = 46/155 (29%), Gaps = 20/155 (12%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESS 295 +I + + N V + L + T T A+ R ++ S Sbjct: 407 KIAAVQFTYDQRTEFSFTDYNTKENVLAVLANIRYMSGGTATGDAIAFTVRNVFGPIRDS 466 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 N F++ +TDG++ + +AG+ I+SV V+ P Sbjct: 467 PNK-------NFLVIVTDGQSYD--------DVRGPAAAAHDAGITIFSVGVAWAPLDD- 510 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 LR F + L + I Sbjct: 511 -LRDMASKPKESHAFFTREFTGLEPIVSDVIRGIC 544 >gi|291409921|ref|XP_002721255.1| PREDICTED: matrilin 4 [Oryctolagus cuniculus] Length = 346 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 28/153 (18%), Positives = 52/153 (33%), Gaps = 15/153 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + + R+G I Y+ + + +++ L L P T T A+ +A Sbjct: 67 DVGPNATRVGVIQYSSQVQSVFPLGAFSRREDMERALRTLVPLAQGTMTGLAIQYAMNVA 126 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E + R+ + + +TDG ++ R G++IY+V V Sbjct: 127 FSVAEGARP--PEERVPRVAVIVTDGRPQD--------RVAEVAAQARARGIEIYAVGVQ 176 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 G LR F V + E Sbjct: 177 RADVGS--LRAMASPPLDEHVFLVESFDLIQEF 207 >gi|325963511|ref|YP_004241417.1| von Willebrand factor type A-like protein [Arthrobacter phenanthrenivorans Sphe3] gi|323469598|gb|ADX73283.1| von Willebrand factor type A-like protein [Arthrobacter phenanthrenivorans Sphe3] Length = 622 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 21/182 (11%), Positives = 57/182 (31%), Gaps = 9/182 (4%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 P K+ ++ ++ + ++G V + P N ++ + Sbjct: 449 PGLTKLQRAKDAVLKALDHFTAEDEIGLAAFSQVGD-GPLTPGVVSPVAPFKTNKEDLIA 507 Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 +LN+L + T + A+ + ++ ++DG+N Sbjct: 508 KLNELKAVDATPLFEAV--------SRFAGDQAKEYKDNFINAIVLLSDGKNDTTHPGDL 559 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKIT 384 + Q+ + + ++++A + L S ++ D L E ++ Sbjct: 560 GGLSEQLGHQNHSTPVLVFTLAYGPDADVPTLREIARASGAHYYDATDPNRLEEVLGELV 619 Query: 385 DK 386 Sbjct: 620 TS 621 >gi|21114118|gb|AAM42187.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|66572853|gb|AAY48263.1| conserved hypothetical protein [Xanthomonas campestris pv. campestris str. 8004] Length = 618 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 34/328 (10%), Positives = 80/328 (24%), Gaps = 29/328 (8%) Query: 72 LKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAES-KAQYEIPTENLFLKGLIPSALTN 130 + + + + + Y L L P Sbjct: 134 MPPSPPTENRETYQTLSDNPIVQAAEQPVSTFSIDVDTGSYSNVRRFLNAGTLPPVDAVR 193 Query: 131 LSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFW 190 + + V + + + + + + Sbjct: 194 VEELINYFRYDDPAPTDGTPFAV--RTELAPTPWNTDTLLLRIGVAGREVPTAALPAANL 251 Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 S AP K+ +L S LV ++K RI + Y Sbjct: 252 VFLVDVSGSMGAP----DKLPLLQSSLKLLVRQLRKQD--------RITLVTYAGSTAVV 299 Query: 251 QCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 + ++ L T + AY+ ++ Sbjct: 300 LPPTSGAQQTRIVEAIDSLQSGGGTAGASGIELAYKAAQQAYLRGGINR--------ILL 351 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFA 369 TDG+ + + L + R +G+ + ++ L+ + D+ G + Sbjct: 352 ATDGDFNVGVTDFDQL--KGMVAEKRRSGVALSTLGFGTGNYNDTLMEQLADAGDGAYAY 409 Query: 370 VNDSRELLESFDKITDKIQEQSVRIAPN 397 ++ + E + +T ++ IA + Sbjct: 410 IDSALEARKV---LTHELGSTLATIARD 434 >gi|291411005|ref|XP_002721795.1| PREDICTED: integrin alpha M [Oryctolagus cuniculus] Length = 1155 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 25/135 (18%), Positives = 46/135 (34%), Gaps = 17/135 (12%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N + + +L T+T + EL++ + +K ++ ITDGE Sbjct: 210 NPRALVKPIRQL--LGRTHTATGILKVVTELFHSSSGARA-----NARKVLVVITDGE-- 260 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD---LLRKCTDSSG--QFFAVND 372 + +TL + G+ Y V V + L F VN+ Sbjct: 261 ---KFGDTLEYEDVIPRAEREGVIRYVVGVGDAFNSEQSRQELNTIASKPSREHVFRVNN 317 Query: 373 SRELLESFDKITDKI 387 L +++ +KI Sbjct: 318 FEALNTIRNQLQEKI 332 >gi|260834995|ref|XP_002612495.1| hypothetical protein BRAFLDRAFT_120990 [Branchiostoma floridae] gi|229297872|gb|EEN68504.1| hypothetical protein BRAFLDRAFT_120990 [Branchiostoma floridae] Length = 443 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 27/140 (19%), Positives = 45/140 (32%), Gaps = 17/140 (12%) Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 PLS N V ++ L T + + A +E+ G + I + Sbjct: 59 IQPLSGNYGTVMQSVDNLKAGGTTPMFEGLMEAMKEILQRGGVLTLPGGRKMTPRV-ILM 117 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAG---------MKIYSVAVSAPPEGQDLLRKCTD 362 TDG + N L+ AG + I V +DLL+ Sbjct: 118 TDGYPD------DKENVLKAALSFGPAGWQAVGLPHPIPIACVGCG-DDVDKDLLQAIAK 170 Query: 363 SSGQFFAVNDSRELLESFDK 382 + + + D +L E F + Sbjct: 171 LTNGMYILGDVSQLSEFFRR 190 >gi|12850100|dbj|BAB28591.1| unnamed protein product [Mus musculus] Length = 431 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 29/136 (21%), Positives = 49/136 (36%), Gaps = 16/136 (11%) Query: 255 LSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 L+ + +++ L +L P +T + A ++Y E + T +I + Sbjct: 3 LTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYYENSQGYRT------ASVIIAL 56 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDGE + + R+ G +Y V V + L + DS F VN Sbjct: 57 TDGELHEDLFFYSERE----ANRSRDLGAIVYCVGV--KDFNETQLARIADSKDHVFPVN 110 Query: 372 D-SRELLESFDKITDK 386 D + L I K Sbjct: 111 DGFQALQGIIHSILKK 126 >gi|297482040|ref|XP_002692521.1| PREDICTED: matrilin 4 [Bos taurus] gi|296480952|gb|DAA23067.1| matrilin 4 [Bos taurus] Length = 584 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 58/183 (31%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 362 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 416 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + + TDG + + Sbjct: 417 VEYMERGTMTGLALRHMVEHSFSEAQGARPR--ALNVPRVGLVFTDGRSQDNISVW---- 470 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V ++ LR+ D + + + Sbjct: 471 ----AARAKEEGIVMYAVGVGKAV--EEELREIASEPAELHVSYSPDFSTMTHLLENLKG 524 Query: 386 KIQ 388 I Sbjct: 525 SIC 527 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 25/164 (15%), Positives = 54/164 (32%), Gaps = 15/164 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + + R+G I Y+ + + +++ + + P T T A+ +A Sbjct: 67 DVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRAVVPLAQGTMTGLAIQYAMNVA 126 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E + + + + +TDG ++ R G++IY+V V Sbjct: 127 FSVAEGARP--PEAHVPRVAVIVTDGRPQD--------RVAEVAAQARARGIEIYAVGVQ 176 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 G LR F V + E + ++ Sbjct: 177 RADVGS--LRAMASPPLNEHVFLVESFDLIQEFGRQFQGRLCTL 218 >gi|300796915|ref|NP_001178240.1| matrilin-4 [Bos taurus] Length = 584 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 58/183 (31%), Gaps = 20/183 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +++ +V+ + + R+G + ++ + EVK + Sbjct: 362 NFELVKRFVNQIVDFL-----DVSPEGTRVGLVQFSSRVRTEFPLGRYGTAAEVKQAVLA 416 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + E T T A+ H ++E + + + + + + TDG + + Sbjct: 417 VEYMERGTMTGLALRHMVEHSFSEAQGARPR--ALNVPRVGLVFTDGRSQDNISVW---- 470 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + G+ +Y+V V ++ LR+ D + + + Sbjct: 471 ----AARAKEEGIVMYAVGVGKAV--EEELREIASEPAELHVSYSPDFSTMTHLLENLKG 524 Query: 386 KIQ 388 I Sbjct: 525 SIC 527 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 25/164 (15%), Positives = 54/164 (32%), Gaps = 15/164 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + + R+G I Y+ + + +++ + + P T T A+ +A Sbjct: 67 DVGPNATRVGVIQYSSQVQSVFPLRAFSRREDMERAIRAVVPLAQGTMTGLAIQYAMNVA 126 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ E + + + + +TDG ++ R G++IY+V V Sbjct: 127 FSVAEGARP--PEAHVPRVAVIVTDGRPQD--------RVAEVAAQARARGIEIYAVGVQ 176 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 G LR F V + E + ++ Sbjct: 177 RADVGS--LRAMASPPLNEHVFLVESFDLIQEFGRQFQGRLCTL 218 >gi|125527010|gb|EAY75124.1| hypothetical protein OsI_03018 [Oryza sativa Indica Group] Length = 589 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 25/198 (12%), Positives = 59/198 (29%), Gaps = 33/198 (16%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL--SNNLNEVKSR 265 ID + ++ + R+ + ++ + E+K+ Sbjct: 83 DGIDKAKTALQFVIRKLSDLD--------RLCIVTFSTNATRLCPLRFVTAAAQAELKAL 134 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 ++ L TN + + + + ++ + V+ ++DG + ++ Sbjct: 135 VDGLKADGMTNMKAGLETSMSVVDGRRLAAGRAVS-------VMLMSDGYQNDGGDARDV 187 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT--DSSGQFFAVNDSRELLESFDKI 383 + +Y+ A + LL G F V DS L F ++ Sbjct: 188 HL----------KNVPVYTFGFGASHDSN-LLEAIARKSLGGTFNYVADSANLTGPFSQL 236 Query: 384 TD---KIQEQSVRIAPNR 398 I Q + + R Sbjct: 237 LGGLLTIIAQDLELTVTR 254 >gi|126722583|ref|NP_001075745.1| voltage-dependent calcium channel subunit alpha-2/delta-1 preproprotein [Oryctolagus cuniculus] gi|116409|sp|P13806|CA2D1_RABIT RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-1; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-1; Flags: Precursor gi|164763|gb|AAA81562.1| dihydropryridine-sensitive calcium channel alpha-2 subunit [Oryctolagus cuniculus] Length = 1106 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 29/334 (8%), Positives = 93/334 (27%), Gaps = 25/334 (7%) Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK- 108 + ++ + + E +I + + + Sbjct: 106 AEKVQAAHQWREDFASNEVVYYNAKDDLDPEKNDSEPGSQRIKPVFIDDANFRRQVSYQH 165 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR----SMEDLY 164 A IPT+ ++ + L S + E+ ++ + + + Sbjct: 166 AAVHIPTDIYEGSTIVLNELNWTSALDDVFKKNREEDPSLLWQVFGSATGLARYYPASPW 225 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 + N + P + S + + + + ++ S ++ ++ Sbjct: 226 VDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETL 285 Query: 225 QKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + + ++ +V N +K +N + T+ Sbjct: 286 SDDDFVNVASFNSNAQDVSCFQHLVQANVR----NKKVLKDAVNNITAKGITDYKKGFSF 341 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A+ +L N S K ++ TDG +Y ++ ++++ Sbjct: 342 AFEQLLNYNVSRA------NCNKIIMLFTDG--------GEERAQEIFAKYNKDKKVRVF 387 Query: 344 SVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 + +V + ++ ++ G ++ + + Sbjct: 388 TFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 421 >gi|312197712|ref|YP_004017773.1| von Willebrand factor type A [Frankia sp. EuI1c] gi|311229048|gb|ADP81903.1| von Willebrand factor type A [Frankia sp. EuI1c] Length = 372 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 14/181 (7%), Positives = 44/181 (24%), Gaps = 50/181 (27%) Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE-------LYNEKESSHNTIGSTRL 304 P + + ++ L L T + + + + Sbjct: 140 LVPPTTDSQKLLDALQNLTTSRGTAIGQGILTSIDAIADADPSVAPTGSAVSGNGTGPYA 199 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------------ 352 ++ +TDG N+ ++ + +++Y++ Sbjct: 200 ADVIVVLTDGANTQG------VDPQTAAKQAAARRLRVYTIGFGTTTPAPMVCGSSQVGG 253 Query: 353 ------------------------GQDLLRK-CTDSSGQFFAVNDSRELLESFDKITDKI 387 + LR + G ++ ++ +L ++ + I Sbjct: 254 FGGFGGFGGFGGGGRLGDRSPLVIDEQALRDVAATTGGTYYRAQNAGQLQDALGTLPRNI 313 Query: 388 Q 388 Sbjct: 314 T 314 >gi|148685681|gb|EDL17628.1| integrin alpha M, isoform CRA_c [Mus musculus] Length = 1037 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 57/181 (31%), Gaps = 25/181 (13%) Query: 214 IESAGNLVNSIQKAIQEKK-NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NP 271 E ++ +K+ I + +P +S ++ + Sbjct: 171 KEFVSTVMEQFKKSKTLFSLMQYSDEFRIHFTFNDFKRNPSP--------RSHVSPIKQL 222 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T T + REL+++ + K ++ ITDGE + + L+ + Sbjct: 223 NGRTKTASGIRKVVRELFHKTNGAR-----ENAAKILVVITDGE-----KFGDPLDYKDV 272 Query: 332 CEYMRNAGMKIYSVAVSAPPE---GQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDK 386 AG+ Y + V + L F V++ L +++ +K Sbjct: 273 IPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGEHVFQVDNFEALNTIQNQLQEK 332 Query: 387 I 387 I Sbjct: 333 I 333 >gi|148685680|gb|EDL17627.1| integrin alpha M, isoform CRA_b [Mus musculus] Length = 1168 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 57/181 (31%), Gaps = 25/181 (13%) Query: 214 IESAGNLVNSIQKAIQEKK-NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NP 271 E ++ +K+ I + +P +S ++ + Sbjct: 171 KEFVSTVMEQFKKSKTLFSLMQYSDEFRIHFTFNDFKRNPSP--------RSHVSPIKQL 222 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T T + REL+++ + K ++ ITDGE + + L+ + Sbjct: 223 NGRTKTASGIRKVVRELFHKTNGAR-----ENAAKILVVITDGE-----KFGDPLDYKDV 272 Query: 332 CEYMRNAGMKIYSVAVSAPPE---GQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDK 386 AG+ Y + V + L F V++ L +++ +K Sbjct: 273 IPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGEHVFQVDNFEALNTIQNQLQEK 332 Query: 387 I 387 I Sbjct: 333 I 333 >gi|132626289|ref|NP_001076429.1| integrin alpha-M isoform 1 precursor [Mus musculus] Length = 1154 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 57/181 (31%), Gaps = 25/181 (13%) Query: 214 IESAGNLVNSIQKAIQEKK-NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NP 271 E ++ +K+ I + +P +S ++ + Sbjct: 170 KEFVSTVMEQFKKSKTLFSLMQYSDEFRIHFTFNDFKRNPSP--------RSHVSPIKQL 221 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T T + REL+++ + K ++ ITDGE + + L+ + Sbjct: 222 NGRTKTASGIRKVVRELFHKTNGAR-----ENAAKILVVITDGE-----KFGDPLDYKDV 271 Query: 332 CEYMRNAGMKIYSVAVSAPPE---GQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDK 386 AG+ Y + V + L F V++ L +++ +K Sbjct: 272 IPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGEHVFQVDNFEALNTIQNQLQEK 331 Query: 387 I 387 I Sbjct: 332 I 332 >gi|132626295|ref|NP_032427.2| integrin alpha-M isoform 2 precursor [Mus musculus] gi|148685679|gb|EDL17626.1| integrin alpha M, isoform CRA_a [Mus musculus] gi|162318464|gb|AAI56095.1| Integrin alpha M [synthetic construct] gi|162319590|gb|AAI56992.1| Integrin alpha M [synthetic construct] Length = 1153 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 57/181 (31%), Gaps = 25/181 (13%) Query: 214 IESAGNLVNSIQKAIQEKK-NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NP 271 E ++ +K+ I + +P +S ++ + Sbjct: 170 KEFVSTVMEQFKKSKTLFSLMQYSDEFRIHFTFNDFKRNPSP--------RSHVSPIKQL 221 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T T + REL+++ + K ++ ITDGE + + L+ + Sbjct: 222 NGRTKTASGIRKVVRELFHKTNGAR-----ENAAKILVVITDGE-----KFGDPLDYKDV 271 Query: 332 CEYMRNAGMKIYSVAVSAPPE---GQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDK 386 AG+ Y + V + L F V++ L +++ +K Sbjct: 272 IPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGEHVFQVDNFEALNTIQNQLQEK 331 Query: 387 I 387 I Sbjct: 332 I 332 >gi|74222716|dbj|BAE42227.1| unnamed protein product [Mus musculus] Length = 1064 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 57/181 (31%), Gaps = 25/181 (13%) Query: 214 IESAGNLVNSIQKAIQEKK-NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NP 271 E ++ +K+ I + +P +S ++ + Sbjct: 80 KEFVSTVMEQFKKSKTLFSLMQYSDEFRIHFTFNDFKRNPSP--------RSHVSPIKQL 131 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T T + REL+++ + K ++ ITDGE + + L+ + Sbjct: 132 NGRTKTASGIRKVVRELFHKTNGAR-----ENAAKILVVITDGE-----KFGDPLDYKDV 181 Query: 332 CEYMRNAGMKIYSVAVSAPPE---GQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDK 386 AG+ Y + V + L F V++ L +++ +K Sbjct: 182 IPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGEHVFQVDNFEALNTIQNQLQEK 241 Query: 387 I 387 I Sbjct: 242 I 242 >gi|74212839|dbj|BAE33379.1| unnamed protein product [Mus musculus] Length = 967 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 57/181 (31%), Gaps = 25/181 (13%) Query: 214 IESAGNLVNSIQKAIQEKK-NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NP 271 E ++ +K+ I + +P +S ++ + Sbjct: 170 KEFVSTVMEQFKKSKTLFSLMQYSDEFRIHFTFNDFKRNPSP--------RSHVSPIKQL 221 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T T + REL+++ + K ++ ITDGE + + L+ + Sbjct: 222 NGRTKTASGIRKVVRELFHKTNGAR-----ENAAKILVVITDGE-----KFGDPLDYKDV 271 Query: 332 CEYMRNAGMKIYSVAVSAPPE---GQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDK 386 AG+ Y + V + L F V++ L +++ +K Sbjct: 272 IPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGEHVFQVDNFEALNTIQNQLQEK 331 Query: 387 I 387 I Sbjct: 332 I 332 >gi|74213162|dbj|BAE41718.1| unnamed protein product [Mus musculus] Length = 1153 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 57/181 (31%), Gaps = 25/181 (13%) Query: 214 IESAGNLVNSIQKAIQEKK-NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NP 271 E ++ +K+ I + +P +S ++ + Sbjct: 170 KEFVSTVMEQFKKSKTLFSLMQYSDEFRIHFTFNDFKRNPSP--------RSHVSPIKQL 221 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T T + REL+++ + K ++ ITDGE + + L+ + Sbjct: 222 NGRTKTASGIRKVVRELFHKTNGAR-----ENAAKILVVITDGE-----KFGDPLDYKDV 271 Query: 332 CEYMRNAGMKIYSVAVSAPPE---GQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDK 386 AG+ Y + V + L F V++ L +++ +K Sbjct: 272 IPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGEHVFQVDNFEALNTIQNQLQEK 331 Query: 387 I 387 I Sbjct: 332 I 332 >gi|74212905|dbj|BAE33399.1| unnamed protein product [Mus musculus] Length = 1232 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 57/181 (31%), Gaps = 25/181 (13%) Query: 214 IESAGNLVNSIQKAIQEKK-NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NP 271 E ++ +K+ I + +P +S ++ + Sbjct: 170 KEFVSTVMEQFKKSKTLFSLMQYSDEFRIHFTFNDFKRNPSP--------RSHVSPIKQL 221 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T T + REL+++ + K ++ ITDGE + + L+ + Sbjct: 222 NGRTKTASGIRKVVRELFHKTNGAR-----ENAAKILVVITDGE-----KFGDPLDYKDV 271 Query: 332 CEYMRNAGMKIYSVAVSAPPE---GQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDK 386 AG+ Y + V + L F V++ L +++ +K Sbjct: 272 IPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGEHVFQVDNFEALNTIQNQLQEK 331 Query: 387 I 387 I Sbjct: 332 I 332 >gi|74191759|dbj|BAE32836.1| unnamed protein product [Mus musculus] Length = 1166 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 57/181 (31%), Gaps = 25/181 (13%) Query: 214 IESAGNLVNSIQKAIQEKK-NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NP 271 E ++ +K+ I + +P +S ++ + Sbjct: 170 KEFVSTVMEQFKKSKTLFSLMQYSDEFRIHFTFNDFKRNPSP--------RSHVSPIKQL 221 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T T + REL+++ + K ++ ITDGE + + L+ + Sbjct: 222 NGRTKTASGIRKVVRELFHKTNGAR-----ENAAKILVVITDGE-----KFGDPLDYKDV 271 Query: 332 CEYMRNAGMKIYSVAVSAPPE---GQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDK 386 AG+ Y + V + L F V++ L +++ +K Sbjct: 272 IPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGEHVFQVDNFEALNTIQNQLQEK 331 Query: 387 I 387 I Sbjct: 332 I 332 >gi|74178358|dbj|BAE32446.1| unnamed protein product [Mus musculus] Length = 1153 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 57/181 (31%), Gaps = 25/181 (13%) Query: 214 IESAGNLVNSIQKAIQEKK-NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NP 271 E ++ +K+ I + +P +S ++ + Sbjct: 170 KEFVSTVMEQFKKSKTLFSLMQYSDEFRIHFTFNDFKRNPSP--------RSHVSPIKQL 221 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T T + REL+++ + K ++ ITDGE + + L+ + Sbjct: 222 NGRTKTASGIRKVVRELFHKTNGAR-----ENAAKILVVITDGE-----KFGDPLDYKDV 271 Query: 332 CEYMRNAGMKIYSVAVSAPPE---GQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDK 386 AG+ Y + V + L F V++ L +++ +K Sbjct: 272 IPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGEHVFQVDNFEALNTIQNQLQEK 331 Query: 387 I 387 I Sbjct: 332 I 332 >gi|124956|sp|P05555|ITAM_MOUSE RecName: Full=Integrin alpha-M; AltName: Full=CD11 antigen-like family member B; AltName: Full=CR-3 alpha chain; AltName: Full=Cell surface glycoprotein MAC-1 subunit alpha; AltName: Full=Leukocyte adhesion receptor MO1; AltName: CD_antigen=CD11b; Flags: Precursor gi|52983|emb|CAA30479.1| unnamed protein product [Mus musculus] Length = 1153 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 57/181 (31%), Gaps = 25/181 (13%) Query: 214 IESAGNLVNSIQKAIQEKK-NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NP 271 E ++ +K+ I + +P +S ++ + Sbjct: 170 KEFVSTVMEQFKKSKTLFSLMQYSDEFRIHFTFNDFKRNPSP--------RSHVSPIKQL 221 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T T + REL+++ + K ++ ITDGE + + L+ + Sbjct: 222 NGRTKTASGIRKVVRELFHKTNGAR-----ENAAKILVVITDGE-----KFGDPLDYKDV 271 Query: 332 CEYMRNAGMKIYSVAVSAPPE---GQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDK 386 AG+ Y + V + L F V++ L +++ +K Sbjct: 272 IPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGEHVFQVDNFEALNTIQNQLQEK 331 Query: 387 I 387 I Sbjct: 332 I 332 >gi|26333265|dbj|BAC30350.1| unnamed protein product [Mus musculus] Length = 1036 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 57/181 (31%), Gaps = 25/181 (13%) Query: 214 IESAGNLVNSIQKAIQEKK-NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NP 271 E ++ +K+ I + +P +S ++ + Sbjct: 170 KEFVSTVMEQFKKSKTLFSLMQYSDEFRIHFTFNDFKRNPSP--------RSHVSPIKQL 221 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T T + REL+++ + K ++ ITDGE + + L+ + Sbjct: 222 NGRTKTASGIRKVVRELFHKTNGAR-----ENAAKILVVITDGE-----KFGDPLDYKDV 271 Query: 332 CEYMRNAGMKIYSVAVSAPPE---GQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDK 386 AG+ Y + V + L F V++ L +++ +K Sbjct: 272 IPEADRAGVIRYVIGVGNAFNKPQSRRELDTIASKPAGEHVFQVDNFEALNTIQNQLQEK 331 Query: 387 I 387 I Sbjct: 332 I 332 >gi|297624820|ref|YP_003706254.1| von Willebrand factor type A [Truepera radiovictrix DSM 17093] gi|297166000|gb|ADI15711.1| von Willebrand factor type A [Truepera radiovictrix DSM 17093] Length = 802 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 45/143 (31%), Gaps = 13/143 (9%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 E+ + + + P T PA A L + K VI +TDG+ Sbjct: 429 QGKREMTAAILNVAPQGGTIFEPAYREALDVLMAQ----------EAAVKHVIVLTDGKF 478 Query: 317 SGASAYQNTLNTLQ---ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDS 373 + + + + R +G+ ++A+ + Q L G+++ D Sbjct: 479 ADGTGPFSRGPAPDFGRLAALGRRSGITTSTIAIGDGADPQQLTTIARAGGGRYYEALDV 538 Query: 374 RELLESFDKITDKIQEQSVRIAP 396 L F +R P Sbjct: 539 STLPRIFTTEALSATRSLLREGP 561 >gi|296331311|ref|ZP_06873783.1| hypothetical protein BSU6633_09416 [Bacillus subtilis subsp. spizizenii ATCC 6633] gi|305676297|ref|YP_003867969.1| hypothetical protein BSUW23_18125 [Bacillus subtilis subsp. spizizenii str. W23] gi|296151426|gb|EFG92303.1| hypothetical protein BSU6633_09416 [Bacillus subtilis subsp. spizizenii ATCC 6633] gi|305414541|gb|ADM39660.1| putative exported protein [Bacillus subtilis subsp. spizizenii str. W23] Length = 227 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 19/128 (14%), Positives = 41/128 (32%), Gaps = 16/128 (12%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 S N + LN + P T A++ A +K V +TDGE Sbjct: 108 SFNEQSFLNSLNAIGPTGWTPIAKALNEAKSSFDQLDTKG---------EKVVYLLTDGE 158 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSR 374 + N ++ + + + + + + L G++F + Sbjct: 159 ETCGG------NPIKTAKELHKDNITVNVIGFDYKEGYKGQLNAIAKVGGGEYFPAYTQK 212 Query: 375 ELLESFDK 382 ++ + F + Sbjct: 213 DVEKIFTQ 220 >gi|294997271|ref|NP_001171103.1| integrin alpha-D isoform 2 [Mus musculus] Length = 1169 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 30/179 (16%), Positives = 58/179 (32%), Gaps = 21/179 (11%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE 273 + L+ + ++ I + LS + + +L Sbjct: 177 KDFVKALMGQLASTSTSFS--LMQYSNILKTHFTFTEFKSSLS--PQSLVDAIVQLQ--G 230 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T T + +EL++ K + KK +I ITDG+ +++ L + Sbjct: 231 LTYTASGIQKVVKELFHSKNGAR-----KSAKKILIVITDGQ-----KFRDPLEYRHVIP 280 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKC-----TDSSGQFFAVNDSRELLESFDKITDKI 387 AG+ Y++ V L++ S F V + L +I +KI Sbjct: 281 EAEKAGIIRYAIGVGDAFREPTALQELNTIGSAPSQDHVFKVGNFVALRSIQRQIQEKI 339 >gi|148685685|gb|EDL17632.1| mCG133512, isoform CRA_a [Mus musculus] Length = 1164 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 30/179 (16%), Positives = 58/179 (32%), Gaps = 21/179 (11%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE 273 + L+ + ++ I + LS + + +L Sbjct: 177 KDFVKALMGQLASTSTSFS--LMQYSNILKTHFTFTEFKSSLS--PQSLVDAIVQLQ--G 230 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T T + +EL++ K + KK +I ITDG+ +++ L + Sbjct: 231 LTYTASGIQKVVKELFHSKNGAR-----KSAKKILIVITDGQ-----KFRDPLEYRHVIP 280 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKC-----TDSSGQFFAVNDSRELLESFDKITDKI 387 AG+ Y++ V L++ S F V + L +I +KI Sbjct: 281 EAEKAGIIRYAIGVGDAFREPTALQELNTIGSAPSQDHVFKVGNFVALRSIQRQIQEKI 339 >gi|148685686|gb|EDL17633.1| mCG133512, isoform CRA_b [Mus musculus] Length = 1168 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 30/179 (16%), Positives = 58/179 (32%), Gaps = 21/179 (11%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE 273 + L+ + ++ I + LS + + +L Sbjct: 177 KDFVKALMGQLASTSTSFS--LMQYSNILKTHFTFTEFKSSLS--PQSLVDAIVQLQ--G 230 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T T + +EL++ K + KK +I ITDG+ +++ L + Sbjct: 231 LTYTASGIQKVVKELFHSKNGAR-----KSAKKILIVITDGQ-----KFRDPLEYRHVIP 280 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKC-----TDSSGQFFAVNDSRELLESFDKITDKI 387 AG+ Y++ V L++ S F V + L +I +KI Sbjct: 281 EAEKAGIIRYAIGVGDAFREPTALQELNTIGSAPSQDHVFKVGNFVALRSIQRQIQEKI 339 >gi|147898761|ref|NP_001080437.1| collagen alpha-1(VI) chain precursor [Xenopus laevis] gi|82210072|sp|Q801S8|CO6A1_XENLA RecName: Full=Collagen alpha-1(VI) chain; Flags: Precursor gi|28703819|gb|AAH47255.1| Col6a1 protein [Xenopus laevis] Length = 1045 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 27/196 (13%), Positives = 63/196 (32%), Gaps = 15/196 (7%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 T A P + + E ++ + + + + ++ A + Sbjct: 72 TSESVALRVKPFKTLVTQVKEFTKKFIDKL-TSRYYRCDRNLVWNAGALHYSDEVILINS 130 Query: 255 LSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 L+ ++ ++ + + T+T A+ E+ + K++I +TD Sbjct: 131 LTRDMKTLRDNVETVEYIGKGTHTDCAIKRGIEEVL-------IGGSHQKENKYLIVVTD 183 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ--FFAVN 371 G ++ G+K++SVA+S L + F Sbjct: 184 GHPLEGYKEPCG-GLEDAANEAKHLGIKVFSVAISPNHLEPR-LSVIASDASHRRNFTAT 241 Query: 372 DSRELLESFDKITDKI 387 + L + D+I + I Sbjct: 242 SAVGLTD--DEIDNTI 255 Score = 40.7 bits (93), Expect = 0.38, Method: Composition-based stats. Identities = 33/188 (17%), Positives = 63/188 (33%), Gaps = 27/188 (14%) Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN- 270 L KA + S R+ + Y+ N ++ ++ + Sbjct: 867 TSKSFVKLLAERFLKAKP-PPSGSARVSVVQYSGQNQQIVEAQFLTNYTVLEVPVDNMQF 925 Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 TN A+ A EL + + K ++ +DG L+ Sbjct: 926 INGATNVVSALR-AVTEL-------YREDSLAGVNKKLLVFSDGNTQE------EKGLLK 971 Query: 331 ICEYMRNAGMKIYSVAVSAP---PEGQDLLRKCTDS------SGQFFAVNDSRELLE--S 379 + + ++AG++IY +AV + P Q +L + F V D LL+ Sbjct: 972 VVQDAQSAGIEIYVLAVGSRLNYPNLQVMLTGSAADIAGPFPEERLFRVPDYTSLLQGVR 1031 Query: 380 FDKITDKI 387 + I+ +I Sbjct: 1032 YQSISRRI 1039 >gi|151555227|gb|AAI48416.1| Integrin, alpha D [synthetic construct] Length = 1164 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 30/179 (16%), Positives = 58/179 (32%), Gaps = 21/179 (11%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE 273 + L+ + ++ I + LS + + +L Sbjct: 172 KDFVKALMGQLASTSTSFS--LMQYSNILKTHFTFTEFKSSLS--PQSLVDAIVQLQ--G 225 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T T + +EL++ K + KK +I ITDG+ +++ L + Sbjct: 226 LTYTASGIQKVVKELFHSKNGAR-----KSAKKILIVITDGQ-----KFRDPLEYRHVIP 275 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKC-----TDSSGQFFAVNDSRELLESFDKITDKI 387 AG+ Y++ V L++ S F V + L +I +KI Sbjct: 276 EAEKAGIIRYAIGVGDAFREPTALQELNTIGSAPSQDHVFKVGNFVALRSIQRQIQEKI 334 >gi|88911344|sp|Q3V0T4|ITAD_MOUSE RecName: Full=Integrin alpha-D; AltName: CD_antigen=CD11d; Flags: Precursor gi|74215609|dbj|BAE21419.1| unnamed protein product [Mus musculus] Length = 1168 Score = 61.8 bits (148), Expect = 2e-07, Method: Composition-based stats. Identities = 30/179 (16%), Positives = 58/179 (32%), Gaps = 21/179 (11%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE 273 + L+ + ++ I + LS + + +L Sbjct: 179 KDFVKALMGQLASTSTSFS--LMQYSNILKTHFTFTEFKSSLS--PQSLVDAIVQLQ--G 232 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T T + +EL++ K + KK +I ITDG+ +++ L + Sbjct: 233 LTYTASGIQKVVKELFHSKNGAR-----KSAKKILIVITDGQ-----KFRDPLEYRHVIP 282 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKC-----TDSSGQFFAVNDSRELLESFDKITDKI 387 AG+ Y++ V L++ S F V + L +I +KI Sbjct: 283 EAEKAGIIRYAIGVGDAFREPTALQELNTIGSAPSQDHVFKVGNFVALRSIQRQIQEKI 341 >gi|149921110|ref|ZP_01909568.1| von Willebrand factor type A domain protein [Plesiocystis pacifica SIR-1] gi|149817997|gb|EDM77456.1| von Willebrand factor type A domain protein [Plesiocystis pacifica SIR-1] Length = 532 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 36/351 (10%), Positives = 91/351 (25%), Gaps = 40/351 (11%) Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKA 109 I + T+ + + + +A D + ++ P ++K Sbjct: 3 ITTLGALAALSMTLTSTAVWQITAPADEVVADASDADASDTDPVAAEEPGPTADGGDTKV 62 Query: 110 QYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHN 169 + + P L + + R + + + + S Sbjct: 63 EADAP--PLDQSTFLAGKTLMVEGRVGHGRMLADARGETFLYVDVRNELSAAGSNTLDAR 120 Query: 170 DNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ 229 N + P + K + I + Sbjct: 121 GNFRDAVARTSAEPLNLAIVID------------HSGSMKGQRERNALDAAAGMISRLRD 168 Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN------PYENTNTYPAMHH 283 + ++YN + + ++ L P NT + Sbjct: 169 G-----DTVSVVSYNTKAHTIVPVT-TLDARNRDRVISDLRVGVASRPSGNTCVSCGVEA 222 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 + L + ++ ++DGE + ++ ++ RN G+ I Sbjct: 223 GLQTLQGRRPG----------IDRMLLLSDGEANRGV--RDEPGIRRLAREARNRGVSIS 270 Query: 344 SVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 S+ V + L+ +++G+ + L FD+ D + + + Sbjct: 271 SIGVDVD-YNEVLMSAIAREANGRHYFSETGSNLDAIFDQELDSLIQAIAK 320 >gi|159896929|ref|YP_001543176.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159889968|gb|ABX03048.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 579 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 32/216 (14%), Positives = 74/216 (34%), Gaps = 26/216 (12%) Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 P +T+ S + +D +A + ++ + N +V Sbjct: 350 VNLSRVQDPLNIMLVIDTSGSMGPSKEGLTDGGLDAAKIAALDFIDHL------PSNANV 403 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 + G + L+N++ V+ +++L P T Y A+ +Y +L K + Sbjct: 404 GLIHF----GTLVTVDHSLTNDIGAVRQSISELKPEGQTAIYDALAISYTQLRRAKGQT- 458 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 F++ I+DG ++ + I A + Y + +++P L Sbjct: 459 ----------FIVLISDGADTASKGDNYD----SIVAKATKANIPTYIIGLTSPEFDGQL 504 Query: 357 LRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQS 391 L D+ + +L + ++ ++ Q Sbjct: 505 LEDLQRDTKAMIYQTPSKEQLGGFYTEVAQEVSGQY 540 >gi|326505554|dbj|BAJ95448.1| predicted protein [Hordeum vulgare subsp. vulgare] gi|326515132|dbj|BAK03479.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 707 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 14/133 (10%), Positives = 39/133 (29%), Gaps = 15/133 (11%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 N + +N L TN + + + + +I ++DG++ Sbjct: 310 NGRMQAIQAVNSLVDGGGTNISDGLKKGAKVIEH--------RRLKNPVCSIILLSDGQD 361 Query: 317 SGASAYQNTLNTLQICEYM-------RNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA 369 + + + + ++I++ + + SSG F Sbjct: 362 TYSVPTFDDELQTNHSALVPPSILPGTGNHVQIHTFGFGMDHDSAAMHAIAETSSGTFSF 421 Query: 370 VNDSRELLESFDK 382 ++ + F + Sbjct: 422 IDAEGSIQNGFAQ 434 >gi|301780866|ref|XP_002925835.1| PREDICTED: integrin alpha-D-like [Ailuropoda melanoleuca] Length = 1040 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 26/154 (16%), Positives = 56/154 (36%), Gaps = 16/154 (10%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNT 298 I Y+ + + + + +S ++ + T T + +EL++ K + Sbjct: 203 LIQYSNHLKIHFTFTQFKSSSSPQSLVDPIVQLNGLTFTATGIRTVVQELFHSKNGAR-- 260 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP---EGQD 355 +K +I ITDG+ Y++ L + AG+ Y++ V + Sbjct: 261 ---KTARKILIVITDGQ-----KYKDPLEYSDVIPQAERAGIVRYAIGVGDAFQKLTARQ 312 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKI 387 L F V++ L +++ +KI Sbjct: 313 ELNTIGSKPSQDHVFRVDNFAALSNIQEQLQEKI 346 >gi|302868694|ref|YP_003837331.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] gi|315504835|ref|YP_004083722.1| von willebrand factor type a [Micromonospora sp. L5] gi|302571553|gb|ADL47755.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] gi|315411454|gb|ADU09571.1| von Willebrand factor type A [Micromonospora sp. L5] Length = 316 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 20/156 (12%), Positives = 49/156 (31%), Gaps = 24/156 (15%) Query: 250 NQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 N P + + V + ++ L E T T A+ + S + ++ Sbjct: 139 NVLVPPTKDRAAVTTAIDGLVLAEATATGEAVFTCLEAI----RSVPADGAAGIPPARIV 194 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------------GQDL 356 ++DG + + + Q A + + ++A + Sbjct: 195 LLSDGYRTSGRSVEEAAAAAQA------ANVPVSTIAFGTDSGQVDIGGQLQRVPVDRTA 248 Query: 357 LRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQS 391 L + + G F+ + EL + + + I ++ Sbjct: 249 LSQLAETTQGFFYEAASASELKQVYQDMGSSIGYRT 284 >gi|149200157|ref|ZP_01877181.1| hypothetical protein LNTAR_03324 [Lentisphaera araneosa HTCC2155] gi|149136798|gb|EDM25227.1| hypothetical protein LNTAR_03324 [Lentisphaera araneosa HTCC2155] Length = 348 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 33/253 (13%), Positives = 80/253 (31%), Gaps = 59/253 (23%) Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAP--------APAPANRKIDVLIES 216 + N + S + S S + + + ++ +S Sbjct: 75 TGEENSYSYKDSVDIVFSLDISGSMSSYDQPEDLAVNRRVIAEAINNKELHPRLHYAKKS 134 Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-- 274 + ++ + R+G + + P +N+ +++RL +++ Sbjct: 135 IADFIDKRKS---------DRLGLVVFGAEAYS--VCPPTNDHEYLQNRLKEISTEYLGD 183 Query: 275 ----TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 TN A+ L S KK +I +TDG ++ + L Sbjct: 184 YNRQTNITAAISGGLARL----------RKSKAPKKIIILVTDGSHT----ANSNLTPRM 229 Query: 331 ICEYMRNAGMKIYSVAVSAP-------------------PEGQDLLRKCTD-SSGQFFAV 370 + + IY++ V ++LL++ + + G +F+V Sbjct: 230 AAKAAAKSDAVIYTIGVGNEVAWNVENFFGSSRLNASNSDFDEELLKEIAEKTGGLYFSV 289 Query: 371 NDSRELLESFDKI 383 ++ ++ + KI Sbjct: 290 REAEQMKDVLKKI 302 >gi|257884610|ref|ZP_05664263.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,501] gi|257820448|gb|EEV47596.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,501] Length = 1107 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 26/272 (9%), Positives = 76/272 (27%), Gaps = 26/272 (9%) Query: 136 TGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTT 195 + + + +V+D S SM D L Sbjct: 257 NARGNVQKDITPLDLVLVVDWSGSMNDNNRIGEVKIGVDRFVDTLADSGITDKINMGYVG 316 Query: 196 KSKYAPAPAPANRKI---DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQC 252 S + + ++ D + ++ S + G + Sbjct: 317 YSSEGYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGTFTQKALRDAGNMLSVPNGHKKVI 376 Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 L++ + ++ +++ ++N Y + ++ + G+T + Sbjct: 377 VLLTDGVPTFSYKVQRVHAQSSSNYYG----------TQFSNTQDQPGNTSRISNSYYAP 426 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTDS 363 D N + T+ ++ G++I+ + + + + +R+ S Sbjct: 427 DQNNLSRRIDSTFIATIGEAMALKERGIEIHGLGIQLQSDPAAGLSKAEVESRMRQMVSS 486 Query: 364 ---SGQFFA-VNDSRELLESFDKITDKIQEQS 391 ++ + + ++ E K +I Sbjct: 487 DEKGDLYYESADHATDISEYLAKKAVQISATV 518 >gi|187927679|ref|YP_001898166.1| hypothetical protein Rpic_0583 [Ralstonia pickettii 12J] gi|187724569|gb|ACD25734.1| conserved hypothetical protein [Ralstonia pickettii 12J] Length = 414 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 30/305 (9%), Positives = 79/305 (25%), Gaps = 29/305 (9%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M ++++V IDL+ + + ++QSA+D+ LS + T T Sbjct: 16 MVGLMLAVLLGMAGLVIDLSGLFVAKTELQSAVDSCALSAAQELDGASDALTRATNAGVT 75 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + + + + + A + ++ Y+ ++ I + Sbjct: 76 AGNANRVVYQASSASLANADVTFSTALNGTYSAAGVASSSSSYVKCKHSKSGITARLIKF 135 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSM-------------------- 160 + + ++ + + I + ++ + Sbjct: 136 --VGGANSYAVAALAVATRVHAQSTCPIPVALIPKTGGTAPNYGFQVGEWVSMLYSGGST 193 Query: 161 ---EDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE-- 215 E + N + + + T N + + Sbjct: 194 TPGEMGWYNLDGSTNANETKTEMDQGYCNSKIGDQLGTPGAKVSVDDNWNARFGIYKNKS 253 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 +A L N + + AY+ +N N+ + N + T Sbjct: 254 NASQLQPDFTGYSYTSTNWTNTVPQNAYSGTPASGSDKSAANFKNKRLAYANYDDT--GT 311 Query: 276 NTYPA 280 + Sbjct: 312 SVSGG 316 >gi|81897704|sp|Q8BVM2|ANTRL_MOUSE RecName: Full=Anthrax toxin receptor-like; Flags: Precursor gi|26346064|dbj|BAC36683.1| unnamed protein product [Mus musculus] Length = 641 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 69/196 (35%), Gaps = 18/196 (9%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 K + ++ ++ + + +++ N ++RI I Y+ PL+++ E+ Sbjct: 80 VLDKSGSVADNWIHIYSFAEGLVKKFTNPNLRISIITYSTEA--EVILPLTSDSKEINKS 137 Query: 266 L---NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 L + P T+ + A ++ + +I +TDG Sbjct: 138 LLVLKSIVPQGLTHMQKGLRKANEQIRKSTLGGRI------VNSVIIALTDGLLLLKPY- 190 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV-NDSRELLESFD 381 L+T++ + R G +Y+V V + L + F V L D Sbjct: 191 ---LDTMEEAKKARRMGAIVYTVGVFM--YSKQQLVNIAGDPDRCFGVDEGFSALEGVVD 245 Query: 382 KITDKIQEQSVRIAPN 397 +T K + + + P Sbjct: 246 PLTSKSCTEILSVQPT 261 >gi|157151714|ref|NP_001096685.1| collagen alpha-3(VI) chain [Canis lupus familiaris] gi|70571939|tpe|CAI77244.1| TPA: collagen, type VI, alpha 3 [Canis lupus familiaris] Length = 3169 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 18/178 (10%), Positives = 57/178 (32%), Gaps = 17/178 (9%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 + + K++ + R + +N EV S ++ ++ +N Sbjct: 58 VREFLYDVIKSLAVGDSDF-RFALVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGGSN 116 Query: 277 -TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 T + + + E + + + + ++ +T G + A + + Sbjct: 117 ETGKGLEYVMQHHLTE---AAGSRAGDGVPQVIVVLTHGHSDDRLALPS--------AEL 165 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 ++A ++++ V + L++ F + + L + + +Q Sbjct: 166 KSADANVFAIGV--EDADEGALKEIASEPPNMHVFNLENFTSLHDIVGNLVSCVQSSV 221 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 25/173 (14%), Positives = 60/173 (34%), Gaps = 17/173 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKN--LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN + ++ +N Sbjct: 1648 INFRRDSFQEVLRFVSEIVDTLYEGGDSIQVGLVQYNSDPTDEFFLKDFSTKQQIIDAIN 1707 Query: 268 KLNPYENTNTYP--AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 K+ + + H N + R+ + IT G+ + Sbjct: 1708 KVVYKGGRHANTKVGIEHLR---LNHFVPEAGSRLDQRVPQIAFVITGGK--------SV 1756 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 1757 EDAQEASLALTQKGVKVFAVGV--KNIDSEEVGKIASNSATAFRVGNVQELSE 1807 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 24/188 (12%), Positives = 57/188 (30%), Gaps = 20/188 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ + V S Sbjct: 1039 GVRSGFPLLKEFVQRVVESL-----DVGPDRVRVAVVQYSDRTRPEFYLNSYMDQQSVVS 1093 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S + + +I +T A Sbjct: 1094 AIRRLTILGGPTPNTGAALDFVLRNILISSAGSRIA---EGVPQLLIVLT--------AD 1142 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 ++ + ++ G + + ++ + A+ R+L Sbjct: 1143 RSGDDVRGPSVVLKRGGAV--PIGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTIQQV 1200 Query: 383 ITDKIQEQ 390 ++D++ + Sbjct: 1201 VSDRVIQL 1208 Score = 49.9 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 18/154 (11%), Positives = 46/154 (29%), Gaps = 18/154 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRE 287 +R+G + Y+ + +V + L N A+ Sbjct: 273 SVGAQQIRVGVVQYSDEPRTVFSLDTYSTKAQVLDAVKALAFTGGELANVGLALDFVVEN 332 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 + + + + + ++ I+ G +S ++ + L+ ++S + Sbjct: 333 HFT---RAGGSRVEEGVPQVLVLISAGPSSD--EIRDGVVALKQAS--------VFSFGL 379 Query: 348 SAPPEGQDLLRKCTDSSGQFFAV---NDSRELLE 378 A + L+ + F V +L E Sbjct: 380 GAQAASRAELQHIATNDNLVFTVPEFRSFGDLQE 413 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 24/164 (14%), Positives = 60/164 (36%), Gaps = 17/164 (10%) Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT-- 275 ++ +++ VRIG + ++ + + V + L +L + Sbjct: 1457 DFVIKIVRRLNIGPNK--VRIGAVQFSNEVFPVFYLKTHKSQASVINALRQLRHRGGSPL 1514 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 NT A+ R L+ S + + + ++ G++ ++ + + + Sbjct: 1515 NTGKALEFVARNLF---VKSAGSRIEDGVPQHLVLFLGGKSQDDTSRFS--------QVI 1563 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 ++G+ S+ V + L+ T++ F V + REL Sbjct: 1564 SSSGIV--SLGVGNRNIDRAELQTITNNPRLVFTVREFRELPNI 1605 >gi|326504154|dbj|BAK02863.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 651 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 17/202 (8%), Positives = 59/202 (29%), Gaps = 30/202 (14%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK--SR 265 K+ +L ++ +++ + R+ ++++ ++ + Sbjct: 184 SKLALLKQAMRFVIDILG--------PDDRLSVVSFSSRARRVTRLTRMSDAGKALCVRA 235 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG------- 318 + L TN + A + L + + V+ ++DG+++ Sbjct: 236 VESLTARTGTNIAEGLRTAAKVLDERRHRNGV--------SCVVLLSDGQDNYTPMRQAF 287 Query: 319 --ASAYQNTLNTLQICEY---MRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDS 373 L + +++ + + + G F + + Sbjct: 288 GRGLPNYAALLPPSFARTGTGAGDRATPVHTFGFGNDHDATAMHAVSEATGGTFSFIENE 347 Query: 374 RELLESFDKITDKIQEQSVRIA 395 + ++F + + V+ A Sbjct: 348 AVIQDAFAQCVGGLLSVVVQEA 369 >gi|295688686|ref|YP_003592379.1| von Willebrand factor type A [Caulobacter segnis ATCC 21756] gi|295430589|gb|ADG09761.1| von Willebrand factor type A [Caulobacter segnis ATCC 21756] Length = 583 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 29/322 (9%), Positives = 73/322 (22%), Gaps = 30/322 (9%) Query: 78 IRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTG 137 E A + ++ + + A Y L + P + Sbjct: 111 DTERYPGAASNPVKRVAEEPVSTFSIDVD-TAAYANTRRFLNDGAVPPRDAIRVEELVNY 169 Query: 138 IIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKS 197 + V+ + + + + S Sbjct: 170 FDYGYPRPTSAQTPFRATVAIAPSP-WSSQRQILHIGLQGYAAPRSEAPPLNLVFLVDTS 228 Query: 198 KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN 257 P ++ + ++ L++ + + ++ Sbjct: 229 GSMMGP----DRLPLAQKALNVLIDQL----------RPQDRVAMVAYAGSAGAVLAPTD 274 Query: 258 NLNEVKS--RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 +++K L L +T + AY + VI ITDG+ Sbjct: 275 GRSKLKMRCALGALQAGGSTAGGRGLELAYGLAKQNFDKKAVNR--------VILITDGD 326 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE 375 + + R +G+ + +++ + Sbjct: 327 FNVG--IADPSRLKDFVADQRKSGVYLSVYGFGRGNYNDTMMQALAQNGNGV--AAYVDT 382 Query: 376 LLESFDKITDKIQEQSVRIAPN 397 L E+ + D + IA + Sbjct: 383 LNEARKLLRDDFESSLFPIADD 404 >gi|6090615|gb|AAF03259.1| dihydropyridine receptor alpha 2 subunit [Homo sapiens] Length = 1110 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 31/334 (9%), Positives = 94/334 (28%), Gaps = 25/334 (7%) Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK- 108 + ++ + + E +I ++ + Sbjct: 104 AEKVQAAHQWREDFASNEVVYYNAKDDLDPEKNDSEPGSQRIKPVFIEDANFGRQISYQH 163 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR----SMEDLY 164 A IPT+ ++ + L S + E+ ++ + + + Sbjct: 164 AAVHIPTDIYEGSTIVLNELNWTSALDEVFKKNREEDPSLLWQVFGSATGLARYYPASPW 223 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 + N + P + S + + + + ++ S ++ ++ Sbjct: 224 VDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETL 283 Query: 225 QKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + + ++ +V N +K +N + T+ Sbjct: 284 SDDDFVNVASFNSNAQDVSCFQHLVQANVR----NKKVLKDAVNNITAKGITDYKKGFSF 339 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A+ +L N S K ++ TDG A N N ++ ++++ Sbjct: 340 AFEQLLNYNVSRA------NCNKIIMLFTDGGEERAQEIFNKYN--------KDKKVRVF 385 Query: 344 SVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 + +V + ++ ++ G ++ + + Sbjct: 386 TFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|15678583|ref|NP_275698.1| magnesium chelatase subunit [Methanothermobacter thermautotrophicus str. Delta H] gi|2621631|gb|AAB85061.1| magnesium chelatase subunit [Methanothermobacter thermautotrophicus str. Delta H] Length = 182 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 16/158 (10%), Positives = 46/158 (29%), Gaps = 10/158 (6%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 + + RI + + P + + + + + + T + L Sbjct: 27 DAQRHRDRISVVGFRGRDA-RVIIPSTAHASSFRDAVESIRVGGTTPMAQGIQRGLEIL- 84 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 F++ ++DG + + ++ +R + + Sbjct: 85 ------REEKRHGEYVPFMVILSDGMPNVGTGRDPKREAVEAASRLREEEIPSTVINFER 138 Query: 350 PPEG--QDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 G + S G ++ ++D R+ + KI + Sbjct: 139 GSRGGRDLNMEIALASGGSYYDLHDLRDPSGAVAKIME 176 >gi|282892468|ref|ZP_06300802.1| hypothetical protein pah_c260o013 [Parachlamydia acanthamoebae str. Hall's coccus] gi|281497750|gb|EFB40114.1| hypothetical protein pah_c260o013 [Parachlamydia acanthamoebae str. Hall's coccus] Length = 373 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 65/170 (38%), Gaps = 20/170 (11%) Query: 248 VGNQCTPLSNNLNEVKSRLNKL-----NPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 +PL+ + + +LNK + T+ A+ + + K + ++ Sbjct: 167 TAQVLSPLTLDHQAIIDQLNKFSIVKHQDEDGTSIGYAIFKTANLIASTKHFAEELKEAS 226 Query: 303 RL---KKFVIFITDG----ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 ++ +TDG ++ +Y + G+++Y + V ++ Sbjct: 227 PYTIKNSIMLIVTDGFQDPNPLDREDQYRSIELEDAAKYAKEQGVRVYIINVEPRIASEE 286 Query: 356 L------LRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 ++K T+ + G+F+ ++ EL + I DK+++ + + R Sbjct: 287 FGSERRVMQKVTEITGGKFYLLDHIEELKNIYADI-DKLEKSILPVLSKR 335 >gi|154687789|ref|YP_001422950.1| YwmC [Bacillus amyloliquefaciens FZB42] gi|154353640|gb|ABS75719.1| YwmC [Bacillus amyloliquefaciens FZB42] Length = 228 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 29/214 (13%), Positives = 61/214 (28%), Gaps = 22/214 (10%) Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 N + S + SKY A R D + + + K + V Sbjct: 36 NNTAILLDASGSMAKRIDGVSKYNMAKEEIVRFADQIKSKSQVRMTVFGSEGNNKNSGKV 95 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 + + + + LN + P T A+ A + Sbjct: 96 QSCESIRGVYGFQRF------DRQSFLNSLNGIGPTGWTPIAKALEDA---------KAS 140 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 T K V +TDGE + + ++ + +R +K+ + Sbjct: 141 FTGLHKLGSKSVFLLTDGEETCGG------DPVKTAKELRKQHIKVNVIGFDFKEGFNGQ 194 Query: 357 LRKCT-DSSGQFFAVNDSRELLESFDKITDKIQE 389 L + G+++ + +++ F + E Sbjct: 195 LHEIAKAGGGKYYEAHSQKDMNRIFTMAASSLAE 228 >gi|325969627|ref|YP_004245819.1| von Willebrand factor type A [Vulcanisaeta moutnovskia 768-28] gi|323708830|gb|ADY02317.1| von Willebrand factor type A [Vulcanisaeta moutnovskia 768-28] Length = 498 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 36/299 (12%), Positives = 84/299 (28%), Gaps = 43/299 (14%) Query: 102 QYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSME 161 E + + + + + S DV + Sbjct: 231 SKDGERMFEALRNVVRNAMSSMGQVKIVRFTDIDKYPTYVVSVREYKVGDNYFDV-DLQK 289 Query: 162 DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAP----APANRKIDVLIESA 217 + + + + + + KI++ ++ Sbjct: 290 TAMNLSRKTMMHKLFTNKDIVVKEYANVKTIDIVLCLDVSGSMRELSSGMPKIEIAKDAV 349 Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN------- 270 + + K N + + + ++ L++V+ ++N Sbjct: 350 SQYIQFLSKT-----NDRLAMVLFNFRADVL--------WGLHQVRRYWQQMNYMLKYVY 396 Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 TN A+ + L K +S K VI +TDG +S ++ Sbjct: 397 AGGGTNLANALERSREVLTRSKSNS----------KHVICVTDGRTVNSSMC------VK 440 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 +R G I ++A+ + + L+R G F ++ +L ++ I DK+ Sbjct: 441 EAVRLRRNGTTISTIAIGENSDDELLMRLSKIGGGLFIKISSIHDLGKAL--IMDKLHS 497 >gi|77747911|ref|NP_638263.2| hypothetical protein XCC2915 [Xanthomonas campestris pv. campestris str. ATCC 33913] gi|77761138|ref|YP_242283.2| hypothetical protein XC_1194 [Xanthomonas campestris pv. campestris str. 8004] Length = 597 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 34/328 (10%), Positives = 80/328 (24%), Gaps = 29/328 (8%) Query: 72 LKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAES-KAQYEIPTENLFLKGLIPSALTN 130 + + + + + Y L L P Sbjct: 113 MPPSPPTENRETYQTLSDNPIVQAAEQPVSTFSIDVDTGSYSNVRRFLNAGTLPPVDAVR 172 Query: 131 LSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFW 190 + + V + + + + + + Sbjct: 173 VEELINYFRYDDPAPTDGTPFAV--RTELAPTPWNTDTLLLRIGVAGREVPTAALPAANL 230 Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 S AP K+ +L S LV ++K RI + Y Sbjct: 231 VFLVDVSGSMGAP----DKLPLLQSSLKLLVRQLRKQD--------RITLVTYAGSTAVV 278 Query: 251 QCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 + ++ L T + AY+ ++ Sbjct: 279 LPPTSGAQQTRIVEAIDSLQSGGGTAGASGIELAYKAAQQAYLRGGINR--------ILL 330 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFA 369 TDG+ + + L + R +G+ + ++ L+ + D+ G + Sbjct: 331 ATDGDFNVGVTDFDQL--KGMVAEKRRSGVALSTLGFGTGNYNDTLMEQLADAGDGAYAY 388 Query: 370 VNDSRELLESFDKITDKIQEQSVRIAPN 397 ++ + E + +T ++ IA + Sbjct: 389 IDSALEARKV---LTHELGSTLATIARD 413 >gi|270007560|gb|EFA04008.1| hypothetical protein TcasGA2_TC014157 [Tribolium castaneum] Length = 805 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 40/337 (11%), Positives = 93/337 (27%), Gaps = 36/337 (10%) Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAES---KAQYEI 113 + + K +KQ + + + + QY++ Sbjct: 159 NEISKNDDKTASLAEIKQNNSTSATVKFNPNIERQKQLATGLGTKEENGLAGQFVVQYDV 218 Query: 114 PTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNN 173 + L E L + VLD S SM+ +++ + N Sbjct: 219 ERDP-----KGGEVLLKDGYFVHFFAPSEVEALPKQVIFVLDTSGSMDGNRIKQLKEAMN 273 Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 ++ +S D + + + + Sbjct: 274 SILSELKKEDVFNIVEFSSIVKVWNVDKVQVDYEVGEDPW---------PLYDSPEAPQK 324 Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 ++ N + K + KLN Y T+ A+ + + KE Sbjct: 325 NKT--------NQVLPPAYKATDENKEKAKKVVEKLNAYGGTDIKSALEVGLKLVKKNKE 376 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + + ++F+TDGE + + + I+S++ Sbjct: 377 N-----KEDAHQPIIVFLTDGEPTMGETNTEKITSAISEMNSGETRAPIFSLSFG-DGAD 430 Query: 354 QDLLRKCTDSS----GQFFAVNDSR-ELLESFDKITD 385 ++ L+K + + + D+ +L E + +I+ Sbjct: 431 REFLQKISLKNLGFARHIYEAADASLQLQEFYKQISS 467 >gi|326530406|dbj|BAJ97629.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 657 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 17/202 (8%), Positives = 59/202 (29%), Gaps = 30/202 (14%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK--SR 265 K+ +L ++ +++ + R+ ++++ ++ + Sbjct: 190 SKLALLKQAMRFVIDILG--------PDDRLSVVSFSSRARRVTRLTRMSDAGKALCVRA 241 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG------- 318 + L TN + A + L + + V+ ++DG+++ Sbjct: 242 VESLTARTGTNIAEGLRTAAKVLDERRHRNGV--------SCVVLLSDGQDNYTPMRQAF 293 Query: 319 --ASAYQNTLNTLQICEY---MRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDS 373 L + +++ + + + G F + + Sbjct: 294 GRGLPNYAALLPPSFARTGTGAGDRATPVHTFGFGNDHDATAMHAVSEATGGTFSFIENE 353 Query: 374 RELLESFDKITDKIQEQSVRIA 395 + ++F + + V+ A Sbjct: 354 AVIQDAFAQCVGGLLSVVVQEA 375 >gi|13476808|ref|NP_108377.1| hypothetical protein mll8241 [Mesorhizobium loti MAFF303099] gi|14027569|dbj|BAB53838.1| mll8241 [Mesorhizobium loti MAFF303099] Length = 678 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 40/323 (12%), Positives = 73/323 (22%), Gaps = 79/323 (24%) Query: 149 SICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR 208 V D S S + P P + N Sbjct: 354 YPYNVNDASPSGGSANTGIGVGDPATMFVPMFAPDEPGNHWKLTQDPDEAAPVTYGAVNS 413 Query: 209 KIDVLIESAGN---LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN-----NLN 260 + S L N + + N N TPL++ Sbjct: 414 WWNDDPTSGTGQSRLRNMAKYFQPRPIDAPALPAGNGPNYSCTTNPITPLTDVSVADGAT 473 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI-GSTRLKKFVIFITDG----- 314 +K+ ++ + P TN M +R + + + + K VI +TDG Sbjct: 474 SIKAAIDLMQPNGGTNVPEGMAWGWRVVSSGEPFTQGRRETEKGNDKVVIVLTDGANTYY 533 Query: 315 -----------------------------------------------ENSGASAYQNTLN 327 ++G Sbjct: 534 TPSSLGYSDPANSKSTYASYGYLNPGYNGTSVGRMFMGTSSAIGQLDYSNGNYTNALNEQ 593 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPE------GQDLLRKCTDS------------SGQFFA 369 +C + A + + +VA+ D L+ C+ + F Sbjct: 594 MATLCNNAKAANIMVMTVALDLSTTKASDKLAIDALKSCSSDSRFRKDPTDPSKPAKLFW 653 Query: 370 VNDSRELLESFDKITDKIQEQSV 392 L F +I +++ V Sbjct: 654 NATGASLSNDFKEIGNELSNLRV 676 Score = 59.5 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 28/189 (14%), Positives = 52/189 (27%), Gaps = 24/189 (12%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 MT + + + A+D + + + +ALDAA + + DQ Sbjct: 36 MTVVAMVPLMGGLAIAVDFTEMNREKQMVTNALDAANFATARRLTE-------GATDDQL 88 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENL-F 119 +L A +N+T N + AQ Sbjct: 89 KAYALDFFNANLNDID---------PASATLNVTLPSNTSGGGLLTMTAQLAYKPYFYPA 139 Query: 120 LKGLIPSALTNLSLRSTGIIERS-SENLAISICMVLDVSRSMEDLYLQ------KHNDNN 172 L+ + T+ + + + + + +VLD S SM L Sbjct: 140 FAQLVGKSATDANQKINFSVTSQVRLKNTLEVALVLDNSGSMTTLGTGSGQKRIDLLKTA 199 Query: 173 NMTSNKYLL 181 + L Sbjct: 200 SKQLVDTLA 208 >gi|319792023|ref|YP_004153663.1| von willebrand factor type a [Variovorax paradoxus EPS] gi|315594486|gb|ADU35552.1| von Willebrand factor type A [Variovorax paradoxus EPS] Length = 345 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 27/244 (11%), Positives = 79/244 (32%), Gaps = 56/244 (22%) Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYN 244 + S A ++ E+A + + + + V++G +A+ Sbjct: 83 SNQQTIILAMDVSGSMRAADVLPNRLVAAQEAAKSFIKDLPRT--------VKVGIVAFA 134 Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK------------ 292 Q + N +++ + ++ T T A+ + L+ + Sbjct: 135 GSAQVAQLP--TTNHDDLVTAIDSFQLQRATATGNAIVVSLATLFPDAGIDVEQFSAPSR 192 Query: 293 -------------ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG 339 + + +I +TDG+ + ++ L + + G Sbjct: 193 QRGTPIDQTEKKLKDFTPVAPGSFTSAAIIMLTDGQRTTG------VDPLDAAKAAADRG 246 Query: 340 MKIYSVAVSAPPE--------------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKIT 384 ++IY+V V ++ L+ + ++ ++F + +L + ++ ++ Sbjct: 247 VRIYTVGVGTVDGETIGFEGWSMRVRLDEETLKAVANKTNAEYFYAGTANDLKKVYETLS 306 Query: 385 DKIQ 388 K+ Sbjct: 307 SKLT 310 >gi|307718398|ref|YP_003873930.1| hypothetical protein STHERM_c06990 [Spirochaeta thermophila DSM 6192] gi|306532123|gb|ADN01657.1| hypothetical protein STHERM_c06990 [Spirochaeta thermophila DSM 6192] Length = 458 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 26/188 (13%), Positives = 63/188 (33%), Gaps = 20/188 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 P +I + + + + + R + P+ + + V Sbjct: 113 PDRMRITHAKRAIREFLPLLSGRDRVGLAVFNRTYRV----------IQPIVGDPSLVLE 162 Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 +L+ + P+ AY ELY E + G ++ ++ ++DGEN ++ Sbjct: 163 KLDAIER-------PSREQAYTELYRSMEEALTDFGEEGRRRVLVVLSDGENFPVDPSES 215 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKI 383 + G+ Y + L+ + G+ F ++ EL + I Sbjct: 216 PSTPGTAIDLAHRYGITCYVIHFGTE--KDRLIGDLASETGGRVFDARNALELASVYTAI 273 Query: 384 TDKIQEQS 391 +++ ++ Sbjct: 274 QEQVLQEY 281 >gi|149051203|gb|EDM03376.1| coagulation factor C homolog (Limulus polyphemus) (predicted), isoform CRA_a [Rattus norvegicus] gi|169642483|gb|AAI60874.1| Coch protein [Rattus norvegicus] Length = 552 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 25/155 (16%), Positives = 46/155 (29%), Gaps = 20/155 (12%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESS 295 +I + + N V + L + T T A+ R ++ S Sbjct: 407 KIAAVQFTYDQRTEFSFTDYNTKENVLAVLANIRYMSGGTATGDAISFTVRNVFGPIRDS 466 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 N F++ +TDG++ + +AG+ I+SV V+ P Sbjct: 467 PNK-------NFLVIVTDGQSYD--------DVRGPAAAAHDAGITIFSVGVAWAPLDD- 510 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 L+ F + L + I Sbjct: 511 -LKDMASKPKESHAFFTREFTGLEPIVSDVIRGIC 544 >gi|125602048|gb|EAZ41373.1| hypothetical protein OsJ_25890 [Oryza sativa Japonica Group] Length = 757 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 15/133 (11%), Positives = 44/133 (33%), Gaps = 16/133 (12%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 KS + L+ TN + A + VI ++DG+++ Sbjct: 378 GKASAKSAVESLHADGCTNILEGLVEAAKVFD--------GRRYRNAVASVILLSDGQDN 429 Query: 318 G-----ASAYQNTLNTLQICEYMRNAG---MKIYSVAVSAPPEGQDLLRKCTDSSGQFFA 369 A + ++ + + +G + +++ + + ++ G F Sbjct: 430 YNVNGGWGASNSKNYSVLVPPSFKRSGDRRLPVHTFGFGTDHDASAMHTIAEETGGTFSF 489 Query: 370 VNDSRELLESFDK 382 + + + ++F + Sbjct: 490 IENQAVVQDAFAQ 502 >gi|22125371|ref|NP_668794.1| hypothetical protein y1474 [Yersinia pestis KIM 10] gi|45442407|ref|NP_993946.1| hypothetical protein YP_2631 [Yersinia pestis biovar Microtus str. 91001] gi|149365130|ref|ZP_01887165.1| putative membrane protein [Yersinia pestis CA88-4125] gi|218930054|ref|YP_002347929.1| hypothetical protein YPO3007 [Yersinia pestis CO92] gi|21958254|gb|AAM85045.1|AE013750_5 hypothetical [Yersinia pestis KIM 10] gi|45437272|gb|AAS62823.1| putative membrane protein [Yersinia pestis biovar Microtus str. 91001] gi|115348665|emb|CAL21610.1| putative membrane protein [Yersinia pestis CO92] gi|149291543|gb|EDM41617.1| putative membrane protein [Yersinia pestis CA88-4125] Length = 509 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 21/163 (12%), Positives = 55/163 (33%), Gaps = 13/163 (7%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYENTNTYPAMHHAYRELYNE 291 N++ + +AY+ + + + + + ++P T + + ++ Sbjct: 165 NITDTLSVVAYDNHAEVIIPATKVTDKPALIASIQQHIHPRGMTALFAGVSMGIGQVDKH 224 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 +I I+DG+ + + L + G+ I ++ + Sbjct: 225 LNREQVNR--------IILISDGQANTGPTSISEL--SDLARMAAKKGIAITTIGLGQD- 273 Query: 352 EGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 +DL+ S G V +S +L ++F K + + Sbjct: 274 YNEDLMTAIAGYSDGNHTFVANSADLEKAFTKEFQDVMSVVAQ 316 >gi|108808190|ref|YP_652106.1| hypothetical protein YPA_2196 [Yersinia pestis Antiqua] gi|108811539|ref|YP_647306.1| hypothetical protein YPN_1376 [Yersinia pestis Nepal516] gi|145599390|ref|YP_001163466.1| hypothetical protein YPDSF_2114 [Yersinia pestis Pestoides F] gi|165926883|ref|ZP_02222715.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. F1991016] gi|165935923|ref|ZP_02224493.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. IP275] gi|166011266|ref|ZP_02232164.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166212622|ref|ZP_02238657.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. B42003004] gi|167398885|ref|ZP_02304409.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167422701|ref|ZP_02314454.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167424397|ref|ZP_02316150.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|167467552|ref|ZP_02332256.1| hypothetical protein YpesF_06584 [Yersinia pestis FV-1] gi|170023658|ref|YP_001720163.1| von Willebrand factor type A [Yersinia pseudotuberculosis YPIII] gi|186896129|ref|YP_001873241.1| von Willebrand factor type A [Yersinia pseudotuberculosis PB1/+] gi|270489996|ref|ZP_06207070.1| von Willebrand factor type A domain protein [Yersinia pestis KIM D27] gi|294504758|ref|YP_003568820.1| membrane protein [Yersinia pestis Z176003] gi|108775187|gb|ABG17706.1| membrane protein [Yersinia pestis Nepal516] gi|108780103|gb|ABG14161.1| putative membrane protein [Yersinia pestis Antiqua] gi|145211086|gb|ABP40493.1| membrane protein [Yersinia pestis Pestoides F] gi|165916068|gb|EDR34675.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. IP275] gi|165921234|gb|EDR38458.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. F1991016] gi|165989944|gb|EDR42245.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. E1979001] gi|166206553|gb|EDR51033.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. B42003004] gi|166958408|gb|EDR55429.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Orientalis str. MG05-1020] gi|167051389|gb|EDR62797.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Antiqua str. UG05-0454] gi|167056279|gb|EDR66048.1| von Willebrand factor type A domain protein [Yersinia pestis biovar Mediaevalis str. K1973002] gi|169750192|gb|ACA67710.1| von Willebrand factor type A [Yersinia pseudotuberculosis YPIII] gi|186699155|gb|ACC89784.1| von Willebrand factor type A [Yersinia pseudotuberculosis PB1/+] gi|262362820|gb|ACY59541.1| membrane protein [Yersinia pestis D106004] gi|262366744|gb|ACY63301.1| membrane protein [Yersinia pestis D182038] gi|270338500|gb|EFA49277.1| von Willebrand factor type A domain protein [Yersinia pestis KIM D27] gi|294355217|gb|ADE65558.1| membrane protein [Yersinia pestis Z176003] Length = 472 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 21/163 (12%), Positives = 55/163 (33%), Gaps = 13/163 (7%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYENTNTYPAMHHAYRELYNE 291 N++ + +AY+ + + + + + ++P T + + ++ Sbjct: 128 NITDTLSVVAYDNHAEVIIPATKVTDKPALIASIQQHIHPRGMTALFAGVSMGIGQVDKH 187 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 +I I+DG+ + + L + G+ I ++ + Sbjct: 188 LNREQVNR--------IILISDGQANTGPTSISEL--SDLARMAAKKGIAITTIGLGQD- 236 Query: 352 EGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 +DL+ S G V +S +L ++F K + + Sbjct: 237 YNEDLMTAIAGYSDGNHTFVANSADLEKAFTKEFQDVMSVVAQ 279 >gi|323320820|gb|ADX36428.1| complement factor B [Apostichopus japonicus] Length = 913 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 34/202 (16%), Positives = 68/202 (33%), Gaps = 20/202 (9%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + ++ + A LV I + S+R+G + +N T ++ + Sbjct: 467 SSSVGEDNFNMAKKFAKELVKEIGVTD---RPNSLRVGALVFNSEAEIGFHTVAFDSTAD 523 Query: 262 VKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 V ++ + TN A+ L N + + K V ITDG+ + Sbjct: 524 VLDAIDSMEYKEGGTNI----AKAFEVLSNVMIPQTAKLNREKSFKTVFLITDGDATEGG 579 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLE 378 + + +R+ + I+ + +S + L F + D L E Sbjct: 580 ------DAQEDARAVRDQDVTIHCIGISENATRRT-LSGMASEPLSEHLFFLKDYSTLEE 632 Query: 379 SFDKI--TDKIQEQSVRIAPNR 398 F +I I ++P R Sbjct: 633 -FIQIVTNQTIDYSECGVSPRR 653 >gi|260834336|ref|XP_002612167.1| hypothetical protein BRAFLDRAFT_88906 [Branchiostoma floridae] gi|229297541|gb|EEN68176.1| hypothetical protein BRAFLDRAFT_88906 [Branchiostoma floridae] Length = 954 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 25/189 (13%), Positives = 65/189 (34%), Gaps = 27/189 (14%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + + + ++VNS + + R+G + Y+ + Sbjct: 189 SNSVGTANFEKVKQFVVDVVNSF-----DVSPTATRVGVVQYSNKNTLMFNLGDKVDKPS 243 Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + +N + T T A+ + ++ + K + +TDGE+ + Sbjct: 244 TVNAINSIQYQGGGTYTGYALKYVRQK---------AAWRGGNVPKVAVVLTDGESYDSV 294 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 + + + + G+++++V V G L+ + +ND +L Sbjct: 295 SV--------AAQNLLSDGVEVFAVGV----AGFKLIAIANSNETNVIELNDFNDLTTKI 342 Query: 381 DKITDKIQE 389 +I K+ Sbjct: 343 GEIAKKVCS 351 >gi|282900974|ref|ZP_06308907.1| hypothetical protein CRC_02390 [Cylindrospermopsis raciborskii CS-505] gi|281194065|gb|EFA69029.1| hypothetical protein CRC_02390 [Cylindrospermopsis raciborskii CS-505] Length = 487 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 33/149 (22%), Positives = 61/149 (40%), Gaps = 20/149 (13%) Query: 244 NIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 N G PL+N++N + + +N+L +T + A +L +T Sbjct: 92 NFGSEVATPAPLTNDINILNNAINQLLENGSTPMGEGIDTAQGQLQ-----------ATT 140 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS 363 L K +I TDG + N+ +RNAG+K+ +AV+ + L + T Sbjct: 141 LNKNIILFTDGIPDDPNFAYNS------ALSVRNAGIKL--IAVATGGADTNYLTQITGD 192 Query: 364 SGQFFAVNDSRELLESFDKITDKIQEQSV 392 F +S + ++F + I +Q + Sbjct: 193 RSLVFYA-NSGQFDQAFSQAEAVIYKQLI 220 >gi|262527579|sp|P54289|CA2D1_HUMAN RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-1; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-1; Flags: Precursor Length = 1103 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 31/334 (9%), Positives = 94/334 (28%), Gaps = 25/334 (7%) Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK- 108 + ++ + + E +I ++ + Sbjct: 104 AEKVQAAHQWREDFASNEVVYYNAKDDLDPEKNDSEPGSQRIKPVFIEDANFGRQISYQH 163 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR----SMEDLY 164 A IPT+ ++ + L S + E+ ++ + + + Sbjct: 164 AAVHIPTDIYEGSTIVLNELNWTSALDEVFKKNREEDPSLLWQVFGSATGLARYYPASPW 223 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 + N + P + S + + + + ++ S ++ ++ Sbjct: 224 VDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETL 283 Query: 225 QKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + + ++ +V N +K +N + T+ Sbjct: 284 SDDDFVNVASFNSNAQDVSCFQHLVQANVR----NKKVLKDAVNNITAKGITDYKKGFSF 339 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A+ +L N S K ++ TDG A N N ++ ++++ Sbjct: 340 AFEQLLNYNVSRA------NCNKIIMLFTDGGEERAQEIFNKYN--------KDKKVRVF 385 Query: 344 SVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 + +V + ++ ++ G ++ + + Sbjct: 386 TFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|149410435|ref|XP_001512838.1| PREDICTED: similar to Coch-5B2 gene product [Ornithorhynchus anatinus] Length = 692 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 19/160 (11%), Positives = 48/160 (30%), Gaps = 20/160 (12%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYN 290 ++ ++ + + V + + ++ T T A+ R ++ Sbjct: 542 SDIGTKVAAVQFTYDQRTEFSFTDYTTKENVLAVIRQIRYMSGGTATGDAVAFTVRNVFG 601 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 S N F++ +TDG++ + AG+ ++S+ ++ Sbjct: 602 PLRDSPNK-------NFLVVLTDGQSYD--------DVRGPAAAAHKAGITVFSIGMAWA 646 Query: 351 PEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 P L+ F + L + + I Sbjct: 647 PLDD--LKDMASEPKESHTFFTREFAGLEQIVTDVIRGIC 684 >gi|291544120|emb|CBL17229.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Ruminococcus sp. 18P13] Length = 1117 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 42/382 (10%), Positives = 107/382 (28%), Gaps = 53/382 (13%) Query: 41 CASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNP 100 +V D + + I I+++ + + A + +K + N Sbjct: 392 SNPLVRDTDGDGISDGEAVAQGIDPTGIEENPAVRTQSVQTALESTEKPLFDNVSVTMNV 451 Query: 101 LQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSM 160 + + + I E+ ++ + + + S + ++ D Sbjct: 452 GGNLNQHLSIRNIEKEDTLSANVVGALSAPIEITSDASFTSAKITFTYDEALLGDTPAEN 511 Query: 161 EDLYLQKH-NDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGN 219 + N + ++ P ++ + D E+ Sbjct: 512 LAVMWYDRENRRYVILDKDTVVDPNAHTVSYTTTHFSTYLVVDREVWY---DCWRENIDY 568 Query: 220 LVNSIQKAIQEKKN----------------LSVRIGTIAYNIGIVGNQ---CTPLSNN-- 258 + E + + + ++ S+N Sbjct: 569 RSGDAGTSQLEPYDIGLCVDVSGSMYGDRLEKAKTALNTFIDAMLPQDNACMVSFSDNAY 628 Query: 259 --------LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 ++SR N+L TNT + L ++ + K +I Sbjct: 629 LVAGYGASKEVMRSRTNQLRDLYGTNTDVGLSKTISILADQGR--------SDASKMIIM 680 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFA 369 I DG+ + + + AG+ +Y++ V + LL+K D + G+++ Sbjct: 681 ICDGDVNY---------IQGTVDAAKAAGIAVYTINVVSGDND--LLQKIADETGGEYYY 729 Query: 370 VNDSRELLESFDKITDKIQEQS 391 + E++ + I + Sbjct: 730 AATTEEVVSQVEAIRGETVSAV 751 >gi|301623011|ref|XP_002940816.1| PREDICTED: integrin alpha-M-like [Xenopus (Silurana) tropicalis] Length = 907 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 25/137 (18%), Positives = 47/137 (34%), Gaps = 17/137 (12%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + + N + +N+L T T A+ +L+ + +K +I ITDGE Sbjct: 290 TKDHNSLVRNINQLK--GATWTATAIQKVLSQLFIPSRGAR-----DGSQKLLIVITDGE 342 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS---APPEGQDLLRKCTDSSG--QFFAV 370 Y ++L+ G+ +++ V + L + V Sbjct: 343 -----KYGDSLDYSIPIAEAERKGIVRFAIGVGRAFSEDTAYSELITIASQPSDKYVYRV 397 Query: 371 NDSRELLESFDKITDKI 387 D L + + DKI Sbjct: 398 GDFSALSKFRKDLQDKI 414 >gi|326328639|ref|ZP_08194979.1| LigA [Nocardioidaceae bacterium Broad-1] gi|325953600|gb|EGD45600.1| LigA [Nocardioidaceae bacterium Broad-1] Length = 871 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 23/192 (11%), Positives = 59/192 (30%), Gaps = 21/192 (10%) Query: 192 KNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ 251 AP KID + +A + + + + + + Sbjct: 654 LIDNSGSMNDEVAPGAAKIDRVQSAANAAIGLLAPKDE--------LAVWTFGSSVHKTA 705 Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNT-YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 P+ N +++V++ + + T A+ A+ L + + K V+ Sbjct: 706 LAPMGNRISQVRAEIGAIEAGGTTTQLPAAVQAAHDALAQTNDPDNPKT------KAVVL 759 Query: 311 ITDGENSGASAYQNTLNTLQICEYM------RNAGMKIYSVAVSAPPEGQDLLRKCTDSS 364 +TDG + + + + + ++IY++ + L + S Sbjct: 760 LTDGATNLTPDGADEEENKAANDALVADIRGSESHVRIYTIPYGNSADKCLLEKVAAASG 819 Query: 365 GQFFAVNDSREL 376 +++ L Sbjct: 820 ARYYGAGARESL 831 >gi|269125771|ref|YP_003299141.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] gi|268310729|gb|ACY97103.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] Length = 601 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 19/125 (15%), Positives = 42/125 (33%), Gaps = 11/125 (8%) Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + ++ RL L T Y AY ++ + V+ +TDG+N Sbjct: 482 RSLLRERLLGLTLSGGTGLYNTTAAAYEKMTGSRRGDAI--------NAVVVMTDGKNER 533 Query: 319 -ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 + L R ++++++ + L R + G + +D + Sbjct: 534 PGGLDLDGLIAKLGAR--REESVRVFTIGYGEDADQNVLRRIAEAADGAAYDSSDPNTIG 591 Query: 378 ESFDK 382 + F + Sbjct: 592 DIFTE 596 >gi|254447588|ref|ZP_05061054.1| conserved hypothetical protein [gamma proteobacterium HTCC5015] gi|198262931|gb|EDY87210.1| conserved hypothetical protein [gamma proteobacterium HTCC5015] Length = 1197 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 36/286 (12%), Positives = 74/286 (25%), Gaps = 64/286 (22%) Query: 162 DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA-PANRKIDVLIESAGNL 220 D + + WS ++ + Sbjct: 205 DNSTGGDLPSAA----YTSSEDFEDRLNWSSEDYSARVVTGNYLNWQENHNDEKTVTRLK 260 Query: 221 VNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL-SNNLNEVKSRLNKLNPYENTNTYP 279 +V G + + L +++ ++ ++ L T Sbjct: 261 AVQNVLHDVIGGMDNVNFGLMDFYGKGRVIYPVSLVNDDRVQLLKTVDDLEANGGTPLQE 320 Query: 280 AM------------HHAYRELYNEKESSHNTIG----STRLKKFVIFITDGENSGASAYQ 323 A+ ++ R + + ES T + V+ +TDGE + Sbjct: 321 ALHESGRYLMGDTYYYGERGVSSAFESDGKTYDSPILNECQANAVVLLTDGEPTDYGTDN 380 Query: 324 ---------NTLNTLQICEYMRNAG-----------------------------MKIYSV 345 + N + C R+ G + ++V Sbjct: 381 VIHDMIKPVHEKNGTRRCTSYRDNGNDYTRYCLPEIAQFMRDFDLSSGLDGDQTAQTFTV 440 Query: 346 AVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQ 390 + Q+LL+ G +F NDS EL ++ I KI+ + Sbjct: 441 GFKS---DQELLQDTASKGQGNYFQANDSVELTQALKDIVRKIKSE 483 >gi|118081930|ref|XP_414992.2| PREDICTED: hypothetical protein [Gallus gallus] Length = 1794 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 41/386 (10%), Positives = 90/386 (23%), Gaps = 47/386 (12%) Query: 36 AVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKH------LKQGSYIRENAGDIAQKA 89 A L+ + I + + ++ T + +++ + + + A+ Sbjct: 288 ASLADSSPIHTIQDLRITNTAVNSLKLSWRRLPGATHYKISWVPFSGGLETSQLVGAESN 347 Query: 90 QINITKDKNNPLQYIAESKAQYEIPTENLFLK----GLIPSALTNLSLRSTGIIERSSEN 145 I+ K + I S L L L + + + Sbjct: 348 SFTISNLKESTTYTIGVSAVIGGQEGNPTLLTAKTLDLPKVTEFTLQEAAETSVLLTWAA 407 Query: 146 LAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAP-APA 204 + +L S + + L F + + P Sbjct: 408 VPGVSTYILTWRLSSASDLSLEVLSAALRSYRVRGLSSEETYLFSIRPVFGDREGPETTL 467 Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKN------------------LSVRIGTIAYNIG 246 G LV+ N ++ Y+ Sbjct: 468 IGQTVCGKFKADIGFLVDESSSIGWSNFNKVKDFLFRIISYFPKIGPEGTQVAVAQYSEE 527 Query: 247 IVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + N + +L+ NT T + +EL+ Sbjct: 528 PRAAFHFNQHQDRNGALKAVKELHYAGGNTKTGRGIAFMLKELFQPSRG-----MRPEFP 582 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC--TDS 363 ++ +TDG + G++I +AV L + Sbjct: 583 HVLMLVTDGRSQDDVLPP--------ARAAHALGIRI--IAVGVSGADPAELNDILLQQN 632 Query: 364 SGQFFAVNDSRELLESFDKITDKIQE 389 F V+ E + ++ + I Sbjct: 633 LQNVFYVSTFDEFPQILRELIEVICS 658 >gi|54112390|ref|NP_000713.2| voltage-dependent calcium channel subunit alpha-2/delta-1 [Homo sapiens] gi|109658756|gb|AAI17469.1| Calcium channel, voltage-dependent, alpha 2/delta subunit 1 [Homo sapiens] gi|109659118|gb|AAI17471.1| Calcium channel, voltage-dependent, alpha 2/delta subunit 1 [Homo sapiens] gi|119597396|gb|EAW76990.1| calcium channel, voltage-dependent, alpha 2/delta subunit 1 [Homo sapiens] Length = 1091 Score = 61.5 bits (147), Expect = 2e-07, Method: Composition-based stats. Identities = 31/334 (9%), Positives = 94/334 (28%), Gaps = 25/334 (7%) Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK- 108 + ++ + + E +I ++ + Sbjct: 104 AEKVQAAHQWREDFASNEVVYYNAKDDLDPEKNDSEPGSQRIKPVFIEDANFGRQISYQH 163 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR----SMEDLY 164 A IPT+ ++ + L S + E+ ++ + + + Sbjct: 164 AAVHIPTDIYEGSTIVLNELNWTSALDEVFKKNREEDPSLLWQVFGSATGLARYYPASPW 223 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 + N + P + S + + + + ++ S ++ ++ Sbjct: 224 VDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETL 283 Query: 225 QKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + + ++ +V N +K +N + T+ Sbjct: 284 SDDDFVNVASFNSNAQDVSCFQHLVQANVR----NKKVLKDAVNNITAKGITDYKKGFSF 339 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A+ +L N S K ++ TDG A N N ++ ++++ Sbjct: 340 AFEQLLNYNVSRA------NCNKIIMLFTDGGEERAQEIFNKYN--------KDKKVRVF 385 Query: 344 SVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 + +V + ++ ++ G ++ + + Sbjct: 386 TFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|24373750|ref|NP_717793.1| inter-alpha-trypsin inhibitor domain-containing protein [Shewanella oneidensis MR-1] gi|24348130|gb|AAN55237.1|AE015661_7 inter-alpha-trypsin inhibitor domain protein [Shewanella oneidensis MR-1] Length = 760 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 24/149 (16%), Positives = 52/149 (34%), Gaps = 18/149 (12%) Query: 248 VGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 + NL + +N+L T A+ A K+ + + Sbjct: 428 SPVPLPATAENLAIARQFVNRLQADGGTEMSLALEAAL-----PKQRPSRAASENNVLQQ 482 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGM---KIYSVAVSAPPEGQDLLRKCTDSS 364 VIF+TDG ++ + + + ++++V + + P + R Sbjct: 483 VIFMTDG----------SVGNEEALFELIRHQIGDNRLFTVGIGSAPNSHFMQRAAELGR 532 Query: 365 GQFFAVNDSRELLESFDKITDKIQEQSVR 393 G F + D E+ + +++ KIQ + Sbjct: 533 GTFTYIGDVDEVEQKINQLLTKIQYPVLT 561 >gi|326789709|ref|YP_004307530.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] gi|326540473|gb|ADZ82332.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] Length = 593 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 27/170 (15%), Positives = 61/170 (35%), Gaps = 19/170 (11%) Query: 235 SVRIGTIAYNIGIVGNQCTPL---SNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYN 290 ++G + Y + L ++ +K+ ++ + E T+ + A + L Sbjct: 69 DNQVGVVTYGSDVSQTYPMSLVKNQSDKENIKNFVDGITRDLEYTDITSGLKEAVKMLN- 127 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT-----LNTLQICEYMRNAGMKIYSV 345 ++ ++ TDG N+ T + I ++ G IY++ Sbjct: 128 -------QRNASGNSPLIVVFTDGNNAIGGVANRTPADIDKDLAAIISQAQSEGYPIYTI 180 Query: 346 AVSAPPE-GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 ++ + + L K + D+ + FA D EL + +I V+ Sbjct: 181 GLNDNGKLNEAYLEKISVDTKAKAFATKDPAELPDILTEIFAAHSNLKVQ 230 >gi|194209576|ref|XP_001915257.1| PREDICTED: similar to voltage-dependent calcium channel alpha-2 delta subunit [Equus caballus] Length = 1016 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 29/334 (8%), Positives = 92/334 (27%), Gaps = 25/334 (7%) Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK- 108 + ++ + + E +I + + Sbjct: 78 AEKVQAAHQWREDFASNEVVYYNAKDDLDPEKNDSEPGSQRIKPVFIDDTNFGRQISYQH 137 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR----SMEDLY 164 A IPT+ ++ + L S + E+ ++ + + + Sbjct: 138 AAVHIPTDIYEGSTIVLNELNWTSALDEVFKKNREEDPSLLWQVFGSATGLARYYPASPW 197 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 + N + P + S + + + + ++ S ++ ++ Sbjct: 198 VDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETL 257 Query: 225 QKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + + ++ +V N +K +N + T+ Sbjct: 258 SDDDFVNVASFNSNAQDVSCFQHLVQANVR----NKKVLKDAVNNITAKGITDYKKGFSF 313 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A+ +L N S K ++ TDG +Y ++ ++++ Sbjct: 314 AFEQLLNYNVSRA------NCNKIIMLFTDG--------GEERAQEIFAKYNKDKKVRVF 359 Query: 344 SVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 + +V + ++ ++ G ++ + + Sbjct: 360 TFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 393 >gi|149632103|ref|XP_001514474.1| PREDICTED: similar to collagen type VI alpha 4 [Ornithorhynchus anatinus] Length = 1844 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 33/187 (17%), Positives = 62/187 (33%), Gaps = 24/187 (12%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKN---LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 + + ES + K + N +VRIG + ++ ++ Sbjct: 628 DGSESIKESNFEKMKEFMKLMVNMSNIGPENVRIGVLQFSSSPREEFMLNKYTTKEDLSR 687 Query: 265 RLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 ++ + T T A+ + +++I ITDGE + Sbjct: 688 AISDIKQIKAGTQTGQALTFTLPYFDTSRWGRPTE------PQYLIVITDGEAQDS---- 737 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF--D 381 + +R+ G+ I+++ V LL T + Q F ND L F Sbjct: 738 ----VKGPAKALRDKGISIFAIGV-LEANKTQLLE-ITGTEDQVFYENDFDSL--IFLKK 789 Query: 382 KITDKIQ 388 KI+ K+ Sbjct: 790 KISFKLC 796 Score = 59.9 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 25/158 (15%), Positives = 48/158 (30%), Gaps = 22/158 (13%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYN 290 N VR G + Y+ ++ ++K + + A+++ + + Sbjct: 475 GNNRVRFGVVQYSDSPHLEFEVGQYHSTVKLKEAIRGIKQLRGRDRIGEALNYMNQRFMD 534 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 K +I IT G + + +R G+ IY++ V Sbjct: 535 NDRV-----------KILILITAGNFQD--------EVAESAQELRQRGIVIYAIGVKTD 575 Query: 351 PEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 Q L + VND L D++ I Sbjct: 576 --NQLKLISIAGTEENVLCVNDFDTLKHIKDEVVQDIC 611 Score = 43.4 bits (100), Expect = 0.062, Method: Composition-based stats. Identities = 24/148 (16%), Positives = 49/148 (33%), Gaps = 15/148 (10%) Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT-N 276 L +++ + N VRIG + Y+ + S L KL Sbjct: 256 NILRETVKSLVIGPDN--VRIGLVLYSDEPRLVFSLETFQTKESIMSHLAKLPYRGGKPK 313 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 T A+ ++ + + R+++ + IT+G + + +R Sbjct: 314 TGAALKFLRENIFTQDGNRRYEK---RVQRMAVVITEG--------FSEDRVSKPASQLR 362 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSS 364 AG+ IY++ + + L K Sbjct: 363 RAGVTIYALGIQRG-LERGNLEKMASYP 389 >gi|325473816|gb|EGC77004.1| BatA protein [Treponema denticola F0402] Length = 282 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 55/168 (32%), Gaps = 41/168 (24%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 P + + SRL+ L+ + T + + + K +S Sbjct: 88 SSSAALILPPTIDHKVFLSRLDSLSIGELGDGTAIGMGLAVSSAYMTRTKLNS------- 140 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE---------- 352 +++ +TDGEN+ +N E + N + Y + + + Sbjct: 141 ---SYIVLLTDGENNTG-----EINPKTAAEVLVNKNIGFYVIGIGSSGYTTLEYTDRKT 192 Query: 353 ------------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKI 387 + L+K +G++ + + L + F+ I+ ++ Sbjct: 193 GKTYSGSIFSKFDEVELKKIAQYGNGKYASASSPEILEDIFNTISKQV 240 >gi|325279871|ref|YP_004252413.1| von Willebrand factor type A [Odoribacter splanchnicus DSM 20712] gi|324311680|gb|ADY32233.1| von Willebrand factor type A [Odoribacter splanchnicus DSM 20712] Length = 341 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 26/174 (14%), Positives = 58/174 (33%), Gaps = 51/174 (29%) Query: 253 TPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P++ + + K L+ ++ + T A+ A R E E+S K + Sbjct: 144 LPITTDYSSAKLFLSNISTDIVPVQGTAIGSAIDLAARSFTPETETS----------KAI 193 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE---------------- 352 I ITDGEN + + + G+ I+++ + Sbjct: 194 IVITDGENHQD-------DAVAAAKQAHEKGIVIHTIGMGLEQGAPIPEKGKPGQFMQDA 246 Query: 353 ---------GQDLLRKCTDSS-GQFFAVNDSR----ELLESFDKITDKIQEQSV 392 + L+ + G + +++ LL+ +++ + E+ V Sbjct: 247 QGNVVISKLDEQTLQDIAKAGEGLYIRASNTEVGLNRLLDEVNRMEKSLLEERV 300 >gi|301764709|ref|XP_002917776.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-1-like [Ailuropoda melanoleuca] Length = 1091 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 29/334 (8%), Positives = 92/334 (27%), Gaps = 25/334 (7%) Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK- 108 + ++ + + E +I + + Sbjct: 104 AEKVQAAHQWREDFASNEVVYYNAKDDLDPEKNESEPGSQRIKPVFIDDANFGRQISYQH 163 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR----SMEDLY 164 A IPT+ ++ + L S + E+ ++ + + + Sbjct: 164 AAVHIPTDIYEGSTIVLNELNWTSALDEVFKKNREEDPSLLWQVFGSATGLARYYPASPW 223 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 + N + P + S + + + + ++ S ++ ++ Sbjct: 224 VDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETL 283 Query: 225 QKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + + ++ +V N +K +N + T+ Sbjct: 284 SDDDFVNVASFNSNAQDVSCFQHLVQANVR----NKKVLKDAVNNITAKGITDYKKGFSF 339 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A+ +L N S K ++ TDG +Y ++ ++++ Sbjct: 340 AFEQLLNYNVSRA------NCNKIIMLFTDG--------GEERAQEIFAKYNKDKKVRVF 385 Query: 344 SVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 + +V + ++ ++ G ++ + + Sbjct: 386 TFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|260559816|ref|ZP_05831995.1| von Willebrand factor [Enterococcus faecium C68] gi|260074040|gb|EEW62363.1| von Willebrand factor [Enterococcus faecium C68] Length = 857 Score = 61.5 bits (147), Expect = 3e-07, Method: Composition-based stats. Identities = 27/272 (9%), Positives = 78/272 (28%), Gaps = 26/272 (9%) Query: 136 TGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTT 195 + + + +V+D S SM D L Sbjct: 7 NARGNVQKDITPLDLVLVVDWSGSMNDNNRIGEVKIGVDRFVDTLADSGITDKINMGYVG 66 Query: 196 KSKYAPAPAPANRKI---DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQC 252 S + + ++ D + ++ S + G++ Sbjct: 67 YSSEGYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGTFTQKALRDAGSMLSVPNGHKKVI 126 Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 L++ + ++ +++ ++N Y + ++ + G+T L + Sbjct: 127 VLLTDCVPTFSYKVQRVHAQSSSNYYG----------TQFSNTQDRPGNTSLISRIYDAP 176 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTDS 363 D N + T+ ++ G++I+ + + + + +R+ S Sbjct: 177 DQNNLSRRIDSTFIATIGEAMALKERGIEIHGLGIQLQSDPAAGLSKAEVESRMRQMVSS 236 Query: 364 ---SGQFFA-VNDSRELLESFDKITDKIQEQS 391 ++ + + ++ E K +I Sbjct: 237 DEKGDLYYESADHATDISEYLAKKAVQISATV 268 >gi|326315855|ref|YP_004233527.1| von Willebrand factor type A [Acidovorax avenae subsp. avenae ATCC 19860] gi|323372691|gb|ADX44960.1| von Willebrand factor type A [Acidovorax avenae subsp. avenae ATCC 19860] Length = 355 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 24/242 (9%), Positives = 73/242 (30%), Gaps = 66/242 (27%) Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 S A ++ ++A + + VR+G +A+ Q + Sbjct: 95 SGSMRAADVHPDRLTAAQDAAKAFIAELP--------RHVRVGIVAFAGSAQLAQLP--T 144 Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN-------------------------- 290 N ++ ++ T T + + L+ Sbjct: 145 QNHEDLFKAIDSFQLQRGTATGNGILLSLATLFPDTGIDVSALGGRQAMPRPQSMDEIGR 204 Query: 291 ---------EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK 341 + + +I +TDG+ + ++ ++ ++ + G++ Sbjct: 205 PPHRGSNGRGADRPAPVAPGSYSSAAIIMLTDGQRTTG------VDPMEAAQWAADRGVR 258 Query: 342 IYSVAVSAPPE--------------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDK 386 +Y+V V +D L+ ++ ++F + +L + ++ ++ + Sbjct: 259 VYTVGVGTVAGETIGFEGWSMRVRLDEDTLKAVAQRTNAEYFHAATAADLKKVYETLSSR 318 Query: 387 IQ 388 + Sbjct: 319 LT 320 >gi|134093165|gb|ABO53025.1| matrilin 4 isoform 1 precursor, 5 prime [Chlorocebus aethiops] Length = 214 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 26/147 (17%), Positives = 49/147 (33%), Gaps = 15/147 (10%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKES 294 R+G I Y+ + + +++ + L P T T A+ + ++ E Sbjct: 71 TRVGVIQYSSQVQSVFPLRAFSRREDMERAIRDLVPLAQGTMTGLAIQYVMNVAFSVAEG 130 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + R+ + + +TDG ++ R G++IY+V V G Sbjct: 131 ARP--PEERVPRVAVIVTDGRPQD--------RVAEVAAQARARGIEIYAVGVQRADVGS 180 Query: 355 DLLRKCTDSS--GQFFAVNDSRELLES 379 LR F V + E Sbjct: 181 --LRAMASPPLDEHVFLVESFDLIQEF 205 >gi|156147106|gb|ABU53697.1| CnPolydom [Hydractinia symbiolongicarpus] Length = 551 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 27/167 (16%), Positives = 53/167 (31%), Gaps = 10/167 (5%) Query: 220 LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYP 279 + ++ + + RI N +++ ++K TN Sbjct: 33 VASNATRIAIGTFSSDHRINFNYILNPSYANTKCKFNDDFKKIKFD------GFMTNIKG 86 Query: 280 AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS--GASAYQNTLNTLQICEYMRN 337 ++ AY + R K VI +TDG + G + ++ Sbjct: 87 SLQDAYNVFRSLDSDPVTHSRRPRSNKVVILLTDGVGNMVGNRVDSAGADGAPEALRLKQ 146 Query: 338 AG-MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 G +++Y+V V Q++L+K F D +L I Sbjct: 147 TGYVELYTVGV-THATDQNMLKKIATDPSLFLYSKDFTDLGNLAANI 192 >gi|159036783|ref|YP_001536036.1| von Willebrand factor type A [Salinispora arenicola CNS-205] gi|157915618|gb|ABV97045.1| von Willebrand factor type A [Salinispora arenicola CNS-205] Length = 319 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 25/226 (11%), Positives = 68/226 (30%), Gaps = 38/226 (16%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 +++ S A ++ E+A V+ + + Sbjct: 84 ERATVMVAVDVSTSMLAGDVEPDRLTAAKEAARRFVDGLPDEFNVGLVAFAGSAAV---- 139 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLN--PYE--NTNTYPAMHHAYRELYNEKESSHNTIGS 301 P + + +++L T A++ + + + Sbjct: 140 ------LVPPDTDREALDEGIDRLVEGATGVQGTAIGEAINTSLGAVK----ALDGEAAK 189 Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA------------ 349 ++ ++DG N+ ++ ++ + ++++A Sbjct: 190 DPPPARIVLLSDGANTSG------MDPMEAATDAVAMDVPVHTIAFGTASGYVDRGGRPI 243 Query: 350 --PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 P +GQ L ++ GQF + ++EL +D I + ++ R Sbjct: 244 QVPVDGQTLDEVARETGGQFHEADSAKELRAVYDDIGSSVGYRTKR 289 >gi|51597046|ref|YP_071237.1| hypothetical protein YPTB2727 [Yersinia pseudotuberculosis IP 32953] gi|51590328|emb|CAH21965.1| putative membrane protein [Yersinia pseudotuberculosis IP 32953] Length = 472 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 21/163 (12%), Positives = 54/163 (33%), Gaps = 13/163 (7%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYENTNTYPAMHHAYRELYNE 291 N + + +AY+ + + + + + ++P T + + ++ Sbjct: 128 NTTDTLSVVAYDNHAEVIIPATKVTDKPALIASIQQHIHPRGMTALFAGVSMGIGQVDKH 187 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 +I I+DG+ + + L + G+ I ++ + Sbjct: 188 LNREQVNR--------IILISDGQANTGPTSISEL--SDLARMAAKKGIAITTIGLGQD- 236 Query: 352 EGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 +DL+ S G V +S +L ++F K + + Sbjct: 237 YNEDLMTAIAGYSDGNHTFVANSADLEKAFTKEFQDVMSVVAQ 279 >gi|156358451|ref|XP_001624532.1| predicted protein [Nematostella vectensis] gi|156211319|gb|EDO32432.1| predicted protein [Nematostella vectensis] Length = 180 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 69/193 (35%), Gaps = 27/193 (13%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + N + E + S + ++ T+ Y+ N+ Sbjct: 12 VDGSGSINNARFGRFREFVKKMAESFP-----VSATNTQVATVVYSEEPELIFNFGKYND 66 Query: 259 LNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 +NE+K+ ++ + + T+T A+ E++ +K +I +TDG S Sbjct: 67 INEIKTAVDNMPYHGKTTHTGKALKFTLEEVF--------KKARKNVKNVLIALTDGHAS 118 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRE 375 + + +R+ G+++++V V +P L + F V D E Sbjct: 119 D--------LVKKPAQAVRDYGIEVFAVGVGSP--DIAELEEIATDPDKDHVFNV-DFDE 167 Query: 376 LLESFDKITDKIQ 388 + + ++I Sbjct: 168 VSNITGGLENQIC 180 >gi|326927692|ref|XP_003210025.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2-like [Meleagris gallopavo] Length = 1108 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 34/305 (11%), Positives = 87/305 (28%), Gaps = 20/305 (6%) Query: 74 QGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSL 133 E + + ++ T D N + A +IPT+ +I + L Sbjct: 139 DDPDGEEIEREKSNSLKLEFTDDANFKTKVNYSYAA-VQIPTDIYKGSTVILNELNWTQA 197 Query: 134 RSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKN 193 IE E+ ++ + + N + P + S Sbjct: 198 LEDVFIENRKEDPSLLWQVFGSATGVTRYYPATPWRAPNKIDLYDVRRRPWYIQGASSPK 257 Query: 194 TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQC 252 + + + + ++ S +++++ + + + ++ +V Sbjct: 258 DMVIIVDVSGSVSGLTLKLMKTSVHEMLDTLSDDDYVNVASFNEKAKPVSCFKHLVQANI 317 Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 N K + + T+ +A+ +L N + K ++ T Sbjct: 318 R----NKKVFKEDVQGMVAKGTTDYKAGFEYAFDQLQNSNITRA------NCNKMIMMFT 367 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVN 371 DG N N +++++ +V L+ + G +F + Sbjct: 368 DGGEDRVQDVFEKYNWP-------NKTVRVFTFSVGQHNYDVTPLQWMACANKGYYFEIP 420 Query: 372 DSREL 376 + Sbjct: 421 SIGAI 425 >gi|73541336|ref|YP_295856.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] gi|72118749|gb|AAZ61012.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] Length = 354 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 19/186 (10%), Positives = 51/186 (27%), Gaps = 57/186 (30%) Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE------------------ 293 P ++N + +++ T T + + L+ + Sbjct: 140 VLPPTDNRLRMLDAIDRFELQNGTATGSGLIQSLAVLFPDDGIDLEGILFGGESLAPGTG 199 Query: 294 ------------------SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + VI ++DG + + L Sbjct: 200 GRSLTEAAAADAVRKRDLEQPGAAPGSYRHGAVILLSDGRRTTG------PDPLDAARMA 253 Query: 336 RNAGMKIYSVAVSAPPE--------------GQDLLRKCTD-SSGQFFAVNDSRELLESF 380 G+++Y+V A + + LR+ + G++F + +L + Sbjct: 254 AQRGLRVYTVGFGAVQDAGTEGSSLSYEMQVDEPTLRQIATLTDGEYFQAGSAADLTRVY 313 Query: 381 DKITDK 386 +++ + Sbjct: 314 RQLSGR 319 >gi|292627943|ref|XP_695559.4| PREDICTED: integrin alpha-11 [Danio rerio] Length = 1104 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 40/252 (15%), Positives = 75/252 (29%), Gaps = 31/252 (12%) Query: 155 DVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPA-----PAPANRK 209 D S ++ ++ S + + Sbjct: 102 DNSFVACGPLWSYECGSSYYSTGICSRVDSNFNFTHSIAPAYQRCETYMDIVIVLDGSNS 161 Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 I E L+N +QK +++G + Y +V ++EV + + Sbjct: 162 IYPWYEVQDFLINVLQKFYIGPGQ--IQVGVVQYGERVVNEFRLDDFRTVDEVVAAAKNI 219 Query: 270 NPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + T T ++ A + + G KK +I ITDGE ++ + Sbjct: 220 DQRGGEETRTALGINVARTQAFKHG-------GRPDAKKVMIVITDGE------SHDSPD 266 Query: 328 TLQICEYMRNAGMKIYSVAV-SA----PPEGQDLLRKC---TDSSG-QFFAVNDSRELLE 378 E + +Y +AV + LR+ FF+V D L + Sbjct: 267 LKAAVEESEKDNITLYGIAVLGYYNRRGINPEAFLREIKFIATDPDEHFFSVTDESALKD 326 Query: 379 SFDKITDKIQEQ 390 D + +KI Sbjct: 327 IVDALGEKIFSL 338 >gi|260578579|ref|ZP_05846489.1| conserved hypothetical protein [Corynebacterium jeikeium ATCC 43734] gi|258603294|gb|EEW16561.1| conserved hypothetical protein [Corynebacterium jeikeium ATCC 43734] Length = 646 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 20/129 (15%), Positives = 52/129 (40%), Gaps = 20/129 (15%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N+ ++ + + K+ +T PA+ A EL + E + ++ ++DGE++ Sbjct: 115 NVTKIPAEVGKVEASGHTPMGPALRQAAEELPKDGE------------RSIVLVSDGEDT 162 Query: 318 GASAYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSR 374 ++ + ++ G+ I +V + + L + G++ D+ Sbjct: 163 -----CAPPPVCEVAKDLKKEGIDLTINTVGFLVDSKARKELECIAEAGGGEYMDAKDTV 217 Query: 375 ELLESFDKI 383 L +S ++ Sbjct: 218 SLADSMKRL 226 >gi|168700938|ref|ZP_02733215.1| BatA [Gemmata obscuriglobus UQM 2246] Length = 317 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 31/198 (15%), Positives = 61/198 (30%), Gaps = 32/198 (16%) Query: 194 TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 S AP + D +++ ++ +K Sbjct: 103 VDVSGSMLAPFGDGNRYDASMKAIDTFLD-FRKGDAFGLTFFGDAFVHW----------V 151 Query: 254 PLSNNLNEVKSRLNKLNPY------ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 PL+ ++ ++ + P T A++ EL K Sbjct: 152 PLTTDVTAIRCSPPFMRPETVPPPFGGTAIAKALNGCKTEL----------RRRDEGDKM 201 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF 367 ++ ITDG +Y T N +I + G+ ++ + V +++ C + G+ Sbjct: 202 IVLITDG-----FSYDLTGNDEEIARTLSAEGVAVFCIIVGGFEPQAEIVNICRLTGGEA 256 Query: 368 FAVNDSRELLESFDKITD 385 F +D L F KI Sbjct: 257 FRADDPDALPAVFKKIDT 274 >gi|160837835|ref|NP_001104272.1| integrin, alpha D [Canis lupus familiaris] Length = 1168 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 28/144 (19%), Positives = 51/144 (35%), Gaps = 17/144 (11%) Query: 249 GNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 S N + + +L+ T T + EL++ K + KK + Sbjct: 209 TFTQFQSSWNPLSLVDPIVQLD--GLTYTATGIRKVVEELFHSKNGAR-----KSAKKIL 261 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP---PEGQDLLRKCTDSS- 364 I ITDG+ Y++ L + AG+ Y++ V P + L Sbjct: 262 IVITDGQ-----KYKDPLEYSDVIPQAERAGIIRYAIGVGDAFWKPSAKQELDNIGSEPA 316 Query: 365 -GQFFAVNDSRELLESFDKITDKI 387 F V++ L +++ +KI Sbjct: 317 QDHVFRVDNFAALSSIQEQLQEKI 340 >gi|315923825|ref|ZP_07920054.1| conserved hypothetical protein [Pseudoramibacter alactolyticus ATCC 23263] gi|315622858|gb|EFV02810.1| conserved hypothetical protein [Pseudoramibacter alactolyticus ATCC 23263] Length = 969 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 33/343 (9%), Positives = 85/343 (24%), Gaps = 31/343 (9%) Query: 72 LKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNL 131 L + + + + T + + A S ++ + Sbjct: 30 LAAENTATPPKFEKKLEPNGDGTYKLSLSVTGTASSTSESSKADVVIVFDISNSMDEETN 89 Query: 132 SLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWS 191 + S + + +T++ + Sbjct: 90 TYVEYATGRYGSVSSDAPTGSSTRRRLYRRSTNNWGYYQYTEITNDTTSGTVYYLGDNY- 148 Query: 192 KNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ 251 +Y ++DV + +++ + A SVRI ++++ Sbjct: 149 ---QYHEYTGKRYSQKTRLDVAKSATNTMIDQL-LANNATNPGSVRISLVSFDTFASDAT 204 Query: 252 CTPLSNNLNEVKSRLNKLNP---------YENTNTYPAMHHA------------YRELYN 290 S+ + S +N TN A+ A + + Sbjct: 205 AWSTSS--ENLHSIVNGYKTPQSSHLGGHRGGTNWEDALQKADGTQPRADAQKHVIFVSD 262 Query: 291 EKESSHNTIGSTRLK-KFVIFITDGENSGASAY--QNTLNTLQICEYMRNAGMKIYSVAV 347 + + + ++ G++ ++ N + + + G Y+V Sbjct: 263 GNPTFRISSINGNPDDQYNDVHGHGDDDYYHSHPNYNYDAAKDDAKKIVDGGAAFYTVGT 322 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 L +S ++ +D L +F I I Sbjct: 323 FGDAARMQNLATEAGASDNYYKADDEAALKAAFKNIVASITHS 365 >gi|297288988|ref|XP_002803438.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-1-like [Macaca mulatta] Length = 997 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 31/334 (9%), Positives = 94/334 (28%), Gaps = 25/334 (7%) Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK- 108 + ++ + + E +I ++ + Sbjct: 104 AEKVQAAHQWREDFASNEVVYYNAKDDLDPEKNDSEPGSQRIKPVFIEDANFGRQISYQH 163 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR----SMEDLY 164 A IPT+ ++ + L S + + EN ++ + + + Sbjct: 164 AAVHIPTDIYEGCTIVLNELNWTSALNEVFKKNREENPSLLWQVFGSATGLARYYPASPW 223 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 + N + P + S + + + + ++ S ++ ++ Sbjct: 224 VDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETL 283 Query: 225 QKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + + ++ +V N +K +N + T+ Sbjct: 284 SDDDFVNVASFNSNAQDVSCFQHLVQANVR----NKKVLKDAVNNITAKGITDYKKGFSF 339 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A+ +L N S K ++ TDG A N ++ ++++ Sbjct: 340 AFEQLLNYNVSRA------NCNKIIMLFTDGGEERAQEIFTKYN--------KDKKVRVF 385 Query: 344 SVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 + +V + ++ ++ G ++ + + Sbjct: 386 TFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|212636466|ref|YP_002312991.1| type IV pilin biogenesis protein [Shewanella piezotolerans WP3] gi|212557950|gb|ACJ30404.1| Type IV pilin biogenesis protein, putative [Shewanella piezotolerans WP3] Length = 1175 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 31/272 (11%), Positives = 73/272 (26%), Gaps = 50/272 (18%) Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 +N + + +++ + + ++++ + V + Sbjct: 192 DIENAIDAAFLTKFGTGKSVTLYTERYVNWYHGYKSSSWQTRMNIAKRVMEDTVVTTPSV 251 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 + IV + + + ++N LN NT ++ AYR Sbjct: 252 DFGLAIFNYNTSKRTDGGRIVSGINRLSTGDKISLVDKINDLNAETNTPLCETLYEAYRY 311 Query: 288 LY-----------------------NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ- 323 N S + +V++ITDG + SA Sbjct: 312 FSGSAVHFAKEAGSMTPTRDKSIESNGNYVSPFKSSQCSNRSYVVYITDGSPTQDSAANT 371 Query: 324 --------------NTLNTLQICEYMRNAGM----------KIYSVAVSAPPEGQD--LL 357 + + M + Y++ S + L Sbjct: 372 KVSKMSGYKSSERVDGSYLPALASIMHRQDVNPSLAGEQFVTTYTIGFSDGAADAEPILK 431 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + S G +F ++ +L + ++ +I E Sbjct: 432 KTAELSGGTYFPAKNATQLQSALQQVFSQILE 463 >gi|285808587|gb|ADC36107.1| putative chloride channel [uncultured bacterium 126] Length = 869 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 26/124 (20%), Positives = 52/124 (41%), Gaps = 19/124 (15%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + E+ + ++ T +PA+ +AY L N + + K VI ++DG+ Sbjct: 491 SRPELHDAIGRIKASGPTAIFPALRNAYDALANVRVRA----------KHVILLSDGQ-- 538 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSREL 376 + + + M A + + +VA+ P LLR G+ + V D++++ Sbjct: 539 -----SDPEDFEGLVRKMSAAHITVSTVALG-PDADAALLRNLASWGGGRSYVVQDAQQI 592 Query: 377 LESF 380 E F Sbjct: 593 PEIF 596 >gi|320106177|ref|YP_004181767.1| VWFA-like domain-containing protein [Terriglobus saanensis SP1PR4] gi|319924698|gb|ADV81773.1| VWFA-related domain-containing protein [Terriglobus saanensis SP1PR4] Length = 370 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 31/215 (14%), Positives = 73/215 (33%), Gaps = 33/215 (15%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 S + K ++ + + N + +N Sbjct: 138 TFSTDDAPVSIGIVFDLSGSMMSKFGRARKALSEFMRT--------SNPQDEFFVVGFND 189 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 ++N+++V +R+ L P T A + +L + + Sbjct: 190 R--PAVIVDYTSNVDDVDARMVMLRPERRTALIDAAYLGLNKL----------KDAKYER 237 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--------SAPPEGQDLL 357 K ++ I+DG ++ + ++ L +R + +IYS+ + L+ Sbjct: 238 KALLIISDGGDNRSRYVESELR-----RAVRESDTQIYSIGIFDVYAATPEEKSGPTLLM 292 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 C + G+ F V D+ EL + +I+ +++ + V Sbjct: 293 DICEMTGGRMFRVTDADELGDIAARISAELRNEYV 327 >gi|293605449|ref|ZP_06687831.1| aerotolerance protein BatA [Achromobacter piechaudii ATCC 43553] gi|292816177|gb|EFF75276.1| aerotolerance protein BatA [Achromobacter piechaudii ATCC 43553] Length = 343 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 38/217 (17%), Positives = 73/217 (33%), Gaps = 42/217 (19%) Query: 196 KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL 255 + A + D + + + R+G I + G PL Sbjct: 106 SVDFNDAQGQPLSRWDAVKAVVADFIAQ---------RADDRLGLIVFGTGAYPQA--PL 154 Query: 256 SNNLNEVKSRLNKL---NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 + + +K L++ NT A+ R L +E K +I +T Sbjct: 155 TRDHASLKLLLDEAAVGMAGPNTAVGDAIGLGIRMLDAAEEQ----------DKVLILLT 204 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------GQDLLRKCTDS-SG 365 DG ++G++ L + ++++ + P D+LR+ + G Sbjct: 205 DGNDTGSAVPPARAAALAA-----QHHVTVHTIGIGDPAATGEDRVDFDILREVARTAGG 259 Query: 366 QFFAVNDSRELLESFDKIT----DKIQEQSVRIAPNR 398 QFF D L E + + +I +++R P R Sbjct: 260 QFFPARDLATLREVYATLDRITPREI--KTLRHQPKR 294 >gi|68535223|ref|YP_249928.1| hypothetical protein jk0158 [Corynebacterium jeikeium K411] gi|68262822|emb|CAI36310.1| hypothetical protein jk0158 [Corynebacterium jeikeium K411] Length = 646 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 20/129 (15%), Positives = 52/129 (40%), Gaps = 20/129 (15%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N+ ++ + + K+ +T PA+ A EL + E + ++ ++DGE++ Sbjct: 115 NVTKIPAEVGKVEASGHTPMGPALRQAAEELPKDGE------------RSIVLVSDGEDT 162 Query: 318 GASAYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSR 374 ++ + ++ G+ I +V + + L + G++ D+ Sbjct: 163 -----CAPPPVCEVAKDLKKEGIDLTINTVGFLVDSKARKELECIAEAGGGEYMDAKDTV 217 Query: 375 ELLESFDKI 383 L +S ++ Sbjct: 218 SLADSMKRL 226 >gi|293396639|ref|ZP_06640915.1| aerotolerance protein BatA [Serratia odorifera DSM 4582] gi|291420903|gb|EFE94156.1| aerotolerance protein BatA [Serratia odorifera DSM 4582] Length = 325 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 28/204 (13%), Positives = 63/204 (30%), Gaps = 35/204 (17%) Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 + + S ++ + S V S RIG + + Sbjct: 101 ILDVSGSMAKNDVPGGITRLQAVKNSVSKFVA---------ARQSDRIGLVIFANQAWPF 151 Query: 251 QCTPLSNNLNEVKSRLNKLNPYEN---TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 P+S + +++R+ +L+P T A+ A + L + + K Sbjct: 152 A--PVSEDKQALQTRITQLSPGMVGEQTAIGDALGVAVKLLDSSA--------NQDASKL 201 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV------SAPPEGQDLLRKCT 361 I +TDG ++ + L ++++++A + L++ Sbjct: 202 AILLTDGNDTASQLAPPLAAQLAA-----AHHVQVHTIAFGDSNSAGSDHVDLTQLQEIA 256 Query: 362 --DSSGQFFAVNDSRELLESFDKI 383 + A N L + +I Sbjct: 257 RITGGKSWTAANSGASLDSVWQEI 280 >gi|197336748|ref|YP_002158568.1| von Willebrand factor, type A [Vibrio fischeri MJ11] gi|197314000|gb|ACH63449.1| von Willebrand factor, type A [Vibrio fischeri MJ11] Length = 350 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 29/177 (16%), Positives = 65/177 (36%), Gaps = 17/177 (9%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---PYENT 275 + +++++K + + G TP + + LN+ ++T Sbjct: 127 SRLDAVKKVLNDFAKTRKGDRLGLILFGDAAFVQTPFTADHEVWLDLLNQTRVEMAGKST 186 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + A+ + S + S KK I +TDG ++ + + + Sbjct: 187 HLGDAIGLTIKRFEENDNSQPLSTTSR--KKVAIILTDGNDTDSYVP-----PMDAAKVA 239 Query: 336 RNAGMKIYSVAVSAPPE------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + G++I+ +A+ P D + D S GQ F + EL+ ++ +I+ Sbjct: 240 KVKGIRIHMIAIGDPQTVGEQALDMDTINTIADASGGQAFQALNQDELINAYAEISK 296 >gi|159897645|ref|YP_001543892.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159890684|gb|ABX03764.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 562 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 24/178 (13%), Positives = 65/178 (36%), Gaps = 17/178 (9%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 ++ + G+ ++ + + + +P+ ++ +R++ Sbjct: 400 NRLREAKTALGDFIDIFADQDNVQVTIFST-------NATELSDLSPIGPKRADLHTRID 452 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 L T Y + Y ++ + E + ++ +TDGE++ +S LN Sbjct: 453 GLVADGETRLYSTIGEVYTDIQQQTEVQRI--------RALVVLTDGEDTASSLSLEQLN 504 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 QI + +KI+++A + + L R + + + D + + + +I Sbjct: 505 -EQIRQDESGTSIKIFTIAYGSDANQEVLQRIAEITGAKSYT-GDPATIRQVYHEIAT 560 >gi|332217052|ref|XP_003257667.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2 [Nomascus leucogenys] Length = 946 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 28/223 (12%), Positives = 77/223 (34%), Gaps = 25/223 (11%) Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 L P PK + + + S + + ++ + +L + ++ + N +VR Sbjct: 304 LDPIPKNILFVIDVSGSMWG---VKMKQTVEAMKTILDDLRAEDRFSVIDF-NQNVRTWR 359 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG 300 + + K + K+ P TN A+ A L Sbjct: 360 NDLISATKTQ--------VADAKRYIEKIQPSGGTNINEALLRAIFILNEASNLGLLDPN 411 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 S +I ++DG+ + + + + E +++ + ++S+ + D L++ Sbjct: 412 S---VSLIILVSDGDPTVGELKLSKIQ-KNVKENIQD-NISLFSLGMGFDV-DYDFLKRL 465 Query: 361 TDSSGQ------FFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + + + S +L + +++++ + P+ Sbjct: 466 -SNENHGIAQRIYGNQDTSSQLKKFYNQVSTPLLRNVQFNYPH 507 >gi|217966673|ref|YP_002352179.1| von Willebrand factor A [Dictyoglomus turgidum DSM 6724] gi|217335772|gb|ACK41565.1| von Willebrand factor type A [Dictyoglomus turgidum DSM 6724] Length = 888 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 28/161 (17%), Positives = 55/161 (34%), Gaps = 19/161 (11%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 L + + + G IA++ N E S +++++P T Y Sbjct: 414 ELAKESAQLVLDLLEDKDYFGLIAFDHSYQWIVPLQPLTNKEEAASLISRISPGGGTALY 473 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 P + A L S K +I ITDG+ G Y + + Sbjct: 474 PPLKSAGESLLKVPIKS----------KHIIAITDGQTEGGDFYN-------LVRNLAKY 516 Query: 339 GMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLE 378 + + ++ + LL+ + +G+F+ + R L + Sbjct: 517 KITVSTIGIGEDAN-IPLLKDIANWGNGRFYHTWNIRNLPQ 556 >gi|160892883|ref|ZP_02073672.1| hypothetical protein CLOL250_00414 [Clostridium sp. L2-50] gi|156865442|gb|EDO58873.1| hypothetical protein CLOL250_00414 [Clostridium sp. L2-50] Length = 596 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 22/166 (13%), Positives = 54/166 (32%), Gaps = 14/166 (8%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 + + RI + Y + + L+ L +TN + AY + Sbjct: 248 DENDRISIVTYAGSDTVVLNGVAGSEAYTICEALDSLEASGSTNGSAGLITAYEIAEQQF 307 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 N VI TDG+ + + + + + +++G+ + + + Sbjct: 308 IKDGNNR--------VILATDGDLNVGL--TSESDLVGLITEEKDSGIFLSVLGFGSDNL 357 Query: 353 GQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + L D G + ++ E + + D++ +A + Sbjct: 358 KDNKLEALADHGNGNYSYLDSVYEAKKV---LVDEMGGTLYTVAKD 400 >gi|288921031|ref|ZP_06415322.1| von Willebrand factor type A [Frankia sp. EUN1f] gi|288347549|gb|EFC81835.1| von Willebrand factor type A [Frankia sp. EUN1f] Length = 401 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 16/193 (8%), Positives = 48/193 (24%), Gaps = 62/193 (32%) Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE-------LYNEKESSHNTIGSTRL 304 P +++ +++ + L L T + + + G Sbjct: 140 LVPPTDDTDKLLAALKSLTTSRGTAIGQGILTSIDAIAEVDPSVPPTGADVPGGTGGEYA 199 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------- 350 ++ +TDG N+ ++ +++Y++ Sbjct: 200 ADVIVVLTDGANTVG------VDPRTAAGEAATRRLRVYTIGFGTTTPAPMVCDSSQIGD 253 Query: 351 ----------------------------------PEGQDLLRKCTD-SSGQFFAVNDSRE 375 +D L++ + G ++ ++ E Sbjct: 254 DAFGFGGGFGGGGGGGFGGGGGGFGGRNGERDPRAIDEDALKQVASTTGGTYYRAQNANE 313 Query: 376 LLESFDKITDKIQ 388 L ++ + I Sbjct: 314 LQDALTDLPRTIT 326 >gi|301606537|ref|XP_002932887.1| PREDICTED: cochlin-like [Xenopus (Silurana) tropicalis] Length = 789 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 56/165 (33%), Gaps = 20/165 (12%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELY 289 ++ ++G I + +N ++V + L + T T A++ A R L+ Sbjct: 638 VSDVGTKVGAIQFTYDQRLEFGLNDHSNKDDVLNALRSIRYMSGGTATGDAINFAVRNLF 697 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + HN F+I +TDG++ + +G+ I+SV V+ Sbjct: 698 QPTKDGHNK-------NFLIIVTDGQSYD--------DVRGPASSAHISGVTIFSVGVAW 742 Query: 350 PPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQSV 392 P L+ F + L + + I + Sbjct: 743 APLED--LKDMASEPKNSHTFFTREFPGLEQITPDLIRGICRDYL 785 >gi|315185579|gb|EFU19348.1| von Willebrand factor type A [Spirochaeta thermophila DSM 6578] Length = 459 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 25/188 (13%), Positives = 64/188 (34%), Gaps = 20/188 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 P +I + + + + + + R + P+ ++ V Sbjct: 113 PDRMRITHAKRAIREFLPLLSERDRVGLAVFNRTYRM----------IQPIVDDPALVLE 162 Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 +L+ + P+ AY ELY E + + ++ ++ ++DGEN ++ Sbjct: 163 KLDAIER-------PSREQAYTELYRSMEEALTSFEEEGRRRVLVVLSDGENFPVDPEKS 215 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKI 383 + G+ Y + L+ + G+ F ++ EL + I Sbjct: 216 PATPGTAVDLAHRYGITCYVIHFGTE--KDRLIGDLASETGGRVFDARNALELASVYTAI 273 Query: 384 TDKIQEQS 391 +++ ++ Sbjct: 274 QEQVLQEY 281 >gi|309792307|ref|ZP_07686777.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308225622|gb|EFO79380.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 423 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 17/181 (9%), Positives = 56/181 (30%), Gaps = 21/181 (11%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++ + ++A +V+ + + + +N + ++K ++ Sbjct: 61 RLQQVKDAANRIVDMLTDDDY--------LSLVTFNDRAEVVIPAQRPQHRPDLKRMISG 112 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 + T M A +E + ++ +TDG Y + Sbjct: 113 IEAAGGTEMATGMALALQETQRPLMGRGVSR--------ILLLTDG-----RTYGDEGRC 159 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 ++I + G+ + ++ + L + + + + E+ + F ++ Sbjct: 160 VEIARRAQGRGVGLTALGIGGEWNEDLLETMSARENSRTQYITSAAEITQIFTDEVKRMH 219 Query: 389 E 389 Sbjct: 220 S 220 >gi|326326039|ref|YP_004250848.1| hypothetical protein VIBNI_0107 [Vibrio nigripulchritudo] gi|323669090|emb|CBJ93137.1| Protein of unknown function (exported) [Vibrio nigripulchritudo] Length = 1081 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 22/137 (16%), Positives = 49/137 (35%), Gaps = 18/137 (13%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 L++N +KS L+ ++ T+ + A E N + +S Sbjct: 356 DSTAQLLQSLTDNKAVIKSALDLIDASGGTDIGDGVSKALEEFANARSAS---------D 406 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG 365 V+ +TDG S + + +++ + + + + + G Sbjct: 407 WAVVLLTDGSGSYNH---------ALTTELVQKNIRVLGITMGSGANQSLIRGISDSTYG 457 Query: 366 QFFAVNDSRELLESFDK 382 + VN + EL+E F++ Sbjct: 458 IYQHVNTADELIEVFER 474 >gi|281342668|gb|EFB18252.1| hypothetical protein PANDA_020017 [Ailuropoda melanoleuca] Length = 885 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 28/228 (12%), Positives = 73/228 (32%), Gaps = 23/228 (10%) Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 L P PK + + + S + + ++ + +L Q ++ + N + Sbjct: 236 FAPENLDPIPKNILFVIDVSGSMWGI---KMKQTVEAMKTILDDLRAEDQFSVIDF-NHN 291 Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS 295 VR + K + K+ P TN A+ A L Sbjct: 292 VRTWRNDLVSATKTQIV--------DAKKYIEKIQPSGGTNINEALLRAIFILNEANNLG 343 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 S +I ++DG+ + + + + + ++S+ + D Sbjct: 344 MLDPES---VSLIILVSDGDPTVGNCELKLSKIQKNVKQNIRDNIALFSLGIGFDV-DYD 399 Query: 356 LLRKCTDSSGQ------FFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 L++ + + + + S +L + +++++ + P+ Sbjct: 400 FLKRL-SNENRGIAQRIYGNQDTSSQLKKFYNQVSTPLLRNVQFNYPH 446 >gi|47212423|emb|CAF93579.1| unnamed protein product [Tetraodon nigroviridis] Length = 688 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 52/165 (31%), Gaps = 20/165 (12%) Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTN 276 +++ + + S R+G + Y+ + +L+ + + ++ P T Sbjct: 28 RFMIDILNT--LDIGLNSTRVGVVQYSSQVRSEFSLRSHASLDSMVKAIQEMVPLAQGTM 85 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 T A+ + + E + V+ +TDG ++ R Sbjct: 86 TGLAIRYTMNVAFTAAEGDRPK-----VPNVVVIVTDGRPQD--------RVAEVAAEAR 132 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 GM+IY+V V LR F V + + Sbjct: 133 ERGMEIYAVGV--ARADMTSLRAMASPPFEDHVFLVESFDLIHQF 175 >gi|21224547|ref|NP_630326.1| secreted protein [Streptomyces coelicolor A3(2)] gi|3559963|emb|CAA20601.1| putative secreted protein [Streptomyces coelicolor A3(2)] Length = 421 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 26/194 (13%), Positives = 68/194 (35%), Gaps = 26/194 (13%) Query: 208 RKIDVLIESAGNLVNSIQK-----AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 ++ ++ ++++ + + PL + E Sbjct: 60 TRMAAAKQAFNEVLDATPEEVRLGIRTLGADYPGDDRKTGCKDTAQLYPVGPL--DRTEA 117 Query: 263 KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 K+ + L+P T PA+ A +L K ++ I+DGE++ Sbjct: 118 KTAVATLSPTGWTPIGPALLKAADDL-----------DGGDGSKRIVLISDGEDT----- 161 Query: 323 QNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLES 379 L+ ++ + G+ I ++ + + + L + + G + +V + EL + Sbjct: 162 CAPLDPCEVAREIAAKGIGLTIDTLGLVPNTKMRQQLSCIAEATGGTYTSVEHTDELTDK 221 Query: 380 FDKITDKIQEQSVR 393 +++ D+ + V Sbjct: 222 VNQLVDRAADPVVT 235 >gi|153950207|ref|YP_001400285.1| von Willebrand factor type A domain-containing protein [Yersinia pseudotuberculosis IP 31758] gi|152961702|gb|ABS49163.1| von Willebrand factor type A domain protein [Yersinia pseudotuberculosis IP 31758] Length = 460 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 21/163 (12%), Positives = 54/163 (33%), Gaps = 13/163 (7%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYENTNTYPAMHHAYRELYNE 291 N + + +AY+ + + + + + ++P T + + ++ Sbjct: 116 NTTDTLSVVAYDNHAEVIIPATKVTDKPALIASIQQHIHPRGMTALFAGVSMGIGQVDKH 175 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 +I I+DG+ + + L + G+ I ++ + Sbjct: 176 LNREQVNR--------IILISDGQANTGPTSISEL--SDLARMAAKKGIAITTIGLGQD- 224 Query: 352 EGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 +DL+ S G V +S +L ++F K + + Sbjct: 225 YNEDLMTAIAGYSDGNHTFVANSADLEKAFTKEFQDVMSVVAQ 267 >gi|17229274|ref|NP_485822.1| hypothetical protein all1782 [Nostoc sp. PCC 7120] gi|17130872|dbj|BAB73481.1| all1782 [Nostoc sp. PCC 7120] Length = 615 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 17/157 (10%), Positives = 45/157 (28%), Gaps = 10/157 (6%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 + Y+ + + +K + ++ TN + ++ + Sbjct: 78 VVVYDDAVDTVVPPQPVTDKPALKKSIRQVRAGGITNLSGGWLKGCEYVKHQLDPQKINR 137 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 V+ +TDG + Q+ G+ ++ + L+ Sbjct: 138 --------VLLLTDGHANMG--IQDPKILTATSTQKAEEGITTTTLGFAQGFNEDLLIGM 187 Query: 360 CTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 ++G F+ + E E F D ++ + Sbjct: 188 ARAANGNFYFIQSIDEAAEVFSIELDSLRSVVGQNLK 224 >gi|45382993|ref|NP_990865.1| collagen alpha-3(VI) chain precursor [Gallus gallus] gi|1345652|sp|P15989|CO6A3_CHICK RecName: Full=Collagen alpha-3(VI) chain; Flags: Precursor gi|211622|gb|AAA03201.1| alpha-3 collagen type VI [Gallus gallus] Length = 3137 Score = 61.1 bits (146), Expect = 3e-07, Method: Composition-based stats. Identities = 23/161 (14%), Positives = 62/161 (38%), Gaps = 15/161 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRE 287 + +R+G + ++ +++ RL +L P + NT A++ Sbjct: 675 DVGTDKIRVGLVQFSDTPKTEFSLYSYQTKSDIIQRLGQLRPKGGSVLNTGSALNF---V 731 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 L N + + + ++ + ++ +T G ++ + LQ+ + AG+ ++V V Sbjct: 732 LSNHFTEAGGSRINEQVPQVLVLVTAG--------RSAVPFLQVSNDLARAGVLTFAVGV 783 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 + L + + + ++D +L ++ I Sbjct: 784 --RNADKAELEQIAFNPKMVYFMDDFSDLTTLPQELKKPIT 822 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 23/197 (11%), Positives = 61/197 (30%), Gaps = 24/197 (12%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 R +L +V S+ + VR+ + Y+ I + ++ Sbjct: 1043 SDGVRRGFPLLKTFVERVVESL-----DIGRDKVRVAIVQYSNAIQPEFLLDAYEDKADL 1097 Query: 263 KSRLNKLNPYENT--NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 S + L + NT A+ + + ++ S + +F+I +T + Sbjct: 1098 VSAIQALTIMGGSPLNTGAALDYLIKNVFTVSSGSRIA---EGVPQFLILLTADRSQDDV 1154 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 + + ++ +G + + + L+ + +V D +L Sbjct: 1155 RRPSVV--------LKTSGTVPFGIGIG--NADLTELQTISFLPDFAISVPDFSQLDSV- 1203 Query: 381 DKITDKIQEQSVRIAPN 397 + + +R+ Sbjct: 1204 ---QQAVSNRVIRLTKK 1217 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 20/177 (11%), Positives = 59/177 (33%), Gaps = 17/177 (9%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENT 275 + + KA+ N R + ++ + +V S + + + Sbjct: 57 VREFLYDVVKALDVGGNDF-RFALVQFSGNPHTEFQLNTYPSNQDVLSHIANMPYMGGGS 115 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 T + + + N + + S + + +I +TDG++ A +++ + Sbjct: 116 KTGKGLEY---LIENHLTKAAGSRASEGVPQVIIVLTDGQSQDDVALPSSV--------L 164 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 ++A + + +AV + L++ F + + L + ++ Sbjct: 165 KSAHVNM--IAVGVQDAVEGELKEIASRPFDTHLFNLENFTALHGIVGDLVASVRTS 219 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 25/199 (12%), Positives = 65/199 (32%), Gaps = 21/199 (10%) Query: 206 ANRKIDVLI-ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 + I + ++ + + ++ ++++ + IG + Y+ + +V Sbjct: 248 GSDNIGSVNFQAIRDFLVNLIESLRVGAQQ-IHIGVVQYSDQPRTEFALNSYSTKADVLD 306 Query: 265 RLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + L+ NT A+ + L+ + + + + + ++ I+ GE + Sbjct: 307 AVKALSFRGGKEANTGAALEYVVENLFTQ---AGGSRIEEAVPQILVLISGGE-----SS 358 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFD- 381 + L + I+S ++ L++ F D R L + Sbjct: 359 DDIREGLLAVKQA-----SIFSFSIGVLNADSAELQQIATDGSFAFTALDIRNLAALREL 413 Query: 382 ---KITDKIQEQSVRIAPN 397 I Q + AP Sbjct: 414 LLPNIVGVAQRLILLEAPT 432 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 23/169 (13%), Positives = 55/169 (32%), Gaps = 15/169 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRE 287 + VRIG + ++ + + N V + +L N A+ + Sbjct: 1472 DVGPNKVRIGVVQFSNNVFPEFYLRTHKSKNAVLQAIRRLRLRGGYPVNAGKALDY---V 1528 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 + N S + + + ++ I ++ + + + + V Sbjct: 1529 VKNYFIKSAGSRIEDGVPQHLVVILGDQSQDDVNRPANVISSTSIQPL----------GV 1578 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 A ++ L+ T+ G+ V D L K+ + ++E +V Sbjct: 1579 GARNVDRNQLQVITNDPGRVLVVQDFTGLPTLERKVQNILEELTVPTTE 1627 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 42/373 (11%), Positives = 103/373 (27%), Gaps = 47/373 (12%) Query: 22 IMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSY-IRE 80 + + SA+ A + G + + + + +L + + + Sbjct: 1090 AYEDKADLVSAIQALTIMGGSPLNTGAALD-------------------YLIKNVFTVSS 1130 Query: 81 NAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIE 140 + Q I + + + F G+ + LT L S Sbjct: 1131 GSRIAEGVPQFLILLTADRSQDDVRRPSVVLKTSGTVPFGIGIGNADLTELQTISFLPDF 1190 Query: 141 RSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYA 200 S + V + +K ++ P K S+ Sbjct: 1191 AISVPDFSQLDSVQQAVSNRVIRLTKKEIESLAPDLVFTSPSPVGVKRDVVFLVDGSR-- 1248 Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 A D++ NL RI + ++ + + Sbjct: 1249 YAAQEFYLIRDLIERIVNNLDVGFDTT---------RISVVQFSEHPHVEFLLNAHSTKD 1299 Query: 261 EVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 EV+ + +L P N A+ + ++ S + +F++ ++ ++ Sbjct: 1300 EVQGAVRRLRPRGGQQVNVGEALEFVAKTIFTRPSGSRIE---EGVPQFLVILSSRKSDD 1356 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + ++ G+ +A + + + + S F V+ +EL Sbjct: 1357 DLEFPSV--------QVKQVGVAPMVIA---KNMDPEEMVQISLSPDYVFQVSSFQELPS 1405 Query: 379 SFDKITDKIQEQS 391 K+ I+ + Sbjct: 1406 LEQKLLAPIETLT 1418 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 27/173 (15%), Positives = 63/173 (36%), Gaps = 17/173 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I++ ++ ++ + + S+++G YN + ++ E+ +N Sbjct: 1650 INLGRDNFQEVLQFVYSIVDAIYEDGDSIQVGLAQYNSDVTDEFFLKDYSSKPEILDAIN 1709 Query: 268 KLNPYENT--NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 K+ NT A+ H + + R+ + IT G+ ++ Sbjct: 1710 KVIYKGGRVANTGAAIKHLQAK---HFVKEAGSRIDQRVPQIAFIITGGK--------SS 1758 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + G+K+++V V + + K S F V+ ++EL E Sbjct: 1759 DDGQGASMEVAQKGVKVFAVGV--RNIDLEEVSKLASESATSFRVSTAQELSE 1809 >gi|51467747|ref|NP_001003823.1| cochlin [Danio rerio] gi|26788036|emb|CAD58748.1| novel protein similar to human coagulation factor C homolog (cochlin, COCH) [Danio rerio] Length = 553 Score = 60.7 bits (145), Expect = 3e-07, Method: Composition-based stats. Identities = 43/317 (13%), Positives = 88/317 (27%), Gaps = 33/317 (10%) Query: 76 SYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRS 135 ++ + G ++ + P + E+ N+F + + SL S Sbjct: 259 NFFSPDFGVRRGYPRVIVVFVDGWPSDNVEEAAILARESGINIFFVSVAKPSPEEASLVS 318 Query: 136 TGIIERSSENLAISICMVLDVSRSMEDLYLQKH-NDNNNMTSNKYLLPPPPKKSFWSKNT 194 R + + +M + K + S + Sbjct: 319 DQDFMRKAVCKDNEFF-----TFTMPSWFSTNKFVKPLAQKLCSIDQMLCSKTCYNSVDL 373 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 + LV+ + + ++ RIG I + Sbjct: 374 GFLIDGSSSVG----DGNFRLVLDLLVSIARS--FDISDIGSRIGAIQFTYDQRMEFNFN 427 Query: 255 LSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 + L K+ T T A++ A R L+ + S+ +KF+I ITD Sbjct: 428 DHVLKDNALRALQKIPYMSGGTATGDAINFAVRSLFKPR--------SSSNRKFLIIITD 479 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVN 371 G+ + + + + G+ +Y+V V + L+ F Sbjct: 480 GQ------SYDDVRVPAMA--AQREGITVYAVGV--AWAPMEDLKAMASEPKESHVFFTR 529 Query: 372 DSRELLESFDKITDKIQ 388 + L + I I Sbjct: 530 EFTGLGQFQQPIVRGIC 546 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 28/199 (14%), Positives = 59/199 (29%), Gaps = 28/199 (14%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + R+ ++ L + + +G + + Sbjct: 177 SSYNIGQRRFNLQKNFVSKLA-----TMLKVGTQGPHVGVVQTSETPRTEFYLTNYTTAK 231 Query: 261 EVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +V + ++ NTNT A+ H R ++ + + ++ DG S Sbjct: 232 DVTFAIKEIPYIGGNTNTGKAILHTVRNFFS-----PDFGVRRGYPRVIVVFVDGWPSD- 285 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-------EGQDLLRKCTDSSGQFFAVND 372 N + R +G+ I+ V+V+ P QD +RK +FF Sbjct: 286 -------NVEEAAILARESGINIFFVSVAKPSPEEASLVSDQDFMRKAVCKDNEFFTFTM 338 Query: 373 SR--ELLESFDKITDKIQE 389 + + K+ Sbjct: 339 PSWFSTNKFVKPLAQKLCS 357 >gi|115361035|ref|YP_778172.1| hypothetical protein Bamb_6294 [Burkholderia ambifaria AMMD] gi|115286363|gb|ABI91838.1| conserved hypothetical protein [Burkholderia ambifaria AMMD] Length = 423 Score = 60.7 bits (145), Expect = 3e-07, Method: Composition-based stats. Identities = 22/212 (10%), Positives = 64/212 (30%), Gaps = 4/212 (1%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTT---KK 57 + + +++ F+ A+DL + R+++Q++ D+ LS + S +++ Sbjct: 25 IVGLALAMMIGFVGLALDLGKLYVTRSELQNSADSCALSAARDLTSAISLQVAEADGIAA 84 Query: 58 DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 + F +Q ++ S + + N ++ Sbjct: 85 GHANYAFFQQNAVQMQTDSNVTFSDSLTNPFLTKTAVATPANVKYVKCTAQLSNIAHWFI 144 Query: 118 LFLKGLIPSALTNLS-LRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS 176 L + + N S + ++ I + +I + + S + + + +S Sbjct: 145 EVLNTIPGVQVANASQVAASAIATVGAGQTTCAIPVFVCKPASSAPYKVGDWISSPSGSS 204 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR 208 Y + + + Sbjct: 205 TTYGPGNFGWAALDGSTNEPTIASELSGNTCN 236 >gi|145547190|ref|XP_001459277.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124427101|emb|CAK91880.1| unnamed protein product [Paramecium tetraurelia] Length = 603 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 27/182 (14%), Positives = 61/182 (33%), Gaps = 16/182 (8%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + RKI+++ +S L+ + + I IV + N Sbjct: 126 VDVSGSMIGRKINLVKDSLRYLMKIL------GPEDRICIIVFTTVAHIVTSFIRNTQEN 179 Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 +K + +L +TN M+ A L N + ++DG++ Sbjct: 180 KPLLKKAILELKGLASTNISDGMNKALWMLKN--------RKYKNPVSCIFLLSDGQDDY 231 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 A Q + LQ+ + I++ + + + G F+ +++ + + Sbjct: 232 KGAEQRVFDQLQLLKI--EEKFVIHTFGYGQDHDAYVMNQIAKYREGNFYYIDNINKASD 289 Query: 379 SF 380 F Sbjct: 290 YF 291 >gi|47523446|ref|NP_999348.1| calcium channel, voltage-dependent, alpha 2/delta subunit 1 preproprotein [Sus scrofa] gi|3341749|gb|AAC36289.1| voltage-dependent calcium channel alpha-2 delta subunit precursor [Sus scrofa] Length = 1091 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 29/334 (8%), Positives = 92/334 (27%), Gaps = 25/334 (7%) Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK- 108 + ++ + + E +I + + Sbjct: 104 AEKVQAAHQWREDFASNEVVYYNAKDDLDPEKNDSEPGSQRIKPVFIDDANFGRQISYQH 163 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR----SMEDLY 164 A IPT+ ++ + L S + E+ ++ + + + Sbjct: 164 AAVHIPTDIYEGSTIVLNELNWTSALDEVFKKNREEDPSLLWQVFGSATGLARYYPASPW 223 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 + N + P + S + + + + ++ S ++ ++ Sbjct: 224 VDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETL 283 Query: 225 QKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + + ++ +V N +K +N + T+ Sbjct: 284 SDDDFVNVASFNSNAQDVSCFQHLVQANVR----NKKVLKDAVNNITAKGITDYKKGFSF 339 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A+ +L N S K ++ TDG +Y ++ ++++ Sbjct: 340 AFEQLLNYNVSRA------NCNKIIMLFTDG--------GEERAQEIFAKYNKDKKVRVF 385 Query: 344 SVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 + +V + ++ ++ G ++ + + Sbjct: 386 TFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|302391027|ref|YP_003826847.1| hypothetical protein Acear_0232 [Acetohalobium arabaticum DSM 5501] gi|302203104|gb|ADL11782.1| Protein of unknown function DUF2134, membrane [Acetohalobium arabaticum DSM 5501] Length = 307 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 21/214 (9%), Positives = 57/214 (26%), Gaps = 6/214 (2%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A++++V F+ ID+ + R ++ + LDAA L+G + D + + Sbjct: 15 VALMMTVFISFLALVIDIGSLYLERIRLVNTLDAAALAGVQDLPDDSQQAETVALDYASR 74 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + + + +G + Q + ++ + Sbjct: 75 NGLDNNVTVEITDDDHQIGLSGSKQVGMNFAVIF---GIDQVEVAASSKARVGHVTAVTG 131 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 + ++ + + ++ +N +N KY Sbjct: 132 AVPFGVVSQNFVYGDKYYLKYGAGGDEVASGRNGNFGALALGGTGANNYEDN---IKYGY 188 Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 + + P ++D Sbjct: 189 DSSLEVGQDVTTEPGNMAGPTTRGVEYRMDQSRT 222 >gi|291514852|emb|CBK64062.1| Mg-chelatase subunit ChlD [Alistipes shahii WAL 8301] Length = 341 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 56/165 (33%), Gaps = 43/165 (26%) Query: 249 GNQCTPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 P++++ ++ +++P + T A+ A + E SH Sbjct: 140 PKVQLPITSDYRMARAFARRIDPSLVSVQGTAIGKALEQALLAFSGDTEQSHG------- 192 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------------ 352 + +I ITDGEN + + + E G+KI+++ + P Sbjct: 193 -RVIILITDGENHDD-------DAIAVAERAAQMGVKIFTIGIGTPEGAPIQIGGEFIKD 244 Query: 353 ----------GQDLLRKCTD-SSGQFFAVNDSR-ELLESFDKITD 385 +++L + D + G + + L E I + Sbjct: 245 EAGEMVVSKLNEEMLARIADITGGAYVRSSKQSIGLDEIVKAINE 289 >gi|145297003|ref|YP_001139824.1| hypothetical protein cgR_2902 [Corynebacterium glutamicum R] gi|140846923|dbj|BAF55922.1| hypothetical protein [Corynebacterium glutamicum R] Length = 230 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 21/195 (10%), Positives = 55/195 (28%), Gaps = 21/195 (10%) Query: 196 KSKYAPAPAPANRKIDVLIESAGNLVNSIQKA-IQEKKNLSVRIGTIAYNIGIVGNQCTP 254 A + D + L + + N + P Sbjct: 7 SGSMVTPDAGGQSRSDAANQFIDELAGTFDLGLVTYGGNTGETPEDYEAGCQDITVVRGP 66 Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 + ++K ++ L P T ++ A EL + ++ ++DG Sbjct: 67 TNGQAEQLKQHIDGLQPRGYTPIGESLRKAVAELPEGGSGT------------ILLVSDG 114 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVN 371 + ++ + + G+ I +V + + +L + G + + Sbjct: 115 IAT-----CTPPPVCEVAAELADQGVDLVINTVGFTLDESARAVLECIAQAGNGTYADAS 169 Query: 372 DSRELLESFDKITDK 386 D+ L+ + + Sbjct: 170 DADSLVAELKQAATR 184 >gi|47227632|emb|CAG09629.1| unnamed protein product [Tetraodon nigroviridis] Length = 457 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 28/167 (16%), Positives = 55/167 (32%), Gaps = 19/167 (11%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL---NPYENTNTYPAMHHAYRELYNE 291 S R+ PL+ + ++K L KL P T + + A ++ + Sbjct: 81 SPRMRVSFIVFSAQAKVLLPLTGDSYKIKEGLRKLYDVKPAGETFMHVGIKEASVQIRAQ 140 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 ++ +TDG+ T++ R G ++Y V + Sbjct: 141 PSP---------TSSIILALTDGKLEVYVHDL----TVKEANEARKYGARVYCVGI--KD 185 Query: 352 EGQDLLRKCTDSSGQFFAVND-SRELLESFDKITDKIQEQSVRIAPN 397 + L D+ Q F V D L + I + + + + P+ Sbjct: 186 FDEQQLANIADTKDQVFPVKDGFHALKGIVNSILKRSCTEILSVEPS 232 >gi|170748502|ref|YP_001754762.1| hypothetical protein Mrad2831_2084 [Methylobacterium radiotolerans JCM 2831] gi|170655024|gb|ACB24079.1| conserved hypothetical protein [Methylobacterium radiotolerans JCM 2831] Length = 463 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 30/284 (10%), Positives = 70/284 (24%), Gaps = 13/284 (4%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 + V ID A + R+Q+Q+A DA VL+ + + Sbjct: 20 FGLSAVVLLGLTGGGIDYARLAARRSQLQNAADAGVLAAGNYLKLAVATSAAAKSIVVDT 79 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + + + + ++ T + T + K Sbjct: 80 VHAQAAPRPESPYALRVEVADDKTSVATTVDETVKLAFGGFVGVPAVKVSVRSTVRVVGK 139 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 + + +E+S+E A + + + D+ + Sbjct: 140 MRLCLLTLDPLAAGAFELEKSAEVTAQDCSLYSNSQSPRGMVGK----DSAYARAQTICT 195 Query: 182 PPPPKKSFWSKNTTKS-KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 + + P P + + + + +S NL + T Sbjct: 196 AGGFDGARANFAPPPQTGCPPIQDPLKDRAAPPVGACTAIPSSANSKRDTGGNLVDQSAT 255 Query: 241 IAYNIGI-----VGNQCTPLSNNLNEVKS---RLNKLNPYENTN 276 + N L +K +++ T+ Sbjct: 256 LEPGTYCGGLRITKNASVTLRAGTYVMKDGPLIVDQSAAMSGTD 299 >gi|32394600|gb|AAM93998.1| proximal thread matrix protein 1 [Griffithsia japonica] Length = 218 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 52/160 (32%), Gaps = 24/160 (15%) Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YEN 274 + K ++ S + +L+ + +N ++P Sbjct: 50 NIREFTVDAAKEFDDRTKDSYFSAV---GFASGVKLIQAPTQSLSTFNTAVNTVSPLNGG 106 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 TN + + Y++L + +I +TDG + C + Sbjct: 107 TNIFRGLRGCYQQL----------KTKPMTDRVLILVTDGF---------GGQPINYCNF 147 Query: 335 MRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSR 374 +++ G+ + +V + Q+ L+ C S + V D+ Sbjct: 148 IKSKGILLVTVGIGT-SINQNFLKNCATSEEFYINVKDTG 186 >gi|329744564|ref|NP_001192916.1| voltage-dependent calcium channel subunit alpha-2/delta-1 [Bos taurus] Length = 1091 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 30/334 (8%), Positives = 92/334 (27%), Gaps = 25/334 (7%) Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK- 108 + ++ + + E +I + + Sbjct: 104 AEKVQAAHQWREDFASNEVVYYNAKDDLDPEKNDSEPGSQRIKPVFIDDANFGRQISYQH 163 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR----SMEDLY 164 A IPT+ ++ + L S + E+ ++ + + + Sbjct: 164 AAVHIPTDIYEGSTIVLNELNWTSALDEVFKKNREEDPSLLWQVFGSATGLARYYPASPW 223 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 + N + P + S + + + + ++ S ++ ++ Sbjct: 224 VDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETL 283 Query: 225 QKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + + ++ +V N +K +N + T+ Sbjct: 284 SDDDFVNVASFNSNAQDVSCFQHLVQANVR----NKKVLKDAVNNITAKGITDYKKGFSF 339 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A+ +L N S K ++ TDG A N ++ ++++ Sbjct: 340 AFEQLLNYNVSRA------NCNKIIMLFTDGGEERAQEIFTKYN--------KDKKVRVF 385 Query: 344 SVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 + +V + ++ ++ G ++ + + Sbjct: 386 TFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|322711218|gb|EFZ02792.1| U-box domain-containing protein [Metarhizium anisopliae ARSEF 23] Length = 734 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 29/203 (14%), Positives = 66/203 (32%), Gaps = 17/203 (8%) Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 + P + + S AP P + L +L I E N R+G Sbjct: 46 IDHVPCDIVLVLDVSTSMEDNAPVPGETERTGLT--VLDLTKHAALTIIETLNDRDRLGI 103 Query: 241 IAYNIGIVGNQCTPLSN--NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNT 298 +++ Q + N +E + ++ L+P +TN + + + E+ + Sbjct: 104 VSFATNSTIVQTLTHMDISNKDEARRKIKALDPNGSTNLWHGIRDGIQIFEQSAENGNI- 162 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLR 358 + ++ +TDG + + + + I++ LL+ Sbjct: 163 -------RAMMVLTDGMPNH---MCPQQGYIPKLKTLPRLPAAIHTFGFGYGLRS-GLLK 211 Query: 359 KCTDSS-GQFFAVNDSRELLESF 380 + G + + D+ + F Sbjct: 212 SLAEYGHGNYAFIPDAGMIGTVF 234 >gi|260797295|ref|XP_002593639.1| hypothetical protein BRAFLDRAFT_235786 [Branchiostoma floridae] gi|229278865|gb|EEN49650.1| hypothetical protein BRAFLDRAFT_235786 [Branchiostoma floridae] Length = 373 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 19/153 (12%), Positives = 50/153 (32%), Gaps = 16/153 (10%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAY 285 + + ++G I Y+ + + ++++ T T A+ + Sbjct: 36 SGFDISPSGTQVGVIQYSTRTRQEFSMNSFVTKETLSAAIDEVQYMRGGTLTGKAIRYVT 95 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + + + + + K VI +TDG + + G+ +Y++ Sbjct: 96 KYGFGKSDGAR-----PGVPKVVIVVTDGVSYD--------AVAAPALEAQQKGITVYAI 142 Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 V D L + ++ V++ L Sbjct: 143 GV--SGYDADQLEQIASNNNTLAFVDNFNLLDN 173 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 18/137 (13%), Positives = 44/137 (32%), Gaps = 16/137 (11%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAY 285 + + ++G I Y+ + S ++++ T T A+ + Sbjct: 252 SGFDISPSGTQVGVIQYSTRTRQEFSMNSFLTKETLSSAIDEVQYMRGGTLTGKAIRYVT 311 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + + + + + K VI +TDG + + G+ +Y++ Sbjct: 312 KYGFGKSDGAR-----PGVPKVVIVVTDGVSYD--------AVAAPALEAQQKGITVYAI 358 Query: 346 AVSAPPEGQDLLRKCTD 362 V D L + Sbjct: 359 GV--SGYDADQLEQIAS 373 >gi|332206577|ref|XP_003252372.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-1 [Nomascus leucogenys] Length = 1107 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 30/334 (8%), Positives = 93/334 (27%), Gaps = 25/334 (7%) Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK- 108 + ++ + + E +I ++ + Sbjct: 108 AEKVQAAHQWREDFASNEVVYYNAKDDLDPEKNDSEPGSQRIKPVFIEDANFGRQISYQH 167 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR----SMEDLY 164 A IPT+ ++ + L S + E+ ++ + + + Sbjct: 168 AAVHIPTDIYEGSTIVLNELNWTSALDEVFKKNREEDPSLLWQVFGSATGLARYYPASPW 227 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 + N + P + S + + + + ++ S ++ ++ Sbjct: 228 VDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETL 287 Query: 225 QKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + + ++ +V N +K +N + T+ Sbjct: 288 SDDDFVNVASFNSNAQDVSCFQHLVQANVR----NKKVLKDAVNNITAKGITDYKKGFSF 343 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A+ +L N S K ++ TDG A N ++ ++++ Sbjct: 344 AFEQLLNYNVSRA------NCNKIIMLFTDGGEERAQEIFTKYN--------KDKKVRVF 389 Query: 344 SVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 + +V + ++ ++ G ++ + + Sbjct: 390 TFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 423 >gi|220913390|ref|YP_002488699.1| hypothetical protein Achl_2645 [Arthrobacter chlorophenolicus A6] gi|219860268|gb|ACL40610.1| conserved hypothetical protein [Arthrobacter chlorophenolicus A6] Length = 319 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 21/153 (13%), Positives = 48/153 (31%), Gaps = 13/153 (8%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + AI++ F+ A+D+ I R Q+QS DA+ ++ D D +T Sbjct: 17 IVAILLVTLLGFVAIAVDVGAIYSERAQLQSGADASAIALAQKCARDTANADCSTTSTLA 76 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 ++ + + I+ + + + + +LF Sbjct: 77 GSLANQNSLDGMSNVYSIQLDKTARTVSVTTSAKETGSPDNSV-------------SLFF 123 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMV 153 I + +++ S+ I + Sbjct: 124 AKAIGIPSKEVGAKASATWGNPSKGPVILPLAI 156 >gi|118443040|ref|YP_877686.1| von Willebrand factor type A domain-containing protein [Clostridium novyi NT] gi|118133496|gb|ABK60540.1| von Willebrand factor type A domain protein [Clostridium novyi NT] Length = 708 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 25/241 (10%), Positives = 57/241 (23%), Gaps = 16/241 (6%) Query: 167 KHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL---------IESA 217 K K + + +I+ + Sbjct: 185 DELQKAAKNFVNKFETKSNTKIGLVSYGNKGEVVHSLTNELDRINSSIDYGLSVYGATNI 244 Query: 218 GNLVNSIQKAIQEKKNLS----VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE 273 G+ + + N + I N + + + + R P Sbjct: 245 GDGIRKANGLLNNGSNADKYIVLMTDGIPTAATCYSNVKSERTEHYSNFYRRNIYDTPVG 304 Query: 274 NTNTYPAMHHA-YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 + + + I I G+N + N + Sbjct: 305 AILDGYGRELEKFNLITDYNYKFEYNPKDYIYSDENIIINRGDNDYGNIALN--YAKESL 362 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + G+ + V S L + +G + EL + +++I D+I+ V Sbjct: 363 KRASENGVNNFVVGFSNGISRDKLTQIADAGNGYYREAMHGDELEDVYNRIADEIKNPVV 422 Query: 393 R 393 + Sbjct: 423 K 423 >gi|161086896|ref|NP_001104313.1| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform a [Mus musculus] gi|46576352|sp|O08532|CA2D1_MOUSE RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-1; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-1; Flags: Precursor gi|1905817|gb|AAB50138.1| voltage-gated calcium channel alpha2/delta subunit, alpha2a isoform [Mus musculus] gi|148671294|gb|EDL03241.1| calcium channel, voltage-dependent, alpha2/delta subunit 1, isoform CRA_a [Mus musculus] Length = 1103 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 29/334 (8%), Positives = 91/334 (27%), Gaps = 25/334 (7%) Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK- 108 + ++ + + E +I ++ + Sbjct: 104 AEKVQAAHQWREDFASNEVVYYNAKDDLDPERNESEPGSQRIKPVFIEDANFGRQISYQH 163 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR----SMEDLY 164 A IPT+ ++ + L S E+ + + + + Sbjct: 164 AAVHIPTDIYEGSTIVLNELNWTSALDEVFKRNRDEDPTLLWQVFGSATGLARYYPASPW 223 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 + N + P + S + + + + ++ S ++ ++ Sbjct: 224 VDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETL 283 Query: 225 QKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + + ++ +V N +K +N + T+ Sbjct: 284 SDDDFVNVASFNSNAQDVSCFQHLVQANVR----NKKVLKDAVNNITAKGITDYKKGFSF 339 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A+ +L N S K ++ TDG +Y ++ ++++ Sbjct: 340 AFEQLLNYNVSRA------NCNKIIMLFTDG--------GEERAQEIFAKYNKDKKVRVF 385 Query: 344 SVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 + +V + ++ ++ G ++ + + Sbjct: 386 TFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|311251228|ref|XP_003124501.1| PREDICTED: integrin alpha-D-like [Sus scrofa] Length = 230 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 63/191 (32%), Gaps = 27/191 (14%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPY 272 +++ S+ + Y+ + + +S ++ + Sbjct: 35 KNFVRAVMDQFMDT-------SILFSLMQYSNLLKTHFTFSQFQTHRSPQSLVDPIVQLK 87 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T T + +EL++ K + + KK ++ ITDG+ Y++ L + Sbjct: 88 GLTYTATGIQTVVKELFHSKNGAR-----SSAKKILLVITDGQ-----KYKDPLEYEDVI 137 Query: 333 EYMRNAGMKIYSVAVSA---PPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDKITDKI 387 A + Y++ V + L S F V++ L K+ +KI Sbjct: 138 PQAEKANVIRYAIGVGDAFQEHSAKQELSIIGSLPSKDHVFKVDNFAALSSIQKKLQEKI 197 Query: 388 ----QEQSVRI 394 + VR+ Sbjct: 198 FAVEGRERVRL 208 >gi|198422516|ref|XP_002123195.1| PREDICTED: similar to EGF-like domain-containing protein [Ciona intestinalis] Length = 2053 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 58/180 (32%), Gaps = 20/180 (11%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + +L S + + VR + YN + +N Sbjct: 1532 VDSSSSVKYNNFQKMKSFTSSLFQSFKL-----GDDRVRAALVRYNRRVDQRFGFADTNT 1586 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 EV+ ++ + T T A+ H + + +K VI ITDG + Sbjct: 1587 KEEVQDGVDAMPYRGGGTLTGQALQHVIDNTLKTENGAR-----DDAQKIVITITDGRSQ 1641 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 + L +R AG+ + + + A + + + + F+V +L Sbjct: 1642 D--------DVLTPSRALRAAGVITFGIGIGARVL-ETQINEIGGDPSRTFSVETFDQLS 1692 >gi|170015985|ref|NP_001116166.1| complement component 2 [Xenopus laevis] gi|169642443|gb|AAI60743.1| LOC734198 protein [Xenopus laevis] Length = 662 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 75/196 (38%), Gaps = 14/196 (7%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + D+ + LV+ + + + T+ I + ++ S+N + Sbjct: 170 ASQSVGQANFDIYKACSEYLVDELALFDMTIQFGIISYATVPKVIIPIYDEE---SDNND 226 Query: 261 EVKSRL-NKLNPYE-----NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 V + + N L + TNT A+ Y + ++KE+ N + +I +TDG Sbjct: 227 HVLTLIRNGLKYSDHKDKTGTNTKAALEEIYSMMSSQKETYKNESVWNSIHHIIILLTDG 286 Query: 315 ENSGASAYQNTLNTLQICEYMR---NAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFA 369 + + +T+ ++ ++ + +Y+ + + DL + G F Sbjct: 287 KANLGGRPAHTIKRIEDFLDIKHKREDYLDVYTFGIGPEVDMADLSEMASKKDGETHVFR 346 Query: 370 VNDSRELLESFDKITD 385 + + E+ F KI D Sbjct: 347 MESANEMKTVFQKIVD 362 >gi|169829413|ref|YP_001699571.1| BatA [Lysinibacillus sphaericus C3-41] gi|168993901|gb|ACA41441.1| BatA [Lysinibacillus sphaericus C3-41] Length = 973 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 40/324 (12%), Positives = 96/324 (29%), Gaps = 37/324 (11%) Query: 72 LKQGSYIRENAGDIAQKAQINITKDKNNP---LQYIAESKAQYEIPTENLFLKGLIPSAL 128 LK+ + + ++ +T + Y K +++ + L Sbjct: 573 LKESADAASRPSGKDRYTKVMVTLVRPGGEPVTDYQGTVKIKFDGVEKTASFITNTSDPL 632 Query: 129 TNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKS 188 N T + S S V+ + + P K+ Sbjct: 633 NNTGSPGTAVAYFDSIIYGKSKVEATLVNPIDPRYATILKGLKDKTVTKDIFTNPNFSKN 692 Query: 189 FWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIV 248 S T + + ++ +K I+ L + + Sbjct: 693 SCSLATEMAYVVDYSSSM--------KAVDPTNYRGKKMIEFINQLKAKNNIVI----ET 740 Query: 249 GNQCTPLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + T L + + + T+ + + A + T+ Sbjct: 741 NTKATILGEGTTDAVLKKDLYKASKDKGATDIFAGIDIALTKFS----------NDTKTA 790 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP--EGQDLLRKCTDS 363 K ++ ++DG+ + ++ + G+KIY+V++ L++ T++ Sbjct: 791 KAIVVVSDGK-------TSKSKMIKAINDAKKQGVKIYTVSMGKKSQINDATLMQLSTET 843 Query: 364 SGQFFAVNDSRELLESFDKITDKI 387 G ++ D+ +L + F K+ D I Sbjct: 844 GGAYYHALDNLQLHQVFQKLIDAI 867 >gi|126272975|ref|XP_001371818.1| PREDICTED: similar to anthrax toxin receptor [Monodelphis domestica] Length = 858 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 34/198 (17%), Positives = 63/198 (31%), Gaps = 18/198 (9%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 K + + + ++ ++ + +R+ I ++ L+ + ++ Sbjct: 348 CVILDKSGSVKHHWIEIYSFVESLAEKFISPMLRMSFIVFSSKGT--TIMKLTEDREAIR 405 Query: 264 SRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 L L P +T + A ++Y H G R +I +TDGE Sbjct: 406 QGLEVLRYEVPGGDTFMHKGFERANEQIY------HENYGGLRTASVIIALTDGELQKEQ 459 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN-DSRELLES 379 Y + R G +Y V V + L DS F V L Sbjct: 460 FYF----AEKEVNRARTFGAIVYCVGV--KDFNETQLSTIADSIDHVFPVTGGFHALRGV 513 Query: 380 FDKITDKIQEQSVRIAPN 397 D I K + + + P+ Sbjct: 514 IDSILKKSCIEILAVEPS 531 >gi|82619290|gb|ABB85337.1| complement C2 [Xenopus laevis] Length = 753 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 33/196 (16%), Positives = 75/196 (38%), Gaps = 14/196 (7%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + D+ + LV+ + + + T+ I + ++ S+N + Sbjct: 261 ASQSVGQANFDIYKACSEYLVDELALFDMTIQFGIISYATVPKVIIPIYDEE---SDNND 317 Query: 261 EVKSRL-NKLNPYE-----NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 V + + N L + TNT A+ Y + ++KE+ N + +I +TDG Sbjct: 318 HVLTLIRNGLKYSDHKDKTGTNTKAALEEIYSMMSSQKETYKNESVWNSIHHIIILLTDG 377 Query: 315 ENSGASAYQNTLNTLQICEYMR---NAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFA 369 + + +T+ ++ ++ + +Y+ + + DL + G F Sbjct: 378 KANLGGRPAHTIKRIEDFLDIKHKREDYLDVYTFGIGPEVDMADLSEMASKKDGETHVFR 437 Query: 370 VNDSRELLESFDKITD 385 + + E+ F KI D Sbjct: 438 MESANEMKTVFQKIVD 453 >gi|19552620|ref|NP_600622.1| hypothetical protein NCgl1349 [Corynebacterium glutamicum ATCC 13032] gi|62390288|ref|YP_225690.1| Mg-chelatase subunit [Corynebacterium glutamicum ATCC 13032] gi|21324171|dbj|BAB98796.1| Hypothetical protein [Corynebacterium glutamicum ATCC 13032] gi|41325625|emb|CAF21414.1| secreted Mg-chelatase subunit [Corynebacterium glutamicum ATCC 13032] Length = 525 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 40/367 (10%), Positives = 99/367 (26%), Gaps = 33/367 (8%) Query: 35 AAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINIT 94 +A+++ + + + + GS + Q + N Sbjct: 170 SALVAMATAYADTGQALTTNDIPAIAEPMSTSLSGQTITSGSSGWLKDTFLEQPDRANAI 229 Query: 95 KDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVL 154 + + L + L+ ++ G L Sbjct: 230 INYESVLHTMISEDGADITVVVPADGVVSADYPLSTITGSDQGEHVAELAGWFAEHPDAL 289 Query: 155 DVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPAN------- 207 + + + P K + + P Sbjct: 290 TDTYRRPTTANATLPAELSSQTIIEAPFPGSKTVTDALIDAYTNQFRVPGETTFVLDVSG 349 Query: 208 ----RKIDVLIESAGNLVNSIQKAIQEKKN--LSVRIGTIAYNIGIVGNQCTPL----SN 257 ++I +L ++ +L++ + ++ I ++ G L S Sbjct: 350 SMLGQRITLLKDTMSDLISGGATTDLANVSLRDREKVSIIPFSFGPHEVISETLGAVGSP 409 Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + +++ R+ L T Y A+ AY ++ +TDGE + Sbjct: 410 SRTDLQQRVEALQADGGTGIYDAVLAAY-----------AESAGGDYIPSIVLMTDGELT 458 Query: 318 GASAYQNTL-NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 Y L + +R+ + ++ + D+ + + G+ F + +L Sbjct: 459 AGRTYDQFLTEWNALPSNIRS--IPVFVILYG-EANVADMEQLAATTGGETFDAIN-GDL 514 Query: 377 LESFDKI 383 E+F +I Sbjct: 515 DEAFKEI 521 >gi|3182932|sp|Q28902|COCA1_RABIT RecName: Full=Collagen alpha-1(XII) chain gi|13195730|gb|AAB34889.2| type XII collagen [Oryctolagus cuniculus] Length = 639 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 18/136 (13%), Positives = 46/136 (33%), Gaps = 18/136 (13%) Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + L NT T A++ ++ R +K + ITDG++ Sbjct: 2 RKSLLQAVANLPYKGGNTLTGMALNFIRQQ-----NFKTQAGMRPRARKIGVLITDGKSQ 56 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRE 375 + + +++ G++++++ + + L+ + V D Sbjct: 57 DDVEAPS--------KKLKDEGVELFAIGI--KNADEVELKMIATDPDDTHAYNVADFDS 106 Query: 376 LLESFDKITDKIQEQS 391 L + D +T + Sbjct: 107 LSKIVDDLTINLCNSV 122 >gi|311030436|ref|ZP_07708526.1| hypothetical protein Bm3-1_07816 [Bacillus sp. m3-13] Length = 921 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 25/141 (17%), Positives = 55/141 (39%), Gaps = 19/141 (13%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N +EV + T+ +PA++ AY++L +K +I +TDG+++ Sbjct: 463 NKDEVIETIRSTALGGGTDIFPALNQAYQQLNEM----------DLKRKHIILLTDGQSN 512 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSREL 376 +I E + + +VA+ LL + + G+F+ V ++ + Sbjct: 513 DGPY-------EEIIEEGLTNNVTLSTVAIG-GDADTSLLEELAEIGTGRFYEVYEASAV 564 Query: 377 LESFDKITDKIQEQSVRIAPN 397 + T + + P+ Sbjct: 565 PSILSRETALTTKTYIEDNPH 585 >gi|224824215|ref|ZP_03697323.1| conserved hypothetical protein [Lutiella nitroferrum 2002] gi|224603634|gb|EEG09809.1| conserved hypothetical protein [Lutiella nitroferrum 2002] Length = 421 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 41/328 (12%), Positives = 95/328 (28%), Gaps = 19/328 (5%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + + I V F+ DL + + ++ A DA L+ A + D + ++ + Sbjct: 20 IVGLCIVVLIGFLGLVADLGRLFITKTELSDASDACSLAAAAELKGD---AESLSRAESA 76 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYE-----IPT 115 ++ K + + D+ +N T N + + +Y Sbjct: 77 GITVGQRNKVDFQANNVTIVPNQDVTFSDHLNGTYYTKNAVAPANIANMKYAKCTLPRTG 136 Query: 116 ENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMT 175 + ++ + +++ ++ + S N AI + + D+ Sbjct: 137 IMPWFMQVMGAGAQSVTSQAVASLSPSQTNCAIPVGLC--SQTPPASCPDGSAPDSYGFC 194 Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 K+ ++ + Y+P A+ L + +N Q S Sbjct: 195 VGKWYGSRFTAGGGFTGSFNLIDYSPPSGGASELSGQLTGAGQCNLNVTNPVGQTGMQQS 254 Query: 236 VRIGTIA----YNIGIVGNQCTPLSNNLN-EVKSRLNKLNPYENTNTY----PAMHHAYR 286 + Y + P S ++ N Y+ T+ A A+ Sbjct: 255 IANAWNTRFGLYQGSYNVSNAPPDFTGYAYTAISWPSQYNAYDGTDGTRPNFQASRTAHT 314 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDG 314 + + + GS +G Sbjct: 315 VYQGDAAAGLSLTGSGSSATSTQLRDNG 342 >gi|190336734|gb|AAI62194.1| Coagulation factor C homolog, cochlin (Limulus polyphemus) [Danio rerio] gi|190339304|gb|AAI62181.1| Coagulation factor C homolog, cochlin (Limulus polyphemus) [Danio rerio] Length = 553 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 43/317 (13%), Positives = 86/317 (27%), Gaps = 33/317 (10%) Query: 76 SYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRS 135 ++ + G ++ + P + E+ N+F + + SL S Sbjct: 259 NFFSPDFGVRRGYPRVIVVFVDGWPSDNVEEAAILARESGINIFFVSVAKPSPEEASLVS 318 Query: 136 TGIIERSSENLAISICMVLDVSRSMEDLYLQKH-NDNNNMTSNKYLLPPPPKKSFWSKNT 194 R + + +M + K + S + Sbjct: 319 DQDFMRKAVCKDNEFF-----TFTMPSWFSTNKFVKPLAQKLCSIDQMLCSKTCYNSVDL 373 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 + LV+ + + ++ RIG I + Sbjct: 374 GFLIDGSSSVG----DGNFRLVLDLLVSIARS--FDISDIGSRIGAIQFTYDQRMEFNFN 427 Query: 255 LSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 + L K+ T T A++ A R L+ + S+ +KF+I ITD Sbjct: 428 DHVLKDNALRALQKIPYMSGGTATGDAINFAVRSLFKPR--------SSSNRKFLIIITD 479 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVN 371 G++ + G+ +Y+V V + L+ F Sbjct: 480 GQSYDDVRVP--------AMAAQREGITVYAVGV--AWAPMEDLKAMASEPKESHVFFTR 529 Query: 372 DSRELLESFDKITDKIQ 388 + L + I I Sbjct: 530 EFTGLGQFQQPIVRGIC 546 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 28/199 (14%), Positives = 59/199 (29%), Gaps = 28/199 (14%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + R+ ++ L + + +G + + Sbjct: 177 SSYNIGQRRFNLQKNFVSKLA-----TMLKVGTQGPHVGVVQTSETPRTEFYLTNYTTAK 231 Query: 261 EVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +V + ++ NTNT A+ H R ++ + + ++ DG S Sbjct: 232 DVTFAIKEIPYIGGNTNTGKAILHTVRNFFS-----PDFGVRRGYPRVIVVFVDGWPSD- 285 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-------EGQDLLRKCTDSSGQFFAVND 372 N + R +G+ I+ V+V+ P QD +RK +FF Sbjct: 286 -------NVEEAAILARESGINIFFVSVAKPSPEEASLVSDQDFMRKAVCKDNEFFTFTM 338 Query: 373 SR--ELLESFDKITDKIQE 389 + + K+ Sbjct: 339 PSWFSTNKFVKPLAQKLCS 357 >gi|169234588|ref|NP_001038425.2| voltage-dependent calcium channel subunit alpha-2/delta-1 [Danio rerio] gi|169154233|emb|CAH68946.2| novel protein similar to vertebrate calcium channel, voltage-dependent, alpha 2/delta subunit 1 (CACNA2D1) [Danio rerio] Length = 1069 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 29/308 (9%), Positives = 87/308 (28%), Gaps = 20/308 (6%) Query: 71 HLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTN 130 + + ++ T+D + A + IPT+ +I + L Sbjct: 128 YYNAKDKSDNQTEPRKNRLILDFTEDPAFKRPVSYNTTAVH-IPTDIYEGSTIILNELNW 186 Query: 131 LSLRSTGIIERSSENLAISICMVLDVSR-SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF 189 + + ++ ++ + + + + N + P + Sbjct: 187 TAALDDVFRKNKEDDPSLHWQVFGSATGLARYFPASPWVDSKNKIDLYDVRRRPWYIQGA 246 Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 S + + + + ++ S ++ ++ + Sbjct: 247 ASPKDMLILVDASGSVSGLTLKLIRTSVSEMLETLSDDDYVNIVSFNNSAKSVACFENLV 306 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 N +K + K+ T+ A+ +L + S K ++ Sbjct: 307 QANVR---NKKTLKEAVQKITANGTTDYKIGFKEAFNQLASMNVSRA------NCNKIIM 357 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFF 368 TDG AS + N+ + ++I++ +V + ++ + G ++ Sbjct: 358 LFTDGGEDKASEIFDEYNS--------DKRVRIFTFSVGQHNYDKAPIQYMACHNKGYYY 409 Query: 369 AVNDSREL 376 + + Sbjct: 410 EIPSIGAI 417 >gi|326666584|ref|XP_687953.4| PREDICTED: collagen alpha-1(VII) chain [Danio rerio] Length = 2001 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 27/296 (9%), Positives = 75/296 (25%), Gaps = 36/296 (12%) Query: 115 TENLFLKGLIPSALT-NLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNN 173 K ++ + + + ++++ + + +R+ + Sbjct: 867 PVTTITKRILGEPRLVPVQSSTPAPVSANTKSNDMHLQTTTRNTRTPVVKAPVTASATGQ 926 Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSK-------------YAPAPAPANRKIDVLIESAGNL 220 S + +S S T + + L + Sbjct: 927 TASALIRMTTVNDQSILSAETPPPGPVCGRVKADIVFLVDESWSIGTNNFGKLKDFLFRT 986 Query: 221 VNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYP 279 V + +I + Y+ + N V L ++ T T Sbjct: 987 VTYFPSIGPKGT----QIAVVHYSDQPRIEFNFNTHKDRNSVLRALREVRYGGGNTKTGR 1042 Query: 280 AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG 339 + + RE++ E ++ +TDG + G Sbjct: 1043 GISYVLREMFQESLG-----MRQEAPHVLVLLTDGRAQDDVEPPS--------RIAHALG 1089 Query: 340 MKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + + + + + +R + F +D +L + I ++++ Sbjct: 1090 VSVLVIGI--AHADMEEVRTIASPTTYKNIFYASDFDDLPSIEREFIGSICSEALQ 1143 >gi|94732541|emb|CAK05117.1| novel protein similar to vertebrate collagen family [Danio rerio] Length = 1721 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 27/296 (9%), Positives = 75/296 (25%), Gaps = 36/296 (12%) Query: 115 TENLFLKGLIPSALT-NLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNN 173 K ++ + + + ++++ + + +R+ + Sbjct: 756 PVTTITKRILGEPRLVPVQSSTPAPVSANTKSNDMHLQTTTRNTRTPVVKAPVTASATGQ 815 Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSK-------------YAPAPAPANRKIDVLIESAGNL 220 S + +S S T + + L + Sbjct: 816 TASALIRMTTVNDQSILSAETPPPGPVCGRVKADIVFLVDESWSIGTNNFGKLKDFLFRT 875 Query: 221 VNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYP 279 V + +I + Y+ + N V L ++ T T Sbjct: 876 VTYFPSIGPKGT----QIAVVHYSDQPRIEFNFNTHKDRNSVLRALREVRYGGGNTKTGR 931 Query: 280 AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG 339 + + RE++ E ++ +TDG + G Sbjct: 932 GISYVLREMFQESLG-----MRQEAPHVLVLLTDGRAQDDVEPPS--------RIAHALG 978 Query: 340 MKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + + + + + +R + F +D +L + I ++++ Sbjct: 979 VSVLVIGI--AHADMEEVRTIASPTTYKNIFYASDFDDLPSIEREFIGSICSEALQ 1032 >gi|260297|gb|AAB24261.1| type VI collagen alpha 3 chain [Homo sapiens] Length = 205 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 18/172 (10%), Positives = 57/172 (33%), Gaps = 17/172 (9%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 + + K++ +N + +N EV S ++ ++ TN Sbjct: 27 VREFLYDVVKSLAVGENDF-HFALVQFNGNPHTEFLLNTYRTKQEVLSHISNMSYIGGTN 85 Query: 277 -TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 T + + + + ++ + + ++ +TDG + A + + Sbjct: 86 QTGKGLEYIMAKHLT---KAAGSLAGDGVPQVIVVLTDGHSKDGLALPS--------AEL 134 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITD 385 ++A + ++++ V + L++ F + + L + + Sbjct: 135 KSADVNVFAIGV--EDADEGALKEIASEPLNMHMFNLENFTSLHDIVGNLVS 184 >gi|157962337|ref|YP_001502371.1| cell wall anchor domain-containing protein [Shewanella pealeana ATCC 700345] gi|157847337|gb|ABV87836.1| LPXTG-motif cell wall anchor domain [Shewanella pealeana ATCC 700345] Length = 789 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 24/167 (14%), Positives = 61/167 (36%), Gaps = 11/167 (6%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 + + + G + NL + ++ +N+L T A++ A Sbjct: 432 PTDKFNIVQFNSDVDKWSGMAMSATPYNLAQAQNYINRLEANGGTEMSIAINAALNIETV 491 Query: 291 EKESSHNTIGS----TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 + + + + + L + V+FITDG S S + + ++ +++++ Sbjct: 492 TDKETGTELDNNDLGSNLLRQVLFITDGAVSNESMLFELIEAQ-----LGDS--RLFTIG 544 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + + P + R G + + E+ + + KI++ V Sbjct: 545 IGSAPNAHFMQRAAQLGRGTYTYIGKLDEVNQKVVSLLKKIEKPQVT 591 >gi|260808373|ref|XP_002598982.1| hypothetical protein BRAFLDRAFT_150456 [Branchiostoma floridae] gi|229284257|gb|EEN54994.1| hypothetical protein BRAFLDRAFT_150456 [Branchiostoma floridae] Length = 132 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 22/133 (16%), Positives = 48/133 (36%), Gaps = 11/133 (8%) Query: 257 NNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 N + + +N ++ T T A+ A ++ L + I +TDG Sbjct: 10 TNKASLATAINNVSYQSGGTQTGRALDAARTQMD--WRQPPVPNVCFSLLQAAIVVTDGM 67 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE 375 + N Q + +R+ + Y V + +L +G F + + + Sbjct: 68 --------SGDNVQQPAKALRDNDISAYGVGIGPAINANELNEIAGGDAGHVFYIPNYDK 119 Query: 376 LLESFDKITDKIQ 388 L + +KI++ + Sbjct: 120 LEKEMEKISNSVC 132 >gi|198434986|ref|XP_002126110.1| PREDICTED: similar to RIKEN cDNA E330026B02 [Ciona intestinalis] Length = 1715 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 24/161 (14%), Positives = 55/161 (34%), Gaps = 12/161 (7%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 + R+G + Y N ++V ++ + + A + L Sbjct: 279 TVGSQFTRVGMMQYGDEPHTEFDLNTFQNGSQVFEAISNVTQIGGESGPYAA--ILQVLR 336 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + + + + + + +IF+TDG ++ + I +R +G +Y++ V Sbjct: 337 RSLTAQYGSREN--VSQIIIFVTDG-----GVVDDSEESQTILNELRFSGALVYTIGVGR 389 Query: 350 PPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQ 388 + LR + EL + +I D+I Sbjct: 390 -MVSRPQLRMIASRPASHHVTTIASYSELSATKSQIIDRIC 429 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 20/136 (14%), Positives = 47/136 (34%), Gaps = 16/136 (11%) Query: 257 NNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 +N +++ + L NT T A+ ++ + + K + ITDG Sbjct: 1023 DNREDLRESIMSLEHCTGNTFTAKAIKFVTNYAFSTERGGR-----PNVLKVALLITDGR 1077 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDS 373 ++ E G+KI+ + V L + +S V + Sbjct: 1078 AQDYRNIRSE------AEQAHTEGIKIFGIGVG--EAQLSELEDMSSLPTSQHTMFVQNY 1129 Query: 374 RELLESFDKITDKIQE 389 +++ + ++ +I + Sbjct: 1130 QDIEKLRSRLVHRICQ 1145 Score = 39.9 bits (91), Expect = 0.82, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 46/168 (27%), Gaps = 34/168 (20%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-------PYENTNTYPAMHHAYRE 287 R+ I Y+ E+ +N + + H Sbjct: 490 YTRVSIIVYSGTCSLEFSFEEYLRKGELLEYINNMTSFVQDEPANAGFALNFTLKHILNS 549 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 L + + V+ +TD +T + +I R + ++S+ + Sbjct: 550 LRVDAQP------------VVVHLTD--------SGSTDDVTRIGNEFRALKIPVFSIGI 589 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 + Q L SS VN L + + E RI+ Sbjct: 590 GSGI-DQVELSSIATSSQNVMIVNSDITL------LPSQASELVTRIS 630 >gi|86147474|ref|ZP_01065786.1| hypothetical protein MED222_21509 [Vibrio sp. MED222] gi|85834767|gb|EAQ52913.1| hypothetical protein MED222_21509 [Vibrio sp. MED222] Length = 460 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 45/386 (11%), Positives = 102/386 (26%), Gaps = 89/386 (23%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M+ I + L + ++D ++++ L S + +D Sbjct: 19 MSIIALPFILLVVGLSVDAGRAYIVKSK---------LFAAVDAASIAAARAVANGEDAG 69 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 +K ++ Y + A D + + AQ Sbjct: 70 RAAAQKYFSANIPADFYSATPSLGAVNFA-----YDSFGNISIDISATAQV-----PTIF 119 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 LI N + + I + + +V+D + S+ + Sbjct: 120 LPLIGLDTFNPGVSAQSIRR------PVDLVLVIDNTTSLRLGSIGDVTQ---------- 163 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 + +I ++ + G V A + S++ Sbjct: 164 ----------DVIDRSKSFVENFHEGFDRISLVKFAFGAEVPVGFNATRGHSRSSIKSEI 213 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG 300 ++N G N TN M+ A EL + ++ Sbjct: 214 DSFNFGSTSNAQ---------------------YTNASEGMYRALNELRTVTDPANL--- 249 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLR-- 358 K ++F TDG + + + +R++ + + L R Sbjct: 250 -----KVIVFFTDGAPNTFATTFDFEGGGTHTGAIRSSD--------GSSGRPRGLWRHD 296 Query: 359 KCTDS-SGQFFAVNDSRELLESFDKI 383 + G + + R++ + +I Sbjct: 297 TIATTLPGGY----EGRDIDDYISEI 318 >gi|326918656|ref|XP_003205604.1| PREDICTED: anthrax toxin receptor 2-like, partial [Meleagris gallopavo] Length = 480 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 33/198 (16%), Positives = 75/198 (37%), Gaps = 20/198 (10%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 K + ++ + + + + + + +R+ I ++ PL+ + ++K Sbjct: 51 YFVLDKSGSVAQNWHEIFDFVNQLTERFVSPKMRLSFIVFSSQA--QVIMPLTGDREKIK 108 Query: 264 SRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 L L+ P +T + + A ++ G++R +I +TDG+ G Sbjct: 109 EGLKNLSEVKPAGDTYIHEGLKQANLQIE--------KQGASRFSSIIIALTDGKLDG-- 158 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN-DSRELLES 379 Q L + + R G ++Y V V Q L + D+ Q F V + L Sbjct: 159 --QIPLYAEKEAKTSRQLGARVYCVGVL--DFVQAQLERIADTKEQVFPVTGGFQALKGI 214 Query: 380 FDKITDKIQEQSVRIAPN 397 + + + + + + P+ Sbjct: 215 INSVLKQSCTEILYLEPS 232 >gi|326426687|gb|EGD72257.1| hypothetical protein PTSG_00277 [Salpingoeca sp. ATCC 50818] Length = 2847 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 23/167 (13%), Positives = 56/167 (33%), Gaps = 15/167 (8%) Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTN 276 ++ + A+ VRI Y P + + ++ ++ + T Sbjct: 846 RTFLSDLTDALFAFPGNDVRISIAEY-STTYTQVLVPYTADETTAQNTISNVVQSGGATA 904 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 T A+ A ++ + ++ +TDG S Q ++ + Sbjct: 905 TGTALGLAADDIGANARP--------DAARVLVLLTDGATSDG--DQQNIDPS--VSDLN 952 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 + G+ I ++ + + +LL+ + F +L + D+I Sbjct: 953 SIGVSITAIGIGDNADETELLQ-IAGDPTRVFNNIAFVDLGDFIDEI 998 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 58/165 (35%), Gaps = 17/165 (10%) Query: 225 QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHA 284 + E V T ++ ++ LN + S +N ++ T T A+ A Sbjct: 1280 TQIDLEGTGSRVAAMTFCATQTLLTTFTGVTTDALNALDSAVNTVSC--GTATGAALDFA 1337 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + N+ + ++ VI +TDG + S++ + + ++++ Sbjct: 1338 IANILNDN-------SNPGARRVVIVVTDGASQEDSSF-----VTAASDRLHAEVDEVFA 1385 Query: 345 VAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + V P L+ S+ F D L + +T ++ Sbjct: 1386 IGVG-PASNLAELQTIATSNDNAFIETDFDALRD--RDLTQQLCS 1427 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 20/136 (14%), Positives = 49/136 (36%), Gaps = 10/136 (7%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 V S + + + L T T A+ A R++++ + ++ + + Sbjct: 502 SDVQEVSLQFSYDKPTIVDTIRNLRHIQRGTATGAALELARRDIFSSSDPNYR---NLAV 558 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS 364 + ITDG++ + + +R G+ + ++ + D L S+ Sbjct: 559 PAIAVVITDGQSQ-----EEPAFVAEQALRLRELGVTVLALGIG-DGTDPDELLAIAGSA 612 Query: 365 GQFFAVNDSRELLESF 380 + V +L++ F Sbjct: 613 DRVEQVEQFGDLVQEF 628 >gi|1399245|gb|AAB03226.1| integrin alpha-M [Rattus norvegicus] Length = 205 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 27/170 (15%), Positives = 57/170 (33%), Gaps = 16/170 (9%) Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMH 282 + +++ + + Y+ + + KS + + T T + Sbjct: 1 VSTVMEQFQKSKTLFSLMQYSDEFRTHFTFNXFKRNPDPKSHVRPIRQLNGRTKTASGIR 60 Query: 283 HAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKI 342 REL+ + + K ++ ITDGE + + LN + AG+ Sbjct: 61 KVVRELFQKINGAR-----DNAAKILVVITDGE-----KFGDPLNYEDVIPEAEEAGIIR 110 Query: 343 YSVAVSAP---PEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKI 387 Y + V P+ + L F V++ L +++ +KI Sbjct: 111 YVIGVXNAFHKPQSRRELDTIASKPAGDHVFQVDNFEALNTIRNQLQEKI 160 >gi|159900699|ref|YP_001546946.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159893738|gb|ABX06818.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 828 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 21/119 (17%), Positives = 49/119 (41%), Gaps = 18/119 (15%) Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 L+E+++ + L+ T+ Y A+ L + T + + +TDG + G Sbjct: 450 LSEIQNNIAGLSAGGGTDIYAALEVGMGGLAQQ----------TGKVRHAVLLTDGRSGG 499 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSREL 376 S+Y++ + +R G+ + ++A+ LL +G++ + +L Sbjct: 500 ESSYESLIAP------LRAQGITLSTIAIG-GDADTVLLESLAKLGAGRYHFASRPDDL 551 >gi|161086904|ref|NP_001104317.1| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform 3 [Rattus norvegicus] gi|17864880|gb|AAL47093.1|AF400662_1 L-type calcium channel alpha2/delta subunit [Rattus norvegicus] Length = 1079 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 29/334 (8%), Positives = 92/334 (27%), Gaps = 25/334 (7%) Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK- 108 + ++ + + E + +I ++ + Sbjct: 104 AEKVQAAHQWREDFASNEVVYYNAKDDLDPERNESESGSQRIKPVFIEDANFGRQISYQH 163 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR----SMEDLY 164 A IPT+ ++ + L S E+ + + + + Sbjct: 164 AAVHIPTDIYEGSTIVLNELNWTSALDEVFKRNRDEDPTLLWQVFGSATGLARYYPASPW 223 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 + N + P + S + + + + ++ S ++ ++ Sbjct: 224 VDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETL 283 Query: 225 QKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + + ++ +V N +K +N + T+ Sbjct: 284 SDDDFVNVASFNSNAQDVSCFQHLVQANVR----NKKVLKDAVNNITAKGITDYKKGFSF 339 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A+ +L N S K ++ TDG +Y ++ ++++ Sbjct: 340 AFEQLLNYNVSRA------NCNKIIMLFTDG--------GEERAQEIFAKYNKDKKVRVF 385 Query: 344 SVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 + +V + ++ ++ G ++ + + Sbjct: 386 TFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|311254858|ref|XP_001927013.2| PREDICTED: calcium-activated chloride channel regulator 4 [Sus scrofa] Length = 910 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 23/135 (17%), Positives = 52/135 (38%), Gaps = 26/135 (19%) Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T+ + A+ + + S ++ +TDGE++ T Sbjct: 360 TASGGTSICSGIRRAFEVV--------RKLYSHTDGSEIVLLTDGEDN----------TA 401 Query: 330 QIC-EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE---LLESFDKIT- 384 C + ++ +G I+ +A+ P + ++ T + G F D E L+++F + Sbjct: 402 GACVDEVKQSGAIIHFIALG-PSADKAVIEMSTATGGVHFYATDEAENNGLIDAFGALAS 460 Query: 385 --DKIQEQSVRIAPN 397 I +QS+++ Sbjct: 461 GNTDISQQSLQLESK 475 >gi|307245403|ref|ZP_07527491.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 1 str. 4074] gi|306853744|gb|EFM85961.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 1 str. 4074] Length = 538 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 46/370 (12%), Positives = 101/370 (27%), Gaps = 67/370 (18%) Query: 14 TYAIDLAHIMYIRNQMQSALDAAVLSGCASIVS----------------DRTIKDPTTKK 57 +++ A I+ + ++ +L+ AVLS A + + + + Sbjct: 36 FVSLESAGIIQDQARLSDSLEQAVLSLTAENNNGRKDNDYKLSGSSNKENDSFDISSEVG 95 Query: 58 DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 + S + +K L Q + + N I +N T K + I ++ Sbjct: 96 KRDSQMVTTFVKAFLPQTNDDKMNL--IPICKTVNNTSGKGHTSSSEVTCTVSGTIEHKS 153 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENL---AISICMVLDVSRSMEDLYLQKHNDNNN- 173 F + + + + +N I + +V D+S SM+D + Sbjct: 154 WFPLKVGTVEVIPQQVDVASKSKAFKKNTFNIPIDLMVVADLSGSMKDGIKGEKLKGGTN 213 Query: 174 ----------MTSNKYLLPPPPKKSFWSKNT---------TKSKYAPAPAPANRKIDVLI 214 L + K P + + Sbjct: 214 SKIYILREVLKELADKSLFTQEANEYNRIGITAFAMGAEHPKENKCVLPFVLQNNLHEMS 273 Query: 215 ES------------------AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 +S N V + I I S Sbjct: 274 KSKIKQYLTSSHKSLRRTEFVDNFVALLDTEATLNSIGKPNYDIIFPKSSICLEGLKKAS 333 Query: 257 N------NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 + ++R++ L T + A ++ +EK S + K+ ++ Sbjct: 334 QFWYTKEEKEKFRNRVDSLKANGGTLASSGLLTASNQMLSEKSRSEEL--NQETKRVILV 391 Query: 311 ITDGENSGAS 320 ++DG + ++ Sbjct: 392 LSDGNDDMSN 401 Score = 43.0 bits (99), Expect = 0.081, Method: Composition-based stats. Identities = 14/97 (14%), Positives = 35/97 (36%), Gaps = 1/97 (1%) Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 +L + S+ + ++T+ + ++ ++ ++ K+ V Sbjct: 430 EDLSSTTTSNKAYYNRHSTFNYNTYLTNKTKDISRKGMCSIIQEKLNTLNKDKNTKLVFV 489 Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 E D + C + G +F ++ LL SF + Sbjct: 490 EFGYRSESADAWKTCVGN-GNYFYADNRESLLNSFKQ 525 >gi|325678986|ref|ZP_08158584.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] gi|324109490|gb|EGC03708.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] Length = 782 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 21/198 (10%), Positives = 64/198 (32%), Gaps = 18/198 (9%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + P +++ +K + RIG + + +++ Sbjct: 292 NSGSMYPKELCPTSSENDVDFKRLDFTQSLIDKFDDDFRIGISKF--TGTYTKMCDFTDD 349 Query: 259 LNEVKSRLNKLNPYE----NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 E++ LN++ + T A+ E + + ++ ++DG Sbjct: 350 RTELRKVLNRIRTEDEIFDGTYNQTALKKCINEFSAAGDG--------KYVNIIVMLSDG 401 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSR 374 E+ + + + + +V + + L + G++++ +D+ Sbjct: 402 ESDE----VDAETIESLSNLANEKSVIVLTVGLGREIDRAWLQEVAYSTGGKYYSASDAT 457 Query: 375 ELLESFDKITDKIQEQSV 392 L + + +I + V Sbjct: 458 SLDDVYKQIVTTLNYDIV 475 >gi|223973011|gb|ACN30693.1| unknown [Zea mays] Length = 481 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 23/128 (17%), Positives = 46/128 (35%), Gaps = 20/128 (15%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 ++ ++ ++ L P NTN + + L + K SS +G V+ ++DG+ Sbjct: 85 DSQPQLLKLIDALQPGGNTNISDGLQTGLKVLADRKLSSGRVVG-------VMLMSDGQQ 137 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT--DSSGQFFAVNDSR 374 + ++ + +Y+ A +L G F VND Sbjct: 138 NRGEP----------AANVKIGNVPVYTFGFGAD-YDPTVLNAVARNSMGGTFSVVNDVN 186 Query: 375 ELLESFDK 382 L +F + Sbjct: 187 LLSMAFSQ 194 >gi|218961689|ref|YP_001741464.1| hypothetical protein; putative membrane protein [Candidatus Cloacamonas acidaminovorans] gi|167730346|emb|CAO81258.1| hypothetical protein; putative membrane protein [Candidatus Cloacamonas acidaminovorans] Length = 331 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 58/162 (35%), Gaps = 40/162 (24%) Query: 254 PLSNNLNEVKSRLNKLNPYE----NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 PL+++ V+ LN LN T+ A+ A + K ++ Sbjct: 144 PLTDDYEAVRIVLNGLNSNTVEIPGTDIGSALRLAENAFPEGSK-----------SKTLV 192 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------G 353 I+DGE+ S L+ ++ G+++Y++ V +P Sbjct: 193 LISDGEDLQHS-------ALREARILKTKGIRVYTMGVGSPEGTIIRHPETGEEVKSKLD 245 Query: 354 QDLLRKCTD-SSGQFFAVN-DSRELLESFDKITDKIQEQSVR 393 + L++ + G+++ V E+ +I + + + Sbjct: 246 EATLQEIARITEGEYYRVTPGGEEIQLILKRIYESESTRRGK 287 >gi|145552898|ref|XP_001462124.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124429962|emb|CAK94751.1| unnamed protein product [Paramecium tetraurelia] Length = 533 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 15/129 (11%), Positives = 38/129 (29%), Gaps = 20/129 (15%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N+ + + ++ L T+ M A L + + ++DG Sbjct: 181 NVQKFRVAISSLQARGGTDIGNGMKMALSILKH--------RKYKNPVSAIFLLSDGV-- 230 Query: 318 GASAYQNTLNTLQICEYMRNAGM----KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDS 373 + ++ + + + I + + + GQF+ V + Sbjct: 231 ------DEGAEERVRDDLIQYNIRDSFTIKTFGFGRDCCPKIMSEIAHYKEGQFYFVPNL 284 Query: 374 RELLESFDK 382 + E F + Sbjct: 285 TNIDECFAE 293 >gi|161086898|ref|NP_001104314.1| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform b [Mus musculus] gi|1905819|gb|AAB50139.1| voltage-gated calcium channel alpha2/delta subunit, alpha2b isoform [Mus musculus] Length = 1091 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 29/334 (8%), Positives = 91/334 (27%), Gaps = 25/334 (7%) Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK- 108 + ++ + + E +I ++ + Sbjct: 104 AEKVQAAHQWREDFASNEVVYYNAKDDLDPERNESEPGSQRIKPVFIEDANFGRQISYQH 163 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR----SMEDLY 164 A IPT+ ++ + L S E+ + + + + Sbjct: 164 AAVHIPTDIYEGSTIVLNELNWTSALDEVFKRNRDEDPTLLWQVFGSATGLARYYPASPW 223 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 + N + P + S + + + + ++ S ++ ++ Sbjct: 224 VDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETL 283 Query: 225 QKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + + ++ +V N +K +N + T+ Sbjct: 284 SDDDFVNVASFNSNAQDVSCFQHLVQANVR----NKKVLKDAVNNITAKGITDYKKGFSF 339 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A+ +L N S K ++ TDG +Y ++ ++++ Sbjct: 340 AFEQLLNYNVSRA------NCNKIIMLFTDG--------GEERAQEIFAKYNKDKKVRVF 385 Query: 344 SVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 + +V + ++ ++ G ++ + + Sbjct: 386 TFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|301785912|ref|XP_002928373.1| PREDICTED: LOW QUALITY PROTEIN: integrin alpha-E-like [Ailuropoda melanoleuca] Length = 1188 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 24/156 (15%), Positives = 51/156 (32%), Gaps = 16/156 (10%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNT 298 + Y I S + +R+ + T T AM H ++ S Sbjct: 253 LVQYGEVIQTEFDLRDSQDAMASLARVQNITQVGKVTKTASAMQHVLDNIFTPSHGSR-- 310 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS---APPEGQD 355 K ++ +TDG+ +++ LN + + G++ +++ V + + Sbjct: 311 ---KNASKVMVVLTDGD-----IFEDPLNLTTVISSPKMQGVERFAIGVGKAFENNKTYN 362 Query: 356 LLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQE 389 L+ F V + L K+ I + Sbjct: 363 ELKLIASDPDDRYAFKVTNYTALDGLLSKLQQNIIQ 398 >gi|281346139|gb|EFB21723.1| hypothetical protein PANDA_018300 [Ailuropoda melanoleuca] Length = 1151 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 24/156 (15%), Positives = 51/156 (32%), Gaps = 16/156 (10%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNT 298 + Y I S + +R+ + T T AM H ++ S Sbjct: 219 LVQYGEVIQTEFDLRDSQDAMASLARVQNITQVGKVTKTASAMQHVLDNIFTPSHGSR-- 276 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS---APPEGQD 355 K ++ +TDG+ +++ LN + + G++ +++ V + + Sbjct: 277 ---KNASKVMVVLTDGD-----IFEDPLNLTTVISSPKMQGVERFAIGVGKAFENNKTYN 328 Query: 356 LLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQE 389 L+ F V + L K+ I + Sbjct: 329 ELKLIASDPDDRYAFKVTNYTALDGLLSKLQQNIIQ 364 >gi|149410251|ref|XP_001508722.1| PREDICTED: hypothetical protein [Ornithorhynchus anatinus] Length = 950 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 20/167 (11%), Positives = 50/167 (29%), Gaps = 11/167 (6%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 ++ + + + N+ K + + TN A+ A Sbjct: 281 DLNPEDQFNLVVFNSMISQWQPSLLKATQENVGSAKKFVLDIRASGGTNINEAVLAAVHL 340 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 L + S +I +TDGE + N N Q + + ++ + Sbjct: 341 LDESNQRELLPENS---VSMIILLTDGEPTVGE--TNPENIQQNIQRSLDGKYALFCLGF 395 Query: 348 SAPPEGQDLLRKCTDSSG----QFFAVNDSR-ELLESFDKITDKIQE 389 L K + + + +D+ +L + + ++ + + Sbjct: 396 GFDV-SYSFLEKMALDNSGLARRIYEDSDAALQLQDFYQEVATPLLK 441 >gi|161086900|ref|NP_001104315.1| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform c [Mus musculus] gi|1905821|gb|AAB50140.1| voltage-gated calcium channel alpha2/delta subunit, alpha2c isoform [Mus musculus] gi|109732367|gb|AAI15872.1| Cacna2d1 protein [Mus musculus] gi|148671297|gb|EDL03244.1| calcium channel, voltage-dependent, alpha2/delta subunit 1, isoform CRA_d [Mus musculus] Length = 1086 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 29/334 (8%), Positives = 91/334 (27%), Gaps = 25/334 (7%) Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK- 108 + ++ + + E +I ++ + Sbjct: 104 AEKVQAAHQWREDFASNEVVYYNAKDDLDPERNESEPGSQRIKPVFIEDANFGRQISYQH 163 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR----SMEDLY 164 A IPT+ ++ + L S E+ + + + + Sbjct: 164 AAVHIPTDIYEGSTIVLNELNWTSALDEVFKRNRDEDPTLLWQVFGSATGLARYYPASPW 223 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 + N + P + S + + + + ++ S ++ ++ Sbjct: 224 VDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETL 283 Query: 225 QKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + + ++ +V N +K +N + T+ Sbjct: 284 SDDDFVNVASFNSNAQDVSCFQHLVQANVR----NKKVLKDAVNNITAKGITDYKKGFSF 339 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A+ +L N S K ++ TDG +Y ++ ++++ Sbjct: 340 AFEQLLNYNVSRA------NCNKIIMLFTDG--------GEERAQEIFAKYNKDKKVRVF 385 Query: 344 SVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 + +V + ++ ++ G ++ + + Sbjct: 386 TFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|31542335|ref|NP_037051.2| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform 1 [Rattus norvegicus] gi|11055592|gb|AAG28164.1|AF286488_1 voltage-gated calcium channel alpha2/delta-1 subunit [Rattus norvegicus] Length = 1091 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 29/334 (8%), Positives = 92/334 (27%), Gaps = 25/334 (7%) Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK- 108 + ++ + + E + +I ++ + Sbjct: 104 AEKVQAAHQWREDFASNEVVYYNAKDDLDPERNESESGSQRIKPVFIEDANFGRQISYQH 163 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR----SMEDLY 164 A IPT+ ++ + L S E+ + + + + Sbjct: 164 AAVHIPTDIYEGSTIVLNELNWTSALDEVFKRNRDEDPTLLWQVFGSATGLARYYPASPW 223 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 + N + P + S + + + + ++ S ++ ++ Sbjct: 224 VDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETL 283 Query: 225 QKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + + ++ +V N +K +N + T+ Sbjct: 284 SDDDFVNVASFNSNAQDVSCFQHLVQANVR----NKKVLKDAVNNITAKGITDYKKGFSF 339 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A+ +L N S K ++ TDG +Y ++ ++++ Sbjct: 340 AFEQLLNYNVSRA------NCNKIIMLFTDG--------GEERAQEIFAKYNKDKKVRVF 385 Query: 344 SVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 + +V + ++ ++ G ++ + + Sbjct: 386 TFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|197295155|ref|YP_002153696.1| hypothetical protein BCAS0306 [Burkholderia cenocepacia J2315] gi|195944634|emb|CAR57238.1| putative membrane protein [Burkholderia cenocepacia J2315] Length = 423 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 22/213 (10%), Positives = 60/213 (28%), Gaps = 6/213 (2%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTI----KDPTTK 56 + + ++V F+ A+DL + R+++Q++ DA LS + S ++ D Sbjct: 25 IVGLSLAVMIGFVGLALDLGKLYVTRSELQNSADACALSAARDLTSAISLSVAEADGIAA 84 Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 F+K + + + + + + + Sbjct: 85 GHLNFVFFQK-TSVQMSTNANVTFSDSLTNPFLTKSAVTKPASIKYVQCTATLSNIAHWF 143 Query: 117 NLFLKGLIPSA-LTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMT 175 L L + + ++ I +I + + S + + + + Sbjct: 144 IEVLNVLPGTKLANAAEVSASAIATVGGGQTTCAIPVFVCRGPSDPAYKVGDWISSPSGS 203 Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR 208 S+ Y + + + Sbjct: 204 SSTYGPGNFGWAALDGSTNETTLASELSGNTCN 236 >gi|156409369|ref|XP_001642142.1| predicted protein [Nematostella vectensis] gi|156229283|gb|EDO50079.1| predicted protein [Nematostella vectensis] Length = 182 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 60/191 (31%), Gaps = 24/191 (12%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + + LV S S R+ I Y+ + Sbjct: 14 ASSSVGKVNFERVKGFIRRLVESFH-----ISRTSTRVAAIVYSSRPRVAFDFNRYTSAR 68 Query: 261 EVKSRLNKLN-PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 + +L T+T A+ A L+ + +K ++ ITDG+ Sbjct: 69 RAAHAVKRLRFLRGGTSTGRALRLASSRLF--------RRYGRKRRKVLMLITDGK---- 116 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 ++ + L+ + ++ G++I++V V ++ L Q + L Sbjct: 117 ----SSDDVLKPSKALKRKGVQIFAVGVGM-SVSRNELILIASHPSQVYQA-SFTSLSAI 170 Query: 380 FDKITDKIQEQ 390 + K E Sbjct: 171 VKSLARKTCES 181 >gi|161086906|ref|NP_001104318.1| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform 2 [Rattus norvegicus] gi|27450704|gb|AAO14652.1|AF486276_1 calcium channel alpha-2 delta-1 subunit isoform e [Rattus norvegicus] Length = 1084 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 29/334 (8%), Positives = 92/334 (27%), Gaps = 25/334 (7%) Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK- 108 + ++ + + E + +I ++ + Sbjct: 104 AEKVQAAHQWREDFASNEVVYYNAKDDLDPERNESESGSQRIKPVFIEDANFGRQISYQH 163 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR----SMEDLY 164 A IPT+ ++ + L S E+ + + + + Sbjct: 164 AAVHIPTDIYEGSTIVLNELNWTSALDEVFKRNRDEDPTLLWQVFGSATGLARYYPASPW 223 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 + N + P + S + + + + ++ S ++ ++ Sbjct: 224 VDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETL 283 Query: 225 QKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + + ++ +V N +K +N + T+ Sbjct: 284 SDDDFVNVASFNSNAQDVSCFQHLVQANVR----NKKVLKDAVNNITAKGITDYKKGFSF 339 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A+ +L N S K ++ TDG +Y ++ ++++ Sbjct: 340 AFEQLLNYNVSRA------NCNKIIMLFTDG--------GEERAQEIFAKYNKDKKVRVF 385 Query: 344 SVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 + +V + ++ ++ G ++ + + Sbjct: 386 TFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|313238855|emb|CBY13854.1| unnamed protein product [Oikopleura dioica] Length = 977 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 31/338 (9%), Positives = 85/338 (25%), Gaps = 39/338 (11%) Query: 59 QTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENL 118 S + R+ + I I ++ + + Sbjct: 33 DVSQTATRVSVVQFTDNVSDRQVFRYYLSADFLLIVLTDGVSQDEIEKAASDLIGDKVLV 92 Query: 119 FLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ-KHNDNNNMTSN 177 F G+ L + + + ++ + N Sbjct: 93 FSVGIGN--------SVDANELEKIAGLPEYVFKTANYNALTGITDTLYNKLCSSLKSDN 144 Query: 178 KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVR 237 +++ + K + G+ + + ++ + Sbjct: 145 CISSVKQDLSFIVDSSSSITISDYQKLKTWMKSIIEKLEIGDNASRVSILQFSGQSARPQ 204 Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYRELYNEKESSH 296 I S + V ++ + +T A+ + YR ++ + Sbjct: 205 GRWI------NPVLTFDRSTSKEAVIGAIDGMKKLNGDTCIGEALDYFYRNMFTSQAGQR 258 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS-------- 348 + +++ VI +TDG+ + + E +R +IY++ + Sbjct: 259 -----SDVEQRVIVMTDGKRN------CPAEIAKPAELIRAQEAEIYAIGIGHQCGYGEN 307 Query: 349 APPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKIT 384 + L + F +N+ +L +I Sbjct: 308 HNCYDRQELHEIASKPADKYVFEINNFDQL--ILKRIG 343 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 23/198 (11%), Positives = 63/198 (31%), Gaps = 32/198 (16%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + ++ D + ++ S + R + Y + + Sbjct: 365 VDSSGSIGPKRFDYMKNWVKSIAASFK-----VGENFARFSVVQY-TKTAKTVVDFQTLD 418 Query: 259 LNEVKSRLNKLNP-------YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 L+ + +++ + T T A+ A+ L + K+ V+ + Sbjct: 419 LSSISQKIDSMIYFQGRNGRGGKTFTGNALERAHTLLKESEPGR---------KRIVLLL 469 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDG + + + +R+ + I++V V +D L + T + + Sbjct: 470 TDGSSDDEYGS--------VAKAIRDDKVDIFAVGVGR--ARKDELVEITADEQRVWQTR 519 Query: 372 DSRELLESFDKITDKIQE 389 + + K+ ++ Sbjct: 520 TFNNIGQFNQKLLAEVCS 537 Score = 44.1 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 12/105 (11%), Positives = 28/105 (26%), Gaps = 9/105 (8%) Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 + + +I +TDG + + + + ++SV Sbjct: 45 QFTDNVSDRQVFRYYLSADFLLIVLTDGVSQD--------EIEKAASDLIGDKVLVFSVG 96 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 + + L K F + L D + +K+ Sbjct: 97 IG-NSVDANELEKIAGLPEYVFKTANYNALTGITDTLYNKLCSSL 140 >gi|300853770|ref|YP_003778754.1| hypothetical protein CLJU_c05700 [Clostridium ljungdahlii DSM 13528] gi|300433885|gb|ADK13652.1| hypothetical protein CLJU_c05700 [Clostridium ljungdahlii DSM 13528] Length = 484 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 14/130 (10%), Positives = 50/130 (38%), Gaps = 11/130 (8%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 ++ ++ + + + T+ + + L + +S++ +I ++DG+N Sbjct: 97 SDKERIEEMASNIPLGQKTDIGRGLLEGAKVLDSGHDSNNRP--------LIILLSDGKN 148 Query: 317 SGASAYQ-NTLNTLQICEYMRNAGMKIYSVAVSAPPE-GQDLLRKCTD-SSGQFFAVNDS 373 + + + ++ G +Y++ ++ + L + + G+ + N + Sbjct: 149 DSQRSASESLKDLNSAISTCKSKGYPVYTIGLNYDGTVDKAQLTQIASETKGKNYITNKA 208 Query: 374 RELLESFDKI 383 +L + I Sbjct: 209 SDLTDILKDI 218 >gi|302632554|ref|NP_001181863.1| integrin alpha-M [Pan troglodytes] Length = 1153 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 23/120 (19%), Positives = 41/120 (34%), Gaps = 15/120 (12%) Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T+T + REL+N + K ++ ITDGE + + L + Sbjct: 223 GRTHTATGIRKVVRELFNITNGAR-----KNAFKILVVITDGE-----KFGDPLGYEDVI 272 Query: 333 EYMRNAGMKIYSVAVSA---PPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKI 387 G+ Y + V + + L F VN+ L +++ +KI Sbjct: 273 PEADREGVIRYVIGVGDAFHSEKSRQELNTIASKPPRDHVFQVNNFEALKTIQNQLREKI 332 >gi|296209823|ref|XP_002807089.1| PREDICTED: LOW QUALITY PROTEIN: voltage-dependent calcium channel subunit alpha-2/delta-1-like [Callithrix jacchus] Length = 1094 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 30/334 (8%), Positives = 93/334 (27%), Gaps = 25/334 (7%) Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK- 108 + ++ + + E +I ++ + Sbjct: 104 AEKVQAAHQWREDFASNEVVYYNAKDDLDPEKNDSEPGSQRIKPVFIEDANFGRQISYQH 163 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR----SMEDLY 164 A IPT+ ++ + L S + E+ ++ + + + Sbjct: 164 AAVHIPTDIYEGSTIVLNELNWTSALDEVFKKNREEDPSLLWQVFGSATGLARYYPASPW 223 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 + N + P + S + + + + ++ S ++ ++ Sbjct: 224 VDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETL 283 Query: 225 QKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + + ++ +V N +K +N + T+ Sbjct: 284 SDDDFVNVASFNSNAQDVSCFQHLVQANVR----NKKVLKDAVNNITAKGITDYKKGFSF 339 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A+ +L N S K ++ TDG A N ++ ++++ Sbjct: 340 AFEQLLNYNVSRA------NCNKIIMLFTDGGEERAQEIFTKYN--------KDKKVRVF 385 Query: 344 SVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 + +V + ++ ++ G ++ + + Sbjct: 386 TFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|157838288|pdb|1BHQ|1 Chain 1, Mac-1 I Domain Cadmium Complex gi|157838289|pdb|1BHQ|2 Chain 2, Mac-1 I Domain Cadmium Complex Length = 189 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 23/120 (19%), Positives = 41/120 (34%), Gaps = 15/120 (12%) Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T+T + REL+N + K ++ ITDGE + + L + Sbjct: 75 GRTHTATGIRKVVRELFNITNGAR-----KNAFKILVVITDGE-----KFGDPLGYEDVI 124 Query: 333 EYMRNAGMKIYSVAVSA---PPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKI 387 G+ Y + V + + L F VN+ L +++ +KI Sbjct: 125 PEADREGVIRYVIGVGDAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTIQNQLREKI 184 >gi|119572528|gb|EAW52143.1| integrin, alpha M (complement component 3 receptor 3 subunit), isoform CRA_a [Homo sapiens] Length = 1153 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 23/120 (19%), Positives = 41/120 (34%), Gaps = 15/120 (12%) Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T+T + REL+N + K ++ ITDGE + + L + Sbjct: 223 GRTHTATGIRKVVRELFNITNGAR-----KNAFKILVVITDGE-----KFGDPLGYEDVI 272 Query: 333 EYMRNAGMKIYSVAVSA---PPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKI 387 G+ Y + V + + L F VN+ L +++ +KI Sbjct: 273 PEADREGVIRYVIGVGDAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTIQNQLREKI 332 >gi|119572529|gb|EAW52144.1| integrin, alpha M (complement component 3 receptor 3 subunit), isoform CRA_b [Homo sapiens] Length = 1152 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 23/120 (19%), Positives = 41/120 (34%), Gaps = 15/120 (12%) Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T+T + REL+N + K ++ ITDGE + + L + Sbjct: 223 GRTHTATGIRKVVRELFNITNGAR-----KNAFKILVVITDGE-----KFGDPLGYEDVI 272 Query: 333 EYMRNAGMKIYSVAVSA---PPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKI 387 G+ Y + V + + L F VN+ L +++ +KI Sbjct: 273 PEADREGVIRYVIGVGDAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTIQNQLREKI 332 >gi|64654539|gb|AAH96347.1| Integrin, alpha M (complement component 3 receptor 3 subunit) [Homo sapiens] Length = 1152 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 23/120 (19%), Positives = 41/120 (34%), Gaps = 15/120 (12%) Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T+T + REL+N + K ++ ITDGE + + L + Sbjct: 223 GRTHTATGIRKVVRELFNITNGAR-----KNAFKILVVITDGE-----KFGDPLGYEDVI 272 Query: 333 EYMRNAGMKIYSVAVSA---PPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKI 387 G+ Y + V + + L F VN+ L +++ +KI Sbjct: 273 PEADREGVIRYVIGVGDAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTIQNQLREKI 332 >gi|64654595|gb|AAH96346.1| Integrin, alpha M (complement component 3 receptor 3 subunit) [Homo sapiens] Length = 1152 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 23/120 (19%), Positives = 41/120 (34%), Gaps = 15/120 (12%) Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T+T + REL+N + K ++ ITDGE + + L + Sbjct: 223 GRTHTATGIRKVVRELFNITNGAR-----KNAFKILVVITDGE-----KFGDPLGYEDVI 272 Query: 333 EYMRNAGMKIYSVAVSA---PPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKI 387 G+ Y + V + + L F VN+ L +++ +KI Sbjct: 273 PEADREGVIRYVIGVGDAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTIQNQLREKI 332 >gi|33340728|gb|AAQ14925.1| Mac-1 alpha subunit [Pan troglodytes] Length = 1144 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 23/120 (19%), Positives = 41/120 (34%), Gaps = 15/120 (12%) Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T+T + REL+N + K ++ ITDGE + + L + Sbjct: 214 GRTHTATGIRKVVRELFNITNGAR-----KNAFKILVVITDGE-----KFGDPLGYEDVI 263 Query: 333 EYMRNAGMKIYSVAVSA---PPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKI 387 G+ Y + V + + L F VN+ L +++ +KI Sbjct: 264 PEADREGVIRYVIGVGDAFHSEKSRQELNTIASKPPRDHVFQVNNFEALKTIQNQLREKI 323 >gi|253722212|pdb|1IDN|1 Chain 1, Mac-1 I Domain Metal Free gi|253722213|pdb|1IDN|2 Chain 2, Mac-1 I Domain Metal Free gi|313507148|pdb|1BHO|1 Chain 1, Mac-1 I Domain Magnesium Complex gi|313507149|pdb|1BHO|2 Chain 2, Mac-1 I Domain Magnesium Complex Length = 190 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 23/120 (19%), Positives = 41/120 (34%), Gaps = 15/120 (12%) Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T+T + REL+N + K ++ ITDGE + + L + Sbjct: 76 GRTHTATGIRKVVRELFNITNGAR-----KNAFKILVVITDGE-----KFGDPLGYEDVI 125 Query: 333 EYMRNAGMKIYSVAVSA---PPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKI 387 G+ Y + V + + L F VN+ L +++ +KI Sbjct: 126 PEADREGVIRYVIGVGDAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTIQNQLREKI 185 >gi|224831239|ref|NP_001139280.1| integrin alpha-M isoform 1 precursor [Homo sapiens] gi|307148|gb|AAA59544.1| glycoprotein Mac-1 [Homo sapiens] Length = 1153 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 23/120 (19%), Positives = 41/120 (34%), Gaps = 15/120 (12%) Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T+T + REL+N + K ++ ITDGE + + L + Sbjct: 223 GRTHTATGIRKVVRELFNITNGAR-----KNAFKILVVITDGE-----KFGDPLGYEDVI 272 Query: 333 EYMRNAGMKIYSVAVSA---PPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKI 387 G+ Y + V + + L F VN+ L +++ +KI Sbjct: 273 PEADREGVIRYVIGVGDAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTIQNQLREKI 332 >gi|386975|gb|AAA59903.1| neutrophil adherence receptor alpha-M subunit [Homo sapiens] Length = 1145 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 23/120 (19%), Positives = 41/120 (34%), Gaps = 15/120 (12%) Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T+T + REL+N + K ++ ITDGE + + L + Sbjct: 215 GRTHTATGIRKVVRELFNITNGAR-----KNAFKILVVITDGE-----KFGDPLGYEDVI 264 Query: 333 EYMRNAGMKIYSVAVSA---PPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKI 387 G+ Y + V + + L F VN+ L +++ +KI Sbjct: 265 PEADREGVIRYVIGVGDAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTIQNQLREKI 324 >gi|307114|gb|AAA59491.1| leukocyte adhesion glycoprotein precursor [Homo sapiens] Length = 1152 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 23/120 (19%), Positives = 41/120 (34%), Gaps = 15/120 (12%) Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T+T + REL+N + K ++ ITDGE + + L + Sbjct: 223 GRTHTATGIRKVVRELFNITNGAR-----KNAFKILVVITDGE-----KFGDPLGYEDVI 272 Query: 333 EYMRNAGMKIYSVAVSA---PPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKI 387 G+ Y + V + + L F VN+ L +++ +KI Sbjct: 273 PEADREGVIRYVIGVGDAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTIQNQLREKI 332 >gi|88501734|ref|NP_000623.2| integrin alpha-M isoform 2 precursor [Homo sapiens] gi|1708572|sp|P11215|ITAM_HUMAN RecName: Full=Integrin alpha-M; AltName: Full=CD11 antigen-like family member B; AltName: Full=CR-3 alpha chain; AltName: Full=Cell surface glycoprotein MAC-1 subunit alpha; AltName: Full=Leukocyte adhesion receptor MO1; AltName: Full=Neutrophil adherence receptor; AltName: CD_antigen=CD11b; Flags: Precursor gi|263049|gb|AAB24821.1| leukocyte integrin alpha chain [Homo sapiens] gi|64653358|gb|AAH96348.1| Integrin, alpha M (complement component 3 receptor 3 subunit) [Homo sapiens] gi|68563402|gb|AAH99660.1| Integrin, alpha M (complement component 3 receptor 3 subunit) [Homo sapiens] gi|168275740|dbj|BAG10590.1| integrin alpha-M precursor [synthetic construct] Length = 1152 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 23/120 (19%), Positives = 41/120 (34%), Gaps = 15/120 (12%) Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T+T + REL+N + K ++ ITDGE + + L + Sbjct: 223 GRTHTATGIRKVVRELFNITNGAR-----KNAFKILVVITDGE-----KFGDPLGYEDVI 272 Query: 333 EYMRNAGMKIYSVAVSA---PPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKI 387 G+ Y + V + + L F VN+ L +++ +KI Sbjct: 273 PEADREGVIRYVIGVGDAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTIQNQLREKI 332 >gi|31615654|pdb|1NA5|A Chain A, Integrin Alpha M I Domain Length = 197 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 23/120 (19%), Positives = 41/120 (34%), Gaps = 15/120 (12%) Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T+T + REL+N + K ++ ITDGE + + L + Sbjct: 80 GRTHTATGIRKVVRELFNITNGAR-----KNAFKILVVITDGE-----KFGDPLGYEDVI 129 Query: 333 EYMRNAGMKIYSVAVSA---PPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKI 387 G+ Y + V + + L F VN+ L +++ +KI Sbjct: 130 PEADREGVIRYVIGVGDAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTIQNQLREKI 189 >gi|31615583|pdb|1MF7|A Chain A, Integrin Alpha M I Domain Length = 194 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 23/120 (19%), Positives = 41/120 (34%), Gaps = 15/120 (12%) Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T+T + REL+N + K ++ ITDGE + + L + Sbjct: 80 GRTHTATGIRKVVRELFNITNGAR-----KNAFKILVVITDGE-----KFGDPLGYEDVI 129 Query: 333 EYMRNAGMKIYSVAVSA---PPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKI 387 G+ Y + V + + L F VN+ L +++ +KI Sbjct: 130 PEADREGVIRYVIGVGDAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTIQNQLREKI 189 >gi|157831557|pdb|1JLM|A Chain A, I-Domain From Integrin Cr3, Mn2+ Bound Length = 192 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 23/120 (19%), Positives = 41/120 (34%), Gaps = 15/120 (12%) Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T+T + REL+N + K ++ ITDGE + + L + Sbjct: 81 GRTHTATGIRKVVRELFNITNGAR-----KNAFKILVVITDGE-----KFGDPLGYEDVI 130 Query: 333 EYMRNAGMKIYSVAVSA---PPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKI 387 G+ Y + V + + L F VN+ L +++ +KI Sbjct: 131 PEADREGVIRYVIGVGDAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTIQNQLREKI 190 >gi|170700850|ref|ZP_02891839.1| conserved hypothetical protein [Burkholderia ambifaria IOP40-10] gi|170134258|gb|EDT02597.1| conserved hypothetical protein [Burkholderia ambifaria IOP40-10] Length = 423 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 22/201 (10%), Positives = 62/201 (30%), Gaps = 6/201 (2%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + + +++ F+ A+DL + R+++Q++ D+ LS + S +++ Sbjct: 25 IVGLALAMMIGFVGLALDLGKLYVTRSELQNSADSCALSAARDLTSAISLQVAEADGIAA 84 Query: 61 STIFKKQIKKHLKQGSYIRE-NAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPT---- 115 +++ Q D + T N + AQ Sbjct: 85 GHANFAFFQQNAVQMQTDSNVTFSDSLTNPFLTKTAVANPANVKYVKCTAQLSNIAHWFI 144 Query: 116 -ENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNM 174 + G+ + + ++ + + AI + + S + + + + + Sbjct: 145 EVLNTIPGVQVANASQVAASAIATVGAGQTTCAIPVFVCKPASSAPYKVGDWISSPSGSS 204 Query: 175 TSNKYLLPPPPKKSFWSKNTT 195 T+ + T Sbjct: 205 TTYGPGNFGWAALDGSTNEPT 225 >gi|56675030|gb|AAW19657.1| matrilin-3 [Cervus elaphus] Length = 146 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 22/149 (14%), Positives = 48/149 (32%), Gaps = 18/149 (12%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNT 298 + Y + ++ +K + ++ P T + A+ A E + + + Sbjct: 3 VVNYASTVKIEFHLQTHSDKQSLKRAVARITPLSTGTMSGLAIQTAMDEAFTVEAGARGP 62 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLR 358 + K I +TDG ++ R +G+++Y+V V + L+ Sbjct: 63 SSNI--PKVAIIVTDGRPQD--------QVNEVAARARASGIELYAVGV--DRADMESLK 110 Query: 359 KCTDSS--GQFFAVND---SRELLESFDK 382 F V +L F + Sbjct: 111 MMASEPLDEHVFYVETYGVIEKLSSRFQE 139 >gi|74002027|ref|XP_544943.2| PREDICTED: similar to anthrax toxin receptor 2 [Canis familiaris] Length = 646 Score = 60.7 bits (145), Expect = 4e-07, Method: Composition-based stats. Identities = 29/167 (17%), Positives = 58/167 (34%), Gaps = 18/167 (10%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL---NPYENTNTYPAMHHAYRELYNE 291 S ++ PL+ + +++ L L +P T + + A ++ Sbjct: 233 SPQMRLSFIVFSSQATIILPLTGDRSKISKGLEDLKNVSPVGETYIHEGLKLANEQIQKA 292 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 G + +I +TDG+ G + + R+ G ++Y V V Sbjct: 293 --------GGLKTSSIIIALTDGKLDG----LVPSYAEKEAKISRSFGARVYCVGVL--D 338 Query: 352 EGQDLLRKCTDSSGQFFAVN-DSRELLESFDKITDKIQEQSVRIAPN 397 Q L + DS Q F V + L + I + + + + P+ Sbjct: 339 FEQAQLERIADSKEQVFPVKGGFQALKGIINSILAQSCTEILELRPS 385 >gi|114614242|ref|XP_001160235.1| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta subunit 1 isoform 1 [Pan troglodytes] Length = 1110 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 30/334 (8%), Positives = 93/334 (27%), Gaps = 25/334 (7%) Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK- 108 + ++ + + E +I ++ + Sbjct: 104 AEKVQAAHQWREDFASNEVVYYNAKDDLDPEKNDSEPGSQRIKPVFIEDANFGRQISYQH 163 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR----SMEDLY 164 A IPT+ ++ + L S + E+ ++ + + + Sbjct: 164 AAVHIPTDIYEGSTIVLNELNWTSALDEVFKKNREEDPSLLWQVFGSATGLARYYPASPW 223 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 + N + P + S + + + + ++ S ++ ++ Sbjct: 224 VDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETL 283 Query: 225 QKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + + ++ +V N +K +N + T+ Sbjct: 284 SDDDFVNVASFNSNAQDVSCFQHLVQANVR----NKKVLKDAVNNITAKGITDYKKGFSF 339 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A+ +L N S K ++ TDG A N ++ ++++ Sbjct: 340 AFEQLLNYNVSRA------NCNKIIMLFTDGGEERAQEIFTKYN--------KDKKVRVF 385 Query: 344 SVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 + +V + ++ ++ G ++ + + Sbjct: 386 TFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|73976417|ref|XP_539432.2| PREDICTED: similar to Matrilin-2 precursor [Canis familiaris] Length = 524 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 23/156 (14%), Positives = 55/156 (35%), Gaps = 21/156 (13%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKE 293 + RIG I Y+ + +N ++ K ++ + T T A+H A + Sbjct: 215 TARIGIINYSHKVEKVAHLTQFSNKDDFKLAVDNMQYLGEGTYTATALHEANHMFEAARP 274 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 +KK + ITDG+ ++ N ++ + + ++I+ + V+ + Sbjct: 275 G---------VKKVALVITDGQTD----SRDEKNLTEVVKRASDINVEIFVIGVAKKNDP 321 Query: 354 Q-----DLLRKCTDSSG--QFFAVNDSRELLESFDK 382 + + +D L ++ + Sbjct: 322 NFEMFHKEMNLIATDPDSEHVYQFDDFITLQDTLKQ 357 >gi|171921010|gb|ACB59193.1| TadG [Actinobacillus suis ATCC 33415] Length = 554 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 45/392 (11%), Positives = 105/392 (26%), Gaps = 82/392 (20%) Query: 3 AIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTST 62 ++ + +++ A I+ + ++ +L+ AVLS A S R D Sbjct: 25 GLLALPIVALMFVSLESAGIIQDKARLSDSLEQAVLSLSAENNSGRKSNDYKLSNTDAEN 84 Query: 63 ---------------IFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAES 107 I K + +L Q + + + Sbjct: 85 GHFNPNSKIGERDLEISKSFVTTYLPQTD--PNKIKLQPVCTTTDKKNRQGHTASTETIC 142 Query: 108 KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENL---AISICMVLDVSRSMEDLY 164 I ++ F + + + + + +N I + + D+S SM Sbjct: 143 TVAGTIEHKSWFPLKVGSTEVIPTEVNIASNSKAIKKNTISIPIDLMVAADLSGSMRYDL 202 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAP--APAPANRKIDVLIES------ 216 ++ + + L + S S + +S A +P + Sbjct: 203 ENRYEPKDGTSKIDILKAVLTELSSNSLFSQESNDNNRIAVSPFALGAEYSTTECTLPFA 262 Query: 217 --AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQC--------------TPLSNNLN 260 N + K++ +V+ Y + ++N L+ Sbjct: 263 LKNNNRTINYTKSLGIPTTENVQDIIKNYLTKSGSSNSQLSRAIFTQSLVTQIDVTNTLS 322 Query: 261 EV------------------------------------KSRLNKLNPYENTNTYPAMHHA 284 + + +N L +T + A Sbjct: 323 SIGSLDKVGLKFPKNAYCLGDKNRNQHQWFTREEQDKFSTFVNSLEAIGSTFAGSGLLAA 382 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 ++ KE+S K+ ++ ++DG + Sbjct: 383 ADKML--KETSRTQKLGEETKRVLLVLSDGND 412 Score = 43.4 bits (100), Expect = 0.062, Method: Composition-based stats. Identities = 39/364 (10%), Positives = 93/364 (25%), Gaps = 29/364 (7%) Query: 38 LSGCASIVSD--RTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK 95 L A + +++ KD TS I +K L + S + + +I ++ Sbjct: 188 LMVAADLSGSMRYDLENRYEPKDGTSKI--DILKAVLTELSSNSLFSQESNDNNRIAVSP 245 Query: 96 DKNNPLQYIAESK--AQYEIPTENLFLKGLIPSALTNLSLRSTGII-------ERSSENL 146 E + + + T Sbjct: 246 FALGAEYSTTECTLPFALKNNNRTINYTKSLGIPTTENVQDIIKNYLTKSGSSNSQLSRA 305 Query: 147 AISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPA 206 + +V + + + + ++ T + + + Sbjct: 306 IFTQSLVTQIDVTNTLSSIGSLDKVGLKFPKNAYCLGDKNRNQHQWFTREEQDKFSTFVN 365 Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 + + + L+ + K ++E LS+ +E+++ Sbjct: 366 SLEAIGSTFAGSGLLAAADKMLKETS-------RTQKLGEETKRVLLVLSDGNDELRADD 418 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 + + + Y+E S + I+ + + Sbjct: 419 TGVPFTNYSRLTEDLILGYQEEIFTSPSEQKSFHDITYYGRRIYSGKSDIILGNRRTPLS 478 Query: 327 NTLQICEYMRNA--------GMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 LQ+C +R+ I V + D + C G +++ D LL Sbjct: 479 KDLQMCNIIRDKLNKLNDDKNTSIVFVEFGYKSKSADAWKHCVG-DGNYYSAKDKESLLN 537 Query: 379 SFDK 382 SF + Sbjct: 538 SFKQ 541 >gi|229838599|ref|ZP_04458758.1| putative membrane protein [Yersinia pestis biovar Orientalis str. PEXU2] gi|229895667|ref|ZP_04510838.1| putative membrane protein [Yersinia pestis Pestoides A] gi|229899165|ref|ZP_04514308.1| putative membrane protein [Yersinia pestis biovar Orientalis str. India 195] gi|229901807|ref|ZP_04516929.1| putative membrane protein [Yersinia pestis Nepal516] gi|229681736|gb|EEO77830.1| putative membrane protein [Yersinia pestis Nepal516] gi|229687567|gb|EEO79640.1| putative membrane protein [Yersinia pestis biovar Orientalis str. India 195] gi|229694965|gb|EEO85012.1| putative membrane protein [Yersinia pestis biovar Orientalis str. PEXU2] gi|229701473|gb|EEO89501.1| putative membrane protein [Yersinia pestis Pestoides A] Length = 437 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 21/163 (12%), Positives = 55/163 (33%), Gaps = 13/163 (7%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYENTNTYPAMHHAYRELYNE 291 N++ + +AY+ + + + + + ++P T + + ++ Sbjct: 93 NITDTLSVVAYDNHAEVIIPATKVTDKPALIASIQQHIHPRGMTALFAGVSMGIGQVDKH 152 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 +I I+DG+ + + L + G+ I ++ + Sbjct: 153 LNREQVNR--------IILISDGQANTGPTSISEL--SDLARMAAKKGIAITTIGLGQD- 201 Query: 352 EGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 +DL+ S G V +S +L ++F K + + Sbjct: 202 YNEDLMTAIAGYSDGNHTFVANSADLEKAFTKEFQDVMSVVAQ 244 >gi|161086902|ref|NP_001104316.1| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform d [Mus musculus] gi|1905823|gb|AAB50141.1| voltage-gated calcium channel alpha2/delta subunit, alpha2d isoform [Mus musculus] gi|148671298|gb|EDL03245.1| calcium channel, voltage-dependent, alpha2/delta subunit 1, isoform CRA_e [Mus musculus] Length = 1079 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 29/334 (8%), Positives = 91/334 (27%), Gaps = 25/334 (7%) Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK- 108 + ++ + + E +I ++ + Sbjct: 104 AEKVQAAHQWREDFASNEVVYYNAKDDLDPERNESEPGSQRIKPVFIEDANFGRQISYQH 163 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR----SMEDLY 164 A IPT+ ++ + L S E+ + + + + Sbjct: 164 AAVHIPTDIYEGSTIVLNELNWTSALDEVFKRNRDEDPTLLWQVFGSATGLARYYPASPW 223 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 + N + P + S + + + + ++ S ++ ++ Sbjct: 224 VDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETL 283 Query: 225 QKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + + ++ +V N +K +N + T+ Sbjct: 284 SDDDFVNVASFNSNAQDVSCFQHLVQANVR----NKKVLKDAVNNITAKGITDYKKGFSF 339 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A+ +L N S K ++ TDG +Y ++ ++++ Sbjct: 340 AFEQLLNYNVSRA------NCNKIIMLFTDG--------GEERAQEIFAKYNKDKKVRVF 385 Query: 344 SVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 + +V + ++ ++ G ++ + + Sbjct: 386 TFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|6753234|ref|NP_033914.1| voltage-dependent calcium channel subunit alpha-2/delta-1 isoform e [Mus musculus] gi|1905825|gb|AAB50142.1| voltage-gated calcium channel alpha2/delta subunit, alpha2e isoform [Mus musculus] gi|148671296|gb|EDL03243.1| calcium channel, voltage-dependent, alpha2/delta subunit 1, isoform CRA_c [Mus musculus] Length = 1084 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 29/334 (8%), Positives = 91/334 (27%), Gaps = 25/334 (7%) Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK- 108 + ++ + + E +I ++ + Sbjct: 104 AEKVQAAHQWREDFASNEVVYYNAKDDLDPERNESEPGSQRIKPVFIEDANFGRQISYQH 163 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR----SMEDLY 164 A IPT+ ++ + L S E+ + + + + Sbjct: 164 AAVHIPTDIYEGSTIVLNELNWTSALDEVFKRNRDEDPTLLWQVFGSATGLARYYPASPW 223 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 + N + P + S + + + + ++ S ++ ++ Sbjct: 224 VDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETL 283 Query: 225 QKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + + ++ +V N +K +N + T+ Sbjct: 284 SDDDFVNVASFNSNAQDVSCFQHLVQANVR----NKKVLKDAVNNITAKGITDYKKGFSF 339 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A+ +L N S K ++ TDG +Y ++ ++++ Sbjct: 340 AFEQLLNYNVSRA------NCNKIIMLFTDG--------GEERAQEIFAKYNKDKKVRVF 385 Query: 344 SVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 + +V + ++ ++ G ++ + + Sbjct: 386 TFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|320014437|gb|ADV98008.1| putative membrane protein [Yersinia pestis biovar Medievalis str. Harbin 35] Length = 437 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 21/163 (12%), Positives = 55/163 (33%), Gaps = 13/163 (7%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYENTNTYPAMHHAYRELYNE 291 N++ + +AY+ + + + + + ++P T + + ++ Sbjct: 93 NITDTLSVVAYDNHAEVIIPATKVTDKPALIASIQQHIHPRGMTALFAGVSMGIGQVDKH 152 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 +I I+DG+ + + L + G+ I ++ + Sbjct: 153 LNREQVNR--------IILISDGQANTGPTSISEL--SDLARMAAKKGIAITTIGLGQD- 201 Query: 352 EGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 +DL+ S G V +S +L ++F K + + Sbjct: 202 YNEDLMTAIAGYSDGNHTFVANSADLEKAFTKEFQDVMSVVAQ 244 >gi|306820467|ref|ZP_07454103.1| D-amino acid dehydrogenase large subunit [Eubacterium yurii subsp. margaretiae ATCC 43715] gi|304551542|gb|EFM39497.1| D-amino acid dehydrogenase large subunit [Eubacterium yurii subsp. margaretiae ATCC 43715] Length = 538 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 24/224 (10%), Positives = 66/224 (29%), Gaps = 22/224 (9%) Query: 170 DNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ 229 + K+ S +++ S ++ + K + Sbjct: 173 KKDMKDPFTGEKVNENKQVNVEIVLDASGSMAKQINGQSMMNIAKNSITEVLKHLPKNAK 232 Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPL-----SNNLNEVKSRLNKLNPYENTNTYPAMHHA 284 + G + L + N + + L+ + T+ ++ + Sbjct: 233 VGLRVFGHKGNNTDSGKTESCSANELIHPIETLNTSAISKALSSVEATGWTSIADSIKNG 292 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG--MKI 342 +L + +TDG + + ++ + ++N+G + + Sbjct: 293 GEDLS--------KFKEEGAVNILYIVTDGIETCGG------DPIEAAQTLKNSGTNVVL 338 Query: 343 YSVAVSAPPEGQDLLRKCTDS-SGQFFAVNDSRELLESFDKITD 385 + + +L+K ++ G + ND+ L KIT+ Sbjct: 339 GIIGFNVNATQDAVLKKIAEAGGGHYAIANDAGTLTSELYKITE 382 >gi|301625572|ref|XP_002941978.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like, partial [Xenopus (Silurana) tropicalis] Length = 476 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 21/134 (15%), Positives = 53/134 (39%), Gaps = 11/134 (8%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 +N+ + K ++K++ T+ A+ A + L N + ST ++F++DGE Sbjct: 48 DNIIKAKQFVSKISARGGTDINKALLAAVKMLKNTSRNKLLPKIST---SIILFLSDGEP 104 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG----QFFAVND 372 + N + + +Y + + L K +G + + +D Sbjct: 105 TSGVTNHNEIINN--VKKANERQTTLYCLGFG-NDVDFNFLEKMALENGGLARRIYEDSD 161 Query: 373 SR-ELLESFDKITD 385 + +L ++++ + Sbjct: 162 AALQLQGFYNEVAN 175 >gi|301611663|ref|XP_002935353.1| PREDICTED: LOW QUALITY PROTEIN: anthrax toxin receptor 1-like [Xenopus (Silurana) tropicalis] Length = 565 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 28/144 (19%), Positives = 50/144 (34%), Gaps = 16/144 (11%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 G+ L+ + +++ L +L P +T + + A ++Y+E + T Sbjct: 84 STRGSTLMRLTEDREQIRQGLEELRKVLPGGDTYMHEGIERASEQIYHESIKGYRT---- 139 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 +I +TDGE + R G ++Y V V + L + D Sbjct: 140 --ASVIIALTDGELHEDLF----YYAEREANRSRELGAQVYCVGV--KDFNETQLARIAD 191 Query: 363 SSGQFFAVN-DSRELLESFDKITD 385 S F VN L + I Sbjct: 192 SKDHVFPVNGGFEALQDIIGSILK 215 >gi|195941051|ref|ZP_03086433.1| von Willebrand factor, type A [Escherichia coli O157:H7 str. EC4024] Length = 325 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 71/204 (34%), Gaps = 35/204 (17%) Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 + + S A ++ + +S V + + RIG + + Sbjct: 101 ILDVSGSMEKNDVAGGLTRLQAVQQSVKKFVAARKS---------DRIGLVIFANSAWPF 151 Query: 251 QCTPLSNNLNEVKSRLNKLNPYEN---TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 P+S + +++R+++L P T A+ + L + + K Sbjct: 152 A--PVSEDKQALETRISQLTPGMAGQQTAIGDALGVTVKLLDSTGD--------KEASKL 201 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS-APPEGQD-----LLRKCT 361 I +TDG ++ + L + + ++++++A G D LL+ Sbjct: 202 AILLTDGNDTA-----SQLTPRLAAQLAVSHHVQLHTIAFGDVNSSGDDKVDLNLLQDLA 256 Query: 362 -DSSGQFFAVNDS-RELLESFDKI 383 + G+ + +S L + +I Sbjct: 257 RMTGGRSWTAENSGASLDAVWKEI 280 >gi|113970537|ref|YP_734330.1| vault protein inter-alpha-trypsin subunit [Shewanella sp. MR-4] gi|113885221|gb|ABI39273.1| Vault protein inter-alpha-trypsin domain protein [Shewanella sp. MR-4] Length = 759 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 28/163 (17%), Positives = 60/163 (36%), Gaps = 12/163 (7%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 ++ I + + + NL + +N+L T A++ A Sbjct: 410 PQDSFNIIEFNSDVSLLSSTPLPATATNLAMARQFVNRLQADGGTEMAQALNSAL----- 464 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 +++ + G + + VIF+TDG SA + + + ++++V + + Sbjct: 465 PRQAFNTASGEDKSLRQVIFMTDGSVGNESALFELIR-----NQIGDN--RLFTVGIGSA 517 Query: 351 PEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 P + R G F + D E+ + K+ KIQ + Sbjct: 518 PNSHFMQRAAELGRGTFTYIGDVDEVEQKISKLLAKIQYPVLT 560 >gi|330829742|ref|YP_004392694.1| von Willebrand factor type A domain-containing protein [Aeromonas veronii B565] gi|328804878|gb|AEB50077.1| von Willebrand factor type A domain protein [Aeromonas veronii B565] Length = 330 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 55/152 (36%), Gaps = 41/152 (26%) Query: 252 CTPLSNNLNEVKSRLNKLNP---YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 +PL+ + + + ++L+ T A+ A + + ++ + Sbjct: 135 LSPLTQEIPALLTLSDELDFDLVGRTTALGEAILLARQHGDPGRPTA------------L 182 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE---------------- 352 + +TDG N+ + + LQ + G++IY++ V A P+ Sbjct: 183 LLVTDGRNTAGN-----ADPLQEAKLAAAQGIRIYTLGVGADPDTFIQPYDEAGSGQADP 237 Query: 353 ----GQDLLRKCTDSS-GQFFAVNDSRELLES 379 + LL++ + G++F +L Sbjct: 238 SSELDEPLLKELAQTGQGRYFRARTQSDLDTI 269 >gi|163848654|ref|YP_001636698.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222526590|ref|YP_002571061.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163669943|gb|ABY36309.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222450469|gb|ACM54735.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 947 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 56/397 (14%), Positives = 109/397 (27%), Gaps = 77/397 (19%) Query: 17 IDLAHIMYIRNQMQSALD-------AAVL----------SGCASIVSDRTIKDPTTKKDQ 59 ++ + R Q+++A D AA L + R + T Sbjct: 261 VEGSGFRRYRVQVEAASDGRVQNNEAAALIRVQGPPRILLVAQTAADARPLMTALTATGI 320 Query: 60 TSTIFKKQIKK-HLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENL 118 + + + L S + + + P + I E+ Sbjct: 321 IAELVSPEAAPRTLADLSTYDALVLVNTPARMLPVGLMQAIPGYVRDLGRGLLMIGGEDS 380 Query: 119 FLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNK 178 F G +L + ++I V+D S SM Sbjct: 381 FGVGGYGRTAVEEALPVYMDVRNRELRPDLAIVFVIDKSGSM------------------ 422 Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 + T+ S+ RKID+ ++ + Sbjct: 423 -DACHCADPDRGAPITSSSE---------RKIDIAKDAIVQAAALLGPQDTVG------- 465 Query: 239 GTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNT 298 + ++ + +V ++ + P TN + A L Sbjct: 466 -VVTFDGAASATFPATRGATVEQVMDAVSGVEPRGPTNIRAGLLRAEEMLQQV------- 517 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLR 358 K +I +TDG SG + L + +R G+ + VA + L + Sbjct: 518 ---DARIKHMILLTDGWGSGG-------DQLDLAARLREQGITLTVVAAGSGSAAY-LKQ 566 Query: 359 KCTDSSGQFFAVNDSRELLESF-----DKITDKIQEQ 390 + G+++ D E+ + F I + I EQ Sbjct: 567 LAAEGGGRYYPAADMAEVPQIFVQETITAIGNYIVEQ 603 >gi|73958316|ref|XP_848776.1| PREDICTED: similar to integrin, alpha D precursor [Canis familiaris] Length = 1166 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 28/137 (20%), Positives = 50/137 (36%), Gaps = 17/137 (12%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 S N + + +L T T + EL++ K + KK +I ITDG+ Sbjct: 218 SWNPLSLVDPIVQLK--GLTYTATGIRKVVEELFHSKNGAR-----KSAKKILIVITDGQ 270 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP---PEGQDLLRKCTDSS--GQFFAV 370 Y++ L + AG+ Y++ V P + L F V Sbjct: 271 -----KYKDPLEYSDVIPQAERAGIIRYAIGVGDAFWKPSAKQELDNIGSEPAQDHVFRV 325 Query: 371 NDSRELLESFDKITDKI 387 ++ L +++ +KI Sbjct: 326 DNFAALSSIQEQLQEKI 342 >gi|307254358|ref|ZP_07536196.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 9 str. CVJ13261] gi|307258816|ref|ZP_07540548.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 11 str. 56153] gi|306862657|gb|EFM94613.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 9 str. CVJ13261] gi|306867167|gb|EFM99023.1| Tight adherence protein G [Actinobacillus pleuropneumoniae serovar 11 str. 56153] Length = 538 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 46/370 (12%), Positives = 101/370 (27%), Gaps = 67/370 (18%) Query: 14 TYAIDLAHIMYIRNQMQSALDAAVLSGCASIVS----------------DRTIKDPTTKK 57 +++ A I+ + ++ +L+ AVLS A + + + + Sbjct: 36 FVSLESAGIIQDQARLSDSLEQAVLSLTAENNNGRKDNDYKLSGSSNKENDSFDISSEVG 95 Query: 58 DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 + S + +K L Q + + N I +N T K + I ++ Sbjct: 96 KRDSQMVTTFVKAFLPQTNDDKMNL--IPICKTVNNTSGKGHTSSSEVTCTVSGTIKHKS 153 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENL---AISICMVLDVSRSMEDLYLQKHNDNNN- 173 F + + + + +N I + +V D+S SM+D + Sbjct: 154 WFPLKVGTVEVIPQQVDVASKSKAFKKNTFNIPIDLMVVADLSGSMKDGIKGEKLKGGTN 213 Query: 174 ----------MTSNKYLLPPPPKKSFWSKNT---------TKSKYAPAPAPANRKIDVLI 214 L + K P + + Sbjct: 214 SKIYILREVLKELADKSLFTQEANEYNRIGITAFAMGAEHPKENKCVLPFVLQNNLHEMS 273 Query: 215 ES------------------AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 +S N V + I I S Sbjct: 274 KSKIKQYLTSSHKSLRRTEFVDNFVALLDTEATLNSIGKPNYDIIFPKSSICLEGLKKAS 333 Query: 257 N------NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 + ++R++ L T + A ++ +EK S + K+ ++ Sbjct: 334 QFWYTKEEKEKFRNRVDSLKANGGTLASSGLLTASNQMLSEKSRSEEL--NQETKRVILV 391 Query: 311 ITDGENSGAS 320 ++DG + ++ Sbjct: 392 LSDGNDDMSN 401 Score = 43.0 bits (99), Expect = 0.081, Method: Composition-based stats. Identities = 14/97 (14%), Positives = 35/97 (36%), Gaps = 1/97 (1%) Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 +L + S+ + ++T+ + ++ ++ ++ K+ V Sbjct: 430 EDLSSTTTSNKAYYNRHSTFNYNTYLTNKTKDISRKGMCSIIQEKLNTLNKDKNTKLVFV 489 Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 E D + C + G +F ++ LL SF + Sbjct: 490 EFGYRSESADAWKTCVGN-GNYFYADNRESLLNSFKQ 525 >gi|46127789|ref|XP_388448.1| hypothetical protein FG08272.1 [Gibberella zeae PH-1] Length = 774 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 40/277 (14%), Positives = 83/277 (29%), Gaps = 25/277 (9%) Query: 110 QYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHN 169 T LF + S T + + + + L Sbjct: 14 TNTPQTVPLFRSLVSGSKSTK-EEPIKSEDAQEPIAIISDKDATIRLEPVPSRNGLLIKI 72 Query: 170 DNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ 229 + S P + + S PAP P + + S +L + I Sbjct: 73 ETPKEPSVNIPHVPC--DIVLVIDVSGSMGQPAPVPGEDQ-ESAGLSVLDLTKHAARTII 129 Query: 230 EKKNLSVRIGTIAYNIGIVG-NQCTPLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 E N + R+ + + P++ +N + + P + TN + M A ++ Sbjct: 130 ESMNENDRLSIVTFASKAKVLQPLLPMNQDNKTRAIKNVKSMEPRDATNLWQGMLEAIKQ 189 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG---MKIYS 344 + S+ ++ +TDG + + + +RN G I++ Sbjct: 190 FNTD--------ESSPNVPAIMILTDGMPNHMNP------AVGFVPKIRNMGPLPASIHT 235 Query: 345 VAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESF 380 DLL+ + +G + + D+ + F Sbjct: 236 FGFGY-SLKSDLLKSIAEIGNGNYAFIPDAGMIGTVF 271 >gi|197102272|ref|NP_001124862.1| calcium channel, voltage-dependent, alpha 2/delta subunit 1 preproprotein [Pongo abelii] gi|55726163|emb|CAH89855.1| hypothetical protein [Pongo abelii] Length = 1079 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 30/334 (8%), Positives = 93/334 (27%), Gaps = 25/334 (7%) Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK- 108 + ++ + + E +I ++ + Sbjct: 104 AEKVQAAHQWREDFASNEVVYYNAKDDLDPEKNDSEPGSQRIKPVFIEDANFGRQISYQH 163 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR----SMEDLY 164 A IPT+ ++ + L S + E+ ++ + + + Sbjct: 164 AAVHIPTDIYEGSTIVLNELNWTSALDEVFKKNREEDPSLLWQVFGSATGLARYYPASPW 223 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 + N + P + S + + + + ++ S ++ ++ Sbjct: 224 VDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETL 283 Query: 225 QKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + + ++ +V N +K +N + T+ Sbjct: 284 SDDDFVNVASFNSNAQDVSCFQHLVQANVR----NKKVLKDAVNNITAKGITDYKKGFSF 339 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A+ +L N S K ++ TDG A N ++ ++++ Sbjct: 340 AFEQLLNYNVSRA------NCNKIIMLFTDGGEERAQEIFTKYN--------KDKKVRVF 385 Query: 344 SVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 + +V + ++ ++ G ++ + + Sbjct: 386 TFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|326504464|dbj|BAJ91064.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 720 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 18/138 (13%), Positives = 42/138 (30%), Gaps = 21/138 (15%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N + ++ L + TN + L + + + ++DG+ + Sbjct: 125 NKARLNHLVDGLQVIDPTNIRDGLEAGLSVLAGRRITGGRVAS-------IFLLSDGDEN 177 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT--DSSGQFFAVNDSRE 375 A + + + +Y+ +L + G F V+D Sbjct: 178 RGHATTVDV-----------SDVPVYTFGFGTD-YDPKVLDEIARRSKGGTFNFVDDEEN 225 Query: 376 LLESFDKITDKIQEQSVR 393 + E F +I + V+ Sbjct: 226 MTEPFSQILGGLLSIVVQ 243 >gi|225850253|ref|YP_002730487.1| putative von Willebrand factor type A domain protein [Persephonella marina EX-H1] gi|225645927|gb|ACO04113.1| putative von Willebrand factor type A domain protein [Persephonella marina EX-H1] Length = 304 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 26/143 (18%), Positives = 52/143 (36%), Gaps = 22/143 (15%) Query: 249 GNQCTPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 + PL+++ + ++ + P T Y + A ++ + Sbjct: 131 PFRLMPLTSDRGALLRVISIIRPAMVDVGGTAMYDGLVEALNMFMKDRRN---------- 180 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS 364 K +I +TDG + + + ++ G KIY++ VS+ L R + Sbjct: 181 -KIIILLTDG-----GDINSKYTLEDVVRFNQDIGAKIYTIGVSSGMNFYVLERLSEATG 234 Query: 365 GQFFAVND--SRELLESFDKITD 385 G+ F V + L FD+I Sbjct: 235 GKAFFVTKDYQKALRSVFDEINR 257 >gi|114614244|ref|XP_001160279.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-1 isoform 2 [Pan troglodytes] gi|114614246|ref|XP_519175.2| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta subunit 1 isoform 3 [Pan troglodytes] Length = 1091 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 30/334 (8%), Positives = 93/334 (27%), Gaps = 25/334 (7%) Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK- 108 + ++ + + E +I ++ + Sbjct: 104 AEKVQAAHQWREDFASNEVVYYNAKDDLDPEKNDSEPGSQRIKPVFIEDANFGRQISYQH 163 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR----SMEDLY 164 A IPT+ ++ + L S + E+ ++ + + + Sbjct: 164 AAVHIPTDIYEGSTIVLNELNWTSALDEVFKKNREEDPSLLWQVFGSATGLARYYPASPW 223 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 + N + P + S + + + + ++ S ++ ++ Sbjct: 224 VDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETL 283 Query: 225 QKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + + ++ +V N +K +N + T+ Sbjct: 284 SDDDFVNVASFNSNAQDVSCFQHLVQANVR----NKKVLKDAVNNITAKGITDYKKGFSF 339 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A+ +L N S K ++ TDG A N ++ ++++ Sbjct: 340 AFEQLLNYNVSRA------NCNKIIMLFTDGGEERAQEIFTKYN--------KDKKVRVF 385 Query: 344 SVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 + +V + ++ ++ G ++ + + Sbjct: 386 TFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|78060312|ref|YP_366887.1| hypothetical protein Bcep18194_C7199 [Burkholderia sp. 383] gi|77964862|gb|ABB06243.1| hypothetical protein Bcep18194_C7199 [Burkholderia sp. 383] Length = 423 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 20/213 (9%), Positives = 60/213 (28%), Gaps = 6/213 (2%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTI----KDPTTK 56 + + ++V F+ A+DL + R+++Q++ DA LS + S ++ D Sbjct: 25 IVGLSLAVMIGFVGLALDLGKLYVTRSELQNSADACALSAARDLTSAISLSVAEADGIAA 84 Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 F+ + + + + + + ++ + Sbjct: 85 GHLNFVFFQNKS-VQMSTNANVTFSDSLTDPFLTRSAVTTPSSIKYVQCTATLSNIAHWF 143 Query: 117 NLFLKGLIPSA-LTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMT 175 L L + + ++ I +I + + + + + + + Sbjct: 144 IEVLNVLPGTKLANAAEVSASAIATVGGGQTTCAIPVFVCRATGSPSYNVGDWITSLSGS 203 Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR 208 S Y + + + Sbjct: 204 STTYGPGNFGWAALDGSTNEPTIASELSGNTCN 236 >gi|73538307|ref|YP_298674.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] gi|72121644|gb|AAZ63830.1| von Willebrand factor, type A [Ralstonia eutropha JMP134] Length = 353 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 22/229 (9%), Positives = 70/229 (30%), Gaps = 66/229 (28%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 +I ++A +L+ + + VR+G +++ ++N ++ ++ Sbjct: 106 TRISAAQQAARDLIVGLPAS--------VRLGIVSFAGTATV--VLRPTSNRQDMLDAID 155 Query: 268 KLNPYENTNTYPAMHHAYRELYNE------------------------------------ 291 + T T + A L+ + Sbjct: 156 RFQLQRGTATGSGLIQALAVLFPDDGIDLEAILFADEPVFSTRRAVPLDEAAAADAVRKR 215 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 ++++ + + VI ++DG + + + G+++Y+V Sbjct: 216 EQATQSAQPGSYRHGAVILLSDGRRTVG------PDPVDAARMAAQRGVRVYTVGFGTLG 269 Query: 352 E-------------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDK 386 + LR + ++F + +L + + +++ + Sbjct: 270 GNAPETSLSYYMQLDEPALRAVATITGAEYFQAGSAADLSQVYRQLSAR 318 >gi|297620568|ref|YP_003708705.1| hypothetical protein wcw_0325 [Waddlia chondrophila WSU 86-1044] gi|297375869|gb|ADI37699.1| putative membrane protein [Waddlia chondrophila WSU 86-1044] Length = 374 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 59/153 (38%), Gaps = 21/153 (13%) Query: 254 PLSNNLNEVKSRLNKLNPY-----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRL---- 304 PL+ + + +L+KL + T A++ + + + G+ + Sbjct: 166 PLTLDHQAIIDQLSKLQYTTDLEQDGTAIGYAIYKTANLIAATRHYAEELEGAGKPAYTI 225 Query: 305 -KKFVIFITDG----ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD---- 355 +I +TDG + L Y + G+K+Y + V ++ Sbjct: 226 KNSIMILVTDGLQAPNPLDQGKEFRNVELLDAAVYAKKLGVKVYIINVEPRIASEEFSAH 285 Query: 356 --LLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 L++K T+ + G+F+ V++S L + +I Sbjct: 286 RLLMKKITELTGGRFYMVDNSLNLSSIYSEIDQ 318 >gi|302865239|ref|YP_003833876.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] gi|302568098|gb|ADL44300.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] Length = 429 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 26/197 (13%), Positives = 69/197 (35%), Gaps = 30/197 (15%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN-------N 258 +I V ++ +V+++ ++ L +R+ Y C + Sbjct: 59 GRSRISVAQQAFNEVVDAL----PDETQLGIRVLGATYRGKDKKQGCLDTQQIVPVGPVD 114 Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + K+ + L P T A+ A ++L + ++ ITDGE++ Sbjct: 115 RTQAKAAVAGLRPTGFTPVGLALRSAAQDLGTGS-----------TARRIVLITDGEDT- 162 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIY--SVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRE 375 + ++ + G ++ ++ ++ + + L + G + A + E Sbjct: 163 ----CAPPDPCEVARELAAQGTRLVVDTLGLAPDEKVRKQLLCIAGATGGTYTAAQSADE 218 Query: 376 LLESFDKITDKIQEQSV 392 L ++ D+ ++ Sbjct: 219 LTGRIKQLVDRARDTYT 235 >gi|326434435|gb|EGD80005.1| hypothetical protein PTSG_10281 [Salpingoeca sp. ATCC 50818] Length = 736 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 39/357 (10%), Positives = 93/357 (26%), Gaps = 51/357 (14%) Query: 66 KQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYI--AESKAQYEIPTENLFLKGL 123 Q L G + + + + QY + + ++ K + Sbjct: 21 GQGAHGLAPGESHPSDTTEAEVC-GVAVDYIDTGTAQYNDLVVYETSGNVAFQDETFKRM 79 Query: 124 IPS--------ALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMT 175 + + +A+ +LDV R++ Y + Sbjct: 80 TDYMAQLLLSLDVVSTETIVVVDGFTPRPEVAVRAPNLLDVGRALRLRYQADGSAATASR 139 Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLV-------------- 221 + + K+ +A L Sbjct: 140 LATLAQACVDAGFDFVSVPNDIGSDNYVHVSVPKMTCGNAAADLLFILDGSGSVGSSNFQ 199 Query: 222 -----NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP--LSNNLNEVKSRLNKLN-PYE 273 + + RI + Y L++ +E+ ++ +N PY Sbjct: 200 TMLSFTRTVATFFDVSADTTRIAVMVYASYNYLIFDFNYILTHTKDELLDAISAINYPYG 259 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T T A+ +A ++ S + + + ITDG + + + Sbjct: 260 GTRTGGALDYARTVMFTA--DRGVRPSSEGIPRVAMVITDG--------ASADDVAAPAQ 309 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 +R+ G+ +Y++ + ++ L + ++ E F + I Sbjct: 310 QLRDEGVTLYAIGI--AGANENELNEIASPPITSNVVFISTFSE----FGTLAAAIS 360 >gi|256784255|ref|ZP_05522686.1| secreted protein [Streptomyces lividans TK24] gi|289768140|ref|ZP_06527518.1| secreted protein [Streptomyces lividans TK24] gi|289698339|gb|EFD65768.1| secreted protein [Streptomyces lividans TK24] Length = 421 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 26/194 (13%), Positives = 68/194 (35%), Gaps = 26/194 (13%) Query: 208 RKIDVLIESAGNLVNSIQK-----AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 ++ ++ ++++ + + PL + E Sbjct: 60 TRMAAAKQAFNEVLDATPEEVQLGIRTLGADYPGDDRKTGCKDTAQLYPVGPL--DRTEA 117 Query: 263 KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 K+ + L+P T PA+ A +L K ++ I+DGE++ Sbjct: 118 KTAVATLSPTGWTPIGPALLKAADDL-----------DGGDGSKRIVLISDGEDT----- 161 Query: 323 QNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLES 379 L+ ++ + G+ I ++ + + + L + + G + +V + EL + Sbjct: 162 CAPLDPCEVAREIAAKGIGLTIDTLGLVPNTKMRRQLSCIAEATGGTYTSVEHTDELTDK 221 Query: 380 FDKITDKIQEQSVR 393 +++ D+ + V Sbjct: 222 VNQLVDRAADPVVT 235 >gi|123233471|emb|CAM28080.1| collagen, type XII, alpha 1 [Homo sapiens] Length = 637 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 18/134 (13%), Positives = 45/134 (33%), Gaps = 18/134 (13%) Query: 261 EVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 + + L NT T A++ ++ R +K + ITDG++ Sbjct: 2 SLLQAVANLPYKGGNTLTGMALNFIRQQ-----NFRTQAGMRPRARKIGVLITDGKSQDD 56 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELL 377 + + +++ G++++++ + + L+ + V D L Sbjct: 57 VEAPS--------KKLKDEGVELFAIGI--KNADEVELKMIATDPDDTHAYNVADFESLS 106 Query: 378 ESFDKITDKIQEQS 391 D +T + Sbjct: 107 RIVDDLTINLCNSV 120 >gi|156742365|ref|YP_001432494.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233693|gb|ABU58476.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 412 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 18/174 (10%), Positives = 60/174 (34%), Gaps = 23/174 (13%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 K+ L E+ +++++ + + ++ + + + + +K++++ Sbjct: 58 KLAALKEATRRVIDTLTPQDI--------VSIVLFDDTVQTLVPATFATDRDALKAQVDA 109 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 + T M EL + + + +TDG+ + + Sbjct: 110 IEEAGGTAMSGGMAAGIVELRKHHDPGRVSAM--------LLLTDGQT-----WGDEDRC 156 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFD 381 + + + +++ ++ + A + LL D + G + D ++ F Sbjct: 157 RALAQELARDHVRVTALGLGAEWN-EKLLDDIADATGGLSDYIADPSQITTFFQ 209 >gi|126282074|ref|XP_001368568.1| PREDICTED: similar to Coch-5B2 gene product [Monodelphis domestica] Length = 549 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 21/155 (13%), Positives = 42/155 (27%), Gaps = 20/155 (12%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESS 295 +I + + V + + + T T A+ R ++ Sbjct: 404 KIAAVQFTYDQRPEFSFTDYTTKENVLAVIRNIRYMSGGTATGDAISFTVRNVFGPIRDG 463 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 N F++ +TDG++ G+ +YS+ V+ P Sbjct: 464 PNK-------NFLVIVTDGQSYDDVRAPAAAAHKA--------GITVYSIGVAWAPLDD- 507 Query: 356 LLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQ 388 L+ F + L + I I Sbjct: 508 -LKDMASEPKETHAFFTREFSGLEQIATDIIRGIC 541 >gi|283779907|ref|YP_003370662.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] gi|283438360|gb|ADB16802.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] Length = 1040 Score = 60.3 bits (144), Expect = 5e-07, Method: Composition-based stats. Identities = 23/100 (23%), Positives = 39/100 (39%), Gaps = 10/100 (10%) Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA-PPEGQDLL 357 + K +I I+DG+ S S I + AG+KI +VAV P G L Sbjct: 550 KPNPASVKHMIIISDGDPSPPSG--------TILNQYKQAGIKITTVAVGTHGPAGSTPL 601 Query: 358 RKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 + + G+++ + + L F ++ V P Sbjct: 602 QNIANATGGKYYVATNPKALPRIFQIEARRVARPLVYEDP 641 >gi|118399120|ref|XP_001031886.1| hypothetical protein TTHERM_00721540 [Tetrahymena thermophila] gi|89286221|gb|EAR84223.1| hypothetical protein TTHERM_00721540 [Tetrahymena thermophila SB210] Length = 994 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 20/164 (12%), Positives = 60/164 (36%), Gaps = 14/164 (8%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 K +I + +N I + L + L +V+ ++ +N T + Sbjct: 78 KTQPHSKISLMTFNTSIDHVENLHLKS-LKQVEQFISNINANGGTIFHITFDKLRDI--- 133 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY-MRN--AGMKIYSVAV 347 + + ++++TDG+ N ++ + ++ ++++++ + Sbjct: 134 -----CQKFTNQNEELVIVYLTDGQVQSGQDSTNLKDSFIFLQQVLKKFVNNVEVHALGM 188 Query: 348 SAPPEGQDLLRKCT--DSSGQFFAVNDSRELLESFDKITDKIQE 389 + L + + + + + +S E+ +F I D I + Sbjct: 189 GTSHDPVILDKIISLQTTQSTYQFIKESSEIEGAFKNIVDIIGQ 232 >gi|194288834|ref|YP_002004741.1| flp pilus assembly protein [Cupriavidus taiwanensis LMG 19424] gi|193222669|emb|CAQ68672.1| putative flp pilus assembly protein [Cupriavidus taiwanensis LMG 19424] Length = 418 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 22/260 (8%), Positives = 78/260 (30%), Gaps = 4/260 (1%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDR----TIKDPTTK 56 + + ++V F+ A+DL + ++++Q+++DA L+ + + T Sbjct: 24 IVGLSLAVLIGFVGLALDLGKLYVTKSELQNSVDACALAAARDVTGATPLLVSEAAGLTT 83 Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 + + +F+ + + + + + D + + + ++ Sbjct: 84 GTRNAALFQGKAVEMFENLNVSYSDTPDNTFYTKDKVPYSLDKIKYVKCTAERTGIAQWF 143 Query: 117 NLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS 176 L L + ++ + + +S A +I + + ++ + + + + Sbjct: 144 IQVLNTLPGMNIQPSTVNAMAVATTTSAQTACAIPVYICTPQTANPVRTAYNRGDWIKSK 203 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 ++ P + + + G+ V + Sbjct: 204 DEKDPYGPGSFGWADLTPPGGGASELADLLAGSGQCDLSVVGSKVGQPGSISSLIPAWNT 263 Query: 237 RIGTIAYNIGIVGNQCTPLS 256 R G + + + Sbjct: 264 RFGIYTGSYKGPQDGTPDFT 283 >gi|308050057|ref|YP_003913623.1| hypothetical protein Fbal_2347 [Ferrimonas balearica DSM 9799] gi|307632247|gb|ADN76549.1| conserved hypothetical protein [Ferrimonas balearica DSM 9799] Length = 457 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 59/191 (30%), Gaps = 8/191 (4%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTK--KD 58 M I + A+D H++ + ++Q+A+DAA LSG +I + + Sbjct: 21 MITIAMFAILAMGALALDGGHLLLNKARLQNAVDAAALSGAVAIQKEYDYLRARQEGLVT 80 Query: 59 QTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIA------ESKAQYE 112 TS + + + + S N QI + + + Sbjct: 81 FTSALGAQDFAELNDRVSLSVLNFASDEVSPQITVEFSERPDPFVPVLTPGAQYVRVTVS 140 Query: 113 IPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNN 172 N F ++ ++ G + + + M++ ED + N Sbjct: 141 DVPLNNFFAQVMGVDKRVSAVAVAGPSTSTPQCSTDLLPMMVCAEDLGEDNFGYPLNKMM 200 Query: 173 NMTSNKYLLPP 183 M + P Sbjct: 201 AMKISSQQNTP 211 >gi|109088171|ref|XP_001107718.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2-like [Macaca mulatta] Length = 946 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 28/223 (12%), Positives = 78/223 (34%), Gaps = 25/223 (11%) Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 L P PK + + + S + + ++ + +L ++ + N +VR Sbjct: 304 LDPIPKNILFVIDVSGSMWG---VKMKQTVEAMKTILDDLRAEDHFSVIDF-NQNVRTWR 359 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG 300 +++ K + K+ P TN A+ A L Sbjct: 360 NDLISATKTQ--------VSDAKRYIEKIQPSGGTNINEALLRAIFILNEANNLGLLDPN 411 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 S +I ++DG+ + + + + E +++ + ++S+ + D L++ Sbjct: 412 S---VSLIILVSDGDPTVGELKLSKIQ-KNVKENIQD-NISLFSLGMGFDV-DYDFLKRL 465 Query: 361 TDSSGQ------FFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + + + + S +L + +++++ + P+ Sbjct: 466 -SNENRGIAQRIYGNQDTSSQLKKFYNQVSTPLLRNVQFNYPH 507 >gi|329940639|ref|ZP_08289920.1| putative secreted protein [Streptomyces griseoaurantiacus M045] gi|329300700|gb|EGG44597.1| putative secreted protein [Streptomyces griseoaurantiacus M045] Length = 421 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 25/194 (12%), Positives = 66/194 (34%), Gaps = 26/194 (13%) Query: 208 RKIDVLIESAGNLVNSIQK-----AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 ++ ++ ++++ + + PL + E Sbjct: 60 SRMAAAKQAFNEVLDATPEEVRLGIRTLGADYPGDNRKEGCKDTAQLYPVGPL--DRTEA 117 Query: 263 KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 K+ + L+P T PA+ A +L K ++ I+DGE++ Sbjct: 118 KTAVATLSPTGWTPIGPALLKAADDL-----------DGGNGSKRIVLISDGEDT----- 161 Query: 323 QNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLES 379 L+ ++ + G+ I ++ + + L + + G + +V +L + Sbjct: 162 CAPLDPCEVAREIAAKGIGLTIDTLGLVPDVKLNRQLSCIAEATGGTYTSVEHRDQLTDK 221 Query: 380 FDKITDKIQEQSVR 393 +++ D+ + V Sbjct: 222 VNQLVDRAADPVVT 235 >gi|320010752|gb|ADW05602.1| Protein of unknown function DUF3520 [Streptomyces flavogriseus ATCC 33331] Length = 528 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 30/274 (10%), Positives = 76/274 (27%), Gaps = 25/274 (9%) Query: 108 KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQK 167 A Y L L + + + +D +RS + Sbjct: 93 TAGYGYARRTLGDGQLPAADTVRPEEFVNSFRQGYERPKGNGFAVSVDGARSDAAGWSLV 152 Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 ++ PP +F ++D+ S G L + ++ Sbjct: 153 RVGLATRAASNTGERPPAALTFVVDI-------SGSMAEPGRLDLAKTSLGILADELRDD 205 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTP-LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYR 286 + + ++ L N +++ + ++ P ++TN + Y Sbjct: 206 DS--------VSLVTFSEEAETRLPMTRLRGNRTKLRDAIEEMEPADSTNVAAGVERGYE 257 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 E V+ ++D + + ++ + + G+ ++ V Sbjct: 258 EAVEGHRKGATNR--------VVLLSDALANTGETEAD-AILERVGDARQEYGITLFGVG 308 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 V + + + R G + D + + F Sbjct: 309 VGSDYGDELMERLTNKGDGNTTYIADEAQARKVF 342 >gi|219847682|ref|YP_002462115.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219541941|gb|ACL23679.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 418 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 20/156 (12%), Positives = 46/156 (29%), Gaps = 13/156 (8%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 + ++ + + + + + + + T M A EL + Sbjct: 82 IVIFDDTVQTLIPATPVGDRSALLAAVETITEAGGTAMSLGMQAAQTELQKHLGPDRISR 141 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 + +TDG+ G + + AG++I ++ + Q L Sbjct: 142 M--------LLLTDGQTWGDEPIC-----RDLARTLGQAGVRITALGLGTEWNEQLLDDI 188 Query: 360 CTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 S G + D ++ F + + Q A Sbjct: 189 AAASDGYSDYIADPAQIETFFQQAVKEAQAVVATDA 224 >gi|218961690|ref|YP_001741465.1| BatA protein (fragment) [Candidatus Cloacamonas acidaminovorans] gi|167730347|emb|CAO81259.1| BatA protein (fragment) [Candidatus Cloacamonas acidaminovorans] Length = 270 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 65/205 (31%), Gaps = 42/205 (20%) Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY--- 272 + N +++ ++ PL+ + + + L+KL Sbjct: 43 APKNRLSAAVSVAKDFVKRRPNDRFGLVAFSEYALTQVPLTFDHLAMLNSLDKLKVNEEA 102 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T + A L ST K +I ITDG ++ ++ L Sbjct: 103 SATAIGMGLAKAVARL----------KNSTAKSKVIILITDGVSNTG-----EIDPLTAA 147 Query: 333 EYMRNAGMKIYSVAVSAP---------------------PEGQDLLRKCTDS--SGQFFA 369 + G+K+Y + V + + L K ++ +G+ Sbjct: 148 GMAKELGIKVYPIGVGSKGLVPFPYSDPIFGTRYINTYIDLDMETLNKIAETTGTGKAAL 207 Query: 370 VNDSRELLESFDKITDKIQEQSVRI 394 D++ L + ++I D++++ Sbjct: 208 ATDAKGLADIMNEI-DRLEKTLFTT 231 >gi|153840568|ref|ZP_01993235.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810] gi|149745769|gb|EDM56899.1| conserved hypothetical protein [Vibrio parahaemolyticus AQ3810] Length = 187 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 22/166 (13%), Positives = 53/166 (31%), Gaps = 2/166 (1%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKK-DQ 59 + ++++ + + IDL H + + ++Q+A+D A L+G + Sbjct: 16 LISMVLLILLGVAAFGIDLNHQVLNKTRLQNAVDTAALAGAVVADKTEDVDQAEAAVIAT 75 Query: 60 TSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLF 119 S+I + L + +N +Y + + + Sbjct: 76 LSSIASESGNTELSFTDGNTSVTFSHDMQTFVNAASFTPPTGEYDIYVRVAVTDMGISQY 135 Query: 120 LKGLIPS-ALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLY 164 L + + S + + I M D + ++ED + Sbjct: 136 LSAVFGIVKNVSASAVAGRSAAIAYTCNLTPIAMCGDPNGTVEDAW 181 >gi|55958063|emb|CAI12958.1| inter-alpha (globulin) inhibitor H2 [Homo sapiens] Length = 935 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 27/223 (12%), Positives = 76/223 (34%), Gaps = 25/223 (11%) Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 L P PK + + + S + + ++ + +L ++ + N ++R Sbjct: 293 LDPIPKNILFVIDVSGSMWG---VKMKQTVEAMKTILDDLRAEDHFSVIDF-NQNIRTWR 348 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG 300 + + K + K+ P TN A+ A L Sbjct: 349 NDLISATKTQ--------VADAKRYIEKIQPSGGTNINEALLRAIFILNEANNLGLLDPN 400 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 S +I ++DG+ + + + + E +++ + ++S+ + D L++ Sbjct: 401 S---VSLIILVSDGDPTVGELKLSKIQ-KNVKENIQD-NISLFSLGMGFDV-DYDFLKRL 454 Query: 361 TDSSGQ------FFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + + + S +L + +++++ + P+ Sbjct: 455 -SNENHGIAQRIYGNQDTSSQLKKFYNQVSTPLLRNVQFNYPH 496 >gi|70778918|ref|NP_002207.2| inter-alpha-trypsin inhibitor heavy chain H2 [Homo sapiens] gi|229462889|sp|P19823|ITIH2_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H2; Short=ITI heavy chain H2; Short=ITI-HC2; Short=Inter-alpha-inhibitor heavy chain 2; AltName: Full=Inter-alpha-trypsin inhibitor complex component II; AltName: Full=Serum-derived hyaluronan-associated protein; Short=SHAP; Flags: Precursor gi|55958062|emb|CAI12957.1| inter-alpha (globulin) inhibitor H2 [Homo sapiens] Length = 946 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 27/223 (12%), Positives = 76/223 (34%), Gaps = 25/223 (11%) Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 L P PK + + + S + + ++ + +L ++ + N ++R Sbjct: 304 LDPIPKNILFVIDVSGSMWG---VKMKQTVEAMKTILDDLRAEDHFSVIDF-NQNIRTWR 359 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG 300 + + K + K+ P TN A+ A L Sbjct: 360 NDLISATKTQ--------VADAKRYIEKIQPSGGTNINEALLRAIFILNEANNLGLLDPN 411 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 S +I ++DG+ + + + + E +++ + ++S+ + D L++ Sbjct: 412 S---VSLIILVSDGDPTVGELKLSKIQ-KNVKENIQD-NISLFSLGMGFDV-DYDFLKRL 465 Query: 361 TDSSGQ------FFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + + + S +L + +++++ + P+ Sbjct: 466 -SNENHGIAQRIYGNQDTSSQLKKFYNQVSTPLLRNVQFNYPH 507 >gi|119606784|gb|EAW86378.1| inter-alpha (globulin) inhibitor H2, isoform CRA_b [Homo sapiens] gi|124376332|gb|AAI32686.1| Inter-alpha (globulin) inhibitor H2 [Homo sapiens] gi|158256194|dbj|BAF84068.1| unnamed protein product [Homo sapiens] Length = 946 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 27/223 (12%), Positives = 76/223 (34%), Gaps = 25/223 (11%) Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 L P PK + + + S + + ++ + +L ++ + N ++R Sbjct: 304 LDPIPKNILFVIDVSGSMWG---VKMKQTVEAMKTILDDLRAEDHFSVIDF-NQNIRTWR 359 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG 300 + + K + K+ P TN A+ A L Sbjct: 360 NDLISATKTQ--------VADAKRYIEKIQPSGGTNINEALLRAIFILNEANNLGLLDPN 411 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 S +I ++DG+ + + + + E +++ + ++S+ + D L++ Sbjct: 412 S---VSLIILVSDGDPTVGELKLSKIQ-KNVKENIQD-NISLFSLGMGFDV-DYDFLKRL 465 Query: 361 TDSSGQ------FFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + + + S +L + +++++ + P+ Sbjct: 466 -SNENHGIAQRIYGNQDTSSQLKKFYNQVSTPLLRNVQFNYPH 507 >gi|156371145|ref|XP_001628626.1| predicted protein [Nematostella vectensis] gi|156215607|gb|EDO36563.1| predicted protein [Nematostella vectensis] Length = 484 Score = 60.3 bits (144), Expect = 6e-07, Method: Composition-based stats. Identities = 28/178 (15%), Positives = 59/178 (33%), Gaps = 16/178 (8%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY 272 L+ S + +G I ++ +V ++ + Sbjct: 290 AKRFVKALIGSFK-----VSQKGTHVGIIRFSTRAKVMFTFTEHFTHEDVNYAIDDIEYT 344 Query: 273 -ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T T A+ A EL++ + S + K + +TDG + + Sbjct: 345 EGGTKTELALRLARTELFS---KQGGSRTSPLIFKLFVLMTDGR------SEYFHAVARQ 395 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + ++ +G+ + +V + Q L S V R+L+ ++I DK+ E Sbjct: 396 AKMLKRSGVHVMAVGIG-KYTNQRELEVIASSKSDVIGVVSFRDLMIRMNEIKDKLCE 452 >gi|268325023|emb|CBH38611.1| hypothetical protein, containing PKD domain [uncultured archaeon] Length = 1152 Score = 59.9 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 28/198 (14%), Positives = 67/198 (33%), Gaps = 10/198 (5%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQE--KKNLSVRIGTIAYNI---GIVGNQCT 253 I + SA +VN I+ I + + R + N Sbjct: 314 VIDTTGSMGDDIANVKASASTIVNEIEAIIPDYQVAVVDYRDFPVDPYGGDGDYPFNDVL 373 Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 P S + + S + L + +++ A + G + K +I + D Sbjct: 374 PFSTDKAAIISAIQGLTLGWGGDWEESVYSALMHSIDAGSLG-GWRGEDQALKAIILMGD 432 Query: 314 GENSGAS-AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND 372 L ++ I + + + IY++ + P L + + G+ F + Sbjct: 433 APPHDPEPFTGYILTSVAIAAELAD-PVHIYTIQIGGPVGKFAEL--ASQTGGEVFTAEN 489 Query: 373 SRELLESFDKITDKIQEQ 390 + E++++ + ++I ++ Sbjct: 490 AEEVVDAILEAIEEITKR 507 >gi|254450361|ref|ZP_05063798.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|254450938|ref|ZP_05064375.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|198264767|gb|EDY89037.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|198265344|gb|EDY89614.1| conserved hypothetical protein [Octadecabacter antarcticus 238] Length = 75 Score = 59.9 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 20/74 (27%), Positives = 31/74 (41%), Gaps = 1/74 (1%) Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 + + IC R G+ IY+VA AP GQ L+ C S F V + ++ + Sbjct: 2 NGTEADARLSDICAAARAQGVVIYTVAFEAPSGGQSALQDCASSPSHHFDV-NGTDISSA 60 Query: 380 FDKITDKIQEQSVR 393 F I I+ + Sbjct: 61 FSAIASDIRALKLT 74 >gi|315501784|ref|YP_004080671.1| von willebrand factor type a [Micromonospora sp. L5] gi|315408403|gb|ADU06520.1| von Willebrand factor type A [Micromonospora sp. L5] Length = 430 Score = 59.9 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 26/197 (13%), Positives = 69/197 (35%), Gaps = 30/197 (15%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN-------N 258 +I V ++ +V+++ ++ L +R+ Y C + Sbjct: 59 GRSRISVAQQAFNEVVDAL----PDETQLGIRVLGATYRGKDKKQGCLDTQQIVPVGPVD 114 Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + K+ + L P T A+ A ++L + ++ ITDGE++ Sbjct: 115 RTQAKAAVAGLRPTGFTPVGLALRSAAQDLGTGS-----------TARRIVLITDGEDT- 162 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIY--SVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRE 375 + ++ + G ++ ++ ++ + + L + G + A + E Sbjct: 163 ----CAPPDPCEVARELAAQGTRLVVDTLGLAPDEKVRKQLLCIAGATGGTYTAAQSADE 218 Query: 376 LLESFDKITDKIQEQSV 392 L ++ D+ ++ Sbjct: 219 LTGRIKQLVDRARDTYT 235 >gi|160874992|ref|YP_001554308.1| cell wall anchor domain-containing protein [Shewanella baltica OS195] gi|160860514|gb|ABX49048.1| LPXTG-motif cell wall anchor domain protein [Shewanella baltica OS195] gi|315267224|gb|ADT94077.1| Vault protein inter-alpha-trypsin domain-containing protein [Shewanella baltica OS678] Length = 771 Score = 59.9 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 37/259 (14%), Positives = 77/259 (29%), Gaps = 22/259 (8%) Query: 144 ENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTK---SKYA 200 + + V + + + + ++K S K S Sbjct: 329 QQGTSPMAWVFNQQGKTHKP-DGDNLSQDTLETSKANGVNEDNYSLVMVLPPKVEKSTQP 387 Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKA------IQEKKNLSVRIGTIAYNIGIVGNQCTP 254 P ID AG+ + + A + ++ I + Sbjct: 388 SLPRELILVIDTSGSMAGDSIVQAKNALLYALKGLKPEDSFNIIEFNSSLSQFSATSLPA 447 Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 S+NL+ + +++L T A+ A K + + + + VIF+TDG Sbjct: 448 TSSNLSRARQFVSRLQADGGTEMALALDAAL-----PKSLGSASPDAVQPLRQVIFMTDG 502 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSR 374 A + + ++++V + + P + R G F + Sbjct: 503 SVGNEQALFDLIRYQIGES-------RLFTVGIGSAPNSHFMQRAAELGRGTFTYIGKVD 555 Query: 375 ELLESFDKITDKIQEQSVR 393 E+ E + KIQ + Sbjct: 556 EVDEKISALLSKIQYPVLT 574 >gi|311264542|ref|XP_003130217.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Sus scrofa] Length = 998 Score = 59.9 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 22/156 (14%), Positives = 54/156 (34%), Gaps = 21/156 (13%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKE 293 + RIG I Y+ + ++ + +K ++ + T T A+ HA ++ Sbjct: 688 TARIGIINYSHKVEEVAHLTQFSSKDALKRAVDNMQYLGEGTYTATAL-HAANRMFEASR 746 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 +KK + ITDG+ ++ N + + + ++I+ + V + Sbjct: 747 PE--------VKKVALVITDGQTDT----RDEKNLTDVVKNASDINVEIFVIGVVKKNDP 794 Query: 354 Q-----DLLRKCTDSSG--QFFAVNDSRELLESFDK 382 + + +D L ++ + Sbjct: 795 NFEMFHKEMNLIATDPDSEHVYQFDDFITLQDTLKQ 830 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 27/158 (17%), Positives = 57/158 (36%), Gaps = 15/158 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKK-NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 D + +L + + + +++ + ++ + + +L+ K R+ Sbjct: 65 FDKQKDFVDSLSDKLFQLTPVGSLKYDIKLAALQFSSSVQIDPPFSSWKDLHTFKQRVKS 124 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 +N T +Y A+ +A R L E K + +TDG + + + Sbjct: 125 MNFIGQGTFSYYAIANATRLLKREGR--------KDSVKVALLMTDGID-----HPKNPD 171 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG 365 I E RNAG+ ++ +S L +S G Sbjct: 172 VQSISEDARNAGIIFITIGLSTVVNETKLHLISGNSPG 209 >gi|317056370|ref|YP_004104837.1| von Willebrand factor type A [Ruminococcus albus 7] gi|315448639|gb|ADU22203.1| von Willebrand factor type A [Ruminococcus albus 7] Length = 554 Score = 59.9 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 36/360 (10%), Positives = 94/360 (26%), Gaps = 30/360 (8%) Query: 39 SGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKN 98 + D + ++ + E A+ + +++ + Sbjct: 38 MIVGDNREQNISNNGQQNLDDADGLEWHYSEEAISDMDINTEEYNYYAENSYLSVAEHPL 97 Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR 158 + ++ + I + + P+A+ + + I + ++S Sbjct: 98 STFSTDVDTASFTNIRRMIENNQNIDPNAVRTEEFINYFKYNYEYPDGDDKIAITTELSD 157 Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAG 218 + + L N K+ +++++ Sbjct: 158 CP--------WNADAKLMQIGLQAKDIDVQDIDSNIVFLIDVSGSMADENKLPLVVQAFA 209 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 L ++ + RI + Y N + S L L +T Sbjct: 210 MLAENLGEND--------RISIVTYAGRDTIELEGESGANYETIASTLAGLTAGGSTAGA 261 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 ++ AY N VI TDG+ + + + E R+ Sbjct: 262 AGINTAYELAEKYFIKGGNNR--------VILATDGDLNVGL--SSEEELKALIEEKRDK 311 Query: 339 GMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 G+ + + + L D+ G + ++ E + ++ +A + Sbjct: 312 GVFLSVLGFGMGNYKDNRLEALADNGNGNYAYIDSVDEAERV---LVTEMNGTMFTVAKD 368 >gi|322418525|ref|YP_004197748.1| hypothetical protein GM18_0996 [Geobacter sp. M18] gi|320124912|gb|ADW12472.1| hypothetical protein GM18_0996 [Geobacter sp. M18] Length = 389 Score = 59.9 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 27/272 (9%), Positives = 74/272 (27%), Gaps = 9/272 (3%) Query: 4 IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTI 63 I++ V + AID+ ++ +Q + + A L+G +I + T + Sbjct: 15 IMLVVFLVVTGLAIDIGYMYVSEEDLQHSAEMAALTGAQTIKQRYLYQAQTDPAR----L 70 Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGL 123 + A + NN ++ + Sbjct: 71 PAISSDPVQAPARNAAVDLVTGKHDAAALVGLLNNNGNALTGDNDITVGFWNMSSRSYTP 130 Query: 124 IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNN---NMTSNKYL 180 + + + +R+ E SS L + +S + + Sbjct: 131 GGTPVNAMQVRTRRTAESSSVGLGTVGTFIAKISGTENFGSTPVATAALIPGTRANIAIC 190 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 + + + + + P + K L ++ L S+ + +S + Sbjct: 191 AEACQSSCTFPQICSIPERRMSHLPWDTKGGAL--ASRYLYTSLLHPVTITNAMSDLVCQ 248 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY 272 + G ++ + + + + + Sbjct: 249 EMPVQEVCGQPIFTAASGSDAILNDIKAMMYD 280 >gi|317057468|ref|YP_004105935.1| von Willebrand factor type A [Ruminococcus albus 7] gi|315449737|gb|ADU23301.1| von Willebrand factor type A [Ruminococcus albus 7] Length = 782 Score = 59.9 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 18/199 (9%), Positives = 65/199 (32%), Gaps = 20/199 (10%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + P +++ +K + RIG + + +++ Sbjct: 292 NSGSMYPKELCPTSSENDVDFKRLDFTQSLIDKFDSDFRIGISKF--TGTYTRMCGFTDD 349 Query: 259 LNEVKSRLNKLNPYE----NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 + + ++ + T+ A+ E + + ++ ++DG Sbjct: 350 RTALSDVIKRIRTEDEIFDGTHNQTALKRCIEEFTATGDG--------KYVNIIVMLSDG 401 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDS 373 E+ N + + + + +V + + L++ + G++++ +++ Sbjct: 402 ESDE----TNAESIKNLARLANEKSVIVLTVGLGREI-DRAWLQEMAYSTGGKYYSASEA 456 Query: 374 RELLESFDKITDKIQEQSV 392 L + + +I + V Sbjct: 457 NALDDVYKQIVTTLNYDIV 475 >gi|257878265|ref|ZP_05657918.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,230,933] gi|257812493|gb|EEV41251.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,230,933] Length = 1107 Score = 59.9 bits (143), Expect = 6e-07, Method: Composition-based stats. Identities = 27/272 (9%), Positives = 78/272 (28%), Gaps = 26/272 (9%) Query: 136 TGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTT 195 + + + +V+D S SM D L Sbjct: 257 NARGNVQKDITPLDLVLVVDWSGSMNDNNRIGEVKIGVDRFVDTLADSGITDKINMGYVG 316 Query: 196 KSKYAPAPAPANRKI---DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQC 252 S + + ++ D + ++ S + G++ Sbjct: 317 YSSEGYSYSNGAVQMGSFDSVKNQVKSITPSRTNGGTFTQKALRDAGSMLSVPNGHKKVI 376 Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 L++ + ++ +++ ++N Y + ++ + G+T L + Sbjct: 377 VLLTDCVPTFSYKVQRVHAQSSSNYYG----------TQFSNTQDRPGNTSLISRIYDAP 426 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTDS 363 D N + T+ ++ G++I+ + + + + +R+ S Sbjct: 427 DQNNLSRRIDSTFIATIGEAMALKERGIEIHGLGIQLQSDPAAGLSKAEVESRMRQMVSS 486 Query: 364 ---SGQFFA-VNDSRELLESFDKITDKIQEQS 391 ++ + + ++ E K +I Sbjct: 487 DEKGDLYYESADHATDISEYLAKKAVQISATV 518 >gi|156324784|ref|XP_001618484.1| hypothetical protein NEMVEDRAFT_v1g2784 [Nematostella vectensis] gi|156199073|gb|EDO26384.1| predicted protein [Nematostella vectensis] Length = 410 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 27/160 (16%), Positives = 53/160 (33%), Gaps = 21/160 (13%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYRELYN 290 + +G + Y+ ++ + +N + P T T A+ A +L+ Sbjct: 269 SSERTHVGLVLYSFFTQLMFNFDKYSDSASIVKAINTTDYPKGGTRTGEALKMAKSQLFG 328 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 S + K +I +TDG +++ + +++ G+ I++V V Sbjct: 329 ASMRS--------VPKVLIVLTDG--------RSSDKVEAPSKALKDEGVVIFAVGVG-D 371 Query: 351 PEGQDLLRKCTDS--SGQFFAVNDSRELLESFDKITDKIQ 388 L S F +EL D I K Sbjct: 372 QIDPSELNVMASDSKSDHVFKAK-FKELDRLVDLIKRKAC 410 >gi|84498072|ref|ZP_00996869.1| hypothetical protein JNB_18333 [Janibacter sp. HTCC2649] gi|84381572|gb|EAP97455.1| hypothetical protein JNB_18333 [Janibacter sp. HTCC2649] Length = 656 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 23/191 (12%), Positives = 60/191 (31%), Gaps = 26/191 (13%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT----IAYNIGIVGNQCTPL-SNNL 259 KI+ + +V ++ Q + P+ + + Sbjct: 55 SGLTKIEAAKRALTGVVGALPDTAQVGLRVYGAKVDGKGKPTPAACADTQLVHPIATLDK 114 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 ++ S + + T ++ A ++L K+ ++ ++DGE Sbjct: 115 PKLTSTIAAIKALGETPIAHSLTEALKDLGTSG------------KRNIVLVSDGEE--- 159 Query: 320 SAYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSREL 376 + + AG+ +I +V + + L+ + G ++ D+ L Sbjct: 160 ---SCVPDPCPAITKLTAAGVDLQIDTVGFGVNTKARAQLQCIAAAGKGTYYDAKDASAL 216 Query: 377 LESFDKITDKI 387 S K++ + Sbjct: 217 TTSLSKLSQRA 227 >gi|294102191|ref|YP_003554049.1| hypothetical protein Amico_1203 [Aminobacterium colombiense DSM 12261] gi|293617171|gb|ADE57325.1| hypothetical protein Amico_1203 [Aminobacterium colombiense DSM 12261] Length = 329 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 16/116 (13%), Positives = 41/116 (35%), Gaps = 5/116 (4%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKD--PTTKKDQ 59 A + V ++D ++ R ++Q+A+DA L+G + + + + Sbjct: 20 VAASMVVLLGAGALSLDYGRLVVARWRLQTAVDAGSLAGAWELGNKSASQALREASAAQV 79 Query: 60 TSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKD---KNNPLQYIAESKAQYE 112 ++ + + + + + I +T N +AE+ A+ Sbjct: 80 AGSVASDNKSEGAYAVDFPDADTCHVEGQETIAMTFARILGVNESTVLAEAAARLS 135 >gi|219683166|ref|YP_002469549.1| FctX [Bifidobacterium animalis subsp. lactis AD011] gi|219620816|gb|ACL28973.1| FctX [Bifidobacterium animalis subsp. lactis AD011] Length = 879 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 36/397 (9%), Positives = 106/397 (26%), Gaps = 60/397 (15%) Query: 39 SGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKN 98 + + + T Q +T + + + G + D++ Sbjct: 42 ASASVAAFADDRQPAATADPQAATASAGNVDAPQHTKRISKNDDGTYTLSMDVTGKSDES 101 Query: 99 NPLQYI-AESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 Q + + ++ L G + L + ++ + +S Sbjct: 102 TEQQVVPLDIALVLDVSGSMNELSGKLVYNEVELLSMNPISTYYVEKDGSYQAVRCSAIS 161 Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 Q + + Y P + + ++ + ++D L ++ Sbjct: 162 -WGRCTTWQDQDSAGQKYTVTYNWIGGP----SASVSPDVQFYKSKQSEETRLDALKDAV 216 Query: 218 GNLVNSIQKAI----------------------QEKKNLSVRIGTIAYNIGIVGNQCTPL 255 ++ ++ + N + YN + Sbjct: 217 TYFLDQVEDQNQRINDPGKKVQVALIKYAGKNSDKIGNDTYNEDGYNYNYSQTVHSLAWT 276 Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 +L + ++ +N L T + HA ++L + + + +K +F +DG Sbjct: 277 PEDLQKEQAAVNSLKAGGATRADFGLQHAVKQLNSGRPGA---------QKLTVFYSDGS 327 Query: 316 NS--GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP------PEGQDLLRKCTDS---- 363 + + N ++ ++N ++ S+ + + + Sbjct: 328 PTSSDGFEAKIANNAIKAAAQLKNDHSQVISIGAMPGADPSGTDNANKFMNYVSSNYPKA 387 Query: 364 ----------SGQFFAVNDS-RELLESFDKITDKIQE 389 G ++ + +L F +I + Sbjct: 388 QSMSEPHDRVEGTYYYAVSARTDLQTIFKEIISIVTS 424 >gi|156743215|ref|YP_001433344.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156234543|gb|ABU59326.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 419 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 27/201 (13%), Positives = 66/201 (32%), Gaps = 15/201 (7%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 T+ + + + + V E + + + ++ Sbjct: 38 TQVRAPVNVCFVIDRSGSMKGEKIDRVRRATIRAIEMLDAQDVVSVVIFDHRTEVLIPAT 97 Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 E+ R+N++ T PA+ RE+ G + + + +I +TDG Sbjct: 98 PVAKPAELADRVNRVRDSGGTRIAPAIEAGLREID---------KGPSHMVRRLILLTDG 148 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSR 374 + ++ + L+ E + I ++ V L+ S G ++ Sbjct: 149 QT------ESESDCLRRAEDAGRRNVPITALGVGKDWNEDLLIEMANRSGGTADYIDRPE 202 Query: 375 ELLESFDKITDKIQEQSVRIA 395 ++++ F + Q +V+ A Sbjct: 203 KIVDYFQNTIQRAQATTVQNA 223 >gi|284029341|ref|YP_003379272.1| von Willebrand factor type A [Kribbella flavida DSM 17836] gi|283808634|gb|ADB30473.1| von Willebrand factor type A [Kribbella flavida DSM 17836] Length = 315 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 32/209 (15%), Positives = 65/209 (31%), Gaps = 34/209 (16%) Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 S A + + V E+A V ++ + R T+ P S Sbjct: 95 SNSMAATDVSPDRFTVAKEAATEFVRNLPEQFNVGLVSFARTATV----------VAPPS 144 Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 N + +L ++T A+ + + + + + ++ ++DG N Sbjct: 145 TNHQAAVDAIEQLTLTDSTAIGEAVLTSLQAVRSLDAQAA----EDPPPARIVLLSDGGN 200 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PEGQDLLRKCTD- 362 + AG+ + ++A P P + LR D Sbjct: 201 TSGRPIDEG------ARAATEAGVPVSTIAYGTPEGTIDLEGRSIPVPADTESLRGLADA 254 Query: 363 SSGQFFAVNDSRELLESFDKITDKIQEQS 391 +SG F+A EL + + + I + Sbjct: 255 TSGSFYAAESDEELRDVYSDLQSSIGWTT 283 >gi|220941746|emb|CAX15447.1| novel protein similar to vertebrate collagen, type VI, alpha 3 (COL6A3) [Danio rerio] Length = 2026 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 37/319 (11%), Positives = 95/319 (29%), Gaps = 16/319 (5%) Query: 75 GSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLR 134 I + Q + Q++ + + L S + + + Sbjct: 358 ADLAEAIKYVIRNELQASAGVRLAQASQHLVVLTGGRSTSDVSTYGSILKGSRVNCIGIG 417 Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 + R +A S VL V + N S ++ PP + + Sbjct: 418 AENADSRQLIQIATSSDDVLQVPSFPNLPNI--QNKFIARLSGSIVVEPPIEIDETTPGL 475 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT--IAYNIGIVGNQC 252 ++K A + I++ + ++ I I R+ Y + Sbjct: 476 PQAKAADIVFLVDGSINLGRNNFKEVMEFILNLIDLFYTERDRLQIGLAHYATDVTDVFY 535 Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 NN +++ + + + + + + ++ +T Sbjct: 536 LNTYNNKDDIINAITRAEYKGGREIRTGSA-IRHVQKTHFVKEKGSRKDEGIPQILMVVT 594 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND 372 G ++ ++ ++ +G+++Y+V V + +D L + + Sbjct: 595 GG--------RSRDDSKSAALGLKASGVRVYAVGVG---DIEDELNNLGSEATTVARAST 643 Query: 373 SRELLESFDKITDKIQEQS 391 +EL E ++I D + + Sbjct: 644 FQELSELNEQILDTLDQDV 662 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 22/226 (9%), Positives = 61/226 (26%), Gaps = 18/226 (7%) Query: 167 KHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK 226 P + K+ + + + +V+ + Sbjct: 254 TKVSTMTRDEISTPAVPRDPLNLGRKDIIFLIDGSDSV-GQSGVAHIRDFILKVVDQL-- 310 Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYR 286 + + VR+ + Y +N V S + +L + A + Sbjct: 311 ---DVRPDQVRVALVQYGERPKTEFSLNSHDNKQSVISAIKRLRHMGGR--GADLAEAIK 365 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 + + + + + + ++ +T G ++ + ++ G ++ + Sbjct: 366 YVIRNELQASAGVRLAQASQHLVVLTGGRSTSDVSTYGSIL----------KGSRVNCIG 415 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + A L + SS V L +K ++ V Sbjct: 416 IGAENADSRQLIQIATSSDDVLQVPSFPNLPNIQNKFIARLSGSIV 461 >gi|260463263|ref|ZP_05811464.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259030853|gb|EEW32128.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 661 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 40/323 (12%), Positives = 72/323 (22%), Gaps = 79/323 (24%) Query: 149 SICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR 208 V D S + P P + N Sbjct: 337 YPYNVNDAPPSGGSANTGIGVGDPATMFVPMFAPDEPGNHWKLTQDPDEAAPVTYGAVNS 396 Query: 209 KIDVLIESAGN---LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN-----NLN 260 + S L N + N N N TPL++ Sbjct: 397 WWNDDPSSTSGQSRLRNMAKYFQPRPINAPALPTGNGPNYSCTTNAITPLTDVSVTDGAT 456 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE-SSHNTIGSTRLKKFVIFITDG----- 314 +K+ ++ + P TN M +R + + + + K VI +TDG Sbjct: 457 AIKAAIDLMQPNGGTNVPEGMAWGWRVVSSGEPFTQGRLETEKGNDKVVIVLTDGANTYY 516 Query: 315 -----------------------------------------------ENSGASAYQNTLN 327 ++G Sbjct: 517 TPSSLSYSDPADSKSTYASYGYLNPGYNGTSVGRMFMGTSTAIGQFDYSNGNYTNALNEQ 576 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPE------GQDLLRKCT------------DSSGQFFA 369 +C + A + + +VA+ D L+ C+ + F Sbjct: 577 MATLCNNAKAANIMVMTVALDLSTTKASDKLAIDALKSCSSESRFRKDPTDPSKPAKLFW 636 Query: 370 VNDSRELLESFDKITDKIQEQSV 392 L F +I +++ V Sbjct: 637 NATGASLSNDFKEIGNELSNLRV 659 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 27/187 (14%), Positives = 49/187 (26%), Gaps = 20/187 (10%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 MTA+ + + AID + + + +ALDAA + + DQ Sbjct: 19 MTAVAMIPLMGGLALAIDFTEMNREKQMVTNALDAANFATARRLTE-------GATDDQL 71 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 +L + ++ Y+ F Sbjct: 72 RAYALDFFNANLNDLNPANATLNLTLPSNTA-------GGGLLKMTARLNYKPYFYPAFA 124 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ------KHNDNNNM 174 + + SA S + + + + +VLD S SM L + Sbjct: 125 QLVGKSATDANQSISFDVTSQVRLKNTLEVALVLDNSGSMTTLGTGSGQKRIDLLKTASK 184 Query: 175 TSNKYLL 181 L Sbjct: 185 QLVDTLA 191 >gi|47228042|emb|CAF97671.1| unnamed protein product [Tetraodon nigroviridis] Length = 1071 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 30/191 (15%), Positives = 68/191 (35%), Gaps = 16/191 (8%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL-SVRIGTIAYNIGI-VGNQCTPLSNNLNEVKSRLN 267 ID + N ++ ++ + + + G + Y+ + + + + + +K+ ++ Sbjct: 27 IDQIKTFTTNFIDELKNIRHQCDRILTWNSGALHYSDEVILVGELMDMQTQRSTLKTSIS 86 Query: 268 KLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 + T T A+ EL H K+++ +TDG Sbjct: 87 GIEYIGKGTYTDCAIKRGLSELLIGGSHYHEN-------KYIVVVTDGHPLTGYKEPCG- 138 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ--FFAVNDSRELLESFDKIT 384 + R G+K+++VA+ +P + L F D ++ + D I Sbjct: 139 GVQEAANEARQHGVKVFAVAI-SPDQEDTRLSLIATDHNYRQNFTAADDSKITK-MDTI- 195 Query: 385 DKIQEQSVRIA 395 I VR++ Sbjct: 196 HTIINMIVRVS 206 >gi|304411390|ref|ZP_07393004.1| LPXTG-motif cell wall anchor domain protein [Shewanella baltica OS183] gi|307305288|ref|ZP_07585036.1| LPXTG-motif cell wall anchor domain protein [Shewanella baltica BA175] gi|304350245|gb|EFM14649.1| LPXTG-motif cell wall anchor domain protein [Shewanella baltica OS183] gi|306911591|gb|EFN42016.1| LPXTG-motif cell wall anchor domain protein [Shewanella baltica BA175] Length = 771 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 40/259 (15%), Positives = 79/259 (30%), Gaps = 22/259 (8%) Query: 144 ENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTK---SKYA 200 + + V + + + + ++K S K S Sbjct: 329 QQGTSPMAWVFNQQGKTHKP-DGDNLSQDTLETSKANGVNEDNYSLVMVLPPKVEKSTQP 387 Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKA---IQEKKNLSVRIGTIAYNIGIVGNQCTPL-- 255 P ID AG+ + + A + I +N + TPL Sbjct: 388 SLPRELILVIDTSGSMAGDSIVQAKNALLYALKGLKPEDSFNIIEFNSSLSQFSATPLPA 447 Query: 256 -SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 S+NL+ + +++L T A+ A K + + + + VIF+TDG Sbjct: 448 TSSNLSRARQFVSRLQADGGTEMALALDAAL-----PKSLGSVSPDAVQPLRQVIFMTDG 502 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSR 374 A + + ++++V + + P + R G F + Sbjct: 503 SVGNEQALFDLIRYQIGES-------RLFTVGIGSAPNSHFMQRAAELGRGTFTYIGKVD 555 Query: 375 ELLESFDKITDKIQEQSVR 393 E+ + KIQ + Sbjct: 556 EVDAKISALLSKIQYPVLT 574 >gi|148657485|ref|YP_001277690.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148569595|gb|ABQ91740.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 459 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 10/130 (7%), Positives = 47/130 (36%), Gaps = 13/130 (10%) Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 +++++ L+++ T + +++++ V+ ++DG+ + Sbjct: 151 RDKMENLLDEIRASGMTALDGGLAQGIDLGQKKRQATT----------LVLLLSDGQANV 200 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + R +G+ + ++ V ++ G+F+ + + ++ Sbjct: 201 GETDLEKIGLR--AQKARQSGLIVSTLGVGLDYNEALMVEIANQGGGRFYHIQEGSQIPA 258 Query: 379 S-FDKITDKI 387 + ++ Sbjct: 259 ALMQELGSAA 268 >gi|47522678|ref|NP_999068.1| inter-alpha-trypsin inhibitor heavy chain H2 precursor [Sus scrofa] gi|3024050|sp|O02668|ITIH2_PIG RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H2; Short=ITI heavy chain H2; Short=ITI-HC2; Short=Inter-alpha-inhibitor heavy chain 2; Flags: Precursor gi|1915954|emb|CAA72308.1| inter-alpha-inhibitor heavy-chain H2 [Sus scrofa] Length = 935 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 18/156 (11%), Positives = 59/156 (37%), Gaps = 11/156 (7%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + + + + K+ + K+ P TN A+ A L S Sbjct: 346 TWRNDLVSATKTQVADAKTYIEKIQPSGGTNINEALLRAIFILNEANNLGLLDPNS---V 402 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS- 364 +I ++DG+ + + + + + +++ + ++S+ + D L++ ++ + Sbjct: 403 SLIILVSDGDPTVGELQLSKIQ-KNVKQNIQD-NVSLFSLGIGFDV-DYDFLKRLSNDNR 459 Query: 365 GQFFAV----NDSRELLESFDKITDKIQEQSVRIAP 396 G + + + +L + +++++ + P Sbjct: 460 GMAQRIYGNQDTASQLKKFYNQVSTPLLRNVQFNYP 495 >gi|293570439|ref|ZP_06681494.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] gi|291609385|gb|EFF38652.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] Length = 1042 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 24/272 (8%), Positives = 70/272 (25%), Gaps = 26/272 (9%) Query: 136 TGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTT 195 + + + +V+D S SM D L Sbjct: 279 NARGNVQKDITPLDLVLVVDWSGSMNDNDRIGEVKIGVDRFVDTLSDSGITDKINMGYVG 338 Query: 196 KSKYAPAPAPANRKI---DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQC 252 S + ++ D + ++ S + G + Sbjct: 339 YSSDGYNYSNGTVQMGSFDSVKNQVKSITPSWTNGGTFTQKGLRDAGDMLSVPNGHKKVI 398 Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 L++ + ++ +++ + + ++ + G+T + Sbjct: 399 VLLTDGVPTFSYKVQRVHA----------QPSNDYYGTQFSNTQDQPGNTSRIARSYYAP 448 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT----------- 361 D N + T+ ++ G++I+ + + + L K Sbjct: 449 DQNNQSRRIDSTFIATIGEAMALKERGIEIHGLGIQLQSDPAAGLSKAEVESRMRQMVSA 508 Query: 362 -DSSGQFFA-VNDSRELLESFDKITDKIQEQS 391 + ++ + + ++ E K +I Sbjct: 509 DEKGDLYYESADHATDISEYLAKKAVQISATV 540 >gi|319786316|ref|YP_004145791.1| von Willebrand factor type A [Pseudoxanthomonas suwonensis 11-1] gi|317464828|gb|ADV26560.1| von Willebrand factor type A [Pseudoxanthomonas suwonensis 11-1] Length = 340 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 34/200 (17%), Positives = 73/200 (36%), Gaps = 36/200 (18%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY--EN-T 275 + + + + I + G TPL+ +L+ V+ +L T Sbjct: 121 DRLTAAKAVIADFLQRRSGDRVGLLVFGQRAYMLTPLTLDLSAVREQLRDTVAGLAGRET 180 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 A+ A + L + E ++ ++ +TDG N+ L L+ E Sbjct: 181 ALGDAIGLAVKRLRTQPEG----------QRVLVLLTDGVNTTGV-----LQPLKAAELA 225 Query: 336 RNAGMKIYSVAVSAPPE----------------GQDLLRKCTD-SSGQFFAVNDSRELLE 378 +++Y++A + LRK + + G+FF +D+ +L Sbjct: 226 AAEQVRVYTIAFGGDGGGFSLFGVQVPVQGDEVDEATLRKVAEITGGRFFRAHDANQLAG 285 Query: 379 SFDKITDKIQEQSVRIAPNR 398 + ++ ++++ V AP R Sbjct: 286 IYAEL-ERLEPVGVETAPVR 304 >gi|311254860|ref|XP_003125977.1| PREDICTED: calcium-activated chloride channel regulator 4-like [Sus scrofa] Length = 874 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 23/135 (17%), Positives = 52/135 (38%), Gaps = 26/135 (19%) Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T+ + A+ + + S ++ +TDGE++ T Sbjct: 324 TASGGTSICSGIRRAFEVV--------RKLYSHTDGSEIVLLTDGEDN----------TA 365 Query: 330 QIC-EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE---LLESFDKIT- 384 C + ++ +G I+ +A+ P + ++ T + G F D E L+++F + Sbjct: 366 GACVDEVKQSGAIIHFIALG-PSADKAVIEMSTATGGVHFYATDEAENNGLIDAFGALAS 424 Query: 385 --DKIQEQSVRIAPN 397 I +QS+++ Sbjct: 425 GNTDISQQSLQLESK 439 >gi|156370019|ref|XP_001628270.1| predicted protein [Nematostella vectensis] gi|156215242|gb|EDO36207.1| predicted protein [Nematostella vectensis] Length = 1552 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 24/129 (18%), Positives = 49/129 (37%), Gaps = 9/129 (6%) Query: 258 NLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 N +K +++L T A+ + +L+ EK KK + +TDG Sbjct: 12 NNVNIKRDIDELRLERGLTFIDKALKISAEKLFTEKNGMRLNR-----KKVALVLTDGIQ 66 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 + L + + M++ G+++YS+ + + + +L+ F EL Sbjct: 67 TKDKGPFTPL--QKASQPMKDKGVEVYSLGIGSDIDVSELITF-ASGEKYVFNAKSFDEL 123 Query: 377 LESFDKITD 385 + IT Sbjct: 124 QLQVENITQ 132 >gi|153874442|ref|ZP_02002664.1| von Willebrand factor, type A [Beggiatoa sp. PS] gi|152069095|gb|EDN67337.1| von Willebrand factor, type A [Beggiatoa sp. PS] Length = 478 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 29/142 (20%), Positives = 56/142 (39%), Gaps = 18/142 (12%) Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 L+++L ++ + KL T + A +L +S ++ ++ TD Sbjct: 100 NLTSDLMNLEQPIQKLRAVGGTPMDRGLQSAMNQLSAGSDSE---------QRSILLFTD 150 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDS 373 G+ N TL + ++NA ++I VA++ LL + T + F Sbjct: 151 GKPD------NQRTTLNASQLVKNANIQI--VAIATDDADIGLLTQVTGDAALVFP-TSV 201 Query: 374 RELLESFDKITDKIQEQSVRIA 395 ++F K I EQ++ A Sbjct: 202 GNFDQAFQKAEQAIYEQNLVTA 223 >gi|172065275|ref|YP_001815987.1| hypothetical protein BamMC406_5998 [Burkholderia ambifaria MC40-6] gi|171997517|gb|ACB68434.1| conserved hypothetical protein [Burkholderia ambifaria MC40-6] Length = 423 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 17/113 (15%), Positives = 38/113 (33%), Gaps = 1/113 (0%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + + ++V F+ A+DL + R+++Q++ DA LS + S +++ Sbjct: 25 IVGLALAVMIGFVGLALDLGKLYVTRSELQNSADACALSAARDLTSAISLQVAEADGIAA 84 Query: 61 STIFKKQIKKHLKQGSYIRE-NAGDIAQKAQINITKDKNNPLQYIAESKAQYE 112 +++ Q D + T + AQ Sbjct: 85 GHANFAFFQQNAVQMQTDSNVTFSDSLTNPFLTKTAVATPANVKYVKCTAQLS 137 >gi|296481520|gb|DAA23635.1| inter-alpha globulin inhibitor H2 polypeptide [Bos taurus] Length = 946 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 27/226 (11%), Positives = 79/226 (34%), Gaps = 23/226 (10%) Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 + P PK + + + S + + ++ + +L ++ + N + Sbjct: 299 FAPENMDPIPKNILFVIDVSGSMWGI---KMKQTVEAMKTILDDLRTEDHFSVVDF-NHN 354 Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS 295 VR + + K+ + K+ P TN A+ A L Sbjct: 355 VRTWRNDLVSATKTQ--------VADAKNYIEKIQPSGGTNINEALLRAIFILNEANNLG 406 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 S +I ++DG+ + + + + + +R+ + ++S+ + D Sbjct: 407 MLDPNS---VSLIILVSDGDPTVGELKLSKIQ-KNVKQNIRD-NISLFSLGIGFDV-DYD 460 Query: 356 LLRKCTDSS----GQFFAVNDSR-ELLESFDKITDKIQEQSVRIAP 396 L++ ++ + + + D+ +L + +++++ + P Sbjct: 461 FLKRLSNDNRGIAQRIYGNQDTSVQLKKFYNQVSTPLLRNVQFNYP 506 >gi|148238273|ref|NP_001091485.1| inter-alpha-trypsin inhibitor heavy chain H2 [Bos taurus] gi|146186952|gb|AAI40657.1| ITIH2 protein [Bos taurus] Length = 946 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 27/226 (11%), Positives = 79/226 (34%), Gaps = 23/226 (10%) Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 + P PK + + + S + + ++ + +L ++ + N + Sbjct: 299 FAPENMDPIPKNILFVIDVSGSMWGI---KMKQTVEAMKTILDDLRTEDHFSVVDF-NHN 354 Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS 295 VR + + K+ + K+ P TN A+ A L Sbjct: 355 VRTWRNDLVSATKTQ--------VADAKNYIEKIQPSGGTNINEALLRAIFILNEANNLG 406 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 S +I ++DG+ + + + + + +R+ + ++S+ + D Sbjct: 407 MLDPNS---VSLIILVSDGDPTVGELKLSKIQ-KNVKQNIRD-NISLFSLGIGFDV-DYD 460 Query: 356 LLRKCTDSS----GQFFAVNDSR-ELLESFDKITDKIQEQSVRIAP 396 L++ ++ + + + D+ +L + +++++ + P Sbjct: 461 FLKRLSNDNRGIAQRIYGNQDTSVQLKKFYNQVSTPLLRNVQFNYP 506 >gi|90403616|ref|NP_001035046.1| integrin alpha M [Bos taurus] gi|74229861|gb|AAX46797.1| integrin alpha M [Bos taurus] Length = 1152 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 24/120 (20%), Positives = 40/120 (33%), Gaps = 15/120 (12%) Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T+T + REL++ + N K +I ITDGE Y + L Sbjct: 223 GRTHTATGIRKVVRELFHSSSGARNHA-----IKIMIVITDGE-----KYLDPLEYSDAI 272 Query: 333 EYMRNAGMKIYSVAVSA---PPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKI 387 + Y + V + + L F VN+ L +++ +KI Sbjct: 273 PEADREKIIRYVIGVGDAFRGRKSRQELDTIASKPPADHVFQVNNFEALKTIQNQLQEKI 332 >gi|149437043|ref|XP_001515962.1| PREDICTED: hypothetical protein [Ornithorhynchus anatinus] Length = 948 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 30/226 (13%), Positives = 80/226 (35%), Gaps = 23/226 (10%) Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 L P PK + + + S + + ++ + +L Q ++ + + Sbjct: 302 FAPENLDPIPKNILFVIDVSGSMWG---VKMKQTVEAMKTILDDLRAEDQFSVIDFNHN- 357 Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS 295 ++ +V ++ K + K+ P TN A+ A L Sbjct: 358 ----VRSWKDNLVPATDLMTTD----AKKYIEKIQPNGGTNINEALLRAIFILREASNLG 409 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 S +I ++DG+ + T+ + + MR+ + ++S+ + D Sbjct: 410 MLDPNS---VSLIILVSDGDPTVGELKP-TVIQKNVKKNMRD-NISLFSLGIGFDV-DYD 463 Query: 356 LLRKCT-DSSGQFFAV----NDSRELLESFDKITDKIQEQSVRIAP 396 L + + ++ G + + S +L + +++++ + + P Sbjct: 464 FLERLSRENHGMAQRIYGNQDTSSQLKQFYNQVSTPLLQNVQFNYP 509 >gi|46362531|gb|AAH68979.1| Slc35c2 protein [Danio rerio] Length = 1816 Score = 59.9 bits (143), Expect = 7e-07, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 54/156 (34%), Gaps = 18/156 (11%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKES 294 V+I Y+ + + + NT T A+ HA E+ Sbjct: 613 VQIALSQYSGDPRTEWHLNNFTSKEPLLEAVRNFRYKGGNTFTGQALIHALENNLKEEVG 672 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + +F++ +TDG++ + + ++NAG++I +A+ + Sbjct: 673 AR-----PNTPQFLLLLTDGKSQD--------DAIAAANRLKNAGVEI--IAIGVKNADE 717 Query: 355 DLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 LR+ + VND L + K+ + Sbjct: 718 AELRQVASEPLELNVYKVNDFPLLSKLVGKLARILC 753 >gi|332298719|ref|YP_004440641.1| von Willebrand factor type A [Treponema brennaborense DSM 12168] gi|332181822|gb|AEE17510.1| von Willebrand factor type A [Treponema brennaborense DSM 12168] Length = 333 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 25/155 (16%), Positives = 47/155 (30%), Gaps = 41/155 (26%) Query: 254 PLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 P + + +RLN L + + + A L + KK ++ Sbjct: 147 PPTLDREAFFARLNSLQAGELGDGSAIGMGVSTAAYHLISSA----------APKKSIVL 196 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------------------ 352 ITDGEN+ S + + G+ +Y + V Sbjct: 197 ITDGENNAGSVH-----PGTAAQLAFENGITLYVLGVGTRGSVPLEYVDPATGKTYSGYL 251 Query: 353 ----GQDLLRKCTDS-SGQFFAVNDSRELLESFDK 382 + L++ + G++F V EL + Sbjct: 252 DSRFDESPLQEIALTAGGRYFGVESMGELTAAVSA 286 >gi|220925466|ref|YP_002500768.1| hypothetical protein Mnod_5624 [Methylobacterium nodulans ORS 2060] gi|219950073|gb|ACL60465.1| hypothetical protein Mnod_5624 [Methylobacterium nodulans ORS 2060] Length = 359 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 30/278 (10%), Positives = 69/278 (24%), Gaps = 29/278 (10%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A + F A+D A+ +R+ +QS DA L D T +++ Sbjct: 19 IFAAAMIPLFGLAGAALDYANARRVRDVLQSISDATALLVA--------DADTPTVAERS 70 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + + Q+ L S + + +K + Sbjct: 71 FKLAENQLISRLGDRSGSGGYTIKGE--------WLDGSSYKLTISTKINTILIHLLSGK 122 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMT----- 175 + ++ R E L++ D +R + Sbjct: 123 SKQFEISAVTVANRIPPRYETKPPTLSLLSPEAADYNRIYMYCFSSDPKRQAETDGGRRG 182 Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIES-------AGNLVNSIQKAI 228 P ++ AP+ R + + ++ + Sbjct: 183 ITPIADNATPPSNYGDYAPPTCGDNEAPSYMLRNVRDARTNPTKWDAKYQSVYEYYTDVV 242 Query: 229 QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 + + Y N P++ + N + + Sbjct: 243 IDTGTRRQTMNMKGYK-VYSSNYKEPINMDKNPILETI 279 >gi|156742135|ref|YP_001432264.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233463|gb|ABU58246.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 425 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 22/188 (11%), Positives = 68/188 (36%), Gaps = 26/188 (13%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY 272 + ++A +V+ + + + +N + ++K+ + ++ Sbjct: 65 VKDAAARIVDQLGQDDY--------FSLVVFNDRADVVIPAQRAIKKADLKAAIAQIEAA 116 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T M A +E+ + + +I +TDG Y + ++I Sbjct: 117 GGTEMAQGMALALQEVQRPFLTRGISR--------IILLTDG-----RTYGDESRCVEIA 163 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKC-TDSSGQFFAVNDSRELLESFDKITDK---IQ 388 ++ G+ + ++ + +DLL + + + ++++++ F + I Sbjct: 164 RRGQSRGIGLTALGIGTEWN-EDLLETMTASENSRAQYIATAQDVVKVFADEVKRLHAIF 222 Query: 389 EQSVRIAP 396 Q V+++ Sbjct: 223 AQQVQLSV 230 >gi|13928960|ref|NP_113879.1| integrin alpha-D precursor [Rattus norvegicus] gi|48428189|sp|Q9QYE7|ITAD_RAT RecName: Full=Integrin alpha-D; AltName: CD_antigen=CD11d; Flags: Precursor gi|6648592|gb|AAF21241.1|AF021334_1 alpha D integrin [Rattus norvegicus] Length = 1161 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 63/192 (32%), Gaps = 23/192 (11%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + R + + L+ S + Y+ + + N+ + Sbjct: 160 SGSINQRDFAQMKDFVKALMGEFAST-------STLFSLMQYSNILKTHFTFTEFKNILD 212 Query: 262 VKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 +S ++ + T T + EL++ K S KK ++ ITDG+ Sbjct: 213 PQSLVDPIVQLQGLTYTATGIRTVMEELFHSKNGSR-----KSAKKILLVITDGQ----- 262 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSA---PPEGQDLLRKCTDSS--GQFFAVNDSRE 375 Y++ L + AG+ Y++ V P L + F V + Sbjct: 263 KYRDPLEYSDVIPAADKAGIIRYAIGVGDAFQEPTALKELNTIGSAPPQDHVFKVGNFAA 322 Query: 376 LLESFDKITDKI 387 L ++ +KI Sbjct: 323 LRSIQRQLQEKI 334 >gi|156409367|ref|XP_001642141.1| predicted protein [Nematostella vectensis] gi|156229282|gb|EDO50078.1| predicted protein [Nematostella vectensis] Length = 193 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 28/148 (18%), Positives = 48/148 (32%), Gaps = 16/148 (10%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELY 289 RIG I Y+ NL V + ++ T T AM +A R+L+ Sbjct: 36 VSPRRARIGLIVYSSRSYLVGGFRRYRNLRSVLQAIKRIRYIRGGTYTGKAMKYALRKLF 95 Query: 290 NEKESSHNTI------GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 + + H+ K ++ ITDG + ++ G+ I+ Sbjct: 96 SRRAGYHHARVRLFRSSRKGAAKILVMITDGISQD--------RVTTPALRLKKMGVVIF 147 Query: 344 SVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 SV V + L++ F Sbjct: 148 SVGVGKRYRLKQLMQ-IASRPRLVFTAP 174 >gi|149176499|ref|ZP_01855112.1| hypothetical protein PM8797T_29982 [Planctomyces maris DSM 8797] gi|148844612|gb|EDL58962.1| hypothetical protein PM8797T_29982 [Planctomyces maris DSM 8797] Length = 598 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 51/371 (13%), Positives = 107/371 (28%), Gaps = 43/371 (11%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQM------------QSALDA----AVLSGCASI 44 M A + F+ + ID+A I + +M Q DA A G + Sbjct: 27 MAAPFLVATMGFMAFGIDIAVITMTKTRMRNAVEAAALAAAQQITDAVQTTADGIGGSDN 86 Query: 45 VSDRTIKDPTTKKDQTSTIFKKQIK---KHLKQGSYIRENAGDIAQKAQINITKDKNNPL 101 VS + D + +K + ++ + + ++ +N Sbjct: 87 VSGDVQDANSIAIDTARAVAEKVARLNGVYIDPETDVEFGKRYQDSGGTFHMVWGENAKP 146 Query: 102 QYIAESKAQYEIPTEN-------LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVL 154 + + A+ + TE LF G + ++ + IE A I +VL Sbjct: 147 YNVVKVTARKDNATEGQPDSRLQLFFAGFMSEKTAAVTTSAIAFIE------ARDIVLVL 200 Query: 155 DVSRSMEDLYLQKHNDNNNMTSNKYLLP-PPPKKSFWSKNTTKSKYAPAPAPANRKIDVL 213 D S SM + + + ++ T S PA Sbjct: 201 DYSGSMSYDSEFDAMSSYRLGKSAVEANLDDIWETLVDSGATYSDSGKLKFPA------- 253 Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE 273 G + + + I + + GN P LN P Sbjct: 254 -TGYGRINSEVGTYISSTNDDYIYRALDLDEEDSSGNLKYPFPQEGKNYYGNLNG-EPSG 311 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 NTN ++ + +++ + + ++ + + + + Sbjct: 312 NTNKNLWKNYIKWVRSDGTVNNYGYRKKYGYRTLMGYLIEQRKLNSQSEDLWRAPIYPFN 371 Query: 334 YMRNAGMKIYS 344 M+ G+ +++ Sbjct: 372 AMKE-GVTLFT 381 >gi|197118196|ref|YP_002138623.1| VWFA superfamily protein [Geobacter bemidjiensis Bem] gi|197087556|gb|ACH38827.1| VWFA superfamily protein [Geobacter bemidjiensis Bem] Length = 331 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 25/174 (14%), Positives = 56/174 (32%), Gaps = 43/174 (24%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE---NTNTYPAMHHAYRELYNEKE 293 RIG +A+ PL+++ ++ + +L+ T A+ L Sbjct: 132 RIGLVAFAGRPYPAA--PLTSDHQWLQGIVERLDTNSVEDGTALGDAILAGVNRL----- 184 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE- 352 + +I ITDG N+ + Q + G++++++ + + Sbjct: 185 -----RQRPAEGRALILITDGRNNAGAEPQLAAQAAKA------LGIRVHAIGIGSRGSA 233 Query: 353 --------------------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 L+ + + G++F D+ L F +I Sbjct: 234 VIPVPSPLGGTIYRRLDADLDAATLKGVAEITGGRYFEAGDATVLSRVFAEIDR 287 >gi|194675927|ref|XP_001788181.1| PREDICTED: integrin, alpha E [Bos taurus] gi|297486662|ref|XP_002695836.1| PREDICTED: integrin, alpha E (antigen CD103, human mucosal lymphocyte antigen 1; alpha polypeptide) [Bos taurus] gi|296476814|gb|DAA18929.1| integrin, alpha E [Bos taurus] Length = 1163 Score = 59.9 bits (143), Expect = 8e-07, Method: Composition-based stats. Identities = 27/158 (17%), Positives = 53/158 (33%), Gaps = 16/158 (10%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESS 295 R + Y I S ++ R+ ++ T T AM H ++ + S Sbjct: 222 RFAVVQYGEVIQTELDLLDSQDVRASLDRVKNISQVGKITKTASAMQHVLDNIFTPNQGS 281 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 K ++ +TDGE +Q+ LN + + G++ +++ V Sbjct: 282 RAKAS-----KVMVVLTDGE-----IFQDPLNLTTVINSPKMHGVERFAIGVGEAFNKSK 331 Query: 356 ---LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 L+ F V + L +K+ + I Sbjct: 332 AYHELKLIASDPDEDHAFKVTNYMALDGLLNKLQESII 369 >gi|260881326|ref|ZP_05404133.2| putative von Willebrand factor type A domain protein [Mitsuokella multacida DSM 20544] gi|260849116|gb|EEX69123.1| putative von Willebrand factor type A domain protein [Mitsuokella multacida DSM 20544] Length = 428 Score = 59.5 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 21/221 (9%), Positives = 55/221 (24%), Gaps = 16/221 (7%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 +TA ++ F A+D R+Q+Q+ + + +Q Sbjct: 15 LTAFLLPFIIAFTGMAVDFGSAYVRRSQLQN-----------AADAAALAGAYHLDDNQA 63 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + K +K +L + + + + + + L Sbjct: 64 DDVVLKYLKTNLDPHFTSYSYQTGDDFPDKFETLNYHTDKQKDELDVTLRSSVEASFL-- 121 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKY- 179 L + + + + E+L ++ + + + + Sbjct: 122 -KLFDIDTIPVYATAKAKVSKEKESLPTDDMFNYAITVANKSYETGNPSIFMVSSGMNIK 180 Query: 180 -LLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGN 219 + FW+ A K ++ N Sbjct: 181 GNILTNGSILFWNDRVNTLDGKIYSAVPLNKQVWSNKAWDN 221 >gi|222616155|gb|EEE52287.1| hypothetical protein OsJ_34277 [Oryza sativa Japonica Group] Length = 367 Score = 59.5 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 32/184 (17%), Positives = 67/184 (36%), Gaps = 29/184 (15%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL----SNNLNEVK 263 ++DVL ++ ++ ++ R+ +A+N V T L N + Sbjct: 75 SRLDVLKDAMKFIIRKLEDGD--------RLSIVAFNDRPVKEYSTGLLDISGNGRRIAE 126 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +++ L T PA+ A R L S F++ +TDG+++ + Sbjct: 127 KKVDWLEGRGGTALMPALEEAIRVLDC------RPGDSRNRVGFILLLTDGDDTSGFRWS 180 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 + + + +++ + A + LL +S G + V+D DKI Sbjct: 181 RDVINGAVGK------YPVHTFGLGAAHSSEALLYIAQESRGTYSFVDDEN-----MDKI 229 Query: 384 TDKI 387 + Sbjct: 230 AGAL 233 >gi|126731955|ref|ZP_01747758.1| Von Willebrand domain containing protein [Sagittula stellata E-37] gi|126707487|gb|EBA06550.1| Von Willebrand domain containing protein [Sagittula stellata E-37] Length = 318 Score = 59.5 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 23/157 (14%), Positives = 57/157 (36%), Gaps = 31/157 (19%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKE 293 R+G + + P ++++ V ++ L + T + A R L Sbjct: 132 RVGLVVFGDRAYVAA--PQTHDVASVARLIDGLQIGVSGKATAIADGLGLAIRRL----- 184 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA---- 349 + ++ ++DG+++ ++ + + R+ GM++Y++A+ Sbjct: 185 -----RERDAKSRVILLLSDGQDTTGM-----VDPVAAAQTARDLGMRVYTIALGPADLS 234 Query: 350 ------PPEGQDLLRKCT-DSSGQFFAVNDSRELLES 379 D LR+ + G+ F V + +L Sbjct: 235 DDPGARDAVDADTLRRIAQAAGGETFRVRTTDDLQAV 271 >gi|75906479|ref|YP_320775.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] gi|75700204|gb|ABA19880.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] Length = 615 Score = 59.5 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 16/142 (11%), Positives = 41/142 (28%), Gaps = 10/142 (7%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 + Y+ + + +K + ++ TN + ++ + Sbjct: 78 VVVYDDAVDTVVSPQPVTDKPALKKSIRQVRAGGITNLSGGWLKGCEYVKHQLDPQKINR 137 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 V+ +TDG + L + G+ ++ + L+ Sbjct: 138 --------VLLLTDGHANMGIQDPKILTATSAQKA--EEGITTTTLGFAQGFNEDLLIGM 187 Query: 360 CTDSSGQFFAVNDSRELLESFD 381 ++G F+ + E E F Sbjct: 188 ARAANGNFYFIQSIDEAAEVFS 209 >gi|327272012|ref|XP_003220780.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like [Anolis carolinensis] Length = 955 Score = 59.5 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 20/138 (14%), Positives = 49/138 (35%), Gaps = 11/138 (7%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 NN+ + K ++ ++P TN A+ + + L + + S +IF+TDG Sbjct: 366 NNIRDAKVYIHNMSPSGGTNINGALQISTKILNDYIAQNDIEARS---VSLIIFLTDGRP 422 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVND--- 372 + + + ++++ + LL + + G V + Sbjct: 423 TFGEIEPAKIINNT--KEAIRNKFCLFTIGIG-NDVDYKLLERLALENCGMMRRVREEED 479 Query: 373 -SRELLESFDKITDKIQE 389 + +L + +I + Sbjct: 480 AAEQLKGFYYEIDTPLLS 497 >gi|206901991|ref|YP_002251775.1| von Willebrand factor type A domain protein [Dictyoglomus thermophilum H-6-12] gi|206741094|gb|ACI20152.1| von Willebrand factor type A domain protein [Dictyoglomus thermophilum H-6-12] Length = 890 Score = 59.5 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 28/140 (20%), Positives = 52/140 (37%), Gaps = 19/140 (13%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 IA++ N E S ++K++P T YP + A L S Sbjct: 435 LIAFDHSYQWIVPLQPLTNKEETASLISKISPGGGTALYPPLKSAGEALIKAPIKS---- 490 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 K +I ITDG+ G Y + +Y+ + + ++ + LL+ Sbjct: 491 ------KHIIAITDGQTEGGDFYN-------LVKYLAKYKITVSTIGIGEDAN-IPLLKD 536 Query: 360 CTD-SSGQFFAVNDSRELLE 378 + +G+F+ + R L + Sbjct: 537 IANWGNGRFYHTWNIRNLPQ 556 >gi|326670658|ref|XP_003199261.1| PREDICTED: collagen alpha-3(VI) chain-like [Danio rerio] Length = 1823 Score = 59.5 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 19/170 (11%), Positives = 54/170 (31%), Gaps = 15/170 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRE 287 + RI + Y+ N E+ + L T A+ + Sbjct: 235 DVAENRDRISVVQYSREPEANFYLNTYTTKEEIVDAVRGLRHKGGRPLYTGEALQYVRDN 294 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 ++ S + + ++ ++ G ++ + ++ G+ + + Sbjct: 295 VFTASSGSRRL---EGVPQILVLLSGG--------RSFDSVDAAASSLKELGVL--TFGI 341 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + L++ + +V+D EL +++ +Q S+ + P Sbjct: 342 GSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASVQVTSIPVTPT 391 Score = 59.5 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 19/170 (11%), Positives = 54/170 (31%), Gaps = 15/170 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRE 287 + RI + Y+ N E+ + L T A+ + Sbjct: 1034 DVAENRDRISVVQYSREPEANFYLNTYTTKEEIVDAVRGLRHKGGRPLYTGEALQYVRDN 1093 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 ++ S + + ++ ++ G ++ + ++ G+ + + Sbjct: 1094 VFTASSGSRRL---EGVPQILVLLSGG--------RSFDSVDAAASSLKELGVL--TFGI 1140 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + L++ + +V+D EL +++ +Q S+ + P Sbjct: 1141 GSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASVQVTSIPVTPT 1190 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 32/337 (9%), Positives = 100/337 (29%), Gaps = 33/337 (9%) Query: 65 KKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLI 124 + ++++ + + + + ++ A L Sbjct: 284 TGEALQYVRDNVFTASSGSRRLEGVPQILVLLSGGRSFDSVDAAAS-----------SLK 332 Query: 125 PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP 184 + + S G R + ++ L VS E +Q+ + ++ + P Sbjct: 333 ELGVLTFGIGSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASVQVTS-IPVTPT 391 Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK--NLSVRIGTIA 242 +T + D S + + +Q+ +++ R+ + Sbjct: 392 SPTVTAEYSTPRKDVVFL----LDGSDGTRSSFPAMRDFVQRVVEKFNIEANRDRVSVVQ 447 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIG 300 Y+ + ++ R+ L T A+ + ++ S Sbjct: 448 YSRDAEVHFYLNSYTKKEDILDRVTGLRHKGGRPLYTGAALQYVRDNVFTASSGSRRL-- 505 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 + + +I ++ G ++ + ++ G+ ++ + + L++ Sbjct: 506 -EGVPQILILLSGG--------RSFDSVDAAASSLKELGVL--TLGIGSRGSDSRELQRI 554 Query: 361 TDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + + +V D EL +++ +Q ++ P Sbjct: 555 SYEANYALSVADFSELPNVQEQLLASVQTVAMSGTPT 591 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 32/337 (9%), Positives = 100/337 (29%), Gaps = 33/337 (9%) Query: 65 KKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLI 124 + ++++ + + + + ++ A L Sbjct: 1083 TGEALQYVRDNVFTASSGSRRLEGVPQILVLLSGGRSFDSVDAAAS-----------SLK 1131 Query: 125 PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP 184 + + S G R + ++ L VS E +Q+ + ++ + P Sbjct: 1132 ELGVLTFGIGSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASVQVTS-IPVTPT 1190 Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK--NLSVRIGTIA 242 +T + D S + + +Q+ +++ R+ + Sbjct: 1191 SPTVTAEYSTPRKDVVFL----LDGSDGTRSSFPAMRDFVQRVVEKFNIEANRDRVSVVQ 1246 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIG 300 Y+ + ++ R+ L T A+ + ++ S Sbjct: 1247 YSRDAEVHFYLNSYTKKEDILDRVTGLRHKGGRPLYTGAALQYVRDNVFTASSGSRRL-- 1304 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 + + +I ++ G ++ + ++ G+ ++ + + L++ Sbjct: 1305 -EGVPQILILLSGG--------RSFDSVDAAASSLKELGVL--TLGIGSRGSDSRDLQRI 1353 Query: 361 TDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + + +V D EL +++ +Q ++ P Sbjct: 1354 SYEANYALSVADFSELPNVQEQLLASVQTVAMSGTPT 1390 >gi|291486255|dbj|BAI87330.1| hypothetical protein BSNT_05611 [Bacillus subtilis subsp. natto BEST195] Length = 227 Score = 59.5 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 20/128 (15%), Positives = 43/128 (33%), Gaps = 16/128 (12%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 S N + LN + P T A++ A +K V +TDGE Sbjct: 108 SFNEQSFLNSLNTIGPTGWTPIAKALNEAKSSFDQLDAKG---------EKVVYLLTDGE 158 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSR 374 + N ++ + ++ + + + + L G++F V + Sbjct: 159 ETCGG------NPIKTAKELQKENITVNVIGFDYKEGYKGQLNAIAKVGGGEYFPVYTQK 212 Query: 375 ELLESFDK 382 ++ + F + Sbjct: 213 DVEKIFTQ 220 >gi|198429401|ref|XP_002121222.1| PREDICTED: similar to integrin alpha 9 [Ciona intestinalis] Length = 1242 Score = 59.5 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 31/193 (16%), Positives = 61/193 (31%), Gaps = 22/193 (11%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAY---NIGIVGNQCTPLSNNLNE 261 + +D +S + I V +G I + N + + + Sbjct: 158 SGSVDVDEYRDSLNWMKQVISSFRSYIDKGDVHVGVIGFSRLNNIDTKVRIRLQAWSYTS 217 Query: 262 VKSRLNKL----NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + S++N + + T A++ E + + K +I +TDG Sbjct: 218 LTSQINNMVNVRSLNGLTYIGYAINLTITEFDDHGR--------ESVPKEMILLTDG--- 266 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 A N E R G+ SV V + + LL ++ + F + L Sbjct: 267 ---AATKPENVKPAAERARANGIVTVSVGVGSRVDETQLL-TIAGNASRVFKATNYDNLD 322 Query: 378 ESFDKITDKIQEQ 390 + + IQ+ Sbjct: 323 SVVEGVKSTIQDT 335 >gi|289177626|gb|ADC84872.1| Collagen adhesion protein [Bifidobacterium animalis subsp. lactis BB-12] Length = 905 Score = 59.5 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 36/397 (9%), Positives = 106/397 (26%), Gaps = 60/397 (15%) Query: 39 SGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKN 98 + + + T Q +T + + + G + D++ Sbjct: 68 ASASVAAFADDRQPAATADPQAATASAGNVDAPQHTKRISKNDDGTYTLSMDVTGKSDES 127 Query: 99 NPLQYI-AESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 Q + + ++ L G + L + ++ + +S Sbjct: 128 TEQQVVPLDIALVLDVSGSMNELSGKLVYNEVELLSMNPISTYYVEKDGSYQAVRCSAIS 187 Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 Q + + Y P + + ++ + ++D L ++ Sbjct: 188 -WGRCTTWQDQDSAGQKYTVTYNWIGGP----SASVSPDVQFYKSKQSEETRLDALKDAV 242 Query: 218 GNLVNSIQKAI----------------------QEKKNLSVRIGTIAYNIGIVGNQCTPL 255 ++ ++ + N + YN + Sbjct: 243 TYFLDQVEDQNQRINDPGKKVQVALIKYAGKNSDKIGNDTYNEDGYNYNYSQTVHSLAWT 302 Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 +L + ++ +N L T + HA ++L + + + +K +F +DG Sbjct: 303 PEDLQKEQAAVNSLKAGGATRADFGLQHAVKQLNSGRPGA---------QKLTVFYSDGS 353 Query: 316 NS--GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP------PEGQDLLRKCTDS---- 363 + + N ++ ++N ++ S+ + + + Sbjct: 354 PTSSDGFEAKIANNAIKAAAQLKNDHSQVISIGAMPGADPSGTDNANKFMNYVSSNYPKA 413 Query: 364 ----------SGQFFAVNDS-RELLESFDKITDKIQE 389 G ++ + +L F +I + Sbjct: 414 QSMSEPHDRVEGTYYYAVSARTDLQTIFKEIISIVTS 450 >gi|260837292|ref|XP_002613639.1| hypothetical protein BRAFLDRAFT_226979 [Branchiostoma floridae] gi|229299025|gb|EEN69648.1| hypothetical protein BRAFLDRAFT_226979 [Branchiostoma floridae] Length = 240 Score = 59.5 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 25/192 (13%), Positives = 67/192 (34%), Gaps = 20/192 (10%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + + ++V + + ++G + Y+ ++ Sbjct: 9 VDGSSSIPADEFEKVKTFLNSIVGHF-----DIGPTATQVGVVQYSSSPQQEFALNAHSS 63 Query: 259 LNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 L ++ + + TNT A+ A + L K V+ +TDG Sbjct: 64 LVSLQQAITNIIIIGRGTNTGSALTFARDVALTAANGAR-----PGLPKIVVTMTDG--- 115 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 ++ + L + +RN G+ +++ V++ + + S + F +D L Sbjct: 116 -----ASSEDVLTPSQNLRNDGVITFAIGVTSRASDWQV-EEIAGSLDRVFTASDFDALD 169 Query: 378 ESFDKITDKIQE 389 ++ ++ E Sbjct: 170 NIKVTLSSQLCE 181 >gi|186683831|ref|YP_001867027.1| von Willebrand factor A [Nostoc punctiforme PCC 73102] gi|186466283|gb|ACC82084.1| von Willebrand factor, type A [Nostoc punctiforme PCC 73102] Length = 615 Score = 59.5 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 18/142 (12%), Positives = 38/142 (26%), Gaps = 10/142 (7%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 + Y+ + N +K + K+ TN + + Sbjct: 78 VVVYDDEVDSVVPPQAVTNKATLKDSIRKVRAGGITNLSGGWLKGCEHVKTRLDPQKINR 137 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 V+ +TDG + Q+ G+ ++ + L+ Sbjct: 138 --------VLLLTDGHANMG--IQDPKVLTATSGQKAEEGITTTTLGFAQGFNEDLLIGM 187 Query: 360 CTDSSGQFFAVNDSRELLESFD 381 + G F+ + E E F Sbjct: 188 ARAARGNFYFIQSIDEATEVFS 209 >gi|327310820|ref|YP_004337717.1| hypothetical protein TUZN_0922 [Thermoproteus uzoniensis 768-20] gi|326947299|gb|AEA12405.1| hypothetical protein TUZN_0922 [Thermoproteus uzoniensis 768-20] Length = 509 Score = 59.5 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 21/132 (15%), Positives = 45/132 (34%), Gaps = 5/132 (3%) Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKES---SHNTIGSTRLKKFVIFITDGENSGASA 321 ++ K+ P T Y A+ +L K + K ++ ITDGE Sbjct: 105 KIEKIKPGTYTLLYQALLQVIDDLRGIKRGLPLMPRRAVPENIPKRIVVITDGEP--WPY 162 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFD 381 Y + + G+ I ++ + + L S G ++ ++ R++ + Sbjct: 163 YTEERWYEHLGKAAARYGITISAIGIGDDYNEKILYALANSSGGAWYHISQIRDISQVLA 222 Query: 382 KITDKIQEQSVR 393 + + R Sbjct: 223 NELRRAKTVVAR 234 >gi|320537259|ref|ZP_08037219.1| von Willebrand factor type A domain protein [Treponema phagedenis F0421] gi|320145887|gb|EFW37543.1| von Willebrand factor type A domain protein [Treponema phagedenis F0421] Length = 332 Score = 59.5 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 26/196 (13%), Positives = 54/196 (27%), Gaps = 41/196 (20%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---PYENT 275 + + + I + + P + + +RLN L E T Sbjct: 111 TRIQAAKDIITDFVQTYPADAFGLTALASTAALVIPPTIQHEQFFARLNSLQIGELGEGT 170 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + A +T + +I +TDGE++ + N E + Sbjct: 171 ALGMGLAVAAAHF----------AKNTVKTQSIILLTDGESNTGEIHPNL-----AAELI 215 Query: 336 RNAGMKIYSVAVSAPPE----------------------GQDLLRKCT-DSSGQFFAVND 372 ++ + Y + + + LR+ +G + + Sbjct: 216 KSKKIGFYIIGIGKDGYANLEYVDPSTGEKREGTLQTIFNERELRELAHRGNGIYVSAKS 275 Query: 373 SRELLESFDKITDKIQ 388 L E F I+ I Sbjct: 276 FASLQEIFKNISQNIS 291 >gi|198430141|ref|XP_002124276.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 606 Score = 59.5 bits (142), Expect = 8e-07, Method: Composition-based stats. Identities = 37/320 (11%), Positives = 84/320 (26%), Gaps = 36/320 (11%) Query: 81 NAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIE 140 G+ D + E + L+ + + + Sbjct: 298 TNGNEYNSVCTFACADGYGLSGSENVTCTDVEEWSGPFPTCELLTCPTFTIRGGNAVCSD 357 Query: 141 RSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLP--PPPKKSFWSKNTTKSK 198 ++ N + N ++ P P Sbjct: 358 GNNVNSDCLATCDDPAHVIYPPSFETVTCQANTTWTDDAPCCVLPCPPHVPTDLVIILDA 417 Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL--S 256 + R+I + ++Q I+ + S + YN L + Sbjct: 418 SSSVNEDNWRRIKSI---------TVQLIIKFMSSESTQFAVFRYNNRPDVATQILLQNT 468 Query: 257 NNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 N+ + + + + T T A+ HA + + + V+ +TDG+ Sbjct: 469 NDAAALLNAIENIPYGGVGTRTGIALGHAADVILHTDNGNRPKAAD-----IVLIMTDGQ 523 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG-----QDLLRKCTDSSGQFFA- 369 + NT + + N G + +V + G ++LL + F Sbjct: 524 VNV------NDNTQVPADALHNMGATVMAVGIEPEKFGGRFQLEELLDITQQNRNHIFLP 577 Query: 370 -----VNDSRELLESFDKIT 384 V+ E+ + + + Sbjct: 578 DGSTVVSTVDEIKSAVENLA 597 >gi|3236370|gb|AAC23667.1| type VI collagen alpha 3 subunit [Mus musculus] Length = 2657 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 23/188 (12%), Positives = 56/188 (29%), Gaps = 20/188 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L + +V S+ + VR+ + Y+ + V S Sbjct: 429 GVRSGFPLLKDFVQRVVESL-----DVGPDRVRVALVQYSDRTRPEFYLNSHMDQQGVIS 483 Query: 265 RLNKLNPYENT--NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S + + +I +T + Sbjct: 484 AIRRLTLLGGTTPNTGAALEFVLRNILTSSTGSRIA---EGVPQLLIVLTAEPSGDDVRG 540 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + ++ G + + ++ + A+ REL Sbjct: 541 PSVV--------LKQGGAV--PIGIGIGNADISEMQTISFIPDFAVAIPTFRELGTIQRV 590 Query: 383 ITDKIQEQ 390 I++++ + Sbjct: 591 ISERVIQL 598 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 27/173 (15%), Positives = 58/173 (33%), Gaps = 17/173 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + S+R+G + YN + ++ +N Sbjct: 1038 INFRRDSFQEVLRFASVIVDTVYEDGDSIRVGLVQYNSDPTDEFFLRDFSTKRQIIDAIN 1097 Query: 268 KLNPYENTNTYP--AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 K+ + + H L N + R+ + IT G+ + Sbjct: 1098 KVVYKGGRHANTRVGIEH---LLRNHFVPEAGSRLDVRVPQIAFVITGGK--------SV 1146 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V +EL E Sbjct: 1147 EDAQDVSLALTQKGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGSVQELSE 1197 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 26/181 (14%), Positives = 60/181 (33%), Gaps = 20/181 (11%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + A I + + +V + VRIG + ++ + + + Sbjct: 833 SSDAVKPDGIAHIRDFVSRIVRRLN-----IGPSKVRIGVVQFSNDVFPEFYLKTHKSQS 887 Query: 261 EVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 V + +L + NT A+ R L+ S + + + ++ G++ Sbjct: 888 SVLEAIRRLRFKGGSPLNTGRALEFVARNLF---VKSAGSRIEDGVPQHLVLFLGGKSQD 944 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 A + + ++G+ S+ + + L+ T+ F V + REL Sbjct: 945 DVARH--------AQVISSSGIV--SLGIGDRNIDRTDLQTITNDPRLVFTVREFRELPN 994 Query: 379 S 379 Sbjct: 995 I 995 >gi|260823774|ref|XP_002606843.1| hypothetical protein BRAFLDRAFT_103549 [Branchiostoma floridae] gi|229292188|gb|EEN62853.1| hypothetical protein BRAFLDRAFT_103549 [Branchiostoma floridae] Length = 1317 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 17/125 (13%), Positives = 43/125 (34%), Gaps = 24/125 (19%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T T AM A L + + ++ K V +TDG ++G + L Sbjct: 155 AGGGTYTKGAMIKAQEVLRHARPNAT---------KAVFLMTDGYSNGG-------DPLP 198 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQ 388 ++ ++I++ + + L+ + ++ E F+ + + Sbjct: 199 EARKLKQNDVQIFTFGIRSG--NVKELQNMATDPAEEHSYFLDSFAE----FEALARRAL 252 Query: 389 EQSVR 393 + ++ Sbjct: 253 HEDLQ 257 >gi|156382097|ref|XP_001632391.1| predicted protein [Nematostella vectensis] gi|156219446|gb|EDO40328.1| predicted protein [Nematostella vectensis] Length = 286 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 21/145 (14%), Positives = 55/145 (37%), Gaps = 12/145 (8%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSN----NLNEVKSRLNKLNPY-ENTNTYPAMH 282 + + + +G I ++ V + N +K +++L T A+ Sbjct: 126 MGDISDQGTHVGIITFSTDPVIDIPFDKYKGVKMNAVNIKRDIDELRRKKGYTFIDKALT 185 Query: 283 HAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKI 342 A + L+ + +K + ++DG + T+ ++ G+++ Sbjct: 186 LADKSLFTQ-----EAGMREDSQKVAVLMSDGIQTKDRGPFTP--TIIAANRLKMKGVQV 238 Query: 343 YSVAVSAPPEGQDLLRKCTDSSGQF 367 Y+V + A + +L+ ++ +G F Sbjct: 239 YTVGIGASIDVLELMWIASEDTGLF 263 >gi|89053332|ref|YP_508783.1| von Willebrand factor, type A [Jannaschia sp. CCS1] gi|88862881|gb|ABD53758.1| von Willebrand factor type A [Jannaschia sp. CCS1] Length = 686 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 20/158 (12%), Positives = 46/158 (29%), Gaps = 12/158 (7%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 + Y +++ + + L L +TN + AYR + Sbjct: 367 IVTYAGSAGVALEPTAASDTATINAALTTLQAGGSTNGVGGLEEAYRLA--------GEM 418 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 V+ TDG+ + + R+ G+ + + D ++ Sbjct: 419 MVDGEVSRVLLATDGDFNVGL--SDAGALEDYIAEQRDTGIYLSVLGFGRGNLQDDTMQA 476 Query: 360 CTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + + L E+ + D++ IA + Sbjct: 477 LAQNGNG--TASYIDTLHEAQRVLVDQLAGALYPIADD 512 >gi|324514578|gb|ADY45916.1| Collagen alpha-5(VI) chain [Ascaris suum] Length = 432 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 20/156 (12%), Positives = 50/156 (32%), Gaps = 17/156 (10%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 + V T + + +N ++ + ++ T A+ R Sbjct: 276 PRFTRVAFITFSSVGKSRTHFNLNRYDNAQQIIEAIRRVESTGGT---TAVGEGIRIATQ 332 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 ++E + KK ++ TDG ++ + ++ + + AG +YS+ Sbjct: 333 QQEKRMGGRPISIAKKAMLIFTDGWSNKG------PDPEEMSKEAKAAGFVLYSIGYEGN 386 Query: 351 PEGQDL--------LRKCTDSSGQFFAVNDSRELLE 378 + D+ ++ + EL+E Sbjct: 387 GRPDAEFAGLNQYTMDAIADTMHHVYSERNFSELVE 422 >gi|113931618|ref|NP_001039260.1| inter-alpha (globulin) inhibitor H2 [Xenopus (Silurana) tropicalis] gi|72679293|gb|AAI00666.1| hypothetical protein MGC107982 [Xenopus (Silurana) tropicalis] Length = 942 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 20/171 (11%), Positives = 52/171 (30%), Gaps = 11/171 (6%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 + I S + + + P TN A+ A L Sbjct: 337 PDDQFSIIDFNHNIRCWKDELVYASSVEKQDASKYVQSIQPNGGTNINEALLRAIFILKE 396 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 S ++ ++DG+ + + + + +R+ ++S+ + Sbjct: 397 ASNKGMLEPNS---VSLIVLVSDGDPTVGEIKLSKIQ-KNVRTNIRDD-FALHSLGIGFD 451 Query: 351 PEGQDLLRKCTDSS-GQFFAV----NDSRELLESFDKITDKIQEQSVRIAP 396 D L + + G + + + +L E + K++ + + + P Sbjct: 452 V-DYDFLERLAQENHGMAQRIYGKQDTAAQLKEFYKKVSTPLLKNIIVNYP 501 >gi|86606733|ref|YP_475496.1| hypothetical protein CYA_2093 [Synechococcus sp. JA-3-3Ab] gi|86555275|gb|ABD00233.1| hypothetical protein CYA_2093 [Synechococcus sp. JA-3-3Ab] Length = 431 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 29/189 (15%), Positives = 61/189 (32%), Gaps = 18/189 (9%) Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT-------PLSNNLNEVKSRLNK 268 + L++ + + + A Q +++ ++ + + Sbjct: 209 AVFGLLDQFRATPLDLGAVLRFDNRGAGFGATALGQPLRTAQLLQDFTSDKQLLRRGVLR 268 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 NP NT Y A A R L + + + ++ TDG ++ S + Sbjct: 269 ANPGGNTALYDATVEAGRFLSDFRPTERFNR-------RLVVFTDGIDNE-STRSINQAS 320 Query: 329 LQICEYMRNAG--MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKIT-D 385 ++ R G + +Y V + +L R + G F S L F + Sbjct: 321 QELTTLARERGQKLTVYVVGLGVDLNLLELQRLAAATEGTFVLARFSEGLEAPFANLFPA 380 Query: 386 KIQEQSVRI 394 I E +++ Sbjct: 381 AIGEHRLQV 389 >gi|313126713|ref|YP_004036983.1| mg-chelatase subunit chld [Halogeometricum borinquense DSM 11551] gi|312293078|gb|ADQ67538.1| Mg-chelatase subunit ChlD [Halogeometricum borinquense DSM 11551] Length = 785 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 23/135 (17%), Positives = 47/135 (34%), Gaps = 20/135 (14%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + PL N + R+ +L T+ + A + L ++ Sbjct: 418 TYSVAERQPLGPNREALADRIRRLQAGGATDIAGGLRGAGKMLGDD-------------P 464 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG 365 VI I+DG + ++ + +R+ G +I ++ P ++L S G Sbjct: 465 GTVILISDG-------HDRVEESISYAKQLRSEGKRIIAIGAGKNPNEKNLRTIARASGG 517 Query: 366 QFFAVNDSRELLESF 380 +F ++ L F Sbjct: 518 SYFRATETNRLNILF 532 >gi|149067641|gb|EDM17193.1| rCG39970 [Rattus norvegicus] Length = 1163 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 31/192 (16%), Positives = 63/192 (32%), Gaps = 23/192 (11%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + R + + L+ S + Y+ + + N+ + Sbjct: 160 SGSINQRDFAQMKDFVKALMGEFAST-------STLFSLMQYSNILKTHFTFTEFKNILD 212 Query: 262 VKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 +S ++ + T T + EL++ K S KK ++ ITDG+ Sbjct: 213 PQSLVDPIVQLQGLTYTATGIRTVVEELFHSKNGSR-----KSAKKILLVITDGQ----- 262 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSA---PPEGQDLLRKCTDSS--GQFFAVNDSRE 375 Y++ L + AG+ Y++ V P L + F V + Sbjct: 263 KYRDPLEYSDVIPAADKAGIIRYAIGVGDAFQEPTALKELNTIGSAPPQDHVFKVGNFAA 322 Query: 376 LLESFDKITDKI 387 L ++ +KI Sbjct: 323 LRSIQRQLQEKI 334 >gi|301788660|ref|XP_002929747.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2-like [Ailuropoda melanoleuca] Length = 946 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 29/228 (12%), Positives = 77/228 (33%), Gaps = 25/228 (10%) Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 L P PK + + + S + + ++ + +L Q ++ + N + Sbjct: 299 FAPENLDPIPKNILFVIDVSGSMWGI---KMKQTVEAMKTILDDLRAEDQFSVIDF-NHN 354 Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS 295 VR + K + K+ P TN A+ A L Sbjct: 355 VRTWRNDLVSATKTQIV--------DAKKYIEKIQPSGGTNINEALLRAIFILNEANNLG 406 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 S +I ++DG+ + + + + + +R+ + ++S+ + D Sbjct: 407 MLDPES---VSLIILVSDGDPTVGELKLSKIQ-KNVKQNIRD-NIALFSLGIGFDV-DYD 460 Query: 356 LLRKCTDSSGQ------FFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 L++ + + + + S +L + +++++ + P+ Sbjct: 461 FLKRL-SNENRGIAQRIYGNQDTSSQLKKFYNQVSTPLLRNVQFNYPH 507 >gi|47217883|emb|CAG05005.1| unnamed protein product [Tetraodon nigroviridis] Length = 647 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 24/163 (14%), Positives = 53/163 (32%), Gaps = 19/163 (11%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYRELYNEKES 294 R+G + Y + + EV+ + + T T A+ +A + E + Sbjct: 91 TRVGLLQYGSVVQPEFSLSTFSTKAEVEQAVRNMKHLATGTMTGLAIQYAAETSFTEADG 150 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + + + + +TDG ++ R AG++I+++ V Sbjct: 151 ARPA--HLHIPRIAVVVTDGRPQD--------RVEEVAAQARQAGIQIFAIGVGR--VDM 198 Query: 355 DLLRKCTDSS--GQFFAVNDSRE---LLESFD-KITDKIQEQS 391 L+ V + L+ F K+ ++ E Sbjct: 199 KTLKTIGSEPHSEHVHLVASFSQMETLVSVFQSKLCREMCELL 241 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 18/186 (9%), Positives = 60/186 (32%), Gaps = 42/186 (22%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + +G + Y+ + + ++ ++++ + T A+ ++ Sbjct: 438 DVSPTGAHVGLLQYSTNVRTEFTLSQHTSAQGIRQAVSRMQYMGRGSMTGSALRRMFQSS 497 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM-------- 340 ++ +E + + + + TDG + + + + +N+G+ Sbjct: 498 FSAEEGAR-----PNVPRVSVVFTDGRSQD--------DASEWAKKAKNSGIPGSFSYFG 544 Query: 341 ----------------KIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDK 382 IY+V V + LR+ + + +++ E +K Sbjct: 545 GTGRFLSCSFLLVLGVTIYAVGVGKAI--EQELREIASEPEEKHLYYAQEFKDVGEITEK 602 Query: 383 ITDKIQ 388 + ++ Sbjct: 603 LKSRMC 608 >gi|326670660|ref|XP_003199262.1| PREDICTED: collagen alpha-3(VI) chain-like [Danio rerio] Length = 1404 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 19/170 (11%), Positives = 54/170 (31%), Gaps = 15/170 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRE 287 + RI + Y+ N E+ + L T A+ + Sbjct: 41 DVAENRDRISVVQYSREPEANFYLNTYTTKEEIVDAVRGLRHKGGRPLYTGEALQYVRDN 100 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 ++ S + + ++ ++ G ++ + ++ G+ + + Sbjct: 101 VFTASSGSRRL---EGVPQILVLLSGG--------RSFDSVDAAASSLKELGVL--TFGI 147 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + L++ + +V+D EL +++ +Q S+ + P Sbjct: 148 GSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASVQVTSIPVTPT 197 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 19/170 (11%), Positives = 54/170 (31%), Gaps = 15/170 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRE 287 + RI + Y+ N E+ + L T A+ + Sbjct: 840 DVAENRDRISVVQYSREPEANFYLNTYTTKEEIVDAVRGLRHKGGRPLYTGEALQYVRDN 899 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 ++ S + + ++ ++ G ++ + ++ G+ + + Sbjct: 900 VFTASSGSRRL---EGVPQILVLLSGG--------RSFDSVDAAASSLKELGVL--TFGI 946 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + L++ + +V+D EL +++ +Q S+ + P Sbjct: 947 GSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASVQVTSIPVTPT 996 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 32/337 (9%), Positives = 100/337 (29%), Gaps = 33/337 (9%) Query: 65 KKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLI 124 + ++++ + + + + ++ A L Sbjct: 90 TGEALQYVRDNVFTASSGSRRLEGVPQILVLLSGGRSFDSVDAAAS-----------SLK 138 Query: 125 PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP 184 + + S G R + ++ L VS E +Q+ + ++ + P Sbjct: 139 ELGVLTFGIGSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASVQVTS-IPVTPT 197 Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK--NLSVRIGTIA 242 +T + D S + + +Q+ +++ R+ + Sbjct: 198 SPTVTAEYSTPRKDVVFL----LDGSDGTRSSFPAMRDFVQRVVEKFNIEANRDRVSVVQ 253 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIG 300 Y+ + ++ R+ L T A+ + ++ S Sbjct: 254 YSRDAEVHFYLNSYTKKEDILDRVTGLRHKGGRPLYTGAALQYVRDNVFTASSGSRRL-- 311 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 + + +I ++ G ++ + ++ G+ ++ + + L++ Sbjct: 312 -EGVPQILILLSGG--------RSFDSVDAAASSLKELGVL--TLGIGSRGSDSRELQRI 360 Query: 361 TDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + + +V D EL +++ +Q ++ P Sbjct: 361 SYEANYALSVADFSELPNVQEQLLASVQTVAMSGTPT 397 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 32/337 (9%), Positives = 100/337 (29%), Gaps = 33/337 (9%) Query: 65 KKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLI 124 + ++++ + + + + ++ A L Sbjct: 889 TGEALQYVRDNVFTASSGSRRLEGVPQILVLLSGGRSFDSVDAAAS-----------SLK 937 Query: 125 PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP 184 + + S G R + ++ L VS E +Q+ + ++ + P Sbjct: 938 ELGVLTFGIGSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASVQVTS-IPVTPT 996 Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK--NLSVRIGTIA 242 +T + D S + + +Q+ +++ R+ + Sbjct: 997 SPTVTAEYSTPRKDVVFL----LDGSDGTRSSFPAMRDFVQRVVEKFNIEANRDRVSVVQ 1052 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIG 300 Y+ + ++ R+ L T A+ + ++ S Sbjct: 1053 YSRDAEVHFYLNSYTKKEDILDRVTGLRHKGGRPLYTGAALQYVRDNVFTASSGSRRL-- 1110 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 + + +I ++ G ++ + ++ G+ ++ + + L++ Sbjct: 1111 -EGVPQILILLSGG--------RSFDSVDAAASSLKELGVL--TLGIGSRGSDSRELQRI 1159 Query: 361 TDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + + +V D EL +++ +Q ++ P Sbjct: 1160 SYEANYALSVADFSELPNVQEQLLASVQTVAMSGTPT 1196 >gi|297473612|ref|XP_002686715.1| PREDICTED: collagen, type XXVIII-like, partial [Bos taurus] gi|296488704|gb|DAA30817.1| collagen, type XXVIII-like [Bos taurus] Length = 327 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 24/190 (12%), Positives = 64/190 (33%), Gaps = 26/190 (13%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + ++ L + + + R+G I Y+ + ++ + Sbjct: 18 SSESVGLKNFQIIKNFVKTLTDRVAL-----DLATTRVGIINYSHKVEEVAHLTQFSSKD 72 Query: 261 EVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 ++K ++ + T T A+H A R K +KK + ITDG+ Sbjct: 73 DLKRAVDNMQYLGEGTYTATALHAANRMFEAAKPG---------VKKVALVITDGQTDT- 122 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ-----DLLRKCTDSSG--QFFAVND 372 ++ N ++ + + ++I+ + V + + + +D Sbjct: 123 ---RDEKNLTEVVKNASDINVEIFVIGVVKRNDPNFHIFHQEMNLIATDPDSEHVYLFDD 179 Query: 373 SRELLESFDK 382 L ++ + Sbjct: 180 FITLEDTLKQ 189 >gi|194666191|ref|XP_604080.4| PREDICTED: collagen, type XXVIII-like [Bos taurus] Length = 1147 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 24/190 (12%), Positives = 64/190 (33%), Gaps = 26/190 (13%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + ++ L + + + R+G I Y+ + ++ + Sbjct: 804 SSESVGLKNFQIIKNFVKTLTDRVAL-----DLATTRVGIINYSHKVEEVAHLTQFSSKD 858 Query: 261 EVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 ++K ++ + T T A+H A R K +KK + ITDG+ Sbjct: 859 DLKRAVDNMQYLGEGTYTATALHAANRMFEAAKPG---------VKKVALVITDGQTDT- 908 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ-----DLLRKCTDSSG--QFFAVND 372 ++ N ++ + + ++I+ + V + + + +D Sbjct: 909 ---RDEKNLTEVVKNASDINVEIFVIGVVKRNDPNFHIFHQEMNLIATDPDSEHVYLFDD 965 Query: 373 SRELLESFDK 382 L ++ + Sbjct: 966 FITLEDTLKQ 975 Score = 44.9 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 28/156 (17%), Positives = 57/156 (36%), Gaps = 16/156 (10%) Query: 210 IDVLIESAGNLVNSIQKAIQEKK-NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 D + G+L + I + +++ + ++ + + +L K R+ Sbjct: 63 FDKQKDFVGSLSDKIFQLTPVGSLKYDIKLAALQFSSSVQIDPPFSSWKDLQTFKQRVKS 122 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 +N T +Y A+ +A R L E K + +TDG + + + Sbjct: 123 MNFIGQGTFSYYAIANATRLLKREGRKDGM--------KVAVLMTDGID-----HPKNPD 169 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS 363 I E R AG+ ++ +SA + LR + Sbjct: 170 VQSISEDARTAGILFITIGLSA-VVNETKLRLISGD 204 >gi|83312059|ref|YP_422323.1| hypothetical protein amb2960 [Magnetospirillum magneticum AMB-1] gi|82946900|dbj|BAE51764.1| hypothetical protein [Magnetospirillum magneticum AMB-1] Length = 1171 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 52/142 (36%), Gaps = 16/142 (11%) Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 L+ + +S L ++P T+ A++ A L + + + K V+ +D Sbjct: 117 TLAATPAQARSALQAISPGGATSIAAALNQAVELLAHGRPG---------MDKVVVLCSD 167 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDS 373 G++ A I +++ +V L D +F + + Sbjct: 168 GQDDIAEIADALARLKAIPS------VRVLAVGFGDEVIHATFLAMVADRQD-YFHLTRA 220 Query: 374 RELLESFDKITDKIQEQSVRIA 395 R++ + F ++ ++ + +A Sbjct: 221 RDMDDVFQRLAKEVNGPTGLLA 242 >gi|167526012|ref|XP_001747340.1| hypothetical protein [Monosiga brevicollis MX1] gi|163774175|gb|EDQ87807.1| predicted protein [Monosiga brevicollis MX1] Length = 1632 Score = 59.5 bits (142), Expect = 9e-07, Method: Composition-based stats. Identities = 36/262 (13%), Positives = 81/262 (30%), Gaps = 25/262 (9%) Query: 125 PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP 184 A T + + I +VL RS D N ++ + Sbjct: 271 GRAPTTFNEGRSAFITTQEGTPEAMQILVLAAIRSKFDWISYVDNTTLRVSVIREGCKSA 330 Query: 185 PKKSFW---SKNTTKSKYAPAPAPANRKI-DVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 F S + + Y P KI D + + + + R+G Sbjct: 331 IDLIFVLDGSGSIDRESYGGTPGNFQYKILDFVKQVVSYF---------DISANATRVGV 381 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYRELYNEKESSHNTI 299 I ++ V N + +++ ++ +N P +T A+ + + E Sbjct: 382 ITFSSSAVINFNLNSFYDKSDMLDAIDNINYPASSTRISLALASVRQNMLKEYNGMRPE- 440 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 S + K V+ +TDG+ + +++ G+ ++S+ + + D L Sbjct: 441 -SEGVPKVVVVLTDGQ------ASSGYEPAYEAALLKDMGVNMFSIGIGS-SIDTDQLED 492 Query: 360 CTDSS--GQFFAVNDSRELLES 379 + + + + + Sbjct: 493 MATAPLASHMHLLKNFDAIPDI 514 >gi|264679151|ref|YP_003279058.1| ferredoxin-dependent glutamate synthase [Comamonas testosteroni CNB-2] gi|262209664|gb|ACY33762.1| ferredoxin-dependent glutamate synthase [Comamonas testosteroni CNB-2] Length = 1405 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 38/316 (12%), Positives = 85/316 (26%), Gaps = 15/316 (4%) Query: 38 LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK 97 L+ +I + + T++ K + + +A IT Sbjct: 552 LAAPGAISVSEALASGSGTFTITASAGLKLLTLDGPGNADASLTLDRLADLVNNPITLT- 610 Query: 98 NNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 + + + + + + D+ Sbjct: 611 --TSKGTLTLTGYDATTGKVSYTYQTSGQQAHTGDDTNVQDHFQITVEDKFGGKATGDLG 668 Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 + D + + + S + + ++DVL S Sbjct: 669 VLITDTAPSLKPIAESSALSSHGTNIMLTLDTSGSMAWSSGVNNSNGWSLSRLDVLKSSV 728 Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNT 277 L++ +A VR+ + +N +S L E K+ +N L TN Sbjct: 729 NGLLDKYGEA------GDVRVLILEFNSSATQKGSGWMS--LAEAKTFVNGLYADGGTNY 780 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 A+ A N + ++ F TDGE + ++ T +++ + Sbjct: 781 QDALTKAMAAWNNSGTG---KLEGNNVQNISYFFTDGEPDSNRSVSSSQQT-TWEKFLAD 836 Query: 338 AGMKIYSVAVSAPPEG 353 + Y + + G Sbjct: 837 NHINSYGIGLGTGATG 852 >gi|12583699|dbj|BAB21479.1| integrin alpha Hr1 precursor [Halocynthia roretzi] Length = 1332 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 45/345 (13%), Positives = 102/345 (29%), Gaps = 36/345 (10%) Query: 66 KQIKKHLKQGSYIRENAGDIAQKAQ-INITKDKNNPLQYIAESKAQYEIPTENLFLKGLI 124 + + I+ + + + AE+K Q + + GL Sbjct: 70 SYYGNSFSLIRQGDNAISVVVGAPKAIDQNQISLHRTKATAETKPQGFLQRCPINFSGLP 129 Query: 125 PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP 184 ++ S + ++ + V+ S N K Sbjct: 130 GLPCKYVNPPSIEPGDNIGLTASVQPEGDI-VNCSPTRELKCSSMKYNPGFCYKSTDYGG 188 Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQ--KAIQEKKNLSVRIGTIA 242 + S+NT + ++ + + ++ A + VR+G + Sbjct: 189 NWRKEGSRNTECPSSGVDVLFVLDGSGSVGKNFDKVKDWVKNITAKLDIGKEIVRVGVVQ 248 Query: 243 YNIGIVGNQCTPLSNNLNEVK-----------SRLNKLNPYENT-NTYPAMHHAYRELYN 290 Y+ + G E+ + ++++ T T A+ R+ + Sbjct: 249 YSHYVEGKSINKQKYITTEISIGEFKLLDNFENAVDRIQLQGYTTYTGRALQKVIRDFDD 308 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + K+ ++ +TDG+ ++ L +RN G+ ++V V Sbjct: 309 AYIGN---------KQVLLLLTDGQ------AKDNKLILPNANRLRNKGIATFAVGVG-- 351 Query: 351 PEGQDLLRKCTD---SSGQFFAVNDSRELLESFDKITDKIQEQSV 392 L+ S+ + F V D EL + +IQ + Sbjct: 352 EYDISELKLIASGTDSTDRVFTVTDFGELDSIVKSLQTEIQSFVL 396 >gi|239814248|ref|YP_002943158.1| von Willebrand factor type A [Variovorax paradoxus S110] gi|239800825|gb|ACS17892.1| von Willebrand factor type A [Variovorax paradoxus S110] Length = 345 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 19/173 (10%), Positives = 58/173 (33%), Gaps = 46/173 (26%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN------------------------- 290 + N +++ + ++ T T A+ + L+ Sbjct: 144 TTNHDDLITAIDSFQLQRATATGNAIVVSLATLFPDAGIDVSQFSAPSRQRGTPIDQAEK 203 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + VI +TDG+ + ++ L + + G++IY+V V Sbjct: 204 QAKEFTPVAPGSYTSAAVIMLTDGQRTTG------VDPLDAAKAAADRGVRIYTVGVGTV 257 Query: 351 PE--------------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQ 388 ++ L+ + + ++F + +L + ++ ++ ++ Sbjct: 258 DGETIGFEGWSMRVRLDEETLKAVANKTQAEYFYAGTAADLKKVYETLSSRLT 310 >gi|309361123|emb|CAP30209.2| hypothetical protein CBG_10938 [Caenorhabditis briggsae AF16] Length = 579 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 20/113 (17%), Positives = 37/113 (32%), Gaps = 19/113 (16%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T T A+ K + ++ TDG + +T + Sbjct: 477 YSGTTFTNQALKRMALLFEASKRDNCKMK--------LLVFTDGY--------SAEDTAE 520 Query: 331 ICEYMRNAGMKIYSVAVSAPPE---GQDLLRKCTDSSGQFFAVNDSRELLESF 380 E ++ G+ +Y+V +S L+ S +F +D LL+ F Sbjct: 521 GIEALKRQGITVYTVGISTDKNAGLNVSELKGMATSPSHYFDSSDFDNLLKHF 573 >gi|260818477|ref|XP_002604409.1| hypothetical protein BRAFLDRAFT_220331 [Branchiostoma floridae] gi|229289736|gb|EEN60420.1| hypothetical protein BRAFLDRAFT_220331 [Branchiostoma floridae] Length = 192 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 22/187 (11%), Positives = 57/187 (30%), Gaps = 22/187 (11%) Query: 206 ANRKIDVLIESAGNLVNSIQKAI---QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + ++ + + K + + S R+G + Y+ + Sbjct: 22 VLDGSGSVTDANFDKMKQFAKNVVNAFDISASSTRVGVVQYSDSNTLEFNLGDHADKPST 81 Query: 263 KSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + ++ + T T A+ A + + K +I +TDG+++ + Sbjct: 82 LAAIDSIVYQGGGTRTGSALEFA---------RVNAAWRGESVPKVMIVVTDGKSADSVT 132 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFD 381 + + G+ +Y++ V LL + + D L + Sbjct: 133 SS--------ANNLASQGVDVYAIGVG-NYRSTQLLEIAAGNQNNVIELTDFNALSAEIE 183 Query: 382 KITDKIQ 388 +I + Sbjct: 184 QIAQAVC 190 >gi|326670662|ref|XP_003199263.1| PREDICTED: collagen alpha-3(VI) chain-like [Danio rerio] Length = 1024 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 19/170 (11%), Positives = 54/170 (31%), Gaps = 15/170 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRE 287 + RI + Y+ N E+ + L T A+ + Sbjct: 235 DVAENRDRISVVQYSREPGANFYLNTYTTKEEIVDAVRGLRHKGGRPLYTGEALQYVRDN 294 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 ++ S + + ++ ++ G ++ + ++ G+ + + Sbjct: 295 VFTASSGSRRL---EGVPQILVLLSGG--------RSFDSVNAAASSLKELGVL--TFGI 341 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + L++ + +V+D EL +++ +Q S+ + P Sbjct: 342 GSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASVQVTSIPVTPT 391 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 31/328 (9%), Positives = 96/328 (29%), Gaps = 33/328 (10%) Query: 65 KKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLI 124 + ++++ + + + + + A L Sbjct: 284 TGEALQYVRDNVFTASSGSRRLEGVPQILVLLSGGRSFDSVNAAAS-----------SLK 332 Query: 125 PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP 184 + + S G R + ++ L VS E +Q+ + ++ + P Sbjct: 333 ELGVLTFGIGSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASVQVTS-IPVTPT 391 Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK--NLSVRIGTIA 242 +T + D S + + +Q+ +++ R+ + Sbjct: 392 SPTVTAEYSTPRKDVVFL----LDGSDGTRSSFLAMRDFVQRVVEKFNIEANRDRVSVVQ 447 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIG 300 Y+ + ++ R+ L T A+ + ++ S Sbjct: 448 YSRDAEVHFYLNSYTKKEDILDRVTGLRHKGGRPLYTGAALQYVRDNIFTASSGSRRL-- 505 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 + + +I ++ G ++ + ++ G+ ++ + + L++ Sbjct: 506 -EGVPQILILLSGG--------RSFDSVDAAASSLKELGVL--TLGIGSRGSDSRELQRI 554 Query: 361 TDSSGQFFAVNDSRELLESFDKITDKIQ 388 + + +V D EL +++ +Q Sbjct: 555 SYEANYALSVADFSELPNVQEQLLASVQ 582 >gi|301064778|ref|ZP_07205158.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300441153|gb|EFK05538.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 625 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 22/162 (13%), Positives = 52/162 (32%), Gaps = 51/162 (31%) Query: 254 PLSNNLNEVKSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 PL+ + + ++ L L+ P T+ A++ A + K ++ Sbjct: 145 PLTLDYDAFRNSLEALDTNIIPQGGTDIASAIYEA-----------EAAFNNDANHKILV 193 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------- 352 ++DGE+ L + + + IY+V V P Sbjct: 194 LVSDGEDLEG-------EALSAAQAAKERDLTIYTVGVGTPSGELIPLVQGGKDGAFVKD 246 Query: 353 ----------GQDLLRKCTD-SSGQFFAV-NDSRELLESFDK 382 + +L+K + + G++ + + L + + Sbjct: 247 EKGQPVKSRLDETMLQKIAEATGGRYEPLGQQAEGLEAIYRE 288 >gi|297526263|ref|YP_003668287.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] gi|297255179|gb|ADI31388.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] Length = 416 Score = 59.5 bits (142), Expect = 1e-06, Method: Composition-based stats. Identities = 30/212 (14%), Positives = 57/212 (26%), Gaps = 10/212 (4%) Query: 172 NNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEK 231 S K + A + + + Sbjct: 10 MGKDSIIEGREDTIPFVLSIKGVYSAHPPIAFLIVIDTSYSMDGEKIFRAKQAALGLLDI 69 Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNE 291 +G + + EV+ + L TN Y + Sbjct: 70 LRDKDYVGVYGFAGKFYKVLEPVPATKRGEVERAIISLKLGSGTNIYDTLKKLVE----- 124 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 E+ +IFITDGE + N L++ + +R AG + V Sbjct: 125 -ETKKVLQNGALSLVRIIFITDGEPTVGKK--NPKKILEMAKKLREAGASALIIGVGTE- 180 Query: 352 EGQDLLRKCTDS-SGQFFAVNDSRELLESFDK 382 + LL + + +G+F ++D L + + Sbjct: 181 YNEKLLSRMAMALNGEFEHISDPASLEKLISE 212 >gi|328784200|ref|XP_003250409.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Apis mellifera] Length = 2258 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 23/136 (16%), Positives = 50/136 (36%), Gaps = 25/136 (18%) Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + +L ++ T T A+ AYR L + ++ KK V ITDG ++G Sbjct: 172 LNKQLTNISYTGGGTYTRGALLEAYRILEKARSNA---------KKAVFLITDGFSNGG- 221 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQF-FAVNDSRELLE 378 + ++ AG I++ + + L S + + ++ E Sbjct: 222 ------DPRPAANLLKGAGATIFTFGIRTGNVDE--LHDIASSPKDTYSYFLDSFTE--- 270 Query: 379 SFDKITDKIQEQSVRI 394 F+ + + + ++ Sbjct: 271 -FEALIRRALHRDLKT 285 >gi|297567411|ref|YP_003686383.1| hypothetical protein Mesil_3037 [Meiothermus silvanus DSM 9946] gi|296851860|gb|ADH64875.1| conserved hypothetical protein [Meiothermus silvanus DSM 9946] Length = 351 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 27/203 (13%), Positives = 65/203 (32%), Gaps = 42/203 (20%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKES 294 + +IG I ++ + + ++ L+ + P +NT+ A+ R L KE Sbjct: 125 TTQIGLITFSDSASVVVAP--TTDRAVLQEALDNVKPVQNTSLPSAIVTGVRLLPGRKEV 182 Query: 295 SHNTI-----------------------GSTRLKKFVIFITDGENSGASAYQ--NTLNTL 329 ++ I+DG + S + N Sbjct: 183 QPPKELQPQNPQNPQPQNPLVQPDTPPIPREFPPGSLLVISDGATNVNSNPRLPNQTALE 242 Query: 330 QICEYMRNAGMKIYSVAVSAPPEG---------------QDLLRKCTDSSGQFFAVNDSR 374 ++ ++ G+KIY+ AV + L + + G++ Sbjct: 243 AAAKFAQDNGVKIYAFAVGKEGGAVVRIEGQDYFVPFEPRSLQQLAERTGGKYVYPPTEE 302 Query: 375 ELLESFDKITDKIQEQSVRIAPN 397 L + ++ I+ ++ ++ + Sbjct: 303 ALRAVYRELGTVIRWEATKLEVS 325 >gi|221130232|ref|XP_002156394.1| PREDICTED: similar to tyrosine kinase receptor [Hydra magnipapillata] Length = 1746 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 17/116 (14%), Positives = 40/116 (34%), Gaps = 11/116 (9%) Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T A+ + + + + + +TDG+ + + + Sbjct: 1219 NGWTRIDLALRKSLEMFEEINGAR------KNVPRLLFLLTDGKQETNEGGA--EDPVNV 1270 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL--LESFDKITD 385 + +R+ G++I +V + + L SS + F + EL + KI + Sbjct: 1271 AQLLRDRGVEIIAVGIGKGVN-RLELNNIAGSSDKVFLAENFDELIKQDFLKKIKE 1325 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 18/215 (8%), Positives = 57/215 (26%), Gaps = 26/215 (12%) Query: 160 MEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYA----------PAPAPANRK 209 + NN + S + + T + Sbjct: 1540 NSTMLTASTLINNKLVSIDQNPVAFDMWNKVVLSQTFNGTVHLEDPSCEGFFDVGFILDS 1599 Query: 210 IDVLIESAGNLVNSIQK--AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 L + V+ +++ + G + ++ + + ++ Sbjct: 1600 SGSLESNYSQEVDFLKQLASSFGISKQGSHAGVVTFSSEAKLSIQLDKYFTDADFNKAVD 1659 Query: 268 KLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 + T A+ A +K S + K + +TDG Q Sbjct: 1660 DIPYMGGGTRIDLALEKAIELFDTKKGSRNEA------PKLLFLLTDGV-------QEPK 1706 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT 361 + + ++ ++++++ + + +L++ Sbjct: 1707 MEIPVPNEIKQKIIQLFAIGIGSNVNKDELIKIVV 1741 >gi|326670656|ref|XP_003199260.1| PREDICTED: collagen alpha-3(VI) chain-like [Danio rerio] Length = 1401 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 19/170 (11%), Positives = 54/170 (31%), Gaps = 15/170 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRE 287 + RI + Y+ N E+ + L T A+ + Sbjct: 840 DVAENRDRISVVQYSREPEANFYLNTYTTKEEIVDAVRGLRHKGGRPLYTGEALQYVRDN 899 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 ++ S + + ++ ++ G ++ + ++ G+ + + Sbjct: 900 VFTASSGSRRL---EGVPQILVLLSGG--------RSFDSVDAAASSLKELGVL--TFGI 946 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + L++ + +V+D EL +++ +Q S+ + P Sbjct: 947 GSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASVQVTSIPVTPT 996 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 19/170 (11%), Positives = 53/170 (31%), Gaps = 15/170 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRE 287 + RI + Y+ N E+ + L T A+ Sbjct: 41 DVAENRDRISVVQYSREPEANFYLNTYTTKEEIVDAVRGLRHKGGRPLYTGEALQFVRDN 100 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 ++ S + + ++ ++ G ++ + ++ G+ + + Sbjct: 101 VFTASSGSRRL---EGVPQILVLLSGG--------RSFDSVDAAASSLKELGVL--TFGI 147 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + L++ + +V+D EL +++ +Q S+ + P Sbjct: 148 GSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASVQVTSIPVTPT 197 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 32/337 (9%), Positives = 100/337 (29%), Gaps = 33/337 (9%) Query: 65 KKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLI 124 + ++++ + + + + ++ A L Sbjct: 889 TGEALQYVRDNVFTASSGSRRLEGVPQILVLLSGGRSFDSVDAAAS-----------SLK 937 Query: 125 PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP 184 + + S G R + ++ L VS E +Q+ + ++ + P Sbjct: 938 ELGVLTFGIGSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASVQVTS-IPVTPT 996 Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK--NLSVRIGTIA 242 +T + D S + + +Q+ +++ R+ + Sbjct: 997 SPTVTAEYSTPRKDVVFL----LDGSDGTRSSFPAMRDFVQRVVEKFNIEANRDRVSVVQ 1052 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIG 300 Y+ + ++ R+ L T A+ + ++ S Sbjct: 1053 YSRDAEVHFYLNSYTKKEDILDRVKGLRHKGGRPLYTGAALQYVRDNVFTASSGSRRL-- 1110 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 + + +I ++ G ++ + ++ G+ ++ + + L++ Sbjct: 1111 -EGVPQILILLSGG--------RSFDSVDAAASSLKELGVL--TLGIGSRGSDSRELQRI 1159 Query: 361 TDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + + +V D EL +++ +Q ++ P Sbjct: 1160 SYEANYALSVADFSELPNVQEQLLASVQTVAMSGTPT 1196 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 31/290 (10%), Positives = 90/290 (31%), Gaps = 22/290 (7%) Query: 112 EIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDN 171 + + L + + S G R + ++ L VS E +Q+ Sbjct: 126 SFDSVDAAASSLKELGVLTFGIGSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLA 185 Query: 172 NNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEK 231 + ++ + P +T + D S + + +Q+ +++ Sbjct: 186 SVQVTS-IPVTPTSSTVTAEYSTPRKDVVFL----LDGSDGTRSSFPAMRDFVQRVVEKF 240 Query: 232 K--NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRE 287 R+ + Y+ + ++ R+ L T A+ + Sbjct: 241 NIEANRDRVSVVQYSRDAEVHFYLNSYTKKEDILDRVTGLRHKGGRPLYTGAALQYVRDN 300 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 ++ S + + +I ++ G ++ + ++ G+ ++ + Sbjct: 301 VFTASSGSRRL---EGVPQILILLSGG--------RSFDSVDAAASSLKELGVL--TLGI 347 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + L++ + + +V D EL +++ +Q ++ P Sbjct: 348 GSRGSDSRELQRISYEANYALSVADFSELPNVQEQLLASVQTVAMSGTPT 397 >gi|319649593|ref|ZP_08003749.1| hypothetical protein HMPREF1013_00353 [Bacillus sp. 2_A_57_CT2] gi|317398755|gb|EFV79437.1| hypothetical protein HMPREF1013_00353 [Bacillus sp. 2_A_57_CT2] Length = 461 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 21/116 (18%), Positives = 41/116 (35%), Gaps = 18/116 (15%) Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + P T + ++ AY EL +K +I +TDG+ + Sbjct: 6 SITPGGGTEIFTSLEQAYSELEE----------LQLQRKHIILLTDGQ------SATNGD 49 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDK 382 + E + + + +VA+ + LL G+F+ V DS + + Sbjct: 50 YELLIEGGKEKNITLSTVALGQDA-DRGLLEDLAGMGLGRFYDVTDSSIIPSILSR 104 >gi|312196063|ref|YP_004016124.1| von Willebrand factor type A [Frankia sp. EuI1c] gi|311227399|gb|ADP80254.1| von Willebrand factor type A [Frankia sp. EuI1c] Length = 560 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 26/137 (18%), Positives = 47/137 (34%), Gaps = 9/137 (6%) Query: 248 VGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 + TP S +L + L NT Y A+ AY ++ + + S Sbjct: 428 TVSDPTPGSADLKAISDYGAALRAGGNTAIYSALDAAYTTAAAGMKADPSALTS------ 481 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF 367 ++ +TDGEN+ G++ ++V + L + T + G Sbjct: 482 IVLMTDGENNRGL-DSAGFLARYNTRPPDVRGVRTFAVDFG-DADRAALTQIATSTGGAV 539 Query: 368 FAVNDSR-ELLESFDKI 383 F L + F +I Sbjct: 540 FDATAPGVSLSDVFREI 556 >gi|297562484|ref|YP_003681458.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] gi|296846932|gb|ADH68952.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] Length = 505 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 27/243 (11%), Positives = 71/243 (29%), Gaps = 24/243 (9%) Query: 154 LDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL 213 + +S + + + ++ P +++ T + + + ++D Sbjct: 1 MHLSALSDFDAVPRDTEDAVSVLVDITAPEREEETERPPATLQVVLDRSGSMGGGRLDGA 60 Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE 273 + + +LV + + G +++N + V+ ++ L+ Sbjct: 61 VRALLSLVERLAPSDN--------FGLVSFNDQARVEVPCGPLEDKARVRRLISGLHASG 112 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T+ + +E T ++ I+DG + + L Q+ Sbjct: 113 GTDLSSGLLRGVQEARRAGADRGGT---------LLLISDGHANQGVTDHDLLR--QVAA 161 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSV 392 G+ S+ ++LL D G D I + + Sbjct: 162 DAYAHGVTTTSLGYGLG-YDEELLGAVADGGAGSALFAEDPDTAGGL---IAREAEYLLA 217 Query: 393 RIA 395 + A Sbjct: 218 KTA 220 >gi|260800525|ref|XP_002595179.1| hypothetical protein BRAFLDRAFT_241028 [Branchiostoma floridae] gi|229280423|gb|EEN51191.1| hypothetical protein BRAFLDRAFT_241028 [Branchiostoma floridae] Length = 284 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 54/166 (32%), Gaps = 15/166 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + +VR+ + Y + +L E + ++ + T T A+ A Sbjct: 32 DIGENAVRVSIVQYAAQVRTEFFLDQYYDLQEAQDAVDGIEYMGGFTLTGKAIDFATNLH 91 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ ++ + + K + ITDG + MR AG+ +V V Sbjct: 92 FDLRKGARA-----DVTKIAVVITDGRSYDDVNRP--------ARRMRQAGIVTIAVGVG 138 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRI 394 +D L +++ L + + + + +R Sbjct: 139 -NNLDRDQLTAIAGDPKTLLSLDGFDRLQDLTTSLPTMLCDGKIRT 183 >gi|297203405|ref|ZP_06920802.1| VWA domain-containing protein [Streptomyces sviceus ATCC 29083] gi|197711494|gb|EDY55528.1| VWA domain-containing protein [Streptomyces sviceus ATCC 29083] Length = 421 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 23/192 (11%), Positives = 69/192 (35%), Gaps = 22/192 (11%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ---CTPLSNNLNEVKS 264 ++ + ++++ + +Q + + + E K+ Sbjct: 60 TRMAAAKRAFNEVLDATPEEVQLGIRTLGANYPGDNQKTGCKDTAQLYPVSTLDRTEAKT 119 Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 ++ L+P T PA+ + + + K ++ I+DGE++ Sbjct: 120 QVATLSPTGWTPIGPALLKSAGDFTDSA-----------SSKRIVLISDGEDT-----CA 163 Query: 325 TLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFD 381 L+ ++ + G+ I ++ + + + L + + G + +V + EL + + Sbjct: 164 PLDPCEVAREIGAKGIGLTIDTLGLVPNTKMRKQLSCIAEATGGTYTSVEHADELTDKVN 223 Query: 382 KITDKIQEQSVR 393 ++ D+ + V Sbjct: 224 QLVDRAADPVVT 235 >gi|162447313|ref|YP_001620445.1| surface-anchored VWFA domain-containing protein [Acholeplasma laidlawii PG-8A] gi|161985420|gb|ABX81069.1| surface-anchored VWFA domain protein [Acholeplasma laidlawii PG-8A] Length = 486 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 36/321 (11%), Positives = 84/321 (26%), Gaps = 30/321 (9%) Query: 65 KKQIKKHLKQGSYIRENAGDIAQKAQIN---ITKDKNNPLQYIAES-KAQYEIPTENLFL 120 Q D + I I NN + A Y + Sbjct: 21 SGNYNGGYLQDENYNYIFNDDEHQEIIENPFIDVSVNNKSNISLSANTASYSFIRSQINS 80 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 + + ++ D + + +Q +N L Sbjct: 81 GRAVDRNAVRIEEMVNFFNYNYNQPET-------DKTFGFKSELIQTPWNNETHLLLIGL 133 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 N A K+ + ++ L+ ++ + I Sbjct: 134 ETKQVDLGDIPSNIVILLDVSGSMSATNKLSLAKKAMELLIEQMK--------PNDVISL 185 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG 300 + Y+ G +++ + S++ L +T + AY+ N Sbjct: 186 VTYSSGEKVVFKGKSIDDMAYMTSQIRLLKASGSTAGKKGLDMAYKVAEEYFIEGGNNR- 244 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 +I TDG+ + + + L ++ R +G+ + + L + Sbjct: 245 -------IILATDGDFNVGISSTDML--IEYISEKRESGIYFSAYGFGYGNFKDEKLERV 295 Query: 361 T-DSSGQFFAVNDSRELLESF 380 +G + ++D ++F Sbjct: 296 AKAGNGTYHYIDDIISARKAF 316 >gi|149046637|gb|EDL99462.1| calcium channel, voltage-dependent, alpha2/delta subunit 1, isoform CRA_d [Rattus norvegicus] Length = 939 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 28/333 (8%), Positives = 87/333 (26%), Gaps = 23/333 (6%) Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK- 108 + ++ + + E + +I ++ + Sbjct: 104 AEKVQAAHQWREDFASNEVVYYNAKDDLDPERNESESGSQRIKPVFIEDANFGRQISYQH 163 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR----SMEDLY 164 A IPT+ ++ + L S E+ + + + + Sbjct: 164 AAVHIPTDIYEGSTIVLNELNWTSALDEVFKRNRDEDPTLLWQVFGSATGLARYYPASPW 223 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 + N + P + S + + + + ++ S ++ ++ Sbjct: 224 VDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETL 283 Query: 225 QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHA 284 + N +K +N + T+ A Sbjct: 284 SDDDFVNVASFNSNAQDVSCFQHLVQANVR---NKKVLKDAVNNITAKGITDYKKGFSFA 340 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + +L N S K ++ TDG +Y ++ +++++ Sbjct: 341 FEQLLNYNVSRA------NCNKIIMLFTDG--------GEERAQEIFAKYNKDKKVRVFT 386 Query: 345 VAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 +V + ++ ++ G ++ + + Sbjct: 387 FSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|289547502|ref|NP_001166098.1| integrin, alpha 11a [Danio rerio] Length = 1190 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 67/197 (34%), Gaps = 27/197 (13%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + I E L+N ++K +++G + Y +V ++ +V Sbjct: 175 GSNSIYPWNEVQDFLINILRKFYVGPGQ--IQVGVLQYGEKVVSEFQLNDFRSVEDVVKA 232 Query: 266 LNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 K+ TNT ++ A E + + K +I ITDGE Sbjct: 233 ARKIGQRGGEETNTALGINVARSEAFKQGGRRGAK-------KVMIVITDGE------SH 279 Query: 324 NTLNTLQICEYMRNAGMKIYSVAV-SA----PPEGQDLL---RKCTDSSG--QFFAVNDS 373 ++ + Q+ E G+ Y++AV + L + FF V D Sbjct: 280 DSADLQQVIEESEKDGITRYAIAVLGYYNRRGINPEAFLNEIKYIASDPDDKHFFNVTDE 339 Query: 374 RELLESFDKITDKIQEQ 390 L + D + ++I Sbjct: 340 AALKDIVDALGERIFSL 356 >gi|55793317|gb|AAV65699.1| integrin alpha 11 subunit [Danio rerio] Length = 1168 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 67/197 (34%), Gaps = 27/197 (13%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + I E L+N ++K +++G + Y +V ++ +V Sbjct: 153 GSNSIYPWNEVQDFLINILRKFYVGPGQ--IQVGVLQYGEKVVSEFQLNDFRSVEDVVKA 210 Query: 266 LNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 K+ TNT ++ A E + + K +I ITDGE Sbjct: 211 ARKIGQRGGEETNTALGINVARSEAFKQGGRRGAK-------KVMIVITDGE------SH 257 Query: 324 NTLNTLQICEYMRNAGMKIYSVAV-SA----PPEGQDLL---RKCTDSSG--QFFAVNDS 373 ++ + Q+ E G+ Y++AV + L + FF V D Sbjct: 258 DSADLQQVIEESEKDGITRYAIAVLGYYNRRGINPEAFLNEIKYIASDPDDKHFFNVTDE 317 Query: 374 RELLESFDKITDKIQEQ 390 L + D + ++I Sbjct: 318 AALKDIVDALGERIFSL 334 >gi|229490509|ref|ZP_04384348.1| putative von Willebrand factor, type A [Rhodococcus erythropolis SK121] gi|229322588|gb|EEN88370.1| putative von Willebrand factor, type A [Rhodococcus erythropolis SK121] Length = 684 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 29/207 (14%), Positives = 73/207 (35%), Gaps = 23/207 (11%) Query: 196 KSKYAPAPAPANRKIDVLIESAGNLV-NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 A K+ +S +V ++ + G+ + Sbjct: 52 SGSMNDNDANGKNKLTGAKQSLSRIVGDTASSSTPLGLWTYPTAGSNCDPGSFLAGADGG 111 Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 + + + + ++++ L T T PA+ + L ++ V+ I+DG Sbjct: 112 VRKDTDTLMAQVSGLKADGGTPTGPALRASVDSLKANGITTAT----------VVLISDG 161 Query: 315 ENSGASAYQNTLNTLQICEYMRNAG--MKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN 371 E++ A +T + + G + + ++ +G+ L + G++ + Sbjct: 162 ESNCGQAPCDT------AKQIVAEGFDVTVEALGFQLSGQGRTELECIASTTGGRYSDIA 215 Query: 372 DSRELLESFDKITDKIQEQSVRI-APN 397 D E+ + ++ I E ++ I AP+ Sbjct: 216 DVDEMQKRLKELM--IPELALTISAPD 240 >gi|170739508|ref|YP_001768163.1| von Willebrand factor type A [Methylobacterium sp. 4-46] gi|168193782|gb|ACA15729.1| von Willebrand factor type A [Methylobacterium sp. 4-46] Length = 342 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 21/112 (18%), Positives = 40/112 (35%), Gaps = 13/112 (11%) Query: 280 AMHHAYRELYNEKESSHNTIG-STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 M L + ST K +I +TDG ++ + + ++ Sbjct: 173 GMAGPRTALGDAIGLGIALFDRSTVKAKTIIALTDGNDTASQVP-----PTEAAGVAKDK 227 Query: 339 GMKIYSVAVSAPPE------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKI 383 G+ I++VA+ P + L+ + G FF D EL + ++ Sbjct: 228 GIVIHTVAIGDPSTVGEDKLDETALKDVASATGGGFFRALDRDELARIYGRL 279 >gi|262195558|ref|YP_003266767.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262078905|gb|ACY14874.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 775 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 22/135 (16%), Positives = 56/135 (41%), Gaps = 14/135 (10%) Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 ++ +++L+ T+ A+ A + + S + + ++F+TDG+ Sbjct: 326 RSQAVEYIDRLSDGGGTDLAGALAEAL----DAQHPSESEADTGSRPHVILFLTDGQ--- 378 Query: 319 ASAYQNTLNTLQICEYMRNAG-MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 ++ TLQ+ +AG +++++ V E L R ++ G+F + E+ Sbjct: 379 ----SDSQATLQVAR--GDAGDARVFTIGVGDGVEKPLLARLASEKRGRFTFIASPSEIE 432 Query: 378 ESFDKITDKIQEQSV 392 ++ +I + Sbjct: 433 RKVSRLYSEIAAPVL 447 >gi|162454786|ref|YP_001617153.1| hypothetical protein sce6504 [Sorangium cellulosum 'So ce 56'] gi|161165368|emb|CAN96673.1| hypothetical protein sce6504 [Sorangium cellulosum 'So ce 56'] Length = 381 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 25/205 (12%), Positives = 67/205 (32%), Gaps = 42/205 (20%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP----YEN 274 +++ + +Q+ + G +P + + + + ++++ Sbjct: 159 TRLDTAKLVLQDFISRRRTDRLGVVVFGKAAYVLSPPTLDYHLLTQMVSQMTLNVIDGSA 218 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 T A+ A L S K VI +TDG+++ + + + Sbjct: 219 TAIGDALGTAVARL----------RRSDAQSKVVILLTDGDSNAGAI--SPEYATHLATS 266 Query: 335 MRNAGMKIYSVAVSAPPE----------------------GQDLLRKCTD-SSGQFFAVN 371 + G K+Y++ + E LL++ + G + Sbjct: 267 L---GAKVYTIQIGTDDEVEVEDGIDLFGQPRYVRHRFPVNPALLQEIAQKTGGASYVAT 323 Query: 372 DSRELLESFDKITDKIQEQSVRIAP 396 D++ L +S + D++++ + Sbjct: 324 DAKALADSMHDVLDRLEKTRFEASS 348 >gi|126465452|ref|YP_001040561.1| von Willebrand factor, type A [Staphylothermus marinus F1] gi|126014275|gb|ABN69653.1| von Willebrand factor, type A [Staphylothermus marinus F1] Length = 416 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 65/196 (33%), Gaps = 25/196 (12%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 KI ++A L++ ++ + + N NEV+ + Sbjct: 55 KIFRAKQAALRLLDILRDKDYVG--------VYGFAGKFYKVLEPVPATNRNEVEKAIIG 106 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L TN Y + E+ +IFITDGE + Sbjct: 107 LKLGSGTNIYDTLKKLVE------ETKKVLESGAISLVRIIFITDGEPTTGQK--KPEKI 158 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS-SGQFFAVNDSRELLESF------- 380 L++ + +R AG + V + LL + +G+F V+D L + Sbjct: 159 LEMAKKLREAGASALIIGVGTE-YNEKLLSRMAMVLNGEFEHVSDPASLEKLISEYAKST 217 Query: 381 DKITDKIQEQSVRIAP 396 +I+ K R++P Sbjct: 218 QEISAKNVAVLFRLSP 233 >gi|157105665|ref|XP_001648969.1| hypothetical protein AaeL_AAEL014547 [Aedes aegypti] gi|108868963|gb|EAT33188.1| hypothetical protein AaeL_AAEL014547 [Aedes aegypti] Length = 541 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 30/198 (15%), Positives = 67/198 (33%), Gaps = 33/198 (16%) Query: 196 KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL 255 + + A +I + + + S S ++ +Q + Sbjct: 142 DASSSVGKANFYSEIKFVKKLLSDFNVSYNYTRVAVITFSSQMKIFR-----HIDQISTS 196 Query: 256 SNNLNE---VKSRLNKLN-PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 + ++ + ++ K+ T TY A+ A N + S KK + I Sbjct: 197 VEDNDKCLLLNYQIPKIEFSGGGTYTYGALKEAEEIFQNARADS---------KKIIFLI 247 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFA 369 TDG ++G + + + E ++ + IYS+ + + + L + S G F Sbjct: 248 TDGFSNG-------RDPIPLAESLKRKNVVIYSIGIQSGNYAE--LYNMSSSPGDSHSFL 298 Query: 370 VNDSRELLESFDKITDKI 387 ++ F+ + K Sbjct: 299 LDSFDH----FETLARKA 312 >gi|47191295|emb|CAF87304.1| unnamed protein product [Tetraodon nigroviridis] Length = 202 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 18/193 (9%), Positives = 60/193 (31%), Gaps = 17/193 (8%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + I + + ++V + + + VR+ + Y+ + NN Sbjct: 13 VDGSDTTGETGIAYIRDFIISVVQQL-----DVQPDRVRVAVVQYSDNVQREFALNSHNN 67 Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 V S + +L ++ A++ + + + + + ++ IT G++ Sbjct: 68 KQAVISAVKRLRLMGGRSSDLG--EAFKYVTENELKPSSGSRPSDASQHLVVITGGQSPQ 125 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 ++ ++++ ++ + V L + + V L Sbjct: 126 DASLYGPW--------LKSS--RVSCIGVGVGGTNTRQLTQIATTPEDVLQVPTFLSLPA 175 Query: 379 SFDKITDKIQEQS 391 ++ ++ Sbjct: 176 IRERFLTRLSGTI 188 >gi|239617869|ref|YP_002941191.1| PEGA domain protein [Kosmotoga olearia TBF 19.5.1] gi|239506700|gb|ACR80187.1| PEGA domain protein [Kosmotoga olearia TBF 19.5.1] Length = 1706 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 20/142 (14%), Positives = 44/142 (30%), Gaps = 13/142 (9%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + + ++K L + T Y + L Sbjct: 336 DTEIEVLKNFTRDREQLKRALAIIKARGATPLYDTVAKGIELLSERSGP----------- 384 Query: 306 KFVIFITDGE--NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS 363 +F+I +TDG N G +A + ++ R + I+++ + + L + Sbjct: 385 RFLILVTDGVDANYGDTAPGSEKTLSEVIRLARENNVVIFAIGLGTRIDEFSLGTLARST 444 Query: 364 SGQFFAVNDSRELLESFDKITD 385 G F L +F+ + + Sbjct: 445 GGMFLKSPTIDNLKTAFNSLLE 466 >gi|68535931|ref|YP_250636.1| putative secreted protein [Corynebacterium jeikeium K411] gi|68263530|emb|CAI37018.1| putative secreted protein [Corynebacterium jeikeium K411] Length = 550 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 24/184 (13%), Positives = 62/184 (33%), Gaps = 21/184 (11%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGI-------VGNQCTPLSNNLN 260 ++ L L++ K + R N + Sbjct: 376 NRLGDLKSILNKLIDGTAGEGNNPKGFARRETITLMPFSSKVADGYTQENYDPDNAQQKQ 435 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 +++ +N L P T Y A+ AY + ++ +TDG ++ + Sbjct: 436 DLRGYVNGLQPRGETAIYDAVLRAYDRVGEGG----------GSLNSIVLMTDGASNSGT 485 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLES 379 + ++ + + N + ++ + ++ +R + + G+ F V +L ++ Sbjct: 486 SRKDFITKMTRMMDETNHKIPVFVILYG--EASEEEMRFLANFTGGKVFNVR-GGDLAKA 542 Query: 380 FDKI 383 F++I Sbjct: 543 FEEI 546 >gi|238060066|ref|ZP_04604775.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] gi|237881877|gb|EEP70705.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] Length = 316 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 20/152 (13%), Positives = 46/152 (30%), Gaps = 24/152 (15%) Query: 250 NQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 N P + + + V + ++ L E T T A+ + S + ++ Sbjct: 139 NVLVPPTKDRDAVTTAIDGLVLAEATATGEAVFTCLEAI----RSVPADGAAGIPPARIV 194 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP--------------EGQD 355 ++DG + + + Q A + + ++A + Sbjct: 195 LLSDGFRTSGRSVEEAAAAAQA------ANVPVSTIAFGTDAGQVDIGGQLQRVPVDRMA 248 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 L + G F+ EL + + + I Sbjct: 249 LAELAETTEGYFYEAASVSELKQVYQDMGSSI 280 >gi|159040640|ref|YP_001539892.1| von Willebrand factor type A [Caldivirga maquilingensis IC-167] gi|157919475|gb|ABW00902.1| von Willebrand factor type A [Caldivirga maquilingensis IC-167] Length = 474 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 32/138 (23%), Positives = 52/138 (37%), Gaps = 25/138 (18%) Query: 257 NNLNEVKSRLNKLN-------PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 + VK +N++ P TN A+ A L K +I Sbjct: 353 WGPHSVKKYINEMEEMSRYIYPGGGTNIANALEKARIILSKSN----------YPNKHII 402 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA 369 ITDG AS+ ++ +R G+ + +VAV + L+R +G F Sbjct: 403 CITDGRTVNASSC------IKEAVKLRRMGVTLSTVAVGDNSDFDLLMRLSKIGNGLFIK 456 Query: 370 VNDSRELLESFDKITDKI 387 +ND L ++ I DK+ Sbjct: 457 INDISNLDKAL--IMDKL 472 >gi|156402981|ref|XP_001639868.1| predicted protein [Nematostella vectensis] gi|156226999|gb|EDO47805.1| predicted protein [Nematostella vectensis] Length = 240 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 58/162 (35%), Gaps = 19/162 (11%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYRELYNE 291 + R+G Y+ +L E + KL P E T T A+ L+++ Sbjct: 75 SSQTRVGVGLYSTFASVPIPFGKYTSLQETVEGIKKLRYPGEGTRTGRALKLMKTHLFSQ 134 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 + K +I +TDG + + + +R +G+++++V + Sbjct: 135 SRP--------KAHKVLIVLTDG--------TSVDDVKAPAKALRESGVEVFAVGIG-EH 177 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 L+ +G R+L+ K+ D + + + Sbjct: 178 YRPRELKDIATDTGHVLTA-GFRDLMSVERKLRDAVCKSVRK 218 >gi|260577971|ref|ZP_05845896.1| secreted Mg-chelatase subunit [Corynebacterium jeikeium ATCC 43734] gi|258603897|gb|EEW17149.1| secreted Mg-chelatase subunit [Corynebacterium jeikeium ATCC 43734] Length = 551 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 23/141 (16%), Positives = 52/141 (36%), Gaps = 15/141 (10%) Query: 245 IGIVGNQCTPLSNNLNE-VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 G P S + ++ +N L P T Y A+ AY + Sbjct: 420 DGYTQEHYDPDSAEQSRGLRDYVNGLQPRGETAIYDAVLRAYDRVGEGG----------G 469 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD- 362 ++ +TDGE++ + Q + ++ + + ++ + +D + D Sbjct: 470 SLNSIVLMTDGESNSGTNRQEFITKMKRKMAETDRKIPVFVILYG--EASEDEMNFLADF 527 Query: 363 SSGQFFAVNDSRELLESFDKI 383 + G+ F +L ++F++I Sbjct: 528 TGGKVFNARS-GDLSKAFEEI 547 >gi|255570578|ref|XP_002526246.1| protein binding protein, putative [Ricinus communis] gi|223534440|gb|EEF36143.1| protein binding protein, putative [Ricinus communis] Length = 513 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 17/128 (13%), Positives = 44/128 (34%), Gaps = 20/128 (15%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 N+ E ++ +N+L TN + A + L + + +G ++ ++DGE+ Sbjct: 120 NSQKEFENLINRLKADGWTNITAGLETALKVLNDRSFNGGRVVG-------IMLMSDGEH 172 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT--DSSGQFFAVNDSR 374 + + +++ +L+ G V ++ Sbjct: 173 NTDGDPAEVPLG----------NVPVHTFGFGRN-YEPRVLKAVAHKSIGGTLSDVQNTN 221 Query: 375 ELLESFDK 382 L ++F + Sbjct: 222 NLGKAFSQ 229 >gi|149046634|gb|EDL99459.1| calcium channel, voltage-dependent, alpha2/delta subunit 1, isoform CRA_a [Rattus norvegicus] Length = 927 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 28/333 (8%), Positives = 87/333 (26%), Gaps = 23/333 (6%) Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK- 108 + ++ + + E + +I ++ + Sbjct: 104 AEKVQAAHQWREDFASNEVVYYNAKDDLDPERNESESGSQRIKPVFIEDANFGRQISYQH 163 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR----SMEDLY 164 A IPT+ ++ + L S E+ + + + + Sbjct: 164 AAVHIPTDIYEGSTIVLNELNWTSALDEVFKRNRDEDPTLLWQVFGSATGLARYYPASPW 223 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 + N + P + S + + + + ++ S ++ ++ Sbjct: 224 VDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETL 283 Query: 225 QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHA 284 + N +K +N + T+ A Sbjct: 284 SDDDFVNVASFNSNAQDVSCFQHLVQANVR---NKKVLKDAVNNITAKGITDYKKGFSFA 340 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + +L N S K ++ TDG +Y ++ +++++ Sbjct: 341 FEQLLNYNVSRA------NCNKIIMLFTDG--------GEERAQEIFAKYNKDKKVRVFT 386 Query: 345 VAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 +V + ++ ++ G ++ + + Sbjct: 387 FSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|116622066|ref|YP_824222.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225228|gb|ABJ83937.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 309 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 22/147 (14%), Positives = 58/147 (39%), Gaps = 19/147 (12%) Query: 250 NQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 + P +N ++ ++ P T Y A+ A + KK ++ Sbjct: 128 STEIPFTNRPEDLTYAISHSPPTGKTALYDAVWKAREWVARGSRD----------KKVLV 177 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD---LLRKCT-DSSG 365 ++DG ++ ++ + I E + ++++++ + P + +LR+ + G Sbjct: 178 VVSDGGDNASTHTLSE-----ILEAANKSNIQVFTIGIFDPDDPDKNPGVLRQLARATGG 232 Query: 366 QFFAVNDSRELLESFDKITDKIQEQSV 392 + F ++ E++ + I I+ Q Sbjct: 233 EAFVPDELSEVVAICESIAKDIRSQYT 259 >gi|1705853|sp|P54290|CA2D1_RAT RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-1; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-1; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-1; Flags: Precursor gi|203955|gb|AAA41088.1| dihydropyridine-sesitive L-type calcium channel alpha-2 subunit [Rattus norvegicus] Length = 1091 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 29/333 (8%), Positives = 91/333 (27%), Gaps = 24/333 (7%) Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK- 108 + ++ + + E + +I ++ + Sbjct: 104 AEKVQAAHQWREDFASNEVVYYNAKDDLDPERNESESGSQRIKPVFIEDANFGRQISYQH 163 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD---VSRSMEDLYL 165 A IPT+ ++ + L S E+ + + ++ Sbjct: 164 AAVHIPTDIYEGSTIVLNELNWTSALDEVFKRNRDEDPTLLWQVFAADRLARYYPASPWV 223 Query: 166 QKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQ 225 N + P + S + + + + ++ S ++ ++ Sbjct: 224 DNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETLS 283 Query: 226 KAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHA 284 + + ++ +V N +K +N + T+ A Sbjct: 284 DDDFVNVASFNSNAQDVSCFQHLVQANVR----NKKVLKDAVNNITAKGITDYKKGFTFA 339 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + +L N S K ++ TDG +Y ++ +++++ Sbjct: 340 FEQLLNYNVSRA------NCNKIIMLFTDG--------GEERAQEIFAKYNKDKKVRVFT 385 Query: 345 VAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 +V + ++ ++ G ++ + + Sbjct: 386 FSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 418 >gi|302555134|ref|ZP_07307476.1| secreted protein [Streptomyces viridochromogenes DSM 40736] gi|302472752|gb|EFL35845.1| secreted protein [Streptomyces viridochromogenes DSM 40736] Length = 415 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 26/196 (13%), Positives = 65/196 (33%), Gaps = 26/196 (13%) Query: 206 ANRKIDVLIESAGNLVNSIQK-----AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 ++ ++ ++++ + + PL + Sbjct: 49 GQSRMAAAKQAFNEVLDATPEEVELGIRTLGADYPGDDRKTGCKDTAQLYPVGPL--DRT 106 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 E K+ + L P T PA+ A +L K ++ I+DGE++ Sbjct: 107 EAKTAVATLTPTGWTPIGPALLKAAGDL-----------DGGNGSKRIVLISDGEDT--- 152 Query: 321 AYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELL 377 L+ ++ + G+ I ++ + + L + + G + +V EL Sbjct: 153 --CAPLDPCEVAREIAAKGIGLTIDTLGLVPTAKLSRQLSCIAEATGGTYTSVEHQDELT 210 Query: 378 ESFDKITDKIQEQSVR 393 + +++ D+ + V Sbjct: 211 DRVNQLVDRAADPVVT 226 >gi|288941617|ref|YP_003443857.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] gi|288896989|gb|ADC62825.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] Length = 341 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 27/160 (16%), Positives = 58/160 (36%), Gaps = 36/160 (22%) Query: 253 TPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 TPL+ + V + L T A+ A + L + E ++ +I Sbjct: 150 TPLTFDGATVAAMLRDSVVGLAGRETAIGDAIGLAVKRLREQPEG----------QRVLI 199 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS-----------------APPE 352 +TDG+N+ L+ L+ E AG+++Y++ + A Sbjct: 200 LLTDGDNTAG-----ALDPLEAAELAAQAGVRVYTIGIGGGELGVRSLFGMRLLRQASDF 254 Query: 353 GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQS 391 L + + + G+ F + ++L +D++ + Sbjct: 255 DPATLERIAEITGGRAFTADSRQQLEAVYDELDRLEPSER 294 >gi|309792347|ref|ZP_07686816.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308225613|gb|EFO79372.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 845 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 39/252 (15%), Positives = 78/252 (30%), Gaps = 21/252 (8%) Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR 158 P +Q P L L + + I +VLD S Sbjct: 359 GPPTRQYNVCSQMPPPPAETNLLDLAGNVSF-----YPEYQIIPTSQKPIQYVVVLDASG 413 Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAG 218 SM + + N++ + P + + R+I V ++ Sbjct: 414 SMSANFDGQCNNSGGVKQCANGPSGFPDVQVSNTGYDYWWTTES----QRRIYVAKKALE 469 Query: 219 NLV---NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL------ 269 LV N S ++ + +N G+ +Q +NN +K+ + L Sbjct: 470 RLVTLSNMPGNPGYTNTRPSDQMAVVWFNDGVSSSQTQAFTNNPTTLKNYITTLNNVNGN 529 Query: 270 -NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG--ASAYQNTL 326 TN ++ A N ++ + K+ V+F+TDG ++ ++ + Sbjct: 530 YRSAGGTNGAGGLYRASLLYQNAPKTVSFNGTNVEYKRVVLFVTDGVSNYFLNTSASDLK 589 Query: 327 NTLQICEYMRNA 338 L + + Sbjct: 590 GPLSSYDTFKKN 601 Score = 49.9 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 23/258 (8%), Positives = 71/258 (27%), Gaps = 8/258 (3%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A++I + + ++D+ + Q ++ +AA L+G ++ ++ + TT Sbjct: 24 IIALMILILVAMVGLSVDVGNTFSKERQAVASANAASLAGMSAYMARSSSTLDTTIYQAI 83 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQI----NITKDKNNPLQYIAESKAQYEIPTE 116 + + + E + + A Sbjct: 84 TASLQSNGLVVGDGTNNTVEVTANYLDSQGNLLAGHPVVGSGGTAPNGAAYIRVQLSGMV 143 Query: 117 NLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS 176 N ++ ++ + + + + + + +D + ++ N T Sbjct: 144 NTSFARVVGRDDLPINADAHAGLCQVNSGVY---PIAVDNAYIGNGVFNNIGV-TNPSTE 199 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 K L ++ + ++ S L ++AG+ Q + Sbjct: 200 YKVLSNGMVQRRVYVRDGDDSPGQFGWLRWKEDKGELGQAAGSAGELAQSLTGDGNLDWG 259 Query: 237 RIGTIAYNIGIVGNQCTP 254 + + Sbjct: 260 FDEAPWPSNETAPSDYPN 277 >gi|149419345|ref|XP_001517573.1| PREDICTED: similar to anthrax toxin receptor, partial [Ornithorhynchus anatinus] Length = 139 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 30/141 (21%), Positives = 51/141 (36%), Gaps = 16/141 (11%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 G+ L+ + +++ L +L P +T + A ++Y+E + T Sbjct: 11 STRGSTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQIYHENWQGYRT---- 66 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 +I +TDGE + + Q R+ G +Y V V + L + D Sbjct: 67 --ASVIIALTDGELHENLFF----HAEQEANRSRDFGATVYCVGV--KDFNETQLARIAD 118 Query: 363 SSGQFFAVND-SRELLESFDK 382 S F VND L D Sbjct: 119 SKDHVFPVNDGFEALQGIIDS 139 >gi|148655977|ref|YP_001276182.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148568087|gb|ABQ90232.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 420 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 27/201 (13%), Positives = 63/201 (31%), Gaps = 15/201 (7%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 T+ + + + + V E + + + ++ Sbjct: 38 TQVRAPVNVCFVIDRSGSMKGEKIDRVRRATIRAIEMLDAQDVVSVVIFDHRTEVLIPAT 97 Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 E+ R+N++ T PA+ RE+ + + +I +TDG Sbjct: 98 PVTRPAELIDRINRVRDSGGTRIAPAIEAGLREIEKGPPQ---------MVRRLILLTDG 148 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSR 374 + +N + L+ + I ++ V L+ S G ++ Sbjct: 149 QT------ENESDCLRRATDAGQRNVPITALGVGKDWNEDLLIEMANRSGGTADYIDRPE 202 Query: 375 ELLESFDKITDKIQEQSVRIA 395 +++E F + Q +V+ A Sbjct: 203 KIVEYFQSTIQRAQATAVQNA 223 >gi|156409373|ref|XP_001642144.1| predicted protein [Nematostella vectensis] gi|156229285|gb|EDO50081.1| predicted protein [Nematostella vectensis] Length = 203 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 27/156 (17%), Positives = 57/156 (36%), Gaps = 20/156 (12%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYRELYNEKES 294 RIG + ++ + +V + L ++ P T T A+ + Y L+ + Sbjct: 46 TRIGAVLFSSRPYLMFNFQKYRTVRQVLAALQRIRYPRGGTKTGRALRYTYSRLFRSRSR 105 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 K+ +I +TDG++ + +++N G++++++ V + Sbjct: 106 VR--------KQALIVLTDGKSQDSVGQP--------AAFIKNQGVELFAIGVGRNYRRR 149 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 DL + S G F L I K+ Sbjct: 150 DLNQIA--SRGNVFTAK-FENLGRIIGAIKSKVCRP 182 >gi|313235273|emb|CBY10837.1| unnamed protein product [Oikopleura dioica] Length = 696 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 36/321 (11%), Positives = 88/321 (27%), Gaps = 26/321 (8%) Query: 72 LKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNL 131 ++ +Y N G I A I + + + I + + + Sbjct: 60 FERYAYAESNYGLIQHHAVQQIWTIDEDQFNTTKTAALKNSISNKFNVDWDRLKYEELTI 119 Query: 132 SLRST-GIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFW 190 S + + I +D ++ D ++ + + Sbjct: 120 PTNSILALFLFIDQLNEFPIPYSIDEQDALYTRITHNTVDGFKDKVDELDQNTQNECTSK 179 Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 + + + + D++ L++ Q + RI Y+ + Sbjct: 180 ALDIVFV-VDESGSVGPDNFDLVK---QFLIDYAQDSNIAANA--TRIAIRTYSTYSDLD 233 Query: 251 QCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 + + +N L TNT A+ + + N + K ++ Sbjct: 234 FSLN-DFKTSNIIFEINNLVHESGGTNTADAITNGLNDFGN---------DRSESVKIMV 283 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS-SGQFF 368 ITDG+++ + ++ I S A+ L+ + G Sbjct: 284 TITDGQSNYDR-------VKAAADLLKADPRNIQSFAIGIDGANMAELQAIATTDPGHIE 336 Query: 369 AVNDSRELLESFDKITDKIQE 389 + + + + K+ E Sbjct: 337 MLKNWSDFGPIKKNLQSKVCE 357 >gi|241767791|ref|ZP_04765389.1| von Willebrand factor type A [Acidovorax delafieldii 2AN] gi|241361168|gb|EER57806.1| von Willebrand factor type A [Acidovorax delafieldii 2AN] Length = 277 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 23/222 (10%), Positives = 70/222 (31%), Gaps = 58/222 (26%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 +I +A + + VR+G +A+ Q + + ++ ++ Sbjct: 36 DRITAAQNAAKAFIAELP--------RHVRVGIVAFAGSAQLAQLP--TQSHEDLAKAID 85 Query: 268 KLNPYENTNTYPAMHHAYRELYNEK---------------------------ESSHNTIG 300 T T + A L+ + + Sbjct: 86 SFQLQRGTATGNGIMLALATLFPDAGIDIAALGGRQAMHPRPLDEVTRQDPAKPFTPVAP 145 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------- 352 + +I +TDG+ + ++ L+ ++ + G+++Y+V V Sbjct: 146 GSYTSAAIIMLTDGQRTTG------VDPLEAAQWAADRGVRVYTVGVGTVQGETIGFEGW 199 Query: 353 ------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKI 387 ++ L+ + ++F + +L + ++ ++ ++ Sbjct: 200 SMRVRLDEETLKAVAGRTHAEYFHAATAADLKKVYETLSSRL 241 >gi|84685162|ref|ZP_01013061.1| hypothetical protein 1099457000257_RB2654_09854 [Maritimibacter alkaliphilus HTCC2654] gi|84666894|gb|EAQ13365.1| hypothetical protein RB2654_09854 [Rhodobacterales bacterium HTCC2654] Length = 496 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 14/156 (8%), Positives = 42/156 (26%), Gaps = 4/156 (2%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRT-IKDPTTKKDQT 60 A+++++ + + D +++MQS D L+ + + +Q Sbjct: 21 FALMLAILSGIVALSYDFGRAAATQSEMQSFADNVALAAAGELDGGADALTRAQAAAEQL 80 Query: 61 STIFKKQIK---KHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 + + + + A +I+ T + P Sbjct: 81 IADSQTYGEGPGALGAEDFVLTFYAVRPDASGEIDATTTPEAAKYVSVRVADRAVTPIFG 140 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMV 153 L + + + + ++ M+ Sbjct: 141 AVYAALSGNDAGRDRAGAHAVAGFTRYACNVTPLMI 176 >gi|159900723|ref|YP_001546970.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159893762|gb|ABX06842.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 421 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 22/138 (15%), Positives = 50/138 (36%), Gaps = 14/138 (10%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N+ +++ + K+ T PA+ A E+ + +I +TDG+ Sbjct: 101 NVAALQAEVGKIKDAGGTKIAPALEAALNEI--------RRSQNANTISRIILLTDGQTE 152 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 G + L++ E + A + + ++ V L+ S G ++ ++ Sbjct: 153 G------ERDCLRLAEEIGKASVPLTALGVGDDWNEDLLIEMANRSGGVAEYFSNPNDIA 206 Query: 378 ESFDKITDKIQEQSVRIA 395 F + Q V+ + Sbjct: 207 SFFQGAVQQAQSAVVQNS 224 >gi|302336993|ref|YP_003802199.1| von Willebrand factor type A [Spirochaeta smaragdinae DSM 11293] gi|301634178|gb|ADK79605.1| von Willebrand factor type A [Spirochaeta smaragdinae DSM 11293] Length = 333 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 28/205 (13%), Positives = 52/205 (25%), Gaps = 52/205 (25%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 + + D + V G TP + + S Sbjct: 109 GGDTRFDAAKHAIRTFVE-----------GREHDPLGLVIFGDEAALVTPPTLDYTSFLS 157 Query: 265 RLNK---LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 R++ + + M A L K ++ ++DGEN+ Sbjct: 158 RMDAVRVMKLGRGSALGLGMAVATVHLEKSSAER----------KVMVIVSDGENNAG-- 205 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------------GQDLLRK 359 + + G++IY+V V +LL+ Sbjct: 206 ---EITPESAARVAASLGIRIYAVGVGGEGSVATEFTDPETGKSYRGTYEGKIDMELLKA 262 Query: 360 C-TDSSGQFFAVNDSRELLESFDKI 383 + GQ F L + F +I Sbjct: 263 VTESTRGQAFLAGSPGALSQVFREI 287 >gi|226874935|ref|NP_034712.2| inter-alpha-trypsin inhibitor heavy chain H2 [Mus musculus] gi|148676057|gb|EDL08004.1| inter-alpha trypsin inhibitor, heavy chain 2 [Mus musculus] Length = 950 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 27/227 (11%), Positives = 74/227 (32%), Gaps = 25/227 (11%) Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 L P PK + + + S + + ++ + +L Q ++ + N + Sbjct: 303 FAPENLDPIPKNILFVIDVSGSMWGI---KMKQTVEAMKTILDDLRTDDQFSVVDF-NHN 358 Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS 295 VR + + K + K+ P TN A+ A L Sbjct: 359 VRTWRNDLVSATKTQ--------IADAKRYIEKIQPSGGTNINEALLRAIFILNEASNMG 410 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 + +I ++DG+ + + + + + + ++S+ + D Sbjct: 411 LL---NPDSVSLIILVSDGDPTVGELKLSKI--QKNVKQSIQDNISLFSLGIGFDV-DYD 464 Query: 356 LLRKCTDSSGQ------FFAVNDSRELLESFDKITDKIQEQSVRIAP 396 L++ + + + + S +L + +++++ + P Sbjct: 465 FLKRL-SNENRGIAQRIYGNQDTSSQLKKFYNQVSTPLLRNVQFNYP 510 >gi|126334857|ref|XP_001374633.1| PREDICTED: similar to leukocyte immune-type receptor TS32.15 L1.1a [Monodelphis domestica] Length = 3609 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 23/198 (11%), Positives = 60/198 (30%), Gaps = 27/198 (13%) Query: 196 KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL 255 + A ++ + + + S + + I ++ Sbjct: 116 DESSSVGHANFLNELKFVKKLLSDFPVVPSATRVAIVTFSSKNNVVPRVDYISSSRAHQ- 174 Query: 256 SNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 + + + + + T T A A + L + +E+S K + ITDG Sbjct: 175 -HKCSLLNREIPNITYRGGGTYTKGAFQQAAQILRHSRENST---------KVIFLITDG 224 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVND 372 ++G + I +R+ G++I++ L + + ++ Sbjct: 225 YSNGG-------DPRPIAASLRDFGVEIFT--FGIWQGNIRELNDMASTPKEEHCYLLHS 275 Query: 373 SRELLESFDKITDKIQEQ 390 E F+ + + + Sbjct: 276 FEE----FEALARRALHE 289 >gi|21707832|gb|AAH34341.1| Inter-alpha trypsin inhibitor, heavy chain 2 [Mus musculus] Length = 946 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 27/227 (11%), Positives = 74/227 (32%), Gaps = 25/227 (11%) Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 L P PK + + + S + + ++ + +L Q ++ + N + Sbjct: 299 FAPENLDPIPKNILFVIDVSGSMWGI---KMKQTVEAMKTILDDLRTDDQFSVVDF-NHN 354 Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS 295 VR + + K + K+ P TN A+ A L Sbjct: 355 VRTWRNDLVSATKTQ--------IADAKRYIEKIQPSGGTNINEALLRAIFILNEASNMG 406 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 + +I ++DG+ + + + + + + ++S+ + D Sbjct: 407 LL---NPDSVSLIILVSDGDPTVGELKLSKI--QKNVKQSIQDNISLFSLGIGFDV-DYD 460 Query: 356 LLRKCTDSSGQ------FFAVNDSRELLESFDKITDKIQEQSVRIAP 396 L++ + + + + S +L + +++++ + P Sbjct: 461 FLKRL-SNENRGIAQRIYGNQDTSSQLKKFYNQVSTPLLRNVQFNYP 506 >gi|74227570|dbj|BAE21837.1| unnamed protein product [Mus musculus] Length = 950 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 27/227 (11%), Positives = 74/227 (32%), Gaps = 25/227 (11%) Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 L P PK + + + S + + ++ + +L Q ++ + N + Sbjct: 303 FAPENLDPIPKNILFVIDVSGSMWGI---KMKQTVEAMKTILDDLRTDDQFSVVDF-NHN 358 Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS 295 VR + + K + K+ P TN A+ A L Sbjct: 359 VRTWRNDLVSATKTQ--------IADAKRYIEKIQPSGGTNINEALLRAIFILNEASNMG 410 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 + +I ++DG+ + + + + + + ++S+ + D Sbjct: 411 LL---NPDSVSLIILVSDGDPTVGELKLSKI--QKNVKQSIQDNISLFSLGIGFDV-DYD 464 Query: 356 LLRKCTDSSGQ------FFAVNDSRELLESFDKITDKIQEQSVRIAP 396 L++ + + + + S +L + +++++ + P Sbjct: 465 FLKRL-SNENRGIAQRIYGNQDTSSQLKKFYNQVSTPLLRNVQFNYP 510 >gi|3024068|sp|Q61703|ITIH2_MOUSE RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H2; Short=ITI heavy chain H2; Short=ITI-HC2; Short=Inter-alpha-inhibitor heavy chain 2; Flags: Precursor gi|695634|emb|CAA49842.1| inter-alpha-inhibitor H2 chain [Mus musculus] gi|122889675|emb|CAM13914.1| inter-alpha trypsin inhibitor, heavy chain 2 [Mus musculus] Length = 946 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 27/227 (11%), Positives = 74/227 (32%), Gaps = 25/227 (11%) Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 L P PK + + + S + + ++ + +L Q ++ + N + Sbjct: 299 FAPENLDPIPKNILFVIDVSGSMWGI---KMKQTVEAMKTILDDLRTDDQFSVVDF-NHN 354 Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS 295 VR + + K + K+ P TN A+ A L Sbjct: 355 VRTWRNDLVSATKTQ--------IADAKRYIEKIQPSGGTNINEALLRAIFILNEASNMG 406 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 + +I ++DG+ + + + + + + ++S+ + D Sbjct: 407 LL---NPDSVSLIILVSDGDPTVGELKLSKI--QKNVKQSIQDNISLFSLGIGFDV-DYD 460 Query: 356 LLRKCTDSSGQ------FFAVNDSRELLESFDKITDKIQEQSVRIAP 396 L++ + + + + S +L + +++++ + P Sbjct: 461 FLKRL-SNENRGIAQRIYGNQDTSSQLKKFYNQVSTPLLRNVQFNYP 506 >gi|149046636|gb|EDL99461.1| calcium channel, voltage-dependent, alpha2/delta subunit 1, isoform CRA_c [Rattus norvegicus] Length = 920 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 28/333 (8%), Positives = 87/333 (26%), Gaps = 23/333 (6%) Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK- 108 + ++ + + E + +I ++ + Sbjct: 104 AEKVQAAHQWREDFASNEVVYYNAKDDLDPERNESESGSQRIKPVFIEDANFGRQISYQH 163 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR----SMEDLY 164 A IPT+ ++ + L S E+ + + + + Sbjct: 164 AAVHIPTDIYEGSTIVLNELNWTSALDEVFKRNRDEDPTLLWQVFGSATGLARYYPASPW 223 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 + N + P + S + + + + ++ S ++ ++ Sbjct: 224 VDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETL 283 Query: 225 QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHA 284 + N +K +N + T+ A Sbjct: 284 SDDDFVNVASFNSNAQDVSCFQHLVQANVR---NKKVLKDAVNNITAKGITDYKKGFSFA 340 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + +L N S K ++ TDG +Y ++ +++++ Sbjct: 341 FEQLLNYNVSRA------NCNKIIMLFTDG--------GEERAQEIFAKYNKDKKVRVFT 386 Query: 345 VAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 +V + ++ ++ G ++ + + Sbjct: 387 FSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 419 >gi|149188658|ref|ZP_01866950.1| hypothetical protein VSAK1_16267 [Vibrio shilonii AK1] gi|148837568|gb|EDL54513.1| hypothetical protein VSAK1_16267 [Vibrio shilonii AK1] Length = 346 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 30/197 (15%), Positives = 67/197 (34%), Gaps = 33/197 (16%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 ++ + A ++D + V + R+G I + TP + + Sbjct: 115 FSSSDGQAVSRLDAVKSVLHEFVAT---------REGDRLGLILFGDAAYLQ--TPFTAD 163 Query: 259 LNEVKSRLNKL---NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + + L++ ++T+ A+ A + S +K V+ +TDG Sbjct: 164 HDVWLALLDQTEVAMAGQSTHLGDAIGLAIKVFEQS-------ESSKDKEKVVVVLTDGN 216 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------GQDLLRKCTD-SSGQFF 368 ++G + + G++I+ +A+ P + S GQ F Sbjct: 217 DTG-----SFVEPKDAAIVAAAKGVRIHVIAMGDPATIGEQALDMATIDNIASQSGGQAF 271 Query: 369 AVNDSRELLESFDKITD 385 D L +++ I + Sbjct: 272 QALDQEALQQAYRTIGE 288 >gi|123718334|emb|CAJ77150.1| procollagen type VI alpha 4 [Mus musculus] Length = 762 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 61/184 (33%), Gaps = 17/184 (9%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-E 273 S N + + ++Q ++R+G Y+ + + +V + L Sbjct: 49 HSVRNFLYILANSLQ-VGRDNIRVGLAQYSDTPTSEFLLSVYHRKGDVLKHIRGLQFKPG 107 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 A+ L + + S + + + ++ G + + E Sbjct: 108 GNRMGQALQFI---LEHHFREGAGSRASQGVPQVAVVVSSGLT--------EDHIREPAE 156 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQS 391 +R AG+ +Y++ V Q LR+ + S F V + L K+ ++ Sbjct: 157 ALRRAGILVYAIGV--KDASQAELREISSSPKDNFTFFVPNFPGLPGLAQKLRPELCSTL 214 Query: 392 VRIA 395 + A Sbjct: 215 GKAA 218 >gi|16080727|ref|NP_391555.1| hypothetical protein BSU36740 [Bacillus subtilis subsp. subtilis str. 168] gi|221311634|ref|ZP_03593481.1| hypothetical protein Bsubs1_19866 [Bacillus subtilis subsp. subtilis str. 168] gi|221315962|ref|ZP_03597767.1| hypothetical protein BsubsN3_19787 [Bacillus subtilis subsp. subtilis str. NCIB 3610] gi|221320873|ref|ZP_03602167.1| hypothetical protein BsubsJ_19730 [Bacillus subtilis subsp. subtilis str. JH642] gi|221325158|ref|ZP_03606452.1| hypothetical protein BsubsS_19896 [Bacillus subtilis subsp. subtilis str. SMY] gi|321313224|ref|YP_004205511.1| hypothetical protein BSn5_09320 [Bacillus subtilis BSn5] gi|8928525|sp|P70960|YWMC_BACSU RecName: Full=Uncharacterized protein ywmC; Flags: Precursor gi|1648853|emb|CAB03680.1| unknown [Bacillus subtilis subsp. subtilis str. 168] gi|2636199|emb|CAB15691.1| putative exported protein [Bacillus subtilis subsp. subtilis str. 168] gi|320019498|gb|ADV94484.1| hypothetical protein BSn5_09320 [Bacillus subtilis BSn5] Length = 227 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 19/128 (14%), Positives = 42/128 (32%), Gaps = 16/128 (12%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 S N + LN + P T A++ A +K V +TDGE Sbjct: 108 SFNEQSFLNSLNTIGPTGWTPIAKALNEAKSSFDQLDAKG---------EKVVYLLTDGE 158 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSR 374 + N ++ + ++ + + + + L G++F + Sbjct: 159 ETCGG------NPIKTAKELQKDNITVNVIGFDYKEGYKGQLNAIAKVGGGEYFPAYTQK 212 Query: 375 ELLESFDK 382 ++ + F + Sbjct: 213 DVEKIFTQ 220 >gi|332308254|ref|YP_004436105.1| LPXTG-motif cell wall anchor domain protein [Glaciecola agarilytica 4H-3-7+YE-5] gi|332175583|gb|AEE24837.1| LPXTG-motif cell wall anchor domain protein [Glaciecola agarilytica 4H-3-7+YE-5] Length = 777 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 23/141 (16%), Positives = 55/141 (39%), Gaps = 9/141 (6%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 +NN+ ++ + L+ T PA+ A E + GS + V+FIT Sbjct: 443 PATANNIQRARNWVASLSANGGTEMAPALSMALH--KTNLEQQNINEGSPVQLRQVVFIT 500 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND 372 DG S I + ++ +++++ + + P + + G F + D Sbjct: 501 DG-----SVSNEDALMSLIENQLADS--RLFTIGIGSAPNSYFMTQAAQAGRGTFTYIGD 553 Query: 373 SRELLESFDKITDKIQEQSVR 393 ++ + ++ +K+ ++ Sbjct: 554 INQVQQKMTELFNKLTRPVMQ 574 >gi|198425808|ref|XP_002121992.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 431 Score = 59.2 bits (141), Expect = 1e-06, Method: Composition-based stats. Identities = 26/242 (10%), Positives = 66/242 (27%), Gaps = 18/242 (7%) Query: 151 CMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKI 210 + + + ++ F ++ Sbjct: 142 SGYVTYGVCYSSNDKGRTWSTDTKQADSKCPVEDLDLIF--LLDGSGSVTVPDPLNFDRV 199 Query: 211 DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN 270 +++ + + A S T + E + ++ + Sbjct: 200 KQWVKNVTDRFDISTFANVGVIQYSHYYETRTVQPYMKVEIGLGQYKTQAEFQMAVDSIQ 259 Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T HA E +S +K ++ +TDG+++ + T Sbjct: 260 FQGFTTFT---AHALNRTVEEFMNSTR-YSDPTTRKVIVLLTDGQSNDREFLEETSAY-- 313 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD----SSGQFFAVNDSRELLESFDKITDK 386 R G+ I++V V ++ L+ T ++ + F ++ +L + D + Sbjct: 314 ----ARGLGITIFAVGV--EGYSEEELQIITSGELGNNERVFGLDTFSDLNKVVDSLQLS 367 Query: 387 IQ 388 I Sbjct: 368 IT 369 >gi|326920703|ref|XP_003206608.1| PREDICTED: cochlin-like, partial [Meleagris gallopavo] Length = 760 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 23/161 (14%), Positives = 51/161 (31%), Gaps = 20/161 (12%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYN 290 ++ +I T+ + +V S + + T T A+ R ++ Sbjct: 610 SDIGSKIATVQFTYDQRTEFSFTDYTTKEKVLSAIRNIRYMSGGTATGDAISFTTRNVFG 669 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + N F++ +TDG++ + + AG+ ++SV V+ Sbjct: 670 PVKDGANK-------NFLVILTDGQSYD--------DVRGPAVAAQKAGITVFSVGVAWA 714 Query: 351 PEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQE 389 P L+ F + L + + I + Sbjct: 715 PLDD--LKDMASEPRESHTFFTREFTGLEQMVPDVIRGICK 753 >gi|88707026|ref|ZP_01104723.1| von Willebrand factor type A domain protein [Congregibacter litoralis KT71] gi|88698754|gb|EAQ95876.1| von Willebrand factor type A domain protein [Congregibacter litoralis KT71] Length = 330 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 24/142 (16%), Positives = 49/142 (34%), Gaps = 25/142 (17%) Query: 254 PLSNNLNEVKSRLNK---LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 P +++ ++ L + ++T A+ A S + +I Sbjct: 155 PFTDDRETWQTLLEESEVAMAGQSTALGDAIGLAISIFQA----------SDTTNRVLIV 204 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------GQDLLRKCT-DS 363 +TDG ++G+ + IY+VAV P + L + Sbjct: 205 LTDGNDTGSRVP-----PRDAATIAAANDVTIYTVAVGDPATIGEEALDLETLNAVAETT 259 Query: 364 SGQFFAVNDSRELLESFDKITD 385 G F D++ L +++D+I Sbjct: 260 GGASFQALDTQALEKAYDEINR 281 >gi|297303947|ref|XP_002808579.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha-trypsin inhibitor heavy chain H5-like protein-like [Macaca mulatta] Length = 1313 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 30/214 (14%), Positives = 76/214 (35%), Gaps = 20/214 (9%) Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 LPP K + + + S + K++ ++ +++ ++ Sbjct: 272 FAPRGLPPMEKNVVFVIDVSGSMFG-------TKMEQTKKAMNVILSDLRANDYFNIISF 324 Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS 295 + G + N++ K L+++ T+ A+ A L + + Sbjct: 325 SDTINVWKAGGSIQATI----QNVHSAKDYLHRMEADGWTDINSALLAAASVLNHSNQ-E 379 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 S +IF+TDGE + + + L + ++++A Sbjct: 380 PGRGPSVGRIPLIIFLTDGEPTAGVTTPSVI--LSNVRQAVGHRVSLFTLAFG-DDADFT 436 Query: 356 LLRKCT-DSSG---QFFAVNDSR-ELLESFDKIT 384 LLR+ + ++ G + + D+ +L +++I+ Sbjct: 437 LLRRLSLENRGIARRIYEDTDAALQLEGLYEEIS 470 >gi|163758683|ref|ZP_02165770.1| von Willebrand factor type A domain protein [Hoeflea phototrophica DFL-43] gi|162283973|gb|EDQ34257.1| von Willebrand factor type A domain protein [Hoeflea phototrophica DFL-43] Length = 587 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 29/203 (14%), Positives = 61/203 (30%), Gaps = 18/203 (8%) Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 P K S KI + + +L+++ A+ + + Sbjct: 27 AVPSAKVGKVMIVLDGSNSMWGQVDGEAKITIAKDVMTDLISNWDDAV-DLGLMVYGHRR 85 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG 300 I P N + ++ ++P T + A + G Sbjct: 86 KGDCSDIEVV-ALPGKVNRPALIDKVQSISPRGKTPISKTLLLAATSV-----------G 133 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 K V+ ++DG + + L I G ++ + E L+ Sbjct: 134 YFSGKSSVVLVSDGLETCDADPCAQAKALGIINP----GFDVHVIGFDVTEEEFKSLQCI 189 Query: 361 TD-SSGQFFAVNDSRELLESFDK 382 + G+FF N++ EL ++ + Sbjct: 190 ATETGGKFFRANNAEELKDALRR 212 >gi|326670664|ref|XP_003199264.1| PREDICTED: collagen alpha-3(VI) chain-like [Danio rerio] Length = 2265 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 19/170 (11%), Positives = 53/170 (31%), Gaps = 15/170 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRE 287 + RI + Y+ N E+ + L T A+ Sbjct: 879 DVAENRDRISVVQYSREPEANFYLNTYTTKEEIVDAVRGLRHKGGRPLYTGEALQFVRDN 938 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 ++ S + + ++ ++ G ++ + ++ G+ + + Sbjct: 939 VFTASSGSRRL---EGVPQILVLLSGG--------RSFDSVDAAASSLKELGVL--TFGI 985 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + L++ + +V+D EL +++ +Q S+ + P Sbjct: 986 GSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASVQVTSIPVTPT 1035 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 19/170 (11%), Positives = 54/170 (31%), Gaps = 15/170 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRE 287 + RI + Y+ N E+ + L T A+ + Sbjct: 1694 DVAENRDRISVVQYSREPEANFYLNTYTTNEEIVDAVRGLRHKGGRPLYTGEALQYVRDN 1753 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 ++ S + + ++ ++ G ++ + ++ G+ + + Sbjct: 1754 VFTASSGSRRL---EGVPQILVLLSGG--------RSFDSVDAAASSLKELGVL--TFGI 1800 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + L++ + +V+D EL +++ +Q S+ + P Sbjct: 1801 GSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASVQVTSIPVTPT 1850 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 31/328 (9%), Positives = 97/328 (29%), Gaps = 33/328 (10%) Query: 65 KKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLI 124 + ++++ + + + + ++ A L Sbjct: 1743 TGEALQYVRDNVFTASSGSRRLEGVPQILVLLSGGRSFDSVDAAAS-----------SLK 1791 Query: 125 PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP 184 + + S G R + ++ L VS E +Q+ + ++ + P Sbjct: 1792 ELGVLTFGIGSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASVQVTS-IPVTPT 1850 Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK--NLSVRIGTIA 242 +T + D S + + +Q+ +++ R+ + Sbjct: 1851 SPTVTAEYSTPRKDVVFL----LDGSDGTRSSFPAMRDFVQRVVEKFNIEANRDRVSVVQ 1906 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIG 300 Y+ + ++ R+ L T A+ + ++ S Sbjct: 1907 YSRDAEVHFYLNSYTKKEDILDRVTGLRHKGGRPLYTGAALQYVRDNVFTASSGSRRL-- 1964 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 + + +I ++ G ++ + ++ G+ ++ + + L++ Sbjct: 1965 -EGVPQILILLSGG--------RSFDSVDAAASSLKELGVL--TLGIGSRGSDSRELQRI 2013 Query: 361 TDSSGQFFAVNDSRELLESFDKITDKIQ 388 + + +V D EL +++ +Q Sbjct: 2014 SYEANYALSVADFSELPNVQEQLLASVQ 2041 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 20/186 (10%), Positives = 54/186 (29%), Gaps = 16/186 (8%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + +V + A RI + Y+ N E+ Sbjct: 58 SDGTRNGFPAMKDFVQKMVEKLAAA-----ENRDRISVVQYSREPEANFYLNTYTTKEEI 112 Query: 263 KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + L Y Y N +S + + + ++ ++ G+ Sbjct: 113 LDAVRGLRHKGGRPLYTGAALQY-VKDNVFTASSGSRRLEGVPQILVLLSGGK------- 164 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + ++ G+ + + + L++ + +V+D EL ++ Sbjct: 165 -SFDSVDAAASSLKELGVL--TFGIGSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQ 221 Query: 383 ITDKIQ 388 + +Q Sbjct: 222 LLASVQ 227 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 30/281 (10%), Positives = 87/281 (30%), Gaps = 22/281 (7%) Query: 112 EIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDN 171 + + L + + S G R + ++ L VS E +Q+ Sbjct: 964 SFDSVDAAASSLKELGVLTFGIGSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLA 1023 Query: 172 NNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEK 231 + ++ + P +T + D S + + +Q+ +++ Sbjct: 1024 SVQVTS-IPVTPTSPTVTAEYSTPRKDVVFL----LDGSDGTRSSFPAMRDFVQRVVEKF 1078 Query: 232 K--NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRE 287 R+ + Y+ + ++ R+ L T A+ + Sbjct: 1079 NIEANRDRVSVVQYSRDAEVHFYLNSYTKKEDILDRVTGLRHKGGRPLYTGAALQYVRDN 1138 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 ++ S + + +I ++ G ++ + ++ G+ ++ + Sbjct: 1139 VFTASSGSRRL---EGVPQILILLSGG--------RSFDSVDAAASSLKELGVL--TLGI 1185 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 + L++ + + +V D EL +++ +Q Sbjct: 1186 GSRGSDSRELQRISYEANYALSVADFSELPNVQEQLLASVQ 1226 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 32/328 (9%), Positives = 96/328 (29%), Gaps = 33/328 (10%) Query: 65 KKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLI 124 +++K + + + + ++ A L Sbjct: 129 TGAALQYVKDNVFTASSGSRRLEGVPQILVLLSGGKSFDSVDAAAS-----------SLK 177 Query: 125 PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP 184 + + S G R + ++ L VS E +Q+ + ++ + P Sbjct: 178 ELGVLTFGIGSRGSDSRELQRISYEPSYALSVSDFSELPNVQEQLLASVQVTS-IPVTPT 236 Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK--NLSVRIGTIA 242 +T + D S + + +Q+ +++ R+ + Sbjct: 237 STTVTAEYSTPRKDVVFL----LDGSDGTRSSFPAMRDFVQRVVEKFNIEANRDRVSVVQ 292 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIG 300 Y+ + ++ R+ L T A+ + ++ S Sbjct: 293 YSRDAEVHFYLNSYTKKEDILDRVTGLRHKGGRPLYTGAALQYVRDNVFTASSGSRRL-- 350 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 + + +I ++ G ++ + ++ G+ ++ + + L++ Sbjct: 351 -EGVPQILILLSGG--------RSFDSVDAAASSLKELGVL--TLGIGSRGSDSRELQRI 399 Query: 361 TDSSGQFFAVNDSRELLESFDKITDKIQ 388 + + +V D EL +++ +Q Sbjct: 400 SYEANYALSVADFSELPNVQEQLLASVQ 427 >gi|320105085|ref|YP_004180676.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] gi|319752367|gb|ADV64127.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] Length = 323 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 30/209 (14%), Positives = 69/209 (33%), Gaps = 31/209 (14%) Query: 194 TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 S +P + D+ +++ ++S ++ Sbjct: 108 VDVSGSMMSPFGDGTRYDMSMKAIDKFLDS-RRGDAFGLTFFGNNYLHW----------V 156 Query: 254 PLSNNLNEVKSRLNKLNP------YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 PL+++++ +K + P T A+ R L +E + Sbjct: 157 PLTSDVSAIKCAPPFMKPEVAPLWMSGTEIGKALLGCRRTLVERQEG----------DRA 206 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF 367 +I I+DG S N ++ ++ G+ +Y++ + ++ C+ + G Sbjct: 207 IILISDGA----SFDLGGGNDEEVARLLKRDGIVVYAIHIDETEIPDPIVTICSITGGDA 262 Query: 368 FAVNDSRELLESFDKITDKIQEQSVRIAP 396 FA +D L F +I + + P Sbjct: 263 FAPDDPSALEAIFKRIDQMTPTRLEKTRP 291 >gi|17231852|ref|NP_488400.1| hypothetical protein alr4360 [Nostoc sp. PCC 7120] gi|17133496|dbj|BAB76059.1| alr4360 [Nostoc sp. PCC 7120] Length = 427 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 25/176 (14%), Positives = 54/176 (30%), Gaps = 26/176 (14%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRL-NKLNPYENTNTYPAMHHAYRELYNEKESS 295 RI +A+ + N +K+++ KL T + EL + Sbjct: 78 RISVVAFAGSATVIIPNQIVENPESIKTQIRKKLQASGGTVIAEGLQQGITELMKGTRGA 137 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQ-------NTLNTLQICEYMRNAGMKIYSVAVS 348 +TDG + ++ L+ + + I ++ Sbjct: 138 ---------VSQAFLLTDGHGEDSLKIWKWEIGPDDSRRCLEFAKKAAKINLTINTLGFG 188 Query: 349 APPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQE-------QSVRIAP 396 QDLL D G + + + F+++ ++Q ++ +AP Sbjct: 189 NNWN-QDLLETIADAGGGTLAHIERPEQAVHHFNRLFTRVQSVGLTNAYLTLSLAP 243 >gi|256618671|ref|ZP_05475517.1| von Willebrand factor [Enterococcus faecalis ATCC 4200] gi|256598198|gb|EEU17374.1| von Willebrand factor [Enterococcus faecalis ATCC 4200] Length = 1154 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 101/382 (26%), Gaps = 30/382 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M A A L+ +++ + D + + QG+ D Sbjct: 202 AEARMAPATLRANLAL--PLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGSAQWD 259 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ ++Y + L++R E Sbjct: 260 GQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRG----NVQKE 315 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +V+D S SM + L + S Sbjct: 316 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNINMGYVGYSSDGYNNN 375 Query: 205 PANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 D + N+ S + + G + L++ + Sbjct: 376 AIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFS 435 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +++++ T R + + + GST D N Sbjct: 436 YKVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNINKRINS 485 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA-V 370 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 486 TFIATIGEAMALKQRGIEIHGLGIQLQSDSRANLSKQQVEDKMREMVSADENGDLYYESA 545 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 546 DYAPDISDYLAKKAVQISGTVV 567 >gi|118093660|ref|XP_001235190.1| PREDICTED: similar to Bardet-Biedl syndrome 5 [Gallus gallus] Length = 1556 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 27/197 (13%), Positives = 66/197 (33%), Gaps = 25/197 (12%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + +++ + LV+ + + RIG + Y++ + Sbjct: 752 SSESVGPENFEIIKDFVTALVDRVT-----VGRNATRIGLVLYSLEVQLEFGLNKHTTQQ 806 Query: 261 EVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +VK + K+ T T A+ A +E + + ++K I +TDG+ Sbjct: 807 DVKRAIRKMQYMGEGTYTGTAIRKATQEGFLGARTG--------VRKVAIVLTDGQADKR 858 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD-----LLRKCTDSSG--QFFAVND 372 A + + A +++Y++ + + L + ++D Sbjct: 859 EA----VKLDIVVREAHAANIEMYAIGIVNTSDPTQAEFVHELNLIASDPDREHMYLIDD 914 Query: 373 SRELLESFDKITDKIQE 389 L K+ ++ E Sbjct: 915 FNTLPALESKLVNQFCE 931 >gi|52138687|ref|NP_001004392.1| collagen alpha-1(XX) chain [Gallus gallus] gi|14280020|gb|AAK58847.1| collagen type XX alpha 1 precursor [Gallus gallus] Length = 1472 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 24/167 (14%), Positives = 55/167 (32%), Gaps = 21/167 (12%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + ++ E L++ A +R+G Y+ Sbjct: 246 VDGSWSIGRSNFKLIKEFLSALISPFNIA-----QDKIRVGLSQYSSDPRTEWDLSAYAT 300 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 ++V + L NT T A+ H + + +K VI +TDG++ Sbjct: 301 RDQVLEAVRNLRYKGGNTFTGLALTHVLEQ-----NLKPDAGARLEAEKLVILLTDGKSQ 355 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS 364 + + ++N G++I+++ V + L++ Sbjct: 356 D--------DANLAAQTLKNMGIEIFAIGV--KNADEAELKQVASEP 392 >gi|331694297|ref|YP_004330536.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] gi|326948986|gb|AEA22683.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] Length = 332 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 35/198 (17%), Positives = 71/198 (35%), Gaps = 42/198 (21%) Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 S A A +++ +A V A VRIG +A+ + Q + Sbjct: 96 SGSMAATDIAPTRLEAAKAAARGFVQRQPAA--------VRIGIVAFGATGLVTQQP--T 145 Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYREL---------YNEKESSHNTIGSTRLKK- 306 ++ V + +++L+P T + A + + + Sbjct: 146 SDRASVVAAIDRLSPQGGTALGGGLQTALGAIVGKPVVVPGSDPGGGPEPSGPDLGYHGS 205 Query: 307 -FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------------- 352 V+ +TDGEN+ + LQ+ + AG+K+Y + + +P Sbjct: 206 AAVVLLTDGENTA------QPDPLQVADIASTAGVKVYPIGLGSPAGTVLQIDGFQIATR 259 Query: 353 -GQDLLRKCTD-SSGQFF 368 + LL++ D + G++F Sbjct: 260 LDEPLLQQIADRTDGRYF 277 >gi|217973614|ref|YP_002358365.1| LPXTG-motif cell wall anchor domain-containing protein [Shewanella baltica OS223] gi|217498749|gb|ACK46942.1| LPXTG-motif cell wall anchor domain protein [Shewanella baltica OS223] Length = 772 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 25/163 (15%), Positives = 53/163 (32%), Gaps = 12/163 (7%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 ++ I + S+NL+ + +++L T A+ A Sbjct: 425 PEDSFNIIEFNSSLSQFSATSLPATSSNLSRARQFVSRLQADGGTEMALALDAAL----- 479 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 K + + + + VIF+TDG A + + ++++V + + Sbjct: 480 PKSLGSVSPDAVQPLRQVIFMTDGSVGNEQALFDLIRYQIGES-------RLFTVGIGSA 532 Query: 351 PEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 P + R G F + E+ + KIQ + Sbjct: 533 PNSHFMQRAAELGRGTFTYIGKVDEVDAKISALLSKIQYPVLT 575 >gi|157273368|gb|ABV27267.1| von Willebrand factor type A [Candidatus Chloracidobacterium thermophilum] Length = 324 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 28/253 (11%), Positives = 74/253 (29%), Gaps = 38/253 (15%) Query: 153 VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPK-KSFWSKNTTKSKYAPAPAPANRKID 211 VLDV+ +D N + + S + + Sbjct: 54 VLDVAVFDQDNRFVGDLRKENFRVYDEQVEQQIEYFSRDEAPVSLGFVVDTSGSMRPRRA 113 Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP 271 +IE+ L + + + + + + + +++ ++ + Sbjct: 114 KVIEAVKFLARAAKPGDE--------FFLVDFKNKAELAEE--FTPRPADIEEAVDNIVW 163 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T A+ + E ++ K ++ +DG++ + + Q+ Sbjct: 164 GGGTALLDAIQLSAEYADKEGKNRR---------KAIVVFSDGDDRDSYYDR-----RQL 209 Query: 332 CEYMRNAGMKIYSVAV------------SAPPEGQDLLRKCT-DSSGQFFAVNDSRELLE 378 + ++ +++Y V S L++ ++ G+ F EL E Sbjct: 210 IKLLQEYQVQVYIVGFPDDDDDGGLFGRSTRKRAVQLIKDIANETGGRAFFPKSVDELPE 269 Query: 379 SFDKITDKIQEQS 391 I ++ Q Sbjct: 270 IVRTINADLRTQY 282 >gi|118090156|ref|XP_420539.2| PREDICTED: hypothetical protein [Gallus gallus] Length = 606 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 73/204 (35%), Gaps = 24/204 (11%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEK------KNLSVRIGTIAYNIGIVGNQCTPLSN 257 K + ++ + + + + + +S ++ + PL+ Sbjct: 157 YFVLDKSGSVAQNWHEIFDFVNQLDGSGFVRLTERFVSPKMRLSFIVFSSQAHVIMPLTG 216 Query: 258 NLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 + ++K L L+ P +T + + A ++ G++R +I +TDG Sbjct: 217 DREKIKEGLKNLSEVKPAGDTYIHEGLKQANMQIE--------KQGASRFSSIIIALTDG 268 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN-DS 373 + G Q L + + R G ++Y V V Q L + D+ Q F V Sbjct: 269 KLDG----QIPLYAEKEAKISRELGARVYCVGVL--DFVQAQLERIADTKEQVFPVTGGF 322 Query: 374 RELLESFDKITDKIQEQSVRIAPN 397 + L + + + + + + P+ Sbjct: 323 QALKGIINSVLKQSCTEILYLEPS 346 >gi|167518794|ref|XP_001743737.1| hypothetical protein [Monosiga brevicollis MX1] gi|163777699|gb|EDQ91315.1| predicted protein [Monosiga brevicollis MX1] Length = 874 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 21/157 (13%), Positives = 48/157 (30%), Gaps = 16/157 (10%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP----YENTNTYPAMHHAY 285 + ++R+ + ++ + + +V++ + N T T A+ Sbjct: 302 DVGLAAIRVAGMMFHAEALPQFDFDDYTSAAQVQNAVANFNYPVNENWGTATGNALDSIR 361 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 L V FITDG + ++ + +R G ++ ++ Sbjct: 362 TNLLQASAGYRGGEV------VVYFITDGVSQ-----ESPSVVESAAQALRATGAQVMAI 410 Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + L S+ V D L E+ Sbjct: 411 GI-TDQIDETQLEVIAGSADNVITVADFANLNEAVRD 446 >gi|52548946|gb|AAU82795.1| conserved hypothetical protein [uncultured archaeon GZfos1C11] Length = 438 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 26/167 (15%), Positives = 62/167 (37%), Gaps = 11/167 (6%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPL--SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 + R+G + +N G + L + N+ ++K + +++ T M A E Sbjct: 230 DHLEDDDRLGLVLFNTGAELAEPVSLVGAKNMQKLKGDVLEISATGGTRLSAGMQMA-TE 288 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 LY+E + + + +IF+TD + + +L + N + + + Sbjct: 289 LYDEF----LEVNQSEYENRIIFLTDAMPNSGQTSEESLLGMIEANA--NKNVYTTFIGI 342 Query: 348 SAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 +L+ T +++V+ + + E D + + V Sbjct: 343 GVD-FNTELVEYITKIRGANYYSVHSATQFKERMDDEFEYMVTPLVF 388 >gi|291398577|ref|XP_002715569.1| PREDICTED: Epithelial chloride channel protein-like [Oryctolagus cuniculus] Length = 958 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 19/145 (13%), Positives = 50/145 (34%), Gaps = 23/145 (15%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN--PYENTNTYPAMHHAYRELYNEKESSHN 297 + + + + + L T+ + ++ + +++ Sbjct: 403 MVTFESTAKIQNNLTKITDDDTYQKITANLPQVAGGGTSICSGLKAGFQAITYSNQNT-- 460 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 ++ +TDGE++G + E ++ +G I+++A+ P L Sbjct: 461 ------SGSEIVLLTDGEDNGIHSCF---------EEVKQSGAIIHTIALG--PSAAKEL 503 Query: 358 RKCTD-SSGQFFAVN-DSRELLESF 380 + + G F N D L+++F Sbjct: 504 EILSSMTGGYRFYANKDINGLIDAF 528 >gi|260463262|ref|ZP_05811463.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259030852|gb|EEW32127.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 644 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 26/170 (15%), Positives = 50/170 (29%), Gaps = 17/170 (10%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 MT + + + ID ++ R +ALDAA ++ IV + Sbjct: 16 MTVLAMVPLMGALAIGIDYTEMVRERQNALNALDAAGIATAQQIV-------AGATDAEA 68 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 K + +L + +T NN + L Sbjct: 69 IAYAKNFFEANLAHIDPANT---------TLAVTLPNNNTGGGTLKLCGTLTYKPYFLPT 119 Query: 121 KGLI-PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHN 169 ++ N + + + + +VLD S SM++L + Sbjct: 120 AKILAGGTSGNATTMAFNTCSEVRLKNTLEVSLVLDNSGSMKELGKGSNK 169 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 17/172 (9%), Positives = 38/172 (22%), Gaps = 1/172 (0%) Query: 153 VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV 212 V P + + + + + Sbjct: 320 VESRPYPYNINDTPAATGTPATLFVPMFAPDETDLTDSNSRPANNNWRTDVTSNSSSAIR 379 Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY 272 + + + T I + + VK+ ++ + Sbjct: 380 QRFMPKYFADPGSTTVTPSYGMDAGPNTSCSTTPIKPLTDVSTTAGASAVKTAIDAMAAD 439 Query: 273 ENTNTYPAMHHAYRELYNEKE-SSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 TN M +R L + + K +I +TDG N+ + Sbjct: 440 GATNVPEGMAWGWRTLSSTAPFTEGRPETERGNDKVLIVLTDGANTYYTPDS 491 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 29/316 (9%), Positives = 70/316 (22%), Gaps = 41/316 (12%) Query: 111 YEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 T + T+L+ ++ + S R M + + Sbjct: 334 AATGTPATLFVPMFAPDETDLTDSNSRPANNNWRTDVTSNSSSAIRQRFMPKYF---ADP 390 Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE 230 + + Y + P S + A + + + + E Sbjct: 391 GSTTVTPSYGMDAGPNTSCSTTPIKPLTDVSTTAG----ASAVKTAIDAMAADGATNVPE 446 Query: 231 KKNLSVRIGTIAYNI--------GIVGNQCTPLSNNLNEVKS--RLNKLNPYENTNTYPA 280 R + L++ N + + Y A Sbjct: 447 GMAWGWRTLSSTAPFTEGRPETERGNDKVLIVLTDGANTYYTPDSVIAQTYSGTNYNYGA 506 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITD------GENSGASAYQNTLNTLQICEY 334 A + + + T ++ + +C Sbjct: 507 NDLAGNKAIYSALGYVTPYSNGYSYGRMFLGTSSSVIKSDYSNANYTKAMNEHFTTLCNN 566 Query: 335 MRNAGMKIYSVAVSAPPEG------QDLLRKCTDSSGQFFAVNDSRE------------L 376 + A + + ++A+ D L+ C+ S D + L Sbjct: 567 AKAANVMVMTIALDLDATNTAEKTQMDALKACSSDSRFSKDPTDPSKPMKLFWNSTGATL 626 Query: 377 LESFDKITDKIQEQSV 392 F I +++ + Sbjct: 627 SNDFKAIGNELSNLRI 642 >gi|225352478|ref|ZP_03743501.1| hypothetical protein BIFPSEUDO_04100 [Bifidobacterium pseudocatenulatum DSM 20438] gi|225156985|gb|EEG70354.1| hypothetical protein BIFPSEUDO_04100 [Bifidobacterium pseudocatenulatum DSM 20438] Length = 810 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 43/342 (12%), Positives = 90/342 (26%), Gaps = 31/342 (9%) Query: 66 KQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIP 125 +++ + + I G I +D NP + + G Sbjct: 15 SFVQRMVAALAAIAMLGGLGYVTTSSAIAEDDQNPSGDT-STLQPANSKSIAKIEGGDGD 73 Query: 126 SALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPP 185 +L+ + + I +VLD S SM++ + N LL Sbjct: 74 QYALHLTASGDSSSSTVTTAVPADIVLVLDKSGSMKNSNRDTNAKNAATALASKLLTAAN 133 Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 + + A + + + + N Sbjct: 134 AAL-----PAEQQVQMAVVTFSDRA--------RTTSQFTTSPGAIGTAVSAWPNGGTNW 180 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENT----NTYPAMHH-----AYRELYNEKESSH 296 LS+ + V+ + L+ T + + + Sbjct: 181 EDALKTANDLSSGRSGVQKHIVFLSDGNPTFRITSYSGCFKWSGFGQGWESHPEYGTQAE 240 Query: 297 NTIGSTRLKKFVIFITDGENSGASA------YQNTLNTLQICEYMRNAG-MKIYSVAVSA 349 + L + + TD + S + N G +Y V VSA Sbjct: 241 CEANNKWLDQSYQWKTDPDGSDGNIHGEGDDDSYGYNYAAALSEANERGNAALYVVKVSA 300 Query: 350 PPEG-QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 DL ++ +G+ + + L ++F++I + I Sbjct: 301 DANKMSDLAKEANAVTGKEYDGTSAENLTKAFEQIYNTITTT 342 >gi|89068992|ref|ZP_01156373.1| Putative membrane protein with von Willebrand (VWA) domain [Oceanicola granulosus HTCC2516] gi|89045361|gb|EAR51426.1| Putative membrane protein with von Willebrand (VWA) domain [Oceanicola granulosus HTCC2516] Length = 669 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 20/158 (12%), Positives = 46/158 (29%), Gaps = 12/158 (7%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 +AY + + + L +L +TN + AY E + Sbjct: 354 IVAYAGSAGEVLAPTPAGERATILAALERLAAGGSTNGAGGLEQAYATAEAMTEDGEVSR 413 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 ++ TDG+ + + + L R++G + + ++ Sbjct: 414 --------ILLATDGDFNVGLSDPSAL--EDFIADKRDSGTYLSVLGFGRGNLDDATMQA 463 Query: 360 CTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + L E+ + D++ IA + Sbjct: 464 LAQNGNG--TAAYIDTLHEAQKVLVDQLTGALFPIADD 499 >gi|193216292|ref|YP_001997491.1| von Willebrand factor type A [Chloroherpeton thalassium ATCC 35110] gi|193089769|gb|ACF15044.1| von Willebrand factor type A [Chloroherpeton thalassium ATCC 35110] Length = 346 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 28/171 (16%), Positives = 60/171 (35%), Gaps = 45/171 (26%) Query: 254 PLSNNLNEVKSRLN----KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 P++++ + +K ++ P + TN A+ + R L +E + + K ++ Sbjct: 144 PITSDKSALKLFMDIVSTDAIPTQGTNFSSAIRESIRALERIEEGAEAEEKNRVRNKVIL 203 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS--------------------- 348 +DGE+ A ++ E + ++IY+V V Sbjct: 204 IFSDGEDHEAGID-------EVLEEAASKNIRIYTVGVGSAEPTPIPVLNKDGKRVDFKR 256 Query: 349 -------APPEGQDLLRKCTD-SSGQFFA-VNDSRELLESFDKITDKIQEQ 390 + LLRK + + G ++ + F+ I D I + Sbjct: 257 DSQGSVVTTHLQEALLRKIAEQTKGNYYRIAPQGSD----FELIADDINKL 303 >gi|29828547|ref|NP_823181.1| hypothetical protein SAV_2005 [Streptomyces avermitilis MA-4680] gi|29605651|dbj|BAC69716.1| hypothetical protein [Streptomyces avermitilis MA-4680] Length = 420 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 25/192 (13%), Positives = 68/192 (35%), Gaps = 22/192 (11%) Query: 208 RKIDVLIESAGNLVN-SIQKAIQEKKNLSVRIGTIAYNIGI--VGNQCTPLSNNLNEVKS 264 ++ ++ +++ + ++ + L G + + E K+ Sbjct: 59 SRMAAAKQAFNEVLDATPEEVRLGIRTLGANYPGDDRKTGCKDTAQLYPVSTLDRTEAKT 118 Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + L+P T PA+ A +L ++ ITDGE++ Sbjct: 119 AVATLSPTGWTPIGPALLKAADDL-----------DGGTGSHRIVLITDGEDT-----CA 162 Query: 325 TLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFD 381 L+ ++ + G+ I ++ + + L + + G + +V +L + + Sbjct: 163 PLDPCEVAREIAAKGVGLTIDTLGLVPNSKLSKQLSCIAEATGGTYTSVEHKEDLTDKVN 222 Query: 382 KITDKIQEQSVR 393 ++ D+ ++ V Sbjct: 223 QLVDRAADKVVT 234 >gi|194386850|dbj|BAG59791.1| unnamed protein product [Homo sapiens] Length = 543 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 19/158 (12%), Positives = 56/158 (35%), Gaps = 13/158 (8%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + + + + K + K+ P TN A+ A L S Sbjct: 119 TWRNDLISATKTQVADAKRYIEKIQPSGGTNINEALLRAIFILNEANNLGLLDPNS---V 175 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG 365 +I ++DG+ + + + + E +++ + ++S+ + D L++ + Sbjct: 176 SLIILVSDGDPTVGELKLSKIQ-KNVKENIQD-NISLFSLGMGFDV-DYDFLKRL-SNEN 231 Query: 366 Q------FFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + + S +L + +++++ + P+ Sbjct: 232 HGIAQRIYGNQDTSSQLKKFYNQVSTPLLRNVQFNYPH 269 >gi|296206127|ref|XP_002750076.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2 [Callithrix jacchus] Length = 946 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 20/157 (12%), Positives = 56/157 (35%), Gaps = 13/157 (8%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + + + + K + K+ P TN A+ A L S Sbjct: 357 TWRNDLISATKTQVADAKRYIEKIQPSGGTNINEALLRAIFILNEANNMGLLDPNS---V 413 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG 365 +I ++DG+ + + + + E +R+ + ++S+ + D L++ + Sbjct: 414 SLIILVSDGDPTVGELKLSKIQ-KNVKENIRD-NISLFSLGMGFDV-DYDFLKRL-SNEN 469 Query: 366 Q------FFAVNDSRELLESFDKITDKIQEQSVRIAP 396 + + + S +L + +++++ + P Sbjct: 470 RGIAQRIYGNQDTSSQLRKFYNQVSTPLLRNVQFNYP 506 >gi|198436415|ref|XP_002121394.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 904 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 36/287 (12%), Positives = 71/287 (24%), Gaps = 32/287 (11%) Query: 113 IPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDV---SRSMEDLYLQKHN 169 + + LI R S+ + D S + N Sbjct: 634 VWDKPTPTCQLIKCPELRPFTRGNYTCTNSNNARSRCSYTCGDDYILRPSTSQTIRCQSN 693 Query: 170 DNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ 229 N P S + + I+ + + G Sbjct: 694 KNWTHAPPCCARQCPASPKIDIVMVLDSSSSVTEPGWRKMINFVKTALGFY-------EM 746 Query: 230 EKKNLSVRIGTIAYNIGIVG--NQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYR 286 + SV + I + + ++ R+ +L T T A+ +A Sbjct: 747 GPNSTSVSVFRYNAEIDEANKISFQYTQTYGKEQLLRRIGRLPYNGQGTRTGQALSYALH 806 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 L NE V+ +TDG++ E +R G+ Y++A Sbjct: 807 ILTNEI-------NRPDAVDVVLVLTDGKSQD--------AVKAPAEALRRNGVLTYAIA 851 Query: 347 VSAPPE--GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 + + L + F + + +I+ Sbjct: 852 IQPERGVLNMNQLNDIAGTPHNLFLLRSG--FSSFTEDFAREIRNSV 896 >gi|126174069|ref|YP_001050218.1| vault protein inter-alpha-trypsin subunit [Shewanella baltica OS155] gi|125997274|gb|ABN61349.1| Vault protein inter-alpha-trypsin domain protein [Shewanella baltica OS155] Length = 771 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 40/259 (15%), Positives = 79/259 (30%), Gaps = 22/259 (8%) Query: 144 ENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTK---SKYA 200 + + V + + + + ++K S K S Sbjct: 329 QQGTSPMAWVFNQQGKTHKP-DGDNLSQDTLETSKANGVNEDNYSLVMVLPPKVEKSTQP 387 Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKA---IQEKKNLSVRIGTIAYNIGIVGNQCTPL-- 255 P ID AG+ + + A + I +N + TPL Sbjct: 388 SLPRELILVIDTSGSMAGDSIVQAKNALLYALKGLKPEDSFNIIEFNSSLSQFSATPLPA 447 Query: 256 -SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 S+NL+ + +++L T A+ A + + + + VIF+TDG Sbjct: 448 TSSNLSRARQFVSRLQADGGTEMALALDAALPKSLGSAP-----SDAVQPLRQVIFMTDG 502 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSR 374 A + + ++++V + + P + R G F + Sbjct: 503 SVGNEQALFDLIRYQIGES-------RLFTVGIGSAPNSHFMQRAAELGRGTFTYIGKVD 555 Query: 375 ELLESFDKITDKIQEQSVR 393 E+ E + KIQ + Sbjct: 556 EVGEKISALLSKIQYPLLT 574 >gi|304394408|ref|ZP_07376331.1| von Willebrand factor, type A [Ahrensia sp. R2A130] gi|303293848|gb|EFL88225.1| von Willebrand factor, type A [Ahrensia sp. R2A130] Length = 689 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 26/217 (11%), Positives = 66/217 (30%), Gaps = 22/217 (10%) Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA--PANRKIDVLIESAGNLVNSIQKAI 228 + +S + KI++ + +++ + Sbjct: 10 AAASLTLTQPANAQQTQSSRNVMVVFDGSGSMWGQIEGRAKIEIARDVLSSVLGETTSNM 69 Query: 229 QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYREL 288 + P ++ + E+ +R N + P T A+ A L Sbjct: 70 TIGMIAYGHRKKGQCSDIETVVAPGPAASTVPEMIARANAIKPKGKTPLSDAVRKAAESL 129 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM--KIYSVA 346 + + V+ +TDG + + + + +G+ + V Sbjct: 130 RYTENEAT-----------VVLVTDGIET------CNADPCALATELEESGVDFTTHVVG 172 Query: 347 VS-APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + EG+ + ++ G+F + +D+ EL + D+ Sbjct: 173 FGLSKDEGRQVACLAANTGGKFISADDADELKAALDE 209 >gi|73949158|ref|XP_535195.2| PREDICTED: similar to inter-alpha globulin inhibitor H2 polypeptide [Canis familiaris] Length = 946 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 29/228 (12%), Positives = 79/228 (34%), Gaps = 25/228 (10%) Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 L P PK + + + S + + ++ + +L Q ++ + N + Sbjct: 299 FAPENLDPIPKNILFVIDVSGSMWGI---KMKQTVEAMKTILDDLRAEDQFSVIDF-NHN 354 Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS 295 VR + + K + K+ P TN A+ A L Sbjct: 355 VRTWRNDLVSATRTQ--------VTDAKKYIEKIQPSGGTNINEALLRAIFILNEANNLG 406 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 S +I ++DG+ + + + + +++R+ + ++S+ + D Sbjct: 407 LLDPES---VSLIILVSDGDPTVGELKLSKIQ-KNVKQHIRD-NISLFSLGIGFDV-DYD 460 Query: 356 LLRKCTDSSGQ------FFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 L++ + + + + S +L + +++++ + P+ Sbjct: 461 FLKRL-SNENRGIAQRIYGNQDTSSQLKKFYNQVSTPLLRNVQFNYPH 507 >gi|307719357|ref|YP_003874889.1| batA protein [Spirochaeta thermophila DSM 6192] gi|306533082|gb|ADN02616.1| putative batA protein [Spirochaeta thermophila DSM 6192] Length = 332 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 44/152 (28%), Gaps = 38/152 (25%) Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T + + L + V+ +TDG+N+ Sbjct: 160 AVRLFSLGDGTALGMGVGTSLLHLSRVN----------ASFRAVVILTDGKNTTG----- 204 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAP----------------------PEGQDLLRKCTD 362 + E R G+ +++V V + ++ LR+ + Sbjct: 205 EILPETAAEMARELGIPLFTVGVGSDRPVSLDVIDPSTGTRYAGVLEEGYDEETLRRIAE 264 Query: 363 -SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 S GQFF+ L F I R Sbjct: 265 ISGGQFFSGYTPTSLHRIFQYIGATATADVRR 296 >gi|258405287|ref|YP_003198029.1| hypothetical protein Dret_1163 [Desulfohalobium retbaense DSM 5692] gi|257797514|gb|ACV68451.1| Protein of unknown function DUF2134, membrane [Desulfohalobium retbaense DSM 5692] Length = 323 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 20/129 (15%), Positives = 40/129 (31%), Gaps = 4/129 (3%) Query: 1 MTAIIISV-CFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQ 59 +TA+ + AID+ +N++Q+ LDAA L+G + + + K+ Sbjct: 21 ITALFVLFSLLATAGIAIDIGRQATAKNELQNTLDAAALAGAIELGQNGPANVKSEAKEA 80 Query: 60 TSTIFKKQIKKHLKQGS---YIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 L ++ ++ NN + A T Sbjct: 81 AENNSIDNNGLILGDNDIKVGNWTEPNFFSKTPYNSVKIMVNNHSINSFFASALNFQQTV 140 Query: 117 NLFLKGLIP 125 + +I Sbjct: 141 SAEATAVIG 149 >gi|198422181|ref|XP_002120553.1| PREDICTED: similar to predicted protein [Ciona intestinalis] Length = 1038 Score = 58.8 bits (140), Expect = 1e-06, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 57/182 (31%), Gaps = 8/182 (4%) Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 + N + L + A+ ++ V + + Sbjct: 193 IVIVIDKSGSMGVTNMNLAKEAAKSVVNTLNPQDRFAVMAFSSIFVPFQSTVASDQCFAT 252 Query: 251 QCTPLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNE---KESSHNTIGSTRLKK 306 S N +V+ ++ ++ TN PA+ A+ E + + I + + + Sbjct: 253 TFADASPQNKKKVEDFVDTISSGGGTNYAPALQKAFSFFQQEPSVSDFNIKKIDPSEIDR 312 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ 366 ++F++DG + L+ N + I + + G +LR + G Sbjct: 313 VILFMSDGIPNDP--GSTILSAQIRANEQLNNSVIILTYGLGNADFG--VLRNMATNKGD 368 Query: 367 FF 368 + Sbjct: 369 VY 370 >gi|295394688|ref|ZP_06804906.1| von Willebrand factor [Brevibacterium mcbrellneri ATCC 49030] gi|294972458|gb|EFG48315.1| von Willebrand factor [Brevibacterium mcbrellneri ATCC 49030] Length = 538 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 40/333 (12%), Positives = 84/333 (25%), Gaps = 33/333 (9%) Query: 60 TSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLF 119 K Q K D + + I + P L Sbjct: 226 AEAFAKDQSKADGIFNYESVLKGMDFGGQPPVLIAPSDG---------VVTADYPLTLLD 276 Query: 120 LKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKY 179 N E VS E + + + + Sbjct: 277 GADDTTKENFNKVTEYLLSDEVQQRITD-ETHRRTKVSNPDEFPTTFETPYPATLDTVQK 335 Query: 180 LLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE---KKNLSV 236 LL +++ L + G L + K E Sbjct: 336 LLEAWVANGRKPATMFFQIDTSGSMRGE-RLEQLKTALGILSGTSAKNDTERFLAIQPRE 394 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLN------EVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 ++ + ++ + L++N + ++ +++ L T Y + Sbjct: 395 KLKLVEFSHEVKSTDGYRLTDNGSADKVRKDLDTKIQTLTAEGGTAIYSTLQTTLESAKK 454 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 EK T V+ TDG N +++ + + +++V+ Sbjct: 455 EKSDDKITS--------VVVFTDGMNEHGISFRAFKDWYS--NNQDVQDIPVFAVSFG-N 503 Query: 351 PEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 + +L + + G+ F N +L +F I Sbjct: 504 ADSDELQELVSLTGGRVFDGN--ADLTAAFKDI 534 >gi|260834334|ref|XP_002612166.1| hypothetical protein BRAFLDRAFT_88905 [Branchiostoma floridae] gi|229297540|gb|EEN68175.1| hypothetical protein BRAFLDRAFT_88905 [Branchiostoma floridae] Length = 815 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 45/368 (12%), Positives = 107/368 (29%), Gaps = 31/368 (8%) Query: 32 ALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQ-GSYIRENAGDIAQKAQ 90 A DAA G +T K+ K L++ + K Sbjct: 137 AEDAA---GETGDTDGQTKDTTGESKEAAGETPMDTPKTLLEELRRQPETPSLKETPKTM 193 Query: 91 INITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISI 150 + K + + E + + T L ++ L S L + + + + Sbjct: 194 MQRLKKPMGIPKSLMEKTLKVLMWTLKLLMRTLKTSMEKTLRVLMWTLKPLLEKTPETLM 253 Query: 151 CMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPAN--- 207 + + + + +N S + AP + Sbjct: 254 ETLKPLMEKTLKTLIGTLTPLMKALKITNSVRGAGAHGLSVQNPRGSGSSTCEAPVDLFF 313 Query: 208 --RKIDVLIESAGNLVNSIQKAI---QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + V + + + R+G + Y+ N Sbjct: 314 LLDGSGSVKAANFAKVKQFAVDMVNSFDVSPAATRVGVLQYSNRNTLVFNLGNKVNKPTT 373 Query: 263 KSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 S +N ++ T T A+ + + + K +I +TDG++ + + Sbjct: 374 VSAINSISYQGGGTRTGAALQYI---------RGNAAWRRGNVPKVLIVLTDGKSEDSVS 424 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFD 381 + + + + +++Y++ V + + ++LL+ + +ND L D Sbjct: 425 GPS--------QNLVSDRVEVYAIGV-SNFDHEELLQIVNNKQSNVIELNDFNALATKID 475 Query: 382 KITDKIQE 389 +I + Sbjct: 476 EIAQDVCS 483 >gi|253701737|ref|YP_003022926.1| hypothetical protein GM21_3141 [Geobacter sp. M21] gi|251776587|gb|ACT19168.1| conserved hypothetical protein [Geobacter sp. M21] Length = 383 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 26/279 (9%), Positives = 75/279 (26%), Gaps = 8/279 (2%) Query: 4 IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTI 63 I++ V + AID+ ++ +Q + + A L+G S+ ++ + + Sbjct: 8 IMLVVFLVLTGLAIDIGYMYVSDEDLQHSAEMAALTGAESLKKRLLLQAQHSPGKLAQVL 67 Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGL 123 + + A + +N + + Sbjct: 68 A----DPLQSAARSVAVDTATGKHSASALVALMNDNGNALTENNDITVGFWNMSSRSYTP 123 Query: 124 IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNN---NMTSNKYL 180 + + + +R+ E SS L V +S + ++ Sbjct: 124 GGTPVNAMQVRARRTAESSSVGLGSLGTFVAKISGTASFGSTPVAVAALVPGTRSNIAIC 183 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 + + + + P + + + + L S+ + +S + Sbjct: 184 AAACEPSCGYPDVCSIPERRMSHLPWDPQREN-SSANRYLYTSLLHPVTITNTMSDLVCQ 242 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYP 279 + G + + + L + N ++ Sbjct: 243 EMPVQEVCGLPIFTAAMKTDAILRDLKAMMYDPNVDSSN 281 >gi|3982895|gb|AAC83698.1| complement factor Bf-2 [Oncorhynchus mykiss gairdneri] Length = 749 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 32/190 (16%), Positives = 67/190 (35%), Gaps = 13/190 (6%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 + + + L+ + + + + + + + + L +V + L+ Sbjct: 253 DQFNKARNAVKKLITKVSSFAVSPNYEILFLASDVFEVVNILDFLGEKRKTLEDVLADLD 312 Query: 268 KLNPYE----NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 N + TN A + + N + +IF TDG + Sbjct: 313 NFNYGDRQNVGTNLNLAFKTILERM--AIQKQRNETLFKEVHHVLIFFTDGAFNMGGRPD 370 Query: 324 NTLNTLQIC-----EYMRNAGMKIYSVAVSAPPEGQDL--LRKCTDSSGQFFAVNDSREL 376 +T+ ++ + R+ + IY V +D+ L D+ +F + D EL Sbjct: 371 DTVAKVREMVYMNQKEERDKYLDIYVFGVGREIFDEDIQPLVTKRDNEDHYFKLKDGTEL 430 Query: 377 LESFDKITDK 386 E+FDKI D+ Sbjct: 431 EETFDKIIDE 440 >gi|24375866|ref|NP_719909.1| von Willebrand factor type A domain-containing protein [Shewanella oneidensis MR-1] gi|24350833|gb|AAN57353.1|AE015872_4 von Willebrand factor type A domain protein [Shewanella oneidensis MR-1] Length = 451 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 24/156 (15%), Positives = 55/156 (35%), Gaps = 13/156 (8%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYENTNTYPAMHHAYRELYNEKESSHNT 298 IAY+ N NE+ +N + P +T + + E+ + + Sbjct: 110 VIAYSDNAYLIIPATKVKNKNEMIKIINDTIKPGGSTALFAGVSKGITEVNKFIKKNQVN 169 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLR 358 +I ++DG+ + + L L + G+ + ++ + +DL+ Sbjct: 170 R--------IILLSDGQANIGPSTTKELADLG--QVAGKQGIAVTTIGLG-NGYNEDLMT 218 Query: 359 KCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 S G V +S +L +F + + + + Sbjct: 219 ALAGFSDGNHAYVENSADLETAFVREFNDVMSVVAQ 254 >gi|260813584|ref|XP_002601497.1| hypothetical protein BRAFLDRAFT_134626 [Branchiostoma floridae] gi|229286794|gb|EEN57509.1| hypothetical protein BRAFLDRAFT_134626 [Branchiostoma floridae] Length = 260 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 22/161 (13%), Positives = 54/161 (33%), Gaps = 16/161 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYREL 288 + + ++G I Y+ + + + +N T T A+ + + Sbjct: 34 DIGPTATQVGIIQYSTRPRQEFSMNSFQTKESLSTAIENVNYMAGGTLTGRAIRYVTKYG 93 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + E + + + K VI +TDG ++ + Q + G+ +Y++ V Sbjct: 94 FGESDGAR-----PGIPKIVILVTDGV--------SSDDIEQPALEAQQKGISLYAIGV- 139 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 D L + ++ + L + + I + Sbjct: 140 -SGYDMDQLERIASNNRTLAVAENFNLLDSLRNTLLTGICD 179 >gi|189526999|ref|XP_691588.2| PREDICTED: anthrax toxin receptor 1 [Danio rerio] Length = 554 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 31/187 (16%), Positives = 60/187 (32%), Gaps = 21/187 (11%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 K + + + ++ ++ + +R+ I ++ L+ N +++ Sbjct: 44 YFVLDKSGSIQNHWIEIYSFVEHLAEKFTSPMLRMSFIVFSTRGT--TIMRLTENRDDIT 101 Query: 264 SRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 LN L P +T + A + G+ +I +TDGE + Sbjct: 102 RGLNTLKREIPGGDTYMNLGLEEA---------NVQIYHGNYGAASVIIALTDGELNDHQ 152 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV-NDSRELLES 379 Q + R+ G +Y V V + L D+ F V + L Sbjct: 153 FVT----AQQEAQRARSMGAIVYCVGV--KDFNETQLATIADTIEHVFPVIGGFQALEGM 206 Query: 380 FDKITDK 386 D I K Sbjct: 207 IDSIIKK 213 >gi|311747444|ref|ZP_07721229.1| putative BatB protein [Algoriphagus sp. PR1] gi|126574803|gb|EAZ79174.1| putative BatB protein [Algoriphagus sp. PR1] Length = 321 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 29/175 (16%), Positives = 62/175 (35%), Gaps = 46/175 (26%) Query: 254 PLSNNLNEVKSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 PL+ + + ++ ++ LN P T+ + A N+ K VI Sbjct: 133 PLTFDQSVLQLYIDGLNTGLVPNFGTDLNAPLRIALDRFQND-------ESQEVKSKSVI 185 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP------------------ 351 I+DGEN G I ++N G+K++++ + Sbjct: 186 LISDGENFGD-------ELENIGSELKNLGVKVFALGIGTESGSTIPRGNGIVMDPQTGE 238 Query: 352 ------EGQDLLRKCTDSSGQFFAVND----SRELLESFDKITDKIQEQSVRIAP 396 + + L + ++ GQ+F ++D +L++ +++ I A Sbjct: 239 PAQTVLDKRPLQQIAAETDGQYFEISDEVQEVADLIKRLERLEGGITGSRTVEAS 293 >gi|114321541|ref|YP_743224.1| von Willebrand factor, type A [Alkalilimnicola ehrlichii MLHE-1] gi|114227935|gb|ABI57734.1| von Willebrand factor, type A [Alkalilimnicola ehrlichii MLHE-1] Length = 972 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 32/255 (12%), Positives = 76/255 (29%), Gaps = 21/255 (8%) Query: 129 TNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKS 188 + + + + V+ + D ++ D + L Sbjct: 269 VQPTTDDPQEGNETVQPDRTRYTALEAVAPTAADNWVVTQLDQMDHGCRDELEIVWMDDD 328 Query: 189 FWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIV 248 + + + AP + LV+ ++ VR A + + Sbjct: 329 LEISLIVDTSGSMSGAPIIN----ARTAGRTLVDVVEPGRTAMGV--VRFSASASVVHPM 382 Query: 249 GNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P + +++K ++ L T + + EL + T + Sbjct: 383 IAIPDPGTAEKDQLKDAIDSLPASGLTAMFDGLILGLDELQDYS-----AANDTDAGQVA 437 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD-LLRKCTDSSGQF 367 ++DG ++ + Q + ++A + I +A +LR+ D++G Sbjct: 438 FLLSDGGDN-----SSAATEPQTVQAYQDANVPI--IAFGYGSFAPTGVLRRLADNTGGE 490 Query: 368 FAV--NDSRELLESF 380 F E+ E+F Sbjct: 491 FFASPTTLAEIQEAF 505 >gi|183602734|ref|ZP_02964097.1| hypothetical protein BIFLAC_00845 [Bifidobacterium animalis subsp. lactis HN019] gi|183217972|gb|EDT88620.1| hypothetical protein BIFLAC_00845 [Bifidobacterium animalis subsp. lactis HN019] Length = 839 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 36/397 (9%), Positives = 106/397 (26%), Gaps = 60/397 (15%) Query: 39 SGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKN 98 + + + T Q +T + + + G + D++ Sbjct: 2 ASASVAAFADDRQPAATADPQAATASAGNVDAPQHTKRISKNDDGTYTLSMDVTGKSDES 61 Query: 99 NPLQYI-AESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 Q + + ++ L G + L + ++ + +S Sbjct: 62 TEQQVVPLDIALVLDVSGSMNELSGKLVYNEVELLSMNPISTYYVEKDGSYQAVRCSAIS 121 Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 Q + + Y P + + ++ + ++D L ++ Sbjct: 122 -WGRCTTWQDQDSAGQKYTVTYNWIGGP----SASVSPDVQFYKSKQSEETRLDALKDAV 176 Query: 218 GNLVNSIQKAI----------------------QEKKNLSVRIGTIAYNIGIVGNQCTPL 255 ++ ++ + N + YN + Sbjct: 177 TYFLDQVEDQNQRINDPGKKVQVALIKYAGKNSDKIGNDTYNEDGYNYNYSQTVHSLAWT 236 Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 +L + ++ +N L T + HA ++L + + + +K +F +DG Sbjct: 237 PEDLQKEQAAVNSLKAGGATRADFGLQHAVKQLNSGRPGA---------QKLTVFYSDGS 287 Query: 316 NS--GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP------PEGQDLLRKCTDS---- 363 + + N ++ ++N ++ S+ + + + Sbjct: 288 PTSSDGFEAKIANNAIKAAAQLKNDHSQVISIGAMPGADPSGTDNANKFMNYVSSNYPKA 347 Query: 364 ----------SGQFFAVNDS-RELLESFDKITDKIQE 389 G ++ + +L F +I + Sbjct: 348 QSMSEPHDRVEGTYYYAVSARTDLQTIFKEIISIVTS 384 >gi|197117451|ref|YP_002137878.1| VWFA superfamily protein [Geobacter bemidjiensis Bem] gi|197086811|gb|ACH38082.1| VWFA superfamily protein [Geobacter bemidjiensis Bem] Length = 357 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 18/141 (12%), Positives = 47/141 (33%), Gaps = 11/141 (7%) Query: 17 IDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGS 76 +D ++ ++ +A+DAA ++ D K++ S + + Sbjct: 20 VDAGRAYGVKAKLHAAVDAASYEAAKALAQGEDEDD---MKEKASEAAFDYFRANFPSDY 76 Query: 77 YIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRST 136 + + +G +++ K + + A+ G++ +L RS Sbjct: 77 FGAQCSGPEL---ELSERKSEQKMRALSVSATARL-----PNIFAGILGWKSIDLPARSR 128 Query: 137 GIIERSSENLAISICMVLDVS 157 + + + L + L S Sbjct: 129 AVRKDADVVLVLESSDALRDS 149 >gi|299535615|ref|ZP_07048936.1| BatA [Lysinibacillus fusiformis ZC1] gi|298728815|gb|EFI69369.1| BatA [Lysinibacillus fusiformis ZC1] Length = 972 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 41/330 (12%), Positives = 96/330 (29%), Gaps = 30/330 (9%) Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNP--LQYIAESKAQYEIPTENLF 119 FKK+++ + +G + Y K +Y+ + Sbjct: 563 EAFKKRVEALKESADAASRPSGKDRYTKVMVTLVRPGGEPITDYQGTVKIKYDGVEKTAS 622 Query: 120 LKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKY 179 L N T + S S V+ + + Sbjct: 623 FITNTSDPLNNTGNPGTAVAYFDSVIYGKSKVEARLVNPIDPRYATSLKGLKDKTVTKDI 682 Query: 180 LLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG 239 P K+ S T + + ++ +K I+ L + Sbjct: 683 FTNPYFSKNSCSLATEIAYVVDYSSSM--------KAVDPTNYRGKKMIELINQLKAKNN 734 Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 + + N+ + K T+ + + A + Sbjct: 735 IVIETNTKATVLGEGTTENVLK-KDLYKASKEKGATDIFAGIDIALTKFS---------- 783 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP--EGQDLL 357 ++ K ++ ++DG+ + ++ + G+K+Y+V++ L+ Sbjct: 784 NDSKTSKAIVVVSDGK-------TSKSKMTKVINEAKKQGVKVYTVSMGKKSQVNDATLM 836 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDKI 387 + +++SG +F D+ +L + F K+ D I Sbjct: 837 QLSSETSGAYFHAIDNMQLHQVFQKLIDTI 866 >gi|284036687|ref|YP_003386617.1| von Willebrand factor A [Spirosoma linguale DSM 74] gi|283815980|gb|ADB37818.1| von Willebrand factor type A [Spirosoma linguale DSM 74] Length = 316 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 42/280 (15%), Positives = 83/280 (29%), Gaps = 34/280 (12%) Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSN 177 L + L + + + V+D + + + N +N Sbjct: 12 LPVASLNAKPSAAATSSLSTGSTGVEKLTFSVDVFVVDKNGRLITGLKPSNFKIVNTVTN 71 Query: 178 KYLLPPPPKKSFWSKNTTKS--------KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ 229 Y L + SK S P N +I+ NL Sbjct: 72 FYELIDVSTSNLISKPGGYSAMLLLDQTGSISTTDPYNLRIEASKIFLNNL--------- 122 Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN--KLNPYENTNTYPAMHHAYRE 287 + + + + V + +N ++K L+ LN T Y + + Sbjct: 123 -GTDDYTGLTSFTSSYTSVVKLHSGFTNKTEQMKKSLDTLALNVSGGTPLYTSTIQSVT- 180 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 + K VI TDGEN+ + I + +++V + Sbjct: 181 --------YTAQKGPTANKAVIVFTDGENNVTTNTLEDATAKAI-----QQKIPLFTVGL 227 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 S L + ++ G FF D+ +L+ +F + + + Sbjct: 228 STDVNVNVLAQMANETGGAFFYAKDAGQLISTFGTLGNLL 267 >gi|146304257|ref|YP_001191573.1| von Willebrand factor, type A [Metallosphaera sedula DSM 5348] gi|145702507|gb|ABP95649.1| von Willebrand factor, type A [Metallosphaera sedula DSM 5348] Length = 383 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 22/122 (18%), Positives = 46/122 (37%), Gaps = 19/122 (15%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + ++ + ++ L+ T + A+ A+ + +VI +TDG + Sbjct: 92 DPEDLTAEISSLSAGGQTAFFTALLTAFNL-----------HNKHGIPSYVILLTDGNPT 140 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSREL 376 + N G++ S + + +L+ D S G F+ VND+ E+ Sbjct: 141 ------DDTNVETYKRIAIPNGVQTISFGLG-DDYNETILKSLADRSGGVFYHVNDAMEI 193 Query: 377 LE 378 E Sbjct: 194 PE 195 >gi|94499792|ref|ZP_01306328.1| hypothetical protein RED65_14762 [Oceanobacter sp. RED65] gi|94427993|gb|EAT12967.1| hypothetical protein RED65_14762 [Oceanobacter sp. RED65] Length = 731 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 25/158 (15%), Positives = 62/158 (39%), Gaps = 19/158 (12%) Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGS 301 Y + G N+ E+ +L+ + P T+ + A L ++ S+ Sbjct: 394 DYASNLTGGFLPATQKNIAEIIRKLDLVLPNGGTHLMDGVRFALSGLDADRTSA------ 447 Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT 361 + +TDG + Q + + ++ +++++ + + LL+ T Sbjct: 448 ------IWLVTDGVTNVGETKQ-----RKFVDLLKQKDIRVFTF-IMGNGANRPLLKAIT 495 Query: 362 -DSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 S+G V++S +++ +K K+ +++R R Sbjct: 496 KASNGFAINVSNSDDIIGQLEKAASKVTHEALRDIKVR 533 >gi|55962354|emb|CAI11851.1| novel protein (zgc:56119) [Danio rerio] gi|56207241|emb|CAI21014.1| novel protein (zgc:56119) [Danio rerio] Length = 946 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 24/172 (13%), Positives = 56/172 (32%), Gaps = 15/172 (8%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 + I + S ++E K + + P TN A+ A + Sbjct: 334 DLSIDDYFSIIDFNHNVRCWSEDLVQASSIQVDEAKKYIQNIKPNGGTNINEALLRAIQM 393 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 L SH+ + R +I ++DG+ + +T+ + M+ ++S+ + Sbjct: 394 LIKA---SHHGLIDPRSVSMIILVSDGDPTVGEIKLSTIQ-KNVKLRMKEE-FSLFSLGI 448 Query: 348 SAPPEGQDLLRKCTDSSGQ-----FFAVNDSRELLESFDKITDKIQEQSVRI 394 D L + + + N + +L + ++ +R Sbjct: 449 GFDV-DFDFLERIAMDNRGIAQRIYANQNAAEQLKTFYS----QVSSPLLRT 495 >gi|220933243|ref|YP_002512142.1| von Willebrand factor type A [Thioalkalivibrio sp. HL-EbGR7] gi|219994553|gb|ACL71155.1| von Willebrand factor type A [Thioalkalivibrio sp. HL-EbGR7] Length = 325 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 31/145 (21%), Positives = 53/145 (36%), Gaps = 23/145 (15%) Query: 250 NQCTPLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 PL+ + V S+L +L ++T + A R+L Sbjct: 153 ATLVPLTGDRELVASQLARLRAGMLGDDTAIGDGIALALRQLQASGAERRPA-------- 204 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS-----APPEGQDLLRKCT 361 +I +DGE++ L + R AG+ +Y+V ++ AP EG+ L Sbjct: 205 -LILFSDGESNAGL-----LRPSEALALARAAGVALYTVEITGGQALAPVEGEPSLADMA 258 Query: 362 -DSSGQFFAVNDSRELLESFDKITD 385 + G+ F V S +L I Sbjct: 259 ETTGGRHFHVTRSADLEAVIATIDR 283 >gi|1353722|gb|AAB01767.1| unknown [Naegleria fowleri] Length = 357 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 14/144 (9%), Positives = 45/144 (31%), Gaps = 12/144 (8%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + K ++ TN + R + + ++ TDG + Sbjct: 1 GKQKAKQVAKNIHAGTCTNLSGGLFEGLRLIK--------QRTTCNEITSILLFTDGLAN 52 Query: 318 GASAYQNTLNTL---QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSR 374 + + + I E +R + ++ + + L +G ++ +N+ Sbjct: 53 EGITNTSEIVSKMNTTIHEEIRKQ-ITCFTFGFGSDTDANMLTSIAQAGNGLYYFLNNVD 111 Query: 375 ELLESFDKITDKIQEQSVRIAPNR 398 ++ ++F + + + + Sbjct: 112 DIPKAFGNVIGGLVSVVAQNIKVK 135 >gi|328882566|emb|CCA55805.1| putative exported protein [Streptomyces venezuelae ATCC 10712] Length = 543 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 37/274 (13%), Positives = 77/274 (28%), Gaps = 25/274 (9%) Query: 108 KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQK 167 A Y L L A + LD +R+ D + Sbjct: 95 TASYGYARRTLAEGRLPDPATVRPEEFVNSFRPDYPRPADNGFSVTLDGARAGSDGWSLV 154 Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 +++ PP +F ++D++ ES G L + ++ Sbjct: 155 RVGLATRAADRTGERPPAALTFVVDI-------SGSMAEPGRLDLVKESLGLLADELRDD 207 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTP-LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYR 286 I + ++ + V+ +N L +TN + Y Sbjct: 208 DS--------IALVTFSDEAETRLPMTRVGEARGRVREVVNSLATTSSTNVEAGVRTGYD 259 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 + V+ ++D + + + +I E + G+ ++ V Sbjct: 260 VAVDGHRKDATNR--------VVLLSDALANTGAT-EAGAILERIEEERKAYGITLFGVG 310 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 V + + R GQ V+ S + + F Sbjct: 311 VGSDYGDAFMERLADRGDGQTTYVSTSAQARKVF 344 >gi|260825786|ref|XP_002607847.1| hypothetical protein BRAFLDRAFT_199461 [Branchiostoma floridae] gi|229293196|gb|EEN63857.1| hypothetical protein BRAFLDRAFT_199461 [Branchiostoma floridae] Length = 187 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 28/188 (14%), Positives = 62/188 (32%), Gaps = 24/188 (12%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + D + E N+VN+ + S R+G + Y+ + Sbjct: 21 SGSVTYANFDKVKEFTENVVNAF-----DISASSTRVGVVQYSTSNTLEFNLGDHADKPS 75 Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + ++ ++ T T A+ A L + K +I +TDG++ + Sbjct: 76 TLAAIDSISYQGGGTRTGSALEFA--RLNAAWRGG-------SVPKVMIVVTDGKSGDSV 126 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 A + + G+ +Y++ V + LL + + D L Sbjct: 127 ASS--------ANDLASQGVDVYAIGVG-NYDATQLLEIAAGNQNNVIELTDFNALSAEI 177 Query: 381 DKITDKIQ 388 ++I + Sbjct: 178 NQIAQTVC 185 >gi|218781309|ref|YP_002432627.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] gi|218762693|gb|ACL05159.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] Length = 336 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 23/174 (13%), Positives = 56/174 (32%), Gaps = 50/174 (28%) Query: 254 PLSNNLNEVKSRLNKLNPY------ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 PL+ + ++ L++L + T+ A+ + K Sbjct: 144 PLTLDYQAIQMFLDQLTVDLLPLRFQGTDLGAAIEMGMTAFDPKS----------STDKV 193 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE--------------- 352 ++ ITDGE++ L+ E + G++I+ + + P Sbjct: 194 ILLITDGEDN-------EEAGLKAAEKASDEGIRIFVLGIGDPAGGPVPSLDGSGFEKDA 246 Query: 353 ---------GQDLLRKCT-DSSGQFFAVNDSR-ELLES-FDKITDKIQEQSVRI 394 + L+ ++ G + +L + F+ I K + + ++ Sbjct: 247 GGKIILSKPDESTLQAIANETGGDYIRSEAGDFDLDQLYFNGIKKKTEAEILKT 300 >gi|296159241|ref|ZP_06842067.1| von Willebrand factor type A [Burkholderia sp. Ch1-1] gi|295890500|gb|EFG70292.1| von Willebrand factor type A [Burkholderia sp. Ch1-1] Length = 345 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 21/160 (13%), Positives = 56/160 (35%), Gaps = 28/160 (17%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKE 293 RIG + + PL+ + + V+ L+++ T A+ + + + Sbjct: 140 RIGLVVFGDAAYPQA--PLTLDHDSVRILLDQMQIGMAGPRTAIGDAIGLTVKLMADSHA 197 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE- 352 K +I +TDG ++ ++ + E + + ++++ + P Sbjct: 198 QE----------KVLILLTDGNDTSSAIP-----PERAAEIAKQHKLVVHTIGIGDPGTT 242 Query: 353 -----GQDLLRKCTD-SSGQFFAVNDSR-ELLESFDKITD 385 + L + + G+ F + +L E + + Sbjct: 243 GEDRVDLEALARIASITGGRAFRALGQQRDLAEVYATLDK 282 >gi|242091866|ref|XP_002436423.1| hypothetical protein SORBIDRAFT_10g002210 [Sorghum bicolor] gi|241914646|gb|EER87790.1| hypothetical protein SORBIDRAFT_10g002210 [Sorghum bicolor] Length = 636 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 34/284 (11%), Positives = 79/284 (27%), Gaps = 31/284 (10%) Query: 105 AESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVL-----DVSRS 159 + P + + ++ + I + L D Sbjct: 4 FQVTTMILQPRLLTWFFSTLLLTQLMMASTAAAESTVKVSTTPIFPQIPLGQARKDFQVL 63 Query: 160 MEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGN 219 + + + P P ++D+L +A Sbjct: 64 LRVEAPTAAVRPEARVPIDVVAV----LDVSGSMNDPAAVPPERRPTTSRLDLLKTAAKF 119 Query: 220 LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL----SNNLNEVKSRLNKLNPYENT 275 +V ++ R+ +A+N V + L ++ + +++L T Sbjct: 120 MVAKLEDGD--------RLSIVAFNDRPVKELSSGLLYMSADGRRKAMKSVDQLEARGGT 171 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 PA A + L F++ +TDGE++ + Sbjct: 172 ALVPAFEEAVKVLD------GRVGDGRNRLGFIVLLTDGEDTSGFTLSERRREVIRGALG 225 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 R ++++ + + + LL +S G + V+D + E Sbjct: 226 R---YPVHTLGLGRAHDPEVLLYLAQESHGTYSFVDD-DNIGEV 265 >gi|326669364|ref|XP_695742.5| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Danio rerio] Length = 3651 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 20/132 (15%), Positives = 45/132 (34%), Gaps = 25/132 (18%) Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + + T T A A + L +E++ K + ITDG ++G Sbjct: 147 FSKEIPSITYRGGGTYTRGAFQRAAQILRQSRENAT---------KVIFLITDGYSNGG- 196 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLE 378 + + +R G++I+++ + L + F V++ E Sbjct: 197 ------DPRPVAAALRERGVEIFTLGI--WQGNIRELHEMASQPKDQHCFFVHNFAE--- 245 Query: 379 SFDKITDKIQEQ 390 F+ + + + Sbjct: 246 -FEALARRALHE 256 >gi|317419404|emb|CBN81441.1| von Willebrand factor A domain-containing protein 2 [Dicentrarchus labrax] Length = 761 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 24/308 (7%), Positives = 80/308 (25%), Gaps = 33/308 (10%) Query: 84 DIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSS 143 D + + + + E L ++ Sbjct: 403 GFVSAPVFADVTDDLPRVVVLLTATPSSDEVVEPSKYAR---DREIFLIAVGPDRLKGQL 459 Query: 144 ENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAP 203 N+ + + + + + + + ++ + + Sbjct: 460 NNITGNPQRTI--TYTSQFSAKIPELKAKICSVDTQGCLGQAVDLVFALD-------ASG 510 Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 + + + + L + ++ +AY + + + Sbjct: 511 SVSPDNFATMRDFVRGLSVQF-----DINRDVAQMALVAYGRRATTVFNLDTHDTGSAIL 565 Query: 264 SRLNKLNPYENTN-TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + N T A+ H + ++ + + + K V+ +TDG Sbjct: 566 KAVGDANYMGGVASTGTALLHVHSDILTVAKGARPGVN-----KAVVVVTDGSGGD---- 616 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + +R+ G+ ++ + + + + LL+ S +V +L D Sbjct: 617 ----DAAVPAQKLRDNGVSLFVIGIG-DIQKERLLQ-IAGSEEHMISVLSYEDLKYFEDV 670 Query: 383 ITDKIQEQ 390 + + + Sbjct: 671 LVQMLCSE 678 >gi|302382107|ref|YP_003817930.1| hypothetical protein Bresu_0994 [Brevundimonas subvibrioides ATCC 15264] gi|302192735|gb|ADL00307.1| Protein of unknown function DUF3520 [Brevundimonas subvibrioides ATCC 15264] Length = 625 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 25/324 (7%), Positives = 69/324 (21%), Gaps = 21/324 (6%) Query: 56 KKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPT 115 T + E D + + + + A Y Sbjct: 122 TNAAGQTTVDGVVVPGRPGTRVDTERYPDATPNPVRRVADEPVSTFSIDVD-TAAYANVR 180 Query: 116 ENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMT 175 + P + + + + + + Sbjct: 181 RFISEGQTPPRDAVRVEEMINYFDYGYARPGRADEPFAVSTAVAASPWSANAGAGGRQIV 240 Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 P N T + K+ + ++ +++ ++ Sbjct: 241 HIGLQGYELPAGERRPLNLTFMVDVSGSMQSPDKLGLAQQTMNLIIDRLRPED------- 293 Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS 295 R+ Y + + +++ + LN +T M +AY + Sbjct: 294 -RVAVTYYASDVGTAVGPTPGSEKLKLRCAVAALNAGGSTAGAQGMVNAYEQAEAAFSPD 352 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 ++ TDG+ + + R G+ + Sbjct: 353 KVNR--------ILMFTDGDFNVGV--TDDRRLEDYVADKRGTGIYLSVYGFGRGNYQDA 402 Query: 356 LLRKCTDSSGQFFAVNDSRELLES 379 ++ + +L E+ Sbjct: 403 RMQTIAQAGNGV--AAYVDDLDEA 424 >gi|28948724|pdb|1N3Y|A Chain A, Crystal Structure Of The Alpha-X Beta2 Integrin I Domain Length = 198 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 28/174 (16%), Positives = 56/174 (32%), Gaps = 17/174 (9%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 N V ++ Q + + S+N + + +++L T T Sbjct: 32 NFVRAVISQFQRPSTQFSLMQFSNKFQTHFTFEEFRRSSNPLSLLASVHQLQ--GFTYTA 89 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 A+ + L++ + K +I ITDG ++L+ + A Sbjct: 90 TAIQNVVHRLFHASYGARR-----DAAKILIVITDG-----KKEGDSLDYKDVIPMADAA 139 Query: 339 GMKIYSVAVSA---PPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKI 387 G+ Y++ V L F V D L + +++ +KI Sbjct: 140 GIIRYAIGVGLAFQNRNSWKELNDIASKPSQEHIFKVEDFDALKDIQNQLKEKI 193 >gi|257052324|ref|YP_003130157.1| von Willebrand factor type A [Halorhabdus utahensis DSM 12940] gi|256691087|gb|ACV11424.1| von Willebrand factor type A [Halorhabdus utahensis DSM 12940] Length = 592 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 24/238 (10%), Positives = 66/238 (27%), Gaps = 18/238 (7%) Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 S D + + + + + + +++ + K+ V ++ Sbjct: 176 NSTLDTSTFERKRLDVVIVLDISGSMGSQFDQYYYDRFGNRHTVEEGDSRSKMAVAKDAL 235 Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL--SNNLNEVKSRLN-KLNPYEN 274 L + + R+G + +N + + +++ ++ + + Sbjct: 236 VALTEQL--------HPDDRVGVVLFNNEPTVAKPLRDVETTDMDAIRGHIREDIEAGGG 287 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 TN M A L +S + + I ITD + L Sbjct: 288 TNIADGMAEAADMLGEYADSDPTEAETRQ-----IVITDAMPNTGQTDDQALQDRLAG-- 340 Query: 335 MRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 G+ V V + + + +V+ + + + + + V Sbjct: 341 YAEDGIHTSFVGVGVDFNPELVDEITAVRGANYRSVHSAEDFETYLGEEFEYMVTPLV 398 >gi|224054051|ref|XP_002190865.1| PREDICTED: collagen, type VI, alpha 1 [Taeniopygia guttata] Length = 1023 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 36/226 (15%), Positives = 76/226 (33%), Gaps = 22/226 (9%) Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 + + P F+ + T A P + + + ++ + + + + Sbjct: 28 IITQVSSAEDCPVDLFFVLD-TSESVALRVKPFGDLVTQVKDFTNQFIDKLTQ-RYYRCD 85 Query: 234 LSVRIGTIAYNIGIVGNQCTPLS---NNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELY 289 ++ A + L+ + +E+K+R++ +N T T A+ EL Sbjct: 86 RNLVWNAGALHYSDEVVLIKSLTPMPSGQSELKNRVSAINYIGKGTYTDCAIKRGIEELL 145 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 SH+ K++I +TDG + G+K++SVA+S Sbjct: 146 --IGGSHHKEN-----KYLIVVTDGHPLEGYKEPCG-GLDDAANEAKLLGIKVFSVAISP 197 Query: 350 PPEGQDLLRKCTDSSGQ--FFAVNDSR-----ELLESFDKITDKIQ 388 Q L F + ++ E+ + I D I+ Sbjct: 198 NHLDQR-LNIIATDHAYRRNFTATSLKPTREIDVEETINTIIDMIK 242 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 22/158 (13%), Positives = 53/158 (33%), Gaps = 14/158 (8%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + ++ + + L +A + + SVRI + Y+ N Sbjct: 833 VDSSTSVGSKNFETTKKFVKQLSGRFLEAS-KPTDESVRISVVQYSGRNQQKVEAQFQYN 891 Query: 259 LNEVKSRLNKLN-PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + ++ + + T+ A+ ++ KK V+ +DG + Sbjct: 892 YTVIAKAIDNMEFMNDATDVNSALQFITELYRRSARAAA--------KKRVLVFSDGHSQ 943 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 G + + + AG++IY +AV + + Sbjct: 944 G----ITARAIERAVQDAQKAGIEIYVLAVGSQANEPN 977 >gi|198435588|ref|XP_002122129.1| PREDICTED: similar to von Willebrand factor precursor (vWF) [Ciona intestinalis] Length = 3684 Score = 58.8 bits (140), Expect = 2e-06, Method: Composition-based stats. Identities = 42/313 (13%), Positives = 85/313 (27%), Gaps = 20/313 (6%) Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI 148 + P + + + + + + + T S+ S A Sbjct: 184 PSVPGVTTTTVPFLVQQQQQFPLFLVSTTTTIPSVPGATTTFSSVPGVTTTTIPSVPGAT 243 Query: 149 SICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYA-PAPAPAN 207 +I V V+ + +T+ L P + + Sbjct: 244 TIPSVPGVTTTTVPFLPGL------ITTPPLLPQPHTTLGLTGCTVDLVLVVDSSYSIGI 297 Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS-NNLNEVKSRL 266 L + GNL+ + N R+ + Y+ NL+ + + Sbjct: 298 DGFITLKDILGNLIKKF-----DVPNDETRVSLVQYSKRSQVEWLLSTYPGNLDGMLHTI 352 Query: 267 NKLN-PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + T TY A+ + + + +S + ++ ITDG + Sbjct: 353 AGMQMLQGVTYTYHALKLVLQTVIDGSDSGRR----PDVPFVIVLITDGRAKDEDIREEV 408 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 LN +Q+ N + ++ V L + V D E D++ Sbjct: 409 LNKIQVVHDKWNETFHLITIGVG--SVDPHQLEGIASHDDYVYTVEDMNEASSLVDEVAA 466 Query: 386 KIQEQSVRIAPNR 398 I R R Sbjct: 467 SICRPLARETYPR 479 >gi|257415703|ref|ZP_05592697.1| von Willebrand factor [Enterococcus faecalis AR01/DG] gi|257157531|gb|EEU87491.1| von Willebrand factor [Enterococcus faecalis ARO1/DG] Length = 1154 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 98/382 (25%), Gaps = 30/382 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M+ A A L+ T T + + H + G Sbjct: 202 AEARMEPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGGAQWDGQ 261 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ ++Y + L++R E Sbjct: 262 T--SWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRG----NVQKE 315 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +V+D S SM + L + S Sbjct: 316 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNINMGYVGYSSDGYNNN 375 Query: 205 PANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 D + N+ S + + G + L++ + Sbjct: 376 AIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFS 435 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +++++ T R + + + GST D N Sbjct: 436 YKVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNINKRINS 485 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA-V 370 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 486 TFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSADENGDLYYESA 545 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 546 DYAPDISDYLAKKAVQISGTVV 567 >gi|296473248|gb|DAA15363.1| integrin alpha M [Bos taurus] Length = 1152 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 24/120 (20%), Positives = 41/120 (34%), Gaps = 15/120 (12%) Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T+T + REL++ + N K +I ITDGE Y + L + Sbjct: 223 GRTHTATGIRKVVRELFHSSSGARNHA-----IKIMIVITDGE-----KYLDPLEYSDVI 272 Query: 333 EYMRNAGMKIYSVAVSA---PPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKI 387 + Y + V + + L F VN+ L +++ +KI Sbjct: 273 PEADRKKIIRYVIGVGDAFRSRKSRQELDTIASKPPADHVFQVNNFEALKTIQNQLQEKI 332 >gi|292619294|ref|XP_692164.4| PREDICTED: integrin alpha-M-like [Danio rerio] Length = 806 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 22/151 (14%), Positives = 44/151 (29%), Gaps = 17/151 (11%) Query: 244 NIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGST 302 N + L + +S++ + + T T A+ L+ + + Sbjct: 185 NDCDIHYNFNDLKLDDGTWESKVANIPYHEGGTFTASAIQKLVNYLFTPNGGTRPSA--- 241 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP---PEGQDLLRK 359 KK ++ ITDGE+ + + +++ V ++ L Sbjct: 242 --KKILVVITDGESHDRN------LLKDAASQAEKNSIVRFAIGVGKAFDYYNAREELNT 293 Query: 360 CTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 F V D L K+ I Sbjct: 294 IASDPDTDYVFKVTDFNALKNILQKLEGNII 324 >gi|130498817|ref|NP_001076116.1| inter-alpha-trypsin inhibitor heavy chain2 [Oryctolagus cuniculus] gi|11041696|dbj|BAB17301.1| inter-alpha-trypsin inhibitor heavy chain2 [Oryctolagus cuniculus] Length = 946 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 18/158 (11%), Positives = 57/158 (36%), Gaps = 13/158 (8%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + + + + K + K+ P TN A+ A L S Sbjct: 357 TWRNDLVSATKTQIADAKRYIEKIQPNGGTNINEALLRAIFILNEANNMGLLDPNS---V 413 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG 365 +I ++DG+ + + + + + +++ + ++S+ + D L++ + Sbjct: 414 SLIILVSDGDPTVGELKLSKIQ-KNVKQNIQD-NVSLFSLGIGFDV-DYDFLKRL-SNEN 469 Query: 366 Q------FFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + + + S +L + +++++ + P+ Sbjct: 470 RGIAQRIYGNQDTSSQLKKFYNQVSTPLLRNVQFNYPH 507 >gi|116622495|ref|YP_824651.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225657|gb|ABJ84366.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 313 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 24/167 (14%), Positives = 64/167 (38%), Gaps = 21/167 (12%) Query: 232 KNLSVRIGTIAYNIGIVGNQC--TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 N ++ + +N TP + + E+++ ++ + +T Y A+ A + Sbjct: 106 SNTRDQMFVVHFNERARLGLPERTPFTGKIKELETAISSFDVGGSTALYDAILLAQSHI- 164 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 G ++ ++ ITDG ++ + + + + AG+ IY++ + Sbjct: 165 ---------RGGVYGRRILLVITDGGDN-----SSKATLEEAVDAVAKAGVVIYAIGIYD 210 Query: 350 PPE---GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSV 392 P + +L + + G+ F ++ ++I ++ Q Sbjct: 211 PNDKDQNPKVLAHLAEVTGGEAFFPTALSDITRICEEIAADVRRQYT 257 >gi|194043710|ref|XP_001928122.1| PREDICTED: collagen alpha-3(VI) chain isoform 1 [Sus scrofa] Length = 3178 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 19/165 (11%), Positives = 56/165 (33%), Gaps = 17/165 (10%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 + + K++ +N R + +N + +V S ++ L+ + +N Sbjct: 57 VREFLYDVIKSLAVGENDF-RFALVQFNGNPHTEFLLNTYRSKQDVLSHVSNLSYFGGSN 115 Query: 277 -TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 T + + + E S + + ++ +TD + + + Sbjct: 116 QTGKGLEYVMQNHLTEAAGSRAR---DGVPQVIVVLTDRPSKDGLVLPS--------AEL 164 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLE 378 ++A + +++V V + L++ F + + L + Sbjct: 165 KSADVNVFAVGV--EDADEGALKEIASEPLNMHVFNLENFTSLHD 207 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 60/172 (34%), Gaps = 15/172 (8%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKN--LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN + ++ +N Sbjct: 1647 INFRRDSFQEVLRFVSEIVDTVYEGGDSIQVGLVQYNSDPTDEFFLKDFSTKQQIIDAIN 1706 Query: 268 KLNPYENTNTYPAMHHAYREL-YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 K+ + L N+ + R+ + IT G+ + Sbjct: 1707 KVVYKGGRHANT--KVGLEHLRLNQFVPEAGSRLEQRVPQIAFVITGGK--------SVE 1756 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 1757 DAQEASLALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 1806 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 35/290 (12%), Positives = 81/290 (27%), Gaps = 31/290 (10%) Query: 103 YIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMED 162 S + P NL G++P L + + I L ++ Sbjct: 947 VAGRSSDSVDRPALNLKQSGVVPFILQAKNADPGELELIVPSPAFILAAESLPKIGDLQP 1006 Query: 163 LYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVN 222 + + P + S+ + P +L E +V Sbjct: 1007 HIVNLL-----KSVQNGAPTPVSGEKDVVFLIDGSEGVRSGFP------LLKEFVQRVVE 1055 Query: 223 SIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPA 280 S+ + VR+ + Y+ + V + + +L NT A Sbjct: 1056 SL-----DVGPDRVRVAVVQYSDRTRPEFYLNSYMDQQSVVNAVRQLTLLGGPIPNTGAA 1110 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + R++ E S + + +I +T A ++ + ++ G Sbjct: 1111 LDFVLRDILTESAGSRIA---EGIPQLLIVLT--------ADRSGDDVRGPSVVLKRGGA 1159 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + + ++ + A+ R+L I++++ Sbjct: 1160 V--PIGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQVISERVTRL 1207 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 16/154 (10%), Positives = 43/154 (27%), Gaps = 18/154 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRE 287 +R+G + Y+ +V + L N A+ Sbjct: 272 SVGAQQIRVGVVQYSDEPRTMFSLNSFATKAQVLDAVKALGFLGGELANVGLALDFVVDH 331 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 + + + + + ++ I+ A ++ ++ A ++S + Sbjct: 332 HFT---RAGGSRVEEGVPQVLVLIS--------AAPSSDKIRDAVLALKQA--SVFSFGL 378 Query: 348 SAPPEGQDLLRKCTDSSGQFFAV---NDSRELLE 378 A + L+ + F V +L + Sbjct: 379 GAQAASKAELQHIATNDNFVFTVPEFRSFGDLQD 412 Score = 43.7 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 21/148 (14%), Positives = 50/148 (33%), Gaps = 15/148 (10%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNE 291 VR+G + ++ + + V + +L + NT A+ R + Sbjct: 1470 NKVRVGVVQFSNDVFPEFYLKTYRSQASVLDAIRRLRFKGGSPLNTGKALEFVARNFF-- 1527 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 S + + + ++ G++ + + + M ++G+ + V Sbjct: 1528 -VKSAGSRIEDGVPQHLVLFLGGKSQDDVSRYS--------QVMGSSGIVR--LGVGDRN 1576 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLES 379 + L+ T+ F V + REL Sbjct: 1577 IDRTELQTITNDPRLVFTVREFRELPNI 1604 >gi|147898495|ref|NP_001088330.1| inter-alpha (globulin) inhibitor H2 [Xenopus laevis] gi|71051796|gb|AAH98981.1| LOC495168 protein [Xenopus laevis] Length = 935 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 20/185 (10%), Positives = 62/185 (33%), Gaps = 13/185 (7%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNI--GIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 V++++ +++ + + S + + ++ P TN Sbjct: 323 QTVDAMKSILEDLNSDDQFGIIDFNHNIRCWKDELVYASSVEKGDASKYVQRIQPNGGTN 382 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 A+ A L S ++ ++DG+ + + + ++ Sbjct: 383 INDALLRAIFILKEASNKGLLEQNS---VSLIVLVSDGDPTVGELKLPKIQ-KNVRTNIQ 438 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAV----NDSRELLESFDKITDKIQEQS 391 + + ++S+ + D L + + G + + + +L E ++K++ + + Sbjct: 439 DD-IALHSLGIGFDV-DYDFLERLAQENHGMAQRIYGNQDTAAQLKEFYNKVSTPLLKNI 496 Query: 392 VRIAP 396 V P Sbjct: 497 VVNYP 501 >gi|219847012|ref|YP_002461445.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219541271|gb|ACL23009.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 847 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 52/329 (15%), Positives = 104/329 (31%), Gaps = 43/329 (13%) Query: 29 MQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQK 88 +++ALDA + +G ++ I + + G ++ G K Sbjct: 306 VRNALDAHIAAGTRMVLPIYDIAVGQGSNAAFRVV---------RFGLFVLTAYGQERGK 356 Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI 148 ++ + + P EN + L S +E + Sbjct: 357 PYLDFIF--LGDPNRQGTACSATPPPPENTSVVRLTGSVELWPE-----YQIVVNERRPV 409 Query: 149 SICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR 208 ++LDVS SM ++ + N +T P P S + + P R Sbjct: 410 QYVVILDVSGSMNANFIGQGIVNGRVTQCTNGPPGSPPAQ--SCGQPQYAWNPVQ---ER 464 Query: 209 KIDVLIESAGNLVNSIQ---KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +I V ++ L+ + + + + + P +N NE+ Sbjct: 465 RIYVAKKALELLIRQTNMPGNPGYDPTQPIDSMALVWFTHNVPSTNVVPFKSNPNELIQA 524 Query: 266 LNKLNPYENT--------NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG--- 314 +N Y+ N ++ A + L N +++ ++ +IF+TDG Sbjct: 525 VNSAGAYQGDPYKTSGGTNGTGGLYRASQLLANAPRTTNQLGKEWIYRRAIIFVTDGVTN 584 Query: 315 --------ENSGASAYQNTLNTLQICEYM 335 +G S+ Q T T +C Sbjct: 585 TFFNANNSNVNGGSSNQTTYPTGHVCRKA 613 >gi|261415941|ref|YP_003249624.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|261372397|gb|ACX75142.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|302326143|gb|ADL25344.1| von Willebrand factor type A domain protein [Fibrobacter succinogenes subsp. succinogenes S85] Length = 228 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 16/89 (17%), Positives = 37/89 (41%), Gaps = 2/89 (2%) Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 T AM+ A L +++ + G + +++ +TDG+ +G S + Sbjct: 89 NFFANGGTPMGEAMNMALD-LLEKRKGEYKASGVDYYQPWIVLMTDGKPNGDS-SEYARA 146 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + CE ++N + I+ + + + L Sbjct: 147 VQRTCEMIKNRKLTIFPIGIGEDADMNAL 175 >gi|327260888|ref|XP_003215265.1| PREDICTED: collagen alpha-1(VI) chain-like [Anolis carolinensis] Length = 1026 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 29/190 (15%), Positives = 66/190 (34%), Gaps = 20/190 (10%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN---NLNEVKSRL 266 +D + + ++ + + + + ++ A + + L++ + +K ++ Sbjct: 72 VDNIKQFTTQFIDKLNE-RYYRCDRNLMWNAGALHYSDEVQLISGLTSMRTGRSGLKDQV 130 Query: 267 NKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 +K+ T T A+ EL SH+ K++I +TDG Sbjct: 131 SKVVSIGKGTYTDCAIKRGIEELL--IGGSHHKEN-----KYMIVVTDGHPLEGYKEPCG 183 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ--FFAVNDSR----ELLES 379 ++ G+K++SVA+S + L F ++ + Sbjct: 184 -GLEDAANEAKHQGIKVFSVAISPNHL-ESRLSVIATDQAYRRNFTATGPSLRARDIDNT 241 Query: 380 FDKITDKIQE 389 D I D I+ Sbjct: 242 IDTIIDMIKS 251 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 33/220 (15%), Positives = 70/220 (31%), Gaps = 25/220 (11%) Query: 182 PPPPKKSFWSKNTTKSKYA--PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG 239 P T + + +R + + L A + +VR+ Sbjct: 817 CPDYTCPISFSGPTDITLVVDSSTSVGSRNFNTTKKFVKRLAERFLSAA-KPTEDAVRVS 875 Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 + Y+ P N + ++K+ N ++ A + + S + Sbjct: 876 VVQYSGRTQQKLEVPFEQNYTVIADSVDKMQFI---NDATDVNAALNYVTSLFRRSSRS- 931 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 KK ++ +DG + G + + + R AG++IY + V ++ Sbjct: 932 ---GAKKRMLIFSDGNSQGITQS----AIERAVQEARQAGIEIYVLVVGTQANEPNVRVL 984 Query: 360 CTDSSGQF---------FAVNDSRELLE--SFDKITDKIQ 388 T + ++ F V D LL + ++ KI Sbjct: 985 VTGKTAEYDVAFGERHLFRVPDYESLLRGVFYQTVSRKIS 1024 >gi|326668779|ref|XP_002662551.2| PREDICTED: collagen alpha-1(VII) chain-like [Danio rerio] Length = 2698 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 50/153 (32%), Gaps = 18/153 (11%) Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTN 276 + ++ + +R G + Y+ N E+ + + +N NT Sbjct: 55 SFMAGIVKPFAKAVGPNGIRFGAVQYSDTARVEFTFTAYLNGTELITAVENINYKGGNTR 114 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 T + + ++ + K I ITDG++ N L+ + +R Sbjct: 115 TGAGLKYIADNFFSPA-------SIRDVPKISILITDGKSQD--------NVLEPSQKLR 159 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA 369 G+KI++V + L+ F Sbjct: 160 GLGVKIFAVGI--KSADPAELKLIASPPQSEFT 190 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 40/364 (10%), Positives = 97/364 (26%), Gaps = 28/364 (7%) Query: 37 VLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKD 96 VL+ + TT + +TS + + + + A +N + Sbjct: 663 VLALVQNREGPPVSVRVTTGEPETSRLIGSDATSYTLD-GLRPGLSYTVRFSALLNGVET 721 Query: 97 KNNPLQYIAESK--AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVL 154 + ++ ++ T++ L+ + + E+ + L Sbjct: 722 EATSIRVSTDALPPVTGLSVTDSTENSVLLGWSPVP-GATGYILRWTEVEDRGTTQSETL 780 Query: 155 DVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSK----YAPAPAPANRKI 210 S + + + T ++ + T + + Sbjct: 781 PSSATSYRVMGLRLGRRYRFTVQSTFQNQVGPETSVEERTVCVGGRLDVVFLVPASRDRS 840 Query: 211 DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-- 268 ++ L +S ++G + Y+ +N + + Sbjct: 841 GLVGPVLSLLASSGGS-FTSIGPRDSQMGVVMYSEDPKVRFLLNRHSNSETLLQDILSTP 899 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 N N AM +A + L + + + V+ I D +++ N Sbjct: 900 FNNRPGNNIGQAMTYARQFLLSAPAGRR-----SGVPGVVVIIADEKSTD--------NL 946 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDK 386 Q +R G+ + +V V LR D+ +L + D Sbjct: 947 SQPAAAVRADGVTVLAVGVGR--ADPSELRLAVTDGSTQNLLYAQDANQLYGLHPDLADL 1004 Query: 387 IQEQ 390 + Sbjct: 1005 LCGL 1008 >gi|327263661|ref|XP_003216636.1| PREDICTED: cochlin-like [Anolis carolinensis] Length = 527 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 38/345 (11%), Positives = 86/345 (24%), Gaps = 37/345 (10%) Query: 59 QTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENL 118 + T K + + G + A K + N ++ I Sbjct: 197 KNFTTAKDVLFAIKEIGFKGGNSNTGKALKQTVQKFFSPENGVRKGIPRVIVVFIDGWPS 256 Query: 119 FLK-------GLIPSALTNLSLRSTGIIERSSEN--LAISICMVLDVSRSMEDLYLQKHN 169 + +S+ E I + L+ ++ Sbjct: 257 DDMEHAGILAREFGINVFIVSIAKPAPEELGMVQDLGFIEKAVCLNNGFFSYNIPSWFGT 316 Query: 170 DNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ 229 + L + + A + +S L+ + Sbjct: 317 TKYVKPLVQKLCAHEHMLCSKTCYN-----SVNVAFLIDGSSSVGDSNFRLMLEFISNVA 371 Query: 230 E---KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAY 285 + ++ +I + + V + L + T T A+ H Sbjct: 372 KSFEITDIGAKIAAVQFTYDQRTEFSFTDYITKENVLAALRGIRYMSGGTATGEAISHTT 431 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 R ++ N F++ +TDG++ + + AG+ IYS+ Sbjct: 432 RNVFGPVRDGGNK-------NFLVILTDGQSYD--------DVRGPAVAAQQAGITIYSI 476 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQ 388 ++ P L+ + F + L + + I Sbjct: 477 GIAWAPLDD--LKDMASAPKETHTFFTREFTGLEQIVPDVVRGIC 519 >gi|84498078|ref|ZP_00996875.1| putative secreted protein [Janibacter sp. HTCC2649] gi|84381578|gb|EAP97461.1| putative secreted protein [Janibacter sp. HTCC2649] Length = 659 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 26/191 (13%), Positives = 65/191 (34%), Gaps = 26/191 (13%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT----IAYNIGIVGNQCTPL-SNNL 259 KI+ ++ +V ++ Q + TP+ + + Sbjct: 54 SGLTKIEAAKKALTGVVGALPDTAQVGLRVYGATVDGKGKPTPAACADTQLITPIGTIDK 113 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 + S ++ +N T ++ A ++L K+ ++ ++DGE Sbjct: 114 PALTSAISAINALGETPIAHSLTEALKDLGPTG------------KRNIVLVSDGEE--- 158 Query: 320 SAYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSREL 376 T + + + G+ +I +V + + L+ D+ G ++ D+ L Sbjct: 159 ---SCTPDPCPAVKKLTAGGVDLQIDTVGFGVNAKARSQLQCIADAGKGTYYDAKDAPAL 215 Query: 377 LESFDKITDKI 387 S K++ + Sbjct: 216 AASLSKLSQRA 226 >gi|113476849|ref|YP_722910.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] gi|110167897|gb|ABG52437.1| von Willebrand factor, type A [Trichodesmium erythraeum IMS101] Length = 441 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 32/245 (13%), Positives = 82/245 (33%), Gaps = 26/245 (10%) Query: 154 LDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL 213 LDV+ + +L L P S +TT + +V Sbjct: 2 LDVTITPHREFLAADTPG-QKLFVMLKLRPNAIVSASRPSTTFTFVIDTSGSMYDDSEVG 60 Query: 214 IESAGNLVNSIQKAIQEKK-NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP- 271 +V ++++ + + + + RI + ++ + + +++ ++KL Sbjct: 61 RPKIDIVVEALERLVTDIQADPRDRIALVQFDDSASVLLPLTAATDTVTLQNAISKLRSF 120 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T + + L S + + TDG+ + ++ ++ Sbjct: 121 SGGTRMALGIEKSLNLL----------KDSVLSSRRTLIFTDGQT------IDEIDCREL 164 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFF----AVNDSRELLESFDKITDK 386 AG+ I ++ V +DLL +D + G+ F +++ + ++ Sbjct: 165 AVQFAQAGIPITALGVG--DYNEDLLVYLSDHTGGRVFNVVEQASNTGTTDIAISELPQT 222 Query: 387 IQEQS 391 I ++ Sbjct: 223 IFQEV 227 >gi|320334211|ref|YP_004170922.1| von Willebrand factor type A [Deinococcus maricopensis DSM 21211] gi|319755500|gb|ADV67257.1| von Willebrand factor type A [Deinococcus maricopensis DSM 21211] Length = 609 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 22/211 (10%), Positives = 53/211 (25%), Gaps = 19/211 (9%) Query: 170 DNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ 229 P + N + L +A + Sbjct: 16 TPAPTDLLLRFREDVPNATRRPLNVALVIDRSGSMAGSPLRYALKAAADFV--------- 66 Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 ++ + + + Y+ + + +K L + TN + Sbjct: 67 DRLTETDVLSIVVYDDDVDTLLDAQPVRDKAAIKDLLKGVRAGGITNLSGGWLRGCELVA 126 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + + V+ +TDG+ + L + AG+ ++ + Sbjct: 127 GARRADAVNR--------VLLLTDGQANHGVTDTGVLIKTAASKA--EAGVSTTTLGFGS 176 Query: 350 PPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 E L+ S G F+ + + + F Sbjct: 177 SFEEDLLIGMARASGGNFYFIQSMDDAADVF 207 >gi|170589747|ref|XP_001899635.1| von Willebrand factor type A domain containing protein [Brugia malayi] gi|158593848|gb|EDP32443.1| von Willebrand factor type A domain containing protein [Brugia malayi] Length = 634 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 27/150 (18%), Positives = 55/150 (36%), Gaps = 16/150 (10%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELY 289 VRI I Y + +L ++ + + +N T T A+ A EL+ Sbjct: 102 TDKDDVRIAMIQYAETPIVEFSFGTYRDLPDITNHIMTINLHSGGTRTGKALLAAKGELF 161 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN-AGMKIYSVAVS 348 +E++ + K ++ TDG + + ++ + +R +KIY V V Sbjct: 162 SEEKGAR-----KNASKIIVLFTDG--------LSVDDPIKHAQQLREIEKIKIYVVYVG 208 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + + F N+ L + Sbjct: 209 SDGF-EQEMNRIAGGRSNVFGSNEFTRLKK 237 >gi|297473448|ref|XP_002686617.1| PREDICTED: collagen, type VI, alpha 3-like isoform 2 [Bos taurus] gi|296488812|gb|DAA30925.1| collagen, type VI, alpha 3-like isoform 2 [Bos taurus] Length = 2956 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 23/177 (12%), Positives = 60/177 (33%), Gaps = 16/177 (9%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + NLVNS+ + + +R+G + ++ V +E Sbjct: 440 SSNVGETNFPYVRDFVMNLVNSL-----DVGSDHIRVGLVQFSDTPVTEFSLNTYPTKSE 494 Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + + L ++ + +Y N + + + + ++ +T G+ Sbjct: 495 LLAHLRQMQLQGGSVLNTGAALSY-VHANHFTEAGGSRIQDHVPQLLLLLTAGQ------ 547 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + LQ + AG+ + V + L + + G + ++D L Sbjct: 548 --SEDSYLQAANALARAGILTFCVG--TSQADRAELEEIAFNPGLVYLMDDFSSLPA 600 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 26/173 (15%), Positives = 60/173 (34%), Gaps = 17/173 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKN--LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN ++ +N Sbjct: 1441 INFRRDSFQEVLRFVSEIVDTVYEGGDSIQVGLVQYNSDPTDEFFLKDFPTKQQIIDAIN 1500 Query: 268 KLNPYENTNTYP--AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 K+ + + H R N + R+ + IT G+ + Sbjct: 1501 KVVYKGGRHANTKVGLEHLRR---NHFVPEAGSRLDQRVPQIAFVITGGK--------SV 1549 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 1550 EDAQEASMALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 1600 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 18/156 (11%), Positives = 50/156 (32%), Gaps = 15/156 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRE 287 +R+G + Y+ + +V + L N A+ Sbjct: 66 SIGTQQIRVGVVQYSDEPRTMFSLNSYSTKAQVLDAVKALGFIGGELANVGLALDFVVEN 125 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 + + + + + ++ I+ G +S ++ + L+ ++S + Sbjct: 126 HFT---RAGGSRAEEGVPQVLVLISAGPSSD--EIRDGVIALKQAS--------VFSFGL 172 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 A + L+ + F V + R L + +++ Sbjct: 173 GAQAASKAELQHIATNDNLVFTVPEFRSLGDVQEQL 208 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 58/169 (34%), Gaps = 15/169 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRE 287 VRIG + ++ + + V + +L + NT A+ R Sbjct: 1260 NVGPNKVRIGVLQFSNDVFPEFQLKTYKSQASVLDAIRRLRFKGGSPLNTGKALEFVARN 1319 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 + S + + + ++ G++ + + + +++AG+ S+ V Sbjct: 1320 YF---VKSAGSRIEDGVPQHLVLFLGGKSQDDISRYS--------QVIKSAGI--ASLGV 1366 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 + L+ T F V + R+L +++ + V AP Sbjct: 1367 GDRNIDRTELQTITSDPRLVFTVREFRDLPSIEERMVNSFGSSGVTPAP 1415 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 20/161 (12%), Positives = 54/161 (33%), Gaps = 16/161 (9%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 + + + + + E + +++ Y + + EV + + K+ + Sbjct: 255 NAIRDFIAKVIQ-RLEIRQDLIQVAVAQYADTVRPEFYFNTYPSKREVINAVRKMKALDG 313 Query: 275 TN--TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 + T A+ L+ E + + + K ++ +T G+ + Q Sbjct: 314 SALYTGSALDFVRNNLFTE---AAGYRAAEGVPKLLVLVTGGK--------SLDAVSQPA 362 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDS 373 + ++ +G+ + AV Q L + S F + Sbjct: 363 QELKRSGIL--AFAVGNKVADQAELEEIAFDSSLVFTATEF 401 Score = 48.0 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 34/290 (11%), Positives = 80/290 (27%), Gaps = 31/290 (10%) Query: 103 YIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMED 162 +S + + P NL G++P L + + I + L ++ Sbjct: 741 VAGKSSDRVDTPALNLKQSGVVPFILQAKNADPAELELIVPSPAFILVAESLPKIGDLQP 800 Query: 163 LYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVN 222 + + P + S+ + P +L E +V Sbjct: 801 QIVNLL-----KSVQNGAPAPVSVEKDVVFLIDGSEGVRSGFP------LLKEFVQRVVE 849 Query: 223 SIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPA 280 S+ + VR+ + Y+ + V + L NT A Sbjct: 850 SL-----DVGPDRVRVAVVQYSDRTRPEFYLNSYMDQQSVVGAIRGLTLLGGPAPNTGAA 904 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + R + S + + +I +T A ++ + +R G Sbjct: 905 LEFVLRNILVGSAGSRIA---EGVPQLLIVLT--------ADRSGDDVRGPSVVLRRGGA 953 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + + ++ + + R+L I++++ + Sbjct: 954 V--PIGIGIGNADITEMQTLSFVPDFAVVIPTFRQLGTIQQVISERVTQL 1001 >gi|147905660|ref|NP_001090738.1| calcium channel, voltage-dependent, alpha 2/delta subunit 1 preproprotein [Xenopus (Silurana) tropicalis] gi|120537294|gb|AAI29013.1| LOC100036724 protein [Xenopus (Silurana) tropicalis] Length = 1076 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 29/344 (8%), Positives = 92/344 (26%), Gaps = 25/344 (7%) Query: 41 CASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNP 100 ++V + + Q F + + + + A I + Sbjct: 94 SKALVHLAKKAEEIQESHQWRDQFGTNSIVYYNAKDDQMDPEKNDSDSASQKIKPVLQDD 153 Query: 101 LQYIAESKAQYEIPTENLFL---KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 + ++ Q+ + ++ + L + + E+ + + + Sbjct: 154 PVFRRQTSYQHSAVHIPTDIYEGSTIVLNELNWTAALDEVFKKNREEDETLLWQVFGSAT 213 Query: 158 R----SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL 213 ++ K N + P + S + + + + ++ Sbjct: 214 GLARYYPASPWVDKSRTANKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLI 273 Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE 273 S ++ ++ + N ++K +N + Sbjct: 274 RTSVSEMLETLSDDDFVNVAAFNNNAHDVSCFNHLVQANVR---NKKKLKEAVNNITAKG 330 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T+ A+ +L N S K ++ TDG A N Sbjct: 331 TTDYKTGFKFAFDQLLNHNVSRA------NCNKIIMLFTDGGEDKAKETFEAYN------ 378 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 ++ +++++ +V + ++ + G ++ + + Sbjct: 379 --KDKTVRVFTFSVGQHNYDKGPIQWMACQNKGFYYEIPSIGAI 420 >gi|257089500|ref|ZP_05583861.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis CH188] gi|256998312|gb|EEU84832.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis CH188] Length = 1154 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 97/382 (25%), Gaps = 30/382 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M A A L+ T T + + H + G Sbjct: 202 AEARMAPATLRANLALPLIAPRYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGSAQWDGQ 261 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ ++Y + L++R E Sbjct: 262 T--SWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRG----NVQKE 315 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +V+D S SM + L + S Sbjct: 316 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNINMGYVGYSSDGYNNN 375 Query: 205 PANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 D + N+ S + + G + L++ + Sbjct: 376 AIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFS 435 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +++++ T R + + + GST D N Sbjct: 436 YKVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNINKRINS 485 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA-V 370 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 486 TFIATIGEAMALKQRGIEIHGLGIQLQSDSRANLSKQQIEDKMREMVSADENGDLYYESA 545 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 546 DYAPDISDYLAKKAVQISGTVV 567 >gi|50949741|emb|CAH10363.1| hypothetical protein [Homo sapiens] Length = 460 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 56/154 (36%), Gaps = 12/154 (7%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKS---RLNKLNPYENTNTYPAMHHAYRELYNEKE 293 R I ++ I + +S + ++ ++ ++P T+ A+ A R L Sbjct: 113 RFSIIGFSNRIKVRKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLLNKYVA 172 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S R ++F+TDG+ + + + + I+++ + + Sbjct: 173 HSGI---GDRSVSLIVFLTDGKPTVGETHTLKILNNT--REAARGQVCIFTIGIGNDVDF 227 Query: 354 QDL----LRKCTDSSGQFFAVNDSRELLESFDKI 383 + L L C + + +L+ +D+I Sbjct: 228 RLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 261 >gi|91783676|ref|YP_558882.1| hypothetical protein Bxe_A2138 [Burkholderia xenovorans LB400] gi|91687630|gb|ABE30830.1| Conserved hypothetical protein containing von Willebrand factor type A domain [Burkholderia xenovorans LB400] Length = 337 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 55/160 (34%), Gaps = 28/160 (17%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKE 293 RIG + + PL+ + + V+ L+++ T A+ + + + Sbjct: 132 RIGLVVFGDAAYPQA--PLTLDHDSVRILLDQMQIGMAGPRTAIGDAIGLTVKLMADSHA 189 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE- 352 K +I +TDG ++ ++ + E + + +++V + P Sbjct: 190 QE----------KVLILLTDGNDTSSAIP-----PERAAEIAKQHKLVVHTVGIGDPGTT 234 Query: 353 -----GQDLLRKCTD-SSGQFFAVNDSR-ELLESFDKITD 385 + L + + G+ F +L E + + Sbjct: 235 GEDRVDLEALARIASITGGRAFRALGQEKDLTEVYATLDK 274 >gi|301620566|ref|XP_002939640.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like protein-like [Xenopus (Silurana) tropicalis] Length = 1179 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 26/145 (17%), Positives = 58/145 (40%), Gaps = 13/145 (8%) Query: 248 VGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 G + N K +NK+ T+ A+ A ++N+ ST+ Sbjct: 344 PGQSIPATAQNKKSAKDYVNKIEADGWTDINAAL-MAAASIFNQTSHKPEKETSTKKIPL 402 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAG--MKIYSVAVSAPPEGQDLLRKCT-DSS 364 +IF+TDGE + + + + G + ++ +A +L+R+ + ++ Sbjct: 403 IIFLTDGEATSGVLATS----RILSNAQKAMGGTISLFCLAFGEDA-DYNLMRRLSLENR 457 Query: 365 G---QFFAVNDSR-ELLESFDKITD 385 G + + +D+ +L +D+I Sbjct: 458 GIARRIYEYSDATLQLKGFYDEIAS 482 >gi|260814261|ref|XP_002601834.1| hypothetical protein BRAFLDRAFT_215239 [Branchiostoma floridae] gi|229287136|gb|EEN57846.1| hypothetical protein BRAFLDRAFT_215239 [Branchiostoma floridae] Length = 863 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 25/174 (14%), Positives = 61/174 (35%), Gaps = 10/174 (5%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN-KLNPYENT 275 +N+I K +++ +V + + + N+ ++ + +N T Sbjct: 255 TKQAMNTILKDLRDHDRFNVMPFSYSSTMWRPNEMVLATRENIESARTYVRRSINAGGGT 314 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 N A+ A L + S R +IF+TDG S N + + Sbjct: 315 NINQAIIDAADLLRRVTDDQP---NSPRSASLIIFLTDGLPSVGE--SKPRNIMVNVKNA 369 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSSG---QFFAVNDSR-ELLESFDKITD 385 + ++ + + L + ++ G + + +D+ +L +D++ Sbjct: 370 IREQVSLFCLGFGKDVDFPFLEKMALENRGLARRIYEDSDAALQLKGFYDEVAT 423 >gi|149277251|ref|ZP_01883393.1| hypothetical protein PBAL39_10186 [Pedobacter sp. BAL39] gi|149232128|gb|EDM37505.1| hypothetical protein PBAL39_10186 [Pedobacter sp. BAL39] Length = 629 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 38/308 (12%), Positives = 85/308 (27%), Gaps = 31/308 (10%) Query: 92 NITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISIC 151 + KD + ++ A Y + GL + + Sbjct: 157 DPLKDPLSTFSIDVDA-ASYSNVRRMINNGGLPEKDAVRIEEMINYFDYDYPQPAGDDPV 215 Query: 152 MVLDVSRSMEDLYLQKHNDN-NNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKI 210 ++ + K + P F + K+ Sbjct: 216 NIITEIAAAPWNKKHKLVQIGLQGKTISTAKLPSSNLVFLID-------VSGSMNDSNKL 268 Query: 211 DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN 270 +L+ S L + ++K R+ + Y + + +K LNKL+ Sbjct: 269 PLLVSSFKLLTDQLRKTD--------RVAIVVYAGNSGLVLPSTSGDQKTTIKDALNKLS 320 Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 +T + AY N VI TDG+ + + + + Sbjct: 321 AGGSTAGGAGIRLAYEVAAKNYIKGGNNR--------VILATDGDFNVG--ASSDEDMEK 370 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQE 389 + E R +G+ + + + D G + +++ + E+ + ++ Sbjct: 371 LIEEKRKSGVFLTVLGFGMGNLKDSKMEVLADKGNGNYAYIDN---INEARKVLVNEFGG 427 Query: 390 QSVRIAPN 397 IA + Sbjct: 428 TLFTIAKD 435 >gi|145595544|ref|YP_001159841.1| von Willebrand factor, type A [Salinispora tropica CNB-440] gi|145304881|gb|ABP55463.1| von Willebrand factor, type A [Salinispora tropica CNB-440] Length = 316 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 21/152 (13%), Positives = 46/152 (30%), Gaps = 24/152 (15%) Query: 250 NQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 N P + + V + +N L E+T T A+ + S + ++ Sbjct: 139 NVLVPPTKDRQAVLAAINGLALAESTATGEAVFTCLEAI----RSVPADGAAGIPPARIV 194 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------------GQDL 356 ++DG + + + Q A + + ++A + Sbjct: 195 LLSDGYRTSGRSVEQAAAAAQA------ANVAVSTIAFGTDGGQVDIGGQRQRVPVDRLA 248 Query: 357 LRKCT-DSSGQFFAVNDSRELLESFDKITDKI 387 L + G F+ EL + + + I Sbjct: 249 LADLAETTDGYFYEAASVSELKQVYQDMGSSI 280 >gi|260813598|ref|XP_002601504.1| hypothetical protein BRAFLDRAFT_185472 [Branchiostoma floridae] gi|229286801|gb|EEN57516.1| hypothetical protein BRAFLDRAFT_185472 [Branchiostoma floridae] Length = 400 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 21/154 (13%), Positives = 56/154 (36%), Gaps = 16/154 (10%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKES 294 ++G I Y+ I ++ + ++ + T T A+ +A + ++ + Sbjct: 262 TQVGVIQYSSKIRQEFSMNSFQTVSGLLGAIDAMEYMQGGTLTGRAIRYASKYGFSVFDG 321 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + + K ++ +TDG +S A + G+ +Y++ V Sbjct: 322 ARR-----GVPKVLVVVTDGVSSDEVAIP--------ALEAQRQGIFVYAIGV--SNYDA 366 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 + L+K ++ V++ L + + + Sbjct: 367 EQLQKIASTNESSAMVDNFNLLDSVRNTLLTSVC 400 Score = 41.0 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 23/190 (12%), Positives = 56/190 (29%), Gaps = 21/190 (11%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI---AYNIGIVGNQCTPLSNNLN 260 D + ++ + V + KA+ + V + Y+ +L Sbjct: 3 VFVLDGSDHVGQANFDRVKAWVKAVVSGFTIGVNTTMVGVLQYSSQPRVEFSLGAFRDLQ 62 Query: 261 EVKSRLNKLNPY-ENTNTYPAMHHAYRE-LYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 ++ ++ + N A+H L + + K VI ++ G Sbjct: 63 DLLQAIDGVPYMAGGANVGQALHTVRTSPLLLGNGTRPDAR------KVVILVSGG---- 112 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 ++ + +Q ++ G +Y A L S+ +V + L + Sbjct: 113 ----PSSDDVIQPALQLQQTGAVVY--AAGVDRHDVSELGNIASSAQTAASVGNFAALDD 166 Query: 379 SFDKITDKIQ 388 + + Sbjct: 167 LRTSLLSSVC 176 >gi|312903619|ref|ZP_07762795.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0635] gi|310632972|gb|EFQ16255.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0635] gi|315577195|gb|EFU89386.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0630] Length = 1103 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 101/382 (26%), Gaps = 30/382 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M A A L+ +++ + D + + QG+ D Sbjct: 151 AEARMAPATLRANLAL--PLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGSAQWD 208 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ ++Y + L++R E Sbjct: 209 GQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRG----NVQKE 264 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +V+D S SM + L + S Sbjct: 265 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNINMGYVGYSSDGYNNN 324 Query: 205 PANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 D + N+ S + + G + L++ + Sbjct: 325 AIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFS 384 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +++++ T R + + + GST D N Sbjct: 385 YKVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNINKRINS 434 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA-V 370 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 435 TFIATIGEAMALKQRGIEIHGLGIQLQSDSRANLSKQQIEDKMREMVSADENGDLYYESA 494 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 495 DYAPDISDYLAKKAVQISGTVV 516 >gi|309789848|ref|ZP_07684427.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308228152|gb|EFO81801.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 420 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 26/188 (13%), Positives = 65/188 (34%), Gaps = 23/188 (12%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 KID + ++ V+ + + ++ + +K R++ Sbjct: 59 DKIDRVRQAISLAVDRLDAQDIA--------SLVIFDHRNEVLIPAAPVTDRRMIKDRVS 110 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 ++ T PA+ RE+ ++ + + ++ +TDG+ +N Sbjct: 111 RIRDAGGTKIAPAVEKGLREIEKDRSGAI---------RRLVLLTDGQT------ENEDE 155 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 L+ + G+ I ++ V L+ S G + + E+ E F + Sbjct: 156 CLRRADDAGRIGVPITALGVGQDWNEDLLIEMANRSGGTADYIARADEITEYFQNTVQRA 215 Query: 388 QEQSVRIA 395 Q +++ + Sbjct: 216 QNSAIQNS 223 >gi|94970371|ref|YP_592419.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94552421|gb|ABF42345.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 356 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 38/284 (13%), Positives = 102/284 (35%), Gaps = 21/284 (7%) Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSN 177 L L + A + + + + + + + + ++ + + Sbjct: 44 LPLPAIPKDAPSAPAPKPPTPQNDAKADTPSDDQAITKIVVGVNEVNVIFTVTDKRNRFV 103 Query: 178 KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVR 237 K L P K K + ++ +LI+S+ ++ + + + + Sbjct: 104 KDLSQPDFKFVDDGKPVASIRDFRKETNLPLRVGLLIDSSNSIRDRFKFEQESAIEFLNQ 163 Query: 238 IGTIAYN------IGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNE 291 I ++ +++ + + + L P T Y A+++A R Sbjct: 164 IIRPKFDKAFVIGFDTTAEVTQDFTDDTDLLGKGVRMLRPGGGTAMYDAIYYACR----- 218 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 + G+T ++K +I ++DGE++ + + + E + A + IY+++ + Sbjct: 219 -DKLLKENGNTAMRKAMILLSDGEDNQSRVTR-----EEAVEMAQRAEVIIYAISTNTSG 272 Query: 352 ----EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 + L R + G+ F ++ +F +I D+++ Q Sbjct: 273 LKLRGDKVLERFAEATGGRAFFPFKISDVANAFSEIQDELRSQY 316 >gi|33321021|gb|AAQ06268.1| unknown [Sorghum bicolor] Length = 610 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 30/196 (15%), Positives = 66/196 (33%), Gaps = 22/196 (11%) Query: 188 SFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGI 247 + P P ++D+L +A +V ++ R+ +A+N Sbjct: 80 DVSGSMNDPAAVPPERRPTTSRLDLLKTAAKFMVAKLEDGD--------RLSIVAFNDRP 131 Query: 248 VGNQCTPL----SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 V + L ++ + +++L T PA A + L Sbjct: 132 VKELSSGLLYMSADGRRKAMKSVDQLEARGGTALVPAFEEAVKVLD------GRVGDGRN 185 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS 363 F++ +TDGE++ + R ++++ + + + LL +S Sbjct: 186 RLGFIVLLTDGEDTSGFTLSERRREVIRGALGR---YPVHTLGLGRAHDPEVLLYLAQES 242 Query: 364 SGQFFAVNDSRELLES 379 G + V+D + E Sbjct: 243 HGTYSFVDD-DNIGEV 257 >gi|316973220|gb|EFV56840.1| putative transmembrane cell adhesion receptor mua-3 [Trichinella spiralis] Length = 3249 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 30/195 (15%), Positives = 61/195 (31%), Gaps = 21/195 (10%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 + V + N + R+ I Y+ I +N+ EV Sbjct: 896 GSGSIGSAVFKNEILRFLREFINLFTIGSNHT-RLAIIQYSDQIRHELDFKEANSKAEVD 954 Query: 264 SRLNKLNP-YENTNTYPAMHHAYRELYNE----------KESSHNTIGSTRLKKFVIFIT 312 LN++ T T A+ ++ ++ ES T + + I IT Sbjct: 955 EALNRVEYLTGLTKTGDALTDMFKIGFSSTFATKFFTILSESRGARPIETGVHRVAIVIT 1014 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND 372 DG + ++ + + + +++V V + L + S + F V + Sbjct: 1015 DGRSQDIVSFS--------ANEAKKSNVLMFAVGV-TDHVSEAELVEIAGSKDRVFLVKE 1065 Query: 373 SRELLESFDKITDKI 387 +L + K Sbjct: 1066 FTDLNVRLRSLIQKA 1080 >gi|260841562|ref|XP_002613981.1| hypothetical protein BRAFLDRAFT_118457 [Branchiostoma floridae] gi|229299371|gb|EEN69990.1| hypothetical protein BRAFLDRAFT_118457 [Branchiostoma floridae] Length = 2122 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 18/115 (15%), Positives = 40/115 (34%), Gaps = 16/115 (13%) Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 TNT A A L +K+ VI +TDG + + ++ Sbjct: 125 TNTEDAFRLAQELLRPPS----AFKNERPVKQVVILLTDGRPTRGG------DPVKRANN 174 Query: 335 MRN-AGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 +++ +I+S+ + + L C + + + + F + +I+ Sbjct: 175 LKSVYNAEIFSIGIG-GNLNKQQLEDCATDAQHLYLSPNFVD----FKDLAKRIR 224 >gi|54038464|gb|AAH84380.1| LOC495168 protein [Xenopus laevis] Length = 554 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 20/185 (10%), Positives = 62/185 (33%), Gaps = 13/185 (7%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNI--GIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 V++++ +++ + + S + + ++ P TN Sbjct: 323 QTVDAMKSILEDLNSDDQFGIIDFNHNIRCWKDELVYASSVEKGDASKYVQRIQPNGGTN 382 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 A+ A L S ++ ++DG+ + + + ++ Sbjct: 383 INDALLRAIFILKEASNKGLLEQNS---VSLIVLVSDGDPTVGELKLPKIQ-KNVRTNIQ 438 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAV----NDSRELLESFDKITDKIQEQS 391 + + ++S+ + D L + + G + + + +L E ++K++ + + Sbjct: 439 DD-IALHSLGIGFDV-DYDFLERLAQENHGMAQRIYGNQDTAAQLKEFYNKVSTPLLKNI 496 Query: 392 VRIAP 396 V P Sbjct: 497 VVNYP 501 >gi|220913381|ref|YP_002488690.1| hypothetical protein Achl_2636 [Arthrobacter chlorophenolicus A6] gi|219860259|gb|ACL40601.1| conserved hypothetical protein [Arthrobacter chlorophenolicus A6] Length = 310 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 15/97 (15%), Positives = 30/97 (30%), Gaps = 6/97 (6%) Query: 6 ISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFK 65 + V A+D I R Q+Q+ DA+ L+G ++ Sbjct: 1 MVVLIGAGALAVDTGQIYAERAQLQNGADASALAGADLCSANGGCTQAAATS------VA 54 Query: 66 KQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQ 102 + S + D++ Q+ +T + Sbjct: 55 DALANSNSNDSKSTVQSVDLSVPGQVTVTTSTKDGTT 91 >gi|242051338|ref|XP_002463413.1| hypothetical protein SORBIDRAFT_02g043390 [Sorghum bicolor] gi|241926790|gb|EER99934.1| hypothetical protein SORBIDRAFT_02g043390 [Sorghum bicolor] Length = 491 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 16/132 (12%), Positives = 36/132 (27%), Gaps = 15/132 (11%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 K + L TN + + L +E + I ++DG+++ Sbjct: 133 GKASAKFAVGALCAVRGTNIGQGLRVGAQVLAGRRERNAVAGM--------ILLSDGQDT 184 Query: 318 GASAYQ-NTLNTLQICE------YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV 370 T + I++ + + + G F V Sbjct: 185 SGCWTTVRPDGTKTYANLVPPSTSFSSRPAPIHTFGFGTDHDAAAMHAIAEATGGTFSFV 244 Query: 371 NDSRELLESFDK 382 + + +SF + Sbjct: 245 GNEAAIQDSFAR 256 >gi|118094354|ref|XP_422360.2| PREDICTED: similar to calcium-activated chloride channel [Gallus gallus] Length = 928 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 36/239 (15%), Positives = 77/239 (32%), Gaps = 35/239 (14%) Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 N + + + + N +I L +A + I + Sbjct: 282 RNSSVVNSLVPPFETTFELLQTQDRAVSLVLDVSGSMNTNNRITNLRTAAEVFLIQIIEI 341 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL--NPYENTNTYPAMHHAY 285 R+G + + ++ + + L T + Sbjct: 342 G-------SRVGIVTFESSAYEKSPLLQITSVATRQRLVQNLPTTAGGGTKICAGIEKGL 394 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC-EYMRNAGMKIYS 344 + N T ++ +TDGE+S T+ +C E ++ +G I++ Sbjct: 395 EIITNAI--------GTTYGSEIVLLTDGEDS----------TMSLCREKVKESGAIIHT 436 Query: 345 VAVSAPPEGQDLLRKCTDSSG---QFFAVNDSRELLESFDKITD---KIQEQSVRIAPN 397 +A+ P ++L + G V+ +L+E+F +IT I EQS+++ Sbjct: 437 IALG-PSAAKELEEFSNITGGLQLYAVDVDVPSKLVEAFSEITTGSGDISEQSIQLESK 494 >gi|296206125|ref|XP_002750075.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5 [Callithrix jacchus] Length = 940 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 24/154 (15%), Positives = 57/154 (37%), Gaps = 12/154 (7%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKS---RLNKLNPYENTNTYPAMHHAYRELYNEKE 293 R I ++ I + +S + V+ ++ ++P T+ A+ A R L Sbjct: 329 RFSIIGFSNRIKVWKDHLISVTPDSVRDGKVYIHHMSPTGGTDINEALQTAIRLLNKY-- 386 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 ++ R +IF+TDG+ + + + + I+++ + + Sbjct: 387 -VAHSDSGDRSVSLIIFLTDGKPTVGETHTLKILNNT--REAARGQVCIFTIGIGNDVDF 443 Query: 354 QDL----LRKCTDSSGQFFAVNDSRELLESFDKI 383 + L L C + + +L+ +D+I Sbjct: 444 RLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 477 >gi|220924567|ref|YP_002499869.1| aminoacyl-tRNA synthetase class I [Methylobacterium nodulans ORS 2060] gi|219949174|gb|ACL59566.1| aminoacyl-tRNA synthetase class I [Methylobacterium nodulans ORS 2060] Length = 407 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 21/208 (10%), Positives = 58/208 (27%), Gaps = 8/208 (3%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + I++ V +D A++ + ++Q+A D+A L + + D T + Sbjct: 19 IVGIVLPVLLGTAAVVLDGANLHLSQLRLQNAADSAALGAV------QVLPDSATAVSRG 72 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEI-PTENLF 119 ++ + + + + T P ++ + ++ Sbjct: 73 VSLVGQNLPPSYGTAAAATDVVVGTYDPGAKAFTAGGAQPNAVKVTARRSAALGNAIPVY 132 Query: 120 LKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKY 179 + S + A + + V+ ++ N M + Y Sbjct: 133 FAWIFGYRSLEAKAESVAVAAGGGNPAAC-LYALDPVNPGIDARGSVTVNATCGMQLSSY 191 Query: 180 LLPPPPKKSFWSKNTTKSKYAPAPAPAN 207 + ++ + A N Sbjct: 192 ISTSGNAGNYNGQICQSVGDTGATGNFN 219 >gi|198434614|ref|XP_002123557.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 1105 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 23/180 (12%), Positives = 61/180 (33%), Gaps = 23/180 (12%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NT 275 V + K + ++ V + + +N++E++ +L ++ T Sbjct: 874 INFTVALLDKFVISPDDMRVGVARFNRHFDRDSEILIGNYSNISELRQKLRRMPYRGRGT 933 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 T A+ H + + ++ ITDG S L+ + Sbjct: 934 LTGNALWHMNNHSLHAPGNRPGVPD------VIVVITDGLASD--------EVLRAANAL 979 Query: 336 RNAGMKIYSVAV--SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + +K+Y V + L+ S ++ + D+ ++++ D++ + + Sbjct: 980 KEQDVKMYVVGLINRMNRMNLAQLQDI-SSGTEYLQIIDNG-----YERLADELSDTLTQ 1033 >gi|167534461|ref|XP_001748906.1| hypothetical protein [Monosiga brevicollis MX1] gi|163772586|gb|EDQ86236.1| predicted protein [Monosiga brevicollis MX1] Length = 2847 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 28/175 (16%), Positives = 51/175 (29%), Gaps = 17/175 (9%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE 273 S L + A E + T ++ + + S L+ + + N L Sbjct: 1261 QTSIQFLRALVNGADIESSGSRIAAITFCSEPTLLTDYVSTTSEALDALNTASNTLTC-- 1318 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T T A+ + + ++ ++ VI ITDGE+ + E Sbjct: 1319 GTATGAALDFVRENILTD-------RSNSGARRVVIVITDGESQEDFSVVQNAGARLQAE 1371 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 +Y++ V + LR S F L I + Sbjct: 1372 VD-----DVYAIGVGSG-TDLAELRVIASSDDNTFQEASFDNLDNI--DILQLLC 1418 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 34/175 (19%), Positives = 66/175 (37%), Gaps = 20/175 (11%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPY 272 +LV+++ + VR+G Y P S++ + + S L+ + Sbjct: 840 RSFVRDLVSNLMSGDND-----VRVGVAEY-SSTYTQIVFPFSSSQSAIDSSLSSMIQTA 893 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T T ++ A ++ + SS + +I +TDGE S + N Sbjct: 894 GATATGTSLGEAADDIGSTARSSAA--------RVLILMTDGETSDG----DEQNIDPSV 941 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 + +R G+ I ++ V +LL+ SS F +L ++I +I Sbjct: 942 DALRALGVSITAIGVGNSASESELLQ-IAGSSDHVFNNIAFVDLSSFINQIIGQI 995 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 24/134 (17%), Positives = 53/134 (39%), Gaps = 10/134 (7%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 T S + + + S +N L + T T A+ A ++++++ + S + Sbjct: 495 SSEPQIETGFSYDESYLISVINSLPHLKLGTATGEALRMARQDIFSDND---ALFRSFSV 551 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS 364 F I ITDG ++ ++ + +R G++++++ V + L + Sbjct: 552 PAFAIVITDG-----NSLEDASYVAEQARRLREHGVQVFALGVGSQIT-VSQLVDIAGDN 605 Query: 365 GQFFAVNDSRELLE 378 + F V D L Sbjct: 606 ARVFGVADFGVLNA 619 >gi|149922245|ref|ZP_01910682.1| hypothetical protein PPSIR1_07355 [Plesiocystis pacifica SIR-1] gi|149816878|gb|EDM76364.1| hypothetical protein PPSIR1_07355 [Plesiocystis pacifica SIR-1] Length = 370 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 23/145 (15%), Positives = 59/145 (40%), Gaps = 19/145 (13%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + P+ + + + ++ +T Y ++ + L G T + Sbjct: 184 NTTVEERAPMGSGMKATSDAIQAVDAVGSTALYTSLIRSIDALEGS--------GKTGYR 235 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL-----LRKC 360 + ++ +TDG+++ +S + + A ++IY V++ + + L +++ Sbjct: 236 RAILVLTDGKDTASSHG-----VATVINRAKAAKVRIYVVSLGGAGDQKGLGYVGPMQRL 290 Query: 361 TD-SSGQFFAVNDSRELLESFDKIT 384 T + G F V+ + +L+ FD I Sbjct: 291 TTETGGVFTHVDRADDLVARFDAIA 315 >gi|119572524|gb|EAW52139.1| integrin, alpha D [Homo sapiens] Length = 366 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 38/320 (11%), Positives = 89/320 (27%), Gaps = 30/320 (9%) Query: 75 GSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLR 134 G + + G N + + A L ++ + L+L Sbjct: 36 GQSVVQFGGSRLVVGAPLEVVAANQTGRLYDCAAATGMCQPIPLHIRPEAVNMSLGLTLA 95 Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYL-QKHNDNNNMTSNKYLLPPPPKKSFWSKN 193 ++ R + + S S L + + P + Sbjct: 96 ASTNGSRLLACGPTLHRVCGENSYSKGSCLLLGSRWEIIQTVPDATPECPHQEMDIVFLI 155 Query: 194 TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 + + + + ++ + + Y+ + + Sbjct: 156 ------DGSGSIDQNDFNQMKGFVQAVMGQFEGTDT-------LFALMQYSNLLKIHFTF 202 Query: 254 PLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 +S ++ + T T + +L++ K + KK +I IT Sbjct: 203 TQFRTSPSQQSLVDPIVQLKGLTFTATGILTVVTQLFHHKNGAR-----KSAKKILIVIT 257 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS---APPEGQDLLRKCTDSS--GQF 367 DG+ Y++ L + AG+ Y++ V P + L + + Sbjct: 258 DGQ-----KYKDPLEYSDVIPQAEKAGIIRYAIGVGHAFQGPTARQELNTISSAPPQDHV 312 Query: 368 FAVNDSRELLESFDKITDKI 387 F V++ L ++ +KI Sbjct: 313 FKVDNFAALGSIQKQLQEKI 332 >gi|62087470|dbj|BAD92182.1| PREDICTED: integrin, alpha D variant [Homo sapiens] Length = 1177 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 38/320 (11%), Positives = 89/320 (27%), Gaps = 30/320 (9%) Query: 75 GSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLR 134 G + + G N + + A L ++ + L+L Sbjct: 52 GQSVVQFGGSRLVVGAPLEVVAANQTGRLYDCAAATGMCQPIPLHIRPEAVNMSLGLTLA 111 Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYL-QKHNDNNNMTSNKYLLPPPPKKSFWSKN 193 ++ R + + S S L + + P + Sbjct: 112 ASTNGSRLLACGPTLHRVCGENSYSKGSCLLLGSRWEIIQTVPDATPECPHQEMDIVFLI 171 Query: 194 TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 + + + + ++ + + Y+ + + Sbjct: 172 ------DGSGSIDQNDFNQMKGFVQAVMGQFEGTDT-------LFALMQYSNLLKIHFTF 218 Query: 254 PLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 +S ++ + T T + +L++ K + KK +I IT Sbjct: 219 TQFRTSPSQQSLVDPIVQLKGLTFTATGILTVVTQLFHHKNGAR-----KSAKKILIVIT 273 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS---APPEGQDLLRKCTDSS--GQF 367 DG+ Y++ L + AG+ Y++ V P + L + + Sbjct: 274 DGQ-----KYKDPLEYSDVIPQAEKAGIIRYAIGVGHAFQGPTARQELNTISSAPPQDHV 328 Query: 368 FAVNDSRELLESFDKITDKI 387 F V++ L ++ +KI Sbjct: 329 FKVDNFAALGSIQKQLQEKI 348 >gi|1167550|gb|AAB38547.1| leukointegrin alpha d chain [Homo sapiens] Length = 1162 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 38/320 (11%), Positives = 89/320 (27%), Gaps = 30/320 (9%) Query: 75 GSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLR 134 G + + G N + + A L ++ + L+L Sbjct: 36 GQSVVQFGGSRLVVGAPLEVVAANQTGRLYDCAAATGMCQPIPLHIRPEAVNMSLGLTLA 95 Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYL-QKHNDNNNMTSNKYLLPPPPKKSFWSKN 193 ++ R + + S S L + + P + Sbjct: 96 ASTNGSRLLACGPTLHRVCGENSYSKGSCLLLGSRWEIIQTVPDATPECPHQEMDIVFLI 155 Query: 194 TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 + + + + ++ + + Y+ + + Sbjct: 156 ------DGSGSIDQNDFNQMKGFVQAVMGQFEGTDT-------LFALMQYSNLLKIHFTF 202 Query: 254 PLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 +S ++ + T T + +L++ K + KK +I IT Sbjct: 203 TQFRTSPSQQSLVDPIVQLKGLTFTATGILTVVTQLFHHKNGAR-----KSAKKILIVIT 257 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS---APPEGQDLLRKCTDSS--GQF 367 DG+ Y++ L + AG+ Y++ V P + L + + Sbjct: 258 DGQ-----KYKDPLEYSDVIPQAEKAGIIRYAIGVGHAFQGPTARQELNTISSAPPQDHV 312 Query: 368 FAVNDSRELLESFDKITDKI 387 F V++ L ++ +KI Sbjct: 313 FKVDNFAALGSIQKQLQEKI 332 >gi|62548866|ref|NP_005344.2| integrin alpha-D precursor [Homo sapiens] gi|296434544|sp|Q13349|ITAD_HUMAN RecName: Full=Integrin alpha-D; AltName: Full=ADB2; AltName: Full=CD11 antigen-like family member D; AltName: Full=Leukointegrin alpha D; AltName: CD_antigen=CD11d; Flags: Precursor gi|162317970|gb|AAI56096.1| Integrin, alpha D [synthetic construct] gi|168275856|dbj|BAG10648.1| integrin alpha-D precursor [synthetic construct] Length = 1161 Score = 58.4 bits (139), Expect = 2e-06, Method: Composition-based stats. Identities = 38/320 (11%), Positives = 89/320 (27%), Gaps = 30/320 (9%) Query: 75 GSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLR 134 G + + G N + + A L ++ + L+L Sbjct: 36 GQSVVQFGGSRLVVGAPLEVVAANQTGRLYDCAAATGMCQPIPLHIRPEAVNMSLGLTLA 95 Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYL-QKHNDNNNMTSNKYLLPPPPKKSFWSKN 193 ++ R + + S S L + + P + Sbjct: 96 ASTNGSRLLACGPTLHRVCGENSYSKGSCLLLGSRWEIIQTVPDATPECPHQEMDIVFLI 155 Query: 194 TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 + + + + ++ + + Y+ + + Sbjct: 156 ------DGSGSIDQNDFNQMKGFVQAVMGQFEGTDT-------LFALMQYSNLLKIHFTF 202 Query: 254 PLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 +S ++ + T T + +L++ K + KK +I IT Sbjct: 203 TQFRTSPSQQSLVDPIVQLKGLTFTATGILTVVTQLFHHKNGAR-----KSAKKILIVIT 257 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS---APPEGQDLLRKCTDSS--GQF 367 DG+ Y++ L + AG+ Y++ V P + L + + Sbjct: 258 DGQ-----KYKDPLEYSDVIPQAEKAGIIRYAIGVGHAFQGPTARQELNTISSAPPQDHV 312 Query: 368 FAVNDSRELLESFDKITDKI 387 F V++ L ++ +KI Sbjct: 313 FKVDNFAALGSIQKQLQEKI 332 >gi|291401974|ref|XP_002717657.1| PREDICTED: inter-alpha trypsin inhibitor heavy chain precursor 5 [Oryctolagus cuniculus] Length = 940 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 25/154 (16%), Positives = 58/154 (37%), Gaps = 12/154 (7%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKS---RLNKLNPYENTNTYPAMHHAYRELYNEKE 293 R I ++ I + +S N ++ ++ ++P T+ A+ A R L N Sbjct: 330 RFNIIGFSNRIKVWKDNLISVTPNSIRDGKIYIHHMSPTGGTDINGALQTAIRLLNNYVA 389 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + R ++F+TDG+ + + + + + I+++ + + Sbjct: 390 HNDIE---DRSVSLIVFLTDGKPTVGETHTLKILNNT--KEAAQGRVCIFTIGIGNDVDF 444 Query: 354 QDL----LRKCTDSSGQFFAVNDSRELLESFDKI 383 + L L C + N +L+ +D+I Sbjct: 445 KLLEKLSLENCGLTRRVHEEENAGAQLIGFYDEI 478 >gi|284052943|ref|ZP_06383153.1| von Willebrand factor, type A [Arthrospira platensis str. Paraca] gi|291569121|dbj|BAI91393.1| hypothetical protein [Arthrospira platensis NIES-39] Length = 463 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 26/215 (12%), Positives = 66/215 (30%), Gaps = 34/215 (15%) Query: 193 NTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK-NLSVRIGTIAYNIGIVGNQ 251 ++ ++ S+++ + + + RI + ++ Sbjct: 62 TIPTGNSYFLDGKQYTQVTGGKTKIDQVIESLERLVSSGQADSRDRIALVRFDDSASVLL 121 Query: 252 CTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 S + +K+ + +L T M A L + + Sbjct: 122 PLTASTDTASLKNAIGQLRNFSGGTRMALGMEEALNIL----------KNCDLSSRRTLI 171 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFA 369 TDG+ + + + AG+ I ++ V +DLL +D + G+ F Sbjct: 172 FTDGQT------FDESDCRDLATQFAEAGIPITALGVG--EYNEDLLLYLSDRTGGRVFN 223 Query: 370 V------NDSREL------LESFDKITDKIQEQSV 392 V + ++ F+++ + Q + + Sbjct: 224 VVETQTHTGTTDIPISELPNTIFEEV-QQAQSEVI 257 >gi|221132796|ref|XP_002166108.1| PREDICTED: similar to fibrillar collagen [Hydra magnipapillata] Length = 2213 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 17/90 (18%), Positives = 36/90 (40%), Gaps = 1/90 (1%) Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 S R + + +TDG + S+ + ++ +++ G++IYS+ + L Sbjct: 81 NSGRKRCCLAVLTDGRQTRGSSAPDAVDLHVASRPLKDIGVQIYSLGIGRD-YDIGELLD 139 Query: 360 CTDSSGQFFAVNDSRELLESFDKITDKIQE 389 F +D EL+ IT+ + Sbjct: 140 IASDDASVFRSSDVDELVSIVASITETTCK 169 >gi|291087628|ref|ZP_06346959.2| putative von Willebrand factor type A domain protein [Clostridium sp. M62/1] gi|291074491|gb|EFE11855.1| putative von Willebrand factor type A domain protein [Clostridium sp. M62/1] Length = 473 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 29/182 (15%), Positives = 62/182 (34%), Gaps = 23/182 (12%) Query: 210 IDVLIESAGNLVNSIQ--KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 ++ + + +++ A E R+G ++++ N LS + + Sbjct: 62 LEAAKKGIKAFIETLERESAQPEGYAGEKRVGLVSFSDTATVNSM--LSPVVEQAARAAE 119 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 L +N A+ A + L + K + ITDG+ S Sbjct: 120 GLTAGGKSNQAEAIRAAVKLLDMKTPGE----------KMLFLITDGQTPFRS------Q 163 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPE-GQDLLRKCTDSS--GQFFAVNDSRELLESFDKIT 384 T R AG+ +Y + ++AP ++ LR + + E +F+++ Sbjct: 164 TDSAAAEARQAGVTVYCIGIAAPDGVNREALRSWASGPSDSHIIEIRELGEAQTAFERLM 223 Query: 385 DK 386 Sbjct: 224 KN 225 >gi|332879903|ref|ZP_08447588.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332682114|gb|EGJ55026.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 345 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 26/180 (14%), Positives = 60/180 (33%), Gaps = 43/180 (23%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 R+G +AY L+ + + K L +N ++ A+ A R N + Sbjct: 130 RVGIVAYAASAYPQ--LALTTDHSAAKMFLQSMNTNMLSSQGTAIQEAIRMATNYFD--- 184 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE---- 352 + + + I+DGE+ + + +I + G+ IY++ V Sbjct: 185 ---DKSTTSRLLFIISDGED-------HEMGATEIAAEAQEKGIHIYTIGVGTEKGSPIP 234 Query: 353 ----------------------GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 ++LL++ ++G + + E+ +I ++ Sbjct: 235 MRELGEQSYKRDRNGEVVITRLNKELLQQIAINAGGQYL--NGDNTQEAVSQIEKILEST 292 >gi|271968449|ref|YP_003342645.1| von Willebrand factor type A domain-containing protein [Streptosporangium roseum DSM 43021] gi|270511624|gb|ACZ89902.1| von Willebrand factor type A domain protein [Streptosporangium roseum DSM 43021] Length = 490 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 36/291 (12%), Positives = 82/291 (28%), Gaps = 30/291 (10%) Query: 108 KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQK 167 A Y L L + E D D Sbjct: 72 TASYGYAKRILQEGRLPEPGQIRPEEFVNSFRQDYKEPGD-------DGFTVHMDGARMP 124 Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 N + P+ N T ++D++ E+ LV+ + Sbjct: 125 ENGTALIRVGLQTRKAEPEARR-PANLTFVVDVSGSMGEPGRLDLVREALHKLVDQLGPG 183 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 ++ +A++ + +++ + +++L ++TN + Y E Sbjct: 184 D--------QVSIVAFSTQARLVLSMTPATGRDQLHAAIDRLGVEDSTNLETGLTAGYAE 235 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 + VI ++DG + + + E + + V V Sbjct: 236 AARAFRPAATNR--------VILLSDGLANTGDTTWQGILDR-VAESA-GRQITLLCVGV 285 Query: 348 SAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESF-DKITDKIQEQSVRIAP 396 G L+ + D+ G V+ + + + F +++ + + R A Sbjct: 286 GRD-YGDQLMEQLADNGDGAAVYVSSADDARKVFVEQLATNL-DLRARDAK 334 >gi|224092755|ref|XP_002190101.1| PREDICTED: inter-alpha (globulin) inhibitor H2 [Taeniopygia guttata] Length = 948 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 18/156 (11%), Positives = 56/156 (35%), Gaps = 11/156 (7%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 N + + + + K + ++P TN A+ A L K S Sbjct: 359 CWRDNLVSATPSQVEDAKKYIQTIHPNGGTNINEALLRATFILNEAKSLGMLDPNS---V 415 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS- 364 ++ ++DG+ + T+ + + +++ ++ + + D L++ + Sbjct: 416 SMIVLVSDGDPTVGELKLTTIQ-KNVKQSIKDE-FSLFCLGIGFDV-DYDFLQRIATDNR 472 Query: 365 GQFFAV----NDSRELLESFDKITDKIQEQSVRIAP 396 G + S ++ +++++ + ++ P Sbjct: 473 GMAHRIFGNQETSLQMKNFYNQVSTPLLKKIQFNYP 508 >gi|152993598|ref|YP_001359319.1| hypothetical protein SUN_2020 [Sulfurovum sp. NBC37-1] gi|151425459|dbj|BAF72962.1| hypothetical protein [Sulfurovum sp. NBC37-1] Length = 940 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 27/212 (12%), Positives = 64/212 (30%), Gaps = 24/212 (11%) Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 S KI++ ++ N+V + G Sbjct: 18 QVAAEDNPRAVIIFDASGSMWGQINGVTKIEIARDALKNVVREWN---PNVELGLTVYGH 74 Query: 241 IAYNIGIVGNQCTPLSN-NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 + P+ + V + K+ P T ++ A EL +E + Sbjct: 75 RSKGDCNDIEVVIPIGKVDKKRVIDTVMKIKPKGKTPISRSLRKAAGELKYTEEKAT--- 131 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKI--YSVAVSAPPEGQDLL 357 +I I+DG+ + + + ++ G+ + V + + L Sbjct: 132 --------IILISDGKET------CDPDPCATAKELKKEGIDFVAHVVGFNVDKKTDKQL 177 Query: 358 RKCT-DSSGQFFAVNDSRELLESFDKITDKIQ 388 + G++F+ ++ L ++ I K++ Sbjct: 178 ECIANATGGEYFSAKNAAALNKAMKTIVKKVE 209 >gi|304310230|ref|YP_003809828.1| hypothetical protein HDN1F_05810 [gamma proteobacterium HdN1] gi|301795963|emb|CBL44164.1| hypothetical protein HDN1F_05810 [gamma proteobacterium HdN1] Length = 371 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 31/288 (10%), Positives = 91/288 (31%), Gaps = 20/288 (6%) Query: 106 ESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYL 165 + + F ++ +A L + + ++ + + + + +++ Sbjct: 20 DVRVSARSLFSRAFGGRMLGAASAALVASAFMVSAPLAQAAVSPEGLRIVGNANPVQVFV 79 Query: 166 QKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNL--VNS 223 N N+ + + + L S + + L + Sbjct: 80 PTQN-NSGLDISDFQLREDGSSQTLVSAEPNSSLPFVTVFVMDYSPSVRNNETALKRMEE 138 Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 K + I +N + L++N + + + ++ + P Sbjct: 139 AAKGFVSLMQPKDKAAVIKFNDRVEVMGA-GLTSNHDTLNAAIDSIPP----------QR 187 Query: 284 AYRELYNEKESSHNTIG-STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKI 342 Y +LY+ + + +L +I ++DG++ + L + + AG + Sbjct: 188 GYSKLYDAISKAIEVSNCNPKLVCSIIVLSDGDD-----VGSALPLADLHNQLYQAGTAV 242 Query: 343 YSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + + L T+S G ++ DS + +++I ++ + Sbjct: 243 FPIGYGDNINVNKLQELATNSGGAYYTSEDSSQFSAVYERIWARLSNE 290 >gi|160896215|ref|YP_001561797.1| von Willebrand factor type A [Delftia acidovorans SPH-1] gi|160361799|gb|ABX33412.1| von Willebrand factor type A [Delftia acidovorans SPH-1] Length = 536 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 19/132 (14%), Positives = 45/132 (34%), Gaps = 9/132 (6%) Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 L +V + + L T Y A+ A ++ E + S ++ +TD Sbjct: 408 DLQAASAQVLAYADSLVADGGTAIYDALTLAQQQARQELRADPERFVS------IVLLTD 461 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDS 373 G N+ + ++ ++++ + G+ + + G+ F ++ Sbjct: 462 GANTAGRDWAAFEREQRMARDGGAPLVRVFPIIFGEAQSGE-MQALAALTGGRAFDARNT 520 Query: 374 --RELLESFDKI 383 L F +I Sbjct: 521 GKSGLPLVFKEI 532 >gi|326798073|ref|YP_004315892.1| von Willebrand factor type A [Sphingobacterium sp. 21] gi|326548837|gb|ADZ77222.1| von Willebrand factor type A [Sphingobacterium sp. 21] Length = 622 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 21/182 (11%), Positives = 59/182 (32%), Gaps = 19/182 (10%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 ++ ++ S LV+ ++K R+ + Y +N Sbjct: 258 VSGSMDGPNRLPLVKSSLKMLVDQLRKED--------RVAIVTYAGTARIKLAPVWANEK 309 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +K+ +++L+ +T + AY + N +I +DG+ + Sbjct: 310 MRIKNAIDELDAGGSTAGGAGLKMAYDLAREHFKKDGNNR--------IILASDGDFNVG 361 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLE 378 + + + E R +G+ + + + + G + +++ E + Sbjct: 362 --PSSNEDMETLIEKERQSGVSLSVLGFGMVNLKDSKMELLANKGHGNYAYIDNLMEAKK 419 Query: 379 SF 380 + Sbjct: 420 AM 421 >gi|297606054|ref|NP_001057930.2| Os06g0578100 [Oryza sativa Japonica Group] gi|255677166|dbj|BAF19844.2| Os06g0578100 [Oryza sativa Japonica Group] Length = 622 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 39/272 (14%), Positives = 93/272 (34%), Gaps = 25/272 (9%) Query: 112 EIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDN 171 ++ L L LI S L ++ S ++ S+ + +I L ++ Sbjct: 3 QLHNLMLLLPCLIFSTLLHIEAMSVAPVKVSTTPIFPTIPRAQTNKDFQVLLRVEAPPAA 62 Query: 172 NNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEK 231 + + S + +P ++DVL S ++ + Sbjct: 63 DLNGHVPIDVVAVLDVSGSMNDP--VAASPESNLQATRLDVLKASMKFIIRKLDDGD--- 117 Query: 232 KNLSVRIGTIAYNIGIVGNQCTPL----SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 R+ +A+N G V + L + + ++++L + A+ +E Sbjct: 118 -----RLSIVAFNDGPVKEYSSGLLDVSGDGRSIAGKKIDRLQARGG--SGSALMPELQE 170 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 + S F++ +TDG+++ + + + + +++ A+ Sbjct: 171 AVKILDERQG--NSRNRVGFILLLTDGDDTTGFRWSRDVIHGAVGK------YPVHTFAL 222 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 A + + LL +S G + V+D L + Sbjct: 223 GAAHDPEALLHIAQESRGTYSFVDD-GNLDKI 253 >gi|224078385|ref|XP_002194338.1| PREDICTED: collagen, type XX, alpha 1 [Taeniopygia guttata] Length = 1505 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 26/167 (15%), Positives = 56/167 (33%), Gaps = 21/167 (12%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + ++ E NL++ A +R+G Y+ + Sbjct: 266 VDGSWSIGRNNFKLIKEFLSNLISPFSIA-----EDKIRVGLSQYSSDPRTEWELSAYST 320 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 +V + L NT T A+ H + + +K VI +TDG++ Sbjct: 321 REQVLEAVRNLRYKGGNTFTGLALTHVLEQ-----NLKPDAGARLEAEKLVILLTDGKSQ 375 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS 364 + + ++N G++I+++ V + LR+ Sbjct: 376 D--------DANLAAQTLKNLGIEIFAIGV--KNADEAELRQVASEP 412 >gi|149636044|ref|XP_001506552.1| PREDICTED: similar to collagen type XX alpha 1 [Ornithorhynchus anatinus] Length = 1500 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 26/168 (15%), Positives = 54/168 (32%), Gaps = 23/168 (13%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + ++ E +L++ A + IG Y+ + Sbjct: 241 VDGSWSIGRSNFRLVREFLASLISPFNIARDKIS-----IGLSQYSGDPRTEWDLNKFAS 295 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAY-RELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 ++V + L NT T A+ H + L E K VI +TDG++ Sbjct: 296 KDKVLEAVRNLRYKGGNTFTGLALTHVLEQNLKLEAGPRPEA------DKIVILLTDGKS 349 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS 364 + +++ G+ I+++ V + LR+ Sbjct: 350 QD--------EANAAAQALKDLGISIFAIGV--KNADEAELRQVASHP 387 >gi|148655604|ref|YP_001275809.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148567714|gb|ABQ89859.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 425 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 15/130 (11%), Positives = 51/130 (39%), Gaps = 15/130 (11%) Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 +++K+ + ++ T + A +E+ + + +I +TDG Sbjct: 103 KSDLKAAIAQIEAAGGTEMAQGLALALQEVQRPFLTRGISR--------LILLTDG---- 150 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC-TDSSGQFFAVNDSRELL 377 Y + ++I ++ G+ + ++ + +DLL + + + +++++ Sbjct: 151 -RTYGDESRCVEIARRGQSRGIGLTALGIGTEWN-EDLLETMTASENSRAQYIATAQDVV 208 Query: 378 ESFDKITDKI 387 + F ++ Sbjct: 209 KVFADEVKRL 218 >gi|313212817|emb|CBY36735.1| unnamed protein product [Oikopleura dioica] Length = 696 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 40/353 (11%), Positives = 91/353 (25%), Gaps = 31/353 (8%) Query: 43 SIVSDRTIKDPTTKKDQTSTIFKKQIKK-HLKQGSYIRENAGDIAQKAQINITKDKNNPL 101 + D + +T K + ++ ++ N G I A I + Sbjct: 30 ATGEDVARATFAKVWKEIATWDVKYVDGGFFERYAFAESNYGLIQHHAVQQIWTIDEDQF 89 Query: 102 QYIAESKAQYEIPTENLFLKGLIPSALTNLSLRST-GIIERSSENLAISICMVLDVSRSM 160 + I + + + S + + I +D ++ Sbjct: 90 NTTKTVALKNSISNKFNMDWDRLKYEELTIPTNSILALFLVIDQFNEFPIPYSIDEQDAL 149 Query: 161 EDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNL 220 D ++ + + + + + + +++ + Sbjct: 150 YTRITHNTVDGFKDKVDELDQNTQNECTTKALDIVFV-VDESGSVGPDNFELVKLFLIDY 208 Query: 221 VNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYP 279 A RI Y+ + + S +N L TNT Sbjct: 209 AQDSNIAADA-----TRIAIRTYSTNSDLDFSLN-DFKTRNIISEINNLVYASGGTNTAD 262 Query: 280 AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG 339 A+ + N + K ++ ITDG++S + ++ Sbjct: 263 AITKGLNDFGN---------DRSESVKIMVTITDGQSSYDH-------VKAAADLLKADP 306 Query: 340 MKIYSVAVSAPPEGQDLLRKCTDS-SGQFFAVNDSRELLESFDKITDKIQEQS 391 I S A+ L+ + G + D + F I +Q + Sbjct: 307 RNIQSFAIGIDGANMAELQAIATTDPGHIEMLKDWTD----FGPIKQNLQSKV 355 >gi|94969533|ref|YP_591581.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94551583|gb|ABF41507.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 362 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 41/267 (15%), Positives = 99/267 (37%), Gaps = 35/267 (13%) Query: 136 TGIIERSSENLAISICMVL-DVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 + + + + + +VL V+ + D L + N +P + Sbjct: 78 PADVNTHTRPMRVDVNIVLVPVTVTDPDNRLVTGLEKENFEVLDQNIPQQIRHFSSEDAP 137 Query: 195 TKSKYAPA-PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 + KID E+ + N +A+N Sbjct: 138 VSIGVIFDMSGSMSNKIDKSREAIVEFFKTA--------NPDDEFFVVAFNDK--PEVLQ 187 Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 +N + +++ +L L P + T+ A++ ++ + +K ++ I+D Sbjct: 188 DFTNRIEDIQEKLTILQPKDRTSLLDAIYLGMNKM----------RQAKYERKALLIISD 237 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV-------SAPPEGQDLLRKCTD-SSG 365 G ++ + +N + ++ +R A ++IY++ + + G LL + +D + G Sbjct: 238 GGDNHSRYTENEIKSM-----VREADVQIYAIGIYDLAPTTTEEMAGPALLGEISDWTGG 292 Query: 366 QFFAVNDSRELLESFDKITDKIQEQSV 392 + F +++ EL + KI +++ Q V Sbjct: 293 RMFPIDNVNELADVATKIGVELRNQYV 319 >gi|194209663|ref|XP_001495019.2| PREDICTED: similar to collagen, type XXVIII [Equus caballus] Length = 1127 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 24/190 (12%), Positives = 63/190 (33%), Gaps = 26/190 (13%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + ++ L + + + RIG I Y+ + ++ + Sbjct: 805 SSESVGPENFQIIKNFVKTLTDQVAL-----DLDTARIGIINYSHKVEMVAHLTQFSSKD 859 Query: 261 EVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 + K ++ + T T A+ HA ++ +KK + ITDG+ Sbjct: 860 DFKLAVDNMQYLGEGTYTATAL-HAANHMFEAARPG--------VKKVALVITDGQTDT- 909 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ-----DLLRKCTDSSG--QFFAVND 372 ++ N ++ + +A ++I+ + V + + + +D Sbjct: 910 ---RDEKNLTEVVKNASDASVEIFVIGVVKRNDPNFEMFHKEMNLIATDPDSEHVYQFDD 966 Query: 373 SRELLESFDK 382 L ++ + Sbjct: 967 FITLQDTLKQ 976 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 29/150 (19%), Positives = 53/150 (35%), Gaps = 17/150 (11%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN--NLNEVKSRLN 267 D + +L + + + ++L I A +P S+ +L K R+ Sbjct: 64 FDKQKDFVDSLSDKVFQLTP-VRSLKYDIKLAALQFSSSVQIDSPFSSWKDLQTFKQRVK 122 Query: 268 KLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 +N T +Y A+ +A R L E K + +TDG + + Sbjct: 123 SMNLIGQGTFSYYAISNATRLLQREGRKDGV--------KVALLMTDGID-----HPKNP 169 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + I E R AG+ ++ +S L Sbjct: 170 DVQSISEDARTAGILFITIGLSTVVNEAKL 199 >gi|326922791|ref|XP_003207628.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Meleagris gallopavo] Length = 1224 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 26/197 (13%), Positives = 67/197 (34%), Gaps = 25/197 (12%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + +++ + LV+ + + RIG + Y++ + Sbjct: 780 SSESVGPENFEIIKDFVTALVDRVT-----VGRNATRIGLVLYSLEVQLEFGLNKHTTQQ 834 Query: 261 EVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +VK + K+ T T A+ A +E + + ++K + +TDG+ Sbjct: 835 DVKRAIRKMQYMGEGTYTGTAIRKATQEGFLGARTG--------VRKVAVVLTDGQADKR 886 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD-----LLRKCTDSSG--QFFAVND 372 A + + + A +++Y++ + + L + ++D Sbjct: 887 EAVKLDV----VVREAHAANIEMYAIGIVNTSDPTQAEFVHELNLIASDPDREHMYLIDD 942 Query: 373 SRELLESFDKITDKIQE 389 L K+ ++ E Sbjct: 943 FNTLPALESKLVNQFCE 959 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 26/167 (15%), Positives = 56/167 (33%), Gaps = 19/167 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + LS R+ + Y+ + Q KS + + T T A+ Sbjct: 61 SGRTLSWRMALLQYSSTVSTEQTFHDWKGPEAFKSHIAPITYIGHGTYTTYAI------- 113 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA-- 346 + T G+ K + TDG + + + +N G+ ++++ Sbjct: 114 -TNLTQLYMTEGTPDSLKLAVLFTDGVD-----HPRNPDIFAATADAKNQGIVLFTMGMT 167 Query: 347 -VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 V+ LR F V++ +E + +K+ ++ Q+ Sbjct: 168 RVAEEVSNAAKLRLLASVPASRF-VSNLQE-KGTVEKVLTEMCSQTT 212 >gi|313159754|gb|EFR59111.1| von Willebrand factor type A domain protein [Alistipes sp. HGB5] Length = 340 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 35/245 (14%), Positives = 73/245 (29%), Gaps = 54/245 (22%) Query: 169 NDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAI 228 L + S A +++ + L + + + Sbjct: 71 LAAARPQFGSKLREEKTQGVEMMLAVDVSNSMLAEDFEPNRLERTKYAINKLFDGLHQ-- 128 Query: 229 QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHA 284 R+G I + P++++ K+ +++P + T A+ A Sbjct: 129 -------DRVGLIVFAGEPKVQ--LPITSDYRMAKAFAKRIDPSLVPVQGTAIGKALSQA 179 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 E E +H + VI ITDGEN A + ++ G++IY+ Sbjct: 180 LMSFSGETEENH--------SRVVILITDGENHEDDALAAARHAAEM-------GIRIYT 224 Query: 345 VAVSAPPE----------------------GQDLLRKCTD-SSGQFFAVNDSR-ELLESF 380 + + P +++L + D + G + + L E Sbjct: 225 IGIGTPEGAPIQIGGEFIKDEKGDMVVSKLNEEMLAQIADITGGAYVRSSKQSIGLDEIV 284 Query: 381 DKITD 385 I + Sbjct: 285 KSINE 289 >gi|110667707|ref|YP_657518.1| hypothetical protein HQ1753A [Haloquadratum walsbyi DSM 16790] gi|109625454|emb|CAJ51881.1| conserved hypothetical protein [Haloquadratum walsbyi DSM 16790] Length = 799 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 23/126 (18%), Positives = 42/126 (33%), Gaps = 20/126 (15%) Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 L+ N + + R+ +L TN + A L + VI I+DG Sbjct: 440 LTENRDTTRQRIRQLRAGGGTNIANGLRGAEEML-------------DGQRGTVILISDG 486 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSR 374 + + E + G+++ +V L + S G +F N++ Sbjct: 487 VD-------ARSRATVVAESLGRRGVRVITVGAGQRVNEPLLEQIADISGGTYFQANETD 539 Query: 375 ELLESF 380 L F Sbjct: 540 RLRILF 545 >gi|158337332|ref|YP_001518507.1| von Willebrand factor type A domain-containing protein [Acaryochloris marina MBIC11017] gi|158307573|gb|ABW29190.1| von Willebrand factor, type A domain protein [Acaryochloris marina MBIC11017] Length = 686 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 38/325 (11%), Positives = 88/325 (27%), Gaps = 31/325 (9%) Query: 74 QGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSL 133 G++ E+ I + + + + A Y + L P L Sbjct: 201 PGTFNTEDYKRINENPFFLPQRTPLSTFSIDVD-TASYSNVRRFIRQGQLPPKDAVRLEE 259 Query: 134 RSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKN 193 + V + N L + N Sbjct: 260 LINYFDYGYASPKGDQPFSVSTEVATAPWN--------NQHKLVHIGLKGKELEKEQPSN 311 Query: 194 TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 K+ ++ +S LV+ ++ R+ + Y + Sbjct: 312 LVFLIDVSGSMKRPNKLALVKKSLCLLVHQLKPED--------RVSLVVYAGRAGIVLPS 363 Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 + + +++L +T + AY + N VI TD Sbjct: 364 TPGTQKATIMNAIDRLEAGGSTAGAAGIKMAYDMAERHFLKNGNNR--------VILATD 415 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVND 372 G+ + + L ++ E R+ G+ + + + + + G + ++ Sbjct: 416 GDFNVGQSSDAEL--ERLIEQKRDRGVFLTVLGYGTGNYKDNKMELLANKGNGNYAYIDT 473 Query: 373 SRELLESFDKITDKIQEQSVRIAPN 397 LLE+ + + ++ IA + Sbjct: 474 ---LLEAQKVLVNDLRGTLFTIAKD 495 >gi|75907531|ref|YP_321827.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] gi|75701256|gb|ABA20932.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] Length = 427 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 53/176 (30%), Gaps = 26/176 (14%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRL-NKLNPYENTNTYPAMHHAYRELYNEKESS 295 RI +A++ + + +K+++ KL T + EL + Sbjct: 78 RISVVAFSGSATVIIPNQIVEDPESIKTQIRKKLQASGGTVIAEGLQQGITELMKGTRGA 137 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQ-------NTLNTLQICEYMRNAGMKIYSVAVS 348 +TDG + ++ + + + I ++ Sbjct: 138 ---------VSQAFLLTDGHGEDSLKIWKWEIGPDDSRRCQEFAKKAAKINLTINTLGFG 188 Query: 349 APPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQE-------QSVRIAP 396 QDLL D G + + + F+++ ++Q + +AP Sbjct: 189 NNWN-QDLLETIADAGGGTLAHIERPEQAVHHFNRLFARVQSVGLTNAYLILSLAP 243 >gi|84498071|ref|ZP_00996868.1| hypothetical protein JNB_18328 [Janibacter sp. HTCC2649] gi|84381571|gb|EAP97454.1| hypothetical protein JNB_18328 [Janibacter sp. HTCC2649] Length = 651 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 24/191 (12%), Positives = 65/191 (34%), Gaps = 26/191 (13%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT----IAYNIGIVGNQCTPLSN-NL 259 KI+ ++ +V ++ Q + P++ + Sbjct: 57 SGLTKIEAAKKALTGVVGALPDTAQVGLRVYGATVDGKGKPTPAACADTQLIHPIAALDK 116 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 ++ + + + T ++ A ++L K+ ++ ++DGE Sbjct: 117 TKLTTTIAAIKALGETPIAHSLTEALKDLGTSG------------KRNIVLVSDGEE--- 161 Query: 320 SAYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSREL 376 + I + + AG+ +I +V + + L+ D+ G ++ D+ L Sbjct: 162 ---SCVPDPCPIVKKLTAAGVDLQIDTVGFGVNAKARTQLQCIADAGKGTYYDAKDAGAL 218 Query: 377 LESFDKITDKI 387 S +K++ + Sbjct: 219 ATSLNKLSQRA 229 >gi|84498180|ref|ZP_00996977.1| putative membrane protein [Janibacter sp. HTCC2649] gi|84381680|gb|EAP97563.1| putative membrane protein [Janibacter sp. HTCC2649] Length = 654 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 22/187 (11%), Positives = 58/187 (31%), Gaps = 26/187 (13%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + + + + + S K VRIG +++ + V++ Sbjct: 100 GRSGMATVRTAVKDFLASAPK--------DVRIGVVSFGNTAGPEIAP--TTARAAVQAV 149 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 ++ L NT + + A R L + + ++ ++DG+N+ Sbjct: 150 VDDLRADGNTALFSGVTQAVRMLGSTG------------DRSIVLLSDGKNTVGDRAS-- 195 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKIT 384 + + + +++ V + + L G D+ + +F Sbjct: 196 -GLAAAGKALTASQVRVEVVRFTTGENDPEALAAFAKAGGGSVVQATDAEGVRTAFQTAA 254 Query: 385 DKIQEQS 391 ++ Q Sbjct: 255 KVLESQV 261 >gi|260800505|ref|XP_002595170.1| hypothetical protein BRAFLDRAFT_240981 [Branchiostoma floridae] gi|229280413|gb|EEN51181.1| hypothetical protein BRAFLDRAFT_240981 [Branchiostoma floridae] Length = 183 Score = 58.0 bits (138), Expect = 2e-06, Method: Composition-based stats. Identities = 21/156 (13%), Positives = 59/156 (37%), Gaps = 15/156 (9%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEK 292 S +I Y+ +V ++ E+ ++ + T T A++ + ++ Sbjct: 42 SSAQISLFQYSNFVVQEFALDTYGSIGEINQAVDAVMYQGGGTATGLALYEMRQYGFSFA 101 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 ++ I +TDG +T++ + AG+ +Y V + + + Sbjct: 102 NGGR-----PGTRRVAILLTDGM------SSDTVDKHAMA--AWQAGISLYVVGIGSNVD 148 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 +LL + F++ + +L + ++ +++ Sbjct: 149 MNELL-AIGGTPDNVFSLGNFGQLQDLGSRLPNRLC 183 >gi|297154321|gb|ADI04033.1| hypothetical protein SBI_00912 [Streptomyces bingchenggensis BCW-1] Length = 423 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 27/201 (13%), Positives = 68/201 (33%), Gaps = 30/201 (14%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN------ 257 ++ ++ +++ A+ ++ L +R Y C Sbjct: 56 VDGMSRMAAAKQAFNEVID----AVPDEVRLGIRTLGATYPGQDRKTGCLDSKQLYPVGR 111 Query: 258 -NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + E K+ + L P T A+ A ++L + ++ ITDGE+ Sbjct: 112 VDRTEAKTAVATLRPTGWTPIGLALRGASKDLA-----------GGDATRRIVLITDGED 160 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIY--SVAVSAPPEGQDLLRKCTD-SSGQFFAVNDS 373 S + + + G + ++ ++ + +D L + + G + AV + Sbjct: 161 SCGQ-----PDPCDVARELAAQGTHLVVDTLGLTLDSKVRDQLSCIAEATGGTYTAVQHT 215 Query: 374 RELLESFDKITDKIQEQSVRI 394 +L ++ + + V+ Sbjct: 216 DQLSTRIKQLVRRAADTPVQT 236 >gi|257086444|ref|ZP_05580805.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis D6] gi|256994474|gb|EEU81776.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis D6] Length = 1154 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 101/382 (26%), Gaps = 30/382 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M A A L+ +++ + D + + QG+ D Sbjct: 202 AEARMAPATLRANLAL--PLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGSAQWD 259 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ ++Y + L++R E Sbjct: 260 GQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRG----NVQKE 315 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +V+D S SM + L + S Sbjct: 316 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNINMGYVGYSSDGYNNN 375 Query: 205 PANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 D + N+ S + + G + L++ + Sbjct: 376 AIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFS 435 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +++++ T R + + + GST D N Sbjct: 436 YKVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNINKRINS 485 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA-V 370 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 486 TFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSADENGDLYYESA 545 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 546 DYAPDISDYLAKKAVQISGTVV 567 >gi|94309590|ref|YP_582800.1| hypothetical protein Rmet_0645 [Cupriavidus metallidurans CH34] gi|93353442|gb|ABF07531.1| conserved hypothetical protein [Cupriavidus metallidurans CH34] Length = 434 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 33/308 (10%), Positives = 84/308 (27%), Gaps = 7/308 (2%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + ++I V FI A+DL + ++++Q+ D+ L+ + + Sbjct: 25 IVGLMIVVLVGFIGLALDLGKLYVSKSELQNRADSCALAAARDLTGATPLTVSEAAGLTA 84 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKA------QINITKDKNNPLQYIAESKAQYEIP 114 + + +L+Q I + N N ++Y+ ++ I Sbjct: 85 AARNLVLFQGNLEQQPNITSAESVTYSDSLANPFLDKNSVTYALNTIKYVKCDVSRGNIA 144 Query: 115 TENLFLKG-LIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNN 173 + + + ++ + + +S +I + + + + Sbjct: 145 NWFAQVLNAIPGIHIGANTVGAFAVATTTSAQTTCAIPVYICRPDTATPAVPGGYTIGQW 204 Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 +T+ S + ++ + +N+ K + Sbjct: 205 LTAKIGTSGGGVYGGGNFGWADLSGGKGNTPALDTQLQGAGQCNLPALNTQIGTTGNKAS 264 Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 LS + T + + + N Y T+T L K Sbjct: 265 LSDAWNSRFGITTNKTTGVTDFTGFAYTSTTWTAQNNAYNGTSTDSKGASQPNFLAARKT 324 Query: 294 SSHNTIGS 301 S Sbjct: 325 YQPYQSDS 332 >gi|332832625|ref|XP_520182.3| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 [Pan troglodytes] Length = 3571 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 20/132 (15%), Positives = 47/132 (35%), Gaps = 25/132 (18%) Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + + ++ T T A A + L + +E+S K + ITDG ++G Sbjct: 153 LLQEIPAISYRGGGTYTKGAFQQAAQILLHARENST---------KVIFLITDGYSNGG- 202 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLE 378 + I +R++G++I++ L + + ++ E Sbjct: 203 ------DPRPIAASLRDSGVEIFT--FGIWQGNIRELNDMASTPKEEHCYLLHSFEE--- 251 Query: 379 SFDKITDKIQEQ 390 F+ + + + Sbjct: 252 -FEALARRALHE 262 >gi|311070191|ref|YP_003975114.1| hypothetical protein BATR1942_16320 [Bacillus atrophaeus 1942] gi|310870708|gb|ADP34183.1| hypothetical protein BATR1942_16320 [Bacillus atrophaeus 1942] Length = 226 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 29/213 (13%), Positives = 66/213 (30%), Gaps = 23/213 (10%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVN----SIQKAIQEKKNLSVRIGTI 241 + + S RKIDV +S + ++ + + Sbjct: 29 PDTNVAVLFDGSGSMVQKTGGERKIDVAKKSVKSFAELLPEDTNLMLRVFGHEGNNKLSG 88 Query: 242 AYNIGIVGNQCTPLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG 300 L + + L+++NP T A+ E Sbjct: 89 KALSCSTTETIYGLHPYEGSLFNNALSQINPTGWTPIAKALSDTREEFQ---------KV 139 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLR 358 + K V ITDGE + + + E +R A + + + + +G ++ Sbjct: 140 NAEGKNVVYLITDGEETCGG------DPAKEIEKLREANVDTIVNIIGFNFDVKGNQKMK 193 Query: 359 KCT-DSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + G++ + ++ E ++++ K E+ Sbjct: 194 QAAVAGGGEYISAKNAEEFEQAWENEAQKFTEE 226 >gi|153000354|ref|YP_001366035.1| vault protein inter-alpha-trypsin subunit [Shewanella baltica OS185] gi|151364972|gb|ABS07972.1| Vault protein inter-alpha-trypsin domain protein [Shewanella baltica OS185] Length = 772 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 24/144 (16%), Positives = 49/144 (34%), Gaps = 12/144 (8%) Query: 250 NQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 S+NL+ + +++L T A+ A K + + + + VI Sbjct: 444 TPLPATSSNLSRARQFVSRLQADGGTEMALALDAAL-----PKSLGSVSPDAVQPLRQVI 498 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA 369 F+TDG A + + ++++V + + P + R G F Sbjct: 499 FMTDGSVGNEQALFDLIRYQIGES-------RLFTVGIGSAPNSHFMQRAAELGRGTFTY 551 Query: 370 VNDSRELLESFDKITDKIQEQSVR 393 + E+ + KIQ + Sbjct: 552 IGKVDEVDAKISALLSKIQYPVLT 575 >gi|198433657|ref|XP_002122417.1| PREDICTED: similar to PK-120 [Ciona intestinalis] Length = 864 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 33/335 (9%), Positives = 86/335 (25%), Gaps = 38/335 (11%) Query: 71 HLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTN 130 + G + + ++ +T P + + Sbjct: 183 YADPGQVVGDFKIEVEITEPTGLTTVTATPPSPKLRRTINPDTFDTGNVEIRRSETQAYV 242 Query: 131 LSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFW 190 + +S + DV+R + + LP PKK + Sbjct: 243 SYRPTREQQRNIRRRSDLSFLVNYDVTREELGGEILIKDGYFVHFFAPTNLPVIPKKVVF 302 Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 + + S KI E+ +++ + + Q + + +V Sbjct: 303 VIDVSGSMSG-------HKIVQTKEALRTILDDLNEIDQFNIITFSSTTNVWHPNEMVDV 355 Query: 251 QCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 T N+ K + + TN A + L + + +I Sbjct: 356 NPT----NIRNAKKHVRSMYARGGTNFNAAALDGIQLLETISSNRT---NTLEEASMMIL 408 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMK--------IYSVAVSAPPEGQDLLRKCTD 362 +TDG+ + ++ I+ + + L + Sbjct: 409 LTDGQPTVGVTG----------NEAIRRNIRERVNGRYSIFCLGFGQ-HLDHEFLDQIAS 457 Query: 363 SSGQ-----FFAVNDSRELLESFDKITDKIQEQSV 392 + + + + +L + +D++ + + Sbjct: 458 ENKGLSRKIYNDADAALQLKDFYDEVASPLLAHVI 492 >gi|257078107|ref|ZP_05572468.1| von Willebrand factor [Enterococcus faecalis JH1] gi|256986137|gb|EEU73439.1| von Willebrand factor [Enterococcus faecalis JH1] Length = 1154 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 101/382 (26%), Gaps = 30/382 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M A A L+ +++ + D + + QG+ D Sbjct: 202 AEARMAPATLRANLAL--PLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGSAQWD 259 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ ++Y + L++R E Sbjct: 260 GQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRG----NVQKE 315 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +V+D S SM + L + S Sbjct: 316 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNINMGYVGYSSDGYNNN 375 Query: 205 PANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 D + N+ S + + G + L++ + Sbjct: 376 AIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFS 435 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +++++ T R + + + GST D N Sbjct: 436 YKVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNINKRINS 485 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA-V 370 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 486 TFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSADENGDLYYESA 545 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 546 DYAPDISDYLAKKAVQISGTVV 567 >gi|198436264|ref|XP_002122997.1| PREDICTED: similar to HyTSR1 protein [Ciona intestinalis] Length = 1993 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 29/191 (15%), Positives = 53/191 (27%), Gaps = 22/191 (11%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + +++ LV S + RIG + YN + N Sbjct: 1813 VDSSSSIGDDNFELMRNFILELVRSFN-----VSRDTTRIGYVRYNNAVDERFQLNTFNT 1867 Query: 259 LNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 EV+ + + T T A+ +A + + I ITDG Sbjct: 1868 SEEVQEAIRAVPYRGSGTLTGQALSYA-----SRTSVRAPAGRRPGVPGVAIVITDGRAQ 1922 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 ++ + VA+ + L G F + D L Sbjct: 1923 D--------AVDAPARELQR---LMQVVAIGIRGAVPEQLNAIASQQGYVFNIEDFNRLD 1971 Query: 378 ESFDKITDKIQ 388 E I+ + Sbjct: 1972 EVLGSISSTVC 1982 >gi|288556553|ref|YP_003428488.1| hypothetical protein BpOF4_17775 [Bacillus pseudofirmus OF4] gi|288547713|gb|ADC51596.1| hypothetical protein BpOF4_17775 [Bacillus pseudofirmus OF4] Length = 459 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 42/350 (12%), Positives = 98/350 (28%), Gaps = 29/350 (8%) Query: 40 GCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNN 99 A +++ + + T+ D+ I + + + + I + + + + Sbjct: 9 VAACMLAACSTESSETQTDEVQDIAVENVANEESESLEEGSSEKSIQETNDFSFNQPEAP 68 Query: 100 PLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRS 159 + E Y + L + + I R ++ +S Sbjct: 69 YVPESMEDVLTYPVGEFALVDPSESEEFQ---DVLAELPILREEATSEEIRTYIIHLSTL 125 Query: 160 MEDLYLQKHNDNNNMTSNKYLLP-------PPPKKSFWSKNTTKSKYAPAPAPANRKIDV 212 Y N +L P P K+ S K+ + Sbjct: 126 TSPAYPDPRLLANRWEQLSFLAPNEEGELGPEEKELNVEILLDASGSMRDEVDGVDKMTL 185 Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS-NNLNEVKSRLNKLNP 271 E+ V + + +G ++ L + + + ++ + Sbjct: 186 AREAIQGFVEELPDQANVALRVYGHVGESPEKSCEGIDRVYDLQPYDESSFQDAIDGVMA 245 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T A+ + N + + V+ ++DG ++ + +Q Sbjct: 246 NGWTPLAKAIEVTSDDYRNASKDAT---------NMVLVVSDGMDTCGG------DPVQA 290 Query: 332 CEYMRNAGMK--IYSVAVSAPPEGQDLLRKCTDSSGQFF-AVNDSRELLE 378 + + + I + P Q LR+ +SG F VND +L Sbjct: 291 VKDLSELDVTPLISIIGFDVPANEQQQLREMAQASGSAFATVNDQAQLSA 340 >gi|323137991|ref|ZP_08073065.1| hypothetical protein Met49242DRAFT_2453 [Methylocystis sp. ATCC 49242] gi|322396710|gb|EFX99237.1| hypothetical protein Met49242DRAFT_2453 [Methylocystis sp. ATCC 49242] Length = 296 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 27/195 (13%), Positives = 52/195 (26%), Gaps = 27/195 (13%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A L I A+D + + R+ +Q ALDA VLS + + + Sbjct: 30 IFAFATLPMVLLIGGAVDYSRAIGARSNLQQALDAGVLSAAVKGGNPDSGQ--------- 80 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + + ++ G + +T + Sbjct: 81 ---LARYLNSNMSPGGAATNVTLTRSVATGGAVTFVGDADFSVA-------------TNF 124 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 + L S + +D+++ N + S + + Sbjct: 125 LKMAGLGAIKLHSHSEATLPAQIVTATFKPTSA--QGAYSKDIFIWTKNAAGTVVSRQTV 182 Query: 181 LPPPPKKSFWSKNTT 195 L S SK TT Sbjct: 183 LTYRYNSSNGSKVTT 197 >gi|281349285|gb|EFB24869.1| hypothetical protein PANDA_021744 [Ailuropoda melanoleuca] Length = 493 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 34/194 (17%), Positives = 65/194 (33%), Gaps = 23/194 (11%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV-- 262 + ++ ++ + +++ NL +R+ I Y G+ L+++ N++ Sbjct: 58 GFWHRSASANDNWKDIYIFVGDVVKKFPNLKMRVSFITY--STQGHTLMELTSDRNKIHN 115 Query: 263 -KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 SRL + P TN + A ++ E + +I +T G + Sbjct: 116 SLSRLKNIKPTGATNMHEGFKKANEQIEQENAGGN------NAASLIIALTTGPLT---- 165 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFD 381 + T E R G K+Y V V +D L Q + V + F Sbjct: 166 PKALQETKSEAEKAREMGAKVYCVGV--KDYRKDQLDAIVGRKDQMYGVGNG------FK 217 Query: 382 KITDKIQEQSVRIA 395 + D V Sbjct: 218 SLQDMANLLVVNSC 231 >gi|158335198|ref|YP_001516370.1| von Willebrand factor type A domain-containing protein [Acaryochloris marina MBIC11017] gi|158305439|gb|ABW27056.1| von Willebrand factor type A domain protein, putative [Acaryochloris marina MBIC11017] Length = 573 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 21/133 (15%), Positives = 48/133 (36%), Gaps = 20/133 (15%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 ++ + L NT Y ++ A L + VI +TDGE+ Sbjct: 450 GQSKGLEFVVALKADGNTKLYDSILAAQTWLTQNLRPNAI--------NAVIVLTDGED- 500 Query: 318 GASAYQNTLNTLQICEYMRNAG------MKIYSVAVS-APPEGQDLLRKCTDSSGQFFAV 370 + Q+ ++ +G + I++V A D+L++ +++G ++ Sbjct: 501 ----SGSGQQLPQLLSALKKSGFEGEQRIAIFTVGYGNAGDFAPDVLKQIAEANGGYYRQ 556 Query: 371 NDSRELLESFDKI 383 D + + + Sbjct: 557 GDPASIAQLMADL 569 >gi|118356063|ref|XP_001011290.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89293057|gb|EAR91045.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 520 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 20/148 (13%), Positives = 53/148 (35%), Gaps = 23/148 (15%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 N+ +++ +++L T M A+ + +++ + ++DG++ Sbjct: 154 NSKKKIQKTVDELQAGGGTQIGFGMQKAFDIIKE--------RTNSKNLASIFLLSDGQD 205 Query: 317 SGASAYQNTLNTLQICEYMRNAGMK----IYSVAVSAPPEGQDLLRKCTDSSGQFFAVND 372 + + M + ++ I + L + G F + D Sbjct: 206 NCGFSQTQHF--------MNQSKIEYPFCIDCFGFGDDHDSLTLSKINQLQQGTFNFIRD 257 Query: 373 SRELLESFDKITDKIQE---QSVRIAPN 397 ++ ++F I I+ Q+V+I+ N Sbjct: 258 ISQIDDAFTIILAGIKTFVAQNVKISVN 285 >gi|116201805|ref|XP_001226714.1| hypothetical protein CHGG_08787 [Chaetomium globosum CBS 148.51] gi|88177305|gb|EAQ84773.1| hypothetical protein CHGG_08787 [Chaetomium globosum CBS 148.51] Length = 777 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 54/172 (31%), Gaps = 21/172 (12%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 ID++ +A +V ++ R+G + + P N + + + Sbjct: 108 IDLVKHAARTIVATLDS--------RDRLGIVTFTNRSKVG--IPPYENKAKTLENIESM 157 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 P+ +TN + + GST ++ +TDG + + Sbjct: 158 EPFSSTNMWHGIRDGLSLFSEA------EGGSTGRVPALLVLTDGMPN---YMCPPKGYV 208 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESF 380 + M I++ LL+ + G + + D+ L F Sbjct: 209 PMLRSMEPLPATIHTFGFGYELRS-GLLKSIAEVGGGNYSFIPDAGMLGTVF 259 >gi|332283431|ref|YP_004415342.1| hypothetical protein PT7_0178 [Pusillimonas sp. T7-7] gi|330427384|gb|AEC18718.1| hypothetical protein PT7_0178 [Pusillimonas sp. T7-7] Length = 585 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 20/136 (14%), Positives = 49/136 (36%), Gaps = 20/136 (14%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + +N + P T ++ A L ++ V+ I+DG Sbjct: 93 DRDAFIKTVNAITPKGKTPISASLKQAADVLQYRDHNAT-----------VVLISDGLE- 140 Query: 318 GASAYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSR 374 + + ++ G+ K + V EG + L ++ G F +++ Sbjct: 141 -----SCHGDPCAVAAELKEKGVDFKAHVVGFDLDQEGNEALSCIAKNTGGIFVPASNAD 195 Query: 375 ELLESFDKITDKIQEQ 390 EL ++ ++ K+ ++ Sbjct: 196 ELQDALQQVQAKVVQK 211 >gi|219852403|ref|YP_002466835.1| hypothetical protein Mpal_1806 [Methanosphaerula palustris E1-9c] gi|219546662|gb|ACL17112.1| conserved hypothetical protein [Methanosphaerula palustris E1-9c] Length = 316 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 25/228 (10%), Positives = 61/228 (26%), Gaps = 43/228 (18%) Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 L S A +++ ESA L+ S+ Sbjct: 81 LDQTRDGVSVVLAIDDSGSMAANDYQPTRLEAAKESASVLIKSLDPKDYAG--------V 132 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNTI 299 + + G LS + + V + + T + A + Sbjct: 133 VIFESGATTAAY--LSPDKDRVMEKTAAIEQKNGQTALGDGLALAVDMADS--------- 181 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------- 352 KK V+ ++DG + +++++V + + Sbjct: 182 -IPNQKKVVVLLSDGVGNAGVISPEDATAFAA-----QNKVQVFTVGLGSKSPVLLGTDP 235 Query: 353 ---------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQ 390 + L+ + + G ++ D + L + + + +I + Sbjct: 236 TGTPQYATLDEAALQSIAEKTGGTYYTSVDEQTLHQIYAGLNKEIVRE 283 >gi|157831431|pdb|1IDO|A Chain A, I-Domain From Integrin Cr3, Mg2+ Bound Length = 189 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 22/119 (18%), Positives = 40/119 (33%), Gaps = 15/119 (12%) Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T+T + REL+N + K ++ ITDGE + + L + Sbjct: 81 GRTHTATGIRKVVRELFNITNGAR-----KNAFKILVVITDGE-----KFGDPLGYEDVI 130 Query: 333 EYMRNAGMKIYSVAVSA---PPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDK 386 G+ Y + V + + L F VN+ L +++ +K Sbjct: 131 PEADREGVIRYVIGVGDAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTIQNQLREK 189 >gi|119478003|ref|ZP_01618103.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2143] gi|119448916|gb|EAW30158.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2143] Length = 341 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 22/142 (15%), Positives = 52/142 (36%), Gaps = 25/142 (17%) Query: 254 PLSNNLNEVKSRLNKL---NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 P + + + + L + +T+ A+ + + + + +I Sbjct: 165 PFTADKDTWLTLLQETEIAMAGASTSIGDAIGLSISTFEHSDTDN----------RVLIV 214 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------GQDLLRKCTD-S 363 +TDG ++G+ + +KIY++A+ P D L++ +D + Sbjct: 215 LTDGNDTGSRVP-----PVDAARVANARDVKIYTIAIGDPETIGEDAMDVDTLKQVSDIT 269 Query: 364 SGQFFAVNDSRELLESFDKITD 385 G +F D + L ++ I Sbjct: 270 GGAYFEALDRQALERAYLDIES 291 >gi|22219356|pdb|1M1U|A Chain A, An Isoleucine-Based Allosteric Switch Controls Affinity And Shape Shifting In Integrin Cd11b A-Domain Length = 195 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 22/119 (18%), Positives = 40/119 (33%), Gaps = 15/119 (12%) Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T+T + REL+N + K ++ ITDGE + + L + Sbjct: 87 GRTHTATGIRKVVRELFNITNGAR-----KNAFKILVVITDGE-----KFGDPLGYEDVI 136 Query: 333 EYMRNAGMKIYSVAVSA---PPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDK 386 G+ Y + V + + L F VN+ L +++ +K Sbjct: 137 PEADREGVIRYVIGVGDAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTIQNQLREK 195 >gi|116625802|ref|YP_827958.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228964|gb|ABJ87673.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 326 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 62/172 (36%), Gaps = 27/172 (15%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 N + ++ + E+ R + +T A+ A E+ Sbjct: 127 NANPADEFFLVQFSDRARLVAGMTKDSEEISRRAASMRIGGSTALLDAVAMAMEEM---- 182 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP- 351 + L+K ++ I+DG+++ + N L +R + IY++ ++ Sbjct: 183 ------KSAHYLRKVMVIISDGDDNSSRCPVNDLK-----RIVREGDVTIYAIGITDDNV 231 Query: 352 ----------EGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSV 392 G LL + + G+ F V+ ++L E KI+ ++Q V Sbjct: 232 PLAYPQRDRLTGAALLNEIATQTGGRLFEVHKLKQLPEIAAKISGWTRKQYV 283 >gi|326334019|ref|ZP_08200248.1| BatA protein [Nocardioidaceae bacterium Broad-1] gi|325948168|gb|EGD40279.1| BatA protein [Nocardioidaceae bacterium Broad-1] Length = 336 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 26/209 (12%), Positives = 62/209 (29%), Gaps = 47/209 (22%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 +++ +A + ++ + +G ++++ + + V ++ Sbjct: 106 NRLEAAKAAAISFIDQLPA--------RYNVGLVSFSSSARVVTSP--TTDHALVVRSID 155 Query: 268 KL-NPYENTNTYPA----MHHAYRELYNEKESSHNTIGSTRL-------------KKFVI 309 L P T A + + L S + ++ Sbjct: 156 GLGPPDGGTAIGEAVYSSIDDLQQILEEAAASGPSQEPEESEGSEKSEESEEERSPAHLV 215 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG------------QDLL 357 ++DG NS + + AG+ ++A +D L Sbjct: 216 LLSDGGNSAGRSPVAAAEAARE------AGLPTSTIAYGTEGSSATLPGGQSVEVREDTL 269 Query: 358 RKCT-DSSGQFFAVNDSRELLESFDKITD 385 R + G+F+ + + EL E +D I Sbjct: 270 RNLADTTGGRFYRASSADELREVYDDIGT 298 >gi|54290564|dbj|BAD61973.1| zinc finger-like [Oryza sativa Japonica Group] gi|54291279|dbj|BAD62048.1| zinc finger-like [Oryza sativa Japonica Group] Length = 598 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 27/177 (15%), Positives = 66/177 (37%), Gaps = 23/177 (12%) Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL----SNNLNEV 262 ++DVL S ++ + R+ +A+N G V + L + + Sbjct: 72 ATRLDVLKASMKFIIRKLDDGD--------RLSIVAFNDGPVKEYSSGLLDVSGDGRSIA 123 Query: 263 KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 ++++L + A+ +E + S F++ +TDG+++ + Sbjct: 124 GKKIDRLQARGG--SGSALMPELQEAVKILDERQG--NSRNRVGFILLLTDGDDTTGFRW 179 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 + + + +++ A+ A + + LL +S G + V+D L + Sbjct: 180 SRDVIHGAVGK------YPVHTFALGAAHDPEALLHIAQESRGTYSFVDD-GNLDKI 229 >gi|297699669|ref|XP_002826908.1| PREDICTED: LOW QUALITY PROTEIN: integrin alpha-E-like [Pongo abelii] Length = 1273 Score = 58.0 bits (138), Expect = 3e-06, Method: Composition-based stats. Identities = 23/156 (14%), Positives = 49/156 (31%), Gaps = 16/156 (10%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNT 298 + Y I S ++ +R+ + + T T AM H ++ S Sbjct: 335 LVQYGGVIQTEFDLRDSQDVMASLARVQNITQVGSVTKTASAMQHVLDSIFTSSHGSRRK 394 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP---PEGQD 355 K ++ +TDG +++ L+ + + G++ +++ V + Sbjct: 395 AS-----KVMVVLTDG-----GIFEDPLDLTTVINSPKMQGVERFAIGVGEEFKSARTER 444 Query: 356 LLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQE 389 L F V + L K+ I Sbjct: 445 ELNLIASDPDETHAFKVTNYMALDGLLSKLRHNIIS 480 >gi|255973237|ref|ZP_05423823.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T1] gi|257418922|ref|ZP_05595916.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T11] gi|255964255|gb|EET96731.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T1] gi|257160750|gb|EEU90710.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T11] Length = 1154 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 101/382 (26%), Gaps = 30/382 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M A A L+ +++ + D + + QG+ D Sbjct: 202 AEARMAPATLRANLAL--PLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGSAQWD 259 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ ++Y + L++R E Sbjct: 260 GQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRG----NVQKE 315 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +V+D S SM + L + S Sbjct: 316 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNINMGYVGYSSDGYNNN 375 Query: 205 PANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 D + N+ S + + G + L++ + Sbjct: 376 AIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFS 435 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +++++ T R + + + GST D N Sbjct: 436 YKVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNINKRINS 485 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA-V 370 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 486 TFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSADENGDLYYESA 545 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 546 DYAPDISDYLAKKAVQISGTVV 567 >gi|195539501|ref|NP_001124213.1| inter-alpha (globulin) inhibitor H2 [Gallus gallus] gi|190576833|gb|ACE79193.1| inter-alpha inhibitor heavy chain 2 precursor [Gallus gallus] Length = 948 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 16/149 (10%), Positives = 52/149 (34%), Gaps = 7/149 (4%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 N + + + K + ++P TN A+ A L + S Sbjct: 359 CWRDNLVSATPAQVEDAKKYIQTIHPNGGTNINEALLRATFILNEAQNLGMLDPNS---V 415 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS- 364 ++ ++DG+ + T+ + + +++ ++ + + D L++ + Sbjct: 416 SMIVLVSDGDPTVGELKLTTIQ-KNVKQSIKDE-YSLFCLGIGFDV-DYDFLQRIATDNR 472 Query: 365 GQFFAVNDSRELLESFDKITDKIQEQSVR 393 G + ++E + +++ ++ Sbjct: 473 GMAQRIFGNQETSAQMKRFYNQVSTPLLK 501 >gi|325914146|ref|ZP_08176499.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas vesicatoria ATCC 35937] gi|325539649|gb|EGD11292.1| hypothetical protein containing a von Willebrand factor type A (vWA) domain [Xanthomonas vesicatoria ATCC 35937] Length = 525 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 34/321 (10%), Positives = 74/321 (23%), Gaps = 34/321 (10%) Query: 78 IRENAGDIAQKAQINITKDKNNPLQYIAES-KAQYEIPTENLFLKGLIPSALTNLSLRST 136 + N + + Y L L P + Sbjct: 48 AENRETYQTLSDNPVVQAADNPVSTFSIDVDTGSYSNVRRYLTSGSLPPVDAVRVEEMIN 107 Query: 137 GIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTK 196 V E ++D+ + + Sbjct: 108 YFRYDDPAPRDGQPFAV-----RTELAPTPWNHDSVLLRIGITGRAVAASAMPAANLVFL 162 Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 A K+ +L S L ++ RI + Y Sbjct: 163 VD-VSGSMGAPDKLPLLQSSLKLLTRQLRAQD--------RITLVTYAGNTAVVLPPTPG 213 Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 N + ++ L T + AY+ ++ TDG+ Sbjct: 214 NQQARIVEAIDSLQSGGGTAGASGIELAYKAAQQSYLRDGINR--------ILLATDGDF 265 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDS-- 373 + +TL + R +G+ + ++ L+ + D+ G + ++ Sbjct: 266 NVGVTDFDTL--KGMVAEKRRSGVALSTLGFGTGNYNDTLMEQLADAGDGAYAYIDSPLE 323 Query: 374 ------RELLESFDKITDKIQ 388 EL + + I ++ Sbjct: 324 ARKVLTHELGATLETIARDVK 344 >gi|149923516|ref|ZP_01911918.1| hypothetical protein PPSIR1_08092 [Plesiocystis pacifica SIR-1] gi|149815646|gb|EDM75176.1| hypothetical protein PPSIR1_08092 [Plesiocystis pacifica SIR-1] Length = 716 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 25/130 (19%), Positives = 40/130 (30%), Gaps = 15/130 (11%) Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK-----KFVIFITDGENSGASA 321 N T T+ + S+ N + F I ITDG+ +G S Sbjct: 440 NGFCSGSGTYTHLGLQLVKDYQQAYSGSTMNNDMAPYPTADETLYFNILITDGQYNGYST 499 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAP---PEGQDLLRKCT----DSSGQFFAVNDSR 374 E M N+G+ Y + Q L+ S ++ N+ Sbjct: 500 ---NAQVQGELEEMYNSGITTYVIGFGDGVDTAAAQAQLQNMAQWGSGSQNNYYDANNQT 556 Query: 375 ELLESFDKIT 384 EL ++ I Sbjct: 557 ELEQALTTIF 566 >gi|145594605|ref|YP_001158902.1| von Willebrand factor, type A [Salinispora tropica CNB-440] gi|145303942|gb|ABP54524.1| von Willebrand factor, type A [Salinispora tropica CNB-440] Length = 436 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 27/195 (13%), Positives = 68/195 (34%), Gaps = 30/195 (15%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN-------N 258 +I V ++ +V+++ ++ L +R+ Y C + Sbjct: 65 GRSRISVAQQAFNEVVDAL----PDETELGIRVLGATYPGDDKEQGCQDTQQIVPVGPVD 120 Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + K+ + L P T A+ A +L + ++ ITDGE++ Sbjct: 121 RVQAKAAVATLRPTGYTPVGLALRSAAEDLGTGS-----------TARRIVLITDGEDT- 168 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIY--SVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRE 375 + ++ + G K+ ++ ++ + + L + G + A + E Sbjct: 169 ----CAPPDPCEVARELAAQGTKLVVDTLGLAPDEKVRQQLLCIAGATGGTYTAAQSADE 224 Query: 376 LLESFDKITDKIQEQ 390 L ++ D+ ++ Sbjct: 225 LTGRIKQLVDRARDT 239 >gi|33985|emb|CAA30160.1| trypsin inhibitor [Homo sapiens] Length = 946 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 19/142 (13%), Positives = 52/142 (36%), Gaps = 13/142 (9%) Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 K + K+ P TN A+ A L S +I ++DG+ + Sbjct: 373 AKRYIEKIQPSGGTNINEALLRAIFILNEANNLGLLDPNS---VSLIILVSDGDPTVGEL 429 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ------FFAVNDSRE 375 + + + E +++ + ++S+ + D L++ + + + S + Sbjct: 430 KLSKIQ-KNVKENIQD-NISLFSLGMGFDV-DYDFLKRL-SNENHGIAQRIYGNQDTSSQ 485 Query: 376 LLESFDKITDKIQEQSVRIAPN 397 L + +++++ + P+ Sbjct: 486 LKKFYNQVSTPLLRNVQFNYPH 507 >gi|218198427|gb|EEC80854.1| hypothetical protein OsI_23472 [Oryza sativa Indica Group] Length = 604 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 27/177 (15%), Positives = 66/177 (37%), Gaps = 23/177 (12%) Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL----SNNLNEV 262 ++DVL S ++ + R+ +A+N G V + L + + Sbjct: 72 ATRLDVLKASMKFIIRKLDDGD--------RLSIVAFNDGPVKEYSSGLLDVSGDGRSIA 123 Query: 263 KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 ++++L + A+ +E + S F++ +TDG+++ + Sbjct: 124 GKKIDRLQARGG--SGSALMLELQEAVKILDERQG--NSRNRVGFILLLTDGDDTTGFRW 179 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 + + + +++ A+ A + + LL +S G + V+D L + Sbjct: 180 SRDVIHGAVGK------YPVHTFALGAAHDPEALLHIAQESRGTYSFVDD-GNLDKI 229 >gi|1616950|gb|AAB16869.1| CD11b [Sus scrofa] Length = 920 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 21/119 (17%), Positives = 42/119 (35%), Gaps = 15/119 (12%) Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T+T + REL++ K + K ++ ITDGE + + L + Sbjct: 90 GRTHTATGIRKVVRELFHSKSGAR-----ENALKILVVITDGE-----KFGDPLGYEDVI 139 Query: 333 EYMRNAGMKIYSVAVSA---PPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDK 386 G+ Y + V + ++ L F VN+ + +++ +K Sbjct: 140 PEADRKGVIRYVIGVGDAFNSWKSREELNTIASKPCGDHVFQVNNFEAVKTIQNQLQEK 198 >gi|229188238|ref|ZP_04315314.1| hypothetical protein bcere0004_57400 [Bacillus cereus BGSC 6E1] gi|228595258|gb|EEK53002.1| hypothetical protein bcere0004_57400 [Bacillus cereus BGSC 6E1] Length = 425 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 26/202 (12%), Positives = 62/202 (30%), Gaps = 22/202 (10%) Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI----GIVGNQC 252 S K++ ++ N ++ I + G+ N Sbjct: 134 SGSMAGKVNGEVKMEAAKKAIYNYLDKIPDNANVMLRVYGHKGSNNENDKSLSCGSSEVM 193 Query: 253 TPLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 PL N + + L+K P T A+ + V + Sbjct: 194 YPLQPYNKEQFNAALSKFGPKGWTPLASAIESVNDDFKE--------YTGEENLNVVYIV 245 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAG--MKIYSVAVSAPPEGQDLLRKCTDSS-GQFF 368 +DGE + + + + + + + + Q L+ ++ G + Sbjct: 246 SDGEETCGG------DPVNAAKNLNQSNTHAVVNIIGFDVKNSEQQQLKYTAEAGKGNYA 299 Query: 369 AVNDSRELLESFDKITDKIQEQ 390 V+ + EL ++ +K +K+ ++ Sbjct: 300 TVSSADELHQTLNKEYEKLYKE 321 >gi|167623667|ref|YP_001673961.1| cell wall anchor domain-containing protein [Shewanella halifaxensis HAW-EB4] gi|167353689|gb|ABZ76302.1| LPXTG-motif cell wall anchor domain [Shewanella halifaxensis HAW-EB4] Length = 850 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 47/395 (11%), Positives = 100/395 (25%), Gaps = 41/395 (10%) Query: 19 LAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQ---IKKHLKQG 75 AH ++ + AL++ L + ++ I ++ Sbjct: 285 YAHSAVVKAE-DEALESEALE-SRERQNRVSMTVTFDAAMPIENIVSPYHGISINMVENA 342 Query: 76 SYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRS 135 + +T + A +Q L + N + Sbjct: 343 AAQVSLDNYAVANRDFVLTWQPVQGSEPTAAVFSQQGKTHAELASQVTAGDTSFN-QGGA 401 Query: 136 TGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF--WSKN 193 + S+ S V D S+ M Sbjct: 402 KSKLSPQSQPEPQSQLQVQD-SKQQTLSKKALEKYALVMLMPPQGSDDESSSIARELVLV 460 Query: 194 TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 S A K L + L + + + R + Sbjct: 461 IDTSGSMSGDAIIQAKS-ALKYALAGLRPQDSFNVLQFNSTVERWSRHVMPATAI----- 514 Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN---------------EKESSHNT 298 NL ++ +N L T A+ A +L N + + Sbjct: 515 ----NLGRAQNYINGLQADGGTEMSLALDAALTKLDNDRGHNSKPVHDDDRYQSSNETLE 570 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLR 358 + + V+FITDG + QI + + +++++ + + P + R Sbjct: 571 QSAATPLRQVLFITDGAV-----ANESRLFEQIKNQLGES--RLFTIGIGSAPNAHFMQR 623 Query: 359 KCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 G + + E+ + + +KI++ V Sbjct: 624 AAEVGRGTYTYIGKLDEVNQKVVSLLEKIEKPQVT 658 >gi|167525755|ref|XP_001747212.1| hypothetical protein [Monosiga brevicollis MX1] gi|163774507|gb|EDQ88136.1| predicted protein [Monosiga brevicollis MX1] Length = 471 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 17/157 (10%), Positives = 45/157 (28%), Gaps = 19/157 (12%) Query: 233 NLSVRIGTIAYNIGIVGNQCT--PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 + R + ++ + + + + + KL TN + + Sbjct: 90 KDTDRFALVTFDSDVKTVFDLRPMTTAHKEACLADVQKLRAGSCTNLSGGLFRGVELMQQ 149 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR-----NAGMKIYSV 345 + ++ +TDG + ++ +C +R IY+ Sbjct: 150 RGAT-------KGAVSSILLMTDGIANEGVRDKDD-----MCRALRGLMGPAPDYTIYTF 197 Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 L + +G ++ + + + ESF Sbjct: 198 GYGKDHNENMLRQLSETGNGMYYFIESNDIIPESFGD 234 >gi|266620637|ref|ZP_06113572.1| putative von Willebrand factor type A domain protein [Clostridium hathewayi DSM 13479] gi|288867752|gb|EFD00051.1| putative von Willebrand factor type A domain protein [Clostridium hathewayi DSM 13479] Length = 2963 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 41/358 (11%), Positives = 106/358 (29%), Gaps = 19/358 (5%) Query: 38 LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK 97 L+ + KD + + + + + + + I I+ Sbjct: 426 LALTIHHNAFYNDKDISE-DEVVRGLKEGDVINPADYAWNKDFLTYAGSSQDSIVISSQT 484 Query: 98 NNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 N + + A+ + E + S ++ + + Sbjct: 485 ENRVDLYYNAIAEEKSEDEVTITGAIPRSYFRSVKADDNTTPGKVFPTKTAEWVDEANGI 544 Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 + + + WS T +K K V+ + Sbjct: 545 GKINFTIYGNPIRRGSDVILVIDSSGSMEGEKWSTAKTAAK-GFIDNLYQNKDGVVSDDR 603 Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL--NKLNPYENT 275 +V+ A S + + I T + + +KS + +++ T Sbjct: 604 IAIVDFDSSAKAYPGTNSGSETFLKVDDKITIKNKTYSAKDY--LKSYVLDSQMKDTGGT 661 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY--QNTLNTLQICE 333 + A+ A + N ++SS +++F++DGE +G + + + Sbjct: 662 DYNKALQTAQSVINNRRDSSRPA--------YIVFMSDGEPNGYWDWLTYRYYDGQKYAT 713 Query: 334 YMRNAGMKIYSVAVSAPPEG-QDLLRKCTDSSGQFF--AVNDSRELLESFDKITDKIQ 388 +++ G+ IYS+ ++ + + + + +L+ +D I I+ Sbjct: 714 ELKSDGVTIYSLGLNIGSTNFNKFIVPLASDPTSTYAKNIVKTSDLVGIYDAIASSIK 771 >gi|193213113|ref|YP_001999066.1| hypothetical protein Cpar_1468 [Chlorobaculum parvum NCIB 8327] gi|193086590|gb|ACF11866.1| conserved hypothetical protein [Chlorobaculum parvum NCIB 8327] Length = 352 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 28/194 (14%), Positives = 50/194 (25%), Gaps = 8/194 (4%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSAL------DAAVLSGCASIVSDRTIKDPT 54 M AI + V F A+DL + + Q+QSA A L + D + Sbjct: 20 MFAIFLVVLLGFAALALDLGRMNLTKVQLQSAADAAALGGAGSLVNSSLSTYDWDAAEQK 79 Query: 55 TKKDQTSTIFKK-QIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEI 113 I QI++ + Y + G ++ + Sbjct: 80 GLVLAQHNIVNGEQIQQATIEAGYWNSSDGFRHHGTSGVPVTGDVPAVRATVALTSTQNN 139 Query: 114 PTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNN 173 LF + +N+ + I + + + L + Sbjct: 140 GPLKLFFAPFLGINESNIPASAIAAIYPPAGGVGM-FPFTLGKDVFNNFWDSETQTPKTE 198 Query: 174 MTSNKYLLPPPPKK 187 L PK Sbjct: 199 DPFMVNLETYYPKG 212 >gi|109112823|ref|XP_001117651.1| PREDICTED: integrin alpha-E-like [Macaca mulatta] Length = 956 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 22/156 (14%), Positives = 49/156 (31%), Gaps = 16/156 (10%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNT 298 + Y I S ++ +++ + + T T AM H ++ S Sbjct: 407 LVQYGGVIQTEFDLRDSQDVMASLAKVQNITQVGSVTKTASAMQHVLDNIFTSSHGSRRK 466 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP---PEGQD 355 K ++ +TDG +++ L+ + + G++ +++ V + Sbjct: 467 AS-----KVMVVLTDG-----GIFEDPLDLTTVINSPKMHGVERFAIGVGEEFKSARTER 516 Query: 356 LLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQE 389 L F V + L K+ I Sbjct: 517 ELNLIASDPDETHAFKVTNYMALDGLLSKLRYNIIS 552 >gi|301784617|ref|XP_002927724.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like [Ailuropoda melanoleuca] Length = 898 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 21/132 (15%), Positives = 54/132 (40%), Gaps = 11/132 (8%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 +N+ + K ++ ++P T+ A+ A + L + + R +IF+TDG+ Sbjct: 310 DNVRDGKVYIHHMSPTGGTDINGALQRAIKLLNDYVAHNDIE---DRSVSLIIFLTDGKP 366 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL----LRKCTDSSGQFFAVND 372 + + + + + I+++ + + L L C + + +D Sbjct: 367 TVGETHTLKILNNT--KEAARGQICIFTIGIGNDVDFMLLEKLSLENCGLTR-RVLEEDD 423 Query: 373 S-RELLESFDKI 383 + +L+ +D+I Sbjct: 424 AGAQLIGFYDEI 435 >gi|281346829|gb|EFB22413.1| hypothetical protein PANDA_017530 [Ailuropoda melanoleuca] Length = 895 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 21/132 (15%), Positives = 54/132 (40%), Gaps = 11/132 (8%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 +N+ + K ++ ++P T+ A+ A + L + + R +IF+TDG+ Sbjct: 308 DNVRDGKVYIHHMSPTGGTDINGALQRAIKLLNDYVAHNDIE---DRSVSLIIFLTDGKP 364 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL----LRKCTDSSGQFFAVND 372 + + + + + I+++ + + L L C + + +D Sbjct: 365 TVGETHTLKILNNT--KEAARGQICIFTIGIGNDVDFMLLEKLSLENCGLTR-RVLEEDD 421 Query: 373 S-RELLESFDKI 383 + +L+ +D+I Sbjct: 422 AGAQLIGFYDEI 433 >gi|257085650|ref|ZP_05580011.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis Fly1] gi|256993680|gb|EEU80982.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis Fly1] Length = 1154 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 97/382 (25%), Gaps = 30/382 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M A A L+ T T + + H + G Sbjct: 202 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWRPTGNQNVLNHQGNKDGGAQWDGQ 261 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ ++Y + L++R E Sbjct: 262 T--SWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRG----NVQKE 315 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +V+D S SM + L + S Sbjct: 316 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNINMGYVGYSSDGYNNN 375 Query: 205 PANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 D + N+ S + + G + L++ + Sbjct: 376 AIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFS 435 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +++++ T R + + + GST D N Sbjct: 436 YKVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNISKRINS 485 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA-V 370 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 486 TFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSADENGDLYYESA 545 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 546 DYAPDISDYLAKKAVQISGTVV 567 >gi|156382663|ref|XP_001632672.1| predicted protein [Nematostella vectensis] gi|156219731|gb|EDO40609.1| predicted protein [Nematostella vectensis] Length = 1235 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 26/197 (13%), Positives = 66/197 (33%), Gaps = 23/197 (11%) Query: 207 NRKIDVLIESAGNLVNSI---QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS-NNLNEV 262 ++DV + +++++ K N V + + + P N+N + Sbjct: 306 RTRLDVAKAALSTILSTLLPQDKVGVVLFNSKVTLAGSSGVDECYSTRLAPAGRFNVNYL 365 Query: 263 KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 K +N+ P T A A+ L + K + F++F+TDG Sbjct: 366 KDFINRSRPGGGTQYQNAFKAAFTLLKSAKSGDGGGE-----QSFLLFLTDG----GPKD 416 Query: 323 QNTLNTLQICEYMRN-----AGMKIYSVAVSAPPEGQDLLRKCTDS-SGQFFAVNDS--- 373 I + + + I ++ + +D L + + + ++ V++ Sbjct: 417 DALEVERLIAQNKKEMEESRERVTIMTIGLGKDEHMKDFLGRLSKNVGSKYSQVDNEAHM 476 Query: 374 -RELLESFDKITDKIQE 389 + + + + + Sbjct: 477 YSAIHDYYSHLQAMATK 493 >gi|327412874|emb|CAX67882.1| putative Von Willebrand factor, type A [Salmonella bongori] Length = 325 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 33/204 (16%), Positives = 66/204 (32%), Gaps = 35/204 (17%) Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 + + S ++ + ES V S RIG + + Sbjct: 101 ILDVSGSMAKNDVQGGATRLQAVQESVRKFVA---------ARQSDRIGLVIFASQAWPF 151 Query: 251 QCTPLSNNLNEVKSRLNKLNPYEN---TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 P+S + +++R+N+L+P T A+ + L N +T K Sbjct: 152 A--PVSEDKQALQTRINQLSPGMVGQQTAIGDALGVGVKLLDNT--------TNTEASKL 201 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA-PPEGQD-----LLRKCT 361 I +TDG ++ + L ++++++A G D L++ Sbjct: 202 AILLTDGNDTASQLAPALAAQLAA-----AHHVQVHTIAFGDINSTGDDKVDLPLMQNIA 256 Query: 362 --DSSGQFFAVNDSRELLESFDKI 383 + A N L + +I Sbjct: 257 QITGGQSWTAANSGAALDSVWKEI 280 >gi|296282333|ref|ZP_06860331.1| von Willebrand factor type A domain-containing protein [Citromicrobium bathyomarinum JL354] Length = 571 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 37/367 (10%), Positives = 94/367 (25%), Gaps = 36/367 (9%) Query: 37 VLSGCASIVSDRTIKDPTTKKDQTSTIFKK----QIKKHLKQGSYIRENAGDIAQKAQIN 92 + ++I+ + + + RE Sbjct: 62 AFAAPSAIMVTGSRISREEADATAQPQTTSPDLRYVPSIVIPTVPDRERYDGKDVSEVAV 121 Query: 93 ITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICM 152 + + + Y L + P + Sbjct: 122 TLEQPVSTFSVDVD-TGAYSNARRMLTDGQMPPKGAVRTEEFVNYFRYDYPRPTSAQ--- 177 Query: 153 VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKS-FWSKNTTKSKYAPAPAPANRKID 211 D ++ + D + L K+ + N + K+ Sbjct: 178 --DAPFTVNMDVARTPWDADTRLVRIGLAGYEAPKAERPAANLVFLLDVSGSMSSADKLP 235 Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP 271 ++ + LV + + +++ E+ + L++L Sbjct: 236 LVKTAMKTLVGQLT----------PKDRVSIVVYAGAAGLVLEPTSDSREIMAALDQLQA 285 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 +T + AY+ K VI TDG+ + + + L L+ Sbjct: 286 GGSTAGGAGLELAYKVAEASKVDGI---------NRVILATDGDFNVGLSDNDKL--LEY 334 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQ 390 E R G+ + + + L+ + D G + ++ + +E+ + +++ Sbjct: 335 VEDKRKNGIAMSVLGFGRGNINEALMEQIADKGNGNYGYIDSA---IEARKVLGEQLGAT 391 Query: 391 SVRIAPN 397 IA + Sbjct: 392 LYTIAKD 398 >gi|116625274|ref|YP_827430.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228436|gb|ABJ87145.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 320 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 26/170 (15%), Positives = 59/170 (34%), Gaps = 24/170 (14%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTP--LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 N + + +N + +N++ E++ L +++ T A+ + L Sbjct: 119 SNRDDEVFIVNFNDTAYLDNPKDKDFTNDIGELEQALKRIDARGGTAMRDAIQMSIDHLK 178 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 KK ++ ITDG ++ + +N +I + + + IY V + Sbjct: 179 KGHRD----------KKVLVVITDGNDN-----SSVINMERIMKNAHQSDVLIYGVGLLT 223 Query: 350 P------PEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + L + + G+ F D E+ ++ I+ Q Sbjct: 224 EEEHREAARAKRALNDLAEATGGKTFFPKDLEEVDAIASQVAHDIRSQYT 273 >gi|325860337|ref|ZP_08173459.1| von Willebrand factor type A domain protein [Prevotella denticola CRIS 18C-A] gi|325482216|gb|EGC85227.1| von Willebrand factor type A domain protein [Prevotella denticola CRIS 18C-A] Length = 331 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 59/165 (35%), Gaps = 45/165 (27%) Query: 253 TPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P++ + K L+ +NP + T+ A+ + ++++ K + Sbjct: 145 LPITGDYVSAKMFLDNINPSLIGTQGTDIGKALQLSINSFTP----------NSKVGKAI 194 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE---------------- 352 I ITDGE++ + + RN G+K++ + + + Sbjct: 195 ILITDGEDNEG-------GAEAMAKQARNKGIKVFILGIGSKEGSTIPMPDGTELKDSNG 247 Query: 353 -------GQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQE 389 +D+ R+ + G + V++S K K+Q+ Sbjct: 248 EPVKTHLNEDMCRRIATAGQGVYVHVDNSNVADTVLGKELGKLQK 292 >gi|326674126|ref|XP_003200076.1| PREDICTED: integrin alpha-E-like [Danio rerio] Length = 940 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 20/165 (12%), Positives = 53/165 (32%), Gaps = 16/165 (9%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELY 289 + + Y I + + ++ ++ N T T A++H +++ Sbjct: 208 TTMSNCNFAIVQYGSSIRTELSLLDNEDGARSLQKVKQIKQIYNLTKTASAINHVLTDIF 267 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 +K +I ++DG+ G + + + G+ YS+ V Sbjct: 268 -----IPENGSKNNSEKIIIVLSDGKILGDPMTLDEVLNKT-----QMKGVTRYSIGVGD 317 Query: 350 ----PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + + + G++++V+ L + + I Sbjct: 318 GILKNKDAIKEMTQIA-DPGKYYSVSSYGALNDILSSLERGIIGT 361 >gi|327278400|ref|XP_003223950.1| PREDICTED: integrin alpha-M-like [Anolis carolinensis] Length = 1160 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 26/135 (19%), Positives = 46/135 (34%), Gaps = 17/135 (12%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N +E+ + +L T T + RE + K +I ITDGE Sbjct: 214 NPDELLRPVTQLR--GATLTATYIQRVVRE-----QFVTEKGSRPGASKVLIVITDGE-- 264 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVS---APPEGQDLLRKCTDSS--GQFFAVND 372 + L + AG+ +++ V + + L F V++ Sbjct: 265 ---KSGDPLQYSDVIPEAERAGIIRFAIGVGKAFSGGTAKQELISIASQPEDDHVFPVDN 321 Query: 373 SRELLESFDKITDKI 387 L + +K+ DKI Sbjct: 322 FDALKDIQNKLQDKI 336 >gi|296474257|gb|DAA16372.1| complement component 2 precursor [Bos taurus] Length = 750 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 40/244 (16%), Positives = 89/244 (36%), Gaps = 16/244 (6%) Query: 155 DVSRSMEDLYLQKHNDNNNMTSNKYLLPP-PPKKSFWSKNTTKSKYAPAPAPANRKIDVL 213 DV+ ++ + N + K + + + + ++ Sbjct: 215 DVAPALGTSFSHLLATTNPIQQKKKQNLGRKIQIQRSGHLNLYLLLDASQSVSKDDFEIF 274 Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-- 271 +SA +V+ I + V I T A I+ + S ++ EV++ L +N Sbjct: 275 KDSASRMVDRIFSFEIKVS---VAIITFASKPKIIMSVLEDRSRDVTEVENSLRNINYKD 331 Query: 272 ---YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF-VIFITDGENSGASAYQNTLN 327 TN Y A+H Y + N+ H G+ + + +I +TDG+++ + + ++ Sbjct: 332 HENGTGTNIYEALHAVYIMMNNQMNRPHMNPGAWQEIRHAIILLTDGKSNMGGSPKVAVD 391 Query: 328 TLQICEYMRNAG---MKIYSVAVSAPPEGQDLLRKCTDSSG---QFFAVNDSRELLESFD 381 ++ + + IY++ V + L F + D + L + F+ Sbjct: 392 NIKEVLNINQKRKDYLDIYAIGVGSLHVDWKELNNLGSKKDGERHAFILKDVQALSQVFE 451 Query: 382 KITD 385 + D Sbjct: 452 HMLD 455 >gi|111120280|gb|ABH06325.1| complement component 2 precursor [Bos taurus] Length = 787 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 40/244 (16%), Positives = 89/244 (36%), Gaps = 16/244 (6%) Query: 155 DVSRSMEDLYLQKHNDNNNMTSNKYLLPP-PPKKSFWSKNTTKSKYAPAPAPANRKIDVL 213 DV+ ++ + N + K + + + + ++ Sbjct: 215 DVAPALGTSFSHLLATTNPIQQKKKQNLGRKIQIQRSGHLNLYLLLDASQSVSKDDFEIF 274 Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-- 271 +SA +V+ I + V I T A I+ + S ++ EV++ L +N Sbjct: 275 KDSASRMVDRIFSFEIKVS---VAIITFASKPKIIMSVLEDRSRDVTEVENSLRNINYKD 331 Query: 272 ---YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF-VIFITDGENSGASAYQNTLN 327 TN Y A+H Y + N+ H G+ + + +I +TDG+++ + + ++ Sbjct: 332 HENGTGTNIYEALHAVYIMMNNQMNRPHMNPGAWQEIRHAIILLTDGKSNMGGSPKVAVD 391 Query: 328 TLQICEYMRNAG---MKIYSVAVSAPPEGQDLLRKCTDSSG---QFFAVNDSRELLESFD 381 ++ + + IY++ V + L F + D + L + F+ Sbjct: 392 NIKEVLNINQKRKDYLDIYAIGVGSLHVDWKELNNLGSKKDGERHAFILKDVQALSQVFE 451 Query: 382 KITD 385 + D Sbjct: 452 HMLD 455 >gi|77735935|ref|NP_001029664.1| complement C2 precursor [Bos taurus] gi|115311857|sp|Q3SYW2|CO2_BOVIN RecName: Full=Complement C2; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement C2b fragment; Contains: RecName: Full=Complement C2a fragment; Flags: Precursor gi|74267667|gb|AAI03358.1| Complement component 2 [Bos taurus] Length = 750 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 40/244 (16%), Positives = 89/244 (36%), Gaps = 16/244 (6%) Query: 155 DVSRSMEDLYLQKHNDNNNMTSNKYLLPP-PPKKSFWSKNTTKSKYAPAPAPANRKIDVL 213 DV+ ++ + N + K + + + + ++ Sbjct: 215 DVAPALGTSFSHLLATTNPIQQKKKQNLGRKIQIQRSGHLNLYLLLDASQSVSKDDFEIF 274 Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-- 271 +SA +V+ I + V I T A I+ + S ++ EV++ L +N Sbjct: 275 KDSASRMVDRIFSFEIKVS---VAIITFASKPKIIMSVLEDRSRDVTEVENSLRNINYKD 331 Query: 272 ---YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF-VIFITDGENSGASAYQNTLN 327 TN Y A+H Y + N+ H G+ + + +I +TDG+++ + + ++ Sbjct: 332 HENGTGTNIYEALHAVYIMMNNQMNRPHMNPGAWQEIRHAIILLTDGKSNMGGSPKVAVD 391 Query: 328 TLQICEYMRNAG---MKIYSVAVSAPPEGQDLLRKCTDSSG---QFFAVNDSRELLESFD 381 ++ + + IY++ V + L F + D + L + F+ Sbjct: 392 NIKEVLNINQKRKDYLDIYAIGVGSLHVDWKELNNLGSKKDGERHAFILKDVQALSQVFE 451 Query: 382 KITD 385 + D Sbjct: 452 HMLD 455 >gi|3024062|sp|P97279|ITIH2_MESAU RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H2; Short=ITI heavy chain H2; Short=ITI-HC2; Short=Inter-alpha-inhibitor heavy chain 2; Flags: Precursor gi|1694690|dbj|BAA13939.1| inter-alpha-trypsin inhibitor heavy chain 2 [Mesocricetus auratus] Length = 946 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 27/226 (11%), Positives = 79/226 (34%), Gaps = 23/226 (10%) Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 L P PK + + + S + + ++ + +L Q ++ + N + Sbjct: 299 FAPENLDPIPKNILFVIDVSGSMWGI---KMKQTVEAMKTILDDLRTEDQFSVVDF-NHN 354 Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS 295 VR + + K + K+ P TN A+ A L Sbjct: 355 VRTWRNDLVSATKTQ--------ITDAKRYIEKIQPSGGTNINEALLRAIFILNEASNLG 406 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 + ++ ++DG+ + + + + + +++ + ++S+ + D Sbjct: 407 ML---NPDSVSLIVLVSDGDPTVGELKLSKIQ-KNVKQNIQD-NISLFSLGIGFDV-DYD 460 Query: 356 LLRKCT-DSSGQFFAV----NDSRELLESFDKITDKIQEQSVRIAP 396 L++ + ++ G + + S +L + +++++ + P Sbjct: 461 FLKRLSNENRGIAQRIYGNRDTSSQLKKFYNQVSTPLLRNVQFNYP 506 >gi|300869833|ref|YP_003784704.1| hypothetical protein BP951000_0196 [Brachyspira pilosicoli 95/1000] gi|300687532|gb|ADK30203.1| putative membrane protein containing von Willebrand factor (vWA) type A domain, BatB [Brachyspira pilosicoli 95/1000] Length = 338 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 29/164 (17%), Positives = 52/164 (31%), Gaps = 39/164 (23%) Query: 248 VGNQCTPLSNNLNEVKSRLNKLNPYE----NTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 +P + ++ LN+LN T A+ A Sbjct: 139 TSFVASPFTQDMETFSYILNELNTKSVTLQGTRIADALVTAKNTFNVNIPG--------- 189 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP------------- 350 KK +I ITDGE+ I + +++ + +Y+V V + Sbjct: 190 -KKSIILITDGEDHAGYFDN-------ILKELKDNDISVYTVGVGSELGATIRSDIGYSE 241 Query: 351 -----PEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 L+ DS+G ++D+ L FD I + + Sbjct: 242 NTVISKRDDKTLKLIADSTGGKSYISDNISLKSIFDDIKNNMDS 285 >gi|260430630|ref|ZP_05784603.1| von Willebrand factor, type A [Citreicella sp. SE45] gi|260418659|gb|EEX11916.1| von Willebrand factor, type A [Citreicella sp. SE45] Length = 318 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 22/157 (14%), Positives = 56/157 (35%), Gaps = 31/157 (19%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKE 293 R+G + + ++++ V + L + T + A R L Sbjct: 132 RVGLVVFGDRAYVAAAP--THDVGAVAQVIGTLQIGVSGKATAIADGLGLAIRRL----- 184 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA---- 349 G + +I ++DG+++ ++ + + + GM++Y++A+ Sbjct: 185 -----RGREAESRVIILLSDGQDTTG-----AVDPVAAAQAAQELGMRVYTIALGPADLA 234 Query: 350 ------PPEGQDLLRKCTD-SSGQFFAVNDSRELLES 379 + LR+ + + G+ F V + +L Sbjct: 235 SSPDARDAVDSETLRRIAEVAGGETFRVRSTEDLEAV 271 >gi|221117277|ref|XP_002154725.1| PREDICTED: similar to HyTSR1 protein, partial [Hydra magnipapillata] Length = 3382 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 23/151 (15%), Positives = 49/151 (32%), Gaps = 12/151 (7%) Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGS 301 + G + N E ++ L + T+ A A ++ Sbjct: 2789 QFFDGAMNNWTLSDKQKQEEFLRKMIILESRGFSYTWSAAITARETIFTT-----QKGMR 2843 Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT 361 + +K+ I TDG +G Q + ++ G+++ +V + +P D L Sbjct: 2844 SNVKRVAILFTDGVYNGKHDTQKEWQYV------KDQGIQVIAVGIGSPIN-TDNLELWA 2896 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + F+ +E I D + + Sbjct: 2897 SNKNSVFSATTYQEADAFISTIGDNVCKSLT 2927 Score = 54.5 bits (129), Expect = 2e-05, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 50/159 (31%), Gaps = 15/159 (9%) Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGS 301 + G + N E ++ L + T+ A A ++ Sbjct: 3048 QFFDGAMNNWTLSDKQKQEEFLRKMIILESRGFSYTWSAAITARETIFTT-----QKGMR 3102 Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT 361 + +K+ I TDG +G Q + ++ G+++ ++ + +P L Sbjct: 3103 SNVKRVAILFTDGVYNGKHDTQKEWQYV------KDQGIQVVAIGIGSPIN-ISNLELWA 3155 Query: 362 DSSGQFFAVNDSRELLESFDK-ITDKIQEQSVR--IAPN 397 S F E I ++ V+ I PN Sbjct: 3156 SSKNFVFNATTYEEADAFISSVILNEPSSTVVKTIIEPN 3194 >gi|221067364|ref|ZP_03543469.1| conserved hypothetical protein [Comamonas testosteroni KF-1] gi|220712387|gb|EED67755.1| conserved hypothetical protein [Comamonas testosteroni KF-1] Length = 419 Score = 57.6 bits (137), Expect = 3e-06, Method: Composition-based stats. Identities = 36/287 (12%), Positives = 80/287 (27%), Gaps = 21/287 (7%) Query: 11 LFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKK 70 F+ A+D + +R ++Q+ALD+ L+ + + + Sbjct: 43 GFMGIALDFGRLFIVRTELQTALDSCALAAARELNKQPD--------AISRAVSAGAAAG 94 Query: 71 HLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTN 130 +L + N Q +I+ Y++ + A + ++ + I L Sbjct: 95 NLNGVNLQSANWSGQGQITAADISFRDA---SYLSTTSATAAVYSQCTHTQSNIGMWLLK 151 Query: 131 LSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNN--MTSNKYLLPPPPKKS 188 +G + V + + + T Y + Sbjct: 152 ALGAFSGNSASFPATGNVGTYAVATRASAQSACPIPVALQAKTPGATRPDYGYTVGDWLT 211 Query: 189 FWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNS-IQKAIQEKKNLSVRIGTIAYNIGI 247 +K+ S + A + + S + + + + Y GI Sbjct: 212 LLAKSNGGSTFGWANLDGSNSASETADELNGHCGSKLGDTLGTPGVQASIVEIWNYRFGI 271 Query: 248 VGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKES 294 N P S + + +T+T+ A AY + Sbjct: 272 YKNSSGP-SVGRPDYTGYI------YDTSTWSAGRSAYNGSSGSTPN 311 >gi|223939755|ref|ZP_03631626.1| Vault protein inter-alpha-trypsin domain protein [bacterium Ellin514] gi|223891531|gb|EEF58021.1| Vault protein inter-alpha-trypsin domain protein [bacterium Ellin514] Length = 806 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 20/135 (14%), Positives = 39/135 (28%), Gaps = 13/135 (9%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N + + L T A+ A E V+F+TDG + Sbjct: 372 NREKAGDFIKNLKAMGGTAIDEALKKALSLESKEGRPF-----------VVVFLTDGLPT 420 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 + ++ + + E + +I+ + L R ++ V +L Sbjct: 421 VGTTDEDQI-LKGMQERNKEKR-RIFCFGIGTDVNTHLLDRIAEETRAFSQYVLPEEDLE 478 Query: 378 ESFDKITDKIQEQSV 392 KI E + Sbjct: 479 VKVSSFFSKINEPVL 493 >gi|217974748|ref|YP_002359499.1| outer membrane adhesin-like protein [Shewanella baltica OS223] gi|217499883|gb|ACK48076.1| outer membrane adhesin like proteiin [Shewanella baltica OS223] Length = 1215 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 35/347 (10%), Positives = 102/347 (29%), Gaps = 26/347 (7%) Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 + L + Q ++ ++ I + + Sbjct: 185 DEYKGNAVSGAANATLTTDVATDSIMSNQWQARNGDMKWLNHSTASNIGDVNRTAQGRV- 243 Query: 117 NLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS 176 G + ++ R + + L + + ++K + Sbjct: 244 ----YGKSAWEVLAQDVKDDPKSGRKTAQPTRTRYTTLANNAPDANNPVKKELPAAQFSC 299 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 L + + + +P ID ++A LV++ + +S Sbjct: 300 RDQLDFVWVEGDIDMQIVMDRSGSMFGSP----IDNAKQAAKILVDATAEGSTAMGLVSF 355 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 + + P + +K ++ + +T + A L ++ Sbjct: 356 SGRSSVKQDFAMQKMPKPDNGVKQALKGAIDNIYANGSTALFDGSQLALDNLS-----AY 410 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 ++ V + DG+++ + +++ + +NA + I+S + L Sbjct: 411 QASAASGAPGVVFVLADGDDNNSIKSESS-----VITAYQNANVPIFSFGYGSASPTGPL 465 Query: 357 LRKCTDSSGQFFAV-NDSRELLESF---DKIT---DKIQEQSVRIAP 396 + + G++F+ E++++F + I + ++ IA Sbjct: 466 VTMANATGGKYFSSPTTLAEIIDAFLQANAIATDNQNLVSSTLNIAS 512 >gi|319956804|ref|YP_004168067.1| von willebrand factor type a [Nitratifractor salsuginis DSM 16511] gi|319419208|gb|ADV46318.1| von Willebrand factor type A [Nitratifractor salsuginis DSM 16511] Length = 398 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 26/201 (12%), Positives = 60/201 (29%), Gaps = 23/201 (11%) Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 S RKID+ + +L+ I + N PL Sbjct: 30 SGSMWGQVDGERKIDIARRAMRDLLRDWNPQIPLGLTVYGHRRKGDCNDIETPIPVGPL- 88 Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + + + + P T A+ + ++ + +I I+DG+ Sbjct: 89 -DRQRMIRAVEGIRPKGKTPIARALKRVAAQFRGSEDPAT-----------IILISDGKE 136 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKI--YSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDS 373 + +R +G++ + + L + G +F+ ++ Sbjct: 137 ------SCDADPCATARELRKSGIRFVAHVIGFHVDRTTDRQLACIARATGGSYFSARNA 190 Query: 374 RELLESFDKITDKIQEQSVRI 394 L + ++ KI + +I Sbjct: 191 AALNRAITQVAKKIT-RVTKI 210 >gi|209527393|ref|ZP_03275900.1| von Willebrand factor type A [Arthrospira maxima CS-328] gi|209492184|gb|EDZ92532.1| von Willebrand factor type A [Arthrospira maxima CS-328] Length = 463 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 26/215 (12%), Positives = 65/215 (30%), Gaps = 34/215 (15%) Query: 193 NTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK-NLSVRIGTIAYNIGIVGNQ 251 ++ ++ S++ + + + RI + ++ Sbjct: 62 TIPTGNSYFLDGKQYTQVTGGKTKIDQVIESLEGLVSSGQADSRDRIALVRFDDSASVLL 121 Query: 252 CTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 S + +K+ + +L T M A L + + Sbjct: 122 PLTASTDTASLKNAIGQLRNFSGGTRMALGMEEALNIL----------KNCDLSSRRTLI 171 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFA 369 TDG+ + + + AG+ I ++ V +DLL +D + G+ F Sbjct: 172 FTDGQT------FDESDCRDLATQFAEAGIPITALGVG--EYNEDLLLYLSDRTGGRVFN 223 Query: 370 V------NDSREL------LESFDKITDKIQEQSV 392 V + ++ F+++ + Q + + Sbjct: 224 VVETQTHTGTTDIPISELPNTIFEEV-QQAQSEVI 257 >gi|160876887|ref|YP_001556203.1| outer membrane adhesin-like protein [Shewanella baltica OS195] gi|160862409|gb|ABX50943.1| outer membrane adhesin like proteiin [Shewanella baltica OS195] gi|315269091|gb|ADT95944.1| outer membrane adhesin like proteiin [Shewanella baltica OS678] Length = 1215 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 35/347 (10%), Positives = 102/347 (29%), Gaps = 26/347 (7%) Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 + L + Q ++ ++ I + + Sbjct: 185 DEYKGNAVSGAANATLTTDVATDSIMSNQWQARNGDMKWLNHSTASNIGDVNRTAQGRV- 243 Query: 117 NLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS 176 G + ++ R + + L + + ++K + Sbjct: 244 ----YGKSAWEVLAQDVKDDPKSGRKTAQPTRTRYTTLANNAPDANNPVKKELPAAQFSC 299 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 L + + + +P ID ++A LV++ + +S Sbjct: 300 RDQLDFVWVEGDIDMQIVMDRSGSMFGSP----IDNAKQAAKILVDATAEGSTAMGLVSF 355 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 + + P + +K ++ + +T + A L ++ Sbjct: 356 SGRSSVKQDFAMQKMPKPDNGVKQALKGAIDNIYANGSTALFDGSQLALDNLS-----AY 410 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 ++ V + DG+++ + +++ + +NA + I+S + L Sbjct: 411 QASAASGAPGVVFVLADGDDNNSIKSESS-----VITAYQNANVPIFSFGYGSASPTGPL 465 Query: 357 LRKCTDSSGQFFAV-NDSRELLESF---DKIT---DKIQEQSVRIAP 396 + + G++F+ E++++F + I + ++ IA Sbjct: 466 VTMANATGGKYFSSPTTLAEIIDAFLQANAIATDNQNLVSSTLNIAS 512 >gi|332222722|ref|XP_003260519.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 [Nomascus leucogenys] Length = 3535 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 21/132 (15%), Positives = 47/132 (35%), Gaps = 25/132 (18%) Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + + ++ T T A A + L + +E+S K V ITDG ++G Sbjct: 153 LLQEIPAISYRGGGTYTKGAFQQAAQILLHARENST---------KVVFLITDGYSNGG- 202 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLE 378 + I +R++G++I++ L + + ++ E Sbjct: 203 ------DPRPIAASLRDSGVEIFT--FGIWQGNIRELNDMASTPKEEHCYLLHSFEE--- 251 Query: 379 SFDKITDKIQEQ 390 F+ + + + Sbjct: 252 -FEALARRALHE 262 >gi|297685094|ref|XP_002820135.1| PREDICTED: LOW QUALITY PROTEIN: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Pongo abelii] Length = 3553 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 21/132 (15%), Positives = 47/132 (35%), Gaps = 25/132 (18%) Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + + ++ T T A A + L + +E+S K V ITDG ++G Sbjct: 138 LLQEIPAISYRGGGTYTKGAFQQAAQILLHARENST---------KVVFLITDGYSNGG- 187 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLE 378 + I +R++G++I++ L + + ++ E Sbjct: 188 ------DPRPIAASLRDSGVEIFT--FGIWQGNIRELNDMASTPKEEHCYLLHSFEE--- 236 Query: 379 SFDKITDKIQEQ 390 F+ + + + Sbjct: 237 -FEALARRALHE 247 >gi|193783708|dbj|BAG53619.1| unnamed protein product [Homo sapiens] Length = 868 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 21/132 (15%), Positives = 47/132 (35%), Gaps = 25/132 (18%) Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + + ++ T T A A + L + +E+S K V ITDG ++G Sbjct: 153 LLQEIPAISYRGGGTYTKGAFQQAAQILLHARENST---------KVVFLITDGYSNGG- 202 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLE 378 + I +R++G++I++ L + + ++ E Sbjct: 203 ------DPRPIAASLRDSGVEIFT--FGIWQGNIRELNDMASTPKEEHCYLLHSFEE--- 251 Query: 379 SFDKITDKIQEQ 390 F+ + + + Sbjct: 252 -FEALARRALHE 262 >gi|55662683|emb|CAH74138.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Homo sapiens] gi|55665761|emb|CAH73557.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Homo sapiens] Length = 845 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 21/132 (15%), Positives = 47/132 (35%), Gaps = 25/132 (18%) Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + + ++ T T A A + L + +E+S K V ITDG ++G Sbjct: 130 LLQEIPAISYRGGGTYTKGAFQQAAQILLHARENST---------KVVFLITDGYSNGG- 179 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLE 378 + I +R++G++I++ L + + ++ E Sbjct: 180 ------DPRPIAASLRDSGVEIFT--FGIWQGNIRELNDMASTPKEEHCYLLHSFEE--- 228 Query: 379 SFDKITDKIQEQ 390 F+ + + + Sbjct: 229 -FEALARRALHE 239 >gi|55662684|emb|CAH74139.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Homo sapiens] gi|55665762|emb|CAH73558.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Homo sapiens] gi|55957947|emb|CAI14068.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Homo sapiens] Length = 3548 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 21/132 (15%), Positives = 47/132 (35%), Gaps = 25/132 (18%) Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + + ++ T T A A + L + +E+S K V ITDG ++G Sbjct: 130 LLQEIPAISYRGGGTYTKGAFQQAAQILLHARENST---------KVVFLITDGYSNGG- 179 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLE 378 + I +R++G++I++ L + + ++ E Sbjct: 180 ------DPRPIAASLRDSGVEIFT--FGIWQGNIRELNDMASTPKEEHCYLLHSFEE--- 228 Query: 379 SFDKITDKIQEQ 390 F+ + + + Sbjct: 229 -FEALARRALHE 239 >gi|148886654|ref|NP_699197.3| sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 [Homo sapiens] gi|296452942|sp|Q4LDE5|SVEP1_HUMAN RecName: Full=Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1; AltName: Full=CCP module-containing protein 22; AltName: Full=Polydom; AltName: Full=Selectin-like osteoblast-derived protein; Short=SEL-OB; AltName: Full=Serologically defined breast cancer antigen NY-BR-38; Flags: Precursor Length = 3571 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 21/132 (15%), Positives = 47/132 (35%), Gaps = 25/132 (18%) Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + + ++ T T A A + L + +E+S K V ITDG ++G Sbjct: 153 LLQEIPAISYRGGGTYTKGAFQQAAQILLHARENST---------KVVFLITDGYSNGG- 202 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLE 378 + I +R++G++I++ L + + ++ E Sbjct: 203 ------DPRPIAASLRDSGVEIFT--FGIWQGNIRELNDMASTPKEEHCYLLHSFEE--- 251 Query: 379 SFDKITDKIQEQ 390 F+ + + + Sbjct: 252 -FEALARRALHE 262 >gi|119579467|gb|EAW59063.1| hCG1794476, isoform CRA_b [Homo sapiens] Length = 1196 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 21/132 (15%), Positives = 47/132 (35%), Gaps = 25/132 (18%) Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + + ++ T T A A + L + +E+S K V ITDG ++G Sbjct: 153 LLQEIPAISYRGGGTYTKGAFQQAAQILLHARENST---------KVVFLITDGYSNGG- 202 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLE 378 + I +R++G++I++ L + + ++ E Sbjct: 203 ------DPRPIAASLRDSGVEIFT--FGIWQGNIRELNDMASTPKEEHCYLLHSFEE--- 251 Query: 379 SFDKITDKIQEQ 390 F+ + + + Sbjct: 252 -FEALARRALHE 262 >gi|112180424|gb|AAH30816.1| SVEP1 protein [Homo sapiens] Length = 868 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 21/132 (15%), Positives = 47/132 (35%), Gaps = 25/132 (18%) Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + + ++ T T A A + L + +E+S K V ITDG ++G Sbjct: 153 LLQEIPAISYRGGGTYTKGAFQQAAQILLHARENST---------KVVFLITDGYSNGG- 202 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLE 378 + I +R++G++I++ L + + ++ E Sbjct: 203 ------DPRPIAASLRDSGVEIFT--FGIWQGNIRELNDMASTPKEEHCYLLHSFEE--- 251 Query: 379 SFDKITDKIQEQ 390 F+ + + + Sbjct: 252 -FEALARRALHE 262 >gi|37222213|gb|AAQ89957.1| selectin-like protein [Homo sapiens] gi|68655017|emb|CAF04067.1| SEL-OB protein [Homo sapiens] Length = 3574 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 21/132 (15%), Positives = 47/132 (35%), Gaps = 25/132 (18%) Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + + ++ T T A A + L + +E+S K V ITDG ++G Sbjct: 153 LLQEIPAISYRGGGTYTKGAFQQAAQILLHARENST---------KVVFLITDGYSNGG- 202 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLE 378 + I +R++G++I++ L + + ++ E Sbjct: 203 ------DPRPIAASLRDSGVEIFT--FGIWQGNIRELNDMASTPKEEHCYLLHSFEE--- 251 Query: 379 SFDKITDKIQEQ 390 F+ + + + Sbjct: 252 -FEALARRALHE 262 >gi|327271908|ref|XP_003220729.1| PREDICTED: collagen alpha-1(XX) chain-like [Anolis carolinensis] Length = 1480 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 64/195 (32%), Gaps = 21/195 (10%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + ++ E +LV A+ +R+G Y+ Sbjct: 252 VDGSWSIGRNNFGLIREFLASLVAPFNVAM-----DKIRVGLTQYSSDPRTEWDLNTYAT 306 Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 +EV L L NT+ + A + + S K +I +TDG++ Sbjct: 307 RDEVLEALRSLRYKGG-NTFTGL--ALTHVLEHNLKADTGARSEAP-KLIILLTDGKSQD 362 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSREL 376 + + ++N G++I++V V + LR+ + V D L Sbjct: 363 DANPP--------AQVLKNMGIQIFAVGV--KNADETELRQVASDPLELNVYNVLDFPLL 412 Query: 377 LESFDKITDKIQEQS 391 ++T + Q Sbjct: 413 SSLVGRLTRVLCAQI 427 >gi|256965511|ref|ZP_05569682.1| von Willebrand factor [Enterococcus faecalis HIP11704] gi|256956007|gb|EEU72639.1| von Willebrand factor [Enterococcus faecalis HIP11704] Length = 1154 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 97/382 (25%), Gaps = 30/382 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M A A L+ T T + + H + G Sbjct: 202 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGGAQWDGQ 261 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ ++Y + L++R E Sbjct: 262 T--SWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRG----NVQKE 315 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +V+D S SM + L + S Sbjct: 316 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNINMGYVGYSSDGYNNN 375 Query: 205 PANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 D + N+ S + + G + L++ + Sbjct: 376 AIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFS 435 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +++++ T R + + + GST D N Sbjct: 436 YKVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNINKRINS 485 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA-V 370 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 486 TFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSADENGDLYYESA 545 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 546 DYAPDISDYLAKKAVQISGTVV 567 >gi|73949160|ref|XP_544264.2| PREDICTED: similar to inter-alpha trypsin inhibitor heavy chain precursor 5 isoform 1 [Canis familiaris] Length = 893 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 20/132 (15%), Positives = 54/132 (40%), Gaps = 11/132 (8%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 +N+ + K ++ ++P T+ A+ A + L + + R ++F+TDG+ Sbjct: 310 DNVRDGKIYIHHMSPTGGTDINGALQRAIKLLNDYVAHNDIE---DRSVSLIVFLTDGKP 366 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL----LRKCTDSSGQFFAVND 372 + + + + + I+++ + + + L L C + + D Sbjct: 367 TVGETHTLKILNNT--KEAARGQVCIFTIGIGDDVDFKLLEKLSLENCGLTR-RVLEEED 423 Query: 373 S-RELLESFDKI 383 + +L+ +D+I Sbjct: 424 AGSQLIGFYDEI 435 >gi|313239054|emb|CBY14036.1| unnamed protein product [Oikopleura dioica] Length = 442 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 28/221 (12%), Positives = 59/221 (26%), Gaps = 15/221 (6%) Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 Y + I + N I E + S+R Sbjct: 150 YEPCNQDACDAGCSGPRDVLFVAHYTTYMGSTFADISA--FFENIISTINVEPSDSSIRF 207 Query: 239 GTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNT 298 +N + N+++E K + P Y A + + + Sbjct: 208 AFSFFNHAYIEFFAFDWLNSIDEYKWAFSSFPPASGNANYIG--RALKGAADTMTPAFGK 265 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLR 358 V+ +T+ +T ++ + ++ ++ V + GQD L Sbjct: 266 GRRIDTVGTVVLLTN--------AASTDEVNEMADQLKEKVDRVIVVGLGY-AFGQDELA 316 Query: 359 KCTDSS--GQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 S + +S +L I D+I + + Sbjct: 317 GIASSPTKENLYTAEESSDLAGLVRTIADEICSTELSNPSD 357 >gi|296190554|ref|XP_002806559.1| PREDICTED: LOW QUALITY PROTEIN: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Callithrix jacchus] Length = 3582 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 32/281 (11%), Positives = 78/281 (27%), Gaps = 36/281 (12%) Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSN 177 L + A S R A L + + + Sbjct: 10 WGLALVSGWATFQQMSPSRNFSFRLFPETAPGAPGSLPAPPAPGEEAAGSRVERLGQAFR 69 Query: 178 KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVR 237 + + + + + N + ++L + + V Sbjct: 70 RRVRLLRELSERL-ELVFLVDDSSSVGEVNFRSELL-----FVRKLLSDFPVVPTATRVA 123 Query: 238 IGTIAYNIGIVGNQCTPLSNNLNE-----VKSRLNKLNP-YENTNTYPAMHHAYRELYNE 291 I T + +V ++ + + + ++ T T A A + L + Sbjct: 124 IVTFSSKNYVVPRVDYISTSRARQHKCALLLQEIPAISYRGGGTYTKGAFQQAAQILLHA 183 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 +E+S K + ITDG ++G + I +R++G++I++ Sbjct: 184 RENST---------KVLFLITDGYSNGG-------DPRPIAASLRDSGVEIFT--FGIWQ 225 Query: 352 EGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 L + + ++ E F+ + + + Sbjct: 226 GNIRELNDMASTPKEEHCYLLHSFEE----FEALARRALHE 262 >gi|253582503|ref|ZP_04859725.1| conserved hypothetical protein [Fusobacterium varium ATCC 27725] gi|251835648|gb|EES64187.1| conserved hypothetical protein [Fusobacterium varium ATCC 27725] Length = 376 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 27/151 (17%), Positives = 58/151 (38%), Gaps = 8/151 (5%) Query: 224 IQKAIQEKKNLSVRIGTIAYNI--GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAM 281 + + +Q K+ V+IG +AY + + T LS NL+E+ S L + + Sbjct: 67 VNEVMQTHKDSKVKIGLVAYRDRGDVYVTKVTQLSENLDEIYSVLMGYKAQGGGDDPEDV 126 Query: 282 HHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK 341 A E + S L + + + D Y ++ +T + ++ G+ Sbjct: 127 RKALHESLEVIQWSTPREN---LSQIIFLVGDAPPHDD--YNDSPDTSDTAKKAKSRGII 181 Query: 342 IYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN 371 I ++ P+ + G++F ++ Sbjct: 182 INTIQCGDMPKTDYYWKAIAQFGGGEYFHIS 212 >gi|162312016|gb|ABX84114.1| hedgling [Nematostella vectensis] Length = 3480 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 21/125 (16%), Positives = 47/125 (37%), Gaps = 15/125 (12%) Query: 257 NNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 ++ +K+ +N + T T A+ A R ++ + ++ + K + +TDG Sbjct: 238 SSKAALKNAVNAIYYRSGWTYTADALDLAGRNIF---QVANGMRPDKGIPKIAVLLTDGY 294 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDS 373 ++G N L +R AG+ ++ V + + L F + + Sbjct: 295 SNGN-------NPLGPANDLRAAGVNVFCVGIG--NYYERELNDIATDPDKDHVFKLENF 345 Query: 374 RELLE 378 +L Sbjct: 346 NDLNS 350 >gi|261855659|ref|YP_003262942.1| von Willebrand factor A [Halothiobacillus neapolitanus c2] gi|261836128|gb|ACX95895.1| von Willebrand factor type A [Halothiobacillus neapolitanus c2] Length = 625 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 13/92 (14%), Positives = 39/92 (42%), Gaps = 5/92 (5%) Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQ-----ICEYMRNAGMKIYSVAVSAPPEGQD 355 + + ++ ++DG + + + + +R ++++++A+S + + Sbjct: 131 NDGTDRNILLLSDGMVDISPEKAINVRAQEELLQKLVPQLRAEHIRVHTIALSKDADSKL 190 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 L + D+ G F + + L +F KI + Sbjct: 191 LSQIAADTGGIFVEADSADALQRAFLKIFEAA 222 >gi|257082947|ref|ZP_05577308.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis E1Sol] gi|256990977|gb|EEU78279.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis E1Sol] Length = 1148 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 97/382 (25%), Gaps = 30/382 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M A A L+ T T + + H + G Sbjct: 196 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGGAQWDGQ 255 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ ++Y + L++R E Sbjct: 256 T--SWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRG----NVQKE 309 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +V+D S SM + L + S Sbjct: 310 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNINMGYVGYSSDGYNNN 369 Query: 205 PANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 D + N+ S + + G + L++ + Sbjct: 370 AIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFS 429 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +++++ T R + + + GST D N Sbjct: 430 YKVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNINKRINS 479 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA-V 370 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 480 TFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSADENGDLYYESA 539 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 540 DYAPDISDYLAKKAVQISGTVV 561 >gi|255976231|ref|ZP_05426817.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T2] gi|255969103|gb|EET99725.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T2] Length = 1154 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 97/382 (25%), Gaps = 30/382 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M A A L+ T T + + H + G Sbjct: 202 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGGAQWDGQ 261 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ ++Y + L++R E Sbjct: 262 T--SWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRG----NVQKE 315 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +V+D S SM + L + S Sbjct: 316 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNINMGYVGYSSDGYNNN 375 Query: 205 PANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 D + N+ S + + G + L++ + Sbjct: 376 AIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFS 435 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +++++ T R + + + GST D N Sbjct: 436 YKVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNINKRINS 485 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA-V 370 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 486 TFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSADENGDLYYESA 545 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 546 DYAPDISDYLAKKAVQISGTVV 567 >gi|256762099|ref|ZP_05502679.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T3] gi|256683350|gb|EEU23045.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T3] Length = 1154 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 97/382 (25%), Gaps = 30/382 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M A A L+ T T + + H + G Sbjct: 202 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWRPTGNQNVLNHQGNKDGGAQWDGQ 261 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ ++Y + L++R E Sbjct: 262 T--SWNGDPTNRTNSYIEYGGTGDQADYAIRKYASETTTPGLFDVYLNVRG----NVQKE 315 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +V+D S SM + L + S Sbjct: 316 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNINMGYVGYSSDGYNNN 375 Query: 205 PANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 D + N+ S + + G + L++ + Sbjct: 376 AIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFS 435 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +++++ T R + + + GST D N Sbjct: 436 YKVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNINKRINS 485 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA-V 370 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 486 TFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSADENGDLYYESA 545 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 546 DYAPDISDYLAKKAVQISGTVV 567 >gi|332817190|ref|XP_003309914.1| PREDICTED: LOW QUALITY PROTEIN: voltage-dependent calcium channel subunit alpha-2/delta-2-like [Pan troglodytes] Length = 1241 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 36/356 (10%), Positives = 90/356 (25%), Gaps = 36/356 (10%) Query: 33 LDAAVLSGCASIVSDRTIKDPTTKKDQTST----IFKKQIKKHLKQGSYIRENAGDIAQK 88 LD V + + + +D + + L G A Sbjct: 175 LDRKVQALKRLADAAENFQKAHRWQDNIKEEDIVYYDAKADAELDDPESEDVERGSKANT 234 Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI 148 +++ +D N + A +IPT+ +I + L +E ++ + Sbjct: 235 LRLDFIEDPNFKNKVNYSYAA-VQIPTDIYKGSTVILNELNWTEALENVFMENRRQDPTL 293 Query: 149 SICMVLDVSRSMEDL------YLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPA 202 + + +Y+ K + Sbjct: 294 LWQVFGSATGVTRYYPGRYQPLSPWALPXQRHLLGRYIQGASSPKDMVIIVDVSGSVSGL 353 Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 S +++++ + + + ++ +V N Sbjct: 354 TLKLM------KTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVR----NKKV 403 Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 K + + T +A+ +L N + K ++ TDG Sbjct: 404 FKEAVQGMVAKGTTGYKAGFEYAFDQLQNSNITRA------NCNKMIMMFTDGGEDRVQD 457 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 N N +++++ +V L+ + G +F + + Sbjct: 458 VFEKYNWP-------NRTVRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 506 >gi|262198293|ref|YP_003269502.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262081640|gb|ACY17609.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 419 Score = 57.6 bits (137), Expect = 4e-06, Method: Composition-based stats. Identities = 15/124 (12%), Positives = 40/124 (32%), Gaps = 10/124 (8%) Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 ++ L L TN M + + + ++ +TDG+ S Sbjct: 95 RQTLEQALAGLRTGVGTNLAAGMKKGAEAVRSGFVRGALSR--------LVLLTDGQPSL 146 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + + G+ I ++ + + + L G F + + ++ Sbjct: 147 G--ITDNDRLCALAQKEADRGVTITTMGLGQGFDDELLADLAHSGRGGFHYLASAADIPG 204 Query: 379 SFDK 382 +F + Sbjct: 205 AFGR 208 >gi|313676404|ref|YP_004054400.1| von willebrand factor type a [Marivirga tractuosa DSM 4126] gi|312943102|gb|ADR22292.1| von Willebrand factor type A [Marivirga tractuosa DSM 4126] Length = 618 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 40/367 (10%), Positives = 94/367 (25%), Gaps = 30/367 (8%) Query: 33 LDAAVLS-GCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQI 91 D A LS + R I + + + E + + Sbjct: 100 ADVAQLSEVVVTGTPSRNIFSKRKVASYQADQSYFPVDAEAPYHNSNTEEYQGLDENTFQ 159 Query: 92 NITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISIC 151 T++ + ++ A Y + P + + Sbjct: 160 EATQNPLSTFSIDVDA-ASYSNMRRFINSGQNPPKDAVRIEEMINYFNYDYKQPSGQDPF 218 Query: 152 MVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKID 211 + + + + +KH + K + S A K+ Sbjct: 219 SI--NTEVSQAPWNKKHQLVHIGLQGKVIPTENLPASNLV----FLLDVSGSMFAQNKLP 272 Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP 271 +L LV+ +++ ++ + Y N +++ L L Sbjct: 273 LLKSGLKMLVDQLREED--------KVSIVVYAGAAGCVLPPTSGNEKDKIIEALQNLQA 324 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 +T + AY+ N +I TDG+ + + + Sbjct: 325 GGSTAGGAGIELAYKIAKENFIKEGNNR--------IILATDGDFNVG--ASSNEAMEDL 374 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQ 390 E R G+ + + + D G + +++ E + + ++ Sbjct: 375 IEKKRKEGVFLTVLGFGMGNYKDSKMEILADKGNGNYAYIDNILEAKKV---LVNEFGGT 431 Query: 391 SVRIAPN 397 IA + Sbjct: 432 LFTIAKD 438 >gi|295132199|ref|YP_003582875.1| von Willebrand factor (vWA) type A domain-containing protein [Zunongwangia profunda SM-A87] gi|294980214|gb|ADF50679.1| von Willebrand factor (vWA) type A domain-containing protein [Zunongwangia profunda SM-A87] Length = 347 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 23/170 (13%), Positives = 55/170 (32%), Gaps = 49/170 (28%) Query: 253 TPLSNNLNEVKSRLNKLNPYE----NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P++ + K L LN T A+ A ++ +++ + + Sbjct: 144 LPITTDYAAAKMFLQALNTDMISSQGTAIGDAIDLATTYYDDDNQTN----------RVL 193 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE---------------- 352 I+DGE+ N I + G++IY++ V Sbjct: 194 FIISDGEDHEG-------NVESIADEAAEKGIRIYTIGVGTEKGGPIPIKRNGVVQNYKK 246 Query: 353 -----------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQ 390 + L++ ++G++ N + ++ E+ + I++ Sbjct: 247 DQNGETVITKLDPNTLQQIASEANGEYIEGNVTAKVAETVQDLLKNIEKT 296 >gi|271966806|ref|YP_003341002.1| von Willebrand factor, type A [Streptosporangium roseum DSM 43021] gi|270509981|gb|ACZ88259.1| von Willebrand factor, type A [Streptosporangium roseum DSM 43021] Length = 315 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 24/199 (12%), Positives = 64/199 (32%), Gaps = 34/199 (17%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 +I E+A V + + R ++ + + V + L Sbjct: 106 NRITAAKEAAQKFVEDLPERFNVGVVAFARSASV----------VVSPTTDHQAVSASLG 155 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 L T A+ ++ + + + + + ++ ++DG+N+ + Sbjct: 156 NLTTRAGTAIGEAVFNSLDAVRSFDQQA----VTDPPPAAIVLLSDGDNTSGRS------ 205 Query: 328 TLQICEYMRNAGMKIYSVAVSA--------------PPEGQDLLRKCTDSSGQFFAVNDS 373 + + +A + I ++A P L +SG+ + Sbjct: 206 VAEAVDAAMSARVPISTIAYGTQEGTVSIDGRDVNVPVNKATLQTLSEGTSGRAYEAESG 265 Query: 374 RELLESFDKITDKIQEQSV 392 +L E +++I + ++V Sbjct: 266 SQLREVYEQIGTSLGYRTV 284 >gi|149922129|ref|ZP_01910569.1| hypothetical protein PPSIR1_23374 [Plesiocystis pacifica SIR-1] gi|149817066|gb|EDM76548.1| hypothetical protein PPSIR1_23374 [Plesiocystis pacifica SIR-1] Length = 546 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 23/159 (14%), Positives = 56/159 (35%), Gaps = 14/159 (8%) Query: 247 IVGNQCTPLSN-------NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 + TPLSN + ++ +L + T PA+ + L + T Sbjct: 172 SYDDTVTPLSNLQRVDDDGIEVLRRQLLDIQVGGTTALGPALFMGLQRLAAPEPFGPQTR 231 Query: 300 GSTRLKKF--VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 R + VI ++DG + + + E+ G+ + ++ + + Sbjct: 232 TEARHDRLRHVILLSDGIANVGETRPEVIGGR-VAEHFGG-GVSVSTLGMGLDYNEDLMT 289 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 R + G++ + D+ + + D++ + +A Sbjct: 290 RIADEGGGRYHFIEDAESIPAM---LGDELAGLTATVAS 325 >gi|330469087|ref|YP_004406830.1| von willebrand factor type a [Verrucosispora maris AB-18-032] gi|328812058|gb|AEB46230.1| von willebrand factor type a [Verrucosispora maris AB-18-032] Length = 316 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 20/152 (13%), Positives = 45/152 (29%), Gaps = 24/152 (15%) Query: 250 NQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 N P + + V + ++ L E T T A+ + S + ++ Sbjct: 139 NVLVPPTKDRPAVTNAIDGLVLAEATATGEAVFTCLEAI----RSVPADGAAGIPPARIV 194 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------------GQDL 356 ++DG + + + Q A + + ++A + Sbjct: 195 LLSDGFRTAGRSVEEAAAAAQA------ANVPVSTIAFGTDAGHVAIGGQLQRVPVDRMA 248 Query: 357 LRKCT-DSSGQFFAVNDSRELLESFDKITDKI 387 L + G F+ EL + + + I Sbjct: 249 LAALAETTEGYFYEAASVSELKQVYQDMGSSI 280 >gi|187466178|emb|CAQ51888.1| integrin, alpha E, epithelial-associated [Mus musculus] Length = 1038 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 25/155 (16%), Positives = 50/155 (32%), Gaps = 16/155 (10%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNT 298 + Y I S ++N +++ + E T T AM H ++ S Sbjct: 236 LVQYGAVIQTEFDLQESRDINASLAKVQSIVQVKEVTKTASAMQHVLDNIFIPSRGSRKK 295 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA---PPEGQD 355 K ++ +TDG+ + + LN + + G+ +++ V Sbjct: 296 A-----LKVMVVLTDGD-----IFGDPLNLTTVINSPKMQGVVRFAIGVGDAFKNNNTYR 345 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 L+ F V + L K+ +I Sbjct: 346 ELKLIASDPKEAHTFKVTNYSALDGLLSKLQQRIV 380 >gi|148680762|gb|EDL12709.1| integrin, alpha E, epithelial-associated [Mus musculus] Length = 1167 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 25/155 (16%), Positives = 50/155 (32%), Gaps = 16/155 (10%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNT 298 + Y I S ++N +++ + E T T AM H ++ S Sbjct: 236 LVQYGAVIQTEFDLQESRDINASLAKVQSIVQVKEVTKTASAMQHVLDNIFIPSRGSRKK 295 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA---PPEGQD 355 K ++ +TDG+ + + LN + + G+ +++ V Sbjct: 296 A-----LKVMVVLTDGD-----IFGDPLNLTTVINSPKMQGVVRFAIGVGDAFKNNNTYR 345 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 L+ F V + L K+ +I Sbjct: 346 ELKLIASDPKEAHTFKVTNYSALDGLLSKLQQRIV 380 >gi|88702489|gb|ABD49099.1| integrin alpha E [Mus musculus] Length = 1167 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 25/155 (16%), Positives = 50/155 (32%), Gaps = 16/155 (10%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNT 298 + Y I S ++N +++ + E T T AM H ++ S Sbjct: 236 LVQYGAVIQTEFDLQESRDINASLAKVQSIVQVKEVTKTASAMQHVLDNIFIPSRGSRKK 295 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA---PPEGQD 355 K ++ +TDG+ + + LN + + G+ +++ V Sbjct: 296 A-----LKVMVVLTDGD-----IFGDPLNLTTVINSPKMQGVVRFAIGVGDAFKNNNTYR 345 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 L+ F V + L K+ +I Sbjct: 346 ELKLIASDPKEAHTFKVTNYSALDGLLSKLQQRIV 380 >gi|159900724|ref|YP_001546971.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159893763|gb|ABX06843.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 415 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 24/186 (12%), Positives = 67/186 (36%), Gaps = 23/186 (12%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 KI L E+ +V +++ + + ++ + L+++L +++ + Sbjct: 59 DKIQHLREAVREIVANLRPIDA--------VSIVLFDDTLEVLVPARLADDLPALQNAIE 110 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + T + EL + + V+ +TDG+ + + Sbjct: 111 SIGEQGGTAMSLGLQAGLAELQKFQAADRVGR--------VLLLTDGQT-----WGDEDT 157 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDK 386 + + + + G+ I ++ + + LL S+G+ + D ++ + F + Sbjct: 158 CRDLAKQIGDLGVSITALGLGTEWN-EALLDDLATASNGESDYIADPSQISKYFQQTLQS 216 Query: 387 IQEQSV 392 Q +V Sbjct: 217 AQTTTV 222 >gi|25742632|ref|NP_113956.1| integrin, alpha E [Rattus norvegicus] gi|3236342|gb|AAC23662.1| integrin alpha E1 [Rattus norvegicus] gi|187466179|emb|CAQ51889.1| integrin, alpha E, epithelial-associated [Mus musculus] Length = 1167 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 25/155 (16%), Positives = 50/155 (32%), Gaps = 16/155 (10%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNT 298 + Y I S ++N +++ + E T T AM H ++ S Sbjct: 236 LVQYGAVIQTEFDLQESRDINASLAKVQSIVQVKEVTKTASAMQHVLDNIFIPSRGSRKK 295 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA---PPEGQD 355 K ++ +TDG+ + + LN + + G+ +++ V Sbjct: 296 A-----LKVMVVLTDGD-----IFGDPLNLTTVINSPKMQGVVRFAIGVGDAFKNNNTYR 345 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 L+ F V + L K+ +I Sbjct: 346 ELKLIASDPKEAHTFKVTNYSALDGLLSKLQQRIV 380 >gi|2497428|sp|Q60677|ITAE_MOUSE RecName: Full=Integrin alpha-E; AltName: Full=Integrin alpha M290; AltName: CD_antigen=CD103; Contains: RecName: Full=Integrin alpha-E light chain; Contains: RecName: Full=Integrin alpha-E heavy chain; Flags: Precursor gi|535477|gb|AAC52142.1| alpha M290 integrin [Mus musculus] Length = 1167 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 25/155 (16%), Positives = 50/155 (32%), Gaps = 16/155 (10%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNT 298 + Y I S ++N +++ + E T T AM H ++ S Sbjct: 236 LVQYGAVIQTEFDLQESRDINASLAKVQSIVQVKEVTKTASAMQHVLDNIFIPSRGSRKK 295 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA---PPEGQD 355 K ++ +TDG+ + + LN + + G+ +++ V Sbjct: 296 A-----LKVMVVLTDGD-----IFGDPLNLTTVINSPKMQGVVRFAIGVGDRFKNNNTYR 345 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 L+ F V + L K+ +I Sbjct: 346 ELKLIASDPKEAHTFKVTNYSALDGLLSKLQQRIV 380 >gi|74150598|dbj|BAE32321.1| unnamed protein product [Mus musculus] Length = 1166 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 25/155 (16%), Positives = 50/155 (32%), Gaps = 16/155 (10%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNT 298 + Y I S ++N +++ + E T T AM H ++ S Sbjct: 235 LVQYGAVIQTEFDLQESRDINASLAKVQSIVQVKEVTKTASAMQHVLDNIFIPSRGSRKK 294 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA---PPEGQD 355 K ++ +TDG+ + + LN + + G+ +++ V Sbjct: 295 A-----LKVMVVLTDGD-----IFGDPLNLTTVINSPKMQGVVRFAIGVGDAFKNNNTYR 344 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 L+ F V + L K+ +I Sbjct: 345 ELKLIASDPKEAHTFKVTNYSALDGLLSKLQQRIV 379 >gi|31615649|pdb|1N9Z|A Chain A, Integrin Alpha M I Domain Mutant Length = 192 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 22/120 (18%), Positives = 40/120 (33%), Gaps = 15/120 (12%) Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T+T + REL+N + K ++ ITDGE + + L + Sbjct: 80 GRTHTATGIRKVVRELFNITNGAR-----KNAFKILVVITDGE-----KFGDPLGYEDVI 129 Query: 333 EYMRNAGMKIYSVAVSA---PPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKI 387 G+ Y + V + + L F VN+ L +++ + I Sbjct: 130 PEADREGVIRYVIGVGDAFRSEKSRQELNTIASKPPRDHVFQVNNFEALKTIQNQLRECI 189 >gi|290956351|ref|YP_003487533.1| hypothetical protein SCAB_18411 [Streptomyces scabiei 87.22] gi|260645877|emb|CBG68968.1| putative secreted protein [Streptomyces scabiei 87.22] Length = 422 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 26/195 (13%), Positives = 68/195 (34%), Gaps = 26/195 (13%) Query: 206 ANRKIDVLIESAGNLVNSIQK-----AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 ++ ++ ++++ K + PL + Sbjct: 57 GQSRMSAAKQAFNEVLDATPKEVQLGIRTLGADYPGDDRKTGCKDTAQLYPVGPL--DRT 114 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 E K+ + L P T PA+ A +L + + K ++ I+DGE++ Sbjct: 115 EAKTAVATLAPTGWTPIGPALLKAADDLEGGEGT-----------KRIVLISDGEDT--- 160 Query: 321 AYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELL 377 L+ ++ + G+ I ++ + + + L + + G + ++ EL Sbjct: 161 --CAPLDPCEVAREIAAKGIGLTIDTLGLVPNAKLRVQLSCIAEATGGTYTSIEHRDELT 218 Query: 378 ESFDKITDKIQEQSV 392 + +++ D+ + V Sbjct: 219 DRVNQLVDRAADPVV 233 >gi|291245103|ref|XP_002742431.1| PREDICTED: complement component factor B/C2-like [Saccoglossus kowalevskii] Length = 782 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 24/161 (14%), Positives = 55/161 (34%), Gaps = 28/161 (17%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNL-NEVKSRLNKLN----PYENTNTYPAMHHAYRELYN 290 R+ I++ L +++ N + + +L T A + E+ Sbjct: 333 TRVAAISFASVATLEF--NLGDDIVNTTEKAIEQLQVIQASGGGTAMKEAFEIMFSEVVP 390 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG---MKIYSVAV 347 KK + ITDG+++ S + + +R+ +I+++ + Sbjct: 391 RLRPE--------SKKAMFIITDGKSNSGS-------PISYAQRLRDRNEHAFEIFAIGI 435 Query: 348 SAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDK 386 ++ L+K F + +L D I++K Sbjct: 436 G-NGVDRNELKKIASEPFTSHVFLIRQYEDLTTLTDIISEK 475 >gi|170742540|ref|YP_001771195.1| hypothetical protein M446_4419 [Methylobacterium sp. 4-46] gi|168196814|gb|ACA18761.1| hypothetical protein M446_4419 [Methylobacterium sp. 4-46] Length = 303 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 37/275 (13%), Positives = 73/275 (26%), Gaps = 32/275 (11%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M A ++ L I IDL + R ++Q+ALD AV++ + D Sbjct: 27 MFAGMMPAVLLAIGCGIDLQRALAYRGKVQAALDGAVMAVVGNTSFDAD----------- 75 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 ++ K + A +T +N AE+ A + Sbjct: 76 --FGRQAFKTSASFAYALDGAAPGSDPLTITRLTFTQNPDGTVTAEATATMRTTVVS--- 130 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 +I +LS RST + + I+ ++ +D + Sbjct: 131 --IIGVRSLDLSFRSTAKGTTTLKISTITFKVLSAQGAFDKD--------------IYFF 174 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 + Y K K + + + Sbjct: 175 TRDGNGNLLSESLVLQYDYNYPAGGTATKSYNPPPPQTFTQPVSNYKTYGFKMVVYKDPS 234 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 + SN+ ++ + +P +T Sbjct: 235 YRGLHVNPSVYYSDSSNSSQWIRIQGKCYDPGGST 269 >gi|45384390|ref|NP_990268.1| cochlin precursor [Gallus gallus] gi|7387581|sp|O42163|COCH_CHICK RecName: Full=Cochlin; AltName: Full=COCH-5B2; Flags: Precursor gi|2293562|gb|AAC62253.1| Coch-5B2 [Gallus gallus] Length = 547 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 23/161 (14%), Positives = 51/161 (31%), Gaps = 20/161 (12%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYN 290 ++ +I T+ + +V S + + T T A+ R ++ Sbjct: 397 SDIGSKIATVQFTYDQRTEFSFTDYTTKEKVLSAIRNIRYMSGGTATGDAISFTTRNVFG 456 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + N F++ +TDG++ + + AG+ ++SV V+ Sbjct: 457 PVKDGANK-------NFLVILTDGQSYD--------DVRGPAVAAQKAGITVFSVGVAWA 501 Query: 351 PEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQE 389 P L+ F + L + + I + Sbjct: 502 PLDD--LKDMASEPRESHTFFTREFTGLEQMVPDVIRGICK 540 >gi|163754946|ref|ZP_02162067.1| hypothetical protein KAOT1_02992 [Kordia algicida OT-1] gi|161325013|gb|EDP96341.1| hypothetical protein KAOT1_02992 [Kordia algicida OT-1] Length = 593 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 33/192 (17%), Positives = 58/192 (30%), Gaps = 19/192 (9%) Query: 192 KNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ 251 N T A K+ +L + +VN ++ ++ + Y Sbjct: 234 SNFTFLIDVSGSMFAQNKLPLLKSAFTLMVNKMRPED--------KVAIVVYAGAAGMVL 285 Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 NN ++ LN L +T + AY+ + N VI Sbjct: 286 EPTSGNNKQKILDALNNLRAGGSTAGGAGIELAYKTATENFIKNGNNR--------VIMA 337 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAV 370 TDG+ + + + E R G+ + + D L D G V Sbjct: 338 TDGDFNVG--ATSETAMETLIEQKRETGVYLSVLGFGMGNYQDDRLELLADKGNGNHAYV 395 Query: 371 NDSRELLESFDK 382 + +E + F K Sbjct: 396 DTMQEAHKIFGK 407 >gi|256962322|ref|ZP_05566493.1| von Willebrand factor [Enterococcus faecalis Merz96] gi|256952818|gb|EEU69450.1| von Willebrand factor [Enterococcus faecalis Merz96] Length = 1154 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 97/382 (25%), Gaps = 30/382 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M A A L+ T T + + H + G Sbjct: 202 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGGAQWDGQ 261 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ ++Y + L++R E Sbjct: 262 T--SWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRG----NVQKE 315 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +V+D S SM + L + S Sbjct: 316 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNINMGYVGYSSDGYNNN 375 Query: 205 PANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 D + N+ S + + G + L++ + Sbjct: 376 AIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFS 435 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +++++ T R + + + GST D N Sbjct: 436 YKVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNISKRINS 485 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA-V 370 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 486 TFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSADENGDLYYESA 545 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 546 DYAPDISDYLAKKAVQISGTVV 567 >gi|302346570|ref|YP_003814868.1| von Willebrand factor type A domain protein [Prevotella melaninogenica ATCC 25845] gi|302151211|gb|ADK97472.1| von Willebrand factor type A domain protein [Prevotella melaninogenica ATCC 25845] Length = 331 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 21/165 (12%), Positives = 61/165 (36%), Gaps = 45/165 (27%) Query: 253 TPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P++++ K L+ +NP + T+ A+ + ++++ K + Sbjct: 145 LPITSDYVSAKMFLDNINPSLIGTQGTDIGKALQLSMNSFTP----------NSKVGKAI 194 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE---------------- 352 I ITDGE++ ++ + ++ G++++ + V + Sbjct: 195 ILITDGEDNEG-------GAEEMAKQAQSKGIRVFILGVGSTEGATIPMPDGSELKTSNG 247 Query: 353 -------GQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQE 389 +++ ++ + G + V +S + +K+Q+ Sbjct: 248 EVVKTRLNEEMCKQIATAGHGVYLHVTNSSMADAVLGRELNKLQK 292 >gi|218438801|ref|YP_002377130.1| von Willebrand factor A [Cyanothece sp. PCC 7424] gi|218171529|gb|ACK70262.1| von Willebrand factor type A [Cyanothece sp. PCC 7424] Length = 573 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 17/125 (13%), Positives = 43/125 (34%), Gaps = 16/125 (12%) Query: 240 TIAYNIGIVGNQCTPLSNNLNE--VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHN 297 + Y L + L + ++ L T Y M +L +K+++ + Sbjct: 437 LVTYGDRAAEVVPLGLFDELQHKRFLAAIDNLRADGATAMYDGMMIGLSKLMEQKKNNPD 496 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 +++ +TDG+ + + ++ +G+++Y +A Q+ L Sbjct: 497 GRF------YLLLLTDGQANMGVTFDEVKEVIEY------SGVRVYPIAYG--DVNQEEL 542 Query: 358 RKCTD 362 Sbjct: 543 EAIAS 547 >gi|4809045|gb|AAD30063.1| Itgae protein [Mus sp.] Length = 895 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 25/155 (16%), Positives = 50/155 (32%), Gaps = 16/155 (10%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNT 298 + Y I S ++N +++ + E T T AM H ++ S Sbjct: 223 LVQYGAVIQTEFDLQESRDINASLAKVQSIVQVKEVTKTASAMQHVLDNIFIPSRGSRKK 282 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA---PPEGQD 355 K ++ +TDG+ + + LN + + G+ +++ V Sbjct: 283 A-----LKVMVVLTDGD-----IFGDPLNLTTVINSPKMQGVVRFAIGVGDAFKNNNTYR 332 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 L+ F V + L K+ +I Sbjct: 333 ELKLIASDPKEAHTFKVTNYSALDGLLSKLQQRIV 367 >gi|149053317|gb|EDM05134.1| rCG33209 [Rattus norvegicus] Length = 1169 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 25/155 (16%), Positives = 51/155 (32%), Gaps = 16/155 (10%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNT 298 + Y I S ++N +++ + E T T AM H ++ S Sbjct: 238 LVQYGGVIQTEFDLLDSRDINASLAKVQSIVQVKEVTKTASAMQHVLDNIFIPSRGSRKK 297 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA---PPEGQD 355 K ++ +TDG+ +++ LN + + G+ +++ V Sbjct: 298 A-----LKVMVVLTDGD-----IFRDPLNLTTVISSSKMQGVVRFAIGVGNAFENNNTYR 347 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 L+ F V + L K+ +I Sbjct: 348 ELKLIASDPKAAHTFKVTNYSALDGLLSKLQQRII 382 >gi|3236344|gb|AAC23663.1| integrin alpha E2 [Rattus norvegicus] Length = 1167 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 25/155 (16%), Positives = 51/155 (32%), Gaps = 16/155 (10%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNT 298 + Y I S ++N +++ + E T T AM H ++ S Sbjct: 236 LVQYGGVIQTEFDLLDSRDINASLAKVQSIVQVKEVTKTASAMQHVLDNIFIPSRGSRKK 295 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA---PPEGQD 355 K ++ +TDG+ +++ LN + + G+ +++ V Sbjct: 296 A-----LKVMVVLTDGD-----IFRDPLNLTTVISSSKMQGVVRFAIGVGNAFENNNTYR 345 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 L+ F V + L K+ +I Sbjct: 346 ELKLIASDPKAAHTFKVTNYSALDGLLSKLQQRII 380 >gi|313239055|emb|CBY14037.1| unnamed protein product [Oikopleura dioica] Length = 1055 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 28/221 (12%), Positives = 59/221 (26%), Gaps = 15/221 (6%) Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 Y + I + N I E + S+R Sbjct: 763 YEPCNQDACDAGCSGPRDVLFVAHYTTYMGSTFADISA--FFENIISTINVEPSDSSIRF 820 Query: 239 GTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNT 298 +N + N+++E K + P Y A + + + Sbjct: 821 AFSFFNHAYIEFFAFDWLNSIDEYKWAFSSFPPASGNANYIG--RALKGAADTMTPAFGK 878 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLR 358 V+ +T+ +T ++ + ++ ++ V + GQD L Sbjct: 879 GRRIDTVGTVVLLTN--------AASTDEVNEMADQLKEKVDRVIVVGLGY-AFGQDELA 929 Query: 359 KCTDSS--GQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 S + +S +L I D+I + + Sbjct: 930 GIASSPTKENLYTAEESSDLAGLVKTIADEICATELSNPSD 970 >gi|118348690|ref|XP_001007820.1| U-box domain containing protein [Tetrahymena thermophila] gi|89289587|gb|EAR87575.1| U-box domain containing protein [Tetrahymena thermophila SB210] Length = 790 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 31/231 (13%), Positives = 73/231 (31%), Gaps = 24/231 (10%) Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 ++ S + K+ + +D++ S K I + Sbjct: 133 RSACDICCVIDVSGSMSDEAKIKNSKGDIESNGLTILDLVKHSV--------KTIINNLD 184 Query: 234 LSVRIGTIAYNIGIVGNQCTPLS----NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 R+ +A++ + T L+ N N L KL P ++TN + ++ A + Sbjct: 185 ERDRLSLVAFHTNAY--KITDLTPMNENGRNHAIKELEKLIPLDSTNIWDGIYQALEVVK 242 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA---GMKIYSVA 346 ++ S ++ TDG+ + L + + + I + Sbjct: 243 AGQQQSIQKGEQRVAFSQILLFTDGQPNV----IPPRGHLPMLKKYKEENDVNCSISTFG 298 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + + L + + G F + D + + F + + +A + Sbjct: 299 FGYNLDSELLDQLAIEGRGSFAFIPDGQFVGTVF---VNALSNLMTTLAVD 346 >gi|28210485|ref|NP_781429.1| membrane-associated protein [Clostridium tetani E88] gi|28202922|gb|AAO35366.1| membrane-associated protein [Clostridium tetani E88] Length = 842 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 25/148 (16%), Positives = 52/148 (35%), Gaps = 18/148 (12%) Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 ++ I + N ++ + KL P T P + + L + K Sbjct: 456 FSDTIDWVVPFQKAENKEKLIKEVGKLKPKGGTLIIPGLIEGVKTLSSAKTKV------- 508 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 K +I +TDG+ + E M+ M + +V + + + L Sbjct: 509 ---KHMILLTDGQAEKNGFD-------KYLENMKKNNMTLSTVGLGEDSDREVLTHLSDF 558 Query: 363 SSGQFFAVNDSRELLESFDKITDKIQEQ 390 + G+ + ND + + F K +I ++ Sbjct: 559 TGGRKYFSNDFKSVPIIFAK-ETRISQK 585 >gi|331007462|ref|ZP_08330636.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [gamma proteobacterium IMCC1989] gi|330418739|gb|EGG93231.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [gamma proteobacterium IMCC1989] Length = 699 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 27/179 (15%), Positives = 64/179 (35%), Gaps = 35/179 (19%) Query: 237 RIGTIAYNIGIVGNQCTPLS---NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 R + +N G V L+ N+ V +L+ TN Y + + ++ Sbjct: 331 RFKVVLFNNGSVDLTSGFLTVSQANVANVLQQLDNYKVGGGTNLYAGLQKGLTGIDADRP 390 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + +I +TDG + + + + +++++ + Sbjct: 391 AG------------IILVTDGVANVGQT-----EKKTFLKLLNDHDVRLFTF-IMGNSAN 432 Query: 354 QDLLRKCTD-SSGQFFAVNDSREL------------LESFDKITDKIQEQSVR-IAPNR 398 + LL++ D S+G +V+++ ++ ++F I I V+ I P + Sbjct: 433 RPLLKEMADVSNGFAMSVSNADDIVGHIMLATSKLTHQAFRDIDLDIDGVKVKNITPEK 491 >gi|326430405|gb|EGD75975.1| hypothetical protein PTSG_00683 [Salpingoeca sp. ATCC 50818] Length = 762 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 24/181 (13%), Positives = 54/181 (29%), Gaps = 23/181 (12%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + E +LV+ + + A+ T +++ Sbjct: 508 NSGSVGPDNFALAQEFVMDLVDQMTISSTAIN-------VGAFLFNSQVQMLTAFTDDKT 560 Query: 261 EVKSRLNKLNP----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 +++ ++ + T T A++ A + V ITDG + Sbjct: 561 AIQAAISGYSYPPSNTAGTATGAALNFAVDTMLQSGAGYRGG------SVLVYVITDGRS 614 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 S++ + G ++ SV + + LR S F + D +L Sbjct: 615 QEDSSF-----VASAAANLHATGAEVVSVGI-TSSVDETQLRTIATSDSNVFVLEDFSDL 668 Query: 377 L 377 Sbjct: 669 N 669 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 24/180 (13%), Positives = 56/180 (31%), Gaps = 17/180 (9%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 A + +D + + +S++ + + V ++ + S + + + Sbjct: 189 ASGSVGVDNFDIAKNFVADSVELMDVDPDVIRVAGM--MFHANPLPQFDFDFSFDRDVIA 246 Query: 264 SRLNKLNP----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 + T T A+++ + L + V FITDG + A Sbjct: 247 DAVRSFVYPTDRNWGTATGAALNYIRKYLL-----VPSAGNRDPADTIVYFITDGNSQEA 301 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 ++ + + G ++ ++ + Q L S V D +L E Sbjct: 302 LSF-----VQDAADNIHATGARVVAIGI-TDAIDQSQLEIIASSPDDVIIVEDFADLDEV 355 >gi|325268974|ref|ZP_08135595.1| aerotolerance protein BatB [Prevotella multiformis DSM 16608] gi|324988595|gb|EGC20557.1| aerotolerance protein BatB [Prevotella multiformis DSM 16608] Length = 330 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 24/165 (14%), Positives = 59/165 (35%), Gaps = 45/165 (27%) Query: 253 TPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P++++ K L+ +NP + T+ A+ A +++ K + Sbjct: 145 LPITSDYVSAKMFLDNINPSLIGTQGTDIGKALQLAANSFTP----------NSKAGKAI 194 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA------------------- 349 I ITDGE++ + + R+ G+K++ + + + Sbjct: 195 ILITDGEDNEG-------GAEAMAKQARSKGIKVFILGIGSRQGAVIPMPDGSELKTSDG 247 Query: 350 ----PPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQE 389 +++ R+ + G + V++S + K+Q+ Sbjct: 248 EPVKTHLNEEMCRQIAAAGQGVYVHVDNSNVADAVLGRELGKLQK 292 >gi|313239872|emb|CBY14719.1| unnamed protein product [Oikopleura dioica] Length = 982 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 59/201 (29%), Gaps = 33/201 (16%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL------ 266 + + G LV++ R+G + Y+ ++ + Sbjct: 22 VRQWIGKLVDTFDIEEDGGGT---RVGVVIYSDAPRMEISLGNGLGKTDLIKAVLVIYLN 78 Query: 267 -------NKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 L NT T ++ +A ++E + + +I +TDG Sbjct: 79 LIDFLFAQSLMYERGNTLTGESIRYASEVAFSETSGARALSEGIN--RIMIVLTDGRAQD 136 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSREL 376 N + G+ IY+V V +D L + F+V++ + Sbjct: 137 --------NVAGPAVIAQEDGIVIYAVGVG--HAIKDELDEIASKPTHRHKFSVSEYGAI 186 Query: 377 LESFDKITDKIQEQSVRIAPN 397 + I I PN Sbjct: 187 ESIRSNLRRTIC--IHSICPN 205 >gi|313212957|emb|CBY36854.1| unnamed protein product [Oikopleura dioica] Length = 1117 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 59/201 (29%), Gaps = 33/201 (16%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL------ 266 + + G LV++ R+G + Y+ ++ + Sbjct: 22 VRQWIGKLVDTFDIEEDGGGT---RVGVVIYSDAPRMEISLGNGLGKTDLIKAVLVIYLN 78 Query: 267 -------NKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 L NT T ++ +A ++E + + +I +TDG Sbjct: 79 LIDFLFAQSLMYERGNTLTGESIRYASEVAFSETSGARALSEGIN--RIMIVLTDGRAQD 136 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSREL 376 N + G+ IY+V V +D L + F+V++ + Sbjct: 137 --------NVAGPAVIAQEDGIVIYAVGVG--HAIKDELDEIASKPTHRHKFSVSEYGAI 186 Query: 377 LESFDKITDKIQEQSVRIAPN 397 + I I PN Sbjct: 187 ESIRSNLRRTIC--IHSICPN 205 >gi|270006429|gb|EFA02877.1| hypothetical protein TcasGA2_TC008029 [Tribolium castaneum] Length = 1868 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 20/133 (15%), Positives = 52/133 (39%), Gaps = 26/133 (19%) Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 L+++ S++ TNT A A + S +K + ITDG ++G Sbjct: 147 LSKLLSKIE--YTGGGTNTLKAFEVAKEIFTQSRNDS---------EKVLFLITDGFSNG 195 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA--VNDSREL 376 + + + ++ +KI+++ ++ L + + G+ ++ ++ E Sbjct: 196 G-------DPIPLAAELKKDQVKIFTIGIANGNY--KELYELASTPGEIYSYLLDSFEE- 245 Query: 377 LESFDKITDKIQE 389 F+ + ++ Sbjct: 246 ---FESLARHLKS 255 >gi|219849077|ref|YP_002463510.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219543336|gb|ACL25074.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 546 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 21/132 (15%), Positives = 53/132 (40%), Gaps = 12/132 (9%) Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 PLS+N ++ + + T + A+ + L + + ++ ++D Sbjct: 425 PLSDNRIALQDAVQAMRASGRTALFDALVLGKQVLEQLPPA------DDDRIRAIVLLSD 478 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDS 373 G ++ + A + + TL +G+ I+ VA + +L + S V D+ Sbjct: 479 GADNSSQASLDQIRTL-----FDESGISIFPVAYG-NDADRQVLDAIAEFSRTIVVVGDT 532 Query: 374 RELLESFDKITD 385 ++ + F+ ++ Sbjct: 533 GDIAQIFENLSR 544 >gi|297157667|gb|ADI07379.1| hypothetical protein SBI_04258 [Streptomyces bingchenggensis BCW-1] Length = 528 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 23/182 (12%), Positives = 63/182 (34%), Gaps = 15/182 (8%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN-----LNEV 262 ++D L + L + A E+ + + + + + + + L + Sbjct: 352 DRLDRLKAALTQLAGADGAATGERFRDREEVTLMPFGSEVKAVRTHTVPEDDPGKALAAI 411 Query: 263 KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 ++ L T + ++ AYR L + + ++ ++ +TDGEN+ + Sbjct: 412 RADAKALTADGETAIFSSLRAAYRHLAERASALGDDRFTS-----IVLMTDGENTAGDSA 466 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFD 381 + + + ++ + + L + G+ F L ++F+ Sbjct: 467 DDFESFYRRLPGA-QRTTPVFPILFG--DSDRGELENIASLTGGRLFDAT-KGSLDQAFE 522 Query: 382 KI 383 +I Sbjct: 523 EI 524 >gi|229816811|ref|ZP_04447093.1| hypothetical protein BIFANG_02059 [Bifidobacterium angulatum DSM 20098] gi|229785827|gb|EEP21941.1| hypothetical protein BIFANG_02059 [Bifidobacterium angulatum DSM 20098] Length = 1185 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 45/357 (12%), Positives = 90/357 (25%), Gaps = 22/357 (6%) Query: 43 SIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQ 102 ++ + + ++ +G+ A + T K + + Sbjct: 386 QFARVQSNDFTDDTTYKVRELDSSGYTVSANGSPMTQQGSGNNAYAETGSFTVGKTSHVT 445 Query: 103 YIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMED 162 + + + G NL+ + I +V+D S SM + Sbjct: 446 IVNSNVKPSNNKSIVKTDGGDGDQYTLNLTASGDSTSSTVTTATPADIVLVMDKSGSMNE 505 Query: 163 LYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVN 222 + K LL SK + + A + + + + N+ Sbjct: 506 NNRDANAQKAAKDLAKKLLTGTN-----SKLPPEQQVQMAVVTFSTEASLKQKFTTNVSE 560 Query: 223 SIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYP 279 + N Q + VK + L+ P T +Y Sbjct: 561 INNAVRGNPDGGT--------NWEAALKQANDMQGGRRGVKKHIIFLSDGNPTYRTTSYS 612 Query: 280 A----MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + + +DG S+ N Sbjct: 613 GCYSYSLWGWTAHPEYTTPESCRAQRYTWGENPDGGSDGAYGAGSSDDYGFNYAAALAEA 672 Query: 336 RNAG-MKIYSVAVSAPPEG-QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 G +Y V S DL + +G+ F ++ L ++FD+I I Sbjct: 673 NRRGDAALYVVKTSTDANKMADLAEQANAVNGKEFDGTNAANLTKAFDQIYSSITSS 729 >gi|194227185|ref|XP_001916970.1| PREDICTED: inter-alpha (globulin) inhibitor H5 [Equus caballus] Length = 905 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 25/124 (20%), Positives = 52/124 (41%), Gaps = 11/124 (8%) Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 ++ ++P T+ A+ A R L N + R +IF+TDG+ + N Sbjct: 318 YIHHMSPTGGTDINGALQRAIRLLNNYVAHNDIE---DRSVSLIIFLTDGKPTVGE--TN 372 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL----LRKCTDSSGQFFAVNDS-RELLES 379 TL L + + I+++ + + + L L C + + +D+ +L+ Sbjct: 373 TLKILNNTKEATRGQICIFTIGIGDDVDFKLLEKLSLENCGLTR-RVHDEDDAGSQLIGF 431 Query: 380 FDKI 383 +D+I Sbjct: 432 YDEI 435 >gi|149922178|ref|ZP_01910616.1| flagellar biosynthesis protein P [Plesiocystis pacifica SIR-1] gi|149816918|gb|EDM76403.1| flagellar biosynthesis protein P [Plesiocystis pacifica SIR-1] Length = 689 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 24/167 (14%), Positives = 47/167 (28%), Gaps = 16/167 (9%) Query: 229 QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAM----HHA 284 + T + P ++++ + N T T+ + + Sbjct: 378 TDPWGGPPIAWTYSDGQQDPPGFDIPTTSHMPQCAG--NTFCSGSGTYTHLGLQLIKDNQ 435 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + + + F I ITDG+ +G S E M N G+ Y Sbjct: 436 VQYQADGLMDGAEFPTNDETIYFNILITDGQYNGYST---NAQVQGELEEMYNDGITTYV 492 Query: 345 VAVSAPPE---GQDLLRKCT----DSSGQFFAVNDSRELLESFDKIT 384 + + L+ S ++ N+ EL + I Sbjct: 493 IGFGDGVDTPAAMAQLQNMAQWGSGDSENYYDANNQAELEAALTSIF 539 >gi|326434685|gb|EGD80255.1| hypothetical protein PTSG_10931 [Salpingoeca sp. ATCC 50818] Length = 706 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 39/363 (10%), Positives = 85/363 (23%), Gaps = 26/363 (7%) Query: 36 AVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK 95 A+L+ AS +S T + S + I + + I K Sbjct: 12 ALLATMASALSPGQTSPSNTDEADVSGVAVDYIDIGSAAYEALNNFDTFVGTSHGIVFDK 71 Query: 96 DKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD 155 + + + Q + + S + L + Sbjct: 72 ESHKLMTQYMAQLLQALDNAIGEDIAVVRGFQSRPQVPTSAPSLLDVGRALRLRYKADGS 131 Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAP-------APANR 208 S L + N + + Sbjct: 132 ASSDARLSTLAQACINAGFDFVAVPADTTTDNYVYVSTPKMDCGDAVADLLFILDGSGSV 191 Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 + V + + S + ++ + + Sbjct: 192 GSGNFQTMLNFAQEVVSFFDVAPDKTRVAAMVYDSSNYRKFDFDYIQSVSKQQLINYFDT 251 Query: 269 L-NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 P T T A+ A ++ + + S + + I ITDG+ + + Sbjct: 252 FAYPDGGTETGSALSFALSSMFVTSRGARDL--SEGVPRVAIVITDGK--------SGDD 301 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITD 385 + +R+AG+ +Y+V + L + ++ E F + Sbjct: 302 VSAPAQALRDAGVTLYAVGI--SGADVSELNQIASPPVEDNVVFIDTFSE----FSALAS 355 Query: 386 KIQ 388 KI Sbjct: 356 KIS 358 >gi|307298147|ref|ZP_07577951.1| von Willebrand factor type A [Thermotogales bacterium mesG1.Ag.4.2] gi|306916233|gb|EFN46616.1| von Willebrand factor type A [Thermotogales bacterium mesG1.Ag.4.2] Length = 704 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 18/136 (13%), Positives = 44/136 (32%), Gaps = 10/136 (7%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + E ++ ++ TN Y A+ + K ++F+TDGE Sbjct: 323 SEKAEWIEKVRRIQADGMTNIYGALQTSIDMFSKYDTGRF---------KALLFLTDGEP 373 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 + R + ++S V + L R +++G+ + + + Sbjct: 374 TEGITDI-GRIISDATPEARARNVHLFSFGVGTGVVAELLDRLVQENAGRVSYIIEGESI 432 Query: 377 LESFDKITDKIQEQSV 392 + I+ ++ Sbjct: 433 EAKVTDLYRSIETPAL 448 >gi|294780961|ref|ZP_06746313.1| von Willebrand factor type A domain protein [Enterococcus faecalis PC1.1] gi|307268799|ref|ZP_07550167.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX4248] gi|307286933|ref|ZP_07567011.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0109] gi|294451907|gb|EFG20357.1| von Willebrand factor type A domain protein [Enterococcus faecalis PC1.1] gi|306501991|gb|EFM71279.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0109] gi|306514927|gb|EFM83474.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX4248] gi|315031654|gb|EFT43586.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0017] gi|315034905|gb|EFT46837.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0027] gi|315165584|gb|EFU09601.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX1302] Length = 1103 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 101/382 (26%), Gaps = 30/382 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M A A L+ +++ + D + + QG+ D Sbjct: 151 AEARMAPATLRANLAL--PLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGSAQWD 208 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ ++Y + L++R E Sbjct: 209 GQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRG----NVQKE 264 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +V+D S SM + L + S Sbjct: 265 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNINMGYVGYSSDGYNNN 324 Query: 205 PANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 D + N+ S + + G + L++ + Sbjct: 325 AIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFS 384 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +++++ T R + + + GST D N Sbjct: 385 YKVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNINKRINS 434 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA-V 370 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 435 TFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSADENGDLYYESA 494 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 495 DYAPDISDYLAKKAVQISGTVV 516 >gi|119358220|ref|YP_912864.1| von Willebrand factor, type A [Chlorobium phaeobacteroides DSM 266] gi|119355569|gb|ABL66440.1| von Willebrand factor, type A [Chlorobium phaeobacteroides DSM 266] Length = 344 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 36/168 (21%), Positives = 67/168 (39%), Gaps = 41/168 (24%) Query: 251 QCTPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 PL+ + + + L+ ++ E T A+ A L S +K Sbjct: 156 TPCPLTLDHLTLGTVLDNISSEVIQEEGTAIGTAILIAVNRL----------RASESRQK 205 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------------- 352 +I +TDG+N+ ++ L + G+KIY++A +A Sbjct: 206 AIILLTDGQNNAG-----DIDPLTAAGFALQDGIKIYTIAATAQDARPFVRSAESLPAGG 260 Query: 353 ------GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 +D+L + + G+ F V D L E+FD I D++++ S+R Sbjct: 261 ALSGLPAEDVLVSISRLTQGRSFRVGDKAGLAETFDDI-DRLEKSSLR 307 >gi|315028044|gb|EFT39976.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX2137] Length = 1103 Score = 57.2 bits (136), Expect = 4e-06, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 101/382 (26%), Gaps = 30/382 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M A A L+ +++ + D + + QG+ D Sbjct: 151 AEARMAPATLRANLAL--PLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGSAQWD 208 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ ++Y + L++R E Sbjct: 209 GQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRG----NVQKE 264 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +V+D S SM + L + S Sbjct: 265 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNINMGYVGYSSDGYNNN 324 Query: 205 PANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 D + N+ S + + G + L++ + Sbjct: 325 AIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFS 384 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +++++ T R + + + GST D N Sbjct: 385 YKVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNINKRINS 434 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA-V 370 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 435 TFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSADENGDLYYESA 494 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 495 DYAPDISDYLAKKAVQISGTVV 516 >gi|315148427|gb|EFT92443.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX4244] Length = 1103 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 101/382 (26%), Gaps = 30/382 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M A A L+ +++ + D + + QG+ D Sbjct: 151 AEARMAPATLRANLAL--PLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGSAQWD 208 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ ++Y + L++R E Sbjct: 209 GQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRG----NVQKE 264 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +V+D S SM + L + S Sbjct: 265 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNINMGYVGYSSDGYNNN 324 Query: 205 PANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 D + N+ S + + G + L++ + Sbjct: 325 AIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFS 384 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +++++ T R + + + GST D N Sbjct: 385 YKVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNINKRINS 434 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA-V 370 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 435 TFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSADENGDLYYESA 494 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 495 DYAPDISDYLAKKAVQISGTVV 516 >gi|223670962|dbj|BAH22728.1| complement factor B precursor [Nematostella vectensis] Length = 858 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 20/124 (16%), Positives = 52/124 (41%), Gaps = 16/124 (12%) Query: 261 EVKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 V +++ K+ T + A+ R++ +K + FITDG ++ Sbjct: 459 SVIAKIGKIKRSGGGTASRLALDTTIRQVVP--------FTREGSQKALFFITDGHSNIG 510 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF-FAVNDSRELLE 378 + + + +++ G +IY++ V ++L+ ++ ++ +V ++LL Sbjct: 511 GS------PRKAAKILKDKGFQIYAIGVGKKVRRRELMEIASEPEDEYVISVRKYKQLLS 564 Query: 379 SFDK 382 + K Sbjct: 565 AVKK 568 >gi|217978613|ref|YP_002362760.1| von Willebrand factor type A [Methylocella silvestris BL2] gi|217503989|gb|ACK51398.1| von Willebrand factor type A [Methylocella silvestris BL2] Length = 325 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 26/151 (17%), Positives = 54/151 (35%), Gaps = 29/151 (19%) Query: 254 PLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PLS +++ V L ++ +T + A + L S + ++ Sbjct: 155 PLSFDVDAVSRTLAEIPLGLVGHSTAIGEGLGLALKRLTE----------SKAPSRVIVL 204 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA----------PPEGQDLLRKC 360 ++DG N T + + E N G+KIY++ + P L++ Sbjct: 205 LSDGANDAG-----TTDPTGVAELANNLGVKIYTIGLGVVDTQTFNGLGDPVDFLALQRL 259 Query: 361 TD-SSGQFFAVNDSRELLESFDKITDKIQEQ 390 + G+ F V + +L + I + + Sbjct: 260 AEIGGGEAFRVRTTEDLAYASAAIERLVAGE 290 >gi|156390865|ref|XP_001635490.1| predicted protein [Nematostella vectensis] gi|156222584|gb|EDO43427.1| predicted protein [Nematostella vectensis] Length = 851 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 20/124 (16%), Positives = 52/124 (41%), Gaps = 16/124 (12%) Query: 261 EVKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 V +++ K+ T + A+ R++ +K + FITDG ++ Sbjct: 452 SVIAKIGKIKRSGGGTASRLALDTTIRQVVP--------FTREGSQKALFFITDGHSNIG 503 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF-FAVNDSRELLE 378 + + + +++ G +IY++ V ++L+ ++ ++ +V ++LL Sbjct: 504 GS------PRKAAKILKDKGFQIYAIGVGKKVRRRELMEIASEPEDEYVISVRKYKQLLS 557 Query: 379 SFDK 382 + K Sbjct: 558 AVKK 561 >gi|315160966|gb|EFU04983.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0645] Length = 1103 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 101/382 (26%), Gaps = 30/382 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M A A L+ +++ + D + + QG+ D Sbjct: 151 AEARMAPATLRANLAL--PLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGSAQWD 208 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ ++Y + L++R E Sbjct: 209 GQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRG----NVQKE 264 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +V+D S SM + L + S Sbjct: 265 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNINMGYVGYSSDGYNNN 324 Query: 205 PANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 D + N+ S + + G + L++ + Sbjct: 325 AIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFS 384 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +++++ T R + + + GST D N Sbjct: 385 YKVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNINKRINS 434 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA-V 370 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 435 TFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSADENGDLYYESA 494 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 495 DYAPDISDYLAKKAVQISGTVV 516 >gi|148657647|ref|YP_001277852.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148569757|gb|ABQ91902.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 966 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 32/185 (17%), Positives = 59/185 (31%), Gaps = 27/185 (14%) Query: 196 KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL 255 S P K+D+ E+ S+ + ++G + ++ Sbjct: 419 GSMAEPVAGGRRNKLDLAKEAVYQ--ASLGLTPID------QVGLVVFDDTANWVLQLQP 470 Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 ++ E++ L TN P + A L + K V+ +TDG Sbjct: 471 LPSMVEIERALGSFGIGGGTNIRPGIEQAALALAST----------DAKIKHVLLLTDGI 520 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE 375 N + MR +G+ I +VAV +L+ G+ + V E Sbjct: 521 A--------ESNYSDLIAQMRASGITISTVAVGLDAN-PNLVDVANAGGGRSYRVTSIDE 571 Query: 376 LLESF 380 + F Sbjct: 572 VPRIF 576 >gi|315150339|gb|EFT94355.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0012] Length = 1103 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 97/382 (25%), Gaps = 30/382 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M A A L+ T T + + H + G Sbjct: 151 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGGAQWDGQ 210 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ ++Y + L++R E Sbjct: 211 T--SWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRG----NVQKE 264 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +V+D S SM + L + S Sbjct: 265 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSSITNNINMGYVGYSSDGYNNN 324 Query: 205 PANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 D + N+ S + + G + L++ + Sbjct: 325 AIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFS 384 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +++++ T R + + + GST D N Sbjct: 385 YKVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNINKRINS 434 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA-V 370 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 435 TFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSADENGDLYYESA 494 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 495 DYAPDISDYLAKKAVQISGTVV 516 >gi|300860089|ref|ZP_07106177.1| von Willebrand factor type A domain protein [Enterococcus faecalis TUSoD Ef11] gi|295112663|emb|CBL31300.1| Cna protein B-type domain./von Willebrand factor type A domain. [Enterococcus sp. 7L76] gi|300850907|gb|EFK78656.1| von Willebrand factor type A domain protein [Enterococcus faecalis TUSoD Ef11] gi|315144603|gb|EFT88619.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX2141] Length = 1103 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 101/382 (26%), Gaps = 30/382 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M A A L+ +++ + D + + QG+ D Sbjct: 151 AEARMAPATLRANLAL--PLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGSAQWD 208 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ ++Y + L++R E Sbjct: 209 GQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRG----NVQKE 264 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +V+D S SM + L + S Sbjct: 265 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNINMGYVGYSSDGYNNN 324 Query: 205 PANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 D + N+ S + + G + L++ + Sbjct: 325 AIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFS 384 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +++++ T R + + + GST D N Sbjct: 385 YKVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNINKRINS 434 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA-V 370 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 435 TFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSADENGDLYYESA 494 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 495 DYAPDISDYLAKKAVQISGTVV 516 >gi|260800509|ref|XP_002595172.1| hypothetical protein BRAFLDRAFT_241020 [Branchiostoma floridae] gi|229280415|gb|EEN51183.1| hypothetical protein BRAFLDRAFT_241020 [Branchiostoma floridae] Length = 299 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 22/188 (11%), Positives = 62/188 (32%), Gaps = 20/188 (10%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + + +++ + + RIG + Y+ + + Sbjct: 28 GSVGATNFERMKTFVQKMISDFELGPEA-----TRIGVVVYSNRASLEISLDAFEDQESL 82 Query: 263 KSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + + + P T T A+ + ++ + + ++K + +TDG + Sbjct: 83 QDAVAGIAYPGGYTLTGAAIDYTTTFAFSTRNGAR-----EGVRKVAVILTDGVSYD--- 134 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFD 381 + + + MR A + Y+V + + +D L F ++D L Sbjct: 135 -----DPAEPAQSMRKAAIITYAVGIGSN-LDRDQLDVIAGVPDNLFVLDDFSMLDNLRT 188 Query: 382 KITDKIQE 389 + ++ + Sbjct: 189 TLPTQVCD 196 >gi|156349150|ref|XP_001621939.1| predicted protein [Nematostella vectensis] gi|156208297|gb|EDO29839.1| predicted protein [Nematostella vectensis] Length = 147 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 21/158 (13%), Positives = 55/158 (34%), Gaps = 19/158 (12%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYRELYN 290 S RI + ++LN K ++ + T A+ +++N Sbjct: 8 SPSSTRISVATFENFPKMEFSFQKYSDLNSAKFAVDAIQISNGGTRIGEALKLVKTDMFN 67 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + ++ +TDG +++ + + +R+ G+ I ++ + + Sbjct: 68 TAR--------SNVPRMLLVMTDG--------RSSDDVVAPSRALRDIGVTILTLGLGSD 111 Query: 351 PEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 D L+ F + +L + +K+ K Sbjct: 112 -YDLDQLKMIASGVDLVFE-SPFDQLPQMVEKVQQKAC 147 >gi|327542166|gb|EGF28659.1| von Willebrand factor type A [Rhodopirellula baltica WH47] Length = 1014 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 48/379 (12%), Positives = 109/379 (28%), Gaps = 39/379 (10%) Query: 29 MQSALDA-AVLSGCASIVSDRTIKDPTTKKDQTS--TIFKKQIKKHL---KQGSYIRENA 82 +Q +D A + A ++D D T+ + + T + + + L + E Sbjct: 277 LQHKIDQPAAYTYEAEFIADEDEDDSLTQNNSATGYTYVRGKGRVLLIHGPEDIGDFELL 336 Query: 83 GDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERS 142 + + I +T L ++ Q + + ++ S IE Sbjct: 337 IATLRDSNIEVTPMPTTQLFGSL-AELQPYDAVILAGVARVSGDTTQTITSFSDDQIEML 395 Query: 143 SENLAISICMVLDVSRS---MEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY 199 N +L + + + + K+ + Sbjct: 396 VRNTQQLGAGLLMIGGPDSLGAGGWTGTELEKA-------MPVDFQIKNTKVQGVGALAL 448 Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 + + + + + + A + G G + N Sbjct: 449 IMHASEMAQGNYWQKQISIAAIEQLGSADKAGVVHWTMNGDKWLWGG--SKGMLEVGPNR 506 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 + + + ++ P + PAM A L K +I I+DG+ Sbjct: 507 RAMLAAVGRMTPGDMPEFDPAMRMAVTGLVRT----------DASVKHLIIISDGDPGPP 556 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSA-PPEGQDLLRKCT-DSSGQFFAVNDSRELL 377 S + + ++ + I +VAV + L+ + G+++AV R L Sbjct: 557 SNS--------VIQAFKDNSITISTVAVESHGLSDSRRLQDIARATGGKYYAVKSGRALP 608 Query: 378 ESFDKITDKIQEQSVRIAP 396 F + ++ + P Sbjct: 609 GIFQREARRVTRPLIYEPP 627 >gi|269126610|ref|YP_003299980.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] gi|268311568|gb|ACY97942.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] Length = 315 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 19/146 (13%), Positives = 49/146 (33%), Gaps = 24/146 (16%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 S + S ++ L + T A+ + + + S G ++ ++DG+ Sbjct: 144 SGDRAAAISSIDTLTLAKRTAIGEAVFTSLQAI----RSFDAQAGQDPPPAHIVLLSDGD 199 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--------------PEGQDLLRKCT 361 N+ + + + R A + + ++A P L Sbjct: 200 NTTGRSVP------EAIDAARAADVPVSTIAFGTPYGTVEIDGETTPVEVNKVTLAGLAQ 253 Query: 362 DSSGQFFAVNDSRELLESFDKITDKI 387 ++G+ + D+ +L + + I + Sbjct: 254 GTNGKAYEAADNDQLSQVYANIGTSL 279 >gi|332885553|gb|EGK05799.1| hypothetical protein HMPREF9456_02063 [Dysgonomonas mossii DSM 22836] Length = 580 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 34/285 (11%), Positives = 81/285 (28%), Gaps = 26/285 (9%) Query: 96 DKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD 155 D + ++ A Y + L + +E + Sbjct: 123 DPLSTFSVDVDA-ASYSNMRRFINRGSLPNKDAVRIEELINYFSYNYTEPTGNDPIKI-- 179 Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 S + + N + + + N ++D++ Sbjct: 180 -STEVGNCPWNSQNRLVKIGLKARSMAGENLPA---SNFVFLIDVSGSMYGATRLDLVKS 235 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 S L N++++ R+ + Y + N ++K L+ LN +T Sbjct: 236 SLKLLTNNLREKD--------RVAIVVYAGSAGEVLPSTSGANKQKIKEALDNLNAGGST 287 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + AY+ N +I TDG+ + + + L L + E Sbjct: 288 TGGAGIQLAYKIAKQNFIKGGNNR--------IILCTDGDFNVGVSSNDGL--LALIEQE 337 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLES 379 R +G+ + + ++ + G +++ +E + Sbjct: 338 RKSGVFLSILGYGMGNYKDSKMQTLAQAGNGNHAYIDNLQEANKV 382 >gi|327313514|ref|YP_004328951.1| von Willebrand factor type A domain-containing protein [Prevotella denticola F0289] gi|326944388|gb|AEA20273.1| von Willebrand factor type A domain protein [Prevotella denticola F0289] Length = 331 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 26/165 (15%), Positives = 59/165 (35%), Gaps = 45/165 (27%) Query: 253 TPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P++ + K L+ +NP + T+ A+ + ++++ K + Sbjct: 145 LPITGDYVSAKMFLDNINPSLIGTQGTDIGKALQLSINSFTP----------NSKVGKAI 194 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE---------------- 352 I ITDGE++ + + RN G+K++ + + + Sbjct: 195 ILITDGEDNEG-------GAEAMAKQARNKGIKVFILGIGSKEGSTIPMPDGTELKDSNG 247 Query: 353 -------GQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQE 389 +D+ R+ + G + V++S K K+Q+ Sbjct: 248 EPVKTHLNEDMCRRIAAAGQGVYVHVDNSNVADAVLGKELGKLQK 292 >gi|156382099|ref|XP_001632392.1| predicted protein [Nematostella vectensis] gi|156219447|gb|EDO40329.1| predicted protein [Nematostella vectensis] Length = 298 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 18/128 (14%), Positives = 45/128 (35%), Gaps = 9/128 (7%) Query: 258 NLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 N +K +++L T A+ A + L+ + +K + ++DG Sbjct: 172 NAVNIKRDIDELRRKKGYTFIDKALTLADKSLFTQ-----EAGMREDSQKVAVLMSDGIQ 226 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 + + ++ G+++Y+V + A + +L+ F+ +L Sbjct: 227 TKDRGPFTPTD--IAANPLKMKGVQVYTVGIGADVDVFELM-AVASGITSMFSDRYLDDL 283 Query: 377 LESFDKIT 384 I+ Sbjct: 284 KAELAYIS 291 >gi|118096903|ref|XP_425157.2| PREDICTED: similar to type VII collagen [Gallus gallus] Length = 1645 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 21/122 (17%), Positives = 46/122 (37%), Gaps = 19/122 (15%) Query: 258 NLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 N ++ + +L+ NT T + + + + + K I ITDG++ Sbjct: 17 NGTGIQKAIQQLSYKGGNTRTGAGLRYISDNFFGPTQLRPG------VPKICILITDGKS 70 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSR 374 + Q +++ G+K+++V + + L + + FF V D + Sbjct: 71 QD--------DAEQPALRLKSLGIKVFAVGI--KNADRQELIRVASLPTDSFFFYVGDFK 120 Query: 375 EL 376 L Sbjct: 121 LL 122 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 60/185 (32%), Gaps = 17/185 (9%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 R ++ L+++ A+ ++G Y+ + S+ L V + Sbjct: 872 GTRDSSSDADTVRTLLSNTVTAMGRLGPDGTQVGLATYSYRSLPWLLLNRSSELPAVLEQ 931 Query: 266 LNKLNPY--ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 + + A+ A L + + + ++ + DG Sbjct: 932 IRTMRYEEPSGNAIGAAITFARTYLLSPGAGRRPS-----VPAVLVVLADG--------P 978 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 + + + + AG+++ +AV Q+ LR+ F + L E ++ Sbjct: 979 SGDDAITAARDAKAAGVRV--LAVGLEGADQEQLRRMVSGEDPRFVFRNRGSLSELEGEL 1036 Query: 384 TDKIQ 388 TD + Sbjct: 1037 TDDLC 1041 >gi|213961715|ref|ZP_03389981.1| protein containing von Willebrand factor [Capnocytophaga sputigena Capno] gi|213955504|gb|EEB66820.1| protein containing von Willebrand factor [Capnocytophaga sputigena Capno] Length = 550 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 23/148 (15%), Positives = 49/148 (33%), Gaps = 11/148 (7%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 + R+ + Y G + +++ L+ L +T+ + AY+E Sbjct: 223 KPTDRVAIVTYASGTKVALSSTPVKERQKIEKVLDNLYASGSTSGSSGIQLAYKEAQKNF 282 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + N +I TDG+ + N + E R +G+ + + Sbjct: 283 IKNGNNR--------IILATDGDFNVG--ISNPRELEKFIEKQRESGIYMSVLGFGMGNY 332 Query: 353 GQDLLRKCTDSS-GQFFAVNDSRELLES 379 D+ D G + ++D E + Sbjct: 333 RDDMAETIADKGNGNYAYIDDLTEAKKV 360 >gi|299136327|ref|ZP_07029511.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] gi|298602451|gb|EFI58605.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] Length = 376 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 36/358 (10%), Positives = 106/358 (29%), Gaps = 49/358 (13%) Query: 39 SGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKN 98 + + + + P T Q + + + ++ Sbjct: 25 AAISQTTTPQPAAPPVTPSTAPQQPATT--PATTPQSASPTQPNSTSPTSPTGSTAAPQS 82 Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR 158 + S + T + + L G NL +LD R Sbjct: 83 STPTLQPRSDQDAQDGTTFTLHRSVNEVDLI---FTVMGKDGHFVSNLQQQNFGLLDDGR 139 Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAG 218 + + N + L + ++ + ++A Sbjct: 140 PPQHVLRFAQQTNLPLRVGIMLDT--------------------SSSIRQRFEFEQQAAT 179 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 + + + + N +N ++ + + + +L P T + Sbjct: 180 DFLLQVLHPAD---------RAFVEGFDVQINIAQDFTNRIDMLDTGIRRLRPGGGTALF 230 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 +++ K+ ++K ++ ++DG++ + +++C+ Sbjct: 231 DSLYRTC------KDQMLTLQQDAEVRKAIVLVSDGDDDYSRVL--ETEAIKMCQRAE-- 280 Query: 339 GMKIYSVAVSAP---PEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSV 392 +Y+++ + +G D+L++ +D + GQ F ++ F I ++++ Q + Sbjct: 281 -TIVYTISTNVGPSRDKGDDVLQQISDATGGQSFYPQRIDDVAIGFRNIEEELRSQYL 337 >gi|282858825|ref|ZP_06267970.1| von Willebrand factor type A domain protein [Prevotella bivia JCVIHMP010] gi|282588394|gb|EFB93554.1| von Willebrand factor type A domain protein [Prevotella bivia JCVIHMP010] Length = 340 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 23/165 (13%), Positives = 55/165 (33%), Gaps = 45/165 (27%) Query: 253 TPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P++++ K L+ +NP + T+ A++ A + K + Sbjct: 145 LPITSDFISAKMFLDNINPSLIGTQGTDIGQAINLAMHSFSPTSK----------SGKAI 194 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL----------- 357 + ITDGE++ ++ + AG+++Y + V + + L Sbjct: 195 VVITDGEDNEGR-------AEEMASKAQKAGIQVYILGVGSTSGAEIPLGNGEMLKDNSG 247 Query: 358 -------------RKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + G + V+++ + K+Q+ Sbjct: 248 NVVRTHLNENMCKKIAAAGKGVYIHVDNNNDAQNILKGELSKLQK 292 >gi|109090611|ref|XP_001091779.1| PREDICTED: von Willebrand factor A domain-containing protein 2 isoform 3 [Macaca mulatta] Length = 781 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 21/168 (12%), Positives = 57/168 (33%), Gaps = 22/168 (13%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + VR+G ++ + EVK+++ ++ T+T A+ + Sbjct: 82 DISTERVRVGAFQFSSTPHLEFPLDSFSTQQEVKAKIKRMIFKGGHTDTGLALKYLLNRG 141 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + ++ + + ++ +TDG++ G A + ++ G+ +++V V Sbjct: 142 FPGGRNA-------SVPQILVIVTDGKSQGHVALP--------AKQLKEKGVTVFAVGVR 186 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLE-SFDKITDK-IQEQSV 392 P + L + F +++ I + Sbjct: 187 FPRWEE--LHALASEPREQHVLLAEQVEDATNGLFSTLSNSAICSSTT 232 >gi|109090613|ref|XP_001091550.1| PREDICTED: von Willebrand factor A domain-containing protein 2 isoform 1 [Macaca mulatta] Length = 725 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 21/168 (12%), Positives = 57/168 (33%), Gaps = 22/168 (13%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + VR+G ++ + EVK+++ ++ T+T A+ + Sbjct: 82 DISTERVRVGAFQFSSTPHLEFPLDSFSTQQEVKAKIKRMIFKGGHTDTGLALKYLLNRG 141 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + ++ + + ++ +TDG++ G A + ++ G+ +++V V Sbjct: 142 FPGGRNA-------SVPQILVIVTDGKSQGHVALP--------AKQLKEKGVTVFAVGVR 186 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLE-SFDKITDK-IQEQSV 392 P + L + F +++ I + Sbjct: 187 FPRWEE--LHALASEPREQHVLLAEQVEDATNGLFSTLSNSAICSSTT 232 >gi|298491707|ref|YP_003721884.1| von Willebrand factor type A ['Nostoc azollae' 0708] gi|298233625|gb|ADI64761.1| von Willebrand factor type A ['Nostoc azollae' 0708] Length = 426 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 27/189 (14%), Positives = 58/189 (30%), Gaps = 27/189 (14%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL-NK 268 I+ +I++ L+ +Q I +A+ + + +K +L + Sbjct: 59 INTVIQAVEQLLAQLQPGD--------HISIVAFAGTSEVIIPNQIVQDAESIKCQLHKR 110 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ----- 323 L T + EL G+ +TDG Sbjct: 111 LKAGGGTIIAEGLSLGITELL---------KGTKGAVSQAFLLTDGHGDRGLKIWKWEMG 161 Query: 324 --NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESF 380 + L++ + + + + QDLL K D G + ++ ++ F Sbjct: 162 PNDKKRCLELAQKATRVSLTLNTFGFGNDWN-QDLLEKIADAGGGTLAYIERPQQAVDQF 220 Query: 381 DKITDKIQE 389 ++ +IQ Sbjct: 221 SRLLKRIQS 229 >gi|327534722|gb|AEA93556.1| von Willebrand factor type A domain protein [Enterococcus faecalis OG1RF] Length = 1103 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 96/382 (25%), Gaps = 30/382 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M A A L+ T T + + H + G Sbjct: 151 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGGAQWDGQ 210 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ +Y + L++R E Sbjct: 211 T--SWNGDPTNRTNSYTEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRG----NVQKE 264 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +V+D S SM + L + S Sbjct: 265 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNINMGYVGYSSDGYNNN 324 Query: 205 PANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 D + N+ S + + G + L++ + Sbjct: 325 AIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFS 384 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +++++ T R + + + GST D N Sbjct: 385 YKVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNISKRINS 434 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA-V 370 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 435 TFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSADENGDLYYESA 494 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 495 DYAPDISDYLAKKAVQISGTVV 516 >gi|288919483|ref|ZP_06413814.1| von Willebrand factor type A [Frankia sp. EUN1f] gi|288349086|gb|EFC83332.1| von Willebrand factor type A [Frankia sp. EUN1f] Length = 533 Score = 57.2 bits (136), Expect = 5e-06, Method: Composition-based stats. Identities = 33/185 (17%), Positives = 60/185 (32%), Gaps = 19/185 (10%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL--------SNNL 259 +I L + L + + R N+ + S L Sbjct: 355 DRISDLRSALIGLTGADSSLTARFTSFRAREKITLVPFDSGVNRISDFAVTDPSPDSPEL 414 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 E++ + N +T Y A+ AY + + S V+ +TDGEN+ Sbjct: 415 KELRRAVEGFNAGGDTAIYSALRAAYDRAAADLARDGSYYTS------VVLLTDGENTTG 468 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLE 378 ++ + L + A + ++V D LR+ D + G F + L + Sbjct: 469 ASADDFLAHYRSLSPAARA-VPTFTVLFG--DADPDALRQIADVTGGTVFDAGSTS-LPD 524 Query: 379 SFDKI 383 F I Sbjct: 525 VFKDI 529 >gi|227552873|ref|ZP_03982922.1| pilus subunit protein [Enterococcus faecalis HH22] gi|229550416|ref|ZP_04439141.1| pilus subunit protein [Enterococcus faecalis ATCC 29200] gi|257422992|ref|ZP_05599982.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis X98] gi|312952910|ref|ZP_07771770.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0102] gi|227178004|gb|EEI58976.1| pilus subunit protein [Enterococcus faecalis HH22] gi|229304444|gb|EEN70440.1| pilus subunit protein [Enterococcus faecalis ATCC 29200] gi|257164816|gb|EEU94776.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis X98] gi|310629158|gb|EFQ12441.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0102] gi|315154335|gb|EFT98351.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0031] gi|315156569|gb|EFU00586.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0043] gi|315573683|gb|EFU85874.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0309B] gi|315582702|gb|EFU94893.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0309A] gi|323480330|gb|ADX79769.1| Endocarditis and Biofilm-Associated Pilus subunitA [Enterococcus faecalis 62] Length = 1103 Score = 56.8 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 101/382 (26%), Gaps = 30/382 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M A A L+ +++ + D + + QG+ D Sbjct: 151 AEARMAPATLRANLAL--PLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGSAQWD 208 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ ++Y + L++R E Sbjct: 209 GQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRG----NVQKE 264 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +V+D S SM + L + S Sbjct: 265 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNINMGYVGYSSDGYNNN 324 Query: 205 PANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 D + N+ S + + G + L++ + Sbjct: 325 AIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFS 384 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +++++ T R + + + GST D N Sbjct: 385 YKVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNINKRINS 434 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA-V 370 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 435 TFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSADENGDLYYESA 494 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 495 DYAPDISDYLAKKAVQISGTVV 516 >gi|333028697|ref|ZP_08456761.1| hypothetical protein STTU_6202 [Streptomyces sp. Tu6071] gi|332748549|gb|EGJ78990.1| hypothetical protein STTU_6202 [Streptomyces sp. Tu6071] Length = 418 Score = 56.8 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 22/139 (15%), Positives = 51/139 (36%), Gaps = 20/139 (14%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + E K+ + L+P T PA+ A +L K ++ ITDGE Sbjct: 107 DRTEAKAAVATLSPTGWTPIGPALLGAADDL-----------RGGEASKRIVLITDGE-- 153 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIY--SVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSR 374 + ++ + G+ + ++ + + +D L + + G + V ++ Sbjct: 154 ----DTCHRDPCEVAREIAAKGVHLVVDTLGLVPDAKTRDQLSCIAEATGGTYTTVRHTK 209 Query: 375 ELLESFDKITDKIQEQSVR 393 +L ++ + + V Sbjct: 210 DLSGRVKQLVHRAADPVVT 228 >gi|326911070|ref|XP_003201885.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2-like [Meleagris gallopavo] Length = 948 Score = 56.8 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 16/149 (10%), Positives = 52/149 (34%), Gaps = 7/149 (4%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 N + + + K + ++P TN A+ A L + S Sbjct: 359 CWRDNLVSATPAQVEDAKKYIQTIHPNGGTNINEALLRATFILNEAQNLGMLDPNS---V 415 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS- 364 ++ ++DG+ + T+ + + +++ ++ + + D L++ + Sbjct: 416 SMIVLVSDGDPTVGELKLTTIQ-KNVKQSIKDE-FSLFCLGIGFDV-DYDFLQRIATDNR 472 Query: 365 GQFFAVNDSRELLESFDKITDKIQEQSVR 393 G + ++E + +++ ++ Sbjct: 473 GMAQRIFGNQETSAQMKRFYNQVSTPLLK 501 >gi|313204752|ref|YP_004043409.1| von willebrand factor type a [Paludibacter propionicigenes WB4] gi|312444068|gb|ADQ80424.1| von Willebrand factor type A [Paludibacter propionicigenes WB4] Length = 626 Score = 56.8 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 36/338 (10%), Positives = 95/338 (28%), Gaps = 26/338 (7%) Query: 43 SIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQ 102 S V ++ D K + +K+ E G + ++ + + Sbjct: 119 STVGAISVADVADVKVSNYNQPSSFMPVMIKRVQPDAEEYGSYKENRFLSAKEQALSTFS 178 Query: 103 YIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMED 162 ++ A Y + + P + + + + Sbjct: 179 LDVDA-ASYGNMRRMINMGQKPPKDAIRVEELINYFSYDYPKPTGKDPVSINTETSICPW 237 Query: 163 LYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVN 222 + P + + + S P K++++ S L N Sbjct: 238 DATHRLVKIGVKAREIPSENLPASNFVFLLDVSGSMDVP------NKLELVKSSIKLLTN 291 Query: 223 SIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMH 282 +++K R+ + Y + + ++ ++ L+ +T + Sbjct: 292 NLRKTD--------RVAIVVYAGAAGVVLESTEGTDKQKIMEAVDGLHAGGSTAGGAGIQ 343 Query: 283 HAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKI 342 AY+ + N ++ TDG+ + + N L + E R G+ + Sbjct: 344 LAYKIAEKNFIENGNNR--------IVLCTDGDFNVGVSSNNEL--ESLIESKRKTGVYL 393 Query: 343 YSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLES 379 + + L+ + G +++ +E + Sbjct: 394 TVLGYGMGNYKDNKLQILAEKGNGNHAYIDNIQEANKV 431 >gi|325677636|ref|ZP_08157288.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] gi|324110604|gb|EGC04768.1| von Willebrand factor type A domain protein [Ruminococcus albus 8] Length = 549 Score = 56.8 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 40/398 (10%), Positives = 104/398 (26%), Gaps = 37/398 (9%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+ ++ +A + A D+ + + +++ +K + ++ T Sbjct: 8 IFAMAAAIMMTVTMFASCGS-----------ASDSTYKAENDTNITNNDVKGGKSPENIT 56 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + + + I E + N + + + Sbjct: 57 ADAETDGGEFNYYTEEAIPEINTEEYNHYAENSFQSVAEHPLSTFSTDVD---TASFTNV 113 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 + +I + + + D + ++ + L Sbjct: 114 RRMIENRQDIFADAVRTEEFINYFKYDYPQPDNDDKIGITTE-LSDCPWNDESKLMLVGL 172 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 N K+ ++ ++ L ++ + RI Sbjct: 173 QAKDIDVQDIDSNIVFLIDVSGSMGDANKLPLVAQAFAMLAENLG--------QNDRISI 224 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG 300 + Y N +++ L L +T ++ AY N Sbjct: 225 VTYAGRDTVELEGESGANYSKIAETLAGLTAGGSTAGAAGINTAYELAQKYFIEGGNNR- 283 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 VI TDG+ + + Q L + E R+ G+ + + L Sbjct: 284 -------VILATDGDLNVGLSSQEEL--TALIEEKRDMGIFLSVLGFGMGNYKDSRLEAL 334 Query: 361 TDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 D+ G + ++ E + ++ +A + Sbjct: 335 ADNGNGNYAYIDSIDEAERV---LVTEMNGTMFTVAKD 369 >gi|156403935|ref|XP_001640163.1| predicted protein [Nematostella vectensis] gi|156227296|gb|EDO48100.1| predicted protein [Nematostella vectensis] Length = 247 Score = 56.8 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 22/171 (12%), Positives = 59/171 (34%), Gaps = 13/171 (7%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELY- 289 SV++G + ++ N + + ++K+ + + A+ ++L+ Sbjct: 66 SKDSVQVGVVPFSHHYALEFGLTNYTNHKALDAAIDKIQFEGSFSMLSGALKVVQQKLFM 125 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + + + + V+ + DG N S + +++ G +++ V + Sbjct: 126 PQLVEVKSKAKKDKPLQAVVIVGDGGNLSGSDALYESSL-----ALKDTGKRLFVVGLGR 180 Query: 350 PPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + +R FF + L ++ + I + I P R Sbjct: 181 LEY-EASMRMLASEPPKTHFFNAGTGKNLKNFVKRLANSICKD---ICPER 227 >gi|148976671|ref|ZP_01813358.1| von Willebrand factor type A domain protein [Vibrionales bacterium SWAT-3] gi|145964022|gb|EDK29280.1| von Willebrand factor type A domain protein [Vibrionales bacterium SWAT-3] Length = 303 Score = 56.8 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 40/339 (11%), Positives = 91/339 (26%), Gaps = 74/339 (21%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M+ I + L + +ID ++++ L S + +D Sbjct: 19 MSVIALPFILLVVGLSIDAGRAYIVKSK---------LFAAVDAASIAAARAVANGEDAG 69 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 +K ++ Y +N D + + AQ Sbjct: 70 RAAAQKYFAANIPADFYSATP-----NLGDVNFAYDSFGNISIDISATAQV-----PTVF 119 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 LI N + + I + + +V+D + S+ + Sbjct: 120 LPLIGLDTFNPGVSAQSIRR------PVDLVLVIDNTTSLRLGSIGDVTQ---------- 163 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 + +I ++ + G+ V A + +++ Sbjct: 164 ----------DVIDRSKSFIENFHEGFDRISLVKFAFGSEVPVGFNATRGHSRSTIKSEI 213 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG 300 ++N G N TN M+ A+ EL + ++ Sbjct: 214 DSFNFGSTSNAQ---------------------YTNASEGMYRAFNELRTVTDPANL--- 249 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG 339 K ++F TDG + ++ + +R++ Sbjct: 250 -----KVIVFFTDGAPNTFASTFDFEGGGTHTGAIRSSD 283 >gi|90021002|ref|YP_526829.1| hypothetical protein Sde_1355 [Saccharophagus degradans 2-40] gi|89950602|gb|ABD80617.1| von Willebrand factor, type A [Saccharophagus degradans 2-40] Length = 787 Score = 56.8 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 27/195 (13%), Positives = 66/195 (33%), Gaps = 23/195 (11%) Query: 193 NTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQC 252 + N ++ + L+ + + + + + +V Q Sbjct: 49 DVRLVIDVSGSMKRNDPANLRQPAVDLLMQLLPEGSKAGVWTFGKWVNMLVPHQVVDEQW 108 Query: 253 TPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 L +++ +++N TN A+ A +L + K +I + Sbjct: 109 RSLG------RAKASEINSVGLYTNIGEALEKAAYDLDAASD---------EYAKHIILL 153 Query: 312 TDGENSGASAYQNTLN-----TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSG 365 TDG ++ ++ AG I++VA+ + +LL+K + + G Sbjct: 154 TDGMVDIDKQPDKNTQEWRRIVDEVLPKLKAAGYTIHTVAL-SDNADNNLLKKLSLQTDG 212 Query: 366 QFFAVNDSRELLESF 380 + + +L++ F Sbjct: 213 IASVAHTADDLMKIF 227 >gi|331085807|ref|ZP_08334890.1| hypothetical protein HMPREF0987_01193 [Lachnospiraceae bacterium 9_1_43BFAA] gi|330406730|gb|EGG86235.1| hypothetical protein HMPREF0987_01193 [Lachnospiraceae bacterium 9_1_43BFAA] Length = 1321 Score = 56.8 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 47/361 (13%), Positives = 104/361 (28%), Gaps = 46/361 (12%) Query: 34 DAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINI 93 DAA + + K+ + + T K + + + ++ G + Q N Sbjct: 376 DAAYRGPAGELNRILSEKEDKEQLKEIDTYAKLTVDHIVPIVDHSQDTDGQKEETFQENK 435 Query: 94 TKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMV 153 + + + QY+ ++ Q + + Sbjct: 436 EERQEDFTQYVTDTVNQKAASV------SISGIDTKEFETVRAVVSLEEGIADTEEKFRG 489 Query: 154 LDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL 213 + L S + KI+ L Sbjct: 490 NVEILDCGVEIPDYKVKKLEYDTVNIALCCDNSGSMEGE----------------KIENL 533 Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE 273 ++ V + V IG + + G++ C P ++ +++ + Sbjct: 534 KKAVSTFVGKLA--------DEVNIGIVPFGSGVLEGVCEP-GSSREKLEQSVESFRSDS 584 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 TN Y + + L EK++ + ++DG++ + + +I Sbjct: 585 GTNIYSGVEYTLSMLAKEKDA----------LNIAVIMSDGQD----SIPSEEQLQKITS 630 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 N + +YS+ + A E + L +G + V+DS L + I I + Sbjct: 631 ACENGNILLYSMGLGADVESEVLSTYSDAGNGAYVFVSDSNSLYSFYQYIYQ-ISKNRYE 689 Query: 394 I 394 I Sbjct: 690 I 690 >gi|315158311|gb|EFU02328.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0312] Length = 1103 Score = 56.8 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 101/382 (26%), Gaps = 30/382 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M A A L+ +++ + D + + QG+ D Sbjct: 151 AEARMAPATLRANLAL--PLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGSAQWD 208 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ ++Y + L++R E Sbjct: 209 GQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRG----NVQKE 264 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +V+D S SM + L + S Sbjct: 265 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNINMGYVGYSSDGYNNN 324 Query: 205 PANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 D + N+ S + + G + L++ + Sbjct: 325 AIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFS 384 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +++++ T R + + + GST D N Sbjct: 385 YKVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNINKRINS 434 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA-V 370 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 435 TFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSADENGDLYYESA 494 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 495 DYAPDISDYLAKKAVQISGTVV 516 >gi|327283159|ref|XP_003226309.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Anolis carolinensis] Length = 1076 Score = 56.8 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 29/197 (14%), Positives = 70/197 (35%), Gaps = 25/197 (12%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + +++ + LV+ + + RIG + Y++ + Sbjct: 811 SSESVGPENFEIIKDFVTALVDRVT-----VGRNATRIGLVLYSLEVRLEFGLNRYTTQQ 865 Query: 261 EVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +VK + K+ T T A+ A +E + + ++K I +TDG+ Sbjct: 866 DVKQAIRKMLYMGEGTYTGTAIRKATQEGFFGARTG--------VRKVAIVLTDGQADKR 917 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAV--SAPPEGQDLLRK---CTDSSG--QFFAVND 372 A + A +++Y++ + ++ P + LR+ + ++D Sbjct: 918 EA----VKLDIAVREAHAANIEMYAIGIVNTSDPTQVEFLRELNLIASDPDREHMYLIDD 973 Query: 373 SRELLESFDKITDKIQE 389 L K+ ++ E Sbjct: 974 FNTLPALESKLVNQFCE 990 >gi|260800517|ref|XP_002595175.1| hypothetical protein BRAFLDRAFT_240946 [Branchiostoma floridae] gi|229280419|gb|EEN51187.1| hypothetical protein BRAFLDRAFT_240946 [Branchiostoma floridae] Length = 286 Score = 56.8 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 23/153 (15%), Positives = 49/153 (32%), Gaps = 15/153 (9%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAY 285 + + +VR+ + Y +L E + ++ + T T A+ A Sbjct: 29 SYLDIGENAVRVSIVQYAAQARTEFFLDQYYDLQEAQDAVDDIEYMGGYTLTGKAIDFAT 88 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 ++ ++ + + K + ITDG + MR AG+ +V Sbjct: 89 NLHFDLRKGARA-----DVTKIAVVITDGRSYDDVNRP--------ARRMRQAGIVTIAV 135 Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 V +D L +++ L + Sbjct: 136 GVG-NNLDRDQLTAIAGDPKTLLSLDGFDRLQD 167 >gi|163848202|ref|YP_001636246.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222526110|ref|YP_002570581.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163669491|gb|ABY35857.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222449989|gb|ACM54255.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 415 Score = 56.8 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 21/187 (11%), Positives = 56/187 (29%), Gaps = 21/187 (11%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 K+D L + ++ +Q + ++ + + + + + ++ Sbjct: 59 KLDNLKAATRRVIEQLQPTDIA--------AIVIFDDTVQTLIPATPVGDRSALLAAVDS 110 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 ++ T M EL S + + +TDG+ + + Sbjct: 111 ISEAGGTAMSLGMQAGQVELQKHLGSDRLSRM--------LLLTDGQT-----WGDEPLC 157 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 + + AG++I + + Q L S+G + ++ F + + Q Sbjct: 158 RDLARSLGQAGVQIVAFGLGTEWNEQLLDDIAAASNGYSDYIAAPEQIGTFFQQAVHEAQ 217 Query: 389 EQSVRIA 395 Sbjct: 218 ATVATET 224 >gi|307180277|gb|EFN68310.1| Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 [Camponotus floridanus] Length = 2214 Score = 56.8 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 22/202 (10%), Positives = 59/202 (29%), Gaps = 27/202 (13%) Query: 196 KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL 255 + + +++ + + + R Y ++ P Sbjct: 103 DASDSIGSKNFRSELNFVTKLLSDFTVDETTTRVAVVTFGGRGNV--YRNIDQISRHGPN 160 Query: 256 SNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 + + + + T T A+ A L +E + K V ITDG Sbjct: 161 DHKCYLLNKQFGNITYSGGGTYTRGALLEALTILEKSREKA---------NKVVFLITDG 211 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVND 372 ++G + + ++N G +++ + + L G + ++ Sbjct: 212 FSNGG-------DPRPAADLLKNTGATVFTFGIRTG--NVEELHDIASPPGYTHSYLLDS 262 Query: 373 SRELLESFDKITDKIQEQSVRI 394 E F+ + + + ++ Sbjct: 263 FAE----FEALARRALHRDLKT 280 >gi|311265878|ref|XP_003130868.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like [Sus scrofa] Length = 944 Score = 56.8 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 21/132 (15%), Positives = 50/132 (37%), Gaps = 11/132 (8%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 NN+ + K ++ ++P T+ A+ L++ R ++F+TDG+ Sbjct: 353 NNVRDGKVYIHHMSPSGGTDINGALQTGIALLHDSVARHDLE---DRSVSLIVFLTDGKP 409 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL----LRKCTDSSGQFFAVND 372 + + + + I++V + + + L L C + D Sbjct: 410 TVGETHTPKILNNT--REAARGRVCIFTVGIGDDVDFRLLEKLSLENCGFTR-HVHEDED 466 Query: 373 S-RELLESFDKI 383 + +L+ +D+I Sbjct: 467 AGAQLIGFYDEI 478 >gi|198426622|ref|XP_002122673.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 721 Score = 56.8 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 62/200 (31%), Gaps = 27/200 (13%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS---N 257 + + + + + +++ + R+ YN + LS N Sbjct: 528 SSSSVKRPNWNTMKQFVRSIITTFN-----FGENEARMAVFRYNRQVDTRNQILLSDHIN 582 Query: 258 NLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 N +KL T T A+ HA + ++ +K ++ ITDG + Sbjct: 583 NKTTFLEAYDKLPYNGFGTFTGRALRHAKNVIL-----ANRNGNRPNVKDVILTITDGRS 637 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE---GQDLLRKCTDSSGQFF-AVND 372 N I +R G+ + + + QD L + AV Sbjct: 638 QD--------NVATISTELREMGVTTFVIGIQPGNGAGLDQDQLLAMGGTPENTILAVGG 689 Query: 373 SRELLESF-DKITDKIQEQS 391 L +F ++++ I Sbjct: 690 FGGLDATFLNRLSASICTNR 709 >gi|298705152|emb|CBJ28595.1| von Willebrand A domain containing protein [Ectocarpus siliculosus] Length = 382 Score = 56.8 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 34/220 (15%), Positives = 61/220 (27%), Gaps = 28/220 (12%) Query: 174 MTSNKYLLPPPPKKSFWSKN----TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK--- 226 P S +S TT + A + + L K Sbjct: 69 ADPTWAANCPCEAGSQYSFTMEGQTTVTANTVNVAVIIDASGSVGTADWELSKEFAKNTV 128 Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAY 285 A ++NL G+ ++ +L + + ++ T+ + Sbjct: 129 ASFAEQNLFTNGGSASFAQFSSDASEGGTFYSLEDFNAFVDADTEYSGGTDIIAGIAKGR 188 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 L S F+I TDG + + R G +Y+V Sbjct: 189 ELLSA----------SPTTTSFMIVTTDGAAP---------DPQDEADAARAEGTILYAV 229 Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 V + P + LL F +++ EL + D I Sbjct: 230 GVGSGPSEEILL-AIGGDEANVFDIDNFDELDVALDDIVS 268 >gi|94969532|ref|YP_591580.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94551582|gb|ABF41506.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 430 Score = 56.8 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 47/370 (12%), Positives = 98/370 (26%), Gaps = 37/370 (10%) Query: 37 VLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQI----N 92 VL G + + + + K + + G Q + Sbjct: 24 VLLGALAFGQNTSQPQNQNSQPPAGQQGKSDGGLIMPIDDGSQPAQGQQQQNSAQPQPGQ 83 Query: 93 ITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICM 152 K N + I +++ +P + ++ + Sbjct: 84 QGKTDNGLVMPIENGQSEAPVPKSPNQPGDTVNVPASSSRGNGQNPDSEVGGVYTFKKQV 143 Query: 153 VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFW----SKNTTKSKYAPAPAPANR 208 + Q+ + TS P++ Sbjct: 144 EEVRLHATVVDDRQRLITTLDKTSFTVYENGEPQQITSFRHEDIPVALGVVIDNSGSMRD 203 Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 K + + NLV + N + + +N +Q ++++ ++K L K Sbjct: 204 KRPAVNAATINLV--------KASNPEDEVFVVNFNDDYYLDQDY--TDSVAKLKEALEK 253 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 T Y A+ ++H KK + +TDGE+ + Sbjct: 254 YETRGGTALYDAVL---------ASNAHLMKAPKLEKKVLFIVTDGEDDASL----NTLE 300 Query: 329 LQICEYMRNAGMKIYSVAV-----SAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDK 382 I + + G IY++ + Q LR+ + G F E+ + Sbjct: 301 QTIRKVQQENGPTIYTIGILDETGGHKRRAQRALREMAESTGGVAFFPQSLDEVSRITQQ 360 Query: 383 ITDKIQEQSV 392 I I+ Q Sbjct: 361 IAHDIRNQYT 370 >gi|198421591|ref|XP_002123589.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 401 Score = 56.8 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 34/384 (8%), Positives = 101/384 (26%), Gaps = 41/384 (10%) Query: 39 SGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKN 98 + + I + ++ + + Sbjct: 28 ALRYLQSPVVRAVSNSASSTNPGQIPSGTSPSYFGYDFHLGKTGNQQLRFTVGAPKSIDP 87 Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIP-SALTNLSLRSTGIIERSSENLAISICMVLDVS 157 + A T +L + + + + + +DVS Sbjct: 88 SQPSLQTTPSASAAPATGDLLECPVSNLFSSNPPPRPACNSRNPPGAQRGDAFGLSVDVS 147 Query: 158 R----SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSK-------------YA 200 S Q++ +++ S Y + S W+ +K Sbjct: 148 PNGRLSACSPTKQQNCPPDSIYSPGYCYNSMNRGSTWAPGPETNKIRCPIIDLDMLFVLD 207 Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIA------YNIGIVGNQCTP 254 + + +++ + NS + + ++ I Sbjct: 208 GSGSVGKDNFEIVKNWTIKVANSFDISDGYTQVGVIQYSHYWATEPLDKQSYIKTEVPLG 267 Query: 255 LSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 N E + + ++ + T T A++ + + K +I +TD Sbjct: 268 KYRNKQEFSAAVRNISLHEYTTYTAHALNKTVFDFQQSSR-----WNRPKTSKVLILLTD 322 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT---DSSGQFFAV 370 G ++ ++ N + R+ + ++V V + L++ ++ + + Sbjct: 323 GLSTDKQLLPSSANYV------RSLNITTFAVGVG--EANEKELQEIANGQGTNERVYYT 374 Query: 371 NDSRELLESFDKITDKIQEQSVRI 394 ++ L + ++ I + + Sbjct: 375 SNFAGLNKIVSQLRSAILNFVLEV 398 >gi|219848163|ref|YP_002462596.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219542422|gb|ACL24160.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 914 Score = 56.8 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 23/135 (17%), Positives = 48/135 (35%), Gaps = 26/135 (19%) Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 ++ L++L TN + A + S + VI +TDG Sbjct: 477 TIEDALSRLVAGGGTNIRSGIALAAETIAT----------SQARIRHVILLTDGV----- 521 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 + + +R G+ + ++A+ L R G+++ V +L + Sbjct: 522 ---SETEYADLVADLRAQGITVSAIAIGLD-TDPALERVAQIGGGKYYLVQRVPDLPQV- 576 Query: 381 DKITDKIQEQSVRIA 395 + E++VR+A Sbjct: 577 ------VLEETVRVA 585 >gi|315172113|gb|EFU16130.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX1342] Length = 1103 Score = 56.8 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 97/382 (25%), Gaps = 30/382 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M A A L+ T T + + H + G Sbjct: 151 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGGAQWDGQ 210 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ ++Y + L++R E Sbjct: 211 T--SWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRG----NVQKE 264 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +V+D S SM + L + S Sbjct: 265 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNINMGYVGYSSDGYNNN 324 Query: 205 PANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 D + N+ S + + G + L++ + Sbjct: 325 AIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFS 384 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +++++ T R + + + GST D N Sbjct: 385 YKVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNINKRINS 434 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA-V 370 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 435 TFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSADENGDLYYESA 494 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 495 DYAPDISDYLAKKAVQISGTVV 516 >gi|307274607|ref|ZP_07555787.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX2134] gi|306508759|gb|EFM77849.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX2134] Length = 1103 Score = 56.8 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 101/382 (26%), Gaps = 30/382 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M A A L+ +++ + D + + QG+ D Sbjct: 151 AEARMAPATLRANLAL--PLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGSAQWD 208 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ ++Y + L++R E Sbjct: 209 GQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRG----NVQKE 264 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +V+D S SM + L + S Sbjct: 265 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNINMGYVGYSSDGYNNN 324 Query: 205 PANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 D + N+ S + + G + L++ + Sbjct: 325 AIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFS 384 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +++++ T R + + + GST D N Sbjct: 385 YKVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNINKRINS 434 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA-V 370 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 435 TFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSADENGDLYYESA 494 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 495 DYAPDISDYLAKKAVQISGTVV 516 >gi|114707049|ref|ZP_01439948.1| Von Willebrand domain containing protein [Fulvimarina pelagi HTCC2506] gi|114537599|gb|EAU40724.1| Von Willebrand domain containing protein [Fulvimarina pelagi HTCC2506] Length = 317 Score = 56.8 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 21/146 (14%), Positives = 49/146 (33%), Gaps = 29/146 (19%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ ++ V +++ +T + A + L S + VI Sbjct: 148 PLTFDVAAVSRIVDEATIGISGRSTAIAGGLGLALKRL----------KRSDAQSRVVIL 197 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA----------PPEGQDLLRKC 360 ++DG ++ + + G+ ++++A+ LR Sbjct: 198 LSDGSDTSGN-----VLPRDSARLAEQLGVTVHTIALGPEDMETAPQTRDAVDTATLRDI 252 Query: 361 TD-SSGQFFAVNDSRELLESFDKITD 385 + S G+ F V ++ +L +I Sbjct: 253 AELSGGRTFRVRNTDDLRAVTAEIDR 278 >gi|32477499|ref|NP_870493.1| hypothetical protein RB12409 [Rhodopirellula baltica SH 1] gi|32448053|emb|CAD77570.1| conserved hypothetical protein [Rhodopirellula baltica SH 1] Length = 1032 Score = 56.8 bits (135), Expect = 5e-06, Method: Composition-based stats. Identities = 48/379 (12%), Positives = 109/379 (28%), Gaps = 39/379 (10%) Query: 29 MQSALDA-AVLSGCASIVSDRTIKDPTTKKDQTS--TIFKKQIKKHL---KQGSYIRENA 82 +Q +D A + A ++D D T+ + + T + + + L + E Sbjct: 295 LQHKIDQPAAYTYEAEFIADEDEDDSLTQNNSATGYTYVRGKGRVLLIHGPEDIGDFELL 354 Query: 83 GDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERS 142 + + I +T L ++ Q + + ++ S IE Sbjct: 355 IATLRDSNIEVTPMPTTQLFGSL-AELQPYDAVILAGVARVSGDTTQTITSFSDDQIEML 413 Query: 143 SENLAISICMVLDVSRS---MEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY 199 N +L + + + + K+ + Sbjct: 414 VRNTQQLGAGLLMIGGPDSLGAGGWTGTELEKA-------MPVDFQIKNTKVQGVGALAL 466 Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 + + + + + + A + G G + N Sbjct: 467 IMHASEMAQGNYWQKQISIAAIEQLGSADKAGVVHWTMNGDKWLWGG--SKGMLEVGPNR 524 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 + + + ++ P + PAM A L K +I I+DG+ Sbjct: 525 RAMLAAVGRMTPGDMPEFDPAMRMAVTGLVRT----------DASVKHLIIISDGDPGPP 574 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSA-PPEGQDLLRKCT-DSSGQFFAVNDSRELL 377 S + + ++ + I +VAV + L+ + G+++AV R L Sbjct: 575 SNS--------VIQAFKDNSITISTVAVESHGLSDSRRLQDIARATGGKYYAVKSGRALP 626 Query: 378 ESFDKITDKIQEQSVRIAP 396 F + ++ + P Sbjct: 627 GIFQREARRVTRPLIYEPP 645 >gi|86143680|ref|ZP_01062056.1| aerotolerance-related membrane protein [Leeuwenhoekiella blandensis MED217] gi|85829723|gb|EAQ48185.1| aerotolerance-related membrane protein [Leeuwenhoekiella blandensis MED217] Length = 349 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 33/243 (13%), Positives = 76/243 (31%), Gaps = 52/243 (21%) Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 L + SK A A +ID + ++N++ Sbjct: 79 IGTKLETVKREGVDVVFAVDVSKSMDAEDIAPSRIDKAKQLVTQIINNLGS--------- 129 Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS 295 R+G IAY P++ + + K LN +N ++ A+ A Sbjct: 130 DRVGIIAYAGSAYPQ--LPITTDYSSAKMFLNAMNTDMLSSQGTAIRDAIELAKTYYN-- 185 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ- 354 + + ++ I+DGE+ I E G++I+++ V + + Sbjct: 186 ----DEEQTNRVLVIISDGEDHAGEVAS-------IAESATEQGIRIFTIGVGSEAGDRI 234 Query: 355 ---------------------------DLLRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 L ++ G++ N ++E+++ +++ Sbjct: 235 PIKRNGVVQSYKKDQNGETVITKLDPATLQEIAAEADGEYINGNSTQEVVDKMASALNQM 294 Query: 388 QEQ 390 ++ Sbjct: 295 DKK 297 >gi|301619004|ref|XP_002938896.1| PREDICTED: von Willebrand factor A domain-containing protein 1-like [Xenopus (Silurana) tropicalis] Length = 496 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 29/198 (14%), Positives = 73/198 (36%), Gaps = 26/198 (13%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + + + E GNL+ Q+ V+ + + V + Sbjct: 43 SSGSVSYYEFAKVREFIGNLLRPFTFGPQD-----VQASIVHISTNPVLEFPFNQYGSSQ 97 Query: 261 EVKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 E++ + + +TNT A+ + L++E+ S + K ++++TDG ++ Sbjct: 98 EIQRAIQNIKQRMGDTNTGKALSYIKENLFDERSGSRAE-----VPKVMVWVTDGLSTDD 152 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG----QFFAVNDSRE 375 + + +++ G+ ++ V+ L + F V+D Sbjct: 153 ISQP--------MQLLKDMGVTVFIVSTGRGNY--LELSAAASTPSDTHLHFVDVDDLHI 202 Query: 376 L-LESFDKITDKIQEQSV 392 + E D I + I+ + + Sbjct: 203 ITKELRDSIIELIRARRL 220 >gi|229116005|ref|ZP_04245400.1| hypothetical protein bcere0017_22960 [Bacillus cereus Rock1-3] gi|228667499|gb|EEL22946.1| hypothetical protein bcere0017_22960 [Bacillus cereus Rock1-3] Length = 452 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 26/216 (12%), Positives = 63/216 (29%), Gaps = 22/216 (10%) Query: 183 PPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIA 242 P K S K++ ++ N ++ I + G+ Sbjct: 147 PKEKSLNVEILLDASGSMAGKVNGQVKMEAAKKAIYNYLDKIPNNANVMLRVYGHKGSNN 206 Query: 243 YNI----GIVGNQCTPLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHN 297 N PL + + L+ P T A+ + Sbjct: 207 ENDKSLSCGSSEVMYPLQPYKKEQFNAALSNFGPKGWTPLASAIESVNNDFKE------- 259 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA--GMKIYSVAVSAPPEGQD 355 V ++DGE + + + + + + + + Q Sbjct: 260 -YTGEENLNVVYIVSDGEETCGG------DPVNAAKNLNQSSTHAVVNIIGFDVKNSEQQ 312 Query: 356 LLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQ 390 L+ ++ G + V+++ EL ++ +K +K+ ++ Sbjct: 313 QLKNTAEAGKGNYATVSNAEELYQTLNKEYEKLYKE 348 >gi|126662670|ref|ZP_01733669.1| hypothetical protein FBBAL38_04925 [Flavobacteria bacterium BAL38] gi|126626049|gb|EAZ96738.1| hypothetical protein FBBAL38_04925 [Flavobacteria bacterium BAL38] Length = 347 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 22/196 (11%), Positives = 65/196 (33%), Gaps = 30/196 (15%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 +++ + ++N + + R+G + Y P++ + + K L Sbjct: 110 NRLEKTKQLVSQIINQLG---------NDRVGIVGYAGSAYP--ILPMTTDYSIAKMYLQ 158 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 +N ++ A + A + + + K +I ++DGE+ G Sbjct: 159 SMNTNMVSSQGTAFNDAIKLAVDYFDV-------KDTSKLIILVSDGEDHG-------EG 204 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL-----LESFDK 382 + + + G++I ++ V L+ + + + + + Sbjct: 205 ASEAIDLAKEKGVRILTIGVGTEKGALIPLKDNKGTISSYKKDQNGENVITKLYPDVLKN 264 Query: 383 ITDKIQEQSVRIAPNR 398 I K + + + + + Sbjct: 265 IATKTKSKYIFGSSTK 280 >gi|198426873|ref|XP_002129255.1| PREDICTED: similar to Collagen alpha-1(XIV) chain [Ciona intestinalis] Length = 725 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 41/357 (11%), Positives = 109/357 (30%), Gaps = 25/357 (7%) Query: 33 LDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQIN 92 +D +++G + F + L + +Y N G + +I Sbjct: 30 VDQTLVAGSSGKPVVTAATQKLEAVPGFLNDFNLMTRIALVESNYGENNKGSLGGIWEIT 89 Query: 93 ITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICM 152 + +Q + T ++ TNL+ RS+ I I Sbjct: 90 TS------IQATLRGSLSSSVQTIVNEAGCILGQENTNLTNTDFSTPIRSAFAARIFIQQ 143 Query: 153 VLDVSRSM------EDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPA 206 + S + + N T L+ + + Sbjct: 144 SVGSSAIPITLTQQATWWSTVYRPGANATKFIELVSQVENVTVGCATHKLDLWFVIDGSG 203 Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 + +S L + ++ VR+G Y+ + N + +++ + Sbjct: 204 SVGFSNFQDSLRFLASLTKRFTIGP--DDVRVGFSVYSSTSTIHSHFNQHMNNSALEAEI 261 Query: 267 NKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + T+T A++ + E+ + S + + ++ +TDG++ + + Sbjct: 262 LGTSYTGGGTSTGRAINDVLNNGFVERNGARPA--SEGVPRILVVMTDGQSGDSVKTPS- 318 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + ++ AG+ ++ V + + + ++ ++ ++ LL + Sbjct: 319 -------DNVKAAGITVFGVGIGSGIDIAEVNEIASNPDSRYAYELTGFNLLNVLSQ 368 >gi|15965798|ref|NP_386151.1| putative signal peptide protein [Sinorhizobium meliloti 1021] gi|307311332|ref|ZP_07590975.1| putative signal peptide protein [Sinorhizobium meliloti BL225C] gi|307318865|ref|ZP_07598297.1| conserved hypothetical protein [Sinorhizobium meliloti AK83] gi|15075067|emb|CAC46624.1| Hypothetical signal peptide protein [Sinorhizobium meliloti 1021] gi|306895586|gb|EFN26340.1| conserved hypothetical protein [Sinorhizobium meliloti AK83] gi|306899633|gb|EFN30261.1| putative signal peptide protein [Sinorhizobium meliloti BL225C] Length = 444 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 24/122 (19%), Positives = 39/122 (31%), Gaps = 7/122 (5%) Query: 3 AIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTST 62 AI AID A + ++QSALDAA L + T D + Sbjct: 26 AIAFVPIIGAAALAIDFAGAYFEAEKIQSALDAAALGSVRAYGEGATEDDAYDAAQKF-- 83 Query: 63 IFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKG 122 + + Q + T++ N + A++ + LFL+ Sbjct: 84 FWSNYALPQESVVDALAVATEPSTQALSVKFTRNVNED-----TATAEFVLDHNPLFLER 138 Query: 123 LI 124 L Sbjct: 139 LP 140 >gi|14042009|dbj|BAB55070.1| unnamed protein product [Homo sapiens] Length = 942 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 55/154 (35%), Gaps = 12/154 (7%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKS---RLNKLNPYENTNTYPAMHHAYRELYNEKE 293 R I + I + +S + ++ ++ ++P T+ A+ A R L Sbjct: 330 RFSIIGFPNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLLNKYVA 389 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S R ++F+TDG+ + + + + I+++ + + Sbjct: 390 HSGI---GDRSVSLIVFLTDGKPTVGETHTLKILNNT--REAARGQVCIFTIGIGNDVDF 444 Query: 354 QDL----LRKCTDSSGQFFAVNDSRELLESFDKI 383 + L L C + + +L+ +D+I Sbjct: 445 RLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|163849427|ref|YP_001637471.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222527431|ref|YP_002571902.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163670716|gb|ABY37082.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222451310|gb|ACM55576.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 851 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 36/300 (12%), Positives = 83/300 (27%), Gaps = 34/300 (11%) Query: 35 AAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINIT 94 A L S + + Q S + ++ L ++ G + ++ Sbjct: 306 RAALDAHISNGTRMILPIYDVAVGQGSNAAYRVVRFGL----FVMTAYGRERNRPYMDFI 361 Query: 95 KDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVL 154 + + P E+ + L S + + ++L Sbjct: 362 F--LGDPDRQGTACSATPPPPEDTSIVRLTGSVELWPEYQIVANDR-----PPVQYVVIL 414 Query: 155 DVSRSMEDLYLQKHNDNNNMTSNKYLLP--PPPKKSFWSKNTTKSKYAPAPAPANRKIDV 212 D + SM + + N P P P + + R+I V Sbjct: 415 DATGSMSMNFNGQGKLNGQTRQCITGPPGWPFPDDPSCDGSPNFAWTPR----EERRIYV 470 Query: 213 LIESAGNLVNSIQ---KAIQEKKNLSVRIGTIAYNIGIVGNQCT--PLSNNLNEVKSRLN 267 E+ L+ + ++ + +N + ++ S++ + + Sbjct: 471 AKEAVRLLIKQTNMPGNPGYDPTRPIDQMAIVWFNQDVPESRIMLGGFSSDPAALDRAV- 529 Query: 268 KLNPY----------ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 L+ TNT A + + L + ++ +I +TDG + Sbjct: 530 -LDAGKVNNDPYLSQGGTNTAGAFYRVGQLLGRAPTGTTQLGREWTYRRAIILVTDGLAN 588 >gi|312899566|ref|ZP_07758892.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0470] gi|311293245|gb|EFQ71801.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0470] Length = 1103 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 97/382 (25%), Gaps = 30/382 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M A A L+ T T + + H + G Sbjct: 151 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGGAQWDGQ 210 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ ++Y + L++R E Sbjct: 211 T--SWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRG----NVQKE 264 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +V+D S SM + L + S Sbjct: 265 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNINMGYVGYSSDGYNNN 324 Query: 205 PANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 D + N+ S + + G + L++ + Sbjct: 325 AIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFS 384 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +++++ T R + + + GST D N Sbjct: 385 YKVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNINKRINS 434 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA-V 370 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 435 TFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSADENGDLYYESA 494 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 495 DYAPDISDYLAKKAVQISGTVV 516 >gi|159038794|ref|YP_001538047.1| von Willebrand factor type A [Salinispora arenicola CNS-205] gi|157917629|gb|ABV99056.1| von Willebrand factor type A [Salinispora arenicola CNS-205] Length = 316 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 20/152 (13%), Positives = 44/152 (28%), Gaps = 24/152 (15%) Query: 250 NQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 N P + + V + ++ L E+T T A+ + S + ++ Sbjct: 139 NVLVPPTKDRQAVVTAVDGLVLAESTATGEAVFTCLEAI----RSVPADGAAGIPPARIV 194 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD-------------- 355 ++DG + + + Q + ++A D Sbjct: 195 LLSDGYRTSGRSVEEAAAAAQAANVAVS------TIAFGTDGGQVDIGGQLQRVPVDRFA 248 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 L + G F+ EL + + + I Sbjct: 249 LAELAATTEGHFYEAASVNELKQVYQDMGSSI 280 >gi|73960093|ref|XP_855328.1| PREDICTED: similar to calcium activated chloride channel 4 [Canis familiaris] Length = 938 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 26/203 (12%), Positives = 70/203 (34%), Gaps = 29/203 (14%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 +++ + ++A + + +Q V + AY + + S N Sbjct: 313 SGSMNGFNRLNRMNQAAKHFL--LQTIENGSWVGMVHFDSTAYIKSNLIQIIS--SKERN 368 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + L T+ + A++ + + ++ +TDGE++ Sbjct: 369 NLLESLPT-TANGGTSICAGIKSAFQVIGEIYPQIDGSE--------IVLLTDGEDNT-- 417 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE---LL 377 ++ +G I+ +A+ P Q ++ T + G F +D + L+ Sbjct: 418 -------AKNCIGEVKQSGAIIHLIALG-PSADQAVIEMSTITGGNHFFASDEAQNNGLI 469 Query: 378 ESFDKIT---DKIQEQSVRIAPN 397 ++F + + +Q +++ Sbjct: 470 DAFGALASGNTDLSQQPLQLESK 492 >gi|147906540|ref|NP_001087052.1| von Willebrand factor A domain-containing protein 2 precursor [Xenopus laevis] gi|82235697|sp|Q6DCQ6|VWA2_XENLA RecName: Full=von Willebrand factor A domain-containing protein 2; AltName: Full=A domain-containing protein similar to matrilin and collagen; Short=AMACO; Flags: Precursor gi|50415038|gb|AAH77945.1| MGC80919 protein [Xenopus laevis] Length = 790 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 33/304 (10%), Positives = 84/304 (27%), Gaps = 31/304 (10%) Query: 87 QKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENL 146 + +D L + + T L ++ S Sbjct: 435 STPVFSDVRDDLPRLVVLLTGSKSQDSVTGPATYARDQEVFLIGVTSDSNKGEMAEIVGN 494 Query: 147 AISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPA 206 ++ ++ S + + + P + + + A Sbjct: 495 PLN---LVTYSNPQQLFNQLPQLQKRICSIDVQGCQAQPLDLAFVLD-------ASTAVG 544 Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 K + L + + +IG + Y+ ++ + + + Sbjct: 545 QEKFNRLKNFVTMVSLQF-----DINRDVTQIGLVTYSSRPETVFGLDTHDSGSSLLQGI 599 Query: 267 NKLNPYENTN-TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + + + T A+ Y ++ ++ + + K V+ ITDG Sbjct: 600 GRASYMGGSASTGSALLRVYNDVMTVQKGARPGVN-----KAVVVITDGRG--------A 646 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + + +R+ G+ +Y + + + LLR S +V L D + Sbjct: 647 EDAAVPAQKLRDNGIMVYVIGIG-NIQRNSLLR-LAGSEKFLISVPSYESLGHYEDSVVQ 704 Query: 386 KIQE 389 ++ E Sbjct: 705 RVCE 708 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 19/166 (11%), Positives = 56/166 (33%), Gaps = 23/166 (13%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + + +R+G + Y+ S + +K ++ + T T A+ + + Sbjct: 81 DIGSDLIRVGAMQYSGAPQVEFRLDSSFSKAAIKEKIKSIVFKGGPTETGLALKYIVWKG 140 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + + + K +I ++DG++ G N ++ +++++V V Sbjct: 141 FPGGRPA-------SVPKILIIVSDGKSQG--------NIKLPAAQIKGEDIEVFTVGVK 185 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQSV 392 P + L + ++ + + + S+ Sbjct: 186 FPRWEE--LHALSSEPQEAHVLFAEHVD---DAVNGLATSLTNSSL 226 >gi|227518343|ref|ZP_03948392.1| pilus subunit protein [Enterococcus faecalis TX0104] gi|227074216|gb|EEI12179.1| pilus subunit protein [Enterococcus faecalis TX0104] Length = 1103 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 98/382 (25%), Gaps = 30/382 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M A A L+ T T + + H + G Sbjct: 151 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGGTQWDGQ 210 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ ++Y + L++R E Sbjct: 211 T--SWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRG----NVQKE 264 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +V+D S SM + L + S + Sbjct: 265 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNINMGYVGYSSDSYNNN 324 Query: 205 PANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 D + N+ S + + G + L++ + Sbjct: 325 AIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFS 384 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +++++ T R + + + GST D N Sbjct: 385 YKVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNINKRINS 434 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA-V 370 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 435 TFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSADENGDLYYESA 494 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 495 DYAPDISDYLAKKAVQISGTVV 516 >gi|315168359|gb|EFU12376.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX1341] Length = 1103 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 97/382 (25%), Gaps = 30/382 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M A A L+ T T + + H + G Sbjct: 151 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGGAQWDGQ 210 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ ++Y + L++R E Sbjct: 211 T--SWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRG----NVQKE 264 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +V+D S SM + L + S Sbjct: 265 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNINMGYVGYSSDGYNNN 324 Query: 205 PANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 D + N+ S + + G + L++ + Sbjct: 325 AIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFS 384 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +++++ T R + + + GST D N Sbjct: 385 YKVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNINKRINS 434 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA-V 370 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 435 TFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSADENGDLYYESA 494 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 495 DYAPDISDYLAKKAVQISGTVV 516 >gi|307271265|ref|ZP_07552544.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0855] gi|306512014|gb|EFM81005.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0855] Length = 1103 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 97/382 (25%), Gaps = 30/382 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M A A L+ T T + + H + G Sbjct: 151 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGGAQWDGQ 210 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ ++Y + L++R E Sbjct: 211 T--SWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRG----NVQKE 264 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +V+D S SM + L + S Sbjct: 265 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNINMGYVGYSSDGYNNN 324 Query: 205 PANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 D + N+ S + + G + L++ + Sbjct: 325 AIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFS 384 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +++++ T R + + + GST D N Sbjct: 385 YKVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNINKRINS 434 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA-V 370 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 435 TFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSADENGDLYYESA 494 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 495 DYAPDISDYLAKKAVQISGTVV 516 >gi|227818462|ref|YP_002822433.1| hypothetical protein NGR_b02140 [Sinorhizobium fredii NGR234] gi|227337461|gb|ACP21680.1| conserved hypothetical protein [Sinorhizobium fredii NGR234] Length = 440 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 26/171 (15%), Positives = 55/171 (32%), Gaps = 2/171 (1%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 MTA+++ V FL + A+++A+ ++MQ ALDAA + S + + + Sbjct: 22 MTALLVPVLFLSGSVALNIANATREASKMQDALDAAAIKAVRSYGEGESEN--AVRTEAN 79 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 F G + ++ Y A+ + + Sbjct: 80 RLFFANFQTPSATDGYNSASPESPAVEFTFSETGQETRASASYAAQYNPVFWGLQPFVIS 139 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDN 171 + + + LT+ + +S + +S +D S Sbjct: 140 RRSVAARLTDREACILALHPTASRAIEVSGSAAVDTSNCTITSNSDDAQSI 190 >gi|156402479|ref|XP_001639618.1| predicted protein [Nematostella vectensis] gi|156226747|gb|EDO47555.1| predicted protein [Nematostella vectensis] Length = 412 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 40/363 (11%), Positives = 89/363 (24%), Gaps = 32/363 (8%) Query: 34 DAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINI 93 DAA ++ + + T + T+ K + + R + ++ Sbjct: 69 DAASITAAVNATKYPGM---GTYAGKGLTLAKDDLYSAPVRSGVPRILIVMTDGISSDDV 125 Query: 94 TKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMV 153 E A + + S + + + I Sbjct: 126 AGPAKALRDMGVEIFALGIGKNYDQGQLDSMGSDPKPDHVVTADFDKLDPVIQTIKDKAC 185 Query: 154 --LDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKID 211 +D ++ + + + K + Sbjct: 186 NGIDKEQTESHSASPVDDGALKTKNQNRFRSSSVTVCKAKVDVGFLLDGSGSVEFYAKGN 245 Query: 212 V--LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 V S G + ++ + + + +N Sbjct: 246 FQRCKNFINKFVKSF-----MVSKDDSHFGLVLFSSDSNVEFKFDDHYDAASITAAVNAT 300 Query: 270 N-PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 N P T + A +LY+ S + + +I +TDG +S A Sbjct: 301 NYPGMGTYAGKGLTLAKDDLYSAPVRS-------GVPRILIVMTDGISSDDVAGP----- 348 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS--SGQFFAVNDSRELLESFDKITDK 386 + +R+ G++I+++ + Q L D +L I DK Sbjct: 349 ---AKALRDMGVEIFALGIGKN-YDQGQLDSMGSDPKPDHVVTA-DFDKLDPVIQTIKDK 403 Query: 387 IQE 389 + Sbjct: 404 ACK 406 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 26/180 (14%), Positives = 51/180 (28%), Gaps = 25/180 (13%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-P 271 V S G + ++ + + + +N P Sbjct: 29 CKNFINKFVKSF-----MVSKDDSHFGLVLFSSDSNVEFKFDDHYDAASITAAVNATKYP 83 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T + A +LY+ S + + +I +TDG +S A Sbjct: 84 GMGTYAGKGLTLAKDDLYSAPVRS-------GVPRILIVMTDGISSDDVAGP-------- 128 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS--SGQFFAVNDSRELLESFDKITDKIQE 389 + +R+ G++I+++ + Q L D +L I DK Sbjct: 129 AKALRDMGVEIFALGIGKN-YDQGQLDSMGSDPKPDHVVTA-DFDKLDPVIQTIKDKACN 186 >gi|305665950|ref|YP_003862237.1| aerotolerance-related membrane protein [Maribacter sp. HTCC2170] gi|88710725|gb|EAR02957.1| aerotolerance-related membrane protein [Maribacter sp. HTCC2170] Length = 349 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 20/166 (12%), Positives = 54/166 (32%), Gaps = 40/166 (24%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 P++ + K L +N T+ A++ A + + + + I+ Sbjct: 144 LPITTDYGAAKMFLQNMNTNMLTSQGTAINEAIELATTYYD------DEEQTNRVLFIIS 197 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------------------- 352 DGE + TL+ + G++I+++ V Sbjct: 198 DGE------DHSEGTTLKAVDDAIEEGIQIFTIGVGKSKGAPIPIKRNGVVESLKKDRQG 251 Query: 353 -------GQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQ 390 + +L+ G++ +++ + +E +I ++ + Sbjct: 252 EVVITKLNETILQDIASEGEGEYIDGSNTSDAVEQIKEILLQMDKT 297 >gi|307278884|ref|ZP_07559945.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0860] gi|306504433|gb|EFM73642.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0860] Length = 1103 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 97/382 (25%), Gaps = 30/382 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M A A L+ T T + + H + G Sbjct: 151 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGGAQWDGQ 210 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ ++Y + L++R E Sbjct: 211 T--SWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRG----NVQKE 264 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +V+D S SM + L + S Sbjct: 265 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNINMGYVGYSSDGYNNN 324 Query: 205 PANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 D + N+ S + + G + L++ + Sbjct: 325 AIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFS 384 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +++++ T R + + + GST D N Sbjct: 385 YKVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNINKRINS 434 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA-V 370 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 435 TFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSADENGDLYYESA 494 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 495 DYAPDISDYLAKKAVQISGTVV 516 >gi|325959993|ref|YP_004291459.1| Magnesium chelatase [Methanobacterium sp. AL-21] gi|325331425|gb|ADZ10487.1| Magnesium chelatase [Methanobacterium sp. AL-21] Length = 711 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 19/164 (11%), Positives = 56/164 (34%), Gaps = 15/164 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 + ++ + + + I++ ++ + + P + K ++ L Sbjct: 537 MYGEKKAVR-VKDILNNLIEDAARNGDKVSVVGFKGKDAL-IIIPTTRRAVSFKEQIENL 594 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T M + L EK ++ +TDG + A + + + Sbjct: 595 KIGGTTPLASGMKRGFEILKKEKFR-------DEYVPMMLILTDGMPNVAISKSPVDDAI 647 Query: 330 QICEYMRNAGMKIYSVAVSAPP---EGQDL-LRKCTDSSGQFFA 369 I ++ +I+++ ++ G+D+ + S G+++ Sbjct: 648 DIAGSLKEN--EIHTIIINFEQAVKYGRDMNMELAVASGGRYYD 689 >gi|49225581|ref|NP_990438.1| collagen alpha-1(VI) chain precursor [Gallus gallus] gi|115314|sp|P20785|CO6A1_CHICK RecName: Full=Collagen alpha-1(VI) chain; Flags: Precursor gi|62875|emb|CAA41062.1| collagen alpha 1 type VI [Gallus gallus] gi|63302|emb|CAA45788.1| collagen type VI alpha 1 subunit [Gallus gallus] gi|211354|gb|AAB59954.1| alpha-1 type VI collagen precursor [Gallus gallus] Length = 1019 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 29/211 (13%), Positives = 64/211 (30%), Gaps = 25/211 (11%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 T A P + + + ++ + + + + + A + Sbjct: 44 TSESVALRVKPFGDLVAQVKDFTNRFIDKLTE-RYFRCDRFLAWNAGALHYSDSVVIIKD 102 Query: 255 LS---NNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 L+ + E+K+ ++ +N T+T A+ L SH K++I Sbjct: 103 LTAMPSGRAELKNSVSAINYIGKGTHTDCAIKQGIERLLLG--GSHLKEN-----KYLIV 155 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ--FF 368 +TDG ++ G+K++SVA+ +P L F Sbjct: 156 VTDGHPLEGYKEPCG-GLDDAANEAKHLGIKVFSVAI-SPHHLDQRLNIIATDHAYRRNF 213 Query: 369 AVND---------SRELLESFDKITDKIQEQ 390 + + I D +++ Sbjct: 214 TATSLKPTRDLDVEETINNIIEMIKDNMEQS 244 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 64/201 (31%), Gaps = 25/201 (12%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + ++ D L +A + SVR+ + Y+ P N Sbjct: 829 VDSSTSVGSKNFDTTKNFVKRLAERFLEAS-KPAEDSVRVSVVQYSGRNQQKVEVPFQRN 887 Query: 259 LNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + ++ + T+ A+ + S V+ +DG + Sbjct: 888 YTVIAKAVDNMEFMNEATDVNAALQYIMGLYQRSSRSGAKKK--------VLVFSDGNSQ 939 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF---------F 368 G + + ++ AG+++Y +AV + ++ T S + F Sbjct: 940 G----ITARAIERTVQEVQQAGIEVYVLAVGSQVNEPNVRVLVTGKSTNYDVAYGERHLF 995 Query: 369 AVNDSRELLE--SFDKITDKI 387 V D LL + ++ KI Sbjct: 996 RVPDYTSLLRGVFYQTVSRKI 1016 >gi|326927638|ref|XP_003209998.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like [Meleagris gallopavo] Length = 881 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 21/133 (15%), Positives = 47/133 (35%), Gaps = 11/133 (8%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 NL+E + + ++ TN Y M L E + S +I +TDG+ + Sbjct: 342 NLDEARKFVRGIDTKGLTNLYGGMMKGIDMLNAAHEGNLVPKRS---ASIIIMLTDGQPN 398 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ-----FFAVND 372 NT + + +Y++ + L K + + + Sbjct: 399 VG--ISNTQDIQTHVKKAIEGKYTLYNLGFGYGV-DYNFLEKMALENKGLARRIYPDSDS 455 Query: 373 SRELLESFDKITD 385 + +L +D++++ Sbjct: 456 ALQLQGFYDEVSN 468 >gi|163852924|ref|YP_001640967.1| hypothetical protein Mext_3511 [Methylobacterium extorquens PA1] gi|163664529|gb|ABY31896.1| hypothetical protein Mext_3511 [Methylobacterium extorquens PA1] Length = 473 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 16/94 (17%), Positives = 34/94 (36%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+ S + AID A R +QSA+DA VL+G ++ + + Sbjct: 34 IFALAGSTLIGLVGGAIDYARFASARTNLQSAVDAGVLAGGNALKLVVSSSESIVGLTTQ 93 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINIT 94 + + + + + + +A+ Sbjct: 94 TIQAEAKAGADAPVSIQVTVASDKTSVEARAEQV 127 >gi|119358219|ref|YP_912863.1| von Willebrand factor, type A [Chlorobium phaeobacteroides DSM 266] gi|119355568|gb|ABL66439.1| von Willebrand factor, type A [Chlorobium phaeobacteroides DSM 266] Length = 337 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 22/167 (13%), Positives = 57/167 (34%), Gaps = 41/167 (24%) Query: 254 PLSNNLNEVKSRLN----KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 P++ + ++ L +L + T A++ A R + K ++ Sbjct: 144 PMTTDHAAFEALLGMVSTELVSDQGTAFDSALNLAMRLFERTEPPGDVKEVQGE--KVIV 201 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP------------------- 350 ++DGEN + + + ++ +G+ ++++ + P Sbjct: 202 LLSDGENHSGNF-------RAVADALKQSGVSVFTIVLGKPLPAAIPLGQSSGVKKDAAG 254 Query: 351 ------PEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 + + R DS G FF ++ +D++ ++I Sbjct: 255 KIVKTRSSPETMRRLAGDSGGTFFDASEDD---AVYDRVAERISTLV 298 >gi|256820365|ref|YP_003141644.1| von Willebrand factor type A [Capnocytophaga ochracea DSM 7271] gi|256581948|gb|ACU93083.1| von Willebrand factor type A [Capnocytophaga ochracea DSM 7271] Length = 347 Score = 56.8 bits (135), Expect = 6e-06, Method: Composition-based stats. Identities = 27/176 (15%), Positives = 65/176 (36%), Gaps = 42/176 (23%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 R+G +AY L+ + + K L +N ++ A+ A R N + Sbjct: 130 RVGIVAYAASAYPQ--LALTTDHSAAKMFLQDMNTDMLSSQGTAIQEAIRMASNYFD--- 184 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE---- 352 +T + + +TDGE+ + + +I + G+ IY++ + Sbjct: 185 ---ENTPTARLLFILTDGED-------HEMGATEIATEAQEKGVHIYTIGIGTEKGAPIP 234 Query: 353 ----------------------GQDLLRKCTDS-SGQFFAVNDSRELLESFDKITD 385 ++LL++ + G++ +++++++ +KI D Sbjct: 235 IKEGGGQTYKRDRNGEVVITKLNRELLQQIAINAGGEYLDGDNTQKVVSQINKILD 290 >gi|226290246|gb|EEH45730.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18] Length = 757 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 31/208 (14%), Positives = 64/208 (30%), Gaps = 15/208 (7%) Query: 180 LLPPPPKKSFWSKNTTKSKYAPAPAPANR---KIDVLIESAGNLVNSIQKAIQEKKNLSV 236 + P + + S AP P K + S +L + I E N + Sbjct: 68 DICHVPCDIVLCIDVSGSMQLSAPLPTTNESGKREETGLSVLDLTKHAARTIIETLNEND 127 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNE--VKSRLNKLNPYENTNTYPAMHHAYRELYNEKES 294 R+G + ++ ++ N+ + L P +TN + + L Sbjct: 128 RLGVVTFSNDAEVAYKISHMDDTNKKAALEAVEALQPLASTNLWHGLKLGLNVLGEVDLR 187 Query: 295 SHNTIGSTRLKKFVIFITDGENS-GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 N + + +TDG+ + I E ++ I++ Sbjct: 188 PRNV-------QALYVLTDGQPNHMCPTQGYVPKLRPILERQKDRLPLIHTFGFGYDIRS 240 Query: 354 QDLLRKCTD-SSGQFFAVNDSRELLESF 380 LL+ + G + + D+ + F Sbjct: 241 -GLLQSIAEVGGGTYSFIPDAGMIGTVF 267 >gi|332833576|ref|XP_003312497.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5 [Pan troglodytes] Length = 728 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 56/154 (36%), Gaps = 12/154 (7%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKS---RLNKLNPYENTNTYPAMHHAYRELYNEKE 293 R I ++ I + +S + ++ ++ ++P T+ A+ A R L Sbjct: 116 RFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLLNKYVA 175 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S R ++F+TDG+ + + + + I+++ + + Sbjct: 176 HSGI---GDRSVSLIVFLTDGKPTVGETHTLKILNNT--REAARGQVCIFTIGIGNDVDF 230 Query: 354 QDL----LRKCTDSSGQFFAVNDSRELLESFDKI 383 + L L C + + +L+ +D+I Sbjct: 231 RLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 264 >gi|332828898|gb|EGK01581.1| hypothetical protein HMPREF9455_02113 [Dysgonomonas gadei ATCC BAA-286] Length = 609 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 34/284 (11%), Positives = 78/284 (27%), Gaps = 26/284 (9%) Query: 97 KNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDV 156 + ++ A Y + L + +E + Sbjct: 153 PLSTFSVDVDA-ASYSNMRRFINRGELPYKDAVRIEELINYFSYDYAEPAGNDPVRITTE 211 Query: 157 SRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIES 216 S + ++ K L S + ++D++ S Sbjct: 212 VGSC--PWNSQNRLVKVGLKAKSLASDNLPASNFVFLID----VSGSMSGPTRLDLVKSS 265 Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 LVN+++K R+ + Y + N ++K LN L+ +T Sbjct: 266 LKLLVNNLRKKD--------RVAIVVYASSTGEVLPSTSGENKQKIKEALNNLSAGGSTA 317 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 + AY+ N +I TDG+ + + ++ E R Sbjct: 318 GGAGIQLAYKIAKQNFIKGGNNR--------IILCTDGDFNVGV--SSNEGLQRLIENER 367 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLES 379 G+ + + ++ + G +++ +E + Sbjct: 368 KTGVFLSILGYGMGNYKDSKMQTLAQAGNGNHAYIDNLQEANKV 411 >gi|312434033|ref|NP_116206.4| inter-alpha-trypsin inhibitor heavy chain H5 isoform 2 [Homo sapiens] Length = 728 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 56/154 (36%), Gaps = 12/154 (7%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKS---RLNKLNPYENTNTYPAMHHAYRELYNEKE 293 R I ++ I + +S + ++ ++ ++P T+ A+ A R L Sbjct: 116 RFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLLNKYVA 175 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S R ++F+TDG+ + + + + I+++ + + Sbjct: 176 HSGI---GDRSVSLIVFLTDGKPTVGETHTLKILNNT--REAARGQVCIFTIGIGNDVDF 230 Query: 354 QDL----LRKCTDSSGQFFAVNDSRELLESFDKI 383 + L L C + + +L+ +D+I Sbjct: 231 RLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 264 >gi|310703621|ref|NP_085046.5| inter-alpha-trypsin inhibitor heavy chain H5 isoform 1 precursor [Homo sapiens] Length = 942 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 56/154 (36%), Gaps = 12/154 (7%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKS---RLNKLNPYENTNTYPAMHHAYRELYNEKE 293 R I ++ I + +S + ++ ++ ++P T+ A+ A R L Sbjct: 330 RFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLLNKYVA 389 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S R ++F+TDG+ + + + + I+++ + + Sbjct: 390 HSGI---GDRSVSLIVFLTDGKPTVGETHTLKILNNT--REAARGQVCIFTIGIGNDVDF 444 Query: 354 QDL----LRKCTDSSGQFFAVNDSRELLESFDKI 383 + L L C + + +L+ +D+I Sbjct: 445 RLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|242097658|emb|CAY86115.1| truncated collagen type VI alpha 4 precursor [Homo sapiens] Length = 385 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 23/171 (13%), Positives = 56/171 (32%), Gaps = 16/171 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYREL 288 + ++R+G Y+ + +V + + A+ L Sbjct: 62 NVSSETIRVGLAKYSDVPHSEFLLSTYHRKGDVLRHIRQFQFKPGGKKMGLALKFI---L 118 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + + + + S + + + I+ G + +R AG+ +Y++ V Sbjct: 119 DHHFQEASGSRASQEVPQIAVVISSGPVEDHVHGP--------AKALRKAGILLYAIGV- 169 Query: 349 APPEGQDLLRKCTDSSGQFF--AVNDSRELLESFDKITDKIQEQSVRIAPN 397 LR+ S + F V + L K+ ++ + + AP+ Sbjct: 170 -RDAVWAELREIASSPQENFTSFVPNFSGLSNLAQKLRQELCDTLAKAAPH 219 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 21/114 (18%), Positives = 42/114 (36%), Gaps = 12/114 (10%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + VR+G YN I + + ++ L TNT A+ Sbjct: 265 DISSDRVRVGLAQYNDNIYPAFQLNQHPLKSMILEQIQNLPYRTGGTNTGSALEFIRTNY 324 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKI 342 E+ S R+ + VI +TDGE++ ++ + ++ G+ + Sbjct: 325 LTEESGSRAK---DRVPQIVILVTDGESND--------EVQEVADRLKEDGVVV 367 >gi|255530103|ref|YP_003090475.1| von Willebrand factor A [Pedobacter heparinus DSM 2366] gi|255343087|gb|ACU02413.1| von Willebrand factor type A [Pedobacter heparinus DSM 2366] Length = 613 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 37/302 (12%), Positives = 83/302 (27%), Gaps = 29/302 (9%) Query: 97 KNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDV 156 + ++ A Y + G+ P + + ++ Sbjct: 153 PLSTFAVDVDA-ASYSNVRRFINNGGMPPKDAVRIEEMINYFDYEYPQPKGNDPVNIVTE 211 Query: 157 SRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIES 216 K K + S S P K+ +LI S Sbjct: 212 IADAPWNANHKLVQI--GLQGKKIPTDNLPASNLVFLIDVSGSMNQP----NKLPLLIAS 265 Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 L ++ ++ + Y + N ++K LNKL+ +T Sbjct: 266 FKLLTEQLRPED--------KVAIVVYAGNSGLVLPSTPGNEKTKIKEALNKLSAGGSTA 317 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 + AY+ + N +I TDG+ + + + + E R Sbjct: 318 GGAGIQLAYQVATDNFIKGGNNR--------IILATDGDFNVG--ASSDKDMESLIEEKR 367 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 +G+ + + + D G + +++ + E+ + ++ IA Sbjct: 368 KSGVFLTVLGYGMGNMKDSKMETLADKGNGNYAYIDN---ISEARKVLINEFGGTLFTIA 424 Query: 396 PN 397 + Sbjct: 425 KD 426 >gi|187609608|sp|Q86UX2|ITIH5_HUMAN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H5; Short=ITI heavy chain H5; Short=ITI-HC5; Short=Inter-alpha-inhibitor heavy chain 5; Flags: Precursor Length = 942 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 56/154 (36%), Gaps = 12/154 (7%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKS---RLNKLNPYENTNTYPAMHHAYRELYNEKE 293 R I ++ I + +S + ++ ++ ++P T+ A+ A R L Sbjct: 330 RFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLLNKYVA 389 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S R ++F+TDG+ + + + + I+++ + + Sbjct: 390 HSGI---GDRSVSLIVFLTDGKPTVGETHTLKILNNT--REAARGQVCIFTIGIGNDVDF 444 Query: 354 QDL----LRKCTDSSGQFFAVNDSRELLESFDKI 383 + L L C + + +L+ +D+I Sbjct: 445 RLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|119606788|gb|EAW86382.1| inter-alpha (globulin) inhibitor H5, isoform CRA_d [Homo sapiens] Length = 735 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 56/154 (36%), Gaps = 12/154 (7%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKS---RLNKLNPYENTNTYPAMHHAYRELYNEKE 293 R I ++ I + +S + ++ ++ ++P T+ A+ A R L Sbjct: 330 RFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLLNKYVA 389 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S R ++F+TDG+ + + + + I+++ + + Sbjct: 390 HSGI---GDRSVSLIVFLTDGKPTVGETHTLKILNNT--REAARGQVCIFTIGIGNDVDF 444 Query: 354 QDL----LRKCTDSSGQFFAVNDSRELLESFDKI 383 + L L C + + +L+ +D+I Sbjct: 445 RLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|119606787|gb|EAW86381.1| inter-alpha (globulin) inhibitor H5, isoform CRA_c [Homo sapiens] Length = 748 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 56/154 (36%), Gaps = 12/154 (7%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKS---RLNKLNPYENTNTYPAMHHAYRELYNEKE 293 R I ++ I + +S + ++ ++ ++P T+ A+ A R L Sbjct: 136 RFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLLNKYVA 195 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S R ++F+TDG+ + + + + I+++ + + Sbjct: 196 HSGI---GDRSVSLIVFLTDGKPTVGETHTLKILNNT--REAARGQVCIFTIGIGNDVDF 250 Query: 354 QDL----LRKCTDSSGQFFAVNDSRELLESFDKI 383 + L L C + + +L+ +D+I Sbjct: 251 RLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 284 >gi|119606785|gb|EAW86379.1| inter-alpha (globulin) inhibitor H5, isoform CRA_a [Homo sapiens] gi|168275576|dbj|BAG10508.1| inter-alpha trypsin inhibitor heavy chain precursor 5 isoform 1 [synthetic construct] Length = 942 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 56/154 (36%), Gaps = 12/154 (7%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKS---RLNKLNPYENTNTYPAMHHAYRELYNEKE 293 R I ++ I + +S + ++ ++ ++P T+ A+ A R L Sbjct: 330 RFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLLNKYVA 389 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S R ++F+TDG+ + + + + I+++ + + Sbjct: 390 HSGI---GDRSVSLIVFLTDGKPTVGETHTLKILNNT--REAARGQVCIFTIGIGNDVDF 444 Query: 354 QDL----LRKCTDSSGQFFAVNDSRELLESFDKI 383 + L L C + + +L+ +D+I Sbjct: 445 RLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|55958059|emb|CAI12954.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] gi|55958529|emb|CAI16361.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] Length = 577 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 56/154 (36%), Gaps = 12/154 (7%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKS---RLNKLNPYENTNTYPAMHHAYRELYNEKE 293 R I ++ I + +S + ++ ++ ++P T+ A+ A R L Sbjct: 205 RFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLLNKYVA 264 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S R ++F+TDG+ + + + + I+++ + + Sbjct: 265 HSGI---GDRSVSLIVFLTDGKPTVGETHTLKILNNT--REAARGQVCIFTIGIGNDVDF 319 Query: 354 QDL----LRKCTDSSGQFFAVNDSRELLESFDKI 383 + L L C + + +L+ +D+I Sbjct: 320 RLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 353 >gi|55958058|emb|CAI12953.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] gi|55958528|emb|CAI16360.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] Length = 742 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 56/154 (36%), Gaps = 12/154 (7%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKS---RLNKLNPYENTNTYPAMHHAYRELYNEKE 293 R I ++ I + +S + ++ ++ ++P T+ A+ A R L Sbjct: 116 RFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLLNKYVA 175 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S R ++F+TDG+ + + + + I+++ + + Sbjct: 176 HSGI---GDRSVSLIVFLTDGKPTVGETHTLKILNNT--REAARGQVCIFTIGIGNDVDF 230 Query: 354 QDL----LRKCTDSSGQFFAVNDSRELLESFDKI 383 + L L C + + +L+ +D+I Sbjct: 231 RLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 264 >gi|37181977|gb|AAQ88792.1| LLLL311 [Homo sapiens] Length = 694 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 56/154 (36%), Gaps = 12/154 (7%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKS---RLNKLNPYENTNTYPAMHHAYRELYNEKE 293 R I ++ I + +S + ++ ++ ++P T+ A+ A R L Sbjct: 330 RFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLLNKYVA 389 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S R ++F+TDG+ + + + + I+++ + + Sbjct: 390 HSGI---GDRSVSLIVFLTDGKPTVGETHTLKILNNT--REAARGQVCIFTIGIGNDVDF 444 Query: 354 QDL----LRKCTDSSGQFFAVNDSRELLESFDKI 383 + L L C + + +L+ +D+I Sbjct: 445 RLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|30314037|gb|AAO49812.1| inter-alpha trypsin inhibitor heavy chain precursor 5 [Homo sapiens] Length = 942 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 56/154 (36%), Gaps = 12/154 (7%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKS---RLNKLNPYENTNTYPAMHHAYRELYNEKE 293 R I ++ I + +S + ++ ++ ++P T+ A+ A R L Sbjct: 330 RFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLLNKYVA 389 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S R ++F+TDG+ + + + + I+++ + + Sbjct: 390 HSGI---GDRSVSLIVFLTDGKPTVGETHTLKILNNT--REAARGQVCIFTIGIGNDVDF 444 Query: 354 QDL----LRKCTDSSGQFFAVNDSRELLESFDKI 383 + L L C + + +L+ +D+I Sbjct: 445 RLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|55958060|emb|CAI12955.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] gi|55958531|emb|CAI16363.1| inter-alpha (globulin) inhibitor H5 [Homo sapiens] gi|189442558|gb|AAI67770.1| Inter-alpha (globulin) inhibitor H5 [synthetic construct] Length = 956 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 56/154 (36%), Gaps = 12/154 (7%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKS---RLNKLNPYENTNTYPAMHHAYRELYNEKE 293 R I ++ I + +S + ++ ++ ++P T+ A+ A R L Sbjct: 330 RFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLLNKYVA 389 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S R ++F+TDG+ + + + + I+++ + + Sbjct: 390 HSGI---GDRSVSLIVFLTDGKPTVGETHTLKILNNT--REAARGQVCIFTIGIGNDVDF 444 Query: 354 QDL----LRKCTDSSGQFFAVNDSRELLESFDKI 383 + L L C + + +L+ +D+I Sbjct: 445 RLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|49355778|ref|NP_001001851.1| inter-alpha-trypsin inhibitor heavy chain H5 isoform 3 precursor [Homo sapiens] gi|119606789|gb|EAW86383.1| inter-alpha (globulin) inhibitor H5, isoform CRA_e [Homo sapiens] Length = 702 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 56/154 (36%), Gaps = 12/154 (7%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKS---RLNKLNPYENTNTYPAMHHAYRELYNEKE 293 R I ++ I + +S + ++ ++ ++P T+ A+ A R L Sbjct: 330 RFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLLNKYVA 389 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S R ++F+TDG+ + + + + I+++ + + Sbjct: 390 HSGI---GDRSVSLIVFLTDGKPTVGETHTLKILNNT--REAARGQVCIFTIGIGNDVDF 444 Query: 354 QDL----LRKCTDSSGQFFAVNDSRELLESFDKI 383 + L L C + + +L+ +D+I Sbjct: 445 RLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 478 >gi|18916771|dbj|BAB85539.1| KIAA1953 protein [Homo sapiens] Length = 824 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 56/154 (36%), Gaps = 12/154 (7%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKS---RLNKLNPYENTNTYPAMHHAYRELYNEKE 293 R I ++ I + +S + ++ ++ ++P T+ A+ A R L Sbjct: 212 RFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLLNKYVA 271 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S R ++F+TDG+ + + + + I+++ + + Sbjct: 272 HSGI---GDRSVSLIVFLTDGKPTVGETHTLKILNNT--REAARGQVCIFTIGIGNDVDF 326 Query: 354 QDL----LRKCTDSSGQFFAVNDSRELLESFDKI 383 + L L C + + +L+ +D+I Sbjct: 327 RLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 360 >gi|14042797|dbj|BAB55397.1| unnamed protein product [Homo sapiens] Length = 397 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 56/154 (36%), Gaps = 12/154 (7%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKS---RLNKLNPYENTNTYPAMHHAYRELYNEKE 293 R I ++ I + +S + ++ ++ ++P T+ A+ A R L Sbjct: 25 RFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLLNKYVA 84 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S R ++F+TDG+ + + + + I+++ + + Sbjct: 85 HSGI---GDRSVSLIVFLTDGKPTVGETHTLKILNNT--REAARGQVCIFTIGIGNDVDF 139 Query: 354 QDL----LRKCTDSSGQFFAVNDSRELLESFDKI 383 + L L C + + +L+ +D+I Sbjct: 140 RLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEI 173 >gi|14042827|dbj|BAB55409.1| unnamed protein product [Homo sapiens] Length = 397 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 56/154 (36%), Gaps = 12/154 (7%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKS---RLNKLNPYENTNTYPAMHHAYRELYNEKE 293 R I ++ I + +S + ++ ++ ++P T+ A+ A R L Sbjct: 25 RFSIIGFSNRIKVWKDHLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLLNKYVA 84 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S R ++F+TDG+ + + + + I+++ + + Sbjct: 85 HSGI---GDRSVSLIVFLTDGKPTVGETHTLKILNNT--REAARGQVCIFTIGIGNDVDF 139 Query: 354 QDL----LRKCTDSSGQFFAVNDSRELLESFDKI 383 + L L C + + +L+ +D+I Sbjct: 140 RLLEKPSLENCGLTRRVHEEEDAGSQLIGFYDEI 173 >gi|293383944|ref|ZP_06629845.1| von Willebrand factor type A domain protein [Enterococcus faecalis R712] gi|293387429|ref|ZP_06631983.1| von Willebrand factor type A domain protein [Enterococcus faecalis S613] gi|312908057|ref|ZP_07767039.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 512] gi|312910783|ref|ZP_07769620.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 516] gi|291078704|gb|EFE16068.1| von Willebrand factor type A domain protein [Enterococcus faecalis R712] gi|291083151|gb|EFE20114.1| von Willebrand factor type A domain protein [Enterococcus faecalis S613] gi|310625945|gb|EFQ09228.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 512] gi|311288927|gb|EFQ67483.1| von Willebrand factor type A domain protein [Enterococcus faecalis DAPTO 516] Length = 1103 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 97/382 (25%), Gaps = 30/382 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M A A L+ T T + + H + G Sbjct: 151 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGGAQWDGQ 210 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ ++Y + L++R E Sbjct: 211 T--SWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRG----NVQKE 264 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +V+D S SM + L + S Sbjct: 265 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNINMGYVGYSSDGYNNN 324 Query: 205 PANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 D + N+ S + + G + L++ + Sbjct: 325 AIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFS 384 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +++++ T R + + + GST D N Sbjct: 385 YKVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNISKRINS 434 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA-V 370 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 435 TFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSADENGDLYYESA 494 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 495 DYAPDISDYLAKKAVQISGTVV 516 >gi|332560892|ref|ZP_08415210.1| von Willebrand factor, type A [Rhodobacter sphaeroides WS8N] gi|332274690|gb|EGJ20006.1| von Willebrand factor, type A [Rhodobacter sphaeroides WS8N] Length = 341 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 22/156 (14%), Positives = 54/156 (34%), Gaps = 31/156 (19%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRL--NKLNPYE-NTNTYPAMHHAYRELYNEKE 293 RIG + + PL+ +L V + + +T + A + + Sbjct: 146 RIGLVLFANRAYVAA--PLTFDLAAVGRAIEEASIGITGRSTAIADGLGLALKRVTESGA 203 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-- 351 +S + ++ ++DG+++ ++ Q+ G++I+++A+ Sbjct: 204 AS----------RVIVLLSDGQDNAHQ-----IDARQVAGLAARHGVRIHTIALGPDDLE 248 Query: 352 --------EGQDLLRKCTD-SSGQFFAVNDSRELLE 378 LR + S G+ + V +L Sbjct: 249 TRPAARDAVDTATLRAIAEASGGRSYRVRGMEDLRA 284 >gi|325673437|ref|ZP_08153128.1| type II secretion system protein [Rhodococcus equi ATCC 33707] gi|325555458|gb|EGD25129.1| type II secretion system protein [Rhodococcus equi ATCC 33707] Length = 623 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 22/148 (14%), Positives = 58/148 (39%), Gaps = 16/148 (10%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + L+ + ++ R++ L N+ ++ A L + ++ Sbjct: 128 SSTPQVLSELTTDSEDLLRRIDGLKAGGNSAIADSVVTAAEMLERGEAAN---------- 177 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSS 364 ++ +TDG ++ ++ + L ++ + + +Y+V +S P LL++ +S Sbjct: 178 NILLLLTDGADTSSAHSMSELPSV-----LSRSRASLYAVQMSTPETNSALLQQVARESR 232 Query: 365 GQFFAVNDSRELLESFDKITDKIQEQSV 392 GQ+ + D+ L + + V Sbjct: 233 GQYASAGDTAALGAIYQSAARALGNLYV 260 >gi|255557532|ref|XP_002519796.1| protein binding protein, putative [Ricinus communis] gi|223541035|gb|EEF42592.1| protein binding protein, putative [Ricinus communis] Length = 477 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 14/128 (10%), Positives = 39/128 (30%), Gaps = 20/128 (15%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 N+ ++++ +N L+ Y N + + L + + + + D Sbjct: 120 NSQKDLENLINGLHAYGAANITAGLQTGLKVLNDRRFTGGRVATIMLVSSSEQNNGDD-- 177 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT--DSSGQFFAVNDSR 374 + + + +++ A +L+ G F V + Sbjct: 178 ---------------ADQILVGNVPVHTFGFGAYH-EPGVLKAIAHNSIGGTFSDVQNMD 221 Query: 375 ELLESFDK 382 L ++F + Sbjct: 222 NLNKAFSQ 229 >gi|312139258|ref|YP_004006594.1| type ii secretion system integral membrane subunit [Rhodococcus equi 103S] gi|311888597|emb|CBH47909.1| putative type II secretion system integral membrane subunit [Rhodococcus equi 103S] Length = 622 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 22/148 (14%), Positives = 58/148 (39%), Gaps = 16/148 (10%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + L+ + ++ R++ L N+ ++ A L + ++ Sbjct: 127 SSTPQVLSELTTDSEDLLRRIDGLKAGGNSAIADSVVTAAEMLERGEAAN---------- 176 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSS 364 ++ +TDG ++ ++ + L ++ + + +Y+V +S P LL++ +S Sbjct: 177 NILLLLTDGADTSSAHSMSELPSV-----LSRSRASLYAVQMSTPETNSALLQQVARESR 231 Query: 365 GQFFAVNDSRELLESFDKITDKIQEQSV 392 GQ+ + D+ L + + V Sbjct: 232 GQYASAGDTAALGAIYQSAARALGNLYV 259 >gi|229495775|ref|ZP_04389503.1| BatB protein [Porphyromonas endodontalis ATCC 35406] gi|229317349|gb|EEN83254.1| BatB protein [Porphyromonas endodontalis ATCC 35406] Length = 338 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 22/101 (21%), Positives = 41/101 (40%), Gaps = 13/101 (12%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 P++ +L K ++ NP +N A+ A H + K +I T Sbjct: 144 LPITTDLAMAKKMVDDANPDMLSNQGTAIASAIDLSLGSFSDRH------DVGKAIILFT 197 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 DGEN + L+ + ++ G+K+Y++AV + Sbjct: 198 DGENHEG-------DALEAAKKAKSQGVKVYTIAVGSEEGA 231 >gi|145299122|ref|YP_001141963.1| von Willebrand factor type A domain-containing protein [Aeromonas salmonicida subsp. salmonicida A449] gi|142851894|gb|ABO90215.1| von Willebrand factor type A domain protein [Aeromonas salmonicida subsp. salmonicida A449] Length = 331 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 29/207 (14%), Positives = 65/207 (31%), Gaps = 47/207 (22%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP--- 271 + ++ + + RI + + +PL+ + + +L+ Sbjct: 110 SAVRQQISRLIETRPG-----DRIALVVFADHAYL--LSPLTQETKALLTLTRELDFDLV 162 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T A+ A + ++ ++ ++ +TDG N+ S + L Sbjct: 163 GRTTALGEAILLARQHADPKRPTA------------LLLVTDGRNTAGS-----ADPLSE 205 Query: 332 CEYMRNAGMKIYSVAVSAPPE-----------------GQDLLRKCTDSS-GQFFAVNDS 373 +GM +Y++ V A P+ + LL++ G++F Sbjct: 206 ARRAAASGMTLYTLGVGADPDTFVEALQPAQSDPSAELDEALLQQLAKVGQGRYFRARTQ 265 Query: 374 RELLESFDKITDK--IQEQSVRIAPNR 398 +L + R PNR Sbjct: 266 GDLDAINQTLDTLEPAATARTRYLPNR 292 >gi|229113208|ref|ZP_04242703.1| hypothetical protein bcere0018_54170 [Bacillus cereus Rock1-15] gi|228670234|gb|EEL25582.1| hypothetical protein bcere0018_54170 [Bacillus cereus Rock1-15] Length = 425 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 26/202 (12%), Positives = 62/202 (30%), Gaps = 22/202 (10%) Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI----GIVGNQC 252 S K++ ++ N ++ I + G+ N Sbjct: 134 SGSMAGKVNGEVKMEAAKKAIYNYLDKIPDNANVMLRVYGHKGSNNENDKSLSCGSSEVM 193 Query: 253 TPLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 PL N + + L+K P T A+ + V + Sbjct: 194 YPLQPYNKEQFNAALSKFGPKGWTPLASAIESVNDDFKE--------YTGEENLNVVYIV 245 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNA--GMKIYSVAVSAPPEGQDLLRKCTDSS-GQFF 368 +DGE + + + + + + + + Q L+ ++ G + Sbjct: 246 SDGEETCGG------DPVNAAKNLNQSSTHAVVNIIGFDVKNSEQQQLKNTAEAGKGNYA 299 Query: 369 AVNDSRELLESFDKITDKIQEQ 390 V+ + EL ++ +K +K+ ++ Sbjct: 300 TVSSADELHQTLNKEYEKLYKE 321 >gi|149641794|ref|XP_001509032.1| PREDICTED: similar to integrin alpha E2 [Ornithorhynchus anatinus] Length = 1679 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 27/153 (17%), Positives = 52/153 (33%), Gaps = 14/153 (9%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNT 298 + Y I S + N ++ + N T T A++H +E++N ++ S Sbjct: 200 VVQYGSVIQTEFDLLASRDANSSLQKVKNIKQVGNVTKTASAIYHVLKEIFNVEKGSRRQ 259 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-PEGQDLL 357 K +I +TDG+ +Q+ N + G++ +++ V + L Sbjct: 260 -----NSKIIIVLTDGDV-----FQDPKNLATVMNLPEMKGIERFAIGVGNEFSASKKTL 309 Query: 358 RKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 F V L + KI Sbjct: 310 ELIASEPHHSHKFRVEKFSGLDGLLRGLEQKII 342 >gi|73960091|ref|XP_537088.2| PREDICTED: similar to chloride channel calcium activated 4 [Canis familiaris] Length = 905 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 22/148 (14%), Positives = 55/148 (37%), Gaps = 25/148 (16%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN--PYENTNTYPAMHHAYRELYNEKESS 295 G + + + N + L L T+ + ++ + + +++ Sbjct: 347 TGMVTFESSATIQNYLTEITDHNAYEKILANLPQAAGGGTSICSGLRAGFQAIIHSNQNT 406 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC-EYMRNAGMKIYSVAVSAPPEGQ 354 ++ +TDGE+ + + +C E ++ +G I+++A+ P Sbjct: 407 --------SGSEIVLLTDGED----------DNISLCFEEVKKSGSVIHTIALG--PSAA 446 Query: 355 DLLRKCTD-SSGQFFAVN-DSRELLESF 380 L ++ + G F N D L+++F Sbjct: 447 KELEILSNMTGGHRFYANKDINGLIDAF 474 >gi|330465656|ref|YP_004403399.1| von willebrand factor type a [Verrucosispora maris AB-18-032] gi|328808627|gb|AEB42799.1| von willebrand factor type a [Verrucosispora maris AB-18-032] Length = 410 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 30/199 (15%), Positives = 70/199 (35%), Gaps = 30/199 (15%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN-------N 258 +I V ++ G +V+++ ++ L +R+ Y C N Sbjct: 35 GRSRISVAQQAFGEVVDAL----PDETQLGIRVLGATYRGEDKQQGCLDTQQIVPVGPVN 90 Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 K+ + L P T A+ A ++L G + ++ ITDGE++ Sbjct: 91 RERAKAAVATLRPTGFTPVGLALREAAKDL-----------GGGTTARRIVLITDGEDT- 138 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIY--SVAVSAPPE-GQDLLRKCTDSSGQFFAVNDSRE 375 + Q+ + G + ++ ++ + + LL + G + A + + Sbjct: 139 ----CAPPDPCQVARELAAQGTTLVVDTLGLAPDEKVRRQLLCIAAATGGTYTAATSAED 194 Query: 376 LLESFDKITDKIQEQSVRI 394 L ++ D+ ++ Sbjct: 195 LTGRLKQLVDRARDTYATT 213 >gi|156383259|ref|XP_001632752.1| predicted protein [Nematostella vectensis] gi|156219812|gb|EDO40689.1| predicted protein [Nematostella vectensis] Length = 157 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 17/136 (12%), Positives = 50/136 (36%), Gaps = 17/136 (12%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + IG + + S + + + ++ + P T T A+ ++L Sbjct: 38 DISASGTHIGIVTFATDPTVELEFDQSFDNTSIATIIDNIRNPDALTFTGKALETVKKDL 97 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + + + + + + +I +TDG ++ + + ++ +G+ +Y+V V Sbjct: 98 FEKSQRA-------NVHRMLIVLTDG--------RSWDAVQEPAKQLKESGVTLYAVGVG 142 Query: 349 APPEGQDLLRKCTDSS 364 + L+ + Sbjct: 143 QD-YDLEQLKDIASNP 157 >gi|114046077|ref|YP_736627.1| von Willebrand factor, type A [Shewanella sp. MR-7] gi|113887519|gb|ABI41570.1| von Willebrand factor, type A [Shewanella sp. MR-7] Length = 335 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 24/150 (16%), Positives = 54/150 (36%), Gaps = 23/150 (15%) Query: 246 GIVGNQCTPLSNNLN---EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 G TP + + + ++T+ A+ + + S Sbjct: 149 GDAAFIQTPFTADQQVWLSLLEEAQTGMAGQSTHLGDAIGLGIKVFEQNPQPSE------ 202 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-PEGQDLL---- 357 ++ +I +TDG ++G + + + + G+KIY++A+ P G+ + Sbjct: 203 --QQVMIVLTDGNDTG-----SFVEPVDAAKIAAARGIKIYTIAMGDPTHVGEQPMDMEV 255 Query: 358 --RKCTDSSGQFFAVNDSRELLESFDKITD 385 R + + F D EL +++ I Sbjct: 256 VQRVSQLTQARAFIAIDQAELDKAYQLIDK 285 >gi|242078369|ref|XP_002443953.1| hypothetical protein SORBIDRAFT_07g005010 [Sorghum bicolor] gi|241940303|gb|EES13448.1| hypothetical protein SORBIDRAFT_07g005010 [Sorghum bicolor] Length = 567 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 24/186 (12%), Positives = 55/186 (29%), Gaps = 28/186 (15%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN--LNEVKSRL 266 K+ +L ++ +++ + A R+ + ++ ++ K + Sbjct: 96 KLALLKQAMCFVIDQLGPAD--------RLSVVTFSNDASRLTRLARMSDAGKASAKIAV 147 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 L TN +H A L + +I ++DG ++ Sbjct: 148 ESLAVQGFTNIKQGIHVAAEVL--------AGRREKNVVAGMILLSDGHDNCGGTSVRPD 199 Query: 327 NTLQICEYM-----RNAG-----MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 T + AG I++ + + + G F V D + Sbjct: 200 GTKSYVNLVPPSLTVAAGSSRPAAPIHTFGFGTSHDAGAMHAVAEATGGTFSFVGDEAAI 259 Query: 377 LESFDK 382 +SF + Sbjct: 260 QDSFAR 265 >gi|307324435|ref|ZP_07603643.1| von Willebrand factor type A [Streptomyces violaceusniger Tu 4113] gi|306890166|gb|EFN21144.1| von Willebrand factor type A [Streptomyces violaceusniger Tu 4113] Length = 543 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 25/217 (11%), Positives = 65/217 (29%), Gaps = 20/217 (9%) Query: 175 TSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNL 234 LL T ++ L + G L S ++ Sbjct: 335 AVADGLLASYENDLRRPSRTVYVLDTSGSMNG-DRLAQLKRALGQLAGSDVSPTGDRFRD 393 Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNL-----NEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 + + + + G + + +++ L +T + ++ AY L Sbjct: 394 REEVTLMPFGSRVKGVRTHTVPGERPAPVLAAIRADAEALTADGDTAIFSSLQAAYDHLA 453 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR--NAGMKIYSVAV 347 + + + ++ ++ +TDGEN+ + + +R ++ + Sbjct: 454 QRRSALGDDRFTS-----IVLMTDGENTTGATASDFDAYY---RRLRGPERTAPVFPIVF 505 Query: 348 SAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKI 383 + L+ + G+ F L +F++I Sbjct: 506 G--DSDRSQLQSIATLTGGRLFDAT-KGSLDGAFEEI 539 >gi|254562692|ref|YP_003069787.1| hypothetical protein METDI4317 [Methylobacterium extorquens DM4] gi|254269970|emb|CAX25948.1| hypothetical protein METDI4317 [Methylobacterium extorquens DM4] Length = 473 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 16/94 (17%), Positives = 34/94 (36%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+ S + AID A R +QSA+DA VL+G ++ + + Sbjct: 34 IFALAGSTLIGLVGGAIDYARFASARTNLQSAVDAGVLAGGNALKLVVSSSESIVGLTTQ 93 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINIT 94 + + + + + + +A+ Sbjct: 94 TIQAEAKAGADAPVSIQVTVASDKTSVEARAEQV 127 >gi|254428069|ref|ZP_05041776.1| Vault protein inter-alpha-trypsin [Alcanivorax sp. DG881] gi|196194238|gb|EDX89197.1| Vault protein inter-alpha-trypsin [Alcanivorax sp. DG881] Length = 657 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 25/160 (15%), Positives = 59/160 (36%), Gaps = 21/160 (13%) Query: 237 RIGTIAYNIGIVGNQCTPLS---NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 R ++ TP++ N+ + + ++ L T+ + Sbjct: 335 RFNITDFDSQHTLLFETPVTVSDNSRQQAQDFVDGLQASGGTH-----------MLPALS 383 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 ++ + S + VIFITDG S + + + A ++++V + + P Sbjct: 384 ATLSQPASDGYLRQVIFITDGAVGNESGIF-----RALHQQLGEA--RLFTVGIGSAPNS 436 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + R G F +ND ++ + D + +++ +R Sbjct: 437 HFMTRAAQFGRGSFTYINDQNQVQQGMDTLFRRLESPLMR 476 >gi|218531748|ref|YP_002422564.1| hypothetical protein Mchl_3818 [Methylobacterium chloromethanicum CM4] gi|218524051|gb|ACK84636.1| conserved hypothetical protein [Methylobacterium chloromethanicum CM4] Length = 473 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 16/94 (17%), Positives = 34/94 (36%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+ S + AID A R +QSA+DA VL+G ++ + + Sbjct: 34 IFALAGSTLIGLVGGAIDYARFASARTNLQSAVDAGVLAGGNALKLVVSSSESIVGLTTQ 93 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINIT 94 + + + + + + +A+ Sbjct: 94 TIQAEAKAGADAPVSIQVTVASDKTSVEARAEQV 127 >gi|332846919|ref|XP_003315346.1| PREDICTED: integrin alpha-E [Pan troglodytes] Length = 1241 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 24/156 (15%), Positives = 48/156 (30%), Gaps = 16/156 (10%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNT 298 + Y I S ++ +R+ + + T T AM H ++ S Sbjct: 305 LVQYGGVIQTEFDLRDSQDVMASLARVQNITQVGSVTKTASAMQHVLDSIFTSSHGSRRK 364 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP---PEGQD 355 K ++ +TDG +++ LN + + G++ +++ V Sbjct: 365 AS-----KVMVVLTDG-----GIFEDPLNLTTVINSPKMQGVERFAIGVGEEFKSARTAR 414 Query: 356 LLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQE 389 L F V + L K+ I Sbjct: 415 ELNLIASDPDETHAFKVTNYMALDGLLSKLRYNIIS 450 >gi|301609304|ref|XP_002934186.1| PREDICTED: epithelial chloride channel protein-like [Xenopus (Silurana) tropicalis] Length = 934 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 31/272 (11%), Positives = 82/272 (30%), Gaps = 30/272 (11%) Query: 132 SLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWS 191 S+ + ++E + M S +P F Sbjct: 243 SVSEFCDSKHNTEAPTLQNRMCESRSTWDVISNSTDIKSTVPRADFNIPVPSFSLLQFSE 302 Query: 192 KNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ 251 + + ++ +I +++ V I + + + + Sbjct: 303 RVVSLVLDVSGSMSSSNRIGRQLQAVELFVVQIIENGAHVG----IVKFSSSASVVSSLV 358 Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 +++KS + + TN + + + ++ + Sbjct: 359 KINTQAQRDQLKSLIPR-TAGGGTNICAGIRAGIAL--------NKNFDGSSYGTEIVLL 409 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAV 370 TDGE++ ++ T ++G I+ +A+ P L + + G F Sbjct: 410 TDGEDNLDTSLCFKDIT--------DSGAIIHVIALG--PNAAKELETIANMTGGLRFNA 459 Query: 371 ND---SRELLESFDKI---TDKIQEQSVRIAP 396 D + EL+++F + +I +Q++++ Sbjct: 460 LDKVEANELIDAFSGLHSGNGEIIQQAIQLES 491 >gi|148728188|ref|NP_002199.3| integrin alpha-E precursor [Homo sapiens] gi|226694184|sp|P38570|ITAE_HUMAN RecName: Full=Integrin alpha-E; AltName: Full=HML-1 antigen; AltName: Full=Integrin alpha-IEL; AltName: Full=Mucosal lymphocyte 1 antigen; AltName: CD_antigen=CD103; Contains: RecName: Full=Integrin alpha-E light chain; Contains: RecName: Full=Integrin alpha-E heavy chain; Flags: Precursor Length = 1179 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 24/156 (15%), Positives = 48/156 (30%), Gaps = 16/156 (10%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNT 298 + Y I S ++ +R+ + + T T AM H ++ S Sbjct: 243 LVQYGGVIQTEFDLRDSQDVMASLARVQNITQVGSVTKTASAMQHVLDSIFTSSHGSRRK 302 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP---PEGQD 355 K ++ +TDG +++ LN + + G++ +++ V Sbjct: 303 AS-----KVMVVLTDG-----GIFEDPLNLTTVINSPKMQGVERFAIGVGEEFKSARTAR 352 Query: 356 LLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQE 389 L F V + L K+ I Sbjct: 353 ELNLIASDPDETHAFKVTNYMALDGLLSKLRYNIIS 388 >gi|119610886|gb|EAW90480.1| integrin, alpha E (antigen CD103, human mucosal lymphocyte antigen 1; alpha polypeptide) [Homo sapiens] Length = 1196 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 24/156 (15%), Positives = 48/156 (30%), Gaps = 16/156 (10%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNT 298 + Y I S ++ +R+ + + T T AM H ++ S Sbjct: 260 LVQYGGVIQTEFDLRDSQDVMASLARVQNITQVGSVTKTASAMQHVLDSIFTSSHGSRRK 319 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP---PEGQD 355 K ++ +TDG +++ LN + + G++ +++ V Sbjct: 320 AS-----KVMVVLTDG-----GIFEDPLNLTTVINSPKMQGVERFAIGVGEEFKSARTAR 369 Query: 356 LLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQE 389 L F V + L K+ I Sbjct: 370 ELNLIASDPDETHAFKVTNYMALDGLLSKLRYNIIS 405 >gi|4406708|gb|AAB59359.2| integrin alpha E precursor [Homo sapiens] gi|109659254|gb|AAI17208.1| Integrin, alpha E (antigen CD103, human mucosal lymphocyte antigen 1; alpha polypeptide) [Homo sapiens] gi|109730475|gb|AAI13437.1| Integrin, alpha E (antigen CD103, human mucosal lymphocyte antigen 1; alpha polypeptide) [Homo sapiens] gi|313883908|gb|ADR83440.1| integrin, alpha E (antigen CD103, human mucosal lymphocyte antigen 1 [synthetic construct] Length = 1179 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 24/156 (15%), Positives = 48/156 (30%), Gaps = 16/156 (10%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNT 298 + Y I S ++ +R+ + + T T AM H ++ S Sbjct: 243 LVQYGGVIQTEFDLRDSQDVMASLARVQNITQVGSVTKTASAMQHVLDSIFTSSHGSRRK 302 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP---PEGQD 355 K ++ +TDG +++ LN + + G++ +++ V Sbjct: 303 AS-----KVMVVLTDG-----GIFEDPLNLTTVINSPKMQGVERFAIGVGEEFKSARTAR 352 Query: 356 LLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQE 389 L F V + L K+ I Sbjct: 353 ELNLIASDPDETHAFKVTNYMALDGLLSKLRYNIIS 388 >gi|7239181|gb|AAF43107.1| HUMINAE [Homo sapiens] Length = 1127 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 24/156 (15%), Positives = 48/156 (30%), Gaps = 16/156 (10%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNT 298 + Y I S ++ +R+ + + T T AM H ++ S Sbjct: 191 LVQYGGVIQTEFDLRDSQDVMASLARVQNITQVGSVTKTASAMQHVLDSIFTSSHGSRRK 250 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP---PEGQD 355 K ++ +TDG +++ LN + + G++ +++ V Sbjct: 251 AS-----KVMVVLTDG-----GIFEDPLNLTTVINSPKMQGVERFAIGVGEEFKSARTAR 300 Query: 356 LLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQE 389 L F V + L K+ I Sbjct: 301 ELNLIASDPDETHAFKVTNYMALDGLLSKLRYNIIS 336 >gi|52548788|gb|AAU82637.1| cell surface protein [uncultured archaeon GZfos18H11] Length = 1359 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 26/167 (15%), Positives = 62/167 (37%), Gaps = 11/167 (6%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPL--SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 + R+G + +N G + L + N+ ++K + +++ T M A E Sbjct: 1004 DHLEDDDRLGLVLFNTGAELAEPVSLVGAKNMQKLKGDVLEISATGGTRLSAGMQMA-TE 1062 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 LY+E + + + +IF+TD + + +L + N + + + Sbjct: 1063 LYDEF----LEVNQSEYENRIIFLTDAMPNSGQTSEESLLGMIEANA--NKNVYTTFIGI 1116 Query: 348 SAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 +L+ T +++V+ + + E D + + V Sbjct: 1117 GVD-FNTELVEYITKIRGANYYSVHSATQFKERMDDEFEYMVTPLVF 1162 >gi|260892924|ref|YP_003239021.1| Protein of unknown function DUF2134, membrane [Ammonifex degensii KC4] gi|260865065|gb|ACX52171.1| Protein of unknown function DUF2134, membrane [Ammonifex degensii KC4] Length = 298 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 19/161 (11%), Positives = 43/161 (26%), Gaps = 28/161 (17%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A ++ +D ++ R ++ +A+DAA L+G + D + T Sbjct: 13 LVAAALTFLLGLAALVVDGGGLLLARERLVNAVDAAALAGVQFLPGDPSGAVQTALDYA- 71 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 A + P +A +P FL Sbjct: 72 ------------------------RLNGADPAQVTAEVEPDGRTLAVRADRSVPF---FL 104 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSME 161 ++ + ++ + + + D S Sbjct: 105 ARVLGLEKGEVKAQAKARVGAPEAVFGVVPLGIPDQSLVFG 145 >gi|239945536|ref|ZP_04697473.1| hypothetical protein SrosN15_31402 [Streptomyces roseosporus NRRL 15998] gi|239992004|ref|ZP_04712668.1| hypothetical protein SrosN1_32194 [Streptomyces roseosporus NRRL 11379] Length = 416 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 27/196 (13%), Positives = 67/196 (34%), Gaps = 26/196 (13%) Query: 206 ANRKIDVLIESAGNLVNSIQK-----AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 ++ ++ ++++++ + + + PL + Sbjct: 48 GQSRMAAAKQAFNDVLDAVPEQVQLGIRTLGADYPGEDRKVGCKDTRQLYPVGPL--DRT 105 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 E K+ + L P T PA+ A +L + + ++ ITDGE++ Sbjct: 106 EAKTAVATLAPTGFTPIGPALLGAADDLEGGEG-----------SRRIVLITDGEDTCG- 153 Query: 321 AYQNTLNTLQICEYMRNAGMKIY--SVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELL 377 L+ ++ + G + ++ + + + L + + G + AV EL Sbjct: 154 ----PLDPCEVAREIAARGTHLVVDTLGLVPNAKIRQQLTCIAEATGGTYTAVQHKEELS 209 Query: 378 ESFDKITDKIQEQSVR 393 ++ D+ E V Sbjct: 210 GRVKQLVDRAAEPVVT 225 >gi|297190882|ref|ZP_06908280.1| von Willebrand factor [Streptomyces pristinaespiralis ATCC 25486] gi|197722677|gb|EDY66585.1| von Willebrand factor [Streptomyces pristinaespiralis ATCC 25486] Length = 518 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 28/180 (15%), Positives = 63/180 (35%), Gaps = 15/180 (8%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN----LNEVKS 264 +I L E+ L + G + ++ + L+ ++ Sbjct: 344 RITALRETIDGLAGGDDSPSGKFVRFYRGETLTVMRFGGRVLEERNITYDGPRDLDRLRG 403 Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + +T + ++ HAYR + + S ++ +TDGEN+ Sbjct: 404 VVASDDFAGSTAIWSSLDHAYRAVARDLVDRPERRVS------IVLMTDGENNAGMDVDA 457 Query: 325 TLNTLQ-ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 + + E R ++ Y++ + ++L R + G+ D LL +F +I Sbjct: 458 FVRAHARLPEDARR--VRTYTIRYG-EADTRELDRGARATGGRMVDATDRS-LLSAFKEI 513 >gi|118384116|ref|XP_001025211.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89306978|gb|EAS04966.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 631 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 20/155 (12%), Positives = 53/155 (34%), Gaps = 22/155 (14%) Query: 233 NLSVRIGTIAYNI--GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 N + RI I+++ I+ N +E+K + + +TN M + N Sbjct: 175 NENDRICLISFDSVEKILTPFLRNNLENKSELKKAIKNIVGRGSTNIEAGMEAGLWMIKN 234 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK----IYSVA 346 + ++DG++ + +++ ++ + + Sbjct: 235 --------RKEKNPITCMFLLSDGQDDSPQVDLRVQ------KLIQSYDIQDTFIVNTYG 280 Query: 347 VSAPPEGQDLLRKCTDS-SGQFFAVNDSRELLESF 380 A +R ++ G ++ + D +++ E F Sbjct: 281 YGADH-DATQMRNIAETHKGGYYYIEDVKKVSEWF 314 >gi|291382819|ref|XP_002708118.1| PREDICTED: polydom [Oryctolagus cuniculus] Length = 3569 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 20/132 (15%), Positives = 45/132 (34%), Gaps = 25/132 (18%) Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + + + T T A A + L + +E+S K + ITDG ++G Sbjct: 153 LSREIPAITYRGGGTYTKGAFQQAAQILRHSRENST---------KVIFLITDGYSNGG- 202 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLE 378 + I +R+ G++I++ L + + ++ E Sbjct: 203 ------DPRPIAASLRDFGVEIFT--FGIWQGNIRELNDMASTPKEEHCYLLHSFEE--- 251 Query: 379 SFDKITDKIQEQ 390 F+ + + + Sbjct: 252 -FEALARRALHE 262 >gi|124783268|ref|NP_073725.2| sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 precursor [Mus musculus] gi|171769535|sp|A2AVA0|SVEP1_MOUSE RecName: Full=Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1; AltName: Full=Polydom; Flags: Precursor gi|123210319|emb|CAM21214.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Mus musculus] gi|123229801|emb|CAM23597.1| sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Mus musculus] Length = 3567 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 20/132 (15%), Positives = 45/132 (34%), Gaps = 25/132 (18%) Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + + + T T A A + L + +E+S K + ITDG ++G Sbjct: 154 LSREIPAITYRGGGTYTKGAFQQAAQILRHSRENST---------KVIFLITDGYSNGG- 203 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLE 378 + I +R+ G++I++ L + + ++ E Sbjct: 204 ------DPRPIAASLRDFGVEIFT--FGIWQGNIRELNDMASTPKEEHCYLLHSFEE--- 252 Query: 379 SFDKITDKIQEQ 390 F+ + + + Sbjct: 253 -FEALARRALHE 263 >gi|118096699|ref|XP_414253.2| PREDICTED: similar to inter-alpha (globulin) inhibitor H3 [Gallus gallus] Length = 886 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 20/133 (15%), Positives = 47/133 (35%), Gaps = 11/133 (8%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 NL+E + + ++ TN Y + L E + S +I +TDG+ + Sbjct: 342 NLDEARKFVRSIDTEGMTNLYGGIMKGIDMLNAAHEGNLVPKRS---ASIIIMLTDGQPN 398 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ-----FFAVND 372 NT + + +Y++ + L K + + + Sbjct: 399 VG--ISNTQDIQTHVKKAIEGKYTLYNLGFGYGV-DYNFLEKMALENKGLARRIYPDSDS 455 Query: 373 SRELLESFDKITD 385 + +L +D++++ Sbjct: 456 ALQLQGFYDEVSN 468 >gi|26330612|dbj|BAC29036.1| unnamed protein product [Mus musculus] Length = 440 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 20/132 (15%), Positives = 45/132 (34%), Gaps = 25/132 (18%) Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + + + T T A A + L + +E+S K + ITDG ++G Sbjct: 154 LSREIPAITYRGGGTYTKGAFQQAAQILRHSRENST---------KVIFLITDGYSNGG- 203 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLE 378 + I +R+ G++I++ L + + ++ E Sbjct: 204 ------DPRPIAASLRDFGVEIFT--FGIWQGNIRELNDMASTPKEEHCYLLHSFEE--- 252 Query: 379 SFDKITDKIQEQ 390 F+ + + + Sbjct: 253 -FEALARRALHE 263 >gi|26342893|dbj|BAC35103.1| unnamed protein product [Mus musculus] Length = 848 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 20/132 (15%), Positives = 45/132 (34%), Gaps = 25/132 (18%) Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + + + T T A A + L + +E+S K + ITDG ++G Sbjct: 154 LSREIPAITYRGGGTYTKGAFQQAAQILRHSRENST---------KVIFLITDGYSNGG- 203 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLE 378 + I +R+ G++I++ L + + ++ E Sbjct: 204 ------DPRPIAASLRDFGVEIFT--FGIWQGNIRELNDMASTPKEEHCYLLHSFEE--- 252 Query: 379 SFDKITDKIQEQ 390 F+ + + + Sbjct: 253 -FEALARRALHE 263 >gi|11177164|gb|AAG32160.1|AF206329_1 polydom protein [Mus musculus] Length = 3567 Score = 56.5 bits (134), Expect = 7e-06, Method: Composition-based stats. Identities = 20/132 (15%), Positives = 45/132 (34%), Gaps = 25/132 (18%) Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + + + T T A A + L + +E+S K + ITDG ++G Sbjct: 154 LSREIPAITYRGGGTYTKGAFQQAAQILRHSRENST---------KVIFLITDGYSNGG- 203 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLE 378 + I +R+ G++I++ L + + ++ E Sbjct: 204 ------DPRPIAASLRDFGVEIFT--FGIWQGNIRELNDMASTPKEEHCYLLHSFEE--- 252 Query: 379 SFDKITDKIQEQ 390 F+ + + + Sbjct: 253 -FEALARRALHE 263 >gi|126464748|ref|YP_001045861.1| von Willebrand factor, type A [Rhodobacter sphaeroides ATCC 17029] gi|126106559|gb|ABN79089.1| von Willebrand factor, type A [Rhodobacter sphaeroides ATCC 17029] Length = 328 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 22/156 (14%), Positives = 54/156 (34%), Gaps = 31/156 (19%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRL--NKLNPYE-NTNTYPAMHHAYRELYNEKE 293 RIG + + PL+ +L V + + +T + A + + Sbjct: 133 RIGLVLFANRAYVAA--PLTFDLAAVGRAIEEASIGITGRSTAIADGLGLALKRVTESAA 190 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-- 351 +S + ++ ++DG+++ ++ Q+ G++I+++A+ Sbjct: 191 AS----------RVIVLLSDGQDNAHQ-----IDARQVAGLAARHGVRIHTIALGPDDLE 235 Query: 352 --------EGQDLLRKCTD-SSGQFFAVNDSRELLE 378 LR + S G+ + V +L Sbjct: 236 TRPAARDAVDTATLRAIAEASGGRSYRVRGMEDLRA 271 >gi|156741667|ref|YP_001431796.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156232995|gb|ABU57778.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 826 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 38/317 (11%), Positives = 83/317 (26%), Gaps = 77/317 (24%) Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 + N + ++LDVS SM + + N P Sbjct: 391 TPRYQTYPDLNRPVQFLLILDVSGSMSWTFDGRGVQNGQTVF---CTNPSQGCVSVQTAW 447 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYN---------- 244 ++ R+I + + V I + Q + + ++ Sbjct: 448 PNAQ--------ERRIYTAKQVLRSFVAQIDQDRQSGLRPYDTVRLVTFSGRLGSFVNSS 499 Query: 245 ----------IGIVGNQCTPLSNNLNEVKSRLNK--------LNPYENTNTYPAMHHAYR 286 + +N+ +++ +N T + A A + Sbjct: 500 GAVGDNNRALNDLTEVLPAGWTNDRATLEAAINSAGMVDGDPYMTAGATPSAVAFARASQ 559 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSG---------------------------- 318 N E + + ++ VIF+TDG + Sbjct: 560 VFANAPERAP---NGMKYRRVVIFVTDGVANVLRNGMQNNYGEGCQLGAENVGCQMGDPL 616 Query: 319 -ASAYQNTLNTLQICEYMRNAGMK-----IYSVAVSAPPEGQDLLRKCTDSSGQFFAVND 372 + + + + ++ A ++ +Y VA+ + L + Sbjct: 617 PDGSLRPLNAMVAEAQALKEAYIRPSDGSVYVVAL-SGTFEATGLNLVASQPDYVKRADR 675 Query: 373 SRELLESFDKITDKIQE 389 S EL + FD I + Sbjct: 676 SEELQQIFDDIQVSAIQ 692 >gi|326670666|ref|XP_003199265.1| PREDICTED: collagen alpha-3(VI) chain [Danio rerio] Length = 1455 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 34/339 (10%), Positives = 84/339 (24%), Gaps = 33/339 (9%) Query: 73 KQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLS 132 ++ +N + + + + Sbjct: 676 SPNFFLDTYQTKDEVLRAVNGLTLAGGRGLNTGAALTFMKNTVLSTARGSRAAQNVPQFL 735 Query: 133 LRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSK 192 + T R S V + + ++ N + S + Sbjct: 736 IVLTAGRSRDSVREPAVALKTEGVVPFGVGVKNADPKEIEAISHNPSFAFNVKEFSQLNT 795 Query: 193 N----TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA----------------IQEKK 232 + + + L++ + + Sbjct: 796 VQEKLKNYVNFQEQDLTKFLERGINKRDIVFLLDGSDDSRNTLLTIREFIRRMVLDLDID 855 Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNE- 291 VR+ + Y+ + + N+ V +N L N A + + + Sbjct: 856 QDIVRVAVVQYSEDPLIHFLLNTYNSKKAVLYAINGLTAKGGRNINTGA--ALQYVRDHV 913 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 ++ + + + +I +T G ++ E ++N G+ S+A+ Sbjct: 914 FTTASGSRHHLGVPQVLIVMTGG--------RSIDQVADPAEDLKNFGVL--SIAIGIKN 963 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + L+ S F + S ELL I I+ + Sbjct: 964 ALESELQTIAFSPRFIFNLPVSGELLHIQPDILSFIKSK 1002 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 21/160 (13%), Positives = 51/160 (31%), Gaps = 15/160 (9%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAY 285 + VR+ + ++ N +EV +N L NT A+ Sbjct: 656 PLDIGINKVRVSVVQHSDRPSPNFFLDTYQTKDEVLRAVNGLTLAGGRGLNTGAALTFMK 715 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + S + +F+I +T G ++ + + ++ G+ + Sbjct: 716 NTVLSTARGSRAAQN---VPQFLIVLTAG--------RSRDSVREPAVALKTEGVVPF-- 762 Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 V + + + F V + +L +K+ + Sbjct: 763 GVGVKNADPKEIEAISHNPSFAFNVKEFSQLNTVQEKLKN 802 Score = 41.8 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 22/154 (14%), Positives = 45/154 (29%), Gaps = 14/154 (9%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYRELY 289 ++R+ + Y +N V S + L P N A+ L Sbjct: 53 VGRDAIRVALVLYGADPEIKFYLNSYDNRQSVLSAIRGLKYPGGEANLGAALQEVADSLL 112 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 E + + ++ I+ GE + + Q ++ + I+ ++A Sbjct: 113 GEDAGG---RAEEGVPQALVVISAGE--------SADDVSQSGRALKQDSVYIF--GIAA 159 Query: 350 PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 L + D R + D+I Sbjct: 160 GDSATAQLEAIATDKSFVLSAPDVRTVSSMGDQI 193 >gi|315223476|ref|ZP_07865333.1| aerotolerance-related exported protein BatB [Capnocytophaga ochracea F0287] gi|314946649|gb|EFS98640.1| aerotolerance-related exported protein BatB [Capnocytophaga ochracea F0287] Length = 347 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 27/176 (15%), Positives = 65/176 (36%), Gaps = 42/176 (23%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 R+G +AY L+ + + K L +N ++ A+ A R N + Sbjct: 130 RVGIVAYAASAYPQ--LALTTDHSAAKMFLQGMNTDMLSSQGTAIQEAIRMASNYFD--- 184 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE---- 352 +T + + +TDGE+ + + +I + G+ IY++ + Sbjct: 185 ---ENTPTARLLFILTDGED-------HEMGATEIATEAQEKGVHIYTIGIGTEKGAPIP 234 Query: 353 ----------------------GQDLLRKCTDS-SGQFFAVNDSRELLESFDKITD 385 ++LL++ + G++ +++++++ +KI D Sbjct: 235 IKDGGEQTYKRDRNGEVVITKLNRELLQQIAINAGGEYLDGDNTQKVVSQINKILD 290 >gi|194218991|ref|XP_001915421.1| PREDICTED: similar to integrin, alpha D [Equus caballus] Length = 1160 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 23/120 (19%), Positives = 41/120 (34%), Gaps = 15/120 (12%) Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T T + EL++ K + + KK +I ITDG+ + Sbjct: 223 GLTFTATGILAVVNELFHSKNGARRSA-----KKILIVITDGQKYKDPW-----EYRDVI 272 Query: 333 EYMRNAGMKIYSVAVSA---PPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKI 387 AG+ Y++ V P + L + F V++ L ++ +KI Sbjct: 273 PQAERAGIIRYAIGVGDAFQEPIARQELNTIGSAPSQDHVFKVDNFAALSSIQKQLQEKI 332 >gi|90411204|ref|ZP_01219217.1| hypothetical protein P3TCK_06547 [Photobacterium profundum 3TCK] gi|90328050|gb|EAS44371.1| hypothetical protein P3TCK_06547 [Photobacterium profundum 3TCK] Length = 436 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 14/125 (11%), Positives = 36/125 (28%), Gaps = 3/125 (2%) Query: 1 MTA-IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTK--K 57 + A + + V A+D+ +++ + ++Q+ +D+A LS +I + + Sbjct: 20 IFATLAMVVLIGAGALALDVGNLVLSKGKLQNIVDSAALSAAKAIDLGGDQAEAIVAGNE 79 Query: 58 DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 + + + I D T I + + Sbjct: 80 AINNNLTLDGFGSMTIDNTDIHYEFSDSLPFDSSTNTATSPYVRVRIEDVDVADYLVAIF 139 Query: 118 LFLKG 122 Sbjct: 140 NIDMS 144 >gi|193786519|dbj|BAG51302.1| unnamed protein product [Homo sapiens] Length = 164 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 16/117 (13%), Positives = 36/117 (30%), Gaps = 14/117 (11%) Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T T A+ H ++E + + + + + + TDG + + Sbjct: 4 GTMTGLALRHMVEHSFSEAQGARPR--ALNVPRVGLVFTDGRSQDDISVW--------AA 53 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQ 388 + G+ +Y+V V + LR+ D + + + I Sbjct: 54 RAKEEGIVMYAVGVGKAV--EAELREIASEPAELHVSYAPDFGTMTHLLENLRSSIC 108 >gi|115623666|ref|XP_789748.2| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3 [Strongylocentrotus purpuratus] gi|115960627|ref|XP_001186460.1| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3 [Strongylocentrotus purpuratus] Length = 846 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 25/183 (13%), Positives = 66/183 (36%), Gaps = 19/183 (10%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 RK+D + +++ ++ + L + + +V +N+ K+ +N Sbjct: 352 RKMDQTKRAFTTILDDVRPIDRINIVLFESNVRVWRSNQMVEAT----GDNIAAAKNHVN 407 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 ++ TN Y + +A L +I +TDG+ + S + Sbjct: 408 DISAGGGTNLYDGLTNAVDLLMEHGNGEAMP--------LIIMLTDGQPTSGSVTSTSEI 459 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG----QFFAVNDSR-ELLESFDK 382 +I + + ++SV L K + S+ + + + + ++ +D+ Sbjct: 460 IKRITNLIDGR-LSLFSVGFG-NGVDFSFLEKLSLSNQALARKVYEDSSASLQMKGFYDE 517 Query: 383 ITD 385 + + Sbjct: 518 VAN 520 >gi|298711243|emb|CBJ26488.1| similar to collagen, partial [Ectocarpus siliculosus] Length = 356 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 34/220 (15%), Positives = 64/220 (29%), Gaps = 28/220 (12%) Query: 174 MTSNKYLLPPPPKKSFWSKN----TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK--- 226 P S +S TT + A + + L K Sbjct: 43 ADPTWAANCPCEAGSQYSFTMEGQTTVTANTVNVAVIIDASGSVGTADWELSKEFAKNTV 102 Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN-KLNPYENTNTYPAMHHAY 285 A ++NL G+ ++ +L + + ++ + T+ + Sbjct: 103 ASFAEQNLFTNGGSASFAQFASDASEGGTFYSLEDFNAFVDADVKHSGGTDIIDGIAKGR 162 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 L ++ F+I TDG + + R G +Y+V Sbjct: 163 ELLSASPAAT----------SFMIVTTDGAAP---------DPQDEADAARAEGTILYAV 203 Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 V + P ++LL F V++ EL + D I Sbjct: 204 GVGSGPSQENLL-AIGGDEANVFDVDNFEELDLALDDIVS 242 >gi|332291973|ref|YP_004430582.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] gi|332170059|gb|AEE19314.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] Length = 344 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 33/247 (13%), Positives = 81/247 (32%), Gaps = 60/247 (24%) Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 L + SK A A +I+ + ++N++ Sbjct: 78 IGTKLETVKREGVDVVFAIDVSKSMLAEDIAPNRIEKSKQLVTQIINNLGS--------- 128 Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE----NTNTYPAMHHAYRELYNE 291 RIG IAY P++ + + K L+++N T A+ A +E Sbjct: 129 DRIGIIAYAGSAYPQ--LPITTDYSSAKLFLSQMNTDMLSSQGTAIGEAIELAKTYYNDE 186 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 ++++ + + I+DGE+ + I E + G++I+++ V Sbjct: 187 EQTN----------RVLFIISDGEDHVGESSN-------IAEQANDEGIRIFTIGVGKSE 229 Query: 352 E---------------------------GQDLLRKCTDS-SGQFFAVNDSRELLESFDKI 383 L++ +G++ +++ ++E+ + Sbjct: 230 GGPIPLKRNGIVQSYKKDQNGETVITRLDDTTLKEIASGANGEYIDGSNTATVVETVQNL 289 Query: 384 TDKIQEQ 390 + + ++ Sbjct: 290 LNGMDKK 296 >gi|240140254|ref|YP_002964732.1| hypothetical protein MexAM1_META1p3745 [Methylobacterium extorquens AM1] gi|240010229|gb|ACS41455.1| hypothetical protein MexAM1_META1p3745 [Methylobacterium extorquens AM1] Length = 441 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 16/94 (17%), Positives = 34/94 (36%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+ S + AID A R +QSA+DA VL+G ++ + + Sbjct: 2 IFALAGSTLIGLVGGAIDYARFASARTNLQSAVDAGVLAGGNALKLVVSSSESIVGLTTQ 61 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINIT 94 + + + + + + +A+ Sbjct: 62 TIQAEAKAGADAPVSIQVTVASDKTSVEARAEQV 95 >gi|260062899|ref|YP_003195979.1| hypothetical protein RB2501_14954 [Robiginitalea biformata HTCC2501] gi|88784467|gb|EAR15637.1| hypothetical protein RB2501_14954 [Robiginitalea biformata HTCC2501] Length = 378 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 22/185 (11%), Positives = 78/185 (42%), Gaps = 15/185 (8%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 + L ++ + ++++ A+ S ++ ++ V ++ PL+++ E+ + + + Sbjct: 156 LTELKSASASFIDNVMPAVPA---ESFQMAIYWFDGEDVLHELNPLTSSREELIAAVESI 212 Query: 270 NPYENTNTYPAMHHAYRELYNEKES--SHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + + + ++ A + + + +T ++ TDG + + ++ Sbjct: 213 DSDFSNDPSTDLYGAVIKSTDLATDLLRDSEQNNTIGAASIVLFTDGTDQASRYSESQAL 272 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF--AVNDSRELLESFDKITD 385 N+ + +++ + A + + +L + G+ F + EL +F++++ Sbjct: 273 DKV---EKANSNISFFTIGLGAEIDSE-VLEEI----GKTFSVFAGNKEELEVTFNQLSQ 324 Query: 386 KIQEQ 390 K+ E+ Sbjct: 325 KVSER 329 >gi|296506525|ref|YP_003667759.1| hypothetical protein BMB171_P0145 [Bacillus thuringiensis BMB171] gi|296327112|gb|ADH10039.1| hypothetical protein BMB171_P0145 [Bacillus thuringiensis BMB171] Length = 452 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 26/202 (12%), Positives = 62/202 (30%), Gaps = 22/202 (10%) Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI----GIVGNQC 252 S K++ ++ N ++ I + G+ N Sbjct: 161 SGSMAGKVNGEVKMEAAKKAIYNYLDKIPDNANVMLRVYGHKGSNNENDKSLSCGSSEVM 220 Query: 253 TPLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 PL N + + L+K P T A+ + V + Sbjct: 221 YPLQPYNKEQFNAALSKFGPKGWTPLASAIESINDDFKE--------YTGEENLNVVYIV 272 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNA--GMKIYSVAVSAPPEGQDLLRKCTDSS-GQFF 368 +DGE + + + + + + + + Q L+ ++ G + Sbjct: 273 SDGEETCGG------DPVNAAKNLNQSSTHAVVNIIGFDVKNSEQQQLKNTAEAGKGNYA 326 Query: 369 AVNDSRELLESFDKITDKIQEQ 390 V+ + EL ++ +K +K+ ++ Sbjct: 327 TVSSADELHQTLNKEYEKLYKE 348 >gi|224285863|gb|ACN40645.1| unknown [Picea sitchensis] Length = 829 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 11/134 (8%), Positives = 43/134 (32%), Gaps = 16/134 (11%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + + +L TN + + L + ++ + ++ ++DG++ Sbjct: 416 DGQRAANRVVERLLCTGGTNIAEGLRKGAKVLEDRRQRNPVAS--------IMLLSDGQD 467 Query: 317 SGASAYQ-----NTLNTLQICEYMRNAG---MKIYSVAVSAPPEGQDLLRKCTDSSGQFF 368 + + + + + + G + +++ + + S G F Sbjct: 468 TYSLSSRGVVLFPSDEQRRSARQSTRYGHVQIPVHAFGFGVDHDAATMHAISEVSGGTFS 527 Query: 369 AVNDSRELLESFDK 382 + + ++F + Sbjct: 528 FIQAESLVQDAFAQ 541 >gi|47214246|emb|CAG12465.1| unnamed protein product [Tetraodon nigroviridis] Length = 467 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 21/114 (18%), Positives = 41/114 (35%), Gaps = 17/114 (14%) Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T T A+ A + + + + + ST++ + + +TDG ++ Sbjct: 106 GTMTGLAIRTAVEKAFAAEAGAR--LNSTKVARVAVVVTDGRPQD--------EVERVSA 155 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVND---SRELLESFDK 382 R +G++IY+V V + LR F V +L F + Sbjct: 156 AARESGIEIYAVGV--DRADRTSLRLMASQPHEDHVFYVETYGVIEKLTSRFRE 207 >gi|114569600|ref|YP_756280.1| von Willebrand factor, type A [Maricaulis maris MCS10] gi|114340062|gb|ABI65342.1| von Willebrand factor, type A [Maricaulis maris MCS10] Length = 555 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 32/358 (8%), Positives = 91/358 (25%), Gaps = 30/358 (8%) Query: 44 IVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQY 103 + T+ + + + + ++ + Sbjct: 43 MSEMVTVTGSRVRSSAYRNDAIAGVANFTMPDQLVVDRERYEDVDPNPVMSTADEPVSTF 102 Query: 104 IAES-KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMED 162 + A Y + +L L P+ + + V+ + Sbjct: 103 SIDVDTASYSLVRNSLEAGRLPPTDAVRIEEMVNYFDYDYALPPGPDEPFATHVTVTPTP 162 Query: 163 LYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVN 222 ++ Y + P N + K+ + +++ LV+ Sbjct: 163 WNADTQL--MHIGIQGYEIIPDE---RPRANLVFLIDVSGSMNSPDKLPLAVQAMHLLVD 217 Query: 223 SIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMH 282 + + + Y + N E+ L+ L+ +T + Sbjct: 218 ELHPDDT--------VALVVYASASGVVLPPTEARNAREIHRALDSLSAGGSTAGGAGLA 269 Query: 283 HAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKI 342 AY + V+ +TDG+ + R++G+ + Sbjct: 270 LAYDLAEQNFDEDAVNR--------VMLLTDGDFNVGV--TQDERLEDFVARKRDSGIYL 319 Query: 343 YSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRE-----LLESFDKITDKIQEQSVRI 394 + +++ + G ++ +E + ESF + + +++ Sbjct: 320 SVMGFGRGNYNDQMMQTIAQAGNGTAAYIDSRQEARRMLVEESFSSLFTIANDVKIQV 377 >gi|313238562|emb|CBY13611.1| unnamed protein product [Oikopleura dioica] Length = 445 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 34/302 (11%), Positives = 69/302 (22%), Gaps = 26/302 (8%) Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR 158 + + + + L P+ + R Sbjct: 122 WKTTTEVNGEGEATTTADFNIVTDLPPATEAPETTEGPTCEWTIWTEWDKCSETCGGGQR 181 Query: 159 SMEDLYLQKHNDNN----NMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLI 214 + N K P + K L Sbjct: 182 NRYRNPTGDINAAGCEGFAEDVEYCNTQDCETKQCKDNYVDVCFLLPVHNATDNKDVRL- 240 Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN--EVKSRL---NKL 269 N V I + ++ Y+ S + + ++K+ L ++ Sbjct: 241 --MRNFVRETHNYIGNFGSEDLQFCVYQYSESAANVFSLSESADFDSLDLKTALEEGIEI 298 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 N A + E +N + ++ +I +TD N+ Sbjct: 299 PEDRGANIGAAFKTIHDEGFNAING---WRKNDQIPSVLIVLTDNLNTVDFYDDLQYVHN 355 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDKITDKI 387 + ++ +V + E L S F V D EL D++ I Sbjct: 356 KA--------YRVVAVGIGENVENSS-LSSIASLPSDENVFTVRDFTELGNVVDEVGYDI 406 Query: 388 QE 389 + Sbjct: 407 CQ 408 >gi|296225414|ref|XP_002758468.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H4 [Callithrix jacchus] Length = 904 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 27/178 (15%), Positives = 58/178 (32%), Gaps = 14/178 (7%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 ++ I + + + N+N+ +S ++ TN A+ A + Sbjct: 303 DLSPRDQFNLITFSSEATQWSPSLVPASAENVNKARSFAAAIHALGGTNINDAVLMAVQL 362 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS--- 344 L TR +I +TDG+ + + T E +R A YS Sbjct: 363 LDRSNREERLP---TRSVSLIILLTDGDPTVGEGPASNSKTRCTGENVREAVSGQYSSLP 419 Query: 345 -VAVSAPPEGQDLLRKCTDSSGQ----FFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + + L D+ G + + + +L + + ++ + + PN Sbjct: 420 GLGFDVSYAPEKL---ALDTGGLARRIYEDSDSAMQLQDFYQEVANPLLTAVTFEYPN 474 >gi|254786708|ref|YP_003074137.1| von Willebrand factor A [Teredinibacter turnerae T7901] gi|237687355|gb|ACR14619.1| von Willebrand factor, type A [Teredinibacter turnerae T7901] Length = 767 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 23/194 (11%), Positives = 63/194 (32%), Gaps = 21/194 (10%) Query: 193 NTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQC 252 + N ++ + LV + + + + + V + Sbjct: 50 DVRLVIDVSGSMKRNDPNNLRQPAVDLLVQLLPEGSRAGVWTFGKWVNMLVPHRDVTDPW 109 Query: 253 TPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 +++ +++N TN A+ A + + K +I + Sbjct: 110 ------RATAQAKASEINSVGLFTNIGEALEKAT---FEGADGGAEFR------KSIILL 154 Query: 312 TDGENSGASAYQNTLN-----TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ 366 TDG + + ++ ++ AG+ ++++A+SA + L + + G Sbjct: 155 TDGMVDIDKSPEQNKREWRRIADEVIPRLKEAGVTVHTIALSANADTNLLNKISLATGGM 214 Query: 367 FFAVNDSRELLESF 380 + + +L+ F Sbjct: 215 AEVAHSADDLMRIF 228 >gi|256821839|ref|YP_003145802.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] gi|256795378|gb|ACV26034.1| von Willebrand factor type A [Kangiella koreensis DSM 16069] Length = 958 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 35/230 (15%), Positives = 66/230 (28%), Gaps = 36/230 (15%) Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 P P + + KID +SA V ++ + R+ Sbjct: 366 TTPVPVPDQDIMLVIDRSGSMSGDAGTGQSKIDEAKDSASLFVQLVEASAG------HRM 419 Query: 239 GTIAYNIGIVGNQ-CTPLSNNLNEVK--------SRLNKLNPYENTNTYPAMHHAYRELY 289 G ++++ ++ L+ + L P T+ + A EL Sbjct: 420 GLVSFSTSASIDEGIGNLNPGKKNQLIGPAPYSGGAVGGLIPDGWTSIGDGIDKAQSELT 479 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 K ++ +TDG + + N + +I+++ + Sbjct: 480 GGANP-----------KTILLLTDGLQNTPPMIETATNDIG--------DTRIHAIGLGT 520 Query: 350 PPE-GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 LL T + G + D EL + F I E I P Sbjct: 521 EANLNGGLLSDLTQSTGGAYTRAGDGLELKKFFALAFGDIFEDGTLIDPT 570 >gi|292655414|ref|YP_003535311.1| von Willebrand factor type A domain-containing protein [Haloferax volcanii DS2] gi|291372503|gb|ADE04730.1| von Willebrand factor type A domain protein [Haloferax volcanii DS2] Length = 818 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 21/145 (14%), Positives = 49/145 (33%), Gaps = 21/145 (14%) Query: 237 RIGTIAYNIGIVGNQCT-PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS 295 R+G + +N PL N + +L T+ + A ++L + + + Sbjct: 439 RVGIVGFNYRAYDVAPLRPLGPNRESAADLIRRLESGGATDIAVGLDGAAQQLGDRRGT- 497 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 +I I+DG + + + + + G+ + ++ P + Sbjct: 498 ------------IILISDG-------HDRFQDAATLADQLGRDGVSVITIGTGPNPNERT 538 Query: 356 LLRKCTDSSGQFFAVNDSRELLESF 380 L S G + +++ L F Sbjct: 539 LRAIARASGGNYLRADETDRLRILF 563 >gi|257870337|ref|ZP_05649990.1| von Willebrand factor type A domain-containing protein [Enterococcus gallinarum EG2] gi|257804501|gb|EEV33323.1| von Willebrand factor type A domain-containing protein [Enterococcus gallinarum EG2] Length = 1169 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 26/266 (9%), Positives = 73/266 (27%), Gaps = 24/266 (9%) Query: 140 ERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY 199 E + + +V+D S SM D L + S Sbjct: 329 NVQKEIAPLDLVLVVDWSGSMNDNNRIGEVQKGVDRFVDTLAESGITDNIHMGYVGYSSD 388 Query: 200 APAPAPANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 D + + + S + G + L++ Sbjct: 389 GYKNDSVAMGPFDSVKNAIKTITPSSTTGGTFTQKALRDAGNMLATPNGHKKVIVLLTDG 448 Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + +++K+ N + Y + + + GST + D N+ Sbjct: 449 VPTFSYQVSKVQTETNGSYYG----------TQFTNRQDQPGSTSRISNSYYAPDQRNTN 498 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQ 366 + T+ ++ G++I+ + + + ++ +R+ + Sbjct: 499 KLINSTFIATIGEAMALKQRGIEIHGLGIQLQSDTNAGLSKQEVENKMRQMVSADENGDL 558 Query: 367 FFA-VNDSRELLESFDKITDKIQEQS 391 ++ + + ++ + + +I Sbjct: 559 YYESADHAPDISDYLARKAVQISGTV 584 >gi|213965586|ref|ZP_03393780.1| von Willebrand factor type A domain protein [Corynebacterium amycolatum SK46] gi|213951745|gb|EEB63133.1| von Willebrand factor type A domain protein [Corynebacterium amycolatum SK46] Length = 330 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 33/222 (14%), Positives = 70/222 (31%), Gaps = 25/222 (11%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 ++ S A A +I E+ V ++ + IG + ++ Sbjct: 89 NRATVMLVVDVSLSMSATDVAPDRITAAKEAGQEFVENLP--------DDLNIGLVTFSG 140 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + N + V L + T T A+ A N+ S G Sbjct: 141 RARTAVSP--TTNHDTVNRALQAAELDQATATGDAIAAALDA-INQFSDSVQGGGEGAPP 197 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PE 352 ++ ++DG+ + + AG+ + +++ P Sbjct: 198 ATIVLLSDGKQTVPQELDDPRGAYTAAAEAAKAGVPVNTISFGTAQGAITVQGELIPVPN 257 Query: 353 GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 D LR+ + G+FF+ +L +++ + D I + R Sbjct: 258 DDDSLREIARRTKGEFFSAGSLEQLRDAYGSLEDDIGYELKR 299 >gi|288802179|ref|ZP_06407619.1| BatB protein [Prevotella melaninogenica D18] gi|288335146|gb|EFC73581.1| BatB protein [Prevotella melaninogenica D18] Length = 331 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 21/165 (12%), Positives = 61/165 (36%), Gaps = 45/165 (27%) Query: 253 TPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P++++ K L+ +NP + T+ A+ + ++++ K + Sbjct: 145 LPITSDYVSAKMFLDNINPSLIGTQGTDIGKALQLSMNSFTP----------NSKVGKAI 194 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE---------------- 352 I ITDGE++ ++ + ++ G++++ + V + Sbjct: 195 ILITDGEDNEG-------GAEEMAKQAQSKGIRVFILGVGSTEGATIPMPDGSELKTSNS 247 Query: 353 -------GQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQE 389 +++ ++ + G + V +S + +K+Q+ Sbjct: 248 EVVKTRLNEEMCKQIATAGHGVYLHVTNSSMADAVLGRELNKLQK 292 >gi|260814492|ref|XP_002601949.1| hypothetical protein BRAFLDRAFT_86433 [Branchiostoma floridae] gi|229287252|gb|EEN57961.1| hypothetical protein BRAFLDRAFT_86433 [Branchiostoma floridae] Length = 1774 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 28/189 (14%), Positives = 66/189 (34%), Gaps = 24/189 (12%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + D + + +V++ + + R+G + Y+ + Sbjct: 1148 SGSVSVSDFDTVKQFVVAVVSAFTIGLAD-----TRVGVLQYSDRNTLGCNLGDHPDEAS 1202 Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 S +N + T+T AM A + + + K +I +TDG++S + Sbjct: 1203 FVSSINTMTRQGGGTSTGAAMEFARQNA--AWRPAP-------VPKIMIVLTDGKSSDSV 1253 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 + ++++ V + + LL + Q F + D L +S Sbjct: 1254 VAAAHALAAD--------QVTVFAIGVGSFDHSE-LLEITNNKPSQVFELADFNVLAQSI 1304 Query: 381 DKITDKIQE 389 ++I D + Sbjct: 1305 NRIVDGVCN 1313 >gi|207028763|ref|NP_001124799.1| inter-alpha-trypsin inhibitor heavy chain H5 [Pongo abelii] Length = 942 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 19/123 (15%), Positives = 45/123 (36%), Gaps = 9/123 (7%) Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 ++ ++P T+ A+ A R L S R ++F+TDG+ + + Sbjct: 361 YIHHMSPTGGTDINGALQRAIRLLNKYVAHSGI---GDRSVSLIVFLTDGKPTVGETHTL 417 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL----LRKCTDSSGQFFAVNDSRELLESF 380 + + I+++ + + + L L C + + +L+ + Sbjct: 418 KILNNT--REAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFY 475 Query: 381 DKI 383 D+I Sbjct: 476 DEI 478 >gi|56403909|emb|CAI29739.1| hypothetical protein [Pongo abelii] Length = 694 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 19/123 (15%), Positives = 45/123 (36%), Gaps = 9/123 (7%) Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 ++ ++P T+ A+ A R L S R ++F+TDG+ + + Sbjct: 113 YIHHMSPTGGTDINGALQRAIRLLNKYVAHSGI---GDRSVSLIVFLTDGKPTVGETHTL 169 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL----LRKCTDSSGQFFAVNDSRELLESF 380 + + I+++ + + + L L C + + +L+ + Sbjct: 170 KILNNT--REAARGQVCIFTIGIGNDVDFRLLEKLSLENCGLTRRVHEEEDAGSQLIGFY 227 Query: 381 DKI 383 D+I Sbjct: 228 DEI 230 >gi|163846842|ref|YP_001634886.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222524662|ref|YP_002569133.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163668131|gb|ABY34497.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222448541|gb|ACM52807.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 545 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 20/132 (15%), Positives = 52/132 (39%), Gaps = 12/132 (9%) Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 PLS N +++ + ++ T + A+ A + L + + ++ ++D Sbjct: 424 PLSENRIDLQIAVQEMRASGRTALFDALDLARQTLEALPPAE------DDRIRAIVLLSD 477 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDS 373 G ++ + + +G+ I+ VA + + +L + S V DS Sbjct: 478 GADNASRLTLEEVR-----RQFDESGITIFPVAYGSDA-DRQVLDAIAEFSRTIVVVGDS 531 Query: 374 RELLESFDKITD 385 ++ + F+ ++ Sbjct: 532 GDIAQIFENLSR 543 >gi|332520546|ref|ZP_08397008.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] gi|332043899|gb|EGI80094.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] Length = 698 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 45/365 (12%), Positives = 100/365 (27%), Gaps = 41/365 (11%) Query: 34 DAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINI 93 DA + + ++ I T K + T K + + + + + + +I Sbjct: 202 DAGATAIYGNRGANGVILVSTKKGNFTMPDVHKIANESYTKIHENKFKLTNTSALSTFSI 261 Query: 94 TKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMV 153 D KA Y + IP+ + S+ + Sbjct: 262 DVD-----------KASYSNVRRMINNAQHIPADAVKIEEMINYFNYNYSQPKDEHPFAI 310 Query: 154 LDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL 213 + + + N K L ++ + + ++ +L Sbjct: 311 --HTEVAQTPWN------NQTKLVKIGLQGKTYENKELPAANLTFLIDVSGSMSHELKLL 362 Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE 273 + LV+ ++ ++ + Y + ++ LNKL Sbjct: 363 KSAFKLLVDQLRDKD--------KVSIVVYAGAAGVVLEPTSGKDKKKILKALNKLQSGG 414 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 +T ++ AY+ + N VI TDG+ + + + Sbjct: 415 STAGGAGINLAYKLAEENFNKNGNNR--------VILATDGDFNVG--ASSNQAMEDLII 464 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSV 392 R +G+ + + D L D G ++ +E F + Sbjct: 465 EKRKSGVFLSVLGFGYGNYKDDKLETLADKGNGNHAYIDTMQEAKLIF---GKEFGGTLF 521 Query: 393 RIAPN 397 IA + Sbjct: 522 TIAKD 526 >gi|295840348|ref|ZP_06827281.1| secreted protein [Streptomyces sp. SPB74] gi|295827934|gb|EFG65721.1| secreted protein [Streptomyces sp. SPB74] Length = 418 Score = 56.5 bits (134), Expect = 8e-06, Method: Composition-based stats. Identities = 22/139 (15%), Positives = 51/139 (36%), Gaps = 20/139 (14%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + E K+ + L+P T PA+ A +L K ++ ITDGE Sbjct: 107 DRTEAKAAVATLSPTGWTPIGPALLGAAEDL-----------RGGEAAKRIVLITDGE-- 153 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIY--SVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSR 374 + ++ + G+ + ++ + + +D L + + G + V ++ Sbjct: 154 ----DTCRRDPCEVAREIAAKGVHLVVDTLGLVPDAKTRDQLSCIAEATGGTYTTVRHTK 209 Query: 375 ELLESFDKITDKIQEQSVR 393 +L ++ + + V Sbjct: 210 DLSGRVKQLVHRAADPVVT 228 >gi|332669282|ref|YP_004452290.1| von Willebrand factor type A [Cellulomonas fimi ATCC 484] gi|332338320|gb|AEE44903.1| von Willebrand factor type A [Cellulomonas fimi ATCC 484] Length = 538 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 26/201 (12%), Positives = 52/201 (25%), Gaps = 19/201 (9%) Query: 180 LLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG 239 + K + N K+ +L +S LV + + + Sbjct: 171 VQATDVKPTTRGNNVVFLLDVSGSMDEPNKLPLLADSFALLVEQLDEDDT--------VS 222 Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 + Y + + E+ L +L +T + AY N Sbjct: 223 IVTYAGSDQVLADSVPGDRRGEIVDILRELRAGGSTGGARGLETAYELAAKNFVEGGNNR 282 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 VI TDG+ + ++ E G+ I + + Sbjct: 283 --------VILATDGDFNVG--PSTPEQLTELIEEHARTGVYISVLGFGMGNLKDSTMEA 332 Query: 360 CTDSS-GQFFAVNDSRELLES 379 D G + ++ E + Sbjct: 333 IADHGNGNYAYIDTLDEARKV 353 >gi|332217050|ref|XP_003257666.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha-trypsin inhibitor heavy chain H5-like [Nomascus leucogenys] Length = 941 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 24/160 (15%), Positives = 57/160 (35%), Gaps = 12/160 (7%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKS---RLNKLNPYENTNTYPAMHHAYRELYNEKE 293 R I ++ I + +S + ++ ++ ++P T+ A+ A R L Sbjct: 330 RFSIIGFSNRIKVWKDYLISVTPDSIRDGKVYIHHMSPTGGTDINGALQRAIRLLNKYVA 389 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S R V+F+TDG+ + + + + I+++ + + Sbjct: 390 HSDI---GDRSVSLVVFLTDGKPTVGETHTLKILNNT--REAARGQVCIFTIGIGNDVDF 444 Query: 354 QDL----LRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + L L C + + +L+ +D+I + Sbjct: 445 RLLEKLSLENCGLTRRVHEEEDAGSQLIGFYDEIXTPLLS 484 >gi|326922309|ref|XP_003207392.1| PREDICTED: collagen alpha-1(VI) chain-like [Meleagris gallopavo] Length = 998 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 29/211 (13%), Positives = 64/211 (30%), Gaps = 25/211 (11%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 T A P + + + ++ + + + + + A + Sbjct: 44 TSESVALRVKPFGDLVAQVKDFTNRFIDKLTE-RYFRCDRFLAWNAGALHYSDSVVIIKD 102 Query: 255 LS---NNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 L+ + E+K+ ++ +N T+T A+ L SH K++I Sbjct: 103 LTAMPSGRAELKNSVSAINYIGKGTHTDCAIKQGIERLLVG--GSHLKEN-----KYLIV 155 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ--FF 368 +TDG ++ G+K++SVA+ +P L F Sbjct: 156 VTDGHPLEGYKEPCG-GLDDAANEAKHLGIKVFSVAI-SPHHLDQRLNIIATDHAYRRNF 213 Query: 369 AVND---------SRELLESFDKITDKIQEQ 390 + + I D +++ Sbjct: 214 TATSLKPTRDLDVEETINNIIEMIKDNMEQS 244 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 31/200 (15%), Positives = 67/200 (33%), Gaps = 23/200 (11%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + ++ D L +A + SVR+ + Y+ P N Sbjct: 808 VDSSTSVGSKNFDTTKSFVKRLAERFLEAS-KPAEDSVRVSVVQYSGRNQQKVEVPFQRN 866 Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + ++ + N + A + + + S KK V+ +DG + G Sbjct: 867 YTVIAKAVDNMEFM---NEATDVSAALQYVTGLYQRSSRA----GAKKKVLVFSDGNSQG 919 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF---------FA 369 + + ++ AG+++Y +AV + ++ T + + F Sbjct: 920 ----ITARAIERTVQEVQQAGIEVYVLAVGSQVNEPNIRVLVTGKTANYDVVYGERHLFR 975 Query: 370 VNDSRELLE--SFDKITDKI 387 V D LL + ++ KI Sbjct: 976 VPDYTSLLRGVFYQTVSRKI 995 >gi|327189769|gb|EGE56913.1| hypothetical protein RHECNPAF_550036 [Rhizobium etli CNPAF512] Length = 533 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 23/195 (11%), Positives = 51/195 (26%), Gaps = 20/195 (10%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 +T I + + F ID+ + +Q+A+DA L+G + D T + Sbjct: 20 LTLIAMPMLLGFSLLVIDVGRSSNLHTDLQNAVDAMALAGAREL--DGRDDAITRARTAI 77 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 I ++ + I++ + N + Sbjct: 78 EKIANSAA---------FSGGGTGMSLGSHISVVYNAGNDAGSTVTV---LFLKNIPAND 125 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 IPS++ ++ A+ + V + + + Y Sbjct: 126 DTAIPSSMQTTVASEASYAWVIAKPQAMQTIFPIPVG------FTRDTINIAADAVAVYH 179 Query: 181 LPPPPKKSFWSKNTT 195 + N Sbjct: 180 ASACDVTPIFICNPY 194 >gi|301768895|ref|XP_002919864.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Ailuropoda melanoleuca] Length = 1127 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 21/154 (13%), Positives = 53/154 (34%), Gaps = 21/154 (13%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESS 295 R+G I Y+ + + ++ K ++++ T T A+H A + Sbjct: 836 RVGIINYSHKVEKVAHLTQFSTKDDFKLAVDRMQYLGEGTYTASALHEANHMFEAARPG- 894 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ- 354 +KK + ITDG+ ++ N ++ + + ++I+ + V + Sbjct: 895 --------VKKVALVITDGQTDT----RDEKNLTEVVKKASDINVEIFVIGVVKKNDPNF 942 Query: 355 ----DLLRKCTDSSG--QFFAVNDSRELLESFDK 382 + + +D L ++ + Sbjct: 943 EVFHKEMNLIATDPDSEHVYQFDDFITLQDTLKQ 976 Score = 43.0 bits (99), Expect = 0.096, Method: Composition-based stats. Identities = 24/156 (15%), Positives = 55/156 (35%), Gaps = 16/156 (10%) Query: 210 IDVLIESAGNLVNSIQKAIQEKK-NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 D + +L + + + + +++ + ++ + + +L K R+ Sbjct: 64 FDKQKDFVDSLSDRVFQLTPVRSLKYDIKLAALQFSSSVQIDPSFSSWKDLQTFKQRVKS 123 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 +N T +Y A+ +A L E K + +TDG + + + Sbjct: 124 MNFIGQGTFSYYAISNATGLLKREGRKDGV--------KVALLMTDGID-----HPKNPD 170 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS 363 I E R AG+ ++ +S + LR + Sbjct: 171 VQSISEDARTAGILFITIGLST-VVNEAKLRLISGD 205 >gi|257389158|ref|YP_003178931.1| von Willebrand factor A [Halomicrobium mukohataei DSM 12286] gi|257171465|gb|ACV49224.1| von Willebrand factor type A [Halomicrobium mukohataei DSM 12286] Length = 393 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 16/132 (12%), Positives = 45/132 (34%), Gaps = 11/132 (8%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + +L T+ Y + A L + K ++ ++DG+++ Sbjct: 96 DRQTAMDHVEELTAGGGTDMYNGLKAAKETLSSSATG-------PDTVKRLLLLSDGKDN 148 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 + + E + +AG++I S + + T G + + ++ Sbjct: 149 E----RTPDEFEGLAEAIDDAGIRIQSAGIGTDYNEATIRTLGTAGRGTWTHLEAPGDIE 204 Query: 378 ESFDKITDKIQE 389 + F + ++ Sbjct: 205 DFFGEAVEQAGS 216 >gi|307291074|ref|ZP_07570959.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0411] gi|306497728|gb|EFM67260.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX0411] Length = 1103 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 96/382 (25%), Gaps = 30/382 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M A A L+ T T + + H + G Sbjct: 151 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGGSQWDGQ 210 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ ++Y + L++R E Sbjct: 211 T--SWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRG----NVQKE 264 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +V+D S SM + L + S Sbjct: 265 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNINMGYVGYSSDGYNNN 324 Query: 205 PANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 D + N+ S + + G + L++ + Sbjct: 325 AIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFS 384 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 ++++ T R + + + GST D N Sbjct: 385 YTVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNINKRINS 434 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA-V 370 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 435 TFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSADENGDLYYESA 494 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 495 DYAPDISDYLAKKAVQISGTVV 516 >gi|11414924|dbj|BAB18554.1| voltage dependent calcium channel alpha2a/delta subunit [Rana catesbeiana] Length = 1102 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 23/220 (10%), Positives = 66/220 (30%), Gaps = 20/220 (9%) Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAG 218 ++ K N + P + S + + + + ++ S Sbjct: 219 YPASPWVDKSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVT 278 Query: 219 NLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNT 277 ++ ++ + ++ +V N +K +N + T+ Sbjct: 279 EMLETLSDDDFVNVAAFNSNAHDVSCFHHLVQANVR----NKKVLKEAVNNITAKGTTDY 334 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 A+ +L N S K ++ TDG A+ N +N Sbjct: 335 KQGFKFAFDQLRNTNVSRA------NCNKIIMLFTDGGEDKATETFKLYN--------KN 380 Query: 338 AGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 +++++ +V + ++ ++ G ++ + + Sbjct: 381 KTVRVFTFSVGQHNYDKGPIQWMACENKGYYYEIPSIGAI 420 >gi|145593798|ref|YP_001158095.1| von Willebrand factor, type A [Salinispora tropica CNB-440] gi|145303135|gb|ABP53717.1| von Willebrand factor, type A [Salinispora tropica CNB-440] Length = 319 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 20/160 (12%), Positives = 51/160 (31%), Gaps = 28/160 (17%) Query: 252 CTPLSNNLNEVKSRLNKLN--PYE--NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 P + + +++L T A++ + + + Sbjct: 140 LVPPDTDREALDEGIDRLVEGATGVQGTAIGEAINTSLGAVK----ALDGEAAKDPPPAR 195 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA--------------PPEG 353 ++ ++DG N+ ++ ++ + ++++A P +G Sbjct: 196 IVLLSDGANTSG------MDPMEAAADAVEMEVPVHTIAFGTASGYVDRGGRPIQVPVDG 249 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 Q L ++ GQF EL +D I + ++ R Sbjct: 250 QTLDAVARETGGQFHEAVSVEELRAVYDDIGSSVGYRTKR 289 >gi|148253705|ref|YP_001238290.1| hypothetical protein BBta_2204 [Bradyrhizobium sp. BTAi1] gi|146405878|gb|ABQ34384.1| hypothetical protein BBta_2204 [Bradyrhizobium sp. BTAi1] Length = 409 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 17/117 (14%), Positives = 35/117 (29%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + AI + + A+D A ++ Q+Q ++DAAVL+G + Sbjct: 28 IFAIASIPILVSVGAAVDFAKSSDVKAQLQKSIDAAVLAGVVRPNDQQISTAAAVFSGAY 87 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 F + + + L A + A+ ++ Sbjct: 88 RGRFDTAATASFASNTDGSLTGTATTSVKTSFLNVMGTSALGVTASATAKAGAQAQS 144 >gi|313226592|emb|CBY21738.1| unnamed protein product [Oikopleura dioica] Length = 766 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 27/297 (9%), Positives = 79/297 (26%), Gaps = 28/297 (9%) Query: 97 KNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRST-GIIERSSENLAISICMVLD 155 + + + + + S + I +D Sbjct: 157 DEAQFNTTKTVALKSSVSGKFNVDWDKLEYDELTIPTYSMLALFLYLEHLNEFPIPYSID 216 Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 ++ + + D ++ + + + + + ++ + Sbjct: 217 AQAAIYESITHHNVDGFKDGVDELDQITQDQCRTNALDIVFV-VDESGSIGTPNFQLIKD 275 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-EN 274 + + A RI ++ + + + + N Sbjct: 276 FLEHFASDSTIAADA-----TRIAIRPFSTSNYLYFSLN-DFKTKNIINEIKNMPYNSGN 329 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 TNT A+ A + ++ K ++ ITDG ++ + L+T + Sbjct: 330 TNTADALDAALTDYGTDRP---------ESVKVMVTITDGASN------SFLSTSAAADR 374 Query: 335 MRN--AGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 ++N ++ +++ V L + F +N + + K+ E Sbjct: 375 VKNDLRNIQSFAIGV--SGANMAELEAIAITDKHVFMLNGWADFEPIKSNLLQKVCE 429 >gi|307565332|ref|ZP_07627825.1| von Willebrand factor type A domain protein [Prevotella amnii CRIS 21A-A] gi|307346001|gb|EFN91345.1| von Willebrand factor type A domain protein [Prevotella amnii CRIS 21A-A] Length = 566 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 59/165 (35%), Gaps = 45/165 (27%) Query: 253 TPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P++++ K LN +NP + T+ A++ A ++ K + Sbjct: 145 LPITSDFISAKMFLNDINPSLIGTQGTDIGKAINLAMHSFSPTSKAG----------KAI 194 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE---------------- 352 I ITDGE++ + + + AG IY + + + Sbjct: 195 IIITDGEDNEG-------GAEAMAKKAQEAGFHIYILGIGSTSGAEIPIGNGEKLKDKRG 247 Query: 353 -------GQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQE 389 +D+ + D+ G + V+++ + + K K+Q+ Sbjct: 248 NIVVSHLNEDMCKGIADAGKGIYIHVDNNSDAQQILKKQLSKLQK 292 >gi|229546230|ref|ZP_04434955.1| pilus subunit protein [Enterococcus faecalis TX1322] gi|229308754|gb|EEN74741.1| pilus subunit protein [Enterococcus faecalis TX1322] Length = 1103 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 96/382 (25%), Gaps = 30/382 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M A A L+ T T + + H + G Sbjct: 151 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGGSQWDGQ 210 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ ++Y + L++R E Sbjct: 211 T--SWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRG----NVQKE 264 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +V+D S SM + L + S Sbjct: 265 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNINMGYVGYSSDGYNNN 324 Query: 205 PANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 D + N+ S + + G + L++ + Sbjct: 325 AIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFS 384 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 ++++ T R + + + GST D N Sbjct: 385 YTVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNINKRINS 434 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA-V 370 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 435 TFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSADENGDLYYESA 494 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 495 DYAPDISDYLAKKAVQISGTVV 516 >gi|313235671|emb|CBY11123.1| unnamed protein product [Oikopleura dioica] Length = 362 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 21/183 (11%), Positives = 57/183 (31%), Gaps = 17/183 (9%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 ++ + ++A + ++ V +AY + S++ Sbjct: 118 ISTSMGIGKRWVNVKKAATQYLEAVPHDAHVGI---VLFHRVAYVKVPLKQIENADSSDF 174 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +K+RL K+ T+ A+ + L S +I ++DG+ S Sbjct: 175 --LKNRLEKVELGVGTSIASALKVSMNVLQTSGP-----KDSRDSGGNIILLSDGQESHD 227 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF-AVNDSRE-LL 377 + + + + + + + ++A + +F + L Sbjct: 228 PRIDDE-----LIQELIDNKVTVNTIAFGTDASQKLEEVSSRTKGSSYFSDPENPNSILQ 282 Query: 378 ESF 380 ++F Sbjct: 283 DAF 285 >gi|302517652|ref|ZP_07269994.1| von Willebrand factor [Streptomyces sp. SPB78] gi|302426547|gb|EFK98362.1| von Willebrand factor [Streptomyces sp. SPB78] Length = 418 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 22/139 (15%), Positives = 51/139 (36%), Gaps = 20/139 (14%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + E K+ + L+P T PA+ A +L K ++ ITDGE Sbjct: 107 DRTEAKAAVATLSPTGWTPIGPALLGAADDL-----------RGGEASKRIVLITDGE-- 153 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIY--SVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSR 374 + ++ + G+ + ++ + + +D L + + G + V ++ Sbjct: 154 ----DTCHRDPCEVAREIAAKGVHLVVDTLGLVPDAKTRDQLSCIAEATGGTYTTVRHTK 209 Query: 375 ELLESFDKITDKIQEQSVR 393 +L ++ + + V Sbjct: 210 DLSGRVKQLVHRAADPVVT 228 >gi|194225621|ref|XP_001916184.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Equus caballus] Length = 3570 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 19/132 (14%), Positives = 45/132 (34%), Gaps = 25/132 (18%) Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + + + T T A A + L + +E+S K + ITDG ++G Sbjct: 153 LSQEIPAITYRGGGTYTKGAFQQAAQILRHSRENST---------KVIFLITDGYSNGG- 202 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLE 378 + + +R+ G++I++ L + + ++ E Sbjct: 203 ------DPRPVAASLRDFGVEIFT--FGIWQGNIRELNDMASTPKEEHCYLLHSFEE--- 251 Query: 379 SFDKITDKIQEQ 390 F+ + + + Sbjct: 252 -FEALARRALHE 262 >gi|187956243|gb|AAI50691.1| Integrin alpha E, epithelial-associated [Mus musculus] Length = 1038 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 25/155 (16%), Positives = 49/155 (31%), Gaps = 16/155 (10%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNT 298 + Y I S ++N +++ + E T T AM H ++ S Sbjct: 236 LVQYGAVIQTEFDLQESRDINASLAKVQSIVQVKEVTKTASAMQHVLDNIFIPSRGSRKK 295 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA---PPEGQD 355 K ++ +TDG+ + + LN + + G+ +++ V Sbjct: 296 A-----LKVMVVLTDGD-----IFGDPLNLTTVINSPKMQGVVRFAIGVGDAFKNNNTYR 345 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 L+ F V + L K+ I Sbjct: 346 ELKLIASDPKEAHTFKVTNYSALDGLLSKLQQHIV 380 >gi|182676519|sp|P0C6B8|SVEP1_RAT RecName: Full=Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1; Flags: Precursor Length = 3564 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 20/132 (15%), Positives = 45/132 (34%), Gaps = 25/132 (18%) Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + + + T T A A + L + +E+S K + ITDG ++G Sbjct: 154 LSREIPAITYRGGGTYTMGAFQQAAQILRHSRENST---------KVIFLITDGYSNGG- 203 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLE 378 + I +R+ G++I++ L + + ++ E Sbjct: 204 ------DPRPIAASLRDFGVEIFT--FGIWQGNIRELNDMASTPKEEHCYLLHSFEE--- 252 Query: 379 SFDKITDKIQEQ 390 F+ + + + Sbjct: 253 -FEALARRALHE 263 >gi|226823202|ref|NP_032425.2| integrin alpha-E isoform 1 [Mus musculus] gi|56206400|emb|CAI24788.1| integrin, alpha E, epithelial-associated [Mus musculus] Length = 1167 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 25/155 (16%), Positives = 49/155 (31%), Gaps = 16/155 (10%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNT 298 + Y I S ++N +++ + E T T AM H ++ S Sbjct: 236 LVQYGAVIQTEFDLQESRDINASLAKVQSIVQVKEVTKTASAMQHVLDNIFIPSRGSRKK 295 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA---PPEGQD 355 K ++ +TDG+ + + LN + + G+ +++ V Sbjct: 296 A-----LKVMVVLTDGD-----IFGDPLNLTTVINSPKMQGVVRFAIGVGDAFKNNNTYR 345 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 L+ F V + L K+ I Sbjct: 346 ELKLIASDPKEAHTFKVTNYSALDGLLSKLQQHIV 380 >gi|156364713|ref|XP_001626490.1| predicted protein [Nematostella vectensis] gi|156213368|gb|EDO34390.1| predicted protein [Nematostella vectensis] Length = 1043 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 22/126 (17%), Positives = 41/126 (32%), Gaps = 10/126 (7%) Query: 258 NLNEVKSRL--NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 N VK + K + A+ A ++L+N KK + +T G Sbjct: 156 NAASVKRGILGAKQSRGPGLRIDKALKAADKDLFN-----RRFGMREDQKKVCLLVTSGA 210 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE 375 + L T+ + G+ IY+V V LR + + + E Sbjct: 211 QTKDQGPSTQLGTVTA--LLSARGVDIYAVGVG-DGVDSSELRNIASTEDFIYTASSFEE 267 Query: 376 LLESFD 381 + + + Sbjct: 268 INKVLE 273 Score = 41.4 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 25/181 (13%), Positives = 59/181 (32%), Gaps = 12/181 (6%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 K D + +++ S A N ++ + G + L+EV + L + Sbjct: 491 KYDFVKRFVKDVIMSYADAENSA-NFAIGQYAKYFQTGTKRFRNFRSMEELDEVINSLRQ 549 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 ++ N + A E + K L +F+I + A+ ++ Sbjct: 550 MSSSAERNVGAGLRGAANEFFQVKNG-----MRQGLPRFLIVLA-----SANPSASSEAI 599 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 + ++ +V LR F V++ ++L + ++T + Sbjct: 600 ESAVVNLDKENVRRIAVGF-TEDATPGFLRMLASDPSLMFRVDEPKKLDKVMMELTPMLC 658 Query: 389 E 389 + Sbjct: 659 Q 659 >gi|153002167|ref|YP_001367848.1| putative outer membrane adhesin-like protein [Shewanella baltica OS185] gi|151366785|gb|ABS09785.1| putative outer membrane adhesin like proteiin [Shewanella baltica OS185] Length = 1215 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 32/325 (9%), Positives = 95/325 (29%), Gaps = 20/325 (6%) Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 + L + Q ++ ++ I + + Sbjct: 185 DEYKGNAVSGAANATLTTDVATDSIMSNQWQARNGDMKWLNHSTASNIGDVNRTAQGRV- 243 Query: 117 NLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS 176 G + ++ R + + L + + ++K + Sbjct: 244 ----YGKSAWEVLAQDVKDDPKSGRKTAQPTRTRYTTLANNAPDANSPVKKELPAAQFSC 299 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 L + + + +P ID ++A LV++ + +S Sbjct: 300 RDQLDFVWVEGDIDMQIVMDRSGSMFGSP----IDNAKQAAKILVDATAEGSTAMGLVSF 355 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 + + P + +K ++ + +T + A L ++ Sbjct: 356 SGRSSVKQDFAMQKMPKPDNGVKQALKGAIDNIYANGSTALFDGSQLALDNLS-----AY 410 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 ++ V + DG+++ + +++ + +NA + I+S + L Sbjct: 411 QASAASGAPGVVFVLADGDDNNSIKSESS-----VITAYQNANVPIFSFGYGSASPTGPL 465 Query: 357 LRKCTDSSGQFFAV-NDSRELLESF 380 + + G++F+ E++++F Sbjct: 466 VTMANATGGKYFSSPTTLAEIIDAF 490 >gi|293347389|ref|XP_002726583.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Rattus norvegicus] Length = 3578 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 20/132 (15%), Positives = 45/132 (34%), Gaps = 25/132 (18%) Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + + + T T A A + L + +E+S K + ITDG ++G Sbjct: 154 LSREIPAITYRGGGTYTMGAFQQAAQILRHSRENST---------KVIFLITDGYSNGG- 203 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLE 378 + I +R+ G++I++ L + + ++ E Sbjct: 204 ------DPRPIAASLRDFGVEIFT--FGIWQGNIRELNDMASTPKEEHCYLLHSFEE--- 252 Query: 379 SFDKITDKIQEQ 390 F+ + + + Sbjct: 253 -FEALARRALHE 263 >gi|109474969|ref|XP_001065678.1| PREDICTED: polydom [Rattus norvegicus] Length = 3583 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 20/132 (15%), Positives = 45/132 (34%), Gaps = 25/132 (18%) Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + + + T T A A + L + +E+S K + ITDG ++G Sbjct: 154 LSREIPAITYRGGGTYTMGAFQQAAQILRHSRENST---------KVIFLITDGYSNGG- 203 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLE 378 + I +R+ G++I++ L + + ++ E Sbjct: 204 ------DPRPIAASLRDFGVEIFT--FGIWQGNIRELNDMASTPKEEHCYLLHSFEE--- 252 Query: 379 SFDKITDKIQEQ 390 F+ + + + Sbjct: 253 -FEALARRALHE 263 >gi|75812635|ref|YP_320253.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] gi|75705391|gb|ABA25064.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] Length = 592 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 25/143 (17%), Positives = 50/143 (34%), Gaps = 20/143 (13%) Query: 250 NQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 TPL++N + ++S + L+ T + A EL L + ++ Sbjct: 98 QTATPLTDNADTLESAIASLSEAGGTPMAQGLDAAIGELQATF-----------LSRNIL 146 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA 369 TDG + + R+ + + +AV+ + L + T F Sbjct: 147 LFTDGVPD------SQALASLSAQSARSQRINL--IAVATGDADTNYLAQLTADPSLVFY 198 Query: 370 VNDSRELLESFDKITDKIQEQSV 392 +S + ++F I +Q V Sbjct: 199 A-NSGQFDQAFRNAEAAIYKQLV 220 >gi|27370456|ref|NP_766532.1| integrin alpha-E isoform 2 [Mus musculus] gi|26334103|dbj|BAC30769.1| unnamed protein product [Mus musculus] gi|56206399|emb|CAI24787.1| integrin, alpha E, epithelial-associated [Mus musculus] Length = 1038 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 25/155 (16%), Positives = 49/155 (31%), Gaps = 16/155 (10%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNT 298 + Y I S ++N +++ + E T T AM H ++ S Sbjct: 236 LVQYGAVIQTEFDLQESRDINASLAKVQSIVQVKEVTKTASAMQHVLDNIFIPSRGSRKK 295 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA---PPEGQD 355 K ++ +TDG+ + + LN + + G+ +++ V Sbjct: 296 A-----LKVMVVLTDGD-----IFGDPLNLTTVINSPKMQGVVRFAIGVGDAFKNNNTYR 345 Query: 356 LLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 L+ F V + L K+ I Sbjct: 346 ELKLIASDPKEAHTFKVTNYSALDGLLSKLQQHIV 380 >gi|318058741|ref|ZP_07977464.1| hypothetical protein SSA3_12405 [Streptomyces sp. SA3_actG] Length = 418 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 22/139 (15%), Positives = 51/139 (36%), Gaps = 20/139 (14%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + E K+ + L+P T PA+ A +L K ++ ITDGE Sbjct: 107 DRTEAKAAVATLSPTGWTPIGPALLGAADDL-----------RGGEASKRIVLITDGE-- 153 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIY--SVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSR 374 + ++ + G+ + ++ + + +D L + + G + V ++ Sbjct: 154 ----DTCHRDPCEVAREIAAKGVHLVVDTLGLVPDAKTRDQLSCIAEATGGTYTTVRHTK 209 Query: 375 ELLESFDKITDKIQEQSVR 393 +L ++ + + V Sbjct: 210 DLSGRVKQLVHRAADPVVT 228 >gi|226310168|ref|YP_002770062.1| hypothetical protein BBR47_05810 [Brevibacillus brevis NBRC 100599] gi|226093116|dbj|BAH41558.1| hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 437 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 25/265 (9%), Positives = 75/265 (28%), Gaps = 33/265 (12%) Query: 140 ERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY 199 + +L + + L + + + S + ++ Sbjct: 86 NDAYLDLYWNQLLTLFAEDYLSPQMVVDRWRMASYGSPDIEDARFQFRENFNVEIILDAS 145 Query: 200 APAPAPA--NRKIDVLIESAGNLVN--------SIQKAIQEKKNLSVRIGTIAYNIGIVG 249 K+ + E+ S++ + N Sbjct: 146 GSMAGKIGDKTKMQLAKEAIQEFAEALPEDARISLRVYGHKGSNADEHKQL----SCGSS 201 Query: 250 NQCTPL-SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 PL + + ++ L+ P T+ ++ A +L + + + Sbjct: 202 EMVYPLQAYDAKRLEQALDMFEPTGWTSIAHSLRLAQEDL--------AGFEADKNTNVI 253 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCT-DSSG 365 ++DG + N + + + + + + + + EGQ L++ S G Sbjct: 254 YLVSDGIET------CDGNPVAVAKELSQSKIMPLLNVIGFDVNAEGQKQLKEIAHASEG 307 Query: 366 QFFAVNDSRELLESFDKITDKIQEQ 390 + V + + + ++ +I ++ Sbjct: 308 LYANVTNREQFKQELER-AKEIAQK 331 >gi|212715236|ref|ZP_03323364.1| hypothetical protein BIFCAT_00127 [Bifidobacterium catenulatum DSM 16992] gi|212661917|gb|EEB22492.1| hypothetical protein BIFCAT_00127 [Bifidobacterium catenulatum DSM 16992] Length = 1192 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 45/365 (12%), Positives = 99/365 (27%), Gaps = 32/365 (8%) Query: 43 SIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQ 102 ++ + + ++ G+ A T + + Sbjct: 386 QFARVQSDNFTDDTRYTVRELNSSGYTVSANGSPMTQQGGGNDAYAETGPFTVGTTSHVT 445 Query: 103 YIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMED 162 + + + G L+ + I +V+D S SM+ Sbjct: 446 IVNSNVKPSNNKSIVKTGGGDGDQYTLYLTASGDSTSSTVTTTTPADIVLVMDKSGSMKG 505 Query: 163 LYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVN 222 + N + K L S ++ + A + K + +++ Sbjct: 506 ELDNNAKEAANALAKKLL------TDKNSTLPSEQQVQMAVVTFSTKATIEQNFTTDVLK 559 Query: 223 SIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL---NPYENTNTYP 279 + + N Q LS + VK + L +P T++Y Sbjct: 560 INNAVEGDPDGGT--------NWEAALKQANILS-GRSNVKKHIIFLSDGDPTFRTSSYG 610 Query: 280 AMHHAYRELYNEKESSHNTIGSTRLKKFV------------IFITDGENSGASAYQNTLN 327 +++ Y + + T + K+ + DG + S+ N Sbjct: 611 GSCYSHWGPYYTPQPKYTTKEACTAAKYDWLDENPDDKYDGVPGYDGVHGSGSSDGYGYN 670 Query: 328 TLQICEYMRNAG-MKIYSVAVSAPPEG-QDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + G +Y V S + D + G+ F + L ++F++I Sbjct: 671 YQAALAEAKGRGDAALYVVKTSTEAKKMADFAEQAGAVDGKEFDGTNPENLTKAFNQIYS 730 Query: 386 KIQEQ 390 I Sbjct: 731 SITSS 735 >gi|260810969|ref|XP_002600195.1| hypothetical protein BRAFLDRAFT_66700 [Branchiostoma floridae] gi|229285481|gb|EEN56207.1| hypothetical protein BRAFLDRAFT_66700 [Branchiostoma floridae] Length = 323 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 23/161 (14%), Positives = 59/161 (36%), Gaps = 26/161 (16%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL----------NKLNPYEN--TNTYP 279 + ++ + ++ + E+K + N +N T+T+ Sbjct: 153 YSPYHQVALMTFHSTPTKQFDFNDHGSYAELKEAILAVPYEVLMTNFVNKQGGPRTDTHE 212 Query: 280 AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG 339 A+ +A ++ + + V+ +TDG+ + T+Q E +RN+G Sbjct: 213 ALDYARTTMFTSRTGLR-----PGSLREVLLLTDGQPNED------DLTVQAAERLRNSG 261 Query: 340 MKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLE 378 + ++++ + A + L + F +N +L + Sbjct: 262 ITVFALGI-ADGVDNEHLEQLVSDPEYKHIFHLNTFEDLAD 301 >gi|194211467|ref|XP_001916467.1| PREDICTED: similar to collagen, type VI, alpha 3 [Equus caballus] Length = 3165 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 32/314 (10%), Positives = 88/314 (28%), Gaps = 33/314 (10%) Query: 72 LKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNL 131 ++ + Q+ + + +A A+ + N+F G+ + L Sbjct: 125 MQTHLTQAAGSRASDGVPQVIVVLTDGHSEDGLALPTAELKSADVNVFAIGVEDADEGAL 184 Query: 132 SLRSTGIIE--RSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF 189 ++ + + S+ ++ S + + T F Sbjct: 185 KEIASEPLNMHVFNLENFTSLHDIVGNLVSCVHSSVTPGRAGDTGTLKDITAQDSADIIF 244 Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 + + V+ + NL+ + +R+G + Y+ Sbjct: 245 LIDGSNNTGSV--------NFAVIRDFLVNLLERLS-----IGTQQIRVGVVQYSDEPRT 291 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 + +V + L N A+ + + + + + Sbjct: 292 MFSLDTYSTKAQVLDAVKALGFTGGELANVGLALDFVVENHFT---RAGGSRVEEGVPQV 348 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF 367 ++ I+ G +S ++ + L+ ++S + A + L++ + Sbjct: 349 LVLISAGPSSD--EIRDGVVALKQAS--------VFSFGLGAQAASKAELQQIATNDNMV 398 Query: 368 FAV---NDSRELLE 378 F V +L E Sbjct: 399 FTVPEFRSFGDLQE 412 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 35/301 (11%), Positives = 86/301 (28%), Gaps = 35/301 (11%) Query: 103 YIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMED 162 S + + P NL G++P + + I L ++ Sbjct: 947 VAGRSSDRVDTPALNLKQSGVVPFIFQAKNADPAELELIVPSPAFILTAESLPKIGDLQP 1006 Query: 163 LYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVN 222 + + P + S+ + P +L E +V Sbjct: 1007 QIVNLL-----KSVQNGAPTPVSGEKDVVFLIDGSEGVRSGFP------LLKEFVQRVVE 1055 Query: 223 SIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-T-NTYPA 280 S+ + VR+ + Y+ + V + +L T NT A Sbjct: 1056 SL-----DVGPDRVRVAVVQYSDRTRPEFYLNSYMDQQSVVGAIRRLTLLGGPTPNTGAA 1110 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 ++ R + S + + + +I +T + + + ++ G Sbjct: 1111 LNFVLRNIL---IRSAGSRIEEGVPQLLIVLTAERSGDDVRGPSVV--------LKREGA 1159 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE----QSVRIAP 396 + + ++ + A+ R+L I++++ + + R+ P Sbjct: 1160 V--PIGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQVISERVTQLSREELSRLKP 1217 Query: 397 N 397 + Sbjct: 1218 D 1218 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 27/173 (15%), Positives = 59/173 (34%), Gaps = 18/173 (10%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKN--LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN + E+ +N Sbjct: 1647 INFRRDSFQEVLRFVSEIVDTLYEGGDSIQVGLVQYNSDPTDEFFLKDFSTKQEIIDAIN 1706 Query: 268 KLNPYENTNTYP--AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 K+ + + H N + R+ + IT G + Sbjct: 1707 KVVYKGGRHANTKVGIEHLR---LNHFVPEAGSRLDQRVPQIAFVITGG---------KS 1754 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + Q + G+K+++V V + + K +S F V + +EL E Sbjct: 1755 VEDAQEASALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 1805 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 26/152 (17%), Positives = 56/152 (36%), Gaps = 15/152 (9%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNE 291 VRIG + ++ + + V + +L + NT A+ H R L+ Sbjct: 1470 NKVRIGVVQFSNEVFPEFYLKTYKSRTAVLDAIRRLRFKGGSPLNTGKALEHVARNLF-- 1527 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 S + + + ++ G++ + + + + ++G+ S+ V Sbjct: 1528 -VKSAGSRIEDGVPQHLVLFLGGKSQDDISRYS--------QVISSSGIV--SLGVGDRN 1576 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 + L+ T+ F V + REL +KI Sbjct: 1577 IDRTELQTITNDPRLVFTVREFRELPNIEEKI 1608 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 16/172 (9%), Positives = 51/172 (29%), Gaps = 20/172 (11%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + A + + + ++ ++ +++ Y + + Sbjct: 449 VDGSSALGQNNFNAIRDFIARVIQRLE-----IGQDLIQVAVAQYADTVRPEFYFNSYPS 503 Query: 259 LNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 EV + + K+ E T A+ L+ + + + K ++ IT G++ Sbjct: 504 KREVVNAVRKMKSLEGPALYTGSALDFVRNNLFTS---AAGYRAAEGVPKILVLITGGKS 560 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF 368 + + ++ G+ +++ + L + F Sbjct: 561 LDGISQP--------AQELKRNGIMAFAIG--NKAADKAELEEIAFDPTLVF 602 >gi|219804724|ref|NP_001137337.1| collagen alpha-1(VI) chain [Bos taurus] gi|296490826|gb|DAA32939.1| collagen, type VI, alpha 1 [Bos taurus] Length = 1027 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 34/229 (14%), Positives = 77/229 (33%), Gaps = 19/229 (8%) Query: 166 QKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQ 225 ++ + S P F+ + T A P +D + ++++ Sbjct: 16 WASAQDDPVASRAIAFQDCPVDLFFVLD-TSESVALRLKPYGALVDKVKSFTKRFIDNLN 74 Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS---NNLNEVKSRLNKLNPYE-NTNTYPAM 281 + + ++ A + L+ + +E+KS ++ + + T T A+ Sbjct: 75 D-RYYRCDRNLVWNAGALHYSDEVEIIRGLTRMPSGRDELKSSVDAVKYFGKGTYTDCAI 133 Query: 282 HHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK 341 EL SH K+++ +TDG ++ G+K Sbjct: 134 KKGLEELLVG--GSHLKEN-----KYLVVVTDGHPLEGYKEPCG-GLEDAVNEAKHLGIK 185 Query: 342 IYSVAVSAPPEGQDLLRKCTDSSGQF---FAVNDSRELLESFDKITDKI 387 ++SVA+ P + L + + F D + ++ + I+ I Sbjct: 186 VFSVAI-TPDHLEPRLSIIA-TDHTYRRNFTAADWGQSRDAEEVISQTI 232 Score = 44.1 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 41/330 (12%), Positives = 101/330 (30%), Gaps = 27/330 (8%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 NQMQ +D + R +D + + Q + R Sbjct: 666 SHNQMQEHVD-------LRDPNIRNAQDLKEAIKKLQWMGGGTFTGEALQYTRSRLLPPT 718 Query: 85 IAQKAQINITKDKNNPLQYI--AESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERS 142 + + IT +++ + +I ++ +K + A + L + Sbjct: 719 PNNRIALVITDGRSDTQRDTTPLSVLCGPDIQVVSVGIKDVFGLAAGSDQLNVISCQGLA 778 Query: 143 SENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPA 202 + + + + ++D + N + +K +F S + Sbjct: 779 PQGRPGISLVKENYAELLDDGF--LKNITAQICIDKKCPDYSCPITFSSPADITILLDGS 836 Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ---CTPLSNNL 259 + + D+ A L A + VR+ + Y+ N Sbjct: 837 ASVGSHNFDITKRFAKRLAERFLTASRTDPGQDVRVAVVQYSGTGQQRPERAALQFLQNY 896 Query: 260 NEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + + ++ ++ + T+ A+ + R ++ KK ++ +DG + G Sbjct: 897 TVLANTVDSMDFFNDATDVMDALGYVTRFYREASSNAA--------KKRLLLFSDGNSQG 948 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + + + AG++I++V V Sbjct: 949 ----ATPAAIEKAVQEAQRAGVEIFAVVVG 974 >gi|297287373|ref|XP_001118050.2| PREDICTED: hypothetical protein LOC721855 [Macaca mulatta] Length = 2077 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 35/223 (15%), Positives = 77/223 (34%), Gaps = 19/223 (8%) Query: 172 NNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEK 231 + L P P F+ + T A P +D + +++++ + Sbjct: 1089 SAKAPLLRLRPDCPVDLFFVLD-TSESVALRLKPYGALVDKVKSFTKRFIDNLRD-RYYR 1146 Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLS---NNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRE 287 + ++ A + L+ + + +KSR++ + + T T A+ + Sbjct: 1147 CDRNLVWNAGALHYSDDVEIIQGLTRMPGDRDTLKSRVDAIKYFGKGTYTDCAIKKGLEQ 1206 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 L SH K++I +TDG ++ G+K++SVA+ Sbjct: 1207 LLVG--GSHLKEN-----KYLIVVTDGHPLEGYKEPCG-GLEDAVNEAKHLGVKVFSVAI 1258 Query: 348 SAPPEGQDLLRKCTDSSGQF---FAVNDSRELLESFDKITDKI 387 P + L + + F D + ++ + I+ I Sbjct: 1259 -TPDHLEPRLSIIA-TDHTYRRNFTAADWGQSRDAEEVISQTI 1299 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 30/222 (13%), Positives = 62/222 (27%), Gaps = 25/222 (11%) Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 N + +K +F S + + + D A L A Sbjct: 1852 KNVTAQICIDKKCPDYTCPITFSSPADITILLDSSASVGSHNFDTTKRFAKRLAERFLMA 1911 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQ---CTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHH 283 + VR+ + Y+ N + S ++ ++ + T+ A+ + Sbjct: 1912 GRTDPAHDVRVAVVQYSGTGQQRPERASLQFLQNYTALASAVDAMDFINDATDVNDALGY 1971 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 R + KK ++ +DG + G + + + AG++I+ Sbjct: 1972 VTRFYREASSGAA--------KKRLLLFSDGNSQG----ATPAAIEKAVQEAQRAGIEIF 2019 Query: 344 SVAVSAPPE---------GQDLLRKCTDSSGQFFAVNDSREL 376 V V G+ F V + L Sbjct: 2020 VVVVGRQVNEPHIRVLVTGKTAEYDVAYGERHLFRVPSYQAL 2061 >gi|315186712|gb|EFU20470.1| von Willebrand factor type A [Spirochaeta thermophila DSM 6578] Length = 332 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 21/152 (13%), Positives = 44/152 (28%), Gaps = 38/152 (25%) Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + + T + + L + V+ +TDG+N+ Sbjct: 160 AVRLFSLGDGTALGMGVGISLLHLSRVN----------ASFRAVVILTDGKNTTG----- 204 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAP----------------------PEGQDLLRKCTD 362 + E ++ + +++V V + ++ LR+ + Sbjct: 205 EILPETAAEMAKDLDIPVFTVGVGSDLPVSLDVIDPSTGTRYAGVLEEGYDEETLRRMAE 264 Query: 363 -SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 S GQFF+ L F I R Sbjct: 265 MSGGQFFSGYTPTSLHRIFQYIGATATADVRR 296 >gi|126314598|ref|XP_001371349.1| PREDICTED: similar to SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily c, member 2 [Monodelphis domestica] Length = 3274 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 17/145 (11%), Positives = 48/145 (33%), Gaps = 15/145 (10%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKE 293 +++G + Y+ G + +V + L TN A+ + + Sbjct: 277 IQVGVVQYSNGPRTAFSLNSYSTKADVLDAVKALRVIGGEETNVGAALDFVVENHFTQ-- 334 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + + + + ++ I+ +++ ++ + L+ I+S + A Sbjct: 335 -AGGSRVEEGVPQVLVLISGSQSTDD--IRDGVVALKQAS--------IFSFGLGAQGAD 383 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLE 378 + L+ F + R L + Sbjct: 384 RAELQHIATDENFVFTTPEFRSLGD 408 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 35/311 (11%), Positives = 83/311 (26%), Gaps = 27/311 (8%) Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 T + ++ + Q+ + + I + + + F Sbjct: 317 ETNVGAALDFVVENHFTQAGGSRVEEGVPQVLVLISGSQSTDDIRDGVVALKQASIFSFG 376 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 G + L +T + S + D+ + + + + Sbjct: 377 LGAQGADRAELQHIATDENFVFTTPEFRS---LGDLREYLLPYIVGVAQRTIVLQPPTIV 433 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 A N D L + L E +++ Sbjct: 434 TQVIEVNKRDIVFLIDGTSTRGSANFNAIRDFLYKVVQKL---------EIGQDLIQVAV 484 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKESSHNT 298 Y + + +V S + +L P T T A+ + +N S Sbjct: 485 AQYADTVKLGFHFKDFQSRRDVLSAIRRLRPINGTVLYTGAALDYVRNNFFN---GSVGY 541 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLR 358 S + K ++ IT G++ +Y + ++ +G+ +++ +P + L Sbjct: 542 RASEGVPKLLVLITGGKSLDDISYP--------AQELKRSGIMAFAIG--SPMADRTELE 591 Query: 359 KCTDSSGQFFA 369 + F Sbjct: 592 EVAFDPSLVFE 602 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 22/164 (13%), Positives = 62/164 (37%), Gaps = 16/164 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRE 287 + + R+ I ++ ++ ++V+ + +L P N A+ + + Sbjct: 1261 DVGFDTTRVAVIQFSEDPRVEFLLNAHSSKDDVQVAVRRLRPKGGRQINVGSALDYVSKN 1320 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 ++ + + +F++ I+ G +++ + +++ G+ ++VA Sbjct: 1321 IF---RRPLGSRLEEGVPQFLVLISSG--------RSSDDVEDAAAHIKETGVAPFTVA- 1368 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 + L + + S F+V+ REL K+ I + Sbjct: 1369 --RNVDPEELVRISLSPEYVFSVSTFRELPSLEQKLLTPITTLT 1410 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 16/179 (8%), Positives = 50/179 (27%), Gaps = 16/179 (8%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 + + +++ + R + +N EV S + + + Sbjct: 55 VREFLYDVIESLAVAGSDF-RFALVQFNGNPHTEFLLNTYRTNQEVLSHIANMTYLGGDS 113 Query: 277 -TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 T + + + + + + + ++ +TDG + A + Sbjct: 114 KTGRGLRYVIQNHLT---PAAGSRARDGVPQVIVVLTDGRSQDDVAGPSAELKSAAA--- 167 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQSV 392 + VA+ + L++ F + + L + + ++ V Sbjct: 168 ------VDVVAIGVQDAEEGELKEMATEPLDLHVFNLENFTALHDIVGNLVACVRSSMV 220 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 26/173 (15%), Positives = 59/173 (34%), Gaps = 17/173 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKN--LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ ++ ++ + + S+++G + YN + E+ +N Sbjct: 1645 INFKRDNFQEVLRFVSGIVDTIYEGGDSIQVGLVQYNSDPTDEFFLKDFSTKEEILDAIN 1704 Query: 268 KLNPYENT--NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 K+ NT + H N + R+ + IT G + Sbjct: 1705 KVVYKGGRQANTLVGLEHLR---KNHFVPEAGSRIDQRIPQIAFVITGG--------SSV 1753 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + +S F V++ +EL E Sbjct: 1754 EDVEEATRALSQKGVKVFAVGV--RNVDLREVSRIASNSAIAFRVSNVQELSE 1804 >gi|291448990|ref|ZP_06588380.1| secreted protein [Streptomyces roseosporus NRRL 15998] gi|291351937|gb|EFE78841.1| secreted protein [Streptomyces roseosporus NRRL 15998] Length = 420 Score = 56.1 bits (133), Expect = 9e-06, Method: Composition-based stats. Identities = 27/196 (13%), Positives = 67/196 (34%), Gaps = 26/196 (13%) Query: 206 ANRKIDVLIESAGNLVNSIQK-----AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 ++ ++ ++++++ + + + PL + Sbjct: 52 GQSRMAAAKQAFNDVLDAVPEQVQLGIRTLGADYPGEDRKVGCKDTRQLYPVGPL--DRT 109 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 E K+ + L P T PA+ A +L + + ++ ITDGE++ Sbjct: 110 EAKTAVATLAPTGFTPIGPALLGAADDLEGGEG-----------SRRIVLITDGEDTCG- 157 Query: 321 AYQNTLNTLQICEYMRNAGMKIY--SVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELL 377 L+ ++ + G + ++ + + + L + + G + AV EL Sbjct: 158 ----PLDPCEVAREIAARGTHLVVDTLGLVPNAKIRQQLTCIAEATGGTYTAVQHKEELS 213 Query: 378 ESFDKITDKIQEQSVR 393 ++ D+ E V Sbjct: 214 GRVKQLVDRAAEPVVT 229 >gi|330830099|ref|YP_004393051.1| RTX toxin-like protein [Aeromonas veronii B565] gi|328805235|gb|AEB50434.1| RTX toxin-like protein [Aeromonas veronii B565] Length = 1553 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 36/367 (9%), Positives = 95/367 (25%), Gaps = 30/367 (8%) Query: 27 NQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIA 86 + +D V + ++ + + +T + I Sbjct: 994 TNLTQVIDGNVTANDSAGADGFATPKLVSVEYNGTTYTFNNSNTSFSIALGTNKGTLYIE 1053 Query: 87 QKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENL 146 T E + + + L + + + Sbjct: 1054 NDGDYRFTPPSGGAEGAPVEVTYKIKDGDGDTSSAKLTIINPVLVVGSNANDTGSGASTA 1113 Query: 147 AISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPA 206 +D L + +L + + + Sbjct: 1114 -------IDDHVRPNPLLAPDVDGAIVGGVGADVLIGDVGGVTSGSYNLTFMIDMSGSIS 1166 Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 + ++ ++ NL+ Q +++ + +++ + + Sbjct: 1167 GTEFQLMKDAINNLLAKFSGISQ------LQVEIGTFADNS---NVVGTYSSVTAAQQAV 1217 Query: 267 NKL-NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + L TN A+ + + + H K+V F+TDGE + S +T Sbjct: 1218 SNLTRSGGGTNYQAALTTLNTMMTVDPVADH---------KYVYFLTDGEPTVGSWTNST 1268 Query: 326 --LNTLQICEYMRNAGMKIYSVAVSAPPEGQ--DLLRKCTDSSGQFFAVNDSRELLESFD 381 N + + G+ I +V + P + L ++ + AV++ +L Sbjct: 1269 QIANGMAALNALTAPGVVINAVGIGVPSGASFGNNLNAIDNTPDNYLAVDNFDDLSSGLG 1328 Query: 382 KITDKIQ 388 + + Sbjct: 1329 SLFTAVS 1335 >gi|325286052|ref|YP_004261842.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] gi|324321506|gb|ADY28971.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] Length = 348 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 31/188 (16%), Positives = 61/188 (32%), Gaps = 50/188 (26%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE----NTNTYPAMHHAYRELYN 290 S RIG IAY P++ + K L LN T A+ A + Sbjct: 128 SDRIGIIAYAGQAYPQ--LPITTDYGAAKMFLQGLNTNMLSSQGTAINQALDLASTYYDD 185 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 +++++ + + I+DGE + +T E + G+KI+++ V Sbjct: 186 DEQTN----------RVLFIISDGE------DHSEGSTEGAVEKAVDQGIKIFTIGVGTE 229 Query: 351 PE---------------------------GQDLLRKCTDSS-GQFFAVNDSRELLESFDK 382 + +L D GQF ++ + + + Sbjct: 230 KGAPIPIKRNGVLQSLKKDLQGEVVITKLNKSVLEDIADEGNGQFIDGTNTNDAVTIIKE 289 Query: 383 ITDKIQEQ 390 +K+ + Sbjct: 290 ELNKMDKT 297 >gi|241672093|ref|XP_002411438.1| neurogenic locus notch, putative [Ixodes scapularis] gi|215504089|gb|EEC13583.1| neurogenic locus notch, putative [Ixodes scapularis] Length = 1597 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 22/124 (17%), Positives = 47/124 (37%), Gaps = 18/124 (14%) Query: 257 NNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 N+ + LN++ T T A+ +A L+N ++ + + V+ I+DG+ Sbjct: 105 TNMCKFVHELNQIPYRSGGTRTREALGYAGEILWNARQEA---------NRIVVLISDGQ 155 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE 375 + S +I +R G+ I+ V +D L S + + + Sbjct: 156 ANSGS------EPSEIARLLRVKGIVIF--GVGVAHINKDELLDVASSPAHTYMLRNFEY 207 Query: 376 LLES 379 + + Sbjct: 208 IKKV 211 >gi|220921017|ref|YP_002496318.1| von Willebrand factor type A [Methylobacterium nodulans ORS 2060] gi|219945623|gb|ACL56015.1| von Willebrand factor type A [Methylobacterium nodulans ORS 2060] Length = 324 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 24/151 (15%), Positives = 49/151 (32%), Gaps = 29/151 (19%) Query: 247 IVGNQCTPLSNNLNEVKSRLN--KLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 + LS + V L+ ++ +T + A + L Sbjct: 148 DQADVAASLSFDTASVAHALDEAQIGLVGRSTGIGDGLGLALKRLDATPARE-------- 199 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA----------PPEG 353 K V+ ++DG N+ + R G++++++A+ Sbjct: 200 --KVVVLLSDGANNAGQTT-----PHDVAALARELGIRVHTIALGPRDLSDAEGDPDVVD 252 Query: 354 QDLLRKCTDS-SGQFFAVNDSRELLESFDKI 383 + LR + G+FF V + +L D I Sbjct: 253 TEALRDVATTSGGRFFRVRTTDDLAAVADSI 283 >gi|258624850|ref|ZP_05719778.1| conserved hypothetical protein [Vibrio mimicus VM603] gi|258582848|gb|EEW07669.1| conserved hypothetical protein [Vibrio mimicus VM603] Length = 128 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 18/96 (18%), Positives = 34/96 (35%), Gaps = 23/96 (23%) Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP----------------- 350 +I ++DG N+ L+ L+ + IY+V V A Sbjct: 1 MILLSDGSNTAGV-----LDPLEAANIAKQYQTTIYTVGVGAGEMIVKDFLFSRKVNTAQ 55 Query: 351 PEGQDLLRKCTDS-SGQFFAVNDSRELLESFDKITD 385 + L+ + GQ+F + ++L +D I Sbjct: 56 DLDEKTLQTIASTTGGQYFRARNQQDLQSIYDTINQ 91 >gi|281417929|ref|ZP_06248949.1| Carbohydrate-binding CenC domain protein [Clostridium thermocellum JW20] gi|281409331|gb|EFB39589.1| Carbohydrate-binding CenC domain protein [Clostridium thermocellum JW20] Length = 1050 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 40/290 (13%), Positives = 78/290 (26%), Gaps = 30/290 (10%) Query: 130 NLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF 189 ++ S+ N S + + N N + L Sbjct: 646 SIGGNIEFQTSNSTINGIAYAPGNPANPNSGKIFFSGDKNTINGSIAANELDF---FAGG 702 Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 N T+ ++ K L N + K K ++V + + N Sbjct: 703 LVVNHTEGQFDTVEEKYIDKSTYLKLVKDAAKNFVDKFAGSKTKMAVIQYSDSANDNDFK 762 Query: 250 NQCTPLSNNLNEVKSRLNKLNPY--ENTNTYPAMHHAYRELYNEKESSHNTI-------- 299 L + +K ++K+ P +N M AY L + + Sbjct: 763 KYDLSLPDKGAALKETIDKIKPGTSGLSNMGDGMRRAYHILNDPPPKGQISKYIVVITGS 822 Query: 300 ---------GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 K D + +Y + + + + G+ + + S Sbjct: 823 VPNRWTAVDNKKNEPKTDNGRADFIKADNESYNSLDYAKDMGRIITSKGINLVFIDFSEE 882 Query: 351 PEGQDLLRKCTDSSG--------QFFAVNDSRELLESFDKITDKIQEQSV 392 G L +S ++ N+ ELL+ + +T KI V Sbjct: 883 DIGDVLEEIAAESGAKPLEGTDRHYYKANNFLELLDILNNMTLKIYYDVV 932 >gi|77465284|ref|YP_354787.1| von Willebrand factor domain-containing protein [Rhodobacter sphaeroides 2.4.1] gi|77389702|gb|ABA80886.1| Von Willebrand domain containing protein [Rhodobacter sphaeroides 2.4.1] Length = 328 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 22/156 (14%), Positives = 54/156 (34%), Gaps = 31/156 (19%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRL--NKLNPYE-NTNTYPAMHHAYRELYNEKE 293 RIG + + PL+ +L V + + +T + A + + Sbjct: 133 RIGLVLFANRAYVAA--PLTFDLAAVGRAIEEASIGITGRSTAIADGLGLALKSVTESSA 190 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-- 351 +S + ++ ++DG+++ ++ Q+ G++I+++A+ Sbjct: 191 AS----------RVIVLLSDGQDNAHQ-----IDARQVAGLAARHGVRIHTIALGPDDLE 235 Query: 352 --------EGQDLLRKCTD-SSGQFFAVNDSRELLE 378 LR + S G+ + V +L Sbjct: 236 TRPAARDAVDTATLRAIAEASGGRSYRVRGMEDLRA 271 >gi|116751108|ref|YP_847795.1| vault protein inter-alpha-trypsin subunit [Syntrophobacter fumaroxidans MPOB] gi|116700172|gb|ABK19360.1| Vault protein inter-alpha-trypsin domain protein [Syntrophobacter fumaroxidans MPOB] Length = 680 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 21/150 (14%), Positives = 53/150 (35%), Gaps = 20/150 (13%) Query: 231 KKNLSVRIGTIAYNIGI---VGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 K + + R + +N G N+ R+ ++ +T + + AYR Sbjct: 333 KMSANDRFRIVTFNTTAADFTGGYVPASPENVQTWMQRVKQIQAGGSTALFDGLDLAYRL 392 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 L E+ + ++ +TDG + + L ++ +++++ + Sbjct: 393 LDGERTTG------------IVLVTDGVCNVGPTRHDEFLGL-----LKQHDVRLFTFVI 435 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 + R +S G V++S ++ Sbjct: 436 GNSANQPLMDRLAKESGGFAMNVSESDDIA 465 >gi|298713908|emb|CBJ33776.1| conserved unknown protein [Ectocarpus siliculosus] Length = 977 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 22/113 (19%), Positives = 37/113 (32%), Gaps = 16/113 (14%) Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 +TN + L N + F+I ITDG+ + + Sbjct: 145 GDTNIEAGLSKGRELLANGTSTRT---------SFLILITDGDWNRGG------DPQIEA 189 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + R+ G +Y+V V LL F ++ EL + D+I Sbjct: 190 DAARDEGTIVYAVGVGPDVSEATLL-SIGGDLTNVFDASNFTELDNTLDEIVS 241 >gi|260814488|ref|XP_002601947.1| hypothetical protein BRAFLDRAFT_86431 [Branchiostoma floridae] gi|229287250|gb|EEN57959.1| hypothetical protein BRAFLDRAFT_86431 [Branchiostoma floridae] Length = 1386 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 21/162 (12%), Positives = 58/162 (35%), Gaps = 21/162 (12%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP--YENTNTYPAMHHAYRELYNEKE 293 R+G + Y+ + S +N + +T T A+ A + Sbjct: 514 TRVGVLQYSTSSTLACNLGDHPDEASFVSAINTMTYQKGGSTYTGAALEFARQNA--AWR 571 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + + + +I +TDG++ + + ++++ V + Sbjct: 572 PAP-------VSRIMIVLTDGQSHDSVVAAAQALAAD--------QVTVFAIGVGSFDHS 616 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE-QSVRI 394 + LL ++ G F ++D + ++ +I + + ++R+ Sbjct: 617 E-LLEITSNKLGHVFELDDFNAMAQNITQIVRAVCKDTTIRL 657 >gi|149709406|ref|XP_001496048.1| PREDICTED: similar to calcium-activated chloride channel [Equus caballus] Length = 904 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 23/145 (15%), Positives = 53/145 (36%), Gaps = 23/145 (15%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN--PYENTNTYPAMHHAYRELYNEKESSHN 297 + ++ + N ++ + KL T+ + ++ + Sbjct: 349 MVTFDSSAEIQNNLTKITDDNAYQNIIAKLPQFAGGGTSICNGLKAGFQAI--------V 400 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 + +I +TDGE++ S+ E + +G I+S+A+ P L Sbjct: 401 YSNQSTSGSEIILLTDGEDNQMSSCF---------EEAKASGAIIHSIALG--PSAAKEL 449 Query: 358 RKCTD-SSGQFFAVN-DSRELLESF 380 ++ + G F+ N D L+++F Sbjct: 450 ETLSNMTGGLRFSANKDINGLIDAF 474 >gi|319426151|gb|ADV54225.1| Vault protein inter-alpha-trypsin domain protein [Shewanella putrefaciens 200] Length = 757 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 26/159 (16%), Positives = 56/159 (35%), Gaps = 16/159 (10%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEV---KSRLNKLNPYENTNTYPAMHHAYRELYNEKES 294 I +N + T L N + + +++L T A++ A Sbjct: 413 FNIIEFNSELTQLSPTSLPANQTHLARARQFIHRLQADGGTEMALALNAAL------PRG 466 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + S++ + VIF+TDG A + + ++++V + + P Sbjct: 467 INRLSESSQSLRQVIFMTDGSVGNEQALFDLIRYQIGES-------RLFTVGIGSAPNSH 519 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + R G F + + E+ + ++ KIQ + Sbjct: 520 FMQRAAELGRGTFTYIGNVDEVEQKISQLLSKIQYPVLT 558 >gi|221119984|ref|XP_002166572.1| PREDICTED: similar to predicted protein [Hydra magnipapillata] Length = 5008 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 21/176 (11%), Positives = 66/176 (37%), Gaps = 20/176 (11%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT- 275 + + + V +G +++ G LS + +V+ ++ + +T Sbjct: 4364 IQQFMGDFISKVM--NDSKVSVGVVSF--GQFQQMDVQLSKDQQKVQLGISAMKYMGDTG 4419 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 N A+ + +Y + + +++ I + + N +I + + Sbjct: 4420 NLTSALKFVDQNIYQTDQKR------SNVQQLCIIL-------GGIPHYSQNADEIADKL 4466 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQE 389 ++ G+++++V + + LL + G + L+ + + + +KI++ Sbjct: 4467 KSQGVEVFAVGIGKMFKTDTLLLQIASIPILGHTLFTDYENLLITATNSLYNKIKK 4522 >gi|119386037|ref|YP_917092.1| von Willebrand factor, type A [Paracoccus denitrificans PD1222] gi|119376632|gb|ABL71396.1| von Willebrand factor, type A [Paracoccus denitrificans PD1222] Length = 855 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 29/183 (15%), Positives = 63/183 (34%), Gaps = 26/183 (14%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT-PLSNNLNEVKS 264 KI + + G++V+ ++ V G P E+ Sbjct: 44 GINKITIARDVVGDIVSDF---PADQNLGFVTYGHRERGQCADIETLVEPAPGTAAEIAG 100 Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + LNP T A+ A + L + ++++ VI ++DG + Sbjct: 101 IVEGLNPRGMTPMTDAVVTAAQALRHTEQAAT-----------VILVSDGIET------C 143 Query: 325 TLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLR---KCTDSSGQFFAVNDSRELLES 379 + + AG+ + + E LL+ ++ G+F ++++EL E+ Sbjct: 144 NPDPCAAARALEEAGVDFTAHVIGFDVRGEADALLQMQCIAEETGGRFLTADNAQELNEA 203 Query: 380 FDK 382 + Sbjct: 204 LRE 206 >gi|167758708|ref|ZP_02430835.1| hypothetical protein CLOSCI_01050 [Clostridium scindens ATCC 35704] gi|167663904|gb|EDS08034.1| hypothetical protein CLOSCI_01050 [Clostridium scindens ATCC 35704] Length = 1865 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 54/186 (29%), Gaps = 45/186 (24%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 G S+N++ + ++ N T A+ A L G Sbjct: 1125 TDSGFYTLDTSDNVDAINRFISNKNASGGTPMGDALEEANSILS----------GRPNSS 1174 Query: 306 KFVIFITDGENSGASAYQNT-----LNTLQICEYMRNAGMKIYSVAVSAPPE-------- 352 K+ + TDG S+ + + + ++ K+Y++ Sbjct: 1175 KYALLFTDGMPGYNSSNNSFNCMVANHANNEAKEIKEY-AKLYTIGYKLSGSFKWEEGHS 1233 Query: 353 ---------------GQDLLRK-CTDSSG----QFFAVNDSRELLESFDKITDKIQEQSV 392 D L+ S + +++ L + F+ I +I + Sbjct: 1234 QDSTNNHGSHKTETKAADFLKNYLASSPEGDRTYAYTTDNTDGLTKIFEDIAGQIGDLY- 1292 Query: 393 RIAPNR 398 +I P + Sbjct: 1293 KINPEK 1298 >gi|221116226|ref|XP_002166419.1| PREDICTED: similar to collagen, type XXVIII [Hydra magnipapillata] Length = 680 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 29/245 (11%), Positives = 75/245 (30%), Gaps = 32/245 (13%) Query: 166 QKHNDNNNMTSN-KYLLPPPPKKSFWSKNTTKSKYAPAPAPAN-----RKIDVLIESAGN 219 K N + N++++ ++ + + + + + + + S Sbjct: 86 MKRNTSANISTSEPIDDDVTSEEVYSNGSVLLEERDKCSKTVDLLFLLDSSESVKYSNWK 145 Query: 220 LVNSIQKAIQEKKN-LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNT 277 +V K++ + + R+G I Y N+ + ++ + T T Sbjct: 146 IVIQFVKSLCNRFKLSTTRVGIIRYASDAEIALHLTRFNDTTSRDTAIDNIFYKTGGTRT 205 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG-----ASAYQNTLNTLQIC 332 A+ A ++ + + +I +TDG + + Sbjct: 206 DIALKKAADVFQFSEQKN----------QVLILVTDGPTNSLEINKDHFVEGKDLVAGPV 255 Query: 333 EYMRNAGMKIYSVAVSAPPEGQD-------LLRKCTDSSG--QFFAVNDSRELLESFDKI 383 + +++AG+ I+ + + E D +R F + EL I Sbjct: 256 DRLKDAGVAIFCIGIVPDSETPDEIETMKEEMRVIASEPTKLHLFMSDGYHELQRKVHAI 315 Query: 384 TDKIQ 388 ++ Sbjct: 316 SEAAC 320 >gi|91082533|ref|XP_973629.1| PREDICTED: similar to Inter-alpha-trypsin inhibitor heavy chain H4 precursor (ITI heavy chain H4) (Inter-alpha-inhibitor heavy chain 4) (Inter-alpha-trypsin inhibitor family heavy chain-related protein) (IHRP) (Plasma kallikrein sensitive glycoprotein 120) (P [Tribolium castaneum] gi|270007557|gb|EFA04005.1| hypothetical protein TcasGA2_TC014154 [Tribolium castaneum] Length = 824 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 18/139 (12%), Positives = 49/139 (35%), Gaps = 11/139 (7%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N+ K + + TN + + +E + + + ++F+TDG + Sbjct: 389 NIEAAKKIIEDKSRLGMTNMMYGLEVGLFLIKRTQEETPDK-----YQPMIVFLTDGHPN 443 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF----FAVNDS 373 + ++ + I+S++ + LRK + + F + +D+ Sbjct: 444 AGMSGRDEITNTVTSLNSGKKKASIFSLSFG-DFADKRFLRKISSKNSGFSRHIYESSDA 502 Query: 374 R-ELLESFDKITDKIQEQS 391 +L + + I+ + Sbjct: 503 SLQLQDFYRAISAPLLSNV 521 >gi|301762312|ref|XP_002916580.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Ailuropoda melanoleuca] Length = 3529 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 19/132 (14%), Positives = 45/132 (34%), Gaps = 25/132 (18%) Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + + + T T A A + L + +E+S K + ITDG ++G Sbjct: 111 LSREIPAITYRGGGTYTKGAFQQAAQILRHSRENST---------KVIFLITDGYSNGG- 160 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLE 378 + + +R+ G++I++ L + + ++ E Sbjct: 161 ------DPRPVAASLRDFGVEIFT--FGIWQGNIRELNDMASTPKEEHCYLLHSFEE--- 209 Query: 379 SFDKITDKIQEQ 390 F+ + + + Sbjct: 210 -FEALARRALHE 220 >gi|281338501|gb|EFB14085.1| hypothetical protein PANDA_006133 [Ailuropoda melanoleuca] Length = 984 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 20/220 (9%), Positives = 65/220 (29%), Gaps = 20/220 (9%) Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAG 218 ++ N + P + S + + + + ++ S Sbjct: 88 YPASPWVDNSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVS 147 Query: 219 NLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNT 277 ++ ++ + + ++ +V N +K +N + T+ Sbjct: 148 EMLETLSDDDFVNVASFNSNAQDVSCFQHLVQANVR----NKKVLKDAVNNITAKGITDY 203 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 A+ +L N S K ++ TDG +Y ++ Sbjct: 204 KKGFSFAFEQLLNYNVSRA------NCNKIIMLFTDG--------GEERAQEIFAKYNKD 249 Query: 338 AGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 +++++ +V + ++ ++ G ++ + + Sbjct: 250 KKVRVFTFSVGQHNYDRGPIQWMACENKGYYYEIPSIGAI 289 >gi|256425856|ref|YP_003126509.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] gi|256040764|gb|ACU64308.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] Length = 588 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 36/349 (10%), Positives = 98/349 (28%), Gaps = 24/349 (6%) Query: 39 SGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKN 98 + +V +DP + + ++ + +Y G A + T +N Sbjct: 86 AYSRMLVQMNAQEDPAEDAVKKAKAIARERSSNGSNPNYGNALMGTRAFFDETYGTLYEN 145 Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR 158 + + + + + + + + + + N + V + Sbjct: 146 KFIAAETQIPSLFAVDVDRAAYSNIRRFVKLKERIPANAVRIEEMVNYFHYSYPLPPVGQ 205 Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY----APAPAPANRKIDVLI 214 ++ + K + K+ +L Sbjct: 206 TLAIYSNYATCP-WAEDHRLLQIAVRGKSVNLDSLPPSNLVFLIDVSGSMAMPNKLPLLQ 264 Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 + LVN+++ + +AY + + +++ + ++ L+ Sbjct: 265 AAFRILVNNLRSND--------HVAIVAYAGVPGVILPSTPGSAKSKILNAIDYLSAGGA 316 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 T A+ AY+ N VI TDG+ + + + L + + Sbjct: 317 TAGEAAIKLAYQIAEENFIKEGNNR--------VILATDGDFNVGQTSDHDMEQLILGK- 367 Query: 335 MRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDK 382 + G+ + + L + G F +++ E + F + Sbjct: 368 -KETGVLLTCLGFGMKNYKDSKLETLSSKGNGNFAYIDNLEEASKIFAR 415 >gi|156342098|ref|XP_001620878.1| hypothetical protein NEMVEDRAFT_v1g222619 [Nematostella vectensis] gi|156206294|gb|EDO28778.1| predicted protein [Nematostella vectensis] Length = 252 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 22/129 (17%), Positives = 48/129 (37%), Gaps = 12/129 (9%) Query: 257 NNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 ++ + +L ++ TNT A+ A++ K S V+ +TDG Sbjct: 102 SSRADAVRKLRGISRSGGKTNTQDALELAFQMFTTSKYGSTPGG-----LARVLVVTDGR 156 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSR 374 ++ ++ G++++ +A+ EG D L + ++ + V D + Sbjct: 157 SNIEKHRTERKAFK-----LKANGIEVFVIAIGDYLEGMDELARMANTKYAHMYRVEDVK 211 Query: 375 ELLESFDKI 383 L I Sbjct: 212 GLARVVKLI 220 >gi|162138242|gb|ABX82825.1| complement factor B [Sus scrofa] Length = 765 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 31/238 (13%), Positives = 71/238 (29%), Gaps = 30/238 (12%) Query: 175 TSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR-----KIDVLIESAGNLVNSIQKAIQ 229 + +P +K + + S + + + + Sbjct: 246 DAEDGHIPGDQQKRKIVLDPSGSMNIYLVLDGSDSIGARNFTGAKNCLKDFIEKVASYGV 305 Query: 230 EKKNLSVRIGTIAYNIGIVG--NQCTPLSNNLNEVKSRLNKLNPYE-----NTNTYPAMH 282 + + G + Y P S + + V +L+K++ + TNT A+ Sbjct: 306 KP-----KYGLVTYATDPKVLIRVSNPKSADADWVTEQLDKISYDDHKLKAGTNTKKALL 360 Query: 283 HAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL-NTLQICEYMRNAGMK 341 Y + + + R + ++ +TDG ++ + + + RN Sbjct: 361 EVYNMMSWGVNNFPDNWN--RTRHVIVLLTDGLHNMGGDPVTVIHDIRDLLNIGRNRKNP 418 Query: 342 ------IYSVAVSAPPEGQDLLRKCTDSSG---QFFAVNDSRELLESFDKITDKIQEQ 390 IY V P Q+ + F + D L + F ++ D+ + Sbjct: 419 REDYLDIYVFGVG-PLVNQENINALASKKDKEQHVFKLKDVDNLEDVFFQMLDESRSL 475 >gi|156120152|ref|NP_001095294.1| complement factor B [Sus scrofa] gi|148724909|emb|CAN87697.1| B-factor, properdin [Sus scrofa] Length = 765 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 31/238 (13%), Positives = 71/238 (29%), Gaps = 30/238 (12%) Query: 175 TSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR-----KIDVLIESAGNLVNSIQKAIQ 229 + +P +K + + S + + + + Sbjct: 246 DAEDGHIPGDQQKRKIVLDPSGSMNIYLVLDGSDSIGARNFTGAKNCLKDFIEKVASYGV 305 Query: 230 EKKNLSVRIGTIAYNIGIVG--NQCTPLSNNLNEVKSRLNKLNPYE-----NTNTYPAMH 282 + + G + Y P S + + V +L+K++ + TNT A+ Sbjct: 306 KP-----KYGLVTYATDPKVLIRVSNPKSADADWVTEQLDKISYDDHKLKAGTNTKKALL 360 Query: 283 HAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL-NTLQICEYMRNAGMK 341 Y + + + R + ++ +TDG ++ + + + RN Sbjct: 361 EVYNMMSWGVNNFPDNWN--RTRHVIVLLTDGLHNMGGDPVTVIHDIRDLLNIGRNRKNP 418 Query: 342 ------IYSVAVSAPPEGQDLLRKCTDSSG---QFFAVNDSRELLESFDKITDKIQEQ 390 IY V P Q+ + F + D L + F ++ D+ + Sbjct: 419 REDYLDIYVFGVG-PLVNQENINALASKKDKEQHVFKLKDVDNLEDVFFQMLDESRSL 475 >gi|148724910|emb|CAN87698.1| B-factor, properdin [Sus scrofa] Length = 549 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 31/238 (13%), Positives = 71/238 (29%), Gaps = 30/238 (12%) Query: 175 TSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR-----KIDVLIESAGNLVNSIQKAIQ 229 + +P +K + + S + + + + Sbjct: 246 DAEDGHIPGDQQKRKIVLDPSGSMNIYLVLDGSDSIGARNFTGAKNCLKDFIEKVASYGV 305 Query: 230 EKKNLSVRIGTIAYNIGIVG--NQCTPLSNNLNEVKSRLNKLNPYE-----NTNTYPAMH 282 + + G + Y P S + + V +L+K++ + TNT A+ Sbjct: 306 KP-----KYGLVTYATDPKVLIRVSNPKSADADWVTEQLDKISYDDHKLKAGTNTKKALL 360 Query: 283 HAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL-NTLQICEYMRNAGMK 341 Y + + + R + ++ +TDG ++ + + + RN Sbjct: 361 EVYNMMSWGVNNFPDNWN--RTRHVIVLLTDGLHNMGGDPVTVIHDIRDLLNIGRNRKNP 418 Query: 342 ------IYSVAVSAPPEGQDLLRKCTDSSG---QFFAVNDSRELLESFDKITDKIQEQ 390 IY V P Q+ + F + D L + F ++ D+ + Sbjct: 419 REDYLDIYVFGVG-PLVNQENINALASKKDKEQHVFKLKDVDNLEDVFFQMLDESRSL 475 >gi|39937341|ref|NP_949617.1| hypothetical protein RPA4281 [Rhodopseudomonas palustris CGA009] gi|192293121|ref|YP_001993726.1| hypothetical protein Rpal_4760 [Rhodopseudomonas palustris TIE-1] gi|39651199|emb|CAE29722.1| conserved hypothetical protein [Rhodopseudomonas palustris CGA009] gi|192286870|gb|ACF03251.1| conserved hypothetical protein [Rhodopseudomonas palustris TIE-1] Length = 390 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 16/91 (17%), Positives = 27/91 (29%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + AI + + A+D R +MQ+ALD+AVL+G I + Sbjct: 16 IFAIALIPLLGAVGSAVDYTIASNQRMKMQTALDSAVLAGVLEPTDAAKIARASAAFTAN 75 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQI 91 + + Q Sbjct: 76 FQPSWSAATASFTVNGGELGGSANSTVPTQF 106 >gi|317502373|ref|ZP_07960539.1| hypothetical protein HMPREF1026_02483 [Lachnospiraceae bacterium 8_1_57FAA] gi|316896246|gb|EFV18351.1| hypothetical protein HMPREF1026_02483 [Lachnospiraceae bacterium 8_1_57FAA] Length = 4107 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 32/209 (15%), Positives = 66/209 (31%), Gaps = 50/209 (23%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 + + N + IV + + N++++ S +N L T Sbjct: 261 VKFASDESDNIGNDFINNNY------NRSQIVTELKSYTTKNISDLTSTVNSLIAAGATR 314 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI--CEY 334 ++ A R +K V+F TDG+ + S + N++ I + Sbjct: 315 ADFGLNQAQRAFQLGGTREGA-------QKVVVFFTDGQPTSNSDWSNSVAAAAITNAKE 367 Query: 335 MRNAGMKIYSVAVSAPPEGQD----------LLRKCTD---------------------- 362 +++A IYS+ V D + + Sbjct: 368 LKDANALIYSIGVFRDANPNDTNTSAGNFNGYMHAVSSNYPDATATSSTRPNRYSCTLGK 427 Query: 363 ---SSGQFFAVNDSRELLESFDKITDKIQ 388 +S + A D+ EL F++I+ ++ Sbjct: 428 RTDNSDYYKAATDADELNNIFNEISSDLE 456 >gi|257785062|ref|YP_003180279.1| Cna B domain-containing protein [Atopobium parvulum DSM 20469] gi|257473569|gb|ACV51688.1| Cna B domain protein [Atopobium parvulum DSM 20469] Length = 863 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 34/248 (13%), Positives = 76/248 (30%), Gaps = 24/248 (9%) Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAG 218 ++ ++ + K + S ++D Sbjct: 59 TISPNTDGTYDLTLTIKGETSAASEEQKANVLVVFDNSSSMTAQTGGGEMRLDAAKRVVN 118 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 L ++I + + V + +++N + +LNE + N + + TN Sbjct: 119 QLSSTILGINRNAQKDVVEMALLSFNEKPNLE--CGWTADLNEFQRATNNMGFHTGTNWE 176 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN------TLQIC 332 A+ A + + + T +VIF+TDG + N L Sbjct: 177 SALERAKVLADQKAANGNPT--------YVIFVTDGLPTQDRNGWVRNNQIGYEHALDEA 228 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--------QFFAVNDSRELLESFDKIT 384 + +AG YSV + L + ++ N++ ++ ++F +I Sbjct: 229 RAIGSAGYHFYSVYMYGGHAYLRQLTNYAYTGNPFGNPGGTYYYEANNTAQMEQAFKEIA 288 Query: 385 DKIQEQSV 392 I + Sbjct: 289 SVITKSIT 296 >gi|254229333|ref|ZP_04922750.1| hypothetical protein VEx25_1585 [Vibrio sp. Ex25] gi|262393419|ref|YP_003285273.1| hypothetical protein VEA_002646 [Vibrio sp. Ex25] gi|151938141|gb|EDN56982.1| hypothetical protein VEx25_1585 [Vibrio sp. Ex25] gi|262337013|gb|ACY50808.1| hypothetical protein VEA_002646 [Vibrio sp. Ex25] Length = 422 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 22/230 (9%), Positives = 50/230 (21%), Gaps = 8/230 (3%) Query: 11 LFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFK---KQ 67 + IDL H + + ++Q+A+D A L+G + T T Sbjct: 30 GVAAFGIDLNHQVLNKTRLQNAVDTAALAGAVVADKTEDVNQAETAVRTTLGGISTEPGN 89 Query: 68 IKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSA 127 + + + D+ +I Sbjct: 90 SELTFSDSNTAVTFSHDMQTFVSAASFTPPLGEYDIYVRVAVT-DISLTQYLSGLFGIDK 148 Query: 128 LTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP--- 184 + S + + I M D + + D + + + L Sbjct: 149 EVSASAVAGRSAAIAYTCNLTPIAMCGDPAGDVSDAWGYRPPGYDPNVDMDPSLVHELKV 208 Query: 185 -PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 + + N ++ ++ N Sbjct: 209 GDQNNTDMGPGNFQLLDFGQGTGNSGAALVRDALSGAYNGCASIGDTVTT 258 >gi|125829720|ref|XP_698253.2| PREDICTED: collagen alpha-1(VI) chain [Danio rerio] Length = 1000 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 25/171 (14%), Positives = 57/171 (33%), Gaps = 17/171 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQC---TPLSNNLNEVKSRL 266 I+ + ++ ++ Q + V + A + L+ ++K+ + Sbjct: 56 INQIKTFTKLFIDELKDLRQ-PCDRDVTWNSGALHYSDDTELVMGLVDLNTKRADLKAAI 114 Query: 267 NKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 +++ T T A+ EL H K+++ +TDG Sbjct: 115 DRIKYIGKGTYTDCAIKEGIAELLRAGSHYHEN-------KYIVVVTDGHPITGYKEPCG 167 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG---QFFAVNDS 373 + R +K+++VA+ +P + L F A ++S Sbjct: 168 -GIQEAANEARQHAIKVFAVAI-SPDQEDTRLSVIATDINYRQNFTAADNS 216 >gi|46395320|dbj|BAD16597.1| DEC-1 [Lymnaea stagnalis] Length = 919 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 33/356 (9%), Positives = 95/356 (26%), Gaps = 31/356 (8%) Query: 34 DAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINI 93 +A+V G + +++ + + L + + G A + + I Sbjct: 290 EASVAFGFDRYLDPISLQQAIRSVEYIGSATFTDKALSLALQVFTPGSYGARANENHVAI 349 Query: 94 TKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMV 153 + N+ + G+ R + ++ + V Sbjct: 350 VLTDGLSTDRNKTLIEAARLRNANVTILGI----------RIGQYQIQELIDITGDVKKV 399 Query: 154 LDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL 213 V L + S + +++ + Sbjct: 400 FSVEGFDAIHNLTYDITKAICKQVSNINCTQSYGD--IIFVMDSSSSITYPNYVKQLSFV 457 Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE 273 N + VR G + + + ++ +V+ + + Sbjct: 458 ANVTRNFL---------IGKDDVRYGALIFGSNVEKLFDLKKYDSPVDVEQHI--MEATY 506 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 + A + + +++ + G K +I +TDGE++ T Sbjct: 507 L-ASSTDTAAALQYILDQRMFADEQGGRPDAVKIIIVLTDGESTYPE------KTRAEAT 559 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 +++ G + S+ + + L ++ F + L + ++ + + Sbjct: 560 KLQSLGYHMMSIGIG-NEINELELNALASNTSNIFKAASYQVLDQLHKEVVTRACQ 614 Score = 53.0 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 28/186 (15%), Positives = 55/186 (29%), Gaps = 16/186 (8%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 ID + + +V+ TI ++ + ++ + + Sbjct: 256 IDDYRLQLQFVADL--TQNFSIGRENVQFSTIVFSNEASVAFGFDRYLDPISLQQAIRSV 313 Query: 270 NPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 + T T A+ A + + I +TDG ++ + T Sbjct: 314 EYIGSATFTDKALSLALQVFTPGSYGARANENH-----VAIVLTDGLSTDRN------KT 362 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 L +RNA + I + + L T + F+V + IT I Sbjct: 363 LIEAARLRNANVTI--LGIRIGQYQIQELIDITGDVKKVFSVEGFDAIHNLTYDITKAIC 420 Query: 389 EQSVRI 394 +Q I Sbjct: 421 KQVSNI 426 >gi|331089974|ref|ZP_08338865.1| hypothetical protein HMPREF1025_02448 [Lachnospiraceae bacterium 3_1_46FAA] gi|330403112|gb|EGG82675.1| hypothetical protein HMPREF1025_02448 [Lachnospiraceae bacterium 3_1_46FAA] Length = 4107 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 32/209 (15%), Positives = 66/209 (31%), Gaps = 50/209 (23%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 + + N + IV + + N++++ S +N L T Sbjct: 261 VKFASDESDNIGNDFINNNY------NRSQIVTELKSYTTKNISDLTSTVNSLIAAGATR 314 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI--CEY 334 ++ A R +K V+F TDG+ + S + N++ I + Sbjct: 315 ADFGLNQAQRAFQLGGTREGA-------QKVVVFFTDGQPTSNSDWSNSVAAAAITNAKE 367 Query: 335 MRNAGMKIYSVAVSAPPEGQD----------LLRKCTD---------------------- 362 +++A IYS+ V D + + Sbjct: 368 LKDANALIYSIGVFRDANPNDTNTSAGNFNGYMHAVSSNYPDATATSSTRPNRYSCTLGK 427 Query: 363 ---SSGQFFAVNDSRELLESFDKITDKIQ 388 +S + A D+ EL F++I+ ++ Sbjct: 428 RTDNSDYYKAATDADELNNIFNEISSDLE 456 >gi|309271626|ref|XP_003085374.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-3(VI) chain [Mus musculus] Length = 3284 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 23/188 (12%), Positives = 56/188 (29%), Gaps = 20/188 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L + +V S+ + VR+ + Y+ + V S Sbjct: 1036 GVRSGFPLLKDFVQRVVESL-----DVGPDRVRVALVQYSDRTRPEFYLNSHMDQQGVIS 1090 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S + + +I +T + Sbjct: 1091 AIRRLTLLGGPTPNTGAALEFVLRNILTSSTGSRIA---EGVPQLLIVLTAEPSGDDVRG 1147 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + ++ G + + ++ + A+ REL Sbjct: 1148 PSVV--------LKQGGAV--PIGIGIGNADISEMQTISFIPDFAVAIPTFRELGTIQQV 1197 Query: 383 ITDKIQEQ 390 I++++ + Sbjct: 1198 ISERVIQL 1205 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 27/173 (15%), Positives = 59/173 (34%), Gaps = 17/173 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + S+R+G + YN + ++ +N Sbjct: 1645 INFRRDSFQEVLRFASEIVDTVYEDGDSIRVGLVQYNSDPTDEFFLRDFSTKRQIIDAIN 1704 Query: 268 KLNPYENTNTYP--AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 K+ + + H L N + R+ + IT G+ + Sbjct: 1705 KVVYKGGRHANTRVGIEH---LLRNHFVPEAGSRLDERVPQIAFVITGGK--------SV 1753 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V +EL E Sbjct: 1754 EDAQDVSLALTQKGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGSVQELSE 1804 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 34/299 (11%), Positives = 88/299 (29%), Gaps = 30/299 (10%) Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIE--RSSENL 146 Q+ I A A+ + N+F G+ + L ++ + + Sbjct: 142 PQVIIVLTDGQSEDGFALPSAELKSADVNVFAVGVEGADERALGEVASEPLSMHVFNLEN 201 Query: 147 AISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPA 206 S+ ++ S + + + F + Sbjct: 202 VTSLHGLVGNLVSCIHSSVNPERAGDKESLKDITAQDSADIIFLID--------GSQNTG 253 Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 N DV+ + N++ + Q V++G + Y+ + + V + Sbjct: 254 NANFDVIRDFLVNVLERLSVGNQ-----QVQVGVVQYSEEPITMFSLNSYPSKAAVLDAV 308 Query: 267 NKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 L+ N A+ + + + + + ++ I+ G +S ++ Sbjct: 309 KGLSLVGGESANIGQALDFVVENHFT---RAGGSRVEEGVPQVLVLISAGPSSD--EIRD 363 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 ++ L+ ++S + A + L+ F V + R + ++I Sbjct: 364 SVVALKQAS--------VFSFGLGAQAASRAELQHIATDDSLVFTVPEFRSFGDLQEQI 414 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 26/181 (14%), Positives = 60/181 (33%), Gaps = 20/181 (11%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + A I + + +V + VRIG + ++ + + + Sbjct: 1440 SSDAVKPDGIAHIRDFVSRIVRRLN-----IGPSKVRIGVVQFSNDVFPEFYLKTHKSQS 1494 Query: 261 EVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 V + +L + NT A+ R L+ S + + + ++ G++ Sbjct: 1495 SVLEAIRRLRFKGGSPLNTGRALEFVARNLF---VKSAGSRIEDGVPQHLVLFLGGKSQD 1551 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 A + + ++G+ S+ + + L+ T+ F V + REL Sbjct: 1552 DVARH--------AQVISSSGIV--SLGIGDRNIDRTDLQTITNDPRLVFTVREFRELPN 1601 Query: 379 S 379 Sbjct: 1602 I 1602 >gi|309264114|ref|XP_003086228.1| PREDICTED: collagen alpha-3(VI) chain isoform 5 [Mus musculus] Length = 2349 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 23/188 (12%), Positives = 56/188 (29%), Gaps = 20/188 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L + +V S+ + VR+ + Y+ + V S Sbjct: 830 GVRSGFPLLKDFVQRVVESL-----DVGPDRVRVALVQYSDRTRPEFYLNSHMDQQGVIS 884 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S + + +I +T + Sbjct: 885 AIRRLTLLGGPTPNTGAALEFVLRNILTSSTGSRIA---EGVPQLLIVLTAEPSGDDVRG 941 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + ++ G + + ++ + A+ REL Sbjct: 942 PSVV--------LKQGGAV--PIGIGIGNADISEMQTISFIPDFAVAIPTFRELGTIQQV 991 Query: 383 ITDKIQEQ 390 I++++ + Sbjct: 992 ISERVIQL 999 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 27/173 (15%), Positives = 59/173 (34%), Gaps = 17/173 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + S+R+G + YN + ++ +N Sbjct: 1439 INFRRDSFQEVLRFASEIVDTVYEDGDSIRVGLVQYNSDPTDEFFLRDFSTKRQIIDAIN 1498 Query: 268 KLNPYENTNTYP--AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 K+ + + H L N + R+ + IT G+ + Sbjct: 1499 KVVYKGGRHANTRVGIEH---LLRNHFVPEAGSRLDERVPQIAFVITGGK--------SV 1547 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V +EL E Sbjct: 1548 EDAQDVSLALTQKGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGSVQELSE 1598 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 16/143 (11%), Positives = 45/143 (31%), Gaps = 15/143 (10%) Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIG 300 Y+ + + V + L+ N A+ + + + Sbjct: 79 YSEEPITMFSLNSYPSKAAVLDAVKGLSLVGGESANIGQALDFVVENHFT---RAGGSRV 135 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 + + ++ I+ G +S ++++ L+ ++S + A + L+ Sbjct: 136 EEGVPQVLVLISAGPSSD--EIRDSVVALKQAS--------VFSFGLGAQAASRAELQHI 185 Query: 361 TDSSGQFFAVNDSRELLESFDKI 383 F V + R + ++I Sbjct: 186 ATDDSLVFTVPEFRSFGDLQEQI 208 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 26/181 (14%), Positives = 60/181 (33%), Gaps = 20/181 (11%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + A I + + +V + VRIG + ++ + + + Sbjct: 1234 SSDAVKPDGIAHIRDFVSRIVRRLN-----IGPSKVRIGVVQFSNDVFPEFYLKTHKSQS 1288 Query: 261 EVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 V + +L + NT A+ R L+ S + + + ++ G++ Sbjct: 1289 SVLEAIRRLRFKGGSPLNTGRALEFVARNLF---VKSAGSRIEDGVPQHLVLFLGGKSQD 1345 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 A + + ++G+ S+ + + L+ T+ F V + REL Sbjct: 1346 DVARH--------AQVISSSGIV--SLGIGDRNIDRTDLQTITNDPRLVFTVREFRELPN 1395 Query: 379 S 379 Sbjct: 1396 I 1396 >gi|309264110|ref|XP_003086224.1| PREDICTED: collagen alpha-3(VI) chain isoform 1 [Mus musculus] Length = 3057 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 23/188 (12%), Positives = 56/188 (29%), Gaps = 20/188 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L + +V S+ + VR+ + Y+ + V S Sbjct: 830 GVRSGFPLLKDFVQRVVESL-----DVGPDRVRVALVQYSDRTRPEFYLNSHMDQQGVIS 884 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S + + +I +T + Sbjct: 885 AIRRLTLLGGPTPNTGAALEFVLRNILTSSTGSRIA---EGVPQLLIVLTAEPSGDDVRG 941 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + ++ G + + ++ + A+ REL Sbjct: 942 PSVV--------LKQGGAV--PIGIGIGNADISEMQTISFIPDFAVAIPTFRELGTIQQV 991 Query: 383 ITDKIQEQ 390 I++++ + Sbjct: 992 ISERVIQL 999 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 27/173 (15%), Positives = 59/173 (34%), Gaps = 17/173 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + S+R+G + YN + ++ +N Sbjct: 1439 INFRRDSFQEVLRFASEIVDTVYEDGDSIRVGLVQYNSDPTDEFFLRDFSTKRQIIDAIN 1498 Query: 268 KLNPYENTNTYP--AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 K+ + + H L N + R+ + IT G+ + Sbjct: 1499 KVVYKGGRHANTRVGIEH---LLRNHFVPEAGSRLDERVPQIAFVITGGK--------SV 1547 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V +EL E Sbjct: 1548 EDAQDVSLALTQKGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGSVQELSE 1598 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 16/143 (11%), Positives = 45/143 (31%), Gaps = 15/143 (10%) Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIG 300 Y+ + + V + L+ N A+ + + + Sbjct: 79 YSEEPITMFSLNSYPSKAAVLDAVKGLSLVGGESANIGQALDFVVENHFT---RAGGSRV 135 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 + + ++ I+ G +S ++++ L+ ++S + A + L+ Sbjct: 136 EEGVPQVLVLISAGPSSD--EIRDSVVALKQAS--------VFSFGLGAQAASRAELQHI 185 Query: 361 TDSSGQFFAVNDSRELLESFDKI 383 F V + R + ++I Sbjct: 186 ATDDSLVFTVPEFRSFGDLQEQI 208 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 26/181 (14%), Positives = 60/181 (33%), Gaps = 20/181 (11%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + A I + + +V + VRIG + ++ + + + Sbjct: 1234 SSDAVKPDGIAHIRDFVSRIVRRLN-----IGPSKVRIGVVQFSNDVFPEFYLKTHKSQS 1288 Query: 261 EVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 V + +L + NT A+ R L+ S + + + ++ G++ Sbjct: 1289 SVLEAIRRLRFKGGSPLNTGRALEFVARNLF---VKSAGSRIEDGVPQHLVLFLGGKSQD 1345 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 A + + ++G+ S+ + + L+ T+ F V + REL Sbjct: 1346 DVARH--------AQVISSSGIV--SLGIGDRNIDRTDLQTITNDPRLVFTVREFRELPN 1395 Query: 379 S 379 Sbjct: 1396 I 1396 >gi|309264106|ref|XP_003086227.1| PREDICTED: collagen alpha-3(VI) chain isoform 4 [Mus musculus] Length = 3062 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 23/188 (12%), Positives = 56/188 (29%), Gaps = 20/188 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L + +V S+ + VR+ + Y+ + V S Sbjct: 830 GVRSGFPLLKDFVQRVVESL-----DVGPDRVRVALVQYSDRTRPEFYLNSHMDQQGVIS 884 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S + + +I +T + Sbjct: 885 AIRRLTLLGGPTPNTGAALEFVLRNILTSSTGSRIA---EGVPQLLIVLTAEPSGDDVRG 941 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + ++ G + + ++ + A+ REL Sbjct: 942 PSVV--------LKQGGAV--PIGIGIGNADISEMQTISFIPDFAVAIPTFRELGTIQQV 991 Query: 383 ITDKIQEQ 390 I++++ + Sbjct: 992 ISERVIQL 999 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 27/173 (15%), Positives = 59/173 (34%), Gaps = 17/173 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + S+R+G + YN + ++ +N Sbjct: 1439 INFRRDSFQEVLRFASEIVDTVYEDGDSIRVGLVQYNSDPTDEFFLRDFSTKRQIIDAIN 1498 Query: 268 KLNPYENTNTYP--AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 K+ + + H L N + R+ + IT G+ + Sbjct: 1499 KVVYKGGRHANTRVGIEH---LLRNHFVPEAGSRLDERVPQIAFVITGGK--------SV 1547 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V +EL E Sbjct: 1548 EDAQDVSLALTQKGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGSVQELSE 1598 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 16/143 (11%), Positives = 45/143 (31%), Gaps = 15/143 (10%) Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIG 300 Y+ + + V + L+ N A+ + + + Sbjct: 79 YSEEPITMFSLNSYPSKAAVLDAVKGLSLVGGESANIGQALDFVVENHFT---RAGGSRV 135 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 + + ++ I+ G +S ++++ L+ ++S + A + L+ Sbjct: 136 EEGVPQVLVLISAGPSSD--EIRDSVVALKQAS--------VFSFGLGAQAASRAELQHI 185 Query: 361 TDSSGQFFAVNDSRELLESFDKI 383 F V + R + ++I Sbjct: 186 ATDDSLVFTVPEFRSFGDLQEQI 208 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 26/181 (14%), Positives = 60/181 (33%), Gaps = 20/181 (11%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + A I + + +V + VRIG + ++ + + + Sbjct: 1234 SSDAVKPDGIAHIRDFVSRIVRRLN-----IGPSKVRIGVVQFSNDVFPEFYLKTHKSQS 1288 Query: 261 EVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 V + +L + NT A+ R L+ S + + + ++ G++ Sbjct: 1289 SVLEAIRRLRFKGGSPLNTGRALEFVARNLF---VKSAGSRIEDGVPQHLVLFLGGKSQD 1345 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 A + + ++G+ S+ + + L+ T+ F V + REL Sbjct: 1346 DVARH--------AQVISSSGIV--SLGIGDRNIDRTDLQTITNDPRLVFTVREFRELPN 1395 Query: 379 S 379 Sbjct: 1396 I 1396 >gi|11414928|dbj|BAB18556.1| voltage dependent calcium channel alpha2c/delta subunit [Rana catesbeiana] Length = 1085 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 23/220 (10%), Positives = 66/220 (30%), Gaps = 20/220 (9%) Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAG 218 ++ K N + P + S + + + + ++ S Sbjct: 219 YPASPWVDKSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVT 278 Query: 219 NLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNT 277 ++ ++ + ++ +V N +K +N + T+ Sbjct: 279 EMLETLSDDDFVNVAAFNSNAHDVSCFHHLVQANVR----NKKVLKEAVNNITAKGTTDY 334 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 A+ +L N S K ++ TDG A+ N +N Sbjct: 335 KQGFKFAFDQLRNTNVSRA------NCNKIIMLFTDGGEDKATETFKLYN--------KN 380 Query: 338 AGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 +++++ +V + ++ ++ G ++ + + Sbjct: 381 KTVRVFTFSVGQHNYDKGPIQWMACENKGYYYEIPSIGAI 420 >gi|11414926|dbj|BAB18555.1| voltage dependent calcium channel alpha2b/delta subunit [Rana catesbeiana] Length = 1090 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 23/220 (10%), Positives = 66/220 (30%), Gaps = 20/220 (9%) Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAG 218 ++ K N + P + S + + + + ++ S Sbjct: 219 YPASPWVDKSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVT 278 Query: 219 NLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNT 277 ++ ++ + ++ +V N +K +N + T+ Sbjct: 279 EMLETLSDDDFVNVAAFNSNAHDVSCFHHLVQANVR----NKKVLKEAVNNITAKGTTDY 334 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 A+ +L N S K ++ TDG A+ N +N Sbjct: 335 KQGFKFAFDQLRNTNVSRA------NCNKIIMLFTDGGEDKATETFKLYN--------KN 380 Query: 338 AGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 +++++ +V + ++ ++ G ++ + + Sbjct: 381 KTVRVFTFSVGQHNYDKGPIQWMACENKGYYYEIPSIGAI 420 >gi|309264112|ref|XP_003086226.1| PREDICTED: collagen alpha-3(VI) chain isoform 3 [Mus musculus] Length = 2656 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 23/188 (12%), Positives = 56/188 (29%), Gaps = 20/188 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L + +V S+ + VR+ + Y+ + V S Sbjct: 429 GVRSGFPLLKDFVQRVVESL-----DVGPDRVRVALVQYSDRTRPEFYLNSHMDQQGVIS 483 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S + + +I +T + Sbjct: 484 AIRRLTLLGGPTPNTGAALEFVLRNILTSSTGSRIA---EGVPQLLIVLTAEPSGDDVRG 540 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + ++ G + + ++ + A+ REL Sbjct: 541 PSVV--------LKQGGAV--PIGIGIGNADISEMQTISFIPDFAVAIPTFRELGTIQQV 590 Query: 383 ITDKIQEQ 390 I++++ + Sbjct: 591 ISERVIQL 598 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 27/173 (15%), Positives = 59/173 (34%), Gaps = 17/173 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + S+R+G + YN + ++ +N Sbjct: 1038 INFRRDSFQEVLRFASEIVDTVYEDGDSIRVGLVQYNSDPTDEFFLRDFSTKRQIIDAIN 1097 Query: 268 KLNPYENTNTYP--AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 K+ + + H L N + R+ + IT G+ + Sbjct: 1098 KVVYKGGRHANTRVGIEH---LLRNHFVPEAGSRLDERVPQIAFVITGGK--------SV 1146 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V +EL E Sbjct: 1147 EDAQDVSLALTQKGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGSVQELSE 1197 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 26/181 (14%), Positives = 60/181 (33%), Gaps = 20/181 (11%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + A I + + +V + VRIG + ++ + + + Sbjct: 833 SSDAVKPDGIAHIRDFVSRIVRRLN-----IGPSKVRIGVVQFSNDVFPEFYLKTHKSQS 887 Query: 261 EVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 V + +L + NT A+ R L+ S + + + ++ G++ Sbjct: 888 SVLEAIRRLRFKGGSPLNTGRALEFVARNLF---VKSAGSRIEDGVPQHLVLFLGGKSQD 944 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 A + + ++G+ S+ + + L+ T+ F V + REL Sbjct: 945 DVARH--------AQVISSSGIV--SLGIGDRNIDRTDLQTITNDPRLVFTVREFRELPN 994 Query: 379 S 379 Sbjct: 995 I 995 >gi|309264108|ref|XP_003086225.1| PREDICTED: collagen alpha-3(VI) chain isoform 2 [Mus musculus] Length = 3263 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 23/188 (12%), Positives = 56/188 (29%), Gaps = 20/188 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L + +V S+ + VR+ + Y+ + V S Sbjct: 1036 GVRSGFPLLKDFVQRVVESL-----DVGPDRVRVALVQYSDRTRPEFYLNSHMDQQGVIS 1090 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S + + +I +T + Sbjct: 1091 AIRRLTLLGGPTPNTGAALEFVLRNILTSSTGSRIA---EGVPQLLIVLTAEPSGDDVRG 1147 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + ++ G + + ++ + A+ REL Sbjct: 1148 PSVV--------LKQGGAV--PIGIGIGNADISEMQTISFIPDFAVAIPTFRELGTIQQV 1197 Query: 383 ITDKIQEQ 390 I++++ + Sbjct: 1198 ISERVIQL 1205 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 27/173 (15%), Positives = 59/173 (34%), Gaps = 17/173 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + S+R+G + YN + ++ +N Sbjct: 1645 INFRRDSFQEVLRFASEIVDTVYEDGDSIRVGLVQYNSDPTDEFFLRDFSTKRQIIDAIN 1704 Query: 268 KLNPYENTNTYP--AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 K+ + + H L N + R+ + IT G+ + Sbjct: 1705 KVVYKGGRHANTRVGIEH---LLRNHFVPEAGSRLDERVPQIAFVITGGK--------SV 1753 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V +EL E Sbjct: 1754 EDAQDVSLALTQKGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGSVQELSE 1804 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 34/299 (11%), Positives = 88/299 (29%), Gaps = 30/299 (10%) Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIE--RSSENL 146 Q+ I A A+ + N+F G+ + L ++ + + Sbjct: 142 PQVIIVLTDGQSEDGFALPSAELKSADVNVFAVGVEGADERALGEVASEPLSMHVFNLEN 201 Query: 147 AISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPA 206 S+ ++ S + + + F + Sbjct: 202 VTSLHGLVGNLVSCIHSSVNPERAGDKESLKDITAQDSADIIFLID--------GSQNTG 253 Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 N DV+ + N++ + Q V++G + Y+ + + V + Sbjct: 254 NANFDVIRDFLVNVLERLSVGNQ-----QVQVGVVQYSEEPITMFSLNSYPSKAAVLDAV 308 Query: 267 NKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 L+ N A+ + + + + + ++ I+ G +S ++ Sbjct: 309 KGLSLVGGESANIGQALDFVVENHFT---RAGGSRVEEGVPQVLVLISAGPSSD--EIRD 363 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 ++ L+ ++S + A + L+ F V + R + ++I Sbjct: 364 SVVALKQAS--------VFSFGLGAQAASRAELQHIATDDSLVFTVPEFRSFGDLQEQI 414 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 26/181 (14%), Positives = 60/181 (33%), Gaps = 20/181 (11%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + A I + + +V + VRIG + ++ + + + Sbjct: 1440 SSDAVKPDGIAHIRDFVSRIVRRLN-----IGPSKVRIGVVQFSNDVFPEFYLKTHKSQS 1494 Query: 261 EVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 V + +L + NT A+ R L+ S + + + ++ G++ Sbjct: 1495 SVLEAIRRLRFKGGSPLNTGRALEFVARNLF---VKSAGSRIEDGVPQHLVLFLGGKSQD 1551 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 A + + ++G+ S+ + + L+ T+ F V + REL Sbjct: 1552 DVARH--------AQVISSSGIV--SLGIGDRNIDRTDLQTITNDPRLVFTVREFRELPN 1601 Query: 379 S 379 Sbjct: 1602 I 1602 >gi|148708138|gb|EDL40085.1| mCG12867, isoform CRA_d [Mus musculus] Length = 2281 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 23/188 (12%), Positives = 56/188 (29%), Gaps = 20/188 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L + +V S+ + VR+ + Y+ + V S Sbjct: 1036 GVRSGFPLLKDFVQRVVESL-----DVGPDRVRVALVQYSDRTRPEFYLNSHMDQQGVIS 1090 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S + + +I +T + Sbjct: 1091 AIRRLTLLGGPTPNTGAALEFVLRNILTSSTGSRIA---EGVPQLLIVLTAEPSGDDVRG 1147 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + ++ G + + ++ + A+ REL Sbjct: 1148 PSVV--------LKQGGAV--PIGIGIGNADISEMQTISFIPDFAVAIPTFRELGTIQQV 1197 Query: 383 ITDKIQEQ 390 I++++ + Sbjct: 1198 ISERVIQL 1205 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 27/173 (15%), Positives = 59/173 (34%), Gaps = 17/173 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + S+R+G + YN + ++ +N Sbjct: 1645 INFRRDSFQEVLRFASEIVDTVYEDGDSIRVGLVQYNSDPTDEFFLRDFSTKRQIIDAIN 1704 Query: 268 KLNPYENTNTYP--AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 K+ + + H L N + R+ + IT G+ + Sbjct: 1705 KVVYKGGRHANTRVGIEH---LLRNHFVPEAGSRLDERVPQIAFVITGGK--------SV 1753 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V +EL E Sbjct: 1754 EDAQDVSLALTQKGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGSVQELSE 1804 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 33/299 (11%), Positives = 87/299 (29%), Gaps = 30/299 (10%) Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIE--RSSENL 146 Q+ I A A+ + N+F G+ + L ++ + + Sbjct: 142 PQVIIVLTDGQSEDGFALPSAELKSADVNVFAVGVEGADERALGEVASEPLSMHVFNLEN 201 Query: 147 AISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPA 206 S+ ++ S + + + F + Sbjct: 202 VTSLHGLVGNLVSCIHSSVNPERAGDKESLKDITAQDSADIIFLID--------GSQNTG 253 Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 N DV+ + N++ + V++G + Y+ + + V + Sbjct: 254 NANFDVIRDFLVNVLERLSVGN-----QQVQVGVVQYSEEPITMFSLNSYPSKAAVLDAV 308 Query: 267 NKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 L+ N A+ + + + + + ++ I+ G +S ++ Sbjct: 309 KGLSLVGGESANIGQALDFVVENHFT---RAGGSRVEEGVPQVLVLISAGPSSD--EIRD 363 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 ++ L+ ++S + A + L+ F V + R + ++I Sbjct: 364 SVVALKQAS--------VFSFGLGAQAASRAELQHIATDDSLVFTVPEFRSFGDLQEQI 414 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 26/181 (14%), Positives = 60/181 (33%), Gaps = 20/181 (11%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + A I + + +V + VRIG + ++ + + + Sbjct: 1440 SSDAVKPDGIAHIRDFVSRIVRRLN-----IGPSKVRIGVVQFSNDVFPEFYLKTHKSQS 1494 Query: 261 EVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 V + +L + NT A+ R L+ S + + + ++ G++ Sbjct: 1495 SVLEAIRRLRFKGGSPLNTGRALEFVARNLF---VKSAGSRIEDGVPQHLVLFLGGKSQD 1551 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 A + + ++G+ S+ + + L+ T+ F V + REL Sbjct: 1552 DVARH--------AQVISSSGIV--SLGIGDRNIDRTDLQTITNDPRLVFTVREFRELPN 1601 Query: 379 S 379 Sbjct: 1602 I 1602 >gi|148708139|gb|EDL40086.1| mCG12867, isoform CRA_e [Mus musculus] Length = 2555 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 23/188 (12%), Positives = 56/188 (29%), Gaps = 20/188 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L + +V S+ + VR+ + Y+ + V S Sbjct: 1036 GVRSGFPLLKDFVQRVVESL-----DVGPDRVRVALVQYSDRTRPEFYLNSHMDQQGVIS 1090 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S + + +I +T + Sbjct: 1091 AIRRLTLLGGPTPNTGAALEFVLRNILTSSTGSRIA---EGVPQLLIVLTAEPSGDDVRG 1147 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + ++ G + + ++ + A+ REL Sbjct: 1148 PSVV--------LKQGGAV--PIGIGIGNADISEMQTISFIPDFAVAIPTFRELGTIQQV 1197 Query: 383 ITDKIQEQ 390 I++++ + Sbjct: 1198 ISERVIQL 1205 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 27/173 (15%), Positives = 59/173 (34%), Gaps = 17/173 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + S+R+G + YN + ++ +N Sbjct: 1645 INFRRDSFQEVLRFASEIVDTVYEDGDSIRVGLVQYNSDPTDEFFLRDFSTKRQIIDAIN 1704 Query: 268 KLNPYENTNTYP--AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 K+ + + H L N + R+ + IT G+ + Sbjct: 1705 KVVYKGGRHANTRVGIEH---LLRNHFVPEAGSRLDERVPQIAFVITGGK--------SV 1753 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V +EL E Sbjct: 1754 EDAQDVSLALTQKGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGSVQELSE 1804 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 34/299 (11%), Positives = 88/299 (29%), Gaps = 30/299 (10%) Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIE--RSSENL 146 Q+ I A A+ + N+F G+ + L ++ + + Sbjct: 142 PQVIIVLTDGQSEDGFALPSAELKSADVNVFAVGVEGADERALGEVASEPLSMHVFNLEN 201 Query: 147 AISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPA 206 S+ ++ S + + + F + Sbjct: 202 VTSLHGLVGNLVSCIHSSVNPERAGDKESLKDITAQDSADIIFLID--------GSQNTG 253 Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 N DV+ + N++ + Q V++G + Y+ + + V + Sbjct: 254 NANFDVIRDFLVNVLERLSVGNQ-----QVQVGVVQYSEEPITMFSLNSYPSKAAVLDAV 308 Query: 267 NKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 L+ N A+ + + + + + ++ I+ G +S ++ Sbjct: 309 KGLSLVGGESANIGQALDFVVENHFT---RAGGSRVEEGVPQVLVLISAGPSSD--EIRD 363 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 ++ L+ ++S + A + L+ F V + R + ++I Sbjct: 364 SVVALKQAS--------VFSFGLGAQAASRAELQHIATDDSLVFTVPEFRSFGDLQEQI 414 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 26/181 (14%), Positives = 60/181 (33%), Gaps = 20/181 (11%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + A I + + +V + VRIG + ++ + + + Sbjct: 1440 SSDAVKPDGIAHIRDFVSRIVRRLN-----IGPSKVRIGVVQFSNDVFPEFYLKTHKSQS 1494 Query: 261 EVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 V + +L + NT A+ R L+ S + + + ++ G++ Sbjct: 1495 SVLEAIRRLRFKGGSPLNTGRALEFVARNLF---VKSAGSRIEDGVPQHLVLFLGGKSQD 1551 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 A + + ++G+ S+ + + L+ T+ F V + REL Sbjct: 1552 DVARH--------AQVISSSGIV--SLGIGDRNIDRTDLQTITNDPRLVFTVREFRELPN 1601 Query: 379 S 379 Sbjct: 1602 I 1602 >gi|148708136|gb|EDL40083.1| mCG12867, isoform CRA_b [Mus musculus] Length = 3261 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 23/188 (12%), Positives = 56/188 (29%), Gaps = 20/188 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L + +V S+ + VR+ + Y+ + V S Sbjct: 1036 GVRSGFPLLKDFVQRVVESL-----DVGPDRVRVALVQYSDRTRPEFYLNSHMDQQGVIS 1090 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S + + +I +T + Sbjct: 1091 AIRRLTLLGGPTPNTGAALEFVLRNILTSSTGSRIA---EGVPQLLIVLTAEPSGDDVRG 1147 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + ++ G + + ++ + A+ REL Sbjct: 1148 PSVV--------LKQGGAV--PIGIGIGNADISEMQTISFIPDFAVAIPTFRELGTIQQV 1197 Query: 383 ITDKIQEQ 390 I++++ + Sbjct: 1198 ISERVIQL 1205 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 27/173 (15%), Positives = 59/173 (34%), Gaps = 17/173 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + S+R+G + YN + ++ +N Sbjct: 1645 INFRRDSFQEVLRFASEIVDTVYEDGDSIRVGLVQYNSDPTDEFFLRDFSTKRQIIDAIN 1704 Query: 268 KLNPYENTNTYP--AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 K+ + + H L N + R+ + IT G+ + Sbjct: 1705 KVVYKGGRHANTRVGIEH---LLRNHFVPEAGSRLDERVPQIAFVITGGK--------SV 1753 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V +EL E Sbjct: 1754 EDAQDVSLALTQKGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGSVQELSE 1804 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 34/299 (11%), Positives = 88/299 (29%), Gaps = 30/299 (10%) Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIE--RSSENL 146 Q+ I A A+ + N+F G+ + L ++ + + Sbjct: 142 PQVIIVLTDGQSEDGFALPSAELKSADVNVFAVGVEGADERALGEVASEPLSMHVFNLEN 201 Query: 147 AISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPA 206 S+ ++ S + + + F + Sbjct: 202 VTSLHGLVGNLVSCIHSSVNPERAGDKESLKDITAQDSADIIFLID--------GSQNTG 253 Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 N DV+ + N++ + Q V++G + Y+ + + V + Sbjct: 254 NANFDVIRDFLVNVLERLSVGNQ-----QVQVGVVQYSEEPITMFSLNSYPSKAAVLDAV 308 Query: 267 NKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 L+ N A+ + + + + + ++ I+ G +S ++ Sbjct: 309 KGLSLVGGESANIGQALDFVVENHFT---RAGGSRVEEGVPQVLVLISAGPSSD--EIRD 363 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 ++ L+ ++S + A + L+ F V + R + ++I Sbjct: 364 SVVALKQAS--------VFSFGLGAQAASRAELQHIATDDSLVFTVPEFRSFGDLQEQI 414 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 26/181 (14%), Positives = 60/181 (33%), Gaps = 20/181 (11%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + A I + + +V + VRIG + ++ + + + Sbjct: 1440 SSDAVKPDGIAHIRDFVSRIVRRLN-----IGPSKVRIGVVQFSNDVFPEFYLKTHKSQS 1494 Query: 261 EVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 V + +L + NT A+ R L+ S + + + ++ G++ Sbjct: 1495 SVLEAIRRLRFKGGSPLNTGRALEFVARNLF---VKSAGSRIEDGVPQHLVLFLGGKSQD 1551 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 A + + ++G+ S+ + + L+ T+ F V + REL Sbjct: 1552 DVARH--------AQVISSSGIV--SLGIGDRNIDRTDLQTITNDPRLVFTVREFRELPN 1601 Query: 379 S 379 Sbjct: 1602 I 1602 >gi|148708137|gb|EDL40084.1| mCG12867, isoform CRA_c [Mus musculus] Length = 2207 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 23/188 (12%), Positives = 56/188 (29%), Gaps = 20/188 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L + +V S+ + VR+ + Y+ + V S Sbjct: 1036 GVRSGFPLLKDFVQRVVESL-----DVGPDRVRVALVQYSDRTRPEFYLNSHMDQQGVIS 1090 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S + + +I +T + Sbjct: 1091 AIRRLTLLGGPTPNTGAALEFVLRNILTSSTGSRIA---EGVPQLLIVLTAEPSGDDVRG 1147 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + ++ G + + ++ + A+ REL Sbjct: 1148 PSVV--------LKQGGAV--PIGIGIGNADISEMQTISFIPDFAVAIPTFRELGTIQQV 1197 Query: 383 ITDKIQEQ 390 I++++ + Sbjct: 1198 ISERVIQL 1205 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 27/173 (15%), Positives = 59/173 (34%), Gaps = 17/173 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + S+R+G + YN + ++ +N Sbjct: 1645 INFRRDSFQEVLRFASEIVDTVYEDGDSIRVGLVQYNSDPTDEFFLRDFSTKRQIIDAIN 1704 Query: 268 KLNPYENTNTYP--AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 K+ + + H L N + R+ + IT G+ + Sbjct: 1705 KVVYKGGRHANTRVGIEH---LLRNHFVPEAGSRLDERVPQIAFVITGGK--------SV 1753 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V +EL E Sbjct: 1754 EDAQDVSLALTQKGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGSVQELSE 1804 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 33/299 (11%), Positives = 87/299 (29%), Gaps = 30/299 (10%) Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIE--RSSENL 146 Q+ I A A+ + N+F G+ + L ++ + + Sbjct: 142 PQVIIVLTDGQSEDGFALPSAELKSADVNVFAVGVEGADERALGEVASEPLSMHVFNLEN 201 Query: 147 AISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPA 206 S+ ++ S + + + F + Sbjct: 202 VTSLHGLVGNLVSCIHSSVNPERAGDKESLKDITAQDSADIIFLID--------GSQNTG 253 Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 N DV+ + N++ + V++G + Y+ + + V + Sbjct: 254 NANFDVIRDFLVNVLERLSVGN-----QQVQVGVVQYSEEPITMFSLNSYPSKAAVLDAV 308 Query: 267 NKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 L+ N A+ + + + + + ++ I+ G +S ++ Sbjct: 309 KGLSLVGGESANIGQALDFVVENHFT---RAGGSRVEEGVPQVLVLISAGPSSD--EIRD 363 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 ++ L+ ++S + A + L+ F V + R + ++I Sbjct: 364 SVVALKQAS--------VFSFGLGAQAASRAELQHIATDDSLVFTVPEFRSFGDLQEQI 414 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 26/181 (14%), Positives = 60/181 (33%), Gaps = 20/181 (11%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + A I + + +V + VRIG + ++ + + + Sbjct: 1440 SSDAVKPDGIAHIRDFVSRIVRRLN-----IGPSKVRIGVVQFSNDVFPEFYLKTHKSQS 1494 Query: 261 EVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 V + +L + NT A+ R L+ S + + + ++ G++ Sbjct: 1495 SVLEAIRRLRFKGGSPLNTGRALEFVARNLF---VKSAGSRIEDGVPQHLVLFLGGKSQD 1551 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 A + + ++G+ S+ + + L+ T+ F V + REL Sbjct: 1552 DVARH--------AQVISSSGIV--SLGIGDRNIDRTDLQTITNDPRLVFTVREFRELPN 1601 Query: 379 S 379 Sbjct: 1602 I 1602 >gi|148708135|gb|EDL40082.1| mCG12867, isoform CRA_a [Mus musculus] Length = 2656 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 23/188 (12%), Positives = 56/188 (29%), Gaps = 20/188 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L + +V S+ + VR+ + Y+ + V S Sbjct: 429 GVRSGFPLLKDFVQRVVESL-----DVGPDRVRVALVQYSDRTRPEFYLNSHMDQQGVIS 483 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S + + +I +T + Sbjct: 484 AIRRLTLLGGPTPNTGAALEFVLRNILTSSTGSRIA---EGVPQLLIVLTAEPSGDDVRG 540 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + ++ G + + ++ + A+ REL Sbjct: 541 PSVV--------LKQGGAV--PIGIGIGNADISEMQTISFIPDFAVAIPTFRELGTIQQV 590 Query: 383 ITDKIQEQ 390 I++++ + Sbjct: 591 ISERVIQL 598 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 27/173 (15%), Positives = 59/173 (34%), Gaps = 17/173 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + S+R+G + YN + ++ +N Sbjct: 1038 INFRRDSFQEVLRFASEIVDTVYEDGDSIRVGLVQYNSDPTDEFFLRDFSTKRQIIDAIN 1097 Query: 268 KLNPYENTNTYP--AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 K+ + + H L N + R+ + IT G+ + Sbjct: 1098 KVVYKGGRHANTRVGIEH---LLRNHFVPEAGSRLDERVPQIAFVITGGK--------SV 1146 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V +EL E Sbjct: 1147 EDAQDVSLALTQKGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGSVQELSE 1197 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 26/181 (14%), Positives = 60/181 (33%), Gaps = 20/181 (11%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + A I + + +V + VRIG + ++ + + + Sbjct: 833 SSDAVKPDGIAHIRDFVSRIVRRLN-----IGPSKVRIGVVQFSNDVFPEFYLKTHKSQS 887 Query: 261 EVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 V + +L + NT A+ R L+ S + + + ++ G++ Sbjct: 888 SVLEAIRRLRFKGGSPLNTGRALEFVARNLF---VKSAGSRIEDGVPQHLVLFLGGKSQD 944 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 A + + ++G+ S+ + + L+ T+ F V + REL Sbjct: 945 DVARH--------AQVISSSGIV--SLGIGDRNIDRTDLQTITNDPRLVFTVREFRELPN 994 Query: 379 S 379 Sbjct: 995 I 995 >gi|153815168|ref|ZP_01967836.1| hypothetical protein RUMTOR_01400 [Ruminococcus torques ATCC 27756] gi|145847427|gb|EDK24345.1| hypothetical protein RUMTOR_01400 [Ruminococcus torques ATCC 27756] Length = 4109 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 32/209 (15%), Positives = 66/209 (31%), Gaps = 50/209 (23%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 + + N + IV + + N++++ S +N L T Sbjct: 263 VKFASDESDNIGNDFINNNY------NRSQIVTELKSYTTKNISDLTSTVNSLIAAGATR 316 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI--CEY 334 ++ A R +K V+F TDG+ + S + N++ I + Sbjct: 317 ADFGLNQAQRAFQLGGTREGA-------QKVVVFFTDGQPTSNSDWSNSVAAAAITNAKE 369 Query: 335 MRNAGMKIYSVAVSAPPEGQD----------LLRKCTD---------------------- 362 +++A IYS+ V D + + Sbjct: 370 LKDANALIYSIGVFRDANPNDTNTSAGNFNGYMHAVSSNYPDATATSSTRPNRYSCTLGK 429 Query: 363 ---SSGQFFAVNDSRELLESFDKITDKIQ 388 +S + A D+ EL F++I+ ++ Sbjct: 430 RTDNSDYYKAATDADELNNIFNEISSDLE 458 >gi|229125374|ref|ZP_04254461.1| hypothetical protein bcere0016_56090 [Bacillus cereus 95/8201] gi|228658081|gb|EEL13834.1| hypothetical protein bcere0016_56090 [Bacillus cereus 95/8201] Length = 452 Score = 56.1 bits (133), Expect = 1e-05, Method: Composition-based stats. Identities = 26/216 (12%), Positives = 63/216 (29%), Gaps = 22/216 (10%) Query: 183 PPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIA 242 P K S K++ ++ N ++ I + G+ Sbjct: 147 PKEKSLNVEILLDASGSMAGKVNGQVKMEAAKKAIYNYLDKIPDNANVMLRVYGHKGSNN 206 Query: 243 YNI----GIVGNQCTPLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHN 297 N PL + + L+ P T A+ + Sbjct: 207 ENDKSLSCGSSEVMYPLQPYKKEQFNAALSNFGPKGWTPLASAIESVNDDFKE------- 259 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA--GMKIYSVAVSAPPEGQD 355 V ++DGE + + + + + + + + Q Sbjct: 260 -YTGEENLNVVYIVSDGEETCGG------DPVNAAKNLNQSSTHAVVNIIGFDVKNSEQQ 312 Query: 356 LLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQ 390 L+ ++ G + V+++ EL ++ +K +K+ ++ Sbjct: 313 QLKNTAEAGKGNYATVSNADELYQTLNKEYEKLYKE 348 >gi|311696337|gb|ADP99210.1| protein containing a von Willebrand factor type A (vWA) domain [marine bacterium HP15] Length = 704 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 59/183 (32%), Gaps = 24/183 (13%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY 272 ++ + ++ ++ I + + NNL + +++L Sbjct: 363 ARDALQAGLGTLT-----PRDRFNVIQFNSQTHSLFMQPEVATGNNLARARQYVDRLRAD 417 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T PA+ A ++ + + VIFITDG N + Sbjct: 418 GGTEMAPALSRALEGGGETEDGARV--------RQVIFITDGAVG---------NEAALF 460 Query: 333 EYMRNA--GMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 +R ++++VA+ + P + G + A++ ++ + ++ Sbjct: 461 RQIRQQLGNQRLFTVAIGSAPNRHFMREAARWGRGTYTAIHSPSDVDGPLQALFSAMESP 520 Query: 391 SVR 393 + Sbjct: 521 VLT 523 >gi|310694574|gb|ADP05359.1| collagen type VI alpha 1 protein [Bubalus bubalis] Length = 1027 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 31/229 (13%), Positives = 73/229 (31%), Gaps = 20/229 (8%) Query: 169 NDNNNMTSNKYLLPPPPKKSFWSKNT--TKSKYAPAPAPANRKIDVLIESAGNLVNSIQK 226 + + T A P +D + +++++ Sbjct: 16 WASAQDDTVASRAIAFQDCPVDLFFVLDTSESVALRLKPYGALVDKVKSFTKRFIDNLKD 75 Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLS---NNLNEVKSRLNKLNPYE-NTNTYPAMH 282 + + ++ A + L+ + +E+KS ++ + + T T A+ Sbjct: 76 -RYYRCDRNLVWNAGALHYSDEVEIIRGLTRMPSGRDELKSSVDAVKYFGKGTYTDCAIK 134 Query: 283 HAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKI 342 EL SH K+++ +TDG ++ G+K+ Sbjct: 135 KGLEELLVG--GSHLKEN-----KYLVVVTDGHPLEGYKEPCG-GLEDAVNEAKHLGIKV 186 Query: 343 YSVAVSAPPEGQDLLRKCTDSSGQF---FAVNDSRELLESFDKITDKIQ 388 +SVA+ P + L + + F D + ++ + I+ I+ Sbjct: 187 FSVAI-TPDHLEPRLSIIA-TDHTYRRNFTAADWGQSRDAEEVISQTIE 233 >gi|297714304|ref|XP_002833596.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H2-like, partial [Pongo abelii] Length = 384 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 28/222 (12%), Positives = 77/222 (34%), Gaps = 23/222 (10%) Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 L P PK + + + S + + ++ + +L ++ + N +VR Sbjct: 15 LDPIPKNILFVIDVSGSMWG---VKMKQTVEAMKTILDDLRAEDHFSVIDF-NQNVRTWR 70 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG 300 + + K + K+ P TN A+ A L Sbjct: 71 NDLISATKTQ--------VADAKRYIEKIQPSGGTNINEALLRAIFILNEASNLGLLDPN 122 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 S +I ++DG+ + + + + E +++ + ++S+ + D L++ Sbjct: 123 S---VSLIILVSDGDPTVGELKLSKIQ-KNVKENIQD-NISLFSLGMGFDV-DYDFLKRL 176 Query: 361 TDSSGQ-----FFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + + + + S +L + +++++ + P+ Sbjct: 177 SSENRGIAQRIYGNQDTSSQLKKFYNQVSTPLLRNVQFNYPH 218 >gi|284029817|ref|YP_003379748.1| von Willebrand factor type A [Kribbella flavida DSM 17836] gi|283809110|gb|ADB30949.1| von Willebrand factor type A [Kribbella flavida DSM 17836] Length = 550 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 44/371 (11%), Positives = 94/371 (25%), Gaps = 28/371 (7%) Query: 21 HIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRE 80 + + A VL + K T S + Sbjct: 190 QAERAKTSASNTQVAQVL-VPLAQRLGSMAKQYTEVNGPLSKASADGSNVVVPVSEQAFV 248 Query: 81 NAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIE 140 D +AQ+ L E + G + S Sbjct: 249 KHQDEHPEAQLKAVVPGTGTLVLDYPVVVTARNNAEAVTAAGKALAEEMLSDASSQARD- 307 Query: 141 RSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP----PKKSFWSKNTTK 196 +V + + + + K L S + + Sbjct: 308 ----EGGFRDHLVSPLGEGRGEGEVTQLTKPTVAAVEKILQSWTTLSLSAHSLAVIDVSG 363 Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 S A ++ + IE+A N + + + + L+ Sbjct: 364 SMAEKVAGSAKTRMQLTIEAADNGLKMFPDSAELGLWVFSTKIGPDSADFRQLVPIGKLT 423 Query: 257 NNLNEVKSRLNKLNP-----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 + + +L T Y A R + + ++ V+ Sbjct: 424 PGHR--QRMIGQLKAQSARVGGGTGLYDTAIAAVRAVRSSY--------NSGAVNTVLLF 473 Query: 312 TDGENSGASAYQNTLNTLQICEYMRN--AGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA 369 TDG+ + + TLQI +++ ++I ++ + + +L R + GQ + Sbjct: 474 TDGK-NDDPGSLSLDRTLQILNGLKDPARPVRIIALGIGPDADADELKRLAQATGGQAYV 532 Query: 370 VNDSRELLESF 380 + +L + F Sbjct: 533 ARNPTDLKDVF 543 >gi|4104232|gb|AAD01978.1| collagen alpha3(VI) [Mus musculus] Length = 1703 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 23/188 (12%), Positives = 56/188 (29%), Gaps = 20/188 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L + +V S+ + VR+ + Y+ + V S Sbjct: 1036 GVRSGFPLLKDFVQRVVESL-----DVGPDRVRVALVQYSDRTRPEFYLNSHMDQQGVIS 1090 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S + + +I +T + Sbjct: 1091 AIRRLTLLGGPTPNTGAALEFVLRNILTSSTGSRIA---EGVPQLLIVLTAEPSGDDVRG 1147 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + ++ G + + ++ + A+ REL Sbjct: 1148 PSVV--------LKQGGAV--PIGIGIGNADISEMQTISFIPDFAVAIPTFRELGTIQQV 1197 Query: 383 ITDKIQEQ 390 I++++ + Sbjct: 1198 ISERVIQL 1205 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 32/299 (10%), Positives = 87/299 (29%), Gaps = 30/299 (10%) Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIE--RSSENL 146 Q+ + A A+ + N+F G+ + L ++ + + Sbjct: 142 PQVIVVLTDGQSEDGFALPSAELKSADVNVFAVGVEGADERALGEVASEPLSMHVFNLEN 201 Query: 147 AISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPA 206 S+ ++ S + + + F + Sbjct: 202 VTSLHGLVGNLVSCIHSSVNPERAGDKESLKDITAQDSADIIFLID--------GSQNTG 253 Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 N DV+ + N++ + V++G + Y+ + + V + Sbjct: 254 NANFDVIRDFLVNVLERLSVGN-----QQVQVGVVQYSEEPITMFSLNSYPSKAAVLDAV 308 Query: 267 NKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 L+ N A+ + + + + + ++ I+ G +S ++ Sbjct: 309 KGLSLVGGESANIGQALDFVVENHFT---RAGGSRVEEGVPQVLVLISAGPSSD--EIRD 363 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 ++ L+ ++S + A + L+ F V + R + ++I Sbjct: 364 SVVALKQAS--------VFSFGLGAQAASRAELQHIATDDSLVFTVPEFRSFGDLQEQI 414 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 23/146 (15%), Positives = 52/146 (35%), Gaps = 15/146 (10%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKE 293 VRIG + ++ + + + V + +L + NT A+ R L+ Sbjct: 1470 VRIGVVQFSNDVFPEFYLKTHKSQSSVLEAIRRLRFKGGSPLNTGRALEFVARNLF---V 1526 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S + + + ++ G++ A + + ++G+ S+ + Sbjct: 1527 KSAGSRIEDGVPQHLVLFLGGKSQDDVARH--------AQVISSSGIV--SLGIGDRNID 1576 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLES 379 + L+ T+ F V + REL Sbjct: 1577 RTDLQTITNDPRLVFTVREFRELPNI 1602 >gi|323650214|gb|ADX97193.1| inter-alpha-trypsin inhibitor heavy chain h2 [Perca flavescens] Length = 550 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 19/134 (14%), Positives = 39/134 (29%), Gaps = 6/134 (4%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 + I S + + K + + P TN A+ A + Sbjct: 292 DLTIDDHFSIIDFNHNVRCWSEELVAGSSIQIADAKKYIQNIKPNGGTNINEALMRAVQM 351 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 L R +I ++DG+ + +T+ + MR ++S+ + Sbjct: 352 LLKASNQGLI---DPRSVSMIILVSDGDPTVGEIKLSTIQ-KNVKRVMREE-FSLFSLGI 406 Query: 348 SAPPEGQDLLRKCT 361 D L + Sbjct: 407 GFDV-DYDFLERIA 419 >gi|327270780|ref|XP_003220166.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 906 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 44/352 (12%), Positives = 96/352 (27%), Gaps = 47/352 (13%) Query: 66 KQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIP 125 + S + A + I N I + IP Sbjct: 166 DEYNNDAPFYSVGKNEAEATRCSSDITGKYIFRNDKGQIRTCNTERRSQL-FEAGCQFIP 224 Query: 126 SALTNLSLRSTGIIERSSENLAISIC----MVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 + + SS ++ M D ++ Sbjct: 225 DKTQTTPVSIMYMQNLSSVTQFCDESSHNIKAPNMQNKMCDYRSTWEVIMDSADFVNSSP 284 Query: 182 ----PPPPKKS---FWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNL 234 PP P S + +I L ++A + I + Sbjct: 285 RSAPPPHPTLSLLQTRDRVVCLVLDVSGSMRDFNRIHRLKQAAELFLLQIIET------G 338 Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN--PYENTNTYPAMHHAYRELYNEK 292 S + + ++++ ++ + L T+ + ++ Sbjct: 339 SWAGIVVFNSKASTKAFLQQITSDSVR-QTLSDHLPTVADGGTSICSGIREGFQVFL--- 394 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 S+ ++ +TDGE+S S+ + +G I+++A+ P Sbjct: 395 -----QKYSSTEGCEIVLLTDGEDSSVSSCFAEVE---------RSGSTIHTIALG--PS 438 Query: 353 GQDLLRKCTD-SSGQFFAVN---DSRELLESFDKITD---KIQEQSVRIAPN 397 L + + G F+ DS L+++F I+ I +QS+++ Sbjct: 439 AAKELEMLANMTGGLTFSATDSLDSNGLIDAFSGISSGSGDISQQSIQLESK 490 >gi|295087036|emb|CBK68559.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Bacteroides xylanisolvens XB1A] Length = 614 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 32/306 (10%), Positives = 86/306 (28%), Gaps = 26/306 (8%) Query: 75 GSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLR 134 + E G+ + +++ + ++ A Y + L P Sbjct: 139 DAVNAEEYGEFQENGFKSVSDTPLSTFSIDVDA-ASYSNMRRFINKGELPPVDAIRTEEL 197 Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 + + S + P + + N Sbjct: 198 VNYFSYDYPKPTGSDPVKITMESGVCPWNTNHRLVRIGLKAKEI------PTDNLPASNL 251 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 ++D++ S LVN+++ ++ + Y+ Sbjct: 252 VFLIDVSGSMWGANRLDLVKSSLKLLVNNLRDKD--------KVAIVTYSGSAGVKLEAT 303 Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 ++ +++ +++L +T + AYR S+ N +I +DG Sbjct: 304 PGSDKQKIREAIDELTAGGSTAGGAGILLAYRIAKKNLISNGNNR--------IILCSDG 355 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDS 373 + + + L Q+ E R +G+ + + ++ + G +++ Sbjct: 356 DFNVGVSSAEGL--EQLIEKERKSGVFLTVLGYGMGNYKDKKIQVLAEKGNGNHAYIDNL 413 Query: 374 RELLES 379 +E Sbjct: 414 QEANRV 419 >gi|11414930|dbj|BAB18557.1| voltage dependent calcium channel alpha2d/delta subunit [Rana catesbeiana] Length = 1078 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 23/220 (10%), Positives = 66/220 (30%), Gaps = 20/220 (9%) Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAG 218 ++ K N + P + S + + + + ++ S Sbjct: 219 YPASPWVDKSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVT 278 Query: 219 NLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNT 277 ++ ++ + ++ +V N +K +N + T+ Sbjct: 279 EMLETLSDDDFVNVAAFNSNAHDVSCFHHLVQANVR----NKKVLKEAVNNITAKGTTDY 334 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 A+ +L N S K ++ TDG A+ N +N Sbjct: 335 KQGFKFAFDQLRNTNVSRA------NCNKIIMLFTDGGEDKATETFKLYN--------KN 380 Query: 338 AGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 +++++ +V + ++ ++ G ++ + + Sbjct: 381 KTVRVFTFSVGQHNYDKGPIQWMACENKGYYYEIPSIGAI 420 >gi|11414932|dbj|BAB18558.1| voltage dependent calcium channel alpha2e/delta subunit [Rana catesbeiana] Length = 1083 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 23/220 (10%), Positives = 66/220 (30%), Gaps = 20/220 (9%) Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAG 218 ++ K N + P + S + + + + ++ S Sbjct: 219 YPASPWVDKSRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLKLIRTSVT 278 Query: 219 NLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNT 277 ++ ++ + ++ +V N +K +N + T+ Sbjct: 279 EMLETLSDDDFVNVAAFNSNAHDVSCFHHLVQANVR----NKKVLKEAVNNITAKGTTDY 334 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 A+ +L N S K ++ TDG A+ N +N Sbjct: 335 KQGFKFAFDQLRNTNVSRA------NCNKIIMLFTDGGEDKATETFKLYN--------KN 380 Query: 338 AGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 +++++ +V + ++ ++ G ++ + + Sbjct: 381 KTVRVFTFSVGQHNYDKGPIQWMACENKGYYYEIPSIGAI 420 >gi|198426777|ref|XP_002120162.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 367 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 26/174 (14%), Positives = 54/174 (31%), Gaps = 22/174 (12%) Query: 220 LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYP 279 L S + N + + I S + ++ KL T T Sbjct: 173 LDGSTSINEADPNNFNTVKNWVK---NITKRFDITSSGSAAVAMDQI-KLRL-GATFTAA 227 Query: 280 AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG 339 A+ A N KK ++ +TDG+++ + +RN Sbjct: 228 ALSKATTVFKNSSR-----FNDPLTKKVLVLLTDGQSN------DREGLNASATQVRNLN 276 Query: 340 MKIYSVAVSAPPEGQDLLRKCT----DSSGQFFAVNDSRELLESFDKITDKIQE 389 + ++AV + L+ ++ + + + D L I +I++ Sbjct: 277 IT--TIAVGVKADVLQELQIIANGVIGNNDRVYQLRDFSNLDSIVQSIFQEIEK 328 >gi|332254494|ref|XP_003276364.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha-trypsin inhibitor heavy chain H5-like protein-like [Nomascus leucogenys] Length = 1313 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 31/214 (14%), Positives = 77/214 (35%), Gaps = 20/214 (9%) Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 LPP K + + + S + KI+ ++ +++ ++ Sbjct: 272 FAPRGLPPMEKNVVFVIDVSGSMFG-------TKIEQTKKAMNVILSDLKANDYFNIISF 324 Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS 295 + G + N++ K L+ + T+ A+ A L + + Sbjct: 325 SDTVNVWKAGGSIQATI----QNVHSAKDYLHCMEADGWTDINSALLAAASVLNHSNQ-E 379 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 S +IF+ DGE + ++ +C+ + + ++S+A Sbjct: 380 PGRGPSVGRIPLIIFLMDGEPTAGVT-TPSVILSNVCQAL-GHRVSLFSLAFG-DDADFT 436 Query: 356 LLRKCT-DSSG---QFFAVNDSR-ELLESFDKIT 384 LLR+ + ++ G + + D+ +L +++I+ Sbjct: 437 LLRRLSLENRGIAQRIYEDTDAALQLEGLYEEIS 470 >gi|291395815|ref|XP_002714336.1| PREDICTED: complement component 2 isoform 2 [Oryctolagus cuniculus] Length = 613 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 24/156 (15%), Positives = 61/156 (39%), Gaps = 16/156 (10%) Query: 255 LSNNLNEVKSRLNKLNP--------YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 L +N ++ +N L TN Y A++ + N+ + + + Sbjct: 177 LHDNSRDMTEVINSLENAKYTDHENGTGTNIYKALNAVNIMMNNQMQRLGMETTGWQEIR 236 Query: 307 F-VIFITDGENSGASAYQNTLNTLQICEYMRNAG---MKIYSVAVSAPPEGQDLLRKCTD 362 +I +TDG+++ + + +++++ ++ + IY++ V L + Sbjct: 237 HAIILLTDGKSNMGGSPKPAVDSIKDVLNIKEKRNDYLDIYAIGVGKLDVDWRELNELAS 296 Query: 363 SSG---QFFAVNDSRELLESFDKITDKIQEQSVRIA 395 F + D+R L + F+ + D + + + I Sbjct: 297 KKDGERHAFILQDARALQQVFEHMLD-VSKLTDTIC 331 >gi|291395813|ref|XP_002714335.1| PREDICTED: complement component 2 isoform 1 [Oryctolagus cuniculus] Length = 744 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 24/156 (15%), Positives = 61/156 (39%), Gaps = 16/156 (10%) Query: 255 LSNNLNEVKSRLNKLNP--------YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 L +N ++ +N L TN Y A++ + N+ + + + Sbjct: 308 LHDNSRDMTEVINSLENAKYTDHENGTGTNIYKALNAVNIMMNNQMQRLGMETTGWQEIR 367 Query: 307 F-VIFITDGENSGASAYQNTLNTLQICEYMRNAG---MKIYSVAVSAPPEGQDLLRKCTD 362 +I +TDG+++ + + +++++ ++ + IY++ V L + Sbjct: 368 HAIILLTDGKSNMGGSPKPAVDSIKDVLNIKEKRNDYLDIYAIGVGKLDVDWRELNELAS 427 Query: 363 SSG---QFFAVNDSRELLESFDKITDKIQEQSVRIA 395 F + D+R L + F+ + D + + + I Sbjct: 428 KKDGERHAFILQDARALQQVFEHMLD-VSKLTDTIC 462 >gi|116876155|gb|ABK30937.1| complement component 2/factor B variant 2 [Carcinoscorpius rotundicauda] Length = 889 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 22/154 (14%), Positives = 52/154 (33%), Gaps = 13/154 (8%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAY 285 K R G ++++ + + EV + L+K + T A+ Sbjct: 454 TRMGVKEFGTRFGAVSFSSTVSASFLPQDYTTEEEVLNALDKFDFTEGGTAISSALDFVK 513 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA-GMKIYS 344 ++ + + + +TDG+ + + Q+ + ++ +IYS Sbjct: 514 TQMIPLSKHTFADRAMKT---IIFLLTDGKANMRG------DPKQVAKELKADVKAEIYS 564 Query: 345 VAVSAPPEGQDLLRKCTDS-SGQFFAVNDSRELL 377 +A+ + LR+ S + + D L Sbjct: 565 IAL-TGDYDINKLREVASSKKDHVYILKDYETLD 597 >gi|115315541|gb|AAV65032.2| complement component 2/factor B variant 1 [Carcinoscorpius rotundicauda] Length = 889 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 22/154 (14%), Positives = 52/154 (33%), Gaps = 13/154 (8%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAY 285 K R G ++++ + + EV + L+K + T A+ Sbjct: 454 TRMGVKEFGTRFGAVSFSSTVSASFLPQDYTTEEEVLNALDKFDFTEGGTAISSALDFVK 513 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA-GMKIYS 344 ++ + + + +TDG+ + + Q+ + ++ +IYS Sbjct: 514 TQMIPLSKHTFADRAMKT---IIFLLTDGKANMRG------DPKQVAKELKADVKAEIYS 564 Query: 345 VAVSAPPEGQDLLRKCTDS-SGQFFAVNDSRELL 377 +A+ + LR+ S + + D L Sbjct: 565 IAL-TGDYDINKLREVASSKKDHVYILKDYETLD 597 >gi|73971950|ref|XP_532030.2| PREDICTED: similar to sushi, von Willebrand factor type A, EGF and pentraxin domain containing 1 [Canis familiaris] Length = 3569 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 19/132 (14%), Positives = 46/132 (34%), Gaps = 25/132 (18%) Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 ++ + + T T A A + L + +E+S K + ITDG ++G Sbjct: 153 LRREIPAIAYRGGGTYTKGAFQQAAQILRHSRENST---------KVIFLITDGYSNGG- 202 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLE 378 + + +R+ G++I++ L + + ++ E Sbjct: 203 ------DPRPVAASLRDFGVEIFT--FGIWQGNIRELNDMASTPKEEHCYLLHSFEE--- 251 Query: 379 SFDKITDKIQEQ 390 F+ + + + Sbjct: 252 -FEALARRALHE 262 >gi|268572467|ref|XP_002648969.1| Hypothetical protein CBG21291 [Caenorhabditis briggsae] Length = 427 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 56/166 (33%), Gaps = 26/166 (15%) Query: 235 SVRIGTIAYNI--GIVGNQCTPLSNNLNEVKSRLNKLNPYENT-NTYPAMHHAYRELYNE 291 ++ + + + EV ++KL T + A ++ Sbjct: 276 YTQVAAVTFATVGRTRVRFNLKKYSTQEEVLRGIDKLQSKGGTTAIGAGIEKALTQIDES 335 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV---- 347 + + K +I TDG ++ + + NAG ++Y+VA Sbjct: 336 EGARPGIAT-----KVMIVFTDGWSNKG------PDPEKRARDAVNAGFEMYTVAYTARA 384 Query: 348 -SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + + L + SSG F D F + DKI+++++ Sbjct: 385 PGSVTLNNETLSAISGSSGHAF--TDVT-----FQTLVDKIKQRNL 423 >gi|117921993|ref|YP_871185.1| von Willebrand factor, type A [Shewanella sp. ANA-3] gi|117614325|gb|ABK49779.1| von Willebrand factor, type A [Shewanella sp. ANA-3] Length = 335 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 24/150 (16%), Positives = 54/150 (36%), Gaps = 23/150 (15%) Query: 246 GIVGNQCTPLSNNLN---EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 G TP + + + ++T+ A+ + + S Sbjct: 149 GDAAFIQTPFTADQQVWLSLLEEAQTGMAGQSTHLGDAIGLGIKVFEQNPQPSE------ 202 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-PEGQDLL---- 357 ++ +I +TDG ++G + + + + G+KIY++A+ P G+ + Sbjct: 203 --QQVMIVLTDGNDTG-----SFVEPVDAAKIAAARGIKIYTIAMGDPTHVGEQPMDMEV 255 Query: 358 --RKCTDSSGQFFAVNDSRELLESFDKITD 385 R + + F D EL +++ I Sbjct: 256 VQRVSQLTQARAFIAIDQAELDKAYQLIDK 285 >gi|327277464|ref|XP_003223484.1| PREDICTED: von Willebrand factor A domain-containing protein 2-like, partial [Anolis carolinensis] Length = 750 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 25/154 (16%), Positives = 57/154 (37%), Gaps = 20/154 (12%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + V++G I ++ +++K +L ++ T T A+ + R+ Sbjct: 82 DISPEKVKVGAIEFSNTAYLEFSLDAYFTKHQIKDKLKRIVFKGGRTETGLALKYILRKG 141 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ +S + K +I +TDG++ G N + ++ G+ ++ V VS Sbjct: 142 FHGSRNST-------VPKILIILTDGKSQG--------NIAAPAKQLKEMGITVFVVGVS 186 Query: 349 APPEGQDLLRKCTDSSG--QFFAVNDSRELLESF 380 P + L D+ + + F Sbjct: 187 FPRWEE--LHILASDPTEWHLLFAEDTDDAVNGF 218 >gi|297270401|ref|XP_002800091.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Macaca mulatta] Length = 3386 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 21/132 (15%), Positives = 47/132 (35%), Gaps = 25/132 (18%) Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + + ++ T T A A + L + +E+S K V ITDG ++G Sbjct: 153 LLQEIPAISYRGGGTYTKGAFQQAAQILLHARENST---------KVVFLITDGYSNGG- 202 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLE 378 + I +R++G++I++ L + + ++ E Sbjct: 203 ------DPRPIAASLRDSGVEIFT--FGIWQGNIRELNDMASAPKEEHCYLLHSFEE--- 251 Query: 379 SFDKITDKIQEQ 390 F+ + + + Sbjct: 252 -FEALARRALHE 262 >gi|145491133|ref|XP_001431566.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124398671|emb|CAK64168.1| unnamed protein product [Paramecium tetraurelia] Length = 636 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 21/151 (13%), Positives = 46/151 (30%), Gaps = 17/151 (11%) Query: 234 LSVRIGTIAYNIGIVGNQCTPL--SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNE 291 + R+ I ++ + N + + ++ A A+ +L + Sbjct: 193 DNDRLQLITFDNDAHRLTPLKTVTNQNKSYFTQIIKQIKANGGNRISEATKMAFYQLKS- 251 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 V ++DG + +N + T+ +++ Sbjct: 252 -------RKYINNVTSVFLLSDGVDYTYPEVKNQIQTVNEV-------FTLHTFGFGEDH 297 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + Q + + C SG F+ V D L E F Sbjct: 298 DAQMMTQLCNLKSGSFYFVQDVTLLDEFFAD 328 >gi|297618081|ref|YP_003703240.1| hypothetical protein Slip_1922 [Syntrophothermus lipocalidus DSM 12680] gi|297145918|gb|ADI02675.1| protein of unknown function DUF2134, membrane [Syntrophothermus lipocalidus DSM 12680] Length = 310 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 16/154 (10%), Positives = 45/154 (29%), Gaps = 28/154 (18%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+ ++ F D + ++++ +A+D+AVL+G + + Sbjct: 18 LFALSLTALLGFTALVTDSGLLFLNQSRVANAVDSAVLAGAQELPGNPD-----AALAVA 72 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 T + + + Q + E+ + L+ Sbjct: 73 RTYAEAN------------------GVTDAEAVFSVSEDGRQIVGEATRRVG-----LYF 109 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVL 154 ++ + R+ + + + VL Sbjct: 110 ARVLGFTQGEVKARAAARVGALTAATGCAPLGVL 143 >gi|90407194|ref|ZP_01215382.1| type IV pilin biogenesis protein, putative [Psychromonas sp. CNPT3] gi|90311770|gb|EAS39867.1| type IV pilin biogenesis protein, putative [Psychromonas sp. CNPT3] Length = 1116 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 30/274 (10%), Positives = 73/274 (26%), Gaps = 60/274 (21%) Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 P + + A ++D+ E L++S + + Sbjct: 175 IFTSPDAVTLYTANYYYWYMNKRASNNTRSRVDIAKEVITTLISSTPSVDFGLSIFNSKG 234 Query: 239 GTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNT--------------------- 277 + ++ + +N L +TNT Sbjct: 235 SGGRIIHRVKERDKPE----KEQLVASINNLKIESSTNTPLLETVFESYLYYAGLNVKSG 290 Query: 278 --YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA-----------YQN 324 + + + + K +++ +TDG+ S A Sbjct: 291 KTAKGSEPIRDMKAEDAGVYTSPFKTCQEKSYLVLMTDGDPSSPYALQKEIGSLIKRDNP 350 Query: 325 TLNTLQICEY------------------MRNAGMKIYSVAVS---APPEGQDLLRKCTD- 362 N I + Y++ + + DLL+K + Sbjct: 351 NSNLKGIANSYMPVLTQWMFNNDINPKLAEEQNITTYTIGFGDITSDTDATDLLKKTAEL 410 Query: 363 SSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 G++F +++ L ++F K+ I ++ ++ Sbjct: 411 GGGKYFPASNASALQKAFKKMIISILNETGSLSS 444 >gi|26352386|dbj|BAC39823.1| unnamed protein product [Mus musculus] Length = 902 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 22/150 (14%), Positives = 61/150 (40%), Gaps = 25/150 (16%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 S++ ++ + L + T+ + ++ + + + ++ +T Sbjct: 364 ITSSSDYQKITANLPQ-QASGGTSICHGLQAGFQAITSS--------DQSTSGSEIVLLT 414 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN 371 DGE++G + E + +G I+++A+ + L +D + G F N Sbjct: 415 DGEDNGIRSCF---------EAVSRSGAIIHTIALGPSAARE--LETLSDMTGGLRFYAN 463 Query: 372 -DSRELLESFDKITD---KIQEQSVRIAPN 397 D L+++F +I+ + +Q++++ Sbjct: 464 KDLNSLIDAFSRISSTSGSVSQQALQLESK 493 >gi|71280467|ref|YP_270055.1| von Willebrand factor type A domain-containing protein [Colwellia psychrerythraea 34H] gi|71146207|gb|AAZ26680.1| von Willebrand factor type A domain protein [Colwellia psychrerythraea 34H] Length = 786 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 24/136 (17%), Positives = 53/136 (38%), Gaps = 14/136 (10%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N+++ + ++ L+ T YR L N + S++ + ++FITDG Sbjct: 454 NISKAQQFIDGLSANGGT-------EMYRPLSNALMMKKDKTQSSKAIRQIVFITDGAV- 505 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 N +Q+ + ++Y+V + A P G + + G + + + E+ Sbjct: 506 -----ANEFELMQLLNTAQGD-FRLYTVGIGAAPNGYFMKKAAQFGRGSYVFIQNKSEVQ 559 Query: 378 ESFDKITDKIQEQSVR 393 KI + ++ Sbjct: 560 RKMSHFMTKISQPALT 575 >gi|313242832|emb|CBY39591.1| unnamed protein product [Oikopleura dioica] Length = 674 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 28/221 (12%), Positives = 59/221 (26%), Gaps = 15/221 (6%) Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 Y + I + N I E + S+R Sbjct: 382 YEPCNQDACDAGCSGPRDVLFVAHYTTYMGSTFADISA--FYENIISTINVEPSDSSIRF 439 Query: 239 GTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNT 298 +N + N+++E K + P Y A + + + Sbjct: 440 AFSFFNHAYIEFFAFDWLNSIDEYKWAFSSFPPASGNANYIG--RALKGAADTMTPAFGK 497 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLR 358 V+ +T+ +T ++ + ++ ++ V + GQD L Sbjct: 498 GRRIDTVGTVVLLTN--------AASTDEVNEMADQLKEKVDRVIVVGLGY-AFGQDELA 548 Query: 359 KCTDSS--GQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 S + +S +L I D+I + + Sbjct: 549 GIASSPTKENLYTAEESSDLAGLVRTIADEICATELSNPSD 589 >gi|227833260|ref|YP_002834967.1| hypothetical protein cauri_1436 [Corynebacterium aurimucosum ATCC 700975] gi|262184244|ref|ZP_06043665.1| hypothetical protein CaurA7_09649 [Corynebacterium aurimucosum ATCC 700975] gi|227454276|gb|ACP33029.1| putative membrane protein [Corynebacterium aurimucosum ATCC 700975] Length = 688 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 24/226 (10%), Positives = 60/226 (26%), Gaps = 29/226 (12%) Query: 154 LDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKN----TTKSKYAPAPAPANRK 209 L V+ + S++ S + A + Sbjct: 30 LSVASVPAFAQNPTGVTPSAEPSSEAAPNGGAANSQSGATMLVLDSSGSMNVQDAGGQTR 89 Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT-----PLSNNLNEVKS 264 +D ++ V+ + I A G Q P + + Sbjct: 90 LDAAKDATKKFVSELGGTIPLGLVTYGGTVDEAPENQEAGCQDIHVVSGPKEDVGDSFTG 149 Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 ++ L T ++ A EL ++ ++DG ++ Sbjct: 150 PIDALQAKGYTPIGDSLKKAAEEL-------------GGQHGTIVLVSDGIDT-----CA 191 Query: 325 TLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCTDSSGQFF 368 ++ + + G+ I ++ + E + L +++G + Sbjct: 192 PPPVCEVAKELHEQGIDLVINTIGFNVDEEARKELSCIAEAAGGEY 237 >gi|237721534|ref|ZP_04552015.1| conserved hypothetical protein [Bacteroides sp. 2_2_4] gi|293369166|ref|ZP_06615760.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CMC 3f] gi|229449330|gb|EEO55121.1| conserved hypothetical protein [Bacteroides sp. 2_2_4] gi|292635749|gb|EFF54247.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CMC 3f] Length = 616 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 33/327 (10%), Positives = 92/327 (28%), Gaps = 26/327 (7%) Query: 54 TTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEI 113 K ++ + + E G I + +++ + ++ A Y Sbjct: 118 KATKSVSAAYMAVCPTPGIMYDAVNAEEYGQIQENGFKSVSDAPLSTFSIDVDA-ASYSN 176 Query: 114 PTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNN 173 + L P + + + + + + Sbjct: 177 MRRFINKGELPPVDAIRTEELVNYFSYDYPKPTGSDPVKITMEAGACPWNAVHRLVRIGL 236 Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 P + + N ++D++ S LVN+++ Sbjct: 237 KAREI------PTDNLPASNLVFLIDVSGSMWGANRLDLVKSSLKLLVNNLRDKD----- 285 Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 ++ + Y L ++ +++ +++L +T + AY+ Sbjct: 286 ---KVAIVTYAGSAGVKLEATLGSDKQKIREAIDELTAGGSTAGGAGILLAYKIAKKNFI 342 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S+ N +I +DG+ + + L Q+ E R +G+ + + Sbjct: 343 SNGNNR--------IILCSDGDFNVGVSSAEGL--EQLIEKERKSGVFLTVLGYGMGNYK 392 Query: 354 QDLLRKCTDSS-GQFFAVNDSRELLES 379 ++ + G +++ +E Sbjct: 393 DKKIQVLAEKGNGNHAYIDNLQEANRV 419 >gi|127512721|ref|YP_001093918.1| vault protein inter-alpha-trypsin subunit [Shewanella loihica PV-4] gi|126638016|gb|ABO23659.1| Vault protein inter-alpha-trypsin domain protein [Shewanella loihica PV-4] Length = 776 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 33/159 (20%), Positives = 57/159 (35%), Gaps = 11/159 (6%) Query: 237 RIGTIAYNIGIVGNQCTPLSN---NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 IA++ + LS NL + + L T PA+ A + Sbjct: 438 TFNVIAFDSSVRSLSPVALSATAANLGKANLFVQSLEADGGTEMAPALLRALSQ-PESGV 496 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 SS ++ K V+FITDG + +L L R +++V + A P G Sbjct: 497 SSISSAVKPERLKQVVFITDGAVGN----EASLFALIAANIGRQR---LFTVGIGAAPNG 549 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + R G + V E+ ++ +KI+ + Sbjct: 550 YFMERAARAGRGTYTYVGKISEVDAKIGELLEKIESPQI 588 >gi|305663060|ref|YP_003859348.1| von Willebrand factor type A [Ignisphaera aggregans DSM 17230] gi|304377629|gb|ADM27468.1| von Willebrand factor type A [Ignisphaera aggregans DSM 17230] Length = 411 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 17/131 (12%), Positives = 47/131 (35%), Gaps = 6/131 (4%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 +++ + + + TNTY + Y ++ + + +I +TDG + Sbjct: 95 DMDRIVKAVAGIKLGGGTNTYGVLEQIYMDIPSVLDRVKKEESDKIPSIRMIMVTDGNPT 154 Query: 318 GASAYQNTLNTLQICEYMRNA-GMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRE 375 ++ + I E + + + + V + LL K + G F +++ + Sbjct: 155 VG--IRDEDRIIDIAERLGKYLSISL-IIGVG-DDYNERLLAKIALKTKGFFEHLDNPAK 210 Query: 376 LLESFDKITDK 386 + + + Sbjct: 211 TSSILENMVSR 221 >gi|301788518|ref|XP_002929675.1| PREDICTED: complement factor B-like [Ailuropoda melanoleuca] gi|281345621|gb|EFB21205.1| hypothetical protein PANDA_019913 [Ailuropoda melanoleuca] Length = 768 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 32/228 (14%), Positives = 73/228 (32%), Gaps = 24/228 (10%) Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDV-----LIESAGNLVNSIQKAIQEKKNLSV 236 P +K + + S + I + + + + + K V Sbjct: 253 PGEQQKRRIVLDPSGSMNIYLVLDGSDSIGIGNFTRAKNCLRDFIEKVASYGVKPKYGLV 312 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN----KLNPYENTNTYPAMHHAYRELYNEK 292 TI + + + ++ + + ++++ KL TNT A+ Y + Sbjct: 313 TYATIPKVWVKLRDDNSSDADWVTRILNQISYEDHKLKA--GTNTKKALQEVYNMM--SW 368 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGA-------SAYQNTLNTLQICEYMRNAGMKIYSV 345 + G + +I +TDG ++ ++ L + + R + +Y Sbjct: 369 PGNALLEGWNHTRHVIILMTDGLHNMGGDPVSVIHEIRDFLYIGRDRKNPREDYLDVYVF 428 Query: 346 AVSAPPEGQDLLRKCTDSSG---QFFAVNDSRELLESFDKITDKIQEQ 390 V P Q+ + F V D L + F ++ D+ + Sbjct: 429 GVG-PLVNQENINALASKKDKEQHVFRVKDMENLEDVFIQMLDETRTL 475 >gi|167590268|ref|ZP_02382656.1| putative transmembrane protein [Burkholderia ubonensis Bu] Length = 377 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 33/285 (11%), Positives = 73/285 (25%), Gaps = 19/285 (6%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAV----------LSGCASIVSDRTIK 51 A+ + + F AID+ +++ RN++Q++ DAA + + S Sbjct: 21 VALCMVILLGFAALAIDIGNLLIARNELQNSADAAAMAGAGCLIRRTACSNTSASQPDWP 80 Query: 52 DPTTKKDQTSTIFKKQ-------IKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYI 104 ST + G + + + +Q Sbjct: 81 TADATASAFSTSATTNQVQGTSVQTSTVATGYWNTTGTPYGLESLPFTPGANDLPAVQVT 140 Query: 105 AESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLY 164 +FL + + + S +T ++ L +S+ M Y Sbjct: 141 IRKDGSNANGAVPIFLGRIFGARILKASAVATAVLSTPGSVGP-GGLFPLAISQCMYTNY 199 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTT-KSKYAPAPAPANRKIDVLIESAGNLVNS 223 + + + N ++P + + + + Sbjct: 200 WDSSSQSPKIAPNSGVVPGFSWPNQIAGQPYIFQIGSSYHYGTCSSGQWTTFDVNDNSAG 259 Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 K + N + + I L NN + N Sbjct: 260 YAKTMLTNGNPNTLTIGASPGTWIQTGTENTLFNNTATCSAAGNG 304 >gi|296232325|ref|XP_002761549.1| PREDICTED: hypothetical protein LOC100408376 [Callithrix jacchus] Length = 912 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 66/190 (34%), Gaps = 18/190 (9%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS---NNLNE 261 P +D + +++++ + + ++ A + L+ + Sbjct: 54 PYGALVDKVKSFTKRFIDNLRD-RYYRCDRNLVWNAGALHYSDEVEIIQGLTRMPGGRDS 112 Query: 262 VKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 +KS ++ + + T T A+ +L SH K++I +TDG Sbjct: 113 LKSSVDAVKYFGKGTYTDCAIKKGLEQLLVG--GSHLKEN-----KYLIVVTDGHPLEGY 165 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF---FAVNDSRELL 377 ++ G+K++SVA+ P + L + + F D + Sbjct: 166 KEPCG-GLEDAVNEAKHLGIKVFSVAI-TPDHLEPRLSIIA-TDHTYRRNFTAADWGQSR 222 Query: 378 ESFDKITDKI 387 ++ + I+ I Sbjct: 223 DAEEAISQTI 232 >gi|21675084|ref|NP_663149.1| hypothetical protein CT2278 [Chlorobium tepidum TLS] gi|21648324|gb|AAM73491.1| conserved hypothetical protein [Chlorobium tepidum TLS] Length = 332 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 29/134 (21%), Positives = 50/134 (37%), Gaps = 30/134 (22%) Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T A+ A L S+ +K ++ ITDGEN+ + Sbjct: 178 GTAIGTAILTATNRL----------KASSSKEKALVLITDGENNAG-----EVTPETAAR 222 Query: 334 YMRNAGMKIYSV-------AVSAPPE------GQDLLRKCTD-SSGQFFAVNDSRELLES 379 N G++IY+V A G+ L S G+ F+ D L++S Sbjct: 223 LAANYGIRIYTVFAGKEARAFENTSNTALNRKGRSELETVARISGGRMFSAGDVFGLMKS 282 Query: 380 FDKITDKIQEQSVR 393 F I D++++ ++ Sbjct: 283 FRDI-DRLEKTRLK 295 >gi|47224105|emb|CAG13025.1| unnamed protein product [Tetraodon nigroviridis] Length = 983 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 21/219 (9%), Positives = 58/219 (26%), Gaps = 18/219 (8%) Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAG 218 ++ + + P + S + + + + ++ S Sbjct: 43 YPASPWMDARKTPSKIDLYDVRRRPWYIQGAASPKDMLILVDASGSVSGLTLKLIRTSVT 102 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 ++ ++ + N +K + + TN Sbjct: 103 EMLETLSDDDYVNVVYFNTQVKKTACFDHLVQANVR---NKKLLKDAVQNITAKGITNYT 159 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 + A+ +L S K ++ TDG A A N + Sbjct: 160 KGLEFAFEQLSVTNVSRA------NCNKIIMLFTDGGEERAQAILEKYNA--------DK 205 Query: 339 GMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 ++I++ +V + ++ + G F+ + + Sbjct: 206 KVRIFTFSVGQHNYDKGPIQWMACSNKGYFYEIPSIGAI 244 >gi|114566000|ref|YP_753154.1| Mg-chelatase subunit ChlD-like protein [Syntrophomonas wolfei subsp. wolfei str. Goettingen] gi|114336935|gb|ABI67783.1| Mg-chelatase subunit ChlD-like protein [Syntrophomonas wolfei subsp. wolfei str. Goettingen] Length = 592 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 30/271 (11%), Positives = 81/271 (29%), Gaps = 26/271 (9%) Query: 112 EIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDN 171 ++P+ + + ++ N + L V ++ N Sbjct: 329 KMPSGGNSRFRKLGQVDQKQTQVEFTNRNKTVNNPDKNWSGDLAVPETIVQAMKNSFLRN 388 Query: 172 NNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEK 231 + + K K + + + A K ++A L ++ + + Sbjct: 389 DPHFTIKKEDLHYYDKKSYVPIDVCLLIDASGSMAGDK----RQAACFLAQNLLLSGK-- 442 Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNE 291 ++ + + P + N N + L+ ++P T + A + Sbjct: 443 ----EKVAVVTFQERS-SEVVVPFTRNQNILNKGLSTISPAGLTPMADGIMTAVNLI--- 494 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNT-LNTLQICEYMRNAGMKIYSVAVSAP 350 + ++ I+DG + + + L+ ++R KI+ + + Sbjct: 495 -------KNNRVRNPLLVLISDGIPNIPLWTLDAQADALEAATHIREN--KIHFICIGLE 545 Query: 351 PEGQDLLRKCTDSSGQFFAVND--SRELLES 379 L + ++ G + V+D L+ Sbjct: 546 SNRFYLEKLSANAGGALYLVDDLNKDNLINI 576 >gi|224171915|ref|XP_002198669.1| PREDICTED: similar to collagen, type XX, alpha 1, partial [Taeniopygia guttata] Length = 152 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 20/124 (16%), Positives = 42/124 (33%), Gaps = 16/124 (12%) Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIG 300 Y+ + +V + L NT T A+ H + + Sbjct: 3 QYSSDPRTEWELSTYSTREQVLEAVRNLRYKGGNTFTGLALTHVLEQ-----NLKPDAGA 57 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 +K VI +TDG++ + + ++N G++I+++ V + LR+ Sbjct: 58 RLEAEKLVILLTDGKSQD--------DANLAAQTLKNLGIEIFAIGV--KNADEAELRQV 107 Query: 361 TDSS 364 Sbjct: 108 ASEP 111 >gi|162424746|gb|ABX90059.1| hedgling [Amphimedon queenslandica] Length = 2416 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 25/233 (10%), Positives = 72/233 (30%), Gaps = 25/233 (10%) Query: 146 LAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAP 205 + ++ +L ++ S ++ S + + + Sbjct: 144 TSRNVTRLLKMAVSAGFDWVMYDKKGYARMSVIPDACDTNLDVVFVLD---QSGSIGYYN 200 Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 ++ L + K ++G I Y+ ++ + + +R Sbjct: 201 HQLALNFLSKVVEFFKIGANKT---------QVGLITYSTHAYVQFDLNDYHSKSTILNR 251 Query: 266 LNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 ++++ T T + A + N ++ S + + VI +TDG ++ Sbjct: 252 ISRIYYTGGWTATALGLFQA-GVILNPQQMRGARPISQGVPRVVILLTDGRSNRVPID-- 308 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRE 375 ++ + + G+++Y+V V L+ F ++ + Sbjct: 309 -----EVAPSLHDFGIQVYTVGVG--NIYLPELKFIASDPDPYHIFLLDSFSD 354 >gi|72093926|ref|XP_787298.1| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3, partial [Strongylocentrotus purpuratus] gi|115954441|ref|XP_001184332.1| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3, partial [Strongylocentrotus purpuratus] Length = 504 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 26/157 (16%), Positives = 56/157 (35%), Gaps = 14/157 (8%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 RKID + +++ ++ + L + + +V S+N+ K +N Sbjct: 351 RKIDQTKRAFTTILDDVRPIDRINIVLFESDVRVWRSNQMVEAT----SDNIAAAKRHVN 406 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 ++ TN Y + +A L +I +TDG+ + S + Sbjct: 407 RIRAGGGTNLYDGLRNAVDLLMEHGNGEAMP--------LIIMLTDGQPTSGSVKSTSEI 458 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS 364 +I + + ++SV+ L K + S+ Sbjct: 459 IQRITNLIDGR-LSLFSVSFG-NGVDFSFLEKLSLSN 493 >gi|329954940|ref|ZP_08295957.1| von Willebrand factor type A domain protein [Bacteroides clarus YIT 12056] gi|328527044|gb|EGF54055.1| von Willebrand factor type A domain protein [Bacteroides clarus YIT 12056] Length = 453 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 24/188 (12%), Positives = 61/188 (32%), Gaps = 19/188 (10%) Query: 193 NTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQC 252 N S ++D++ S LVN+++ R+ + Y+ Sbjct: 91 NLPVSNLVFLIDVGAERLDLVKSSLKLLVNNLRDKD--------RVAIVVYSGAAGERLP 142 Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 + ++ ++K +++L +T + AY+ N +I T Sbjct: 143 STSGSDKQKIKDAIDELEAGGSTAGGAGIELAYKIARKNFVPGGNNR--------IILCT 194 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVN 371 DG+ + + ++ E R G+ + + ++ + G ++ Sbjct: 195 DGDFNVG--ISSGKALEKLIEKERETGVFLTVLGYGMGNYKDGKMQVLAEKGNGNHAYID 252 Query: 372 DSRELLES 379 + +E Sbjct: 253 NLQEANRV 260 >gi|77735553|ref|NP_001029472.1| calcium-activated chloride channel regulator 4 [Bos taurus] gi|74268230|gb|AAI03388.1| Chloride channel accessory 4 [Bos taurus] gi|296489202|gb|DAA31315.1| chloride channel accessory 4 [Bos taurus] Length = 933 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 17/133 (12%), Positives = 46/133 (34%), Gaps = 24/133 (18%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T+ + ++ + N ++ +TDGE+S Sbjct: 380 ASGGTSICSGIESGFQAIRNA--------DFQIDGSEIVLLTDGEDST---------AKS 422 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE---LLESFDKITDK- 386 E ++ +G I+ +A+ + G++ +D + L+++F + + Sbjct: 423 CIEKVKQSGAIIHFIALGPDAAQAVKEMSIA-TGGKYIYASDEGQNNGLIDAFAALASEN 481 Query: 387 --IQEQSVRIAPN 397 +QS+++ Sbjct: 482 IDASQQSLQLESK 494 >gi|29375667|ref|NP_814821.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis V583] gi|29343128|gb|AAO80891.1| von Willebrand factor type A domain protein [Enterococcus faecalis V583] Length = 1103 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 101/382 (26%), Gaps = 30/382 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M A A L+ +++ + D + + QG+ D Sbjct: 151 AEARMAPATLRANLAL--PLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGSAQWD 208 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ ++Y + L++R E Sbjct: 209 GQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRG----NVQKE 264 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +V+D S SM + L + S Sbjct: 265 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNINMGYVGYSSDGYNNN 324 Query: 205 PANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 D + N+ S + + G + L++ + Sbjct: 325 AIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFS 384 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +++++ T R + + + GST D N Sbjct: 385 YKVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNINKRINS 434 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA-V 370 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 435 TFIATIGEAMVLKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSADENGDLYYESA 494 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 495 DYAPDISDYLAKKAVQISGTVV 516 >gi|295106190|emb|CBL03733.1| hypothetical protein [Gordonibacter pamelaeae 7-10-1-b] Length = 929 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 21/240 (8%), Positives = 58/240 (24%), Gaps = 49/240 (20%) Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 + + + + I ++ + + I A + L V + Y Sbjct: 271 DAANSFVQSVRSNSLGEDGIAGTVDDVPHRIAVIGFASGDNTELFVGSASYPYGQSAQSQ 330 Query: 251 QCTPLSN-----NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + + + + + +L+ T + A + Sbjct: 331 YGSAFQDMTTEMGFDNALASIGQLSADGGTLVDDGLDMANGVFSANPLVTGELRN----- 385 Query: 306 KFVIFITDGENS-GASAYQNTLNTLQICEYMRNAGMKIYSVAVSA--------------- 349 + + +TDG + + ++ AG ++S+ + Sbjct: 386 RVTVVLTDGAPGLYGNDRGVANEAISQASELKTAGSTVFSIGIFPGADASGDLPDQSAMG 445 Query: 350 --PPEGQDLLRKCT---------------------DSSGQFFAVNDSRELLESFDKITDK 386 + + + S + + +DS L F I+ Sbjct: 446 WGDNDSNRFMHLLSSNYPDASSMGSPGARFVDEEGSSPDYYLSASDSAGLNSIFQSISQS 505 >gi|57163755|ref|NP_001009219.1| integrin alpha-E [Felis catus] gi|20530606|gb|AAM27173.1|AF420018_1 alpha E integrin [Felis catus] Length = 1160 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 25/155 (16%), Positives = 49/155 (31%), Gaps = 14/155 (9%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNT 298 + Y I S + +R+ + +N T T A+ H ++ S Sbjct: 232 LVQYGEVIQTEFDLRDSQDALASLARVQNITQVKNVTKTASAIQHVLDNIFTPSHGSR-- 289 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ-DLL 357 K ++ ITDG+ + + LN + + G++ +++ V L Sbjct: 290 ---KNAPKVIVVITDGD-----IFGDPLNLTTVISSPKMQGVERFAIRVGNESTKTLKEL 341 Query: 358 RKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 + F V + L K+ I Sbjct: 342 KLIASDPFERHAFTVTNYSALDGLLSKLQQNIIHT 376 >gi|120599090|ref|YP_963664.1| cell wall anchor domain-containing protein [Shewanella sp. W3-18-1] gi|120559183|gb|ABM25110.1| LPXTG-motif cell wall anchor domain [Shewanella sp. W3-18-1] Length = 757 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 56/159 (35%), Gaps = 16/159 (10%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEV---KSRLNKLNPYENTNTYPAMHHAYRELYNEKES 294 I +N + T L N + + +++L T A++ A Sbjct: 413 FNIIEFNSELTQLSPTSLPANQTHLARARQFIHRLQADGGTEMALALNAAL------PRG 466 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + S++ + VIF+TDG A + + ++++V + + P Sbjct: 467 INRLSESSQSLRQVIFMTDGSVGNEQALFDLIRYQIGES-------RLFTVGIGSAPNSH 519 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + R G F + + E+ + K+ KIQ + Sbjct: 520 FMQRAAELGRGTFTYIGNVDEVEQKISKLLSKIQYPVLT 558 >gi|3560547|gb|AAC35003.1| chloride channel CaCC [Mus musculus] Length = 901 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 22/150 (14%), Positives = 61/150 (40%), Gaps = 25/150 (16%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 S++ ++ + L + T+ + ++ + + + ++ +T Sbjct: 364 ITSSSDYQKITANLPQ-QASGGTSICHGLQAGFQAITSS--------DQSTSGSEIVLLT 414 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN 371 DGE++G + E + +G I+++A+ + L +D + G F N Sbjct: 415 DGEDNGIRSCF---------EAVSRSGAIIHTIALGPSAARE--LETLSDMTGGLRFYAN 463 Query: 372 -DSRELLESFDKITD---KIQEQSVRIAPN 397 D L+++F +I+ + +Q++++ Sbjct: 464 KDLNSLIDAFSRISSTSGSVSQQALQLESK 493 >gi|148680077|gb|EDL12024.1| mCG3350, isoform CRA_d [Mus musculus] Length = 902 Score = 55.7 bits (132), Expect = 1e-05, Method: Composition-based stats. Identities = 22/150 (14%), Positives = 61/150 (40%), Gaps = 25/150 (16%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 S++ ++ + L + T+ + ++ + + + ++ +T Sbjct: 364 ITSSSDYQKITANLPQ-QASGGTSICHGLQAGFQAITSS--------DQSTSGSEIVLLT 414 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN 371 DGE++G + E + +G I+++A+ + L +D + G F N Sbjct: 415 DGEDNGIRSCF---------EAVSRSGAIIHTIALGPSAARE--LETLSDMTGGLRFYAN 463 Query: 372 -DSRELLESFDKITD---KIQEQSVRIAPN 397 D L+++F +I+ + +Q++++ Sbjct: 464 KDLNSLIDAFSRISSTSGSVSQQALQLESK 493 >gi|213964310|ref|ZP_03392536.1| BatB protein [Capnocytophaga sputigena Capno] gi|213953052|gb|EEB64408.1| BatB protein [Capnocytophaga sputigena Capno] Length = 345 Score = 55.3 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 38/286 (13%), Positives = 90/286 (31%), Gaps = 35/286 (12%) Query: 113 IPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNN 172 IP +F ++ S ++R N + V + +++ Sbjct: 17 IPLFIIFAFSMLRKRRLQKQFASPAALKRLVPNRSRFKLWV--KWSVLAVVFILLSIALA 74 Query: 173 NMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK 232 N + + SK A A +++ A ++ ++ Sbjct: 75 NPKIGTKIETVKREGVDIVFAIDVSKSMLAEDVAPNRLEKAKRIAFETISQLK------- 127 Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 R+G +AY L+ + + K L +N ++ A+ A R N Sbjct: 128 --GDRVGIVAYAASAYPQ--LALTTDHSAAKMFLQGMNTDMLSSQGTAIQEAIRMASNYF 183 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + T + + ++DGE+ + + +I + G+ IY++ + Sbjct: 184 D------DKTPTARLLFILSDGED-------HEMGATEIASEAQEKGVHIYTIGIGTEKG 230 Query: 353 GQDLLRKCTDSSGQFFAVNDSREL------LESFDKITDKIQEQSV 392 ++ + GQ + + + E+ E +I + + Sbjct: 231 APIPIK---EGGGQTYKRDSNGEVVITKLNSELLQQIASNAGGEYL 273 >gi|149620125|ref|XP_001521750.1| PREDICTED: similar to Collagen, type VI, alpha 1 [Ornithorhynchus anatinus] Length = 1163 Score = 55.3 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 44/365 (12%), Positives = 95/365 (26%), Gaps = 33/365 (9%) Query: 40 GCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNN 99 S V+ R I+D + + I + G +I I Sbjct: 814 VSMSDVNIRNIRDLKEAVKKLNWIAGGTFTGEALD--FASNTLGRPDGNQRIAIVITDGR 871 Query: 100 PLQYI----AESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD 155 + Q + + + L + + + Sbjct: 872 SDTKRDPSPLNALCQTGAQVVAVGINDIFSKPSNQEGLSAVTCEAPAPGQKKGLSISKEN 931 Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 + ++D + N + + K F S + + + + + Sbjct: 932 YAELLDDAF--LKNITSQICIEKKCPDYTCPIKFTSSSDITILVDSSTSVGSHNFKTSKQ 989 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYEN 274 L A + VR+ + Y+ N E+ S ++ + + Sbjct: 990 FVKRLAERFLSADKT-DTADVRVSVVQYSGRNQQKLEAQFLQNYTEIASIIDDMEFINDA 1048 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 T+ A+ + +KK ++ +DG + G + Sbjct: 1049 TDVNAAIRYVTTLYQKSSPRG--------VKKRLLLFSDGNSQG----ITGKAIEAAVQE 1096 Query: 335 MRNAGMKIYSVAVSAPPE---------GQDLLRKCTDSSGQFFAVNDSRELLE--SFDKI 383 + AG++IY + V G+ F V D LL+ + + Sbjct: 1097 AQRAGIEIYVLVVGRHANEPNIRVLVTGKTTEYDVAYGERHLFRVPDYHALLQGVFYQTV 1156 Query: 384 TDKIQ 388 + KI Sbjct: 1157 SRKIS 1161 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 27/165 (16%), Positives = 55/165 (33%), Gaps = 17/165 (10%) Query: 238 IGTIAYNIGIVGNQCTPLS---NNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKE 293 A + L+ + +K R+ + T+T A+ EL Sbjct: 234 WNAGALHYSDSVELIQGLTRMPSGQKNLKDRVEAVQYIGKGTHTDCAIKRGIEELL---- 289 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + K++I +TDG ++ G+K++S+A+S Sbjct: 290 ---IGGSHQKENKYLIVVTDGHPLEGYKEPCG-GLEDAVNEAKHLGIKVFSIAISPGHLE 345 Query: 354 QDLLRKCTDSSGQF---FAVNDSRELLESFDKITDKIQEQSVRIA 395 L + + F VN + ES D + + I++ ++ Sbjct: 346 PR-LSIIA-TDHTYRRNFTVNSEENVDESIDTMINMIKKNVEQVC 388 >gi|219848048|ref|YP_002462481.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219542307|gb|ACL24045.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 446 Score = 55.3 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 21/164 (12%), Positives = 55/164 (33%), Gaps = 20/164 (12%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL---NPYENTNTYPAMHHAYRELY 289 R+ IA + + + ++ + +L E+TN + A + Sbjct: 114 TPDDRMALIACASDALVLAPSTPGHRRTDLIGAIARLPVLRLGESTNLAQGLQLALAQFV 173 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 E + + V+ +TDG + +T + + + I ++ + Sbjct: 174 VTDEPA---------VRRVVLLTDGFTT------DTTMCTALAREAADRSITISTIGLG- 217 Query: 350 PPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + LL + D S G+ V ++ + + ++ ++ Sbjct: 218 NTFEETLLTQIADLSGGRASFVQEAGHIPTIISAELEHARQTTI 261 >gi|146292839|ref|YP_001183263.1| cell wall anchor domain-containing protein [Shewanella putrefaciens CN-32] gi|145564529|gb|ABP75464.1| LPXTG-motif cell wall anchor domain [Shewanella putrefaciens CN-32] Length = 757 Score = 55.3 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 27/159 (16%), Positives = 56/159 (35%), Gaps = 16/159 (10%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEV---KSRLNKLNPYENTNTYPAMHHAYRELYNEKES 294 I +N + T L N + + +++L T A++ A Sbjct: 413 FNIIEFNSELTQLSPTSLPANQTHLARARQFIHRLQADGGTEMSLALNAAL------PRG 466 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + S++ + VIF+TDG A + + ++++V + + P Sbjct: 467 INRLSESSQSLRQVIFMTDGSVGNEQALFDLIRYQIGES-------RLFTVGIGSAPNSH 519 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + R G F + + E+ + K+ KIQ + Sbjct: 520 FMQRAAELGRGTFTYIGNVDEVEQKISKLLSKIQYPVLT 558 >gi|74209191|dbj|BAE24978.1| unnamed protein product [Mus musculus] Length = 902 Score = 55.3 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 22/150 (14%), Positives = 61/150 (40%), Gaps = 25/150 (16%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 S++ ++ + L + T+ + ++ + + + ++ +T Sbjct: 364 ITSSSDYQKITANLPQ-QASGGTSICHGLQAGFQAITSS--------DQSTSGSEIVLLT 414 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN 371 DGE++G + E + +G I+++A+ + L +D + G F N Sbjct: 415 DGEDNGIRSCF---------EAVSRSGAIIHTIALGPSAARE--LETLSDMTGGLRFYAN 463 Query: 372 -DSRELLESFDKITD---KIQEQSVRIAPN 397 D L+++F +I+ + +Q++++ Sbjct: 464 KDLNSLIDAFSRISSTSGSVSQQALQLESK 493 >gi|89100236|ref|ZP_01173103.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] gi|89085086|gb|EAR64220.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] Length = 476 Score = 55.3 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 19/189 (10%), Positives = 55/189 (29%), Gaps = 24/189 (12%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLS----VRIGTIAYNIGIVGNQCTPL-SNN 258 +K+++ + + +I + + + ++ PL S N Sbjct: 187 VDGKQKMEIAKSAVRSFAKTIGEENDVSLYVYGHAGTQEDKDKQISCTTIDEVYPLQSYN 246 Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + + T A+ A + ++DG + Sbjct: 247 EESFFKAVEGVEAKGWTPLAGAIKAARE-----------ASMDYEGDITLYIVSDGAETC 295 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELL 377 R + + +A + +D L+K ++ G++ +++ +L Sbjct: 296 DGNPVEEARLFAETNESRMVNI----IGFNADAKAEDQLKKVAEAGKGEYIGADNADQLN 351 Query: 378 ESFDKITDK 386 I+++ Sbjct: 352 ---SSISNE 357 >gi|125973772|ref|YP_001037682.1| carbohydrate-binding, CenC-like protein [Clostridium thermocellum ATCC 27405] gi|125713997|gb|ABN52489.1| Carbohydrate-binding, CenC-like protein [Clostridium thermocellum ATCC 27405] Length = 1050 Score = 55.3 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 40/290 (13%), Positives = 78/290 (26%), Gaps = 30/290 (10%) Query: 130 NLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF 189 ++ S+ N + S + + N N + L Sbjct: 646 SIGGNIEFQTSNSTINGIVYAPGNPANPNSGKIFFSGDKNTINGSIAANELDF---FAGG 702 Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 N T+ ++ K L N + K K ++V + + N Sbjct: 703 LVVNHTEGQFDTVEEKYIDKSTYLKLVKDAAKNFVDKFAGSKTKMAVIQYSDSANDNDFK 762 Query: 250 NQCTPLSNNLNEVKSRLNKLNPY--ENTNTYPAMHHAYRELYNEKESSHNTI-------- 299 L + +K ++K+ P +N M AY L + Sbjct: 763 KYDLSLPDKGAALKETIDKIKPGTSGLSNMGDGMRRAYHILNGPPPKGQISKYIVVITGS 822 Query: 300 ---------GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 K D + +Y + + + + G+ + + S Sbjct: 823 VPNRWTAVDNKKNEPKTDNGRADFIKADNESYNSLDYAKDMGRIITSKGINLVFIDFSEE 882 Query: 351 PEGQDLLRKCTDSSG--------QFFAVNDSRELLESFDKITDKIQEQSV 392 G L +S ++ N+ ELL+ + +T KI V Sbjct: 883 DIGDVLEEIAAESGAKPLEGTDRHYYKANNFLELLDILNNMTLKIYYDVV 932 >gi|322697455|gb|EFY89235.1| U-box domain-containing protein [Metarhizium acridum CQMa 102] Length = 757 Score = 55.3 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 31/220 (14%), Positives = 72/220 (32%), Gaps = 31/220 (14%) Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 + P + + S AP P + L +L I E N R+G Sbjct: 49 IEHVPCDIVLVLDVSTSMEDDAPVPGEAEETGLT--VLDLTKHAALTIIETLNEKDRLGI 106 Query: 241 IAYNIGIVGNQCTPLSN--NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNT 298 +++ Q + N E + ++ L+P +TN + + + E+ + Sbjct: 107 VSFATNSTIVQTLTHMDISNKEEARRKIKALDPNGSTNLWHGIRDGIQAFEQSSENGNI- 165 Query: 299 IGSTRLKKFVIFITDGENSG--------ASAYQNTLNTLQIC---------EYMRNAGMK 341 + ++ +TDG + + +++ N++ C + + Sbjct: 166 -------RAMMVLTDGMPNHMCVSPLLLWAHREHSSNSVDRCPQQGYIPKLKTLSRLPAT 218 Query: 342 IYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESF 380 I++ LL+ + G + + D+ + F Sbjct: 219 IHTFGFGYGLRS-GLLKSLAEYGHGNYAFIPDAGMIGTVF 257 >gi|149637338|ref|XP_001510478.1| PREDICTED: similar to ITI-like protein [Ornithorhynchus anatinus] Length = 1374 Score = 55.3 bits (131), Expect = 1e-05, Method: Composition-based stats. Identities = 38/308 (12%), Positives = 91/308 (29%), Gaps = 37/308 (12%) Query: 104 IAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDL 163 E + + + L S L +LR I S++ C + + + ++ Sbjct: 211 SVEVTISERTGIDYIHVLPLRTSRLLTNTLRGEADIPPSTKIEKGEKCARIIFTPTPQEQ 270 Query: 164 YLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNS 223 + +Y K + AP R + + ++ +++ Sbjct: 271 AAYSSSGIMGDFVVQY--DVSMKDIIGDVQIYNGYFVHYFAP--RGLPPVQKNVVFVIDV 326 Query: 224 IQKAIQEKKNLSVRIGTIAYNI--GIVGNQCTPLSN----------------NLNEVKSR 265 K + + + N S+ N+ K Sbjct: 327 SGSMFGTKMKQTKKAMHVILNDLHHDDYFNIVTFSDAVSVWKASGSIQATPPNIKSAKVY 386 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 +NK+ T+ A+ A + G ++ +IF+TDGE + Sbjct: 387 VNKMEADGWTDINAALLVAASVFNQSTGETGRGKGLKKIP-LIIFLTDGEATAGVT---- 441 Query: 326 LNTLQICEYMRNA---GMKIYSVAVSAPPEGQDLLRKCTDSSGQ-----FFAVNDSRELL 377 +I + + + ++ +A L+R+ + + + + + +L Sbjct: 442 -VASRILSNAKQSLKGNISLFGLAFG-DDADYHLMRRLSLENRGVARRIYEDADATLQLK 499 Query: 378 ESFDKITD 385 +D+I Sbjct: 500 GFYDEIAS 507 >gi|163758555|ref|ZP_02165642.1| von Willebrand factor type A domain protein [Hoeflea phototrophica DFL-43] gi|162283845|gb|EDQ34129.1| von Willebrand factor type A domain protein [Hoeflea phototrophica DFL-43] Length = 549 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 24/193 (12%), Positives = 58/193 (30%), Gaps = 28/193 (14%) Query: 196 KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL 255 S KI + E +L+ + ++ + V + Sbjct: 22 GSNSMWGQVDGEAKITIAKEVMTDLITNWDDSVDLGLTVYGHRRKGDCADIEVVAMPGKV 81 Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + + ++ + P T + A + G K V+ ++DG Sbjct: 82 --DRQALIDKVQSITPRGKTPISKTLSLAALSV-----------GFFSGKSSVVLVSDGL 128 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMK-----IYSVAVSAPPEGQDLLRKCTD-SSGQFFA 369 + + C ++ G+ ++ + E L+ + G+FF Sbjct: 129 ETCNADP---------CAQAKSLGIINPGFDVHVIGFDVTEEEFKSLQCIATETGGKFFR 179 Query: 370 VNDSRELLESFDK 382 N++ EL ++ + Sbjct: 180 ANNAEELKDALRQ 192 >gi|32964827|ref|NP_034029.2| chloride channel calcium activated 1 [Mus musculus] gi|3925281|gb|AAC79982.1| calcium-sensitive chloride conductance protein-1 [Mus musculus] gi|74183411|dbj|BAE36582.1| unnamed protein product [Mus musculus] gi|124376304|gb|AAI32343.1| Chloride channel calcium activated 1 [Mus musculus] Length = 902 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 22/150 (14%), Positives = 61/150 (40%), Gaps = 25/150 (16%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 S++ ++ + L + T+ + ++ + + + ++ +T Sbjct: 364 ITSSSDYQKITANLPQ-QASGGTSICHGLQAGFQAITSS--------DQSTSGSEIVLLT 414 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN 371 DGE++G + E + +G I+++A+ + L +D + G F N Sbjct: 415 DGEDNGIRSCF---------EAVSRSGAIIHTIALGPSAARE--LETLSDMTGGLRFYAN 463 Query: 372 -DSRELLESFDKITD---KIQEQSVRIAPN 397 D L+++F +I+ + +Q++++ Sbjct: 464 KDLNSLIDAFSRISSTSGSVSQQALQLESK 493 >gi|224051386|ref|XP_002199700.1| PREDICTED: coagulation factor C homolog, cochlin [Taeniopygia guttata] Length = 565 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 48/371 (12%), Positives = 98/371 (26%), Gaps = 36/371 (9%) Query: 35 AAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINIT 94 AA+ + S+ + R KK K K + N G Q N Sbjct: 214 AALEAVGRSVSTARPSTGKRPKKTLDKKAGNKDCKADIAFLIDGSYNIGQRRFNLQKNFV 273 Query: 95 KD-----KNNPLQYIAESKAQYEIPTENLFLKGLIPS-ALTNLSLRSTGIIERSSENLAI 148 E P +LK + + S+ + Sbjct: 274 GKVAVMLGIGTEGPHVGVVQASEHPKIEFYLKNFTATKEVLFAIKELGFRGGNSNTVIDF 333 Query: 149 SICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPP----PPKKSFWSKNTTKSKYAPAPA 204 S+ + + + + L K + S N + Sbjct: 334 SLAVCRNNGFFSYQMPTWFGTTKYVKPLVQKLCSHEQMLCSKTCYNSVNIGFLIDGSSSI 393 Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L V+++ KA + ++ ++ + + +V S Sbjct: 394 GEINFRLML-----EFVSNVAKAFE-ISDIGSKVAAVQFTYNQRKEFGFTDHVTKEKVLS 447 Query: 265 RLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 ++ + T T A+ R ++ + N F+I +TDG++ Sbjct: 448 AIHNIQYMSGGTATGDAISFTTRTVFGPVKDGPNK-------NFLIVLTDGQSYDDVTGP 500 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFD 381 G+ ++SV V+ P L++ F + L + Sbjct: 501 AAAAKKA--------GITVFSVGVAWAPLDD--LKEMASEPRESHTFFTREFTGLEQMVP 550 Query: 382 KITDKIQEQSV 392 I I + + Sbjct: 551 DIIRGICKDYL 561 >gi|161408065|dbj|BAF94136.1| Dual Intracellular Von Willebrand factor domain A [Homo sapiens] Length = 276 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 21/114 (18%), Positives = 42/114 (36%), Gaps = 12/114 (10%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + VR+G YN I + + ++ L TNT A+ Sbjct: 156 DISSDRVRVGLAQYNDNIYPAFQLNQHPLKSMILEQIQNLPYRTGGTNTGSALEFIRTNY 215 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKI 342 E+ S R+ + VI +TDGE++ ++ + ++ G+ + Sbjct: 216 LTEESGSRAK---DRVPQIVILVTDGESND--------EVQEVADRLKEDGVVV 258 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 18/113 (15%), Positives = 41/113 (36%), Gaps = 12/113 (10%) Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 L + + + + S + + + I+ G + +R AG+ +Y++ Sbjct: 8 ILDHHFQEASGSRASQEVPQIAVVISSGPVEDHVHGP--------AKALRKAGILLYAIG 59 Query: 347 VSAPPEGQDLLRKCTDSSGQFF--AVNDSRELLESFDKITDKIQEQSVRIAPN 397 V LR+ S + F V + L K+ ++ + + AP+ Sbjct: 60 V--RDAVWAELREIASSPQENFTSFVPNFSGLSNLAQKLRQELCDTLAKAAPH 110 >gi|307196324|gb|EFN77937.1| Sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1 [Harpegnathos saltator] Length = 2255 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 20/126 (15%), Positives = 42/126 (33%), Gaps = 24/126 (19%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T T A+ A L +E + K V ITDG ++G + Sbjct: 149 SGGGTYTRGALLEALAILEKSREEA---------SKVVFLITDGFSNGG-------DPRP 192 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQ 388 ++NAG +++ + + L G + ++ E F+ + + Sbjct: 193 AANLLKNAGATVFTFGIRTG--NVEELHDIASFPGYTHSYLLDSFAE----FEALARRAL 246 Query: 389 EQSVRI 394 + ++ Sbjct: 247 HRDLKT 252 >gi|294055226|ref|YP_003548884.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] gi|293614559|gb|ADE54714.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] Length = 330 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 26/140 (18%), Positives = 50/140 (35%), Gaps = 25/140 (17%) Query: 254 PLSNNLNEVKSRLNKL---NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 P +++ + L++ +T ++ A S + +I Sbjct: 155 PFTDDHETWLALLDESIVNMAGPSTALGDSIGLAIAHF----------RQSKTENRVLIV 204 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------GQDLLRKCT-DS 363 +TDG ++G+ L E + G+ IY+VAV P + L + Sbjct: 205 LTDGNDTGSRVP-----PLDAAEVAKVEGVTIYTVAVGDPTTVGEEALDMETLETVARLT 259 Query: 364 SGQFFAVNDSRELLESFDKI 383 G F +D L E++ +I Sbjct: 260 GGDSFVASDLVALRETYQRI 279 >gi|168698108|ref|ZP_02730385.1| protein containing a von Willebrand factor type A domain [Gemmata obscuriglobus UQM 2246] Length = 821 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 19/185 (10%), Positives = 55/185 (29%), Gaps = 17/185 (9%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 K+ ++ ++ +Q ++ + ++ L+ ++ Sbjct: 287 KMQQAKKAVKFCLSQLQ-----PEDRFGVVRFSTTVTKFRSELVAANTDYLDLATKWIDG 341 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L T +PA++ A + S +F TDG+ + + + Sbjct: 342 LKTSGGTAIWPALNDALAM--RSSDPSRPFTM--------VFFTDGQPTVDETNADKIVK 391 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 + + +I++ V L + + V ++ ++ + KI Sbjct: 392 NVLAKNTG--NTRIFTFGVGDDVNAAMLDQLADSTRAVSTYVREAEDIEVKVSGLYAKIS 449 Query: 389 EQSVR 393 + Sbjct: 450 NPVLT 454 >gi|227819319|ref|YP_002823290.1| hypothetical protein NGR_b10840 [Sinorhizobium fredii NGR234] gi|227338318|gb|ACP22537.1| conserved hypothetical protein [Sinorhizobium fredii NGR234] Length = 533 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 32/272 (11%), Positives = 67/272 (24%), Gaps = 27/272 (9%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 +T I + + F ID+ + +Q+A+DA L+G + + Sbjct: 21 LTLISMPLLLGFSLLVIDVGRTGNLHTDLQNAVDAMALAGARELDGRDDAITRADAAIEA 80 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 L + ++ + I + D N EIP ++ Sbjct: 81 -----------LANSAAFGGGGNGMSLGSHITVAYDAGNAAGSTVAVTYLKEIPADD--- 126 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 +T + + +A M + + + + Sbjct: 127 ----DDPITASMETTDPNEASYAWVIANDQAMTTIFPVPVGFNRDTINVAAEAVAVYRSS 182 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 T P AP N +A L + + + + + Sbjct: 183 ACD---------VTPIYICNPFEAPGNSTEAGNQTAADALHENFAAGNLYGRQIELHSTS 233 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY 272 + N N + L +P Sbjct: 234 SSAPGPGNFGFLATYGNGANVLAEALATGSPG 265 >gi|118364357|ref|XP_001015400.1| Helicase conserved C-terminal domain containing protein [Tetrahymena thermophila] gi|89297167|gb|EAR95155.1| Helicase conserved C-terminal domain containing protein [Tetrahymena thermophila SB210] Length = 2720 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 23/210 (10%), Positives = 64/210 (30%), Gaps = 24/210 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN------QCTPLS 256 + I + + + EK + + Q + Sbjct: 2497 TGSMSNLITQTKNTIQTTFEQTRDILTEKGYDPQCFQIMISCFRSYNSKWEEIFQTSTWE 2556 Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE- 315 NN ++++S L K+ T ++ + + + + + + + Sbjct: 2557 NNPDKLRSFLQKITASGGTWPGESVEVGLWWANKQNDEDPISQVIILGDQPAHLMNEAQK 2616 Query: 316 --NSGASAYQNTLNTLQI------CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSG- 365 N +Y +T + C+ ++N + + + + + ++G Sbjct: 2617 FRNQFGQSYWDTTPLKGLTYYVPECQKLKNKNIPVNTFYLKQGA--KSTFENIAKLTNGI 2674 Query: 366 -QFFAVNDSR---ELLESF-DKITDKIQEQ 390 Q+ +N + +L F ++I I +Q Sbjct: 2675 SQYLDINSGQSSKQLTTLFVEQILKDIGKQ 2704 >gi|91216720|ref|ZP_01253685.1| aerotolerance-related membrane protein [Psychroflexus torquis ATCC 700755] gi|91185189|gb|EAS71567.1| aerotolerance-related membrane protein [Psychroflexus torquis ATCC 700755] Length = 349 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 24/163 (14%), Positives = 53/163 (32%), Gaps = 42/163 (25%) Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 P++ + K+ L +N ++ A+ A + + K +I +++ Sbjct: 145 PITTDYASTKTFLQSMNTDMVSSQGTAISQAIDLAKSYYN------DDDQTNKVLIILSE 198 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA-------------------PPEGQ 354 GE+ + N + E G+KIY++ V G+ Sbjct: 199 GEDHDS-------NVESMAETAAAEGIKIYTIGVGTERGDPIPIKKEGRIQSYLKDDNGE 251 Query: 355 --------DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + L+K + + D + D I ++Q+ Sbjct: 252 IVITKRDTETLQKIAEIGNGAYI--DGTNTSNAVDDILKELQK 292 >gi|299529294|ref|ZP_07042734.1| ferredoxin-dependent glutamate synthase [Comamonas testosteroni S44] gi|298722738|gb|EFI63655.1| ferredoxin-dependent glutamate synthase [Comamonas testosteroni S44] Length = 1449 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 36/316 (11%), Positives = 83/316 (26%), Gaps = 16/316 (5%) Query: 38 LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK 97 L+ +I + + T++ K + + +A +T Sbjct: 601 LAAPGAISVSEALASGSGTFTITASAGLKSLTLDGPDNADATLTLDRLADLVNNPVTLT- 659 Query: 98 NNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 + + + + + + D+ Sbjct: 660 --TSKGTLTLTGYDATTGKVSYTYQTSGQQAHTGDDTNVQDHFQITVEDKFGGKATGDLG 717 Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 + D + + + S ++ VL S Sbjct: 718 VLITDTAPSLKPIAESSALSSHGTNIMLTLDTSGSMNYGSG-VYNGWTQLSRLAVLKSSV 776 Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNT 277 NL++ +A VR+ + +N +S L E K+ ++ L TN Sbjct: 777 NNLLDKYGEA------GDVRVMIVEFNTSASQKGGGWMS--LAEAKALVSGLGYGGGTNY 828 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 A+ A N + ++ F TDGE ++ N+ +++ + Sbjct: 829 QTALDTAMNAWNNSGTG---KLEGGNVQNISYFFTDGEP-DSNRSVNSAQQATWEKFLAD 884 Query: 338 AGMKIYSVAVSAPPEG 353 + Y + + G Sbjct: 885 NHINSYGIGLGTGATG 900 >gi|55378019|ref|YP_135869.1| hypothetical protein rrnAC1219 [Haloarcula marismortui ATCC 43049] gi|55230744|gb|AAV46163.1| unknown [Haloarcula marismortui ATCC 43049] Length = 788 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 20/138 (14%), Positives = 46/138 (33%), Gaps = 20/138 (14%) Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 ++ L N E ++ +L T+ + A L + Sbjct: 417 NQNAYRVSEMQALGQNRAETADKIRRLESGGATDIAVGLQGADELLDDR----------- 465 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 + +I ++DG++ + + G ++ SV V + + ++ Sbjct: 466 --EGTIILLSDGQDRLGP-------PAAVANQLGREGTRVVSVGVGKRVGVATMRQIASE 516 Query: 363 SSGQFFAVNDSRELLESF 380 S G +FA +++ L F Sbjct: 517 SGGSYFAADETERLRLLF 534 >gi|309355882|emb|CAP38139.2| hypothetical protein CBG_21291 [Caenorhabditis briggsae AF16] Length = 430 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 27/166 (16%), Positives = 56/166 (33%), Gaps = 26/166 (15%) Query: 235 SVRIGTIAYNI--GIVGNQCTPLSNNLNEVKSRLNKLNPYENT-NTYPAMHHAYRELYNE 291 ++ + + + EV ++KL T + A ++ Sbjct: 279 YTQVAAVTFATVGRTRVRFNLKKYSTQEEVLRGIDKLQSKGGTTAIGAGIEKALTQIDES 338 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV---- 347 + + K +I TDG ++ + + NAG ++Y+VA Sbjct: 339 EGARPGIAT-----KVMIVFTDGWSNKG------PDPEKRARDAVNAGFEMYTVAYTARA 387 Query: 348 -SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + + L + SSG F D F + DKI+++++ Sbjct: 388 PGSVTLNNETLSAISGSSGHAF--TDVT-----FQTLVDKIKQRNL 426 >gi|219518504|gb|AAI45058.1| Chloride channel calcium activated 1 [Mus musculus] Length = 902 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 22/150 (14%), Positives = 61/150 (40%), Gaps = 25/150 (16%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 S++ ++ + L + T+ + ++ + + + ++ +T Sbjct: 364 ITSSSDYQKITANLPQ-QASGGTSICHGLQAGFQAITSS--------DQSTSGSEIVLLT 414 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN 371 DGE++G + E + +G I+++A+ + L +D + G F N Sbjct: 415 DGEDNGIRSCF---------EAVSRSGAIIHTIALGPSAARE--LETLSDMTGGLRFYAN 463 Query: 372 -DSRELLESFDKITD---KIQEQSVRIAPN 397 D L+++F +I+ + +Q++++ Sbjct: 464 KDLNSLIDAFSRISSTSGSVSQQALQLESK 493 >gi|160858155|emb|CAP19997.1| collagen type VI alpha 5 precursor [Homo sapiens] Length = 591 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 16/138 (11%), Positives = 50/138 (36%), Gaps = 16/138 (11%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHA 284 + VR+G + Y+ +N +++ + + T T A+ + Sbjct: 469 TEMFSIGPDKVRVGVVQYSDDTEVEFYITDYSNDIDLRKAIFNIKQLTGGTYTGKALDYI 528 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + + N + + + ++I +TDG ++ ++ + +R + +++ Sbjct: 529 LQIIKNGMKDRMSK-----VPCYLIVLTDGMSTD--------RVVEPAKRLRAEQITVHA 575 Query: 345 VAVSAPPEGQDLLRKCTD 362 V + + L++ Sbjct: 576 VGIG--AANKIELQEIAG 591 >gi|302405156|ref|XP_003000415.1| U-box domain containing protein [Verticillium albo-atrum VaMs.102] gi|261361072|gb|EEY23500.1| U-box domain containing protein [Verticillium albo-atrum VaMs.102] Length = 662 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 20/128 (15%), Positives = 41/128 (32%), Gaps = 18/128 (14%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 +N K ++ L P TN + + + + SS ++ +TDG Sbjct: 166 DNKVSAKDKIENLQPLNGTNMWHGITEGIKLFSDCDSSSGRVP-------AMMVLTDGLP 218 Query: 317 SGASAYQNTLNTLQICEYMRNAG---MKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVND 372 + L +R+ G I++ LL+ + G + + D Sbjct: 219 NSGCPR------LGYIPKLRDMGQLPATIHTFGFGY-HIRSGLLKSIAEIGGGNYAFIPD 271 Query: 373 SRELLESF 380 + + F Sbjct: 272 AGMIGTVF 279 >gi|167752251|ref|ZP_02424378.1| hypothetical protein ALIPUT_00494 [Alistipes putredinis DSM 17216] gi|167660492|gb|EDS04622.1| hypothetical protein ALIPUT_00494 [Alistipes putredinis DSM 17216] Length = 344 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 29/222 (13%), Positives = 70/222 (31%), Gaps = 54/222 (24%) Query: 192 KNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ 251 S A +++ + L + +++ R+G + + V Sbjct: 94 LVVDVSNSMLAEDFQPNRLERTKYAIDKLFDGLKQ---------DRVGLVVFAGDAVVQ- 143 Query: 252 CTPLSNNLNEVKSRLNKLNPYE----NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 P++++ K+ +++P T+ A+ A + ++ + Sbjct: 144 -LPITSDYRMAKAFARRISPSMVSVQGTDIGQALSLATMSFSEKGDNPAG--------RV 194 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE--------------- 352 ++ ITDGE + A + + G++I+++ + P Sbjct: 195 IVLITDGEGHDSGAIEAAERAAE-------QGIRIFTIGIGTPEGAPIQIGGEFIKDDKG 247 Query: 353 -------GQDLLRKCT-DSSGQFFAVNDSR-ELLESFDKITD 385 G+ LL K + G + + L E I + Sbjct: 248 EMVVSKLGEPLLEKIAQATDGAYIRSTNQSIGLDEIVRTINN 289 >gi|114557515|ref|XP_524757.2| PREDICTED: calcium-activated chloride channel regulator 4 isoform 2 [Pan troglodytes] Length = 919 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 20/133 (15%), Positives = 56/133 (42%), Gaps = 24/133 (18%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 P T+ + +A++ + + S V+ +TDGE++ AS+ Sbjct: 379 PRGGTSICSGIKYAFQVI--------GELHSQLDGSEVLLLTDGEDNTASSC-------- 422 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE---LLESFDKI---T 384 + ++ +G ++ +A+ + ++ + G F V+D + L+++F + Sbjct: 423 -IDEVKQSGAIVHFIALGRDA-DEAVIEMSKITGGSHFYVSDEAQNNGLIDAFGALTSGN 480 Query: 385 DKIQEQSVRIAPN 397 + ++S+++ Sbjct: 481 TDLSQKSLQLESK 493 >gi|326430897|gb|EGD76467.1| hypothetical protein PTSG_07584 [Salpingoeca sp. ATCC 50818] Length = 985 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 31/216 (14%), Positives = 71/216 (32%), Gaps = 18/216 (8%) Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 L S T ++KY +D++ +A ++ N +Q ++ + + Sbjct: 223 ICLVIDVSGSMDRHATQRNKYNQLEKFEQTYLDIVKHAARSIANFLQN-----EDNYLSV 277 Query: 239 GTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNT 298 + N + ++ + L P TN + + L ++ Sbjct: 278 VAFSENASVTLPMERMTEAGVSRATDAVGALQPCSCTNLGDGVLRGMQLLLKGTDT---- 333 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR-NAGMKIYSVAVSAPPEGQDLL 357 T+ + ++ +TDGE + + ++ L + + + + + LL Sbjct: 334 ---TKAQPVLMVLTDGEPNEGNDARDVLRSFR--NTYKHARRFIVNTFGFGFEQIDSPLL 388 Query: 358 -RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 G + V DS + +F I + Q V Sbjct: 389 SELAALGGGTYAFVPDSSFVGTAF--INATVAAQLV 422 >gi|320352592|ref|YP_004193931.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] gi|320121094|gb|ADW16640.1| von Willebrand factor type A [Desulfobulbus propionicus DSM 2032] Length = 798 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 31/205 (15%), Positives = 64/205 (31%), Gaps = 12/205 (5%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 + A ++I+ L +A N V+ + + + Sbjct: 323 THDTVRVVLVLDESGSMNAETPKRIERLKVAAKNFVSLAENGTELGIVSYASDAAV--AS 380 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 G PL N + ++ L P TN + A + + NT Sbjct: 381 GRTEVAIAPLGANRAAWNNAIDGLGPSTRTNIGAGLQKARDLITAAGGVTANT------- 433 Query: 306 KFVIFITDGENSGASAYQN-TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC-TDS 363 +++ ++DG N+ + N + + G+ +Y + + T + Sbjct: 434 -YIVLMSDGLNNEPAPQANADADLNGKIAMLLADGIPVYVTCTGSDLGLASQCSEIGTGT 492 Query: 364 SGQFFAVNDSRELLESFDKITDKIQ 388 G + DS L E+F ++I Sbjct: 493 GGHYVDSADSARLPEAFADFHERIV 517 >gi|327261941|ref|XP_003215785.1| PREDICTED: hypothetical protein LOC100567114 [Anolis carolinensis] Length = 1225 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 21/143 (14%), Positives = 51/143 (35%), Gaps = 9/143 (6%) Query: 248 VGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 N+ + K ++K+ T+ A+ A + + + R+ Sbjct: 327 PSQSIQATPQNIKKAKDYVSKMEADGWTDINAALLAAASVFNHSSPMAGKIMRDQRIP-L 385 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ- 366 +IF+TDGE + + + L + + ++ +A LLR+ + + Sbjct: 386 IIFLTDGEPTSGVTTGSRI--LSNAQQALKGTISLFGLAFG-DDADYGLLRRLSLENRGV 442 Query: 367 ----FFAVNDSRELLESFDKITD 385 + + + +L +D+I Sbjct: 443 ARRIYEDADATLQLKGFYDEIAS 465 >gi|254443293|ref|ZP_05056769.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] gi|198257601|gb|EDY81909.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] Length = 632 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 18/168 (10%), Positives = 56/168 (33%), Gaps = 14/168 (8%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 + + I + I + N +++ + ++++ + +T + Y+ Sbjct: 171 GSDDQLSIVLYGSSTHIHLEPTKTSTENRDQIIASIDRIQSHGSTAMEAGLELGYQVARQ 230 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 ++ V+ TD + + + + E + + + ++ V Sbjct: 231 SADAFVGKT-------RVMLFTDERPNVGR--TDATGFMAMAESGSKSDIGLTTIGVGV- 280 Query: 351 PEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 G +L K + G F +D + +F ++ + +A + Sbjct: 281 HFGAELAEKISSVRGGNLFFFDDDESMETTFR---KELDTMVLELAYD 325 >gi|26328325|dbj|BAC27903.1| unnamed protein product [Mus musculus] Length = 902 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 22/150 (14%), Positives = 61/150 (40%), Gaps = 25/150 (16%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 S++ ++ + L + T+ + ++ + + + ++ +T Sbjct: 364 ITSSSDYQKITANLPQ-QASGGTSICHGLQAGFQAITSS--------DQSTSGSEIVLLT 414 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN 371 DGE++G + E + +G I+++A+ + L +D + G F N Sbjct: 415 DGEDNGIRSCF---------EAVSRSGAIIHTIALGPSAARE--LETLSDMTGGLRFYAN 463 Query: 372 -DSRELLESFDKITD---KIQEQSVRIAPN 397 D L+++F +I+ + +Q++++ Sbjct: 464 KDLNSLIDAFSRISSTSGSVSQQALQLESK 493 >gi|118401451|ref|XP_001033046.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89287392|gb|EAR85383.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 680 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 18/139 (12%), Positives = 49/139 (35%), Gaps = 22/139 (15%) Query: 257 NNLNEVK--SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 ++ N++ ++ KL +TN + L K S + + +TDG Sbjct: 112 DDANKILAIDKIEKLEASGSTNLQHGIQVGLNILSKSK--------SQNRNQAMYVLTDG 163 Query: 315 ENSGASAYQNTLNTLQICEYMRNAG----MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV 370 + + N +Q + + I + + + + L + +G + + Sbjct: 164 QP-------DDRNVMQFLKKYKKDNPQLRCTISTFGFGSSCDSELLDEIAREYNGMYSFI 216 Query: 371 NDSRELLESF-DKITDKIQ 388 D+ + +F + + + + Sbjct: 217 PDATLIATAFANALANTLT 235 >gi|327266510|ref|XP_003218048.1| PREDICTED: complement C2-like [Anolis carolinensis] Length = 735 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 21/117 (17%), Positives = 49/117 (41%), Gaps = 10/117 (8%) Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ--- 330 TNTY A+ YR + + ++ +++ +I +TDG+++ ++ + +++ Sbjct: 341 GTNTYAALDAIYRMMI--NDRANVLEKWDKVRHAIILLTDGKSNLGRPPKDAVRSIEGLV 398 Query: 331 -ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSG--QFFAVNDSRELLESFDKI 383 + E ++ + IY V + + G F + L E+F+ I Sbjct: 399 NVGENRKDY-LDIYVFGVGNLDVDWSAMNEIASKKPGEKHAFKLEKLETLKEAFEDI 454 >gi|157694070|ref|YP_001488532.1| hypothetical protein BPUM_3319 [Bacillus pumilus SAFR-032] gi|157682828|gb|ABV63972.1| hypothetical protein YwmC [Bacillus pumilus SAFR-032] Length = 233 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 19/129 (14%), Positives = 45/129 (34%), Gaps = 18/129 (13%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 ++ LN L P T A+ HA + + + K V ITDGE Sbjct: 113 YEKESFENSLNGLGPNGWTPIARALEHAKQ---------TDEQLNNGTKHIVYLITDGEE 163 Query: 317 SGASAYQNTLNTLQICEYMRNAG--MKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDS 373 + + +++ + + N+ + + + + L++ + G ++ + Sbjct: 164 TCGG------DPVKVAKELHNSKGSTVVNVIGLDFNDGYEGQLKQVAKAGKGHYYQASTG 217 Query: 374 RELLESFDK 382 +E+ Sbjct: 218 KEMGSILSA 226 >gi|51244491|ref|YP_064375.1| hypothetical protein DP0639 [Desulfotalea psychrophila LSv54] gi|50875528|emb|CAG35368.1| hypothetical membrane protein (BatB) [Desulfotalea psychrophila LSv54] Length = 566 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 26/170 (15%), Positives = 56/170 (32%), Gaps = 48/170 (28%) Query: 254 PLSNNLNEVKSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 PL+ + L L+ P TN + A + + + K +I Sbjct: 150 PLTLDYQAFTDSLKALDTKIIPRRGTNIAKVIALAEKTVADSSN-----------HKILI 198 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------- 352 +TDGEN + L+ + + G+ IY++ V Sbjct: 199 ILTDGENLQG-------DVLKAADLAKKNGLTIYTIGVGTAAGELIPGGPGGAFIRDSSG 251 Query: 353 -------GQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQSVR 393 ++ L++ + +G N+++ L + + I ++ +R Sbjct: 252 KYVKSKLDEETLQEIAEKTGGISVLLGNNNQGLKKIYTDKLRFIPKKELR 301 >gi|294054129|ref|YP_003547787.1| hypothetical protein Caka_0592 [Coraliomargarita akajimensis DSM 45221] gi|293613462|gb|ADE53617.1| conserved hypothetical protein [Coraliomargarita akajimensis DSM 45221] Length = 339 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 36/227 (15%), Positives = 71/227 (31%), Gaps = 25/227 (11%) Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 Y L + S + + K +E A +V E Sbjct: 74 YSLEVNEGIAIQMLVDVSSSMDMSVKNFDGKSTTRMEVAKEMVERFIAGDGEDLQGRPHD 133 Query: 239 GTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-----ENTNTYPAMHHAYRELYNEKE 293 + +PL+ + + + L + T A+ A L N +E Sbjct: 134 LIGLITFARYADTRSPLTFGHDALLQIVRHLTIQERPNEDGTAYGDALALAAARLKNPQE 193 Query: 294 SSHNTIGSTRL----KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 H + K +I +TDGEN+ S ++ + KIY++++ Sbjct: 194 LRHGKRPDAQAEAIESKVIILLTDGENNSGSH-----LPIEAAGLAKAWDCKIYAISLGE 248 Query: 350 PPEGQDLLRKCT-----------DSSGQFFAVNDSRELLESFDKITD 385 + ++ L + ++ G F +D LL +++I Sbjct: 249 SLDAENPLDALSPAERVLEHISIETGGVFRQAHDFESLLSVYEEIDR 295 >gi|148680076|gb|EDL12023.1| mCG3350, isoform CRA_c [Mus musculus] Length = 907 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 22/150 (14%), Positives = 61/150 (40%), Gaps = 25/150 (16%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 S++ ++ + L + T+ + ++ + + + ++ +T Sbjct: 369 ITSSSDYQKITANLPQ-QASGGTSICHGLQAGFQAITSS--------DQSTSGSEIVLLT 419 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN 371 DGE++G + E + +G I+++A+ + L +D + G F N Sbjct: 420 DGEDNGIRSCF---------EAVSRSGAIIHTIALGPSAARE--LETLSDMTGGLRFYAN 468 Query: 372 -DSRELLESFDKITD---KIQEQSVRIAPN 397 D L+++F +I+ + +Q++++ Sbjct: 469 KDLNSLIDAFSRISSTSGSVSQQALQLESK 498 >gi|156742722|ref|YP_001432851.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156234050|gb|ABU58833.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 452 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 33/273 (12%), Positives = 83/273 (30%), Gaps = 34/273 (12%) Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 + ++ + ++ +V DVSRSM + + L ++ Sbjct: 33 VQASVTGAAPSRPVNWALVADVSRSMRIPIV------DETQFRSLLRTGSAQEMLVDGVP 86 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 P D + +V ++ ++ + R+ + + + Sbjct: 87 VWQLSGSVPQEVR---DTASSALDYVVRALHTIVERLDHHD-RLALVVFADHALLLIPGM 142 Query: 255 LSNNLNEVKSRLN---KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 + + + + LN + TN + A ++ ++ +I + Sbjct: 143 VGADRVTLVRAIERLPGLNLGDGTNLADGIALALNQIRANRDGRCADR--------IILL 194 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAV 370 TDG +++ L + + + + I ++ + DLL D S G + Sbjct: 195 TDG------FTRDSAACLALADQAADEHIAITTIGLG-GEFQDDLLTAIADRSGGHALFL 247 Query: 371 NDSRELLESFDKI-----TDKIQEQSVRIAPNR 398 + + + ++ IAP R Sbjct: 248 KRVSAIPRAVSAELETVRAAAVSAVTIAIAPQR 280 >gi|315122409|ref|YP_004062898.1| hypothetical protein CKC_03305 [Candidatus Liberibacter solanacearum CLso-ZC1] gi|313495811|gb|ADR52410.1| hypothetical protein CKC_03305 [Candidatus Liberibacter solanacearum CLso-ZC1] Length = 411 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 49/400 (12%), Positives = 116/400 (29%), Gaps = 39/400 (9%) Query: 3 AIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTST 62 A+++S + D ++ +RN +QS++D A+ + ++ ++ ++ Sbjct: 27 AVLLSSFLTIMDIMRDYTDMIRVRNMLQSSIDYAL----HNNPNELSVGTIKQREMLIKK 82 Query: 63 IFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL-K 121 + + K E I ++ ++IT+ P Q+ + I ++L L Sbjct: 83 RIGYFLDSNYKGTLLTEEQIKLIVNQSTVSITERSFYPQQFHINIELHKNIQLKSLILHM 142 Query: 122 GLIPSALTNLSLRSTGIIERSSE----NLAISICMVLDVSRSMEDLYLQKHNDNNNMT-- 175 + P N+S R + + +++ + + + + Q ++ T Sbjct: 143 AMNPKKDFNISQRKSSLYKKNVALMVVPFTWTGEWIPPSLFTTQFTVSQDLLPSDLKTEH 202 Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPA-NRKIDVLIESAGNLVNSIQKAIQEKKNL 234 K + F + + AP I E + Sbjct: 203 FKKTEYFNKRNQFFKMFLSKIKENNLCIAPYHYSAIVYWSEGIFSYKLPFSTTFLYSFRD 262 Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHA-----YRELY 289 N L L+ T P + + Sbjct: 263 IYVKQYSTIWDMKPSNYILDLFAGAE--------LHSNRLTPADPCFRRGVIQKKFMLII 314 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + S K T + Q + +YS+ +S Sbjct: 315 AAGNQISDRKNSAEYFKMKHGCTLMGKNMGKNPQEE--------------ITVYSLGISP 360 Query: 350 PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 P+ + L +CT +++ + +++ D++ I Sbjct: 361 DPDTKRDLIQCTRHPDRYYEIQSYKDIAPVIDRLERNISS 400 >gi|268610218|ref|ZP_06143945.1| von Willebrand factor, type A [Ruminococcus flavefaciens FD-1] Length = 565 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 40/370 (10%), Positives = 98/370 (26%), Gaps = 39/370 (10%) Query: 12 FITYAIDLAHIMYIRNQMQSALDA-AVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKK 70 +D A I + +D AV+S +P + + Sbjct: 190 VAGVLVDSATYSDIEQK-YGTVDINAVVSATLDGTLTTGYTNPLVSTTGLNFLASVLYNF 248 Query: 71 HLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTN 130 + G + + I + A+ + +T Sbjct: 249 DSSDPLGNKAVDGFKSFQDNIPFVA----YNTLQMRTAAENGTFGCMMMEYQSYIQDVTL 304 Query: 131 LSLRSTGIIERSSENLAISICMVLDVSR-SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF 189 +N ++I + D ++E +D ++ Y Sbjct: 305 SRNYKFIPFGIRHDNPLVAIGKLSDTEMKTLEMFADFCKSDKAKKLADSYGFNQMNDYKG 364 Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIE-SAGNLVNSIQKAIQEKKNLSVRIGTIA--YNIG 246 + + + + +A ++++ N + Y Sbjct: 365 DNIKVNGNSWTQMQKLWKTNKNSGKPIAAVFVLDTSGSMSGAPLNSLKASLRNSIKYINS 424 Query: 247 IVGNQCTPLSNNLN--------------EVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 S+N+N ++ L NT T+ A+ A L + Sbjct: 425 SNYIGVVSYSSNVNVDLELAKFDLNQQAYFMGAVDSLTASGNTATFSALSQAMIMLRD-- 482 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 T + + V ++DG+++ S + + + + + IY++ +A Sbjct: 483 ----FTKDNPNVSPMVFLLSDGQSNSGSEFSDIDGAIATAQ------IPIYTIGYNANLN 532 Query: 353 GQDLLRKCTD 362 L+ ++ Sbjct: 533 ---ELKAISE 539 >gi|185132132|ref|NP_001118067.1| complement factor B [Oncorhynchus mykiss] gi|3982897|gb|AAC83699.1| complement factor Bf-1 [Oncorhynchus mykiss gairdneri] Length = 743 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 30/231 (12%), Positives = 67/231 (29%), Gaps = 22/231 (9%) Query: 172 NNMTSNKYLLPPP-PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE 230 + K + + A + + L+ + Sbjct: 209 MAAPVDDTDQEGKKITIDKGGKLNIYIAMDISDSIAEEDFNSARNAVKKLITKVSSFSVS 268 Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE----VKSRLNKLNPYE----NTNTYPAMH 282 + + S + + V + LN TN A Sbjct: 269 PNYE----IIFFASDVLEVVNIIDFSGDKRKPLVDVLAELNNFKYDARDNVGTNLNLAFK 324 Query: 283 HAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC-----EYMRN 337 + +K+ + +IF TDG + + +NT+ ++ + R Sbjct: 325 TILERMAIQKKRNEMLFMEI--HHVLIFFTDGAYNMGGSPENTMAKIRESVYMNNKTKRE 382 Query: 338 AGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDK 386 + +Y V + +D++ T +G +F + + +L +FD I D+ Sbjct: 383 KYLDVYVFGVGSDIFDEDIMPLVTKRNGERHYFKLKNVIDLERTFDDIIDE 433 >gi|302869502|ref|YP_003838139.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] gi|315504036|ref|YP_004082923.1| von willebrand factor type a [Micromonospora sp. L5] gi|302572361|gb|ADL48563.1| von Willebrand factor type A [Micromonospora aurantiaca ATCC 27029] gi|315410655|gb|ADU08772.1| von Willebrand factor type A [Micromonospora sp. L5] Length = 572 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 46/375 (12%), Positives = 105/375 (28%), Gaps = 28/375 (7%) Query: 28 QMQSALDAAVLSGCASIVS-DRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIA 86 + Q A+ A+ + + S + + + + I L + I N+ Sbjct: 201 KAQEAMVGALRALATNRSSLRQDLLARFPRSSDPTAIANGLGAAALSEEDVIAYNSTKPP 260 Query: 87 QKAQI------------NITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLR 134 + + ++ +E+ F L AL Sbjct: 261 IRLAALYLEPAPIPLDYPFAVLPGIEPTKASAARVLFEVLRTPGFKDRLASQALRAPDGN 320 Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 + + + + V S + + + + + + + Sbjct: 321 WGRGFQAPTGAPSPANGGASQVPPSGQGGAADLDPGAISTATTTWSVATQSGRMLCVIDV 380 Query: 195 TKS-KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 + S K A A + V + +A + + + Sbjct: 381 SGSMKKPVATANGASREQVTVAAASQGLGLFDDS-WSIGLWTFSTNLQGSQDWSELVGIK 439 Query: 254 PLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 PLS+N ++ L + P NT Y M AY+++ + E ++ T Sbjct: 440 PLSSNRGSLQRGLASIKPSSGNTGLYDTMLAAYKKVQQDWEPGKVNS--------IVLFT 491 Query: 313 DGENSGASAYQNTLNTLQICEYMRN--AGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV 370 DG+N Q+ +++ +++ + + +L + G F Sbjct: 492 DGKNEDDDGISQKALLDQL-NKLKDDEQPVQVIIIGIGTEVNRAELESITKVTGGGAFVT 550 Query: 371 NDSRELLESF-DKIT 384 D ++ E F I Sbjct: 551 TDPSKIGEIFLRAIA 565 >gi|224051388|ref|XP_002199708.1| PREDICTED: coagulation factor C homolog, cochlin [Taeniopygia guttata] Length = 417 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 22/164 (13%), Positives = 50/164 (30%), Gaps = 20/164 (12%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYN 290 ++ ++ + + +V S ++ + T T A+ R ++ Sbjct: 267 SDIGSKVAAVQFTYNQRKEFGFTDHVTKEKVLSAIHNIQYMSGGTATGDAISFTTRTVFG 326 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + N F+I +TDG++ G+ ++SV V+ Sbjct: 327 PVKDGPNK-------NFLIVLTDGQSYDDVTGPAAAAKKA--------GITVFSVGVAWA 371 Query: 351 PEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQSV 392 P L++ F + L + I I + + Sbjct: 372 PLDD--LKEMASEPRESHTFFTREFTGLEQMVPDIIRGICKDYL 413 >gi|89100218|ref|ZP_01173085.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] gi|89085068|gb|EAR64202.1| possible D-amino acid dehydrogenase, large subunit [Bacillus sp. NRRL B-14911] Length = 456 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 22/130 (16%), Positives = 47/130 (36%), Gaps = 17/130 (13%) Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 P N E+ S L+K NP T ++ A ++L K V + Sbjct: 221 YPPQPYNEGELNSALDKFNPAGWTPLAQSLIEAQKDL--------AQFEGQDNKNMVYVV 272 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCT-DSSGQFF 368 +DG + N ++ + ++++G+ + + + Q L + + G + Sbjct: 273 SDGIET------CDGNPVEAAKDLKDSGVAPVVNIIGFDVKGKDQQQLEEVAKAAGGTYQ 326 Query: 369 AVNDSRELLE 378 V ++L Sbjct: 327 NVTSQQQLQN 336 >gi|163786709|ref|ZP_02181157.1| aerotolerance-related membrane protein [Flavobacteriales bacterium ALC-1] gi|159878569|gb|EDP72625.1| aerotolerance-related membrane protein [Flavobacteriales bacterium ALC-1] Length = 345 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 33/215 (15%), Positives = 76/215 (35%), Gaps = 60/215 (27%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 ++D + ++NS+ R+G IAY P++ + K L Sbjct: 110 NRLDKSKQLVTQIINSLAS---------DRVGIIAYAGKAFPQ--LPITTDYASAKMFLQ 158 Query: 268 KLNPYE----NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +N T A+ A +E++++ + +I I+DGE+ G Sbjct: 159 NMNTDMLSSQGTAISEAIELAKTYYDDEEQTN----------RVLIIISDGEDHGG---- 204 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSA------------------PPE---------GQDL 356 + I E G++I +V V + Sbjct: 205 ---EAVDIAEEANEEGIRILTVGVGDVKGGPIPIKRNGVVLNYKKDNKGETVITRLDETT 261 Query: 357 LRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQ 390 L++ +++G + +++ E++++ + DK+ ++ Sbjct: 262 LKEIAEETNGIYINGSNTNEVVDAIKDVLDKMDKK 296 >gi|304382531|ref|ZP_07365026.1| aerotolerance protein BatB [Prevotella marshii DSM 16973] gi|304336362|gb|EFM02603.1| aerotolerance protein BatB [Prevotella marshii DSM 16973] Length = 340 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 23/165 (13%), Positives = 56/165 (33%), Gaps = 45/165 (27%) Query: 253 TPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P++++ K L ++P + T+ A+ A + + + K + Sbjct: 144 LPITSDFVSAKMFLQNIDPSLIATQGTDIAKAIDMAMKSFTQQ----------ENVGKAI 193 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP------------------ 350 I ITDGE+ L+ + + G ++ + V Sbjct: 194 IVITDGEDHEG-------GALEAAKTAKAKGYNVFILGVGTAKGAPIPTGDGSYLKDNTG 246 Query: 351 -----PEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQE 389 +D+ ++ + G + V+++ E + + K+Q+ Sbjct: 247 QTVITALNEDMCKEIAAAGSGTYIHVDNTNEAQKILNNELTKMQK 291 >gi|156390493|ref|XP_001635305.1| predicted protein [Nematostella vectensis] gi|156222397|gb|EDO43242.1| predicted protein [Nematostella vectensis] Length = 229 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 22/174 (12%), Positives = 55/174 (31%), Gaps = 9/174 (5%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 I + ++ +I + + V + ++++ Sbjct: 22 VDCTGSMGEYIRQAQKHVISISETISRTAYNVRLALVEYRDHPPQDKSFVTRVHDFTSDV 81 Query: 260 NEVKSRLNKLNPYENTN----TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 E+K ++K++ + A+ A + Y E + + + + F DG Sbjct: 82 KEMKVWVDKMSASGGGDCPESVADAIFKACKLGYREDATKMCVLIADAPPHGLGFAHDGF 141 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS--APPEGQDLLRKCTDSSGQF 367 +G + + L C M G+ +Y++ L+ + GQ+ Sbjct: 142 PNG---CPDNHDPLASCHVMAEKGITLYTIGCEPSVDSFRDFLVGMAAITGGQY 192 >gi|86131263|ref|ZP_01049862.1| aerotolerance-related exported protein BatB [Dokdonia donghaensis MED134] gi|85818674|gb|EAQ39834.1| aerotolerance-related exported protein BatB [Dokdonia donghaensis MED134] Length = 344 Score = 55.3 bits (131), Expect = 2e-05, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 70/196 (35%), Gaps = 42/196 (21%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 +I+ + ++NS+ RIG IAY P++ + + K L+ Sbjct: 110 NRIEKSKQLVTQIINSLGS---------DRIGIIAYAGSAYPQ--LPITTDYSSAKLFLS 158 Query: 268 KLNPYE----NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 ++N T A+ A +E++++ + + I+DGE+ A Sbjct: 159 QMNTDMLSSQGTAIGEAIELAKTYYNDEEQTN----------RVLFIISDGEDHVGEASS 208 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK--CTDSSGQFFAVNDSRE----LL 377 + E G++I+++ V G L++ S + + L Sbjct: 209 -------LAEQANKEGIRIFTIGVGKTEGGPIPLKRNGIVQS---YKKDQNGETVITRLD 258 Query: 378 ES-FDKITDKIQEQSV 392 E+ I ++ + + Sbjct: 259 EATLKAIAEQANGEYI 274 >gi|327441394|dbj|BAK17759.1| uncharacterized protein containing a von Willebrand factor type A domain [Solibacillus silvestris StLB046] Length = 961 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 30/222 (13%), Positives = 71/222 (31%), Gaps = 23/222 (10%) Query: 169 NDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAI 228 N + N + + + T + D+ + ++ Sbjct: 15 LSINTPSINTSSKTVEAAYDYSNLSCTNEQGRDIVFVIQDTPDIQSHDPDQSRVTEVLSL 74 Query: 229 QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN----KLNPYENTNTYPAMHHA 284 + + R G + +N + L+NN+ + KS+LN ++PY + + A Sbjct: 75 MDDASSKDRFGFVGFNKEVTKE--LALTNNIVQAKSKLNEFGKNISPYMANDLSKGLEKA 132 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 EL ST K ++ +T G + ++ + I++ Sbjct: 133 VDELT---------KKSTSNDKVIVIMTVGN------SIYNEVSKKLAAKAYEEDITIHT 177 Query: 345 VAVSAP-PEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKIT 384 ++ P L + + G + ++ L + K++ Sbjct: 178 ISFGDPLYADAPFLTEIAKLTGGNYTHSPNAAFLKDVLSKLS 219 >gi|149197817|ref|ZP_01874866.1| von Willebrand factor type A domain protein [Lentisphaera araneosa HTCC2155] gi|149139038|gb|EDM27442.1| von Willebrand factor type A domain protein [Lentisphaera araneosa HTCC2155] Length = 1078 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 18/141 (12%), Positives = 44/141 (31%), Gaps = 19/141 (13%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 +N +++++ + + N + AY+ S VI ++DG Sbjct: 793 SNHSQLETAVKNIEAGTVANLSVGIEEAYKLAAQNFRSGAVNR--------VILLSDGIA 844 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRE 375 S + L+ R G+ + V + L + G ++ + + Sbjct: 845 SLGEKEAQEV--LKTVSQYRKQGIGNTVIGVGSEDYDDSFLETLANKGDGVYYFGDSKEQ 902 Query: 376 LLES--------FDKITDKIQ 388 + + F I ++ Sbjct: 903 MNDILVNNFEASFKTIARDVK 923 >gi|116332274|ref|YP_801992.1| hypothetical protein LBJ_2829 [Leptospira borgpetersenii serovar Hardjo-bovis JB197] gi|116125963|gb|ABJ77234.1| Conserved hypothetical protein [Leptospira borgpetersenii serovar Hardjo-bovis JB197] Length = 379 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 27/198 (13%), Positives = 63/198 (31%), Gaps = 25/198 (12%) Query: 193 NTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQC 252 S +KI + + ++++ + Sbjct: 70 IVDASGSMNEYLGIYQKIHLAKKHVSRYISTLPTE----TEIGFIAYGNRIPGCSSSRLY 125 Query: 253 TPLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 PL N K+RL L P T ++ A + K+ + +I I Sbjct: 126 EPLQRENHGTFKNRLFSLTPSGATPLAESIRIAGNLISQRKKETE-----------IILI 174 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCTDSS-GQFF 368 TDG S + + + ++ G+ K + + + P+ + ++ + G++F Sbjct: 175 TDGVESCYG------DPKKELQALKQQGIYFKFHILGLGLKPDEERKMKILAEEGNGKYF 228 Query: 369 AVNDSRELLESFDKITDK 386 + D + D + ++ Sbjct: 229 GIEDDSSFYTALDSLKNQ 246 >gi|116327069|ref|YP_796789.1| hypothetical protein LBL_0242 [Leptospira borgpetersenii serovar Hardjo-bovis L550] gi|116119813|gb|ABJ77856.1| Conserved hypothetical protein [Leptospira borgpetersenii serovar Hardjo-bovis L550] Length = 379 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 27/198 (13%), Positives = 63/198 (31%), Gaps = 25/198 (12%) Query: 193 NTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQC 252 S +KI + + ++++ + Sbjct: 70 IVDASGSMNEYLGIYQKIHLAKKHVSRYISTLPTE----TEIGFIAYGNRIPGCSSSRLY 125 Query: 253 TPLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 PL N K+RL L P T ++ A + K+ + +I I Sbjct: 126 EPLQRENHGTFKNRLFSLTPSGATPLAESIRIAGNLISQRKKETE-----------IILI 174 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCTDSS-GQFF 368 TDG S + + + ++ G+ K + + + P+ + ++ + G++F Sbjct: 175 TDGVESCYG------DPKKELQALKQQGIYFKFHILGLGLKPDEERKMKILAEEGNGKYF 228 Query: 369 AVNDSRELLESFDKITDK 386 + D + D + ++ Sbjct: 229 GIEDDSSFYTALDSLKNQ 246 >gi|221369290|ref|YP_002520386.1| von Willebrand factor, type A precursor [Rhodobacter sphaeroides KD131] gi|221162342|gb|ACM03313.1| von Willebrand factor, type A precursor [Rhodobacter sphaeroides KD131] Length = 328 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 19/139 (13%), Positives = 49/139 (35%), Gaps = 29/139 (20%) Query: 254 PLSNNLNEVKSRL--NKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ +L V + + +T + A + + +S + ++ Sbjct: 148 PLTFDLAAVGRAIEEASIGITGRSTAIADGLGLALKRVTESGAAS----------RVIVL 197 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP----------EGQDLLRKC 360 ++DG+++ ++ Q+ G++I+++A+ LR Sbjct: 198 LSDGQDNAHQ-----IDARQVAGLAARHGVRIHTIALGPDDLETRPAARDAVDTATLRAI 252 Query: 361 TD-SSGQFFAVNDSRELLE 378 + S G+ + V +L Sbjct: 253 AEASGGRSYRVRGMEDLRA 271 >gi|329573764|gb|EGG55354.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX1467] Length = 1103 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 101/382 (26%), Gaps = 30/382 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M A A L+ +++ + D + + QG+ D Sbjct: 151 AEARMAPANLRANLAL--PLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGSAQWD 208 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ ++Y + L++R E Sbjct: 209 GQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRG----NVQKE 264 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +V+D S SM + L + S Sbjct: 265 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNINMGYVGYSSDGYNNN 324 Query: 205 PANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 D + N+ S + + G + L++ + Sbjct: 325 AIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFS 384 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +++++ T R + + + GST D N Sbjct: 385 YKVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNINKRINS 434 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA-V 370 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 435 TFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSADENGDLYYESA 494 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 495 DYAPDISDYLAKKAVQISGTVV 516 >gi|296127472|ref|YP_003634724.1| von Willebrand factor type A [Brachyspira murdochii DSM 12563] gi|296019288|gb|ADG72525.1| von Willebrand factor type A [Brachyspira murdochii DSM 12563] Length = 338 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 21/165 (12%), Positives = 48/165 (29%), Gaps = 41/165 (24%) Query: 248 VGNQCTPLSNNLNEVKSRLNKLNPYE----NTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 P + ++ L+ L+ T A+ A Sbjct: 139 TSFIACPFTQDMETFSYILDNLSTKSVTLQGTRIADALVTAKNTFN----------VDAV 188 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP------------ 351 KK ++ ITDGE+ G ++ + +++ + +Y++ V Sbjct: 189 SKKSIVLITDGEDHGGYFD-------EVLKELKDMNISVYTIGVGTSQGAAISTDLGVRE 241 Query: 352 ------EGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQE 389 + L+ + G+ + + L FD + + Sbjct: 242 KSVISKRDDNTLKLIADSTLGKSYIAENVS-LESIFDDMKKNMDS 285 >gi|327439430|dbj|BAK15795.1| uncharacterized protein containing a von Willebrand factor type A domain [Solibacillus silvestris StLB046] Length = 986 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 27/181 (14%), Positives = 62/181 (34%), Gaps = 30/181 (16%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY 272 ++ N +++ I++ +N + ++ S ++ L Y Sbjct: 724 ARDAKNYTANKVKQTIKQIGAN--PSHVYRFNNRPNHEA-----TDKADIVSSIDSLLTY 776 Query: 273 EN----TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 +N TN A+ A + K ++ +TDG ++ Sbjct: 777 KNENRSTNIVKALETAIGNFTT----------NQYTSKAIVLVTDGYSNSN-------GL 819 Query: 329 LQICEYMRNAGMKIYSVAVSAPPE-GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDK 386 Q+ + G+ I++V+V + + LL+ + ++G + + L S I Sbjct: 820 EQVLRDAKLKGIAIHTVSVGSYTTVNEKLLKDISSETNGTYQNITSIENLHGSLQAIITT 879 Query: 387 I 387 I Sbjct: 880 I 880 >gi|325661940|ref|ZP_08150560.1| hypothetical protein HMPREF0490_01298 [Lachnospiraceae bacterium 4_1_37FAA] gi|325471792|gb|EGC75010.1| hypothetical protein HMPREF0490_01298 [Lachnospiraceae bacterium 4_1_37FAA] Length = 1321 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 44/354 (12%), Positives = 101/354 (28%), Gaps = 46/354 (12%) Query: 41 CASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNP 100 + + K+ + + +T K + + ++ G + Q N + + + Sbjct: 383 AGELNRILSEKEDKEQLKEINTYAKLTVDHIVPIADNSQDIDGQKEETFQENKEERQEDF 442 Query: 101 LQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSM 160 QY+ ++ Q + + Sbjct: 443 TQYVTDTVNQKAASV------SISGIDTKEFETVRAVVSLEEGIADTEEKFRENVEILDC 496 Query: 161 EDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNL 220 + L S + KI+ L ++ Sbjct: 497 GVEIPDYKVKKLEYDTVNIALCCDNSGSMEGE----------------KIENLKKAVSTF 540 Query: 221 VNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPA 280 V + V IG + + G++ C P ++ +++ + TN Y Sbjct: 541 VGKLA--------DEVNIGIVPFGSGVLEGVCEP-GSSREKLEQSVESFRSDSGTNIYSG 591 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + + L EK++ + ++DG++ + + +I N + Sbjct: 592 VEYTLSMLAKEKDA----------LNIAVIMSDGQD----SIPSEEQLQKITSACENGNI 637 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRI 394 +YS+ + A E + L +G + V+DS L + I I + I Sbjct: 638 LLYSMGLGADVESEVLSTYSDAGNGAYVFVSDSNSLYSFYQYIYQ-ISKNRYEI 690 >gi|188583113|ref|YP_001926558.1| hypothetical protein Mpop_3896 [Methylobacterium populi BJ001] gi|179346611|gb|ACB82023.1| conserved hypothetical protein [Methylobacterium populi BJ001] Length = 473 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 20/139 (14%), Positives = 41/139 (29%), Gaps = 21/139 (15%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + + S + I+ A ++ R Q+QSA+DA V++G ++ Sbjct: 34 IFGLAASTLIGLVGGGIEYARVLSARTQLQSAVDAGVMAGGNAL---------------- 77 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 K + + KA ++ + S L Sbjct: 78 -----KLVVSSTDSIVGLTTQTIQTEAKAPADVPVTVQVTVAPDKTSVEARAEQVIKLTF 132 Query: 121 KGLIPSALTNLSLRSTGII 139 + A +S R+ + Sbjct: 133 GAFVGMASMPISARAKASV 151 >gi|149178272|ref|ZP_01856865.1| hypothetical protein PM8797T_16765 [Planctomyces maris DSM 8797] gi|148842921|gb|EDL57291.1| hypothetical protein PM8797T_16765 [Planctomyces maris DSM 8797] Length = 169 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 23/137 (16%), Positives = 43/137 (31%), Gaps = 16/137 (11%) Query: 255 LSNNLNEVKSRLNKLNP---YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 L +++ L L+P TN P + + L L+ V+ + Sbjct: 27 LCHDVPTATDLLGHLHPVIGGGGTNMAPGLFISREILE-----RPIFPSQIYLRPVVVVL 81 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 +DG + T +I ++ I +VA + L SS F+ Sbjct: 82 SDGLT------SHPAKTSEIATQLKKD-ADIVTVAFG-DDADEPYLISLATSSEHFYHCR 133 Query: 372 DSRELLESFDKITDKIQ 388 +L F + + Sbjct: 134 TGTDLRAFFASVGTTLS 150 >gi|90410254|ref|ZP_01218271.1| hypothetical protein P3TCK_05786 [Photobacterium profundum 3TCK] gi|90329607|gb|EAS45864.1| hypothetical protein P3TCK_05786 [Photobacterium profundum 3TCK] Length = 370 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 24/189 (12%), Positives = 65/189 (34%), Gaps = 30/189 (15%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---P 271 ++ ++ + R+G I + P + + + L++ Sbjct: 123 DAVKQVLAQFSQHRDG-----DRLGLILFGDAAYLQA--PFTADHETWLALLDETQVGMA 175 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK--------KFVIFITDGENSGASAYQ 323 ++T+ A+ A + ++ +++ K VI +TDG ++ + Sbjct: 176 GQSTHLGDAIGLAIKVFNDQADAAKQAATQQGSAVVTRPVKEKVVIVLTDGNDTDSLVP- 234 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAP------PEGQDLLRKCTD-SSGQFFAVNDSREL 376 + + + ++I+ +A+ P +++ + + GQ F EL Sbjct: 235 ----PIDAAKVAASRDIRIHMIAMGDPRTVGEQALDMEVIEQVASLTGGQSFQALSPAEL 290 Query: 377 LESFDKITD 385 + KI + Sbjct: 291 TRVYKKIGE 299 >gi|162456414|ref|YP_001618781.1| hypothetical protein sce8131 [Sorangium cellulosum 'So ce 56'] gi|161166996|emb|CAN98301.1| hypothetical protein sce8131 [Sorangium cellulosum 'So ce 56'] Length = 507 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 29/174 (16%), Positives = 65/174 (37%), Gaps = 16/174 (9%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 K++ +A V+ + + +SV + + V + Sbjct: 138 KMENARAAAQAFVDRL----PDGDLVSVASFADTAQARVAPTVLGRST--RPAVARAIAA 191 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L P +TN + + A + ST + V+ I+DG+ + + + L Sbjct: 192 LGPDGSTNLFAGLKLAEQHALAAP--------STHAVRRVVLISDGQANIGPSSPDILG- 242 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + G++I S+ V A + + L SSG+ + + ++RE+ ++ Sbjct: 243 -ALAQRGAAHGVQITSIGVGADYDERTLNALAVGSSGRLYHLTEAREMSSVLER 295 >gi|254513911|ref|ZP_05125972.1| von Willebrand factor type A domain protein [gamma proteobacterium NOR5-3] gi|219676154|gb|EED32519.1| von Willebrand factor type A domain protein [gamma proteobacterium NOR5-3] Length = 330 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 21/142 (14%), Positives = 49/142 (34%), Gaps = 25/142 (17%) Query: 254 PLSNNLNEVKSRLNK---LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 P ++++ + L + +T A+ A + + + ++ Sbjct: 155 PFTDDIATWLTLLEESEVAMAGPSTALGDAIGLAISLFQASETRN----------RVLVV 204 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------GQDLLRKCT-DS 363 +TDG ++G+ + + IY+VAV P + L + Sbjct: 205 LTDGNDTGSRVP-----PVDAASIAAANDVTIYTVAVGDPSTIGEEALDLETLDAIASST 259 Query: 364 SGQFFAVNDSRELLESFDKITD 385 G F D+ L +++++I Sbjct: 260 RGASFLALDTLALKDAYEQINR 281 >gi|47218379|emb|CAG01900.1| unnamed protein product [Tetraodon nigroviridis] Length = 683 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 33/311 (10%), Positives = 85/311 (27%), Gaps = 23/311 (7%) Query: 71 HLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTN 130 + + G + + D + + A + IPT+ +I + L Sbjct: 103 YYNAKEDFNQTDGKKNRYVPDDFEMDDDFKRLVSYNTTAVH-IPTDIYEGSTIILNELNW 161 Query: 131 LSLRSTGIIERSSENLAISICMVLDVSR----SMEDLYLQKHNDNNNMTSNKYLLPPPPK 186 + E+ ++ + + ++ N N + P Sbjct: 162 TEALEDVFRKNKEEDPSLLWQVFGSATGLARYYPASPWMDISNSANKIDLYDVRRRPWYI 221 Query: 187 KSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIG 246 + S + + + + ++ S ++ ++ V A Sbjct: 222 QGAASPKDMLILVDASGSVSGLTLKLIQISVSKMLETLSDDDYV---NVVYFNDKAKYAS 278 Query: 247 IVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 N N +K + + TN A+ +L + + K Sbjct: 279 CFENLVQANVRNKRMLKKAVQNITAKGTTNYSGGFELAFEQL------AQMNVSRANCNK 332 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSG 365 ++ TDG A N N ++I++ +V + ++ + G Sbjct: 333 IIMLFTDGGEEKAEEIFKKYNP--------NQEVRIFTFSVGQHNYDKGPIQWMACANKG 384 Query: 366 QFFAVNDSREL 376 ++ + + Sbjct: 385 YYYEIPSIGAI 395 >gi|326789712|ref|YP_004307533.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] gi|326540476|gb|ADZ82335.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] Length = 404 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 25/167 (14%), Positives = 56/167 (33%), Gaps = 20/167 (11%) Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCT---PLSNNLNEVKSRLNKLN--PYENTNTY 278 ++ + + RI + ++ + VK+++ T Sbjct: 137 ATSSLIDNLEGNRRIAFMTFDDSPILQFDFMEATTKEQKEVVKAKIASYQQNDDGQTGVR 196 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 ++ AY + N S +I I+DG S SA N + Sbjct: 197 DMINEAYELIQN---------NSKNHSGSLIMISDGAPSDDSAS----NIPALVSNYVQN 243 Query: 339 GMKIYSVAVSAPPEG-QDLLRKCTD-SSGQFFAVNDSRELLESFDKI 383 + IY++ + + L + + GQ ++ +D+ + +F +I Sbjct: 244 NIPIYTIGMMYGDNSAEQYLIDIANLTGGQHYSTSDTTMIAGAFGQI 290 >gi|145500364|ref|XP_001436165.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124403304|emb|CAK68768.1| unnamed protein product [Paramecium tetraurelia] Length = 604 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 18/136 (13%), Positives = 40/136 (29%), Gaps = 14/136 (10%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N ++K+ T A +++ S + ++DG++ Sbjct: 244 NKTYYSDIISKIYANGGTVIGLGTQMALKQIK--------YRKSVNNVTAIFVLSDGQDE 295 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 A + + I+S + + + + + G F+ VN+ L Sbjct: 296 AAISSLQKQLAYY------KQTLTIHSFGFGSDHDAKLMTKISNLGKGSFYFVNNISLLD 349 Query: 378 ESFDKITDKIQEQSVR 393 E F + V Sbjct: 350 EFFVDALGALTSMVVT 365 >gi|153006798|ref|YP_001381123.1| hypothetical protein Anae109_3961 [Anaeromyxobacter sp. Fw109-5] gi|152030371|gb|ABS28139.1| hypothetical protein Anae109_3961 [Anaeromyxobacter sp. Fw109-5] Length = 381 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 20/141 (14%), Positives = 48/141 (34%), Gaps = 6/141 (4%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRT-IKDPTTKKDQ 59 + AI++ V F+ ++++ H++ +R ++Q+A DA L G + I Sbjct: 16 IVAIVLLVLGGFMALSLNVGHLLSVRGELQNASDAGALGGAWDLDCTVDGIAKARVAALD 75 Query: 60 TSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAE-----SKAQYEIP 114 +T + L + + + ++ I+ +NP + + Sbjct: 76 YATRHSTDYQPLLPSDTMVETGYWSVDKERFWPISDPGSNPELVSLTNAVRVRSTRVDAN 135 Query: 115 TENLFLKGLIPSALTNLSLRS 135 +F + T Sbjct: 136 AAPVFFPVFLGGNNTANVGAG 156 >gi|332221823|ref|XP_003260064.1| PREDICTED: calcium-activated chloride channel regulator 4 isoform 1 [Nomascus leucogenys] Length = 921 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 18/133 (13%), Positives = 55/133 (41%), Gaps = 24/133 (18%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T+ + +A++ + + S V+ +TDGE++ AS+ Sbjct: 379 ALGGTSICSGIKYAFQVI--------GELHSQLDGSEVVLLTDGEDNTASSC-------- 422 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE---LLESFDKI---T 384 + ++ +G ++ +A+ + + ++ + G F +D + L+++F + Sbjct: 423 -IDEVKQSGAIVHFIALGS-AADEAVIEMSNITGGSHFYASDEAQNNGLIDAFGALTSGN 480 Query: 385 DKIQEQSVRIAPN 397 + ++S+++ Sbjct: 481 TDLSQKSLQLESK 493 >gi|73972306|ref|XP_860269.1| PREDICTED: similar to Complement factor B precursor (C3/C5 convertase) (Properdin factor B) (Glycine-rich beta glycoprotein) (GBG) (PBF2) isoform 3 [Canis familiaris] Length = 549 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 31/233 (13%), Positives = 73/233 (31%), Gaps = 20/233 (8%) Query: 175 TSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV-----LIESAGNLVNSIQKAIQ 229 + +P +K + + S + I + + + Sbjct: 246 DAEDGHIPGEQQKRKIVLDPSGSMNIYLVLDGSDSIGAGNFTRAKNCLRDFIEKVASYGV 305 Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY--ENTNTYPAMHHAYRE 287 + K V T V ++ + ++ + ++ ++++ + TNT A+ Y Sbjct: 306 KPKYGLVTYATNPKVWVRVRDKNSSDADWVTKILNQVSYEDHMLKSGTNTKKALEAVYSM 365 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGA-------SAYQNTLNTLQICEYMRNAGM 340 + R + +I +TDG + +N L+ + + R + Sbjct: 366 MN--WPGETPPADWNRTRHVIILMTDGLYNMGGDPVSVIHNIRNFLDIGRDHKNPREDYL 423 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSSG---QFFAVNDSRELLESFDKITDKIQEQ 390 +Y V P Q+ + F V D L + F ++ D+ + Sbjct: 424 DVYVFGVG-PLVNQENINALASKKDKEQHVFKVKDMENLEDVFIQMLDETRTL 475 >gi|73972310|ref|XP_849985.1| PREDICTED: similar to Complement factor B precursor (C3/C5 convertase) (Properdin factor B) (Glycine-rich beta glycoprotein) (GBG) (PBF2) isoform 2 [Canis familiaris] Length = 1112 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 31/233 (13%), Positives = 73/233 (31%), Gaps = 20/233 (8%) Query: 175 TSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV-----LIESAGNLVNSIQKAIQ 229 + +P +K + + S + I + + + Sbjct: 591 DAEDGHIPGEQQKRKIVLDPSGSMNIYLVLDGSDSIGAGNFTRAKNCLRDFIEKVASYGV 650 Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY--ENTNTYPAMHHAYRE 287 + K V T V ++ + ++ + ++ ++++ + TNT A+ Y Sbjct: 651 KPKYGLVTYATNPKVWVRVRDKNSSDADWVTKILNQVSYEDHMLKSGTNTKKALEAVYSM 710 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGA-------SAYQNTLNTLQICEYMRNAGM 340 + R + +I +TDG + +N L+ + + R + Sbjct: 711 MN--WPGETPPADWNRTRHVIILMTDGLYNMGGDPVSVIHNIRNFLDIGRDHKNPREDYL 768 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSSG---QFFAVNDSRELLESFDKITDKIQEQ 390 +Y V P Q+ + F V D L + F ++ D+ + Sbjct: 769 DVYVFGVG-PLVNQENINALASKKDKEQHVFKVKDMENLEDVFIQMLDETRTL 820 >gi|73972308|ref|XP_532086.2| PREDICTED: similar to Complement factor B precursor (C3/C5 convertase) (Properdin factor B) (Glycine-rich beta glycoprotein) (GBG) (PBF2) isoform 1 [Canis familiaris] Length = 767 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 31/233 (13%), Positives = 73/233 (31%), Gaps = 20/233 (8%) Query: 175 TSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV-----LIESAGNLVNSIQKAIQ 229 + +P +K + + S + I + + + Sbjct: 246 DAEDGHIPGEQQKRKIVLDPSGSMNIYLVLDGSDSIGAGNFTRAKNCLRDFIEKVASYGV 305 Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY--ENTNTYPAMHHAYRE 287 + K V T V ++ + ++ + ++ ++++ + TNT A+ Y Sbjct: 306 KPKYGLVTYATNPKVWVRVRDKNSSDADWVTKILNQVSYEDHMLKSGTNTKKALEAVYSM 365 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGA-------SAYQNTLNTLQICEYMRNAGM 340 + R + +I +TDG + +N L+ + + R + Sbjct: 366 MN--WPGETPPADWNRTRHVIILMTDGLYNMGGDPVSVIHNIRNFLDIGRDHKNPREDYL 423 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSSG---QFFAVNDSRELLESFDKITDKIQEQ 390 +Y V P Q+ + F V D L + F ++ D+ + Sbjct: 424 DVYVFGVG-PLVNQENINALASKKDKEQHVFKVKDMENLEDVFIQMLDETRTL 475 >gi|319652226|ref|ZP_08006344.1| hypothetical protein HMPREF1013_02957 [Bacillus sp. 2_A_57_CT2] gi|317396049|gb|EFV76769.1| hypothetical protein HMPREF1013_02957 [Bacillus sp. 2_A_57_CT2] Length = 463 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 19/181 (10%), Positives = 59/181 (32%), Gaps = 21/181 (11%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI----GIVGNQCTPLS-NN 258 A K+D+ + + +I ++ + + G+ A + P+ + Sbjct: 171 AGGKMKMDIAKSAVKSFAQTIGQSSEVSLVVYGHKGSEADADKEISCSGVEEVYPMGKYS 230 Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 E ++ T A+ A + + + S + ++DG + Sbjct: 231 KKEFHEAVDSFESKGWTPLAGAIQKAAE-MSSGYDGSTT----------IYIVSDGAETC 279 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELL 377 + + + + + + I + + ++ L+ ++ G++ ++ EL Sbjct: 280 DGDPVS--ASKNLVKNNSSNSVNI--IGFGVDGKAENQLKAVAEAGNGEYLKADNPDELK 335 Query: 378 E 378 Sbjct: 336 N 336 >gi|149481218|ref|XP_001506719.1| PREDICTED: similar to matrilin 4, partial [Ornithorhynchus anatinus] Length = 312 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 24/179 (13%), Positives = 62/179 (34%), Gaps = 23/179 (12%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + ++ +++ G LV S+ + R+G + ++ + E Sbjct: 142 SKSVGAQQFELVKRWVGELVGSL-----DVSPAGTRVGLVQFSSRVRTEFPLGRHGTKAE 196 Query: 262 VKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 ++ + + P + T T A+ H ++E E + ++ + +TDG + Sbjct: 197 AEAAVRAVTPMDKGTMTGLALRHLVERGFSEAEGAR-----PGSRRVGLLVTDGRSQDDV 251 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELL 377 + ++ G+ +++V V ++ LR+ D L Sbjct: 252 SPW--------AARAKDRGIVMFAVGVGKAV--EEELREIASDPVERHVSYAADFGALT 300 >gi|228931399|ref|ZP_04094324.1| hypothetical protein bthur0010_60390 [Bacillus thuringiensis serovar pondicheriensis BGSC 4BA1] gi|228828205|gb|EEM73915.1| hypothetical protein bthur0010_60390 [Bacillus thuringiensis serovar pondicheriensis BGSC 4BA1] Length = 448 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 30/220 (13%), Positives = 65/220 (29%), Gaps = 22/220 (10%) Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 Y + P K S K++ ++ N ++ I + Sbjct: 139 YSIKPKEKSLNVEILLDASGSMAGKVNGQVKMEAAKKAIYNYLDKIPDNSNVMLRVYGHK 198 Query: 239 GTIAYNI----GIVGNQCTPLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 G+ N PL N + + L+K P T A+ + Sbjct: 199 GSNNENDKSLSCGSSEVMYPLQPYNKEQFNAALSKFGPKGWTPLASAIESVNDDFKE--- 255 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA--GMKIYSVAVSAPP 351 V ++DGE + + + + + + + Sbjct: 256 -----YTGEENLNVVYIVSDGEETCGG------EPVNAAKNLNQSSTHAVVNIIGFDVKN 304 Query: 352 EGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQ 390 Q L+ ++ G + V+ + EL ++ +K +K+ +Q Sbjct: 305 SEQLQLKNTAEAGKGNYATVSTADELYQTLNKEYEKLYKQ 344 >gi|149026142|gb|EDL82385.1| rCG29121 [Rattus norvegicus] Length = 905 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 18/132 (13%), Positives = 51/132 (38%), Gaps = 22/132 (16%) Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T+ + + + + + ++ +TDGE+ S+ Sbjct: 383 EAAGGTSICNGLRKGFETITSS--------DQSTCGSEIVLLTDGEDDQISSCF------ 428 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQ-DLLRKCTDSSGQFFAVNDSRELLESFDKITDK-- 386 E ++++G I+++A+ + + L +F+A L+++F I+ K Sbjct: 429 ---EEVKHSGAVIHTIALGPDAARELETLSDM-TGGRRFYASEGINGLIDAFSGISSKSG 484 Query: 387 -IQEQSVRIAPN 397 + +Q++++ Sbjct: 485 SLSQQALQLESK 496 >gi|116487355|ref|NP_001070824.1| chloride channel calcium activated 4-like [Rattus norvegicus] gi|116013527|dbj|BAF34587.1| calcium-activated chloride channel [Rattus norvegicus] Length = 905 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 18/132 (13%), Positives = 51/132 (38%), Gaps = 22/132 (16%) Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T+ + + + + + ++ +TDGE+ S+ Sbjct: 383 EAAGGTSICNGLRKGFETITSS--------DQSTCGSEIVLLTDGEDDQISSCF------ 428 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQ-DLLRKCTDSSGQFFAVNDSRELLESFDKITDK-- 386 E ++++G I+++A+ + + L +F+A L+++F I+ K Sbjct: 429 ---EEVKHSGAVIHTIALGPDAARELETLSDM-TGGRRFYASEGINGLIDAFSGISSKSG 484 Query: 387 -IQEQSVRIAPN 397 + +Q++++ Sbjct: 485 SLSQQALQLESK 496 >gi|156377912|ref|XP_001630889.1| predicted protein [Nematostella vectensis] gi|156217919|gb|EDO38826.1| predicted protein [Nematostella vectensis] Length = 372 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 62/184 (33%), Gaps = 19/184 (10%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNT 277 L + + V + + + +NNL +K + K+ T T Sbjct: 195 FLDKLVSELEIGPSKSHVGLVRYNQVADTLWDFNGAENNNLKSLKDAIEKIEYLPGGTRT 254 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT---------LNT 328 A+ ++++ + N + + ++ ITDG+ + S ++ L Sbjct: 255 DLALKKVNEDIFSPMGGARN-----DVPQVLVVITDGKTNQRSEPYSSVLQPLKVRILYP 309 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKI 387 + E + +KI +V V G+ L + F + +L+ + I + Sbjct: 310 RDVTEQ--DKDVKIVAVGVGH-SIGKKELDIIALGDASNVFMLETFDDLVRRLNAIMNSF 366 Query: 388 QEQS 391 + Sbjct: 367 CDTL 370 >gi|149021014|gb|EDL78621.1| rCG55860, isoform CRA_b [Rattus norvegicus] gi|149021015|gb|EDL78622.1| rCG55860, isoform CRA_b [Rattus norvegicus] Length = 919 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 30/156 (19%), Positives = 59/156 (37%), Gaps = 10/156 (6%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNE 291 I + I + + P++ +N+ K + L+P T+ A+ A + L N Sbjct: 294 QDRFNIIGFSNRIKMWKDHLLPVTPDNIRNGKIYMYHLSPTGGTDINGALQTAIKLLNNY 353 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 + R +IF+TDG+ + NTL L + + I++V + Sbjct: 354 VAQNDIE---DRSVSLIIFLTDGKPTFGE--TNTLRILSNTKEATGGQICIFTVGIGNDV 408 Query: 352 EGQDL----LRKCTDSSGQFFAVNDSRELLESFDKI 383 + + L L C + +L+ +D+I Sbjct: 409 DFRLLEKLSLENCGLTRRVHEEEKAGAQLIGFYDEI 444 >gi|293342867|ref|XP_001069890.2| PREDICTED: rCG55860-like [Rattus norvegicus] gi|149021013|gb|EDL78620.1| rCG55860, isoform CRA_a [Rattus norvegicus] Length = 953 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 30/156 (19%), Positives = 59/156 (37%), Gaps = 10/156 (6%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNE 291 I + I + + P++ +N+ K + L+P T+ A+ A + L N Sbjct: 328 QDRFNIIGFSNRIKMWKDHLLPVTPDNIRNGKIYMYHLSPTGGTDINGALQTAIKLLNNY 387 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 + R +IF+TDG+ + NTL L + + I++V + Sbjct: 388 VAQNDIE---DRSVSLIIFLTDGKPTFGE--TNTLRILSNTKEATGGQICIFTVGIGNDV 442 Query: 352 EGQDL----LRKCTDSSGQFFAVNDSRELLESFDKI 383 + + L L C + +L+ +D+I Sbjct: 443 DFRLLEKLSLENCGLTRRVHEEEKAGAQLIGFYDEI 478 >gi|257470753|ref|ZP_05634843.1| von Willebrand factor type A domain-containing protein [Fusobacterium ulcerans ATCC 49185] gi|317064958|ref|ZP_07929443.1| conserved hypothetical protein [Fusobacterium ulcerans ATCC 49185] gi|313690634|gb|EFS27469.1| conserved hypothetical protein [Fusobacterium ulcerans ATCC 49185] Length = 376 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 26/151 (17%), Positives = 58/151 (38%), Gaps = 8/151 (5%) Query: 224 IQKAIQEKKNLSVRIGTIAYNI--GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAM 281 + + +Q K+ V+IG +AY + + T L+ NL+E+ S L + + Sbjct: 67 VNEVMQNHKDSKVKIGLVAYRDRGDVYVTKVTQLNENLDEIYSVLMDYKAQGGGDDPEDV 126 Query: 282 HHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK 341 A E + S L + + + D Y ++ +T+ + ++ G+ Sbjct: 127 RKALHESLEIIQWSAPREN---LSQIIFLVGDAPPHDD--YNDSPDTVVTAKKAKSKGII 181 Query: 342 IYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN 371 I ++ P + G++F ++ Sbjct: 182 INTIQCGNMPSTDRYWKAIAQFGGGEYFHIS 212 >gi|47230696|emb|CAF99889.1| unnamed protein product [Tetraodon nigroviridis] Length = 1031 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 31/169 (18%), Positives = 58/169 (34%), Gaps = 25/169 (14%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNE 291 + ++G + Y +V ++ EV R +N TNT ++ A + + Sbjct: 151 FTGQVGVLQYGEKVVHEFKLSDYKSVEEVVKRARSINQRGGEETNTALGINVACSQAFKH 210 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV-SA- 349 K +I ITDGE ++ + Q+ + G+ Y++AV Sbjct: 211 GGRRGAK-------KVMIVITDGE------SHDSADLQQVIKDCEKDGITRYAIAVLGYY 257 Query: 350 ---PPEGQDLL---RKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQ 390 + L + FF V D L + D + ++I Sbjct: 258 NRRGINPEAFLNEIKYIASDPDDKHFFNVTDEAALKDIVDALGERIFSL 306 >gi|229577024|ref|NP_001153314.1| collagen type XXVIII alpha 1 a [Danio rerio] gi|228007387|emb|CAQ51228.1| collagen type XXVIII alpha 1 a [Danio rerio] Length = 1208 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 22/163 (13%), Positives = 52/163 (31%), Gaps = 19/163 (11%) Query: 229 QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRE 287 + LS R + Y+ ++ Q KS + + T T A+ + + Sbjct: 96 DTGRKLSWRAALLQYSSHVIIEQTLKQWKGTENFKSSIAPMAYIGHGTYTTYAITNMTKI 155 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 E K + +TDG + + +N G+K++++ + Sbjct: 156 FVEESSPERI--------KIALLLTDG-----FFHPRNPDIFSAMADAKNQGVKVFTIGI 202 Query: 348 SAPPEG---QDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 + LR + + + + D+ + + +I Sbjct: 203 TRTANDPVNAANLRLLSSTPASRFLYNLQDTNVMEKVITQIAQ 245 >gi|116619435|ref|YP_821591.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116222597|gb|ABJ81306.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 377 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 21/141 (14%), Positives = 50/141 (35%), Gaps = 17/141 (12%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 + I+Y + + + +++ ++ K+ P Y A++ A Sbjct: 133 PGDKRYTGYLISYGNTA--DIAVNTTWDSDKIADKVRKMKPGGGAALYDAIYLACT---- 186 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV---AV 347 ++ ++ I DG ++ + N ++ E + + IY+V A Sbjct: 187 --RRELVKGEPYEPRRVIVVIGDGHDNA-----SKHNLEEVLELAQRNLVTIYAVSTMAF 239 Query: 348 SAPPEGQDLLRKCT-DSSGQF 367 E Q++L + T + G Sbjct: 240 GFSNEDQEVLERLTHKTGGHV 260 >gi|301626998|ref|XP_002942667.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Xenopus (Silurana) tropicalis] Length = 4207 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 22/198 (11%), Positives = 57/198 (28%), Gaps = 27/198 (13%) Query: 196 KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL 255 + + ++ + + + S + I ++ P Sbjct: 605 DESSSVGHSNFVNELRFVKKLLSDFPVVPSATRVAIITFSSKTNVQTRVDYISSSE--PH 662 Query: 256 SNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 + + + + + T T A A + L + K + ITDG Sbjct: 663 QHKCSLLNREIPAITYKGGGTFTKGAFQQAAQIL---------RYSRSNSTKVIFLITDG 713 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVND 372 ++G + I +R+ G++I++V + L + ++ Sbjct: 714 YSNGG-------DPRPIAANLRDLGVEIFTVGI--WQGNIRELHDMASHPKEEHCYLLHS 764 Query: 373 SRELLESFDKITDKIQEQ 390 E F+ + + + Sbjct: 765 FAE----FEALARRALHE 778 >gi|124004754|ref|ZP_01689598.1| von Willebrand factor, type A, putative [Microscilla marina ATCC 23134] gi|123989877|gb|EAY29406.1| von Willebrand factor, type A, putative [Microscilla marina ATCC 23134] Length = 354 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 27/169 (15%), Positives = 57/169 (33%), Gaps = 40/169 (23%) Query: 254 PLSNNLNEVKSRLN----KLNPYE--NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 PL+ + + L P T+ Y + ++ +S+ K Sbjct: 160 PLTYDKGALNLFTQILNTNLMPIGNAGTDFYAPLELVLKKYQEANKSNRKQQNE--YAKV 217 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ------------- 354 V+ +DGE G I + + +++++V V + G+ Sbjct: 218 VVLFSDGEEFGDRY-------TAIVDQYKQNNIRVFTVGVGSLQGGKIPTSLGFKKDKKG 270 Query: 355 ---------DLLRKCTD-SSGQFFAV-NDSRELLESFDKITDKIQEQSV 392 L+ + ++G+FF V E+ E + I +I+ Q + Sbjct: 271 KVVLSKLSTTSLQTIAEQTNGRFFEVSETKNEIPELINTI-QEIKGQKL 318 >gi|118347390|ref|XP_001007172.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89288939|gb|EAR86927.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 669 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 23/178 (12%), Positives = 54/178 (30%), Gaps = 18/178 (10%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN--LNEVKSRLNKLNPYENTN 276 +LV K I N R+ + ++ ++ N + ++K+ TN Sbjct: 67 DLVKHTVKTIASSLNPQDRLALVGFSTHSKIYFELTEMDDQGKNVAFTEIDKMWAGGQTN 126 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 + + + + + N + TDG + + +++ + Sbjct: 127 IWGGLQDSLEVIKKGFRPNQNV--------CIFLFTDGRPTM----IPAIGHVEMLRRWK 174 Query: 337 NAG----MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 I++ + +L + +G F ++DS L F I Sbjct: 175 EQHPAIQFSIFTFGFGNDLDTDLMLELSQEQNGIFSFISDSSMLGTVFSNALANILST 232 >gi|89068267|ref|ZP_01155677.1| von Willebrand factor type A domain protein [Oceanicola granulosus HTCC2516] gi|89046184|gb|EAR52242.1| von Willebrand factor type A domain protein [Oceanicola granulosus HTCC2516] Length = 1065 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 24/181 (13%), Positives = 53/181 (29%), Gaps = 25/181 (13%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT-PLSNNLNEVKS 264 KI + E ++ ++ G P + ++ Sbjct: 41 GVNKIVIAREVVSEILGDF---PADQNLGLTVYGHRTRGDCTDIETVVAPATGTAGQIVE 97 Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + +NP T A+ A L +E + V+ ++DG + Sbjct: 98 VVEGINPRGKTPMTDAIIAAAEALRYTEERAT-----------VVLVSDGIET------C 140 Query: 325 TLNTLQICEYMRNAGM--KIYSVAVS-APPEGQDLLRKCTD-SSGQFFAVNDSRELLESF 380 + + AG+ + + P+ ++ D + G F ++ EL E+ Sbjct: 141 NPDPCAAARALEQAGIDFTAHVIGFDVTDPDALGQMQCIADETGGTFTTAANAGELSEAM 200 Query: 381 D 381 Sbjct: 201 S 201 >gi|256004503|ref|ZP_05429482.1| Carbohydrate-binding CenC domain protein [Clostridium thermocellum DSM 2360] gi|255991508|gb|EEU01611.1| Carbohydrate-binding CenC domain protein [Clostridium thermocellum DSM 2360] gi|316940035|gb|ADU74069.1| Carbohydrate-binding CenC domain protein [Clostridium thermocellum DSM 1313] Length = 1050 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 40/290 (13%), Positives = 77/290 (26%), Gaps = 30/290 (10%) Query: 130 NLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF 189 ++ S+ N S + + N N + L Sbjct: 646 SIGGNIEFQTSNSTINGIAYAPGNPANPNSGKIFFSGDKNTINGSIAANELDF---FAGG 702 Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 N T+ ++ K L N + K K ++V + + N Sbjct: 703 LVVNHTEGQFDTVEEKYIDKSTYLKLVKDAAKNFVDKFAGSKTKMAVIQYSDSANDNDFK 762 Query: 250 NQCTPLSNNLNEVKSRLNKLNPY--ENTNTYPAMHHAYRELYNEKESSHNTI-------- 299 L + +K ++K+ P +N M AY L + Sbjct: 763 KYDLSLPDKGAALKETIDKIKPGTSGLSNMGDGMRRAYHILNGPPPKGQISKYIVVITGS 822 Query: 300 ---------GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 K D + +Y + + + + G+ + + S Sbjct: 823 VPNRWTAVDNKKNEPKTDNGRADFIKADNESYNSLDYAKDMGRIITSKGINLVFIDFSEE 882 Query: 351 PEGQDLLRKCTDSSG--------QFFAVNDSRELLESFDKITDKIQEQSV 392 G L +S ++ N+ ELL+ + +T KI V Sbjct: 883 DIGDVLEEIAAESGAKPLEGTDRHYYKANNFLELLDILNNMTLKIYYDVV 932 >gi|240255540|ref|NP_476506.3| collagen alpha-3(VI) chain isoform 3 precursor [Homo sapiens] Length = 1237 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 56/189 (29%), Gaps = 20/189 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ N +V + Sbjct: 833 GVRSGFPLLKEFVQRVVESL-----DVGQDRVRVAVVQYSDRTRPEFYLNSYMNKQDVVN 887 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S T + + +I +T + Sbjct: 888 AVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRIT---EGVPQLLIVLTADRSGDDVRN 944 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + + ++ + A+ R+L Sbjct: 945 PSVVVKRGGAVP----------IGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV 994 Query: 383 ITDKIQEQS 391 I++++ + + Sbjct: 995 ISERVTQLT 1003 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 20/180 (11%), Positives = 53/180 (29%), Gaps = 22/180 (12%) Query: 206 ANRKIDVLIESA--GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 + + + LVN ++K +R+G + ++ + +V Sbjct: 43 GSNNTGSVNFAVILDFLVNLLEKLPI--GTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVL 100 Query: 264 SRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + L N A+ + + + + + ++ I+ G Sbjct: 101 GAVKALGFAGGELANIGLALDFVVENHFT---RAGGSRVEEGVPQVLVLISAGP------ 151 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV---NDSRELLE 378 + + + ++ A ++S + A + L+ F V +L E Sbjct: 152 SSDEIRYGVVA--LKQA--SVFSFGLGAQAASRAELQHIATDDNLVFTVPEFRSFGDLQE 207 >gi|240255535|ref|NP_476507.3| collagen alpha-3(VI) chain isoform 4 precursor [Homo sapiens] Length = 2570 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 56/189 (29%), Gaps = 20/189 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ N +V + Sbjct: 432 GVRSGFPLLKEFVQRVVESL-----DVGQDRVRVAVVQYSDRTRPEFYLNSYMNKQDVVN 486 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S T + + +I +T + Sbjct: 487 AVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRIT---EGVPQLLIVLTADRSGDDVRN 543 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + + ++ + A+ R+L Sbjct: 544 PSVVVKRGGAVP----------IGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV 593 Query: 383 ITDKIQEQS 391 I++++ + + Sbjct: 594 ISERVTQLT 602 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 57/153 (37%), Gaps = 15/153 (9%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKE 293 VR+G + ++ + + V + +L + NT A+ R L+ Sbjct: 866 VRVGVVQFSNDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLF---V 922 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S + + + ++ + G++ + + +R++G+ S+ V Sbjct: 923 KSAGSRIEDGVPQHLVLVLGGKSQDDVSRF--------AQVIRSSGIV--SLGVGDRNID 972 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + L+ T+ F V + REL ++I + Sbjct: 973 RTELQTITNDPRLVFTVREFRELPNIEERIMNS 1005 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 59/172 (34%), Gaps = 15/172 (8%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN + ++ +N Sbjct: 1041 INFRRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLKDFSTKRQIIDAIN 1100 Query: 268 KLNPYENTNTYPAMHHAYRELY-NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 K+ + L N + R+ + IT G+ + Sbjct: 1101 KVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGK--------SVE 1150 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 1151 DAQDVSLALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 1200 >gi|219841772|gb|AAI44596.1| COL6A3 protein [Homo sapiens] gi|223462379|gb|AAI50626.1| COL6A3 protein [Homo sapiens] Length = 2570 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 56/189 (29%), Gaps = 20/189 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ N +V + Sbjct: 432 GVRSGFPLLKEFVQRVVESL-----DVGQDRVRVAVVQYSDRTRPEFYLNSYMNKQDVVN 486 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S T + + +I +T + Sbjct: 487 AVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRIT---EGVPQLLIVLTADRSGDDVRN 543 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + + ++ + A+ R+L Sbjct: 544 PSVVVKRGGAVP----------IGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV 593 Query: 383 ITDKIQEQS 391 I++++ + + Sbjct: 594 ISERVTQLT 602 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 57/153 (37%), Gaps = 15/153 (9%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKE 293 VR+G + ++ + + V + +L + NT A+ R L+ Sbjct: 866 VRVGVVQFSNDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLF---V 922 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S + + + ++ + G++ + + +R++G+ S+ V Sbjct: 923 KSAGSRIEDGVPQHLVLVLGGKSQDDVSRF--------AQVIRSSGIV--SLGVGDRNID 972 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + L+ T+ F V + REL ++I + Sbjct: 973 RTELQTITNDPRLVFTVREFRELPNIEERIMNS 1005 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 59/172 (34%), Gaps = 15/172 (8%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN + ++ +N Sbjct: 1041 INFRRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLKDFSTKRQIIDAIN 1100 Query: 268 KLNPYENTNTYPAMHHAYRELY-NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 K+ + L N + R+ + IT G+ + Sbjct: 1101 KVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGK--------SVE 1150 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 1151 DAQDVSLALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 1200 >gi|219521324|gb|AAI71790.1| COL6A3 protein [Homo sapiens] Length = 2570 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 56/189 (29%), Gaps = 20/189 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ N +V + Sbjct: 432 GVRSGFPLLKEFVQRVVESL-----DVGQDRVRVAVVQYSDRTRPEFYLNSYMNKQDVVN 486 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S T + + +I +T + Sbjct: 487 AVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRIT---EGVPQLLIVLTADRSGDDVRN 543 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + + ++ + A+ R+L Sbjct: 544 PSVVVKRGGAVP----------IGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV 593 Query: 383 ITDKIQEQS 391 I++++ + + Sbjct: 594 ISERVTQLT 602 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 57/153 (37%), Gaps = 15/153 (9%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKE 293 VR+G + ++ + + V + +L + NT A+ R L+ Sbjct: 866 VRVGVVQFSNDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLF---V 922 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S + + + ++ + G++ + + +R++G+ S+ V Sbjct: 923 KSAGSRIEDGVPQHLVLVLGGKSQDDVSRF--------AQVIRSSGIV--SLGVGDRNID 972 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + L+ T+ F V + REL ++I + Sbjct: 973 RTELQTITNDPRLVFTVREFRELPNIEERIMNS 1005 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 59/172 (34%), Gaps = 15/172 (8%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN + ++ +N Sbjct: 1041 INFRRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLKDFSTKRQIIDAIN 1100 Query: 268 KLNPYENTNTYPAMHHAYRELY-NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 K+ + L N + R+ + IT G+ + Sbjct: 1101 KVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGK--------SVE 1150 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 1151 DAQDVSLALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 1200 >gi|194389238|dbj|BAG65607.1| unnamed protein product [Homo sapiens] Length = 1237 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 56/189 (29%), Gaps = 20/189 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ N +V + Sbjct: 833 GVRSGFPLLKEFVQRVVESL-----DVGQDRVRVAVVQYSDRTRPEFYLNSYMNKQDVVN 887 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S T + + +I +T + Sbjct: 888 AVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRIT---EGVPQLLIVLTADRSGDDVRN 944 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + + ++ + A+ R+L Sbjct: 945 PSVVVKRGGAVP----------IGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV 994 Query: 383 ITDKIQEQS 391 I++++ + + Sbjct: 995 ISERVTQLT 1003 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 19/180 (10%), Positives = 52/180 (28%), Gaps = 22/180 (12%) Query: 206 ANRKIDVLIESA--GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 + + + LVN ++K +R+G + ++ + +V Sbjct: 43 GSNNTGSVNFAVILDFLVNLLEKLPI--GTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVL 100 Query: 264 SRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + L N A+ + + + + + ++ I+ G Sbjct: 101 GAVKALGFAGGELANIGLALDFVVENHFT---RAGGSRVEEGVPQVLVLISAGP------ 151 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV---NDSRELLE 378 + + + ++ A ++S + + L+ F V +L E Sbjct: 152 SSDEIRYGVVA--LKQA--SVFSFGLGVQAASRAELQHIATDDNLVFTVPEFRSFGDLQE 207 >gi|194381994|dbj|BAG64366.1| unnamed protein product [Homo sapiens] Length = 1207 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 56/189 (29%), Gaps = 20/189 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ N +V + Sbjct: 432 GVRSGFPLLKEFVQRVVESL-----DVGQDRVRVAVVQYSDRTRPEFYLNSYMNKQDVVN 486 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S T + + +I +T + Sbjct: 487 AVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRIT---EGVPQLLIVLTADRSGDDVRN 543 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + + ++ + A+ R+L Sbjct: 544 PSVVVKRGGAVP----------IGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV 593 Query: 383 ITDKIQEQS 391 I++++ + + Sbjct: 594 ISERVTQLT 602 Score = 53.0 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 26/179 (14%), Positives = 62/179 (34%), Gaps = 16/179 (8%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN + ++ +N Sbjct: 1041 INFRRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLKDFSTKRQIIDAIN 1100 Query: 268 KLNPYENTNTYPAMHHAYRELY-NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 K+ + L N + R+ + IT G+ + Sbjct: 1101 KVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGK--------SVE 1150 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + + + G+K+++V V + + K +S F V + +EL E +I+ Sbjct: 1151 DAQDVSLALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE-LSEIST 1206 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 57/153 (37%), Gaps = 15/153 (9%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKE 293 VR+G + ++ + + V + +L + NT A+ R L+ Sbjct: 866 VRVGVVQFSNDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLF---V 922 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S + + + ++ + G++ + + +R++G+ S+ V Sbjct: 923 KSAGSRIEDGVPQHLVLVLGGKSQDDVSRF--------AQVVRSSGIV--SLGVGDRNID 972 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + L+ T+ F V + REL ++I + Sbjct: 973 RTELQTITNDPRLVFTVREFRELPNIEERIMNS 1005 >gi|240255542|ref|NP_476505.3| collagen alpha-3(VI) chain isoform 2 precursor [Homo sapiens] gi|193787261|dbj|BAG52467.1| unnamed protein product [Homo sapiens] Length = 1036 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 56/189 (29%), Gaps = 20/189 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ N +V + Sbjct: 632 GVRSGFPLLKEFVQRVVESL-----DVGQDRVRVAVVQYSDRTRPEFYLNSYMNKQDVVN 686 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S T + + +I +T + Sbjct: 687 AVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRIT---EGVPQLLIVLTADRSGDDVRN 743 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + + ++ + A+ R+L Sbjct: 744 PSVVVKRGGAVP----------IGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV 793 Query: 383 ITDKIQEQS 391 I++++ + + Sbjct: 794 ISERVTQLT 802 >gi|119591515|gb|EAW71109.1| collagen, type VI, alpha 3, isoform CRA_g [Homo sapiens] Length = 2205 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 56/189 (29%), Gaps = 20/189 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ N +V + Sbjct: 833 GVRSGFPLLKEFVQRVVESL-----DVGQDRVRVAVVQYSDRTRPEFYLNSYMNKQDVVN 887 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S T + + +I +T + Sbjct: 888 AVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRIT---EGVPQLLIVLTADRSGDDVRN 944 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + + ++ + A+ R+L Sbjct: 945 PSVVVKRGGAVP----------IGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV 994 Query: 383 ITDKIQEQS 391 I++++ + + Sbjct: 995 ISERVTQLT 1003 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 57/153 (37%), Gaps = 15/153 (9%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKE 293 VR+G + ++ + + V + +L + NT A+ R L+ Sbjct: 1267 VRVGVVQFSNDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLF---V 1323 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S + + + ++ + G++ + + +R++G+ S+ V Sbjct: 1324 KSAGSRIEDGVPQHLVLVLGGKSQDDVSRF--------AQVIRSSGIV--SLGVGDRNID 1373 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + L+ T+ F V + REL ++I + Sbjct: 1374 RTELQTITNDPRLVFTVREFRELPNIEERIMNS 1406 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 59/172 (34%), Gaps = 15/172 (8%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN + ++ +N Sbjct: 1442 INFRRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLKDFSTKRQIIDAIN 1501 Query: 268 KLNPYENTNTYPAMHHAYRELY-NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 K+ + L N + R+ + IT G+ + Sbjct: 1502 KVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGK--------SVE 1551 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 1552 DAQDVSLALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 1601 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 20/180 (11%), Positives = 53/180 (29%), Gaps = 22/180 (12%) Query: 206 ANRKIDVLIESA--GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 + + + LVN ++K +R+G + ++ + +V Sbjct: 43 GSNNTGSVNFAVILDFLVNLLEKLPI--GTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVL 100 Query: 264 SRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + L N A+ + + + + + ++ I+ G Sbjct: 101 GAVKALGFAGGELANIGLALDFVVENHFT---RAGGSRVEEGVPQVLVLISAGP------ 151 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV---NDSRELLE 378 + + + ++ A ++S + A + L+ F V +L E Sbjct: 152 SSDEIRYGVVA--LKQA--SVFSFGLGAQAASRAELQHIATDDNLVFTVPEFRSFGDLQE 207 >gi|119591516|gb|EAW71110.1| collagen, type VI, alpha 3, isoform CRA_h [Homo sapiens] Length = 2977 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 56/189 (29%), Gaps = 20/189 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ N +V + Sbjct: 839 GVRSGFPLLKEFVQRVVESL-----DVGQDRVRVAVVQYSDRTRPEFYLNSYMNKQDVVN 893 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S T + + +I +T + Sbjct: 894 AVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRIT---EGVPQLLIVLTADRSGDDVRN 950 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + + ++ + A+ R+L Sbjct: 951 PSVVVKRGGAVP----------IGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV 1000 Query: 383 ITDKIQEQS 391 I++++ + + Sbjct: 1001 ISERVTQLT 1009 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 57/153 (37%), Gaps = 15/153 (9%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKE 293 VR+G + ++ + + V + +L + NT A+ R L+ Sbjct: 1273 VRVGVVQFSNDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLF---V 1329 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S + + + ++ + G++ + + +R++G+ S+ V Sbjct: 1330 KSAGSRIEDGVPQHLVLVLGGKSQDDVSRF--------AQVIRSSGIV--SLGVGDRNID 1379 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + L+ T+ F V + REL ++I + Sbjct: 1380 RTELQTITNDPRLVFTVREFRELPNIEERIMNS 1412 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 59/172 (34%), Gaps = 15/172 (8%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN + ++ +N Sbjct: 1448 INFRRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLKDFSTKRQIIDAIN 1507 Query: 268 KLNPYENTNTYPAMHHAYRELY-NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 K+ + L N + R+ + IT G+ + Sbjct: 1508 KVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGK--------SVE 1557 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 1558 DAQDVSLALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 1607 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/314 (10%), Positives = 84/314 (26%), Gaps = 33/314 (10%) Query: 72 LKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNL 131 ++ + Q+ + + +A A+ + N+F G+ + L Sbjct: 126 MQSHLTKAAGSRAGDGVPQVIVVLTDGHSKDGLALPSAELKSADVNVFAIGVEDADEGAL 185 Query: 132 SLRSTGIIERSSEN--LAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF 189 ++ + N S+ ++ S + + T F Sbjct: 186 KEIASEPLNMHMFNLENFTSLHDIVGNLVSCVHSSVSPERAGDTETLKDITAQDSADIIF 245 Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 + + LVN ++K +R+G + ++ Sbjct: 246 LIDGSNNTGSVNFAVI-----------LDFLVNLLEKLPI--GTQQIRVGVVQFSDEPRT 292 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 + +V + L N A+ + + + + + Sbjct: 293 MFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENHFT---RAGGSRVEEGVPQV 349 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF 367 ++ I+ G + + + ++ A ++S + A + L+ Sbjct: 350 LVLISAGP------SSDEIRYGVVA--LKQA--SVFSFGLGAQAASRAELQHIATDDNLV 399 Query: 368 FAV---NDSRELLE 378 F V +L E Sbjct: 400 FTVPEFRSFGDLQE 413 >gi|119591509|gb|EAW71103.1| collagen, type VI, alpha 3, isoform CRA_a [Homo sapiens] Length = 2211 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 56/189 (29%), Gaps = 20/189 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ N +V + Sbjct: 839 GVRSGFPLLKEFVQRVVESL-----DVGQDRVRVAVVQYSDRTRPEFYLNSYMNKQDVVN 893 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S T + + +I +T + Sbjct: 894 AVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRIT---EGVPQLLIVLTADRSGDDVRN 950 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + + ++ + A+ R+L Sbjct: 951 PSVVVKRGGAVP----------IGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV 1000 Query: 383 ITDKIQEQS 391 I++++ + + Sbjct: 1001 ISERVTQLT 1009 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 57/153 (37%), Gaps = 15/153 (9%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKE 293 VR+G + ++ + + V + +L + NT A+ R L+ Sbjct: 1273 VRVGVVQFSNDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLF---V 1329 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S + + + ++ + G++ + + +R++G+ S+ V Sbjct: 1330 KSAGSRIEDGVPQHLVLVLGGKSQDDVSRF--------AQVIRSSGIV--SLGVGDRNID 1379 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + L+ T+ F V + REL ++I + Sbjct: 1380 RTELQTITNDPRLVFTVREFRELPNIEERIMNS 1412 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 59/172 (34%), Gaps = 15/172 (8%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN + ++ +N Sbjct: 1448 INFRRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLKDFSTKRQIIDAIN 1507 Query: 268 KLNPYENTNTYPAMHHAYRELY-NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 K+ + L N + R+ + IT G+ + Sbjct: 1508 KVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGK--------SVE 1557 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 1558 DAQDVSLALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 1607 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/314 (10%), Positives = 84/314 (26%), Gaps = 33/314 (10%) Query: 72 LKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNL 131 ++ + Q+ + + +A A+ + N+F G+ + L Sbjct: 126 MQSHLTKAAGSRAGDGVPQVIVVLTDGHSKDGLALPSAELKSADVNVFAIGVEDADEGAL 185 Query: 132 SLRSTGIIERSSEN--LAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF 189 ++ + N S+ ++ S + + T F Sbjct: 186 KEIASEPLNMHMFNLENFTSLHDIVGNLVSCVHSSVSPERAGDTETLKDITAQDSADIIF 245 Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 + + LVN ++K +R+G + ++ Sbjct: 246 LIDGSNNTGSVNFAVI-----------LDFLVNLLEKLPI--GTQQIRVGVVQFSDEPRT 292 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 + +V + L N A+ + + + + + Sbjct: 293 MFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENHFT---RAGGSRVEEGVPQV 349 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF 367 ++ I+ G + + + ++ A ++S + A + L+ Sbjct: 350 LVLISAGP------SSDEIRYGVVA--LKQA--SVFSFGLGAQAASRAELQHIATDDNLV 399 Query: 368 FAV---NDSRELLE 378 F V +L E Sbjct: 400 FTVPEFRSFGDLQE 413 >gi|119591514|gb|EAW71108.1| collagen, type VI, alpha 3, isoform CRA_f [Homo sapiens] Length = 2244 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 56/189 (29%), Gaps = 20/189 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ N +V + Sbjct: 1039 GVRSGFPLLKEFVQRVVESL-----DVGQDRVRVAVVQYSDRTRPEFYLNSYMNKQDVVN 1093 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S T + + +I +T + Sbjct: 1094 AVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRIT---EGVPQLLIVLTADRSGDDVRN 1150 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + + ++ + A+ R+L Sbjct: 1151 PSVVVKRGGAVP----------IGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV 1200 Query: 383 ITDKIQEQS 391 I++++ + + Sbjct: 1201 ISERVTQLT 1209 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 59/172 (34%), Gaps = 15/172 (8%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN + ++ +N Sbjct: 1481 INFRRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLKDFSTKRQIIDAIN 1540 Query: 268 KLNPYENTNTYPAMHHAYRELY-NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 K+ + L N + R+ + IT G+ + Sbjct: 1541 KVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGK--------SVE 1590 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 1591 DAQDVSLALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 1640 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/314 (10%), Positives = 84/314 (26%), Gaps = 33/314 (10%) Query: 72 LKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNL 131 ++ + Q+ + + +A A+ + N+F G+ + L Sbjct: 126 MQSHLTKAAGSRAGDGVPQVIVVLTDGHSKDGLALPSAELKSADVNVFAIGVEDADEGAL 185 Query: 132 SLRSTGIIERSSEN--LAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF 189 ++ + N S+ ++ S + + T F Sbjct: 186 KEIASEPLNMHMFNLENFTSLHDIVGNLVSCVHSSVSPERAGDTETLKDITAQDSADIIF 245 Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 + + LVN ++K +R+G + ++ Sbjct: 246 LIDGSNNTGSVNFAVI-----------LDFLVNLLEKLPI--GTQQIRVGVVQFSDEPRT 292 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 + +V + L N A+ + + + + + Sbjct: 293 MFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENHFT---RAGGSRVEEGVPQV 349 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF 367 ++ I+ G + + + ++ A ++S + A + L+ Sbjct: 350 LVLISAGP------SSDEIRYGVVA--LKQA--SVFSFGLGAQAASRAELQHIATDDNLV 399 Query: 368 FAV---NDSRELLE 378 F V +L E Sbjct: 400 FTVPEFRSFGDLQE 413 >gi|119591511|gb|EAW71105.1| collagen, type VI, alpha 3, isoform CRA_c [Homo sapiens] Length = 2971 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 56/189 (29%), Gaps = 20/189 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ N +V + Sbjct: 833 GVRSGFPLLKEFVQRVVESL-----DVGQDRVRVAVVQYSDRTRPEFYLNSYMNKQDVVN 887 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S T + + +I +T + Sbjct: 888 AVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRIT---EGVPQLLIVLTADRSGDDVRN 944 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + + ++ + A+ R+L Sbjct: 945 PSVVVKRGGAVP----------IGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV 994 Query: 383 ITDKIQEQS 391 I++++ + + Sbjct: 995 ISERVTQLT 1003 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 57/153 (37%), Gaps = 15/153 (9%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKE 293 VR+G + ++ + + V + +L + NT A+ R L+ Sbjct: 1267 VRVGVVQFSNDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLF---V 1323 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S + + + ++ + G++ + + +R++G+ S+ V Sbjct: 1324 KSAGSRIEDGVPQHLVLVLGGKSQDDVSRF--------AQVIRSSGIV--SLGVGDRNID 1373 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + L+ T+ F V + REL ++I + Sbjct: 1374 RTELQTITNDPRLVFTVREFRELPNIEERIMNS 1406 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 59/172 (34%), Gaps = 15/172 (8%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN + ++ +N Sbjct: 1442 INFRRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLKDFSTKRQIIDAIN 1501 Query: 268 KLNPYENTNTYPAMHHAYRELY-NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 K+ + L N + R+ + IT G+ + Sbjct: 1502 KVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGK--------SVE 1551 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 1552 DAQDVSLALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 1601 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 20/180 (11%), Positives = 53/180 (29%), Gaps = 22/180 (12%) Query: 206 ANRKIDVLIESA--GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 + + + LVN ++K +R+G + ++ + +V Sbjct: 43 GSNNTGSVNFAVILDFLVNLLEKLPI--GTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVL 100 Query: 264 SRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + L N A+ + + + + + ++ I+ G Sbjct: 101 GAVKALGFAGGELANIGLALDFVVENHFT---RAGGSRVEEGVPQVLVLISAGP------ 151 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV---NDSRELLE 378 + + + ++ A ++S + A + L+ F V +L E Sbjct: 152 SSDEIRYGVVA--LKQA--SVFSFGLGAQAASRAELQHIATDDNLVFTVPEFRSFGDLQE 207 >gi|119591512|gb|EAW71106.1| collagen, type VI, alpha 3, isoform CRA_d [Homo sapiens] Length = 2411 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 56/189 (29%), Gaps = 20/189 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ N +V + Sbjct: 1039 GVRSGFPLLKEFVQRVVESL-----DVGQDRVRVAVVQYSDRTRPEFYLNSYMNKQDVVN 1093 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S T + + +I +T + Sbjct: 1094 AVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRIT---EGVPQLLIVLTADRSGDDVRN 1150 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + + ++ + A+ R+L Sbjct: 1151 PSVVVKRGGAVP----------IGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV 1200 Query: 383 ITDKIQEQS 391 I++++ + + Sbjct: 1201 ISERVTQLT 1209 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 57/153 (37%), Gaps = 15/153 (9%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKE 293 VR+G + ++ + + V + +L + NT A+ R L+ Sbjct: 1473 VRVGVVQFSNDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLF---V 1529 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S + + + ++ + G++ + + +R++G+ S+ V Sbjct: 1530 KSAGSRIEDGVPQHLVLVLGGKSQDDVSRF--------AQVIRSSGIV--SLGVGDRNID 1579 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + L+ T+ F V + REL ++I + Sbjct: 1580 RTELQTITNDPRLVFTVREFRELPNIEERIMNS 1612 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 59/172 (34%), Gaps = 15/172 (8%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN + ++ +N Sbjct: 1648 INFRRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLKDFSTKRQIIDAIN 1707 Query: 268 KLNPYENTNTYPAMHHAYRELY-NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 K+ + L N + R+ + IT G+ + Sbjct: 1708 KVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGK--------SVE 1757 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 1758 DAQDVSLALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 1807 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/314 (10%), Positives = 84/314 (26%), Gaps = 33/314 (10%) Query: 72 LKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNL 131 ++ + Q+ + + +A A+ + N+F G+ + L Sbjct: 126 MQSHLTKAAGSRAGDGVPQVIVVLTDGHSKDGLALPSAELKSADVNVFAIGVEDADEGAL 185 Query: 132 SLRSTGIIERSSEN--LAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF 189 ++ + N S+ ++ S + + T F Sbjct: 186 KEIASEPLNMHMFNLENFTSLHDIVGNLVSCVHSSVSPERAGDTETLKDITAQDSADIIF 245 Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 + + LVN ++K +R+G + ++ Sbjct: 246 LIDGSNNTGSVNFAVI-----------LDFLVNLLEKLPI--GTQQIRVGVVQFSDEPRT 292 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 + +V + L N A+ + + + + + Sbjct: 293 MFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENHFT---RAGGSRVEEGVPQV 349 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF 367 ++ I+ G + + + ++ A ++S + A + L+ Sbjct: 350 LVLISAGP------SSDEIRYGVVA--LKQA--SVFSFGLGAQAASRAELQHIATDDNLV 399 Query: 368 FAV---NDSRELLE 378 F V +L E Sbjct: 400 FTVPEFRSFGDLQE 413 >gi|119591513|gb|EAW71107.1| collagen, type VI, alpha 3, isoform CRA_e [Homo sapiens] Length = 3177 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 56/189 (29%), Gaps = 20/189 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ N +V + Sbjct: 1039 GVRSGFPLLKEFVQRVVESL-----DVGQDRVRVAVVQYSDRTRPEFYLNSYMNKQDVVN 1093 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S T + + +I +T + Sbjct: 1094 AVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRIT---EGVPQLLIVLTADRSGDDVRN 1150 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + + ++ + A+ R+L Sbjct: 1151 PSVVVKRGGAVP----------IGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV 1200 Query: 383 ITDKIQEQS 391 I++++ + + Sbjct: 1201 ISERVTQLT 1209 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 57/153 (37%), Gaps = 15/153 (9%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKE 293 VR+G + ++ + + V + +L + NT A+ R L+ Sbjct: 1473 VRVGVVQFSNDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLF---V 1529 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S + + + ++ + G++ + + +R++G+ S+ V Sbjct: 1530 KSAGSRIEDGVPQHLVLVLGGKSQDDVSRF--------AQVIRSSGIV--SLGVGDRNID 1579 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + L+ T+ F V + REL ++I + Sbjct: 1580 RTELQTITNDPRLVFTVREFRELPNIEERIMNS 1612 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 59/172 (34%), Gaps = 15/172 (8%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN + ++ +N Sbjct: 1648 INFRRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLKDFSTKRQIIDAIN 1707 Query: 268 KLNPYENTNTYPAMHHAYRELY-NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 K+ + L N + R+ + IT G+ + Sbjct: 1708 KVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGK--------SVE 1757 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 1758 DAQDVSLALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 1807 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/314 (10%), Positives = 84/314 (26%), Gaps = 33/314 (10%) Query: 72 LKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNL 131 ++ + Q+ + + +A A+ + N+F G+ + L Sbjct: 126 MQSHLTKAAGSRAGDGVPQVIVVLTDGHSKDGLALPSAELKSADVNVFAIGVEDADEGAL 185 Query: 132 SLRSTGIIERSSEN--LAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF 189 ++ + N S+ ++ S + + T F Sbjct: 186 KEIASEPLNMHMFNLENFTSLHDIVGNLVSCVHSSVSPERAGDTETLKDITAQDSADIIF 245 Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 + + LVN ++K +R+G + ++ Sbjct: 246 LIDGSNNTGSVNFAVI-----------LDFLVNLLEKLPI--GTQQIRVGVVQFSDEPRT 292 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 + +V + L N A+ + + + + + Sbjct: 293 MFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENHFT---RAGGSRVEEGVPQV 349 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF 367 ++ I+ G + + + ++ A ++S + A + L+ Sbjct: 350 LVLISAGP------SSDEIRYGVVA--LKQA--SVFSFGLGAQAASRAELQHIATDDNLV 399 Query: 368 FAV---NDSRELLE 378 F V +L E Sbjct: 400 FTVPEFRSFGDLQE 413 >gi|55743106|ref|NP_476508.2| collagen alpha-3(VI) chain isoform 5 precursor [Homo sapiens] Length = 2971 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 56/189 (29%), Gaps = 20/189 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ N +V + Sbjct: 833 GVRSGFPLLKEFVQRVVESL-----DVGQDRVRVAVVQYSDRTRPEFYLNSYMNKQDVVN 887 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S T + + +I +T + Sbjct: 888 AVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRIT---EGVPQLLIVLTADRSGDDVRN 944 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + + ++ + A+ R+L Sbjct: 945 PSVVVKRGGAVP----------IGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV 994 Query: 383 ITDKIQEQS 391 I++++ + + Sbjct: 995 ISERVTQLT 1003 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 57/153 (37%), Gaps = 15/153 (9%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKE 293 VR+G + ++ + + V + +L + NT A+ R L+ Sbjct: 1267 VRVGVVQFSNDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLF---V 1323 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S + + + ++ + G++ + + +R++G+ S+ V Sbjct: 1324 KSAGSRIEDGVPQHLVLVLGGKSQDDVSRF--------AQVIRSSGIV--SLGVGDRNID 1373 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + L+ T+ F V + REL ++I + Sbjct: 1374 RTELQTITNDPRLVFTVREFRELPNIEERIMNS 1406 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 59/172 (34%), Gaps = 15/172 (8%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN + ++ +N Sbjct: 1442 INFRRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLKDFSTKRQIIDAIN 1501 Query: 268 KLNPYENTNTYPAMHHAYRELY-NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 K+ + L N + R+ + IT G+ + Sbjct: 1502 KVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGK--------SVE 1551 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 1552 DAQDVSLALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 1601 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 20/180 (11%), Positives = 53/180 (29%), Gaps = 22/180 (12%) Query: 206 ANRKIDVLIESA--GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 + + + LVN ++K +R+G + ++ + +V Sbjct: 43 GSNNTGSVNFAVILDFLVNLLEKLPI--GTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVL 100 Query: 264 SRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + L N A+ + + + + + ++ I+ G Sbjct: 101 GAVKALGFAGGELANIGLALDFVVENHFT---RAGGSRVEEGVPQVLVLISAGP------ 151 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV---NDSRELLE 378 + + + ++ A ++S + A + L+ F V +L E Sbjct: 152 SSDEIRYGVVA--LKQA--SVFSFGLGAQAASRAELQHIATDDNLVFTVPEFRSFGDLQE 207 >gi|55743098|ref|NP_004360.2| collagen alpha-3(VI) chain isoform 1 precursor [Homo sapiens] gi|311033499|sp|P12111|CO6A3_HUMAN RecName: Full=Collagen alpha-3(VI) chain; Flags: Precursor gi|225000446|gb|AAI72233.1| Collagen, type VI, alpha 3 [synthetic construct] gi|302313173|gb|ADL14511.1| collagen, type VI, alpha 3 [Homo sapiens] Length = 3177 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 56/189 (29%), Gaps = 20/189 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ N +V + Sbjct: 1039 GVRSGFPLLKEFVQRVVESL-----DVGQDRVRVAVVQYSDRTRPEFYLNSYMNKQDVVN 1093 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S T + + +I +T + Sbjct: 1094 AVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRIT---EGVPQLLIVLTADRSGDDVRN 1150 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + + ++ + A+ R+L Sbjct: 1151 PSVVVKRGGAVP----------IGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV 1200 Query: 383 ITDKIQEQS 391 I++++ + + Sbjct: 1201 ISERVTQLT 1209 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 57/153 (37%), Gaps = 15/153 (9%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKE 293 VR+G + ++ + + V + +L + NT A+ R L+ Sbjct: 1473 VRVGVVQFSNDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLF---V 1529 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S + + + ++ + G++ + + +R++G+ S+ V Sbjct: 1530 KSAGSRIEDGVPQHLVLVLGGKSQDDVSRF--------AQVIRSSGIV--SLGVGDRNID 1579 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + L+ T+ F V + REL ++I + Sbjct: 1580 RTELQTITNDPRLVFTVREFRELPNIEERIMNS 1612 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 59/172 (34%), Gaps = 15/172 (8%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN + ++ +N Sbjct: 1648 INFRRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLKDFSTKRQIIDAIN 1707 Query: 268 KLNPYENTNTYPAMHHAYRELY-NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 K+ + L N + R+ + IT G+ + Sbjct: 1708 KVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGK--------SVE 1757 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 1758 DAQDVSLALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 1807 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/314 (10%), Positives = 84/314 (26%), Gaps = 33/314 (10%) Query: 72 LKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNL 131 ++ + Q+ + + +A A+ + N+F G+ + L Sbjct: 126 MQSHLTKAAGSRAGDGVPQVIVVLTDGHSKDGLALPSAELKSADVNVFAIGVEDADEGAL 185 Query: 132 SLRSTGIIERSSEN--LAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF 189 ++ + N S+ ++ S + + T F Sbjct: 186 KEIASEPLNMHMFNLENFTSLHDIVGNLVSCVHSSVSPERAGDTETLKDITAQDSADIIF 245 Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 + + LVN ++K +R+G + ++ Sbjct: 246 LIDGSNNTGSVNFAVI-----------LDFLVNLLEKLPI--GTQQIRVGVVQFSDEPRT 292 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 + +V + L N A+ + + + + + Sbjct: 293 MFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENHFT---RAGGSRVEEGVPQV 349 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF 367 ++ I+ G + + + ++ A ++S + A + L+ Sbjct: 350 LVLISAGP------SSDEIRYGVVA--LKQA--SVFSFGLGAQAASRAELQHIATDDNLV 399 Query: 368 FAV---NDSRELLE 378 F V +L E Sbjct: 400 FTVPEFRSFGDLQE 413 >gi|62988748|gb|AAY24135.1| unknown [Homo sapiens] Length = 2588 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 56/189 (29%), Gaps = 20/189 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ N +V + Sbjct: 1039 GVRSGFPLLKEFVQRVVESL-----DVGQDRVRVAVVQYSDRTRPEFYLNSYMNKQDVVN 1093 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S T + + +I +T + Sbjct: 1094 AVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRIT---EGVPQLLIVLTADRSGDDVRN 1150 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + + ++ + A+ R+L Sbjct: 1151 PSVVVKRGGAVP----------IGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV 1200 Query: 383 ITDKIQEQS 391 I++++ + + Sbjct: 1201 ISERVTQLT 1209 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 57/153 (37%), Gaps = 15/153 (9%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKE 293 VR+G + ++ + + V + +L + NT A+ R L+ Sbjct: 1473 VRVGVVQFSNDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLF---V 1529 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S + + + ++ + G++ + + +R++G+ S+ V Sbjct: 1530 KSAGSRIEDGVPQHLVLVLGGKSQDDVSRF--------AQVIRSSGIV--SLGVGDRNID 1579 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + L+ T+ F V + REL ++I + Sbjct: 1580 RTELQTITNDPRLVFTVREFRELPNIEERIMNS 1612 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 59/172 (34%), Gaps = 15/172 (8%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN + ++ +N Sbjct: 1648 INFRRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLKDFSTKRQIIDAIN 1707 Query: 268 KLNPYENTNTYPAMHHAYRELY-NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 K+ + L N + R+ + IT G+ + Sbjct: 1708 KVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGK--------SVE 1757 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 1758 DAQDVSLALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 1807 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/314 (10%), Positives = 84/314 (26%), Gaps = 33/314 (10%) Query: 72 LKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNL 131 ++ + Q+ + + +A A+ + N+F G+ + L Sbjct: 126 MQSHLTKAAGSRAGDGVPQVIVVLTDGHSKDGLALPSAELKSADVNVFAIGVEDADEGAL 185 Query: 132 SLRSTGIIERSSEN--LAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF 189 ++ + N S+ ++ S + + T F Sbjct: 186 KEIASEPLNMHMFNLENFTSLHDIVGNLVSCVHSSVSPERAGDTETLKDITAQDSADIIF 245 Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 + + LVN ++K +R+G + ++ Sbjct: 246 LIDGSNNTGSVNFAVI-----------LDFLVNLLEKLPI--GTQQIRVGVVQFSDEPRT 292 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 + +V + L N A+ + + + + + Sbjct: 293 MFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENHFT---RAGGSRVEEGVPQV 349 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF 367 ++ I+ G + + + ++ A ++S + A + L+ Sbjct: 350 LVLISAGP------SSDEIRYGVVA--LKQA--SVFSFGLGAQAASRAELQHIATDDNLV 399 Query: 368 FAV---NDSRELLE 378 F V +L E Sbjct: 400 FTVPEFRSFGDLQE 413 >gi|52545928|emb|CAH56139.1| hypothetical protein [Homo sapiens] Length = 1222 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 56/189 (29%), Gaps = 20/189 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ N +V + Sbjct: 833 GVRSGFPLLKEFVQRVVESL-----DVGQDRVRVAVVQYSDRTRPEFYLNSYMNKQDVVN 887 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S T + + +I +T + Sbjct: 888 AVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRIT---EGVPQLLIVLTADRSGDDVRN 944 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + + ++ + A+ R+L Sbjct: 945 PSVVVKRGGAVP----------IGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV 994 Query: 383 ITDKIQEQS 391 I++++ + + Sbjct: 995 ISERVTQLT 1003 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 20/180 (11%), Positives = 53/180 (29%), Gaps = 22/180 (12%) Query: 206 ANRKIDVLIESA--GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 + + + LVN ++K +R+G + ++ + +V Sbjct: 43 GSNNTGSVNFAVILDFLVNLLEKLPI--GTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVL 100 Query: 264 SRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + L N A+ + + + + + ++ I+ G Sbjct: 101 GAVKALGFAGGELANIGLALDFVVENHFT---RAGGSRVEEGVPQVLVLISAGP------ 151 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV---NDSRELLE 378 + + + ++ A ++S + A + L+ F V +L E Sbjct: 152 SSDEIRYGVVA--LKQA--SVFSFGLGAQAASRAELQHIATDDNLVFTVPEFRSFGDLQE 207 >gi|3127926|emb|CAA36267.1| collagen type VI, alpha 3 chain [Homo sapiens] Length = 3176 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 56/189 (29%), Gaps = 20/189 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ N +V + Sbjct: 1039 GVRSGFPLLKEFVQRVVESL-----DVGQDRVRVAVVQYSDRTRPEFYLNSYMNKQDVVN 1093 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S T + + +I +T + Sbjct: 1094 AVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRIT---EGVPQLLIVLTADRSGDDVRN 1150 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + + ++ + A+ R+L Sbjct: 1151 PSVVVKRGGAVP----------IGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV 1200 Query: 383 ITDKIQEQS 391 I++++ + + Sbjct: 1201 ISERVTQLT 1209 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 57/153 (37%), Gaps = 15/153 (9%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKE 293 VR+G + ++ + + V + +L + NT A+ R L+ Sbjct: 1473 VRVGVVQFSNDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLF---V 1529 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S + + + ++ + G++ + + +R++G+ S+ V Sbjct: 1530 KSAGSRIEDGVPQHLVLVLGGKSQDDVSRF--------AQVIRSSGIV--SLGVGDRNID 1579 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + L+ T+ F V + REL ++I + Sbjct: 1580 RTELQTITNDPRLVFTVREFRELPNIEERIMNS 1612 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 59/172 (34%), Gaps = 15/172 (8%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN + ++ +N Sbjct: 1648 INFRRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLKDFSTKRQIIDAIN 1707 Query: 268 KLNPYENTNTYPAMHHAYRELY-NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 K+ + L N + R+ + IT G+ + Sbjct: 1708 KVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGK--------SVE 1757 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 1758 DAQDVSLALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 1807 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/314 (10%), Positives = 84/314 (26%), Gaps = 33/314 (10%) Query: 72 LKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNL 131 ++ + Q+ + + +A A+ + N+F G+ + L Sbjct: 126 MQSHLTKAAGSRAGDGVPQVIVVLTDGHSKDGLALPSAELKSADVNVFAIGVEDADEGAL 185 Query: 132 SLRSTGIIERSSEN--LAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF 189 ++ + N S+ ++ S + + T F Sbjct: 186 KEIASEPLNMHMFNLENFTSLHDIVGNLVSCVHSSVSPERAGDTETLKDITAQDSADIIF 245 Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 + + LVN ++K +R+G + ++ Sbjct: 246 LIDGSNNTGSVNFAVI-----------LDFLVNLLEKLPI--GTQQIRVGVVQFSDEPRT 292 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 + +V + L N A+ + + + + + Sbjct: 293 MFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENHFT---RAGGSRVEEGVPQV 349 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF 367 ++ I+ G + + + ++ A ++S + A + L+ Sbjct: 350 LVLISAGP------SSDEIRYGVVA--LKQA--SVFSFGLGAQAASRAELQHIATDDNLV 399 Query: 368 FAV---NDSRELLE 378 F V +L E Sbjct: 400 FTVPEFRSFGDLQE 413 >gi|301612325|ref|XP_002935678.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-2 [Xenopus (Silurana) tropicalis] Length = 524 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 34/340 (10%), Positives = 89/340 (26%), Gaps = 20/340 (5%) Query: 38 LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK 97 L+ A +++ K + E + +I+ D Sbjct: 68 LAEAAEKFQKAHRWQDNIREEDIEYYDSK--ADTEYEDLEGEEEPKEPTHSLKIDFADDP 125 Query: 98 NNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 N + A +IPT+ +I + L I+ E+ + + + Sbjct: 126 NFKAKVNYNYTA-VQIPTDIYKGSTVILNELNWTDALEDVFIQNRLEDPTLLWQVFGSAT 184 Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 + + P + S + + + + ++ S Sbjct: 185 GVTRYYPATPWRAPSKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGLTLKLMKTSV 244 Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNT 277 +++++ + N +K + ++ T+ Sbjct: 245 MEMLDTLSDDDYVTVASFHEKADPVSCFRQLVQANVR---NKKVIKEAVQEMVARGTTDY 301 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 +A+ +L N + K ++ TDG N N Sbjct: 302 KAGFEYAFSQLQNTSITRA------NCNKMIMMFTDGGEDRVQDVFEKYNWP-------N 348 Query: 338 AGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 +++++ +V L+ + G +F + + Sbjct: 349 KTVRVFTFSVGQHNYDVTPLQWMACANKGYYFEIPSIGAI 388 >gi|126306129|ref|XP_001365364.1| PREDICTED: similar to putative calcium activated chloride channel-like protein 1; eCLCA1 [Monodelphis domestica] Length = 895 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 21/143 (14%), Positives = 55/143 (38%), Gaps = 27/143 (18%) Query: 263 KSRLNKLN--PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 S + +L T+ + A+ + + + ++ +TDGE++ S Sbjct: 369 NSLITRLPTVAGGGTSICSGLRTAFTVIKKKFSTD---------GSEIVLLTDGEDNTIS 419 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE---LL 377 + + + +G I++VA+ P L + + G D+ + L+ Sbjct: 420 TCFDEV---------KQSGAIIHTVALG-PSADPGLEKLAEMTGGMKTTATDNAQNNGLI 469 Query: 378 ESFDKITD---KIQEQSVRIAPN 397 ++F ++ I ++S+++ Sbjct: 470 DAFSALSSGNGAITQRSIQLVSK 492 >gi|126306100|ref|XP_001362237.1| PREDICTED: similar to calcium-dependent chloride channel-1 [Monodelphis domestica] Length = 959 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 24/169 (14%), Positives = 64/169 (37%), Gaps = 27/169 (15%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN--PYENTNTYPAMHHAYRELYNEKES 294 G + ++ + S +++L T+ + A+ + N+ + Sbjct: 343 WTGMVTFDSSATIQSALIQIETDAQRNSLISRLPTAAGGGTSICSGLRTAFTVIKNKFST 402 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 ++ +TDGE+S S+ + + + +G I++VA+ P Sbjct: 403 D---------GSEIVLLTDGEDSTISSCFDEV---------KQSGAIIHTVALG-PSADP 443 Query: 355 DLLRKCTDSSGQFFAVNDSRE---LLESFDKITDK---IQEQSVRIAPN 397 L + G + D+ + L+++F ++ + I ++S+++ Sbjct: 444 GLEELAKMTGGMKTSATDNAQNNGLIDAFSALSSENGAITQRSIQLDSK 492 >gi|308493174|ref|XP_003108777.1| hypothetical protein CRE_11006 [Caenorhabditis remanei] gi|308248517|gb|EFO92469.1| hypothetical protein CRE_11006 [Caenorhabditis remanei] Length = 425 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 28/166 (16%), Positives = 57/166 (34%), Gaps = 26/166 (15%) Query: 235 SVRIGTIAYNI--GIVGNQCTPLSNNLNEVKSRLNKLNPYENT-NTYPAMHHAYRELYNE 291 ++ + + EV ++KL T + A ++ Sbjct: 274 YTQVAAVTFATVGRTRVRFNLKKYTTQEEVLRGIDKLQSKGGTTAIGAGIEKALTQIDES 333 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 + + K +I TDG ++ + + + NAG ++Y+VA +A Sbjct: 334 EGARPGIAT-----KVMIVFTDGWSNKG------PDPEKRAKDAVNAGFEMYTVAYTARA 382 Query: 352 E-----GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + L + SSG F D F + DKI+++++ Sbjct: 383 PNSVTLNNETLSAISGSSGHAF--TDVT-----FQSLVDKIKQRNL 421 >gi|224065911|ref|XP_002191398.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H4 (plasma Kallikrein-sensitive glycoprotein) [Taeniopygia guttata] Length = 809 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 19/135 (14%), Positives = 41/135 (30%), Gaps = 14/135 (10%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N+ + + L T+ A+ A L R +I +TDG+ + Sbjct: 305 NVASAAALVQTLTARGGTDISGALLAAVGVLEKA------EGLPERSISMIILLTDGQPT 358 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ-----FFAVND 372 N + + N ++ + L K S+G + + Sbjct: 359 SGEK--NVEVIQEKVQEAINGKYALFCLGFGFDV-SYKFLEKMALSNGGIARRIYENADA 415 Query: 373 SRELLESFDKITDKI 387 + +L + ++ I Sbjct: 416 ALQLQGFYQEVATPI 430 >gi|54293612|ref|YP_126027.1| hypothetical protein lpl0665 [Legionella pneumophila str. Lens] gi|53753444|emb|CAH14899.1| hypothetical protein lpl0665 [Legionella pneumophila str. Lens] Length = 1169 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 47/418 (11%), Positives = 103/418 (24%), Gaps = 64/418 (15%) Query: 34 DAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINI 93 DA L+ I + + ++ + K I+ ++ + + Sbjct: 113 DANGLAPYTVIQGGNKVDNSASRLNVAKAGIKAIIENYMPTTDFALGIYSTSNISSYNTW 172 Query: 94 TKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMV 153 + P + SA + +S + I L +S Sbjct: 173 VYYMSPPGSDFVFTNTPVAGNRYVTNPCYNYGSASSTVSSNCSSIGSLYGTTL-VSSSQY 231 Query: 154 LDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL 213 L + S +D + + + Sbjct: 232 LQIGDSSDDPDINDALY-AGSGFPGIFVSYNGPTPSSPFPPNYTISNYNQGNIRISYANT 290 Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP----------LSNNLNEVK 263 S GN +S A + V + + T + V Sbjct: 291 RPSIGNFSSSPTNAGFVPFSQQVMYVQRGFGYYSNQSYATGNMLVNMQTAGTNPTTTSVN 350 Query: 264 SRLNKLNP-----YENTNTY--------PAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 + +N P +T T + + ++ T G+ K+++I Sbjct: 351 NAINAFLPHLKPETNSTATTEIKAAAVQSPLAGLLTRSRSFMKTVGTTSGNCPQKQYIIL 410 Query: 311 ITDGENSGASAYQNT-------------------------------LNTLQICEYMRNAG 339 I+DG + + + + + ++N G Sbjct: 411 ISDGLPTQDLQSRYWPPLGSAAATGYGVTATFNADGSLNNTNSQALSDAINEIKALKNDG 470 Query: 340 MKIYSVAVSAPPE------GQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQE 389 + I+ + + A + LR + G ++ L+ S + I IQ Sbjct: 471 VLIFIIGMGAGVDPAVNPEAAATLRAMAVAGGTENYYPATSPEALVSSLNSILSNIQN 528 >gi|302917449|ref|XP_003052439.1| hypothetical protein NECHADRAFT_36251 [Nectria haematococca mpVI 77-13-4] gi|256733379|gb|EEU46726.1| hypothetical protein NECHADRAFT_36251 [Nectria haematococca mpVI 77-13-4] Length = 764 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 34/206 (16%), Positives = 65/206 (31%), Gaps = 22/206 (10%) Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 +P P + + S AP P + S +L + I E N S R+G Sbjct: 80 IPHVPCDIVLVIDVSGSMAGAAPVPGEETNESTGLSILDLTKHAARTIIETMNESDRLGI 139 Query: 241 IAYNIGIVGNQCT--PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNT 298 + + Q S N + + + P + TN + + + N K S+ Sbjct: 140 VTFASKAKVVQPLLSMTSENKERSRGNVTSMRPIDATNLWHGLLEGIKLFKNVKSSNVPA 199 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG---MKIYSVAVSAPPEGQD 355 ++ +TDG + + +R G I++ Sbjct: 200 ---------IMVLTDGMPNHMNP------AAGFVPKLRAMGQLPASIHTFGFGY-HLRSG 243 Query: 356 LLRKCTD-SSGQFFAVNDSRELLESF 380 LL+ + G + + D+ + F Sbjct: 244 LLKSIAEIGGGNYAFIPDAGMIGTVF 269 >gi|291395817|ref|XP_002714337.1| PREDICTED: complement factor B-like [Oryctolagus cuniculus] Length = 764 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 39/225 (17%), Positives = 75/225 (33%), Gaps = 18/225 (8%) Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL--IESAGNLVNSIQKAIQEKKNLSVRIG 239 P +K + S + I + LVN I+K + Sbjct: 253 PGELQKRKIVLDPAGSMNIYLVLDGSDSIGASNFTGAKRCLVNLIEKVASYGVRPRYGLV 312 Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-----NTNTYPAMHHAYRELYNEKES 294 T A ++ P S++ N V +LN+++ + TNT A+ Y + + Sbjct: 313 TYATYPNVLVRVSDPKSSDANWVTEKLNQISYEDHKLKTGTNTKRALVEVYNMMSWPGDV 372 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL-QICEYMRNAGMK------IYSVAV 347 R + +I +TDG ++ +N + + ++ +Y V Sbjct: 373 PPEGWN--RTRHVIILMTDGLHNMGGDPVTVINEIRDLLNIGKDRKNPREDYLDVYVFGV 430 Query: 348 S--APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 P + L ++ F V D L + F ++ D+ Q Sbjct: 431 GPLVEPANINALASKKENEQHVFRVKDMEHLEDVFFQMIDESQSL 475 >gi|149187720|ref|ZP_01866017.1| hypothetical protein VSAK1_23409 [Vibrio shilonii AK1] gi|148838600|gb|EDL55540.1| hypothetical protein VSAK1_23409 [Vibrio shilonii AK1] Length = 340 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 23/158 (14%), Positives = 57/158 (36%), Gaps = 24/158 (15%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKE 293 R+G I + +P + + + L++ E+T+ A + + Sbjct: 140 RLGLILFGDSAYLQ--SPFTADHEAWLALLDQAQVGMAGESTHLGDA-------VGLTIK 190 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE- 352 + + + ++K I +TDG ++ + + + + G+++Y VA+ +P Sbjct: 191 TYIDNPENQTVEKVAIILTDGNDTDSLVP-----PIDAAKVAQAYGIRLYIVAMGSPNTT 245 Query: 353 -----GQDLLRKCTD-SSGQFFAVNDSRELLESFDKIT 384 + + GQ F +L + I+ Sbjct: 246 GDQAIDFSTIETMATVTGGQAFLAMSQEDLDAVYQTIS 283 >gi|126340390|ref|XP_001364302.1| PREDICTED: similar to voltage-dependent calcium channel alpha-2 delta subunit [Monodelphis domestica] Length = 1092 Score = 54.9 bits (130), Expect = 2e-05, Method: Composition-based stats. Identities = 35/346 (10%), Positives = 98/346 (28%), Gaps = 29/346 (8%) Query: 41 CASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIA--QKAQINITKDKN 98 ++ T + Q F + E + + +I ++ Sbjct: 94 SKALERLATKAEQFQAAHQWRDDFGSNEVVYYNAKDDFDELDKNESEPNSQRIKPVFVED 153 Query: 99 NPLQYIAESK-AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 + + A IPT+ ++ + L S + E+ + + + Sbjct: 154 AVFRRQTSYQHAAVHIPTDIYEGSTIVLNELNWTSALDEVFKKNRDEDPTLLWQVFGSAT 213 Query: 158 R----SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL 213 ++ K N + P + S + + + + ++ Sbjct: 214 GLARYYPASPWVDKTRTPNKIDLYDVRRRPWYIQGAASPKDMLILVDVSGSVSGLTLRLI 273 Query: 214 IESAGNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY 272 S ++ ++ + + ++ +V N +K +N + Sbjct: 274 RTSVSEMLETLSDDDFVNVASFNSNAQDVSCFQHLVQANVR----NKKVLKDAVNNITAK 329 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T+ A+ +L N S K ++ TDG Sbjct: 330 GITDYKKGFSFAFEQLLNYNVSRA------NCNKIIMLFTDG--------GEERAQEIFA 375 Query: 333 EYMRNAGMKIYSVAVSAPPEGQD--LLRKCTDSSGQFFAVNDSREL 376 +Y ++ +++++ +V +D + C + G ++ + + Sbjct: 376 KYNKDKKVRVFTFSVGQHNYDRDPVKWKAC-RNCGYYYEIPSIGAI 420 >gi|313244510|emb|CBY15285.1| unnamed protein product [Oikopleura dioica] Length = 486 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 27/221 (12%), Positives = 58/221 (26%), Gaps = 15/221 (6%) Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 Y + I + N I E + S+R Sbjct: 243 YEPCNQDACDAGCSGPRDVLFVAHYTTYMGSTFADISA--FFENIISTINVEPSDSSIRF 300 Query: 239 GTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNT 298 +N + N+++E K + P Y A + + Sbjct: 301 AFSFFNHAYIEFFAFDWLNSIDEYKWAFSSFPPASGNANYIG--RALKGAADTMTPEFGK 358 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLR 358 V+ +T+ +T ++ + ++ ++ V + GQD L Sbjct: 359 GRRIDTVGTVVLLTN--------AASTDEVNEMADQLKEKVDRVIVVGLGY-AFGQDELA 409 Query: 359 KCTDSS--GQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + + +S +L I D+I + + Sbjct: 410 GIASNPTKENLYIAEESSDLAGLVKTIADEICATELSNPSD 450 >gi|194387934|dbj|BAG61380.1| unnamed protein product [Homo sapiens] Length = 589 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 66/190 (34%), Gaps = 18/190 (9%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS---NNLNE 261 P +D + +++++ + + ++ A + L+ + Sbjct: 54 PYGALVDKVKSFTKRFIDNLRD-RYYRCDRNLVWNAGALHYSDEVEIIQGLTRMPGGRDA 112 Query: 262 VKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 +KS ++ + + T T A+ +L SH K++I +TDG Sbjct: 113 LKSSVDAVKYFGKGTYTDCAIKKGLEQLLVG--GSHLKEN-----KYLIVVTDGHPLEGY 165 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF---FAVNDSRELL 377 ++ G+K++SVA+ P + L + + F D + Sbjct: 166 KEPCG-GLEDAVNEAKHLGVKVFSVAI-TPDHLEPRLSIIA-TDHTYRRNFTAADWGQSR 222 Query: 378 ESFDKITDKI 387 ++ + I+ I Sbjct: 223 DAEEAISQTI 232 >gi|194386506|dbj|BAG61063.1| unnamed protein product [Homo sapiens] Length = 404 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 66/190 (34%), Gaps = 18/190 (9%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS---NNLNE 261 P +D + +++++ + + ++ A + L+ + Sbjct: 54 PYGALVDKVKSFTKRFIDNLRD-RYYRCDRNLVWNAGALHYSDEVEIIQGLTRMPGGRDA 112 Query: 262 VKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 +KS ++ + + T T A+ +L SH K++I +TDG Sbjct: 113 LKSSVDAVKYFGKGTYTDCAIKKGLEQLLVG--GSHLKEN-----KYLIVVTDGHPLEGY 165 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF---FAVNDSRELL 377 ++ G+K++SVA+ P + L + + F D + Sbjct: 166 KEPCG-GLEDAVNEAKHLGVKVFSVAI-TPDHLEPRLSIIA-TDHTYRRNFTAADWGQSR 222 Query: 378 ESFDKITDKI 387 ++ + I+ I Sbjct: 223 DAEEAISQTI 232 >gi|119629725|gb|EAX09320.1| collagen, type VI, alpha 1, isoform CRA_a [Homo sapiens] gi|119629726|gb|EAX09321.1| collagen, type VI, alpha 1, isoform CRA_a [Homo sapiens] Length = 726 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 66/190 (34%), Gaps = 18/190 (9%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS---NNLNE 261 P +D + +++++ + + ++ A + L+ + Sbjct: 54 PYGALVDKVKSFTKRFIDNLRD-RYYRCDRNLVWNAGALHYSDEVEIIQGLTRMPGGRDA 112 Query: 262 VKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 +KS ++ + + T T A+ +L SH K++I +TDG Sbjct: 113 LKSSVDAVKYFGKGTYTDCAIKKGLEQLLVG--GSHLKEN-----KYLIVVTDGHPLEGY 165 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF---FAVNDSRELL 377 ++ G+K++SVA+ P + L + + F D + Sbjct: 166 KEPCG-GLEDAVNEAKHLGVKVFSVAI-TPDHLEPRLSIIA-TDHTYRRNFTAADWGQSR 222 Query: 378 ESFDKITDKI 387 ++ + I+ I Sbjct: 223 DAEEAISQTI 232 >gi|114684807|ref|XP_001158390.1| PREDICTED: collagen, type VI, alpha 1 isoform 1 [Pan troglodytes] gi|114684809|ref|XP_001158445.1| PREDICTED: collagen alpha-1(VI) chain isoform 2 [Pan troglodytes] Length = 1028 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 66/190 (34%), Gaps = 18/190 (9%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS---NNLNE 261 P +D + +++++ + + ++ A + L+ + Sbjct: 54 PYGALVDKVKSFTKRFIDNLRD-RYYRCDRNLVWNAGALHYSDEVEIIQGLTRMPGGRDA 112 Query: 262 VKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 +KS ++ + + T T A+ +L SH K++I +TDG Sbjct: 113 LKSSVDAVKYFGKGTYTDCAIKKGLEQLLVG--GSHLKEN-----KYLIVVTDGHPLEGY 165 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF---FAVNDSRELL 377 ++ G+K++SVA+ P + L + + F D + Sbjct: 166 KEPCG-GLEDAVNEAKHLGVKVFSVAI-TPDHLEPRLSIIA-TDHTYRRNFTAADWGQSR 222 Query: 378 ESFDKITDKI 387 ++ + I+ I Sbjct: 223 DAEEAISQTI 232 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 30/222 (13%), Positives = 62/222 (27%), Gaps = 25/222 (11%) Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 N + +K +F S + + + D A L A Sbjct: 803 KNVTAQICVDKKCPDYTCPITFSSPADITILLDGSASVGSHNFDTTKRFAKRLAERFLTA 862 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQ---CTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHH 283 + VR+ + Y+ N + S ++ ++ + T+ A+ + Sbjct: 863 GRTDPAHDVRVAVVQYSGTGQQRPERASLQFLQNYTALASAVDAMDFINDATDVNDALGY 922 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 R + KK ++ +DG + G + + + AG++I+ Sbjct: 923 VTRFYREASSGAA--------KKRLLLFSDGNSQG----ATPAAIEKAVQEAQRAGIEIF 970 Query: 344 SVAVSAPPE---------GQDLLRKCTDSSGQFFAVNDSREL 376 V V G+ F V + L Sbjct: 971 VVVVGRQVNEPHIRVLVTGKTAEYDVAYGERHLFRVPSYQAL 1012 >gi|87196339|ref|NP_001839.2| collagen alpha-1(VI) chain precursor [Homo sapiens] gi|125987811|sp|P12109|CO6A1_HUMAN RecName: Full=Collagen alpha-1(VI) chain; Flags: Precursor gi|30851190|gb|AAH52575.1| Collagen, type VI, alpha 1 [Homo sapiens] gi|119629727|gb|EAX09322.1| collagen, type VI, alpha 1, isoform CRA_b [Homo sapiens] Length = 1028 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 66/190 (34%), Gaps = 18/190 (9%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS---NNLNE 261 P +D + +++++ + + ++ A + L+ + Sbjct: 54 PYGALVDKVKSFTKRFIDNLRD-RYYRCDRNLVWNAGALHYSDEVEIIQGLTRMPGGRDA 112 Query: 262 VKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 +KS ++ + + T T A+ +L SH K++I +TDG Sbjct: 113 LKSSVDAVKYFGKGTYTDCAIKKGLEQLLVG--GSHLKEN-----KYLIVVTDGHPLEGY 165 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF---FAVNDSRELL 377 ++ G+K++SVA+ P + L + + F D + Sbjct: 166 KEPCG-GLEDAVNEAKHLGVKVFSVAI-TPDHLEPRLSIIA-TDHTYRRNFTAADWGQSR 222 Query: 378 ESFDKITDKI 387 ++ + I+ I Sbjct: 223 DAEEAISQTI 232 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 30/222 (13%), Positives = 62/222 (27%), Gaps = 25/222 (11%) Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 N + +K +F S + + + D A L A Sbjct: 803 KNVTAQICIDKKCPDYTCPITFSSPADITILLDGSASVGSHNFDTTKRFAKRLAERFLTA 862 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQ---CTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHH 283 + VR+ + Y+ N + S ++ ++ + T+ A+ + Sbjct: 863 GRTDPAHDVRVAVVQYSGTGQQRPERASLQFLQNYTALASAVDAMDFINDATDVNDALGY 922 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 R + KK ++ +DG + G + + + AG++I+ Sbjct: 923 VTRFYREASSGAA--------KKRLLLFSDGNSQG----ATPAAIEKAVQEAQRAGIEIF 970 Query: 344 SVAVSAPPE---------GQDLLRKCTDSSGQFFAVNDSREL 376 V V G+ F V + L Sbjct: 971 VVVVGRQVNEPHIRVLVTGKTAEYDVAYGESHLFRVPSYQAL 1012 >gi|21322624|emb|CAC27151.2| inter-alpha-trypsin inhibitor heavy chain 2 [Platichthys flesus] Length = 304 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 21/167 (12%), Positives = 51/167 (30%), Gaps = 7/167 (4%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 + + + S + + K + + P TN A+ A + Sbjct: 20 DLTIDDHFIIVDFNHNVRCWNEELVHGSSIQVTDAKKYIQNIKPNGGTNINEALMRAVQM 79 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 L R +I ++DG+ + +T+ + MR ++S+ + Sbjct: 80 LVRASNQGMI---DPRSVSMIILVSDGDPTVGEIKLSTIQ-KNVKRVMREE-FSLFSLGI 134 Query: 348 SAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 D L + ++ G + + + E ++ +R Sbjct: 135 GFDV-DYDFLERIAMENRGMAQRIYANHDAAEQLRAFYSQVSSPLLR 180 >gi|30032|emb|CAA33888.1| precursor polypeptide (AA -19 to 237) [Homo sapiens] Length = 256 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 66/190 (34%), Gaps = 18/190 (9%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS---NNLNE 261 P +D + +++++ + + ++ A + L+ + Sbjct: 54 PYGALVDKVKSFTKRFIDNLRD-RYYRCDRNLVWNAGALHYSDEVEIIQGLTRMPGGRDA 112 Query: 262 VKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 +KS ++ + + T T A+ +L SH K++I +TDG Sbjct: 113 LKSSVDAVKYFGKGTYTDCAIKKGLEQLLVG--GSHLKEN-----KYLIVVTDGHPLEGY 165 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF---FAVNDSRELL 377 ++ G+K++SVA+ P + L + + F D + Sbjct: 166 KEPCG-GLEDAVNEAKHLGVKVFSVAI-TPDHLEPRLSIIA-TDHTYRRNFTAADWGQSR 222 Query: 378 ESFDKITDKI 387 ++ + I+ I Sbjct: 223 DAEEAISQTI 232 >gi|166363914|ref|YP_001656187.1| hypothetical protein MAE_11730 [Microcystis aeruginosa NIES-843] gi|166086287|dbj|BAG00995.1| hypothetical protein MAE_11730 [Microcystis aeruginosa NIES-843] Length = 581 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 42/359 (11%), Positives = 99/359 (27%), Gaps = 26/359 (7%) Query: 38 LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK 97 L+ I+ +K + T + + G + A ++ Sbjct: 232 LTVADVEKYQGQIQQIQSKITRYGTSTASLAQSMVANGPFWASVASVYESLVIAANSQAG 291 Query: 98 NNPLQYIA---ESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVL 154 +N +Y A ++ + + I+ ++ +A + + Sbjct: 292 SNQTRYQAVYPKATFSSNMRAILAHAPWISDREKEAAEKVIEFILLPETQQIATDLGLRP 351 Query: 155 DVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLI 214 V N + P P+ + ++ A Sbjct: 352 GVPGVALGSKFSAEFGVNPQPTYDSYRSPQPEVVEAMLKSWQNYAKKPSQVAVVIDTSGS 411 Query: 215 ESAGNLVNSIQKAIQEKKN--LSVRIGTIAYNIGIVGNQCTPLS-NNLNEVKSRLNKLNP 271 L + + +N RI I++N I + + + +L Sbjct: 412 MEGQKLTSVKNTLLNYVQNLGPKERIALISFNSVINEPVIIEGTPQGRDRGIEFIGQLRS 471 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T Y + +A L T V+ +TDGE+SG+ + L Sbjct: 472 SGGTRLYDSALYARNWLSQNLR--------TDTINAVLILTDGEDSGSQINLDQLE---- 519 Query: 332 CEYMRNAG------MKIYSVAVS-APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 + ++ +G + +++ L+K + +G ++ D + + Sbjct: 520 -QELQKSGFSSDQRIAFFTIGYGKEGEFNPQALQKIAEVNGGYYRQGDPATISTVMGDL 577 >gi|86609942|ref|YP_478704.1| hypothetical protein CYB_2508 [Synechococcus sp. JA-2-3B'a(2-13)] gi|86558484|gb|ABD03441.1| hypothetical protein CYB_2508 [Synechococcus sp. JA-2-3B'a(2-13)] Length = 249 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 22/132 (16%), Positives = 44/132 (33%), Gaps = 10/132 (7%) Query: 248 VGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 + + ++ + + P NT Y A A R L + + + Sbjct: 126 TAQLLQDFTGDKELLRRGVLRAVPGGNTALYDATVEAGRFLSDFRPTERFNR-------R 178 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAG--MKIYSVAVSAPPEGQDLLRKCTDSSG 365 ++ TDG ++ S T ++ R G + +Y V + +L R + G Sbjct: 179 LVVFTDGIDNE-STRSINQATQELTTLARERGQKLTVYVVGLGVDLNLLELQRLAAATEG 237 Query: 366 QFFAVNDSRELL 377 F + +L Sbjct: 238 TFVLARFADQLS 249 >gi|47218290|emb|CAG04122.1| unnamed protein product [Tetraodon nigroviridis] Length = 993 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 17/125 (13%), Positives = 34/125 (27%), Gaps = 11/125 (8%) Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T+ A+ L + S +IF+TDG+ + L Sbjct: 381 TSGGGTDIDGAIQTGSSLLRDHLSGRDAGPNS---VSLIIFLTDGQPTVGEVR--PGAIL 435 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA-----VNDSRELLESFDKIT 384 I+++ + LL + + + S L +D+I Sbjct: 436 GNARAAVRDKFCIFTIGMG-DDVDYRLLERMALDNCGMMRRIPEEADASSMLKGFYDEIG 494 Query: 385 DKIQE 389 + Sbjct: 495 TPLLS 499 >gi|313233701|emb|CBY09871.1| unnamed protein product [Oikopleura dioica] Length = 663 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 44/335 (13%), Positives = 84/335 (25%), Gaps = 37/335 (11%) Query: 74 QGSYIRENAGDIAQKAQINITKDKNNPLQYIAE---SKAQYEIPTENLFLKGLIPSALT- 129 S I+Q +T P GLI S+ T Sbjct: 339 AESTTSSIGPTISQPPGPPVTTTMEPPGSSTTNQPNITNTLPPGPPATTTTGLIGSSTTA 398 Query: 130 NLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKH---NDNNNMTSNKYLLPPPPK 186 L+ + S ++ S + + P Sbjct: 399 QLTTTNQSATTISPTCPPSVPDPCMNNSTCIMINATLTDCACQRGWAGEFCDVVCPATLT 458 Query: 187 KSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIG 246 A P + + L+N Q V++ +++ Sbjct: 459 GRLDLAVLLDVSGTIASNPNKDQDTF--DFFQALLNEFDTVNQ------VQLSITSFSDD 510 Query: 247 IVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 V + N ++ + ++ T+ + A + + Sbjct: 511 AVVDLPMG-HYNEPDLFGAVKNVDWVGRLTDINEGLQTALSTMNTT----------DDVP 559 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL-LRKCTDSS 364 +IF++DG + + + NAG+ + SV + L D+ Sbjct: 560 DIMIFVSDGFD-----SFDPGAIGDNAADISNAGVDVVSVGFGLNGFVNFMALVTVADNE 614 Query: 365 G-QFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 G F + ELL I + + AP R Sbjct: 615 GANVFTASTGDELLSQTTAILEALCTA---NAPTR 646 >gi|323700441|ref|ZP_08112353.1| hypothetical protein DND132_3035 [Desulfovibrio sp. ND132] gi|323460373|gb|EGB16238.1| hypothetical protein DND132_3035 [Desulfovibrio desulfuricans ND132] Length = 389 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 19/180 (10%), Positives = 46/180 (25%), Gaps = 7/180 (3%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M ++ ++ A+DL R+ +Q + ++ ++ D+ Sbjct: 18 MVSLCMAALMGLTALAVDLGRAYLKRSALQ------TAADAGALAGANSLLAAGKDLDKL 71 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEI-PTENLF 119 I ++L + D + D+ P Q E L+ Sbjct: 72 RLIVTNYTTRNLTDADGPAKALTDADIVFLRDGVPDEEQPNQVEVTVTLSGERENAFPLY 131 Query: 120 LKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKY 179 + ++ + S + + +V +D N Sbjct: 132 FGKAVGKPAMDIVVTSRAGLAGMCSSKCSKPFVVPTKYEWDDDAAPGTKYYQNGTLDVDS 191 >gi|332796610|ref|YP_004458110.1| von Willebrand factor type A [Acidianus hospitalis W1] gi|332694345|gb|AEE93812.1| von Willebrand factor type A [Acidianus hospitalis W1] Length = 381 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 16/123 (13%), Positives = 50/123 (40%), Gaps = 19/123 (15%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 ++ ++ + + + T+ + A+ A++ + +++ +TDG Sbjct: 91 SDPLDLTNEIQNIAANGQTSLFTALLTAFKIAVKYSMPA-----------YILLLTDGNP 139 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRE 375 + + +T + G++I S + ++LL+ +D + F+ + ++ E Sbjct: 140 T------DVTDTRTYEKMSIPQGVQIISFGIG-DDYNEELLKILSDRTGSTFYHIQEASE 192 Query: 376 LLE 378 + E Sbjct: 193 IPE 195 >gi|313226593|emb|CBY21739.1| unnamed protein product [Oikopleura dioica] Length = 694 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 31/325 (9%), Positives = 87/325 (26%), Gaps = 28/325 (8%) Query: 69 KKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSAL 128 + + +Y N G + I + + + + Sbjct: 57 RDFFLRYAYTETNYGIVQHHHVQQIWSIDEAQFNTTKTVALKSSVSGKFNVDWDKLEYDE 116 Query: 129 TNLSLRST-GIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKK 187 + S + I +D ++ + + D ++ + Sbjct: 117 LTIPTYSMLALFLYLEHLNEFPIPYSIDAQAAIYESITHHNVDGFKDGVDELDQITQDQC 176 Query: 188 SFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGI 247 + + + + ++ + + + A RI ++ Sbjct: 177 RTNALDIVFV-VDESGSIGTPNFQLIKDFLEHFASDSTIAADA-----TRIAIRPFSTSN 230 Query: 248 VGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 + + + + NTNT A+ A + ++ K Sbjct: 231 YLYFSLN-DFKTKNIINEIKNMPYNSGNTNTADALDAALTDYGTDRP---------ESVK 280 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRN--AGMKIYSVAVSAPPEGQDLLRKCTDSS 364 ++ ITDG ++ + L+T + ++N ++ +++ V L + Sbjct: 281 VMVTITDGASN------SFLSTSAAADRVKNDLRNIQSFAIGV--SGANMAELEAIAITD 332 Query: 365 GQFFAVNDSRELLESFDKITDKIQE 389 F +N + + K+ E Sbjct: 333 KHVFMLNGWADFEPIKSNLLQKVCE 357 >gi|238897540|ref|YP_002923219.1| putative tight adherance operon protein G [Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum)] gi|229465297|gb|ACQ67071.1| putative tight adherance operon protein G [Candidatus Hamiltonella defensa 5AT (Acyrthosiphon pisum)] Length = 450 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 39/425 (9%), Positives = 111/425 (26%), Gaps = 64/425 (15%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 I+ SV + I + ++ + + + L+ L + ++ T D+ Sbjct: 36 FLIMFSVFLILIYATFEYCRLINEKIKTEQVLEQTSL----ILTAEDNGDTSGTPGDRNR 91 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + I+ + + E I I N L+ + + Sbjct: 92 FLALSHIESFIPSTTATNE---------SIEIISPNQNDLKRKYTITLRNAYSSIFNLFS 142 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNN-------- 173 + + + + ++ + + + + N Sbjct: 143 TM----SVSGKAAAQKAYQEEPMDVVFVFSFYRHTIENPLRMSWETRQEINELKEVLIEK 198 Query: 174 -------MTSNKYLLPPPPKKSFWSKN-----------TTKSKYAPAPAPANRKIDVLIE 215 NK P + + + +I Sbjct: 199 INTILEMNPLNKVGFVPFSWGVKEGRYCLQPFVFLFKPNYDNMNFGPAQFKAPRQTEVIS 258 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGT---IAYNIGIVGNQCTPLSNNLNEVKSRL-NKLNP 271 + + V ++ + +V + PL+ NE+ + ++L P Sbjct: 259 KSIDFVKTVNNIPFAMTDSNVISLASLDNRLCGLSLYRDPIPLTREKNEIFMIIKDELIP 318 Query: 272 YENTNTYPAMHHAYRELYNE----------KESSHNTIGSTRLKKFVIFITDGENSGASA 321 N + + L ++ + + N+ + Sbjct: 319 GYN-LISSGILSGVKLLSRGTNSKKLLIIISDTEEAPPMAINDDNKRMLRIRFNNNTLNI 377 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLR-KCTDSSGQFFAVNDSRELLESF 380 ++ + +CE ++N+G+K+Y ++ E + + +C + + + Sbjct: 378 SESLIKA-GMCEKIKNSGIKMYFISTGGSIESHNYWKTQCIGEKNMH----NLNNVDDEL 432 Query: 381 DKITD 385 +I Sbjct: 433 KQILQ 437 >gi|256958585|ref|ZP_05562756.1| von Willebrand factor [Enterococcus faecalis DS5] gi|256949081|gb|EEU65713.1| von Willebrand factor [Enterococcus faecalis DS5] Length = 666 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 41/382 (10%), Positives = 101/382 (26%), Gaps = 30/382 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M A A L+ +++ + D + + QG+ D Sbjct: 202 AEARMAPATLRANLAL--PLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGSAQWD 259 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ ++Y + L++R E Sbjct: 260 GQTSWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRG----NVQKE 315 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + +V+D S SM + L + S Sbjct: 316 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNINMGYVGYSSDGYNNN 375 Query: 205 PANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 D + N+ S + + G + L++ + Sbjct: 376 AIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPTFS 435 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +++++ T R + + + GST D N Sbjct: 436 YKVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNINKRINS 485 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA-V 370 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 486 TFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSADENGDLYYESA 545 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 546 DYAPDISDYLAKKAVQISGTVV 567 >gi|254436533|ref|ZP_05050027.1| von Willebrand factor type A domain protein [Octadecabacter antarcticus 307] gi|198251979|gb|EDY76293.1| von Willebrand factor type A domain protein [Octadecabacter antarcticus 307] Length = 613 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 38/359 (10%), Positives = 95/359 (26%), Gaps = 32/359 (8%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPL 101 A+ + + + + ++ F ++ E + + + + Sbjct: 111 AAPAGNDMAGNLRSMAEPSNDGFVHVLRDGSTFYEEYDETFANDTPNPLKITSDEPVSTF 170 Query: 102 QYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSME 161 + A Y + +L L P+ + + Sbjct: 171 SIDVD-TAAYALIRSSLTRGQLPPTDAVRIEEMINYFPYAYPAPEGEAPFRPTINVFETP 229 Query: 162 DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLV 221 + + PP + +T+ S + K+ +L +S ++ Sbjct: 230 WNADTQLVHIGIQGEMPAIEDRPPLNLVFLIDTSGS------MESADKLPLLRQSFRLML 283 Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAM 281 ++++ + + + Y ++ + + LN LN +TN + Sbjct: 284 DNLRPEDE--------VAIVTYAGSTSIALEPTQASERATIIAALNALNAGGSTNGQGGL 335 Query: 282 HHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA--- 338 AY K + + VI TDG+ + + E R+ Sbjct: 336 EQAYALAETMKTAGDVSR--------VILATDGDFNVGL--SDPRGLQAYIEDKRDDAQQ 385 Query: 339 --GMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 G + + ++ + L E+ + D + IA Sbjct: 386 TGGTYLSVLGFGRGNLQDATMQSLAQNGNG--TAAYIDTLSEAQKVLVDNLTGALFPIA 442 >gi|91203253|emb|CAJ72892.1| hypothetical protein kustd2147 [Candidatus Kuenenia stuttgartiensis] Length = 701 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 18/170 (10%), Positives = 58/170 (34%), Gaps = 14/170 (8%) Query: 233 NLSVRIGTIAYNIGIVGNQCT-PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNE 291 N + ++G + ++ + N +++ + Y + + Sbjct: 358 NDTHQVGVVEFHRPDEPPAILQDFTTYKNAAIEAVSQFSSG---KIYRDFSSCWDAVLKG 414 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 + + K ++F++DG ++ + + + + IY + + Sbjct: 415 LKQFP-EKPDPDIFKTLVFLSDGFDN-----SSFSTPGNVISLAKERDVHIYILGIGRGS 468 Query: 352 EGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQS-VR-IAPNR 398 +++L+ ++ G + + E F + I+ Q ++ I P + Sbjct: 469 -EEEVLKNIALETGGTYVHAENIAVFRERFKQTIKDIKGQYKIKYITPKK 517 >gi|47551047|ref|NP_999700.1| complement factor B [Strongylocentrotus purpuratus] gi|3928787|gb|AAC79682.1| factor B SpBf [Strongylocentrotus purpuratus] Length = 833 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 32/250 (12%), Positives = 73/250 (29%), Gaps = 25/250 (10%) Query: 139 IERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSK 198 + +V+D + ++ + K + + Sbjct: 344 SRVMNGLGKTFDEIVIDGGSWVGRNDTSSNSGLRRKRTIK---------LSDGMDIYFAF 394 Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 A ++ A LV +Q R+G ++Y+ N+ Sbjct: 395 DASNSVGLKN-FEIGKTFAKQLVGKLQ---VNTSPGGTRVGAVSYSSEARRLFNV---ND 447 Query: 259 LNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + TN + A + + + G + K+ + ITDG ++ Sbjct: 448 FTSTVDVVKAIEANVNYTNKGTNLPAALETIGVMITETADESGYSSRKRILFIITDGFSN 507 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRE 375 A + L+ ++ I+ + +S + L + F ++D E Sbjct: 508 VGGAPSKSAQPLKEDAALK-----IHCIGISRN-TDKTALAEIASPPVSEHVFYLSDYNE 561 Query: 376 LLESFDKITD 385 L + + IT Sbjct: 562 LERAVEAITS 571 >gi|110632968|ref|YP_673176.1| hypothetical protein Meso_0611 [Mesorhizobium sp. BNC1] gi|110283952|gb|ABG62011.1| conserved hypothetical protein [Chelativorans sp. BNC1] Length = 427 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 25/208 (12%), Positives = 61/208 (29%), Gaps = 2/208 (0%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + AI+ V + ++ + + ++Q A D AV + + + Sbjct: 29 IAAIVFPVVVGAMGLGVESGYWYLKQRKLQHAADVAVYAASVRYRA-GDARALMETAALR 87 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 S + ++ + + + +T+ + + + L Sbjct: 88 SARVTGYQPSIGTITTGVQAGSTAGSGTVSVELTETHSRLFSSVFTTDPVVLSARAVAEL 147 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 KG + + LS + G + + + + VS S N + M+++ Sbjct: 148 KGGSRACVLALSPTAPGAVTVTGS-TDVQLNGCSVVSNSNASDAFLMRNGSALMSTDCVY 206 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANR 208 + T SK + P Sbjct: 207 TVGGAITTTGLNLTGCSKPVVSSPPVPD 234 >gi|268608768|ref|ZP_06142495.1| hypothetical protein RflaF_04637 [Ruminococcus flavefaciens FD-1] Length = 453 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 24/154 (15%), Positives = 51/154 (33%), Gaps = 11/154 (7%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNT--IGSTR 303 T +N+++E+ S + K Y +TN A+ +A L + + S Sbjct: 230 DSGSKTLTDFTNDIDELDSAIAKTTAYGSTNYSAALRNAAELLSKVSADAVRNIVLCSDG 289 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQ--ICEYMRNAGMKIYSVAVSAPPEGQDL----- 356 + G+ + + I + ++ +IY++ G+DL Sbjct: 290 NPYGGEEKSTGKYTLSDYSDYEYANAAYDIAQEIKKD-YEIYTLGFFHSLSGEDLDFGRT 348 Query: 357 -LRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 L+ + VN +L + F + Sbjct: 349 YLKDVASYDSNYAEVNKVDDLQKVFADVAGNAVS 382 >gi|167041757|gb|ABZ06500.1| putative NHL repeat protein [uncultured marine microorganism HF4000_010L19] Length = 1148 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 39/336 (11%), Positives = 90/336 (26%), Gaps = 19/336 (5%) Query: 68 IKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSA 127 + SY + + A + KN ++ + + Sbjct: 179 STPYQSSLSYSWGMSVNSASNKLVIADYVKNRVVELNVSGSSLSYSQATSASYSSSNGYF 238 Query: 128 LTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKK 187 + +L + + S + + + Y + Sbjct: 239 RRPTDAAYDSSGNIYALDLYNNRIQKFNSSLTYQ--AKTGSWSVSAGFRYPYGMHIDSSD 296 Query: 188 SFWSKNTTKSKYAPAPAPANRKIDVLIESAGNL--VNSIQKAIQEKKNLSVRIGTIAYNI 245 + + + N + L + K I +L+ Sbjct: 297 NIYVTDFYNYAVRKYDTSLNETATYGGGAGTRLDAAKKVIKKIVSNTDLTSGANFGLMEW 356 Query: 246 GIVGNQCTPLSN-NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 G N +S+ + + ++ + T+ AM A + + ++ + Sbjct: 357 GTRHNIRVKISDTGAKTIYTNVDGVYASGGTDLAKAMSIARNYFTSGQVANW---NLSCS 413 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRN-AGMKIYSVAVSAPPEGQDLLRKCTDS 363 ++I I+DG SG + L I E ++N +K ++V + + Sbjct: 414 VNYLIVISDGYWSGHTT------VLSIAEQIKNVYNIKTFAVGFALGGANSNY-STLATK 466 Query: 364 SGQFF--AVNDSRE-LLESFDKITDKIQEQSVRIAP 396 G ++ E L + D I I + P Sbjct: 467 GGTTSPLYASNQSELLAKLTDAIKQAISGKLTFTTP 502 >gi|156409365|ref|XP_001642140.1| predicted protein [Nematostella vectensis] gi|156229281|gb|EDO50077.1| predicted protein [Nematostella vectensis] Length = 156 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 50/165 (30%), Gaps = 25/165 (15%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + + N+++ + + R+G + Y+ N Sbjct: 16 VDGSGSIKAARFKGVKRFIQNVISRFHISPK-----HTRVGLVLYSNNPYKIFGFNKYTN 70 Query: 259 LNEVKSRLNKLN-PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N ++ P T T A+ + R L+ + + +I +TDG + Sbjct: 71 KNAAMKATGRIPYPRRGTKTGRALAYTGRYLFRSSKRR----------RVLILLTDGRSY 120 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 +R AG+ IY+V V LR Sbjct: 121 D--------RVSAPARKLRQAGIHIYAVGVGRN-YNIKQLRSIAS 156 >gi|159037814|ref|YP_001537067.1| von Willebrand factor type A [Salinispora arenicola CNS-205] gi|157916649|gb|ABV98076.1| von Willebrand factor type A [Salinispora arenicola CNS-205] Length = 427 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 28/195 (14%), Positives = 70/195 (35%), Gaps = 30/195 (15%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN-------N 258 +I V ++ +V+++ ++ L +R+ Y C + Sbjct: 56 GRSRISVAQQAFNEVVDAL----PDETQLGIRVLGATYPGENKERGCQDTQQIVPVGPVD 111 Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + K+ + L P T A+ A ++L + ++ ITDGE++ Sbjct: 112 RVQAKAAVATLRPTGFTPVGLALRSAAQDLGTGS-----------TARRIVLITDGEDT- 159 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIY--SVAVSAPPE-GQDLLRKCTDSSGQFFAVNDSRE 375 + ++ + G K+ ++ ++ + + LL + G + A + E Sbjct: 160 ----CAPPDPCEVARELAAQGTKLVVDTLGLAPDEKVRRQLLCIAAATGGTYTAAQSADE 215 Query: 376 LLESFDKITDKIQEQ 390 L ++ D+ ++ Sbjct: 216 LTGRIKQLVDRARDT 230 >gi|326775386|ref|ZP_08234651.1| von Willebrand factor type A [Streptomyces cf. griseus XylebKG-1] gi|326655719|gb|EGE40565.1| von Willebrand factor type A [Streptomyces cf. griseus XylebKG-1] Length = 424 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 27/196 (13%), Positives = 66/196 (33%), Gaps = 26/196 (13%) Query: 206 ANRKIDVLIESAGNLVNSIQK-----AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 ++ ++ +++++ + + + PL + Sbjct: 58 GQSRMAAAKQAFNDVLDAAPEEVHLGIRTLGADYPGEDRKVGCKDTKQLYPVGPL--DRT 115 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 E K+ + L P T PA+ A +L + + ++ ITDGE++ Sbjct: 116 EAKAAVATLAPTGFTPIGPALLGAADDLEGGEG-----------SRRIVLITDGEDTCG- 163 Query: 321 AYQNTLNTLQICEYMRNAGMKIY--SVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELL 377 L+ ++ + G + ++ + + + L + + G + AV EL Sbjct: 164 ----PLDPCEVAREIAARGTHLVVDTLGLVPNAKIRQQLTCIAEATGGTYTAVQHKEELS 219 Query: 378 ESFDKITDKIQEQSVR 393 ++ D+ E V Sbjct: 220 GRVKQLVDRAAEPVVT 235 >gi|218679029|ref|ZP_03526926.1| hypothetical protein RetlC8_09174 [Rhizobium etli CIAT 894] Length = 151 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 24/157 (15%), Positives = 49/157 (31%), Gaps = 27/157 (17%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + ++ L + +ID R +QSA+D+AVL+ A + Sbjct: 22 IVILVAVPLLLAVGASIDFIRAYNNRVDLQSAVDSAVLAAAAKYKH-------GMPEASI 74 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 S + + A + ++ D+ + ++ Sbjct: 75 SGTVNAFLSAN---------GTLKSAVIGKPEVSSDEAELCLDVGDAV--------PTTF 117 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 + +S+RS + + I +VLDVS Sbjct: 118 MQVANIQSVPISVRSCAAL---PGVKQLEIALVLDVS 151 >gi|47229694|emb|CAG06890.1| unnamed protein product [Tetraodon nigroviridis] Length = 707 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 17/132 (12%), Positives = 39/132 (29%), Gaps = 14/132 (10%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 NL K+ + TN A+ L +I +TDG+ + Sbjct: 327 NLESAKTFARNIRANGATNINAAVLKGSSMLNAHPR--------EGSASILILLTDGDPT 378 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG---QFFAVNDSR 374 N +Y + + L + ++G + + +D+ Sbjct: 379 TGE--TNPEAIQSNVRNAIAEKFPLYCLGFGFDVNFEFLEKMSLQNNGVARRIYEDSDAD 436 Query: 375 -ELLESFDKITD 385 +L ++++ Sbjct: 437 LQLKGFYEEVAT 448 >gi|301606773|ref|XP_002932992.1| PREDICTED: integrin alpha-11-like [Xenopus (Silurana) tropicalis] Length = 1188 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 46/368 (12%), Positives = 95/368 (25%), Gaps = 34/368 (9%) Query: 38 LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK 97 L ++ + D + I + AG + Sbjct: 7 LMAAWALCLLPGLTDTFNIDTKRPRIISGSKDAFFGYTVQQHQIAGKKWLVVGAPFDVNG 66 Query: 98 NNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 I + + + + LS S ++ D S Sbjct: 67 PQKTGDIYKCSVTNDTSNDCTK----LGIGRVTLSNVSERKDNMRLGMSLVTNPK--DNS 120 Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY-----APAPAPANRKIDV 212 ++ T+ + + + I Sbjct: 121 FVACSPLWSHECGSSYYTTGMCSRVNSNFRFSRIVAPALQRCPTFMDIVIVLDGSNSIYP 180 Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY 272 +E L++ +QK +++G + Y +V ++ +V ++ Sbjct: 181 WVEVQSFLISILQKFYIAPGQ--IQVGVLQYGETVVHEFYLNNYRSVTDVVEAAKRIEQR 238 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T T A+ + + G KK +I ITDGE ++ + ++ Sbjct: 239 GGTETRTAL-----GIEKAVTEAFQRGGRKGAKKVMIVITDGE------SHDSPDLQRVI 287 Query: 333 EYMRNAGMKIYSVAV-SA----PPEGQDLL---RKCTDSSG--QFFAVNDSRELLESFDK 382 E + Y+VAV + L + FF V D L + D Sbjct: 288 ESSEKDNITRYAVAVLGYYNRRGINPEAFLNEIKYIASDPDDKHFFNVTDEAALKDIVDA 347 Query: 383 ITDKIQEQ 390 + ++I Sbjct: 348 LGERIFSL 355 >gi|260834079|ref|XP_002612039.1| hypothetical protein BRAFLDRAFT_94127 [Branchiostoma floridae] gi|229297412|gb|EEN68048.1| hypothetical protein BRAFLDRAFT_94127 [Branchiostoma floridae] Length = 794 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 29/261 (11%), Positives = 69/261 (26%), Gaps = 23/261 (8%) Query: 120 LKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKY 179 + + +D++ + N + Sbjct: 114 SSSNGGCNQICTNTAGSFSCSCRPGYRLSGSRTCVDINECSTNNGGCSETCTNIVGGYYC 173 Query: 180 LLPPPPKKSFWSKNTTKSKYAPAPAPANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 P + + + A + + + A + R+ Sbjct: 174 SCRPGYRLTGSGACVEFTTQAVDLVFLIARPRSSVSNADVKTFLKSVVAALNVSQTAARV 233 Query: 239 GTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN---TYPAMHHAYRELYNEKESS 295 I Y + + P +V S +N + + +T+ T A+ + L Sbjct: 234 AVIEYTSVMHSHFDLPTHLTNAQVTSAINSIPAWGSTSYRKTGSAIKYLTDYLSWR---- 289 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 + K ++ ITD ++ + + +NAG+ + SV V Sbjct: 290 ------DGIPKVLVVITDATSNDYVSGP--------AQSAKNAGLILSSVGVGT-SISST 334 Query: 356 LLRKCTDSSGQFFAVNDSREL 376 L +S + V+ ++ Sbjct: 335 ELNTIATNSSYRYTVSSYTDI 355 >gi|182434868|ref|YP_001822587.1| hypothetical protein SGR_1075 [Streptomyces griseus subsp. griseus NBRC 13350] gi|178463384|dbj|BAG17904.1| conserved hypothetical protein [Streptomyces griseus subsp. griseus NBRC 13350] Length = 424 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 27/196 (13%), Positives = 66/196 (33%), Gaps = 26/196 (13%) Query: 206 ANRKIDVLIESAGNLVNSIQK-----AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 ++ ++ +++++ + + + PL + Sbjct: 58 GQSRMAAAKQAFNDVLDAAPEEVHLGIRTLGADYPGEDRKVGCKDTKQLYPVGPL--DRT 115 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 E K+ + L P T PA+ A +L + + ++ ITDGE++ Sbjct: 116 EAKAAVATLAPTGFTPIGPALLGAADDLEGGEG-----------SRRIVLITDGEDTCG- 163 Query: 321 AYQNTLNTLQICEYMRNAGMKIY--SVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELL 377 L+ ++ + G + ++ + + + L + + G + AV EL Sbjct: 164 ----PLDPCEVAREIAARGTHLVVDTLGLVPNAKIRQQLTCIAEATGGTYTAVQHKEELS 219 Query: 378 ESFDKITDKIQEQSVR 393 ++ D+ E V Sbjct: 220 GRVKQLVDRAAEPVVT 235 >gi|329850448|ref|ZP_08265293.1| hypothetical protein ABI_33520 [Asticcacaulis biprosthecum C19] gi|328840763|gb|EGF90334.1| hypothetical protein ABI_33520 [Asticcacaulis biprosthecum C19] Length = 395 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 43/354 (12%), Positives = 90/354 (25%), Gaps = 40/354 (11%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + + ++ A+DL + +R+ MQ+ALDAAVL+G + T Sbjct: 2 IYGLSATLVIAAGGGALDLFNASNLRHDMQNALDAAVLTGVRASSQMGTS---------A 52 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 S FK+ + + S +A + + + A E + Sbjct: 53 SNAFKQNVADDMDGASQSYSSAVSSSSASSSSYVTT-------TLTGTASLE---SPTYF 102 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNN-------- 172 L+ +S++S + VLD S S L N Sbjct: 103 MKLLGLNDLTVSVKSVAQGTTTIAPAGKPCIYVLDPSGSQALLVNSGANVQALSCEIHVK 162 Query: 173 -----------NMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLV 221 + N L + T + A + L A + Sbjct: 163 STGNPAAIFNSGSSLNFKKLCVQGTNIIKNSVTVPNLVTGCAASGDPYAGTLPTPASSTC 222 Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAM 281 + + + + + +K+ N T + Sbjct: 223 TYSNQNYSAATQNLTPGVYCGWFNFNNSSATVNFAPGVYVIKN--GGWNVNGGTWKGTGV 280 Query: 282 HHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 Y + + +S + + + + + + + Sbjct: 281 TFYYADTSKIQFNSGISADLSAPSSGTYANLLMYEASGLSKTQFVLNDSVANKL 334 >gi|301768024|ref|XP_002919431.1| PREDICTED: calcium-activated chloride channel regulator 4-like [Ailuropoda melanoleuca] Length = 922 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 18/133 (13%), Positives = 48/133 (36%), Gaps = 24/133 (18%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T+ M A++ + + ++ +TDGE++ Sbjct: 378 ANGGTSICAGMRSAFQVIREVYPQIDGSE--------IVLLTDGEDNS---------AKD 420 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE---LLESFDKIT--- 384 + + +G I+ +A+ P Q ++ + G F +D + L+++F + Sbjct: 421 CIDEVTQSGAIIHLIALG-PSADQAVIEMSAMTGGNHFFASDEAQNNGLIDAFGALASGN 479 Query: 385 DKIQEQSVRIAPN 397 + +Q +++ Sbjct: 480 TDLSQQPLQLESK 492 >gi|281350503|gb|EFB26087.1| hypothetical protein PANDA_008525 [Ailuropoda melanoleuca] Length = 961 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 22/150 (14%), Positives = 51/150 (34%), Gaps = 21/150 (14%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESS 295 R+G I Y+ + + ++ K ++++ T T A+H A + Sbjct: 665 RVGIINYSHKVEKVAHLTQFSTKDDFKLAVDRMQYLGEGTYTASALHEANHMFEAARPG- 723 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ- 354 +KK + ITDG+ ++ N ++ + + ++I+ + V + Sbjct: 724 --------VKKVALVITDGQTDT----RDEKNLTEVVKKASDINVEIFVIGVVKKNDPNF 771 Query: 355 ----DLLRKCTDSSG--QFFAVNDSRELLE 378 + + +D L E Sbjct: 772 EVFHKEMNLIATDPDSEHVYQFDDFITLQE 801 Score = 44.1 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 32/196 (16%), Positives = 70/196 (35%), Gaps = 30/196 (15%) Query: 210 IDVLIESAGNLVNSIQKAIQEKK-NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 D + +L + + + + +++ + ++ + + +L K R+ Sbjct: 60 FDKQKDFVDSLSDRVFQLTPVRSLKYDIKLAALQFSSSVQIDPSFSSWKDLQTFKQRVKS 119 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 +N T +Y A+ +A L E K + +TDG + + + Sbjct: 120 MNFIGQGTFSYYAISNATGLLKREGRKDGV--------KVALLMTDGID-----HPKNPD 166 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE----LLE--SFD 381 I E R AG+ ++ +S + LR + + S E L + D Sbjct: 167 VQSISEDARTAGILFITIGLST-VVNEAKLRLISG--------DSSSEPIPLLSDPTLVD 217 Query: 382 KITDKIQEQSVRIAPN 397 KI D++ ++ + P Sbjct: 218 KIRDRLFKRVLTTEPK 233 >gi|261876471|dbj|BAI47561.1| collagen type VI alpha 1 subunit [Mesocricetus auratus] Length = 1026 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 29/190 (15%), Positives = 68/190 (35%), Gaps = 18/190 (9%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS---NNLNE 261 P +D + +++++ + + ++ A + L+ + +E Sbjct: 53 PYGALVDKVKSFTKRFIDNLRD-RYYRCDRNLVWNAGALHYSDEVEIIRGLTRMPSGRDE 111 Query: 262 VKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 +K+ ++ + + T T A+ EL SH K++I +TDG Sbjct: 112 LKASVDAVKYFGKGTYTDCAIKKGLEELL--IGGSHLKEN-----KYLIVVTDGHPLEGY 164 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF---FAVNDSRELL 377 ++ G+K++SVA+ P + L + + F D + Sbjct: 165 KEPCG-GLEDAVNEAKHLGVKVFSVAI-TPDHLEPRLSIIA-TDHTYRRNFTAADWGQSR 221 Query: 378 ESFDKITDKI 387 ++ + I+ I Sbjct: 222 DAEETISQTI 231 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 47/367 (12%), Positives = 106/367 (28%), Gaps = 36/367 (9%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 NQMQ +D + R +D + + Q + R Sbjct: 665 SHNQMQEHVD-------MRSPNIRNAQDFKEAVKKLQWMAGGTFTGEALQYTRDRLLPPT 717 Query: 85 IAQKAQINITKDKNNPLQYI--AESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERS 142 + + IT +++ + +I ++ +K + + L Sbjct: 718 QNNRIALVITDGRSDTQRDTTPLSVLCGPDIQVVSVGIKDVFGFVAGSDQLNVISCQGLL 777 Query: 143 SENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPA 202 S+ + + + ++D + N + +K +F S + Sbjct: 778 SQGRPGISLVKENYAELLDDGF--LKNITAQICIDKKCPDYTCPITFSSPTDITILLDGS 835 Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI---GIVGNQCTPLSNNL 259 + + + A L A + + VR+ + Y+ G N Sbjct: 836 ASVGSHNFETTKVFAKRLAERFLSADRTDPSQDVRVAVVQYSGLGQQQPGRTALQFLQNY 895 Query: 260 NEVKSRLNKLN-PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + S ++ ++ + T+ A+ + R + KK V+ +DG + G Sbjct: 896 TVLASSVDSMDFINDATDVNDALSYVTRFYRENSLGA--------TKKRVLLFSDGNSQG 947 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE---------GQDLLRKCTDSSGQFFA 369 + + + AG++I+ V V G+ F Sbjct: 948 ----ATAEAIEKAVQEAQRAGIEIFVVVVGPQVNEPHIRVLVTGKTAEYDVAFGERHLFR 1003 Query: 370 VNDSREL 376 V + + L Sbjct: 1004 VPNYQAL 1010 >gi|223939937|ref|ZP_03631805.1| von Willebrand factor type A [bacterium Ellin514] gi|223891428|gb|EEF57921.1| von Willebrand factor type A [bacterium Ellin514] Length = 342 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 18/104 (17%), Positives = 38/104 (36%), Gaps = 22/104 (21%) Query: 254 PLSNNLNEVKSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 PL+ + + + L ++ P T+ A+ AYR + K ++ Sbjct: 150 PLTFDYDAFRDALLAIDEQTIPVGGTDIGRALDEAYRAMEKNDR-----------HKILV 198 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 ITDGE+ ++ + + G+ +Y++ V Sbjct: 199 LITDGEDL-------EKAGIKTAQALAEKGIVVYTIGVGTAAGS 235 >gi|77553311|gb|ABA96107.1| zinc finger family protein, putative, expressed [Oryza sativa Japonica Group] Length = 529 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 15/142 (10%), Positives = 39/142 (27%), Gaps = 24/142 (16%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + K ++ L NTN + + L T VI ++DG++ Sbjct: 150 DGKATAKRAVDALVANGNTNIRDGLDVDAKVLD--------GRRHTDAVASVILLSDGQD 201 Query: 317 S---GASAYQNTLNTLQICEYMR-------------NAGMKIYSVAVSAPPEGQDLLRKC 360 + G + + +++ + + Sbjct: 202 NQTMGYRGRFHMTDFKAAATSYDVLVPPSFTRAGGGERCAPVHAFGFGTDHDAAAMHSIS 261 Query: 361 TDSSGQFFAVNDSRELLESFDK 382 + G F + + + ++F + Sbjct: 262 EITGGTFSFIENLAVIQDTFAR 283 >gi|330952765|gb|EGH53025.1| von Willebrand factor, type A [Pseudomonas syringae Cit 7] Length = 262 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 19/106 (17%), Positives = 40/106 (37%), Gaps = 18/106 (16%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ + V+ L++ +NT A+ A + L + ++ Sbjct: 150 PLTYDRRTVRVWLDEARIGIAGKNTALGDAIGLALKRL----------RMRPATSRALVL 199 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 +TDG N+ ++ + G+KIY + + + P+ L Sbjct: 200 VTDGANNAGQ-----IDPITAARLAAEEGVKIYPIGIGSDPDKDAL 240 >gi|325287696|ref|YP_004263486.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] gi|324323150|gb|ADY30615.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] Length = 366 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 28/236 (11%), Positives = 78/236 (33%), Gaps = 27/236 (11%) Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRK-----IDVLIESAGNLVNSIQKAI 228 Y T SK +D+ + ++ ++ A Sbjct: 89 DDFTIYEQGRNDDCFNKISTTESSKKISPNGQIFNNSTLLVLDLSKSVLDSSLDELKAAS 148 Query: 229 QEKKN----------LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 N S + ++ ++ PL+++ +E+ S + + +++ Sbjct: 149 ISFINNVMPEEEDKSDSFTMSIHWFDGEDKLHELNPLTSSRDELVSAIESITSTISSDAS 208 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKF--VIFITDGENSGASAYQNTLNTLQICEYMR 336 ++ A + + + V+ TDG + + ++ + Sbjct: 209 TDLYGAAIRSTDLASKYLEDNTAKDVIGAASVVLFTDGTDQASRY-TKAAALKKVTDA-- 265 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF--AVNDSRELLESFDKITDKIQEQ 390 + + +S+ + + ++L++ G+ F + EL +F++I+ KI E+ Sbjct: 266 DPNISFFSIGLGSEI-DAEILKEL----GKTFSVFATNKEELETTFNEISTKISER 316 >gi|194043712|ref|XP_001928130.1| PREDICTED: collagen alpha-3(VI) chain isoform 2 [Sus scrofa] Length = 2972 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 60/172 (34%), Gaps = 15/172 (8%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKN--LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN + ++ +N Sbjct: 1441 INFRRDSFQEVLRFVSEIVDTVYEGGDSIQVGLVQYNSDPTDEFFLKDFSTKQQIIDAIN 1500 Query: 268 KLNPYENTNTYPAMHHAYREL-YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 K+ + L N+ + R+ + IT G+ + Sbjct: 1501 KVVYKGGRHANT--KVGLEHLRLNQFVPEAGSRLEQRVPQIAFVITGGK--------SVE 1550 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 1551 DAQEASLALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 1600 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 35/290 (12%), Positives = 81/290 (27%), Gaps = 31/290 (10%) Query: 103 YIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMED 162 S + P NL G++P L + + I L ++ Sbjct: 741 VAGRSSDSVDRPALNLKQSGVVPFILQAKNADPGELELIVPSPAFILAAESLPKIGDLQP 800 Query: 163 LYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVN 222 + + P + S+ + P +L E +V Sbjct: 801 HIVNLL-----KSVQNGAPTPVSGEKDVVFLIDGSEGVRSGFP------LLKEFVQRVVE 849 Query: 223 SIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPA 280 S+ + VR+ + Y+ + V + + +L NT A Sbjct: 850 SL-----DVGPDRVRVAVVQYSDRTRPEFYLNSYMDQQSVVNAVRQLTLLGGPIPNTGAA 904 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + R++ E S + + +I +T A ++ + ++ G Sbjct: 905 LDFVLRDILTESAGSRIA---EGIPQLLIVLT--------ADRSGDDVRGPSVVLKRGGA 953 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + + ++ + A+ R+L I++++ Sbjct: 954 V--PIGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQVISERVTRL 1001 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 16/154 (10%), Positives = 43/154 (27%), Gaps = 18/154 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRE 287 +R+G + Y+ +V + L N A+ Sbjct: 66 SVGAQQIRVGVVQYSDEPRTMFSLNSFATKAQVLDAVKALGFLGGELANVGLALDFVVDH 125 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 + + + + + ++ I+ A ++ ++ A ++S + Sbjct: 126 HFT---RAGGSRVEEGVPQVLVLIS--------AAPSSDKIRDAVLALKQA--SVFSFGL 172 Query: 348 SAPPEGQDLLRKCTDSSGQFFAV---NDSRELLE 378 A + L+ + F V +L + Sbjct: 173 GAQAASKAELQHIATNDNFVFTVPEFRSFGDLQD 206 Score = 43.7 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 21/148 (14%), Positives = 50/148 (33%), Gaps = 15/148 (10%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNE 291 VR+G + ++ + + V + +L + NT A+ R + Sbjct: 1264 NKVRVGVVQFSNDVFPEFYLKTYRSQASVLDAIRRLRFKGGSPLNTGKALEFVARNFF-- 1321 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 S + + + ++ G++ + + + M ++G+ + V Sbjct: 1322 -VKSAGSRIEDGVPQHLVLFLGGKSQDDVSRYS--------QVMGSSGIVR--LGVGDRN 1370 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLES 379 + L+ T+ F V + REL Sbjct: 1371 IDRTELQTITNDPRLVFTVREFRELPNI 1398 >gi|57114202|ref|NP_001009169.1| complement factor B precursor [Pan troglodytes] gi|38502961|sp|Q864W0|CFAB_PANTR RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|29690185|gb|AAM10004.1| complement factor B precursor [Pan troglodytes] Length = 764 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 36/211 (17%), Positives = 71/211 (33%), Gaps = 31/211 (14%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + NL+ + + + V T V + P S+N + Sbjct: 278 SDSIGASNFTGAKKCLVNLIEKVASYGVKPRYGLVTYATHPKIWVKVSD---PDSSNADW 334 Query: 262 VKSRLNKLNP-----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 V +LN++N TNT A+ Y + G R + +I +TDG + Sbjct: 335 VTKQLNEINYEDHKLKSGTNTKKALQAVYSMM--SWPDDIPPEGWNRTRHVIILMTDGLH 392 Query: 317 SGASAYQNTLNTLQICEYMRN-------------AGMKIYSVAVS--APPEGQDLLRKCT 361 + + + + + +R+ + +Y V + L Sbjct: 393 NMGG------DPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALASKK 446 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 D+ F V D L + F ++ D+ Q S+ Sbjct: 447 DNEQHVFKVKDMENLEDVFYQMIDESQSLSL 477 >gi|221315959|ref|ZP_03597764.1| hypothetical protein BsubsN3_19772 [Bacillus subtilis subsp. subtilis str. NCIB 3610] Length = 235 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 30/218 (13%), Positives = 66/218 (30%), Gaps = 23/218 (10%) Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 + + + S RKID+ +S + + K + Sbjct: 30 SPSFAAEKQADTNVAVLFDGSGSMVQKTGGERKIDIAKKSVKSFAELLPKDTNLMLRVFG 89 Query: 237 RIGTIAYN----IGIVGNQCTPLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNE 291 G + L + + L++L P T A+ +E Sbjct: 90 HAGNNKLSGKALSCSTTETIYGLHPYEGSLFDNSLSELKPTGWTPIAKALADTRKEFEAF 149 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM--KIYSVAVSA 349 K V ITDGE + + E +R + + + + + Sbjct: 150 DADG---------KNVVYLITDGEETCGG------DPAAEIEKLRASNVDTIVNIIGFNF 194 Query: 350 PPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDK 386 +G + +++ G++ + N + E ++++K K Sbjct: 195 DVKGNEEMKQAAVAGGGEYISANSADEFEQAWEKEAQK 232 >gi|332711435|ref|ZP_08431366.1| Mg-chelatase subunit ChlD [Lyngbya majuscula 3L] gi|332349413|gb|EGJ29022.1| Mg-chelatase subunit ChlD [Lyngbya majuscula 3L] Length = 579 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 16/129 (12%), Positives = 39/129 (30%), Gaps = 10/129 (7%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 E + L T Y + A L ++ V+ +TDG++ Sbjct: 455 QGKAEGIKFIGSLKAGGGTRLYDSAIFARNWLKQNFKTDAI--------NAVLILTDGQD 506 Query: 317 SGASAYQNTLNTLQICEYM-RNAGMKIYSVAVSA-PPEGQDLLRKCTDSSGQFFAVNDSR 374 SG+ + L+ + ++V +L+ + +G ++ + Sbjct: 507 SGSEITLDNLSKQLQSSNFEAEESIAFFTVGYGKQGEFSPKVLQTIAELNGGYYRQGNPE 566 Query: 375 ELLESFDKI 383 + + Sbjct: 567 TISNLIADL 575 >gi|297279075|ref|XP_001109489.2| PREDICTED: calcium-activated chloride channel regulator 4-like [Macaca mulatta] Length = 931 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 18/133 (13%), Positives = 54/133 (40%), Gaps = 24/133 (18%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T+ + A++ + + S V+ +TDGE+ AS+ Sbjct: 379 ASGGTSICSGIKSAFQVI--------GELSSHLDGSEVVLLTDGEDYTASSC-------- 422 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE---LLESFDKI---T 384 + ++ +G ++ +A+ + ++ + G+ F +D + L+++F + Sbjct: 423 -IDEVKRSGAIVHFIALGT-AADKAVIEMSKITGGRHFYASDKAQNNGLIDAFGDLTSGN 480 Query: 385 DKIQEQSVRIAPN 397 ++ ++S+++ Sbjct: 481 TELSQKSLQLESK 493 >gi|194016356|ref|ZP_03054970.1| YwmC [Bacillus pumilus ATCC 7061] gi|194011829|gb|EDW21397.1| YwmC [Bacillus pumilus ATCC 7061] Length = 233 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 19/129 (14%), Positives = 46/129 (35%), Gaps = 18/129 (13%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 ++ LN+L P T A+ HA + + + K V ITDGE Sbjct: 113 YEKESFENSLNELGPNGWTPIARALEHAKQA---------DEQLNNGTKHIVYLITDGEE 163 Query: 317 SGASAYQNTLNTLQICEYMRNAG--MKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDS 373 + + +++ + + N+ + + + + L++ + G ++ + Sbjct: 164 TCGG------DPVKVAKELHNSKGSTVVNVIGLDFNDGYEGQLKQVAKAGKGHYYQASTG 217 Query: 374 RELLESFDK 382 +E+ Sbjct: 218 KEMGSILSA 226 >gi|332246079|ref|XP_003272177.1| PREDICTED: complement factor B-like [Nomascus leucogenys] Length = 764 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 37/211 (17%), Positives = 70/211 (33%), Gaps = 31/211 (14%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + NL+ + + + V T V Q S+N + Sbjct: 278 SDSIGASNFTGAKKCLVNLIEKVASYGVKPRYGLVTYATYPRIWVKVSEQD---SSNADW 334 Query: 262 VKSRLNKLNP-----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 V +LNK+N TNT A+ Y + G R + +I +TDG + Sbjct: 335 VTKQLNKINYEDHKLKSGTNTKKALQAVYSMM--SWPDDIPPEGWNRTRHVIILMTDGLH 392 Query: 317 SGASAYQNTLNTLQICEYMRN-------------AGMKIYSVAVS--APPEGQDLLRKCT 361 + + + + + +R+ + +Y V + L Sbjct: 393 NMGG------DPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALASKK 446 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 D+ F V D L + F ++ D+ Q S+ Sbjct: 447 DNEQHVFKVKDMENLEDVFFQMIDESQSLSL 477 >gi|72007460|ref|XP_780292.1| PREDICTED: similar to polydom protein [Strongylocentrotus purpuratus] gi|115939674|ref|XP_001195885.1| PREDICTED: similar to polydom protein [Strongylocentrotus purpuratus] Length = 1500 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 22/166 (13%), Positives = 52/166 (31%), Gaps = 26/166 (15%) Query: 221 VNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPA 280 ++ + A+ + + + Y N+L V T T A Sbjct: 77 ASTTRVAVISYSSCNQIHIRVNYISSPENKNKCTFDNDLTSVNYH------PGGTCTAGA 130 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + A R++ + ++ V+ +TDG ++ + +++ G+ Sbjct: 131 LEAAGRDVLSHGRPGA--------QRVVMLLTDGASNDGGPPHAN------AQKLKSEGV 176 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSSGQFFAV----NDSRELLESFDK 382 KI+++ + L S ++ + D R L Sbjct: 177 KIFTIGIG--SIKLSELNAIATSVDEYVYILADFGDVRNLATVVKD 220 >gi|319777804|ref|YP_004134234.1| hypothetical protein Mesci_6053 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317171523|gb|ADV15060.1| hypothetical protein Mesci_6053 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 343 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 36/290 (12%), Positives = 75/290 (25%), Gaps = 12/290 (4%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + +++ I +++D + R+ Q ALDAAVLS + + T+ T T Sbjct: 4 LFVFMLAPIITAIGFSVDYTRAVQTRSNEQQALDAAVLSITG-MDTTSTLAQRQTMLQDT 62 Query: 61 STIFKKQIKKHL-----KQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPT 115 L A + + + N + S A Sbjct: 63 FIANHGLGTPTLNSFVVSANGTATAQAMASYSMPTVFMQIARINTVPVAVGSAASKTPAL 122 Query: 116 ENLFL--KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNN 173 + ++L T ++ L + NN Sbjct: 123 VQTTFKVSKVSGWWNKTMTLYGTTFGATVAKPLMSIEYTYNGFGDPKGYGTTNVYTITNN 182 Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 ++ L + +T A A + S L Sbjct: 183 GGADIKTLVQSQVCTTAGVSTFTGLTADA---ILQTSGTKKYSTTCLNTMYPANSAGASI 239 Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL-NKLNPYENTNTYPAMH 282 ++ ++ + + L ++ +RL + ++ T T Sbjct: 240 DVNQMAGLSLQMYVPSGNPKYLKSDDPTTSNRLYSGVDANNLTETPTGQK 289 >gi|221115448|ref|XP_002154505.1| PREDICTED: similar to polydom [Hydra magnipapillata] Length = 2514 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 18/118 (15%), Positives = 35/118 (29%), Gaps = 16/118 (13%) Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 TN A+ A L + E + + +TDG + + +T Sbjct: 246 GWTNINGALQKAKALLDSANEK---KFKRHNVNTVAVLLTDGGWNYGGSPYDT------A 296 Query: 333 EYMRN--AGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 +R + I+S+ V + L+ D + F + I+ Sbjct: 297 TNLRTGFHYVDIFSIGVGH-WLDRKQLKNIAGKEENVIIAKDFSD----FSGLATTIR 349 >gi|324513792|gb|ADY45651.1| C-type lectin protein 160 [Ascaris suum] Length = 497 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 19/159 (11%), Positives = 50/159 (31%), Gaps = 21/159 (13%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNT 298 + +N +V + + +N +N MH A E++ Sbjct: 1 MVLFNDVARVGFPLTTYATNFDVTTAILGMNFNGGISNIAAGMHTAMTEVF-------AK 53 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLR 358 +++ ++ ITDG + + + Y + + +G + + Sbjct: 54 SSERGVQRVMVIITDGM--------DITDVGTEHALASKNNINTYVLGIG-EEQGYEEMV 104 Query: 359 KCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 K + + ++ +L + D++ + V P Sbjct: 105 KAAGDPSRLYDASNFTQLRDVI----DEVCNERVTHGPR 139 >gi|257387423|ref|YP_003177196.1| von Willebrand factor A [Halomicrobium mukohataei DSM 12286] gi|257169730|gb|ACV47489.1| von Willebrand factor type A [Halomicrobium mukohataei DSM 12286] Length = 788 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 20/138 (14%), Positives = 46/138 (33%), Gaps = 20/138 (14%) Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 + ++ L N E ++ +L T+ + A L + Sbjct: 417 NHNAYRVSELRTLGQNRAETAEKIRQLESGGATDIAVGLQGADELLGDR----------- 465 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 + +I ++DG++ + + G ++ SV V + + + Sbjct: 466 --EGTIILLSDGQDRLGP-------PAAVANQLGREGTRVVSVGVGKRVGVPTMRQIAGE 516 Query: 363 SSGQFFAVNDSRELLESF 380 S G +FA +++ L F Sbjct: 517 SGGSYFAADETERLRLLF 534 >gi|126340361|ref|XP_001365240.1| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain2 [Monodelphis domestica] Length = 951 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 19/170 (11%), Positives = 54/170 (31%), Gaps = 13/170 (7%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 + + + + + K + K+ P TN A+ A L Sbjct: 345 DQFSVVDFNHNVRNWRDDLVLASKAQITDAKKYIEKIQPNGGTNINEALLRAIFILNEAS 404 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 S +I ++DG+ + + + + + M++ + ++S+ + Sbjct: 405 NLGMLDPNS---VSLIILVSDGDPTVGELKLSQIQ-KNVKQSMQD-NISLFSLGIGFDV- 458 Query: 353 GQDLLRKCTDSSGQ------FFAVNDSRELLESFDKITDKIQEQSVRIAP 396 D L + + + S +L + +++++ + P Sbjct: 459 DYDFLERL-SQENHGVAQRIYGNQDTSLQLKQFYNQVSTPLLRNVQFNYP 507 >gi|221039656|dbj|BAH11591.1| unnamed protein product [Homo sapiens] Length = 237 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 20/130 (15%), Positives = 56/130 (43%), Gaps = 24/130 (18%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 P T+ + +A++ + + S V+ +TDGE++ AS+ Sbjct: 92 PLGGTSICSGIKYAFQVI--------GELHSQLDGSEVLLLTDGEDNTASSC-------- 135 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE---LLESFDKI---T 384 + ++ +G ++ +A+ + ++ + G F V+D + L+++F + Sbjct: 136 -IDEVKQSGAIVHFIALGR-AADEAVIEMSKITGGSHFYVSDEAQNNGLIDAFGALTSGN 193 Query: 385 DKIQEQSVRI 394 + ++S+++ Sbjct: 194 TDLSQKSLQV 203 >gi|149175890|ref|ZP_01854508.1| hypothetical protein PM8797T_24766 [Planctomyces maris DSM 8797] gi|148845337|gb|EDL59682.1| hypothetical protein PM8797T_24766 [Planctomyces maris DSM 8797] Length = 368 Score = 54.5 bits (129), Expect = 3e-05, Method: Composition-based stats. Identities = 23/144 (15%), Positives = 52/144 (36%), Gaps = 11/144 (7%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 TAI++ + +A+D +++ R +Q A DAAVL+ ++ D + + Sbjct: 26 TAIMLVPLLGMVAFAVDYGYLLKKRADLQRAADAAVLAAVRDLIPDANGTQDLS---KVR 82 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNP-----LQYIAESKAQYEI--- 113 ++ ++ + + DI + N +++ Sbjct: 83 ATLRQYADLNIGEIEGFQVLDSDIQIGRYNPESVYDNFTILDWGTFDTVRVTLRFDTQAN 142 Query: 114 PTENLFLKGLIPSALTNLSLRSTG 137 +LF L+ ++L+ ST Sbjct: 143 SPVSLFFARLLGINESDLNATSTA 166 >gi|7145102|gb|AAA36225.2| MHC serum complement factor B [Homo sapiens] Length = 677 Score = 54.1 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 34/205 (16%), Positives = 67/205 (32%), Gaps = 19/205 (9%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + NL+ + + + V T V S+N + Sbjct: 191 SDSIGASNFTGAKKCLVNLIEKLASYGVKPRYGLVTYATXXXIWVKVSEAV---SSNADW 247 Query: 262 VKSRLNKLNP-----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 V +LN++N TNT A+ Y + + R + +I +TDG + Sbjct: 248 VTKQLNEINYEDHKLKSGTNTEEALQAVYSMMSWPDDVPPEGWN--RTRHVIILMTDGLH 305 Query: 317 SGASAYQNTLNTLQICEYM-RNAGMK------IYSVAVS--APPEGQDLLRKCTDSSGQF 367 + ++ Y+ ++ +Y V + L D+ Sbjct: 306 NMGGDPITVIDXXXXXXYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALASKKDNEQHV 365 Query: 368 FAVNDSRELLESFDKITDKIQEQSV 392 F V D L + F ++ D+ Q S+ Sbjct: 366 FKVKDMENLEDVFYQMIDESQSLSL 390 >gi|119383876|ref|YP_914932.1| von Willebrand factor, type A [Paracoccus denitrificans PD1222] gi|119373643|gb|ABL69236.1| von Willebrand factor, type A [Paracoccus denitrificans PD1222] Length = 282 Score = 54.1 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 23/156 (14%), Positives = 52/156 (33%), Gaps = 31/156 (19%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKE 293 RIG + + PL+ +++ V +++ T + A + L Sbjct: 126 RIGLVIFGDRAY--FAQPLTFDVDAVARAVDEAQIGISGRATAISDGLGLAMKRLAA--- 180 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA---- 349 S + V+ ++DG ++ + + + G++I+++A+ Sbjct: 181 -------SEAPTRVVVLMSDGVDTSGN-----VQAVDAARLAAGHGIRIHTIALGPEDLE 228 Query: 350 ------PPEGQDLLRKCTD-SSGQFFAVNDSRELLE 378 LR+ + S G F V +L Sbjct: 229 NQPRSRDAVDTKTLREVAELSGGTAFRVRGMADLEA 264 >gi|310817054|ref|YP_003965018.1| hypothetical protein EIO_2641 [Ketogulonicigenium vulgare Y25] gi|308755789|gb|ADO43718.1| conserved hypothetical protein [Ketogulonicigenium vulgare Y25] Length = 733 Score = 54.1 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 32/209 (15%), Positives = 64/209 (30%), Gaps = 23/209 (11%) Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGN-----LVNSIQKAIQEKKN 233 Y P + S S + + + ++ G+ + + + E N Sbjct: 244 YFETPLEELSPLSARVPREAPEVTMVFVLDRSGSMQQAVGDSNRLGVAKNATLSALELLN 303 Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 +IG I ++ + ++ + L++++ T YP + AYREL Sbjct: 304 PQSQIGVIVFDTEETTVVPLS-TLDIPAAQIALDRVDTGGGTAIYPGLVAAYRELQ---- 358 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S K +I +TDG + I + G + +VA+ + Sbjct: 359 ------RSESPAKHIIVMTDGLSQPGDW-------EGILRQITADGTTVSAVAIGVGADT 405 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDK 382 +G D L + Sbjct: 406 GAAENIARLGNGVAHISRDFEALPSILAQ 434 >gi|83648073|ref|YP_436508.1| von Willebrand factor type A (vWA) domain-containing protein [Hahella chejuensis KCTC 2396] gi|83636116|gb|ABC32083.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Hahella chejuensis KCTC 2396] Length = 261 Score = 54.1 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 28/204 (13%), Positives = 62/204 (30%), Gaps = 26/204 (12%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 K + + + S RK+DV + V + A++ Sbjct: 72 KNYYIVFDGSGSMDNTDCGDGKRKLDVAKTAVKKFVEQLPADANVG--------VYAFDG 123 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 VG + + N VK +++L T + L + + Sbjct: 124 QGVGERTHLATQNRPLVKQMIDQLVAGGGTPLSAGLEDGKAALT-----AQAGKQLGYGE 178 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG 365 ++ ITDG S + T+ +++ + I+++ L + G Sbjct: 179 YHLVIITDGLASLGYETDGAVQTI-----LQDTPINIHTIGFCIGD-DHSLHQ-----PG 227 Query: 366 QFFA--VNDSRELLESFDKITDKI 387 F +D L+ +++ + Sbjct: 228 LTFYRSASDPDSLMAGLNEVLAEA 251 >gi|48428050|sp|Q864V9|CFAB_GORGO RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|29690187|gb|AAM10005.1| complement factor B precursor [Gorilla gorilla] Length = 764 Score = 54.1 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 35/211 (16%), Positives = 71/211 (33%), Gaps = 31/211 (14%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + NL+ + + + V T V + P S+N + Sbjct: 278 SDSIGASNFTGAKKCLVNLIEKVASYGVKPRYGLVTYATYPKIWVKVSD---PDSSNADW 334 Query: 262 VKSRLNKLNP-----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 V +LN++N TNT A+ Y + + R + +I +TDG + Sbjct: 335 VTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVPPEGWN--RTRHVIILMTDGLH 392 Query: 317 SGASAYQNTLNTLQICEYMRN-------------AGMKIYSVAVS--APPEGQDLLRKCT 361 + + + + + +R+ + +Y V + L Sbjct: 393 NMGG------DPITVIDEIRDLLYIGKDHKNPREDYLDVYVFGVGPLVNQVNINALASKK 446 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 D+ F V D L + F ++ D+ Q S+ Sbjct: 447 DNEQHVFKVKDMENLEDVFYQMIDESQSLSL 477 >gi|222055940|ref|YP_002538302.1| hypothetical protein Geob_2856 [Geobacter sp. FRC-32] gi|221565229|gb|ACM21201.1| conserved hypothetical protein [Geobacter sp. FRC-32] Length = 391 Score = 54.1 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 31/274 (11%), Positives = 83/274 (30%), Gaps = 8/274 (2%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 I++ V + AI++ ++ + + +A + + L+G +I + T Sbjct: 14 VGIMLVVFLVIAGLAINIGYMYVSEDDLHNAAELSALAGAQAIGQQMQLSARTGTGKLKE 73 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 TI + + +A I +N + E+ + Sbjct: 74 TI----YDQVQPAARAAAIDHVSGHHQASALIEIRNSNINRLTTENDLTMGFWNISSRTY 129 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ---KHNDNNNMTSNK 178 + + + +R+ E S L ++ +S S + Y + Sbjct: 130 TPGGTPVNAVQVRTRRTAESESAGLGTLGSILSKISGSQKLNYTPEAVAAIPALADANFS 189 Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 + + + T + +P ++ ++D +S S+ + +LS I Sbjct: 190 VCVDACGTECTYPNICTIQERKLSPDSSDPRMDSPTKS-RYAYTSLSYPPGDTMSLSNLI 248 Query: 239 GTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY 272 + G + + + + + + Sbjct: 249 CMGMPPKEVCGKEIYTIRDRDDNALRDMESVMYN 282 >gi|99078203|ref|YP_611461.1| von Willebrand factor, type A [Ruegeria sp. TM1040] gi|99035341|gb|ABF62199.1| von Willebrand factor type A [Ruegeria sp. TM1040] Length = 477 Score = 54.1 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 21/143 (14%), Positives = 54/143 (37%), Gaps = 21/143 (14%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + P + + + + +++P T A+ A L + +E + Sbjct: 80 NDIEQLIAPAAGSRQAISQAVTQISPKGKTPLSAAVMQAADALRSSEEKAT--------- 130 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM--KIYSVAVS-APPEGQDLLRKCTD 362 VI I+DGE + L+ + + G+ ++++ A + L+ + Sbjct: 131 --VILISDGEETCG------LDPCAVGAELEARGVDFTLHAIGFGIADDAARAQLQCLAE 182 Query: 363 SSGQFF-AVNDSRELLESFDKIT 384 ++G F+ + + EL + ++ Sbjct: 183 NTGGFYRDASSASELTAALAQVA 205 >gi|301109920|ref|XP_002904040.1| conserved hypothetical protein [Phytophthora infestans T30-4] gi|262096166|gb|EEY54218.1| conserved hypothetical protein [Phytophthora infestans T30-4] Length = 2146 Score = 54.1 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 23/191 (12%), Positives = 55/191 (28%), Gaps = 35/191 (18%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL-----SNNLNEVKSRLNKLNP-- 271 +++ + N + I L +N ++ + Sbjct: 1906 FVLDCSGSMNGQPWNDLMAAWKEYVYNRIADGATLDLVSVVTFDNSAQIVYEARSITTVT 1965 Query: 272 -------YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 TN + A L + K ++F +DG + Sbjct: 1966 NARIQYRGGGTNYAAGLRSANEVLS--------RVNFDMFKPAIVFFSDGHP------CD 2011 Query: 325 TLNTLQICEYMR----NAGMKIYSVAVSAPPEGQDLLRKCTDS-SGQFFAVNDSRELLES 379 L ++ ++R G++ ++V ++L + + G + V EL + Sbjct: 2012 PLQGEELATHIRGCYERNGLQAFAVGFG--SINLNMLERVAEKLGGTYHHVLTGNELKAT 2069 Query: 380 FDKITDKIQEQ 390 F I+ + + Sbjct: 2070 FFSISASLSTR 2080 >gi|229051629|ref|ZP_04195099.1| hypothetical protein bcere0027_55330 [Bacillus cereus AH676] gi|228721740|gb|EEL73214.1| hypothetical protein bcere0027_55330 [Bacillus cereus AH676] Length = 452 Score = 54.1 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 25/202 (12%), Positives = 61/202 (30%), Gaps = 22/202 (10%) Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI----GIVGNQC 252 S K++ ++ N ++ I + G+ N Sbjct: 161 SGSMAGKVNGEVKMEAAKKAIYNYLDKIPDNANVMLRVYGHKGSNNENDKSLSCGSSEVM 220 Query: 253 TPLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 PL N + + L+K P T A+ + V + Sbjct: 221 YPLQPYNKEQFNAALSKFGPKGWTPLASAIESVNADFKE--------YTGEENLNVVYIV 272 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNA--GMKIYSVAVSAPPEGQDLLRKCTDSS-GQFF 368 +DGE + + + + + + + + Q L+ ++ G + Sbjct: 273 SDGEETCGG------DPVNAAKNLNQSSTHAVVNIIGFDVKNSEQQQLKNTAEAGKGNYA 326 Query: 369 AVNDSRELLESFDKITDKIQEQ 390 V+ + EL ++ + +K+ ++ Sbjct: 327 TVSTADELYQTLNTEYEKLYKE 348 >gi|157817857|ref|NP_001102478.1| procollagen, type VI, alpha 3 [Rattus norvegicus] gi|149037629|gb|EDL92060.1| procollagen, type VI, alpha 3 (predicted), isoform CRA_a [Rattus norvegicus] Length = 2207 Score = 54.1 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 34/290 (11%), Positives = 80/290 (27%), Gaps = 33/290 (11%) Query: 103 YIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMED 162 S P +L +G+IP + + + + I L ++ Sbjct: 948 VAGRSTDAVAGPASSLKQRGVIPFIFQAKNANPSELEQIVPSPAFILAAESLPKIGDLQS 1007 Query: 163 LYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVN 222 + + + S+ + P +L E +V Sbjct: 1008 EIVGLLKAKQTSDTV-------SGEKDVVFLIDGSEGVRSGFP------LLKEFVQRVVE 1054 Query: 223 SIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-T-NTYPA 280 S+ + VR+ + Y+ + V S + +L T NT A Sbjct: 1055 SL-----DVGPNQVRVALVQYSDRTRPEFYLNSHMDQQGVISAIRRLTLLGGPTPNTGAA 1109 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + R + S + + +I +T + + + ++ G Sbjct: 1110 LDFVLRNILTSSTGSRIE---EGVPQLLIVLTAERSGDDVRGPSVV--------LKQGGA 1158 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + + ++ + A+ REL I++++ + Sbjct: 1159 V--PIGIGIGNADISEMQTISFIPDFAVAIPTFRELGTIQQVISERVIQL 1206 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 22/196 (11%), Positives = 61/196 (31%), Gaps = 20/196 (10%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + ++ E ++V S+ + + N ++ Sbjct: 43 VDSSWSAGKDRFLLVQEFLSDVVESLSVGDNDF-----HFALVRLNGNPHTEFLLNAYHS 97 Query: 259 LNEVKSRLNKLNPYENTN-TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 EV S + ++ +N T + + E S + + ++ +TDG++ Sbjct: 98 KQEVLSHILNMSYIGESNQTGKGLEYIIHSHLTEASGSRAA---DGVPQVIVVLTDGQSE 154 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRE 375 + +++A + +++V V + LR+ F + + Sbjct: 155 EDGFALPSA-------ELKSADVNVFAVGV--EDADERTLREIASEPLSMHVFNLENVTS 205 Query: 376 LLESFDKITDKIQEQS 391 L + + I Sbjct: 206 LHDMVGNLVSCIHSSV 221 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 21/182 (11%), Positives = 53/182 (29%), Gaps = 23/182 (12%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + DV+ + N++ + N VR+G + Y+ + Sbjct: 250 SQNTGKANFDVIRDFLVNVLERLS-----VGNQQVRVGVVQYSDEPRTMFSLDSYPSKAA 304 Query: 262 VKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 V + +L+ N A+ + + + + + ++ I+ G Sbjct: 305 VLDAVKRLSFAGGELANIGQALDFVVENHFT---RTGGSRVEEGVPQVLVLISAG----- 356 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV---NDSREL 376 ++ ++ ++S + A + L+ F V +L Sbjct: 357 ---PSSDEIRDAVVALKQG--SVFSFGLGAQAASRVELQHIATDDNLVFTVPEFRSFGDL 411 Query: 377 LE 378 E Sbjct: 412 QE 413 Score = 53.0 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 28/173 (16%), Positives = 60/173 (34%), Gaps = 17/173 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + S+R+G + YN + ++ +N Sbjct: 1645 INFRRDSFQEVLRFASEIVDTVYEDGDSIRVGLVQYNSDPTDEFFLRDFSTKRQIIDAIN 1704 Query: 268 KLNPYENTNTYP--AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 K+ + + H L N S + R+ + IT G+ + Sbjct: 1705 KVIYKGGRHANTRVGIEH---LLKNHFVSEAGSRLDERVPQIAFVITGGK--------SV 1753 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V +EL E Sbjct: 1754 EDAQDVSLALTQKGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGSVQELSE 1804 Score = 44.9 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 23/148 (15%), Positives = 52/148 (35%), Gaps = 15/148 (10%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNE 291 VRIG + ++ + + N V + +L + NT A+ R L+ Sbjct: 1469 NKVRIGVVQFSNDVFPEFYLKTHKSQNSVLEAIRRLRFKGGSPLNTGRALEFVARNLF-- 1526 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 S + + + ++ G++ + + + ++G+ S+ + Sbjct: 1527 -VKSAGSRIEDGVPQHLVLFLGGKSQDDVSRH--------AQVISSSGIM--SLGIGDRN 1575 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLES 379 + L+ T+ F V + REL Sbjct: 1576 IDRTDLQTITNDPRLVFTVREFRELPNI 1603 >gi|149037631|gb|EDL92062.1| procollagen, type VI, alpha 3 (predicted), isoform CRA_c [Rattus norvegicus] Length = 2862 Score = 54.1 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 34/290 (11%), Positives = 80/290 (27%), Gaps = 33/290 (11%) Query: 103 YIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMED 162 S P +L +G+IP + + + + I L ++ Sbjct: 948 VAGRSTDAVAGPASSLKQRGVIPFIFQAKNANPSELEQIVPSPAFILAAESLPKIGDLQS 1007 Query: 163 LYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVN 222 + + + S+ + P +L E +V Sbjct: 1008 EIVGLLKAKQTSDTV-------SGEKDVVFLIDGSEGVRSGFP------LLKEFVQRVVE 1054 Query: 223 SIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-T-NTYPA 280 S+ + VR+ + Y+ + V S + +L T NT A Sbjct: 1055 SL-----DVGPNQVRVALVQYSDRTRPEFYLNSHMDQQGVISAIRRLTLLGGPTPNTGAA 1109 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + R + S + + +I +T + + + ++ G Sbjct: 1110 LDFVLRNILTSSTGSRIE---EGVPQLLIVLTAERSGDDVRGPSVV--------LKQGGA 1158 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + + ++ + A+ REL I++++ + Sbjct: 1159 V--PIGIGIGNADISEMQTISFIPDFAVAIPTFRELGTIQQVISERVIQL 1206 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 22/196 (11%), Positives = 61/196 (31%), Gaps = 20/196 (10%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + ++ E ++V S+ + + N ++ Sbjct: 43 VDSSWSAGKDRFLLVQEFLSDVVESLSVGDNDF-----HFALVRLNGNPHTEFLLNAYHS 97 Query: 259 LNEVKSRLNKLNPYENTN-TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 EV S + ++ +N T + + E S + + ++ +TDG++ Sbjct: 98 KQEVLSHILNMSYIGESNQTGKGLEYIIHSHLTEASGSRAA---DGVPQVIVVLTDGQSE 154 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRE 375 + +++A + +++V V + LR+ F + + Sbjct: 155 EDGFALPSA-------ELKSADVNVFAVGV--EDADERTLREIASEPLSMHVFNLENVTS 205 Query: 376 LLESFDKITDKIQEQS 391 L + + I Sbjct: 206 LHDMVGNLVSCIHSSV 221 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 21/182 (11%), Positives = 53/182 (29%), Gaps = 23/182 (12%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + DV+ + N++ + N VR+G + Y+ + Sbjct: 250 SQNTGKANFDVIRDFLVNVLERLS-----VGNQQVRVGVVQYSDEPRTMFSLDSYPSKAA 304 Query: 262 VKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 V + +L+ N A+ + + + + + ++ I+ G Sbjct: 305 VLDAVKRLSFAGGELANIGQALDFVVENHFT---RTGGSRVEEGVPQVLVLISAG----- 356 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV---NDSREL 376 ++ ++ ++S + A + L+ F V +L Sbjct: 357 ---PSSDEIRDAVVALKQG--SVFSFGLGAQAASRVELQHIATDDNLVFTVPEFRSFGDL 411 Query: 377 LE 378 E Sbjct: 412 QE 413 Score = 53.0 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 28/173 (16%), Positives = 60/173 (34%), Gaps = 17/173 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + S+R+G + YN + ++ +N Sbjct: 1645 INFRRDSFQEVLRFASEIVDTVYEDGDSIRVGLVQYNSDPTDEFFLRDFSTKRQIIDAIN 1704 Query: 268 KLNPYENTNTYP--AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 K+ + + H L N S + R+ + IT G+ + Sbjct: 1705 KVIYKGGRHANTRVGIEH---LLKNHFVSEAGSRLDERVPQIAFVITGGK--------SV 1753 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V +EL E Sbjct: 1754 EDAQDVSLALTQKGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGSVQELSE 1804 Score = 44.9 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 23/148 (15%), Positives = 52/148 (35%), Gaps = 15/148 (10%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNE 291 VRIG + ++ + + N V + +L + NT A+ R L+ Sbjct: 1469 NKVRIGVVQFSNDVFPEFYLKTHKSQNSVLEAIRRLRFKGGSPLNTGRALEFVARNLF-- 1526 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 S + + + ++ G++ + + + ++G+ S+ + Sbjct: 1527 -VKSAGSRIEDGVPQHLVLFLGGKSQDDVSRH--------AQVISSSGIM--SLGIGDRN 1575 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLES 379 + L+ T+ F V + REL Sbjct: 1576 IDRTDLQTITNDPRLVFTVREFRELPNI 1603 >gi|149037630|gb|EDL92061.1| procollagen, type VI, alpha 3 (predicted), isoform CRA_b [Rattus norvegicus] Length = 2867 Score = 54.1 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 34/290 (11%), Positives = 80/290 (27%), Gaps = 33/290 (11%) Query: 103 YIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMED 162 S P +L +G+IP + + + + I L ++ Sbjct: 948 VAGRSTDAVAGPASSLKQRGVIPFIFQAKNANPSELEQIVPSPAFILAAESLPKIGDLQS 1007 Query: 163 LYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVN 222 + + + S+ + P +L E +V Sbjct: 1008 EIVGLLKAKQTSDTV-------SGEKDVVFLIDGSEGVRSGFP------LLKEFVQRVVE 1054 Query: 223 SIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-T-NTYPA 280 S+ + VR+ + Y+ + V S + +L T NT A Sbjct: 1055 SL-----DVGPNQVRVALVQYSDRTRPEFYLNSHMDQQGVISAIRRLTLLGGPTPNTGAA 1109 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + R + S + + +I +T + + + ++ G Sbjct: 1110 LDFVLRNILTSSTGSRIE---EGVPQLLIVLTAERSGDDVRGPSVV--------LKQGGA 1158 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + + ++ + A+ REL I++++ + Sbjct: 1159 V--PIGIGIGNADISEMQTISFIPDFAVAIPTFRELGTIQQVISERVIQL 1206 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 22/196 (11%), Positives = 61/196 (31%), Gaps = 20/196 (10%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + ++ E ++V S+ + + N ++ Sbjct: 43 VDSSWSAGKDRFLLVQEFLSDVVESLSVGDNDF-----HFALVRLNGNPHTEFLLNAYHS 97 Query: 259 LNEVKSRLNKLNPYENTN-TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 EV S + ++ +N T + + E S + + ++ +TDG++ Sbjct: 98 KQEVLSHILNMSYIGESNQTGKGLEYIIHSHLTEASGSRAA---DGVPQVIVVLTDGQSE 154 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRE 375 + +++A + +++V V + LR+ F + + Sbjct: 155 EDGFALPSA-------ELKSADVNVFAVGV--EDADERTLREIASEPLSMHVFNLENVTS 205 Query: 376 LLESFDKITDKIQEQS 391 L + + I Sbjct: 206 LHDMVGNLVSCIHSSV 221 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 21/182 (11%), Positives = 53/182 (29%), Gaps = 23/182 (12%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + DV+ + N++ + N VR+G + Y+ + Sbjct: 250 SQNTGKANFDVIRDFLVNVLERLS-----VGNQQVRVGVVQYSDEPRTMFSLDSYPSKAA 304 Query: 262 VKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 V + +L+ N A+ + + + + + ++ I+ G Sbjct: 305 VLDAVKRLSFAGGELANIGQALDFVVENHFT---RTGGSRVEEGVPQVLVLISAG----- 356 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV---NDSREL 376 ++ ++ ++S + A + L+ F V +L Sbjct: 357 ---PSSDEIRDAVVALKQG--SVFSFGLGAQAASRVELQHIATDDNLVFTVPEFRSFGDL 411 Query: 377 LE 378 E Sbjct: 412 QE 413 Score = 53.0 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 28/173 (16%), Positives = 60/173 (34%), Gaps = 17/173 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + S+R+G + YN + ++ +N Sbjct: 1645 INFRRDSFQEVLRFASEIVDTVYEDGDSIRVGLVQYNSDPTDEFFLRDFSTKRQIIDAIN 1704 Query: 268 KLNPYENTNTYP--AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 K+ + + H L N S + R+ + IT G+ + Sbjct: 1705 KVIYKGGRHANTRVGIEH---LLKNHFVSEAGSRLDERVPQIAFVITGGK--------SV 1753 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V +EL E Sbjct: 1754 EDAQDVSLALTQKGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGSVQELSE 1804 Score = 44.9 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 23/148 (15%), Positives = 52/148 (35%), Gaps = 15/148 (10%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNE 291 VRIG + ++ + + N V + +L + NT A+ R L+ Sbjct: 1469 NKVRIGVVQFSNDVFPEFYLKTHKSQNSVLEAIRRLRFKGGSPLNTGRALEFVARNLF-- 1526 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 S + + + ++ G++ + + + ++G+ S+ + Sbjct: 1527 -VKSAGSRIEDGVPQHLVLFLGGKSQDDVSRH--------AQVISSSGIM--SLGIGDRN 1575 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLES 379 + L+ T+ F V + REL Sbjct: 1576 IDRTDLQTITNDPRLVFTVREFRELPNI 1603 >gi|149037634|gb|EDL92065.1| procollagen, type VI, alpha 3 (predicted), isoform CRA_e [Rattus norvegicus] Length = 2254 Score = 54.1 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 34/290 (11%), Positives = 80/290 (27%), Gaps = 33/290 (11%) Query: 103 YIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMED 162 S P +L +G+IP + + + + I L ++ Sbjct: 340 VAGRSTDAVAGPASSLKQRGVIPFIFQAKNANPSELEQIVPSPAFILAAESLPKIGDLQS 399 Query: 163 LYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVN 222 + + + S+ + P +L E +V Sbjct: 400 EIVGLLKAKQTSDTV-------SGEKDVVFLIDGSEGVRSGFP------LLKEFVQRVVE 446 Query: 223 SIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-T-NTYPA 280 S+ + VR+ + Y+ + V S + +L T NT A Sbjct: 447 SL-----DVGPNQVRVALVQYSDRTRPEFYLNSHMDQQGVISAIRRLTLLGGPTPNTGAA 501 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + R + S + + +I +T + + + ++ G Sbjct: 502 LDFVLRNILTSSTGSRIE---EGVPQLLIVLTAERSGDDVRGPSVV--------LKQGGA 550 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + + ++ + A+ REL I++++ + Sbjct: 551 V--PIGIGIGNADISEMQTISFIPDFAVAIPTFRELGTIQQVISERVIQL 598 Score = 53.0 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 28/173 (16%), Positives = 60/173 (34%), Gaps = 17/173 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + S+R+G + YN + ++ +N Sbjct: 1037 INFRRDSFQEVLRFASEIVDTVYEDGDSIRVGLVQYNSDPTDEFFLRDFSTKRQIIDAIN 1096 Query: 268 KLNPYENTNTYP--AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 K+ + + H L N S + R+ + IT G+ + Sbjct: 1097 KVIYKGGRHANTRVGIEH---LLKNHFVSEAGSRLDERVPQIAFVITGGK--------SV 1145 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V +EL E Sbjct: 1146 EDAQDVSLALTQKGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGSVQELSE 1196 Score = 44.9 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 23/148 (15%), Positives = 52/148 (35%), Gaps = 15/148 (10%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNE 291 VRIG + ++ + + N V + +L + NT A+ R L+ Sbjct: 861 NKVRIGVVQFSNDVFPEFYLKTHKSQNSVLEAIRRLRFKGGSPLNTGRALEFVARNLF-- 918 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 S + + + ++ G++ + + + ++G+ S+ + Sbjct: 919 -VKSAGSRIEDGVPQHLVLFLGGKSQDDVSRH--------AQVISSSGIM--SLGIGDRN 967 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLES 379 + L+ T+ F V + REL Sbjct: 968 IDRTDLQTITNDPRLVFTVREFRELPNI 995 >gi|149037632|gb|EDL92063.1| procollagen, type VI, alpha 3 (predicted), isoform CRA_d [Rattus norvegicus] gi|149037633|gb|EDL92064.1| procollagen, type VI, alpha 3 (predicted), isoform CRA_d [Rattus norvegicus] Length = 2140 Score = 54.1 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 34/290 (11%), Positives = 80/290 (27%), Gaps = 33/290 (11%) Query: 103 YIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMED 162 S P +L +G+IP + + + + I L ++ Sbjct: 948 VAGRSTDAVAGPASSLKQRGVIPFIFQAKNANPSELEQIVPSPAFILAAESLPKIGDLQS 1007 Query: 163 LYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVN 222 + + + S+ + P +L E +V Sbjct: 1008 EIVGLLKAKQTSDTV-------SGEKDVVFLIDGSEGVRSGFP------LLKEFVQRVVE 1054 Query: 223 SIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-T-NTYPA 280 S+ + VR+ + Y+ + V S + +L T NT A Sbjct: 1055 SL-----DVGPNQVRVALVQYSDRTRPEFYLNSHMDQQGVISAIRRLTLLGGPTPNTGAA 1109 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + R + S + + +I +T + + + ++ G Sbjct: 1110 LDFVLRNILTSSTGSRIE---EGVPQLLIVLTAERSGDDVRGPSVV--------LKQGGA 1158 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + + ++ + A+ REL I++++ + Sbjct: 1159 V--PIGIGIGNADISEMQTISFIPDFAVAIPTFRELGTIQQVISERVIQL 1206 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 22/196 (11%), Positives = 61/196 (31%), Gaps = 20/196 (10%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + ++ E ++V S+ + + N ++ Sbjct: 43 VDSSWSAGKDRFLLVQEFLSDVVESLSVGDNDF-----HFALVRLNGNPHTEFLLNAYHS 97 Query: 259 LNEVKSRLNKLNPYENTN-TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 EV S + ++ +N T + + E S + + ++ +TDG++ Sbjct: 98 KQEVLSHILNMSYIGESNQTGKGLEYIIHSHLTEASGSRAA---DGVPQVIVVLTDGQSE 154 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRE 375 + +++A + +++V V + LR+ F + + Sbjct: 155 EDGFALPSA-------ELKSADVNVFAVGV--EDADERTLREIASEPLSMHVFNLENVTS 205 Query: 376 LLESFDKITDKIQEQS 391 L + + I Sbjct: 206 LHDMVGNLVSCIHSSV 221 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 21/182 (11%), Positives = 53/182 (29%), Gaps = 23/182 (12%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + DV+ + N++ + N VR+G + Y+ + Sbjct: 250 SQNTGKANFDVIRDFLVNVLERLS-----VGNQQVRVGVVQYSDEPRTMFSLDSYPSKAA 304 Query: 262 VKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 V + +L+ N A+ + + + + + ++ I+ G Sbjct: 305 VLDAVKRLSFAGGELANIGQALDFVVENHFT---RTGGSRVEEGVPQVLVLISAG----- 356 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV---NDSREL 376 ++ ++ ++S + A + L+ F V +L Sbjct: 357 ---PSSDEIRDAVVALKQG--SVFSFGLGAQAASRVELQHIATDDNLVFTVPEFRSFGDL 411 Query: 377 LE 378 E Sbjct: 412 QE 413 Score = 53.0 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 28/173 (16%), Positives = 60/173 (34%), Gaps = 17/173 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + S+R+G + YN + ++ +N Sbjct: 1645 INFRRDSFQEVLRFASEIVDTVYEDGDSIRVGLVQYNSDPTDEFFLRDFSTKRQIIDAIN 1704 Query: 268 KLNPYENTNTYP--AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 K+ + + H L N S + R+ + IT G+ + Sbjct: 1705 KVIYKGGRHANTRVGIEH---LLKNHFVSEAGSRLDERVPQIAFVITGGK--------SV 1753 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V +EL E Sbjct: 1754 EDAQDVSLALTQKGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGSVQELSE 1804 Score = 44.9 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 23/148 (15%), Positives = 52/148 (35%), Gaps = 15/148 (10%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNE 291 VRIG + ++ + + N V + +L + NT A+ R L+ Sbjct: 1469 NKVRIGVVQFSNDVFPEFYLKTHKSQNSVLEAIRRLRFKGGSPLNTGRALEFVARNLF-- 1526 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 S + + + ++ G++ + + + ++G+ S+ + Sbjct: 1527 -VKSAGSRIEDGVPQHLVLFLGGKSQDDVSRH--------AQVISSSGIM--SLGIGDRN 1575 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLES 379 + L+ T+ F V + REL Sbjct: 1576 IDRTDLQTITNDPRLVFTVREFRELPNI 1603 >gi|260797475|ref|XP_002593728.1| hypothetical protein BRAFLDRAFT_199696 [Branchiostoma floridae] gi|229278956|gb|EEN49739.1| hypothetical protein BRAFLDRAFT_199696 [Branchiostoma floridae] Length = 186 Score = 54.1 bits (128), Expect = 3e-05, Method: Composition-based stats. Identities = 26/192 (13%), Positives = 67/192 (34%), Gaps = 16/192 (8%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + + + ++V++ ++ + R+G + Y+ + Sbjct: 10 SGSVGPDNFETVKQFVVDVVSAFTISLTD-----TRVGVVQYSDFNTLACNLGDHPDEAS 64 Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + +N + T T AM +A +L + L + +I +TDG++ Sbjct: 65 FVTAINTMQYQGGGTATGDAMEYARVKLQAVWRPAPTPRK-FPLFQIMIVLTDGKSGDDV 123 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 G+ +Y++ V A + +LL + + + D L S Sbjct: 124 VAAAQALAAD--------GVTVYAIGV-ANFDTAELLEITNGNQDRVIELKDYTALTASI 174 Query: 381 DKITDKIQEQSV 392 + I + + ++ Sbjct: 175 NSIIRALCKGTI 186 >gi|332256817|ref|XP_003277513.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-3(VI) chain-like [Nomascus leucogenys] Length = 3172 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 56/189 (29%), Gaps = 20/189 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ N +V + Sbjct: 1036 GVRSGFPLLKEFVQRVVESL-----DVGQDRVRVAVVQYSDRTRPEFYLNSYMNQQDVVN 1090 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S T + + +I +T + Sbjct: 1091 AVRQLTLLGGPTPNTGAALEFVLRNMLVSSAGSRIT---EGVPQLLIVLTADRSGDDVRN 1147 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + + ++ + A+ R+L Sbjct: 1148 PSVVVKRGGAVP----------IGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV 1197 Query: 383 ITDKIQEQS 391 I++++ + + Sbjct: 1198 ISERVTQLT 1206 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 57/153 (37%), Gaps = 15/153 (9%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKE 293 VR+G + ++ + + V + +L + NT A+ R L+ Sbjct: 1470 VRVGVVQFSNDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLF---V 1526 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S + + + ++ + G++ + + +R++G+ S+ V Sbjct: 1527 KSAGSRIEDGVPQHLVLVLGGKSQDDVSRF--------AQVIRSSGIV--SLGVGDRNID 1576 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + L+ T+ F V + REL ++I + Sbjct: 1577 RTELQTITNDPRLVFTVREFRELPNIEERIMNS 1609 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 30/314 (9%), Positives = 84/314 (26%), Gaps = 33/314 (10%) Query: 72 LKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNL 131 ++ + Q+ + + +A A+ + N+F G+ + L Sbjct: 123 MQSHLTKAAGSRAGDGVPQVIVVLTDGHSKDGLALPSAELKSADVNVFSIGVEDADEGAL 182 Query: 132 SLRSTGIIE--RSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF 189 ++ + + S+ ++ S + + T F Sbjct: 183 KEIASEPLNMHVFNLENFTSLHDIVGNLVSCVHSSVSPERAGDTETLKDITAQDSADIIF 242 Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 + + L+N ++K +R+G + ++ Sbjct: 243 LIDGSNNTGSVNFAVI-----------LDFLINLLEKLPI--GTQQIRVGVVQFSDEPRT 289 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 + +V + L N A+ + + + + + Sbjct: 290 MFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENHFT---RAGGSRVEEGVPQV 346 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF 367 ++ I+ G + + + ++ A ++S + A + L+ Sbjct: 347 LVLISAGP------SSDEIRYGVVA--LKQA--SVFSFGLGAQAASRAELQHIATDDNLV 396 Query: 368 FAV---NDSRELLE 378 F V +L E Sbjct: 397 FTVPEFRSFGDLQE 410 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 59/172 (34%), Gaps = 15/172 (8%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN + ++ +N Sbjct: 1645 INFRRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLKDFSTKRQIIDAIN 1704 Query: 268 KLNPYENTNTYPAMHHAYRELY-NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 K+ + L N + R+ + IT G+ + Sbjct: 1705 KVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGK--------SVE 1754 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 1755 DAQDVSLALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 1804 >gi|327299330|ref|XP_003234358.1| hypothetical protein TERG_04951 [Trichophyton rubrum CBS 118892] gi|326463252|gb|EGD88705.1| hypothetical protein TERG_04951 [Trichophyton rubrum CBS 118892] Length = 741 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 32/211 (15%), Positives = 63/211 (29%), Gaps = 19/211 (9%) Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE-SAGNLVNSIQKAIQEKKNLSVR 237 +P P + + S + AP P + S +L K I E N R Sbjct: 63 NDVPHVPCDIVLVIDISGSMNSAAPIPTGERGGEDTGLSILDLTKHAAKTIIETLNEKDR 122 Query: 238 IGTIAYNIGIVGNQCTPLSN--NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS 295 + + + + N N + V ++KL +TN + M L Sbjct: 123 LAVVTFCTEVNVAFELDSMNKENKSTVLGAIDKLYGKSSTNLWHGMKKGLNILATNP--- 179 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK-----IYSVAVSAP 350 + + ++ +TDG + + + + I++ Sbjct: 180 -----AQGKIQSLLVLTDGAPNH-MCPAQGYVPKLRQTLLDHHNLTGTLPLIHTFGFGYY 233 Query: 351 PEGQDLLRKCTD-SSGQFFAVNDSRELLESF 380 LL+ + G F + D+ + F Sbjct: 234 LRSP-LLQSIAEIGGGTFAFIPDAGMIGTVF 263 >gi|304347707|gb|ADM25314.1| MIC2-like protein 1 [Neospora caninum] gi|325118031|emb|CBZ53582.1| hypothetical protein NCLIV_033690 [Neospora caninum Liverpool] Length = 756 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 29/204 (14%), Positives = 70/204 (34%), Gaps = 32/204 (15%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT--PLS 256 + + +R + + + + ++ + + V + + + P + Sbjct: 73 VDDSASIGSRNFEQVRKFVLDFIDLVP-----ISSEEVHLSVVTFADSPQDVFTFKQPQA 127 Query: 257 NNLNEVKSRLNKLNP--YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 N K L +T T + A R L + K ++ +TDG Sbjct: 128 TNKQLAKEAFKYLRYRRGGSTATDKGLIRARRYLT-----RPVYGTRANVPKVLVLMTDG 182 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC--TDSSGQF----- 367 E+ +T+Q + R G+ ++ V V + C G++ Sbjct: 183 ESDRH------YDTIQAADQARAEGISVFVVGVGMANPVE-----CRGVCGCGRYGPCPQ 231 Query: 368 FAVNDSRELLESFDKITDKIQEQS 391 F +++ EL+++ D I ++ ++ Sbjct: 232 FIMSNWNELVQTVDSIMGEVCKKL 255 >gi|108759903|ref|YP_633800.1| BatB protein [Myxococcus xanthus DK 1622] gi|108463783|gb|ABF88968.1| batB protein [Myxococcus xanthus DK 1622] Length = 343 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 25/166 (15%), Positives = 59/166 (35%), Gaps = 48/166 (28%) Query: 253 TPLSNNLNEVKSRLNKLNP----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 +PL+++ + VK L ++P TN A+ + + L N S ++ V Sbjct: 148 SPLTSDYSAVKLFLRAVDPEVMPQGGTNVGAALRLSRQVLENADRGS--------KERVV 199 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE---------------- 352 + +TDGE+ + + E ++++G+++ +V V + Sbjct: 200 VLLTDGEDLVG-------DVAEATEALKDSGVQVLAVGVGSESGEPIPVFDRRGAFVDYK 252 Query: 353 ------------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + L + + G F+ + + ++I Sbjct: 253 KDAAGETVITRLDRAGLTAIAEATGGTFYFQPRGVAMSQVVERIDQ 298 >gi|311251244|ref|XP_003124541.1| PREDICTED: integrin alpha-M-like [Sus scrofa] Length = 448 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 17/91 (18%), Positives = 32/91 (35%), Gaps = 10/91 (10%) Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA---PPEGQDLLR 358 K ++ ITDGE + + L + G+ Y + V + ++ L Sbjct: 203 ENALKILVVITDGE-----KFGDPLGYEDVIPEADRKGVIRYVIGVGDAFNSWKSREELN 257 Query: 359 KCTD--SSGQFFAVNDSRELLESFDKITDKI 387 S F V + L +++ +KI Sbjct: 258 TIASKPSGDHVFQVTNFEALKTIQNQLQEKI 288 >gi|170739681|ref|YP_001768336.1| von Willebrand factor type A [Methylobacterium sp. 4-46] gi|168193955|gb|ACA15902.1| von Willebrand factor type A [Methylobacterium sp. 4-46] Length = 329 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 26/153 (16%), Positives = 50/153 (32%), Gaps = 29/153 (18%) Query: 247 IVGNQCTPLSNNLNEVKSRL--NKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 + LS + V L ++ +T + A + L Sbjct: 153 DQADVAASLSFDTAAVVHALEEAQIGLVGRSTGIGDGLGLALKRLDAATARE-------- 204 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA----------PPEG 353 K VI ++DG N+ + R+ G++++++A+ Sbjct: 205 --KVVILLSDGANNAGQTT-----PHDVAGLARDLGIRVHTIALGPRDLSDADGDPDVVD 257 Query: 354 QDLLRKC-TDSSGQFFAVNDSRELLESFDKITD 385 + LR S G+FF V + +L D I + Sbjct: 258 TEALRDVSATSGGRFFRVRTTDDLAAVADAIDE 290 >gi|116748933|ref|YP_845620.1| hypothetical protein Sfum_1496 [Syntrophobacter fumaroxidans MPOB] gi|116697997|gb|ABK17185.1| conserved hypothetical protein [Syntrophobacter fumaroxidans MPOB] Length = 427 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 31/357 (8%), Positives = 82/357 (22%), Gaps = 49/357 (13%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSAL--------------DAAVLSGCASIVS 46 + A+ + + F +A+D+ ++ +RN++Q+A D + A+++ Sbjct: 22 IVALALVMLLGFGAFAVDIGYLYVVRNELQNAADAGALAGAAALYNNDGTAVQPTANVIG 81 Query: 47 DRTIKDPTTKKDQTSTIFKKQIKK------HLKQGSYIRENAGDIAQKAQINITKDKNNP 100 T + G++ + ++ + N Sbjct: 82 QEAAMRNTAVRTAVEVTLNGNSGDVQRGHYSFATGTFTPNASLLPVSLWNVSTEELDANT 141 Query: 101 LQYI-AESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRS 159 + P F + L+ + I + +++ Sbjct: 142 DFINAVKVTTHRSAPAAPSFFSRIFGYDSFALAAEAVAYIGFAGTLYPLNV--------- 192 Query: 160 MEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGN 219 + S + S A + N Sbjct: 193 --------DKPIAMCQESLLDAEGNYSCSIGRMINSGSGAGHNTAAWTNYSQPCETANAN 244 Query: 220 LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYP 279 + + S I + + ++ L Sbjct: 245 DMAQL----LAGCGGSNPTPIILGTGMGTSGGMQDSTYRKSIFSCWIDSLRAG------N 294 Query: 280 AMHHAYRELY-NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + + + ++ V++ITD N+ Y + + E Sbjct: 295 PLPLVLPVVECPGNNPGPCSKVVGAVEVTVVWITDVINTNKLNYDEVPSMMNTAEKA 351 >gi|315613111|ref|ZP_07888021.1| collagen adhesion protein [Streptococcus sanguinis ATCC 49296] gi|315314673|gb|EFU62715.1| collagen adhesion protein [Streptococcus sanguinis ATCC 49296] Length = 863 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 44/331 (13%), Positives = 89/331 (26%), Gaps = 57/331 (17%) Query: 104 IAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDL 163 A+S + + LSL T + ++ + + +V D+S SM++ Sbjct: 34 NADSTTEPQTTLHKTITPISGQDDKYELSLDITSKLGTETQTDPLDVVLVADLSGSMQNQ 93 Query: 164 Y--LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR-KIDVLI-ESAGN 219 + + + K L + + KID + + Sbjct: 94 DVQSFDGRTISRIDALKNTLRGTNGRKGLIDTILSNSNNRLSMVGFGGKIDNKKVDQYWD 153 Query: 220 LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN----- 274 + N SNN K+ + ++ Sbjct: 154 GNKWRLFRPYWPYERMTKYYDGVSPWDDA-NTILGWSNNARAAKTAVYNMSIAGGNSIGT 212 Query: 275 -------TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG--------- 318 TN + A + + + + ++ KK VI ++DG + Sbjct: 213 ESGIGTGTNIGAGLTLANQLMGSARSNA---------KKVVILLSDGFANMVYDANGYTI 263 Query: 319 ASAYQNTLNTLQIC--------EYMRNAGMKI-------YSVAVSAPPEGQD-------L 356 + N + + + YS+ + Sbjct: 264 YNYNNEDPNIETAPQWFWDRLNNNLNSLSYSLAPTLDGFYSIKFRYSNNVDSITSLQYYM 323 Query: 357 LRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 + + F+ ND +L +SF ITDKI Sbjct: 324 RQHNASIPNEIFSANDEDQLRDSFKDITDKI 354 >gi|293365338|ref|ZP_06612055.1| collagen adhesion protein [Streptococcus oralis ATCC 35037] gi|307703880|ref|ZP_07640821.1| von Willebrand factor type A domain protein [Streptococcus oralis ATCC 35037] gi|291316788|gb|EFE57224.1| collagen adhesion protein [Streptococcus oralis ATCC 35037] gi|307622715|gb|EFO01711.1| von Willebrand factor type A domain protein [Streptococcus oralis ATCC 35037] Length = 863 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 44/331 (13%), Positives = 89/331 (26%), Gaps = 57/331 (17%) Query: 104 IAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDL 163 A+S + + LSL T + ++ + + +V D+S SM++ Sbjct: 34 NADSTTEPQTTLHKTITPISGQDDKYELSLDITSKLGTETQTDPLDVVLVADLSGSMQNQ 93 Query: 164 Y--LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR-KIDVLI-ESAGN 219 + + + K L + + KID + + Sbjct: 94 DVQSFDGRTISRIDALKNTLRGTNGRKGLIDTILSNSNNRLSMVGFGGKIDNKKVDQYWD 153 Query: 220 LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN----- 274 + N SNN K+ + ++ Sbjct: 154 GNKWRLFRPYWPYERMTKYYDGVSPWDDA-NTILGWSNNARAAKTAVYNMSIAGGNSIGT 212 Query: 275 -------TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG--------- 318 TN + A + + + + ++ KK VI ++DG + Sbjct: 213 ESGIGTGTNIGAGLTLANQLMGSARSNA---------KKVVILLSDGFANMVYDANGYTI 263 Query: 319 ASAYQNTLNTLQIC--------EYMRNAGMKI-------YSVAVSAPPEGQD-------L 356 + N + + + YS+ + Sbjct: 264 YNYNNEDPNIETAPQWFWDRLNNNLNSLSYSLAPTLDGFYSIKFRYSNNVDSITSLQYYM 323 Query: 357 LRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 + + F+ ND +L +SF ITDKI Sbjct: 324 RQHNASIPNEIFSANDEDQLRDSFKDITDKI 354 >gi|86360582|ref|YP_472470.1| hypothetical protein RHE_PE00308 [Rhizobium etli CFN 42] gi|86284684|gb|ABC93743.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 533 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 24/196 (12%), Positives = 51/196 (26%), Gaps = 22/196 (11%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 +T I + + F ID+ + +Q+A+DA L+G + D T + Sbjct: 20 LTLIAMPMLLGFSLLVIDVGRSSNLHTDLQNAVDAMALAGAREL--DGRDDAITRAQTAI 77 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 I ++ + ++T + N LFL Sbjct: 78 EKIANSAA---------FSGGGTGMSLGSNSSVTYEAGNDAGNTVTV----------LFL 118 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLA-ISICMVLDVSRSMEDLYLQKHNDNNNMTSNKY 179 K + T + + I+ + + + + + Y Sbjct: 119 KSIPADDDTPIPASMETTEPSEASYAWVIAKPQAMQTIFPIPVGFNRDTINIAADAVAVY 178 Query: 180 LLPPPPKKSFWSKNTT 195 + N Sbjct: 179 HASACDVTPIFICNPF 194 >gi|241760758|ref|ZP_04758849.1| type IV fimbrial tip adhesin [Neisseria flavescens SK114] gi|241318655|gb|EER55207.1| type IV fimbrial tip adhesin [Neisseria flavescens SK114] Length = 1065 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 13/83 (15%), Positives = 33/83 (39%), Gaps = 7/83 (8%) Query: 310 FITDGENSGA-SAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--PEGQDLLRKCTDSSGQ 366 F TDG ++ S + + + + ++ ++V P G+ L+ Sbjct: 260 FKTDGTDAAGKSWNGDPKDPADYSKQL----VQTFTVGFGQGITPTGKRYLQLAASRPEY 315 Query: 367 FFAVNDSRELLESFDKITDKIQE 389 ++ + L + F+ I ++I+ Sbjct: 316 YYEADKPESLSKVFNDIVEQIKS 338 >gi|254456154|ref|ZP_05069583.1| type II Secretion PilY1 [Candidatus Pelagibacter sp. HTCC7211] gi|207083156|gb|EDZ60582.1| type II Secretion PilY1 [Candidatus Pelagibacter sp. HTCC7211] Length = 1384 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 47/385 (12%), Positives = 101/385 (26%), Gaps = 25/385 (6%) Query: 30 QSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKA 89 Q+ + A + + S ++ + L A A Sbjct: 179 QNLSNGASKMCGNPSGFRYSHAMTGSGNYLYSNYSRRIYRGTLTSSGSNLCPTNIKAHNA 238 Query: 90 QI-----NITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 +T NP + ++ ++I + G + R + ++ Sbjct: 239 YGMSWVYGMTAHPTNPNELYVMARNMHQIFKITVNSAG-TGHTVNWRKGRWGYQKQSTAT 297 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + L + +N + + K + + +A A Sbjct: 298 QNYFYYPWGIHYDDDNHRLIVSGYNQKKIQVFDNNGVFIKDKGGMAATTRMAAAHAAIKA 357 Query: 205 PANRKIDVLIESAGNLVNSIQK--------AIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 L + ++ G+ + Sbjct: 358 IVTD--SNLTSGVNFGFGYWSSRWSARQWPPGFSSWSGNITTGSARPCDTQNCLKVRVHK 415 Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + ++ ++ +N T+ Y ++ YN S S K +VI I DG+ Sbjct: 416 DGAAQINKIISSVNARGGTDPYT-FMKIAQDYYNHGSLSPIDKKSPCQKSYVIVIGDGDF 474 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--PEGQDLLRKCTDSSGQFFA--VND 372 + + +N + + + N G+K + VA P G + G A Sbjct: 475 FPSMGWNTHIN---MAKALNNKGIKTFGVAFGTGISPGGLKRFNDLAKAGGTTKAIIAPT 531 Query: 373 SRELLESFD-KITDKIQEQSVRIAP 396 + L I+ I + AP Sbjct: 532 AAALKTQLQAAISQIIASKLSFTAP 556 >gi|116620210|ref|YP_822366.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116223372|gb|ABJ82081.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 311 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 63/154 (40%), Gaps = 23/154 (14%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 PL+++ ++ L + +T A+ A E+ + K++ K Sbjct: 131 DSSPRLVVPLTSDTGTIEDHLTFSRSHGSTALLDAIFLALHEMKHSKKN----------K 180 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA-VSAPPEGQD------LLR 358 K ++ I+DG ++ + + ++++ ++ + + IYS+ ++ L + Sbjct: 181 KALLIISDGGDNHSRYSEKEVSSV-----VKESDVLIYSIGVFGGGGSPEEAGGPGLLSK 235 Query: 359 KCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + G+ F + EL + KI +++ + + Sbjct: 236 VSEQTGGRLFEA-SAVELPDIAKKIGIELRNRYI 268 >gi|302832626|ref|XP_002947877.1| hypothetical protein VOLCADRAFT_103646 [Volvox carteri f. nagariensis] gi|300266679|gb|EFJ50865.1| hypothetical protein VOLCADRAFT_103646 [Volvox carteri f. nagariensis] Length = 733 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 39/403 (9%), Positives = 98/403 (24%), Gaps = 54/403 (13%) Query: 18 DLAHIMYIRNQMQSA-----------LDAAV----LSGCASIVSDRTIKDPTTKKDQTST 62 ++ I R + A LDAA L+ + + S+ Sbjct: 26 EMNRIRATRALLAQAGLLPVPRRHELLDAAANLPELAALFQPAAAAAAAAAASDTGADSS 85 Query: 63 IFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKG 122 + + + A+ ++P L Sbjct: 86 ASTSPVSNIPSPSELQSKLQALEKAVRAAPASAVGRESNGVEGNGSAESKVPQTALESPQ 145 Query: 123 LIPSAL-------------TNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHN 169 A VL +S E Sbjct: 146 PKDVDSGLTPGAAPGAAPAAAPGAAPGAAPAAVGAATAAEEVDVLSISVVPEYDQYGLEA 205 Query: 170 DNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ 229 + + + P+++ + + I ++ E+ L++ + Sbjct: 206 EAVRAVVSIKAIADVPERARVALTCVLDRSGSMSGG---PIRLVRETCHFLIDQLTSDD- 261 Query: 230 EKKNLSVRIGTIAYNIGIVGNQCT--PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 +G I+Y + + + + + +L +T Y + R+ Sbjct: 262 -------FLGLISYAHDVREDLPLLRMTPASRTLAHAVVEELVAGGSTALYDGLVAGLRQ 314 Query: 288 LYNEKES------------SHNTIGSTRLKKFVIFITDGENSGA-SAYQNTLNTLQICEY 334 + ++ S L TDG+ + S + + LQ + Sbjct: 315 QMAAERDLGGGNGASGGASDSSSPSSLSLVHSCFLFTDGQATDGPSNPASIIEGLQAAQA 374 Query: 335 MRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 + +++ + L + SG ++ ++ ++ Sbjct: 375 PSGQHVTVHTFGFGNGHSVELLQQVAEAQSGVYYYISCEEDIA 417 >gi|223694808|gb|ACN18090.1| von Willebrand factor type A [uncultured bacterium BLR5] Length = 347 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 20/135 (14%), Positives = 49/135 (36%), Gaps = 12/135 (8%) Query: 264 SRLNKLNPYENT--NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 +++ ++ T N A + T R ++ +I +TDG+++ + Sbjct: 171 RKISGISSGPGTIANPREG-STAIWDAVTVSAGEILTRSPGRRRRAIILLTDGQDTSSRV 229 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVS---APPEGQDLLRKCTD-SSGQFFAVNDSRELL 377 + T + A IY++ + + L + + G+ F +L Sbjct: 230 TRGT-----AIDKALEAETVIYAIGIGDSKYEGIDKGALNNVAERTGGRAFFPKRGADLT 284 Query: 378 ESFDKITDKIQEQSV 392 F +I +++ Q + Sbjct: 285 SVFTEIEKELRSQYL 299 >gi|254412101|ref|ZP_05025876.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] gi|196181067|gb|EDX76056.1| von Willebrand factor type A domain protein [Microcoleus chthonoplastes PCC 7420] Length = 570 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 20/133 (15%), Positives = 45/133 (33%), Gaps = 20/133 (15%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N ++ L Y T Y A +A L + V+ +TDGE+ Sbjct: 447 GRNRGLEFISGLQAYGGTKLYDAALYARNWLQDNPRPDAI--------NAVLILTDGED- 497 Query: 318 GASAYQNTLNTLQICEYMRNAG------MKIYSVAVS-APPEGQDLLRKCTDSSGQFFAV 370 + +N Q+ + ++ +G + +++ + L+ D + ++ Sbjct: 498 ----SGSQINLNQLEQELQQSGFNSDQRIAFFTIGYGKEGDFDPEALKAIADLNAGYYRK 553 Query: 371 NDSRELLESFDKI 383 D + D + Sbjct: 554 GDPETIATVMDDL 566 >gi|149414665|ref|XP_001516049.1| PREDICTED: similar to integrin alpha 11 subunit [Ornithorhynchus anatinus] Length = 1194 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 41/245 (16%), Positives = 75/245 (30%), Gaps = 34/245 (13%) Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 S N N +K + P + + + I +E Sbjct: 136 GSSYYTTGMCSRVNANFRFSKTVAPALQRCQTYMDII-------IVLDGSNSIYPWVEVQ 188 Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT-- 275 L+N ++K +++G + Y +V ++ +V + + T Sbjct: 189 HFLINILKKFYIGPGQ--IQVGVVQYGEDVVHEFHLNDYRSVKDVVEAASHIEQRGGTET 246 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 T + A E + G KK +I ITDGE ++ + Q+ + Sbjct: 247 RTAYGIEFARSEAF-------QKGGRKGAKKVMIVITDGE------SHDSPDLEQVIDAS 293 Query: 336 RNAGMKIYSVAV-SA----PPEGQDLL---RKCTDSSG--QFFAVNDSRELLESFDKITD 385 + Y+VAV + L + FF V D L + D + D Sbjct: 294 EKDNITRYAVAVLGYYNRRGINPEAFLSEIKYIASDPDDKHFFNVTDEAALKDIVDALGD 353 Query: 386 KIQEQ 390 +I Sbjct: 354 RIFSL 358 >gi|330508298|ref|YP_004384726.1| hypothetical protein MCON_2454 [Methanosaeta concilii GP-6] gi|328929106|gb|AEB68908.1| conserved hypothetical protein, extracellular or membrane bound [Methanosaeta concilii GP-6] Length = 726 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 27/171 (15%), Positives = 59/171 (34%), Gaps = 23/171 (13%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG-NQCTPLSNNLNEVKSRLN 267 K+D E+A +++ + N R +++ + + ++ K L+ Sbjct: 292 KMDQAKEAARYVLDHL--------NPLDRFAIVSFATTTRSFSPSLEPAAQADKGKDFLD 343 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 +L +T+ AM A + ++ +IF+TDG + Sbjct: 344 RLEAMGSTDINRAMIEAVGLAEEVRPTT------------LIFLTDGLPTEGVTVT-GAI 390 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + ++I+S V + L + D+ G V E+ E Sbjct: 391 LDNVAREAPD-NVRIFSFGVGDDVDTDLLDQISMDNGGASTYVRPGEEIDE 440 >gi|281352223|gb|EFB27807.1| hypothetical protein PANDA_008059 [Ailuropoda melanoleuca] Length = 907 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 18/133 (13%), Positives = 48/133 (36%), Gaps = 24/133 (18%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T+ M A++ + + ++ +TDGE++ Sbjct: 379 ANGGTSICAGMRSAFQVIREVYPQIDGSE--------IVLLTDGEDNS---------AKD 421 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE---LLESFDKIT--- 384 + + +G I+ +A+ P Q ++ + G F +D + L+++F + Sbjct: 422 CIDEVTQSGAIIHLIALG-PSADQAVIEMSAMTGGNHFFASDEAQNNGLIDAFGALASGN 480 Query: 385 DKIQEQSVRIAPN 397 + +Q +++ Sbjct: 481 TDLSQQPLQLESK 493 >gi|192359695|ref|YP_001982895.1| von Willebrand factor type A domain-containing protein [Cellvibrio japonicus Ueda107] gi|190685860|gb|ACE83538.1| von Willebrand factor type A domain protein [Cellvibrio japonicus Ueda107] Length = 660 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 22/131 (16%), Positives = 54/131 (41%), Gaps = 18/131 (13%) Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA---- 319 S +N + +T+ A+ A +++ + ++ +TDG Sbjct: 111 SAINSVAL--HTHIGAALEKAAQDVVAGDDGFRRN---------LVLLTDGVVDIDPEAV 159 Query: 320 -SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELL 377 + + ++ ++ AG ++++A+S Q+L++K + G F + EL+ Sbjct: 160 VNIQERKRILTELLPQLKAAGYVVHTIALSQDA-DQELMKKLALTTDGVFAVAQSADELM 218 Query: 378 ESFDKITDKIQ 388 ++F I D+ Sbjct: 219 QAFLTIFDQAV 229 >gi|209546481|ref|YP_002278399.1| hypothetical protein Rleg2_4401 [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209537725|gb|ACI57659.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 534 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 19/120 (15%), Positives = 37/120 (30%), Gaps = 11/120 (9%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 +T I + + F ID+ + +Q+A+DA L+G + D + + Sbjct: 20 LTLIAMPMLLGFSLLVIDVGRSSNLHTDLQNAVDAMALAGAREL--DGRDDAISRAQTAI 77 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 I ++ + I++T D N IP + Sbjct: 78 EKIANSAA---------FSGGGAGMSLGSHISVTYDAGNDAGSTVTVFFLKNIPANDDTS 128 >gi|126306106|ref|XP_001362488.1| PREDICTED: similar to calcium-dependent chloride channel-1 [Monodelphis domestica] Length = 911 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 23/169 (13%), Positives = 63/169 (37%), Gaps = 27/169 (15%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN--PYENTNTYPAMHHAYRELYNEKES 294 G + ++ + S +++L T+ + A+ + + + Sbjct: 343 WTGMVTFDSSATIQSALIQIETDAQRNSLISRLPTAAGGGTSICSGLRTAFTVIKKKFST 402 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 ++ +TDGE+S S+ + + + +G I++VA+ P Sbjct: 403 ---------HGSEIVLLTDGEDSTISSCFDEV---------KQSGAIIHTVALG-PSADP 443 Query: 355 DLLRKCTDSSGQFFAVNDSRE---LLESFDKITD---KIQEQSVRIAPN 397 L + + G + D+ + L+++F ++ I ++S+++ Sbjct: 444 GLEKLAEMTGGMKTSATDNAQNNGLIDAFSALSSGNGAITQRSIQLESK 492 >gi|109900221|ref|YP_663476.1| vault protein inter-alpha-trypsin [Pseudoalteromonas atlantica T6c] gi|109702502|gb|ABG42422.1| Vault protein inter-alpha-trypsin [Pseudoalteromonas atlantica T6c] Length = 701 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 20/181 (11%), Positives = 61/181 (33%), Gaps = 14/181 (7%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY 272 + + ++ ++ I + + ++ ++ + L+ Sbjct: 323 AKRAVDFALTQLR-----PEDNVNIIQFNDAPQALWKRAMPATAKHIQRARNWVASLHAD 377 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T PA+ A + ++ S + V+FITDG + + L Sbjct: 378 GGTEMAPALTLALNKPSLHRDDSDLLGSHKL--RQVVFITDGSV-------SNEDALMSL 428 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + A +++++ + + P + + G F + D +++ + +K+ + Sbjct: 429 IESKLADNRLFTIGIGSAPNSYFMTQAAQAGRGTFTYIGDIQQVQHKMTALFNKLTRPVM 488 Query: 393 R 393 + Sbjct: 489 Q 489 >gi|24375029|ref|NP_719072.1| type IV pilin biogenesis protein, putative [Shewanella oneidensis MR-1] gi|24349772|gb|AAN56516.1|AE015789_3 type IV pilin biogenesis protein, putative [Shewanella oneidensis MR-1] Length = 1168 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 38/303 (12%), Positives = 76/303 (25%), Gaps = 71/303 (23%) Query: 155 DVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLI 214 + S S + N ++ SK P +ID+ Sbjct: 194 NTSSSPGTTWANALAAAKNTGFGVGQPVTFYTDNYLRWY-WLSKEGRLPTIKVPRIDIAK 252 Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN--EVKSRLNKLNPY 272 ++ N++ S + G + + + S ++ L Sbjct: 253 KAISNIIRSTPTVDFGLAVFNYNYPNEGNRDGGRIVSGITQMTDTSRASLLSTIDGLPAK 312 Query: 273 ENTNTYPAMHHAYREL------YNEKESSHNTIGSTRLKK-------------------- 306 NT M+ AYR + ++ + + R Sbjct: 313 TNTPLCETMYEAYRYFAGKGVVFGHADTDYGSYVGNRPPYDNSVETNGTYVSPFKVCTDI 372 Query: 307 -FVIFITDGENSGASAYQNTLNTLQIC--------------------------------- 332 +VI++TDG + + N + TL Sbjct: 373 AYVIYVTDGSPTVDGSANNNVKTLTAAASKSGNYSSFSQGLSTPSYLPALASYMFNNDLI 432 Query: 333 ------EYMRNAGMKIYSVAVSAPPEGQDLL--RKCTDSSGQFFAVNDSRELLESFDKIT 384 + ++ Y++ S + L G +FA +S EL + + Sbjct: 433 NKPDSSNTEQMQNVRTYTIGFSKGADDAAPLLAETAKRGGGLYFAAQNSLELQNALNDAL 492 Query: 385 DKI 387 I Sbjct: 493 SNI 495 >gi|162448822|ref|YP_001611189.1| putative secreted protein [Sorangium cellulosum 'So ce 56'] gi|161159404|emb|CAN90709.1| putative secreted protein [Sorangium cellulosum 'So ce 56'] Length = 563 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 20/190 (10%), Positives = 67/190 (35%), Gaps = 23/190 (12%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 ++++ ++A N++ + + +A + + + + Sbjct: 48 YGDRLELAKKAAKIYWNTLVSSNVPASQSFTELLGVASYSDTSSVTYPLTALPASGLDTA 107 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 ++ L +T+ + A L +E + ++ VI ++DG+++ A + Sbjct: 108 VDALVADGSTSIGAGLEEALDMLISESPT-------KSARECVILLSDGQHNTPPAPSDF 160 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD----SSGQFFAVNDSR-----EL 376 +++S+A+ + + ++ S G + + EL Sbjct: 161 YADYFSRVD------EVHSIALGSGA-DEAMMSDIAANYGFSPGLYLRADSDTAIDQLEL 213 Query: 377 LESFDKITDK 386 + +F+++ + Sbjct: 214 IGAFNRMAND 223 >gi|193788254|dbj|BAG53148.1| unnamed protein product [Homo sapiens] Length = 437 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 66/195 (33%), Gaps = 23/195 (11%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + I +E L+N ++K +++G + Y +V ++ +V Sbjct: 171 GSNGIYPWVEVQHFLINILKKFYIGPGQ--IQVGVVQYGEDVVHEFHLNDYRSVKDVVEA 228 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + + T T A + + + G KK +I ITDGE ++ Sbjct: 229 ASHIEQRGGTETRTAFGIEF-----ARSEAFQKGGRKGAKKVMIVITDGE------SHDS 277 Query: 326 LNTLQICEYMRNAGMKIYSVAV-SA----PPEGQDLL---RKCTDSSG--QFFAVNDSRE 375 + ++ + + Y+VAV + L + FF V D Sbjct: 278 PDLEKVIQQSERDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAA 337 Query: 376 LLESFDKITDKIQEQ 390 L + D + D+I Sbjct: 338 LKDIVDALGDRIFSL 352 >gi|260592519|ref|ZP_05857977.1| BatB protein [Prevotella veroralis F0319] gi|260535565|gb|EEX18182.1| BatB protein [Prevotella veroralis F0319] Length = 331 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 25/206 (12%), Positives = 71/206 (34%), Gaps = 45/206 (21%) Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP 271 + + + ++ + ++ N + P++++ K L+ +NP Sbjct: 104 LATDVVPSRLDKSKLMVEGLMNKFTKNKLGLIVFAGDAFVQLPITSDYVSAKMFLDNINP 163 Query: 272 Y----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + T+ A++ A +T+ K ++ ITDGE++ Sbjct: 164 SLIGTQGTDIGKAINLAMHSFTP----------NTQTGKAIVVITDGEDNEG-------G 206 Query: 328 TLQICEYMRNAGMKIYSVAVSA---------------PPE--------GQDLLRKCTDSS 364 + + + G+K++ + + + +++ +K D+ Sbjct: 207 AEAMAKQAQEKGIKVFILGIGSTQGTTIPMPNGEDLRDANGNIVKTHLNEEMCKKIADAG 266 Query: 365 -GQFFAVNDSRELLESFDKITDKIQE 389 G + V++S ++ K+Q+ Sbjct: 267 HGVYIHVDNSSVADALLERELGKLQK 292 >gi|162453507|ref|YP_001615874.1| hypothetical protein sce5231 [Sorangium cellulosum 'So ce 56'] gi|161164089|emb|CAN95394.1| hypothetical protein sce5231 [Sorangium cellulosum 'So ce 56'] Length = 490 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 27/242 (11%), Positives = 66/242 (27%), Gaps = 18/242 (7%) Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 VS Q D P S+ + D+ Sbjct: 166 VSGPCALRACQSSFDARTGIQICVADADCPDGDPCIDIGVCSQNSNVVCG-----DIGER 220 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 + + + T Y V + +E+ +++ P T Sbjct: 221 CPN-RAGTCNRVSRSTCAHPYSCVTEDYATPAVSIAPLDAAW-ADELIGSIDRTFPDGGT 278 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 T A+ A + +++ + TDG + + E + Sbjct: 279 PTAGALSGAIAQARAHAQANPTHRV------VTVLATDGMPTECNPTSADGIAEIAAEGL 332 Query: 336 RNA-GMKIYSVA-VSAPPEG-QDLLRKCTDSSGQ--FFAVNDSRELLESFDKITDKIQEQ 390 + + + + +G Q + + + G + ++ S ++ ++F +I+ Sbjct: 333 GGSPSISTFVIGVFGRDDDGAQATMNRIAEGGGTKSAYFIDTSSDVTQAFLDALSEIRGT 392 Query: 391 SV 392 S+ Sbjct: 393 SL 394 >gi|159043014|ref|YP_001531808.1| von Willebrand factor type A [Dinoroseobacter shibae DFL 12] gi|157910774|gb|ABV92207.1| von Willebrand factor type A [Dinoroseobacter shibae DFL 12] Length = 320 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 26/195 (13%), Positives = 61/195 (31%), Gaps = 31/195 (15%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 ++ + L +++ + G +P + + V R Sbjct: 104 VRDDFNLDDRAVTRL-EAVKAVGADFARRRAGDRLALVVFGSEAYFASPFTFDTESVARR 162 Query: 266 LNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + + T+ + A + L ST + VI ++DG N+ + Sbjct: 163 IEEATIGISGRATSISDGLGLALKRLST----------STATSRVVILLSDGINNAGAT- 211 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPP-----------EGQDLLRKCTD-SSGQFFAV 370 N + E G++++++A+ LR + S G+ F V Sbjct: 212 ----NPRGVAELAARYGVRVHTIALGPKDLTTAEVGERGVVDAATLRAISQISGGESFRV 267 Query: 371 NDSRELLESFDKITD 385 + +L+ + + Sbjct: 268 RTTEDLVAVTEALDR 282 >gi|268324283|emb|CBH37871.1| hypothetical secreted protein, CARDB family [uncultured archaeon] Length = 1149 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 23/134 (17%), Positives = 47/134 (35%), Gaps = 14/134 (10%) Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 LSN+ + KS ++ LNP + + A ++ K+ + K ++ I+D Sbjct: 158 LSNSSSMFKSWIDSLNPSGGGDLPESTLAAL--MHTVKDQKWRGGDA---SKIIVLISDA 212 Query: 315 ENSGASAYQNTLNTL--QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVN 371 N + + +GM +Y V + + + G+FF + Sbjct: 213 YPHSDEHCCNQEKNTFDGVISALTRSGMTVYVVG-----PDEASMEMIANSTGGKFFHIR 267 Query: 372 DSR-ELLESFDKIT 384 L ++I Sbjct: 268 AEGVSLKPVLEEIA 281 >gi|242013157|ref|XP_002427281.1| dihydropyridine-sensitive L-type calcium channel subunits alpha-2/delta precursor calcium channel subunit, putative [Pediculus humanus corporis] gi|212511622|gb|EEB14543.1| dihydropyridine-sensitive L-type calcium channel subunits alpha-2/delta precursor calcium channel subunit, putative [Pediculus humanus corporis] Length = 1205 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 32/228 (14%), Positives = 75/228 (32%), Gaps = 16/228 (7%) Query: 162 DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLV 221 +DN+ + + P + + +K+ + + +L+ Sbjct: 197 PAMKWSTSDNDVDLYDCRMRPWFIEAATCTKDIVILMDNSGSMTGM-RNTIAKLVVNSLL 255 Query: 222 NSIQKAI-QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPA 280 + S + T+ Q TP L + ++ + P N + A Sbjct: 256 KTFGNNDFINVLKFSWKPETVMPCFKDSLVQATP--EVLKSFQEAVSLVKPEGNASFPNA 313 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG- 339 ++ L +E + T + ++ +TDG + LN + G Sbjct: 314 FSYSLNLLKKYREDRNATNNLGGCNQAIMLVTDGLPGNVTEVFENLN-------LDENGM 366 Query: 340 --MKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL-LESFDKI 383 ++I++ V +G + L++ + G + V+D E+ + I Sbjct: 367 PIVRIFTYLVGTEVKGVEDLQQMACSNRGYYVHVHDLDEVHDQVLKYI 414 >gi|313837215|gb|EFS74929.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA2] gi|314971986|gb|EFT16084.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA3] Length = 322 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 20/162 (12%), Positives = 51/162 (31%), Gaps = 25/162 (15%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 P S + V L+ + + T A+ + + + S R+ Sbjct: 142 SAHPEIRMPPSTDRPTVLRALDGIELQDGTALGEAIDKSLQAVKMAPGGS-----KDRVP 196 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PE 352 ++ ++DG N+ + + + +Y++A Sbjct: 197 AAIVMLSDGGNTQGGSPLVAATHAAAAK------VPVYTIAFGTETGYVDLDGQRERVAP 250 Query: 353 GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 LL D + + + + + +L E + ++ + + VR Sbjct: 251 DTKLLSDVADRTDAKSWTADSADKLQEVYKQVHSSVGYEPVR 292 >gi|301604540|ref|XP_002931918.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5 [Xenopus (Silurana) tropicalis] Length = 929 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 22/143 (15%), Positives = 47/143 (32%), Gaps = 21/143 (14%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 NN+ + K + L P TN +H + L N S+ S +IF+TDG Sbjct: 427 NNIRDAKKFIYSLYPTGETNINEGIHVGAQLLNNYLASNGKHEKS---VSLMIFLTDGRA 483 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------GQDLLRKCTDSSGQ 366 + + + ++S+ + ++R+ ++ Sbjct: 484 TIGEIESPKILGNT--KNAIQEKFCLFSIGFGNDVDFNLLEKLSLENCGMMRRIQENEDA 541 Query: 367 FFAVNDSRELLESFDKITDKIQE 389 + +L +D+I + Sbjct: 542 ------ASQLKGFYDEIGTPLLS 558 >gi|166796269|gb|AAI59125.1| LOC779593 protein [Xenopus (Silurana) tropicalis] Length = 973 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 22/143 (15%), Positives = 47/143 (32%), Gaps = 21/143 (14%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 NN+ + K + L P TN +H + L N S+ S +IF+TDG Sbjct: 395 NNIRDAKKFIYSLYPTGETNINEGIHVGAQLLNNYLASNGKHEKS---VSLMIFLTDGRA 451 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------GQDLLRKCTDSSGQ 366 + + + ++S+ + ++R+ ++ Sbjct: 452 TIGEIESPKILGNT--KNAIQEKFCLFSIGFGNDVDFNLLEKLSLENCGMMRRIQENEDA 509 Query: 367 FFAVNDSRELLESFDKITDKIQE 389 + +L +D+I + Sbjct: 510 ------ASQLKGFYDEIGTPLLS 526 >gi|282900951|ref|ZP_06308884.1| hypothetical protein CRC_02367 [Cylindrospermopsis raciborskii CS-505] gi|281194042|gb|EFA69006.1| hypothetical protein CRC_02367 [Cylindrospermopsis raciborskii CS-505] Length = 575 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 35/325 (10%), Positives = 98/325 (30%), Gaps = 18/325 (5%) Query: 40 GCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNN 99 G + + + + +++F+ K+ L G E + + Sbjct: 241 GSGHQQDGQPLTTLDLQSPKVNSLFQAFQKQVLVTGLTTLELKDIAIRDPKKLSVFVSEG 300 Query: 100 PLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRS 159 + ++ + + + L + + + + + Sbjct: 301 LTFNALKKIPEFA-NSIFIPFGVPHNNPLVKFKWTTPEQQQGLEQFAKFAQSDTMQNLAP 359 Query: 160 MEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGN 219 + ++ + ++FW K A + + Sbjct: 360 QMPPEVAQYLAQKQVPPVPSGKVLSLGQTFWKTQKDTGKTVYLMAVIDTSGSMYGGPLNA 419 Query: 220 LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE--VKSRLNKLNPYENTNT 277 + + ++ A Q+ + +G + Y V ++L + ++ L T Sbjct: 420 VKDGLRIASQQINPGNY-VGLVTYGDQPVNLVKLAPFDDLQHKRFLAAIDNLQADGATAM 478 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 Y M A EL +K+++ N +++ +TDG+ + N ++ E ++ Sbjct: 479 YDGMMVALSELVQQKKTNPNGKF------YLLLLTDGQTNQGF------NFEEVKEIIQY 526 Query: 338 AGMKIYSVAVSAPPEGQDLLRKCTD 362 +G+++Y +A + L Sbjct: 527 SGVRVYPIAYG--EVNEAELNAIAA 549 >gi|220678711|emb|CAX12780.1| novel protein similar to H.sapiens ANTXR1, anthrax toxin receptor 1 (ANTXR1) [Danio rerio] Length = 270 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 29/145 (20%), Positives = 46/145 (31%), Gaps = 19/145 (13%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 G L+ N +++ LN L P +T + A + G+ Sbjct: 10 STRGTTIMRLTENRDDITRGLNTLKREIPGGDTYMNLGLEEA---------NVQIYHGNY 60 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 +I +TDGE + Q + R+ G +Y V V + L D Sbjct: 61 GAASVIIALTDGELNDHQFVT----AQQEAQRARSMGAIVYCVGV--KDFNETQLATIAD 114 Query: 363 SSGQFFAV-NDSRELLESFDKITDK 386 + F V + L D I K Sbjct: 115 TIEHVFPVIGGFQALEGMIDSIIKK 139 >gi|162452618|ref|YP_001614985.1| keratin associated protein 5-1 [Sorangium cellulosum 'So ce 56'] gi|161163200|emb|CAN94505.1| keratin associated protein 5-1 [Sorangium cellulosum 'So ce 56'] Length = 1031 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 35/256 (13%), Positives = 64/256 (25%), Gaps = 38/256 (14%) Query: 132 SLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKH-----NDNNNMTSNKYLLPPPPK 186 + S +++ + LD S SM + D + P Sbjct: 578 ESPACIATASVSLVRPVTLFVALDRSNSMRETLSGTDISALRWDPARNALRAFFQDPASA 637 Query: 187 K-----SFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 FW + + + + D + + Sbjct: 638 GLGVAFRFWPHDNPGNCNSSSSCGVAFG-DGCRNAL---------IPFDGTAARRLTEDP 687 Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGS 301 A + +N L P T+ M+ A + N Sbjct: 688 APWDQQE--------------NALVNALFPGPGTSGDTPMYPALHGATTWAINYKNAHPE 733 Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT 361 + V+ +TDG S N + L N G+++++V + L Sbjct: 734 EEVA--VVLVTDGIPSDCDTVPNHIAALAR-NAFANHGVRVHAVGFGSSNAEIINL-IAD 789 Query: 362 DSSGQFFAVNDSRELL 377 GQ F +N L Sbjct: 790 QGGGQAFNLNAGTTLQ 805 >gi|149018695|gb|EDL77336.1| similar to procollagen, type VI, alpha 3 isoform 4 (predicted) [Rattus norvegicus] Length = 719 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 18/151 (11%), Positives = 47/151 (31%), Gaps = 20/151 (13%) Query: 244 NIGIVGNQCTPLSNNLNEVK---SRLNKL-NPYENTNTYPAMHHAYRELYNEKESSHNTI 299 + L NN + ++ + T T A++ + Sbjct: 206 QFSSTPREEFTLKNNYSSKDEMCRAISNVTQINSGTETGKALNFTLPFFDISQGGRPG-- 263 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 + +++I ITDG++ + +R+ + I+++ V + L Sbjct: 264 ----VHQYLIVITDGDSHDDIVSP--------AKALRDRNIIIFAIGVGK--IQRAQLLA 309 Query: 360 CTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 T+ + + + L +I ++ Sbjct: 310 ITNDQDKVYHEENFESLQNLEKEILYEVCTS 340 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 18/88 (20%), Positives = 36/88 (40%), Gaps = 12/88 (13%) Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT 361 T + +++I ITDG++S A + +R+ G+ IY++ V L + Sbjct: 78 TNVAQYLIVITDGQSSDPVAD--------AAQGLRDTGINIYAIGV--RDANTTELEEIA 127 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQE 389 + + F +D L ++ I Sbjct: 128 --NNRVFFTDDFHFLKSIHQEVVRDICS 153 >gi|116284252|gb|AAI24051.1| LOC779593 protein [Xenopus (Silurana) tropicalis] Length = 954 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 22/143 (15%), Positives = 47/143 (32%), Gaps = 21/143 (14%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 NN+ + K + L P TN +H + L N S+ S +IF+TDG Sbjct: 376 NNIRDAKKFIYSLYPTGETNINEGIHVGAQLLNNYLASNGKHEKS---VSLMIFLTDGRA 432 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------GQDLLRKCTDSSGQ 366 + + + ++S+ + ++R+ ++ Sbjct: 433 TIGEIESPKILGNT--KNAIQEKFCLFSIGFGNDVDFNLLEKLSLENCGMMRRIQENEDA 490 Query: 367 FFAVNDSRELLESFDKITDKIQE 389 + +L +D+I + Sbjct: 491 ------ASQLKGFYDEIGTPLLS 507 >gi|242078053|ref|XP_002443795.1| hypothetical protein SORBIDRAFT_07g002215 [Sorghum bicolor] gi|241940145|gb|EES13290.1| hypothetical protein SORBIDRAFT_07g002215 [Sorghum bicolor] Length = 423 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 16/195 (8%), Positives = 54/195 (27%), Gaps = 35/195 (17%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 K++ + + G L++++ + + + + + + + K + Sbjct: 141 KMERVKRAMGFLIDNL------GSDDRLSVVAFSTDARRIIRLTRMSDDGKAAAKRAVES 194 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA--------- 319 L +TN + A L VI ++DG+++ + Sbjct: 195 LAASGSTNIRGGLDVAAMVLD--------GRRHKNAVASVILLSDGQDNQSMHHEYLPTS 246 Query: 320 ----------SAYQNTLNTLQICEYMRN--AGMKIYSVAVSAPPEGQDLLRKCTDSSGQF 367 + L + +++ + + + F Sbjct: 247 WVPKHSPAFSKGGYDVLVPPSFQRTAGGDHRCVTVHTFGFGIDHDAAAMHYISEVTGSTF 306 Query: 368 FAVNDSRELLESFDK 382 + + + ++F + Sbjct: 307 SFIENHAVIQDAFAR 321 >gi|16080726|ref|NP_391554.1| hypothetical protein BSU36730 [Bacillus subtilis subsp. subtilis str. 168] gi|221311632|ref|ZP_03593479.1| hypothetical protein Bsubs1_19856 [Bacillus subtilis subsp. subtilis str. 168] gi|221320871|ref|ZP_03602165.1| hypothetical protein BsubsJ_19720 [Bacillus subtilis subsp. subtilis str. JH642] gi|221325156|ref|ZP_03606450.1| hypothetical protein BsubsS_19886 [Bacillus subtilis subsp. subtilis str. SMY] gi|321313222|ref|YP_004205509.1| hypothetical protein BSn5_09310 [Bacillus subtilis BSn5] gi|8928526|sp|P70961|YWMD_BACSU RecName: Full=Uncharacterized protein ywmD; Flags: Precursor gi|1648854|emb|CAB03681.1| unknown [Bacillus subtilis subsp. subtilis str. 168] gi|2636198|emb|CAB15690.1| putative exported protein [Bacillus subtilis subsp. subtilis str. 168] gi|320019496|gb|ADV94482.1| hypothetical protein BSn5_09310 [Bacillus subtilis BSn5] Length = 224 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 30/218 (13%), Positives = 66/218 (30%), Gaps = 23/218 (10%) Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 + + + S RKID+ +S + + K + Sbjct: 19 SPSFAAEKQADTNVAVLFDGSGSMVQKTGGERKIDIAKKSVKSFAELLPKDTNLMLRVFG 78 Query: 237 RIGTIAYN----IGIVGNQCTPLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNE 291 G + L + + L++L P T A+ +E Sbjct: 79 HAGNNKLSGKALSCSTTETIYGLHPYEGSLFDNSLSELKPTGWTPIAKALADTRKEFEAF 138 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM--KIYSVAVSA 349 K V ITDGE + + E +R + + + + + Sbjct: 139 DADG---------KNVVYLITDGEETCGG------DPAAEIEKLRASNVDTIVNIIGFNF 183 Query: 350 PPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDK 386 +G + +++ G++ + N + E ++++K K Sbjct: 184 DVKGNEEMKQAAVAGGGEYISANSADEFEQAWEKEAQK 221 >gi|88857524|ref|ZP_01132167.1| type IV pilin biogenesis protein, putative [Pseudoalteromonas tunicata D2] gi|88820721|gb|EAR30533.1| type IV pilin biogenesis protein, putative [Pseudoalteromonas tunicata D2] Length = 1220 Score = 54.1 bits (128), Expect = 4e-05, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 51/184 (27%), Gaps = 24/184 (13%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI-----GIVGNQCTPLSNNLNEVKSRLNK 268 S + + +Y +++ + V S N Sbjct: 341 TNCYQRECGSFSYTGNTPPYDASTMSGSSYVSPFKEGCGSNAYIILITDGVPTVDSDANS 400 Query: 269 LNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 T T P N S+ L ++ + + + + Sbjct: 401 YIATLIATKTDPNNQT--TSSSPFYFKVGNKTNSSYLPNLAHWMNNNDVNTNVSG----- 453 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQD--LLRKCTDSSGQFFAVNDSRELLESFDKITD 385 N + +Y++ E + LL T G++F + S EL ++F K Sbjct: 454 ---------NQNVSLYTIGFGEGAEDAEELLLEAATKGGGKYFGASSSIELSQAFSKALS 504 Query: 386 KIQE 389 KI E Sbjct: 505 KILE 508 >gi|317455060|pdb|2XWB|F Chain F, Crystal Structure Of Complement C3b In Complex With Factors B And D gi|317455061|pdb|2XWB|H Chain H, Crystal Structure Of Complement C3b In Complex With Factors B And D Length = 732 Score = 53.8 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 69/211 (32%), Gaps = 31/211 (14%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + NL+ + + + V T V S+N + Sbjct: 244 SGSIGASDFTGAKKCLVNLIEKVASYGVKPRYGLVTYATYPKIWVKVSEAD---SSNADW 300 Query: 262 VKSRLNKLNP-----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 V +LN++N TNT A+ Y + + R + +I +TDG + Sbjct: 301 VTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVPPEGWN--RTRHVIILMTDGLH 358 Query: 317 SGASAYQNTLNTLQICEYMRN-------------AGMKIYSVAVS--APPEGQDLLRKCT 361 + + + + + +R+ + +Y V + L Sbjct: 359 NMGG------DPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALASKK 412 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 D+ F V D L + F ++ D+ Q S+ Sbjct: 413 DNEQHVFKVKDMENLEDVFYQMIDESQSLSL 443 >gi|251837060|pdb|3HRZ|D Chain D, Cobra Venom Factor (Cvf) In Complex With Human Factor B gi|251837064|pdb|3HS0|D Chain D, Cobra Venom Factor (Cvf) In Complex With Human Factor B gi|251837068|pdb|3HS0|I Chain I, Cobra Venom Factor (Cvf) In Complex With Human Factor B gi|317455073|pdb|2XWJ|I Chain I, Crystal Structure Of Complement C3b In Complex With Factor B gi|317455074|pdb|2XWJ|J Chain J, Crystal Structure Of Complement C3b In Complex With Factor B gi|317455075|pdb|2XWJ|K Chain K, Crystal Structure Of Complement C3b In Complex With Factor B gi|317455076|pdb|2XWJ|L Chain L, Crystal Structure Of Complement C3b In Complex With Factor B Length = 741 Score = 53.8 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 69/211 (32%), Gaps = 31/211 (14%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + NL+ + + + V T V S+N + Sbjct: 253 SGSIGASDFTGAKKCLVNLIEKVASYGVKPRYGLVTYATYPKIWVKVSEAD---SSNADW 309 Query: 262 VKSRLNKLNP-----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 V +LN++N TNT A+ Y + + R + +I +TDG + Sbjct: 310 VTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVPPEGWN--RTRHVIILMTDGLH 367 Query: 317 SGASAYQNTLNTLQICEYMRN-------------AGMKIYSVAVS--APPEGQDLLRKCT 361 + + + + + +R+ + +Y V + L Sbjct: 368 NMGG------DPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALASKK 421 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 D+ F V D L + F ++ D+ Q S+ Sbjct: 422 DNEQHVFKVKDMENLEDVFYQMIDESQSLSL 452 >gi|294997269|ref|NP_001025043.3| integrin alpha-D isoform 1 [Mus musculus] Length = 1202 Score = 53.8 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 28/207 (13%), Positives = 61/207 (29%), Gaps = 41/207 (19%) Query: 214 IESAGNLVNSIQKAIQEKK-NLSVRIGTIAYNIGIVGNQCTP------------LSNNLN 260 + L+ + I + + +P L+ + Sbjct: 177 KDFVKALMGQLASTSTSFSLMQYSNILKTHFTFTEFKSSLSPQSLVDAIVQLQGLTYTAS 236 Query: 261 EVKSRLNK-----------LNPY--ENTNTYPAMHHAYRELYNEKESSHNTIGS--TRLK 305 ++ ++ L P NT+ ++ + + ++ K Sbjct: 237 GIQKVVDSNRLLLLRVWSALKPGCHYNTHMP---WWVWQRVRFSWKELFHSKNGARKSAK 293 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC----- 360 K +I ITDG+ +++ L + AG+ Y++ V L++ Sbjct: 294 KILIVITDGQ-----KFRDPLEYRHVIPEAEKAGIIRYAIGVGDAFREPTALQELNTIGS 348 Query: 361 TDSSGQFFAVNDSRELLESFDKITDKI 387 S F V + L +I +KI Sbjct: 349 APSQDHVFKVGNFVALRSIQRQIQEKI 375 >gi|74220882|dbj|BAE33629.1| unnamed protein product [Mus musculus] Length = 1202 Score = 53.8 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 28/207 (13%), Positives = 61/207 (29%), Gaps = 41/207 (19%) Query: 214 IESAGNLVNSIQKAIQEKK-NLSVRIGTIAYNIGIVGNQCTP------------LSNNLN 260 + L+ + I + + +P L+ + Sbjct: 177 KDFVKALMGQLASTSTSFSLMQYSNILKTHFTFTEFKSSLSPQSLVDAIVQLQGLTYTAS 236 Query: 261 EVKSRLNK-----------LNPY--ENTNTYPAMHHAYRELYNEKESSHNTIGS--TRLK 305 ++ ++ L P NT+ ++ + + ++ K Sbjct: 237 GIQKVVDSNRLLLLRVWSALKPGCHYNTHMP---WWVWQRVRFSWKELFHSKNGARKSAK 293 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC----- 360 K +I ITDG+ +++ L + AG+ Y++ V L++ Sbjct: 294 KILIVITDGQ-----KFRDPLEYRHVIPEAEKAGIIRYAIGVGDAFREPTALQELNTIGS 348 Query: 361 TDSSGQFFAVNDSRELLESFDKITDKI 387 S F V + L +I +KI Sbjct: 349 APSQDHVFKVGNFVALRSIQRQIQEKI 375 >gi|120553803|ref|YP_958154.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like [Marinobacter aquaeolei VT8] gi|120323652|gb|ABM17967.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Marinobacter aquaeolei VT8] Length = 1056 Score = 53.8 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 39/282 (13%), Positives = 86/282 (30%), Gaps = 51/282 (18%) Query: 161 EDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPA---------NRKID 211 + + +NN WS T + + +++ Sbjct: 124 GRSWRSFNENNNASRDVDCEADNDVSGVNWSNITAHTYVSGNYRNWLSSSTETVRKTRME 183 Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP 271 ++ + A L +++ + G V N + + +N + K++++ L P Sbjct: 184 IMQDVAKRLADTVTGVNIGLMAFNQSQNG---EGGRVLNNVSNVKDNASAFKAKVDGLYP 240 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF---------------VIFITDGEN 316 T + A R K S + VI +TDG Sbjct: 241 SGQTPLSETLFGAMRYFQGGKPFLDRNPVSGTVDGSNNYKSPIELECQANNVILLTDGAP 300 Query: 317 SGASAYQNTLNTLQ---------------------ICEYMRNAGMKIYSVAVSAPPEGQD 355 + + + + + + + + +K Y+V S Sbjct: 301 TSDTNHNSFIGSAIGKTCSGNCLDEIAGYMATNDMSAAFSGDQTIKTYTVGFSID--DPL 358 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDK-IQEQSVRIAP 396 L T G+++ N++++L ++FD I + S +AP Sbjct: 359 LGAAATAGGGEYYVANNAQQLADAFDDILRSVMDTSSTFVAP 400 >gi|149916952|ref|ZP_01905453.1| hypothetical protein PPSIR1_21929 [Plesiocystis pacifica SIR-1] gi|149822230|gb|EDM81621.1| hypothetical protein PPSIR1_21929 [Plesiocystis pacifica SIR-1] Length = 416 Score = 53.8 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 21/144 (14%), Positives = 49/144 (34%), Gaps = 16/144 (11%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY--------NEKESSHNTIGSTRLKKFV 308 ++ +S++ + + T+ + A +L + + + + V Sbjct: 57 DSRAAARSKIETMQAWGTTDLAGGLQQALAQLQVAQNIVGAGGSTGAQSGAPDPTVLERV 116 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQF 367 + + DG + AS +T+ L G +I ++ + LL + + G F Sbjct: 117 VLLGDGVPNDASTIPSTVGQLAA------RGTQITALGYGIE-YDETLLASLAEQTHGSF 169 Query: 368 FAVNDSRELLESFDKITDKIQEQS 391 V+D + F I+ Sbjct: 170 RFVDDPEAVASLFRDEVLDIERTV 193 >gi|6465945|gb|AAF12731.1|AF108501_1 Ca(2+)-sensitive chloride channel 2 [Mus musculus] Length = 902 Score = 53.8 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 22/150 (14%), Positives = 61/150 (40%), Gaps = 25/150 (16%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 S++ ++ + L + T+ + ++ + + + ++ +T Sbjct: 364 ITSSSDYQKITANLPQ-QATGGTSICHGLQAGFQAITSS--------DQSTSGSEIVLLT 414 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN 371 DGE++G S+ E + +G I+++A+ + L +D + G F N Sbjct: 415 DGEDNGISSCF---------EAVSRSGAIIHTIALGPSAARE--LETLSDMTGGLRFYAN 463 Query: 372 -DSRELLESFDKITD---KIQEQSVRIAPN 397 L+++F +I+ + +Q++++ Sbjct: 464 KHVSSLIDAFSRISSTSGSVSQQALQLESK 493 >gi|327538080|gb|EGF24770.1| conserved hypothetical protein, secreted [Rhodopirellula baltica WH47] Length = 359 Score = 53.8 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 23/155 (14%), Positives = 50/155 (32%), Gaps = 1/155 (0%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPT-TKKDQ 59 + I++ F ID+ + MQS DAA L G + D+ Sbjct: 3 LVVILLFALFAIAGLLIDIGMARLTQAHMQSVSDAASLEGGWQLAMGADQTTTRIAVVDR 62 Query: 60 TSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLF 119 + + + ++ N + +Q +P++ + + ++ + + Sbjct: 63 AAEMSESWGPHRIELEDGYDLNDDGKPESSQTINRDTFGDPVRPMLDPNVDNDLEGDIVL 122 Query: 120 LKGLIPSALTNLSLRSTGIIERSSENLAISICMVL 154 K ++ L + G + E I VL Sbjct: 123 GKYMVNEVPDELPGQPMGYDRHAFEPDVIDPNSVL 157 >gi|119485135|ref|ZP_01619520.1| hypothetical protein L8106_06794 [Lyngbya sp. PCC 8106] gi|119457363|gb|EAW38488.1| hypothetical protein L8106_06794 [Lyngbya sp. PCC 8106] Length = 564 Score = 53.8 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 25/151 (16%), Positives = 47/151 (31%), Gaps = 11/151 (7%) Query: 236 VRIGTIAYNIGIVGNQCTP-LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKES 294 RI I ++ I S N+ +N L T Y A A L ++ Sbjct: 418 ERIALIDFDSEIRPPVVVDGTSEGRNQGMLFINNLKAEGGTKLYDATLSAQTWLQQNQQP 477 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR-NAGMKIYSVAVS-APPE 352 VI +TDGE+S + + L + + +++ Sbjct: 478 DAI--------NAVIILTDGEDSASQISLDKLQQELQKTGFNTDKRIAFFTIGYGKEGEF 529 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 +L+K D + ++ D + + Sbjct: 530 NSTVLKKIADLNSGYYRQGDPETISTLMADL 560 >gi|78186669|ref|YP_374712.1| hypothetical protein Plut_0797 [Chlorobium luteolum DSM 273] gi|78166571|gb|ABB23669.1| putative membrane protein [Chlorobium luteolum DSM 273] Length = 349 Score = 53.8 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 24/163 (14%), Positives = 51/163 (31%), Gaps = 10/163 (6%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRT-----IKDPTT 55 + A+ + V F A+DLA I R ++Q+A DAA L G S+ + Sbjct: 23 LFALTLPVLLGFAALAVDLARIHLTRVELQNAADAAALGGARSLSDSGGNPYNWSAAGSA 82 Query: 56 KKDQTSTIFKKQI---KKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYE 112 D ++ G + ++ + + + Sbjct: 83 ALDIARRNVANGAGIQDALIETGYWNIQDPSEGLRAPGTPGVPAAGDVAAVQVTITISRT 142 Query: 113 IPTEN--LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMV 153 + LF ++ A ++ S +I + + ++ Sbjct: 143 LNNGPLRLFFAPVLGIAEQDVQGSSVAVIAPPAGGTGLFPFVI 185 >gi|296481522|gb|DAA23637.1| inter-alpha-trypsin inhibitor heavy chain H5 precursor [Bos taurus] Length = 940 Score = 53.8 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 22/132 (16%), Positives = 54/132 (40%), Gaps = 11/132 (8%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 N++ + K ++ ++P T+ A+ + L + + R V+F+TDG+ Sbjct: 353 NSIRDGKVYIHHMSPSGGTDINGALQRGIQLLNDYVAHNDIE---DRSVSLVVFLTDGKP 409 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL----LRKCTDSSGQFFAVND 372 + + + + I++V + A + + L L C + + +D Sbjct: 410 TVGETHTLKILNNT--REAARGRVCIFTVGIGADVDFKLLEKLSLENCGLTR-RVHEDHD 466 Query: 373 S-RELLESFDKI 383 + +L+ +D+I Sbjct: 467 ARAQLIGFYDEI 478 >gi|255039218|ref|YP_003089839.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] gi|254951974|gb|ACT96674.1| von Willebrand factor type A [Dyadobacter fermentans DSM 18053] Length = 320 Score = 53.8 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 33/177 (18%), Positives = 64/177 (36%), Gaps = 46/177 (25%) Query: 254 PLSNNLNEVKSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 PL+ + ++ + L P TN A+ AY +L N + + K ++ Sbjct: 133 PLTYDAAALELFIQSLQTDLLPTNGTNVCGAIEMAYNKLMNSADPTSRA-------KMMV 185 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------- 352 TDGENS + + +R G+ +YSVAV Sbjct: 186 LFTDGENSSSCTN-------ALFNNLRRFGIGVYSVAVGTKVGISIQENGKPLKDKNDKL 238 Query: 353 -----GQDLLRKCT-DSSGQFFAVNDS-RELLESFDKITDK----IQEQSVRIAPNR 398 ++ LR S G ++ +N+S ++ + I + +++ + N+ Sbjct: 239 VISKLDENFLRGIANSSRGSYYELNNSKNDIQKLISDINQAEGALVDSRTITVVSNK 295 >gi|218659662|ref|ZP_03515592.1| hypothetical protein RetlI_08405 [Rhizobium etli IE4771] Length = 81 Score = 53.8 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 11/57 (19%), Positives = 21/57 (36%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKK 57 +T I + + F ID+ + +Q+A+DA L+G + Sbjct: 20 LTLIAMPMLLGFSLLIIDVGRSSNLHTDLQNAVDAMALAGARELDGRDDAITRAQTA 76 >gi|77457690|ref|YP_347195.1| VCBS [Pseudomonas fluorescens Pf0-1] gi|77381693|gb|ABA73206.1| putative secreted protein, hemolysin [Pseudomonas fluorescens Pf0-1] Length = 2887 Score = 53.8 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 28/317 (8%), Positives = 85/317 (26%), Gaps = 19/317 (5%) Query: 40 GCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNN 99 ++ + + + G+ G + Sbjct: 1937 VASTDQDVTVYEKALDLTQDGQDLAPGTVTGS-DPGNTGETATGTLVGSVSGGSGAITYT 1995 Query: 100 PLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRS 159 + + Q ++ + + L + T + G S + + + + S Sbjct: 1996 LVGSATGTYGQIQLNADGTYTYTLTSAPKTTPNAN-DGPNTLSESFTYKATDALGNSTTS 2054 Query: 160 MEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGN 219 + + + + + S + P ++++ ++ Sbjct: 2055 TIVVNIVDDVPKAVASERSVAAVEIDSNILIVLDISGSMADASGVPGLSRLELAKQAISA 2114 Query: 220 LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYP 279 L++ V++ + ++ + ++ K+ L L+ TN Sbjct: 2115 LLDKYDDL------GDVKVQLVTFSSNATDRTSVWV--DVATAKTLLAGLSAGGGTNYDA 2166 Query: 280 AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG 339 A+ Y T + F +DG+ + TL+ ++ Sbjct: 2167 AVATMYNAFNTSG-------KLTGAQNVGYFFSDGKPNEGDIGTADEATLKA--FLDANN 2217 Query: 340 MKIYSVAVSAPPEGQDL 356 +K Y++ + + +L Sbjct: 2218 IKNYAIGLGSGVSNANL 2234 >gi|328951281|ref|YP_004368616.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] gi|328451605|gb|AEB12506.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] Length = 328 Score = 53.8 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 28/142 (19%), Positives = 48/142 (33%), Gaps = 29/142 (20%) Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG--------STRLKKFVIFIT 312 EV+ L L P E T+ A+ A R L + + +G V+ I+ Sbjct: 149 EVRKALEALEPQEATSLGAAILAAVRALPGRERAGEELLGRDPVPPELQELPPATVLLIS 208 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--------------PEGQDLLR 358 DG ++ L+ L+ R ++IY+V V +P P L Sbjct: 209 DGVSTSG------LDPLEAARVARAHQVRIYTVGVGSPRGSIQEVDGQLSFIPFDPSGLE 262 Query: 359 KCTD-SSGQFFAVNDSRELLES 379 + + G++ L Sbjct: 263 QIAALTGGRYVYPPTPDALEAV 284 >gi|160887255|ref|ZP_02068258.1| hypothetical protein BACOVA_05272 [Bacteroides ovatus ATCC 8483] gi|156107666|gb|EDO09411.1| hypothetical protein BACOVA_05272 [Bacteroides ovatus ATCC 8483] Length = 616 Score = 53.8 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 31/306 (10%), Positives = 86/306 (28%), Gaps = 26/306 (8%) Query: 75 GSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLR 134 + E G I + +++ + ++ A Y + L P Sbjct: 139 DAVNAEEYGQIQENGFKSVSDAPLSTFSIDVDA-ASYSNMRRFINKGELPPVDAIRTEEL 197 Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 + + + + + P + + N Sbjct: 198 VNYFSYDYPKPTGSDPVKITMEAGACPWNANHRLVRIGLKAKEI------PTDNLPASNL 251 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 ++D++ S LVN+++ ++ + Y Sbjct: 252 VFLIDVSGSMWGANRLDLVKSSLKLLVNNLRDKD--------KVAIVTYAGSAGVKLEAT 303 Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 ++ +++ +++L +T + AY+ S+ N +I +DG Sbjct: 304 PGSDKQKIREAIDELTAGGSTAGGTGILLAYKIAKKNFISNGNNR--------IILCSDG 355 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDS 373 + + + L Q+ E R +G+ + + ++ + G +++ Sbjct: 356 DFNVGVSSAEGL--EQLIEKERKSGVFLTVLGYGMGNYKDKKIQVLAEKGNGNHAYIDNL 413 Query: 374 RELLES 379 +E Sbjct: 414 QEANRV 419 >gi|332815788|ref|XP_003309587.1| PREDICTED: collagen alpha-3(VI) chain [Pan troglodytes] Length = 2570 Score = 53.8 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 56/189 (29%), Gaps = 20/189 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ N +V + Sbjct: 432 GVRSGFPLLKEFVQRVVESL-----DVGQDRVRVAVVQYSDRTRPEFYLNSYMNQQDVVN 486 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S T + + +I +T + Sbjct: 487 AVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRIT---EGVPQLLIVLTADRSGDDVRN 543 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + + ++ + A+ R+L Sbjct: 544 PSVVVKRGGAVP----------IGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV 593 Query: 383 ITDKIQEQS 391 I++++ + + Sbjct: 594 ISERVTQLT 602 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 57/153 (37%), Gaps = 15/153 (9%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKE 293 VR+G + ++ + + V + +L + NT A+ R L+ Sbjct: 866 VRVGVVQFSNDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLF---V 922 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S + + + ++ + G++ + + +R++G+ S+ V Sbjct: 923 KSAGSRIEDGVPQHLVLVLGGKSQDDVSRF--------AQVIRSSGIV--SLGVGDRNID 972 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + L+ T+ F V + REL ++I + Sbjct: 973 RTELQTITNDPRLVFTVREFRELPNIEERIMNS 1005 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 59/172 (34%), Gaps = 15/172 (8%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN + ++ +N Sbjct: 1041 INFRRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLKDFSTKRQIIDAIN 1100 Query: 268 KLNPYENTNTYPAMHHAYRELY-NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 K+ + L N + R+ + IT G+ + Sbjct: 1101 KVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGK--------SVE 1150 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 1151 DAQDVSLALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 1200 >gi|297669807|ref|XP_002813080.1| PREDICTED: collagen alpha-3(VI) chain-like isoform 3 [Pongo abelii] Length = 2575 Score = 53.8 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 56/189 (29%), Gaps = 20/189 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ N +V + Sbjct: 432 GVRSGFPLLKEFVQRVVESL-----DVGQDRVRVAVVQYSDRTRPEFYLNSYMNQQDVVN 486 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S T + + +I +T + Sbjct: 487 AVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRIT---EGVPQLLIVLTADRSGDDVRN 543 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + + ++ + A+ R+L Sbjct: 544 PSVVVKRGGAVP----------IGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV 593 Query: 383 ITDKIQEQS 391 I++++ + + Sbjct: 594 ISERVTQLT 602 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 57/153 (37%), Gaps = 15/153 (9%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKE 293 VR+G + ++ + + V + +L + NT A+ R L+ Sbjct: 866 VRVGVVQFSNDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLF---V 922 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S + + + ++ + G++ + + +R++G+ S+ V Sbjct: 923 KSAGSRIEDGVPQHLVLVLGGKSQDDVSRF--------AQVIRSSGIV--SLGVGDRNID 972 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + L+ T+ F V + REL ++I + Sbjct: 973 RTELQTITNDPRLVFTVREFRELPNIEERIMNS 1005 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 25/173 (14%), Positives = 60/173 (34%), Gaps = 17/173 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN + ++ +N Sbjct: 1041 INFRRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLKDFSTKRQIIDAIN 1100 Query: 268 KLNPYENTNTYP--AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 K+ + + H N + R+ + IT G+ + Sbjct: 1101 KVVYKGGRHANTRVGLEHLR---VNHFVPEAGSRLDQRVPQIAFVITGGK--------SV 1149 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 1150 EDAQDVSLALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 1200 >gi|297669805|ref|XP_002813079.1| PREDICTED: collagen alpha-3(VI) chain-like isoform 2 [Pongo abelii] Length = 2976 Score = 53.8 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 56/189 (29%), Gaps = 20/189 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ N +V + Sbjct: 833 GVRSGFPLLKEFVQRVVESL-----DVGQDRVRVAVVQYSDRTRPEFYLNSYMNQQDVVN 887 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S T + + +I +T + Sbjct: 888 AVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRIT---EGVPQLLIVLTADRSGDDVRN 944 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + + ++ + A+ R+L Sbjct: 945 PSVVVKRGGAVP----------IGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV 994 Query: 383 ITDKIQEQS 391 I++++ + + Sbjct: 995 ISERVTQLT 1003 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 57/153 (37%), Gaps = 15/153 (9%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKE 293 VR+G + ++ + + V + +L + NT A+ R L+ Sbjct: 1267 VRVGVVQFSNDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLF---V 1323 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S + + + ++ + G++ + + +R++G+ S+ V Sbjct: 1324 KSAGSRIEDGVPQHLVLVLGGKSQDDVSRF--------AQVIRSSGIV--SLGVGDRNID 1373 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + L+ T+ F V + REL ++I + Sbjct: 1374 RTELQTITNDPRLVFTVREFRELPNIEERIMNS 1406 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 25/173 (14%), Positives = 60/173 (34%), Gaps = 17/173 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN + ++ +N Sbjct: 1442 INFRRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLKDFSTKRQIIDAIN 1501 Query: 268 KLNPYENTNTYP--AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 K+ + + H N + R+ + IT G+ + Sbjct: 1502 KVVYKGGRHANTRVGLEHLR---VNHFVPEAGSRLDQRVPQIAFVITGGK--------SV 1550 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 1551 EDAQDVSLALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 1601 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 20/180 (11%), Positives = 53/180 (29%), Gaps = 22/180 (12%) Query: 206 ANRKIDVLIESA--GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 + + + LVN ++K +R+G + ++ + +V Sbjct: 43 GSNNTGSVNFAVILDFLVNLLEKLPI--GTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVL 100 Query: 264 SRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + L N A+ + + + + + ++ I+ G Sbjct: 101 GAVKALGFAGGELANIGLALDFVVENHFT---RAGGSRVEEGVPQVLVLISAGP------ 151 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV---NDSRELLE 378 + + + ++ A ++S + A + L+ F V +L E Sbjct: 152 SSDEIRYGVVA--LKQA--SVFSFGLGAQAASRAELQHIATDDNLVFTVPEFRSFGDLQE 207 >gi|297669803|ref|XP_002813078.1| PREDICTED: collagen alpha-3(VI) chain-like isoform 1 [Pongo abelii] Length = 3182 Score = 53.8 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 56/189 (29%), Gaps = 20/189 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ N +V + Sbjct: 1039 GVRSGFPLLKEFVQRVVESL-----DVGQDRVRVAVVQYSDRTRPEFYLNSYMNQQDVVN 1093 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S T + + +I +T + Sbjct: 1094 AVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRIT---EGVPQLLIVLTADRSGDDVRN 1150 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + + ++ + A+ R+L Sbjct: 1151 PSVVVKRGGAVP----------IGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV 1200 Query: 383 ITDKIQEQS 391 I++++ + + Sbjct: 1201 ISERVTQLT 1209 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 57/153 (37%), Gaps = 15/153 (9%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKE 293 VR+G + ++ + + V + +L + NT A+ R L+ Sbjct: 1473 VRVGVVQFSNDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLF---V 1529 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S + + + ++ + G++ + + +R++G+ S+ V Sbjct: 1530 KSAGSRIEDGVPQHLVLVLGGKSQDDVSRF--------AQVIRSSGIV--SLGVGDRNID 1579 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + L+ T+ F V + REL ++I + Sbjct: 1580 RTELQTITNDPRLVFTVREFRELPNIEERIMNS 1612 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 25/173 (14%), Positives = 60/173 (34%), Gaps = 17/173 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN + ++ +N Sbjct: 1648 INFRRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLKDFSTKRQIIDAIN 1707 Query: 268 KLNPYENTNTYP--AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 K+ + + H N + R+ + IT G+ + Sbjct: 1708 KVVYKGGRHANTRVGLEHLR---VNHFVPEAGSRLDQRVPQIAFVITGGK--------SV 1756 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 1757 EDAQDVSLALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 1807 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/314 (10%), Positives = 84/314 (26%), Gaps = 33/314 (10%) Query: 72 LKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNL 131 ++ + Q+ + + +A A+ + N+F G+ + L Sbjct: 126 MQSHLTKAAGSRAGDGVPQVIVVLTDGHSKDGLALPSAELKSADVNVFAIGVEDADEGAL 185 Query: 132 SLRSTGIIERSSEN--LAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF 189 ++ + N S+ ++ S + + T F Sbjct: 186 KEIASEPLNMHMFNLENFTSLHDIVGNLVSCVHSSVSPERAGDTETLKDITAQDSADIIF 245 Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 + + LVN ++K +R+G + ++ Sbjct: 246 LIDGSNNTGSVNFAVI-----------LDFLVNLLEKLPI--GTQQIRVGVVQFSDEPRT 292 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 + +V + L N A+ + + + + + Sbjct: 293 MFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENHFT---RAGGSRVEEGVPQV 349 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF 367 ++ I+ G + + + ++ A ++S + A + L+ Sbjct: 350 LVLISAGP------SSDEIRYGVVA--LKQA--SVFSFGLGAQAASRAELQHIATDDNLV 399 Query: 368 FAV---NDSRELLE 378 F V +L E Sbjct: 400 FTVPEFRSFGDLQE 413 >gi|260171427|ref|ZP_05757839.1| hypothetical protein BacD2_06135 [Bacteroides sp. D2] Length = 608 Score = 53.8 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 31/306 (10%), Positives = 86/306 (28%), Gaps = 26/306 (8%) Query: 75 GSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLR 134 + E G I + +++ + ++ A Y + L P Sbjct: 131 DAVNAEEYGQIQENGFKSVSDAPLSTFSIDVDA-ASYSNMRRFINKGELPPVDAIRTEEL 189 Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 + + + + + P + + N Sbjct: 190 VNYFSYDYPKPTGSDPVKITMEAGACPWNANHRLVRIGLKAKEI------PTDNLPASNL 243 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 ++D++ S LVN+++ ++ + Y Sbjct: 244 VFLIDVSGSMWGANRLDLVKSSLKLLVNNLRDKD--------KVAIVTYAGSAGVKLEAT 295 Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 ++ +++ +++L +T + AY+ S+ N +I +DG Sbjct: 296 PGSDKQKIREAIDELTAGGSTAGGTGILLAYKIAKKNFISNGNNR--------IILCSDG 347 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDS 373 + + + L Q+ E R +G+ + + ++ + G +++ Sbjct: 348 DFNVGVSSAEGL--EQLIEKERKSGVFLTVLGYGMGNYKDKKIQVLAEKGNGNHAYIDNL 405 Query: 374 RELLES 379 +E Sbjct: 406 QEANRV 411 >gi|114584071|ref|XP_001153479.1| PREDICTED: collagen alpha-3(VI) chain isoform 3 [Pan troglodytes] Length = 2971 Score = 53.8 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 56/189 (29%), Gaps = 20/189 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ N +V + Sbjct: 833 GVRSGFPLLKEFVQRVVESL-----DVGQDRVRVAVVQYSDRTRPEFYLNSYMNQQDVVN 887 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S T + + +I +T + Sbjct: 888 AVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRIT---EGVPQLLIVLTADRSGDDVRN 944 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + + ++ + A+ R+L Sbjct: 945 PSVVVKRGGAVP----------IGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV 994 Query: 383 ITDKIQEQS 391 I++++ + + Sbjct: 995 ISERVTQLT 1003 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 57/153 (37%), Gaps = 15/153 (9%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKE 293 VR+G + ++ + + V + +L + NT A+ R L+ Sbjct: 1267 VRVGVVQFSNDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLF---V 1323 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S + + + ++ + G++ + + +R++G+ S+ V Sbjct: 1324 KSAGSRIEDGVPQHLVLVLGGKSQDDVSRF--------AQVIRSSGIV--SLGVGDRNID 1373 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + L+ T+ F V + REL ++I + Sbjct: 1374 RTELQTITNDPRLVFTVREFRELPNIEERIMNS 1406 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 59/172 (34%), Gaps = 15/172 (8%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN + ++ +N Sbjct: 1442 INFRRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLKDFSTKRQIIDAIN 1501 Query: 268 KLNPYENTNTYPAMHHAYRELY-NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 K+ + L N + R+ + IT G+ + Sbjct: 1502 KVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGK--------SVE 1551 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 1552 DAQDVSLALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 1601 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 20/180 (11%), Positives = 53/180 (29%), Gaps = 22/180 (12%) Query: 206 ANRKIDVLIESA--GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 + + + LVN ++K +R+G + ++ + +V Sbjct: 43 GSNNTGSVNFAVILDFLVNLLEKLPI--GTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVL 100 Query: 264 SRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + L N A+ + + + + + ++ I+ G Sbjct: 101 GAVKALGFAGGELANIGLALDFVVENHFT---RAGGSRVEEGVPQVLVLISAGP------ 151 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV---NDSRELLE 378 + + + ++ A ++S + A + L+ F V +L E Sbjct: 152 SSDEIRYGVVA--LKQA--SVFSFGLGAQAASRAELQHIATDDNLVFTVPEFRSFGDLQE 207 >gi|114584073|ref|XP_001153410.1| PREDICTED: alpha 3 type VI collagen isoform 2 [Pan troglodytes] Length = 2977 Score = 53.8 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 56/189 (29%), Gaps = 20/189 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ N +V + Sbjct: 839 GVRSGFPLLKEFVQRVVESL-----DVGQDRVRVAVVQYSDRTRPEFYLNSYMNQQDVVN 893 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S T + + +I +T + Sbjct: 894 AVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRIT---EGVPQLLIVLTADRSGDDVRN 950 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + + ++ + A+ R+L Sbjct: 951 PSVVVKRGGAVP----------IGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV 1000 Query: 383 ITDKIQEQS 391 I++++ + + Sbjct: 1001 ISERVTQLT 1009 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 57/153 (37%), Gaps = 15/153 (9%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKE 293 VR+G + ++ + + V + +L + NT A+ R L+ Sbjct: 1273 VRVGVVQFSNDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLF---V 1329 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S + + + ++ + G++ + + +R++G+ S+ V Sbjct: 1330 KSAGSRIEDGVPQHLVLVLGGKSQDDVSRF--------AQVIRSSGIV--SLGVGDRNID 1379 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + L+ T+ F V + REL ++I + Sbjct: 1380 RTELQTITNDPRLVFTVREFRELPNIEERIMNS 1412 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 59/172 (34%), Gaps = 15/172 (8%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN + ++ +N Sbjct: 1448 INFRRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLKDFSTKRQIIDAIN 1507 Query: 268 KLNPYENTNTYPAMHHAYRELY-NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 K+ + L N + R+ + IT G+ + Sbjct: 1508 KVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGK--------SVE 1557 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 1558 DAQDVSLALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 1607 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/314 (10%), Positives = 84/314 (26%), Gaps = 33/314 (10%) Query: 72 LKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNL 131 ++ + Q+ + + +A A+ + N+F G+ + L Sbjct: 126 MQSHLTKAAGSRAGDGVPQVIVVLTDGHSKDGLALPSAELKSADVNVFAIGVEDADEGAL 185 Query: 132 SLRSTGIIERSSEN--LAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF 189 ++ + N S+ ++ S + + T F Sbjct: 186 KEIASEPLNMHMFNLENFTSLHDIVGNLVSCVHSSVSPERAGDTETLKDITAQDSADIIF 245 Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 + + LVN ++K +R+G + ++ Sbjct: 246 LIDGSNNTGSVNFAVI-----------LDFLVNLLEKLPI--GTQQIRVGVVQFSDEPRT 292 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 + +V + L N A+ + + + + + Sbjct: 293 MFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENHFT---RAGGSRVEEGVPQV 349 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF 367 ++ I+ G + + + ++ A ++S + A + L+ Sbjct: 350 LVLISAGP------SSDEIRYGVVA--LKQA--SVFSFGLGAQAASRAELQHIATDDNLV 399 Query: 368 FAV---NDSRELLE 378 F V +L E Sbjct: 400 FTVPEFRSFGDLQE 413 >gi|114584069|ref|XP_001153544.1| PREDICTED: collagen alpha-3(VI) chain isoform 4 [Pan troglodytes] Length = 3177 Score = 53.8 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 56/189 (29%), Gaps = 20/189 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ N +V + Sbjct: 1039 GVRSGFPLLKEFVQRVVESL-----DVGQDRVRVAVVQYSDRTRPEFYLNSYMNQQDVVN 1093 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S T + + +I +T + Sbjct: 1094 AVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRIT---EGVPQLLIVLTADRSGDDVRN 1150 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + + ++ + A+ R+L Sbjct: 1151 PSVVVKRGGAVP----------IGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV 1200 Query: 383 ITDKIQEQS 391 I++++ + + Sbjct: 1201 ISERVTQLT 1209 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 57/153 (37%), Gaps = 15/153 (9%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKE 293 VR+G + ++ + + V + +L + NT A+ R L+ Sbjct: 1473 VRVGVVQFSNDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLF---V 1529 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S + + + ++ + G++ + + +R++G+ S+ V Sbjct: 1530 KSAGSRIEDGVPQHLVLVLGGKSQDDVSRF--------AQVIRSSGIV--SLGVGDRNID 1579 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + L+ T+ F V + REL ++I + Sbjct: 1580 RTELQTITNDPRLVFTVREFRELPNIEERIMNS 1612 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 59/172 (34%), Gaps = 15/172 (8%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN + ++ +N Sbjct: 1648 INFRRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLKDFSTKRQIIDAIN 1707 Query: 268 KLNPYENTNTYPAMHHAYRELY-NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 K+ + L N + R+ + IT G+ + Sbjct: 1708 KVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGK--------SVE 1757 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 1758 DAQDVSLALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 1807 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/314 (10%), Positives = 84/314 (26%), Gaps = 33/314 (10%) Query: 72 LKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNL 131 ++ + Q+ + + +A A+ + N+F G+ + L Sbjct: 126 MQSHLTKAAGSRAGDGVPQVIVVLTDGHSKDGLALPSAELKSADVNVFAIGVEDADEGAL 185 Query: 132 SLRSTGIIERSSEN--LAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF 189 ++ + N S+ ++ S + + T F Sbjct: 186 KEIASEPLNMHMFNLENFTSLHDIVGNLVSCVHSSVSPERAGDTETLKDITAQDSADIIF 245 Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 + + LVN ++K +R+G + ++ Sbjct: 246 LIDGSNNTGSVNFAVI-----------LDFLVNLLEKLPI--GTQQIRVGVVQFSDEPRT 292 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 + +V + L N A+ + + + + + Sbjct: 293 MFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENHFT---RAGGSRVEEGVPQV 349 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF 367 ++ I+ G + + + ++ A ++S + A + L+ Sbjct: 350 LVLISAGP------SSDEIRYGVVA--LKQA--SVFSFGLGAQAASRAELQHIATDDNLV 399 Query: 368 FAV---NDSRELLE 378 F V +L E Sbjct: 400 FTVPEFRSFGDLQE 413 >gi|114584077|ref|XP_001153230.1| PREDICTED: alpha 3 type VI collagen isoform 1 [Pan troglodytes] Length = 3010 Score = 53.8 bits (127), Expect = 4e-05, Method: Composition-based stats. Identities = 23/189 (12%), Positives = 56/189 (29%), Gaps = 20/189 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ N +V + Sbjct: 1039 GVRSGFPLLKEFVQRVVESL-----DVGQDRVRVAVVQYSDRTRPEFYLNSYMNQQDVVN 1093 Query: 265 RLNKLNPYEN-T-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L T NT A+ R + S T + + +I +T + Sbjct: 1094 AVRQLTLLGGPTPNTGAALEFVLRNILVSSAGSRIT---EGVPQLLIVLTADRSGDDVRN 1150 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + + ++ + A+ R+L Sbjct: 1151 PSVVVKRGGAVP----------IGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV 1200 Query: 383 ITDKIQEQS 391 I++++ + + Sbjct: 1201 ISERVTQLT 1209 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 59/172 (34%), Gaps = 15/172 (8%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN + ++ +N Sbjct: 1481 INFRRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLKDFSTKRQIIDAIN 1540 Query: 268 KLNPYENTNTYPAMHHAYRELY-NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 K+ + L N + R+ + IT G+ + Sbjct: 1541 KVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGK--------SVE 1590 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 1591 DAQDVSLALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 1640 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/314 (10%), Positives = 84/314 (26%), Gaps = 33/314 (10%) Query: 72 LKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNL 131 ++ + Q+ + + +A A+ + N+F G+ + L Sbjct: 126 MQSHLTKAAGSRAGDGVPQVIVVLTDGHSKDGLALPSAELKSADVNVFAIGVEDADEGAL 185 Query: 132 SLRSTGIIERSSEN--LAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF 189 ++ + N S+ ++ S + + T F Sbjct: 186 KEIASEPLNMHMFNLENFTSLHDIVGNLVSCVHSSVSPERAGDTETLKDITAQDSADIIF 245 Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 + + LVN ++K +R+G + ++ Sbjct: 246 LIDGSNNTGSVNFAVI-----------LDFLVNLLEKLPI--GTQQIRVGVVQFSDEPRT 292 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 + +V + L N A+ + + + + + Sbjct: 293 MFSLDTYSTKAQVLGAVKALGFAGGELANIGLALDFVVENHFT---RAGGSRVEEGVPQV 349 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF 367 ++ I+ G + + + ++ A ++S + A + L+ Sbjct: 350 LVLISAGP------SSDEIRYGVVA--LKQA--SVFSFGLGAQAASRAELQHIATDDNLV 399 Query: 368 FAV---NDSRELLE 378 F V +L E Sbjct: 400 FTVPEFRSFGDLQE 413 >gi|239993926|ref|ZP_04714450.1| inter-alpha-trypsin inhibitor domain-containing protein [Alteromonas macleodii ATCC 27126] Length = 586 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 28/209 (13%), Positives = 64/209 (30%), Gaps = 31/209 (14%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 + + P + L + L + + N + R+ Sbjct: 311 DFDRDITFVIDTSGSMGGRPIVDAKESLQLAIDRLSEKDRFNVVAFNNDTTRLF------ 364 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + N + + LN T PA++ A + + Sbjct: 365 ---ETSVEGTTRNKQYARDFVKHLNAGGGTEMAPALNAALKR-----------TTTKDFI 410 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNA--GMKIYSVAVSAPPEGQDLLRKCTDS 363 K V+FITDG N + ++N ++++V + + P + R Sbjct: 411 KQVVFITDGAVG---------NEAALFSQIKNELGDARLFTVGIGSAPNSYFMTRAAQFG 461 Query: 364 SGQFFAVNDSRELLESFDKITDKIQEQSV 392 G + V ++ ++ + D + K++ + Sbjct: 462 LGSYVFVRNTADIKQQMDSLLYKLESPVL 490 >gi|47219688|emb|CAG12610.1| unnamed protein product [Tetraodon nigroviridis] Length = 717 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 20/162 (12%), Positives = 57/162 (35%), Gaps = 16/162 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN-TYPAMHHAYREL 288 + ++ +AY+ ++ + V + + + + T A+ H + ++ Sbjct: 488 DINRDVAQLALVAYSRRATTVFNLDTHDSGSAVLTAIGEASYMGGVASTGTALLHVHSDV 547 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + + + K V+ +TDG + + + +R+ G+ ++ V + Sbjct: 548 LTVDKGARLGVN-----KAVVVVTDGSGGT--------DAVVPAQKLRDNGVSVFVVGIG 594 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + + LL+ S V +L D + + + Sbjct: 595 -DMQREKLLQ-IAGSEEHLILVPSYEDLKYFEDVLVQMLCSE 634 >gi|298207016|ref|YP_003715195.1| hypothetical protein CA2559_02145 [Croceibacter atlanticus HTCC2559] gi|83849650|gb|EAP87518.1| hypothetical protein CA2559_02145 [Croceibacter atlanticus HTCC2559] Length = 346 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 54/176 (30%), Gaps = 58/176 (32%) Query: 253 TPLSNNLNEVKSRLNKLNPYE----NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P++ + K L +N T A+ A ++++++ + + Sbjct: 143 LPITTDYASAKMFLQNMNTDMLSSQGTAINEAIQLAKTYYNDDEQTN----------RVL 192 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA------------------- 349 I+DGE+ +++ I E G++I+++ V Sbjct: 193 FIISDGEDHEG-------DSVNIAEEASEEGIRIFTIGVGTTKGGRIPIKRNGVVLNYKK 245 Query: 350 PPEGQDL--------LRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 Q + L+ + + ++ IT + E+ I N Sbjct: 246 DQNDQTVITRLQEETLKDIAKEANGEY----------IYNNITKETVEKVTEILQN 291 >gi|327273523|ref|XP_003221530.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-1-like [Anolis carolinensis] Length = 1091 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 40/392 (10%), Positives = 108/392 (27%), Gaps = 47/392 (11%) Query: 13 ITYAID-LAHIMYIRNQM--------QSALDAAVLSGCASIVSDRTIKDPTTKKD----- 58 +D LA I + + Q +++A + + T + Sbjct: 55 TASGVDELAKIYQEKRDLYTIEANNAQQLVESAARDIEKLLSNRSTALVRLAEAAERFQK 114 Query: 59 --------QTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQ 110 + I K L + + G K Y + Sbjct: 115 EHQWQDEFAANDIVYYNAKDELNEYGRNESDLGMQRIKPVFEEDPVFGRQTSYQHAA--- 171 Query: 111 YEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR----SMEDLYLQ 166 IPT+ ++ + L + E+ ++ + + ++ Sbjct: 172 VHIPTDIYDGSTIVLNELNWTGALDEVFKKNREEDQSLLWQVFGSATGLARYYPASPWVD 231 Query: 167 KHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK 226 K N + P + S + + + + ++ S ++ ++ Sbjct: 232 KSRTQNKIDLYDVRRRPWYIQGAASPKDMLILVDASGSVSGLTLKLIRTSVIEMLETLSD 291 Query: 227 AIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAY 285 + + ++ +V N ++K + K+ T+ +A+ Sbjct: 292 DDFVNVVSFNENAQNVSCFNHLVQANVR----NKKKLKEAVYKIQAKGITDYKKGFSYAF 347 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 +L N S + K ++ TDG A + N +++++ Sbjct: 348 EQLLNHNHSVFRANCN----KIIMLFTDGGEEKAQEIFDKYNV--------EKKVRVFTF 395 Query: 346 AVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 +V + ++ + G ++ + + Sbjct: 396 SVGQHNYDKGPIQWMACANKGYYYEIPSIGAI 427 >gi|326674128|ref|XP_002664631.2| PREDICTED: integrin alpha-E-like [Danio rerio] Length = 540 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 27/176 (15%), Positives = 59/176 (33%), Gaps = 18/176 (10%) Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS--RLNKLNPYEN-TNTY 278 + QKA N+ G + L +N + S ++ ++ N T T Sbjct: 363 DDFQKAKDFIYNVMSNCNFAIVQYGSLIRTELLLLDNEDRAGSLLKVKQIKQIYNLTKTA 422 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 A++H +++ K +I ++DGE + + + Sbjct: 423 SAINHVLTDIF-----IPEKGSKNNTAKIIIVLSDGEILEDPMTLDE-----VLNKPQMK 472 Query: 339 GMKIYSVAVSAP----PEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 G+ YS+ V P + + G++++V+ L + + +I Sbjct: 473 GVTRYSIGVGDGILKKPNAVKEMMQIA-DPGKYYSVSSYGALNDILSSLEREIIGT 527 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 32/210 (15%), Positives = 68/210 (32%), Gaps = 27/210 (12%) Query: 196 KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAY-------NIGIV 248 P VL S + QKA N+ + + G + Sbjct: 146 YQGGRINQGPGTEIAFVLDGSGSIQDDDFQKAKDFIYNVMSNVWKTCFDCNFAIVQYGSL 205 Query: 249 GNQCTPLSNNLNEVKS--RLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 L +N + V S ++ ++ N T T A++H +++ Sbjct: 206 IRTELSLLDNEDRVGSLLKVKQIKQIYNLTKTASAINHVLTDIF-----IPENGSKDNSA 260 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-----PEGQDLLRKC 360 K +I ++DG+ G + + + G+ YS+ V +++++ Sbjct: 261 KIIIVLSDGKILGDPMTLDE-----VLNKPQMKGVTRYSIGVGDGILKNLDATEEMMQIA 315 Query: 361 TDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 G+++ V+ R L + + I Sbjct: 316 --DPGKYYNVSSYRALNDIVSSLERGIIGT 343 >gi|148699893|gb|EDL31840.1| procollagen, type VI, alpha 1, isoform CRA_b [Mus musculus] Length = 643 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 31/206 (15%), Positives = 71/206 (34%), Gaps = 27/206 (13%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS---NNLNE 261 P +D + +++++ + + ++ A + L+ + +E Sbjct: 85 PYGALVDKVKSFTKRFIDNLRD-RYYRCDRNLVWNAGALHYSDEVEIIRGLTRMPSGRDE 143 Query: 262 VKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 +K+ ++ + + T T A+ EL SH K++I +TDG Sbjct: 144 LKASVDAVKYFGKGTYTDCAIKKGLEELL--IGGSHLKEN-----KYLIVVTDGHPLEGY 196 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA----------V 370 ++ G+K++SVA+ P + L + Sbjct: 197 KEPCG-GLEDAVNEAKHLGIKVFSVAI-TPDHLEPRLSIIAT--DHTYRRNFTAADWGHS 252 Query: 371 NDSRE-LLESFDKITDKIQEQSVRIA 395 D+ E + ++ D I D I+ ++ Sbjct: 253 RDAEEVISQTIDTIVDMIKNNVEQVC 278 >gi|26343093|dbj|BAC35203.1| unnamed protein product [Mus musculus] Length = 266 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 31/206 (15%), Positives = 71/206 (34%), Gaps = 27/206 (13%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS---NNLNE 261 P +D + +++++ + + ++ A + L+ + +E Sbjct: 53 PYGALVDKVKSFTKRFIDNLRD-RYYRCDRNLVWNAGALHYSDEVEIIRGLTRMPSGRDE 111 Query: 262 VKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 +K+ ++ + + T T A+ EL SH K++I +TDG Sbjct: 112 LKASVDAVKYFGKGTYTDCAIKKGLEELL--IGGSHLKEN-----KYLIVVTDGHPLEGY 164 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA----------V 370 ++ G+K++SVA+ P + L + Sbjct: 165 KEPCG-GLEDAVNEAKHLGIKVFSVAI-TPDHLEPRLSIIAT--DHTYRRNFTAADWGHS 220 Query: 371 NDSRE-LLESFDKITDKIQEQSVRIA 395 D+ E + ++ D I D I+ ++ Sbjct: 221 RDAEEVISQTIDTIVDMIKNNVEQVC 246 >gi|6753484|ref|NP_034063.1| collagen alpha-1(VI) chain precursor [Mus musculus] gi|543913|sp|Q04857|CO6A1_MOUSE RecName: Full=Collagen alpha-1(VI) chain; Flags: Precursor gi|50479|emb|CAA47032.1| collagen alpha1 type VI-precursor [Mus musculus] gi|148699892|gb|EDL31839.1| procollagen, type VI, alpha 1, isoform CRA_a [Mus musculus] gi|162318378|gb|AAI56501.1| Collagen, type VI, alpha 1 [synthetic construct] gi|225000678|gb|AAI72708.1| Collagen, type VI, alpha 1 [synthetic construct] Length = 1025 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 31/206 (15%), Positives = 71/206 (34%), Gaps = 27/206 (13%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS---NNLNE 261 P +D + +++++ + + ++ A + L+ + +E Sbjct: 53 PYGALVDKVKSFTKRFIDNLRD-RYYRCDRNLVWNAGALHYSDEVEIIRGLTRMPSGRDE 111 Query: 262 VKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 +K+ ++ + + T T A+ EL SH K++I +TDG Sbjct: 112 LKASVDAVKYFGKGTYTDCAIKKGLEELL--IGGSHLKEN-----KYLIVVTDGHPLEGY 164 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA----------V 370 ++ G+K++SVA+ P + L + Sbjct: 165 KEPCG-GLEDAVNEAKHLGIKVFSVAI-TPDHLEPRLSIIAT--DHTYRRNFTAADWGHS 220 Query: 371 NDSRE-LLESFDKITDKIQEQSVRIA 395 D+ E + ++ D I D I+ ++ Sbjct: 221 RDAEEVISQTIDTIVDMIKNNVEQVC 246 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 47/367 (12%), Positives = 103/367 (28%), Gaps = 37/367 (10%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 NQMQ +D + R +D + + Q + R Sbjct: 665 SHNQMQEHVD-------MRSPNVRNAQDFKEAVKKLQWMAGGTFTGEALQYTRDRLLPPT 717 Query: 85 IAQKAQINITKDKNNPLQYI--AESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERS 142 + + IT +++ + +I ++ +K + + L S Sbjct: 718 QNNRIALVITDGRSDTQRDTTPLSVLCGADIQVVSVGIKDVFGFVAGSDQLNVISCQGLS 777 Query: 143 SENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPA 202 IS ++ + + N + +K +F S + Sbjct: 778 QGRPGIS---LVKENYAELLDDGFLKNITAQICIDKKCPDYTCPITFSSPADITILLDSS 834 Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ---CTPLSNNL 259 + + + A L A + + VR+ + Y+ N Sbjct: 835 ASVGSHNFETTKVFAKRLAERFLSAGRADPSQDVRVAVVQYSGQGQQQPGRAALQFLQNY 894 Query: 260 NEVKSRLNKLN-PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + S ++ ++ + T+ A+ + R + KK V+ +DG + G Sbjct: 895 TVLASSVDSMDFINDATDVNDALSYVTRFYREASSGA--------TKKRVLLFSDGNSQG 946 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE---------GQDLLRKCTDSSGQFFA 369 + + + AG++I+ V V G+ F Sbjct: 947 ----ATAEAIEKAVQEAQRAGIEIFVVVVGPQVNEPHIRVLVTGKTAEYDVAFGERHLFR 1002 Query: 370 VNDSREL 376 V + + L Sbjct: 1003 VPNYQAL 1009 >gi|254413248|ref|ZP_05027019.1| Vault protein inter-alpha-trypsin [Microcoleus chthonoplastes PCC 7420] gi|196179868|gb|EDX74861.1| Vault protein inter-alpha-trypsin [Microcoleus chthonoplastes PCC 7420] Length = 1037 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 36/344 (10%), Positives = 85/344 (24%), Gaps = 43/344 (12%) Query: 74 QGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSL 133 + + + + K + Y ++++ Sbjct: 520 SVGSAKAPTTNSVGSKHFSASSAKAPTTNKTLMTNVAYAAEVNPPIAPPGRSGHDIDVTV 579 Query: 134 RSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKN 193 + SS S ++ + N +Y + ++ Sbjct: 580 EIDAGVPISSVRSPSHPVTTQQTSSTVRVELADQETIPNKDLILRYQVAGADTQATVLTQ 639 Query: 194 TTKSKYAPA----PAPANRKIDVLIESAGNLVNSIQKAIQEKK--------------NLS 235 + A PA ++ +++ + LV++ N Sbjct: 640 ADERGGHFATYLIPAIEYQQNEIVPKDVVFLVDTSGSQSGSPIVQSKELMRQFIQGLNPQ 699 Query: 236 VRIGTIAYNIGIVGNQCTPLSN---NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 I + PL+N N + + +N+L+ T + + Sbjct: 700 DTFTIIDFANSTTQLSDKPLANTPQNRKKALNYINRLDANGGTELMNGIDTVLN--FPAA 757 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA---GMKIYSVAVSA 349 + + V+ +TDG QI +R+ G ++YS V + Sbjct: 758 PAGRL--------RSVVLLTDGLIGDD---------EQIIAEIRDRLKPGNRLYSFGVGS 800 Query: 350 PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + R G V + + +I + Sbjct: 801 STNRFLIERLAELGRGTAEVVPPNESAEVVAQEFFQEINNPVLT 844 >gi|311897983|dbj|BAJ30391.1| hypothetical protein KSE_46100 [Kitasatospora setae KM-6054] Length = 455 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 22/145 (15%), Positives = 55/145 (37%), Gaps = 20/145 (13%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 N E K+ + + P T A+ A ++L + + ++ ITDGE+ Sbjct: 124 TNKVEAKTAVATVRPTGWTPIGIALRAAAQDLGTGPTT-----------RRIVLITDGED 172 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIY--SVAVSAPPEGQDLLRKCT-DSSGQFFAVNDS 373 + + + + + G+ + ++ ++ + + L + G F V Sbjct: 173 T-----CAPPDPCDVARELASQGIHLVVDTLGLAHDDKTRQQLICIANATGGTFTDVRTQ 227 Query: 374 RELLESFDKITDKIQEQSVRIAPNR 398 +L + ++ ++ Q+ I P + Sbjct: 228 EQLTKRVKQLVNRAQDTHA-ITPAK 251 >gi|239622779|ref|ZP_04665810.1| von Willebrand factor type A [Bifidobacterium longum subsp. infantis CCUG 52486] gi|239514776|gb|EEQ54643.1| von Willebrand factor type A [Bifidobacterium longum subsp. infantis CCUG 52486] Length = 401 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 39/348 (11%), Positives = 101/348 (29%), Gaps = 30/348 (8%) Query: 36 AVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK 95 A LS + + DQ++ L+ + ++ D + Sbjct: 78 ATLSIASGSENKEVAVAIQKAADQSNVAVTMHYMGSLEIMNALKAGGQDHDAVWPASSMW 137 Query: 96 DKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD 155 ++I + A + A + +++ ++ + + Sbjct: 138 ISMGDTKHIVKDAASTSTTPI------VFGIAKSKAVKLGWADDTGAAKPVSTADILAAV 191 Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 + ++ + + L P + W + + S + ++ ++ Sbjct: 192 SDGKLTFSMTSATVIDSALNVYQTALR-KPSWTIWVVDYSGSMSGEGKNGVVKGLNAALD 250 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 + +K+ E + V I + T S +++ + + T Sbjct: 251 -----PDQAKKSYIEPASGDVNILIPFETEAHRPVKATGTS--TSDLLHEADATDASGGT 303 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + Y + A EL +E E+S ++ +TDG ++ + Sbjct: 304 DIYEGLLSALDELPSESEASQ-------YTTAIVLMTDGRSNSDHQDEFESAYKS----- 351 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDK 382 R + I+S+ L+ S+ + F +L F + Sbjct: 352 RGRDLPIFSIMFG--DADPSQLKSLATLSNAKVFDGRS-GDLAAVFRQ 396 >gi|237715741|ref|ZP_04546222.1| conserved hypothetical protein [Bacteroides sp. D1] gi|262408749|ref|ZP_06085295.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] gi|229444450|gb|EEO50241.1| conserved hypothetical protein [Bacteroides sp. D1] gi|262353614|gb|EEZ02708.1| conserved hypothetical protein [Bacteroides sp. 2_1_22] Length = 616 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 31/306 (10%), Positives = 87/306 (28%), Gaps = 26/306 (8%) Query: 75 GSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLR 134 + E G+ + +++ + ++ A Y + L P Sbjct: 139 DAANAEEYGEFQENGFKSVSDAPLSTFSIDVDA-ASYSNMRRFINKGELPPVDAIRTEEL 197 Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 + + S + + P + + N Sbjct: 198 VNYFSYDYPKPTGSDPVKITMESGACPWNTNHRLVRIGLKAKEI------PTDNLPASNL 251 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 ++D++ S LVN+++ ++ + Y+ Sbjct: 252 VFLIDVSGSMWGANRLDLVKSSLKLLVNNLRDKD--------KVAIVTYSGSAGVKLEAT 303 Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 ++ +++ +++L +T + AY+ S+ N +I +DG Sbjct: 304 PGSDKQKIREAIDELTAGGSTAGGAGILLAYKIAKKNLISNGNNR--------IILCSDG 355 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDS 373 + + + L Q+ E R +G+ + + ++ + G +++ Sbjct: 356 DFNVGVSSAEGL--EQLIEKERKSGVFLTVLGYGMGNYKDKKIQVLAEKGNGNHAYIDNL 413 Query: 374 RELLES 379 +E Sbjct: 414 QEANRV 419 >gi|197118219|ref|YP_002138646.1| hypothetical protein Gbem_1835 [Geobacter bemidjiensis Bem] gi|197087579|gb|ACH38850.1| conserved hypothetical protein [Geobacter bemidjiensis Bem] Length = 356 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 37/266 (13%), Positives = 78/266 (29%), Gaps = 22/266 (8%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKK--DQ 59 AI++ + F F+ A+D H+ +R ++Q+A DAA L G + + T + Sbjct: 19 VAILLMMMFGFLGLAVDGGHLFKVRGELQNAADAAALKGAWHLYTRPTDPTQLPTLQWEV 78 Query: 60 TSTIFKKQIKKHLKQGSYIRENAGDIA-----------QKAQINITKDKNNPLQYIAESK 108 ++ I ++ + +++ D+ + ++ IA Sbjct: 79 ARFQAQQMITENSSDNTALKDAMVDVGYWNTNSNILQPTTLPTPVPGVDIPAVRVIASRS 138 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKH 168 F + + +S R+ + V + Sbjct: 139 DGNNGGPVKNFFMQIFGKDYSQVSSRAA-------VAMLGFPYTVPPAVPAELFPLALSK 191 Query: 169 NDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAI 228 + S + PPP+ S + + K D + + + K Sbjct: 192 CMTDQYFSQVPMPDPPPEIRISSPYIPGGDTCYSGQWTSFKAD--TNDVRTIKDLMYKGN 249 Query: 229 QEKKNLSVRIGTIAYNIGIVGNQCTP 254 E + I G + N P Sbjct: 250 PEPLAIGDEIWIEPGVEGSLYNHIVP 275 >gi|148656642|ref|YP_001276847.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148568752|gb|ABQ90897.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 452 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 28/262 (10%), Positives = 81/262 (30%), Gaps = 32/262 (12%) Query: 134 RSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKN 193 + ++ +V D SRSM + + L +++ Sbjct: 32 TVRAAATGDAGVRPVNWALVADASRSMRIPIV------DETQFRSLLRNGSAQETLVDGV 85 Query: 194 TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 P + + ++V + E+ + + R+ + + + Sbjct: 86 PVWQLSGSVPQEVRK---AASSALDHVV-HALHTVVERLDRNDRLSLVVFADHALLLIPG 141 Query: 254 PLSNNLNEVKSRLN---KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 + ++ + + L+ + TN + A ++ +++ V+ Sbjct: 142 MVGSDRVTLVRAIERLPGLDLGDGTNLADGIALALNQIRANRDARRANR--------VLL 193 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFA 369 +TDG ++ L + + + + I ++ + DLL D S G Sbjct: 194 LTDG------FTRDPAACLTLADQAADEHIAITTIGLG-GEFQDDLLTGIADRSGGNALF 246 Query: 370 VNDSRELLESFDKITDKIQEQS 391 + + + + I+ +++ Sbjct: 247 LKRASAIPRA---ISAELESAR 265 >gi|159896782|ref|YP_001543029.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159889821|gb|ABX02901.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 950 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 27/146 (18%), Positives = 47/146 (32%), Gaps = 18/146 (12%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKES 294 ++G + ++ + ++V + L + P TN M+ AY +L Sbjct: 458 DDKLGVVTFDDSAHWTIELDKVPSQDDVVAALAPVPPSGQTNVVSGMNAAYEQL------ 511 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 S K I +TDG I E M G+ + VA Sbjct: 512 ----RQSDAKIKHAILLTDGWGHATDIGS-------IAENMNKDGITLSVVAAG-NGSDN 559 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESF 380 L R G+++ E+ + F Sbjct: 560 ALQRYAELGGGRYYPARVMEEVPQIF 585 >gi|1706571|sp|P54281|ECLC_BOVIN RecName: Full=Epithelial chloride channel protein; AltName: Full=Calcium-activated chloride channel gi|1184066|gb|AAC48511.1| calcium-activated chloride channel [Bos taurus] Length = 903 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 20/113 (17%), Positives = 44/113 (38%), Gaps = 21/113 (18%) Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T+ + ++ + ++S+ +I +TDGE++ + Sbjct: 380 EANGGTSICRGLKAGFQAIIQSQQST--------SGSEIILLTDGEDNEIHSC------- 424 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN-DSRELLESF 380 E ++ +G+ I+++A+ P L +D + G F N D L +F Sbjct: 425 --IEEVKQSGVIIHTIALG--PSAAKELETLSDMTGGHRFYANKDINGLTNAF 473 >gi|291402773|ref|XP_002718214.1| PREDICTED: integrin, alpha 11 [Oryctolagus cuniculus] Length = 1188 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 64/195 (32%), Gaps = 23/195 (11%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + I +E L+N ++K +++G + Y V ++ +V Sbjct: 171 GSNSIYPWVEVQHFLINILKKFYIGPGQ--IQVGVVQYGEDAVHEFHLNDYKSVKDVVEA 228 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + + T T A + + + G KK +I ITDGE ++ Sbjct: 229 ASHIEQRGGTETRTAFGIEF-----ARSEAFQKGGRKGAKKVMIVITDGE------SHDS 277 Query: 326 LNTLQICEYMRNAGMKIYSVAV-SA----PPEGQDLL---RKCTDSSG--QFFAVNDSRE 375 + ++ + Y+VAV + L + FF V D Sbjct: 278 PDLKKVIRQSERDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAA 337 Query: 376 LLESFDKITDKIQEQ 390 L + D + D+I Sbjct: 338 LKDIVDALGDRIFSL 352 >gi|157375507|ref|YP_001474107.1| hypothetical protein Ssed_2370 [Shewanella sediminis HAW-EB3] gi|157317881|gb|ABV36979.1| conserved hypothetical protein [Shewanella sediminis HAW-EB3] Length = 461 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 20/194 (10%), Positives = 54/194 (27%), Gaps = 16/194 (8%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M I + A+D H++ + ++Q+A+DA+ L + + Sbjct: 31 MFTIGLFSLIAVAALALDGGHLLLNKGRLQNAVDASALYAAKEL-------QDGASLYEA 83 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + ++L + + + I+++ N Q + E Sbjct: 84 REAATTLLLQNL-------QYQENGELNSSIDLSAPDYNSTQ--VAANLFIEFSEWPDPF 134 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 ++ + +R + + ++ + S + + L Sbjct: 135 SPILVEGSEYVRIRIENVGLTNFLAQIMNFDKSVRASAIAGRSTDIECLNKAVPMMVCAL 194 Query: 181 LPPPPKKSFWSKNT 194 +S T Sbjct: 195 NEDEDDDFGFSPGT 208 >gi|157823041|ref|NP_001101626.1| integrin alpha-11 [Rattus norvegicus] gi|149041917|gb|EDL95758.1| integrin, alpha 11 (predicted) [Rattus norvegicus] Length = 1171 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 33/195 (16%), Positives = 62/195 (31%), Gaps = 23/195 (11%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + I +E L+ + K +++G + Y V ++ +V Sbjct: 147 GSNSIYPWVEVQHFLIEILTKFYIGPGQ--IQVGIVQYGEDAVHEFHLNDYKSVKDVVEA 204 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + + T T A + + + G KK +I ITDGE ++ Sbjct: 205 ASHIEQRGGTETRTAFGIEF-----ARSEAFQKGGRKGAKKVMIVITDGE------SHDS 253 Query: 326 LNTLQICEYMRNAGMKIYSVAV-SA----PPEGQDLL---RKCTDSSG--QFFAVNDSRE 375 + ++ + Y+VAV + L + FF V D Sbjct: 254 PDLEKVIRQSEKDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAA 313 Query: 376 LLESFDKITDKIQEQ 390 L + D + D+I Sbjct: 314 LKDIVDALGDRIFSL 328 >gi|315919742|ref|ZP_07915982.1| conserved hypothetical protein [Bacteroides sp. D2] gi|313693617|gb|EFS30452.1| conserved hypothetical protein [Bacteroides sp. D2] Length = 616 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 31/306 (10%), Positives = 86/306 (28%), Gaps = 26/306 (8%) Query: 75 GSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLR 134 + E G I + +++ + ++ A Y + L P Sbjct: 139 DAVNAEEYGQIQENGFKSVSDAPLSTFSIDVDA-ASYSNMRRFINKGELPPVDAIRTEEL 197 Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 + + + + + P + + N Sbjct: 198 VNYFSYDYPKPTGSDPVKITMEAGACPWNANHRLVRIGLKAKEI------PTDNLPASNL 251 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 ++D++ S LVN+++ ++ + Y Sbjct: 252 VFLIDVSGSMWGANRLDLVKSSLKLLVNNLRDKD--------KVAIVTYAGSAGVKLEAT 303 Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 ++ +++ +++L +T + AY+ S+ N +I +DG Sbjct: 304 PGSDKQKIREAIDELTAGGSTAGGTGILLAYKIAKKNFISNGNNR--------IILCSDG 355 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDS 373 + + + L Q+ E R +G+ + + ++ + G +++ Sbjct: 356 DFNVGVSSAEGL--EQLIEKERKSGVFLTVLGYGMGNYKDKKIQVLAEKGNGNHAYIDNL 413 Query: 374 RELLES 379 +E Sbjct: 414 QEANRV 419 >gi|256852741|ref|ZP_05558111.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T8] gi|256711200|gb|EEU26238.1| von Willebrand factor type A domain-containing protein [Enterococcus faecalis T8] gi|315030743|gb|EFT42675.1| von Willebrand factor type A domain protein [Enterococcus faecalis TX4000] Length = 1105 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 42/384 (10%), Positives = 96/384 (25%), Gaps = 32/384 (8%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 +M A A L+ T T + + H + G Sbjct: 151 AEARMAPATLRANLALPLIAPQYTTDNSGTYPTANWQPTGNQNVLNHQGNKDGGSQWDGQ 210 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + T N+ ++Y + L++R E Sbjct: 211 T--SWNGDPTNRTNSYIEYGGTGDQADYAIRKYARETTTPGLFDVYLNVRG----NVQKE 264 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLP---PPPKKSFWSKNTTKSKYAP 201 + + +V+D S SM + L S Sbjct: 265 ITPLDLVLVVDWSGSMNENNRIGEVQKGVNRFVDTLADSGITNNININMGYVGYSSDGYN 324 Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 A D + N+ S + + G + L++ + Sbjct: 325 NNAIQMGPFDTVKNPIKNITPSSTRGGTFTQKALRDAGDMLATPNGHKKVIVLLTDGVPT 384 Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 ++++ T R + + + GST D N Sbjct: 385 FSYTVSRVQ----TEADG------RFYGTQFTNRQDQPGSTSYISGSYNAPDQNNINKRI 434 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD---SSGQFFA 369 + T+ ++ G++I+ + + + +D +R+ + ++ Sbjct: 435 NSTFIATIGEAMALKQRGIEIHGLGIQLQSDPRANLSKQQVEDKMREMVSADENGDLYYE 494 Query: 370 -VNDSRELLESFDKITDKIQEQSV 392 + + ++ + K +I V Sbjct: 495 SADYAPDISDYLAKKAVQISGTVV 518 >gi|13447394|ref|NP_085104.1| chloride channel calcium activated 2 [Mus musculus] gi|12043705|gb|AAG47626.1|AF115852_1 endothelial chloride channel [Mus musculus] gi|14198178|gb|AAH08147.1| Chloride channel calcium activated 2 [Mus musculus] gi|74208910|dbj|BAE21205.1| unnamed protein product [Mus musculus] gi|148680073|gb|EDL12020.1| mCG120735 [Mus musculus] Length = 902 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 22/150 (14%), Positives = 61/150 (40%), Gaps = 25/150 (16%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 S++ ++ + L + T+ + ++ + + + ++ +T Sbjct: 364 ITSSSDYQKITANLPQ-QATGGTSICHGLQAGFQAITSS--------DQSTSGSEIVLLT 414 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN 371 DGE++G S+ E + +G I+++A+ + L +D + G F N Sbjct: 415 DGEDNGISSCF---------EAVSRSGAIIHTIALGPSAARE--LETLSDMTGGLRFYAN 463 Query: 372 -DSRELLESFDKITD---KIQEQSVRIAPN 397 L+++F +I+ + +Q++++ Sbjct: 464 KHVSSLIDAFSRISSTSGSVSQQALQLESK 493 >gi|194205647|ref|XP_001498059.2| PREDICTED: similar to von Willebrand factor A domain-containing protein 2 precursor (A domain-containing protein similar to matrilin and collagen) (AMACO) (Colon cancer secreted protein 2) (CCSP-2) [Equus caballus] Length = 784 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 23/164 (14%), Positives = 62/164 (37%), Gaps = 19/164 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + VR+G ++ + EVK+++ ++ T T A+ + R+ Sbjct: 82 DINLKRVRVGAFQFSSAPHLEFPLDSFSTQQEVKAKIKRMVFKGGRTETGLALKYLLRKG 141 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + ++ + + +I ITDG + G A + ++ G+ +++V V Sbjct: 142 FPGGRNA-------SVPQILIIITDGRSQGHVALP--------AKQLKERGITVFAVGVR 186 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 P + L + V + ++ ++ + + + ++ Sbjct: 187 FPRWEE--LHTLASEP-REQHVLMAEQVEDAANGLFSTLSSSAI 227 >gi|126728411|ref|ZP_01744227.1| hypothetical protein SSE37_20512 [Sagittula stellata E-37] gi|126711376|gb|EBA10426.1| hypothetical protein SSE37_20512 [Sagittula stellata E-37] Length = 219 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 23/152 (15%), Positives = 44/152 (28%), Gaps = 31/152 (20%) Query: 245 IGIVGNQCTPLSNNLNE---VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGS 301 N L+ + S + L P T + A L Sbjct: 53 DRTCRNVDLRLTPQWQADAPIISEIEGLRPAGGTALTDGVRLAAETL-----------DY 101 Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA-GMKIYSVAVSA----------P 350 + V+ +TDG+ + E+ R A G+ ++ + Sbjct: 102 RNVPGAVVLVTDGKETCGGTPC-----QLAAEFAREAPGLTVHVIGFKVRGDHWDWSTPD 156 Query: 351 PEGQDLLRKCTD-SSGQFFAVNDSRELLESFD 381 G+ + R D + GQ+ + EL+ + Sbjct: 157 APGESVARCLADDTGGQYLSAETVDELVGALR 188 >gi|145491135|ref|XP_001431567.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124398672|emb|CAK64169.1| unnamed protein product [Paramecium tetraurelia] Length = 590 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 19/151 (12%), Positives = 43/151 (28%), Gaps = 17/151 (11%) Query: 234 LSVRIGTIAYNIGIVGNQCTPL--SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNE 291 + R+ I ++ + N + + ++ A + +L Sbjct: 201 DNDRLQLITFDNDAHRLTPLKTVTNQNKSYFTQIIKQIQADGGNRISEATKMTFYQLK-- 258 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 V ++DG + + + T+ +++ Sbjct: 259 ------GRKYINNVTSVFLLSDGVDVTYPEVKKQIKTVNEV-------FTLHTFGFGEDH 305 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + Q + + C SG F+ V D L E F Sbjct: 306 DAQMMTQLCNLKSGSFYFVQDVTLLDEFFAD 336 >gi|198435896|ref|XP_002123489.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 1595 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 36/196 (18%), Positives = 57/196 (29%), Gaps = 26/196 (13%) Query: 206 ANRKIDVLIESA-----GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN--- 257 + + + L + ++ V + + S+ Sbjct: 421 VVDQSGSVNQCNFQKVKRWLRDIVRSFNLGVTEQDVGVVVYSKKATTSTVVDLGFSDYDS 480 Query: 258 ----NLNEVKSRLNKLNPYENT-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 E+ L KL T T A A L KK +I +T Sbjct: 481 DGHTKKQEMTKILKKLAYEGGTTYTGYAFKLANEMLT-------GNKSRPDAKKMIILLT 533 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND 372 DG + NTL + + R A + I +V V Q L + FFAV Sbjct: 534 DGATT----AANTLQLKEELDVSRAANVMILAVGVGK--FNQTELIQIAGDRKNFFAVTK 587 Query: 373 SRELLESFDKITDKIQ 388 EL + DK+ + Sbjct: 588 FSELEKVRDKLRSSVS 603 >gi|157694069|ref|YP_001488531.1| hypothetical protein BPUM_3318 [Bacillus pumilus SAFR-032] gi|157682827|gb|ABV63971.1| hypothetical protein YwmD [Bacillus pumilus SAFR-032] Length = 225 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 23/197 (11%), Positives = 51/197 (25%), Gaps = 23/197 (11%) Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE----KKNLSVRIGTIAYNIGIVGNQC 252 S +KID+ S + + + Q + Sbjct: 40 SGSMAQSVEGEKKIDIAKRSIQSFASILSDDTQVLLRVFGHEGTNKNAGKAISCASSEAV 99 Query: 253 TPL-SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 S + + LN P T A+ ++ + + K V + Sbjct: 100 YGFGSYESSTFQQALNVYKPTGWTPLAKALTDTKQDFEDHQAEG---------KNIVYVV 150 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCT-DSSGQFF 368 +DG+ + + Q + + G+ + + + L+ GQ+ Sbjct: 151 SDGKETCGGSPS------QAAKELHEDGIDTIVNIIGFDVNEKEAKSLKSVAKAGGGQYQ 204 Query: 369 AVNDSRELLESFDKITD 385 ++ EL Sbjct: 205 PAANAEELNHILQNEAS 221 >gi|117618496|ref|YP_856674.1| von Willebrand factor type A domain-containing protein [Aeromonas hydrophila subsp. hydrophila ATCC 7966] gi|117559903|gb|ABK36851.1| von Willebrand factor type A domain protein [Aeromonas hydrophila subsp. hydrophila ATCC 7966] Length = 337 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 29/190 (15%), Positives = 64/190 (33%), Gaps = 49/190 (25%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP--- 271 + ++ + A RI I + +PL+ N + +L+ Sbjct: 110 SAVRQQIDRLIAARPG-----DRIALIVFADHAYL--LSPLTQETNALLGLTRELDFELV 162 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T A+ A + E+ ++ ++ +TDG N+ S + L+ Sbjct: 163 GRTTALGEAILLARQHKAPERSTA------------LLLVTDGRNTAGS-----ADPLRE 205 Query: 332 CEYMRNAGMKIYSVAVSAPPE---------------------GQDLLRKCTDSS-GQFFA 369 + AG+++Y++ V A P+ + LL++ + G++F Sbjct: 206 AKQAAAAGIRLYTLGVGADPDTFAEAMTPAQTPAQSDPSAELDEALLQQLAEVGHGRYFR 265 Query: 370 VNDSRELLES 379 +L Sbjct: 266 ARTQGDLEAI 275 >gi|320106178|ref|YP_004181768.1| VWFA-like domain-containing protein [Terriglobus saanensis SP1PR4] gi|319924699|gb|ADV81774.1| VWFA-related domain-containing protein [Terriglobus saanensis SP1PR4] Length = 365 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 18/168 (10%), Positives = 54/168 (32%), Gaps = 22/168 (13%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 + + + ++++ +++ L + T Y A+ + L Sbjct: 148 SNPKDEAFLVDFSDEAFIDQDFTSDVKKLEDGLGYVKASGGTAIYDAVVASADYL----- 202 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA--VSAPP 351 + K+ ++ +TDG+++ + + ++ + + G IY V Sbjct: 203 ----AKNAKLPKQVLLIVTDGDDNA--SGSTLEDAIRRVQEL--DGPVIYCVGLLFGPDS 254 Query: 352 E------GQDLLRK-CTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + +L + G + E+ ++ I++Q Sbjct: 255 NKSESRHARRVLETLAAQTGGLAYFPRKLSEVDSIATEVAQDIRQQYT 302 >gi|225872657|ref|YP_002754114.1| von Willebrand factor type A domain protein [Acidobacterium capsulatum ATCC 51196] gi|225793405|gb|ACO33495.1| von Willebrand factor type A domain protein [Acidobacterium capsulatum ATCC 51196] Length = 410 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 29/201 (14%), Positives = 70/201 (34%), Gaps = 32/201 (15%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 K++ + ++A +LV N + ++ +Q ++++ Sbjct: 189 VDNSGSMQNKLNAVDKAALDLVR--------ASNPDDEAFIVNFSDQAYLDQ--GFTSSI 238 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +++ L T Y A+ + EL + K+ ++ +TDGE+ Sbjct: 239 AKLEQGLAHTEARGGTALYDAIVASADELS---------KDARHPKQVLLVVTDGED--- 286 Query: 320 SAYQNTLNTLQICEYMRN-AGMKIYSVAVSAPPEGQDLLRK-------CTDSSGQFFAVN 371 +T+N Q + ++ G +IY++ + G + R + G + Sbjct: 287 --DASTMNLQQAIQRVQALHGPEIYAIGLLYDDSGDEAHRARKALEQLTEQTGGLAYFPR 344 Query: 372 DSRELLESFDKITDKIQEQSV 392 + E ++ I+ Q Sbjct: 345 SLENVDEVAAEVAKDIRNQYT 365 >gi|328884707|emb|CCA57946.1| hypothetical protein SVEN_4660 [Streptomyces venezuelae ATCC 10712] Length = 535 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 28/209 (13%), Positives = 58/209 (27%), Gaps = 16/209 (7%) Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 LL T +++ L + L + + Sbjct: 338 VADGLLDAYDNTLRRPSRTVYVLDTSGSMNG-DRLERLKTALVELTGDFRDREEVTLMPF 396 Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS 295 + +P L+ +++ KL T Y ++ AYR L Sbjct: 397 GSAVKRDEVRTHTVDPASPRQA-LDAIRADARKLTASGGTAIYSSLQEAYRSL------- 448 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 S ++ +TDGEN+ + + ++ + + Sbjct: 449 --GKSSGDTFTSIVLMTDGENTDGAPAAAFDSFYGSLP-AGQQRTPVFPILFG--DSDRA 503 Query: 356 LLRKCTD-SSGQFFAVNDSRELLESFDKI 383 L D + G+ F + L +F++I Sbjct: 504 ELGHLADLTGGKLFDAHQGS-LDGAFEEI 531 >gi|220897447|emb|CAX15331.1| complement component 2 (within H-2S) [Mus musculus] Length = 1276 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 36/235 (15%), Positives = 76/235 (32%), Gaps = 18/235 (7%) Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL--IESAGNLVNSIQKAIQEK 231 + P +K + + S + I + L N I+K Sbjct: 757 ADAEDGHSPGEQQKRKIVLDPSGSMNIYLVLDGSDSIGSSNFTGAKRCLTNLIEKVASYG 816 Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-----YENTNTYPAMHHAYR 286 + T A ++ S++ + V +LN+++ TNT A+ Y Sbjct: 817 VRPRYGLLTYATVPKVLVRVSDERSSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYS 876 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM-------RNAG 339 + ++ R + +I +TDG ++ + ++ + R Sbjct: 877 MMSWAGDAPPEGWN--RTRHVIIIMTDGLHNMGGNPVTVIQDIRALLDIGRDPKNPREDY 934 Query: 340 MKIYSVAVS--APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + +Y V + L D+ F V D +L F ++ D+ + S+ Sbjct: 935 LDVYVFGVGPLVDSVNINALASKKDNEHHVFKVKDMEDLENVFYQMIDETKSLSL 989 >gi|220897445|emb|CAX15329.1| complement component 2 (within H-2S) [Mus musculus] Length = 970 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 36/235 (15%), Positives = 76/235 (32%), Gaps = 18/235 (7%) Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL--IESAGNLVNSIQKAIQEK 231 + P +K + + S + I + L N I+K Sbjct: 451 ADAEDGHSPGEQQKRKIVLDPSGSMNIYLVLDGSDSIGSSNFTGAKRCLTNLIEKVASYG 510 Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-----YENTNTYPAMHHAYR 286 + T A ++ S++ + V +LN+++ TNT A+ Y Sbjct: 511 VRPRYGLLTYATVPKVLVRVSDERSSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYS 570 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM-------RNAG 339 + ++ R + +I +TDG ++ + ++ + R Sbjct: 571 MMSWAGDAPPEGWN--RTRHVIIIMTDGLHNMGGNPVTVIQDIRALLDIGRDPKNPREDY 628 Query: 340 MKIYSVAVS--APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + +Y V + L D+ F V D +L F ++ D+ + S+ Sbjct: 629 LDVYVFGVGPLVDSVNINALASKKDNEHHVFKVKDMEDLENVFYQMIDETKSLSL 683 >gi|218156291|ref|NP_001136178.1| complement factor B isoform 2 [Mus musculus] gi|220897443|emb|CAX15327.1| complement factor B [Mus musculus] Length = 713 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 36/235 (15%), Positives = 76/235 (32%), Gaps = 18/235 (7%) Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL--IESAGNLVNSIQKAIQEK 231 + P +K + + S + I + L N I+K Sbjct: 244 ADAEDGHSPGEQQKRKIVLDPSGSMNIYLVLDGSDSIGSSNFTGAKRCLTNLIEKVASYG 303 Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-----YENTNTYPAMHHAYR 286 + T A ++ S++ + V +LN+++ TNT A+ Y Sbjct: 304 VRPRYGLLTYATVPKVLVRVSDERSSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYS 363 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM-------RNAG 339 + ++ R + +I +TDG ++ + ++ + R Sbjct: 364 MMSWAGDAPPEGWN--RTRHVIIIMTDGLHNMGGNPVTVIQDIRALLDIGRDPKNPREDY 421 Query: 340 MKIYSVAVS--APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + +Y V + L D+ F V D +L F ++ D+ + S+ Sbjct: 422 LDVYVFGVGPLVDSVNINALASKKDNEHHVFKVKDMEDLENVFYQMIDETKSLSL 476 >gi|218156289|ref|NP_032224.2| complement factor B isoform 1 [Mus musculus] gi|220897442|emb|CAX15326.1| complement factor B [Mus musculus] Length = 763 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 36/235 (15%), Positives = 76/235 (32%), Gaps = 18/235 (7%) Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL--IESAGNLVNSIQKAIQEK 231 + P +K + + S + I + L N I+K Sbjct: 244 ADAEDGHSPGEQQKRKIVLDPSGSMNIYLVLDGSDSIGSSNFTGAKRCLTNLIEKVASYG 303 Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-----YENTNTYPAMHHAYR 286 + T A ++ S++ + V +LN+++ TNT A+ Y Sbjct: 304 VRPRYGLLTYATVPKVLVRVSDERSSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYS 363 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM-------RNAG 339 + ++ R + +I +TDG ++ + ++ + R Sbjct: 364 MMSWAGDAPPEGWN--RTRHVIIIMTDGLHNMGGNPVTVIQDIRALLDIGRDPKNPREDY 421 Query: 340 MKIYSVAVS--APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + +Y V + L D+ F V D +L F ++ D+ + S+ Sbjct: 422 LDVYVFGVGPLVDSVNINALASKKDNEHHVFKVKDMEDLENVFYQMIDETKSLSL 476 >gi|148694792|gb|EDL26739.1| complement factor B, isoform CRA_b [Mus musculus] Length = 760 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 36/235 (15%), Positives = 76/235 (32%), Gaps = 18/235 (7%) Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL--IESAGNLVNSIQKAIQEK 231 + P +K + + S + I + L N I+K Sbjct: 241 ADAEDGHSPGEQQKRKIVLDPSGSMNIYLVLDGSDSIGSSNFTGAKRCLTNLIEKVASYG 300 Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-----YENTNTYPAMHHAYR 286 + T A ++ S++ + V +LN+++ TNT A+ Y Sbjct: 301 VRPRYGLLTYATVPKVLVRVSDERSSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYS 360 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM-------RNAG 339 + ++ R + +I +TDG ++ + ++ + R Sbjct: 361 MMSWAGDAPPEGWN--RTRHVIIIMTDGLHNMGGNPVTVIQDIRALLDIGRDPKNPREDY 418 Query: 340 MKIYSVAVS--APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + +Y V + L D+ F V D +L F ++ D+ + S+ Sbjct: 419 LDVYVFGVGPLVDSVNINALASKKDNEHHVFKVKDMEDLENVFYQMIDETKSLSL 473 >gi|148694794|gb|EDL26741.1| complement factor B, isoform CRA_d [Mus musculus] Length = 731 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 36/235 (15%), Positives = 76/235 (32%), Gaps = 18/235 (7%) Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL--IESAGNLVNSIQKAIQEK 231 + P +K + + S + I + L N I+K Sbjct: 212 ADAEDGHSPGEQQKRKIVLDPSGSMNIYLVLDGSDSIGSSNFTGAKRCLTNLIEKVASYG 271 Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-----YENTNTYPAMHHAYR 286 + T A ++ S++ + V +LN+++ TNT A+ Y Sbjct: 272 VRPRYGLLTYATVPKVLVRVSDERSSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYS 331 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM-------RNAG 339 + ++ R + +I +TDG ++ + ++ + R Sbjct: 332 MMSWAGDAPPEGWN--RTRHVIIIMTDGLHNMGGNPVTVIQDIRALLDIGRDPKNPREDY 389 Query: 340 MKIYSVAVS--APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + +Y V + L D+ F V D +L F ++ D+ + S+ Sbjct: 390 LDVYVFGVGPLVDSVNINALASKKDNEHHVFKVKDMEDLENVFYQMIDETKSLSL 444 >gi|116126|sp|P04186|CFAB_MOUSE RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|192412|gb|AAA37379.1| factor B [Mus musculus] gi|192414|gb|AAA63293.1| factor B [Mus musculus] gi|2944424|gb|AAC05283.1| complement factor B [Mus musculus] gi|3986764|gb|AAC84160.1| Bf [Mus musculus] gi|13529437|gb|AAH05451.1| Complement factor B [Mus musculus] gi|74146351|dbj|BAE28943.1| unnamed protein product [Mus musculus] gi|148694791|gb|EDL26738.1| complement factor B, isoform CRA_a [Mus musculus] Length = 761 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 36/235 (15%), Positives = 76/235 (32%), Gaps = 18/235 (7%) Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL--IESAGNLVNSIQKAIQEK 231 + P +K + + S + I + L N I+K Sbjct: 242 ADAEDGHSPGEQQKRKIVLDPSGSMNIYLVLDGSDSIGSSNFTGAKRCLTNLIEKVASYG 301 Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-----YENTNTYPAMHHAYR 286 + T A ++ S++ + V +LN+++ TNT A+ Y Sbjct: 302 VRPRYGLLTYATVPKVLVRVSDERSSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYS 361 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM-------RNAG 339 + ++ R + +I +TDG ++ + ++ + R Sbjct: 362 MMSWAGDAPPEGWN--RTRHVIIIMTDGLHNMGGNPVTVIQDIRALLDIGRDPKNPREDY 419 Query: 340 MKIYSVAVS--APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + +Y V + L D+ F V D +L F ++ D+ + S+ Sbjct: 420 LDVYVFGVGPLVDSVNINALASKKDNEHHVFKVKDMEDLENVFYQMIDETKSLSL 474 >gi|297465782|ref|XP_609132.4| PREDICTED: collagen, type VI, alpha 3-like isoform 1, partial [Bos taurus] Length = 1803 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 26/173 (15%), Positives = 60/173 (34%), Gaps = 17/173 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKN--LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN ++ +N Sbjct: 219 INFRRDSFQEVLRFVSEIVDTVYEGGDSIQVGLVQYNSDPTDEFFLKDFPTKQQIIDAIN 278 Query: 268 KLNPYENTNTYP--AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 K+ + + H R N + R+ + IT G+ + Sbjct: 279 KVVYKGGRHANTKVGLEHLRR---NHFVPEAGSRLDQRVPQIAFVITGGK--------SV 327 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 328 EDAQEASMALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 378 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 58/169 (34%), Gaps = 15/169 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRE 287 VRIG + ++ + + V + +L + NT A+ R Sbjct: 38 NVGPNKVRIGVLQFSNDVFPEFQLKTYKSQASVLDAIRRLRFKGGSPLNTGKALEFVARN 97 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 + S + + + ++ G++ + + + +++AG+ S+ V Sbjct: 98 YF---VKSAGSRIEDGVPQHLVLFLGGKSQDDISRYS--------QVIKSAGI--ASLGV 144 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 + L+ T F V + R+L +++ + V AP Sbjct: 145 GDRNIDRTELQTITSDPRLVFTVREFRDLPSIEERMVNSFGSSGVTPAP 193 >gi|297473452|ref|XP_002686619.1| PREDICTED: collagen, type VI, alpha 3-like isoform 4 [Bos taurus] gi|296488814|gb|DAA30927.1| collagen, type VI, alpha 3-like isoform 4 [Bos taurus] Length = 2555 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 26/173 (15%), Positives = 60/173 (34%), Gaps = 17/173 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKN--LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN ++ +N Sbjct: 1040 INFRRDSFQEVLRFVSEIVDTVYEGGDSIQVGLVQYNSDPTDEFFLKDFPTKQQIIDAIN 1099 Query: 268 KLNPYENTNTYP--AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 K+ + + H R N + R+ + IT G+ + Sbjct: 1100 KVVYKGGRHANTKVGLEHLRR---NHFVPEAGSRLDQRVPQIAFVITGGK--------SV 1148 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 1149 EDAQEASMALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 1199 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 58/169 (34%), Gaps = 15/169 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRE 287 VRIG + ++ + + V + +L + NT A+ R Sbjct: 859 NVGPNKVRIGVLQFSNDVFPEFQLKTYKSQASVLDAIRRLRFKGGSPLNTGKALEFVARN 918 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 + S + + + ++ G++ + + + +++AG+ S+ V Sbjct: 919 YF---VKSAGSRIEDGVPQHLVLFLGGKSQDDISRYS--------QVIKSAGI--ASLGV 965 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 + L+ T F V + R+L +++ + V AP Sbjct: 966 GDRNIDRTELQTITSDPRLVFTVREFRDLPSIEERMVNSFGSSGVTPAP 1014 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 20/161 (12%), Positives = 54/161 (33%), Gaps = 16/161 (9%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 + + + + + E + +++ Y + + EV + + K+ + Sbjct: 54 NAIRDFIAKVIQ-RLEIRQDLIQVAVAQYADTVRPEFYFNTYPSKREVINAVRKMKALDG 112 Query: 275 TN--TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 + T A+ L+ E + + + K ++ +T G+ + Q Sbjct: 113 SALYTGSALDFVRNNLFTE---AAGYRAAEGVPKLLVLVTGGK--------SLDAVSQPA 161 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDS 373 + ++ +G+ + AV Q L + S F + Sbjct: 162 QELKRSGIL--AFAVGNKVADQAELEEIAFDSSLVFTATEF 200 Score = 48.0 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 34/290 (11%), Positives = 80/290 (27%), Gaps = 31/290 (10%) Query: 103 YIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMED 162 +S + + P NL G++P L + + I + L ++ Sbjct: 340 VAGKSSDRVDTPALNLKQSGVVPFILQAKNADPAELELIVPSPAFILVAESLPKIGDLQP 399 Query: 163 LYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVN 222 + + P + S+ + P +L E +V Sbjct: 400 QIVNLL-----KSVQNGAPAPVSVEKDVVFLIDGSEGVRSGFP------LLKEFVQRVVE 448 Query: 223 SIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPA 280 S+ + VR+ + Y+ + V + L NT A Sbjct: 449 SL-----DVGPDRVRVAVVQYSDRTRPEFYLNSYMDQQSVVGAIRGLTLLGGPAPNTGAA 503 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + R + S + + +I +T A ++ + +R G Sbjct: 504 LEFVLRNILVGSAGSRIA---EGVPQLLIVLT--------ADRSGDDVRGPSVVLRRGGA 552 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + + ++ + + R+L I++++ + Sbjct: 553 V--PIGIGIGNADITEMQTLSFVPDFAVVIPTFRQLGTIQQVISERVTQL 600 >gi|325297740|ref|YP_004257657.1| von Willebrand factor type A [Bacteroides salanitronis DSM 18170] gi|324317293|gb|ADY35184.1| von Willebrand factor type A [Bacteroides salanitronis DSM 18170] Length = 341 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 19/168 (11%), Positives = 56/168 (33%), Gaps = 46/168 (27%) Query: 251 QCTPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 P++++ K L + P + T+ A+ A + + + + Sbjct: 142 TQLPITSDYISAKMFLETITPSLISTQGTDIRGAIDLAMKSFTP----------NEGVGR 191 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------------- 352 ++ ITDGEN ++ + G++++ + V +P Sbjct: 192 AIVLITDGENHEG-------GAVEAAQEAAKKGVRVFVLGVGSPDGSPIPTGNGSEFRRD 244 Query: 353 ----------GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQE 389 + + ++ +G + V+++ + + +K+ + Sbjct: 245 KDGNVIVTKLNEQMCQEIAKAGNGMYVRVDNTNNAERALNAEINKLAK 292 >gi|291227856|ref|XP_002733898.1| PREDICTED: inter-alpha trypsin inhibitor, heavy chain 3-like, partial [Saccoglossus kowalevskii] Length = 627 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 32/241 (13%), Positives = 78/241 (32%), Gaps = 22/241 (9%) Query: 155 DVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLI 214 DVS + +Q N LPP K + + + S K+ Sbjct: 269 DVSHDLSAGQIQVLNGYFVHYFAPTGLPPVQKNVLFVIDVSGSMDGA-------KMGQTK 321 Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 E+ +++ ++ ++ N+ +++ + N+ E K+ +N L Sbjct: 322 EALRVILDDMRS--FDRFNILTFSYEVSFWKENMM--ILATQENILEAKNFVNNLRASGG 377 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 TN + L + + + + R VI +TDG+ + + + + Sbjct: 378 TNFNGGLVEGVEML---RRVTDDAENTERSAFLVIMLTDGQPTSGETQLTKI--QENAKT 432 Query: 335 MRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ-----FFAVNDSRELLESFDKITDKIQE 389 + ++ + L+K + + + + +L +D++ Q Sbjct: 433 YIDGQYSLFCLGFG-GDVNFKFLQKISLENQGIARRIYEDADGYLQLKGFYDEVATMTQS 491 Query: 390 Q 390 Sbjct: 492 T 492 >gi|315636668|ref|ZP_07891900.1| conserved hypothetical protein [Arcobacter butzleri JV22] gi|315479050|gb|EFU69751.1| conserved hypothetical protein [Arcobacter butzleri JV22] Length = 1209 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 33/320 (10%), Positives = 80/320 (25%), Gaps = 23/320 (7%) Query: 52 DPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQY 111 S IK + K + + Sbjct: 586 SAALTDTDGSETLSVIIKNVPASATLESSKYEVSKNSDGSYTVKVPQGETSISDKLTMKV 645 Query: 112 EIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDN 171 + A + ++++ ++ S + D Sbjct: 646 PQEDAKNINLQIEAKATEARDNEDGQNFKTATDSTTDKTPTLVVGSNKDSVINGGAGKDI 705 Query: 172 NNMTSNKYLLPPPPKKSFWSKNTTK--------SKYAPAPAPANRKIDVLIESAGNLVNS 223 + L K++ S A +ID+L ++ NL +S Sbjct: 706 LIGDTGGTELNVQAGKNYNIALVVDTSGSMKEASGSKTAWGTTISRIDLLKDALKNLADS 765 Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 ++ + V I N S N++++ ++++ L TN A Sbjct: 766 LKGHDGKIN---VSIIDFDTNAKEPITFNDLTSKNISDLITKIDALKAEGGTNYEDAFLK 822 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA----- 338 + S + + F+TDG+ + ++ + M++A Sbjct: 823 TTSWFDTQ---SVTYGKAQGYENLTYFLTDGDPTFSNRDTKNNGSTTEYSDMKDAVDAFK 879 Query: 339 ----GMKIYSVAVSAPPEGQ 354 ++++ + Sbjct: 880 TLSGQSTVHAIGIGNGINEN 899 >gi|254447511|ref|ZP_05060977.1| inter-alpha-trypsin inhibitor domain protein [gamma proteobacterium HTCC5015] gi|198262854|gb|EDY87133.1| inter-alpha-trypsin inhibitor domain protein [gamma proteobacterium HTCC5015] Length = 670 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 20/185 (10%), Positives = 60/185 (32%), Gaps = 23/185 (12%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++ ++ V + + + + + + + + N+ + + + + Sbjct: 336 RMYHAKQALSQAVERLS-----PDDRFNVVEFNNQHSRLFSSMRSASAINVKQALNWVGR 390 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L T + E + + + VI ITD N Sbjct: 391 LQGGGGT-----------MMLPAVEDALSVRSDPAYLRQVILITDASV------GNEAEI 433 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 L++ E + G ++++V + P L + G + + +E+ ++ K++ Sbjct: 434 LRVVER-QRKGARLFTVGIGVSPNSYLLRKAAQVGQGDYVYIASGQEVKARMQRLFAKLE 492 Query: 389 EQSVR 393 ++ Sbjct: 493 NPVLK 497 >gi|194211145|ref|XP_001494734.2| PREDICTED: chloride channel, calcium activated, family member 4 [Equus caballus] Length = 1022 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 29/205 (14%), Positives = 79/205 (38%), Gaps = 33/205 (16%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + +++ + ++A + + +Q +N S + + ++ ++++ N Sbjct: 314 SGSMAGSNRLNRMNQAAKHFL------MQTIENGSWVGMVHFDSTAYIKSKLIQITSS-N 366 Query: 261 EVKSRLNKLN--PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 E L L T+ + A++ L + ++ +TDGE++ Sbjct: 367 ERNKLLESLPTAASGGTSICRGIKSAFQVLTGTYPQIDGSE--------IVLLTDGEDNT 418 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE--- 375 A + + +R +G I+ +A+ P Q ++ T + G+ +D Sbjct: 419 AGSCVD---------EVRQSGAIIHFIALG-PSADQAVIEMSTITGGKHKYASDEAANNG 468 Query: 376 LLESFDKITD---KIQEQSVRIAPN 397 L+++F + + +QS+++ Sbjct: 469 LIDAFAALVSGNADLSQQSLQLESK 493 >gi|123428998|ref|XP_001307617.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121889256|gb|EAX94687.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 642 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 31/298 (10%), Positives = 82/298 (27%), Gaps = 38/298 (12%) Query: 97 KNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDV 156 N Y + ++ ++ + S + N+ +S+ +V Sbjct: 101 DINENGYFYKFPLTHKYQQGSVTNDYSDKPESFHFSTSIMTQ--KEMSNVKVSVNGTTNV 158 Query: 157 SRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIES 216 S ++ + + + K S + + + Sbjct: 159 IDSHNATFVTNDTPTKDAIIIETQIKDKDKNIAISSDGYIAITTYPSFEGPITSNSEFYF 218 Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV--------KSRLNK 268 + S+ + K +R+ + +G +N + + K+ N Sbjct: 219 IVDCSGSMTGSRILKAIECMRMFIQSLPVGCR-FSIIKFGSNFHTILKSCDYTDKNLANA 277 Query: 269 LNP-------YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + T+ Y + + K + +TDGE Sbjct: 278 MQLLYTINSDMGGTDIYSPLKYVSDI-----------KPKKGFVKQIFLLTDGEVQ---- 322 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLE 378 ++ + R+ +I+S+ + + L++ S G + + D+ L + Sbjct: 323 --DSDEICAMAYKNRSNN-RIFSIGLGSGA-DPGLIKGIARKSGGNYTIIGDNDNLNQ 376 >gi|90408685|ref|ZP_01216835.1| hypothetical protein PCNPT3_08475 [Psychromonas sp. CNPT3] gi|90310199|gb|EAS38334.1| hypothetical protein PCNPT3_08475 [Psychromonas sp. CNPT3] Length = 349 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 25/179 (13%), Positives = 65/179 (36%), Gaps = 22/179 (12%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV-----KSRLNKLNPYE 273 + ++++++ + + G TP + + + ++R+N + Sbjct: 127 SRLDAVKEVLTDFIKTRQGDRLGLILFGDAAFVQTPFTADHDVWLDLLMQTRVN--MAGK 184 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 +T+ A+ + ++ + K I ++DG ++G+ + Sbjct: 185 STHLGDAIGLTIKRFNEATKNQTSEKTRE---KVAIILSDGNDTGSYVP-----PIDAAM 236 Query: 334 YMRNAGMKIYSVAVSAPPE------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + ++IY +A+ P + K S GQ + + +ELL ++ KI Sbjct: 237 VAKVNAVRIYMIAIGDPKSVGEQSLDMQTINKIASVSGGQAYQALNQQELLNAYAKIDK 295 >gi|332266371|ref|XP_003282183.1| PREDICTED: collagen alpha-4(VI) chain-like, partial [Nomascus leucogenys] Length = 1020 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 30/165 (18%), Positives = 56/165 (33%), Gaps = 16/165 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYREL 288 + + VR+G YN I + V ++ L TNT A+ Sbjct: 266 DISSDRVRVGLAQYNDNIYPAFQLNQHPLKSMVLEQIQNLPYRPGGTNTGSALEFIRTNY 325 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 E+ S R+ + VI +TDGE++ ++ + ++ G+ +Y V Sbjct: 326 LTEESGSRAK---DRVPQIVILVTDGESND--------EVQEVADRLKEDGVVVY--VVG 372 Query: 349 APPEGQDLLRKCTDSS-GQF-FAVNDSRELLESFDKITDKIQEQS 391 + L+K +F F + L + I + Sbjct: 373 VNVQDVQELQKIASEPFEKFLFNTENFNILQDFSGSILQTLCSAV 417 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 24/171 (14%), Positives = 56/171 (32%), Gaps = 16/171 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYREL 288 + ++R+G Y+ + +V + + N A+ L Sbjct: 63 NVSSETIRVGLAQYSDVPHSEFLLSTYHRKGDVLRHIRQFQFKPGGKNMGLALKFI---L 119 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + + + + S + + + I+ G E +R AG+ +Y V V Sbjct: 120 DHHFQEASGSRASQGVPQIAVVISSGPAEDHVHGP--------AEALRRAGILLYVVGV- 170 Query: 349 APPEGQDLLRKCTDSSGQFF--AVNDSRELLESFDKITDKIQEQSVRIAPN 397 +L+ S + F + + L K+ ++ ++ + AP Sbjct: 171 RDAVWAELME-IASSPQENFTSFLPNFSGLSNLAQKLRQELCDKLAKAAPR 220 >gi|208434899|ref|YP_002266565.1| phage/colicin/tellurite resistance cluster Y protein [Helicobacter pylori G27] gi|208432828|gb|ACI27699.1| phage/colicin/tellurite resistance cluster Y protein [Helicobacter pylori G27] Length = 183 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 27/184 (14%), Positives = 68/184 (36%), Gaps = 16/184 (8%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +I VL ++ ++++ + KK L ++ + + G G +++ + + Sbjct: 2 IRTRIGVLNVCIQKMIETLKQ--EAKKELFSKMAIVTF--GGNGANLHTPFDDIKNINFK 57 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 L+ T A A + + +T + K + I ++DGE + + Sbjct: 58 --PLSASGGTPLDQAFKLAKDLIED-----RDTFPTKFYKPYSILVSDGEPNNDKWQEPL 110 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 N ++A +S+ + + + F +D +L++ F+ +T Sbjct: 111 FNFHNDGRSAKSAC---WSIFIGDREANPQVNKD--FGKDGVFYADDVEKLVKLFEIMTQ 165 Query: 386 KIQE 389 I + Sbjct: 166 TISK 169 >gi|198438279|ref|XP_002126427.1| PREDICTED: similar to LOC779593 protein [Ciona intestinalis] Length = 1012 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 19/167 (11%), Positives = 57/167 (34%), Gaps = 38/167 (22%) Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNE-------------------- 291 T + ++N+ + ++ ++ TN A+ A+ + Sbjct: 382 STASTRSINDAITYVDAVSARGGTNMLVALQTAFAIMEPYLPSLPENETMVEDTTPFPTP 441 Query: 292 ------------KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG 339 K ++ + K ++F+TDG + + + + E + Sbjct: 442 VPLQPETNHFIRKRATETQTELSNYAKMIVFLTDGRPTKDDVGTDDIASR--IEKINGGR 499 Query: 340 MKIYSVAVSAPPEGQDLLRKCTDSSG---QFFAVNDSR-ELLESFDK 382 + ++++ + + + L + + G + F D+ ++ FD+ Sbjct: 500 VNLHTIGFGSLVDMRFLEKLAALNGGVSRRVFESLDAATQIRHFFDE 546 >gi|153809330|ref|ZP_01961998.1| hypothetical protein BACCAC_03643 [Bacteroides caccae ATCC 43185] gi|149128100|gb|EDM19321.1| hypothetical protein BACCAC_03643 [Bacteroides caccae ATCC 43185] Length = 611 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 30/303 (9%), Positives = 81/303 (26%), Gaps = 28/303 (9%) Query: 79 RENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGI 138 E G + + + ++ A Y + L P Sbjct: 140 TEEYGSFQENGFKEVADAALSTFSIDVDA-ASYSNMRRFVNKGELPPVDAVRTEELVNYF 198 Query: 139 IERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT-TKS 197 + + + N L + Sbjct: 199 SYDYPKPTGNDPVKI-------TMEAGACPWNPANRLVRIGLKAKEIPTDNLPASNLVFL 251 Query: 198 KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN 257 ++D++ S LVN+++ ++ + Y+ + + Sbjct: 252 IDVSGSMWGANRLDLVKSSLKLLVNNLRDKD--------KVAIVTYSGSAGVKLESTSGS 303 Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + +++ +++L +T + AY+ S+ N +I +DG+ + Sbjct: 304 DKQKIREAIDELTAGGSTAGGAGIMLAYKIAKKNFISNGNNR--------IILCSDGDFN 355 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSREL 376 + L Q+ E R +G+ + + ++ + G +++ +E Sbjct: 356 VGVSSAEGL--EQLIERERKSGVFLTVLGYGMGNYKDKKIQVLAEKGNGNHAYIDNLQEA 413 Query: 377 LES 379 Sbjct: 414 NRV 416 >gi|48428051|sp|Q864W1|CFAB_PONPY RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|29690183|gb|AAM10003.1| complement factor B precursor [Pongo pygmaeus] Length = 764 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 43/233 (18%), Positives = 77/233 (33%), Gaps = 30/233 (12%) Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDV--LIESAGNLVNSIQKAIQEKKNLSVRIG 239 P +K + + S + I + LVN I+K + Sbjct: 253 PGEQQKRKIVLDPSGSMNIYLVLDGSDSIGAGNFTGAKKCLVNLIEKVASYGVKPRYGLV 312 Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-----YENTNTYPAMHHAYRELYNEKES 294 T A I P S+N + V +LN++N TNT A+ Y + Sbjct: 313 TYATYPKIWVKVSEPDSSNADWVTKQLNEINYEDHKLKSGTNTKKALQAVYSMM--SWPD 370 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN-------------AGMK 341 G R + +I +TDG ++ + + + + +R+ + Sbjct: 371 DIPPEGWNRTRHVIILMTDGLHNMGG------DPITVIDEIRDLLYIGKDRKNPREDYLD 424 Query: 342 IYSVAVS--APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 +Y V + L D+ F V D L + F ++ D+ Q S+ Sbjct: 425 VYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLEDVFFQMIDESQSLSL 477 >gi|330918891|ref|XP_003298384.1| hypothetical protein PTT_09104 [Pyrenophora teres f. teres 0-1] gi|311328422|gb|EFQ93524.1| hypothetical protein PTT_09104 [Pyrenophora teres f. teres 0-1] Length = 1367 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 38/274 (13%), Positives = 80/274 (29%), Gaps = 27/274 (9%) Query: 107 SKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ 166 S + S L S Q Sbjct: 862 SSTSLQASLVFWIKMRDRGDRHITGEWSDAHWEPISPYFNREFCTGTL--SEEFCFTKEQ 919 Query: 167 KHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK 226 D+ N + P K + +++ SK ++DVL + +N + Sbjct: 920 DDEDSKNTYDADKVSQPLVFKVYLGLDSSASKQKS----HMTRLDVLKQMFDAYINRLLA 975 Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYR 286 N +G + ++ + Q ++N + + +LN L +T + ++ A Sbjct: 976 -----YNFHSHVGLVTFSTKALVAQ--KITNAVENFRHKLNNLKASGDTAIWDSIALAQD 1028 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 ++ E + +I I+DGE++ + + + + + + S Sbjct: 1029 QIQQYAEQYPGSKL------RIICISDGEDNK-----SQNTAVDLASRLIRDDITVDS-- 1075 Query: 347 VSAPPEGQDLLRK-CTDSSGQFFAVNDSRELLES 379 L+ C+ S G FA E + Sbjct: 1076 FCLDDHSNKELQTLCSLSGGYSFAPKTLDEAMAI 1109 >gi|297473020|ref|XP_002686329.1| PREDICTED: chloride channel accessory 1 [Bos taurus] gi|296489230|gb|DAA31343.1| chloride channel accessory 1 [Bos taurus] Length = 911 Score = 53.8 bits (127), Expect = 5e-05, Method: Composition-based stats. Identities = 21/135 (15%), Positives = 55/135 (40%), Gaps = 27/135 (20%) Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T+ + A+ + + + +I +TDGE++ T+ Sbjct: 380 TASGGTSICSGLRSAFTVIKKKYPTDGAE---------IILLTDGEDN----------TI 420 Query: 330 QIC-EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND---SRELLESFDKIT- 384 C + ++ +G I++VA+ P Q+L + + G +D + L+++F ++ Sbjct: 421 SACFDEVKQSGAIIHTVALG-PSAAQELEQMSKMTGGLQTYASDQVQNNGLVDAFAALSS 479 Query: 385 --DKIQEQSVRIAPN 397 + ++S+++ Sbjct: 480 GNKAVSQRSIQLESR 494 >gi|317483399|ref|ZP_07942390.1| von Willebrand factor type A domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] gi|316915154|gb|EFV36585.1| von Willebrand factor type A domain-containing protein [Bifidobacterium sp. 12_1_47BFAA] Length = 401 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 39/348 (11%), Positives = 102/348 (29%), Gaps = 30/348 (8%) Query: 36 AVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK 95 A LS + + DQ++ L+ + ++ D + Sbjct: 78 ATLSIASGSENKEVAVAIQKAADQSNVAVTMHYMGSLEIMNALKAGGQDHDAVWPASSMW 137 Query: 96 DKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD 155 ++I + A + A + +++ ++ + + Sbjct: 138 ISMGDTKHIVKDAASTSTTPI------VFGIAKSKAVKLGWADDTGAAKPVSTADILAAV 191 Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 + ++ + + L P + W + + S + ++ ++ Sbjct: 192 SDGKLTFSMTSATVIDSALNVYQTALR-KPSWTIWVVDYSGSMSGEGKNGVVKGLNAALD 250 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 + +K+ E + V I + T S +++ + + T Sbjct: 251 -----PDQAKKSYIEPASGDVNILIPFETEAHRPVKATGTS--TSDLLHEADATDASGGT 303 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + Y + A EL +E E+S ++ +TDG ++ + + Sbjct: 304 DIYEGLLSALDELPSESEASQ-------YTTAIVLMTDGRSNSDHQDEFESSYKS----- 351 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDK 382 R + I+S+ L+ S+ + F +L F + Sbjct: 352 RGRDLPIFSIMFG--DADPSQLKSLATLSNAKVFDGRS-GDLAAVFRQ 396 >gi|239781743|pdb|2WIN|I Chain I, C3 Convertase (C3bbb) Stabilized By Scin gi|239781744|pdb|2WIN|J Chain J, C3 Convertase (C3bbb) Stabilized By Scin gi|239781745|pdb|2WIN|K Chain K, C3 Convertase (C3bbb) Stabilized By Scin gi|239781746|pdb|2WIN|L Chain L, C3 Convertase (C3bbb) Stabilized By Scin Length = 507 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 69/211 (32%), Gaps = 31/211 (14%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + NL+ + + + V T V S+N + Sbjct: 19 SDSIGASNFTGAKKCLVNLIEKVASYGVKPRYGLVTYATYPKIWVKVSEAD---SSNADW 75 Query: 262 VKSRLNKLNP-----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 V +LN++N TNT A+ Y + + R + +I +TDG + Sbjct: 76 VTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVPPEGWN--RTRHVIILMTDGLH 133 Query: 317 SGASAYQNTLNTLQICEYMRN-------------AGMKIYSVAVS--APPEGQDLLRKCT 361 + + + + + +R+ + +Y V + L Sbjct: 134 NMGG------DPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALASKK 187 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 D+ F V D L + F ++ D+ Q S+ Sbjct: 188 DNEQHVFKVKDMENLEDVFYQMIDESQSLSL 218 >gi|194384366|dbj|BAG64956.1| unnamed protein product [Homo sapiens] Length = 1266 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 69/211 (32%), Gaps = 31/211 (14%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + NL+ + + + V T V S+N + Sbjct: 780 SDSIGASNFTGAKKCLVNLIEKVASYGVKPRYGLVTYATYPKIWVKVSEAD---SSNADW 836 Query: 262 VKSRLNKLNP-----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 V +LN++N TNT A+ Y + + R + +I +TDG + Sbjct: 837 VTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVPPEGWN--RTRHVIILMTDGLH 894 Query: 317 SGASAYQNTLNTLQICEYMRN-------------AGMKIYSVAVS--APPEGQDLLRKCT 361 + + + + + +R+ + +Y V + L Sbjct: 895 NMGG------DPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALASKK 948 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 D+ F V D L + F ++ D+ Q S+ Sbjct: 949 DNEQHVFKVKDMENLEDVFYQMIDESQSLSL 979 >gi|168983786|emb|CAQ06837.1| complement factor B [Homo sapiens] gi|168984885|emb|CAQ08426.1| complement factor B [Homo sapiens] Length = 589 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 69/211 (32%), Gaps = 31/211 (14%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + NL+ + + + V T V S+N + Sbjct: 278 SDSIGASNFTGAKKCLVNLIEKVASYGVKPRYGLVTYATYPKIWVKVSEAD---SSNADW 334 Query: 262 VKSRLNKLNP-----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 V +LN++N TNT A+ Y + + R + +I +TDG + Sbjct: 335 VTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVPPEGWN--RTRHVIILMTDGLH 392 Query: 317 SGASAYQNTLNTLQICEYMRN-------------AGMKIYSVAVS--APPEGQDLLRKCT 361 + + + + + +R+ + +Y V + L Sbjct: 393 NMGG------DPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALASKK 446 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 D+ F V D L + F ++ D+ Q S+ Sbjct: 447 DNEQHVFKVKDMENLEDVFYQMIDESQSLSL 477 >gi|134105218|pdb|2OK5|A Chain A, Human Complement Factor B Length = 752 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 69/211 (32%), Gaps = 31/211 (14%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + NL+ + + + V T V S+N + Sbjct: 263 SDSIGASNFTGAKKCLVNLIEKVASYGVKPRYGLVTYATYPKIWVKVSEAD---SSNADW 319 Query: 262 VKSRLNKLNP-----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 V +LN++N TNT A+ Y + + R + +I +TDG + Sbjct: 320 VTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVPPEGWN--RTRHVIILMTDGLH 377 Query: 317 SGASAYQNTLNTLQICEYMRN-------------AGMKIYSVAVS--APPEGQDLLRKCT 361 + + + + + +R+ + +Y V + L Sbjct: 378 NMGG------DPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALASKK 431 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 D+ F V D L + F ++ D+ Q S+ Sbjct: 432 DNEQHVFKVKDMENLEDVFYQMIDESQSLSL 462 >gi|150397140|ref|YP_001327607.1| putative signal peptide protein [Sinorhizobium medicae WSM419] gi|150028655|gb|ABR60772.1| conserved hypothetical signal peptide protein [Sinorhizobium medicae WSM419] Length = 126 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 15/58 (25%), Positives = 22/58 (37%) Query: 3 AIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 AI AID A + ++QSALDAA L + T ++ + Sbjct: 26 AIAFIPIIGAAALAIDFAGAYFEAEKIQSALDAAALGSVRAYGEGATEEEAYDAAQKF 83 >gi|57209925|emb|CAI41860.1| complement factor B [Homo sapiens] Length = 764 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 69/211 (32%), Gaps = 31/211 (14%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + NL+ + + + V T V S+N + Sbjct: 278 SDSIGASNFTGAKKCLVNLIEKVASYGVKPRYGLVTYATYPKIWVKVSEAD---SSNADW 334 Query: 262 VKSRLNKLNP-----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 V +LN++N TNT A+ Y + + R + +I +TDG + Sbjct: 335 VTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVPPEGWN--RTRHVIILMTDGLH 392 Query: 317 SGASAYQNTLNTLQICEYMRN-------------AGMKIYSVAVS--APPEGQDLLRKCT 361 + + + + + +R+ + +Y V + L Sbjct: 393 NMGG------DPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALASKK 446 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 D+ F V D L + F ++ D+ Q S+ Sbjct: 447 DNEQHVFKVKDMENLEDVFYQMIDESQSLSL 477 >gi|13278732|gb|AAH04143.1| Complement factor B [Homo sapiens] gi|14124934|gb|AAH07990.1| Complement factor B [Homo sapiens] gi|62898361|dbj|BAD97120.1| complement factor B preproprotein variant [Homo sapiens] gi|119623955|gb|EAX03550.1| complement factor B [Homo sapiens] gi|123982996|gb|ABM83239.1| complement factor B [synthetic construct] gi|123997681|gb|ABM86442.1| complement factor B [synthetic construct] gi|307685187|dbj|BAJ20524.1| complement factor B [synthetic construct] Length = 764 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 69/211 (32%), Gaps = 31/211 (14%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + NL+ + + + V T V S+N + Sbjct: 278 SDSIGASNFTGAKKCLVNLIEKVASYGVKPRYGLVTYATYPKIWVKVSEAD---SSNADW 334 Query: 262 VKSRLNKLNP-----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 V +LN++N TNT A+ Y + + R + +I +TDG + Sbjct: 335 VTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVPPEGWN--RTRHVIILMTDGLH 392 Query: 317 SGASAYQNTLNTLQICEYMRN-------------AGMKIYSVAVS--APPEGQDLLRKCT 361 + + + + + +R+ + +Y V + L Sbjct: 393 NMGG------DPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALASKK 446 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 D+ F V D L + F ++ D+ Q S+ Sbjct: 447 DNEQHVFKVKDMENLEDVFYQMIDESQSLSL 477 >gi|13560705|gb|AAK30167.1|AF349679_1 factor B [Homo sapiens] Length = 621 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 69/211 (32%), Gaps = 31/211 (14%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + NL+ + + + V T V S+N + Sbjct: 278 SDSIGASNFTGAKKCLVNLIEKVASYGVKPRYGLVTYATYPKIWVKVSEAD---SSNADW 334 Query: 262 VKSRLNKLNP-----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 V +LN++N TNT A+ Y + + R + +I +TDG + Sbjct: 335 VTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVPPEGWN--RTRHVIILMTDGLH 392 Query: 317 SGASAYQNTLNTLQICEYMRN-------------AGMKIYSVAVS--APPEGQDLLRKCT 361 + + + + + +R+ + +Y V + L Sbjct: 393 NMGG------DPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALASKK 446 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 D+ F V D L + F ++ D+ Q S+ Sbjct: 447 DNEQHVFKVKDMENLEDVFYQMIDESQSLSL 477 >gi|291922|gb|AAA16820.1| complement factor B [Homo sapiens] gi|2347133|gb|AAB67977.1| complement factor B [Homo sapiens] Length = 764 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 69/211 (32%), Gaps = 31/211 (14%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + NL+ + + + V T V S+N + Sbjct: 278 SDSIGASNFTGAKKCLVNLIEKVASYGVKPRYGLVTYATYPKIWVKVSEAD---SSNADW 334 Query: 262 VKSRLNKLNP-----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 V +LN++N TNT A+ Y + + R + +I +TDG + Sbjct: 335 VTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVPPEGWN--RTRHVIILMTDGLH 392 Query: 317 SGASAYQNTLNTLQICEYMRN-------------AGMKIYSVAVS--APPEGQDLLRKCT 361 + + + + + +R+ + +Y V + L Sbjct: 393 NMGG------DPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALASKK 446 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 D+ F V D L + F ++ D+ Q S+ Sbjct: 447 DNEQHVFKVKDMENLEDVFYQMIDESQSLSL 477 >gi|67782358|ref|NP_001701.2| complement factor B preproprotein [Homo sapiens] gi|584908|sp|P00751|CFAB_HUMAN RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; AltName: Full=Glycine-rich beta glycoprotein; Short=GBG; AltName: Full=PBF2; AltName: Full=Properdin factor B; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|4261689|gb|AAD13989.1|S67310_1 complement factor B [Homo sapiens] gi|297569|emb|CAA51389.1| complement factor B [Homo sapiens] gi|25070931|gb|AAN71991.1| B-factor, properdin [Homo sapiens] gi|55961819|emb|CAI17456.1| complement factor B [Homo sapiens] gi|123857994|emb|CAM25864.1| complement factor B [Homo sapiens] gi|168984418|emb|CAQ09274.1| complement factor B [Homo sapiens] gi|168985079|emb|CAQ07483.1| complement factor B [Homo sapiens] gi|168985957|emb|CAQ07113.1| complement factor B [Homo sapiens] Length = 764 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 34/211 (16%), Positives = 69/211 (32%), Gaps = 31/211 (14%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + NL+ + + + V T V S+N + Sbjct: 278 SDSIGASNFTGAKKCLVNLIEKVASYGVKPRYGLVTYATYPKIWVKVSEAD---SSNADW 334 Query: 262 VKSRLNKLNP-----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 V +LN++N TNT A+ Y + + R + +I +TDG + Sbjct: 335 VTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVPPEGWN--RTRHVIILMTDGLH 392 Query: 317 SGASAYQNTLNTLQICEYMRN-------------AGMKIYSVAVS--APPEGQDLLRKCT 361 + + + + + +R+ + +Y V + L Sbjct: 393 NMGG------DPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALASKK 446 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 D+ F V D L + F ++ D+ Q S+ Sbjct: 447 DNEQHVFKVKDMENLEDVFYQMIDESQSLSL 477 >gi|254482897|ref|ZP_05096133.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] gi|214036769|gb|EEB77440.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] Length = 330 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 37/277 (13%), Positives = 91/277 (32%), Gaps = 42/277 (15%) Query: 119 FLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNK 178 F L+ + + + + + + +++ +L + + + ++ + + Sbjct: 36 FFDKLVDLSEQKPEQGARILRRSTLQKILVNLTWLLLILAAAKPQWIGPPIEQQKSGRDL 95 Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 + T S ++D + + L S R+ Sbjct: 96 MIAVDLSGSMEARDFTLPSGVTV------DRLDAVKQVLKELAA---------NRESDRL 140 Query: 239 GTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNT 298 G I + TP +++ + L++ T M + S Sbjct: 141 GLIVFGAAAYLQ--TPFTDDHQVWQQLLDE--------TEIGMAGPSTVFGDAIGLSIKL 190 Query: 299 IGSTRLK-KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-GQDL 356 + + +I +TDG ++G +T+ + + G++IY++A+ P G+D Sbjct: 191 FSDSDSDNRVLIMLTDGNDTG-----STVPPVDAAKVAAANGVRIYTIAIGDPATVGEDA 245 Query: 357 L--------RKCTDSSGQFFAVNDSRELLESFDKITD 385 L K G+ F D E+ +++ I + Sbjct: 246 LDMDTITRVSKIA--DGRTFRALDQDEMRQAYITIGE 280 >gi|149918749|ref|ZP_01907236.1| batB protein [Plesiocystis pacifica SIR-1] gi|149820350|gb|EDM79766.1| batB protein [Plesiocystis pacifica SIR-1] Length = 421 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 54/177 (30%), Gaps = 39/177 (22%) Query: 250 NQCTPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGST--- 302 + P+++++ + L +P T A+ + L + Sbjct: 157 ARSFPVTSDMGVLSLFLAHADPRTENPGGTAIGKALDKSIDLLVAVRRDDSGARADQVEG 216 Query: 303 ----------------RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 + ++ +TDGE++ ++ G++IY+V Sbjct: 217 EGEDESGAPEAAPALSEADQVIVLLTDGEDTVGR-------PEEVAARAEQLGIRIYTVG 269 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRE------LLE-SFDKITDKIQEQSVRIAP 396 + + + + D+ L E + + D+ + + VR+ P Sbjct: 270 IGSDSGEPIMRYDADGQPAGY--ATDADGKPVMTRLDEGTLKTLADQTRGKYVRVKP 324 >gi|149639247|ref|XP_001506420.1| PREDICTED: similar to voltage-dependent calcium channel alpha-2 delta subunit [Ornithorhynchus anatinus] Length = 1113 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 32/313 (10%), Positives = 90/313 (28%), Gaps = 25/313 (7%) Query: 71 HLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK-AQYEIPTENLFLKGLIPSALT 129 + K E +I ++ + + A IPT+ ++ + L Sbjct: 148 NAKDNLNDPEKNESEPGSQRIKPVFVEDAVFRRQTSYQHAAVHIPTDIYEGSTIVLNELN 207 Query: 130 NLSLRSTGIIERSSENLAISICMVLDVSR----SMEDLYLQKHNDNNNMTSNKYLLPPPP 185 S + E+ + + + ++ K N + P Sbjct: 208 WTSSLDEVFKKNRDEDPTLLWQVFGSATGLARYYPASPWVDKSRSPNKIDLYDVRRRPWY 267 Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ-EKKNLSVRIGTIAYN 244 + S + + + + ++ S ++ ++ + + ++ Sbjct: 268 IQGAASPKDMLILVDVSGSVSGLTLKLIRTSVSEMLETLSDDDFVNVASFNSNAQDVSCF 327 Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 +V N +K +N + T+ A+ +L N S Sbjct: 328 QHLVQANVR----NKKVLKDAVNNITAKGITDYKKGFSFAFEQLLNYNVSRA------NC 377 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS 364 K ++ TDG A N ++ +++++ +V + ++ + Sbjct: 378 NKIIMLFTDGGEERAQEIFKKYN--------QDKKVRVFTFSVGQHNYDRGPIQWMACKN 429 Query: 365 -GQFFAVNDSREL 376 G ++ + + Sbjct: 430 KGYYYEIPSIGAI 442 >gi|126309708|ref|XP_001376394.1| PREDICTED: similar to B-factor, properdin [Monodelphis domestica] Length = 764 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 31/234 (13%), Positives = 67/234 (28%), Gaps = 40/234 (17%) Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAP-----APAPANRKIDVLIESAGNLVNSIQKAI 228 ++ P + + S + + + +L++ + Sbjct: 249 ADADDEYSPGGQQNRKIVLDPAGSMNIYLVLDASDSIGKNNFTGAKKCLSSLIDKVASYG 308 Query: 229 QEKKNLSVRIGTIAYNIGIVGNQCTPL--SNNLNEVKSRLNKLNPYE-----NTNTYPAM 281 E R + Y S+N + VK L K+ + TNT A+ Sbjct: 309 VEP-----RYAVVTYATEAKAVVKLSDKESSNADWVKQELEKIKYSDHRLKAGTNTKKAL 363 Query: 282 HHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN---- 337 Y + ++ + TR ++ +TDG + + + E +R Sbjct: 364 TMLYEMMILQESQNDINWNKTR--HVIVLMTDGNYNMGG------DPVAAIEQIREFLDI 415 Query: 338 ---------AGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESF 380 + +Y + + + + + G F V D +L F Sbjct: 416 GRNRKNPRENYLDVYVFGIGPLVDQEKINALASKKDGEKHVFKVKDMEDLENVF 469 >gi|283778201|ref|YP_003368956.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] gi|283436654|gb|ADB15096.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] Length = 786 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 36/317 (11%), Positives = 91/317 (28%), Gaps = 39/317 (12%) Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI 148 + E+ + I ++++ +T E S+ Sbjct: 200 PMATARYTNTPIEKVSLEATIESSIAIKSVYSPTHAVDVKRPDEKHATVKFEASNYLPTT 259 Query: 149 SICMVLDVSRSMEDLYLQKHNDNNNMT-SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPAN 207 ++ DV + + + +N+ L P + + + + Sbjct: 260 DFRLLYDVGDAPLAASVLSYRPDNSDEGFFLMLASPNHSQGEVDLTKKTVIFVVDRSGSM 319 Query: 208 --RKIDVLIESAGNLV------NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 +KI+ E+ ++ ++ + S + ++ Sbjct: 320 QGKKIEQAREAMRYVLNNLHEGDTFNIVAYDSTVESFKPELQKFDDAT-----------R 368 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 + ++ L +TN A+ A+ L +++F+TDG + Sbjct: 369 KSALAYVDGLYAGGSTNISGALDSAFAMLTGSDRP-----------NYILFLTDGLPTAG 417 Query: 320 SAYQNTLNTLQICEYMRNAGM---KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 N +I E + + ++ + V + L R ++ GQ V L Sbjct: 418 -----ETNEGKIVELAKQKNVHRARMINFGVGYDVNSRLLDRMSRENFGQSQYVRPDENL 472 Query: 377 LESFDKITDKIQEQSVR 393 S ++ K+ + Sbjct: 473 EASVSRLYSKMSSPVLT 489 >gi|163735880|ref|ZP_02143308.1| hypothetical protein RLO149_07941 [Roseobacter litoralis Och 149] gi|161390816|gb|EDQ15157.1| hypothetical protein RLO149_07941 [Roseobacter litoralis Och 149] Length = 320 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 21/144 (14%), Positives = 49/144 (34%), Gaps = 30/144 (20%) Query: 251 QCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 P S ++ + ++ T+ + A + + S + Sbjct: 148 FAAPFSFDVEAIARQIESAQIGVSGRATSISDGLGIALKRM----------ENSEAASRV 197 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-----------EGQDL 356 VI ++DG N+ + N + E G++++++A+ Sbjct: 198 VILLSDGVNNAGAT-----NPRGVAELAAQMGVRVHTIALGPKDLSSADPGERGVVDAAT 252 Query: 357 LRKCTD-SSGQFFAVNDSRELLES 379 LR ++ S G+ F V + +L+ Sbjct: 253 LRAISEISGGESFRVRTTEDLVAV 276 >gi|307609426|emb|CBW98915.1| hypothetical protein LPW_07021 [Legionella pneumophila 130b] Length = 1169 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 45/418 (10%), Positives = 103/418 (24%), Gaps = 64/418 (15%) Query: 34 DAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINI 93 DA L+ + + + ++ + + I+ ++ + + Sbjct: 113 DANGLAPYTVVQGGNKVDNSASRLNVAKAGIQAIIENYMPTTDFALGTYSTSNISSYNTW 172 Query: 94 TKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMV 153 + P + SA + +S + I L +S Sbjct: 173 VYYMSPPGSDFVFTNTPVAGNRYVTNPCYNYGSASSTVSSNCSSIGNLYGTTL-VSSSQY 231 Query: 154 LDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL 213 L + S +D + + + Sbjct: 232 LQIGDSSDDPDINDVLY-AGSGFPGIFVSYNGPTPSSPFPPNYTISNYNQGNIRISYANT 290 Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP----------LSNNLNEVK 263 S GN +S A + V + + T + V Sbjct: 291 RPSIGNFSSSPTNAGFVPFSQQVMYVQRGFGYYSNQSYATGNMLVNMQTAGTNPTTTSVN 350 Query: 264 SRLNKLNP-----YENTNTY--------PAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 + +N P +T T + + ++ T G+ K+++I Sbjct: 351 NAINAFLPHLKPETNSTATTEIKAAAVQSPLAGLLTRSRSFMKTVGTTSGNCPQKQYIIL 410 Query: 311 ITDGENSGASAYQNT-------------------------------LNTLQICEYMRNAG 339 I+DG + + + + + ++N G Sbjct: 411 ISDGLPTQDLQSRYWPPLGSAAATGYGVTATFNADGSLNSTNSQALSDAINEIKALKNDG 470 Query: 340 MKIYSVAVSAPPE------GQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQE 389 + I+ + + A + LR + G ++ L+ S + I IQ Sbjct: 471 VLIFIIGMGAGVDPAVNPEAAATLRAMAVAGGTENYYPATSPETLVSSLNSILSNIQN 528 >gi|312200955|ref|YP_004021016.1| von Willebrand factor type A [Frankia sp. EuI1c] gi|311232291|gb|ADP85146.1| von Willebrand factor type A [Frankia sp. EuI1c] Length = 618 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 15/123 (12%), Positives = 39/123 (31%), Gaps = 10/123 (8%) Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 + + +N L T Y A+ L + + V+ +TDG+N Sbjct: 497 EALVAAVNGLKAKGGTGLYATALAAFESLSAQYQP--------DKPNQVVLLTDGQNDDP 548 Query: 320 SAYQNTLNTLQICEYMRNAGMKIY--SVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 ++ + + N ++ ++ A + L + + + + D + Sbjct: 549 TSSMTLTQLIATLKAEYNPKAPVHIITIGYGADADMDALRQISAATGSKTYPAQDPNSIF 608 Query: 378 ESF 380 + Sbjct: 609 QVM 611 >gi|222616426|gb|EEE52558.1| hypothetical protein OsJ_34813 [Oryza sativa Japonica Group] Length = 517 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 26/189 (13%), Positives = 71/189 (37%), Gaps = 28/189 (14%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP--LSNNLNEVK 263 + ++D+L + ++ ++ A R+ +++N +V +++ +++ Sbjct: 87 MSSRLDLLKIAMKYIIKLVRDAD--------RLAIVSFNHAVVSEYGLTRNSADSRKKLE 138 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 + ++KL NT+ PA+ A L + F++ ++DG Sbjct: 139 NLVDKLKASGNTDFRPALKKAVEILD-----GRGKEEKKKRVGFILLLSDG---VDQFQY 190 Query: 324 NTLNTLQICEY---------MRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV-NDS 373 + +N ++ + +++ SA + L + S G + V + Sbjct: 191 SRINWEKVAKSTDVDHSEVGAMLRKYAVHTFGFSASHDPVPLRQISALSYGLYSFVCKNL 250 Query: 374 RELLESFDK 382 + E+F + Sbjct: 251 DNITEAFAR 259 >gi|313212349|emb|CBY36340.1| unnamed protein product [Oikopleura dioica] Length = 2306 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 31/173 (17%), Positives = 63/173 (36%), Gaps = 20/173 (11%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TN 276 + + S A + + +++ I Y+ I ++ + + T Sbjct: 595 DRILSFVSATVDALSGNIQYAAIQYSDVITEEFNFKYRL-KKDLIEEIKAMKYDSGWSTY 653 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 T AM A+ L++++ + N + K ++ +TDG + + +I E ++ Sbjct: 654 TGLAMEKAWSMLFDQQFGARNA-----VTKIMVILTDG--------RTKDDIEKISENIK 700 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 NAG +AV D L S F + EL + F + I + Sbjct: 701 NAG-DTTVLAVGLNSAPLDELLTLATSKDLAFY---THELADIFHLLAQLIDK 749 >gi|299148843|ref|ZP_07041905.1| von Willebrand factor type A domain protein [Bacteroides sp. 3_1_23] gi|298513604|gb|EFI37491.1| von Willebrand factor type A domain protein [Bacteroides sp. 3_1_23] Length = 616 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 31/306 (10%), Positives = 86/306 (28%), Gaps = 26/306 (8%) Query: 75 GSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLR 134 + E G I + +++ + ++ A Y + L P Sbjct: 139 DAVNAEEYGQIQENGFKSVSDAPLSTFSIDVDA-ASYSNMRRFINKGELPPVDAIRTEEL 197 Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 + + + + + P + + N Sbjct: 198 VNYFSYDYPKPTGSDPVKITMEAGACPWNADHRLVRIGLKAREI------PTDNLPASNL 251 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 ++D++ S LVN+++ ++ + Y Sbjct: 252 VFLIDVSGSMWGANRLDLVKSSLKLLVNNLRDKD--------KVAIVTYAGSAGVKLEAT 303 Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 ++ +++ +++L +T + AY+ S+ N +I +DG Sbjct: 304 PGSDKQKIREAIDELTAGGSTAGGAGILLAYKIAKKNFISNGNNR--------IILCSDG 355 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDS 373 + + + L Q+ E R +G+ + + ++ + G +++ Sbjct: 356 DFNVGVSSAEGL--EQLIEKERKSGVFLTVLGYGMGNYKDKKIQVLAEKGNGNHAYIDNL 413 Query: 374 RELLES 379 +E Sbjct: 414 QEANRV 419 >gi|297467492|ref|XP_002705103.1| PREDICTED: polydom [Bos taurus] Length = 3396 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 20/132 (15%), Positives = 44/132 (33%), Gaps = 25/132 (18%) Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + + + T T A A + L + +E+S K + ITDG ++G Sbjct: 155 LSREIPAITYRGGGTYTKGAFQQAAQILRHSRENST---------KVIFLITDGYSNGG- 204 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLE 378 + I +R+ G++I++ L + ++ E Sbjct: 205 ------DPRPIAASLRDFGVEIFT--FGIWQGNIRELNDMASIPKEEHCYLLHSFEE--- 253 Query: 379 SFDKITDKIQEQ 390 F+ + + + Sbjct: 254 -FEALARRALHE 264 >gi|310824614|ref|YP_003956972.1| vault protein, inter-alpha-trypsin domain-containing protein [Stigmatella aurantiaca DW4/3-1] gi|309397686|gb|ADO75145.1| Vault protein, inter-alpha-trypsin domain protein [Stigmatella aurantiaca DW4/3-1] Length = 749 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 35/300 (11%), Positives = 97/300 (32%), Gaps = 20/300 (6%) Query: 95 KDKNNPLQYIAESKAQYEIPTENLFLKGL-IPSALTNLSLRSTGIIERSSENLAISICMV 153 + + +K ++P ++++ + + + G+ + + +L+ + + Sbjct: 178 VGGSAKNDFTFSAKVSSKVPLKSIYSPTHPMDVSRRGEAEAVVGLEQVNGADLSKDLDLY 237 Query: 154 LDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPAN--RKID 211 VS L L + + L+ P + S + + + + ++ Sbjct: 238 FSVSDKAVGLSLLTYKQADEPGYFIALIAPKTEVSASEIAAKRVTFVIDTSGSMQGSRMQ 297 Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP 271 + ++ V + ++ + + + N+ + + + +L Sbjct: 298 IAKDALKYCVTRLN-----PQDTFNVVRFSTDVEALFPALKSAQPENIQKAVAFVEQLEA 352 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T A+ + + G + ++FITDG+ + + Q Sbjct: 353 IGGTAIDEALVRGLQ----------DNDGKSSAPHLLMFITDGQPTIGE--TDEGAIAQH 400 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 + R A ++++ V + L R +D +G V D +E DK+ Sbjct: 401 AKDGRKAKTRLFTFGVGEDLNARLLDRLSSDGAGTSDFVRDGKEFETKISSFYDKVSNPV 460 >gi|148657452|ref|YP_001277657.1| hypothetical protein RoseRS_3346 [Roseiflexus sp. RS-1] gi|148569562|gb|ABQ91707.1| hypothetical protein RoseRS_3346 [Roseiflexus sp. RS-1] Length = 392 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 20/132 (15%), Positives = 41/132 (31%), Gaps = 6/132 (4%) Query: 10 FLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIK 69 F AID A R + ++ D A L +++ DR + + T + ++ Sbjct: 26 IAFAGLAIDGALAFAWRRNVMNSADGAALIATRALIVDRGSVNGIAITNAVRTYLQAEL- 84 Query: 70 KHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALT 129 G + A ++ ++P A T + +L G++ Sbjct: 85 -----GVDNPDFELTYVNGAGQSMGTVDSSPAPVNARGIGIVVRHTFDTYLMGILGQPTL 139 Query: 130 NLSLRSTGIIER 141 + S Sbjct: 140 TVRGVSAARFGN 151 >gi|218528581|ref|YP_002419397.1| hypothetical protein Mchl_0537 [Methylobacterium chloromethanicum CM4] gi|218520884|gb|ACK81469.1| conserved hypothetical protein [Methylobacterium chloromethanicum CM4] Length = 480 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 20/214 (9%), Positives = 54/214 (25%), Gaps = 9/214 (4%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSG--CASIVSDRTIKDPTTKKD 58 + A+ + + AID A ++ + +Q A DA V++G +V T Sbjct: 41 IVALAATTLMGLVGGAIDYARLVSAQRHIQQATDAGVMAGGNALKLVVSNTASVIGLTTQ 100 Query: 59 QTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAE-SKAQYEIPTEN 117 K K + + + + + + + A+ + Sbjct: 101 TIQDEIKDSAKNPVTIQVDVASDKTSVTAVVEQTVHLSFGPFVGMSESKVSAKAKASVVG 160 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSN 177 ++ + ++ ++ S S + N++ Sbjct: 161 KMRLCMLALD------PAAAGAFNLEKSAQVTAYDCALYSNSSNSGGMVGRNNSMARAQT 214 Query: 178 KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKID 211 ++ P ++ Sbjct: 215 ICSAGGFKDDRANFTPNPQTNCPVITDPLADRVA 248 >gi|329664002|ref|NP_001193105.1| calcium-activated chloride channel regulator 1 [Bos taurus] Length = 909 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 21/135 (15%), Positives = 55/135 (40%), Gaps = 27/135 (20%) Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T+ + A+ + + + +I +TDGE++ T+ Sbjct: 378 TASGGTSICSGLRSAFTVIKKKYPTDGAE---------IILLTDGEDN----------TI 418 Query: 330 QIC-EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND---SRELLESFDKIT- 384 C + ++ +G I++VA+ P Q+L + + G +D + L+++F ++ Sbjct: 419 SACFDEVKQSGAIIHTVALG-PSAAQELEQMSKMTGGLQTYASDQVQNNGLVDAFAALSS 477 Query: 385 --DKIQEQSVRIAPN 397 + ++S+++ Sbjct: 478 GNKAVSQRSIQLESR 492 >gi|15893764|ref|NP_347113.1| heat shock protein DnaK [Clostridium acetobutylicum ATCC 824] gi|15023331|gb|AAK78453.1|AE007561_14 DnaK protein (heat shock protein), C-terminal region has VWA type A domain [Clostridium acetobutylicum ATCC 824] gi|325507887|gb|ADZ19523.1| DnaK protein (heat shock protein), C-terminal region has VWA type A domain [Clostridium acetobutylicum EA 2018] Length = 698 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 23/146 (15%), Positives = 44/146 (30%), Gaps = 22/146 (15%) Query: 247 IVGNQCTPLSNNLNEVKSRLNKLN-PYENTNT-YPAMHHAYRELYNEKESSHNTIGSTRL 304 L+ + E+ ++ L T+T AY L + Sbjct: 557 DKVKTLINLTEDREEIFRAIDGLKKADVGTSTMSEPFSEAYNILKDA-----------YG 605 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS 364 FV+ +TDG+ G + + + ++I ++ +D L K Sbjct: 606 DCFVVVLTDGQWYG------KKDIMAEVNKCKEYEIEIAAIGFG--NAKKDFLDKIATCE 657 Query: 365 GQFFAVNDSRELLESFDKITDKIQEQ 390 + L +SF +I I Sbjct: 658 ENSIF-TEVSNLKQSFSRIAKVISRS 682 >gi|15897956|ref|NP_342561.1| hypothetical protein SSO1091 [Sulfolobus solfataricus P2] gi|284173931|ref|ZP_06387900.1| hypothetical protein Ssol98_04630 [Sulfolobus solfataricus 98/2] gi|13814281|gb|AAK41351.1| Hypothetical protein SSO1091 [Sulfolobus solfataricus P2] gi|261602668|gb|ACX92271.1| von Willebrand factor type A [Sulfolobus solfataricus 98/2] Length = 380 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 23/149 (15%), Positives = 55/149 (36%), Gaps = 33/149 (22%) Query: 253 TPLSNNLNEVKSRLN---------KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 S+N+N +K ++ ++ T Y A+ A + + Sbjct: 78 ITFSSNVNVIKEFVDPLDLTNEILQITAGGQTALYTAILTANSLAKKYQMPT-------- 129 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD- 362 +++ +TDG + + N L +++YS + + LL+ +D Sbjct: 130 ---YLLLLTDGNPTDETNIGNYLKLPYY------EKIQVYSFGIG-DDYNEQLLQSVSDK 179 Query: 363 SSGQFFAVNDSREL-----LESFDKITDK 386 + G + ++D+ E+ ++ +I K Sbjct: 180 TGGVMYHISDANEIPQKLPQKAVTQIAAK 208 >gi|313232459|emb|CBY24127.1| unnamed protein product [Oikopleura dioica] Length = 1632 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 62/187 (33%), Gaps = 33/187 (17%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 K VL+ NL N Q V+IG +++ V N E++ ++ Sbjct: 1417 NKDQVLMNFTNNLANMYDTINQ------VKIGLTSFSESSVLEMPLDF-YNQLELQDGVS 1469 Query: 268 KLNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 + + TN + A ++ S + +I ITDG NT Sbjct: 1470 NMTWQGSFTNITSGVETALNDMDT----------SDAVDDVMILITDGFQ-----STNTT 1514 Query: 327 NTLQICEYMRNAGMKIYSVAVSAP----PEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 Q+ + + G+++ ++ L+ + + + ELL + Sbjct: 1515 LMFQMIDQAKADGVRLIALGFFGDFAFYSPNLYLMT------NEVYHAANYAELLAIDNT 1568 Query: 383 ITDKIQE 389 I + I Sbjct: 1569 IFETICS 1575 >gi|146303120|ref|YP_001190436.1| von Willebrand factor, type A [Metallosphaera sedula DSM 5348] gi|145701370|gb|ABP94512.1| von Willebrand factor, type A [Metallosphaera sedula DSM 5348] Length = 394 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 43/242 (17%), Positives = 86/242 (35%), Gaps = 49/242 (20%) Query: 144 ENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAP 203 + + + + +VLDVS SM +N + L+ + +Y Sbjct: 33 KPVPLDLFIVLDVSGSMG-------IIDNPPEVDDSLIAGTAEVDGHV-----VRYLKDD 80 Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 N +++V +E+ NL+ + + V I T + ++ ++ + TP + Sbjct: 81 IGVNNRLEVALEAIRNLLENADTST------RVTIITFSDHVNVLCRRVTP-----STAL 129 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG--ASA 321 L ++ P NT Y A+ A + V+ ITDG + Sbjct: 130 EHLEEIVPDGNTALYSAVKKAISLIDEH-------------PARVLLITDGYPTDVEDET 176 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESF 380 + L + + + + +LR D S+G+F+ VND E+ Sbjct: 177 EYSKLEVPRFSQ----------FIPIGVGEYNAKILRSLADLSNGRFYHVNDVSEISRIM 226 Query: 381 DK 382 ++ Sbjct: 227 EE 228 >gi|52549995|gb|AAU83844.1| cell surface protein [uncultured archaeon GZfos34G5] Length = 1357 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 26/167 (15%), Positives = 64/167 (38%), Gaps = 11/167 (6%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPL--SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 + R+G + +N G + L + N+ ++K + +++ + T M A E Sbjct: 1002 DHLEDDDRLGLVLFNTGAELAEPVSLIGAKNMQKLKGDVLEISATDGTRLSAGMQMA-TE 1060 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 LY+E + + + +IF+TD + + +L + N + + + Sbjct: 1061 LYDEF----LEVNQSEYENRIIFLTDAMPNLGQTSEESLLGMTEANA--NKNVYTTFIGI 1114 Query: 348 SAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 +L+ T +++V+ +++ E D + + V Sbjct: 1115 GVD-FNTELVEYITKIRGANYYSVHSAKQFKERMDDEFEYMVTPLVF 1160 >gi|225703035|ref|NP_795896.4| integrin alpha-11 precursor [Mus musculus] Length = 1188 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 64/195 (32%), Gaps = 23/195 (11%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + I +E L+N ++K +++G + Y V ++ +V Sbjct: 171 GSNSIYPWVEVQHFLINILKKFYIGPGQ--IQVGIVQYGEDAVHEFHLNDYRSVKDVVEA 228 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + + T T A + + + G KK +I ITDGE ++ Sbjct: 229 ASHIEQRGGTETRTAFGIEF-----ARSEAFQKGGRKGAKKVMIVITDGE------SHDS 277 Query: 326 LNTLQICEYMRNAGMKIYSVAV-SA----PPEGQDLL---RKCTDSSG--QFFAVNDSRE 375 + ++ + Y+VAV + L + FF V D Sbjct: 278 PDLEKVIRQSEKDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAA 337 Query: 376 LLESFDKITDKIQEQ 390 L + D + D+I Sbjct: 338 LKDIVDALGDRIFSL 352 >gi|212722920|ref|NP_001131192.1| hypothetical protein LOC100192500 [Zea mays] gi|194690832|gb|ACF79500.1| unknown [Zea mays] Length = 650 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 29/194 (14%), Positives = 65/194 (33%), Gaps = 24/194 (12%) Query: 187 KSFWSKNTTKSKYAPAPAPA-----NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 + + S PA AP ++D+L +A +V ++ R+ + Sbjct: 79 DVVAVLDVSGSMNNPAAAPTERTRTTSRLDLLKTAAKFMVAKLEDGD--------RLSIV 130 Query: 242 AYNIGIVGNQCTPL----SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHN 297 A++ V + L ++ L++L T PA A + L Sbjct: 131 AFSDRPVRELSSGLLYMTADGRRNAIRSLDQLEARGGTALVPAFEEAVKVLD------GR 184 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 F++ +TDG + ++ + ++ +++ + + LL Sbjct: 185 QGDGGDRLGFIVLLTDGAEDASGSFTLSERRREVIRGAL-RKYPVHAFGLGTAHGPEVLL 243 Query: 358 RKCTDSSGQFFAVN 371 +S G + V+ Sbjct: 244 YLAQESRGTYSFVD 257 >gi|148694080|gb|EDL26027.1| integrin, alpha 11 [Mus musculus] Length = 1172 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 64/195 (32%), Gaps = 23/195 (11%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + I +E L+N ++K +++G + Y V ++ +V Sbjct: 148 GSNSIYPWVEVQHFLINILKKFYIGPGQ--IQVGIVQYGEDAVHEFHLNDYRSVKDVVEA 205 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + + T T A + + + G KK +I ITDGE ++ Sbjct: 206 ASHIEQRGGTETRTAFGIEF-----ARSEAFQKGGRKGAKKVMIVITDGE------SHDS 254 Query: 326 LNTLQICEYMRNAGMKIYSVAV-SA----PPEGQDLL---RKCTDSSG--QFFAVNDSRE 375 + ++ + Y+VAV + L + FF V D Sbjct: 255 PDLEKVIRQSEKDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAA 314 Query: 376 LLESFDKITDKIQEQ 390 L + D + D+I Sbjct: 315 LKDIVDALGDRIFSL 329 >gi|32394646|gb|AAM62130.1| a11 integrin [Mus musculus] Length = 1188 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 64/195 (32%), Gaps = 23/195 (11%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + I +E L+N ++K +++G + Y V ++ +V Sbjct: 171 GSNSIYPWVEVQHFLINILKKFYIGPGQ--IQVGIVQYGEDAVHEFHLNDYRSVKDVVEA 228 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + + T T A + + + G KK +I ITDGE ++ Sbjct: 229 ASHIEQRGGTETRTAFGIEF-----ARSEAFQKGGRKGAKKVMIVITDGE------SHDS 277 Query: 326 LNTLQICEYMRNAGMKIYSVAV-SA----PPEGQDLL---RKCTDSSG--QFFAVNDSRE 375 + ++ + Y+VAV + L + FF V D Sbjct: 278 PDLEKVIRQSEKDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAA 337 Query: 376 LLESFDKITDKIQEQ 390 L + D + D+I Sbjct: 338 LKDIVDALGDRIFSL 352 >gi|48428278|sp|P61622|ITA11_MOUSE RecName: Full=Integrin alpha-11; Flags: Precursor gi|35193068|gb|AAH58716.1| Integrin alpha 11 [Mus musculus] Length = 1188 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 64/195 (32%), Gaps = 23/195 (11%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + I +E L+N ++K +++G + Y V ++ +V Sbjct: 171 GSNSIYPWVEVQHFLINILKKFYIGPGQ--IQVGIVQYGEDAVHEFHLNDYRSVKDVVEA 228 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + + T T A + + + G KK +I ITDGE ++ Sbjct: 229 ASHIEQRGGTETRTAFGIEF-----ARSEAFQKGGRKGAKKVMIVITDGE------SHDS 277 Query: 326 LNTLQICEYMRNAGMKIYSVAV-SA----PPEGQDLL---RKCTDSSG--QFFAVNDSRE 375 + ++ + Y+VAV + L + FF V D Sbjct: 278 PDLEKVIRQSEKDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAA 337 Query: 376 LLESFDKITDKIQEQ 390 L + D + D+I Sbjct: 338 LKDIVDALGDRIFSL 352 >gi|297183644|gb|ADI19770.1| hypothetical protein [uncultured gamma proteobacterium EB000_37F04] Length = 1181 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 27/178 (15%), Positives = 51/178 (28%), Gaps = 48/178 (26%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRL------------- 304 N + + +N+L T ++ AYR GS Sbjct: 309 NRAALVTTINELTATTWTPLCESLFEAYRYFSGGGVLGGFNGGSLAPPADTSIMNNNRYQ 368 Query: 305 --------KKFVIFITDGENSGASAYQNTLNTLQICEY---------------------- 334 + +++ ITDG + Y + L + + Sbjct: 369 SPMRSCQKQSYLVVITDGVPFYDNDYDSLLRSELALKTGDRFDDSYLPGVAEWMQTRDVN 428 Query: 335 ---MRNAGMKIYSVAVSAPPEGQDLL--RKCTDSSGQFFAVNDSRELLESFDKITDKI 387 + + Y++ S L T GQ++A +D+ L S +I +I Sbjct: 429 PNLLGQQNIVTYTIGFSQGANDAADLLAETATRGGGQYYAASDALALQGSLQQIFSEI 486 >gi|149573051|ref|XP_001520334.1| PREDICTED: hypothetical protein, partial [Ornithorhynchus anatinus] Length = 581 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 19/98 (19%), Positives = 34/98 (34%), Gaps = 10/98 (10%) Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP---P 351 + + K ++ ITDG+ G S + L + G+ Y++ V Sbjct: 5 NPISGAREDANKILVVITDGQKYGDSLKYSEAIPLAEAK-----GVIRYAIGVGDAFDFS 59 Query: 352 EGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKI 387 + L S F V++ L ++ DKI Sbjct: 60 STAEELITIASSPEQDHVFRVDNFGALDNIQQQLQDKI 97 >gi|67078187|ref|YP_245807.1| D-amino acid dehydrogenase, large subunit [Bacillus cereus E33L] gi|66970493|gb|AAY60469.1| conserved hypothetical protein [Bacillus cereus E33L] Length = 452 Score = 53.4 bits (126), Expect = 6e-05, Method: Composition-based stats. Identities = 27/216 (12%), Positives = 62/216 (28%), Gaps = 22/216 (10%) Query: 183 PPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIA 242 P K S K++ ++ N ++ I + G+ Sbjct: 147 PKEKSLNVEILLDASGSMAGKVNGQVKMEAAKKAIYNYLDKIPDNANVMLRVYGHKGSNN 206 Query: 243 YNI----GIVGNQCTPLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHN 297 N PL + + L+K P T A+ + Sbjct: 207 ENDKSLSCGSSEVMYPLQPYKKEQFNAALSKFGPKGWTPLASAIESVNDDFKE------- 259 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA--GMKIYSVAVSA-PPEGQ 354 V ++DGE + + + + + + + + E Q Sbjct: 260 -YTGEENLNVVYIVSDGEETCGG------DPVNAAKNLNQSSTHAVVNIIGFDVKNSEQQ 312 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 L+ G + V+++ EL ++ + +K+ ++ Sbjct: 313 QLMNTAEAGKGNYATVSNADELYQTLNTEYEKLYKE 348 >gi|260467412|ref|ZP_05813583.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] gi|259028808|gb|EEW30113.1| conserved hypothetical protein [Mesorhizobium opportunistum WSM2075] Length = 354 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 34/337 (10%), Positives = 79/337 (23%), Gaps = 9/337 (2%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + +++ + +++D + R+ Q ALDAA+LS + + T+ + Sbjct: 19 LFFLMLVPIISAVGFSVDYTSAVQTRSNQQQALDAALLSITT-MDTTSTLAQRQAALQDS 77 Query: 61 STIFKKQIKKHLKQ---GSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 Q L G+ + K + + + + Sbjct: 78 FIANGGQGTATLNSFVAGTTTAAATARATASFAMPTIFMKIARIDTVPVAVVSAVSKPPS 137 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSN 177 L + ++ ++ + M +D + N ++ + Sbjct: 138 LVNATFKVTGVSGYWNKTMTLYGTQFGAAVAKPLMTIDYTYGNTKDPKGYGTTNVSVLTT 197 Query: 178 KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI-QKAIQEKKNLSV 236 + +N + P K D + + ++S Sbjct: 198 DS---AGKTVTTLVQNQVCTVGNNPPTGVTLKTDASGTKYYCVDTMYPANSAGAAVDVST 254 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMH-HAYRELYNEKESS 295 G GN +SN+ N T T + + S Sbjct: 255 MGGLYLQMDVPSGNPKKLMSNDPATSNRLYNGAYANSLTETATGQTVDIFSIVPCGATSY 314 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 + D + Sbjct: 315 QAWEDGGNPVPAPVSNADFFYNVTGKCDFNKRPSATA 351 >gi|126277540|ref|XP_001376725.1| PREDICTED: similar to integrin alpha 11 subunit [Monodelphis domestica] Length = 1530 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 36/195 (18%), Positives = 66/195 (33%), Gaps = 23/195 (11%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + I +E L+N ++K +++G + Y +V ++ EV Sbjct: 513 GSNSIYPWVEVQHFLINILKKFYIGPGQ--IQVGVVQYGEDVVHEFHLNDYRSVKEVVEA 570 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + + T T A + + + G KK +I ITDGE ++ Sbjct: 571 ASHIEQRGGTETRTAFGIEF-----ARSEAFQKGGRKGAKKVMIVITDGE------SHDS 619 Query: 326 LNTLQICEYMRNAGMKIYSVAV-SA----PPEGQDLL---RKCTDSSG--QFFAVNDSRE 375 + ++ E + Y+VAV + L + FF V D Sbjct: 620 PDLEKVIEDSEKDNVTRYAVAVLGYYNRRGINPEAFLNEIKYIASDPDDKHFFNVTDEAA 679 Query: 376 LLESFDKITDKIQEQ 390 L + D + D+I Sbjct: 680 LKDIVDALGDRIFSL 694 >gi|188590759|ref|YP_001922423.1| von Willebrand factor type A domain protein [Clostridium botulinum E3 str. Alaska E43] gi|188501040|gb|ACD54176.1| von Willebrand factor type A domain protein [Clostridium botulinum E3 str. Alaska E43] Length = 984 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 35/238 (14%), Positives = 66/238 (27%), Gaps = 65/238 (27%) Query: 208 RKIDVLIESAGNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 KI + +A VN I+K + ++ Y + ++ + Sbjct: 105 SKIKKMKNAAMEFVNKIKKIPNLDIDIVTYSTSGYTYLNNGNTEE---------DLLKII 155 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD------------- 313 N + TNT + A L EK + + + + Sbjct: 156 NSIKADGGTNTGEGLRKANYILDLEKNKNADKSIVFMSDGMPTYYSIIAKYYWWDFLKLF 215 Query: 314 ---------------------------------GENSGASAYQNTLNTLQICEYMRNAGM 340 G+ + ++T I ++ Sbjct: 216 PYKSYYDEISNEMIYTDDHIFEINKRNLRVEGSGKEESNNVDKSTNYATTIGNIIKTKKY 275 Query: 341 KIYSVAVSAPPEGQD---LLRKCTDSSG------QFFAVNDSRELLESFDKITDKIQE 389 IYS+ + E L++K +S G F ++D + E F+ I DKI Sbjct: 276 NIYSIGYALGDENSTGNMLMKKIHESMGGIVGEDGTFFMSDENAINEVFNNIGDKIIS 333 >gi|291444745|ref|ZP_06584135.1| lipoprotein [Streptomyces roseosporus NRRL 15998] gi|291347692|gb|EFE74596.1| lipoprotein [Streptomyces roseosporus NRRL 15998] Length = 531 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 36/352 (10%), Positives = 94/352 (26%), Gaps = 25/352 (7%) Query: 37 VLSGCASIVSDRTIKDPTTKKDQTSTIFK---KQIKKHLKQGSYIRENAGDIAQKAQINI 93 VL+G + + D T ++S + + + Sbjct: 20 VLAGGMLLTACGGGSDGATADRESSGRNAPGVSGGGHPAPDAPATGQGSRESEADRVREG 79 Query: 94 TKDKNNPLQYIAES-KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICM 152 + + + + A Y L L + + + Sbjct: 80 RAEPDYLSTFALDVDTASYGYARRTLGDGRLPAAEDVRPEEFVNSFRQGYERPKGNGFSV 139 Query: 153 VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT---TKSKYAPAPAPANRK 209 +D +R ++ + + L + T + Sbjct: 140 NIDGARINSGKGGGGGTGASDWSLLRVGLATKTAPPTSERPPAALTFVVDISGSMAETGR 199 Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP-LSNNLNEVKSRLNK 268 +D++ +S L + ++ + + ++ + +N N +K +++ Sbjct: 200 LDLVRKSLTILADELRDDDS--------LSLVTFSDEAETRLPMTRVKDNRNRIKDVVSE 251 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 + P ++TN + Y E V+ ++D + + Sbjct: 252 MQPAQSTNVEAGIKLGYEESVEGHREGATNR--------VVLLSDALANTGET-EAEGIL 302 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 +I R G+ ++ V V + + + G V D + + F Sbjct: 303 KKIDSARREYGITLFGVGVGSDYGDAFMEQLTNKGDGNTTYVGDETQARKVF 354 >gi|156741348|ref|YP_001431477.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156232676|gb|ABU57459.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 972 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 30/195 (15%), Positives = 64/195 (32%), Gaps = 32/195 (16%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 ++D+ E+ S+ + ++G + ++ ++ E+ Sbjct: 426 GGSRRNRLDLAKEAVYQ--ASLGLTPID------QVGLVVFDDAANWVLPLQRLPSVVEI 477 Query: 263 KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + L TN P + +++ + K VI +TDG Sbjct: 478 ERALGSFGIGGGTNIRPGIE----------QAAQALASADAKVKHVILLTDGIA------ 521 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF-- 380 N + MR AG+ I +VA+ +L+ G+ + V ++ F Sbjct: 522 --ESNYSDLIAQMRAAGVTISTVAIGEDAN-PNLVDVANAGGGRSYRVTRIEDVPRIFLQ 578 Query: 381 DKI---TDKIQEQSV 392 + I I E+ + Sbjct: 579 ETIIAAGRDIVEERI 593 >gi|148655419|ref|YP_001275624.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148567529|gb|ABQ89674.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 824 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 37/307 (12%), Positives = 80/307 (26%), Gaps = 77/307 (25%) Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 N + ++LDVS SM + + N P + Sbjct: 400 NRPVQFLLILDVSGSMSWTFDGRGVQN---GQVVTCTNPTQGCVSIETAWPNVQ------ 450 Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYN-------------------- 244 R+I + V I + + + + + Sbjct: 451 --ERRIYTAKQVLRRFVQQIDQDRRSGLRPHDTVRMVTFTGRLGNYVNNEGRVGDNNRAL 508 Query: 245 IGIVGNQCTPLSNNLNEVKSRLNK--------LNPYENTNTYPAMHHAYRELYNEKESSH 296 + +N+ +++ +N+ T + A A + E + Sbjct: 509 NDLTDVLPAGWTNDRATLEAAINEAGMVDGDPYMTAGATPSAVAFARASQVFAAAPERAP 568 Query: 297 NTIGSTRLKKFVIFITDGENSG-ASAYQNTLNT--------------------------- 328 + ++ VIF+TDG + + QN Sbjct: 569 ---NGMKYRRVVIFVTDGVANVLRNGMQNNYGPGCELGAENVGCQMGDPLPDGSLRPLNA 625 Query: 329 -LQICEYMRNAGMK-----IYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + ++ A ++ +Y VA+ +P L + +L + FD Sbjct: 626 MVAEAQALKEAYIRPSDGSVYVVAL-SPTFETTGLNLVASQPDYVKRADMPDQLQQIFDD 684 Query: 383 ITDKIQE 389 I + Sbjct: 685 IQVSAIQ 691 >gi|83643000|ref|YP_431435.1| von Willebrand factor type A (vWA) domain-containing protein [Hahella chejuensis KCTC 2396] gi|83631043|gb|ABC27010.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Hahella chejuensis KCTC 2396] Length = 733 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 20/135 (14%), Positives = 53/135 (39%), Gaps = 18/135 (13%) Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 L + + + L T A+ A + E + V+F+TDG Sbjct: 417 LQQARRFVRGLKADGGTEIAEALDRALSDAAPEG-----------YVRQVVFLTDGSVGN 465 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 A ++ + + ++ ++++V + P + + G + +ND+ E+ + Sbjct: 466 ELALFKQID-----QQLGDS--RLFTVGIGPSPNRFFMRKAAQFGRGAYSHINDTAEVSD 518 Query: 379 SFDKITDKIQEQSVR 393 ++T +++ ++R Sbjct: 519 KIAELTAALRQPALR 533 >gi|330466229|ref|YP_004403972.1| von willebrand factor type a [Verrucosispora maris AB-18-032] gi|328809200|gb|AEB43372.1| von willebrand factor type a [Verrucosispora maris AB-18-032] Length = 319 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 25/205 (12%), Positives = 58/205 (28%), Gaps = 40/205 (19%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 ++ +A + V+ + + +V + P S + + + Sbjct: 106 DRLSAAKSAARDFVDGL------PREFNVGLVAF----AGSAAVLVPPSTDREALHDGIR 155 Query: 268 KLNPYE-----NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 +L T A+ + + E + +I ++DG N+ Sbjct: 156 RL-AEGITGVQGTAIGEAISTSLGAVKALDEQATT----QPPPARIIVLSDGANTSGMDP 210 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPP--------------EGQDLLRKCTDSSGQFF 368 E +++++ P +G+ L + G F Sbjct: 211 MEAAAEAVAFEV------PVHTISFGTPGGFVDRGGRPIQVPVDGETLQAVAEQTGGAFH 264 Query: 369 AVNDSRELLESFDKITDKIQEQSVR 393 + S EL +D I + + R Sbjct: 265 QADTSDELHAVYDDIGSSVGWRKER 289 >gi|310799477|gb|EFQ34370.1| von Willebrand factor type A domain-containing protein [Glomerella graminicola M1.001] Length = 698 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 30/204 (14%), Positives = 63/204 (30%), Gaps = 13/204 (6%) Query: 180 LLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG 239 P + + AP PA K + S +LV + + E N R+G Sbjct: 83 DHVPCDIVLVIDVSGSMGCNAPVPANPGEKAENYGLSVLDLVKHAARTVLETLNDGDRLG 142 Query: 240 TIAYNIGIVG-NQCTPL-SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHN 297 + + + TP+ + N + +N + P + TN + + + Sbjct: 143 IVTFASKAKVLQKLTPMDAKNKALAEKIINGMRPDDATNLWHGLLEGIKLFNT------C 196 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 + ++ +TDG + + M I++ LL Sbjct: 197 GEMNMGRVPAMMVLTDGMPNH---MCPNQGYVPKLRGMEQLSASIHTFGFGYSLRS-GLL 252 Query: 358 RKCTD-SSGQFFAVNDSRELLESF 380 + + G + + D+ + F Sbjct: 253 KSIAEIGGGNYSFIPDAGMIGTVF 276 >gi|170591963|ref|XP_001900739.1| Zona pellucida-like domain containing protein [Brugia malayi] gi|158591891|gb|EDP30494.1| Zona pellucida-like domain containing protein [Brugia malayi] Length = 447 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 19/115 (16%), Positives = 40/115 (34%), Gaps = 17/115 (14%) Query: 265 RLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 + ++ TNT A+ A + + K + +TDG++ + A Sbjct: 4 AIQRIKYLSGATNTGAALKFALERGFQNARGG-------NIPKVAVVVTDGQSQDSVAES 56 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + +R+A + +Y+V V L + + + V EL + Sbjct: 57 --------AQQLRDAHVMVYAVGV-TNLVNVHQLHQIAGNPARVLTVESFDELSK 102 >gi|332256727|ref|XP_003277467.1| PREDICTED: collagen alpha-1(VI) chain, partial [Nomascus leucogenys] Length = 1104 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 66/190 (34%), Gaps = 18/190 (9%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS---NNLNE 261 P +D + +++++ + + ++ A + L+ + Sbjct: 54 PYGALVDKVKSFTKRFIDNLRD-RYYRCDRNLVWNAGALHYSDEVEIIQGLTRMPGGRDA 112 Query: 262 VKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 +KS ++ + + T T A+ +L SH K++I +TDG Sbjct: 113 LKSSVDAVKYFGKGTYTDCAIKKGLEQLLVG--GSHLKEN-----KYLIVVTDGHPLEGY 165 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF---FAVNDSRELL 377 ++ G+K++SVA+ P + L + + F D + Sbjct: 166 KEPCG-GLEDAVNEAKHLGVKVFSVAI-TPDHLEPRLSIIA-TDHTYRRNFTAADWGQSR 222 Query: 378 ESFDKITDKI 387 ++ + I+ I Sbjct: 223 DAEEVISQTI 232 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 56/185 (30%), Gaps = 16/185 (8%) Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 N + +K +F S + + + D L A Sbjct: 896 KNVTAQICIDKKCPDYTCPITFSSPADITILLDGSASVGSHNFDTTRRFTKRLAERFLTA 955 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQ---CTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHH 283 + VR+ + Y+ N + S ++ ++ + T+ A+ + Sbjct: 956 GRTDPAHDVRVAVVQYSGTGQQRPERASLQFLQNYTALASAIDAMDFINDATDVNDALGY 1015 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 R + + KK ++ +DG + G + + + AG++I+ Sbjct: 1016 VTRFYRDASSGAA--------KKRLLLFSDGNSQG----ATPAAIEKAVQEAQRAGIEIF 1063 Query: 344 SVAVS 348 V V Sbjct: 1064 VVVVG 1068 >gi|326478189|gb|EGE02199.1| U-box domain-containing protein [Trichophyton equinum CBS 127.97] Length = 741 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 32/211 (15%), Positives = 64/211 (30%), Gaps = 19/211 (9%) Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE-SAGNLVNSIQKAIQEKKNLSVR 237 +P P + + S + AP P + S +L K I E N R Sbjct: 63 NDVPHVPCDIVLVIDISGSMNSAAPIPTGERGGEDTGLSILDLTKHAAKTIIETLNEKDR 122 Query: 238 IGTIAYNIGIVGNQCTPLSN--NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS 295 + + + + N N + V S ++KL +TN + + L Sbjct: 123 LAVVTFCTEVNVAFELDSMNKENKSTVLSAIDKLYGKSSTNLWHGIKKGLNVLATNPVRG 182 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK-----IYSVAVSAP 350 + + ++ +TDG + + + + I++ Sbjct: 183 NI--------QSLLVLTDGAPNH-MCPVQGYVPKLRQTLLDHRNLTGTLPLIHTFGFGYY 233 Query: 351 PEGQDLLRKCTDSSGQFF-AVNDSRELLESF 380 LL+ + G F + D+ + F Sbjct: 234 LRSP-LLQSIAEIGGGIFAFIPDAGMIGTVF 263 >gi|326474578|gb|EGD98587.1| hypothetical protein TESG_05957 [Trichophyton tonsurans CBS 112818] Length = 741 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 32/211 (15%), Positives = 64/211 (30%), Gaps = 19/211 (9%) Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE-SAGNLVNSIQKAIQEKKNLSVR 237 +P P + + S + AP P + S +L K I E N R Sbjct: 63 NDVPHVPCDIVLVIDISGSMNSAAPIPTGERGGEDTGLSILDLTKHAAKTIIETLNEKDR 122 Query: 238 IGTIAYNIGIVGNQCTPLSN--NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS 295 + + + + N N + V S ++KL +TN + + L Sbjct: 123 LAVVTFCTEVNVAFELDSMNKENKSTVLSAIDKLYGKSSTNLWHGIKKGLNVLATNPVRG 182 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK-----IYSVAVSAP 350 + + ++ +TDG + + + + I++ Sbjct: 183 NI--------QSLLVLTDGAPNH-MCPVQGYVPKLRQTLLDHRNLTGTLPLIHTFGFGYY 233 Query: 351 PEGQDLLRKCTDSSGQFF-AVNDSRELLESF 380 LL+ + G F + D+ + F Sbjct: 234 LRSP-LLQSIAEIGGGIFAFIPDAGMIGTVF 263 >gi|317122034|ref|YP_004102037.1| hypothetical protein Tmar_1197 [Thermaerobacter marianensis DSM 12885] gi|315592014|gb|ADU51310.1| hypothetical protein Tmar_1197 [Thermaerobacter marianensis DSM 12885] Length = 207 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 17/151 (11%), Positives = 42/151 (27%), Gaps = 6/151 (3%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDR-----TIKDPTTK 56 +++ V + +D ++ +R Q+ D A L+ I D Sbjct: 55 FVLVLPVLLAAVGLGLDAGRLVVVRAHAQAVADLAGLAAVQEIDEDAFARGEPALREAAA 114 Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 + +++ + A + + +AE + Sbjct: 115 AATARQWAEDGLRRAFGDAVAEDATVDVVVVNASPASPRRHPWSGRRVAEPTVGVRL-VV 173 Query: 117 NLFLKGLIPSALTNLSLRSTGIIERSSENLA 147 + L L A L++ + + + A Sbjct: 174 PVRLGWLPAVAAIPLTVTADASVAVERQTAA 204 >gi|261414506|ref|YP_003248189.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|261370962|gb|ACX73707.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] Length = 227 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 14/85 (16%), Positives = 33/85 (38%), Gaps = 2/85 (2%) Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + T AM+ A L K + G + +++ +TDG +G+ A + Sbjct: 88 QFYADGGTPMGEAMNMALDMLEKRKS-EYKASGVDYYQPWIVLMTDGMPNGSQAELSRSI 146 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPE 352 C+ + + + I+ + + + Sbjct: 147 QRT-CDMINDRKLTIFPIGIGEDAD 170 >gi|182438638|ref|YP_001826357.1| hypothetical protein SGR_4845 [Streptomyces griseus subsp. griseus NBRC 13350] gi|178467154|dbj|BAG21674.1| conserved hypothetical protein [Streptomyces griseus subsp. griseus NBRC 13350] Length = 578 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 32/277 (11%), Positives = 74/277 (26%), Gaps = 21/277 (7%) Query: 108 KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQK 167 A Y L L + + +D +R Sbjct: 130 TASYGYARRTLGDGRLPAPEEVRPEEFVNSFRQGYERPKGSGFSVNVDGARIGAGKGGGG 189 Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNT---TKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 ++ + + L S + T ++D++ +S L + + Sbjct: 190 GTGASDWSLLRVGLATEAAPSTAERPPAALTFVVDISGSMAETGRLDLVRKSLAVLTDEL 249 Query: 225 QKAIQEKKNLSVRIGTIAYNIGIVGNQCTP-LSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + + + ++ L N N +K ++++ P ++TN + Sbjct: 250 RDDDS--------VSLVTFSDAAETRLPMTRLQGNRNRIKDAVDEMRPEQSTNVEAGITR 301 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 Y E V+ ++D + + + R G+ ++ Sbjct: 302 GYEESVEGHRKGATNR--------VVLLSDALANTGDTEADGILERIDSTR-REYGITLF 352 Query: 344 SVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 V V + + R G V D + + F Sbjct: 353 GVGVGSDYGDAFMERLTNKGDGNTTYVGDEAQARKVF 389 >gi|313215187|emb|CBY42862.1| unnamed protein product [Oikopleura dioica] Length = 289 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 26/174 (14%), Positives = 52/174 (29%), Gaps = 13/174 (7%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 N I E + S+R +N + N+++E K + P Y Sbjct: 123 FFENIISTINVEPSDSSIRFAFSFFNHAYIEFFAFDWLNSIDEYKWAFSSFPPASGNANY 182 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 A + + V+ +T+ +T ++ + ++ Sbjct: 183 IG--RALKGAADTMTPEFGKGRRIDTVGTVVLLTN--------AASTDEVNEMADQLKEK 232 Query: 339 GMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 ++ V + GQD L S + +S +L I D+I Sbjct: 233 VDRVIVVGLGY-AFGQDELAGIASSPTKENLYIAEESSDLAGLVKTIADEICAT 285 >gi|311245368|ref|XP_003121804.1| PREDICTED: integrin alpha-11-like [Sus scrofa] Length = 1055 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 66/195 (33%), Gaps = 23/195 (11%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + I +E L+N ++K +++G + Y +V ++ +V Sbjct: 177 GSNSIYPWVEVQHFLINILKKFYIGPGQ--IQVGVVQYGEDVVHEFHLNDYRSVKDVVEA 234 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + + T T A + + + G KK +I ITDGE ++ Sbjct: 235 ASHIEQRGGTETRTAFGIEF-----ARSEAFQKGGRKGAKKVMIVITDGE------SHDS 283 Query: 326 LNTLQICEYMRNAGMKIYSVAV-SA----PPEGQDLL---RKCTDSSG--QFFAVNDSRE 375 + ++ + + Y+VAV + L + FF V D Sbjct: 284 PDLEKVIQQSEKDNVTRYAVAVLGYYNRRGINPEAFLNEIKYIASDPDDKHFFNVTDEAA 343 Query: 376 LLESFDKITDKIQEQ 390 L + D + D+I Sbjct: 344 LKDIVDALGDRIFSL 358 >gi|162449478|ref|YP_001611845.1| hypothetical protein sce1208 [Sorangium cellulosum 'So ce 56'] gi|161160060|emb|CAN91365.1| unnamed protein product [Sorangium cellulosum 'So ce 56'] Length = 607 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 18/134 (13%), Positives = 42/134 (31%), Gaps = 10/134 (7%) Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 ++ L+ + NT+ E+ V+ +TDG+ + Sbjct: 101 AEAALDTVIARGNTDLGGGWLRGCAEVGAHLPEDAIGR--------VLLLTDGQANHGIT 152 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFD 381 + L + + +R + ++ + L R + G F+ + EL Sbjct: 153 SPDELTSRARSQRLRR--VTTSTIGLGEGFNEFLLGRLSEEGGGNFYFAARADELPGFVG 210 Query: 382 KITDKIQEQSVRIA 395 + ++ R A Sbjct: 211 REIGEVLSVVARDA 224 >gi|86358602|ref|YP_470494.1| hypothetical protein RHE_CH03000 [Rhizobium etli CFN 42] gi|86282704|gb|ABC91767.1| hypothetical conserved protein [Rhizobium etli CFN 42] Length = 780 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 19/167 (11%), Positives = 48/167 (28%), Gaps = 16/167 (9%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYR 286 + + I +N + + + L T PA+ A R Sbjct: 369 SRLNPNDRFNVIRFDDTMTDYFKGLVAATPDNREKAVAYVRSLTADGGTEMLPALEDALR 428 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 + + V+F+TDG + ++++V Sbjct: 429 NQGPVASGA---------LRQVVFLTDGAIGNEQQLFQEI-------TANRGDARVFTVG 472 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + + P + + G F + + ++ ++ K+Q ++ Sbjct: 473 IGSAPNTYFMTKAAEVGRGTFTQIGSTDQVASRMSELFAKLQNPTMT 519 >gi|194226345|ref|XP_001488401.2| PREDICTED: similar to Collagen alpha-1(VI) chain [Equus caballus] Length = 1027 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 33/225 (14%), Positives = 76/225 (33%), Gaps = 19/225 (8%) Query: 170 DNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ 229 ++ P F+ + T A P +D + +++++ Sbjct: 20 QDDTAAVRTVAFQDCPVDLFFVLD-TSESVALRLKPYGALVDKVKAFTKRFIDNLRD-RY 77 Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLS---NNLNEVKSRLNKLNPYE-NTNTYPAMHHAY 285 + + ++ A + L+ + +E+K+ ++ + + T T A+ Sbjct: 78 YRCDRNLVWNAGALHYSDEVEIIRGLTRMPSGRDELKASVDAVKYFGKGTYTDCAIKKGL 137 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 EL SH K++I +TDG ++ G+K++SV Sbjct: 138 EELLVG--GSHLKEN-----KYLIVVTDGHPLEGYKEPCG-GLEDAVNEAKHLGIKVFSV 189 Query: 346 AVSAPPEGQDLLRKCTDSSGQF---FAVNDSRELLESFDKITDKI 387 A+ P + L + + F D + ++ + I+ I Sbjct: 190 AI-TPDHLEPRLSIIA-TDHTYRRNFTAADWGQSRDAEEIISQTI 232 >gi|114586161|ref|XP_001172548.1| PREDICTED: hypothetical protein [Pan troglodytes] Length = 543 Score = 53.4 bits (126), Expect = 7e-05, Method: Composition-based stats. Identities = 18/117 (15%), Positives = 41/117 (35%), Gaps = 16/117 (13%) Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T T A++ + K +++++I ITDG + Sbjct: 172 GTRTGKALNFTLPFFDSSKGGRP------SVQQYLIVITDGVAQDNVIIP--------AK 217 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 +R+ + I+++ V + LL T+ + + + L +I K+ + Sbjct: 218 ALRDKNIIIFAIGVG-EAKKSQLLE-ITNDEDKVYHDVNFEALQNLEKEILSKVCDP 272 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 17/94 (18%), Positives = 37/94 (39%), Gaps = 12/94 (12%) Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + +++I ITDG++S + + E +R G+ IY++ + D Sbjct: 6 ADTSRINVARYLIVITDGKSSDS--------VAEAAEGLRANGVNIYAIGI--REANIDE 55 Query: 357 LRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 L++ + F V + L + ++ I Sbjct: 56 LKEIAK--DKIFFVYEFDLLKDIQKEVVQDICSS 87 >gi|314927767|gb|EFS91598.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL044PA1] Length = 322 Score = 53.0 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 20/162 (12%), Positives = 51/162 (31%), Gaps = 25/162 (15%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 P S + V L+ + + T A+ + + + S R+ Sbjct: 142 SAHPEIRMPPSTDRPTVLRALDGIELQDGTALGEAIDKSLQAVKMAPGGS-----KDRVP 196 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PE 352 ++ ++DG N+ + + + +Y++A Sbjct: 197 AAIVMLSDGGNTQGGSPLVAATHAAAAK------VPVYTIAFGTETGYVDLDGQRERVAP 250 Query: 353 GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 LL D + + + + + +L E + ++ + + VR Sbjct: 251 DTKLLSDVADRTDAKSWTADSADKLQEVYKQVHSSVGYEPVR 292 >gi|301755498|ref|XP_002913610.1| PREDICTED: von Willebrand factor A domain-containing protein 2-like [Ailuropoda melanoleuca] Length = 765 Score = 53.0 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 22/164 (13%), Positives = 63/164 (38%), Gaps = 19/164 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + VR+G + ++ ++ EVK+++ ++ T T A+ + R Sbjct: 82 DINPERVRVGALQFSSAPRLEFPLDSFSSQQEVKAKIKRMVFKGGRTETGLALKYLLRRG 141 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + ++ + + ++ ITDG + G + ++ G+ +++V V Sbjct: 142 FPGGRNA-------SVPQILVVITDGRSQGPVELP--------AKQLKERGVTVFAVGVR 186 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 P + L + V + ++ ++ + + + + +V Sbjct: 187 FPRWEE--LHTLASEP-REQHVLMAEQVDDATNGLLSTLSKSAV 227 >gi|281347736|gb|EFB23320.1| hypothetical protein PANDA_001404 [Ailuropoda melanoleuca] Length = 708 Score = 53.0 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 22/164 (13%), Positives = 63/164 (38%), Gaps = 19/164 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + VR+G + ++ ++ EVK+++ ++ T T A+ + R Sbjct: 40 DINPERVRVGALQFSSAPRLEFPLDSFSSQQEVKAKIKRMVFKGGRTETGLALKYLLRRG 99 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + ++ + + ++ ITDG + G + ++ G+ +++V V Sbjct: 100 FPGGRNA-------SVPQILVVITDGRSQGPVELP--------AKQLKERGVTVFAVGVR 144 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 P + L + V + ++ ++ + + + + +V Sbjct: 145 FPRWEE--LHTLASEP-REQHVLMAEQVDDATNGLLSTLSKSAV 185 >gi|239987768|ref|ZP_04708432.1| hypothetical protein SrosN1_10718 [Streptomyces roseosporus NRRL 11379] Length = 527 Score = 53.0 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 36/352 (10%), Positives = 94/352 (26%), Gaps = 25/352 (7%) Query: 37 VLSGCASIVSDRTIKDPTTKKDQTSTIFK---KQIKKHLKQGSYIRENAGDIAQKAQINI 93 VL+G + + D T ++S + + + Sbjct: 4 VLAGGMLLTACGGGSDGATADRESSGRNAPGVSGGGHPAPDAPATGQGSRESEADRVREG 63 Query: 94 TKDKNNPLQYIAES-KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICM 152 + + + + A Y L L + + + Sbjct: 64 RAEPDYLSTFALDVDTASYGYARRTLGDGRLPAAEDVRPEEFVNSFRQGYERPKGNGFSV 123 Query: 153 VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT---TKSKYAPAPAPANRK 209 +D +R ++ + + L + T + Sbjct: 124 NIDGARINSGKGGGGGTGASDWSLLRVGLATKTAPPTSERPPAALTFVVDISGSMAETGR 183 Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP-LSNNLNEVKSRLNK 268 +D++ +S L + ++ + + ++ + +N N +K +++ Sbjct: 184 LDLVRKSLTILADELRDDDS--------LSLVTFSDEAETRLPMTRVKDNRNRIKDVVSE 235 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 + P ++TN + Y E V+ ++D + + Sbjct: 236 MQPAQSTNVEAGIKLGYEESVEGHREGATNR--------VVLLSDALANTGET-EAEGIL 286 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 +I R G+ ++ V V + + + G V D + + F Sbjct: 287 KKIDSARREYGITLFGVGVGSDYGDAFMEQLTNKGDGNTTYVGDETQARKVF 338 >gi|114586163|ref|XP_526141.2| PREDICTED: similar to alpha 3 type VI collagen isoform 1 precursor [Pan troglodytes] Length = 891 Score = 53.0 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 30/181 (16%), Positives = 62/181 (34%), Gaps = 19/181 (10%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---E 273 L + +VR G + Y+ N ++ L+KL Sbjct: 671 VNFLKTIVSSLSIRP--DTVRFGLVFYSEEPRLEFSLDAFQNPAKILEHLDKLTYRERKG 728 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T T A+ E++ +++ S + +++ + IT+G + + Sbjct: 729 RTKTGAALDFLRNEVFIQEKGSW---SNHGVQQIAVVITEGFSQDRVSRP--------AS 777 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV--NDSRELLESFDKITDKIQEQS 391 +R AG+ IY+V + L K ++V +L K+T ++ + Sbjct: 778 RLRRAGVTIYAVG-THNVSESKDLEKIASYPPWKYSVPLESFLQLSVVGSKLTHQLCSEM 836 Query: 392 V 392 V Sbjct: 837 V 837 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 28/165 (16%), Positives = 55/165 (33%), Gaps = 16/165 (9%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + + VR+G YN I + + ++ L TNT A+ Sbjct: 232 DISSDRVRVGLAQYNDNIYPAFQLNQHPLKSMILEQIQNLPYRTGGTNTGSALEFIRTNY 291 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 E+ S R+ + VI +TD E++ ++ + ++ G+ +Y V Sbjct: 292 LTEESGSRAK---DRVPQIVILVTDRESND--------EVQEVADRLKEDGVVVY--VVG 338 Query: 349 APPEGQDLLRKCTDSS-GQF-FAVNDSRELLESFDKITDKIQEQS 391 + L+K +F F + L + I + Sbjct: 339 VNVQDVQELQKIASEPFEKFLFNTENFNILQDFSGSILQTLCSAV 383 Score = 46.4 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 21/141 (14%), Positives = 48/141 (34%), Gaps = 16/141 (11%) Query: 260 NEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 ++V + + A+ L + + + + S + + + I+ G Sbjct: 59 SDVLRHIRQFQFKPGGKKMGLALKFI---LDHHFQEASGSRASQGVPQIAVVISSGPVED 115 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF--AVNDSREL 376 E +R AG+ +Y++ V LR+ S + F V + L Sbjct: 116 HVHGP--------VEALRRAGILLYAIGV--RDAVWAELREIASSPQENFTSFVPNFSGL 165 Query: 377 LESFDKITDKIQEQSVRIAPN 397 K+ ++ + + AP+ Sbjct: 166 SNLAQKLRQELCDMLAKAAPH 186 >gi|326674787|ref|XP_003200204.1| PREDICTED: integrin alpha-10-like [Danio rerio] Length = 1170 Score = 53.0 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 36/197 (18%), Positives = 64/197 (32%), Gaps = 25/197 (12%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + I E L N + K + +++G + Y V +V Sbjct: 172 GSNSIYPWYEVQNFLSNILSKFHISPEQ--MQVGVLQYGEISVHEWSLRDYQTTADVVEA 229 Query: 266 LNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 ++ E T T A+ A E ++ + K +I +TDGE+ Sbjct: 230 AKNISRQEGRETRTAYAIQMACTEAFSPDRGAR-----EGATKVMIVVTDGESHDGE--- 281 Query: 324 NTLNTLQICEYMRNAGMKIYSVAV--------SAPPEGQDLLRKCTDSSG--QFFAVNDS 373 + L CE + Y++AV P + ++ + FF V D Sbjct: 282 DLPEALIECE---KRNITRYAIAVLGHYIRRQQDPETFINEIKYISSDPDEKYFFNVTDE 338 Query: 374 RELLESFDKITDKIQEQ 390 L + D + D+I Sbjct: 339 AALNDIVDALGDRIFSL 355 >gi|156602970|ref|XP_001618750.1| hypothetical protein NEMVEDRAFT_v1g153509 [Nematostella vectensis] gi|156200182|gb|EDO26650.1| predicted protein [Nematostella vectensis] Length = 133 Score = 53.0 bits (125), Expect = 7e-05, Method: Composition-based stats. Identities = 25/135 (18%), Positives = 51/135 (37%), Gaps = 20/135 (14%) Query: 257 NNLNEVKSRLNKLN-PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 NN +V ++K+ P +T T A+ + E+ + + +I +TDG+ Sbjct: 11 NNKKDVLEAVDKMPYPKGSTYTGRALQYMNDEI-------YRKATRVGVPNILIVLTDGK 63 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDS 373 + A + +R+ G++IYS+ V + L F+V D Sbjct: 64 AHDSVAEP--------AKALRDIGIEIYSIGVG-ESYDKAELDAIATDPDASHVFSV-DF 113 Query: 374 RELLESFDKITDKIQ 388 + + + +I Sbjct: 114 KNMNSIVSTLDARIC 128 >gi|260799772|ref|XP_002594858.1| hypothetical protein BRAFLDRAFT_124445 [Branchiostoma floridae] gi|229280095|gb|EEN50869.1| hypothetical protein BRAFLDRAFT_124445 [Branchiostoma floridae] Length = 930 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 23/134 (17%), Positives = 55/134 (41%), Gaps = 21/134 (15%) Query: 258 NLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 N+ S +N + T T A+ A +E E+ + K +I ++DG Sbjct: 247 NMTTTLSAINAIKYQKGATYTPKALDRARKEALWRGEA---------VPKVMIVLSDG-- 295 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRE 375 ++ ++ + + + +AG+ +Y++ V + LL ++ + ++D Sbjct: 296 ------RSAIDVTEASKALADAGIIVYAIGVGRADHDELLL--IANNDLSKVIELSDFNA 347 Query: 376 LLESFDKITDKIQE 389 L+ D + + + E Sbjct: 348 LIAEIDLLAEVVCE 361 Score = 44.5 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 17/87 (19%), Positives = 34/87 (39%), Gaps = 1/87 (1%) Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 + S ++ + + + ITDG G + + + R+AG+ +YSV P Sbjct: 441 ADQSIGSVFRDDVPRAAVIITDGSAQGDADGLVMGDYADQADEARDAGITVYSVPNGIPG 500 Query: 352 EGQDL-LRKCTDSSGQFFAVNDSRELL 377 + L + F++ D +L Sbjct: 501 FEDIVALEAISGGPDNVFSMYDPCQLA 527 >gi|148676058|gb|EDL08005.1| inter-alpha (globulin) inhibitor H5, isoform CRA_a [Mus musculus] Length = 918 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 30/156 (19%), Positives = 59/156 (37%), Gaps = 10/156 (6%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNE 291 I + I + + P++ +N+ K + L+P T+ A+ A + L N Sbjct: 294 QDRFNIIGFSNRIKMWKDHLLPVTPDNIRNGKIYMYHLSPTGGTDINGALQAAIKLLNNY 353 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 + R +IF+TDG+ + NTL L + + I++V + Sbjct: 354 VAQNDIE---DRSVSLIIFLTDGKPTFGE--TNTLKILSNTKEATRGQICIFTVGIGDDV 408 Query: 352 EGQDL----LRKCTDSSGQFFAVNDSRELLESFDKI 383 + + L L C + +L+ +D+I Sbjct: 409 DFKLLEKLSLENCGLTRRVHEEDKAGAQLIGFYDEI 444 >gi|27369644|ref|NP_766059.1| inter-alpha-trypsin inhibitor heavy chain H5 precursor [Mus musculus] gi|81873944|sp|Q8BJD1|ITIH5_MOUSE RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H5; Short=ITI heavy chain H5; Short=ITI-HC5; Short=Inter-alpha-inhibitor heavy chain 5; Flags: Precursor gi|26352482|dbj|BAC39871.1| unnamed protein product [Mus musculus] gi|37589944|gb|AAH43314.2| Inter-alpha (globulin) inhibitor H5 [Mus musculus] gi|38328214|gb|AAH62196.1| Inter-alpha (globulin) inhibitor H5 [Mus musculus] gi|74145221|dbj|BAE22250.1| unnamed protein product [Mus musculus] gi|122889674|emb|CAM13913.1| inter-alpha (globulin) inhibitor H5 [Mus musculus] gi|123858038|emb|CAM26660.1| inter-alpha (globulin) inhibitor H5 [Mus musculus] gi|148676059|gb|EDL08006.1| inter-alpha (globulin) inhibitor H5, isoform CRA_b [Mus musculus] Length = 952 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 30/156 (19%), Positives = 59/156 (37%), Gaps = 10/156 (6%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNE 291 I + I + + P++ +N+ K + L+P T+ A+ A + L N Sbjct: 328 QDRFNIIGFSNRIKMWKDHLLPVTPDNIRNGKIYMYHLSPTGGTDINGALQAAIKLLNNY 387 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 + R +IF+TDG+ + NTL L + + I++V + Sbjct: 388 VAQNDIE---DRSVSLIIFLTDGKPTFGE--TNTLKILSNTKEATRGQICIFTVGIGDDV 442 Query: 352 EGQDL----LRKCTDSSGQFFAVNDSRELLESFDKI 383 + + L L C + +L+ +D+I Sbjct: 443 DFKLLEKLSLENCGLTRRVHEEDKAGAQLIGFYDEI 478 >gi|74183702|dbj|BAE24467.1| unnamed protein product [Mus musculus] Length = 952 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 30/156 (19%), Positives = 59/156 (37%), Gaps = 10/156 (6%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNE 291 I + I + + P++ +N+ K + L+P T+ A+ A + L N Sbjct: 328 QDRFNIIGFSNRIKMWKDHLLPVTPDNIRNGKIYMYHLSPTGGTDINGALQAAIKLLNNY 387 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 + R +IF+TDG+ + NTL L + + I++V + Sbjct: 388 VAQNDIE---DRSVSLIIFLTDGKPTFGE--TNTLKILSNTKEATRGQICIFTVGIGDDV 442 Query: 352 EGQDL----LRKCTDSSGQFFAVNDSRELLESFDKI 383 + + L L C + +L+ +D+I Sbjct: 443 DFKLLEKLSLENCGLTRRVHEEDKAGAQLIGFYDEI 478 >gi|332844134|ref|XP_510503.3| PREDICTED: integrin alpha-11 [Pan troglodytes] Length = 1188 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 66/195 (33%), Gaps = 23/195 (11%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + I +E L+N ++K +++G + Y +V ++ +V Sbjct: 171 GSNSIYPWVEVQHFLINILKKFYIGPGQ--IQVGVVQYGEDVVHEFHLNDYRSVKDVVEA 228 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + + T T A + + + G KK +I ITDGE ++ Sbjct: 229 ASHIEQRGGTETRTAFGIEF-----ARSEAFQKGGRKGAKKVMIVITDGE------SHDS 277 Query: 326 LNTLQICEYMRNAGMKIYSVAV-SA----PPEGQDLL---RKCTDSSG--QFFAVNDSRE 375 + ++ + + Y+VAV + L + FF V D Sbjct: 278 PDLEKVIQQSERDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAA 337 Query: 376 LLESFDKITDKIQEQ 390 L + D + D+I Sbjct: 338 LKDIVDALGDRIFSL 352 >gi|332236004|ref|XP_003267196.1| PREDICTED: integrin alpha-11 [Nomascus leucogenys] Length = 1188 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 66/195 (33%), Gaps = 23/195 (11%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + I +E L+N ++K +++G + Y +V ++ +V Sbjct: 171 GSNSIYPWVEVQHFLINILKKFYIGPGQ--IQVGVVQYGEDVVHEFHLNDYRSVKDVVEA 228 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + + T T A + + + G KK +I ITDGE ++ Sbjct: 229 ASHIEQRGGTETRTAFGIEF-----ARSEAFQKGGRKGAKKVMIVITDGE------SHDS 277 Query: 326 LNTLQICEYMRNAGMKIYSVAV-SA----PPEGQDLL---RKCTDSSG--QFFAVNDSRE 375 + ++ + + Y+VAV + L + FF V D Sbjct: 278 PDLEKVIQQSERDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAA 337 Query: 376 LLESFDKITDKIQEQ 390 L + D + D+I Sbjct: 338 LKDIVDALGDRIFSL 352 >gi|297296749|ref|XP_001083531.2| PREDICTED: integrin alpha-11 [Macaca mulatta] Length = 1149 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 66/195 (33%), Gaps = 23/195 (11%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + I +E L+N ++K +++G + Y +V ++ +V Sbjct: 171 GSNSIYPWVEVQHFLINILKKFYIGPGQ--IQVGVVQYGEDVVHEFHLNDYRSVKDVVEA 228 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + + T T A + + + G KK +I ITDGE ++ Sbjct: 229 ASHIEQRGGTETRTAFGIEF-----ARSEAFQKGGRKGAKKVMIVITDGE------SHDS 277 Query: 326 LNTLQICEYMRNAGMKIYSVAV-SA----PPEGQDLL---RKCTDSSG--QFFAVNDSRE 375 + ++ + + Y+VAV + L + FF V D Sbjct: 278 PDLEKVIQQSERDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAA 337 Query: 376 LLESFDKITDKIQEQ 390 L + D + D+I Sbjct: 338 LKDIVDALGDRIFSL 352 >gi|296213559|ref|XP_002753319.1| PREDICTED: integrin alpha-11 [Callithrix jacchus] Length = 1188 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 66/195 (33%), Gaps = 23/195 (11%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + I +E L+N ++K +++G + Y +V ++ +V Sbjct: 171 GSNSIYPWVEVQHFLINILKKFYIGPGQ--IQVGVVQYGEDVVHEFHLNDYRSVKDVVEA 228 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + + T T A + + + G KK +I ITDGE ++ Sbjct: 229 ASHIEQRGGTETRTAFGIEF-----ARSEAFQKGGRKGAKKVMIVITDGE------SHDS 277 Query: 326 LNTLQICEYMRNAGMKIYSVAV-SA----PPEGQDLL---RKCTDSSG--QFFAVNDSRE 375 + ++ + + Y+VAV + L + FF V D Sbjct: 278 PDLEKVIQQSERDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAA 337 Query: 376 LLESFDKITDKIQEQ 390 L + D + D+I Sbjct: 338 LKDIVDALGDRIFSL 352 >gi|239941305|ref|ZP_04693242.1| hypothetical protein SrosN15_09946 [Streptomyces roseosporus NRRL 15998] Length = 516 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 36/352 (10%), Positives = 94/352 (26%), Gaps = 25/352 (7%) Query: 37 VLSGCASIVSDRTIKDPTTKKDQTSTIFK---KQIKKHLKQGSYIRENAGDIAQKAQINI 93 VL+G + + D T ++S + + + Sbjct: 5 VLAGGMLLTACGGGSDGATADRESSGRNAPGVSGGGHPAPDAPATGQGSRESEADRVREG 64 Query: 94 TKDKNNPLQYIAES-KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICM 152 + + + + A Y L L + + + Sbjct: 65 RAEPDYLSTFALDVDTASYGYARRTLGDGRLPAAEDVRPEEFVNSFRQGYERPKGNGFSV 124 Query: 153 VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT---TKSKYAPAPAPANRK 209 +D +R ++ + + L + T + Sbjct: 125 NIDGARINSGKGGGGGTGASDWSLLRVGLATKTAPPTSERPPAALTFVVDISGSMAETGR 184 Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP-LSNNLNEVKSRLNK 268 +D++ +S L + ++ + + ++ + +N N +K +++ Sbjct: 185 LDLVRKSLTILADELRDDDS--------LSLVTFSDEAETRLPMTRVKDNRNRIKDVVSE 236 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 + P ++TN + Y E V+ ++D + + Sbjct: 237 MQPAQSTNVEAGIKLGYEESVEGHREGATNR--------VVLLSDALANTGET-EAEGIL 287 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 +I R G+ ++ V V + + + G V D + + F Sbjct: 288 KKIDSARREYGITLFGVGVGSDYGDAFMEQLTNKGDGNTTYVGDETQARKVF 339 >gi|159473306|ref|XP_001694780.1| flagellar associated protein [Chlamydomonas reinhardtii] gi|158276592|gb|EDP02364.1| flagellar associated protein [Chlamydomonas reinhardtii] Length = 4349 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 21/199 (10%), Positives = 70/199 (35%), Gaps = 21/199 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN-QCTPLSNNLNEVK-SRL 266 +I+++ E+ L++ + +G ++Y+ + + ++ + + + Sbjct: 990 RIELVRETCHFLIDQLTADDY--------LGIVSYSNTVREDVPLLRMTPEARRLAHTMI 1041 Query: 267 NKLNPYENTNTYPAMHHAYRE-------LYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 + L + T Y + ++ L ++ S+R+ TDG+ + Sbjct: 1042 SSLTLHGGTALYAGLEAGVKQQMAAASELKALAAAAGGGSDSSRIVHSCFLFTDGQATTG 1101 Query: 320 SAYQNTL-NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 N + + + + + +++ + L SG ++ ++ + ++ Sbjct: 1102 PCTVNEIMGQMTSLQSPADQNITVHTFGFGDDHSVELLQGVAEAQSGVYYYISCADDIPS 1161 Query: 379 SFDKITDKIQEQSVRIAPN 397 F D + +A + Sbjct: 1162 GF---GDALGGLLAVVAKD 1177 >gi|158258322|dbj|BAF85134.1| unnamed protein product [Homo sapiens] Length = 1188 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 66/195 (33%), Gaps = 23/195 (11%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + I +E L+N ++K +++G + Y +V ++ +V Sbjct: 171 GSNSIYPWVEVQHFLINILKKFYIGPGQ--IQVGVVQYGEDVVHEFHLNDYRSVKDVVEA 228 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + + T T A + + + G KK +I ITDGE ++ Sbjct: 229 ASHIEQRGGTETRTAFGIEF-----ARSEAFQKGGRKGAKKVMIVITDGE------SHDS 277 Query: 326 LNTLQICEYMRNAGMKIYSVAV-SA----PPEGQDLL---RKCTDSSG--QFFAVNDSRE 375 + ++ + + Y+VAV + L + FF V D Sbjct: 278 PDLEKVIQQSERDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAA 337 Query: 376 LLESFDKITDKIQEQ 390 L + D + D+I Sbjct: 338 LKDIVDALGDRIFSL 352 >gi|149692293|ref|XP_001495918.1| PREDICTED: integrin, alpha 11 [Equus caballus] Length = 1188 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 66/195 (33%), Gaps = 23/195 (11%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + I +E L+N ++K +++G + Y +V ++ +V Sbjct: 171 GSNSIYPWVEVQHFLINILKKFYIGPGQ--IQVGVVQYGEDVVHEFHLNDYRSVKDVVEA 228 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + + T T A + + + G KK +I ITDGE ++ Sbjct: 229 ASHIEQRGGTETRTAFGIEF-----ARSEAFQKGGRKGAKKVMIVITDGE------SHDS 277 Query: 326 LNTLQICEYMRNAGMKIYSVAV-SA----PPEGQDLL---RKCTDSSG--QFFAVNDSRE 375 + ++ + + Y+VAV + L + FF V D Sbjct: 278 PDLEKVIQQSERDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAA 337 Query: 376 LLESFDKITDKIQEQ 390 L + D + D+I Sbjct: 338 LKDIVDALGDRIFSL 352 >gi|119598226|gb|EAW77820.1| integrin, alpha 11, isoform CRA_a [Homo sapiens] Length = 1189 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 66/195 (33%), Gaps = 23/195 (11%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + I +E L+N ++K +++G + Y +V ++ +V Sbjct: 171 GSNSIYPWVEVQHFLINILKKFYIGPGQ--IQVGVVQYGEDVVHEFHLNDYRSVKDVVEA 228 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + + T T A + + + G KK +I ITDGE ++ Sbjct: 229 ASHIEQRGGTETRTAFGIEF-----ARSEAFQKGGRKGAKKVMIVITDGE------SHDS 277 Query: 326 LNTLQICEYMRNAGMKIYSVAV-SA----PPEGQDLL---RKCTDSSG--QFFAVNDSRE 375 + ++ + + Y+VAV + L + FF V D Sbjct: 278 PDLEKVIQQSERDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAA 337 Query: 376 LLESFDKITDKIQEQ 390 L + D + D+I Sbjct: 338 LKDIVDALGDRIFSL 352 >gi|5915662|gb|AAD51919.2|AF137378_1 integrin alpha 11 subunit precursor [Homo sapiens] gi|119598227|gb|EAW77821.1| integrin, alpha 11, isoform CRA_b [Homo sapiens] Length = 1188 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 66/195 (33%), Gaps = 23/195 (11%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + I +E L+N ++K +++G + Y +V ++ +V Sbjct: 171 GSNSIYPWVEVQHFLINILKKFYIGPGQ--IQVGVVQYGEDVVHEFHLNDYRSVKDVVEA 228 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + + T T A + + + G KK +I ITDGE ++ Sbjct: 229 ASHIEQRGGTETRTAFGIEF-----ARSEAFQKGGRKGAKKVMIVITDGE------SHDS 277 Query: 326 LNTLQICEYMRNAGMKIYSVAV-SA----PPEGQDLL---RKCTDSSG--QFFAVNDSRE 375 + ++ + + Y+VAV + L + FF V D Sbjct: 278 PDLEKVIQQSERDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAA 337 Query: 376 LLESFDKITDKIQEQ 390 L + D + D+I Sbjct: 338 LKDIVDALGDRIFSL 352 >gi|6013141|gb|AAF01258.1|AF109681_1 integrin alpha-11 subunit precursor [Homo sapiens] Length = 1189 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 66/195 (33%), Gaps = 23/195 (11%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + I +E L+N ++K +++G + Y +V ++ +V Sbjct: 171 GSNSIYPWVEVQHFLINILKKFYIGPGQ--IQVGVVQYGEDVVHEFHLNDYRSVKDVVEA 228 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + + T T A + + + G KK +I ITDGE ++ Sbjct: 229 ASHIEQRGGTETRTAFGIEF-----ARSEAFQKGGRKGAKKVMIVITDGE------SHDS 277 Query: 326 LNTLQICEYMRNAGMKIYSVAV-SA----PPEGQDLL---RKCTDSSG--QFFAVNDSRE 375 + ++ + + Y+VAV + L + FF V D Sbjct: 278 PDLEKVIQQSERDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAA 337 Query: 376 LLESFDKITDKIQEQ 390 L + D + D+I Sbjct: 338 LKDIVDALGDRIFSL 352 >gi|52485853|ref|NP_001004439.1| integrin alpha-11 precursor [Homo sapiens] gi|313104119|sp|Q9UKX5|ITA11_HUMAN RecName: Full=Integrin alpha-11; Flags: Precursor gi|189442879|gb|AAI67840.1| Integrin, alpha 11 [synthetic construct] Length = 1188 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 66/195 (33%), Gaps = 23/195 (11%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + I +E L+N ++K +++G + Y +V ++ +V Sbjct: 171 GSNSIYPWVEVQHFLINILKKFYIGPGQ--IQVGVVQYGEDVVHEFHLNDYRSVKDVVEA 228 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + + T T A + + + G KK +I ITDGE ++ Sbjct: 229 ASHIEQRGGTETRTAFGIEF-----ARSEAFQKGGRKGAKKVMIVITDGE------SHDS 277 Query: 326 LNTLQICEYMRNAGMKIYSVAV-SA----PPEGQDLL---RKCTDSSG--QFFAVNDSRE 375 + ++ + + Y+VAV + L + FF V D Sbjct: 278 PDLEKVIQQSERDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAA 337 Query: 376 LLESFDKITDKIQEQ 390 L + D + D+I Sbjct: 338 LKDIVDALGDRIFSL 352 >gi|297265178|ref|XP_002799142.1| PREDICTED: collagen alpha-3(VI) chain [Macaca mulatta] Length = 2568 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 26/163 (15%), Positives = 59/163 (36%), Gaps = 15/163 (9%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKE 293 VR+G + ++ + + V + +L + NT A+ R L+ Sbjct: 866 VRVGVVQFSNDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGRALEFVARNLF---V 922 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S + + + ++ + G++ + + +R++G+ S+ V Sbjct: 923 KSAGSRIEDGVPQHLVLVLGGKSQDDVSRF--------AQVIRSSGIV--SLGVGDRNID 972 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 + L+ T+ F V + REL ++I + AP Sbjct: 973 RAELQTITNDPRLVFTVREFRELPNIEERIMTSFGTSAATPAP 1015 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 59/172 (34%), Gaps = 15/172 (8%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN + ++ +N Sbjct: 1041 INFRRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLKDFSTKRQIIDAIN 1100 Query: 268 KLNPYENTNTYPAMHHAYRELY-NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 K+ + L N + R+ + IT G+ + Sbjct: 1101 KVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGK--------SVE 1150 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 1151 DAQDVSLALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 1200 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 24/189 (12%), Positives = 58/189 (30%), Gaps = 20/189 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ N +V + Sbjct: 432 GVRSGFPLLKEFVQRVVESL-----DVGQDRVRVAVVQYSDRTRPEFYLNSYMNQQDVVN 486 Query: 265 RLNKLNPYENT--NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L NT A+ R + S T + + +I +T + Sbjct: 487 AVRQLTLLGGPIPNTGAALEFVLRNILVSSAGSRIT---EGVPQLLIVLTAERSGDDVRN 543 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + ++ G + + ++ + A+ R+L Sbjct: 544 PSVV--------LKRGGAV--PIGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV 593 Query: 383 ITDKIQEQS 391 I+D++ + + Sbjct: 594 ISDRVTQLT 602 >gi|109101588|ref|XP_001084624.1| PREDICTED: collagen alpha-3(VI) chain isoform 5 [Macaca mulatta] Length = 2969 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 26/163 (15%), Positives = 59/163 (36%), Gaps = 15/163 (9%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKE 293 VR+G + ++ + + V + +L + NT A+ R L+ Sbjct: 1267 VRVGVVQFSNDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGRALEFVARNLF---V 1323 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S + + + ++ + G++ + + +R++G+ S+ V Sbjct: 1324 KSAGSRIEDGVPQHLVLVLGGKSQDDVSRF--------AQVIRSSGIV--SLGVGDRNID 1373 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 + L+ T+ F V + REL ++I + AP Sbjct: 1374 RAELQTITNDPRLVFTVREFRELPNIEERIMTSFGTSAATPAP 1416 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 59/172 (34%), Gaps = 15/172 (8%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN + ++ +N Sbjct: 1442 INFRRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLKDFSTKRQIIDAIN 1501 Query: 268 KLNPYENTNTYPAMHHAYRELY-NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 K+ + L N + R+ + IT G+ + Sbjct: 1502 KVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGK--------SVE 1551 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 1552 DAQDVSLALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 1601 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 24/189 (12%), Positives = 58/189 (30%), Gaps = 20/189 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ N +V + Sbjct: 833 GVRSGFPLLKEFVQRVVESL-----DVGQDRVRVAVVQYSDRTRPEFYLNSYMNQQDVVN 887 Query: 265 RLNKLNPYENT--NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L NT A+ R + S T + + +I +T + Sbjct: 888 AVRQLTLLGGPIPNTGAALEFVLRNILVSSAGSRIT---EGVPQLLIVLTAERSGDDVRN 944 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + ++ G + + ++ + A+ R+L Sbjct: 945 PSVV--------LKRGGAV--PIGIGIGNADITEMQTISFIPDFAVAIPTFRQLGTVQQV 994 Query: 383 ITDKIQEQS 391 I+D++ + + Sbjct: 995 ISDRVTQLT 1003 Score = 49.9 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 20/180 (11%), Positives = 54/180 (30%), Gaps = 22/180 (12%) Query: 206 ANRKIDVLIESA--GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 + + + LVN ++K +R+G + ++ + +V Sbjct: 43 GSNNTGSVNFAVILDFLVNLLEKLPI--GTQQIRVGVVQFSDEPRTMFSLDTYSTKAQVL 100 Query: 264 SRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + L N A+ + + + + + ++ I+ G Sbjct: 101 GAVKALGFAGGELANIGLALDFVVENHFT---RAGGSRVEEGVPQVLVLISAGP------ 151 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV---NDSRELLE 378 + + + ++ A ++S + A + L+ F V + +L E Sbjct: 152 SSDEIRYGVVA--LKQA--SVFSFGLGAQAASRAELQHIATDDNLVFTVPEFHSFGDLQE 207 >gi|218458530|ref|ZP_03498621.1| hypothetical protein RetlK5_03343 [Rhizobium etli Kim 5] Length = 185 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 19/116 (16%), Positives = 39/116 (33%), Gaps = 11/116 (9%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 +T I + + F ID+ + +Q+A+DA L+G + + Sbjct: 20 LTLIAMPMLLGFSLLIIDVGRSSNLHTDLQNAVDAMALAGARELDGRDD------AITRA 73 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 T +K + + ++ + I++T D N +IP Sbjct: 74 QTAIEK-----ISNSAAFSAGGTGMSLGSHISVTYDAGNDAGSTVTVLFLKDIPAN 124 >gi|297290486|ref|XP_001113553.2| PREDICTED: complement factor B isoform 1 [Macaca mulatta] Length = 1266 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 38/237 (16%), Positives = 75/237 (31%), Gaps = 38/237 (16%) Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANR-----KIDVLIESAGNLVNSIQKAIQEKKNLSV 236 P +K + + S + + NL+ + + + V Sbjct: 755 PGEQQKRRIILDPSGSMNIYLVLDGSDSIGAGNFTGAKKCLVNLIEKVASYGVKPRYALV 814 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-----YENTNTYPAMHHAYRELYNE 291 T V +Q S+N + V +L+++N TNT A+ Y + Sbjct: 815 TYATYPRIWVKVSDQE---SSNADWVTKKLSEINYEDHKLKSGTNTKRALQAVYSMM--S 869 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN-------------A 338 G R + +I +TDG ++ + + + + +R+ Sbjct: 870 WPEDIPPEGWNRTRHVIILMTDGLHNMGG------DPITVIDEIRDLLYIGKDRKNPRED 923 Query: 339 GMKIYSVAVSAPPEGQDLLRKCTDSSG---QFFAVNDSRELLESFDKITDKIQEQSV 392 + +Y V P Q + F V D L + F ++ D+ Q S+ Sbjct: 924 YLDVYVFGVG-PLVDQVNINALASKKDNEQHVFKVKDMENLEDVFFQMIDESQSLSL 979 >gi|5726289|gb|AAD48398.1|AF127035_1 calcium-activated chloride channel protein 2 [Homo sapiens] Length = 917 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 20/133 (15%), Positives = 56/133 (42%), Gaps = 24/133 (18%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 P T+ + +A++ + + S V+ +TDGE++ AS+ Sbjct: 379 PLGGTSICSGIKYAFQVI--------GELHSQLDGSEVLLLTDGEDNTASSC-------- 422 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE---LLESFDKI---T 384 + ++ +G ++ +A+ + ++ + G F V+D + L+++F + Sbjct: 423 -IDEVKQSGAIVHFIALGR-AADEAVIEMSKITGGSHFYVSDEAQNNGLIDAFGALTSGN 480 Query: 385 DKIQEQSVRIAPN 397 + ++S+++ Sbjct: 481 TDLSQKSLQLESK 493 >gi|328907234|gb|EGG27000.1| aerotolerance protein BatA [Propionibacterium sp. P08] Length = 307 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 20/162 (12%), Positives = 51/162 (31%), Gaps = 25/162 (15%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 P S + V L+ + + T A+ + + + S R+ Sbjct: 127 SAHPEIRMPPSTDRPTVLRALDGIELQDGTALGEAIDKSLQAVKMAPGGS-----KDRVP 181 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PE 352 ++ ++DG N+ + + + +Y++A Sbjct: 182 AAIVMLSDGGNTQGGSPLVAATHAAAAK------VPVYTIAFGTETGYVDLDGQRERVAP 235 Query: 353 GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 LL D + + + + + +L E + ++ + + VR Sbjct: 236 DTKLLSDVADRTDAKSWTADSADKLQEVYKQVHSSVGYEPVR 277 >gi|313241793|emb|CBY34008.1| unnamed protein product [Oikopleura dioica] Length = 694 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 31/325 (9%), Positives = 88/325 (27%), Gaps = 28/325 (8%) Query: 69 KKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSAL 128 + + +Y N G + I + + ++ + Sbjct: 57 RDFFLRYAYTETNYGLVQHHHVQQIWSIDEAQFNTTKTVALKSSVSSKFNVDWDKLEYDE 116 Query: 129 TNLSLRST-GIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKK 187 + S + I +D ++ + + D ++ + Sbjct: 117 LTIPTYSMLALFLYLEHLNEFPIPYSIDAQAAIYESITHHNVDGFKDGVDELDQITQDQC 176 Query: 188 SFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGI 247 + + + + ++ + + + A RI ++ Sbjct: 177 RTNALDIVFV-VDESGSIGTNNFQLIKDFLEHFASDSTIAADA-----TRIAIRPFSSSN 230 Query: 248 VGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 + + + + TNT A+ A + ++ K Sbjct: 231 YLYFSLN-DFKTKNIINEIKNMPYNEGGTNTADALDAALTDYGTDRP---------ESVK 280 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRN--AGMKIYSVAVSAPPEGQDLLRKCTDSS 364 ++ ITDG ++ + L+T + ++N ++ +++ V L S+ Sbjct: 281 VMVTITDGASN------SFLSTSAAADRVKNDLRNIQSFAIGV--SGANMAELNAIAISA 332 Query: 365 GQFFAVNDSRELLESFDKITDKIQE 389 F +N + + K+ E Sbjct: 333 KHVFMLNGWADFGPIKSNLLQKVCE 357 >gi|260803822|ref|XP_002596788.1| hypothetical protein BRAFLDRAFT_73696 [Branchiostoma floridae] gi|229282048|gb|EEN52800.1| hypothetical protein BRAFLDRAFT_73696 [Branchiostoma floridae] Length = 547 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 25/117 (21%), Positives = 43/117 (36%), Gaps = 11/117 (9%) Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHN----------TIGSTRLKKFVIFITDGENS 317 + T + P + A L E S + T + + I ++DG Sbjct: 92 GIPLGMYTISNPGLPFAISNLTQEGGLSRTGHALSFMTDTSKFRTGIPRTAILLSDGFPQ 151 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-EGQDLLRKCTDSSGQFFAVNDS 373 + Q + E R+AG+ +Y+V V A D+L T SS + F ++ Sbjct: 152 SDANAQAMDDYEAQAEAARDAGIDLYAVGVGAAGLVNWDVLETITGSSDRVFRSDNP 208 >gi|109731121|gb|AAI13690.1| Chloride channel accessory 4 [Homo sapiens] gi|109731369|gb|AAI13688.1| Chloride channel accessory 4 [Homo sapiens] gi|313883598|gb|ADR83285.1| chloride channel accessory 4 [synthetic construct] Length = 917 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 20/133 (15%), Positives = 56/133 (42%), Gaps = 24/133 (18%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 P T+ + +A++ + + S V+ +TDGE++ AS+ Sbjct: 379 PLGGTSICSGIKYAFQVI--------GELHSQLDGSEVLLLTDGEDNTASSC-------- 422 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE---LLESFDKI---T 384 + ++ +G ++ +A+ + ++ + G F V+D + L+++F + Sbjct: 423 -IDEVKQSGAIVHFIALGR-AADEAVIEMSKITGGSHFYVSDEAQNNGLIDAFGALTSGN 480 Query: 385 DKIQEQSVRIAPN 397 + ++S+++ Sbjct: 481 TDLSQKSLQLESK 493 >gi|298481574|ref|ZP_06999766.1| von Willebrand factor type A domain protein [Bacteroides sp. D22] gi|298272438|gb|EFI14007.1| von Willebrand factor type A domain protein [Bacteroides sp. D22] Length = 616 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 30/306 (9%), Positives = 86/306 (28%), Gaps = 26/306 (8%) Query: 75 GSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLR 134 + E G+ + +++ + ++ A Y + L P Sbjct: 139 DAANAEEYGEFQENGFKSVSDAPLSTFSIDVDA-ASYSNMRRFINKGELPPVDAIRTEEL 197 Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 + + S + + P + + N Sbjct: 198 VNYFSYDYPKPTGSDPVKITMESGACPWNTNHRLVRIGLKAKEI------PTDNLPASNL 251 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 ++D++ S L N+++ ++ + Y+ Sbjct: 252 VFLIDVSGSMWGANRLDLVKSSLKLLENNLRDKD--------KVAIVTYSGSAGVKLEAT 303 Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 ++ +++ +++L +T + AY+ S+ N +I +DG Sbjct: 304 PGSDKQKIREAIDELTAGGSTAGGAGIMLAYKIAKKNLISNGNNR--------IILCSDG 355 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDS 373 + + + L Q+ E R +G+ + + ++ + G +++ Sbjct: 356 DFNVGVSSAEGL--EQLIEKERKSGVFLTVLGYGMGNYKDKKIQVLAEKGNGNHAYIDNL 413 Query: 374 RELLES 379 +E Sbjct: 414 QEANRV 419 >gi|332827674|gb|EGK00413.1| hypothetical protein HMPREF9455_03261 [Dysgonomonas gadei ATCC BAA-286] Length = 402 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 18/173 (10%), Positives = 54/173 (31%), Gaps = 7/173 (4%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 A + I E ++ +S+ ++ ++ V + + PL +++ Sbjct: 62 ATGSMSGLIGAAKEKIWSITSSLSQSEPVP-DIEVGMLFYRDRGDDFITRIIPLGTDMDN 120 Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + +L ++ + LY + + + + D Sbjct: 121 LYEQLMAMDASGG---GDGPESVNQALYEGV-NKMQWDNLPNTYRAIFLVGDYPPHMD-- 174 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSR 374 Y+N ++ + C G+ I ++ + P + ++ + + D Sbjct: 175 YRNDVHYPETCSEGIKKGIVINTILMGNEPTAARIWKEIAGKTKGEYIQTDMS 227 >gi|288574994|ref|ZP_06393351.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Dethiosulfovibrio peptidovorans DSM 11002] gi|288570735|gb|EFC92292.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Dethiosulfovibrio peptidovorans DSM 11002] Length = 1057 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 40/351 (11%), Positives = 84/351 (23%), Gaps = 41/351 (11%) Query: 74 QGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSL 133 GS +N + + L Y + +L L+ Sbjct: 75 DGSKPFKNQAYYGDDTDSSNNDPDDGSLSYYPPVTYLSDEEVSDLRYDTLLGLMGRKGHR 134 Query: 134 RSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKN 193 R + + D S + ++ Sbjct: 135 YLHPNDSRMYILKKVLWSIFTDPSMVEGLKIGLCTYHQREKYGVPGSGYVSYEFPSYAWL 194 Query: 194 TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 K + + + ++++ A + Sbjct: 195 GWYWKRQKLSWQPTGENKAVKRLSLDVIDPFFYAPSSFSGGVPDDKLGTSHWYD----LL 250 Query: 254 PLSNNLNEVKSRLNKLNPYENT------NTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 L + + K+ ++L T + A AY + E + + Sbjct: 251 ALIDGVETSKN--DELRAVGATPLEKSIYSKGARDCAYEFIKEEIDY-------PCQDNW 301 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYM----------RNAGMKIYSVAVSAPPEGQDLL 357 +I +TDGE+S + A + + ++ + + + D L Sbjct: 302 LIVLTDGEDSSSDADPPAAVKKLYEANLDDTWPKPYGKKAQPVRTFVIGLV--DSQSDTL 359 Query: 358 RKCTDSS----------GQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 D + D+ LLE+F I IQ+ AP + Sbjct: 360 DAMADEGRAWEVDESIKKTAYYATDTESLLEAFRTIFRTIQKNRSSSAPPK 410 >gi|327285304|ref|XP_003227374.1| PREDICTED: sushi, von Willebrand factor type A, EGF and pentraxin domain-containing protein 1-like [Anolis carolinensis] Length = 3587 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 18/121 (14%), Positives = 39/121 (32%), Gaps = 24/121 (19%) Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T T A A + L + + ++ K + ITDG ++G + I Sbjct: 168 GGGTYTKGAFQQAAQILLHSRANAT---------KVIFLITDGYSNGG-------DPRPI 211 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQE 389 +R G++I++ L + ++ E F+ + + Sbjct: 212 AASLREFGVEIFT--FGIWQGNIRELNDMASHPKEEHCYLLHSFTE----FEALARRALH 265 Query: 390 Q 390 + Sbjct: 266 E 266 >gi|198436525|ref|XP_002124218.1| PREDICTED: similar to laminin, beta 2 [Ciona intestinalis] Length = 671 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 26/189 (13%), Positives = 68/189 (35%), Gaps = 22/189 (11%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN-------NLN 260 +++ E L + + V + Y+ + NQ + N + Sbjct: 50 SNFNIVKEWVKQLAAGM-HIGHGHIQIGVVQYSHWYHNRPLNNQRYIKTEIELGEHMNKD 108 Query: 261 EVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 + ++++ T T A++ + ++ + K+ +I +TDG+ + + Sbjct: 109 DFDRAVDRIKYQGFKTYTAHAINKTLEFDFLGPKNRY-----PDAKRALILLTDGKATDS 163 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLE 378 S + + + R+ G+ IY+V V L+ T + F + + +L Sbjct: 164 SYLPDVIQHAE-----RDRGVIIYAVGVG--DFDSSELQLITHHHENREFELENFEDLDS 216 Query: 379 SFDKITDKI 387 + + ++ Sbjct: 217 IVNSLQFQL 225 >gi|119593590|gb|EAW73184.1| chloride channel, calcium activated, family member 4, isoform CRA_a [Homo sapiens] Length = 917 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 20/133 (15%), Positives = 56/133 (42%), Gaps = 24/133 (18%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 P T+ + +A++ + + S V+ +TDGE++ AS+ Sbjct: 379 PLGGTSICSGIKYAFQVI--------GELHSQLDGSEVLLLTDGEDNTASSC-------- 422 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE---LLESFDKI---T 384 + ++ +G ++ +A+ + ++ + G F V+D + L+++F + Sbjct: 423 -IDEVKQSGAIVHFIALGR-AADEAVIEMSKITGGSHFYVSDEAQNNGLIDAFGALTSGN 480 Query: 385 DKIQEQSVRIAPN 397 + ++S+++ Sbjct: 481 TDLSQKSLQLESK 493 >gi|119593591|gb|EAW73185.1| chloride channel, calcium activated, family member 4, isoform CRA_b [Homo sapiens] Length = 918 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 20/133 (15%), Positives = 56/133 (42%), Gaps = 24/133 (18%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 P T+ + +A++ + + S V+ +TDGE++ AS+ Sbjct: 380 PLGGTSICSGIKYAFQVI--------GELHSQLDGSEVLLLTDGEDNTASSC-------- 423 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE---LLESFDKI---T 384 + ++ +G ++ +A+ + ++ + G F V+D + L+++F + Sbjct: 424 -IDEVKQSGAIVHFIALGR-AADEAVIEMSKITGGSHFYVSDEAQNNGLIDAFGALTSGN 481 Query: 385 DKIQEQSVRIAPN 397 + ++S+++ Sbjct: 482 TDLSQKSLQLESK 494 >gi|150036262|ref|NP_036260.2| calcium-activated chloride channel regulator 4 [Homo sapiens] gi|205831469|sp|Q14CN2|CLCA4_HUMAN RecName: Full=Calcium-activated chloride channel regulator 4; AltName: Full=Calcium-activated chloride channel family member 4; Short=hCLCA4; AltName: Full=Calcium-activated chloride channel protein 2; Short=CaCC-2; Short=hCaCC-2; Contains: RecName: Full=Calcium-activated chloride channel regulator 4, 110 kDa form; Contains: RecName: Full=Calcium-activated chloride channel regulator 4, 30 kDa form; Flags: Precursor gi|37182063|gb|AAQ88834.1| CLCA4 [Homo sapiens] gi|56203696|emb|CAI22170.1| chloride channel, calcium activated, family member 4 [Homo sapiens] Length = 919 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 20/133 (15%), Positives = 56/133 (42%), Gaps = 24/133 (18%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 P T+ + +A++ + + S V+ +TDGE++ AS+ Sbjct: 379 PLGGTSICSGIKYAFQVI--------GELHSQLDGSEVLLLTDGEDNTASSC-------- 422 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE---LLESFDKI---T 384 + ++ +G ++ +A+ + ++ + G F V+D + L+++F + Sbjct: 423 -IDEVKQSGAIVHFIALGR-AADEAVIEMSKITGGSHFYVSDEAQNNGLIDAFGALTSGN 480 Query: 385 DKIQEQSVRIAPN 397 + ++S+++ Sbjct: 481 TDLSQKSLQLESK 493 >gi|156523144|ref|NP_001095986.1| inter-alpha-trypsin inhibitor heavy chain H5 precursor [Bos taurus] gi|187609595|sp|A2VE29|ITIH5_BOVIN RecName: Full=Inter-alpha-trypsin inhibitor heavy chain H5; Short=ITI heavy chain H5; Short=ITI-HC5; Short=Inter-alpha-inhibitor heavy chain 5; Flags: Precursor gi|126010782|gb|AAI33545.1| ITIH5 protein [Bos taurus] Length = 940 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 22/132 (16%), Positives = 54/132 (40%), Gaps = 11/132 (8%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 N++ + K ++ ++P T+ A+ + L + + R V+F+TDG+ Sbjct: 353 NSIRDGKVYIHHMSPSGGTDINGALQRGIQLLNDYVAHNDIE---DRSVSLVVFLTDGKP 409 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL----LRKCTDSSGQFFAVND 372 + + + + I++V + A + + L L C + + +D Sbjct: 410 TVGETHTFKILNNT--REATRGRVCIFTVGIGADVDFKLLEKLSLENCGLTR-RVHEDHD 466 Query: 373 S-RELLESFDKI 383 + +L+ +D+I Sbjct: 467 ARAQLIGFYDEI 478 >gi|120437735|ref|YP_863421.1| von Willebrand factor (vWA) type A domain-containing protein [Gramella forsetii KT0803] gi|117579885|emb|CAL68354.1| membrane protein containing von Willebrand factor (vWA) type A domain [Gramella forsetii KT0803] Length = 354 Score = 53.0 bits (125), Expect = 8e-05, Method: Composition-based stats. Identities = 25/120 (20%), Positives = 46/120 (38%), Gaps = 23/120 (19%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE----NTNTYPAMHHAYRELYN 290 S R+G IAY G P++ + + K L LN T A+ A + Sbjct: 128 SDRVGIIAYAGGAYPQ--LPITTDFSAAKMFLQALNTDMISSQGTAISDAIELATTYYDD 185 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 +++++ + + I+DGE+ N I E G++I+++ V Sbjct: 186 DQQTN----------RVLFIISDGEDHEG-------NVEDIAEQAAEKGIRIFTIGVGTE 228 >gi|313245449|emb|CBY40179.1| unnamed protein product [Oikopleura dioica] Length = 377 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 34/291 (11%), Positives = 68/291 (23%), Gaps = 26/291 (8%) Query: 110 QYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHN 169 + + + L P+ + R+ N Sbjct: 65 EATTTADFNIVTDLPPATEAPETTEGPTCEWTIWTEWDKCSETCGGGQRNRYRNPTGDIN 124 Query: 170 DNN----NMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQ 225 K P + K L N V Sbjct: 125 AAGCEGFAEDVEYCNTQDCETKQCKDNYVDVCFLLPVHNATDNKSVRL---MRNFVRETH 181 Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN--EVKSRLNK---LNPYENTNTYPA 280 I + ++ Y+ S + + ++K+ L + + N A Sbjct: 182 NYIGNFGSEDLQFCVYQYSESAANVFSLSESADFDSLDLKTALEQGIEIPEDRGANIGAA 241 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + E +N + ++ +I +TD N+ + Sbjct: 242 FKTIHDEGFNAING---WRKNDQIPSVLIVLTDNLNTVDFYDDLQYVHNKA--------Y 290 Query: 341 KIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDKITDKIQE 389 ++ +V + E L S F V D EL D++ I + Sbjct: 291 RVVAVGIGENVENSS-LSSIASLPSDENVFTVRDFTELSNVVDEVGYDICQ 340 >gi|228954949|ref|ZP_04116966.1| Gram positive anchor protein [Bacillus thuringiensis serovar kurstaki str. T03a001] gi|228804676|gb|EEM51278.1| Gram positive anchor protein [Bacillus thuringiensis serovar kurstaki str. T03a001] Length = 997 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 35/254 (13%), Positives = 68/254 (26%), Gaps = 58/254 (22%) Query: 194 TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG----------TIAY 243 A + + ++ N+V+ + A S + + Sbjct: 66 VDFVIVQDASGSFKGTMPNVKKALSNIVDELNPATDRIMVTSYQDYKGYKASDGRVLESR 125 Query: 244 NIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG--- 300 GI L+NN K+ +NK+ P T T + A E K + Sbjct: 126 GNGIKTTLQAGLTNNFTSAKNGINKITPDSGTPTASGLQFALAEYEKAKGQNDPDRETVF 185 Query: 301 --------STRLKKFVIFITDGENSGASAYQNTLNTLQI----------CEYMRNAGMKI 342 + R ++ +T+ + Y++ + ++NAG K+ Sbjct: 186 LLVTDGVANIRKDGYIYKLTNQLTDQGTGYRSNEYGQDYVGALKEVTTEAQNIKNAGYKL 245 Query: 343 YSVAV-------------------SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 + A P + L+K F ND E Sbjct: 246 VT-AFWEDKSILAASDQYYTKYDKEVGPYARQELKKMATKPEWFVLANDIEEFT------ 298 Query: 384 TDKIQEQSVRIAPN 397 + E ++ Sbjct: 299 -KNLIETVTKVTKK 311 >gi|148694797|gb|EDL26744.1| complement factor B, isoform CRA_g [Mus musculus] Length = 541 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 30/205 (14%), Positives = 71/205 (34%), Gaps = 19/205 (9%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + NL+ + + + T+ + V ++ S++ + Sbjct: 55 SDSIGSSNFTGAKRCLTNLIEKVASYGVRPRYGLLTYATVPKVLVRVSDER---SSDADW 111 Query: 262 VKSRLNKLNP-----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 V +LN+++ TNT A+ Y + ++ R + +I +TDG + Sbjct: 112 VTEKLNQISYEDHKLKSGTNTKRALQAVYSMMSWAGDAPPEGWN--RTRHVIIIMTDGLH 169 Query: 317 SGASAYQNTLNTLQICEYM-------RNAGMKIYSVAVS--APPEGQDLLRKCTDSSGQF 367 + + ++ + R + +Y V + L D+ Sbjct: 170 NMGGNPVTVIQDIRALLDIGRDPKNPREDYLDVYVFGVGPLVDSVNINALASKKDNEHHV 229 Query: 368 FAVNDSRELLESFDKITDKIQEQSV 392 F V D +L F ++ D+ + S+ Sbjct: 230 FKVKDMEDLENVFYQMIDETKSLSL 254 >gi|56797867|emb|CAG27567.1| matrilin-4 [Danio rerio] Length = 548 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 24/147 (16%), Positives = 48/147 (32%), Gaps = 18/147 (12%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKES 294 RIG + Y+ + + ++ +N++ P T T A+ +A ++ +E Sbjct: 59 TRIGVVQYSSQVQNVFSLKAFSKTEQMVKAINEIIPLAQGTMTGLAIRYAMNVAFSAEEG 118 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + + + +TDG A +G++IY+V V Sbjct: 119 AR-----PNVPHVAVIVTDGRPQDRVAEVAAAARE--------SGIEIYAVGV--ARADM 163 Query: 355 DLLRKCTDSS--GQFFAVNDSRELLES 379 LR F V + + Sbjct: 164 TSLRAMASPPFEDHVFLVESFDLIHQF 190 >gi|306922629|gb|ADN07507.1| collagen, type VII, alpha 1 [Microtus ochrogaster] Length = 2189 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 20/134 (14%), Positives = 45/134 (33%), Gaps = 17/134 (12%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYN 290 VR T+ Y+ + + + +L+ NT T A+ H ++ Sbjct: 74 SAQGVRFATVQYSDDPQTEFGLDALGSGGDTVRAIRELSYKGGNTRTGAALRHVSDHVFL 133 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + K I ITDG++ + ++ G+K+++V + Sbjct: 134 PHLTRPG------IPKVCILITDGKSQDLVDP--------AAQKLKGQGVKLFAVGI--K 177 Query: 351 PEGQDLLRKCTDSS 364 + L++ Sbjct: 178 NADPEELKRVASQP 191 >gi|306922621|gb|ADN07500.1| collagen, type VII, alpha 1 [Microtus ochrogaster] Length = 2189 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 20/134 (14%), Positives = 45/134 (33%), Gaps = 17/134 (12%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYN 290 VR T+ Y+ + + + +L+ NT T A+ H ++ Sbjct: 74 SAQGVRFATVQYSDDPQTEFGLDALGSGGDTVRAIRELSYKGGNTRTGAALRHVSDHVFL 133 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + K I ITDG++ + ++ G+K+++V + Sbjct: 134 PHLTRPG------IPKVCILITDGKSQDLVDP--------AAQKLKGQGVKLFAVGI--K 177 Query: 351 PEGQDLLRKCTDSS 364 + L++ Sbjct: 178 NADPEELKRVASQP 191 >gi|89098949|ref|ZP_01171829.1| hypothetical protein B14911_06266 [Bacillus sp. NRRL B-14911] gi|89086353|gb|EAR65474.1| hypothetical protein B14911_06266 [Bacillus sp. NRRL B-14911] Length = 940 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 30/327 (9%), Positives = 80/327 (24%), Gaps = 13/327 (3%) Query: 74 QGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSL 133 + + + +I + + + A++ + +N +G I LT Sbjct: 10 SAILMLQLLILMLSYTGKDILAAEEAGVDFSAKASQSVIVKPQNSNAEGSIDFHLTPKGK 69 Query: 134 RSTGIIERSSENLAISICMVLDVSRSMEDLY-LQKHNDNNNMTSNKYLLPPPPKKSFWSK 192 + + ++ S + K + K P + F Sbjct: 70 ATNANRDPIDVVFVFDKSGSMNDSGKNPQKFQSAKDAMTAAVNFFKENAGPNDRFGFVPF 129 Query: 193 NTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQC 252 + + L + NS+ S+ + Sbjct: 130 DDDVETGKVVNFAPENNMASLNL-INSNSNSLSALGGTNYTQSLDAALGMFGNSTNNKYV 188 Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 +++ + + N + + V T Sbjct: 189 LFMTDGEPTFSKVIERTTY----NERACFIVCWDTGKKITGNVVMNYEVYNQGFSVYHKT 244 Query: 313 DGENSGASAYQNTLNT------LQICEYMRNAGMKIYSVAVSAPPE-GQDLLRKCTDSSG 365 +G + L + + + +K++S+ + E LR+ + ++G Sbjct: 245 NGHTYKQDFDLAETKSAIKKHGLSMAQKLAMKDIKLFSIGFGSNTELDMGYLRELSSTTG 304 Query: 366 QFFAVNDSRELLESFDKITDKIQEQSV 392 + F I+ + S+ Sbjct: 305 VTARQATQENISSIFRDISADMDTPSI 331 >gi|296331310|ref|ZP_06873782.1| hypothetical protein BSU6633_09411 [Bacillus subtilis subsp. spizizenii ATCC 6633] gi|305676296|ref|YP_003867968.1| hypothetical protein BSUW23_18120 [Bacillus subtilis subsp. spizizenii str. W23] gi|296151425|gb|EFG92302.1| hypothetical protein BSU6633_09411 [Bacillus subtilis subsp. spizizenii ATCC 6633] gi|305414540|gb|ADM39659.1| putative exported protein [Bacillus subtilis subsp. spizizenii str. W23] Length = 281 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 28/218 (12%), Positives = 65/218 (29%), Gaps = 23/218 (10%) Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 + + + S RKID+ +S + + K + Sbjct: 76 SPSFAAEKQADTNVAVLFDGSGSMIQKTGGERKIDIAKKSVKSFAELLPKDTNLMLRVFG 135 Query: 237 RIGTIAYN----IGIVGNQCTPLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNE 291 G + L + + L+++ P T A+ +E Sbjct: 136 HAGNNKLSGKALSCSTTETIYGLHPYEGSLFDNSLSEIKPTGWTPIAKALSDTRKEFEAF 195 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM--KIYSVAVSA 349 K V ITDGE + + E +R + + + + + Sbjct: 196 DADG---------KNVVYLITDGEETCGG------DPAAEIEKLRESNVDTIVNIIGFNF 240 Query: 350 PPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDK 386 +G + +++ G++ + N + E ++++ K Sbjct: 241 DIKGNEEMKQAAVAGGGEYISANSADEFEQAWEIEAQK 278 >gi|262195149|ref|YP_003266358.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262078496|gb|ACY14465.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 412 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 30/265 (11%), Positives = 75/265 (28%), Gaps = 36/265 (13%) Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 S+ + + + + P S ++ + + Sbjct: 90 DSLGQCQEGELPPDAAVDCPNIEVETDPIIPTVQFLIDFSGSMDQNFGGIKRSQAVRNAL 149 Query: 218 ----GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL----NKL 269 +V +Q ++ +L Q P NNL +++ + N Sbjct: 150 FDEDDGVVALLQSQVRFGASLYTSFDGNEAPPCPRLTQVAPAFNNLTALRADIGGPLNDP 209 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 +T T ++ + + ++ TDGE + + Sbjct: 210 PNAGDTPTGESIDAIAENFPDNGPNDKP---------LIVLATDGEPDSCTDPDPNTDPG 260 Query: 330 QIC---------EYMRNAGMKIYSVAVSAPPEGQDLLRKC----------TDSSGQFFAV 370 + + AG+++Y ++V L R ++ + Sbjct: 261 RAATRRLSEEATQRAFEAGIELYVLSVGNDVGADHLQRVANAGVGKALDESNDPATVYIG 320 Query: 371 NDSRELLESFDKITDKIQEQSVRIA 395 N+ +EL+++F +I + +A Sbjct: 321 NNQQELVDAFSEIIRSARSCEFTLA 345 >gi|229595993|ref|XP_001013840.2| hypothetical protein TTHERM_00427610 [Tetrahymena thermophila] gi|225565664|gb|EAR93595.2| hypothetical protein TTHERM_00427610 [Tetrahymena thermophila SB210] Length = 2471 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 22/213 (10%), Positives = 64/213 (30%), Gaps = 24/213 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN------QCTPLS 256 + I + + +++K + + + + Sbjct: 2248 TGSMSNLITQTKNTIQTTFEQARDILKQKGYDPQCFQIMICCFRSYNSKFEEILEASSWE 2307 Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT---- 312 NN ++++S L K+ T ++ + + + + Sbjct: 2308 NNPDKLRSYLQKITASGGTYPGESVEVGLWWANKQSDENPIGQVIVLGDQPAHLQNEAQS 2367 Query: 313 ----DGENSGASAYQNTL-NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSG- 365 G+ S L + C+ +++ + + + + + + + G Sbjct: 2368 HREKHGQLYWDSTPLKGLTYYVPECDKLQSKNVPVNTFYL--NQSAKQTYEEIASLTKGI 2425 Query: 366 -QFFAVNDSR---ELLESF-DKITDKIQEQSVR 393 QF +N ++ EL +F ++I I ++ R Sbjct: 2426 SQFLDINSAQSSKELTNAFVEQILKDIGKEDGR 2458 >gi|330469792|ref|YP_004407535.1| von willebrand factor type a [Verrucosispora maris AB-18-032] gi|328812763|gb|AEB46935.1| von willebrand factor type a [Verrucosispora maris AB-18-032] Length = 565 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 25/189 (13%), Positives = 58/189 (30%), Gaps = 14/189 (7%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 A A + V E+A +N + PLS Sbjct: 381 AVPTANGATRQQVTAEAARRGLNLFDDS-WSIGLWVFSTRLDGSRDYRQVVPTGPLSRQR 439 Query: 260 NEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + ++ L+ + T Y + AY+++ E ++ TDG+ + Sbjct: 440 STLERSLDTITSSSGDTGLYDTLLAAYKDVQQNWEPGKV--------NSIVLFTDGK-NE 490 Query: 319 ASAYQNTLNTLQICEYMRN--AGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 + + L + +++ +++ + + +L + G F D ++ Sbjct: 491 DADGISQRQLLAELKRIKDPDQPIQVIIIGIGTEVSKAELDTIAQSAGGGAFVAADPTKI 550 Query: 377 LESF-DKIT 384 + F I Sbjct: 551 GDIFLRAIA 559 >gi|56797873|emb|CAG27570.1| matrilin-4 [Danio rerio] Length = 428 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 24/147 (16%), Positives = 48/147 (32%), Gaps = 18/147 (12%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKES 294 RIG + Y+ + + ++ +N++ P T T A+ +A ++ +E Sbjct: 59 TRIGVVQYSSQVQNVFSLKAFSKTEQMVKAINEIIPLAQGTMTGLAIRYAMNVAFSAEEG 118 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + + + +TDG A +G++IY+V V Sbjct: 119 AR-----PNVPHVAVIVTDGRPQDRVAEVAAAARE--------SGIEIYAVGV--ARADM 163 Query: 355 DLLRKCTDSS--GQFFAVNDSRELLES 379 LR F V + + Sbjct: 164 TSLRAMASPPFEDHVFLVESFDLIHQF 190 >gi|332860822|ref|XP_001152090.2| PREDICTED: inter-alpha (globulin) inhibitor H5-like [Pan troglodytes] Length = 1312 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 32/214 (14%), Positives = 74/214 (34%), Gaps = 20/214 (9%) Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 LPP K + + + S + K++ + +++ +Q Sbjct: 272 FAPRGLPPMEKNVVFVIDVSSSMFG-------TKMEQTKMAMNVILSDLQANDYFNIISF 324 Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS 295 + G + N++ K L+ + T+ A+ A L + + Sbjct: 325 SDTVNVWKAGGSIQATI----QNVHSAKDYLHCMEADGWTDINSALLAAASVLNHSNQ-E 379 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 S +IF+TDGE + + + L + ++S+A Sbjct: 380 PGRGPSVGRIPLIIFLTDGEPTAGVTTPSVI--LSNVRQALGHRVSLFSLAFG-DDADFT 436 Query: 356 LLRKCT-DSSG---QFFAVNDSR-ELLESFDKIT 384 LLR+ + ++ G + + D+ +L +++I+ Sbjct: 437 LLRRLSLENRGIARRIYEDTDAALQLKGLYEEIS 470 >gi|324503617|gb|ADY41568.1| Cuticlin-1 [Ascaris suum] Length = 786 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 25/156 (16%), Positives = 56/156 (35%), Gaps = 15/156 (9%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEK 292 +I + Y+ + N +++ L K+ T T A+ A + ++ Sbjct: 71 DRTQIAVMQYSSYTRVEFGFTANPNKEKLRLALQKIRHISGTTRTGKALDKALQ-VFKHG 129 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 E+S + + + + ++DG + + + + +R AG++I ++ + A Sbjct: 130 ETSGARVNQEDVAQIAVVVSDGHSHD--------DPVPAAQRLRRAGVQILTLGIGA-HI 180 Query: 353 GQDLLRKCTDSSGQFFA-VNDSRELLES---FDKIT 384 L T F + L + F KI Sbjct: 181 NMGELIDITGDETLAFQNLTSQASLDKFVSQFRKIA 216 >gi|291486252|dbj|BAI87327.1| putative exported protein [Bacillus subtilis subsp. natto BEST195] Length = 224 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 31/218 (14%), Positives = 67/218 (30%), Gaps = 23/218 (10%) Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 + S + S RKID+ +S + + K + Sbjct: 19 SPSFAAEKQADSNVAVLFDGSGSMVQKTGGERKIDIAKKSVKSFAELLPKDTNLMLRVFG 78 Query: 237 RIGTIAYN----IGIVGNQCTPLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNE 291 G + L + + L++L P T A+ +E Sbjct: 79 HAGNNKLSGKALSCSTTETIYGLHPYEGSLFDNSLSELKPTGWTPIAKALADTRKEFEAF 138 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM--KIYSVAVSA 349 K V ITDGE + + E +R + + + + + Sbjct: 139 DADG---------KNVVYLITDGEETCGG------DPAAEIEKLRASNVDTIVNIIGFNF 183 Query: 350 PPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDK 386 +G + +++ + G++ + N + E ++++K K Sbjct: 184 DVKGNEEMKQAAVAGCGEYISANSADEFEQAWEKEAQK 221 >gi|205374347|ref|ZP_03227145.1| hypothetical protein Bcoam_14574 [Bacillus coahuilensis m4-4] Length = 1083 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 34/360 (9%), Positives = 99/360 (27%), Gaps = 15/360 (4%) Query: 41 CASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNP 100 + + +L A+ ++ Sbjct: 5 ARFSYLFTFLLLLNFIFSTNLASASNNVTVNLSVTPSQSVVILPTTSNAKASLNLMLTPT 64 Query: 101 LQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSM 160 E P + +F+ S ++ S+ S+++ + M D + Sbjct: 65 GNPQTER-----DPIDLVFVFDKSGSMDFKVASNSSVKRIDSAKSAMTNALMFFDGQNTS 119 Query: 161 EDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNL 220 + + N N S ++ + ++ + + E+ Sbjct: 120 DRFGFVPFSSNANTDVVSLTDSSGWGSSSYTNSKLQTIHNKTMGLSASGGTNYTEALDVA 179 Query: 221 VNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPA 280 + ++K + + GT ++ + R+ + + T Sbjct: 180 SKLFDSSSKDKNIIFLTDGTPTFSFSDEKVYYKS-YWGSSSGNDRVEYQSYTDLTY---- 234 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL---NTLQICEYMRN 337 Y K + +T+ + +G + L N ++ + + + Sbjct: 235 -QFTETTKYWYKMGDYFYKNNTKYNMNTTYNPNGNATTYKNNFEKLVKQNDKKVVQSLSS 293 Query: 338 AGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 +G+K+Y++ +G L + G + +++ E I++++ V + Sbjct: 294 SGIKLYTLGFGDSIDGDYLAELANLTGGSYKNAI-GQDINEVLMNISEEVAAPKVDVEVK 352 >gi|119901955|ref|XP_602058.3| PREDICTED: integrin, alpha 11 [Bos taurus] gi|297479009|ref|XP_002690571.1| PREDICTED: integrin alpha 11 subunit-like [Bos taurus] gi|296483745|gb|DAA25860.1| integrin alpha 11 subunit-like [Bos taurus] Length = 1194 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 66/195 (33%), Gaps = 23/195 (11%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + I +E L+N ++K +++G + Y +V ++ +V Sbjct: 177 GSNSIYPWVEVQHFLINILKKFYIGPGQ--IQVGVVQYGEDVVHEFHLNDYRSVKDVVEA 234 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + + T T A + + + G KK +I ITDGE ++ Sbjct: 235 ASHIEQRGGTETRTAFGIEF-----ARSEAFQKGGRKGAKKVMIVITDGE------SHDS 283 Query: 326 LNTLQICEYMRNAGMKIYSVAV-SA----PPEGQDLL---RKCTDSSG--QFFAVNDSRE 375 + ++ + + Y+VAV + L + FF V D Sbjct: 284 PDLEKVIQQSERDNVTRYAVAVLGYYNRRGINPEAFLNEIKYIASDPDDKHFFNVTDEAA 343 Query: 376 LLESFDKITDKIQEQ 390 L + D + D+I Sbjct: 344 LKDIVDALGDRIFSL 358 >gi|74000923|ref|XP_535527.2| PREDICTED: similar to integrin, alpha 11 precursor [Canis familiaris] Length = 1183 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 66/195 (33%), Gaps = 23/195 (11%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + I +E L+N ++K +++G + Y +V ++ +V Sbjct: 166 GSNSIYPWVEVQHFLINILKKFYIGPGQ--IQVGVVQYGEDVVHEFHLNDYRSVKDVVEA 223 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + + T T A + + + G KK +I ITDGE ++ Sbjct: 224 ASHIEQRGGTETRTAFGIEF-----ARSEAFQKGGRKGAKKVMIVITDGE------SHDS 272 Query: 326 LNTLQICEYMRNAGMKIYSVAV-SA----PPEGQDLL---RKCTDSSG--QFFAVNDSRE 375 + ++ + + Y+VAV + L + FF V D Sbjct: 273 PDLEKVIQQSERDNVTRYAVAVLGYYNRRGINPEAFLNEIKYIASDPDDKHFFNVTDEAA 332 Query: 376 LLESFDKITDKIQEQ 390 L + D + D+I Sbjct: 333 LKDIVDALGDRIFSL 347 >gi|86132310|ref|ZP_01050905.1| conserved hypothetical protein [Dokdonia donghaensis MED134] gi|85817229|gb|EAQ38412.1| conserved hypothetical protein [Dokdonia donghaensis MED134] Length = 351 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 23/169 (13%), Positives = 64/169 (37%), Gaps = 8/169 (4%) Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 I + S ++ ++ V ++ PL+ + +++ ++ + + + ++ Sbjct: 140 ITNVMPAVPTESFKMAIYWFDGEDVLHELQPLTTSATQLQEAIDGVTDDISNDPSTDLYG 199 Query: 284 AYRELYNEKESSHNTIGSTRL--KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK 341 A + E+ T+ + L V+ TDG + A + L + Sbjct: 200 AVIKAATNAENIVETLENEDLFAAASVVIFTDGTDQAARY--SEQEALDAVSNAGEE-IS 256 Query: 342 IYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 +++ + + +++L + F +S EL F+ I++ + Q Sbjct: 257 FFTIGLGSEI-DEEVLTAIGKTGSAF--AENSNELEAVFNDISNGVAGQ 302 >gi|194388296|dbj|BAG65532.1| unnamed protein product [Homo sapiens] Length = 650 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 34/195 (17%), Positives = 66/195 (33%), Gaps = 23/195 (11%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + I +E L+N ++K +++G + Y +V ++ +V Sbjct: 171 GSNSIYPWVEVQHFLINILKKFYIGPGQ--IQVGVVQYGEDVVHEFHLNDYRSVKDVVEA 228 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + + T T A + + + G KK +I ITDGE ++ Sbjct: 229 ASHIEQRGGTETRTAFGIEF-----ARSEAFQKGGRKGAKKVMIVITDGE------SHDS 277 Query: 326 LNTLQICEYMRNAGMKIYSVAV-SA----PPEGQDLL---RKCTDSSG--QFFAVNDSRE 375 + ++ + + Y+VAV + L + FF V D Sbjct: 278 PDLEKVIQQSERDNVTRYAVAVLGYYNRRGINPETFLNEIKYIASDPDDKHFFNVTDEAA 337 Query: 376 LLESFDKITDKIQEQ 390 L + D + D+I Sbjct: 338 LKDIVDALGDRIFSL 352 >gi|171681714|ref|XP_001905800.1| hypothetical protein [Podospora anserina S mat+] gi|170940816|emb|CAP66465.1| unnamed protein product [Podospora anserina S mat+] Length = 648 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 29/184 (15%), Positives = 61/184 (33%), Gaps = 10/184 (5%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP--LSN 257 AP PA + + S +LV K I E + R+G + ++ + Sbjct: 87 APVPAKNGTEGEHYGLSVLDLVRHAAKTILETLDDHDRLGIVTFSTSSKVVRELTYMTPA 146 Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF--VIFITDGE 315 N ++ +L+ L P TN + + L+N + N + + ++ +TDG Sbjct: 147 NKAKILKQLDALQPLSMTNLWHGIRDGL-SLFNNNLKAVNDRRNPGSGRVPALLVLTDGM 205 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSR 374 + + I++ LL+ + G + + D+ Sbjct: 206 PNH---QCPNQGYVAKLRQWSTLPASIHTFGFGYSLRS-GLLKSIAEVGGGNYSFIPDAG 261 Query: 375 ELLE 378 + Sbjct: 262 MITT 265 >gi|113971716|ref|YP_735509.1| von Willebrand factor, type A [Shewanella sp. MR-4] gi|113886400|gb|ABI40452.1| von Willebrand factor, type A [Shewanella sp. MR-4] Length = 335 Score = 53.0 bits (125), Expect = 9e-05, Method: Composition-based stats. Identities = 24/150 (16%), Positives = 53/150 (35%), Gaps = 23/150 (15%) Query: 246 GIVGNQCTPLSNNLN---EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 G TP + + + ++T+ A+ + + S Sbjct: 149 GDAAFIQTPFTADQQVWLSLLEEAQTGMAGQSTHLGDAIGLGIKVFEQNPQPSE------ 202 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-PEGQDLL---- 357 ++ +I +TDG ++G + + + + G+KIY +A+ P G+ + Sbjct: 203 --QQVMIVLTDGNDTG-----SFVEPVDAAKIAAARGIKIYIIAMGDPTHVGEQPMDMEV 255 Query: 358 --RKCTDSSGQFFAVNDSRELLESFDKITD 385 R + + F D EL +++ I Sbjct: 256 VQRVSQLTQARAFIAIDQAELDKAYQLIDK 285 >gi|296208411|ref|XP_002751094.1| PREDICTED: epithelial chloride channel protein-like [Callithrix jacchus] Length = 904 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 21/114 (18%), Positives = 46/114 (40%), Gaps = 23/114 (20%) Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T+ + ++ + + L +I +TDGE++ + Sbjct: 380 EASGGTSICNGLKAGFQAISQS--------NQSTLGSEIILLTDGEDNQ----------I 421 Query: 330 QIC-EYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAV-NDSRELLESF 380 +C E +R +G I+++A+ P + L ++ + G F ND L+++F Sbjct: 422 SLCFEEVRQSGAIIHTIALG--PSAEKELETLSNMTRGHRFYAHNDINGLIDAF 473 >gi|303240107|ref|ZP_07326628.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] gi|302592376|gb|EFL62103.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] Length = 329 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 23/202 (11%), Positives = 62/202 (30%), Gaps = 52/202 (25%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN----KLNPYEN 274 + ++ + I+ + PL+++ + + L+ + Sbjct: 103 DRLSRAKNIIESIIDNLEGDRIGFIPFSSAAYIQMPLTDDYDLARMYLDVIDTDMIAGGG 162 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 TN A++ A + + VI ++DGE + N++ I + Sbjct: 163 TNVGTALNLAENSFEETSSA----------DRVVIILSDGEE-------HNSNSVDILKS 205 Query: 335 MRNAGMKIYSVAVSA---------------------PPEGQ--------DLLRKCTDSS- 364 + +K++++ + G+ + L+K + Sbjct: 206 FNDEHLKVFTIGIGTAKGGLVPDYGSDGGQKSGYKKDSNGEFVMSKLNSETLKKLASTGK 265 Query: 365 GQFFAVN-DSRELLESFDKITD 385 G ++ + E+ KI+ Sbjct: 266 GSYYQSSLTGDEIASLTKKISS 287 >gi|115963085|ref|XP_001182555.1| PREDICTED: similar to calcium activated chloride channel 1 precursor [Strongylocentrotus purpuratus] Length = 1245 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 21/166 (12%), Positives = 56/166 (33%), Gaps = 22/166 (13%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 + +ID + +A VN + + ++ + T + Q + Sbjct: 531 TSGSMGTSNRIDKVNSAATAFVNLVD----DGISIGIVTFTGSPTTRHALTQI-NTQADR 585 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 + ++ + +L T + L S + ++ +TDG+ Sbjct: 586 DSLRD-IFQLTASGGTCIGCGLEQGLEVLMAHPSGSAD-------GGIIVLMTDGQ---- 633 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG 365 + + I + +++ G+++ +VA+ G+ L +G Sbjct: 634 ---DSGIQNHIIRQTLQDMGVRVNTVAIGEDAYGE--LSLIAQETG 674 >gi|297527229|ref|YP_003669253.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] gi|297256145|gb|ADI32354.1| von Willebrand factor type A [Staphylothermus hellenicus DSM 12710] Length = 333 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 36/191 (18%), Positives = 67/191 (35%), Gaps = 30/191 (15%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 KI I + ++ V IG I +N P +++ + +L Sbjct: 118 DKIITAINAVKKFIDQ--------TIDYVLIGLITFND--HVRIAIPPTSDQELLYKKLG 167 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 ++ + T + AY L E L +IF+TDG Q+ Sbjct: 168 EIKAFGGTIYSKPLEIAYDWLVPFAE--------FNLSPTIIFVTDGLP----YSQDAPL 215 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEG-------QDLLRKCTD-SSGQFFAVNDSRELLES 379 ++ + IY + + P Q LR+ + + GQF+ V + L+ Sbjct: 216 YREVVYKCARYNITIYPIFIETPGMSIYETMMAQQRLREIANITKGQFYNVKQTNSLINL 275 Query: 380 FDKITDKIQEQ 390 F+K+ +K + Sbjct: 276 FEKLAEKTVSK 286 >gi|163754424|ref|ZP_02161546.1| aerotolerance-related membrane protein [Kordia algicida OT-1] gi|161325365|gb|EDP96692.1| aerotolerance-related membrane protein [Kordia algicida OT-1] Length = 344 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 27/123 (21%), Positives = 46/123 (37%), Gaps = 23/123 (18%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE----NTNTYPAMHHAYR 286 S RIG IAY V P++ + + K LN LN T A+ A Sbjct: 122 NNLGSDRIGIIAYAGRAVPQ--LPITTDFSAAKMFLNNLNTNMLSSQGTAIDDAIRLAKT 179 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 + ++++ + ++ I+DGE+ Q+ E G+K Y++ Sbjct: 180 YYDDVEQTN----------RVLVIISDGEDHTG-------GAGQLAEEATKEGIKTYTIG 222 Query: 347 VSA 349 V Sbjct: 223 VGT 225 >gi|47565090|ref|ZP_00236133.1| gram positive anchor protein, putative [Bacillus cereus G9241] gi|47557876|gb|EAL16201.1| gram positive anchor protein, putative [Bacillus cereus G9241] Length = 997 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 35/254 (13%), Positives = 68/254 (26%), Gaps = 58/254 (22%) Query: 194 TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG----------TIAY 243 A + + ++ N+V+ + A S + + Sbjct: 66 VDFVIVQDASGSFKGTMPNVKKALSNIVDELNPATDRIMVTSYQDYKGYKASDGRVLESR 125 Query: 244 NIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG--- 300 GI L+NN K+ +NK+ P T T + A E K + Sbjct: 126 GNGIKTTLQAGLTNNFTSAKNGINKITPDSGTPTASGLQFALAEYEKAKGQNDPDRETVF 185 Query: 301 --------STRLKKFVIFITDGENSGASAYQNTLNTLQI----------CEYMRNAGMKI 342 + R ++ +T+ + Y++ + ++NAG K+ Sbjct: 186 LLVTDGVANIRKDGYIYKLTNQLTDQGTGYRSNEYGQDYVGALKEVTTEAQNIKNAGYKL 245 Query: 343 YSVAV-------------------SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 + A P + L+K F ND E Sbjct: 246 VT-AFWEDKSILAASDQYYTKYDKEVGPYARQELKKMATKPEWFVLANDIEEFT------ 298 Query: 384 TDKIQEQSVRIAPN 397 + E ++ Sbjct: 299 -KNLIETVTKVTKK 311 >gi|260837260|ref|XP_002613623.1| hypothetical protein BRAFLDRAFT_93664 [Branchiostoma floridae] gi|229299009|gb|EEN69632.1| hypothetical protein BRAFLDRAFT_93664 [Branchiostoma floridae] Length = 655 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 22/167 (13%), Positives = 54/167 (32%), Gaps = 12/167 (7%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE 273 + L++ + + A + + + ++N + L+ L + Sbjct: 321 KHTTWFLLDRLTEDDYVATGYF---NAYAQAVSCLSSFVQATTHNKEVIHKSLDNLEAAD 377 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 N Y + +A++ N + K ++ +T+ A N Sbjct: 378 QANYYAGLEYAFKIFNNFEMEDRFENQGAECNKVIVLVTENAELYPEAVFQKYNP----- 432 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLES 379 + ++++ + V P +L+K D+ G F V E+ Sbjct: 433 ---DRNIRVFVIVVGEPIHDWSVLQKMACDNRGYFSTVRSDGAAREA 476 >gi|47229797|emb|CAG06993.1| unnamed protein product [Tetraodon nigroviridis] Length = 1160 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 20/165 (12%), Positives = 48/165 (29%), Gaps = 17/165 (10%) Query: 214 IESAGNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY 272 S +++++ + + + + N K + + Sbjct: 182 KTSVMEMLDTLSDDDYVNVARFNEKADAVVPCFRTLVQANVR---NKKIFKEAVMHMQAK 238 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T+ A+ +L NE + K ++ TDG A N Sbjct: 239 GTTDYKSGFTFAFEQLLNESSAPRANCN-----KMIMMFTDGGEDRAQEIFEKYNWP--- 290 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSREL 376 N +++++ +V L+ S+ G +F + + Sbjct: 291 ----NKTVRVFTFSVGQHNYDVTPLQWIACSNKGYYFEIPSIGAI 331 >gi|83644399|ref|YP_432834.1| von Willebrand factor type A (vWA) domain-containing protein [Hahella chejuensis KCTC 2396] gi|83632442|gb|ABC28409.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Hahella chejuensis KCTC 2396] Length = 687 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 37/258 (14%), Positives = 81/258 (31%), Gaps = 28/258 (10%) Query: 128 LTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKK 187 + +S ++ N S + + + + Q+ N ++ Y P K Sbjct: 230 SLQSNAQSMEAVQEGDANAPASSPSAYRLDKDIVVYWRQQQNLPGSVDLITYKEPGKDKG 289 Query: 188 SFWSKNTTKS-----KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIA 242 +F T + + L+ ++K + R+ I Sbjct: 290 TFMLTVTPGDDLPAITEGRDWTLVLDRSGSMSGKFSTLLEGLRKGFAKFNRND-RVRVIM 348 Query: 243 YNIGIVGNQ---CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 +N NL +V + P TN A+ A L ++ ++ Sbjct: 349 FNDNATEVTNGWVQATPENLQQVVGAVENAGPSGGTNLMSAIQSALTGLDADRTNA---- 404 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 + +TDGE + Q E + +++++ + + LL Sbjct: 405 --------IWLVTDGEANVGETKQ-----KAFIELLEKKDIRLFTF-IMGNSANRPLLEA 450 Query: 360 CTD-SSGQFFAVNDSREL 376 T S+G +V++S ++ Sbjct: 451 ITKHSNGFAISVSNSDDI 468 >gi|290987786|ref|XP_002676603.1| predicted protein [Naegleria gruberi] gi|284090206|gb|EFC43859.1| predicted protein [Naegleria gruberi] Length = 755 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 28/200 (14%), Positives = 62/200 (31%), Gaps = 24/200 (12%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN- 257 + ++D++ S L+ + N +I I ++ + Sbjct: 151 SSSNENTGFSRLDLVKHSVRTLIELM--------NEKDQISLIPFSDSARMELPLTKMDA 202 Query: 258 -NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + +L L P +TN + + + + +T +I TDGE Sbjct: 203 VGKKKAIEKLEHLGPEGSTNVWDGLRLGME---SSLNNPLCAKTNTC----LILFTDGEP 255 Query: 317 SGASAYQNTLNTLQICEYMRNA--GMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDS 373 + + +Y++ I+S LL+ + G + + D Sbjct: 256 N---INPPRGIVPTLEKYIKEHPLNSTIHSFGFGY-SLDSALLKDIAMNGSGAYSYIPDC 311 Query: 374 RELLESFDKITDKIQEQSVR 393 + +F + I +VR Sbjct: 312 SMVGTTFVNMMSNILCTAVR 331 >gi|322437151|ref|YP_004219363.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] gi|321164878|gb|ADW70583.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] Length = 324 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/194 (12%), Positives = 70/194 (36%), Gaps = 17/194 (8%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + +S + + R +++ Q +NN++ + Sbjct: 101 VGIMLDTSSSIRTRFKFEQDSAIDFFLSVMHQNDRAFVEGFDVQTYLPQDY--TNNIDLL 158 Query: 263 KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + KL P T + +++ R + ++K +I ++DG+++ + A Sbjct: 159 DQGIRKLRPGGGTALFDSLYKTCR------DQMLALQADNEVRKALILVSDGDDNYSRAS 212 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPE---GQDLLRKCTD-SSGQFFAVNDSRELLE 378 + +++C+ +Y+++ P G +L + + G+ F ++ Sbjct: 213 MT--DAIKMCQRA---DTIVYTISTDTSPTRGKGGAVLESISGATGGRTFFPIKLEDVAI 267 Query: 379 SFDKITDKIQEQSV 392 F I +++ Q + Sbjct: 268 GFKNIEIELRSQYL 281 >gi|73998866|ref|XP_535021.2| PREDICTED: similar to A-domain containing protein similar to matrilin and collagen [Canis familiaris] Length = 788 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 26/164 (15%), Positives = 65/164 (39%), Gaps = 19/164 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + VR+G I ++ ++ EVK+++ ++ T T A+ + R Sbjct: 100 DINPERVRVGAIQFSSAPHLEFPLDSFSSQQEVKAKIKRMVFKGGRTETGLALKYLLRRG 159 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + ++ + + +I ITDG + G A + ++ G+ I++V V Sbjct: 160 FPGGRNA-------SVPQILIVITDGRSQGHVAVPT--------KQLKERGVTIFAVGVR 204 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 P + L + + V + ++ ++ + + + + +V Sbjct: 205 FPRWEE--LHRLASEP-REQHVLMAEQVEDATNGLLSTLSKSAV 245 >gi|308270598|emb|CBX27210.1| hypothetical protein N47_A12390 [uncultured Desulfobacterium sp.] Length = 312 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 24/169 (14%), Positives = 53/169 (31%), Gaps = 51/169 (30%) Query: 254 PLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 PL+ + ++ L++L P T+ A+ + K ++ Sbjct: 124 PLTLDYGAIQMFLDELKPELIPVAGTDLGAAIEAGISSFD----------FKSVTDKVIM 173 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------- 352 ITDGE++ L + + G+KI+ + P Sbjct: 174 LITDGEDNEGKG-------LIAAQKAKEKGVKIFVFGMGDPSGGPIPATDGKGGFRKDRN 226 Query: 353 --------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSV 392 ++ L K + G++ + + +L + D I+ + Sbjct: 227 GNVIMSKMDEESLEKIASATGGRYTR-SVTGDLDLIY---FDGIKSLTT 271 >gi|238060728|ref|ZP_04605437.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] gi|237882539|gb|EEP71367.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] Length = 580 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 41/371 (11%), Positives = 93/371 (25%), Gaps = 25/371 (6%) Query: 30 QSALDAAVLS-GCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQK 88 Q A A+ + + + + I L + + N Sbjct: 212 QQATTGALRALATGRSALREDLLARFPRSSDPAAIASGLGAAALSEEDVMAYNERKPPIP 271 Query: 89 AQI------------NITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRST 136 + +K +E+ T F L +L Sbjct: 272 LAALYLEPSPMPLDYPYAVLPGIEPAKQSAAKVLFEVLTTPGFRNRLAGRSLRAPDGNWG 331 Query: 137 GIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTK 196 + + + + + S+ + + + Sbjct: 332 DGFKAPRGAPSPAGRAASAPAPGGNGAGGLDPVAIDRAVSSWSIATQSGRMLCVIDVSGS 391 Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 K A + V + +A +N + + + LS Sbjct: 392 MKQPVPSANNATREQVTVAAASRGLNLFDDS-WSIGLWTFSTELVGTLDYRELVPINLLS 450 Query: 257 NNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 +N + ++ L + P T Y M AY+ + + E V+ TDG+ Sbjct: 451 SNRSRLEQGLATIRPSSGDTGLYDTMLAAYKTVQEDWEPGRV--------NSVVLFTDGK 502 Query: 316 NSGASAYQNTLNTLQICEYMR-NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSR 374 N A+ ++ + +++ + + +L + G F D Sbjct: 503 NEDANGISQQKLLAELKQAADPERPVQVVIIGIGNDVSKSELDSITKVTGGGSFITEDPT 562 Query: 375 ELLESF-DKIT 384 ++ + F I Sbjct: 563 KIGDIFLKAIA 573 >gi|125975609|ref|YP_001039519.1| von Willebrand factor, type A [Clostridium thermocellum ATCC 27405] gi|256003656|ref|ZP_05428645.1| von Willebrand factor type A [Clostridium thermocellum DSM 2360] gi|281416621|ref|ZP_06247641.1| von Willebrand factor type A [Clostridium thermocellum JW20] gi|125715834|gb|ABN54326.1| von Willebrand factor, type A [Clostridium thermocellum ATCC 27405] gi|255992447|gb|EEU02540.1| von Willebrand factor type A [Clostridium thermocellum DSM 2360] gi|281408023|gb|EFB38281.1| von Willebrand factor type A [Clostridium thermocellum JW20] gi|316939730|gb|ADU73764.1| von Willebrand factor type A [Clostridium thermocellum DSM 1313] Length = 565 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 24/130 (18%), Positives = 50/130 (38%), Gaps = 14/130 (10%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLS--NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 + V IG ++Y+ + N N + + L NT T+ A+ A + L Sbjct: 423 SSDVSIGLVSYSTDVNINLPIAKFDLNQRSLFVGAVESLAAGGNTATFDAIIVATKMLKE 482 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 EK + N + ++DG + + + + M+ G+ IY++ +A Sbjct: 483 EKAKNPNAKL------MLFVLSDGVTNYGHSLNDIKDM------MKTFGIPIYTIGYNAN 530 Query: 351 PEGQDLLRKC 360 + + L + Sbjct: 531 IKALETLSQI 540 >gi|303235711|ref|ZP_07322318.1| von Willebrand factor type A domain protein [Prevotella disiens FB035-09AN] gi|302484158|gb|EFL47146.1| von Willebrand factor type A domain protein [Prevotella disiens FB035-09AN] Length = 341 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 31/207 (14%), Positives = 70/207 (33%), Gaps = 57/207 (27%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 ++D +LVN + +IG + + P++++ K LN Sbjct: 110 TRLDRSKRLVEDLVNRFT---------NDKIGIVVFAGDAFVQ--LPITSDYISAKMFLN 158 Query: 268 KLNP----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 ++P + T+ A+ S H+ K +I ITDGEN Sbjct: 159 NISPELIGSQGTDIGKAIEL----------SEHSFSEKANFGKAIIIITDGEN------- 201 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPE-----------------------GQDLLRKC 360 + ++ + G++++ + + +P +D+ RK Sbjct: 202 HEKGAEEMAREAQKQGIRVFILGIGSPQGAPIPMGNGSYLQDTSGQTVMTRLNEDMCRKI 261 Query: 361 TDSS-GQFFAVNDSRELLESF-DKITD 385 ++ G + V+++ ++I Sbjct: 262 AEAGKGMYIHVDNTTAAETILDNEIGK 288 >gi|298246130|ref|ZP_06969936.1| von Willebrand factor type A [Ktedonobacter racemifer DSM 44963] gi|297553611|gb|EFH87476.1| von Willebrand factor type A [Ktedonobacter racemifer DSM 44963] Length = 412 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 26/176 (14%), Positives = 62/176 (35%), Gaps = 24/176 (13%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 K+ + E+ +++ ++ + I + ++ + +N+ +K+ +++ Sbjct: 60 KLRNVKEAVKMVIDRLEPSDY--------ISVVIFDDSAQVIIPSMPANDPVGMKAAIDR 111 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 + T M + EL +I +TDG Y +T Sbjct: 112 IQDAGGTTMSLGMIQSLGEL--------RRWNIPNAVSRMILLTDG-----VTYGDTDRC 158 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQ-FFAVNDSRELLESFDK 382 Q+ AG+ IY + + A ++LL S G + + + F++ Sbjct: 159 RQLARDAAAAGISIYPLGIGADW-DENLLDDVGQLSGGTPAEFIRSPNDAMTIFEQ 213 >gi|48477361|ref|YP_023067.1| hypothetical protein PTO0289 [Picrophilus torridus DSM 9790] gi|48430009|gb|AAT42874.1| hypothetical protein PTO0289 [Picrophilus torridus DSM 9790] Length = 379 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 24/136 (17%), Positives = 47/136 (34%), Gaps = 20/136 (14%) Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + L NT Y A+ A + +I +TDG + + Sbjct: 96 ALEAIPSLKVAGNTAMYTALLTATKLADKYNMP-----------GRIILLTDGMPT-DVS 143 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ-FFAVNDSRELLESF 380 + LQ+ E G I + + DLL+ D F+ + + EL + Sbjct: 144 MNESYENLQVPE-----GFTIDCIGIG-DNYRDDLLKLLADKGNSIFYHLENPEELPKVM 197 Query: 381 DK-ITDKIQEQSVRIA 395 + ++ I ++V++ Sbjct: 198 ESTVSSDISAKNVQVD 213 >gi|62088852|dbj|BAD92873.1| alpha 3 type VI collagen isoform 5 precursor variant [Homo sapiens] Length = 1702 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 59/172 (34%), Gaps = 15/172 (8%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNL--SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ +S ++ + + + S+++G + YN + ++ +N Sbjct: 173 INFRRDSFQEVLRFVSEIVDTVYEDGDSIQVGLVQYNSDPTDEFFLKDFSTKRQIIDAIN 232 Query: 268 KLNPYENTNTYPAMHHAYRELY-NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 K+ + L N + R+ + IT G+ + Sbjct: 233 KVVYKGGRHANT--KVGLEHLRVNHFVPEAGSRLDQRVPQIAFVITGGK--------SVE 282 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + G+K+++V V + + K +S F V + +EL E Sbjct: 283 DAQDVSLALTQRGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGNVQELSE 332 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 21/149 (14%), Positives = 53/149 (35%), Gaps = 15/149 (10%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKESSHN 297 + ++ + + V + +L + NT A+ R L+ S Sbjct: 2 VVQFSNDVFPEFYLKTYRSQAPVLDAIRRLRLRGGSPLNTGKALEFVARNLF---VKSAG 58 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 + + + ++ + G++ + + +R++G+ S+ V + L Sbjct: 59 SRIEDGVPQHLVLVLGGKSQDDVSRF--------AQVIRSSGIV--SLGVGDRNIDRTEL 108 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDK 386 + T+ F V + REL ++I + Sbjct: 109 QTITNDPRLVFTVREFRELPNIEERIMNS 137 >gi|149922101|ref|ZP_01910541.1| hypothetical protein PPSIR1_23234 [Plesiocystis pacifica SIR-1] gi|149817038|gb|EDM76520.1| hypothetical protein PPSIR1_23234 [Plesiocystis pacifica SIR-1] Length = 350 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 23/163 (14%), Positives = 65/163 (39%), Gaps = 16/163 (9%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL----NPYENTNTYPAMHHAYREL 288 + ++ A++ ++ + + + ++R N L ++TN A+ L Sbjct: 129 ESNNKVAIYAFDGEEKIHKISDFTGSEGAAEARANSLAGFQPKDKSTNLNGAIVQGLEVL 188 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 E + + ++ TDG + + ++ + +++ +++++ + Sbjct: 189 DEGLEKA----KNPLRLGTLVVFTDGTDRASRVSEDE-----MLSAVKDTPYEVFAIGLG 239 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 A DL + SG DS E+ +F+ I ++I++ + Sbjct: 240 AELSEGDLQKI--GKSGTAL-AKDSGEIQTAFESIGERIEKLT 279 >gi|118593079|ref|ZP_01550466.1| von Willebrand factor type A domain protein [Stappia aggregata IAM 12614] gi|118434386|gb|EAV41040.1| von Willebrand factor type A domain protein [Stappia aggregata IAM 12614] Length = 858 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 29/230 (12%), Positives = 70/230 (30%), Gaps = 25/230 (10%) Query: 154 LDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL 213 + +S + ++ + + + + + S +A P +I+V Sbjct: 1 MPLSLPRLAVATSLLVASSAFSLVPSSAQAADRATILILDASGSMWAQLP-EGRSRIEVA 59 Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS-NNLNEVKSRLNKLNPY 272 + G+ + S + + G +P+ + + SRLN L+P Sbjct: 60 RDVLGDYLRSRDGSRPLGV---IAYGHNRKGDCQDIETISPVGVQDPASLGSRLNGLSPR 116 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T ++ A + E + ++ +TDG + L+ + Sbjct: 117 GKTPLAGSLRRAATLIPKTSEEAD-----------IVLVTDGLETCG------LDPCAVA 159 Query: 333 EYMRNAGMKI--YSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLES 379 + G+ + + V + + G A EL ++ Sbjct: 160 ASLAQEGIPVRAHVVGFGLTEGEVRQISCIAETTGGMVLAPQSGAELADA 209 >gi|313219850|emb|CBY30766.1| unnamed protein product [Oikopleura dioica] Length = 1473 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 62/187 (33%), Gaps = 33/187 (17%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 K VL+ NL N Q V+IG +++ V N E++ ++ Sbjct: 1258 NKDQVLMNFTNNLANMYDTINQ------VKIGLTSFSESSVLEMPLDF-YNQLELQDGVS 1310 Query: 268 KLNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 + + TN + A ++ S + +I ITDG NT Sbjct: 1311 NMTWQGSFTNITSGVETALNDMDT----------SDAVDDVMILITDGFQ-----STNTT 1355 Query: 327 NTLQICEYMRNAGMKIYSVAVSAP----PEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 Q+ + + G+++ ++ L+ + + + ELL + Sbjct: 1356 LMFQMIDQAKAEGVRLIALGFFGEFAFYSPNLYLMT------NEVYHAANYAELLAIDNT 1409 Query: 383 ITDKIQE 389 I + I Sbjct: 1410 IFETICS 1416 >gi|212634223|ref|YP_002310748.1| Von Willebrand factor type A domain-containing protein [Shewanella piezotolerans WP3] gi|212555707|gb|ACJ28161.1| Von Willebrand factor type A domain protein [Shewanella piezotolerans WP3] Length = 342 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 19/142 (13%), Positives = 49/142 (34%), Gaps = 25/142 (17%) Query: 254 PLSNNL---NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 P + ++ + +++ T A+ + R ++ ++ Sbjct: 165 PFTEDVRLWQTLLEQMDTQMAGPATAIGDAVGLSIRAFERSN----------TSQRILLL 214 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA------VSAPPEGQDLLRKCTD-S 363 +TDG ++ + L+ + G++I+++ V + L K + Sbjct: 215 VTDGSDTSSR-----LDPVDAARVAAAEGIEIFTLGMGSVDTVGDDQVDFNTLNKIAKIT 269 Query: 364 SGQFFAVNDSRELLESFDKITD 385 +G+ F N S + E +I Sbjct: 270 NGRAFEGNSSTAIAEILAQIDK 291 >gi|198430661|ref|XP_002124079.1| PREDICTED: similar to alpha-2 IX collagen [Ciona intestinalis] Length = 1221 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 30/162 (18%), Positives = 61/162 (37%), Gaps = 12/162 (7%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYN 290 + ++G I Y+ I + + +++S + ++ T T A+ A ++N Sbjct: 8 GSEDTQVGVIQYSSCIRPHIWLNNFTDTEQLQSAIARIKFEAKGTATGKALRFASNRMFN 67 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 K+ + G +++ +I ITDG S + + + ++ ++V V Sbjct: 68 WKKGARELSGG--VRRILILITDGRWSV------RDDIPGALASLESKQVERFAVGVGRV 119 Query: 351 PEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 + D L F + EL E KI I +Q Sbjct: 120 VKTLD-LGALVSEPKSNHSFHIPFFDELDEVGRKILSTICQQ 160 >gi|305666869|ref|YP_003863156.1| hypothetical protein FB2170_11421 [Maribacter sp. HTCC2170] gi|88709093|gb|EAR01327.1| hypothetical protein FB2170_11421 [Maribacter sp. HTCC2170] Length = 365 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 34/241 (14%), Positives = 79/241 (32%), Gaps = 13/241 (5%) Query: 155 DVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLI 214 VS D + N+ N + + S +N + + Sbjct: 84 PVSGLDADKFTIYEQGRNDDCFNTISKSESNARISSNSQVFSSNTILVLDLSNSVLSSSL 143 Query: 215 ESAGNLVNSI-QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE 273 E S + S ++ ++ + L+++ E+ S +N + Sbjct: 144 EELKTASVSFVNNVMPLTTEDSYKMAIYWFDGEDELHLLNDLTSSKQELVSAINGITDDI 203 Query: 274 NTNTYPAMHHAYRELYNEKES--SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 + + ++ A + E + + V+ TDG + + + T ++ Sbjct: 204 SNDPSTDLYGAVIKSTEIAEELLADSIKEEIIGAASVVIFTDGTDQASRYTEQT-ALKKV 262 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF--AVNDSRELLESFDKITDKIQE 389 + A + +S+ + A +L G+ F ++ EL +F+ I+ KI E Sbjct: 263 EDAA--ANISFFSIGLGAEI-DTQVLTDI----GKTFSVFAGNAEELENTFNDISIKISE 315 Query: 390 Q 390 + Sbjct: 316 R 316 >gi|325570952|ref|ZP_08146571.1| von Willebrand factor type A domain protein [Enterococcus casseliflavus ATCC 12755] gi|325156278|gb|EGC68462.1| von Willebrand factor type A domain protein [Enterococcus casseliflavus ATCC 12755] Length = 1176 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 25/269 (9%), Positives = 76/269 (28%), Gaps = 26/269 (9%) Query: 140 ERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY 199 + I + +V+D S SM ++ + + S + S Sbjct: 328 NVQRQIDPIDVVLVVDWSGSMNEMGRITEVKKGVDRFLNQIEGSGIQDSVYMGYVGYSSD 387 Query: 200 APAPAPAN---RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 K + E+ + + + G + L+ Sbjct: 388 GNNYQNKTCQLGKFSEVKETIRTMTPETAAGGTFTQRGLRQAGDMLSTQNGHKKVIVLLT 447 Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + + ++K++ A+ ++ ++ + F +D Sbjct: 448 DGVPTYSYHVSKVHT---QADGSYYGTAFSLSQDQPMNTSHLYNG-------YFASDQYG 497 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVS---------APPEGQDLLRKC--TDSSG 365 + + T+ ++ G++I+ + + + ++ +R+ D G Sbjct: 498 NSKWINNTFVATIGEAMALKERGIEIHGLGIQLQGDQTAGYTKADVENKMRQMVSADEEG 557 Query: 366 QFFA--VNDSRELLESFDKITDKIQEQSV 392 + N++ ++ + +K I Sbjct: 558 HLYYESANEAADIADYLEKKALHISATVT 586 >gi|326501022|dbj|BAJ98742.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 284 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 23/234 (9%), Positives = 72/234 (30%), Gaps = 19/234 (8%) Query: 160 MEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGN 219 + +N + + S T +KY + + + Sbjct: 21 FNNWATMSSELKSNNSQSTCPECLHSAYLIDSVTTRCNKYQSISFLVDESGSIGASAFQY 80 Query: 220 LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTY 278 + + + + + + + + NN + + + T T Sbjct: 81 AKSFLYAYVNQTYDDLSIMSIHFF--DSTFDPYIYYGNNRATILNMIQSKAYRGAGTATG 138 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 A++++ + N+ + + K ++ +TDG + + ++ R Sbjct: 139 NAINNSVALIKNKNFPN-------GVPKILVILTDGGSYDS--------VIEAANNARKN 183 Query: 339 GMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 G+ ++ V + + L++ +S + ++ L + I + +Q + Sbjct: 184 GIMLFCVGIGSNVNTAQLIQIAGSTSNIVY-ISSYSSLTNLVNLIENYFCKQII 236 >gi|288800164|ref|ZP_06405623.1| BatB protein [Prevotella sp. oral taxon 299 str. F0039] gi|288333412|gb|EFC71891.1| BatB protein [Prevotella sp. oral taxon 299 str. F0039] Length = 339 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 19/162 (11%), Positives = 57/162 (35%), Gaps = 40/162 (24%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 P++++ K L+ + P + + A R + + K +I IT Sbjct: 144 LPITSDYVSAKMFLSDITPNLISAQGTDIARAIRVSLSSFTQ------QKGVGKAIILIT 197 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA---------------PPEGQDLL 357 DGE++ L+ + + G+ ++ + V GQ ++ Sbjct: 198 DGEDNEG-------GALEAVKEAKEKGVNVFILGVGDSKGAPIPLGNGEYLKDNHGQTVM 250 Query: 358 --------RKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQ 390 ++ + G + ++++ + +++ +++ + Sbjct: 251 TALNENMCKEIAQAGSGTYIHIDNTSL---AQEQLNNELSKL 289 >gi|262200403|ref|YP_003271611.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] gi|262083750|gb|ACY19718.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] Length = 423 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 15/138 (10%), Positives = 37/138 (26%), Gaps = 11/138 (7%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 + G + ++ + + + P T+ +EL Sbjct: 71 DPRDVFGVVTFDDDAQVVLPAAPLADKARAVDAVGSIVPGGCTDLSSGYLRGLQELRRAT 130 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 S+ G+ V+ I+DG + + ++ G+ ++ Sbjct: 131 ASAGIRGGT------VLVISDGHVNRGIRDLDEFASITA--KAAADGIITSTLGYGRG-Y 181 Query: 353 GQDLLRKCT--DSSGQFF 368 + LL + F Sbjct: 182 DETLLSAIARSGNGNHVF 199 >gi|95930867|ref|ZP_01313598.1| von Willebrand factor, type A [Desulfuromonas acetoxidans DSM 684] gi|95133109|gb|EAT14777.1| von Willebrand factor, type A [Desulfuromonas acetoxidans DSM 684] Length = 698 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 32/237 (13%), Positives = 75/237 (31%), Gaps = 29/237 (12%) Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 N I LD + + D N + ++ P ++ T + Sbjct: 259 NKDIVFLYRLDDTTPARIELIPYKADRNATGTMMLVVTPAADLQPITEGTDWTFVLDVSG 318 Query: 205 PAN-RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 + KI L + + + N + R I +N ++ V Sbjct: 319 SMDGHKIATLADGVSQTLGKL--------NSNDRFRIITFNQSAADLTRGFVTATPEAVG 370 Query: 264 SRLNK---LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 +N+ + +TN + + A R L +++ +S ++ +TDG + Sbjct: 371 QWINRVKTIAAGGSTNLFAGLETACRRLDDDRTTS------------IVLVTDGVANVGR 418 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 + + + +++++ + L R DS G ++D ++ Sbjct: 419 T-----EQREFLQLLTEYDVRLFTFVIGNSANRPLLDRLAKDSGGFAMQISDVDDIQ 470 >gi|47523388|ref|NP_999313.1| calcium-activated chloride channel regulator 1 precursor [Sus scrofa] gi|75051712|sp|Q9TUB5|CLCA1_PIG RecName: Full=Calcium-activated chloride channel regulator 1; AltName: Full=Calcium-activated chloride channel family member 1; AltName: Full=pCLCA1; Flags: Precursor gi|6002646|gb|AAF00077.1|AF095584_1 epithelial chloride channel protein [Sus scrofa] Length = 917 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 20/133 (15%), Positives = 51/133 (38%), Gaps = 27/133 (20%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T+ + A+ + + + ++ +TDGE++ T+ Sbjct: 379 ASGGTSICSGLRSAFTVIKKKYPTD---------GSEIVLLTDGEDN----------TIS 419 Query: 331 IC-EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE---LLESFDKIT-- 384 C ++ G I++VA+ P ++L + G +D E L+++F ++ Sbjct: 420 ACFPEVKQNGAIIHTVALG-PSAAKELEELSQMTGGLQTYASDQAENNGLIDAFGALSSG 478 Query: 385 -DKIQEQSVRIAP 396 ++S+++ Sbjct: 479 NRAASQRSIQLES 491 >gi|149911407|ref|ZP_01900025.1| hypothetical protein PE36_11192 [Moritella sp. PE36] gi|149805515|gb|EDM65520.1| hypothetical protein PE36_11192 [Moritella sp. PE36] Length = 450 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 21/195 (10%), Positives = 52/195 (26%), Gaps = 10/195 (5%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTK--KDQ 59 I + + A+D H++ + ++Q+ +D+A L + T + Sbjct: 21 FTIALFALIGMASLALDGGHLLLNKGKLQNLVDSAALHAATELDEGATHEQARAAVVALI 80 Query: 60 TSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKD--------KNNPLQYIAESKAQY 111 I + + N G AQ+N+ NN + Sbjct: 81 QLNIAHNDHHELASAIDFSIVNNGLDQMTAQLNVEFSQLPDPFIQDNNESAKYVKVSLSQ 140 Query: 112 EIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDN 171 L ++ +L + + + + D + + + + Sbjct: 141 LELDNFLADVFSFNKQVSATALSGPSSDISNCYQDLVPMVVCADTANAGVNGANEYGLKE 200 Query: 172 NNMTSNKYLLPPPPK 186 ++ K Sbjct: 201 YSLNLMKIGSNSNSP 215 >gi|66472570|ref|NP_001018424.1| inter-alpha (globulin) inhibitor H3 [Danio rerio] gi|63100652|gb|AAH95235.1| Zgc:110377 [Danio rerio] Length = 868 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 18/197 (9%), Positives = 59/197 (29%), Gaps = 16/197 (8%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 K+ E+ G ++ + + + + + N+ E + Sbjct: 268 YGNKMAQTKEALGTILGELPEDDYFAIIVFSTTFVVWRPYLSKATE-----ENVKEAQEY 322 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + + T + A H LY + + +I +TDG+ + ++ Sbjct: 323 VKTIEVIGGTELHDATIHGVEMLYAAQRNGTAPKNMVL---MMILLTDGQPNQ--YPRSL 377 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ-----FFAVNDSRELLESF 380 + + + ++ +A L + + + + +L + Sbjct: 378 PEIQESIRKAIDGNITLFGLAFG-NDADYGFLDTLSKQNNGIVRRIYEDSDAPLQLKGFY 436 Query: 381 DKITDKIQEQSVRIAPN 397 ++++ + + P+ Sbjct: 437 EEVSSPLLSEVNFHYPD 453 >gi|20091285|ref|NP_617360.1| hypothetical protein MA2454 [Methanosarcina acetivorans C2A] gi|19916409|gb|AAM05840.1| hypothetical protein (multi-domain) [Methanosarcina acetivorans C2A] Length = 551 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 15/106 (14%), Positives = 37/106 (34%), Gaps = 15/106 (14%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 N +N L T T+ + A + + ++ + N + ++DGE Sbjct: 435 NQRASFAGAVNDLQAGGGTATFDGIAVAMKMIQEQRAADPNIRP------VIFVLSDGET 488 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 + + ++ + G+ IY++ +A L+ + Sbjct: 489 NKGHPLNDIKGIVE------DTGIPIYTIGYNANIPA---LQAISS 525 >gi|325982790|ref|YP_004295192.1| PEP motif putative anchor domain-containing protein [Nitrosomonas sp. AL212] gi|325532309|gb|ADZ27030.1| PEP motif putative anchor domain protein [Nitrosomonas sp. AL212] Length = 333 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 23/133 (17%), Positives = 47/133 (35%), Gaps = 17/133 (12%) Query: 250 NQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 TPLS+ V S +N ++ TN + A EL ++ + ++ Sbjct: 121 QGLTPLSSGSAAVISAINAIDASGGTNIGAGIDSAAAELTGANHTA-------GSTQMMV 173 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA 369 ++DG +SG A + I++V + P ++ S + Sbjct: 174 VVSDGFSSGDPASSALAALGAGVDA-------IHTVGL--PGHDAFTMQNIATSGNGIYT 224 Query: 370 -VNDSRELLESFD 381 + L++ F+ Sbjct: 225 NASSLTSLIDLFN 237 >gi|149027999|gb|EDL83450.1| complement factor B, isoform CRA_c [Rattus norvegicus] Length = 739 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 33/236 (13%), Positives = 75/236 (31%), Gaps = 24/236 (10%) Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAP-----APAPANRKIDVLIESAGNLVNSIQKAI 228 + P +K + + S + + NL+ + Sbjct: 220 ADAEDGHSPGEQQKRKIILDPSGSMNIYMVLDGSDSIGASNFTGAKRCLANLIEKVASYG 279 Query: 229 QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-----YENTNTYPAMHH 283 + + V T+ ++ S++ + V +LN+++ TNT A+ Sbjct: 280 VKPRYGLVTYATVP---KVLVRVSEERSSDADWVTEKLNQISYEDHKLKSGTNTKKALQA 336 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL-NTLQICEYMRNAGMK- 341 Y + ++ R + +I +TDG ++ + + + + R+ Sbjct: 337 VYSMMSWPGDAPPEGWN--RTRHVIIIMTDGLHNMGGDPVTVIEDIRDLLDIGRDRKNPR 394 Query: 342 -----IYSVAVS--APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 +Y V P + L ++ F V D +L F K+ D+ + Sbjct: 395 EDYLDVYVFGVGPLVDPVNINALASKKNNEQHVFKVKDMEDLENVFYKMIDETKSL 450 >gi|218156285|ref|NP_997631.2| complement factor B [Rattus norvegicus] gi|149027998|gb|EDL83449.1| complement factor B, isoform CRA_b [Rattus norvegicus] Length = 763 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 33/236 (13%), Positives = 75/236 (31%), Gaps = 24/236 (10%) Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAP-----APAPANRKIDVLIESAGNLVNSIQKAI 228 + P +K + + S + + NL+ + Sbjct: 244 ADAEDGHSPGEQQKRKIILDPSGSMNIYMVLDGSDSIGASNFTGAKRCLANLIEKVASYG 303 Query: 229 QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-----YENTNTYPAMHH 283 + + V T+ ++ S++ + V +LN+++ TNT A+ Sbjct: 304 VKPRYGLVTYATVP---KVLVRVSEERSSDADWVTEKLNQISYEDHKLKSGTNTKKALQA 360 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL-NTLQICEYMRNAGMK- 341 Y + ++ R + +I +TDG ++ + + + + R+ Sbjct: 361 VYSMMSWPGDAPPEGWN--RTRHVIIIMTDGLHNMGGDPVTVIEDIRDLLDIGRDRKNPR 418 Query: 342 -----IYSVAVS--APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 +Y V P + L ++ F V D +L F K+ D+ + Sbjct: 419 EDYLDVYVFGVGPLVDPVNINALASKKNNEQHVFKVKDMEDLENVFYKMIDETKSL 474 >gi|33086684|gb|AAP92654.1| Da1-24 [Rattus norvegicus] Length = 1116 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 33/236 (13%), Positives = 75/236 (31%), Gaps = 24/236 (10%) Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAP-----APAPANRKIDVLIESAGNLVNSIQKAI 228 + P +K + + S + + NL+ + Sbjct: 597 ADAEDGHSPGEQQKRKIILDPSGSMNIYMVLDGSDSIGASNFTGAKRCLANLIEKVASYG 656 Query: 229 QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-----YENTNTYPAMHH 283 + + V T+ ++ S++ + V +LN+++ TNT A+ Sbjct: 657 VKPRYGLVTYATVP---KVLVRVSEERSSDADWVTEKLNQISYEDHKLKSGTNTKKALQA 713 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL-NTLQICEYMRNAGMK- 341 Y + ++ R + +I +TDG ++ + + + + R+ Sbjct: 714 VYSMMSWPGDAPPEGWN--RTRHVIIIMTDGLHNMGGDPVTVIEDIRDLLDIGRDRKNPR 771 Query: 342 -----IYSVAVS--APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 +Y V P + L ++ F V D +L F K+ D+ + Sbjct: 772 EDYLDVYVFGVGPLVDPVNINALASKKNNEQHVFKVKDMEDLENVFYKMIDETKSL 827 >gi|46237594|emb|CAE83972.1| B-factor, properdin [Rattus norvegicus] gi|56268879|gb|AAH87089.1| Complement factor B [Rattus norvegicus] Length = 761 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 33/236 (13%), Positives = 75/236 (31%), Gaps = 24/236 (10%) Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAP-----APAPANRKIDVLIESAGNLVNSIQKAI 228 + P +K + + S + + NL+ + Sbjct: 242 ADAEDGHSPGEQQKRKIILDPSGSMNIYMVLDGSDSIGASNFTGAKRCLANLIEKVASYG 301 Query: 229 QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-----YENTNTYPAMHH 283 + + V T+ ++ S++ + V +LN+++ TNT A+ Sbjct: 302 VKPRYGLVTYATVP---KVLVRVSEERSSDADWVTEKLNQISYEDHKLKSGTNTKKALQA 358 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL-NTLQICEYMRNAGMK- 341 Y + ++ R + +I +TDG ++ + + + + R+ Sbjct: 359 VYSMMSWPGDAPPEGWN--RTRHVIIIMTDGLHNMGGDPVTVIEDIRDLLDIGRDRKNPR 416 Query: 342 -----IYSVAVS--APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 +Y V P + L ++ F V D +L F K+ D+ + Sbjct: 417 EDYLDVYVFGVGPLVDPVNINALASKKNNEQHVFKVKDMEDLENVFYKMIDETKSL 472 >gi|319952790|ref|YP_004164057.1| von willebrand factor type a [Cellulophaga algicola DSM 14237] gi|319421450|gb|ADV48559.1| von Willebrand factor type A [Cellulophaga algicola DSM 14237] Length = 348 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 27/186 (14%), Positives = 57/186 (30%), Gaps = 51/186 (27%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE----NTNTYPAMHHAYRELYN 290 S RIG IAY P++ + + K L +N T A+ A + Sbjct: 128 SDRIGIIAYAAQAYPQ--LPITTDYSAAKMFLQSMNTDMLSSQGTAINEAIELASTYYDD 185 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 E +++ + + I+DGE T E G++I+++ V Sbjct: 186 ETQTN----------RVLFIISDGE------DHAEGTTEDAVEKATEEGIRIFTIGVGKE 229 Query: 351 PE---------------------------GQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 + +L++ + + E+ + I Sbjct: 230 KGAPIPIKRNGIVESLKKDMNGEVVITKLNEVILKEIAAEGNGEYI--NGDNTSEAVEYI 287 Query: 384 TDKIQE 389 +++ + Sbjct: 288 KEQLNQ 293 >gi|149176271|ref|ZP_01854886.1| DnaK protein (heat shock protein), C-terminal region has VWA type A domain [Planctomyces maris DSM 8797] gi|148844873|gb|EDL59221.1| DnaK protein (heat shock protein), C-terminal region has VWA type A domain [Planctomyces maris DSM 8797] Length = 715 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 20/147 (13%), Positives = 46/147 (31%), Gaps = 24/147 (16%) Query: 247 IVGNQCTPLSNNLNEVKSRLNKLNPY--ENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 N ++++ + L+ N+ A ++L + Sbjct: 564 DSTQTVLDACQNASKIEKAVKSLSIGMVGCGNSAQPFDTAMKKLKKVEGP---------- 613 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-S 363 +FVI + DG + + + + + +A + + ++ ++ LR Sbjct: 614 -RFVITLADGVW------ADQPHAVNRAKSLHSAEIDVIAIGFG--DADKNFLRDIASCD 664 Query: 364 SGQFFAVNDSRELLESFDKITDKIQEQ 390 G FF L +F I I + Sbjct: 665 EGSFF--TSLSGLSATFSSIAQVITKT 689 >gi|110636839|ref|YP_677046.1| outer membrane protein [Cytophaga hutchinsonii ATCC 33406] gi|110279520|gb|ABG57706.1| possible outer membrane protein [Cytophaga hutchinsonii ATCC 33406] Length = 1313 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 23/139 (16%), Positives = 50/139 (35%), Gaps = 17/139 (12%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 V T + + ++ + L P T+ + + + Sbjct: 134 NDVSFINTDFTRDRFKLLQSIQTLVPAGGTDYNKGFI-------KSNAGGLDILKKGLHE 186 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG 365 K +IF+TDG N +I + ++ G K+Y + + ++L R T ++G Sbjct: 187 KVLIFLTDGYGDV--------NPTEIIQQAKSIGAKVYVITLGMSAP-EELKRIVTATNG 237 Query: 366 QFFA-VNDSRELLESFDKI 383 ++ V +E+ + I Sbjct: 238 SYYENVISEQEINAVYMSI 256 >gi|88858354|ref|ZP_01132996.1| hypothetical protein PTD2_13229 [Pseudoalteromonas tunicata D2] gi|88819971|gb|EAR29784.1| hypothetical protein PTD2_13229 [Pseudoalteromonas tunicata D2] Length = 684 Score = 52.6 bits (124), Expect = 1e-04, Method: Composition-based stats. Identities = 21/170 (12%), Positives = 54/170 (31%), Gaps = 18/170 (10%) Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 A + ++ I + + N+ ++ + L T A Sbjct: 356 FALANLDPQDSFNIIEFNSKVNALNAQALPANDFNIRRARNFVYGLKADGGTEIGLAFEQ 415 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 + ++F+TDG S T QI + ++ +I+ Sbjct: 416 VLD-----------NSEHADYLRQIVFLTDG-----SISNETEVFAQIKGSLGDS--RIF 457 Query: 344 SVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 ++ + + P + R T G F + D ++ + + ++ +++ Sbjct: 458 TIGIGSAPNSYFMTRAATLGRGTFTFIGDVTDVQRTMKNLFVQLANAALK 507 >gi|322434933|ref|YP_004217145.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] gi|321162660|gb|ADW68365.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] Length = 347 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 30/248 (12%), Positives = 84/248 (33%), Gaps = 35/248 (14%) Query: 154 LDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL 213 + V+ + + + N + S T K Sbjct: 83 MTVTTTDTNRLVTGLEATNFEVFDNNTGQVIKTFSTQDAPVTIGIVFDLSGSMTSKFGRA 142 Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE 273 ++ + + A + + +N ++++ +V++R+ L P Sbjct: 143 RKALSEFLRTSNPADE--------FFVVGFNDK--PAVIVDYTSDVEDVEARMVMLKPEN 192 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T A++ +L + +K ++ ++DG ++ + + L Sbjct: 193 RTALIDAVYLGVNKL----------KEAKYDRKALLIVSDGGDNRSRYTEGELR-----R 237 Query: 334 YMRNAGMKIYSVAVSAPPEGQD--------LLRKCTD-SSGQFFAVNDSRELLESFDKIT 384 +R + ++IYS+ + LL+ ++ + G+ F V D ++ + +I+ Sbjct: 238 VVRESDVQIYSIGI-YDAYAPTEEEQLGPVLLKDISEMTGGRMFPVTDIADMADIASRIS 296 Query: 385 DKIQEQSV 392 +++ + V Sbjct: 297 AELRNEYV 304 >gi|325108015|ref|YP_004269083.1| von Willebrand factor A [Planctomyces brasiliensis DSM 5305] gi|324968283|gb|ADY59061.1| von Willebrand factor type A [Planctomyces brasiliensis DSM 5305] Length = 710 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 19/163 (11%), Positives = 49/163 (30%), Gaps = 31/163 (19%) Query: 255 LSNNLNEVKSRLNKLNPYEN---TNTYPAMHHAYRELYNE-------------------K 292 +SN+ V + + T + L N Sbjct: 539 ISNDYTAVDTIQRRKQAGHYDIYTGLGYGIETGIELLDNHARPGSRPTLLVMTDGNANRS 598 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + + G + + DG + + ++ + + G +++++V A Sbjct: 599 PNGWSLPGDWDWDELTDYDGDGVANYRTWDRHKQYAFYQAKQAIDKGYTVHTLSVGAGA- 657 Query: 353 GQDLLRKCTDSS-GQFFAVNDSRELL-------ESFDKITDKI 387 +DL+R + G++ V + ++F +I + Sbjct: 658 DRDLMRAIAFAGHGEWINVPGGTTIEEIEEQMLDAFARIAANV 700 Score = 36.4 bits (82), Expect = 7.4, Method: Composition-based stats. Identities = 6/30 (20%), Positives = 13/30 (43%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQ 30 + A + F+ ++DL I + +Q Sbjct: 3 LAAAGLVGMIGFVALSVDLGVISLTKTNLQ 32 >gi|145540134|ref|XP_001455757.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124423565|emb|CAK88360.1| unnamed protein product [Paramecium tetraurelia] Length = 522 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 15/136 (11%), Positives = 44/136 (32%), Gaps = 21/136 (15%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N+ + ++ + T+ AM A L + + ++DGE Sbjct: 171 NMYKFLIAIDTIEANGATDIGNAMKMALSILKH--------RRFKNPIASIFLLSDGE-- 220 Query: 318 GASAYQNTLNTLQICEYMRNAGM----KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDS 373 + ++ +++ + I + + + GQF+ +++ Sbjct: 221 ------DEGAAGRVWNDIQSKNIKEPFTINTFGFGRDCCPKIMSEIAHFKEGQFYYISEI 274 Query: 374 RELLE-SFDKITDKIQ 388 ++ E F+ + + Sbjct: 275 SKIDECFFEALGGEAS 290 >gi|73992734|ref|XP_543096.2| PREDICTED: similar to Protein KIAA1510 precursor [Canis familiaris] Length = 1405 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 24/159 (15%), Positives = 49/159 (30%), Gaps = 18/159 (11%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKES 294 V++G Y+ +V + + +L NT T A+ H Sbjct: 243 VQVGLTQYSGDPQTEWDLNALRTKEDVLAAVRRLRYKGGNTFTGLALTHVLE-----HNL 297 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 K +I +TDG++ + +++ G+ I++V V + Sbjct: 298 RPAAGPRPEATKVLILVTDGKSQD--------DARAAGRILKDLGVAIFAVGV--KNADE 347 Query: 355 DLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 L+ V D +L + I ++ Sbjct: 348 AELQLLASQPLDITVHNVQDFPQLGTLAGLLGRLICQKV 386 >gi|156742542|ref|YP_001432671.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156233870|gb|ABU58653.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 547 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 22/150 (14%), Positives = 54/150 (36%), Gaps = 13/150 (8%) Query: 237 RIGTIAYNIGIVGNQCT-PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS 295 R+G + ++ PLS+ + + + T Y A+ L + + Sbjct: 408 RVGLVTFSTESRLVVPPAPLSDTRIRLDDAIAVMRAQGRTALYDALIDGKEALDSLPST- 466 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 + ++ ++DG ++ + A + +G+ I+ VA A D Sbjct: 467 -----GDDRIRAIVLLSDGLDNSSRATLEQVRL-----AFEESGISIFPVAYGADA-DTD 515 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITD 385 L++ S D+ ++ + F+ ++ Sbjct: 516 ALQQIATFSRTILVQGDAGDIGQIFENLSR 545 >gi|301770509|ref|XP_002920678.1| PREDICTED: integrin alpha-11-like [Ailuropoda melanoleuca] Length = 1203 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 33/195 (16%), Positives = 66/195 (33%), Gaps = 23/195 (11%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + I +E L+N ++K +++G + Y +V ++ +V Sbjct: 186 GSNSIYPWVEVQHFLINILKKFYIGPGQ--IQVGVVQYGEDVVHEFHLNDYRSVRDVVEA 243 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + + T T A + + + G KK ++ ITDGE ++ Sbjct: 244 ASHIEQRGGTETRTAFGIEF-----ARSEAFQKGGRKGAKKVMVVITDGE------SHDS 292 Query: 326 LNTLQICEYMRNAGMKIYSVAV-SA----PPEGQDLL---RKCTDSSG--QFFAVNDSRE 375 + ++ + + Y+VAV + L + FF V D Sbjct: 293 PDLEKVIQQSERDNVTRYAVAVLGYYNRRGINPEAFLNEIKYIASDPDDKHFFNVTDEAA 352 Query: 376 LLESFDKITDKIQEQ 390 L + D + D+I Sbjct: 353 LKDIVDALGDRIFSL 367 >gi|281343950|gb|EFB19534.1| hypothetical protein PANDA_009430 [Ailuropoda melanoleuca] Length = 1112 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 33/195 (16%), Positives = 66/195 (33%), Gaps = 23/195 (11%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + I +E L+N ++K +++G + Y +V ++ +V Sbjct: 155 GSNSIYPWVEVQHFLINILKKFYIGPGQ--IQVGVVQYGEDVVHEFHLNDYRSVRDVVEA 212 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + + T T A + + + G KK ++ ITDGE ++ Sbjct: 213 ASHIEQRGGTETRTAFGIEF-----ARSEAFQKGGRKGAKKVMVVITDGE------SHDS 261 Query: 326 LNTLQICEYMRNAGMKIYSVAV-SA----PPEGQDLL---RKCTDSSG--QFFAVNDSRE 375 + ++ + + Y+VAV + L + FF V D Sbjct: 262 PDLEKVIQQSERDNVTRYAVAVLGYYNRRGINPEAFLNEIKYIASDPDDKHFFNVTDEAA 321 Query: 376 LLESFDKITDKIQEQ 390 L + D + D+I Sbjct: 322 LKDIVDALGDRIFSL 336 >gi|115453449|ref|NP_001050325.1| Os03g0403600 [Oryza sativa Japonica Group] gi|113548796|dbj|BAF12239.1| Os03g0403600 [Oryza sativa Japonica Group] Length = 416 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 19/145 (13%), Positives = 46/145 (31%), Gaps = 24/145 (16%) Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 ++ + ++ L NTN + + ++ + V+ ++DG+ + Sbjct: 6 RADLGALVDGLAADGNTNIRAGLEIGLAVAAGRRLTAGRAVN-------VMLMSDGQQNR 58 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT--DSSGQFFAVNDSREL 376 + G+ +++ + A +L+ G F V D L Sbjct: 59 GD-----------ATRLDPGGVPVHTFGLGADH-DPAVLQAIAGKSREGMFHYVADGVNL 106 Query: 377 LESFDKITD---KIQEQSVRIAPNR 398 F ++ I Q + + R Sbjct: 107 TAPFSQLLGGLLTIIAQDLELTVTR 131 >gi|332992330|gb|AEF02385.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Alteromonas sp. SN2] Length = 1360 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 39/333 (11%), Positives = 89/333 (26%), Gaps = 16/333 (4%) Query: 72 LKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNL 131 + + A N K + F G ++ Sbjct: 356 YSSYNQNNSSMLIRGVAANDADDFSSGNRYLLRNIGKTSGVTWSMGSFSSGNDYASPDLS 415 Query: 132 SLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWS 191 + + ++ + D + + K+ P ++ Sbjct: 416 GVVKEIVDRAGWQSGNAMAFVFSDFTGYRGAYTYSGSQSSAAKLVVKFNGSATPGQTSTV 475 Query: 192 KNTTKSKYAPAPAPANR----------KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 + SK A + L K + + Sbjct: 476 REHLISKVDELTASGYTPIVDTLYEAVNYYGGRDVDYGLTRGNYKVSNTVRKNTRVSHRS 535 Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGS 301 AY G +NL++ +N+ P T P + + S + Sbjct: 536 AYLGGDAVQPNGCSDDNLSD-SDCINEYIPSGATYISPVNDLQCQTNNHIVLLSDGEANN 594 Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQIC----EYMRNAGMKIYSVAVSAPPEGQDLL 357 + + +G+ + LN ++ + + + +++ +A + L Sbjct: 595 NHSVSLIESLLGDSCTGSGGEKCGLNLVKNVSDSDTSVIDRRVITHTIGFAANSTANNFL 654 Query: 358 RKCT-DSSGQFFAVNDSRELLESFDKITDKIQE 389 + S G F+ +DS ELLE+F+ I +++ Sbjct: 655 NQLAVQSGGGFYQADDSTELLEAFNTILRSVKD 687 >gi|75907582|ref|YP_321878.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] gi|75701307|gb|ABA20983.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] Length = 570 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 21/127 (16%), Positives = 44/127 (34%), Gaps = 20/127 (15%) Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 ++ L T Y A A L + V+ +TDGE+SG+ Sbjct: 453 QFISGLRADGGTKLYDAAIQARNWLQKNRRQGAI--------NAVLILTDGEDSGSKISL 504 Query: 324 NTLNTLQICEYMRNAG------MKIYSVAVS-APPEGQDLLRKCTDSSGQFFAVNDSREL 376 + L+ ++ +G + ++V D L+K + +G +++ D + Sbjct: 505 DNLSA-----ELQKSGFSTDQRIGFFTVGYGEEGEFNPDALKKIAELNGGYYSKGDPETI 559 Query: 377 LESFDKI 383 + Sbjct: 560 SRLMSDL 566 >gi|219886181|gb|ACL53465.1| unknown [Zea mays] Length = 561 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 24/170 (14%), Positives = 58/170 (34%), Gaps = 19/170 (11%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL----SNNLNE 261 ++D+L +A +V ++ R+ +A++ V + L ++ Sbjct: 14 TTSRLDLLKTAAKFMVAKLEDGD--------RLSIVAFSDRPVRELSSGLLYMTADGRRN 65 Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 L++L T PA A + L F++ +TDG + + Sbjct: 66 AIRSLDQLEARGGTALVPAFEEAVKVLD------GRQGDGGDRLGFIVLLTDGAEDASGS 119 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 + + ++ +++ + + LL +S G + V+ Sbjct: 120 FTLSERRREVIRGAL-RKYPVHAFGLGTAHGPEVLLYLAQESRGTYSFVD 168 >gi|187918047|ref|YP_001883610.1| hypothetical membrane spanning protein [Borrelia hermsii DAH] gi|119860895|gb|AAX16690.1| hypothetical membrane spanning protein [Borrelia hermsii DAH] Length = 341 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 28/172 (16%), Positives = 58/172 (33%), Gaps = 41/172 (23%) Query: 252 CTPLSNNLNEVKSRLNK---LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 PL+ + + +L+ ++ + + A L S K+ V Sbjct: 154 IVPLTIDRDFFSRKLDDIYIMDLGNGSALGLGVSIALSHL----------KHSEAPKRSV 203 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP----------------- 351 I +TDG + Y++ Q+ + +KIYS+ + + Sbjct: 204 IVLTDGVVNSDEVYKD-----QVINLAQGLNVKIYSIGIGSGEELSVGFKLRSGKFYQGT 258 Query: 352 ----EGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQ-EQSVRIAPN 397 +L + + G F++V+D + + K E+ VRI + Sbjct: 259 LREVYDPSMLLEIASKTGGLFYSVSDDFSFKLAIQDFSKKENVERKVRITVD 310 >gi|149546336|ref|XP_001514218.1| PREDICTED: similar to Procollagen, type VI, alpha 2 [Ornithorhynchus anatinus] Length = 1023 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 34/233 (14%), Positives = 82/233 (35%), Gaps = 19/233 (8%) Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAG 218 + ++ ++ + S+ P +D + + Sbjct: 21 PQYVNNIGLELTDSRISERRNCPEKTDCPINVYFVIDTSESVAMQPPIESLVDHIKDFVI 80 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNT 277 + ++ + +++ + + +PL +N +++L+ + + T T Sbjct: 81 QFTSQLENEFYQ-NQVAISWHYGGLHFSDLVEIFSPLPSNKEAFRNKLSGVKYFGRGTFT 139 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 A+ + ++ + + F + ITDG +G+ L E R+ Sbjct: 140 DCAIANMTEQVTQSQVAG---------VNFAVVITDGHVTGSPCGGMKLQ----AERARD 186 Query: 338 AGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES--FDKITDKIQ 388 AG+K++SVAV+ + LR+ + + + N+ L +D I IQ Sbjct: 187 AGIKLFSVAVNENLY-ESGLREIANLPYELYR-NNYTTLKRGIDYDTINKIIQ 237 Score = 44.1 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 28/206 (13%), Positives = 61/206 (29%), Gaps = 24/206 (11%) Query: 167 KHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK 226 D K+ + + + N+V+ + Sbjct: 591 TECDVMTYVRETCGCCDCEKRCGAVDIVFV--IDSSESIGFTNFSLEKNFVINVVSRLGA 648 Query: 227 AIQEKKNLS-VRIGTIAYNIGIVGNQCTPLS---NNLNEVKSRLNKLNP-YENTNTYPAM 281 ++ K+ + R+G + Y+ ++L+ K + L T T A+ Sbjct: 649 IAKDPKSETGARVGVVQYSHEGTFEAIQLDDERIDSLSSFKEAVKNLEWIAGGTWTPSAL 708 Query: 282 HHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK 341 AY +L E + K F + ITDG + N N +C + Sbjct: 709 QFAYNKLIKESRR-------KKTKVFAVVITDGRHD---PRDNDANLRALCHV----DVT 754 Query: 342 IYSVAVSA---PPEGQDLLRKCTDSS 364 + ++ + + + L+ + Sbjct: 755 VNAIGIGDMFHKRQEDETLKSIACGN 780 >gi|126737457|ref|ZP_01753192.1| von Willebrand factor type A domain protein [Roseobacter sp. SK209-2-6] gi|126722042|gb|EBA18745.1| von Willebrand factor type A domain protein [Roseobacter sp. SK209-2-6] Length = 479 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 20/140 (14%), Positives = 46/140 (32%), Gaps = 21/140 (15%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + P + + ++ + P T A+ A L+ +E + Sbjct: 80 SDIEQLIAPAAGTREAISKAVDAITPKGKTPLSAAVIQAAEGLHLSEEKAT--------- 130 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM--KIYSVAVS-APPEGQDLLRKCTD 362 VI I+DGE + + I + AG+ ++++ A + L+ + Sbjct: 131 --VILISDGEETCGR------DPCAIGAELEAAGVDFTLHAIGFGIADDAARAQLQCLAE 182 Query: 363 -SSGQFFAVNDSRELLESFD 381 + G + + L + Sbjct: 183 NTGGVYLDAKGAEGLSAALS 202 >gi|262117974|dbj|BAI48012.1| collagen type VI alpha 3 subunit [Mesocricetus auratus] Length = 2675 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 27/173 (15%), Positives = 59/173 (34%), Gaps = 17/173 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKN--LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 I+ ES G ++ + + + S+R+G + YN + ++ +N Sbjct: 1038 INFQRESFGEVLRFVSEIVDTVYEEGDSIRVGLVQYNSDPTDEFFLKDYSTKRQIIDAIN 1097 Query: 268 KLNPYENTNTYP--AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 K+ + + H N + R+ + IT G+ + Sbjct: 1098 KVVYKGGRHANTKVGIEHLR---LNHFVPGAGSRLDERVPQIAFVITGGK--------SV 1146 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + G+K++++ V + + K +S F V +EL E Sbjct: 1147 EDAQGASLALTQKGVKVFAIGV--RNIDSEEVGKIASNSATAFRVGSVQELSE 1197 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 21/188 (11%), Positives = 55/188 (29%), Gaps = 20/188 (10%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +L E +V S+ + VR+ + Y+ + V + Sbjct: 429 GVRSGFPLLKEFVQRVVESL-----DVGPDRVRVALVQYSDRTRPEFYLNSHMDQQGVIN 483 Query: 265 RLNKLNPYENT--NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +L NT A+ R + S + + +I +T + Sbjct: 484 AIRRLTLLGGPIPNTGAALDFVLRNILTSSTGSRIA---EGVPQLLIVLTAERSGDDVRG 540 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + ++ G + + ++ + A+ REL Sbjct: 541 PSVV--------LKQGGAV--PIGIGIGNADISEMQTISFIPDFAVAIPSFRELGTVQQV 590 Query: 383 ITDKIQEQ 390 +++++ + Sbjct: 591 VSERVIQL 598 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 23/146 (15%), Positives = 52/146 (35%), Gaps = 15/146 (10%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKE 293 VRIG + ++ + + + V + +L + NT A+ R L+ Sbjct: 863 VRIGVVQFSNDVFPEFYLKTHKSQSLVLDAIRRLRFKGGSPLNTGKALEFVARNLF---V 919 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S + + + ++ G++ A + + ++G+ S+ + Sbjct: 920 KSAGSRIEDGVPQHLVLFLGGKSQDDVARH--------AQVISSSGIM--SLGIGDRNID 969 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLES 379 + L+ T+ F V + REL Sbjct: 970 RADLQTITNDPKMVFTVREFRELPNI 995 >gi|90578363|ref|ZP_01234174.1| hypothetical protein VAS14_14969 [Vibrio angustum S14] gi|90441449|gb|EAS66629.1| hypothetical protein VAS14_14969 [Vibrio angustum S14] Length = 259 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 27/222 (12%), Positives = 64/222 (28%), Gaps = 22/222 (9%) Query: 155 DVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLI 214 D S L+ N + + + S +KI Sbjct: 40 DASSGYGFDALKNDWPALNNDNALVSDNWLATNYLLIFDGSGSMDNTNCGNGQKKIVAAK 99 Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 ++ +N I + ++ + +NN +K + + Sbjct: 100 QAIQTFINDIPNSANVG--------LYVFDNKDASLRVPLGNNNRATLKQAIYDVTAGGA 151 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 T ++ +Y L + + V+ +TDG+ S Q +N Sbjct: 152 TPLKSSLDSSYSALE-----RQASKQLGYGEYNVVIVTDGDASQGENPQPAIN-----RI 201 Query: 335 MRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 R++ + I+++ + + T + + N+ +L Sbjct: 202 YRDSPVTIHTIGFCIGEQHALNAKGIT----YYQSANNPEKL 239 >gi|308048514|ref|YP_003912080.1| type IV pilin biogenesis protein [Ferrimonas balearica DSM 9799] gi|307630704|gb|ADN75006.1| type IV pilin biogenesis protein, putative [Ferrimonas balearica DSM 9799] Length = 1183 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 32/296 (10%), Positives = 87/296 (29%), Gaps = 53/296 (17%) Query: 147 AISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPA 206 + M + L + + N + +++N + + + Sbjct: 195 PQNRFMRSPYDGTSPSASLSERERAADDARNSTQFGQNDVVTLFTENYLTYLHHHSGSVQ 254 Query: 207 NRKIDVLIESAGNLVNSIQKAIQE----KKNLSVRIGTIAYNIGIVGNQCTPLS-NNLNE 261 ++I + E+ +L+N+ +N + + + G + ++ +N + Sbjct: 255 RQRITIARETTNSLINTTTGVDFGLMVFNRNRNSGNTIGSDDGGRIVEGIREMTESNRAD 314 Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYN---------------------EKESSHNTIG 300 + + ++ L +T+ ++ AYR ++ + + Sbjct: 315 LVNTVSSLEATSDTSLCESLFEAYRYFSGGAVLGGNKGGALLPAADDSVVSGDNYISPLS 374 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG--------------------- 339 S + +V+ +TDG ++ + + Sbjct: 375 SCQANSYVLLMTDGSPYRDNSLNTLIESELGLTAADKHNGSHLPGVAEWMYQNDMNTSAT 434 Query: 340 ----MKIYSVAVSAPPEGQDLL--RKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + Y+V S L + G ++ ND+ L S +I +I Sbjct: 435 GHQKVVTYTVGFSQGAVDAAELLEQTAHRGGGLYYPANDAAALQSSLQQIVSEILS 490 >gi|119595693|gb|EAW75287.1| collagen, type XX, alpha 1, isoform CRA_c [Homo sapiens] Length = 1329 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 24/140 (17%), Positives = 45/140 (32%), Gaps = 18/140 (12%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 A E V++G Y+ + +V + + +L NT T A+ H Sbjct: 214 APFEIGPDKVQVGLTQYSGDAQTEWDLNSLSTKEQVLAAVRRLRYKGGNTFTGLALTHVL 273 Query: 286 -RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + L K VI +TDG++ +++ G+ +++ Sbjct: 274 GQNLQPAAGLRPEA------AKVVILVTDGKSQDDVH--------TAARVLKDLGVNVFA 319 Query: 345 VAVSAPPEGQDLLRKCTDSS 364 V V + LR Sbjct: 320 VGV--KNADEAELRLLASPP 337 >gi|66347353|emb|CAI95066.1| collagen, type XX, alpha 1 [Homo sapiens] gi|119595691|gb|EAW75285.1| collagen, type XX, alpha 1, isoform CRA_a [Homo sapiens] Length = 1297 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 24/140 (17%), Positives = 45/140 (32%), Gaps = 18/140 (12%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 A E V++G Y+ + +V + + +L NT T A+ H Sbjct: 214 APFEIGPDKVQVGLTQYSGDAQTEWDLNSLSTKEQVLAAVRRLRYKGGNTFTGLALTHVL 273 Query: 286 -RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + L K VI +TDG++ +++ G+ +++ Sbjct: 274 GQNLQPAAGLRPEA------AKVVILVTDGKSQDDVH--------TAARVLKDLGVNVFA 319 Query: 345 VAVSAPPEGQDLLRKCTDSS 364 V V + LR Sbjct: 320 VGV--KNADEAELRLLASPP 337 >gi|112734845|ref|NP_065933.2| collagen alpha-1(XX) chain [Homo sapiens] gi|292495087|sp|Q9P218|COKA1_HUMAN RecName: Full=Collagen alpha-1(XX) chain; Flags: Precursor gi|66347352|emb|CAI95065.1| collagen, type XX, alpha 1 [Homo sapiens] gi|119595695|gb|EAW75289.1| collagen, type XX, alpha 1, isoform CRA_e [Homo sapiens] Length = 1284 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 24/140 (17%), Positives = 45/140 (32%), Gaps = 18/140 (12%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 A E V++G Y+ + +V + + +L NT T A+ H Sbjct: 207 APFEIGPDKVQVGLTQYSGDAQTEWDLNSLSTKEQVLAAVRRLRYKGGNTFTGLALTHVL 266 Query: 286 -RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + L K VI +TDG++ +++ G+ +++ Sbjct: 267 GQNLQPAAGLRPEA------AKVVILVTDGKSQDDVH--------TAARVLKDLGVNVFA 312 Query: 345 VAVSAPPEGQDLLRKCTDSS 364 V V + LR Sbjct: 313 VGV--KNADEAELRLLASPP 330 >gi|45946128|gb|AAH43183.1| Collagen, type XX, alpha 1 [Homo sapiens] Length = 1284 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 24/140 (17%), Positives = 45/140 (32%), Gaps = 18/140 (12%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 A E V++G Y+ + +V + + +L NT T A+ H Sbjct: 207 APFEIGPDKVQVGLTQYSGDAQTEWDLNSLSTKEQVLAAVRRLRYKGGNTFTGLALTHVL 266 Query: 286 -RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + L K VI +TDG++ +++ G+ +++ Sbjct: 267 GQNLQPAAGLRPEA------AKVVILVTDGKSQDDVH--------TAARVLKDLGVNVFA 312 Query: 345 VAVSAPPEGQDLLRKCTDSS 364 V V + LR Sbjct: 313 VGV--KNADEAELRLLASPP 330 >gi|119595694|gb|EAW75288.1| collagen, type XX, alpha 1, isoform CRA_d [Homo sapiens] Length = 637 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 24/140 (17%), Positives = 45/140 (32%), Gaps = 18/140 (12%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 A E V++G Y+ + +V + + +L NT T A+ H Sbjct: 207 APFEIGPDKVQVGLTQYSGDAQTEWDLNSLSTKEQVLAAVRRLRYKGGNTFTGLALTHVL 266 Query: 286 -RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + L K VI +TDG++ +++ G+ +++ Sbjct: 267 GQNLQPAAGLRPEA------AKVVILVTDGKSQDDVH--------TAARVLKDLGVNVFA 312 Query: 345 VAVSAPPEGQDLLRKCTDSS 364 V V + LR Sbjct: 313 VGV--KNADEAELRLLASPP 330 >gi|7959281|dbj|BAA96034.1| KIAA1510 protein [Homo sapiens] Length = 1140 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 24/140 (17%), Positives = 45/140 (32%), Gaps = 18/140 (12%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 A E V++G Y+ + +V + + +L NT T A+ H Sbjct: 57 APFEIGPDKVQVGLTQYSGDAQTEWDLNSLSTKEQVLAAVRRLRYKGGNTFTGLALTHVL 116 Query: 286 -RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + L K VI +TDG++ +++ G+ +++ Sbjct: 117 GQNLQPAAGLRPEA------AKVVILVTDGKSQDDVH--------TAARVLKDLGVNVFA 162 Query: 345 VAVSAPPEGQDLLRKCTDSS 364 V V + LR Sbjct: 163 VGV--KNADEAELRLLASPP 180 >gi|149732066|ref|XP_001492602.1| PREDICTED: similar to complement factor B [Equus caballus] Length = 768 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 29/203 (14%), Positives = 71/203 (34%), Gaps = 19/203 (9%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + + + + + V T+ ++ S++ + Sbjct: 278 SDSIGAHNFTRAKNCLRDFIEKVASYGVKPRYGLVTYATVP---KVLIRVSQERSSDADW 334 Query: 262 VKSRLNKLNPYE-----NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 V +LN+++ + TNT A+ Y + E + G R + ++ +TDG + Sbjct: 335 VTEKLNEISYEDHKLKTGTNTRKALQAVYSMM--SWEGNAPPEGWNRTRHVILLMTDGLH 392 Query: 317 SGA-------SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QF 367 + ++ L+ + + R + IY V +++ + G Sbjct: 393 NMGGDPVPVIHEIRDLLDIGRDRKNPREDYLDIYVFGVGPLVNQENINALASKKDGEQHV 452 Query: 368 FAVNDSRELLESFDKITDKIQEQ 390 F V D L + F ++ D+ + Sbjct: 453 FKVKDMENLEDVFFQMLDETRTL 475 >gi|254481635|ref|ZP_05094879.1| hypothetical protein GPB2148_2134 [marine gamma proteobacterium HTCC2148] gi|214038263|gb|EEB78926.1| hypothetical protein GPB2148_2134 [marine gamma proteobacterium HTCC2148] Length = 1306 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 40/336 (11%), Positives = 95/336 (28%), Gaps = 73/336 (21%) Query: 124 IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPP 183 A + S+ + +E + +++ V N+ Sbjct: 284 GNWAALSNSVHTPDHVECLDDVNSLNAGNGSGVGDGYPQSNAMDGNEYGGSVDGDISW-G 342 Query: 184 PPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAY 243 +F++ + + + KID+ + ++++ + R Sbjct: 343 NTAYTFYTSHYMNWWHDDSLVTPRPKIDIAADVITTIIDTNTSVDFGLLEFNYRE----- 397 Query: 244 NIGIVGNQCTPLSN-NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE--------- 293 + ++ N + + ++ L P T + + AYR L E Sbjct: 398 GGRVTQRIIQNMTAANRTNLINLVDALEPAGWTPLCESTYEAYRYLAGESPVYSGSASDG 457 Query: 294 -------------------SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ---- 330 ++ + + ++I +TDG + + TL Sbjct: 458 IYAGWRYDIAASDSAAMSGGNYVSPNTDCAYTYIIIMTDGLPGRDGGANDAIETLTQKDC 517 Query: 331 ----------------ICEYMRNA------------GMKIYSVAVSAPPEGQDLLRKCTD 362 + EYM G+ Y++ Q LL+ + Sbjct: 518 DMYDSEEGNSKNCMPQLAEYMATNDLDGDTTNGSQFGIT-YTIGF---ATDQQLLQDTAE 573 Query: 363 SS-GQFFAVNDSRELLESFD-KITDKIQEQSVRIAP 396 G+++ + + EL E+F I + + +P Sbjct: 574 KGKGEYYTADSAAELTEAFQGAIVSILSRDTTFTSP 609 >gi|115622698|ref|XP_001202504.1| PREDICTED: similar to LOC594926 protein, partial [Strongylocentrotus purpuratus] gi|115631776|ref|XP_785188.2| PREDICTED: similar to LOC594926 protein, partial [Strongylocentrotus purpuratus] Length = 338 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 21/137 (15%), Positives = 51/137 (37%), Gaps = 15/137 (10%) Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 S+N+ K +N++ TN Y + +A L +I +TD Sbjct: 4 DTSDNIAAAKRHVNRIRAGGGTNLYDGLRNAVDLLMEHGNGEAMP--------LIIMLTD 55 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG----QFFA 369 G+ + S + +I + + ++SV+ L K + S+ + + Sbjct: 56 GQPTSGSVKSTSEIIQRITNLIDGR-LSLFSVSFG-NGVDFSFLEKLSLSNQALARKVYE 113 Query: 370 VNDSR-ELLESFDKITD 385 + + ++ +D++ + Sbjct: 114 DSSASLQMKGFYDEVAN 130 >gi|110678222|ref|YP_681229.1| hypothetical protein RD1_0875 [Roseobacter denitrificans OCh 114] gi|109454338|gb|ABG30543.1| conserved hypothetical protein [Roseobacter denitrificans OCh 114] Length = 320 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 21/141 (14%), Positives = 49/141 (34%), Gaps = 30/141 (21%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 P S ++ + ++ T+ + A + + S + VI Sbjct: 151 PFSFDVEAIARQIEGAQIGISGRATSISDGLGLALKRM----------ENSEAASRVVIL 200 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-----------EGQDLLRK 359 ++DG N+ + N + E G++++++A+ LR Sbjct: 201 LSDGVNNAGAT-----NPRGVAELAAQMGVRVHTIALGPKDLSTADPGERGVVDAATLRA 255 Query: 360 CTD-SSGQFFAVNDSRELLES 379 ++ S G+ F V + +L+ Sbjct: 256 ISEISGGESFRVRTTEDLVAV 276 >gi|309792255|ref|ZP_07686727.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308225796|gb|EFO79552.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 391 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 17/178 (9%), Positives = 57/178 (32%), Gaps = 15/178 (8%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + + + + + + + E + + ++ + L+ + + + Sbjct: 25 VLDRSGSMQGAKLQSMKAATRKVIETLTDQDVVSIVIFDDTVQTLVPATLATDRTALLAA 84 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 ++ ++ T M + ++ +TDG+ + + Sbjct: 85 IDTISEAGGTAMSLGMQ--------AGQVELQKHSGPDRLSHMLLLTDGQT-----WGDE 131 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDK 382 I + A ++I ++ + A + LL + S G + D+ ++ F + Sbjct: 132 ETCRNIARALGQADVRITALGLGAEWN-EQLLDDLAEFSDGTSDYIADANQIGTFFQR 188 >gi|261880540|ref|ZP_06006967.1| BatB protein [Prevotella bergensis DSM 17361] gi|270332763|gb|EFA43549.1| BatB protein [Prevotella bergensis DSM 17361] Length = 342 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 23/165 (13%), Positives = 58/165 (35%), Gaps = 45/165 (27%) Query: 253 TPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P++++ K L ++P + T+ A+ R + ++ + + Sbjct: 146 LPITSDYVSAKMFLQNIDPSLVSTQGTDIARAITVGMRSFTQQ----------EKVGRAI 195 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS---------------APPEG 353 I ITDGE+ ++ + R G+ ++ + V G Sbjct: 196 IVITDGEDHEG-------GAMEAAKEARKRGINVFILGVGSTKGAPIPTGNGGYMTDATG 248 Query: 354 QDLL--------RKCTDSS-GQFFAVNDSRELLESFDKITDKIQE 389 Q ++ ++ + G + V+++ + E + K+Q+ Sbjct: 249 QTVMSALNEEMCKEVAKAGSGTYIHVDNTSDAQEQLNDELTKLQK 293 >gi|257063307|ref|YP_003142979.1| hypothetical protein Shel_05710 [Slackia heliotrinireducens DSM 20476] gi|256790960|gb|ACV21630.1| hypothetical protein Shel_05710 [Slackia heliotrinireducens DSM 20476] Length = 1514 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 31/222 (13%), Positives = 71/222 (31%), Gaps = 43/222 (19%) Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 + K +VL S ++ + R ++ +N+ EV + + Sbjct: 746 STKSEVLQNSIARFAATLGYLSPGSQIAMTRFSVDTFSNAEC--ALLNWTNDTGEVTAAM 803 Query: 267 N-----------------------KLNPYENTNTYPAMHHAYRELYNEKESSH-NTIGST 302 N +T+TY + + N + Sbjct: 804 NQEYGNPLAEGGRANQTLDGLRVYNYGITGSTHTYRGIESYIENMTNGASGGYVPNAPQG 863 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-------EGQD 355 +++I TDG+++ + ++ +T + ++N G I +V + + Sbjct: 864 NNSRYLIIFTDGKDNSGNLQKSMDDT----DALKNNGYTIITVLMQSAGMTSEDVEHSTT 919 Query: 356 LLRKCTDS---SGQFFAV---NDSRELLESFDKITDKIQEQS 391 L++ S ++F ND L++ F I +I + Sbjct: 920 FLKRLASSNASGEKYFYTAMYNDPEGLVKVFQDIAHEIAKPL 961 >gi|291299307|ref|YP_003510585.1| von Willebrand factor type A [Stackebrandtia nassauensis DSM 44728] gi|290568527|gb|ADD41492.1| von Willebrand factor type A [Stackebrandtia nassauensis DSM 44728] Length = 626 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 15/128 (11%), Positives = 46/128 (35%), Gaps = 20/128 (15%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + + + +L P +T ++ A + K K ++ ++DGE + Sbjct: 108 DADAITTAATELKPEGDTPIAYSLEKAAGDFTEAKGP-----------KTILLVSDGEET 156 Query: 318 GASAYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSR 374 + +++ E + + G+ +++ + + L + + G ++ D Sbjct: 157 CGG------DPVKVAEKIASQGVDLRVHVIGFQVDDATRKQLTEIAKAGKGSYYDAQDGP 210 Query: 375 ELLESFDK 382 L + Sbjct: 211 ALASRLKR 218 >gi|110798896|ref|YP_694951.1| von Willebrand factor type A domain-containing protein [Clostridium perfringens ATCC 13124] gi|110673543|gb|ABG82530.1| von Willebrand factor type A domain protein [Clostridium perfringens ATCC 13124] Length = 580 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 40/268 (14%), Positives = 78/268 (29%), Gaps = 57/268 (21%) Query: 146 LAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAP 205 + I MV D S SME N + + Sbjct: 75 NPVDIIMVADKSGSMEYEMPTLKRAMKNFLDDIESSFGDRANISLIEF-----------S 123 Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 K + N N +K + ++ ++ + VKS+ Sbjct: 124 GENKTYIGRYCDFNGFNCYEKYFIKGSIDDAKVLCDY-------------TSEYSTVKSK 170 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN------SGA 319 ++K+ Y T+ A+ ++L + ++ KK+V+F TDG Sbjct: 171 IDKITAYGRTDIEAALELVKKKLDDRNSNN---------KKYVVFFTDGLPIQLLNIETR 221 Query: 320 SAYQNTLNTLQICEYMRN----------AGMKIYSVAVSAPPE-------GQDLLRKCTD 362 I + + + YS+ + +D ++ + Sbjct: 222 EYPSLDYIEKYIIPHTKEYFYEKGFLDKNKVNFYSIGLFTGRRFDSEKKIAKDFIKSINN 281 Query: 363 SSGQFFAVNDSRELLESFDKITDKIQEQ 390 S G +F +DS L ++ I I + Sbjct: 282 S-GSYFITDDSNRLDSVYNDIAMNIINE 308 >gi|301767380|ref|XP_002919113.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-1(VI) chain-like [Ailuropoda melanoleuca] Length = 1059 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 29/256 (11%), Positives = 75/256 (29%), Gaps = 33/256 (12%) Query: 169 NDNNNMTSNKYLLPPPPKKSFWSKNT--TKSKYAPAPAPANRKIDVLIESAGNLVNSIQK 226 S T A P +D + +++++ Sbjct: 16 WAAAQDVSGNARAVAFQDCPVDLFFVLDTSESVALRLKPYGALVDKVKAFTKRFIDNLRD 75 Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPL---SNNLNEVKSRLNKLNPYE-NTNTYPAMH 282 + + ++ A + +PL ++ + +K+ ++ + + T T A+ Sbjct: 76 -RYYRCDRNLVWNAGALHYSDEVEIISPLRPMPSDRDALKASVDAVKYFGKGTYTDCAIK 134 Query: 283 HAYRELY-------------------NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 L + + K++I +TDG Sbjct: 135 KGLEGLXXXXXXXXXXXXXASAGGRRPARSRGCQGGSHLKENKYLIVVTDGHPLEGYKEP 194 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF---FAVNDSRELLESF 380 ++ G+K++SVA+ P + L + + F D + ++ Sbjct: 195 CG-GLEDAVNEAKHLGVKVFSVAI-TPDHLEPRLSIIA-TDHTYRRNFTAADWGQTRDA- 250 Query: 381 DKITDKIQEQSVRIAP 396 ++I + + ++P Sbjct: 251 EEIITQTIDTLTPLSP 266 >gi|153867800|ref|ZP_01998040.1| hypothetical protein BGS_0597 [Beggiatoa sp. SS] gi|152144880|gb|EDN71960.1| hypothetical protein BGS_0597 [Beggiatoa sp. SS] Length = 276 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 30/158 (18%), Positives = 58/158 (36%), Gaps = 14/158 (8%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 N+ +I N T L++++N LN L ++ A + L Sbjct: 52 TSNMPGQIRFALTVFDSHNNIKTSLNDDINTFYGGLNSLTIDGGSDISMAFAPINQLLSQ 111 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + K V+F TDG + +N E +RN G++I++V S P Sbjct: 112 ARPHA---------AKIVVFYTDGYVFHSGKMDAIVND---AEALRNQGVQIFAV--SPP 157 Query: 351 PEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 + + T + N+ +++ F + D + Sbjct: 158 EDDASAMSLITGYPNRVLRPNNLPDIVNRFRYVADAVV 195 >gi|148680070|gb|EDL12017.1| mCG120740 [Mus musculus] Length = 752 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 30/287 (10%), Positives = 96/287 (33%), Gaps = 38/287 (13%) Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 ++ + + E + S + N++ M + Sbjct: 237 ASIMFMESIDSVTEFCKKENHNREAPTLHNMKCNYRSTWEVISRSEDFNNSMPMETPP-- 294 Query: 181 LPPPPKKSFWS-KNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG 239 PP S + + ++D++ +A + ++ I + N S Sbjct: 295 APPFFSLLRISERIVCLVLDVSGSMASYDRLDLMNRAAKHFLSQIIE------NRSWVGM 348 Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKL--NPYENTNTYPAMHHAYRELYNEKESSHN 297 ++ + ++ +++N+ L L + T+ + A++ N + Sbjct: 349 VHFNHLANIKSELIQMNSNIER-NQLLQTLPTSADGGTSICSGIKAAFQVFKNGGYETDG 407 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-GQDL 356 T ++ ++DGE+S + ++++G ++ +A+ + Sbjct: 408 TE--------ILLLSDGEDST---------AKDCIDEVKDSGAIVHFIALGPSADLAVTN 450 Query: 357 LRKCTDSSGQFFAVNDSRE---LLESFDKITDK---IQEQSVRIAPN 397 + + G+ +D + L+++F + + + ++S+++ Sbjct: 451 MSIL--TGGKHMYASDEAQNNGLIDAFVALASENVDVTQKSLQLESK 495 >gi|89054212|ref|YP_509663.1| von Willebrand factor, type A [Jannaschia sp. CCS1] gi|88863761|gb|ABD54638.1| von Willebrand factor type A [Jannaschia sp. CCS1] Length = 1356 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 26/182 (14%), Positives = 63/182 (34%), Gaps = 24/182 (13%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 KI + E ++ + + + + P + + Sbjct: 44 GVNKIVIAREVIAEMLADMADDVSLGLTVYGHRQRGSCTDIETIVAPAPGTQGR--ILDA 101 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 +N +NP T A+ A + L + +E++ VI ++DG + Sbjct: 102 VNAINPRGRTPMTDAVIAAAQSLRSTEEAAT-----------VILVSDGIEN------CN 144 Query: 326 LNTLQICEYMRNAGM--KIYSVAVS--APPEGQDLLRKCTD-SSGQFFAVNDSRELLESF 380 + I + G+ + + + PE + ++ D + GQF +++ EL ++ Sbjct: 145 PDPCAIAAELEATGVDFTAHVIGFDVASEPEARAQMQCIADNTGGQFLTADNATELSQAL 204 Query: 381 DK 382 ++ Sbjct: 205 EQ 206 >gi|149031330|gb|EDL86328.1| rCG38899 [Rattus norvegicus] Length = 1029 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 63/183 (34%), Gaps = 13/183 (7%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 K+ ++ +++ +Q + I G + N++ K+ ++ Sbjct: 261 TKLQQTKKAMDKILSDLQTSDSFNIITFSDTVNIWKAEGSIQATV----QNIHNAKNYVS 316 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 ++ T+ A+ A L + + ++F+TDGE + + + Sbjct: 317 RMEANGWTDINAALLAAASVLNHSNQ-EPGKGRGVGQIPLIMFLTDGEPTAGETTPSVI- 374 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ-----FFAVNDSRELLESFDK 382 L + ++S+A LLR+ + + + + + +L + + Sbjct: 375 -LSNVRQALAHRVSLFSLAFG-DDADFSLLRRLSLENQGEARRIYEDADAALQLEGLYAE 432 Query: 383 ITD 385 I+ Sbjct: 433 ISR 435 >gi|313903839|ref|ZP_07837228.1| von Willebrand factor type A [Thermaerobacter subterraneus DSM 13965] gi|313466027|gb|EFR61552.1| von Willebrand factor type A [Thermaerobacter subterraneus DSM 13965] Length = 1151 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 20/128 (15%), Positives = 39/128 (30%), Gaps = 17/128 (13%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T+ + A + K + VI +TDG + Sbjct: 539 AQGGTSLGSGLAAALPLMEGVKADV----------RHVIALTDGV-------SEPFDVTG 581 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + R G+ + +VA+ + L + + G + D +L + T + Sbjct: 582 LARAFRRQGVTLSAVAIGPDADRNTLAQLAREGGGALYEAADPGQLPTLLARDTALAARR 641 Query: 391 SVRIAPNR 398 +R P R Sbjct: 642 FIRDEPFR 649 >gi|220922748|ref|YP_002498050.1| hypothetical protein Mnod_2796 [Methylobacterium nodulans ORS 2060] gi|219947355|gb|ACL57747.1| conserved hypothetical protein [Methylobacterium nodulans ORS 2060] Length = 135 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 25/105 (23%), Positives = 40/105 (38%), Gaps = 19/105 (18%) Query: 308 VIFITDGENSGASAYQNTLN--------------TLQICEYMRNAGMKIYSVAVSAPPEG 353 I TDG N + + T Q + AG+ +Y++ S P + Sbjct: 30 YIVNTDGSNPSSRFPPTNQDLTTPINIRNALDALTTQAYTNAKAAGISVYTIGFSTPSDS 89 Query: 354 QD-----LLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 D LL C SS Q F ND+ L+ +F++I + + Sbjct: 90 IDDKGLSLLSNCASSSSQAFVANDANTLISAFNQIAKSVGSLRLT 134 >gi|293344916|ref|XP_001079629.2| PREDICTED: collagen, type VI, alpha 1 [Rattus norvegicus] gi|293356747|ref|XP_215375.5| PREDICTED: collagen, type VI, alpha 1 [Rattus norvegicus] gi|149043683|gb|EDL97134.1| procollagen, type VI, alpha 1 (predicted), isoform CRA_b [Rattus norvegicus] Length = 1034 Score = 52.2 bits (123), Expect = 1e-04, Method: Composition-based stats. Identities = 31/205 (15%), Positives = 70/205 (34%), Gaps = 25/205 (12%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL---SNNLNE 261 P +D + +++++ + + ++ A + L + +E Sbjct: 62 PYGALVDKVKSFTKRFIDNLRD-RYYRCDRNLVWNAGALHYSDEVEIIRGLMRMPSGRDE 120 Query: 262 VKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 +K+ ++ + + T T A+ EL SH K++I +TDG Sbjct: 121 LKASIDAVKYFGKGTYTDCAIKKGLEELL--IGGSHLKEN-----KYLIVVTDGHPLEGY 173 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF---FAVND----- 372 ++ G+K++SVA+ P + L + + F D Sbjct: 174 KEPCG-GLEDAVNEAKHLGIKVFSVAI-TPDHLEPRLSIIA-TDHTYRRNFTAADWGHSR 230 Query: 373 --SRELLESFDKITDKIQEQSVRIA 395 + ++ D I D I+ ++ Sbjct: 231 DAEETISQTIDTIVDMIKNNVEQVC 255 >gi|47207521|emb|CAG14087.1| unnamed protein product [Tetraodon nigroviridis] Length = 1048 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 31/146 (21%), Positives = 51/146 (34%), Gaps = 23/146 (15%) Query: 257 NNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 EV ++ E T T A+H A E ++ + + K +I +TDG Sbjct: 20 QTTQEVVEAAKNISRQEGRETRTAYAIHKACTEAFSPERGAR-----EGATKVMIVVTDG 74 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--------SAPPEGQDLLRKCTDSSG- 365 E+ + LQ CE + + Y++AV P ++ Sbjct: 75 ESHDGEELPD---ALQECE---DRNITRYAIAVLGHYIRRQQDPETFISEIKYIASDPDD 128 Query: 366 -QFFAVNDSRELLESFDKITDKIQEQ 390 FF V D L + D + D+I Sbjct: 129 KYFFNVTDEAALNDIVDALGDRIFTL 154 >gi|17231895|ref|NP_488443.1| hypothetical protein all4403 [Nostoc sp. PCC 7120] gi|17133539|dbj|BAB76102.1| all4403 [Nostoc sp. PCC 7120] Length = 570 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 21/127 (16%), Positives = 44/127 (34%), Gaps = 20/127 (15%) Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 ++ L T Y A A L + V+ +TDGE+SG+ Sbjct: 453 QFISGLRADGGTRLYDAAMQARNWLQKNRREGAI--------NAVLILTDGEDSGSQISL 504 Query: 324 NTLNTLQICEYMRNAG------MKIYSVAVS-APPEGQDLLRKCTDSSGQFFAVNDSREL 376 + L+ ++ +G + ++V D L+K + +G +++ D + Sbjct: 505 DNLSA-----ELQKSGFSTDQRIGFFTVGYGEEGEFNPDALKKIAELNGGYYSKGDPETI 559 Query: 377 LESFDKI 383 + Sbjct: 560 SRLMSDL 566 >gi|66814064|ref|XP_641211.1| hypothetical protein DDB_G0280503 [Dictyostelium discoideum AX4] gi|60469236|gb|EAL67231.1| hypothetical protein DDB_G0280503 [Dictyostelium discoideum AX4] Length = 527 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 43/326 (13%), Positives = 93/326 (28%), Gaps = 18/326 (5%) Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 K + ++ + K + D ++ EI N Sbjct: 82 QYLDKSYPILVPNNEGLKISTLKFQIKNSLQEVIDGLESKSVNGIVLSKNEIELNNSSSL 141 Query: 122 -GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 ++ + II + + + E +K + N K Sbjct: 142 QSVLNNGDMVEMKLIEEIIIPPTTTTTTTTTTSSPSNYLEEKKRKEKEIQDENDRLLKSF 201 Query: 181 LPPPPKKSFWSKN--TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 + + + + + P N + +V L+ I + Sbjct: 202 IENNESITSDVEIMFCFDTTGSMYPIIENVRKEVTKT-VKCLIKDIPNIKIGI------M 254 Query: 239 GTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNT 298 G Y G + L++ +++ + + ++ + A +A L K+ S Sbjct: 255 GLGDYCDGENLIKTLDLTSKESDIITFIKEIPRTSGGDCPEAYEYA---LLKAKQLSW-- 309 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLR 358 S+ K + I D S +N + C+ + N G+KIY V + Sbjct: 310 --SSHTSKAFVLIGDNVPHEPSYTNLNINWFKECDDLYNMGIKIYGVKAGTDSSVSCFYQ 367 Query: 359 KCTD-SSGQFFAVNDSRELLESFDKI 383 + + +SG + + F I Sbjct: 368 EIAERTSGISIDFKNFDLITRLFLAI 393 >gi|260832994|ref|XP_002611442.1| hypothetical protein BRAFLDRAFT_63926 [Branchiostoma floridae] gi|229296813|gb|EEN67452.1| hypothetical protein BRAFLDRAFT_63926 [Branchiostoma floridae] Length = 1121 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 31/189 (16%), Positives = 54/189 (28%), Gaps = 24/189 (12%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + D + +VN + + V + A NN N++ Sbjct: 949 SGSVSGQFDTVRNFIRGVVNCLT---IGGSHARVGVIKFA-GSNANRQISLTDYNNKNDL 1004 Query: 263 KSRLNKLNPYENTNTYP--AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 R+ L+ + + E + K I +TDG ++ +S Sbjct: 1005 LVRIRNLDRSLGSAVGSVAGLSVMRNEFSTSGRPTAR--------KIGIVLTDGRDTSSS 1056 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLE 378 E +RN G I+SV V ++ L F L Sbjct: 1057 DAVIPD-----AETLRNEGTTIFSVGV--ANARRETLENMASRPVNENVFTAT-FASLQT 1108 Query: 379 SFDKITDKI 387 D + KI Sbjct: 1109 IVDSLRPKI 1117 >gi|209550318|ref|YP_002282235.1| Vault protein inter-alpha-trypsin domain protein [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209536074|gb|ACI56009.1| Vault protein inter-alpha-trypsin domain protein [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 794 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 19/164 (11%), Positives = 46/164 (28%), Gaps = 16/164 (9%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYR 286 + + I +N + + + L T PA+ A R Sbjct: 383 SRLTPNDRFNVIRFDDTMTDYFKGLVAATPDNREKAIAYVRGLPADGGTEMLPALEDALR 442 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 + + V+F+TDG + ++++V Sbjct: 443 NQGPVATGA---------LRQVVFLTDGAIGNEQQLFQEI-------TANRGDARVFTVG 486 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + + P + + G F + + ++ ++ K+Q Sbjct: 487 IGSAPNTYFMTKAAEIGRGTFTQIGSTDQVASRMGELFAKLQNP 530 >gi|296208409|ref|XP_002751081.1| PREDICTED: calcium-activated chloride channel regulator 4 [Callithrix jacchus] Length = 931 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 19/134 (14%), Positives = 53/134 (39%), Gaps = 24/134 (17%) Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T+ + A++ + + S ++ +TDGE++ AS+ Sbjct: 378 RAQGGTSICSGIKSAFQVI--------GELYSQLDGSEIVLLTDGEDNTASSC------- 422 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE---LLESFDKITD- 385 + ++ +G ++ +A+ + ++ + G +D E L+++F +T Sbjct: 423 --IDEVKQSGAIVHFIALGKDA-DKAVIEMSNITGGSHLYASDEAENNGLIDAFGALTSG 479 Query: 386 --KIQEQSVRIAPN 397 I ++S+++ Sbjct: 480 NAGISQKSLQLESK 493 >gi|125586596|gb|EAZ27260.1| hypothetical protein OsJ_11197 [Oryza sativa Japonica Group] Length = 540 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 19/145 (13%), Positives = 46/145 (31%), Gaps = 24/145 (16%) Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 ++ + ++ L NTN + + ++ + V+ ++DG+ + Sbjct: 130 RADLGALVDGLAADGNTNIRAGLEIGLAVAAGRRLTAGRAVN-------VMLMSDGQQNR 182 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT--DSSGQFFAVNDSREL 376 + G+ +++ + A +L+ G F V D L Sbjct: 183 GD-----------ATRLDPGGVPVHTFGLGADH-DPAVLQAIAGKSREGMFHYVADGVNL 230 Query: 377 LESFDKITD---KIQEQSVRIAPNR 398 F ++ I Q + + R Sbjct: 231 TAPFSQLLGGLLTIIAQDLELTVTR 255 >gi|302336645|ref|YP_003801851.1| von Willebrand factor type A [Spirochaeta smaragdinae DSM 11293] gi|301633830|gb|ADK79257.1| von Willebrand factor type A [Spirochaeta smaragdinae DSM 11293] Length = 474 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 22/123 (17%), Positives = 43/123 (34%), Gaps = 12/123 (9%) Query: 280 AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA---------SAYQNTLNTLQ 330 +Y ELY+ +G ++ VI ++DGE+ L+ + Sbjct: 176 GTDESYTELYHALADMALPVGERSGRRAVIVLSDGEDYSYATHSGNPHPIYGNQQLSPDE 235 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQE 389 + E G+ +Y++ L + + G + D EL + I KI Sbjct: 236 VVEEYIRNGVTLYAIHFGLE--KDQYLGEMALKTGGAVYDAKDQEELTGIYHDIRQKIDG 293 Query: 390 QSV 392 + + Sbjct: 294 EYL 296 >gi|242007523|ref|XP_002424589.1| conserved hypothetical protein [Pediculus humanus corporis] gi|212508032|gb|EEB11851.1| conserved hypothetical protein [Pediculus humanus corporis] Length = 1003 Score = 52.2 bits (123), Expect = 2e-04, Method: Composition-based stats. Identities = 29/140 (20%), Positives = 57/140 (40%), Gaps = 6/140 (4%) Query: 229 QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYREL 288 + N++ + + + + + N + + +L T+ A+ AY E Sbjct: 270 NDFVNVAWFNNDVKWVVPCLKTLVQATTQIKNLLADAIERLTESNLTSYVTALDFAYEEF 329 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 +E IGS K V+F++DG G + +N + C N +KI++ A Sbjct: 330 RKFEEIKKPWIGSNC-HKIVMFLSDG---GTEWPTDVIN--RHCNNSNNENIKIFTFACG 383 Query: 349 APPEGQDLLRKCTDSSGQFF 368 P +L++ S+G +F Sbjct: 384 PHPIPTVILKEMACSTGGYF 403 >gi|327260860|ref|XP_003215251.1| PREDICTED: collagen alpha-2(VI) chain-like [Anolis carolinensis] Length = 1019 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 30/196 (15%), Positives = 64/196 (32%), Gaps = 16/196 (8%) Query: 175 TSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNL 234 S+ P + ++ T A P +D + + ++ + + Sbjct: 30 ASSCADKTDCPVRVYFV-IDTSESIALQTVPIQSLVDHIKRFVPEFITRLEN-ELYQNQV 87 Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKE 293 S+ + V + +N+ +LN + T T A+ + ++ Sbjct: 88 SITWQFAGLHFSDVVIFYSDFTNSKEIYLDKLNNIQYIGRGTFTDCALSNMTAQILANTS 147 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + + ITDG +G+ E R AG+K+++VA S Sbjct: 148 PGIT--------NYAVVITDGHVTGSPCGG----MKHQAERAREAGIKLFAVAPSQNIY- 194 Query: 354 QDLLRKCTDSSGQFFA 369 + LR+ S + + Sbjct: 195 EQGLREIASSPHELYR 210 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 25/174 (14%), Positives = 60/174 (34%), Gaps = 22/174 (12%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS-VRIGTIAYNIGIVGNQCTPLS--- 256 + + + N+V+ + ++ K+ + R+G + Y+ Sbjct: 619 SSESIGYNNFSLEKNFVINVVSRLGSIAKDPKSETGARVGVVQYSHEGTFEAIQLNDKRI 678 Query: 257 NNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 ++L+ K + KL T T A+ AY L E + + F + +TDG Sbjct: 679 DSLSSFKEAVKKLEWIAGGTWTLSALQFAYNTLIKESQREKARV-------FAVVVTDGR 731 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA--PPEGQDL-LRKCTDSSGQ 366 + + + + + + + ++ + + +D LR + Q Sbjct: 732 H-------DPRDNDSHLQALCGRNVTVTAIGIGDMFNAKEEDETLRSIACNDNQ 778 >gi|218296567|ref|ZP_03497295.1| von Willebrand factor type A [Thermus aquaticus Y51MC23] gi|218243109|gb|EED09641.1| von Willebrand factor type A [Thermus aquaticus Y51MC23] Length = 706 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 25/138 (18%), Positives = 45/138 (32%), Gaps = 21/138 (15%) Query: 254 PLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 P++ E +S L L T A A R L + K ++ ++ Sbjct: 361 PMTAQGKKEAESLLLSLRAGGGTVLGGAFREALRLLQDVPVER----------KALLVLS 410 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN 371 DG I AG+++ ++A+ P L G+F+ Sbjct: 411 DGIIFDPKEP--------ILALAATAGVEVSALALG-PDADAAFLEALAQRGGGRFYRAA 461 Query: 372 DSRELLESFDKITDKIQE 389 +EL F K ++ + Sbjct: 462 TPKELPRLFLKEGQEVFQ 479 >gi|328947150|ref|YP_004364487.1| von Willebrand factor type A [Treponema succinifaciens DSM 2489] gi|328447474|gb|AEB13190.1| von Willebrand factor type A [Treponema succinifaciens DSM 2489] Length = 333 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 22/138 (15%), Positives = 43/138 (31%), Gaps = 38/138 (27%) Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 + T + A L S KK ++ ITDGEN+ + + +T Sbjct: 168 GDGTAIGTGLSSAIYHL----------EKSKSPKKSIVLITDGENNSGAVHPHT-----A 212 Query: 332 CEYMRNAGMKIYSVAVSAPP-----------------------EGQDLLRKCTDSSGQFF 368 N + +Y + V + + R ++ +G+FF Sbjct: 213 ARLAVNKDISLYILGVGTRGVVPIDYVDPKSNKIYSGYLESKFDTSSIARIASEGNGKFF 272 Query: 369 AVNDSRELLESFDKITDK 386 + L ++ I+ Sbjct: 273 EIESISALSQAISSISKS 290 >gi|149028000|gb|EDL83451.1| complement factor B, isoform CRA_d [Rattus norvegicus] Length = 543 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 30/203 (14%), Positives = 68/203 (33%), Gaps = 19/203 (9%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + NL+ + + + V T+ ++ S++ + Sbjct: 57 SDSIGASNFTGAKRCLANLIEKVASYGVKPRYGLVTYATVP---KVLVRVSEERSSDADW 113 Query: 262 VKSRLNKLNP-----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 V +LN+++ TNT A+ Y + ++ R + +I +TDG + Sbjct: 114 VTEKLNQISYEDHKLKSGTNTKKALQAVYSMMSWPGDAPPEGWN--RTRHVIIIMTDGLH 171 Query: 317 SGASAYQNTL-NTLQICEYMRNAGMK------IYSVAVS--APPEGQDLLRKCTDSSGQF 367 + + + + + R+ +Y V P + L ++ Sbjct: 172 NMGGDPVTVIEDIRDLLDIGRDRKNPREDYLDVYVFGVGPLVDPVNINALASKKNNEQHV 231 Query: 368 FAVNDSRELLESFDKITDKIQEQ 390 F V D +L F K+ D+ + Sbjct: 232 FKVKDMEDLENVFYKMIDETKSL 254 >gi|52840865|ref|YP_094664.1| type IV fimbrial biogenesis PilY1-related protein [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] gi|52627976|gb|AAU26717.1| type IV fimbrial biogenesis PilY1-related protein [Legionella pneumophila subsp. pneumophila str. Philadelphia 1] Length = 1172 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 46/418 (11%), Positives = 108/418 (25%), Gaps = 64/418 (15%) Query: 34 DAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINI 93 DA L+ I + + +++ + K I+ ++ + Sbjct: 116 DANGLAPYTVIQGGNKVDNSSSRLNVAKAGVKAIIENYMPTTDFALGTYSTSNISTYNTW 175 Query: 94 TKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMV 153 + P + + SA + +S + I +S Sbjct: 176 VYYMSPPGSNFVFTNTPVAGNRYVINPCYNYGSASSTVS-SNCASIGSLYGTSLVSSSQY 234 Query: 154 LDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL 213 L + + +D + + ++ P + S Sbjct: 235 LQIGGASDDPTINDVLYAGSGFPGVFVSFNGPSPAT-PYPPNFSLSNYNNGNVLLSYSST 293 Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT--------------PLSNNL 259 S G+ A + V + + T ++ Sbjct: 294 RPSIGSFATGPTNAGFVPFSQQVMYVQRGFGYYSNQSFNTGNMRVNMTTAGTNPTTTSVT 353 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHA---------YRELYNEKESSHNTIGSTRLKKFVIF 310 N + + L L P N+ + + A + ++ T G+ K+++I Sbjct: 354 NAINAFLPHLRPETNSTSTTEIKAAAVQSPLAGLLTRSRSFMKTVGTTSGNCPQKQYIIL 413 Query: 311 ITDGENSGA-------------------------------SAYQNTLNTLQICEYMRNAG 339 I+DG + + Q + + ++N G Sbjct: 414 ISDGLPTQDLQARYWPPLGSAAATGYGVTATFNADGSLNSTNSQALSDAINEINALKNDG 473 Query: 340 MKIYSVAVSAPPE------GQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQE 389 + + + + A + LR + G ++ + L+ S + I IQ Sbjct: 474 VLTFVIGMGAGVDPAVNPEAAATLRAMAVAGGTENYYPATSPQALVNSLNSILSNIQN 531 >gi|54296651|ref|YP_123020.1| hypothetical protein lpp0682 [Legionella pneumophila str. Paris] gi|53750436|emb|CAH11830.1| hypothetical protein lpp0682 [Legionella pneumophila str. Paris] Length = 1169 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 46/418 (11%), Positives = 108/418 (25%), Gaps = 64/418 (15%) Query: 34 DAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINI 93 DA L+ I + + +++ + K I+ ++ + Sbjct: 113 DANGLAPYTVIQGGNKVDNSSSRLNVAKAGVKAIIENYMPTTDFALGTYSTSNISTYNTW 172 Query: 94 TKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMV 153 + P + + SA + +S + I +S Sbjct: 173 VYYMSPPGSNFVFTNTPVAGNRYVINPCYNYGSASSTVS-SNCASIGSLYGTSLVSSSQY 231 Query: 154 LDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL 213 L + + +D + + ++ P + S Sbjct: 232 LQIGGASDDPTINDVLYAGSGFPGVFVSFNGPSPAT-PYPPNFSLSNYNNGNVLLSYSST 290 Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT--------------PLSNNL 259 S G+ A + V + + T ++ Sbjct: 291 RPSIGSFATGPTNAGFVPFSQQVMYVQRGFGYYSNQSFNTGNMRVNMTTAGTNPTTTSVT 350 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHA---------YRELYNEKESSHNTIGSTRLKKFVIF 310 N + + L L P N+ + + A + ++ T G+ K+++I Sbjct: 351 NAINAFLPHLRPETNSTSTTEIKAAAVQSPLAGLLTRSRSFMKTVGTTSGNCPQKQYIIL 410 Query: 311 ITDGENSGA-------------------------------SAYQNTLNTLQICEYMRNAG 339 I+DG + + Q + + ++N G Sbjct: 411 ISDGLPTQDLQARYWPPLGSAAATGYGVTATFNADGSLNSTNSQALSDAINEINALKNDG 470 Query: 340 MKIYSVAVSAPPE------GQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQE 389 + + + + A + LR + G ++ + L+ S + I IQ Sbjct: 471 VLTFVIGMGAGVDPAVNPEAAATLRAMAVAGGTENYYPATSPQALVNSLNSILSNIQN 528 >gi|387437|gb|AAA39549.1| MHC factor B [Mus musculus] Length = 477 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 26/151 (17%), Positives = 56/151 (37%), Gaps = 16/151 (10%) Query: 256 SNNLNEVKSRLNKLNP-----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 S++ + V +LN+++ TNT A+ Y + ++ R + +I Sbjct: 42 SSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYSMMSWAGDAPPEGWN--RTRHVIII 99 Query: 311 ITDGENSGASAYQNTLNTLQICEYM-------RNAGMKIYSVAVS--APPEGQDLLRKCT 361 +TDG ++ + ++ + R + +Y V + L Sbjct: 100 MTDGLHNMGGNPVTVIQDIRALLDIGRDPKNPREDYLDVYVFGVGPLVDSVNINALASKK 159 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 D+ F V D +L F ++ D+ + S+ Sbjct: 160 DNEHHVFKVKDMEDLENVFYQMIDETKSLSL 190 >gi|74195997|dbj|BAE30555.1| unnamed protein product [Mus musculus] Length = 436 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 26/151 (17%), Positives = 56/151 (37%), Gaps = 16/151 (10%) Query: 256 SNNLNEVKSRLNKLNP-----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 S++ + V +LN+++ TNT A+ Y + ++ R + +I Sbjct: 1 SSDADWVTEKLNQISYEDHKLKSGTNTKRALQAVYSMMSWAGDAPPEGWN--RTRHVIII 58 Query: 311 ITDGENSGASAYQNTLNTLQICEYM-------RNAGMKIYSVAVS--APPEGQDLLRKCT 361 +TDG ++ + ++ + R + +Y V + L Sbjct: 59 MTDGLHNMGGNPVTVIQDIRALLDIGRDPKNPREDYLDVYVFGVGPLVDSVNINALASKK 118 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 D+ F V D +L F ++ D+ + S+ Sbjct: 119 DNEHHVFKVKDMEDLENVFYQMIDETKSLSL 149 >gi|34534804|dbj|BAC87116.1| unnamed protein product [Homo sapiens] Length = 725 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 21/136 (15%), Positives = 49/136 (36%), Gaps = 18/136 (13%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + VR+G ++ + EVK+R+ ++ T T A+ + Sbjct: 82 DISPERVRVGAFQFSSTPHLEFPLDSFSTQQEVKARIKRMVFKGGRTETGLALKYLLHRG 141 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ + + +I +TDG++ G A + + ++ G+ +++V V Sbjct: 142 LPGGRNA-------SVPQILIIVTDGKSQGDVALPS--------KQLKERGVTVFAVGVR 186 Query: 349 APPEGQDLLRKCTDSS 364 P + L Sbjct: 187 FPRWEE--LHALASEP 200 >gi|293597548|ref|NP_001170793.1| calcium channel, voltage-dependent, alpha 2/delta subunit 1 preproprotein [Gallus gallus] Length = 1082 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 30/313 (9%), Positives = 95/313 (30%), Gaps = 25/313 (7%) Query: 71 HLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK-AQYEIPTENLFLKGLIPSALT 129 + K E + +I +++ + + A IPT+ ++ + L Sbjct: 126 NAKDDQNDPEKNDTESGSQRIRPVFEEDPVFRRQTSYQHAAVHIPTDIYEGSTIVLNELN 185 Query: 130 NLSLRSTGIIERSSENLAISICMVLDVSR----SMEDLYLQKHNDNNNMTSNKYLLPPPP 185 + E+ + + + ++ N + P Sbjct: 186 WTAALDDVFKRNREEDPTLLWQVFGSATGLARYYPASPWVDNSRTPNKIDLYDVRRRPWY 245 Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ-EKKNLSVRIGTIAYN 244 + S + + + + ++ S ++ ++ + + ++ Sbjct: 246 IQGAASPKDMLILVDASGSVSGLTLKLIRTSVIEMLETLSDDDFVNVVSFNNNAQNVSCF 305 Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 +V N ++K ++K++ T+ +A+ +L N S Sbjct: 306 NHLVQANVR----NKKKLKEAVDKISAKGITDYKKGFSYAFEQLLNHSVSRA------NC 355 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DS 363 K ++ TDG A + N + +++++ +V + ++ ++ Sbjct: 356 NKIIMLFTDGGEERAQEIFHKYNE--------DKKVRVFTFSVGQHNYDKGPIQWMACEN 407 Query: 364 SGQFFAVNDSREL 376 G ++ + + Sbjct: 408 KGYYYEIPSIGAI 420 >gi|262203982|ref|YP_003275190.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] gi|262087329|gb|ACY23297.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] Length = 461 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 20/180 (11%), Positives = 50/180 (27%), Gaps = 34/180 (18%) Query: 209 KIDVLIESAGNLVNSIQKAIQ-------EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 +ID + ++ + ++ + + P+ + Sbjct: 64 RIDAARNAVSTFISDLTSGTPFGLVAYGNTESAKTTPQAVGCEDVSTLARLGPI--DKEA 121 Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 +S ++ + T A+ A L E V+ ++DGE + Sbjct: 122 ARSAIDGVRAQGWTPLSAALTRAAEMLGTEA-------------GSVVLVSDGEAN---- 164 Query: 322 YQNTLNTLQICEYMRNA--GMKIYSVAVSAPPEGQDLLRKCTDSSGQFF-AVNDSRELLE 378 + +R + I +V + L+ G F +++ +L Sbjct: 165 --CLPDPCATARSLREQNPNLTISTVGFKSDA---AQLQCVAREGGGVFVTADNTAQLSA 219 >gi|195614282|gb|ACG28971.1| retrotransposon protein [Zea mays] Length = 650 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 24/170 (14%), Positives = 58/170 (34%), Gaps = 19/170 (11%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL----SNNLNE 261 ++D+L +A +V ++ R+ +A++ V + L ++ Sbjct: 103 TTSRLDLLKTAAKFMVAKLEDGD--------RLSIVAFSDRPVRELSSGLLYMTADGRRN 154 Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 L++L T PA A + L F++ +TDG + + Sbjct: 155 AIRSLDQLEARGGTALVPAFEEAVKVLD------GRQGDGGDRLGFIVLLTDGAEDASGS 208 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 + + ++ +++ + + LL +S G + V+ Sbjct: 209 FTLSERRREVIRGALGR-YPVHAFGLGTAHGPEVLLYLAQESRGTYSFVD 257 >gi|116253186|ref|YP_769024.1| transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] gi|115257834|emb|CAK08932.1| putative transmembrane protein [Rhizobium leguminosarum bv. viciae 3841] Length = 797 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 20/164 (12%), Positives = 48/164 (29%), Gaps = 16/164 (9%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYR 286 + + I +N + + + L T PA+ A R Sbjct: 386 SKLNPDDRFNVIRFDDTMTDYFKGLVAATPDNREKAIAYVRGLTADGGTEMLPALQAALR 445 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 + + V+F+TDG + + ++++V Sbjct: 446 NQGPVATGA---------LRQVVFLTDGAIGNERQLFQEI-------TANRSDARVFTVG 489 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + + P + + G F A+ + ++ ++ K+Q Sbjct: 490 IGSAPNTYFMTKAAEIGRGTFTAIGSTDQVASRMGELFAKLQNP 533 >gi|312071611|ref|XP_003138688.1| CUTiclin family member [Loa loa] gi|307766146|gb|EFO25380.1| CUTiclin family member [Loa loa] Length = 450 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 20/114 (17%), Positives = 39/114 (34%), Gaps = 17/114 (14%) Query: 265 RLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 + ++ TNT A+ A + + K I +TDG++ + A Sbjct: 4 AIQRIKYLSGATNTGAALKFALERGFQDARGGGI-------PKVAIVVTDGQSQDSVAES 56 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 + +R+A + +Y+V V L + + + V EL Sbjct: 57 --------AQQLRDAHIMLYAVGV-TNLVNVHQLHQIAGNPARVLTVESFDELS 101 >gi|253569758|ref|ZP_04847167.1| conserved hypothetical protein [Bacteroides sp. 1_1_6] gi|251840139|gb|EES68221.1| conserved hypothetical protein [Bacteroides sp. 1_1_6] Length = 621 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 26/290 (8%), Positives = 79/290 (27%), Gaps = 28/290 (9%) Query: 92 NITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISIC 151 + + + ++ A Y + L + ++ Sbjct: 159 QVGESPLSTFSIDVDA-ASYSNMRRMINSGTLPVADAIRTEELVNYFSYDYAKPTGSDPV 217 Query: 152 MVLDVSRSMEDLYLQKHNDN-NNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKI 210 + + + P F ++ Sbjct: 218 KITMEAGVCPWNADHRLVRIGLKAREIPTDKLPESNLVFLID-------VSGSMWGPTRL 270 Query: 211 DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN 270 D++ S LVN++++ ++ + Y + ++ +++ +++L Sbjct: 271 DLVKSSLKLLVNNLREKD--------KVAIVVYAGNASVKLESTPGSDKQKIRDAIDELT 322 Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 +T + AY+ + N +I +DG+ + + L Q Sbjct: 323 SGGSTAGGAGIQLAYKVAKHNFLPKGNNR--------IILCSDGDFNVGVSSVEGL--EQ 372 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLES 379 + E R +G+ + + + + + G +++ +E Sbjct: 373 LIEKERKSGVFLSVLGYGMGNYKDNKGQALAEKGNGNHAYIDNLQEANRV 422 >gi|90420796|ref|ZP_01228702.1| conserved hypothetical protein with von Willebrand factor domain [Aurantimonas manganoxydans SI85-9A1] gi|90335087|gb|EAS48848.1| conserved hypothetical protein with von Willebrand factor domain [Aurantimonas manganoxydans SI85-9A1] Length = 320 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 21/144 (14%), Positives = 47/144 (32%), Gaps = 29/144 (20%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 PL+ ++ V +++ +T + A + L S + V+ Sbjct: 150 PLTFDVEAVGRLIDQATIGISGRSTAISDGLGLALKRL----------ARSDARSRVVVL 199 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA----------PPEGQDLLRKC 360 ++DG N+ + G++++++A+ LR Sbjct: 200 LSDGVNNAG-----AVQPRDAGSLAERLGIRVHTIALGPADLETDPKSRDAVDTATLRAI 254 Query: 361 T-DSSGQFFAVNDSRELLESFDKI 383 S G+ F V + +L + I Sbjct: 255 AETSGGETFRVRTTDDLRQVARAI 278 >gi|29788808|gb|AAP03354.1| hypothetical protein [Oryza sativa Japonica Group] gi|108708692|gb|ABF96487.1| von Willebrand factor type A domain containing protein, expressed [Oryza sativa Japonica Group] Length = 540 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 19/145 (13%), Positives = 46/145 (31%), Gaps = 24/145 (16%) Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 ++ + ++ L NTN + + ++ + V+ ++DG+ + Sbjct: 130 RADLGALVDGLAADGNTNIRAGLEIGLAVAAGRRLTAGRAVN-------VMLMSDGQQNR 182 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT--DSSGQFFAVNDSREL 376 + G+ +++ + A +L+ G F V D L Sbjct: 183 GD-----------ATRLDPGGVPVHTFGLGADH-DPAVLQAIAGKSREGMFHYVADGVNL 230 Query: 377 LESFDKITD---KIQEQSVRIAPNR 398 F ++ I Q + + R Sbjct: 231 TAPFSQLLGGLLTIIAQDLELTVTR 255 >gi|148675553|gb|EDL07500.1| mCG120277 [Mus musculus] Length = 1031 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 63/183 (34%), Gaps = 13/183 (7%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 K+ ++ +++ +Q + I G + N++ K+ ++ Sbjct: 263 TKLQQTKKAMDTILSDLQASDSFNIITFSDTVNIWKAEGSIQATV----QNIHSAKNYVS 318 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 ++ T+ A+ A L + + ++F+TDGE + + + Sbjct: 319 RMEADGWTDINAALLAAASVLNHSNQ-EPGKGRGVGQIPLIMFLTDGEPTAGETTPSVI- 376 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ-----FFAVNDSRELLESFDK 382 L + ++S+A LLR+ + + + + + +L + + Sbjct: 377 -LSNIRQALAHRVSLFSLAFG-DDADFSLLRRLSLENQGEARRIYEDADAALQLEGLYAE 434 Query: 383 ITD 385 I+ Sbjct: 435 ISR 437 >gi|148675425|gb|EDL07372.1| mCG142757 [Mus musculus] Length = 1320 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 19/140 (13%), Positives = 41/140 (29%), Gaps = 18/140 (12%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 V++G Y+ +V + ++ L NT T A+ H Sbjct: 205 TQFAIGPDKVQVGLTQYSGDPQTEWDLNSFQTKEQVLAAVHHLRYKGGNTFTGLALTHVL 264 Query: 286 -RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + L K +I +TDG++ + +++ + ++ Sbjct: 265 EQNLKPAAGVRPEA------AKVLILVTDGKSQD--------DVRTAARILKDQDIDVFV 310 Query: 345 VAVSAPPEGQDLLRKCTDSS 364 V V + L+ Sbjct: 311 VGV--KNVDEAELKLLASQP 328 >gi|123231311|emb|CAM23231.1| collagen, type XX, alpha 1 [Mus musculus] gi|220938197|emb|CAX15853.1| collagen, type XX, alpha 1 [Mus musculus] Length = 1280 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 19/140 (13%), Positives = 41/140 (29%), Gaps = 18/140 (12%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 V++G Y+ +V + ++ L NT T A+ H Sbjct: 197 TQFAIGPDKVQVGLTQYSGDPQTEWDLNSFQTKEQVLAAVHHLRYKGGNTFTGLALTHVL 256 Query: 286 -RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + L K +I +TDG++ + +++ + ++ Sbjct: 257 EQNLKPAAGVRPEA------AKVLILVTDGKSQD--------DVRTAARILKDQDIDVFV 302 Query: 345 VAVSAPPEGQDLLRKCTDSS 364 V V + L+ Sbjct: 303 VGV--KNVDEAELKLLASQP 320 >gi|258679414|ref|NP_082794.1| collagen alpha-1(XX) chain [Mus musculus] gi|292495082|sp|Q923P0|COKA1_MOUSE RecName: Full=Collagen alpha-1(XX) chain; Flags: Precursor gi|123231310|emb|CAM23230.1| collagen, type XX, alpha 1 [Mus musculus] gi|220938196|emb|CAX15852.1| collagen, type XX, alpha 1 [Mus musculus] Length = 1320 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 19/140 (13%), Positives = 41/140 (29%), Gaps = 18/140 (12%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 V++G Y+ +V + ++ L NT T A+ H Sbjct: 205 TQFAIGPDKVQVGLTQYSGDPQTEWDLNSFQTKEQVLAAVHHLRYKGGNTFTGLALTHVL 264 Query: 286 -RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + L K +I +TDG++ + +++ + ++ Sbjct: 265 EQNLKPAAGVRPEA------AKVLILVTDGKSQD--------DVRTAARILKDQDIDVFV 310 Query: 345 VAVSAPPEGQDLLRKCTDSS 364 V V + L+ Sbjct: 311 VGV--KNVDEAELKLLASQP 328 >gi|262199272|ref|YP_003270481.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262082619|gb|ACY18588.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 430 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 22/189 (11%), Positives = 52/189 (27%), Gaps = 22/189 (11%) Query: 208 RKIDVLIESAGNLVNSI---QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 K+ + +A L+ ++ + V + + G + E + Sbjct: 66 DKLSDVKTAALELLETLQPEDTITLVSYSSDVSMHLMRTRADDAGQR---------EARR 116 Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 L L T P + A L + + ++ +DG + + Sbjct: 117 ALLALQARGGTALGPGLFRALEALEGASDRTRM--------SHLMLFSDGIANAGEVRPS 168 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKIT 384 L + + + ++ V + R G++ + DS + D Sbjct: 169 VLGARAAGAFGAG--VSVSTMGVGVDYNEDLMTRLADQGGGRYHFIQDSEAIASILDDEM 226 Query: 385 DKIQEQSVR 393 + R Sbjct: 227 KGLVATVAR 235 >gi|227547429|ref|ZP_03977478.1| von Willebrand factor type A (vWA) domain protein [Bifidobacterium longum subsp. infantis ATCC 55813] gi|227212076|gb|EEI79972.1| von Willebrand factor type A (vWA) domain protein [Bifidobacterium longum subsp. infantis ATCC 55813] Length = 362 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 39/348 (11%), Positives = 101/348 (29%), Gaps = 30/348 (8%) Query: 36 AVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK 95 A LS + + DQ++ L+ + ++ D + Sbjct: 39 ATLSIASGSENKEVAVAIQKAADQSNVAVTMHYMGSLEIMNALKAGGQDHDAVWPASSMW 98 Query: 96 DKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD 155 ++I + A + A + +++ ++ + + Sbjct: 99 ISMGDTKHIVKDAASTSTTPI------VFGIAKSKAVKLGWADDTGATKPVSTADILAAV 152 Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 + ++ + + L P + W + + S + ++ ++ Sbjct: 153 SDGKLTFSMTSATVIDSALNVYQTALR-KPSWTIWVVDYSGSMSGEGKNGVVKGLNAALD 211 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 + +K+ E + V I + T S +++ + + T Sbjct: 212 -----PDQAKKSYIEPASGDVNILIPFETEAHRPVKATGTS--TSDLLHEADATDASGGT 264 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + Y + A EL +E E+S ++ +TDG ++ + Sbjct: 265 DIYEGLLSALDELPSESEASQ-------YTTAIVLMTDGRSNSDHQDEFESAYKS----- 312 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDK 382 R + I+S+ L+ S+ + F +L F + Sbjct: 313 RGRDLPIFSIMFG--DADPSQLKSLATLSNAKVFDGRS-GDLAAVFRQ 357 >gi|309266960|ref|XP_003086909.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha (globulin) inhibitor H5-like, pseudogene [Mus musculus] gi|309271570|ref|XP_003085348.1| PREDICTED: LOW QUALITY PROTEIN: inter-alpha (globulin) inhibitor H5-like, pseudogene [Mus musculus] Length = 1321 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 63/183 (34%), Gaps = 13/183 (7%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 K+ ++ +++ +Q + I G + N++ K+ ++ Sbjct: 297 TKLQQTKKAMDTILSDLQASDSFNIITFSDTVNIWKAEGSIQATV----QNIHSAKNYVS 352 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 ++ T+ A+ A L + + ++F+TDGE + + + Sbjct: 353 RMEADGWTDINAALLAAASVLNHSNQ-EPGKGRGVGQIPLIMFLTDGEPTAGETTPSVI- 410 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ-----FFAVNDSRELLESFDK 382 L + ++S+A LLR+ + + + + + +L + + Sbjct: 411 -LSNIRQALAHRVSLFSLAFG-DDADFSLLRRLSLENQGEARRIYEDADAALQLEGLYAE 468 Query: 383 ITD 385 I+ Sbjct: 469 ISR 471 >gi|224079848|ref|XP_002194049.1| PREDICTED: similar to von Willebrand factor A domain containing 1 [Taeniopygia guttata] Length = 525 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 21/174 (12%), Positives = 56/174 (32%), Gaps = 21/174 (12%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL-NKLNPYENTNTYPAMHHAYR 286 V+ I + + V+ + + +TNT A+ A Sbjct: 65 PFTFGPRDVQTSIIHISTTPTMEFPFDQHLSSGTVRKAIRDTRQLMGDTNTGKALSFAKE 124 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 +L++ + + + K ++++TDG ++ + + +++ G+ ++ V+ Sbjct: 125 KLFSGEAGAR-----PDVPKVLVWVTDGFSTDDISEP--------MQLLKDMGVTVFIVS 171 Query: 347 VSAPPEGQDLLRKCTDSSG----QFFAVNDSRELLESFDK-ITDKIQEQSVRIA 395 L F V+D + + I I+ + + Sbjct: 172 TGRGNF--LELSAAASQPSDKHLHFVDVDDLPIITKELRDGILGVIRAKRLHAT 223 >gi|222080976|ref|YP_002540339.1| hypothetical protein Arad_7191 [Agrobacterium radiobacter K84] gi|221725655|gb|ACM28744.1| conserved hypothetical protein [Agrobacterium radiobacter K84] Length = 649 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 32/221 (14%), Positives = 61/221 (27%), Gaps = 4/221 (1%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 MT + V F +I+ + + +R+Q Q D A +G + + + T + Sbjct: 26 MTVFCLPVVIGFAALSIEYGYGLLVRDQNQRTADLASYAGALAYSNANSEDQMTDAALRV 85 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + + + S Q +++T L I ++ I TE Sbjct: 86 AKL--NGVDAANVVVSLTASPKDSRVQAVHVDVTTTNTLFLAPILGVDSKLNIATEAYSS 143 Query: 121 KGLIPSA-LTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKY 179 G S + L ++G+ S C V S + + + + Sbjct: 144 LGAAESGCIIALDKSASGVTLSGGVQANASSCYVASNSNLVAPCGTKITAKSASYYEGSS 203 Query: 180 LLPPPPKK-SFWSKNTTKSKYAPAPAPANRKIDVLIESAGN 219 P + P V S Sbjct: 204 QPCPWSTNIVRADGSPAPVTKQYTSDPLEGNAGVDKLSGRF 244 >gi|332879551|ref|ZP_08447246.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] gi|332682517|gb|EGJ55419.1| von Willebrand factor type A domain protein [Capnocytophaga sp. oral taxon 329 str. F0087] Length = 352 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 50/165 (30%), Gaps = 42/165 (25%) Query: 251 QCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 P++++ K L +NP T + A + + K + Sbjct: 142 TQLPITSDYVSAKIFLETINPSMITTQGTDIKQAIDLAMKSF------TSNQDVSKAIFV 195 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL------------- 357 ITDGE++ +++ + G+K+Y + V +P + Sbjct: 196 ITDGEDNEG-------GAVEMAKAAAEKGIKVYVLGVGSPQGAPIPMPGSSQYITDNTGN 248 Query: 358 ------------RKCTDSSGQFFAVNDS----RELLESFDKITDK 386 G + V++S +L DK+ K Sbjct: 249 VVVSKLNESMCREIAAAGQGAYIYVDNSSSAQEQLSGYVDKLAKK 293 >gi|296108821|ref|YP_003615770.1| Magnesium chelatase [Methanocaldococcus infernus ME] gi|295433635|gb|ADG12806.1| Magnesium chelatase [Methanocaldococcus infernus ME] Length = 283 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 32/219 (14%), Positives = 71/219 (32%), Gaps = 19/219 (8%) Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 N N + K L ++ S A R++++ + +L+ Sbjct: 81 KNKNLALYIEKEDLREKIREKKISSYILFVVDTSGSMGALRRMELAKGAIRSLLV----- 135 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 + R+G I + + P ++++ + L + T A AY Sbjct: 136 --DAYQKRNRVGMIVFR-KDSADLILPFTSSVELAEKSLRDVPTGGRTPLSKAFLKAYET 192 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 E + + ++FI+D + + A +ICE + G+ ++ + Sbjct: 193 FEKE------LRKNPNIIPIMVFISDFKPNVAIKNDFIKEIYEICEKIHEKGIN--TIFI 244 Query: 348 SAPPEGQDLL---RKCTDSSGQFFAVNDSRELLESFDKI 383 P+ L + G + D +L + +I Sbjct: 245 DTEPKTFIKLGIGEELAKKFGFKYYKIDEIKLDDLLKEI 283 >gi|325688744|gb|EGD30753.1| von Willebrand factor type A [Streptococcus sanguinis SK115] Length = 551 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 21/133 (15%), Positives = 50/133 (37%), Gaps = 14/133 (10%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPL--SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 N +IG ++Y+ + N S + S + L P T TY A + + + Sbjct: 409 NEENQIGLVSYSDDVTINVPIDTMNSTQKSYFTSAIKGLTPSGGTATYDGTLVAVKMILD 468 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + + ++DG+ +G ++ ++ G+ + ++ +A Sbjct: 469 KMKENPGARP------VIFVLSDGQTNGG------YEFERVEPIIKALGITVNTIGYNAD 516 Query: 351 PEGQDLLRKCTDS 363 + L K +S Sbjct: 517 LKELTKLSKINES 529 >gi|67468139|ref|XP_650129.1| elongation factor-2 kinase [Entamoeba histolytica HM-1:IMSS] gi|56466695|gb|EAL44742.1| elongation factor-2 kinase, putative [Entamoeba histolytica HM-1:IMSS] Length = 402 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 34/180 (18%), Positives = 64/180 (35%), Gaps = 18/180 (10%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS-----NNLNEVKSRLNKL 269 SA + +N I I + + +R + Y Q L N++ ++++ +N+L Sbjct: 22 HSAQSNINKIVNTITNSEKIDLRFAIVEYKDHQPNQQQFALKKYDWMNDIKDIQNAINQL 81 Query: 270 NPYENTNTYP-----AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + Y P A A Y + + DG +G Sbjct: 82 SAYGGGMDGPESVTCAFDCAVNLGYRGYAAKVIIWIADAPPHGFNIQYDGYPNG---CPC 138 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSA----PPEGQDLLRKCT-DSSGQFFAVNDSRELLES 379 ++ ++ ++IYSVA +DL+R + GQF A+N + L + Sbjct: 139 GIDFQEVVLKAIKNDIQIYSVACEPIRPIYRHFRDLMRAVAMMTGGQFIALNSADCLADV 198 >gi|303246237|ref|ZP_07332517.1| Protein of unknown function DUF2134, membrane [Desulfovibrio fructosovorans JJ] gi|302492300|gb|EFL52172.1| Protein of unknown function DUF2134, membrane [Desulfovibrio fructosovorans JJ] Length = 382 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 32/308 (10%), Positives = 82/308 (26%), Gaps = 25/308 (8%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+ + V F T A+D + Y R+Q+Q+A A++ + ++ Sbjct: 20 IVALSMIVLAGFATLAVDYGFLEYKRSQLQNAA------DAAALAGASVLVQYGANQEAV 73 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + + +L + ++T ++ + P E +FL Sbjct: 74 TDTAVLYGQANLNDSDSKEMAIRNS------DVTYPDAVSVRATVGRTQERGNPVE-MFL 126 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 ++ +++ + S + + S + + D + N Sbjct: 127 GRILGWNTQDIAATGVAALFCSKSSKCLK-----PWSPPAKFTWK-DDCDADKKYYNNNQ 180 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 L S P K + GT Sbjct: 181 LDAGSVCEMNSVEVQGYDNNDVGTPIILKFGDPSGTVTPGHYQPVDYPPASS------GT 234 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG 300 + + + + + + ++L P L + +S ++ Sbjct: 235 PMTGGDAYRDNIAGCTGSNDGIVNTGDELQVEPGDMVGPTKQGLKELLDQDPGASWDSAT 294 Query: 301 STRLKKFV 308 ++ + Sbjct: 295 NSIVDSAY 302 >gi|149377596|ref|ZP_01895335.1| hypothetical protein MDG893_19479 [Marinobacter algicola DG893] gi|149358138|gb|EDM46621.1| hypothetical protein MDG893_19479 [Marinobacter algicola DG893] Length = 718 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 26/183 (14%), Positives = 61/183 (33%), Gaps = 15/183 (8%) Query: 211 DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN 270 + + ++ L+ + + + I + + NN+ + + LN Sbjct: 369 ESIRQARQALLRGLGTLDAD--DRFNVIQFNSQTHSLFMESVPASGNNIARARRYVKGLN 426 Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T PA+ A + S + V+FITDG SA + Sbjct: 427 ADGGTEMAPALDAALET------NGDGGEASRARVRQVVFITDGAVGNESALFGKIRDGL 480 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 ++++V + + P + G + A++D ++ D++ K++ Sbjct: 481 GSS-------RLFTVGIGSAPNMHFMREAARYGRGTYTAISDLSDVARPLDELFGKMEAP 533 Query: 391 SVR 393 + Sbjct: 534 VLT 536 >gi|91201647|emb|CAJ74707.1| hypothetical protein kuste3944 [Candidatus Kuenenia stuttgartiensis] Length = 339 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 26/154 (16%), Positives = 54/154 (35%), Gaps = 43/154 (27%) Query: 272 YENTNTYPAMHHAYRELYNEKESSHNT----IGSTRLK---------------------K 306 T+ + + L+ ++ S T S + K Sbjct: 153 GYQTSIGEGIFTSITALFEKEMGSRFTFTELRNSINKQYLDDYAVSFVKEMKKRDLLKNK 212 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV------SAPPEG-----QD 355 +I TDG + + L ++ G+K Y VAV P+ Q+ Sbjct: 213 LIILFTDGIYNIGISPDRPLR------LLQRMGIKAYVVAVKASDVYGIDPDIAAQHIQE 266 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 L + G+++ ++ E+ + +D+I D+I++ Sbjct: 267 LQEAIESTGGKYYHADNFEEVAKFYDEI-DRIEK 299 >gi|330808169|ref|YP_004352631.1| hypothetical protein PSEBR_a1432 [Pseudomonas brassicacearum subsp. brassicacearum NFM421] gi|327376277|gb|AEA67627.1| Conserved hypothetical protein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] Length = 2855 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 30/292 (10%), Positives = 80/292 (27%), Gaps = 23/292 (7%) Query: 70 KHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALT 129 + L GS + ++ A + + ++ + L S Sbjct: 1929 QDLAPGSVTGSDPSATSETASGTLVGSVSGATGAVSFTLVGNATGAYGQLLLHPDGSYTY 1988 Query: 130 NLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ-------KHNDNNNMTSNKYLLP 182 L+ + + +S + S+ + + Sbjct: 1989 TLTSPANTTPHANDGPNVLSESFTYQATDSLGNSTTGSLVVSIVDDVPKAVASERSVTAV 2048 Query: 183 PPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIA 242 + + S + P ++D+ ++ L++ V++ + Sbjct: 2049 EIDSNLLIVLDVSGSMADDSGVPGLSRLDLAKQAISALLDKYDDL------GDVKVQLVT 2102 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 ++ + ++ KS L+ L+ TN A+ A + T Sbjct: 2103 FSSSATDQTSVWV--DVATAKSLLSSLSADGGTNYDAAVATAKTAFVTSGQ-------LT 2153 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + F +DG+ + T + ++ G+K Y++ + Sbjct: 2154 GAQNIGYFFSDGKPNSGLE-TGTADEAAWKAFLDANGIKNYAIGLGDGVSND 2204 >gi|315186713|gb|EFU20471.1| von Willebrand factor type A [Spirochaeta thermophila DSM 6578] Length = 331 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 24/156 (15%), Positives = 51/156 (32%), Gaps = 50/156 (32%) Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T+ + A + ++ VI ++DGE + E Sbjct: 167 GTDVASGLSRALEAFPEQSNR----------QRLVILLSDGEALTGEIGP-------VLE 209 Query: 334 YMRNAGMKIYSVAVSAPPE-----------------------GQDLLRKCTD-SSGQFFA 369 RN G+ +++V + LL++ + + G+FF+ Sbjct: 210 LARNLGVAVHTVGIGTESGGPVPLEGEEVLKKPSGEPVISRLDASLLKRIAEITGGRFFS 269 Query: 370 VNDSRELLESFDKITDKIQEQS-------VRIAPNR 398 V ++ ++F + I+E +R+ P R Sbjct: 270 VENAEG--QTFQHVVSTIEETIAREEREGIRLVPAR 303 >gi|326505132|dbj|BAK02953.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 521 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 20/136 (14%), Positives = 39/136 (28%), Gaps = 21/136 (15%) Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 E+K ++ L TN + A + + T V ++DG+ + Sbjct: 119 AELKGIVDGLVANGGTNIKAGLDTALAVIAGRATTKARTPN-------VFLMSDGQQTDG 171 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT--DSSGQFFAVNDSRELL 377 A Q + +Y+ LL G F +V D + Sbjct: 172 DARQVDPG-----------NVAVYTFGFGKDA-DHALLSDVAKKSPGGTFNSVPDGGNVS 219 Query: 378 ESFDKITDKIQEQSVR 393 F ++ + + Sbjct: 220 APFSQLLGGLLSIVAQ 235 >gi|322688246|ref|YP_004207980.1| hypothetical protein BLIF_0055 [Bifidobacterium longum subsp. infantis 157F] gi|320459582|dbj|BAJ70202.1| conserved hypothetical protein [Bifidobacterium longum subsp. infantis 157F] Length = 362 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 39/348 (11%), Positives = 101/348 (29%), Gaps = 30/348 (8%) Query: 36 AVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK 95 A LS + + DQ++ L+ + ++ D + Sbjct: 39 ATLSIASGSENKEVAVAIQKAADQSNVAVTMHYMGSLEIMNALKAGGQDHDAVWPASSMW 98 Query: 96 DKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD 155 ++I + A + A + +++ ++ + + Sbjct: 99 ISMGDTKHIVKDAASTSTTPI------VFGIAKSKAVKLGWADDTGAAKPVSTADILAAV 152 Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 + ++ + + L P + W + + S + ++ ++ Sbjct: 153 SDGKLTFSMTSATVIDSALNVYQTALR-KPSWTIWVVDYSGSMSGEGKNGVVKGLNAALD 211 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 + +K+ E + V I + T S +++ + + T Sbjct: 212 -----PDQAKKSYIEPASGDVNILIPFETEAHRPVKATGTS--TSDLLHEADATDASGGT 264 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + Y + A EL +E E+S ++ +TDG ++ + Sbjct: 265 DIYEGLLSALDELPSESEASQ-------YTTAIVLMTDGRSNSDHQDEFESAYKS----- 312 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDK 382 R + I+S+ L+ S+ + F +L F + Sbjct: 313 RGRDLPIFSIMFG--DADPSQLKSLATLSNAKVFDGRS-GDLAAVFRQ 357 >gi|270008950|gb|EFA05398.1| hypothetical protein TcasGA2_TC015570 [Tribolium castaneum] Length = 873 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 29/263 (11%), Positives = 72/263 (27%), Gaps = 29/263 (11%) Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 + + VLD S SM ++ + + +S+ Sbjct: 247 VHFFAPSGLQTFPKHVVFVLDHSGSMGGRKYEQLKQAMDKILSDLNPDDLFHIVRFSEIV 306 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 V K + + +N + I + Sbjct: 307 ----------------SVWNLEKNKFDKIRFKQMPDYENFDSVLAEFNLRDAIQVTE--- 347 Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 +N+ + KS + + TN + + + + T+ + +IF+TDG Sbjct: 348 --DNIKKAKSIKDDIVDMACTNIIGGLVVGLYLVRRTLQKFYEKNIETKHQPMIIFLTDG 405 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG---QFFAVN 371 + ++ + + I+S++ + L + + G + Sbjct: 406 LPNVGLIIRDEITNVVTKINQGTNRAAIFSLSFGEDADKNFLKKLSAQNLGFSRHIYEAA 465 Query: 372 DSR-ELLESFDKITDKIQEQSVR 393 D+ +L + + +R Sbjct: 466 DAALQLQNFY----RTVSSLLLR 484 >gi|91084773|ref|XP_972278.1| PREDICTED: similar to Inter-alpha-trypsin inhibitor heavy chain H4 precursor (ITI heavy chain H4) (Inter-alpha-inhibitor heavy chain 4) (Inter-alpha-trypsin inhibitor family heavy chain-related protein) (IHRP) (Plasma kallikrein sensitive glycoprotein 120) (P [Tribolium castaneum] Length = 698 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 29/263 (11%), Positives = 72/263 (27%), Gaps = 29/263 (11%) Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 + + VLD S SM ++ + + +S+ Sbjct: 247 VHFFAPSGLQTFPKHVVFVLDHSGSMGGRKYEQLKQAMDKILSDLNPDDLFHIVRFSEIV 306 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 V K + + +N + I + Sbjct: 307 ----------------SVWNLEKNKFDKIRFKQMPDYENFDSVLAEFNLRDAIQVTE--- 347 Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 +N+ + KS + + TN + + + + T+ + +IF+TDG Sbjct: 348 --DNIKKAKSIKDDIVDMACTNIIGGLVVGLYLVRRTLQKFYEKNIETKHQPMIIFLTDG 405 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG---QFFAVN 371 + ++ + + I+S++ + L + + G + Sbjct: 406 LPNVGLIIRDEITNVVTKINQGTNRAAIFSLSFGEDADKNFLKKLSAQNLGFSRHIYEAA 465 Query: 372 DSR-ELLESFDKITDKIQEQSVR 393 D+ +L + + +R Sbjct: 466 DAALQLQNFY----RTVSSLLLR 484 >gi|320104266|ref|YP_004179857.1| hypothetical protein Isop_2740 [Isosphaera pallida ATCC 43644] gi|319751548|gb|ADV63308.1| protein of unknown function DUF1355 [Isosphaera pallida ATCC 43644] Length = 1239 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 26/178 (14%), Positives = 56/178 (31%), Gaps = 21/178 (11%) Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + AI N + N N + + ++++ P + P M Sbjct: 461 AKAAISTLSNYDYAGVLFWTGRDQWLFPLITVGPNRNRMLALIDQMIPGDMPAFGPGMTV 520 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A L + ++ + K +I I+DG+ + + + + Sbjct: 521 ASNSLLQKTDA---------ITKHMIIISDGDP--------APPPPGLINQLVRGKITVT 563 Query: 344 SV---AVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 +V A P + ++ + G+F+ V + R L + K I + N Sbjct: 564 TVLTAAHGNDPGSFNTMQSIAQATKGRFYNVTNPRALPRIYQKEIRLISRPLIHEEVN 621 >gi|156744078|ref|YP_001434207.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156235406|gb|ABU60189.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 429 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 22/181 (12%), Positives = 60/181 (33%), Gaps = 21/181 (11%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 K + ++ L+N + + + + PL + + +R Sbjct: 188 GTPKTVIARQALIALINRLPETTNVALRTYGHRRADDCSDTELIQALAPLQRD--ALIAR 245 Query: 266 LNKLNP--YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +N + P T ++ ++L ++ ++DG+ + Sbjct: 246 INAIRPVNGGRTPIAQSLADMAQDLA-----------GIEGNVLIVLVSDGDETCGGDPV 294 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG-QDLLRKCT-DSSGQFFAVNDSRELLESFD 381 T + L+ N+ ++I + E + L G +F +++ +L ++ D Sbjct: 295 ATASMLRAA----NSQLRISVIGFDVEQEEWRRRLEGIAVAGGGAYFDASNAEQLADALD 350 Query: 382 K 382 + Sbjct: 351 Q 351 >gi|167535479|ref|XP_001749413.1| hypothetical protein [Monosiga brevicollis MX1] gi|163772041|gb|EDQ85698.1| predicted protein [Monosiga brevicollis MX1] Length = 2014 Score = 51.8 bits (122), Expect = 2e-04, Method: Composition-based stats. Identities = 16/137 (11%), Positives = 54/137 (39%), Gaps = 13/137 (9%) Query: 256 SNNLNEVKSRLNKLN-PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 + + + L+ + P T+ + RE++ + ++ + + + +I +TDG Sbjct: 402 TTDQYALNDLLDNVEFPAGATHLSWGLDFIDREMF--RLAAGMRSSNNSIPRVLIVLTDG 459 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVND 372 ++ + +++ G++IY++ V +++ F +++ Sbjct: 460 RSNPGF------EPDEYSTALKDKGIEIYAIGVG--DYYSIEVQEMASEPKDRHAFELSN 511 Query: 373 SRELLESFDKITDKIQE 389 +L D+++ + Sbjct: 512 QDDLARLVDRLSYQTCS 528 >gi|310824462|ref|YP_003956820.1| hypothetical protein STAUR_7237 [Stigmatella aurantiaca DW4/3-1] gi|309397534|gb|ADO74993.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1] Length = 913 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 14/123 (11%), Positives = 36/123 (29%), Gaps = 17/123 (13%) Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + + A+ ++ S + + V+ +D + Sbjct: 462 LNKVARGFSGGGGIFVGEALREGKTQIL----------RSDKATRHVLLFSDAAD----- 506 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAP-PEGQDLLRKCTD-SSGQFFAVNDSRELLES 379 + + +R + + + + P DLLR+ G+ + D+ L Sbjct: 507 SEEPDDYRATLAALRRENVTVSVIGLGTPKDSDADLLREVAQLGGGRIYFAEDALSLPRI 566 Query: 380 FDK 382 F + Sbjct: 567 FSQ 569 >gi|312092300|ref|XP_003147289.1| hypothetical protein LOAG_11723 [Loa loa] gi|307757546|gb|EFO16780.1| hypothetical protein LOAG_11723 [Loa loa] Length = 422 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 26/168 (15%), Positives = 54/168 (32%), Gaps = 21/168 (12%) Query: 235 SVRIGTIAYNIGIVGNQCTPL--SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 RI I ++ L + ++ + + +L T A+ R +K Sbjct: 268 FSRIALITFSSVGKSRTQFNLDRYFDGKDIVTAIRRLESSGGT---TAIGEGIRLGTEQK 324 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + H KK ++ TDG ++ + ++ + AG +Y++ Sbjct: 325 DKQHGGRPVEIAKKIMLVFTDGWSNKG------PDVEEMTRNAKGAGFTLYTIVYEGNGR 378 Query: 353 GQDLLRKCTDSSG-QFFAVND-SRELLESFDK--ITDKIQEQSVRIAP 396 +S G + + + + + T IQE R P Sbjct: 379 VD------ANSPGLNLYTIETMVDDHKHVYSERNFTQLIQELRQRNLP 420 >gi|262193497|ref|YP_003264706.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262076844|gb|ACY12813.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 583 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 22/164 (13%), Positives = 55/164 (33%), Gaps = 19/164 (11%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYRELYNEKESSHNTIG---- 300 + L+ + ++ +++L T Y + L + + Sbjct: 140 STDSDIRLGLTEDEEAIRGAIDELYILNGWTALYDGIRLGNETLGAAAATHSDYDSMDDF 199 Query: 301 -STRLKKFVIFITDGENSG--------ASAYQNTLNTLQI-CEYMRNAGMK--IYSVAVS 348 T K V+ TDG + + ++T +R A ++ IY+V + Sbjct: 200 CDTDRKLAVVAFTDGNENNSANERLRSDEYPGDGIDTTLEDLHDLRVADVRTPIYTVGLG 259 Query: 349 APPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQS 391 L + + G+ ++ + +L +F+ I++ + Sbjct: 260 -DEVDHGGLEELAGYTGGRHHRIDSAADLPATFEVISEYLASSV 302 >gi|170594383|ref|XP_001901943.1| von Willebrand factor type A domain containing protein [Brugia malayi] gi|158590887|gb|EDP29502.1| von Willebrand factor type A domain containing protein [Brugia malayi] Length = 415 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 28/186 (15%), Positives = 61/186 (32%), Gaps = 26/186 (13%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + K D+ I+ L+ S+ N + Y G + + Sbjct: 243 SGGVSDKRDIYIDYVSILIRSLD------LNRTAVHVAAIYYSGPKRARTLFHLRKHSRA 296 Query: 263 KSRLNKLNP----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 ++ + L T T A+++A E + + + K +I TDG + Sbjct: 297 ENAIKDLQRAPSNGGTTRTGEAIYYATNEFNEKFGARKDAR------KMIIIFTDGHSQD 350 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSREL 376 N + RN G+++ +V++ D + + ++ + +L Sbjct: 351 --------NPTEASRTARNKGIELKAVSIEDENIPPDTNQIIAITGDPSDAYSSKNFNKL 402 Query: 377 LESFDK 382 FD+ Sbjct: 403 QSFFDE 408 >gi|115377018|ref|ZP_01464236.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] gi|115365996|gb|EAU65013.1| von Willebrand factor type A domain protein [Stigmatella aurantiaca DW4/3-1] Length = 884 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 14/123 (11%), Positives = 36/123 (29%), Gaps = 17/123 (13%) Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + + A+ ++ S + + V+ +D + Sbjct: 433 LNKVARGFSGGGGIFVGEALREGKTQIL----------RSDKATRHVLLFSDAAD----- 477 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAP-PEGQDLLRKCTD-SSGQFFAVNDSRELLES 379 + + +R + + + + P DLLR+ G+ + D+ L Sbjct: 478 SEEPDDYRATLAALRRENVTVSVIGLGTPKDSDADLLREVAQLGGGRIYFAEDALSLPRI 537 Query: 380 FDK 382 F + Sbjct: 538 FSQ 540 >gi|257062762|ref|YP_003142434.1| Mg-chelatase subunit ChlD [Slackia heliotrinireducens DSM 20476] gi|256790415|gb|ACV21085.1| Mg-chelatase subunit ChlD [Slackia heliotrinireducens DSM 20476] Length = 2281 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 30/238 (12%), Positives = 70/238 (29%), Gaps = 35/238 (14%) Query: 183 PPPKKSFWSKNTTKSKYAPAPAPAN-RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 P P + + S + ++ + NL S+ + V++ + Sbjct: 71 PNPVNVIVILDNSGSMDTRTGGYGSQTRMAAAQNAVNNLARSLYAYNTTEFPDLVQMALV 130 Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE------------LY 289 + G+ +N+ N +N+L+ TN A+ A + Sbjct: 131 GF--STTGSVVQGPTNSYNTFSGAVNRLDADGGTNWEDALQDAAGINFNDDDPTYVIFVS 188 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN-----------------TLNTLQIC 332 + + NT G+ + T G S Q + + Sbjct: 189 DGNPTFRNTRGNYNPMDNYYYNTWGVYGNGSDSQTVAGIAAATTIARCYEHAVDDAESLA 248 Query: 333 EYMRNAGMKIYSVA-VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + Y++ + L +G +F+ ++ +L + I +I++ Sbjct: 249 TSVGAD--HFYTIGAYGNVDRMRSLTTDAGAPAGNYFSAANTTDLQNALAAILAQIEK 304 >gi|123490500|ref|XP_001325627.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121908529|gb|EAY13404.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 688 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 44/365 (12%), Positives = 103/365 (28%), Gaps = 44/365 (12%) Query: 46 SDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIR----ENAGDIAQKAQINITKDKNNPL 101 + + + + + + I + +A + T +N Sbjct: 81 EAQDVTNVSKIQQYEGNGLNSFFLGSIGPNKTIEIHHKVSFMAMANENGFIYTFPLSNKG 140 Query: 102 QYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSME 161 Y + + E ++ +K L S+R T + + + D S ++E Sbjct: 141 IYGPFTYQRPENFEFSIHVKTLSELKEIINSVRGTINVIDPHNVIFATKEFPNDESITIE 200 Query: 162 DLYLQKHNDNNNMTSNKYLLPPP-PKKSFWSKNTTKSKYAPAPAPAN-RKIDVLIESAGN 219 K N+ + + ++ N+ + I Sbjct: 201 TQIKDKDNNIAIWSDGYIAISTFTYFETKVHSNSEFYFIIDCSGSMSGSCIQNAKLCLNI 260 Query: 220 LV-----NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YE 273 + I+ + V + Y N++E +LN ++ Sbjct: 261 FMHSLPIGCRFSIIKFGSDYEVALHPCDYTDE-----------NVSEAMKQLNNIDAEMG 309 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T+ + + K V +TDG+ NT + + Sbjct: 310 GTDILSPLKYVMEL-----------TPKQGFIKQVFLLTDGQ------DSNTNELCALAQ 352 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND--SRELLE-SFDKITDKIQEQ 390 R +I+S+ + + + ++ S G + V+D S +L E + + I Sbjct: 353 ENRTNN-RIFSIGIGSGADKDLIINVSQKSGGNYVFVDDDESEKLNEKVIELLNSAISYA 411 Query: 391 SVRIA 395 + + Sbjct: 412 LMHVC 416 >gi|32477849|ref|NP_870843.1| hypothetical protein RB13068 [Rhodopirellula baltica SH 1] gi|32448406|emb|CAD77921.1| hypothetical protein-transmembrane prediction [Rhodopirellula baltica SH 1] Length = 499 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 12/59 (20%), Positives = 18/59 (30%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQ 59 + I++ F ID+ + MQS DAA L G + Sbjct: 115 LIVILLFALFAIAGLLIDIGMARLTQAHMQSVSDAASLEGGWQLAMGANQTTTRIAVVD 173 >gi|34540039|ref|NP_904518.1| von Willebrand factor type A domain-containing protein [Porphyromonas gingivalis W83] gi|34396350|gb|AAQ65417.1| von Willebrand factor type A domain protein [Porphyromonas gingivalis W83] Length = 1226 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 11/89 (12%), Positives = 28/89 (31%), Gaps = 4/89 (4%) Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE--GQDLLRK 359 + +++ + + + ++ +N+G I+++ + L+ Sbjct: 336 DGVAGALVYEPRFPHPYYYYFPCN-AAINEAQFAKNSGYTIHTIGYDLGDFALANNSLKL 394 Query: 360 CTDSSGQFFAVNDSRELLESFDKITDKIQ 388 FF L +FD I I Sbjct: 395 TATDENHFFTAT-PANLAAAFDNIAQTIN 422 >gi|163848230|ref|YP_001636274.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222526140|ref|YP_002570611.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163669519|gb|ABY35885.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222450019|gb|ACM54285.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 419 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 17/138 (12%), Positives = 48/138 (34%), Gaps = 16/138 (11%) Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + ++++ T PA+ +EL + + ++ +TDG+ Sbjct: 102 RTAILDLVHRIRDAGGTRIAPALEKGIQELQKMPQGV----------RRLVLLTDGQT-- 149 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 ++ L + G+ I ++ + L+ S G + E+++ Sbjct: 150 ----EHEKECLLRADDAGRLGIPITALGIGKDWNEDLLIEMANRSRGVADYIAQPGEIVQ 205 Query: 379 SFDKITDKIQEQSVRIAP 396 F + Q+ +++ + Sbjct: 206 YFQHTVQRAQQTTIQNSV 223 >gi|327270782|ref|XP_003220167.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 904 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 30/207 (14%), Positives = 69/207 (33%), Gaps = 35/207 (16%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 +I L ++ + I + + +N + Sbjct: 309 ISGSMNGFDRIYRLRQAGEQFLLQILETGSWAG-------IVVFNSQALTKTYLKQITGD 361 Query: 260 NEVKSRLNKLN--PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + ++ L TN + ++ + S+ ++ +TDGE++ Sbjct: 362 SVRQTLSAYLPTAAGGGTNICSGIREGFQVFLKKYPSTEGCE--------IVLLTDGEDA 413 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN---DS 373 G S+ ++ +G I+++A+ P L D + G F+ DS Sbjct: 414 GVSSCF---------AEVQRSGSIIHTIALG--PSAAKELEMLADMTGGLKFSATDSLDS 462 Query: 374 RELLESFDKITD---KIQEQSVRIAPN 397 L+++F I+ I +QS+++ Sbjct: 463 NGLIDAFSGISSGSGDISQQSIQLESK 489 >gi|198274643|ref|ZP_03207175.1| hypothetical protein BACPLE_00795 [Bacteroides plebeius DSM 17135] gi|198272090|gb|EDY96359.1| hypothetical protein BACPLE_00795 [Bacteroides plebeius DSM 17135] Length = 339 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 19/168 (11%), Positives = 57/168 (33%), Gaps = 46/168 (27%) Query: 251 QCTPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 P++++ K L ++P + T+ A+ A + + + + Sbjct: 142 TQLPITSDYVSAKMFLETISPSLITTQGTDIRGAIDLAMKSFTP----------NEGVGR 191 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------------- 352 ++ ITDGEN ++ + GM+++ + V +P Sbjct: 192 AIVLITDGENHEG-------GAIEAAQEAAKKGMRVFVLGVGSPDGSPIPVEGTNEFRRD 244 Query: 353 ----------GQDLLRKCTDSSGQFF-AVNDSRELLESFDKITDKIQE 389 + + ++ + + V+++ ++ + +K+ + Sbjct: 245 KDGNVVVTRLNEQMCQEIAKAGNGIYVRVDNTNNAEKALNAEINKLAK 292 >gi|158319036|ref|YP_001511544.1| von Willebrand factor type A [Frankia sp. EAN1pec] gi|158114441|gb|ABW16638.1| von Willebrand factor type A [Frankia sp. EAN1pec] Length = 608 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 39/390 (10%), Positives = 107/390 (27%), Gaps = 29/390 (7%) Query: 10 FLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIK 69 +DL + + +A D +L+ + ++DP + + + + Sbjct: 235 LAAKGALLDLERS----SALVAASDTDLLTAVRAA----DLEDPAATRLTAFPLQESLVY 286 Query: 70 KHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALT 129 ++ ++ + +D + ++ + LT Sbjct: 287 QYNRRVGIGAALPDGRGPELAAFYPRDGTELDEIRYTVLSRASDDPVKAEVARDFLRTLT 346 Query: 130 NLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL---PPPPK 186 + R + I+ + + + + + Sbjct: 347 SGPGRVALLGNGLRPPDGIADSFTARTGLTPRPRMTPERTLDATVLTALQGSFAGVHQRG 406 Query: 187 KSFWSKNTTKSKYAPAPAP-ANRKIDVLIESAGNLVNSI-QKAIQEKKNLSVRIGTIAYN 244 + +T+ S P ++ V +++A + + + + S R+ Sbjct: 407 NTLAVLDTSGSMNEEVPGSAGRSRLSVALDAAKSAIPLFAEDSDLGLWQFSTRLRGDQDW 466 Query: 245 IG----IVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG 300 + V +N++ P +T Y A+R + Sbjct: 467 EELVPLGPMGERLGAGTRSQAVMDAVNRIEPRGDTGLYDTALAAFRYMNQHYVPGRPNQ- 525 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR-NAGMKIYSVAVSAPPEGQDLLRK 359 V+ +TDG+NS + + +++ ++ A + L R Sbjct: 526 -------VVLLTDGKNSDPGSIALDELVRILRREYSPQRPVQVITIGYGADTDLAALSRI 578 Query: 360 CTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + + + D F+ + D + E Sbjct: 579 SAATGAETYPALDP---NTIFEVLVDALTE 605 >gi|293346541|ref|XP_001058131.2| PREDICTED: alpha 1 type VII collagen-like [Rattus norvegicus] gi|293358308|ref|XP_230973.5| PREDICTED: collagen, type XX, alpha 1 [Rattus norvegicus] Length = 1320 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 19/140 (13%), Positives = 44/140 (31%), Gaps = 18/140 (12%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 V++G Y+ + +V + ++ L+ NT T A+ H Sbjct: 205 THFAIGPDKVQVGLTQYSGDPQTEWDLNSFHTKEQVLAAVHHLHYKGGNTFTGLALTHVL 264 Query: 286 -RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + L K +I +TDG++ + +++ + +++ Sbjct: 265 GQNLKPAAGVRPEA------AKVLILVTDGKSQD--------DVRTAARVLKDQDIDVFT 310 Query: 345 VAVSAPPEGQDLLRKCTDSS 364 V V + L+ Sbjct: 311 VGV--KNVDEAELKLLASQP 328 >gi|198421589|ref|XP_002123523.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 1306 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 22/128 (17%), Positives = 50/128 (39%), Gaps = 13/128 (10%) Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + ++K+ +T T A+ E + S + ++ ++ +TDG+++ Sbjct: 277 FDAAVDKIKYHDRSTFTAYAIRKTVNEDFKGNMSRY-----PDSRRVMVLLTDGQST--- 328 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 + + + G++ ++V V + L S + VND ELL Sbjct: 329 ---DKEDLSSAAAEAKQEGVETFAVGVGSKII-LSELVLIAGSPDKVITVNDFNELLGIV 384 Query: 381 DKITDKIQ 388 +++ IQ Sbjct: 385 NQLQGDIQ 392 >gi|188994155|ref|YP_001928407.1| hypothetical protein PGN_0291 [Porphyromonas gingivalis ATCC 33277] gi|188593835|dbj|BAG32810.1| conserved hypothetical protein [Porphyromonas gingivalis ATCC 33277] Length = 1228 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 11/89 (12%), Positives = 28/89 (31%), Gaps = 4/89 (4%) Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE--GQDLLRK 359 + +++ + + + ++ +N+G I+++ + L+ Sbjct: 336 DGVAGALVYEPRFPHPYYYYFPCN-AAINEAQFAKNSGYTIHTIGYDLGDFALANNSLKL 394 Query: 360 CTDSSGQFFAVNDSRELLESFDKITDKIQ 388 FF L +FD I I Sbjct: 395 TATDENHFFTAT-PANLAAAFDNIAQTIN 422 >gi|149033988|gb|EDL88771.1| similar to Protein KIAA1510 precursor (predicted) [Rattus norvegicus] Length = 1320 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 19/140 (13%), Positives = 44/140 (31%), Gaps = 18/140 (12%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 V++G Y+ + +V + ++ L+ NT T A+ H Sbjct: 205 THFAIGPDKVQVGLTQYSGDPQTEWDLNSFHTKEQVLAAVHHLHYKGGNTFTGLALTHVL 264 Query: 286 -RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + L K +I +TDG++ + +++ + +++ Sbjct: 265 GQNLKPAAGVRPEA------AKVLILVTDGKSQD--------DVRTAARVLKDQDIDVFT 310 Query: 345 VAVSAPPEGQDLLRKCTDSS 364 V V + L+ Sbjct: 311 VGV--KNVDEAELKLLASQP 328 >gi|170743045|ref|YP_001771700.1| hypothetical protein M446_4937 [Methylobacterium sp. 4-46] gi|168197319|gb|ACA19266.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 440 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 10/106 (9%), Positives = 29/106 (27%), Gaps = 3/106 (2%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + + + + A++ A I R ++Q A+D A L + + Sbjct: 19 IIGLSLPLLVAGSGAAVEYARIHKRRAELQKAVDVAALGAAGELSVAGSDVSVEAM---A 75 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAE 106 + + + + + I + + + Sbjct: 76 RRLAFDSARATDPGITRVSAAVVGRGTSVTVAINETVQSLFGRLLT 121 >gi|72180809|ref|XP_798930.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H3 [Strongylocentrotus purpuratus] gi|115975272|ref|XP_001180569.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H3 [Strongylocentrotus purpuratus] Length = 964 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 31/225 (13%), Positives = 70/225 (31%), Gaps = 16/225 (7%) Query: 128 LTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKK 187 + S R+ + S + + DV + +Q ND + L K Sbjct: 291 EVHYSPRAREQLGVSPMGIMADYTVRYDVVHGNDAGDIQVLNDYFVQYFSPSGLSVLRKN 350 Query: 188 SFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGI 247 + + + S K+ + ++ +++ + + +K N+ + + Sbjct: 351 IIFVIDISGSMSG-------TKLAQVKDALSTILDDMSET--DKFNILPFSDDVHFLEST 401 Query: 248 VGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 T N+ K + L +NTN + A+ L E E + Sbjct: 402 GMLYSTK--ENVRRAKRFVMGLQEMDNTNLHKAIISGVNMLRAESEQDPQE---EEIVSM 456 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 +I +TDG + + + N ++ + A + Sbjct: 457 LIVLTDGNPNHGEI--DKTIIERNVHEAINGDFSLFCIGFGADAD 499 >gi|330447678|ref|ZP_08311326.1| von Willebrand factor type A domain protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] gi|328491869|dbj|GAA05823.1| von Willebrand factor type A domain protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] Length = 257 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 32/248 (12%), Positives = 72/248 (29%), Gaps = 22/248 (8%) Query: 129 TNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKS 188 + + + ++ + +S + DVS L+ + N T Sbjct: 12 LAVFFINKSMKDKPTSQQLVSAPSLPDVSTGYGFDALKNNWPTLNNTQTSSSDNWLAANY 71 Query: 189 FWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIV 248 + + S RKI E+ +N I + ++ Sbjct: 72 LLIFDGSGSMDNTNCGNGQRKIVAAKEAMQTFINDIPQDANVG--------LYVFDNADS 123 Query: 249 GNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 + NN +K + + T ++ Y L + E + V Sbjct: 124 SLRVPLGINNRATLKQAIYDVKAGGTTPLKSSLTSGYTALEKQAEKQLGYG-----EYNV 178 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF 368 + +TDG+ S + + I +N+ + I+++ T + Sbjct: 179 VIVTDGDASVGEEPE-----VAISRIYQNSPVTIHTIGFCIGNRHALNAEGIT----YYQ 229 Query: 369 AVNDSREL 376 + N+ +L Sbjct: 230 SANNPEKL 237 >gi|301618735|ref|XP_002938765.1| PREDICTED: hypothetical protein LOC100488728 [Xenopus (Silurana) tropicalis] Length = 672 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 31/181 (17%), Positives = 62/181 (34%), Gaps = 17/181 (9%) Query: 214 IESAGNLVNSIQKAI-QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY 272 + N + I + ++ I ++ + Q + K +N + Sbjct: 76 KDFVLNFTDQISHLKLAKPWKTKTKMAIIQFSSSVRIEQSFNEWTGVENFKRIVNSMTYI 135 Query: 273 E-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T TY A+ +A K + + K I +TDG + + + + Q Sbjct: 136 GQGTYTYYAIMNATNIFKAHKSAGNV--------KVAILMTDGID-----HPKSPDARQA 182 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS--SGQFFAVNDSRELLESFDKITDKIQE 389 ++ R AG+ S+ +S + +L K + S + D L E +K+ Sbjct: 183 SDFARAAGINFISIGLSTQKANKTILFKISGQSLSEPVLILGDPNLLQEILEKLASIANT 242 Query: 390 Q 390 Q Sbjct: 243 Q 243 >gi|126306098|ref|XP_001362158.1| PREDICTED: similar to putative calcium activated chloride channel-like protein 1; eCLCA1, partial [Monodelphis domestica] Length = 873 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 22/205 (10%), Positives = 70/205 (34%), Gaps = 36/205 (17%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 +++ L ++ + I + G + ++ + + Sbjct: 316 SMKVGNRLNRLRQALQFFLLQIIEKG-------SWTGIVTFDSSATIQSELMQIESDVQR 368 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 K+ +++L T+ + A+ + + + + +TDGE++ Sbjct: 369 KTLISRLPTVTVAGGGTHICSGLRTAFMVVKKKFLTD---------GSEMALLTDGEDNT 419 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND---SRE 375 + E ++ +G I+++ + P + L + + G D + Sbjct: 420 TNTCF---------EEVKQSGAIIHTIVLG-PSTEKGLEKLSEMTGGMKTTATDNVQNNG 469 Query: 376 LLESFDKITD---KIQEQSVRIAPN 397 L+++F ++ I ++S+++ Sbjct: 470 LIDAFSALSSGNAAITQRSIKLESK 494 >gi|115678877|ref|XP_794839.2| PREDICTED: similar to calcium activated chloride channel 1 precursor [Strongylocentrotus purpuratus] Length = 1031 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 21/166 (12%), Positives = 56/166 (33%), Gaps = 22/166 (13%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 + +ID + +A VN + + ++ + T + Q + Sbjct: 317 TSGSMGTSNRIDKVNSAATAFVNLVD----DGISIGIVTFTGSPTTRHALTQI-NTQADR 371 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 + ++ + +L T + L S + ++ +TDG+ Sbjct: 372 DSLRD-IFQLTASGGTCIGCGLEQGLEVLMAHPSGSAD-------GGIIVLMTDGQ---- 419 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG 365 + + I + +++ G+++ +VA+ G+ L +G Sbjct: 420 ---DSGIQNHIIRQTLQDMGVRVNTVAIGEDAYGE--LSLIAQETG 460 >gi|224012789|ref|XP_002295047.1| predicted protein [Thalassiosira pseudonana CCMP1335] gi|220969486|gb|EED87827.1| predicted protein [Thalassiosira pseudonana CCMP1335] Length = 818 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 23/219 (10%), Positives = 73/219 (33%), Gaps = 18/219 (8%) Query: 178 KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE-----SAGNLVNSIQKAIQEKK 232 + S + + + + + + G++ + + Sbjct: 608 DLCIAVDMSGSVCNSGFFGNNCVGCSPFVFCQSLFVSQETCCANFGDVQQFARLMVYNLS 667 Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNE 291 + + + + ++ ++ + L++L TN A++ L+ Sbjct: 668 QFGDKNTSFSVVSFASDAEILSGLSSADKTINVLDQLIYSGGSTNHGQAINACQESLFTS 727 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS--A 349 + + KKF++ ITDG ++ + + E +++G+ I + +S Sbjct: 728 DQ-------NINRKKFIMLITDGVSATDD-LNPEADAIDAAETAKSSGITIIPIFISPYN 779 Query: 350 PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 + + + G+ + V + L D++ +++ Sbjct: 780 DIDAVSFMSSL-SNDGEVY-VTNFDSLDSLKDQLVEQVS 816 >gi|184199785|ref|YP_001853992.1| hypothetical protein KRH_01390 [Kocuria rhizophila DC2201] gi|183580015|dbj|BAG28486.1| hypothetical protein [Kocuria rhizophila DC2201] Length = 455 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 41/352 (11%), Positives = 91/352 (25%), Gaps = 45/352 (12%) Query: 30 QSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKA 89 ++A + A+ ++ + + + AQ Sbjct: 141 RAATEQAMK---RRARTEGDAAPVQDAARRLQQVAEDPAPSTAPAADPSAPAYAAGAQHP 197 Query: 90 QINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAIS 149 + + T + + + + + + Sbjct: 198 AATVVTRGEWEDYRSRHADTTLVASTPGDRPAS--EAPTADDAALTQALGQWQHLAEPFH 255 Query: 150 ICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRK 209 + +DVS SM L + + A P + Sbjct: 256 ALVAIDVSGSMGTKALPD------------------GSTRMDLTKAAATTAVGLFPEHDA 297 Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 + + + +K SV G Q LS ++ + Sbjct: 298 LGLWTFE--------RHLDGDKDYRSVTPVRELSASVDGGTQRDQLSQDVQSLT-----F 344 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG-ENSGASAYQNTLNT 328 +P T Y AYR++ ++ H + VI ++DG + S + L + Sbjct: 345 SPDGYTGLYDTTLAAYRQVLHDDAPGHL--------RTVIVLSDGMNHDPDSIALDELLS 396 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 E ++I +V VS + L + + G +++ + F Sbjct: 397 TLKAEQDAENPVRIITVGVSKDADATVLRQIAEATGGSSHVARTPQDIQKVF 448 >gi|198426626|ref|XP_002122822.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 1823 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 27/170 (15%), Positives = 50/170 (29%), Gaps = 22/170 (12%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSN---NLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 + + R+ YN + N + N +++ T T A+ HA Sbjct: 684 DLGEETGRMAVFRYNKRVDTNTQILFKDHLKNRTSFFKDYDRIPYNGGGTLTGQALRHAK 743 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + + V+ ITDG I +R G Y + Sbjct: 744 NVIL-----AEENGNRPSVVDVVLTITDGRAQDDVG--------NISRELRANGALTYVI 790 Query: 346 AVSAPPE---GQDLLRKCTDSSGQFFAVND-SRELLESF-DKITDKIQEQ 390 V + ++ L + F V L + F +K++ I Sbjct: 791 GVQPGNKKKLDENELLEIAGRKENMFLVTSGFAALDQQFAEKLSQSICTS 840 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 22/120 (18%), Positives = 40/120 (33%), Gaps = 18/120 (15%) Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T T A+ HA +++E+ + V+ ITDG + + Sbjct: 989 GTLTGQALRHAKDVMFSEENG-----NRPGVVDVVLTITDGRSQDH--------VRNVSR 1035 Query: 334 YMRNAGMKIYSVAVSAPPE---GQDLLRKCTDSSGQFFAVN-DSRELLESFD-KITDKIQ 388 +R G+ Y + + ++ L S F V L + F K++ I Sbjct: 1036 ALRANGVLTYVIGIQPGNNKKLDKNELLDIAGSRENMFLVTGGFSALDQQFAYKLSRSIC 1095 >gi|148360718|ref|YP_001251925.1| type IV fimbrial biogenesis PilY1-like protein [Legionella pneumophila str. Corby] gi|296106216|ref|YP_003617916.1| type IV pilus assembly protein PilY1 [Legionella pneumophila 2300/99 Alcoy] gi|148282491|gb|ABQ56579.1| type IV fimbrial biogenesis PilY1-related protein [Legionella pneumophila str. Corby] gi|295648117|gb|ADG23964.1| type IV pilus assembly protein PilY1 [Legionella pneumophila 2300/99 Alcoy] Length = 1169 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 45/418 (10%), Positives = 102/418 (24%), Gaps = 64/418 (15%) Query: 34 DAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINI 93 DA L+ + + + ++ + K I+ ++ + + Sbjct: 113 DANGLAPYTVVQGGNKVDNSASRLNVAKAGIKAIIENYMPTTDFALGTYSTSNISSYNTW 172 Query: 94 TKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMV 153 + P + SA + +S + I L +S Sbjct: 173 VYYMSPPGSDFVFTNTPVAGNRYVTNPCYNYGSASSTVSSNCSSIGSLYGTTL-VSSSQY 231 Query: 154 LDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL 213 L + S +D + + + Sbjct: 232 LQIGDSSDDPDINDVLY-AGSGFPGIFVSYNGPTPSSPFPPNYTISNYNQGNIRISYANT 290 Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP----------LSNNLNEVK 263 S GN +S A + V + + T + V Sbjct: 291 RPSIGNFSSSPTNAGFVPFSQQVMYVQRGFGYYSNQSYATGNMLVNMQTAGTNPTTTSVN 350 Query: 264 SRLNKLNP-----YENTNTY--------PAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 + +N P +T T + + ++ T G+ K+++I Sbjct: 351 NAINAFLPHLKPETNSTATTEIKAAAVQSPLAGLLTRSRSFMKTVGTTSGNCPQKQYIIL 410 Query: 311 ITDGENSGASAYQNT-------------------------------LNTLQICEYMRNAG 339 I+DG + + + + + ++N G Sbjct: 411 ISDGLPTQDLQSRYWPPLGSAAATGYGVTATFNADGSLNSTNSQALSDAINEIKALKNDG 470 Query: 340 MKIYSVAVSAPPE------GQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQE 389 + + + + A + LR + G ++ L+ S + I IQ Sbjct: 471 VLTFIIGMGAGVDPAVNPEAAATLRAMAVAGGTENYYPATSPEALVSSLNSILSNIQN 528 >gi|118591412|ref|ZP_01548810.1| von Willebrand factor type A domain protein [Stappia aggregata IAM 12614] gi|118436084|gb|EAV42727.1| von Willebrand factor type A domain protein [Stappia aggregata IAM 12614] Length = 657 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 21/191 (10%), Positives = 50/191 (26%), Gaps = 23/191 (12%) Query: 196 KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL 255 S K ++ + V + + + L Sbjct: 32 SSNSMWGQIDGTAKAEIARSAFEGFVAGLPDGTRAGVMAYGHRRKADCGDVETLVPVSDL 91 Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + ++ + L P T + A L +I I+DG Sbjct: 92 --DRAKLVESVKALTPRGKTPITETLRQAAELLAQNDRP-----------GRLILISDGI 138 Query: 316 NSGASAYQNTLNTLQICEYMRNAGM--KIYSVAVSAPP-EGQDLLRKCT-DSSGQFFAVN 371 + + + E + ++G+ K + + Q + + G ++ Sbjct: 139 ETCGG------DPCALAEALASSGVDFKAHVIGFDIASKADQAKIACIAHLTGGTYWNAR 192 Query: 372 DSRELLESFDK 382 D+ L E+ + Sbjct: 193 DADGLNEALKE 203 >gi|123475372|ref|XP_001320864.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121903678|gb|EAY08641.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 665 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 40/358 (11%), Positives = 93/358 (25%), Gaps = 47/358 (13%) Query: 44 IVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQY 103 I + ++ + + K +L + I + Y Sbjct: 65 IGDKQIRPQLRKSEEAAKEYQESKEKGYLSLIGSDSSGNEIYFFFGNLPIGTR--IEISY 122 Query: 104 IAESKAQYEIPTENLFLKGLIPSA-----LTNLSLRSTGIIERSSENLAISICMVLDVSR 158 A+ S+ I+ I ++ Sbjct: 123 TISYLAEINNKGYFFRFPIASKDQIDYNTTLPNSISFKINIKTDKNISEIEGNETAIINL 182 Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAG 218 + ++ + L+ K + S + + A Sbjct: 183 IDNHNAIVNLDEFEPAIFVQTLISDQDKSTAVSSDDYIAVSAYKEFSTKSNGYECKADYF 242 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIA------------YNIGIVGNQCTPLS-NNLNEVK-- 263 +++ ++ +V+ + + G Q P++ N V Sbjct: 243 FVIDCSGSMEGKRIQKAVKCLHLMLQSLPMKCRFSIFCFGTEFKQIMPIAEYNNENVLLA 302 Query: 264 -SRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + + + TN Y + + + K + +TDGE + Sbjct: 303 MNMIKNIQADMYGTNIYDPLKCIF--------------SLEGMTKKIFLLTDGEVN---- 344 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLE 378 N L + E + G IY+V + + Q L+R + + G++ V D+ + Sbjct: 345 --NAEEILNLAEENKKFG-NIYTVGIGSGA-DQCLIRDLAEITDGKWTYVLDNENFDQ 398 >gi|297559546|ref|YP_003678520.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] gi|296843994|gb|ADH66014.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] Length = 587 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 26/206 (12%), Positives = 63/206 (30%), Gaps = 18/206 (8%) Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYN 244 + + + S A P ++ V +A + + + + Sbjct: 373 DSRVLAIVDISGSMLAEVPGTGMTRMQVTSAAATQGLEMFTPSSELGLWEFSTNVNNELH 432 Query: 245 I------GIVGNQCTPLSNNLNEVKSRLNKLNP--YENTNTYPAMHHAYRELYNEKESSH 296 + + + + + L L P +T Y AY+E+ + Sbjct: 433 YQEIAPIRELQAAADDGTAHRDVLAGALASLQPLPQGDTALYETYLAAYQEMSRTYQPDR 492 Query: 297 NTIGSTRLKKFVIFITDGEN-SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 ++ +TDG+N + + L + + + I ++A + + Sbjct: 493 T--------NVILMLTDGDNDNPGGLGLDELMSQIESLASPSRPIPIITIAFGPDVQNLE 544 Query: 356 LLRKC-TDSSGQFFAVNDSRELLESF 380 L++ + G + D E+ E F Sbjct: 545 PLQEIAAATGGAAYMTEDPTEIGEIF 570 >gi|218458154|ref|ZP_03498245.1| hypothetical protein RetlK5_01327 [Rhizobium etli Kim 5] Length = 156 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 21/147 (14%), Positives = 46/147 (31%), Gaps = 30/147 (20%) Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTI-GSTRLKKFVIFITDGENSGASAYQNTL---- 326 + + A + + E + + +K V F+ DG T Sbjct: 7 NYYNDQQTSFDEALKGIEGEITGNIGKGTSNADRQKIVFFVADGVADSYKPSGCTSPKGA 66 Query: 327 --------NTLQICEYMRNAGMKI---YSVAVSAPPEG-------------QDLLRKCTD 362 C+ +++ G+K+ Y+ + P G + +C Sbjct: 67 NGGRCIEPIDTTYCKKLKDRGIKVAVLYTTYLPLPDNGFYKDWVKPFETKIAAKMEECA- 125 Query: 363 SSGQFFAVNDSRELLESFDKITDKIQE 389 + G +FAV+ + + + + KI Sbjct: 126 TPGFYFAVSPTEGIEAAMKALFLKIVS 152 >gi|54302287|ref|YP_132280.1| hypothetical protein PBPRB0607 [Photobacterium profundum SS9] gi|46915709|emb|CAG22480.1| hypothetical protein PBPRB0607 [Photobacterium profundum SS9] Length = 436 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 11/50 (22%), Positives = 26/50 (52%), Gaps = 1/50 (2%) Query: 1 MTA-IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRT 49 + A + + V A+D+ +++ + ++Q+ +D+A LS +I S Sbjct: 20 IFATLAMVVLIGAGALALDVGNLILSKGKLQNLVDSAALSAAKAIDSGSD 69 >gi|330995093|ref|ZP_08319010.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] gi|329576669|gb|EGG58172.1| von Willebrand factor type A domain protein [Paraprevotella xylaniphila YIT 11841] Length = 340 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 24/139 (17%), Positives = 50/139 (35%), Gaps = 22/139 (15%) Query: 251 QCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 P++++ K L +NP T + A + + K + Sbjct: 142 TQLPITSDYVSAKMFLETINPSMITTQGTDIKQAIDLAMKSFTP------NQDVSKAIFV 195 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL----RKCTDSSGQ 366 ITDGE++ +++ + G+K+Y + V +P + + TD++G Sbjct: 196 ITDGEDNEG-------GVVEMAKAAAEKGIKVYVLGVGSPQGAPIPMPGSSQYITDNTGN 248 Query: 367 FFAVNDSRELLESF-DKIT 384 +L E+ +I Sbjct: 249 VV----VSKLNEAMCREIA 263 >gi|118346119|ref|XP_977009.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89288305|gb|EAR86293.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 685 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 21/131 (16%), Positives = 48/131 (36%), Gaps = 12/131 (9%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTR---LKKFVIFITDG 314 NL +K ++N + +TN A+ A+ + +++ + T+ ++ +TDG Sbjct: 147 NLESIKGQINNIISTGDTNIIQALEVAHNIIKQDQQLENQKEEQTKKRIVRYSAFLLTDG 206 Query: 315 ENSGASAYQNTLNTLQICEYMRNA--GMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN 371 +++ + E +N I + LL T + G+F+ + Sbjct: 207 QDNM-----KEKAIFKFRENFKNKDMDYSINCLGFGIDH-DPLLLGAITSYTGGKFYYIK 260 Query: 372 DSRELLESFDK 382 + F Sbjct: 261 PEESVFSVFQD 271 >gi|37523026|ref|NP_926403.1| hypothetical protein gll3457 [Gloeobacter violaceus PCC 7421] gi|35214029|dbj|BAC91398.1| gll3457 [Gloeobacter violaceus PCC 7421] Length = 596 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 43/351 (12%), Positives = 95/351 (27%), Gaps = 33/351 (9%) Query: 32 ALDAAVLS----GCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQ 87 ALDAA ++ + S T T+ + + + + E + Sbjct: 246 ALDAADVARYQDAVKQLQSKVTR-YGTSTDSLAQAMARN--GPYWASVGSVYEAS---VI 299 Query: 88 KAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLA 147 A ++ D+ ++ + + P + S+ LA Sbjct: 300 AANGDLGSDEPRFEAVYPKATFTSNMRAILPSGPWVSPQEQAAAEQILAYLRTPESQKLA 359 Query: 148 ISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTT--KSKYAPAPAP 205 + V + L P P+ ++ +K Sbjct: 360 AEHGLRPGVPGVPLGEKFTAQYGVDPNARYDSLRSPKPEVVAAMLDSWRSYTKKPSLVVL 419 Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS-NNLNEVKS 264 + V +A + I I ++ I S + Sbjct: 420 VIDSSGSMKGDKLPAVQQTLQAYIDGLGPKETIALIDFDSDIRDPMLADASPAGRERAER 479 Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + L T Y A +A L + + V+ +TDG++ G++ N Sbjct: 480 FIAGLEAEGGTRLYDAALYARDWLVKHRRAGAI--------NAVVVLTDGKDDGSTIDLN 531 Query: 325 TLNTLQICEYMRNAG------MKIYSVAVSAPP-EGQDLLRKCTDSSGQFF 368 L ++ +G + ++V + L+ + +G ++ Sbjct: 532 RLGA-----ELQKSGFSSDERVAFFTVGYGGEGQFNPEALKAIAELNGGYY 577 >gi|260800521|ref|XP_002595177.1| hypothetical protein BRAFLDRAFT_240912 [Branchiostoma floridae] gi|260800523|ref|XP_002595178.1| hypothetical protein BRAFLDRAFT_240996 [Branchiostoma floridae] gi|229280421|gb|EEN51189.1| hypothetical protein BRAFLDRAFT_240912 [Branchiostoma floridae] gi|229280422|gb|EEN51190.1| hypothetical protein BRAFLDRAFT_240996 [Branchiostoma floridae] Length = 153 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 22/136 (16%), Positives = 43/136 (31%), Gaps = 15/136 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + +VR+ + Y +L E + ++ + T T A+ A Sbjct: 32 DIGENAVRVSIVQYAAQARTEFFLDQYYDLQEAQDAVDGIEYMGGYTLTGKAIDFATNLH 91 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ ++ + + K + ITDG + MR AG+ +V V Sbjct: 92 FDLRKGARA-----DVTKIAVVITDGRSYDDVNRP--------ARRMRQAGIVTIAVGVG 138 Query: 349 APPEGQDLLRKCTDSS 364 +D L Sbjct: 139 -NNLDRDQLTAIAGDP 153 >gi|29349873|ref|NP_813376.1| putative outer membrane protein [Bacteroides thetaiotaomicron VPI-5482] gi|29341784|gb|AAO79570.1| conserved hypothetical protein, putative outer membrane protein [Bacteroides thetaiotaomicron VPI-5482] Length = 621 Score = 51.4 bits (121), Expect = 2e-04, Method: Composition-based stats. Identities = 26/290 (8%), Positives = 77/290 (26%), Gaps = 28/290 (9%) Query: 92 NITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISIC 151 + + + ++ A Y + L ++ Sbjct: 159 QVGESPLSTFSIDVDA-ASYSNMRRMINSGTLPVPDAIRTEELVNYFSYDYAKPTGSDPV 217 Query: 152 MVLDVSRSMEDLYLQKHNDN-NNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKI 210 + + + P F ++ Sbjct: 218 KITMEAGVCPWNADHRLVRIGLKAREIPTDKLPESNLVFLID-------VSGSMWGPTRL 270 Query: 211 DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN 270 D++ S LVN++++ ++ + Y + ++ +++ +++L Sbjct: 271 DLVKSSLKLLVNNLREKD--------KVAIVVYAGNASVKLESTPGSDKQKIRDAIDELT 322 Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 +T + AY+ N +I +DG+ + + L Q Sbjct: 323 SGGSTAGGAGIQLAYKVAKQNFLPKGNNR--------IILCSDGDFNVGVSSVEGL--EQ 372 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLES 379 + E R +G+ + + + + + G +++ +E Sbjct: 373 LIEKERKSGVFLSVLGYGMGNYKDNKGQALAEKGNGNHAYIDNLQEANRV 422 >gi|302036308|ref|YP_003796630.1| hypothetical protein NIDE0941 [Candidatus Nitrospira defluvii] gi|300604372|emb|CBK40704.1| conserved exported protein of unknown function, contains von Willebrand factor, type A and vault protein inter-alpha-trypsin domain [Candidatus Nitrospira defluvii] Length = 712 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 22/162 (13%), Positives = 55/162 (33%), Gaps = 27/162 (16%) Query: 237 RIGTIAYNIGIVGNQCTP---LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 R I +N + P + ++ + L T PA+ A + + Sbjct: 391 RFNIIQFNHTVRSLFPIPQPVTTKSMQQAIRYTEHLAADGGTEILPALRQALKSPQDSAR 450 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM---KIYSVAVSAP 350 + +I ITDG+ N ++ E + + +++++ + + Sbjct: 451 -----------LQQIILITDGQVG---------NEEELFELLHQR-VGSRRLFTIGIGST 489 Query: 351 PEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 P + + G F + + E+ + D + K++ + Sbjct: 490 PNSHLMRKAAETGRGTFTYIGNVNEVKDKLDGLFRKLEHPVL 531 >gi|114683021|ref|XP_001148451.1| PREDICTED: collagen alpha-1(XX) chain [Pan troglodytes] Length = 1284 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 23/139 (16%), Positives = 44/139 (31%), Gaps = 16/139 (11%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 A E V++G Y+ + +V + + +L NT T A+ H Sbjct: 207 APFEIGPDKVQVGLTQYSGDAQTEWDLNSLSTKEQVLAAVRRLRYKGGNTFTGLALTHVL 266 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + K VI +TDG++ +++ G+ +++V Sbjct: 267 -----GQNLQPVAGLRPEAAKVVILVTDGKSQDDVH--------TAARVLKDLGVNVFAV 313 Query: 346 AVSAPPEGQDLLRKCTDSS 364 V + LR Sbjct: 314 GV--KNADEAELRLLASPP 330 >gi|47228041|emb|CAF97670.1| unnamed protein product [Tetraodon nigroviridis] Length = 1009 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 19/112 (16%), Positives = 44/112 (39%), Gaps = 13/112 (11%) Query: 259 LNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + ++ + T A+ + +E+ + +F + ITDG + Sbjct: 120 KAQFINGISGIRYLGKGTYIDCALTNMTQEMTQSPSP-------FKPLRFAVVITDGHVT 172 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA 369 G ++ E R+AG++I++VA S + +R+ +S + Sbjct: 173 GNPCGGIKVS----AERARDAGIRIFAVAASRNI-DETGMREIANSPAMVYR 219 >gi|256823631|ref|YP_003147594.1| Vault protein inter-alpha-trypsin domain-containing protein [Kangiella koreensis DSM 16069] gi|256797170|gb|ACV27826.1| Vault protein inter-alpha-trypsin domain protein [Kangiella koreensis DSM 16069] Length = 689 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 23/169 (13%), Positives = 54/169 (31%), Gaps = 19/169 (11%) Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + + I + +NL K + L T A++ Sbjct: 355 YALSQLSINDTFNIIDFDNDANKLFDEAVPATLSNLEMAKYFVATLEADGGTEIAKAINL 414 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A + L + V+F+TDG + N Q+ E +++ Sbjct: 415 ALD------------KPDSSLLRQVVFLTDG------SIGNERQIFQMIENQLGNN-RLF 455 Query: 344 SVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 ++ + A P + + G F + + E+ +++ K++ ++ Sbjct: 456 TIGIGAAPNSYFMSKAANYGRGTFTYIGKASEVQTKLEQLFKKLRYPAL 504 >gi|189518186|ref|XP_001331201.2| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5 [Danio rerio] Length = 969 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 24/187 (12%), Positives = 60/187 (32%), Gaps = 14/187 (7%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 K+ ++ ++N ++ + N I TP+S + + K + Sbjct: 330 TKMKQTKQALFTIINELR--PNDNFNFVTFSNRIRVWQPGKLVPVTPIS--IRDAKKFIY 385 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 ++ T+ + L + S +IF+TDG + T+ Sbjct: 386 MISVTGGTDINGGIQTGSALLSDYL--SSKDESHHHSVSLIIFLTDGRPTVGVLQSPTII 443 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAV-NDSRE---LLESFDK 382 + + ++++ + LL + + + G + D+ L +D+ Sbjct: 444 SNT--KTAVQEKFCLFTIGMG-DDVDYRLLERMSLDNCGTMRRIPEDADASLMLKGFYDE 500 Query: 383 ITDKIQE 389 I + Sbjct: 501 IGTPLLS 507 >gi|296128023|ref|YP_003635273.1| von Willebrand factor type A [Cellulomonas flavigena DSM 20109] gi|296019838|gb|ADG73074.1| von Willebrand factor type A [Cellulomonas flavigena DSM 20109] Length = 500 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 27/174 (15%), Positives = 57/174 (32%), Gaps = 19/174 (10%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 K++ + LV+S+++ R+ + Y+ V + +++ Sbjct: 176 KMETTKYALRTLVSSLRRTD--------RVAMVCYSTEADVYLEPTPVAEREGVLAAIDR 227 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L P ++TN + Y + + T V+ ++DG + + Sbjct: 228 LAPRDSTNAAAGLALGYDLAMSMRTEGRLTR--------VVLVSDGVANVGETDPEGILA 279 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF-AVNDSRELLESFD 381 I + G+ + SV V LL + D + V+ E F Sbjct: 280 R-ISSQAKA-GISLISVGVGITTYNDHLLEQLADQGDGWHVYVDGEAEAERVFA 331 >gi|299116460|emb|CBN76178.1| similar to integrin alpha Hr1 precursor-like [Ectocarpus siliculosus] Length = 353 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 27/208 (12%), Positives = 54/208 (25%), Gaps = 31/208 (14%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 + S + + + ++ + + NL G+ + Sbjct: 57 ANTVNVAVIIDSSGSVDDDEWDMSMAFAKDAVSSFADQ---------NLFTNGGSASIAQ 107 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLN--PYENTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 +L + + ++ T+ + L S Sbjct: 108 FSSSASEGGTFYSLEDFNAFVDGNTKYSSGGTDIIDGIAKGRELL----------KASPA 157 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS 363 F+I TDG++S + R+ G +Y+V V P + LL Sbjct: 158 TTSFMIVTTDGQSSS---------PKAEADAARDEGTIVYAVGVGTGPTQEILL-DIGGE 207 Query: 364 SGQFFAVNDSRELLESFDKITDKIQEQS 391 F V+ EL + I Sbjct: 208 EANVFDVDGFDELDVALAGILSASGSSV 235 >gi|224531962|ref|ZP_03672594.1| von Willebrand factor type A domain protein [Borrelia valaisiana VS116] gi|224511427|gb|EEF81833.1| von Willebrand factor type A domain protein [Borrelia valaisiana VS116] Length = 333 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 26/172 (15%), Positives = 61/172 (35%), Gaps = 41/172 (23%) Query: 252 CTPLSNNLNEVKSRLNK---LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P++ + + +L+ ++ + + A L S K+ + Sbjct: 146 VVPITTDRDFFNKKLDDIYIMDLGNGSALGLGISIALSHL----------KHSEAPKRSI 195 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA------------------- 349 I +TDG + Y++ Q+ + +KIYS+ + + Sbjct: 196 IVLTDGVVNSDEIYKD-----QVINLAQGLNVKIYSIGIGSFEEFGVEFKLRSGKFYQGS 250 Query: 350 --PPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQ-EQSVRIAPN 397 +L + ++ + G F++VND + + K E+ ++IA + Sbjct: 251 LKEVYDPSMLVEISNKTGGLFYSVNDDFSFQFAIQDFSKKENLERKIKIAVD 302 >gi|170743966|ref|YP_001772621.1| hypothetical protein M446_5903 [Methylobacterium sp. 4-46] gi|168198240|gb|ACA20187.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 417 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 31/291 (10%), Positives = 83/291 (28%), Gaps = 16/291 (5%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + ++I+V F AID ++ Y + ++Q DAA L ++ + ++ Sbjct: 21 LFGMLIAV--GFAAVAIDSGNLYYSKLKLQKIADAAALGAVMALPTSSSVMAA------A 72 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEI-PTENLF 119 + K S + + + T ++ + Sbjct: 73 LDLVSKNTPVGFGTVSTSADIQIGVYDPSSKTFTPSAIGQNAVQVTTRRSSAYGNAVLTY 132 Query: 120 LKGLIPSALTNLSLRSTGII---ERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS 176 + G++ + +++ S + S + S ++++ T+ Sbjct: 133 VAGILGVSSVDMAASSVAVKYGGACVMVLEPASAGSLQTKGSSALQTNCPIQVNSSSATA 192 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE--SAGNLVNSIQKAIQEKKNL 234 + S S +P P ++ + + + Sbjct: 193 ARTGGSSSITASQICVVGNYSGTGFSPWPKINCPSLVDPLANVPEPAQPVCTVNNPSISS 252 Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAY 285 V Y+ + + L + L KS ++ +T+ + + Sbjct: 253 GVFPTNCTYSGTVSLSGNVTLQSGLYYFKSA--NISVTGSTSITGSGLTIF 301 >gi|46190338|ref|ZP_00121620.2| COG2304: Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Bifidobacterium longum DJO10A] gi|189440236|ref|YP_001955317.1| hypothetical protein BLD_1374 [Bifidobacterium longum DJO10A] gi|189428671|gb|ACD98819.1| Hypothetical protein BLD_1374 [Bifidobacterium longum DJO10A] Length = 380 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 39/348 (11%), Positives = 101/348 (29%), Gaps = 30/348 (8%) Query: 36 AVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK 95 A LS + + DQ++ L+ + ++ D + Sbjct: 57 ATLSIASGSENKEVAVAIQKAADQSNVAVTMHYMGSLEIMNALKAGGQDHDAVWPASSMW 116 Query: 96 DKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD 155 ++I + A + A + +++ ++ + + Sbjct: 117 ISMGDTKHIVKDAASTSTTPI------VFGIAKSKAVKLGWADDTGATKPVSTADILAAV 170 Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 + ++ + + L P + W + + S + ++ ++ Sbjct: 171 SDGKLTFSMTSATVIDSALNVYQTALR-KPSWTIWVVDYSGSMSGEGKNGVVKGLNAALD 229 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 + +K+ E + V I + T S +++ + + T Sbjct: 230 -----PDQAKKSYIEPASGDVNILIPFETEAHRPVKATGTS--TSDLLHEADATDASGGT 282 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + Y + A EL +E E+S ++ +TDG ++ + Sbjct: 283 DIYEGLLSALDELPSESEASQ-------YTTAIVLMTDGRSNSDHQDEFESAYKS----- 330 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDK 382 R + I+S+ L+ S+ + F +L F + Sbjct: 331 RGRDLPIFSIMFG--DADPSQLKSLATLSNAKVFDGRS-GDLAAVFRQ 375 >gi|297380203|gb|ADI35090.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter pylori v225d] Length = 217 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 26/185 (14%), Positives = 70/185 (37%), Gaps = 16/185 (8%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +I+VL ++ ++++ + KK L ++ I + G G +++ + Sbjct: 32 GNGTRIEVLNLCIQKMIETLKQ--EAKKELFSKMAIITF--GENGAVLHTPFDDIKNINF 87 Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + L+ T A A + ++ +T + K + I ++DGE + + Sbjct: 88 K--PLSASGGTPLDQAFRLAKDLIEDK-----DTFPTKFYKPYSILVSDGEPNDDKWQKA 140 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKIT 384 + ++ +S+ + + + ++ F +D +L+ F+ +T Sbjct: 141 LSDFHHYGRSAKSVC---WSIFIGNRNDNPQVNKE--FGKDGVFYADDVEKLVGLFEIMT 195 Query: 385 DKIQE 389 I + Sbjct: 196 QTISK 200 >gi|259490072|ref|NP_001159273.1| hypothetical protein LOC100304363 [Zea mays] gi|223943141|gb|ACN25654.1| unknown [Zea mays] Length = 459 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 21/170 (12%), Positives = 51/170 (30%), Gaps = 26/170 (15%) Query: 237 RIGTIAYNIGIVGNQCTP--LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKES 294 R+ I + L + + + + + L +TN + L + Sbjct: 82 RLAIITFESKAHKVFDLSSMLPDQVKKAVAVVEGLKAGGDTNIKAGLEAGLDVLKTRRGH 141 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 SHN + ++DG + + + + + + + Sbjct: 142 SHNA-------SCIFLMSDGHENVDK-------ARTLLDRVGEHSVV--TFGFG-EKSDE 184 Query: 355 DLLRKCT--DSSGQFFAV---NDSRELLESFD--KITDKIQEQSVRIAPN 397 LL +G + V D +L+++F I I +++ + Sbjct: 185 QLLYDIAYHSHAGTYHHVREKEDENQLMKAFAFLAIYRSISMLDLKVTVS 234 >gi|260800519|ref|XP_002595176.1| hypothetical protein BRAFLDRAFT_140700 [Branchiostoma floridae] gi|229280420|gb|EEN51188.1| hypothetical protein BRAFLDRAFT_140700 [Branchiostoma floridae] Length = 153 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 22/136 (16%), Positives = 43/136 (31%), Gaps = 15/136 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + +VR+ + Y +L E + ++ + T T A+ A Sbjct: 32 DIGENAVRVSIVQYAAQARTEFFLDQYYDLQEAQDAVDGIEYMGGYTLTGKAIDFATNLH 91 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ ++ + + K + ITDG + MR AG+ +V V Sbjct: 92 FDLRKGARA-----DVTKIAVVITDGRSYDDVNRP--------ARRMRQAGIVTIAVGVG 138 Query: 349 APPEGQDLLRKCTDSS 364 +D L Sbjct: 139 -NNLDRDQLTAIAGDP 153 >gi|46205147|ref|ZP_00048963.2| COG2304: Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Magnetospirillum magnetotacticum MS-1] Length = 125 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 16/91 (17%), Positives = 35/91 (38%), Gaps = 17/91 (18%) Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA-----------PPEGQD 355 V+ ++DG N+ + + R G+++Y++A+ + Sbjct: 1 MVVLLSDGANNAGQT-----APKDVAQLARELGVRLYTIALGPIDMADNPDNEQDVVDVE 55 Query: 356 LLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 LR + S G+ F V + +L + I + Sbjct: 56 TLRAMAEVSGGKAFRVKTTDDLENVANAIDE 86 >gi|258651507|ref|YP_003200663.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] gi|258554732|gb|ACV77674.1| von Willebrand factor type A [Nakamurella multipartita DSM 44233] Length = 593 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 29/189 (15%), Positives = 58/189 (30%), Gaps = 18/189 (9%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL----- 255 +ID+ ++A + + A + + + L Sbjct: 401 VPGTNGADRIDLAKDAAARGLGLYR-ADSDIGLWEFSTRLSPTSDHRELIPISSLGPDGQ 459 Query: 256 -SNNLNEVKSRLNKLN--PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 S + + LN L P T Y + A R + + V+ +T Sbjct: 460 GSTGAARLAAALNGLQAIPDGGTGLYDTVLDATRTVRAGYDPDRV--------NVVLLLT 511 Query: 313 DGENSGASAYQ-NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 DG N ++ + L + E + + S+A + L + + G + Sbjct: 512 DGMNDDVNSITMDQLLSTLAAEQDPARPVPVISIAFGPDSDVAALQQISRATGGATYLSQ 571 Query: 372 DSRELLESF 380 D R++ E F Sbjct: 572 DPRQIGEIF 580 >gi|41386751|ref|NP_958822.1| calcium-activated chloride channel regulator 4 [Rattus norvegicus] gi|37703077|gb|AAR01113.1| parturition-related protein PRP3 [Rattus norvegicus] Length = 923 Score = 51.4 bits (121), Expect = 3e-04, Method: Composition-based stats. Identities = 24/206 (11%), Positives = 71/206 (34%), Gaps = 33/206 (16%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 + +++ + ++A + I ++ G + ++ N+ Sbjct: 314 VSGSMGSYDRLNRMNQAAKFFLQQILES-------RSWAGMVHFHSSATVKSELIQINSD 366 Query: 260 NEVKSRLNKL--NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 E L L + T+ + A++ N G ++ ++DGE+S Sbjct: 367 VERNQLLETLPTSASGGTSICSGIRTAFQVFKN--------KGYQTGGNDILLLSDGEDS 418 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE-- 375 + ++++G ++ +A+ Q + + G+ D + Sbjct: 419 T---------AKDCLDEVKDSGAVVHFIALG-KAFDQSISNMANVTGGKQLFATDEAQNN 468 Query: 376 -LLESFDKITDK---IQEQSVRIAPN 397 L+++F + + + E+S+++ Sbjct: 469 GLIDAFGALASENADVTEKSLQLESK 494 >gi|295093780|emb|CBK82871.1| von Willebrand factor type A domain. [Coprococcus sp. ART55/1] Length = 549 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 19/163 (11%), Positives = 51/163 (31%), Gaps = 13/163 (7%) Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN-LNEVKSRLNKLNPYENTN 276 L NS+ Q + + + V N + + +N L T Sbjct: 391 NQLKNSLTNGAQYINDNNYVGLVSYSSSVTVEVPIAQFDLNQRSYFQGSVNNLMASGGTA 450 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 +Y A+ A + + + K + + ++DG + + +I +R Sbjct: 451 SYDAVVVAMKMITDAKAEHPDAK------CMLFLLSDGYANVGYSMD------EITSALR 498 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 + + +Y++ + +L + + + + + Sbjct: 499 QSNIPVYTIGYGGDADTDELTKLSGINEAASINADSDDIIYKI 541 >gi|196230799|ref|ZP_03129660.1| autotransporter-associated beta strand repeat protein [Chthoniobacter flavus Ellin428] gi|196225140|gb|EDY19649.1| autotransporter-associated beta strand repeat protein [Chthoniobacter flavus Ellin428] Length = 1545 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 47/351 (13%), Positives = 104/351 (29%), Gaps = 27/351 (7%) Query: 41 CASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNP 100 +++ T + + + +K K+ + A Q + N Sbjct: 1054 ASALYIKSTNRLVIRNTQDQLDLVDRIVKADAKEREDKAKETVPTAPIPQPEVQTSANAF 1113 Query: 101 LQYIAESK-AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRS 159 + +++ +L + A R E + + + Sbjct: 1114 STFSLNVSDVSFKLAAASLEQGHMPDPASVRSEEFINAFDYRDPEPSPGAPLAFV----T 1169 Query: 160 MEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGN 219 Y N + + K N +++++ E+ Sbjct: 1170 ERARYPFAQNRDLLRFAVKTAAAGRQPGR--PLNIVLLLDRSGSMERADRVNIVREALSV 1227 Query: 220 LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYP 279 L +Q Q+K ++ T V + +++V +R+N++ P TN Sbjct: 1228 LAKHLQ--PQDKLSIVTFARTPHLWADAVAG------DKVHDVIARVNEITPEGGTNLEA 1279 Query: 280 AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG 339 A+ AY ++ VI TDG + + L + R G Sbjct: 1280 ALDLAYETAHHHFAVDSTNR--------VILFTDGAANLGDVNPDALTKKVEAQ--RKQG 1329 Query: 340 MKIYSVAVSAPPEGQDLLRKCTDS-SGQFFAVNDSRELLESFD-KITDKIQ 388 + + + DLL + T + G++ +N + +F +I +Q Sbjct: 1330 IALDCFGIGWEGYNDDLLEQLTRNADGRYGFINTPEDAAANFATQIAGALQ 1380 >gi|123228966|emb|CAI21016.2| novel protein similar to vertebrate inter-alpha (globulin) inhibitor H5 (ITIH5) [Danio rerio] Length = 906 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 24/187 (12%), Positives = 60/187 (32%), Gaps = 14/187 (7%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 K+ ++ ++N ++ + N I TP+S + + K + Sbjct: 267 TKMKQTKQALFTIINELR--PNDNFNFVTFSNRIRVWQPGKLVPVTPIS--IRDAKKFIY 322 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 ++ T+ + L + S +IF+TDG + T+ Sbjct: 323 MISVTGGTDINGGIQTGSALLSDYL--SSKDESHHHSVSLIIFLTDGRPTVGVLQSPTII 380 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAV-NDSRE---LLESFDK 382 + + ++++ + LL + + + G + D+ L +D+ Sbjct: 381 SNT--KTAVQEKFCLFTIGMG-DDVDYRLLERMSLDNCGTMRRIPEDADASLMLKGFYDE 437 Query: 383 ITDKIQE 389 I + Sbjct: 438 IGTPLLS 444 >gi|332809376|ref|XP_003308229.1| PREDICTED: calcium-activated chloride channel regulator 4 isoform 1 [Pan troglodytes] Length = 682 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 20/133 (15%), Positives = 56/133 (42%), Gaps = 24/133 (18%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 P T+ + +A++ + + S V+ +TDGE++ AS+ Sbjct: 142 PRGGTSICSGIKYAFQVI--------GELHSQLDGSEVLLLTDGEDNTASSC-------- 185 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE---LLESFDKI---T 384 + ++ +G ++ +A+ + ++ + G F V+D + L+++F + Sbjct: 186 -IDEVKQSGAIVHFIALGRDA-DEAVIEMSKITGGSHFYVSDEAQNNGLIDAFGALTSGN 243 Query: 385 DKIQEQSVRIAPN 397 + ++S+++ Sbjct: 244 TDLSQKSLQLESK 256 >gi|218509026|ref|ZP_03506904.1| hypothetical protein RetlB5_16575 [Rhizobium etli Brasil 5] Length = 125 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 17/124 (13%), Positives = 40/124 (32%), Gaps = 28/124 (22%) Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ----------ICEYMRNAGMKI---Y 343 + +K + F++DG T T C+ +++ G+KI Y Sbjct: 2 TARRTPSAEKILFFVSDGVGDSYKPSTCTKKTTGGRCQEPIDTSFCKPLKDRGVKIAVLY 61 Query: 344 SVAVSAPPEG-------------QDLLRKCTDSSGQFFAVNDSRELLESFDKIT-DKIQE 389 + + P ++ C G +F V+ + + ++ + I+ Sbjct: 62 TTYLPLPSNSWYNTWIKPFQSEIPTKMQACAS-PGFYFEVSPTDGITDAMKALFLKVIRA 120 Query: 390 QSVR 393 + Sbjct: 121 PRIT 124 >gi|218510669|ref|ZP_03508547.1| putative vault protein inter-alpha-trypsin domain [Rhizobium etli Brasil 5] Length = 784 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 36/330 (10%), Positives = 84/330 (25%), Gaps = 30/330 (9%) Query: 69 KKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSAL 128 G+ + + +I + N I + Sbjct: 223 TVEFNNGAGFATPRDPVENREKIEASVLDPNENAKINPVSLTVNLKAGFPLGDVNSSFHA 282 Query: 129 TNLSLRSTGIIERSSENLAIS--------ICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 ++ S S + A+ L + S K + Sbjct: 283 VDIRQDSDQARTISLKGGAVPADKDFELTWKAALGKTPSAGLFREVKDGKTYLLAFVTPP 342 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 P + + I+ +S ++ + K + + Sbjct: 343 TAPDAAAAPTKREVVFVIDNSGSMSGQS-IEQARQSLALAISRLSKDDR-FNVIRFDDTM 400 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG 300 Y G+V +N + + + L+ T PA+ A R + Sbjct: 401 TDYFNGLVAASP----DNREKAITYVRGLSADGGTEMLPALEDALRNQGPVASGA----- 451 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 + V+F+TDG + ++++V + + P + + Sbjct: 452 ----LRQVVFLTDGAIGNEQQLFQEI-------SANRGDARVFTVGIGSAPNTYFMTKAA 500 Query: 361 TDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 G F A+ + ++ ++ K+Q Sbjct: 501 EIGRGTFTAIGSTDQVASRMGELFAKLQNP 530 >gi|322690259|ref|YP_004219829.1| hypothetical protein BLLJ_0067 [Bifidobacterium longum subsp. longum JCM 1217] gi|320455115|dbj|BAJ65737.1| conserved hypothetical protein [Bifidobacterium longum subsp. longum JCM 1217] Length = 380 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 39/348 (11%), Positives = 101/348 (29%), Gaps = 30/348 (8%) Query: 36 AVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK 95 A LS + + DQ++ L+ + ++ D + Sbjct: 57 ATLSIASGSENKEVAVAIQKAADQSNVAVTMHYMGSLEIMNALKAGGQDHDAVWPASSMW 116 Query: 96 DKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD 155 ++I + A + A + +++ ++ + + Sbjct: 117 ISMGDTKHIVKDAASTSTTPI------VFGIAKSKAVKLGWADDTGAAKPVSTADILAAV 170 Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 + ++ + + L P + W + + S + ++ ++ Sbjct: 171 SDGKLTFSMTSATVIDSALNVYQTALR-KPSWTIWVVDYSGSMSGEGKNGVVKGLNAALD 229 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 + +K+ E + V I + T S +++ + + T Sbjct: 230 -----PDQAKKSYIEPASGDVNILIPFETEAHRPVKATGTS--TSDLLHEADATDASGGT 282 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + Y + A EL +E E+S ++ +TDG ++ + Sbjct: 283 DIYEGLLSALDELPSESEASQ-------YTTAIVLMTDGRSNSDHQDEFESAYKS----- 330 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDK 382 R + I+S+ L+ S+ + F +L F + Sbjct: 331 RGRDLPIFSIMFG--DADPSQLKSLATLSNAKVFDGRS-GDLAAVFRQ 375 >gi|156383257|ref|XP_001632751.1| predicted protein [Nematostella vectensis] gi|156219811|gb|EDO40688.1| predicted protein [Nematostella vectensis] Length = 161 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 22/140 (15%), Positives = 47/140 (33%), Gaps = 20/140 (14%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYRELYN 290 + +G + Y+ ++ + +N + P T T A+ A +L+ Sbjct: 39 SSERTHVGLVLYSSFTQLKFNFDKYSDSASIVKAINTTDYPKGGTRTGEALKMAKSQLFG 98 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 S + K +I +TDG +++ + +++ G+ I++V V Sbjct: 99 ASMRS--------VPKVLIVLTDG--------RSSDKVEAPSKALKDEGVVIFAVGVG-D 141 Query: 351 PEGQDLLRKCTDS--SGQFF 368 L S F Sbjct: 142 QIDPSELNVMASDSKSDHVF 161 >gi|291231970|ref|XP_002735935.1| PREDICTED: chloride channel accessory 2-like, partial [Saccoglossus kowalevskii] Length = 849 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 22/131 (16%), Positives = 46/131 (35%), Gaps = 19/131 (14%) Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 + T + A L N G+ +I ++DGE + + + +T Sbjct: 375 GDATGIGSGLSEAIDVLENG--------GNDAAGGCIILVSDGEENRSPYIDDVQST--- 423 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDS--RELLESFDKITDK--- 386 + + G+ ++++A+ F+ + L E+F I + Sbjct: 424 ---IVDKGVCVHTIALGVDASHNMEQLPLATDGKSFYYSENPYSNALNEAFITIAKQDTN 480 Query: 387 IQEQSVRIAPN 397 +QSV+I N Sbjct: 481 ALDQSVQIYSN 491 >gi|196007110|ref|XP_002113421.1| hypothetical protein TRIADDRAFT_57572 [Trichoplax adhaerens] gi|190583825|gb|EDV23895.1| hypothetical protein TRIADDRAFT_57572 [Trichoplax adhaerens] Length = 1343 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 20/179 (11%), Positives = 49/179 (27%), Gaps = 19/179 (10%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP--Y 272 + L+ + I + + IG I ++ NN + + L P Sbjct: 312 SNLQQLIQAATNVILQLGQIDGSIGIIIFSTSATVTCPLMAVNNDQDKNKLIGCLPPEAS 371 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T+ + L +I ++DG+ + + Sbjct: 372 GGTSIGSGILKGIELLLGS------VGEQKPSGGHLIVMSDGQENA------NPRIKDVM 419 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELL--ESFDKITDKIQ 388 + + + S++ +L + G + + + L +F I + Sbjct: 420 SNITENDVVVTSISFGQSAS--KVLEDLAKSTGGSSYFASTNGTLTLMNAFTAIISNLV 476 >gi|221104611|ref|XP_002170515.1| PREDICTED: similar to Collagen alpha-1(XII) chain [Hydra magnipapillata] Length = 1137 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 28/188 (14%), Positives = 67/188 (35%), Gaps = 21/188 (11%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 A + ++ + + ++ + + + A I N P + Sbjct: 305 ASTSVTSTNWKKTVSFVQSFTKEFVMGPTGVRFAVIDFANEAQIQINILDPKYWSQEAFS 364 Query: 264 SRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 ++ + T T A+ A +++ +K + T + K +I +TDG+++ Sbjct: 365 RKVGSIEYSRGRTKTDLAIKLAREKIFCDKCNLRRT-----VPKLLIVLTDGQST----- 414 Query: 323 QNTLNTLQICEYMR-NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFD 381 + T + ++ + + I ++ V ++ L +S F LL + Sbjct: 415 -DPDLTEKEANLIKTQSKVSIITMGVG-DKIDKNELTSMASNSDYVF-------LLNGYK 465 Query: 382 KITDKIQE 389 I DKI + Sbjct: 466 YINDKINQ 473 Score = 41.8 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 24/170 (14%), Positives = 52/170 (30%), Gaps = 17/170 (10%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT--PLSNNLNEVKSRLNKLNP-YENT 275 V K + +R I ++ P + +++ + T Sbjct: 975 EFVQDFSKEFKMGSTG-IRFAVIDFSDAATLQIDILDPSLWSNEAFNEKISNIEYSMGKT 1033 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 T A+ A +++ + + K VI +TDG ++ TL + Sbjct: 1034 RTDLALKLAREKVFCRECGLRV-----NVPKLVIVVTDGRSTF------PFLTLPEARLI 1082 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 +N + + V + L F +N R + + ++I Sbjct: 1083 KNQ-ASVICMGVG-DEVDINELNTMATDKNHVFLLNGYRYINDKVNQILK 1130 >gi|306829469|ref|ZP_07462659.1| collagen adhesion protein [Streptococcus mitis ATCC 6249] gi|304428555|gb|EFM31645.1| collagen adhesion protein [Streptococcus mitis ATCC 6249] Length = 861 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 39/302 (12%), Positives = 85/302 (28%), Gaps = 44/302 (14%) Query: 127 ALTNLSLRSTGIIERSSENLAISICMVLDVSRSME--DLYLQKHNDNNNMTSNKYLLPPP 184 LSL T + +++ + + +V D+S SM D+ + + K L Sbjct: 57 DKYELSLDITSKLGTETQSEPLDVVLVADLSGSMNKRDVPSSTGRTITRLDALKNTLKGT 116 Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK----AIQEKKNLSVRIGT 240 + + + + + +K Sbjct: 117 RDRQGLIDTILSNSNNRLSMVGFGGKIDNKFAEQSWNSYYRKWEWGYRYWPYEERTAFYD 176 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYE-----------NTNTYPAMHHAYREL 288 + +N+ + K+ ++K+ TN + A + + Sbjct: 177 GVSPWDDA-DTILNWNNDASGSKTAVSKMRIAGGQSIGTESGIGTGTNISAGIRIANQLI 235 Query: 289 YNEKESSHN------------TIGSTRLKKFVIFITDGENSGASAYQNTL----NTLQIC 332 + + ++ + + DG + + N L N L Sbjct: 236 DSARPNAKKVVIVLSDGFANMYYNDSGYTVYNYNNQDGSETAPEWFWNNLDVSINNLAYS 295 Query: 333 EYMRNAGMKIYSVAV--SAPPEGQDLLRK-----CTDSSGQFFAVNDSRELLESFDKITD 385 + G YS+ S + L+ + + + N+ +L +SF ITD Sbjct: 296 LAPKLDG--FYSIKFRYSNNVDSITSLQYYIRYHNSSIPNEILSANNEDQLRDSFKDITD 353 Query: 386 KI 387 KI Sbjct: 354 KI 355 >gi|256393600|ref|YP_003115164.1| Vault protein inter-alpha-trypsin domain-containing protein [Catenulispora acidiphila DSM 44928] gi|256359826|gb|ACU73323.1| Vault protein inter-alpha-trypsin domain protein [Catenulispora acidiphila DSM 44928] Length = 1033 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 28/206 (13%), Positives = 56/206 (27%), Gaps = 22/206 (10%) Query: 194 TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 T + A + + I + R + ++ + Sbjct: 310 PTGATRPRDVALILDRSGSMGGWKMTAARRAAARIVDTLTAEDRFAVLTFDDQMETPDGL 369 Query: 254 PLSNNLNE------VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 P + L ++ T P + A L ++ + Sbjct: 370 PTGLSEATDRHRFRAVQHLATVDARGGTEMEPPLRRAATLLSDDNPDR---------DRV 420 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF 367 +I ITDG+ TL+ + +++++V + L R T G Sbjct: 421 LILITDGQVGNEDRLLTTLSP-------KLTHIRVHTVGIDTAVNAAFLQRLSTLGGGHC 473 Query: 368 FAVNDSRELLESFDKITDKIQEQSVR 393 V L ++ D I +I V Sbjct: 474 ELVESEDRLDDAMDAIHHRIATPLVT 499 >gi|218672134|ref|ZP_03521803.1| hypothetical protein RetlG_11055 [Rhizobium etli GR56] Length = 125 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 18/125 (14%), Positives = 42/125 (33%), Gaps = 28/125 (22%) Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ----------ICEYMRNAGMKI--- 342 + +T +K + F++DG T T C+ +++ G+KI Sbjct: 1 GDGTSNTSAEKILFFVSDGVGDSYKPSTCTKKTTGGRCQEPIDTSFCKPLKDRGVKIAVL 60 Query: 343 YSVAVSAPPEG-------------QDLLRKCTDSSGQFFAVNDSRELLESFDKIT-DKIQ 388 Y+ + P ++ C G +F V+ + + ++ + I+ Sbjct: 61 YTTYLPLPSNSWYNTWIKPFQSEIPTKMQACAS-PGFYFEVSPTEGITDAMKALFLKVIR 119 Query: 389 EQSVR 393 + Sbjct: 120 APRIT 124 >gi|312070072|ref|XP_003137977.1| hypothetical protein LOAG_02391 [Loa loa] gi|307766862|gb|EFO26096.1| hypothetical protein LOAG_02391 [Loa loa] Length = 647 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 26/151 (17%), Positives = 51/151 (33%), Gaps = 16/151 (10%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYRELY 289 V I I Y N ++ + + +N T T A+ A EL+ Sbjct: 103 TDKDQVHIAMIQYAETPTIEFSLDTYRNPRDITNHIMTINFHSGGTRTGKALLAAKVELF 162 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN-AGMKIYSVAVS 348 +EK+ + K ++ TDG + + ++ + +R +KIY V V Sbjct: 163 SEKKGAR-----KNASKIIVLFTDGL--------SVDDPVKHAQQLREVEKVKIYVVYVG 209 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 + + + + F ++ L Sbjct: 210 SDGFEYE-MDRIAGGKSNIFGPHEFTRLKRI 239 >gi|301792481|ref|XP_002931207.1| PREDICTED: epithelial chloride channel protein-like [Ailuropoda melanoleuca] Length = 904 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 18/113 (15%), Positives = 41/113 (36%), Gaps = 21/113 (18%) Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T+ + ++ + + + +I +TDGE+ S+ Sbjct: 381 EASGGTSICSGLRAGFQAIIHS--------NQSTSGSEIILLTDGEDDQISSCF------ 426 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQ-DLLRKCTDSSGQFFAVN-DSRELLESF 380 E ++ +G I+++A+ + + L + G F N D L ++F Sbjct: 427 ---EEVKQSGAVIHTIALGPSAARELETLSNM--TGGYRFYANKDINGLTDAF 474 >gi|17538702|ref|NP_499959.1| hypothetical protein C18H7.1 [Caenorhabditis elegans] gi|14573846|gb|AAF98615.2| Hypothetical protein C18H7.1 [Caenorhabditis elegans] Length = 425 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 23/164 (14%), Positives = 51/164 (31%), Gaps = 22/164 (13%) Query: 235 SVRIGTIAYNI--GIVGNQCTPLSNNLNEVKSRLNKLNPYENT-NTYPAMHHAYRELYNE 291 ++ + + EV ++ L T + A +L Sbjct: 274 YTQVAAVTFATVGRTRVRFNLKKYQTQEEVLRGIDNLKSRGGTTAIGAGIEKALTQLDES 333 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 + + K ++ TDG ++ + + ++G ++Y+VA +A Sbjct: 334 EGARPGIAT-----KVMVVFTDGWSNKG------PDPEKRARDAVSSGFEMYTVAYTAHT 382 Query: 352 EGQDLLRK---CTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 G L S D F + DKI+++++ Sbjct: 383 PGAVTLNNETLSAISGDVHHTFTDVT-----FQALIDKIKQRNL 421 >gi|327281097|ref|XP_003225286.1| PREDICTED: integrin alpha-M-like [Anolis carolinensis] Length = 292 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 21/136 (15%), Positives = 47/136 (34%), Gaps = 18/136 (13%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + + +N+L ++ T + A +K + +F++ +TDGE Sbjct: 110 DPDHLMREVNQLR--GSSYTATGIRKATELFTTQKGARATAK------RFLVVVTDGE-- 159 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVS---APPEGQDLLRKCTDS--SGQFFAVND 372 + + L ++ E A + +++ V Q L F V Sbjct: 160 ---KFGDMLEYAEVIEEANRAKITRFAIGVGIVFTSRVAQRELHSIGSHPVPDHVFVVRH 216 Query: 373 SRELLESFDKITDKIQ 388 L + ++ +KI Sbjct: 217 FTGLRDIQTQLKEKIC 232 >gi|198424353|ref|XP_002120419.1| PREDICTED: similar to mCG120740 [Ciona intestinalis] Length = 1650 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 32/223 (14%), Positives = 68/223 (30%), Gaps = 36/223 (16%) Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 + + ++ ++ ++ N ++++ N V I A + Sbjct: 371 FDRRIVLVLDISTSMENYGRMGLMRQAVSNFIDTV------PMNTWVGIVVFASRANTLA 424 Query: 250 NQCTPLSNNLNEVKSR--LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 S + + +N T+ + L Sbjct: 425 RLTEITSYDARNILKTRLVN--TTVVGTSIGSGIMKGLEVLETSGP-----RSLRGSGGS 477 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQ 366 +I +TDG +T+ E +R G+++ ++A+ + L S+G+ Sbjct: 478 IIILTDGLEHNNPKINDTI------ERVREFGVRVSTIALGSNVAKD--LEWLASVSNGR 529 Query: 367 FFAVNDSR-----ELLESF------DKITD-KIQEQSVRIAPN 397 A + + EL E+F + I I Q V I N Sbjct: 530 THAASSGQFGIDAELQEAFASHHEQEDIGSISILSQEVVIEKN 572 >gi|115372062|ref|ZP_01459374.1| inter-alpha-inhibitor H4 heavy chain, putative [Stigmatella aurantiaca DW4/3-1] gi|115371027|gb|EAU69950.1| inter-alpha-inhibitor H4 heavy chain, putative [Stigmatella aurantiaca DW4/3-1] Length = 540 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 24/184 (13%), Positives = 59/184 (32%), Gaps = 17/184 (9%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 ++ + ++ V + ++ + + + N+ + + + Sbjct: 85 SRMQIAKDALKYCVTRLN-----PQDTFNVVRFSTDVEALFPALKSAQPENIQKAVAFVE 139 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 +L T A+ + + G + ++FITDG+ + + Sbjct: 140 QLEAIGGTAIDEALVRGLQ----------DNDGKSSAPHLLMFITDGQPTIGE--TDEGA 187 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 Q + R A ++++ V + L R +D +G V D +E DK+ Sbjct: 188 IAQHAKDGRKAKTRLFTFGVGEDLNARLLDRLSSDGAGTSDFVRDGKEFETKISSFYDKV 247 Query: 388 QEQS 391 Sbjct: 248 SNPV 251 >gi|254445696|ref|ZP_05059172.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] gi|198260004|gb|EDY84312.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] Length = 923 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 38/362 (10%), Positives = 99/362 (27%), Gaps = 26/362 (7%) Query: 25 IRNQM--QSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENA 82 R ++ Q+ ++ S + + T + + + +L S Sbjct: 401 ARPRLPFQNTANSPTPSSPSLAADTFNPEPITENPPSSFNLAEGLDNPNLVAASTANATQ 460 Query: 83 GDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSL----RSTGI 138 + T++ Q ++ ++ F + + Sbjct: 461 TAPNNDEPLPRTQNPPPTTQLSEYPESNTATDPQSTFSLNVSDVSYRLTEAYLAQNVRPP 520 Query: 139 IERSSENLAISICMVLDVSRS----MEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 ++ D + + + + H + S Sbjct: 521 AGTLRTEEFVNAFDYGDPTPPVARKIGFTWERAHWPFAHDRDVLRFSLQTAAHGRASSQP 580 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 A + + + D + ++VNS+ A+Q R+ ++++ Sbjct: 581 LHLTLAIDTSGSMSRPDRV-----DIVNSLATALQSNLTEKDRLSIVSFDRQPRLVLDGQ 635 Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 + + +LNP T+ A+ +Y+ + + VI ITDG Sbjct: 636 SVTAETNLATLATQLNPQGGTDLESALQLSYQTAQRHFQENAINR--------VILITDG 687 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDS 373 + + L T +R G+ + + L + + G++ + Sbjct: 688 AANLGNTNAEQLRTTVTENRIR--GIALDCFGIGFDGHDDTFLESLSRNGDGRYRFLRSP 745 Query: 374 RE 375 + Sbjct: 746 ED 747 >gi|2623767|gb|AAB86531.1| Lu-ECAM-1 [Bos taurus] Length = 820 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 18/112 (16%), Positives = 42/112 (37%), Gaps = 21/112 (18%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T+ + ++ + + + +I +TDGE++ ++ Sbjct: 382 ANGGTSICRGLKAGFQAIIHS--------DQSTSGSEIILLTDGEDNEINSCF------- 426 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN-DSRELLESF 380 E ++ +G I+++A+ P L ++ + G F N D L +F Sbjct: 427 --EDVKRSGAIIHTIALG--PSAAKELETLSNMTGGYRFFANKDITGLTNAF 474 >gi|23465165|ref|NP_695768.1| hypothetical protein BL0580 [Bifidobacterium longum NCC2705] gi|23325787|gb|AAN24404.1| hypothetical protein BL0580 [Bifidobacterium longum NCC2705] Length = 383 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 39/348 (11%), Positives = 101/348 (29%), Gaps = 30/348 (8%) Query: 36 AVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK 95 A LS + + DQ++ L+ + ++ D + Sbjct: 60 ATLSIASGSENKEVAVAIQKAADQSNVAVTMHYMGSLEIMNALKAGGQDHDAVWPASSMW 119 Query: 96 DKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD 155 ++I + A + A + +++ ++ + + Sbjct: 120 ISMGDTKHIVKDAASTSTTPI------VFGIAKSKAVKLGWADDTGAAKPVSTADILAAV 173 Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 + ++ + + L P + W + + S + ++ ++ Sbjct: 174 SDGKLTFSMTSATVIDSALNVYQTALR-KPSWTIWVVDYSGSMSGEGKNGVVKGLNAALD 232 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 + +K+ E + V I + T S +++ + + T Sbjct: 233 -----PDQAKKSYIEPASGDVNILIPFETEAHRPVKATGTS--TSDLLHEADATDASGGT 285 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + Y + A EL +E E+S ++ +TDG ++ + Sbjct: 286 DIYEGLLSALDELPSESEASQ-------YTTAIVLMTDGRSNSDHQDEFESAYKS----- 333 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDK 382 R + I+S+ L+ S+ + F +L F + Sbjct: 334 RGRDLPIFSIMFG--DADPSQLKSLATLSNAKVFDGRS-GDLAAVFRQ 378 >gi|326926921|ref|XP_003209645.1| PREDICTED: integrin alpha-11-like [Meleagris gallopavo] Length = 1195 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 67/197 (34%), Gaps = 27/197 (13%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + I +E L+N ++K +++G + Y +V ++ +V + Sbjct: 174 GSNSIYPWVEVQHFLINILKKFYIGPGQ--IQVGVVQYGEDVVHEFHLNDYRSVKDVVAA 231 Query: 266 LNKLNPYENT--NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 + + T T + A E + G K+ +I ITDGE Sbjct: 232 ASHIEQRGGTETRTAYGIEFARSEAF-------QKGGRKGAKRVMIVITDGE------SH 278 Query: 324 NTLNTLQICEYMRNAGMKIYSVAV-SA----PPEGQDLL---RKCTDSSG--QFFAVNDS 373 ++ + ++ E + Y+VAV + L + FF V D Sbjct: 279 DSPDLEKVIEDSEKDNVTRYAVAVLGYYNRRGINPEAFLNEIKFIASDPDDKHFFNVTDE 338 Query: 374 RELLESFDKITDKIQEQ 390 L + D + ++I Sbjct: 339 AALKDIVDALGERIFSL 355 >gi|325282943|ref|YP_004255484.1| von Willebrand factor type A [Deinococcus proteolyticus MRP] gi|324314752|gb|ADY25867.1| von Willebrand factor type A [Deinococcus proteolyticus MRP] Length = 535 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 49/365 (13%), Positives = 92/365 (25%), Gaps = 31/365 (8%) Query: 36 AVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK 95 A LSG ++ ++ K+ + L + + Sbjct: 181 AALSGKGDAITAGDVQAGALKQFFKGQVLTSGSSGWLADAYVADQGRSQLNGLINYESVL 240 Query: 96 DKNNPLQ---------YIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTG-IIERSSEN 145 N Y ++ + P L LRS E Sbjct: 241 LSLNRGGRLQEPLKLIYPSDGLVTADYPLMLLNDARRSEYQALVDRLRSPQVQQRIMQET 300 Query: 146 LAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAP 205 L + +S L + + + +L NT Sbjct: 301 LRRPAAPGVALSSEFPPGMLVELPFPASAGTIDAILGSYLNDVRRPANTIFVLDVSGSME 360 Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE---- 261 +++ L + GNL + + R N Sbjct: 361 GK-RLEALKAALGNLSGADTSLGWRFAAFADRERVTLIPFSGDVEAVRSFQVNKASRAAD 419 Query: 262 ---VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + + L TN Y A+ AYR+ S+ + V+ +TDGE + Sbjct: 420 LQAIAAAGGALQAGGGTNIYGALSEAYRQAAAAPAGSYTS---------VVLMTDGEGTA 470 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 N +K ++V + Q++ + G+ F + L Sbjct: 471 G-PSLNEFRDFYAALPAGARSVKTFTVLFG-DSDVQEMNEVAALTGGRTFDGQ--QNLAA 526 Query: 379 SFDKI 383 +F +I Sbjct: 527 AFKEI 531 >gi|300776965|ref|ZP_07086823.1| von Willebrand factor type A domain protein [Chryseobacterium gleum ATCC 35910] gi|300502475|gb|EFK33615.1| von Willebrand factor type A domain protein [Chryseobacterium gleum ATCC 35910] Length = 634 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 18/144 (12%), Positives = 46/144 (31%), Gaps = 11/144 (7%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 ++G + Y + + ++ L+ L +T + AY+ Sbjct: 312 KVGIVVYAGNAGMVLPSTSAGEKEKIIKALDNLQAGGSTAGGAGIELAYKLAKENFIKGG 371 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 N V+ TDG+ + +T + + R G+ + + + Sbjct: 372 NNR--------VVLATDGDFNVG--ASSTSDIETLITEKRKTGIFLTCLGYGMGNYKDNT 421 Query: 357 LRKCTDSS-GQFFAVNDSRELLES 379 + D G + +++ +E + Sbjct: 422 MEVLADKGNGNYAYIDNMQEANKF 445 >gi|118096010|ref|XP_413930.2| PREDICTED: similar to integrin alpha 11 subunit [Gallus gallus] Length = 1191 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 34/197 (17%), Positives = 67/197 (34%), Gaps = 27/197 (13%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + I +E L+N ++K +++G + Y +V ++ +V + Sbjct: 174 GSNSIYPWVEVQHFLINILKKFYIGPGQ--IQVGVVQYGEDVVHEFHLNDYRSVKDVVAA 231 Query: 266 LNKLNPYENT--NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 + + T T + A E + G K+ +I ITDGE Sbjct: 232 ASHIEQRGGTETRTAYGIEFARSEAF-------QKGGRKGAKRVMIVITDGE------SH 278 Query: 324 NTLNTLQICEYMRNAGMKIYSVAV-SA----PPEGQDLL---RKCTDSSG--QFFAVNDS 373 ++ + ++ E + Y+VAV + L + FF V D Sbjct: 279 DSPDLEKVIEDSEKDNVTRYAVAVLGYYNRRGINPEAFLNEIKFIASDPDDKHFFNVTDE 338 Query: 374 RELLESFDKITDKIQEQ 390 L + D + ++I Sbjct: 339 AALKDIVDALGERIFSL 355 >gi|282877522|ref|ZP_06286340.1| von Willebrand factor type A domain protein [Prevotella buccalis ATCC 35310] gi|281300346|gb|EFA92697.1| von Willebrand factor type A domain protein [Prevotella buccalis ATCC 35310] Length = 345 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 23/166 (13%), Positives = 58/166 (34%), Gaps = 48/166 (28%) Query: 253 TPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P++++ K L +P + TN A+ + + ++ + + Sbjct: 144 LPITSDYVSAKMFLQNADPSLITTQGTNIAQAIRLSMSSFTQQ----------DKVGRAI 193 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS---------------APPEG 353 I ITDGE+ L+ + R G+ +Y + V G Sbjct: 194 ILITDGEDHEG-------EALEAAKEARKKGINVYILGVGETKGAPIPTPDGGYMTDDRG 246 Query: 354 QDLL--------RKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQ 390 Q ++ ++ + G + V+++ ++ ++ D++ + Sbjct: 247 QTVMTALNEKMCQEVAKAGEGTYIHVDNTS---DAQKQLNDELAKL 289 >gi|242034241|ref|XP_002464515.1| hypothetical protein SORBIDRAFT_01g019910 [Sorghum bicolor] gi|241918369|gb|EER91513.1| hypothetical protein SORBIDRAFT_01g019910 [Sorghum bicolor] Length = 704 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 20/200 (10%), Positives = 62/200 (31%), Gaps = 32/200 (16%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT--PLSNNLNEVKSRL 266 K+ +L + ++ +++ + R+ +A++ + + + + Sbjct: 250 KLALLKRAMRFVIENLEPSD--------RLSVVAFSSSACRLFPLRKMTAFGQQQSQQAV 301 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 + L TN + A R + + + +I ++DG +S ++ Sbjct: 302 DSLVADGGTNIAEGLRKAARVVED--------RQARNPVCSIILLSDGVDSHNLPPRDGS 353 Query: 327 NTL-QICEYM-------RNAGMKIYSVAVSAPPE---GQDLLRKCTD-SSGQFFAVNDS- 373 + + I++ + + + SSG F ++ Sbjct: 354 APEPDYAPLVPRSILPGSEHHVPIHAFGLGMDHDHDHDSRAMHAVAQMSSGTFSFIDMVG 413 Query: 374 RELLESFDK-ITDKIQEQSV 392 + ++ + I + V Sbjct: 414 SSIQDALAQCIGGLLSVSVV 433 >gi|71988814|ref|NP_499400.2| CUTiclin family member (cut-6) [Caenorhabditis elegans] gi|35210138|emb|CAA97806.2| C. elegans protein M142.2, confirmed by transcript evidence [Caenorhabditis elegans] Length = 572 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 22/158 (13%), Positives = 49/158 (31%), Gaps = 18/158 (11%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHA 284 + + I Y N+ +++ + ++ TNT A+ Sbjct: 74 ASRLNISEDGSHMALIQYAETPKLEFSLGQFNHPTQLEWAIQRIEYQSGATNTGQALRLT 133 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + K I ITDG++ + + + +R+A + +Y+ Sbjct: 134 LEKGLQGARPGI--------PKVAIVITDGQSQDDVSEPS--------QLLRDADVMVYA 177 Query: 345 VAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + V L + T + + F V +L + Sbjct: 178 IGV-TNLVNVHQLHQMTGNPVRVFTVESFEQLDRALAD 214 >gi|300856050|ref|YP_003781034.1| hypothetical protein CLJU_c28840 [Clostridium ljungdahlii DSM 13528] gi|300436165|gb|ADK15932.1| conserved hypothetical protein [Clostridium ljungdahlii DSM 13528] Length = 297 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 16/210 (7%), Positives = 52/210 (24%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 ++ ++I+ F Y +D+ I + ++ +A+D+ L+ + + Sbjct: 13 ISCLLITALLGFTAYVLDIGMIYIEKTKLTNAIDSGALAAALELPDNEVRARTAAVDYLQ 72 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + + ++ + N A++KA Sbjct: 73 KNNVDPSLALITVGADHKSIQIEEVKNVKHLFAQIIGINSSNIKAKTKAVVAPAKSVTGG 132 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 + S ++ + + + + S ++ + T + Sbjct: 133 IRPFAVEVYKFSYGDLVTLKEDAGDGYSGNYGAVSLGGSGGSVFRANALYGYSGTISVGD 192 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKI 210 + Y + Sbjct: 193 YIDTEPGNMAGACNDIKNYINSEHSTFDNF 222 >gi|303240541|ref|ZP_07327057.1| protein of unknown function DUF2134, membrane [Acetivibrio cellulolyticus CD2] gi|302591943|gb|EFL61675.1| protein of unknown function DUF2134, membrane [Acetivibrio cellulolyticus CD2] Length = 305 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 21/111 (18%), Positives = 41/111 (36%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 AII++V F ++D+ I + + ++ + +DAA L+G +V++ + + Sbjct: 18 FAIILTVIVAFAALSVDVGVIAFEKAKLSNTVDAAALAGAQELVTNVSNTNNVVNNYIAK 77 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYE 112 + S NN IA +KA+ E Sbjct: 78 NNSGLNESTIIVNESERSVKVTSSKTVENNFAKVFGNNSQDVIATAKAKVE 128 >gi|77552603|gb|ABA95400.1| von Willebrand factor type A domain containing protein [Oryza sativa Japonica Group] Length = 574 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 25/201 (12%), Positives = 76/201 (37%), Gaps = 35/201 (17%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP--LSNNLNEVK 263 + ++D+L + ++ ++ A R+ +++N +V +++ +++ Sbjct: 127 MSSRLDLLKIAMKYIIKLVRDAD--------RLAIVSFNHAVVSEYGLTRNSADSRKKLE 178 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGS------------TRLKKFVIFI 311 + ++KL NT+ PA+ A ++ + + + + + F++ + Sbjct: 179 NLVDKLKASGNTDFRPALKKAVEDMNIQNIKNSSAYNNFQILDGRGKEEKKKRVGFILLL 238 Query: 312 TDGENSGASAYQNTLNTLQICEY---------MRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 +DG + +N ++ + +++ SA + L + Sbjct: 239 SDG---VDQFQYSRINWEKVAKSTDVDHSEVGAMLRKYAVHTFGFSASHDPVPLRQISAL 295 Query: 363 SSGQFFAV-NDSRELLESFDK 382 S G + V + + E+F + Sbjct: 296 SYGLYSFVCKNLDNITEAFAR 316 >gi|269126104|ref|YP_003299474.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] gi|268311062|gb|ACY97436.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] Length = 583 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 24/202 (11%), Positives = 62/202 (30%), Gaps = 11/202 (5%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 + + + S P P ++ + V+ + + Sbjct: 391 ANVLFLVDVSGSMAEPLPGTGRTRMQQAQRALRQAVDDFVAVDRVGLWEFSTDLGGGRDY 450 Query: 246 GIVGNQCTPLSNN-LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 + + ++ ++ L P +T Y + A+R + ++ Sbjct: 451 RPLVPIRPMSTPGHRERLREQIAALRPRGDTGLYDSTLAAFRHVRAVRQDGAI------- 503 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS 364 V+ +TDG N + L+ + G++I+++A A +G L + + Sbjct: 504 -NSVVVLTDGRNDDPGGGLSLEELLKELDGA--DGVRIFTIAYGAGADGGALKKISEATD 560 Query: 365 GQFFAVNDSRELLESFDKITDK 386 + D L + ++ Sbjct: 561 AAAYDSRDPATLDKVLTQVVSN 582 >gi|295669664|ref|XP_002795380.1| von Willebrand factor type A domain containing protein [Paracoccidioides brasiliensis Pb01] gi|226285314|gb|EEH40880.1| von Willebrand factor type A domain containing protein [Paracoccidioides brasiliensis Pb01] Length = 773 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 31/208 (14%), Positives = 64/208 (30%), Gaps = 15/208 (7%) Query: 180 LLPPPPKKSFWSKNTTKSKYAPAPAPANR---KIDVLIESAGNLVNSIQKAIQEKKNLSV 236 + P + + S AP P K + S +L + I E N + Sbjct: 68 DIRHVPCDIVLCIDVSGSMQLSAPLPTTDESGKREETGLSVLDLTKHAARTIIETLNEND 127 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNE--VKSRLNKLNPYENTNTYPAMHHAYRELYNEKES 294 R+G + ++ ++ N+ + L P +TN + + L Sbjct: 128 RLGVVTFSNDAEVAYKISHMDDTNKKAALEAVEALQPLASTNLWHGLKLGLSVLGKVDLR 187 Query: 295 SHNTIGSTRLKKFVIFITDGENS-GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 N + + +TDG+ + I E ++ I++ Sbjct: 188 PQNV-------QALYVLTDGQPNHMCPRQGYVPKLRPILERQKDRLPLIHTFGFGYDIRS 240 Query: 354 QDLLRKCTD-SSGQFFAVNDSRELLESF 380 LL+ + G + + D+ + F Sbjct: 241 -GLLQSIAEVGGGTYSFIPDAGMIGTVF 267 >gi|126336627|ref|XP_001380264.1| PREDICTED: similar to Inter-alpha trypsin inhibitor, heavy chain 3 [Monodelphis domestica] Length = 894 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 18/164 (10%), Positives = 53/164 (32%), Gaps = 12/164 (7%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYR 286 KK+ + + ++ P + NL + ++++ TN + Sbjct: 311 SDVKKDDFLNFILFSSDVRTWKENLVPATPENLKAAEEFVHQIQATGGTNINDGLLRGIE 370 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 + +E R +I +TDGE + + + + +Y++ Sbjct: 371 MVNKAREMGTVL---DRSTSIIIMLTDGEANVGESRVEKI--QENVRNAIGGKYPLYNLG 425 Query: 347 VSAPPEGQDLLRKCTDSSGQF----FAVNDSR-ELLESFDKITD 385 + L + + + +D+ ++ +D++ + Sbjct: 426 FGYDVN-YNFLERMALENNGVARRIYEDSDANLQMQGFYDEVAN 468 >gi|297567245|ref|YP_003686217.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] gi|296851694|gb|ADH64709.1| von Willebrand factor type A [Meiothermus silvanus DSM 9946] Length = 467 Score = 51.1 bits (120), Expect = 3e-04, Method: Composition-based stats. Identities = 41/268 (15%), Positives = 84/268 (31%), Gaps = 42/268 (15%) Query: 104 IAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDL 163 + E+ + + +L+ P L+L+ E + + + V+D S SM ++ Sbjct: 3 VVEALLEASVQPHREYLQANQPGQKLFLALKIRPSAEATRSRPQLVVAFVVDTSGSMREV 62 Query: 164 YLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNS 223 P + + A KID++IE+ NL++S Sbjct: 63 VT---------------EPTERTGQSVRVDGKDYEVVR---GAKSKIDLVIEALQNLLSS 104 Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMH 282 Q + R+ + ++ Q +N + + +L T M Sbjct: 105 PQLQPSD------RLAIVKFDDVAEVVQPFTPANEKARLVAAAERLTQYSGGTQMGAGMR 158 Query: 283 HAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKI 342 R L E + +I +TDG+ + + + A + + Sbjct: 159 EGMRLLEREAG-----------SRRLILLTDGQT------FDEPLVETVAAQLAQARIPV 201 Query: 343 YSVAVSAPPEGQDLLRKCTDSSGQFFAV 370 ++ V L + G+ F V Sbjct: 202 TAIGVGDEWNDDLLAEITDRTQGKPFHV 229 >gi|87310828|ref|ZP_01092954.1| inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated [Blastopirellula marina DSM 3645] gi|87286343|gb|EAQ78251.1| inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated [Blastopirellula marina DSM 3645] Length = 788 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 23/138 (16%), Positives = 48/138 (34%), Gaps = 19/138 (13%) Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + ++ L +TN A+ A L ++K +++F+TDG + Sbjct: 355 REKALGFVDNLYAGGSTNIDGALAKAMGMLKDDKRP-----------SYMLFLTDGLPTH 403 Query: 319 ASAYQNTLNTLQICEYMRNAG---MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE 375 N +I + + ++ S V + L R + GQ V + + Sbjct: 404 G-----EQNEAKIVDNAKQKNDVRARVISFGVGYDVNSRLLDRLSRECFGQSEYVRPNED 458 Query: 376 LLESFDKITDKIQEQSVR 393 + K+ +KI + Sbjct: 459 IETHVAKLYNKISAPVMT 476 >gi|260789946|ref|XP_002590005.1| hypothetical protein BRAFLDRAFT_224914 [Branchiostoma floridae] gi|229275192|gb|EEN46016.1| hypothetical protein BRAFLDRAFT_224914 [Branchiostoma floridae] Length = 159 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 23/155 (14%), Positives = 49/155 (31%), Gaps = 22/155 (14%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS--NNLNEVKSRLNKLNP 271 +VN + R+G + ++ N L+ E+ + + + Sbjct: 23 KTFVQTVVNYFTLGEND-----TRVGVVTFSDRDRQNTRVTLNEHYTRVELLTEIRDIPY 77 Query: 272 Y-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 +T T + H + E N F+I ITD ++ + ++ Sbjct: 78 DRGHTYTGLGLDHVRNNSFLEVNGRRN-----NTLDFLIVITD--------DESEDDIVR 124 Query: 331 ICEYMRNAGMKIYSVAVSAPPE-GQDLLRKCTDSS 364 + +R G+ ++ V V + Q L Sbjct: 125 PAQLIRQMGITVFVVGVGQESDISQPTLETIAGDP 159 >gi|310119165|ref|XP_003118918.1| PREDICTED: collagen alpha-4(VI) chain-like [Homo sapiens] gi|310126588|ref|XP_003120448.1| PREDICTED: collagen alpha-4(VI) chain-like [Homo sapiens] Length = 535 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 19/94 (20%), Positives = 38/94 (40%), Gaps = 12/94 (12%) Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 G + ++VI ITDG++S + + E +R G+ IY++ + D Sbjct: 6 ADTGRINVARYVIVITDGKSSDS--------VAEAAEGLRANGVNIYAIGI--REANIDE 55 Query: 357 LRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 L++ + F V + L + ++ I Sbjct: 56 LKEIAK--DKIFFVYEFDLLKDIQKEVVQDICSS 87 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 18/117 (15%), Positives = 40/117 (34%), Gaps = 16/117 (13%) Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T T A++ + K +++++I ITDG + Sbjct: 138 GTRTGKALNFTLPFFDSSKGGRP------SVQQYLIVITDGVAQDNVIIP--------AK 183 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 +R+ I+++ V + LL T+ + + + L +I K+ + Sbjct: 184 ALRDKNTIIFAIGVG-EAKKSQLLE-ITNDEDKVYHDVNFEALQNLEKEILSKVCDP 238 >gi|171912901|ref|ZP_02928371.1| hypothetical protein VspiD_17015 [Verrucomicrobium spinosum DSM 4136] Length = 339 Score = 51.1 bits (120), Expect = 4e-04, Method: Composition-based stats. Identities = 21/159 (13%), Positives = 50/159 (31%), Gaps = 31/159 (19%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 G P + + + + + ++++ T M ++ Sbjct: 146 GGAPYNPCPPTLDHDWLLNNMDRIQ----TGI---MEDGTAIGSGIAAAARRLDQLEVKS 198 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-------------- 351 K ++ +TDG N+ L+ G++I+++++ P Sbjct: 199 KVILLMTDGANNSG-----KLSPQDAARLAATLGIRIHAISIGTPGMHPIYMPNGPPINS 253 Query: 352 ----EGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + L++ + SG FF D L F + + Sbjct: 254 GRQEFDPETLQEVANIGSGSFFRAEDLSTLERIFKTVDE 292 >gi|291575288|gb|ADE10213.1| hypothetical protein [Actinoplanes liguriensis] Length = 432 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 24/196 (12%), Positives = 45/196 (22%), Gaps = 26/196 (13%) Query: 1 MTAIIIS-----VCFLFITYAIDLAHIMYIRNQMQSALDAA------VLSGCASIVSDRT 49 +TA++ V ID+ + R Q+QS DAA +G A Sbjct: 18 ITALVAVLAGAGVLLGMAALVIDIGALYAEREQLQSGADAASWKVAQACAGTAGRDLTSA 77 Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNP--------- 100 ++D + K + + N Sbjct: 78 TCTVAAQRDNAQRYADRNAKDLVSDVQFCITTVSAAGVTTADAGCPSSWNTPVTCPAPPS 137 Query: 101 ---LQYIAESKAQYEIPTENLFLKGLIP---SALTNLSLRSTGIIERSSENLAISICMVL 154 E + + L + + + A++ + L Sbjct: 138 ASGPYRYVEVRTSTRNSDNTSVVPPLFGRGLAGSAYHGAKMGACGRVAWGAPAVTDVLAL 197 Query: 155 DVSRSMEDLYLQKHND 170 VSR + Sbjct: 198 GVSRCDFLRLTGDYTR 213 >gi|296221263|ref|XP_002756657.1| PREDICTED: von Willebrand factor A domain-containing protein 2 [Callithrix jacchus] Length = 725 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 21/136 (15%), Positives = 49/136 (36%), Gaps = 18/136 (13%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + VR+G ++ + EVK+++ ++ T T A+ + Sbjct: 82 DISPERVRVGAFQFSSTPHLEFPLDSFSTQQEVKAKIKRMIFKGGRTETGLALKNLLHRG 141 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + ++ + + +I ITDG++ G A + ++ G+ +++V V Sbjct: 142 FPGGRNA-------SVPQILIIITDGKSQGHVALP--------AKQLKEKGVTVFAVGVR 186 Query: 349 APPEGQDLLRKCTDSS 364 P + L Sbjct: 187 FPRWEE--LHMLASEP 200 >gi|198430887|ref|XP_002124814.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 400 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 25/195 (12%), Positives = 56/195 (28%), Gaps = 20/195 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEK-----KNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 ++ L S Q+ + + + L N Sbjct: 188 GETNFGLVKNWVVALAKSFDIEKQDNIGIIQYSHWYPGVPYSQQPYMKTEVPLGLYKNFT 247 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + ++ T HA + + +S + K +I ITDG Sbjct: 248 LFEKIAQNISLQGFTTYT---AHALNKTVLDFMASERF-MHENVTKVMILITDGR----- 298 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS----SGQFFAVNDSREL 376 + ++ EY+R+ G+ +++ + +D L+ + + F V L Sbjct: 299 -ADDAIDLYSSAEYVRSQGIITFAIGIG-NSVLRDQLQIVANGKLGEDTRVFEVTTFTSL 356 Query: 377 LESFDKITDKIQEQS 391 + I++ Sbjct: 357 NSILSLLRSSIKQSL 371 >gi|260837103|ref|XP_002613545.1| hypothetical protein BRAFLDRAFT_208193 [Branchiostoma floridae] gi|229298930|gb|EEN69554.1| hypothetical protein BRAFLDRAFT_208193 [Branchiostoma floridae] Length = 184 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 14/125 (11%), Positives = 40/125 (32%), Gaps = 14/125 (11%) Query: 225 QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHH 283 + + + ++ + Y L ++ S +N + T T A+ + Sbjct: 73 TTSDFQIGPDNAQVSVVQYESSPTEEFPLDRYATLEDLLSAINLIPYRGGGTRTGRAIDY 132 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 + + K +I +TDG+ + + + + +G+ + Sbjct: 133 VVTTTLTVSRGAR-----QGVPKVIIVVTDGQ--------SGDDVREPARRAKQSGIIMV 179 Query: 344 SVAVS 348 ++ V Sbjct: 180 AIGVG 184 >gi|300727143|ref|ZP_07060562.1| BatB protein [Prevotella bryantii B14] gi|299775687|gb|EFI72278.1| BatB protein [Prevotella bryantii B14] Length = 340 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 22/211 (10%), Positives = 69/211 (32%), Gaps = 59/211 (27%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 ++ NLV+ + +IG + + P++++ K L Sbjct: 110 SRLTKSKMLVENLVDHFN---------NDQIGLVVFAGQSYVQ--LPITSDYVSAKMFLQ 158 Query: 268 KLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 + P + T+ A++ + ++ K +I ITDGE+ Sbjct: 159 DIQPSLIQTQGTDIAGAINTCMHAFTP----------NDKVGKAIIVITDGEDHEG---- 204 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAP-----------------------PEGQDLLRKC 360 ++ + + G+ ++ + + + + ++ Sbjct: 205 ---GAIEAAKAAHDRGINVFILGIGDSKGAPIPMGNGDYLKDRSGNTVMTAVNEQMCKQI 261 Query: 361 TDSS-GQFFAVNDSRELLESFDKITDKIQEQ 390 ++ G++ V+++ ++ +++ D + Sbjct: 262 AEAGSGKYIHVDNT---KDAENQLNDDLSRL 289 >gi|156382057|ref|XP_001632371.1| predicted protein [Nematostella vectensis] gi|156219426|gb|EDO40308.1| predicted protein [Nematostella vectensis] Length = 297 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 27/192 (14%), Positives = 67/192 (34%), Gaps = 20/192 (10%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + D L E ++ +I + Y+ + + + Sbjct: 37 VDGSGSIGSSNFDRLKEFVSTVIGGF-----VISPQGTQISVVVYHSSAKTHLSFGDAQD 91 Query: 259 LNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 L V+ ++ + T T + A++ E + S+R + V+ I DG++ Sbjct: 92 LISVRRIISSIAYPSGPQTYTDRGLVEAHQRFAKENGAR-----SSRTTRVVVVINDGKS 146 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 +G S + +++ G+ I ++ + ++ L+ S G + + L Sbjct: 147 NGES-------LEASSKPLKDEGIVIMALGFGSSVRVEE-LQTMASSQGDMHLYSTAIAL 198 Query: 377 LESFDKITDKIQ 388 + I + I Sbjct: 199 KNDANSIVNSIC 210 >gi|149371022|ref|ZP_01890617.1| hypothetical protein SCB49_05035 [unidentified eubacterium SCB49] gi|149355808|gb|EDM44366.1| hypothetical protein SCB49_05035 [unidentified eubacterium SCB49] Length = 325 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 20/165 (12%), Positives = 54/165 (32%), Gaps = 41/165 (24%) Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 P++ + K LN ++ ++ A+ A + + + + + I+D Sbjct: 124 PITTDFASTKLFLNSMDTDMVSSQGTAISEAVQMSTTYFD------DEEQKNRVLFVISD 177 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE--------------------- 352 GE+ + E G+K+Y++ V Sbjct: 178 GEDHEGNF-------ESAIENATENGIKVYAIGVGTEKGGPIPIKRNGVLQSYKRDQDNQ 230 Query: 353 ------GQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQ 390 G+ L+ D G++ +++ +++ I + + ++ Sbjct: 231 QVITRLGEPTLQAIADEGNGKYINGTNTQAVVDEVKAILNGMDKK 275 >gi|332221825|ref|XP_003260065.1| PREDICTED: calcium-activated chloride channel regulator 4 isoform 2 [Nomascus leucogenys] Length = 684 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 18/133 (13%), Positives = 55/133 (41%), Gaps = 24/133 (18%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T+ + +A++ + + S V+ +TDGE++ AS+ Sbjct: 142 ALGGTSICSGIKYAFQVI--------GELHSQLDGSEVVLLTDGEDNTASSC-------- 185 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE---LLESFDKI---T 384 + ++ +G ++ +A+ + + ++ + G F +D + L+++F + Sbjct: 186 -IDEVKQSGAIVHFIALGS-AADEAVIEMSNITGGSHFYASDEAQNNGLIDAFGALTSGN 243 Query: 385 DKIQEQSVRIAPN 397 + ++S+++ Sbjct: 244 TDLSQKSLQLESK 256 >gi|254414399|ref|ZP_05028165.1| Vault protein inter-alpha-trypsin [Microcoleus chthonoplastes PCC 7420] gi|196178629|gb|EDX73627.1| Vault protein inter-alpha-trypsin [Microcoleus chthonoplastes PCC 7420] Length = 801 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 23/167 (13%), Positives = 42/167 (25%), Gaps = 19/167 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSN---NLNEVKSRLNKLNPYENTNTYPAMHHAYR 286 N I + PL+N N + + +N L T + Sbjct: 354 NGLNPDDTFTIIDFANTTTQLSPQPLANTAPNRTQALNYINGLQANGGTELMNGIRAVLN 413 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 + + V+ +TDG N + + G ++YS Sbjct: 414 -FPPSAPNR---------LRSVVLLTDG------YIGNESQVIAEVQRQLKPGNRLYSFG 457 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 V + L R G + E + +I + Sbjct: 458 VGSSVNRFLLNRLAEVGRGTAKIIRQDEPTQEVAETFARQINNPVLT 504 >gi|149183229|ref|ZP_01861674.1| hypothetical protein BSG1_17256 [Bacillus sp. SG-1] gi|148849062|gb|EDL63267.1| hypothetical protein BSG1_17256 [Bacillus sp. SG-1] Length = 245 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 16/103 (15%), Positives = 40/103 (38%), Gaps = 10/103 (9%) Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----GQD 355 T + V+ ITDG ++ + + + + G+ + + V G + Sbjct: 1 MKTGTLRQVLLITDGCSNQGE------DPVAMAALAKEQGITVNVIGVMENDSIDERGME 54 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + S G + +++L ++ +T K Q+++ N+ Sbjct: 55 EIEGIAMSGGGVSQIVYAQQLSQTVQMVTRKAMTQTLQGVVNK 97 >gi|104779436|ref|YP_605934.1| surface adhesion protein [Pseudomonas entomophila L48] gi|95108423|emb|CAK13117.1| Surface adhesion protein [Pseudomonas entomophila L48] Length = 5862 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 36/318 (11%), Positives = 95/318 (29%), Gaps = 20/318 (6%) Query: 68 IKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSA 127 I +L + ++ ++ D + S Sbjct: 5205 ISTNLTDTDGSETLSIKLSGIPAGSVIADNAGHTFTVGASAVDVTGWNLGSLTIKPPAYY 5264 Query: 128 LTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKK 187 ++ + S + + V+ + + +++ + Sbjct: 5265 YGQFDVKVSSTSTESVGGSTATTEGTIKVTVYPQTYSTSNLSSDSDNITGTDGNDVVVAD 5324 Query: 188 SFWSKNTTKSKYA------PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 Y + + + +D +S ++ ++ +++ ++ +V I + Sbjct: 5325 VSGLHVVPGQNYNLAFIVDTSGSMGSSGVDAAKKSLESVFKTLAASVKGDQSGTVNILLV 5384 Query: 242 AYNIGIVGNQCTPLSN-NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG 300 + + + L++ L + + LN L TN A N K+ + Sbjct: 5385 DFATQVKSSVAVTLNDAGLQTLLNALNNLRADGGTNYEDAFKTTANWFQNLKDGGNTGSN 5444 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-PEGQDLLRK 359 T FITDG+ + A +N+ TL +G+ + + S G Sbjct: 5445 QT------FFITDGKPTYYQANENSNPTLGT------SGITLDTFLASINYKSGMAYNGY 5492 Query: 360 CTDSSGQFFAVNDSRELL 377 S+ + +++++ L Sbjct: 5493 IDSSNTNYVSIDNAGNLQ 5510 >gi|327541351|gb|EGF27892.1| hypothetical protein RBWH47_01135 [Rhodopirellula baltica WH47] Length = 384 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 34/311 (10%), Positives = 73/311 (23%), Gaps = 24/311 (7%) Query: 4 IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVS---DRTIKDPTTKKDQT 60 I+ F + + A+I R ++ + D+A L+G ++ + + K Sbjct: 29 ILCVALFTVLAVVSEFAYINTARTELSRSADSAALAGAWALYDARVNGDHDGVSQVKSSA 88 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE---- 116 I ++ ++A +I + N + + Sbjct: 89 DLISSANHIGQIQAHVGSVDHAIEIGSYDHATQVFEPINDWASANAVRVNLHLTNSAHGE 148 Query: 117 -NLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMT 175 L L GL +L T + ++S D + Sbjct: 149 LPLQLAGLTGRKSQSLHQSVTAAFQH-------------NISGFKRPPNNGTKIDILPIA 195 Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 ++ K + + + + G + N S Sbjct: 196 LDEETWQLVSDKQTSDDLMYFNGSIVEGSDGFHECSLYPTGTGA-PGNRGTVDIGSSNNS 254 Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS 295 GI + L V +L+ +T + + EK Sbjct: 255 TSDLRRQILYGISQDDFDAL--GRELVFDENGELSLNGDTGISAGIKAELSSIIGEKRII 312 Query: 296 HNTIGSTRLKK 306 Sbjct: 313 PIFRKVQGNGN 323 >gi|281208974|gb|EFA83149.1| hypothetical protein PPL_03939 [Polysphondylium pallidum PN500] Length = 1103 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 30/220 (13%), Positives = 64/220 (29%), Gaps = 12/220 (5%) Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 MT + L + S + + I + + + Sbjct: 144 TPTVAGAAAMTDGERDLLSNFVDTAVSSDVEIVFVFDTTGSMSSIIANVKSQVEATITRL 203 Query: 225 QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHA 284 K I +G + G + L+++ ++ + + K+ + A A Sbjct: 204 TKDI--PNIKIGIMGLGDFCDGQNVLKTLDLTDDKTKLINFIKKVPMTGGGDAPEAYEFA 261 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + S+H K ++ I D S +N +Q + + G+KIY Sbjct: 262 LYMANSLSWSAH-------TSKALVMIGDEGPHPPSTTDLKINWIQQADDLAAKGVKIY- 313 Query: 345 VAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKI 383 + A + + + G + + E F I Sbjct: 314 -GIRAKCSQPAFYNEIAERTGGIAIDFDRFNLITEMFMAI 352 >gi|104782921|ref|YP_609419.1| RTX toxin [Pseudomonas entomophila L48] gi|95111908|emb|CAK16632.1| putative RTX toxin [Pseudomonas entomophila L48] Length = 2350 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 30/294 (10%), Positives = 81/294 (27%), Gaps = 23/294 (7%) Query: 70 KHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALT 129 L GS G + A ++ + + S S Sbjct: 1440 NDLAAGSVTGSEPGSTGETASGSLVGSVSGGFGALTYSLVGNATGQYGQIQLNADGSYTY 1499 Query: 130 NLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ-------KHNDNNNMTSNKYLLP 182 L+ + + + ++ + + Sbjct: 1500 TLTSAPKTPGGTNDGANTVVEQFTYKATDALGNSVISTIAINIVDDVPKAESAVRSITPG 1559 Query: 183 PPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIA 242 + + S + + P ++++ ++ L++ +++ + Sbjct: 1560 EVDSNILLVVDVSSSMNSGSGVPGLTRLELAKQAINTLLDKYDDM------GDIKVQIVT 1613 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 ++ G +S ++E KS + L +T A A + + Sbjct: 1614 FSTGATMQTPVWVS--ISEAKSLIAGLTAGGSTYYDSAATKAQEAFVSAGKLVGA----- 1666 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + F +DGE SG + + ++ + G+K ++V + + +L Sbjct: 1667 --QNVSYFFSDGEPSGGHSIT-AVRETTWETFLDDNGIKSFAVGMGSGVNAGNL 1717 >gi|311271865|ref|XP_001927121.2| PREDICTED: LOW QUALITY PROTEIN: von Willebrand factor A domain-containing protein 2 [Sus scrofa] Length = 769 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 23/176 (13%), Positives = 58/176 (32%), Gaps = 25/176 (14%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + VR+G + + + EVK+ + ++ T T A+ + R+ Sbjct: 82 DVDPARVRVGAVQFGSTPRLEFPLDAFSTQQEVKAEIRRMAFKGGRTETGLALKYLLRKG 141 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + ++ + + ++ +TDG + G A E ++ + +++V V Sbjct: 142 FPGGRNA-------SVPQVLLIVTDGRSQGHVAEP--------AEQLKQRDVTVFAVGVR 186 Query: 349 APPEGQDLLRKCTD--SSGQFFAVNDSRE-LLESFDKITDK----IQEQSVRIAPN 397 P + L + + F ++ I ++ P+ Sbjct: 187 FPRWEE--LHILASEPTEQHVLMAEQVEDAANGLFSSLSSSAICTITSPDCKVQPH 240 >gi|302392065|ref|YP_003827885.1| von Willebrand factor type A [Acetohalobium arabaticum DSM 5501] gi|302204142|gb|ADL12820.1| von Willebrand factor type A [Acetohalobium arabaticum DSM 5501] Length = 230 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 20/195 (10%), Positives = 58/195 (29%), Gaps = 21/195 (10%) Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 P + + + K + + L + + Sbjct: 36 VDQRQLPTFTDANNVEIIWDVSGSMWGKVEKNKKYL--RAKKALKDIVMMIPDHVNIGLR 93 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 G + +NN + + +++N L P + A+ A Sbjct: 94 TFGKGEDSDRSTNLVVNIATNNKSTLLAKINNLKPAGKSPIGKALSQA-----------G 142 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS-APPEGQD 355 + + ++ +TDG ++G + ++ + + G+ I+ + + + + Sbjct: 143 VDLINLDGNNHILLVTDGRDTG------NIVPSRVADRLSKNGIYIHVLKIGAVGNQQRA 196 Query: 356 LLRKCT-DSSGQFFA 369 +L+ G++F Sbjct: 197 VLKSIARLGGGRYFT 211 >gi|74001590|ref|XP_848825.1| PREDICTED: similar to Collagen alpha 1(VI) chain precursor [Canis familiaris] Length = 206 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 23/142 (16%), Positives = 48/142 (33%), Gaps = 13/142 (9%) Query: 225 QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS---NNLNEVKSRLNKLNPYE-NTNTYPA 280 + + + G +PL+ + + +K++++ + + T T A Sbjct: 74 RSCARTPRTRLPSSSVCTRKGGDEVEIISPLTPMPADRDALKAKVDAVKYFGKGTYTDCA 133 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + EL SH K++I +TDG ++ G+ Sbjct: 134 IKKGLEELLVG--GSHLKEN-----KYLIVVTDGHPLEGYKEPCG-GLEDAVNEAKHLGV 185 Query: 341 KIYSVAVSAPPEGQDLLRKCTD 362 K++SVA+ P + L Sbjct: 186 KVFSVAI-TPDHLEPRLSIIAT 206 >gi|225310539|emb|CAQ19230.1| collagen type XXVIII alpha 1 b precursor [Danio rerio] Length = 491 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 29/199 (14%), Positives = 62/199 (31%), Gaps = 18/199 (9%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNL-SVRIGTIAYNIGIVGNQCTPLSNN 258 + A +LVN I + + R + Y+ ++ Q Sbjct: 46 VDSSESAKDNHGQEKSFVTDLVNHIPNIRLQTGQGLNFRTALLQYSSHVITEQSFKDWRG 105 Query: 259 LNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + +SR+ + T T A+ + R E K I + G Sbjct: 106 VPSFQSRVASIPFIGHGTYTTYAITNLTRIYLEESGPGTV--------KVAILMYGGA-- 155 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 ++ + +N G+K + V + + L+ + + V++ ++ Sbjct: 156 ---SHPKNPDIFSALADAKNQGIKFFIVGL-TSAANMEKLQLLASAPASRY-VHNIQD-K 209 Query: 378 ESFDKITDKIQEQSVRIAP 396 DKI +I + + P Sbjct: 210 GVVDKIIREITKVVDEVCP 228 >gi|94501046|ref|ZP_01307570.1| hypothetical protein RED65_05304 [Oceanobacter sp. RED65] gi|94426793|gb|EAT11777.1| hypothetical protein RED65_05304 [Oceanobacter sp. RED65] Length = 867 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 26/134 (19%), Positives = 50/134 (37%), Gaps = 8/134 (5%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 N E +N TN AM A +L + + + V D + Sbjct: 109 TNAKEKAKMINSFGLR--TNIGEAMERATWKLAADSDFEQHA--ILLTDGIVDIAADDDP 164 Query: 317 SGASAYQNTLN--TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDS 373 + + + +N G+KI+++A+ + + LL K ++ G V +S Sbjct: 165 QKDNKNEAERQRILTDVLSEYKNLGVKIHTIAL-SNAADKVLLEKLALETGGMAEVVENS 223 Query: 374 RELLESFDKITDKI 387 +L+++F DK Sbjct: 224 EQLVKAFLNAFDKA 237 >gi|218193011|gb|EEC75438.1| hypothetical protein OsI_11969 [Oryza sativa Indica Group] Length = 516 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 19/145 (13%), Positives = 47/145 (32%), Gaps = 24/145 (16%) Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 ++ + ++ L NTN + + ++ ++ V+ ++DG+ + Sbjct: 130 RADLGALVDGLAADGNTNIRAGLEIGLAVAAGRRLTAGRSVN-------VMLMSDGQQN- 181 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT--DSSGQFFAVNDSREL 376 + G+ +++ + A +L+ G F V D L Sbjct: 182 ----------RADATRLDPGGVPVHTFGLGADH-DPAVLQAIAGKSREGMFHYVADGVNL 230 Query: 377 LESFDKITD---KIQEQSVRIAPNR 398 F ++ I Q + + R Sbjct: 231 TAPFSQLLGGLLTIIAQDLELTVTR 255 >gi|42524419|ref|NP_969799.1| putative secreted protein [Bdellovibrio bacteriovorus HD100] gi|39576628|emb|CAE80792.1| putative secreted protein [Bdellovibrio bacteriovorus HD100] Length = 469 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 41/265 (15%), Positives = 75/265 (28%), Gaps = 27/265 (10%) Query: 128 LTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKK 187 S + S ++ D+ M + N K P Sbjct: 10 TLCAFASSAQEKDEPSVPQGMTQLKAEDLLDKM--DFDPSKIPANASGEVKAEDLPSTAI 67 Query: 188 SFWSKNTTKSKY-APAPAPANRKIDVLIESA-GNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 S P KI VL + L++ + L V + Sbjct: 68 PLVEYVIDSSGSMGQLMGPKKTKIYVLKKLLARYLMSQW--TEKTSSGLRVIGSRRKKDC 125 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 P + L ++ + P T A+ AY+++ + K Sbjct: 126 KDNYLAIEPAQSKLGAIEGIVKGFEPVGMTPIGQALKDAYKDVEHYKGP----------- 174 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYM--RNAGMKIYSVAVSAPPEGQDLLRKCTDS 363 K V+ TDGE + + +I + ++ +K + VA D+L K Sbjct: 175 KRVVLFTDGEETCG------QDPCKIAAELSGKDVDLKFFVVAFGL-QNQPDVLDKLACI 227 Query: 364 SGQFFAVNDSRELLESFDKITDKIQ 388 +D +L E F + ++ Sbjct: 228 GDM-SQADDEEKLEELFQDLDKQLN 251 >gi|328953619|ref|YP_004370953.1| Protein of unknown function DUF2134, membrane [Desulfobacca acetoxidans DSM 11109] gi|328453943|gb|AEB09772.1| Protein of unknown function DUF2134, membrane [Desulfobacca acetoxidans DSM 11109] Length = 333 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 24/209 (11%), Positives = 57/209 (27%), Gaps = 5/209 (2%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVS-DRTIKDPTTKKDQT 60 A+ +++ F+ A D+ H++ ++ ++Q DA L+G +V + T Q Sbjct: 24 FALTLAILCGFVALAFDIGHLVMVKAELQRTADAGALAGATGLVPYTGPVTSQTPDWLQG 83 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPL----QYIAESKAQYEIPTE 116 + + I D A + + + A P Sbjct: 84 QSKAQTLINNTANLADNQVFTTTDSAVTYGYWLLNPPEGYVQSLPTVRPTTSAYLPQPAI 143 Query: 117 NLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS 176 + L + L S+ ++ + + + ++ + Sbjct: 144 RVTLSRNVDLYFAPLLGVSSPKTVNATATAILPETYRTKNTPPIAVARDIVYDIIGDSVV 203 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAP 205 +S + AP Sbjct: 204 INVDEQTITPRSNAGSAGWFNLSGENSAP 232 >gi|42526760|ref|NP_971858.1| batB protein, putative [Treponema denticola ATCC 35405] gi|41817075|gb|AAS11769.1| batB protein, putative [Treponema denticola ATCC 35405] Length = 322 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 23/158 (14%), Positives = 39/158 (24%), Gaps = 43/158 (27%) Query: 252 CTPLSNNLNEVKSRLNKLNP----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 PLS N + S +N L+P TN + A + K Sbjct: 139 SVPLSFEKNALSSAINALSPLILSSTGTNLEAGVLRALDSF----------GENRGNSKI 188 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA---------PPEGQDLLR 358 ++ TDG + S L E ++ + V +G L+ Sbjct: 189 IVLCTDGGETSGS-------LLHAAEKIKKTDAILIIVGFGTLEETKIKVLDEKGNTQLK 241 Query: 359 K-------------CTDSSGQFFAVNDSRELLESFDKI 383 + + S + I Sbjct: 242 DARLEEAFLQKAASIAGGESTYISALSSGSIESILKII 279 >gi|315644678|ref|ZP_07897809.1| von Willebrand factor type A [Paenibacillus vortex V453] gi|315279937|gb|EFU43236.1| von Willebrand factor type A [Paenibacillus vortex V453] Length = 562 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 17/98 (17%), Positives = 40/98 (40%), Gaps = 15/98 (15%) Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 +N L NT T+ + A + L++E + +K + ++DGE + + + Sbjct: 454 AINSLEASGNTATFDGIVVAMKMLHDE------MAVNPDVKPLIFVLSDGETNVGHSLDD 507 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 I E ++ + +Y++ +A L+ + Sbjct: 508 ------IRELIKAFKIPVYTIGYNAN---IQALQSISS 536 >gi|72679950|gb|AAI00088.1| LOC498793 protein [Rattus norvegicus] Length = 568 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 17/134 (12%), Positives = 52/134 (38%), Gaps = 11/134 (8%) Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 K+ P TN A+ A L + +I ++DG+ + + + Sbjct: 1 KIQPSGGTNINEALLRAIFILNEASNMGLL---NPDSVSLIILVSDGDPTVGELKLSKIQ 57 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAV----NDSRELLESFDK 382 + + +++ + ++S+ + D L++ + ++ G + + S +L + +++ Sbjct: 58 -KNVKQNIQD-NISLFSLGIGFDV-DYDFLKRLSNENRGIAQRIYGNRDTSSQLKKFYNQ 114 Query: 383 ITDKIQEQSVRIAP 396 ++ + P Sbjct: 115 VSTPLLRNVQFNYP 128 >gi|325473817|gb|EGC77005.1| BatB protein [Treponema denticola F0402] Length = 286 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 23/158 (14%), Positives = 39/158 (24%), Gaps = 43/158 (27%) Query: 252 CTPLSNNLNEVKSRLNKLNP----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 PLS N + S +N L+P TN + A + K Sbjct: 103 SVPLSFEKNALSSAINALSPLILSSTGTNLEAGVLRALDSF----------GENRGNSKI 152 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA---------PPEGQDLLR 358 ++ TDG + S L E ++ + V +G L+ Sbjct: 153 IVLCTDGGETSGS-------LLHAAEKIKKTDAILIIVGFGTLEETKIKVLDEKGNTQLK 205 Query: 359 K-------------CTDSSGQFFAVNDSRELLESFDKI 383 + + S + I Sbjct: 206 DARLEEAFLKKIASIAGGESTYISALSSGSIETILKII 243 >gi|310114429|ref|XP_003119946.1| PREDICTED: collagen alpha-4(VI) chain-like [Homo sapiens] Length = 481 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 18/117 (15%), Positives = 39/117 (33%), Gaps = 16/117 (13%) Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T T A++ + K +++++I ITDG + Sbjct: 138 GTRTGKALNFTLPFFDSSKGGRP------SVQQYLIVITDGVAQDNVIIP--------AK 183 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 +R+ I+++ V + LL T+ + + L +I K+ + Sbjct: 184 ALRDKNTIIFAIGVG-EAKKSQLLE-ITNDEDNVYHDVNFEALQNLEKEILSKVCDP 238 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 18/94 (19%), Positives = 38/94 (40%), Gaps = 12/94 (12%) Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 G + +++I ITDG++S + + E +R G+ IY++ + D Sbjct: 6 ADTGRINVARYLIVITDGKSSDS--------VAEAAEGLRANGVNIYAIGI--REANIDE 55 Query: 357 LRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 L++ + F V + L + ++ I Sbjct: 56 LKEIAK--DKIFFVYEFDLLKDIQKEVVQDICSS 87 >gi|198419820|ref|XP_002120348.1| PREDICTED: similar to type A von Willebrand factor (VWFA) domain-containing protein [Ciona intestinalis] Length = 863 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 34/180 (18%), Positives = 66/180 (36%), Gaps = 27/180 (15%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNE 291 + + ++ G +SN ++ LN LN T PA+ ++ Sbjct: 430 SDFQLYDFDFIFSTGKTAPIPGTISNTRVQLSRALNSLNTSGGTALGPALLYSVALA--- 486 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC--------EYMRNAGMKIY 343 S R VI TDG + T + QIC E + G+ + Sbjct: 487 ---------SQRPGSQVIVCTDGRANMGIGSLETEDDYQICHHFYDSTTEMALSNGVIVS 537 Query: 344 --SVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDK---IQEQSVRIAPNR 398 S+ V +L R + ++G+ V D + ++F+ I + + SV+++ ++ Sbjct: 538 MMSI-VGTDCRLVELGRVASKTAGKVSIV-DLENISKTFEDILSESVLATQVSVQLSTHK 595 >gi|291008392|ref|ZP_06566365.1| von Willebrand factor, type A [Saccharopolyspora erythraea NRRL 2338] Length = 538 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 25/179 (13%), Positives = 63/179 (35%), Gaps = 13/179 (7%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLS--VRIGTIAYNIGIVGNQCTPLS--NNLNEVKS 264 +I L + + A+ + + + + ++ + ++ +L V+ Sbjct: 365 RIAALRSAFAGFSGADPSAVGKFVRFYKGEMVTIMRFGGHVLDERDFTITGQADLKAVQD 424 Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + T + +++ T ++ +TDGEN+ + Q+ Sbjct: 425 YIAADRFDQTTGVWS------ALEAAYAKAAAATRDHPEQPVTIMLMTDGENNAGISLQD 478 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 L Q + A + Y+V G+ L + + G+ ++ L E+F +I Sbjct: 479 FLRNHQARDPAAKA-VHTYTVRFGEANPGE-LDQAARATGGRMVDA-NATSLSEAFKEI 534 >gi|323498921|ref|ZP_08103904.1| hypothetical protein VISI1226_07138 [Vibrio sinaloensis DSM 21326] gi|323316033|gb|EGA69061.1| hypothetical protein VISI1226_07138 [Vibrio sinaloensis DSM 21326] Length = 418 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 20/108 (18%), Positives = 37/108 (34%), Gaps = 2/108 (1%) Query: 4 IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTI 63 + ++V +ID H+M +N++Q+ALD A L+G D + Sbjct: 23 VSMAVIVGVAALSIDANHLMVSKNRLQNALDTAALAGATVANRTYEEDDAKEAIVEAYNK 82 Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQY 111 + + G + I + + N+ S A Y Sbjct: 83 VTSAAGND--ELVLAASDDGTSLKSLTIEYSDNANSGFSSNFPSSADY 128 >gi|257876693|ref|ZP_05656346.1| von Willebrand factor type A domain-containing protein [Enterococcus casseliflavus EC20] gi|257810859|gb|EEV39679.1| von Willebrand factor type A domain-containing protein [Enterococcus casseliflavus EC20] Length = 1195 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 28/270 (10%), Positives = 80/270 (29%), Gaps = 29/270 (10%) Query: 140 ERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY 199 + I + +V+D S SM ++ + + S + S Sbjct: 347 NVQRQIDPIDVVLVVDWSGSMNEMGRIAEVKKGVDRFLNQIEGSGIQDSVYMGYVGYSSD 406 Query: 200 APAPAPAN---RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 K + E+ ++ + + G + L+ Sbjct: 407 GSNYQNKTCQLGKFSEVKETIRSMTPETAAGGTFTQRGLRQAGDMLSTQNGHKKVIVLLT 466 Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + + ++K++ A+ ++ ++ F D +N Sbjct: 467 DGVPTYSYHVSKVHT---QADGSYYGTAFSLTQDQPMNTSFLYNG-------YFAFDQQN 516 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVS---------APPEGQDLLRKC--TDSSG 365 + + T+ ++ G++I+ + + + + +R+ D G Sbjct: 517 NYKWINNTFIATIGEAMALKERGIEIHGLGIQLQGDQTAGYTKADVEKKMRQMVSADEDG 576 Query: 366 QFFAVNDSRELLESFDKITDKIQEQSVRIA 395 + + E + I D ++++++ IA Sbjct: 577 HLYY-ESANEAAD----IADYLEKKALHIA 601 >gi|15645056|ref|NP_207226.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter pylori 26695] gi|2313537|gb|AAD07497.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter pylori 26695] Length = 185 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 27/193 (13%), Positives = 72/193 (37%), Gaps = 34/193 (17%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +I+VL ++ ++++ + KK L ++ + + L + ++K+ Sbjct: 6 IRTRIEVLNLCIQKMIETLKQ--EAKKELFSKMAIVTFGGNGAV-----LHTDFGDIKN- 57 Query: 266 LN--KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA-- 321 +N L+ T A A + ++ +T + K + I ++DGE + Sbjct: 58 INFKPLSTSGGTPLDQAFRLAKDLIEDK-----DTFPTKFYKPYSILVSDGEPNNDKWQE 112 Query: 322 -----YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 + + + +C +S+ + + + + F +D +L Sbjct: 113 PLFNFHHDGRSAKSVC----------WSIFIGDRNDNPQVNKD--FGKDGVFYTDDVEKL 160 Query: 377 LESFDKITDKIQE 389 ++ F+ +T I + Sbjct: 161 VKLFEIMTQTISK 173 >gi|331006836|ref|ZP_08330094.1| hypothetical protein IMCC1989_793 [gamma proteobacterium IMCC1989] gi|330419347|gb|EGG93755.1| hypothetical protein IMCC1989_793 [gamma proteobacterium IMCC1989] Length = 693 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 24/202 (11%), Positives = 63/202 (31%), Gaps = 16/202 (7%) Query: 192 KNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ 251 + N ++ + LV + K + + + V Q Sbjct: 27 SDVRLVIDISGSMKKNDPQNLRRPALDMLVQLLPKGSKAGIWTFGQYVNMLVPHKPVDAQ 86 Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 ++ + + + TN A+ A + + VI + Sbjct: 87 WGRTASAASSEIKSIAQF-----TNIGAALEKA------AYDHKQQMKADQDYQTHVILL 135 Query: 312 TDGENSGASAYQNTLNTLQ-----ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ 366 TDG + Q + + +G+ ++++A+S + + L + + G+ Sbjct: 136 TDGMVDIDRDNRLNKKERQRILNNVLPMYQQSGITLHTIALSDNADKKLLNKLALATDGK 195 Query: 367 FFAVNDSRELLESFDKITDKIQ 388 ++ EL+ F ++ ++ Sbjct: 196 VSVAKNAEELMNVFLRVFNQAV 217 >gi|313222038|emb|CBY39058.1| unnamed protein product [Oikopleura dioica] Length = 1721 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 32/273 (11%), Positives = 75/273 (27%), Gaps = 17/273 (6%) Query: 124 IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPP 183 + + + ++ + D+ + + + Sbjct: 146 LGNKAQYEKANKIDPYGEIYQFKDVASLLQQDIVDEIIEDICLSVEEKTITPHVDVQNIH 205 Query: 184 PPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAY 243 ++ + + + K ES V K E + G + Y Sbjct: 206 DHVGDCSIEHPSADIFFVLDGSYSTKRAGW-ESIIEFVQRFVKINFEHG-GDMNYGLLQY 263 Query: 244 NIGIVGNQCTPLSNNLNEVKSRLNKLN-----PYENTNTYPAMHHAYRELYNEKESSHNT 298 + + E+ + L+ +T T AM +A + + + Sbjct: 264 SDFVEPILSFADKEGQREIAEFIEILSQIKYHSGFSTLTGTAMRYAAETEFMSERGAR-- 321 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLR 358 ++K +I +TDG + + E +R AG + VAV + L Sbjct: 322 ---KNVRKIMIVVTDGRA----KDIDDNTVKIVGESLRAAG-DLTVVAVGVNKAVESELV 373 Query: 359 KCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 + S +L + +I +++ Sbjct: 374 EIASSPDFVHNTKKFEDLHNFLSPVFAEICKET 406 >gi|50913505|ref|YP_059477.1| collagen adhesion protein [Streptococcus pyogenes MGAS10394] gi|19108443|gb|AAL11466.1| FctX [Streptococcus pyogenes] gi|50902579|gb|AAT86294.1| Collagen adhesion protein [Streptococcus pyogenes MGAS10394] gi|198417345|gb|ACH87890.1| ancillary protein 1 [Streptococcus pyogenes] gi|198417347|gb|ACH87891.1| ancillary protein 1 [Streptococcus pyogenes] Length = 1036 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 44/296 (14%), Positives = 83/296 (28%), Gaps = 55/296 (18%) Query: 144 ENLAISICMVLDVSRSMEDLYLQKHND---NNNMTSNKYLLPPPPKKSFWSK-------- 192 + + I +V+D S SM++ + S Sbjct: 436 KKNPLDILVVVDKSGSMQEGIGSVQRYRYYAQRWDDYYSQWVYHGTFDYSSYQGESFNRG 495 Query: 193 -NTTKSKYAPAPAPANRKIDVLIES---AGNLVNSIQKAIQEKK---------------- 232 + + + + R+ D + S L+ E K Sbjct: 496 QIHYRYRGIVSVSDGIRRDDAVKNSLLGVNGLLQRFVNINPENKLSVIGFQGSADYHAGK 555 Query: 233 ---NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP--YENTNTYPAMHHAYRE 287 + S R G N+ + + N + N L TN + A+ A Sbjct: 556 WYPDQSPRGGFYQPNLNNSRDAELLKGWSTNSLLDP-NTLTALHNNGTNYHAALLKAKEI 614 Query: 288 LYNEKESSHNTIGSTRLKKF--VIFITDGENSGASAYQNTLN-------TLQICEYMRNA 338 L K+ I F DG SG + + N + + + Sbjct: 615 LNEVKDDGRRKIMIFISDGVPTFYFGEDGYRSGNGSSNDRNNVTRSQEGSKLAIDEFKAR 674 Query: 339 --GMKIYSVAVSAPPEGQD------LLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + IYS+ VS +L+ ++ + D+ EL ++ +KI + Sbjct: 675 YPNLSIYSLGVSKDINSDTASSSPVVLKYL-SGEEHYYGITDTAELEKTLNKIVED 729 >gi|2623765|gb|AAB86530.1| Lu-ECAM-1 [Bos taurus] Length = 794 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 18/112 (16%), Positives = 42/112 (37%), Gaps = 21/112 (18%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T+ + ++ + + + +I +TDGE++ ++ Sbjct: 382 ANGGTSICRGLKAGFQAIIHS--------DQSTSGSEIILLTDGEDNEINSCF------- 426 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN-DSRELLESF 380 E ++ +G I+++A+ P L ++ + G F N D L +F Sbjct: 427 --EDVKRSGAIIHTIALG--PSAAKELETLSNMTGGYRFFANKDITGLTNAF 474 >gi|116671476|ref|YP_832409.1| hypothetical protein Arth_2930 [Arthrobacter sp. FB24] gi|116611585|gb|ABK04309.1| hypothetical protein Arth_2930 [Arthrobacter sp. FB24] Length = 353 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 23/154 (14%), Positives = 48/154 (31%), Gaps = 12/154 (7%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + AI++ V F A+D+ + R Q+++ + K Sbjct: 28 IVAIMLVVLLGFGAVAVDVGMLYAERTQLRNG-----------ADAAALAVAQKCAKSAP 76 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPT-ENLF 119 S+ S NA D + I N + A ++ + + P +LF Sbjct: 77 SSSDADCSNTSTLAASLANSNANDGRSNIKSIILDTTNRKVTVTAGAQEKGKSPNEVSLF 136 Query: 120 LKGLIPSALTNLSLRSTGIIERSSENLAISICMV 153 ++ ++ ST + + + V Sbjct: 137 FARVLGMNSAEVNAPSTVVWGSPEKGTSPFPITV 170 >gi|325089698|gb|EGC43008.1| U-box domain-containing protein [Ajellomyces capsulatus H88] Length = 759 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 36/206 (17%), Positives = 69/206 (33%), Gaps = 15/206 (7%) Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE---SAGNLVNSIQKAIQEKKNLSVRI 238 P P + + S + AP P + E S +L + I E N + R+ Sbjct: 70 PHVPCDIVLCIDVSYSMQSSAPLPTTDESGEREETGLSVLDLTKHAARTIIETLNENDRL 129 Query: 239 GTIAYNIGIVGNQCTPLSN--NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 G +A++ N N + L P +TN + + + NE+ + Sbjct: 130 GIVAFSTEAEVVYKISKMNESNKKAALKAVEALKPLSSTNLWHGLKLGLKAFENERRT-- 187 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK-IYSVAVSAPPEGQD 355 + + + +TDG + Q + L+ + M I++ Sbjct: 188 -----PQSVQALYVLTDGMPNHMCPKQGYVTKLRPILQLLGHRMPMIHTFGFGYNIRS-G 241 Query: 356 LLRKCTD-SSGQFFAVNDSRELLESF 380 LL+ + G F + D+ + F Sbjct: 242 LLQAIAEVGGGTFAFIPDAGMIGTVF 267 >gi|301780608|ref|XP_002925750.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-1(XX) chain-like [Ailuropoda melanoleuca] Length = 1307 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 51/162 (31%), Gaps = 20/162 (12%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY-RELYNE 291 V++G Y+ V + + L NT T A+ H + L Sbjct: 209 NKVQVGLTQYSGDPQTEWDLNAFRTKEVVLAAVYSLRYKGGNTFTGLALTHVREQNLKPG 268 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 K VI +TDG++ + +++ G+ +++V V Sbjct: 269 AGPRPEA------AKVVILVTDGKSQD--------DARAAGRILKDLGVAVFAVGV--KN 312 Query: 352 EGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKIQEQS 391 + L+ V D +L D ++ I ++ Sbjct: 313 ADEAELQLLASPPLDITIHNVQDFPQLSTLADLLSRLICQKV 354 >gi|220909014|ref|YP_002484325.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] gi|219865625|gb|ACL45964.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] Length = 589 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 23/172 (13%), Positives = 52/172 (30%), Gaps = 20/172 (11%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 N+ N++ I + + ++ L NT+ Y Sbjct: 427 NVQNTLNTYINGLSPQDQVALMRFSSDVGTPVVVDGTPAGRDRGLQFISSLRANGNTHLY 486 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 A A L S V+ +TDGE++G++ L ++ + Sbjct: 487 DATLAARNWLTQNLRSDAI--------NAVLVLTDGEDTGSAISLEQLGP-----ELQKS 533 Query: 339 G------MKIYSVAVS-APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 G + ++V L++ + +G +++ D + + Sbjct: 534 GFNSDQRISFFTVGYGEEGEFDPQALQQIANVNGGYYSKGDPASIGRLMADL 585 >gi|260642112|ref|ZP_05414581.2| von Willebrand factor type A domain protein [Bacteroides finegoldii DSM 17565] gi|260623548|gb|EEX46419.1| von Willebrand factor type A domain protein [Bacteroides finegoldii DSM 17565] Length = 615 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 29/291 (9%), Positives = 76/291 (26%), Gaps = 30/291 (10%) Query: 92 NITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISIC 151 ++ + + A Y + L P+ + Sbjct: 155 SVADTPLSTFSIDVD-PASYSNMRRFINRGELPPADAIRTEELVNYFSYDYPKPTGNDPV 213 Query: 152 MVLDVSRSMEDLY--LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRK 209 + + + LP + Sbjct: 214 KITVEAGTCTWNTAHRLVRIGLKAKEIPTEQLPASNLVFLID--------ISGSMWGANR 265 Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 +D++ S LVN+++ ++ + Y + +++ +++L Sbjct: 266 LDLVKSSLKLLVNNLRNKD--------KVAIVTYAGSAGVKLEATSGGDKQKIREAIDEL 317 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 +T +H AY+ S N +I +DG+ + + L Sbjct: 318 TAGGSTAGGAGIHLAYQIAKKNFISDGNNR--------IILCSDGDFNVGVSSAEGL--E 367 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLES 379 Q+ E R +G+ + + ++ + G +++ +E Sbjct: 368 QLIEKERKSGVHLTVLGYGMGNYKDKKIQVLAEKGNGNHAYIDNLQEANRV 418 >gi|47212619|emb|CAF92825.1| unnamed protein product [Tetraodon nigroviridis] Length = 533 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 30/217 (13%), Positives = 59/217 (27%), Gaps = 47/217 (21%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 R+ + + ++ ++ K+ +R+ I ++ + L+ N ++ Sbjct: 28 YFLCRRSGSVKNHWEEIYYFVENLAEKFKSPMLRMSFITFSSRA--STIMKLTENRINIR 85 Query: 264 SRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 LN L P +T + + A ++ + +I +TDGE Sbjct: 86 KGLNALKREIPGGDTIMHLGLQKANEQI---------KRENFGPASVIIALTDGELQEDE 136 Query: 321 AYQNTLNTLQI------------------------------CEYMRNAGMKIYSVAVSAP 350 E R+ G +Y V V Sbjct: 137 LISAQQEVAASRSMSAVAAVSALAAFSIPDVMDPFPVFPQQAETARSLGAIVYCVGV--K 194 Query: 351 PEGQDLLRKCTDSSGQFFAV-NDSRELLESFDKITDK 386 + L D+ F V L + I K Sbjct: 195 DFNETQLATIADTIEHVFPVLGGFHALRGVINSIIKK 231 >gi|229489135|ref|ZP_04383001.1| von Willebrand factor, type A [Rhodococcus erythropolis SK121] gi|229324639|gb|EEN90394.1| von Willebrand factor, type A [Rhodococcus erythropolis SK121] Length = 551 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 20/128 (15%), Positives = 47/128 (36%), Gaps = 11/128 (8%) Query: 256 SNNLNEVKSRLNKLNP--YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 +++ +++ SR++ L+ T Y ++ AYR + + + VI +TD Sbjct: 423 TDHRSKLLSRIDSLSSIVGGGTGLYDSVLAAYRSMQQTYDPASI--------NSVILLTD 474 Query: 314 GENSGASAYQNTLNTLQICEYMR-NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND 372 G N S+ + + I ++ V+ + L + + G Sbjct: 475 GANDDPSSISLQELLDTLTREQDPTRPVPIITIGVTDDADTDVLEQISALTGGNSHFAPT 534 Query: 373 SRELLESF 380 ++ + F Sbjct: 535 PADIPKVF 542 >gi|300788143|ref|YP_003768434.1| hypothetical protein AMED_6298 [Amycolatopsis mediterranei U32] gi|299797657|gb|ADJ48032.1| conserved hypothetical protein [Amycolatopsis mediterranei U32] Length = 534 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 21/113 (18%), Positives = 37/113 (32%), Gaps = 11/113 (9%) Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 P T Y ++ YR E E VI +TDG N Sbjct: 421 TPDGGTGLYDSVLDTYRTARREWEPGRL--------NLVIVMTDGRNEDPRGISRADLLT 472 Query: 330 QIC--EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 ++ + R + + V + + +L + + GQ F D ++ + F Sbjct: 473 ELAGLQDAR-RPIPLIGVGIGPDADKAELDQLTAATGGQAFLAPDPAKITDVF 524 >gi|307719356|ref|YP_003874888.1| von Willebrand factor type A [Spirochaeta thermophila DSM 6192] gi|306533081|gb|ADN02615.1| von Willebrand factor type A [Spirochaeta thermophila DSM 6192] Length = 331 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 25/156 (16%), Positives = 51/156 (32%), Gaps = 50/156 (32%) Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T+ + A + ++ VI ++DGE + E Sbjct: 167 GTDVASGLSRALEAFPQQSNR----------QRLVILLSDGEALTGEIGP-------VLE 209 Query: 334 YMRNAGMKIYSVAVSAPPE-----------------------GQDLLRKCTD-SSGQFFA 369 RN G+ +++V + LL++ + + G+FF+ Sbjct: 210 LARNLGVAVHTVGIGTESGGPVPLEGEDVLKKPSGEPVISRLDASLLKRIAEITGGRFFS 269 Query: 370 VNDSRELLESFDKITDKIQEQS-------VRIAPNR 398 V D+ ++F + I+E +R+ P R Sbjct: 270 VRDAEG--QTFQHVVSTIEETIAREEREGIRLVPAR 303 >gi|198426244|ref|XP_002124558.1| PREDICTED: similar to Vwa1 protein [Ciona intestinalis] Length = 430 Score = 50.7 bits (119), Expect = 4e-04, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 57/164 (34%), Gaps = 20/164 (12%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLS---NNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 S+RIG YN + + L+ N+ + + + + L TNT A+ H Sbjct: 269 TIDQSSLRIGAFRYNRRVHTDTQILLNQFNNDKDGLLAAIQSLPYNGGGTNTGRAIAHVT 328 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + ++ V+ ITDG + +R G ++ V Sbjct: 329 NVMLKAENG-----NRPDVQDLVVLITDGRAQD--------RVDLVSADLRATGAVVFVV 375 Query: 346 AVSAPPEGQDL--LRKCTDSSGQFFAVND-SRELLESFDKITDK 386 AV P L + + + ++ V+ L +F + K Sbjct: 376 AVILPGSTIRLSQMLEISGTNETLLIVDSGFDGLDTAFSSMLTK 419 >gi|88601603|ref|YP_501781.1| magnesium chelatase, ChlI subunit [Methanospirillum hungatei JF-1] gi|88187065|gb|ABD40062.1| protoporphyrin IX magnesium-chelatase [Methanospirillum hungatei JF-1] Length = 680 Score = 50.7 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 28/190 (14%), Positives = 71/190 (37%), Gaps = 26/190 (13%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 A ++++ + +L+ + R+G +A+ + PL++++ Sbjct: 497 ASGSMGVEQRMEAAKGAIFSLLE-------DSYQNRDRVGLVAFRGEGA-DVVLPLTSSI 548 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 + RL++L T + + L EK+ L ++ ITDG + Sbjct: 549 DLAYQRLSELPTGGKTPLAAGLQKSLTILMREKQKYP------SLLPLLVLITDGRANVG 602 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK-------CTD-SSGQFFAVN 371 + + I + + AG++ +V + + + L + G++F ++ Sbjct: 603 NGGKLKDEIGLITDDLVKAGIE--TVIIDTESKQKGRLSIQLGFCPFIAQRTQGRYFQIS 660 Query: 372 D--SRELLES 379 D + +L + Sbjct: 661 DLTASDLSSA 670 >gi|66576258|gb|AAY51689.1| conserved hypothetical protein [Chlorobium tepidum TLS] Length = 350 Score = 50.7 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 18/164 (10%), Positives = 51/164 (31%), Gaps = 11/164 (6%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIK-DPTTKKDQ 59 + A+++ V I A+DL + ++ ++Q+A DAA L+G S++ + + + Sbjct: 20 LFALVLMVLVGLIALAVDLTRLHLVKAELQNAADAAALAGAGSLIDTSLQTFNWSAATAK 79 Query: 60 TSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIA----------ESKA 109 G + +E ++A + + + Sbjct: 80 AQEFADVNSADGKTIGQHRQEQDVNVAIQPGYWNLITPSFTSNTGLVTHTGDGNIPAVQV 139 Query: 110 QYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMV 153 + F ++ + + + + + + Sbjct: 140 TITLSHLKFFFAPILGIPEGTVQATAIAAVSPPTGGTGLFPMAI 183 >gi|45767838|gb|AAH67716.1| Col14a1 protein [Danio rerio] Length = 747 Score = 50.7 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 16/89 (17%), Positives = 35/89 (39%), Gaps = 16/89 (17%) Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT 361 + K ++ +TDG + +I + M+ G I+++ + G+ L Sbjct: 15 RGIPKVLVVLTDGRSQDDVN--------KISKEMQMEGYIIFAIGFADADYGE--LVNIA 64 Query: 362 DSSG--QFFAVNDSRELLESFDKITDKIQ 388 F V+D ++F KI +++ Sbjct: 65 SKPSERHVFFVDDL----DAFKKIEEQLI 89 >gi|319780570|ref|YP_004140046.1| hypothetical protein Mesci_0829 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317166458|gb|ADV09996.1| hypothetical protein Mesci_0829 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 359 Score = 50.7 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 32/321 (9%), Positives = 73/321 (22%), Gaps = 25/321 (7%) Query: 4 IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLS--------------GCASIVSDRT 49 + + I +++D + R+ Q+ALDAA+LS Sbjct: 22 LAMIPLITAIGFSVDYTSAVQTRSTEQAALDAAILSITTMDTASTKPQRQVAMQASYMAN 81 Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINI-TKDKNNPLQYIAESK 108 T + + IA+ + + N + E+ Sbjct: 82 GGQGTATLNSFDVSANGTATAQASASFAMPTVFMQIARIPSVAVGVTSAVNKAPALVEAT 141 Query: 109 AQYEIPTENLF-LKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQK 167 + + L + + + I+ + + K Sbjct: 142 FKVTGVSGYWNKKMTLYGTMFGAAAGKPLMTIDYAYGKTGDPKGYGTTTVSVLTTDSAGK 201 Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR------KIDVLIESAGNLV 221 L+ + +I+V + L Sbjct: 202 TVTTVAQKQVCKLVDSSTPAGAVIQTDGFQTKYYCVDTMYPADGAGAQINVSQMAGLYLQ 261 Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN--KLNPYENTNTYP 279 + K + + G+ +Q + N ++ L P T Y Sbjct: 262 MDVPSGNPSKLMSNDPTTSNRLYSGLKSDQTGAIDYNEMATGQIVDIFGLVPCGATA-YQ 320 Query: 280 AMHHAYRELYNEKESSHNTIG 300 A ++ ++ Sbjct: 321 AWEDGGNKVPAPVSNADFFYN 341 >gi|134101426|ref|YP_001107087.1| von Willebrand factor, type A [Saccharopolyspora erythraea NRRL 2338] gi|133914049|emb|CAM04162.1| von Willebrand factor, type A [Saccharopolyspora erythraea NRRL 2338] Length = 501 Score = 50.7 bits (119), Expect = 5e-04, Method: Composition-based stats. Identities = 25/179 (13%), Positives = 63/179 (35%), Gaps = 13/179 (7%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLS--VRIGTIAYNIGIVGNQCTPLS--NNLNEVKS 264 +I L + + A+ + + + + ++ + ++ +L V+ Sbjct: 328 RIAALRSAFAGFSGADPSAVGKFVRFYKGEMVTIMRFGGHVLDERDFTITGQADLKAVQD 387 Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + + T + +++ T ++ +TDGEN+ + Q+ Sbjct: 388 YIAADRFDQTTGVWS------ALEAAYAKAAAATRDHPEQPVTIMLMTDGENNAGISLQD 441 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 L Q + A + Y+V G+ L + + G+ ++ L E+F +I Sbjct: 442 FLRNHQARDPAAKA-VHTYTVRFGEANPGE-LDQAARATGGRMVDA-NATSLSEAFKEI 497 >gi|327330197|gb|EGE71946.1| putative von Willebrand factor type A domain protein [Propionibacterium acnes HL097PA1] Length = 322 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 18/154 (11%), Positives = 51/154 (33%), Gaps = 25/154 (16%) Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 P S + V ++ + + T A+ + + + + ++ ++D Sbjct: 150 PPSTDRPTVLRAVDGIELQDGTALGGAIDKSLEAVK----MAPGGSKNPAPA-AIVMLSD 204 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PEGQDLLRKC 360 G+N+ + N + + +Y++A LL Sbjct: 205 GDNTQGGSPLVAANRAAAAK------VPVYTIAFGTETGYVDLNGQRERVAPDTKLLSTV 258 Query: 361 TD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 D + + + + + +L E + ++ + + VR Sbjct: 259 ADRTHAKSWTADSADKLQEVYQQVHSSVGYEPVR 292 >gi|50842461|ref|YP_055688.1| aerotolerance protein BatA [Propionibacterium acnes KPA171202] gi|289427106|ref|ZP_06428822.1| von Willebrand factor type A domain protein [Propionibacterium acnes J165] gi|295130538|ref|YP_003581201.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK137] gi|50840063|gb|AAT82730.1| conserved protein, putative BatA (bacteroides aerotolerance operon) [Propionibacterium acnes KPA171202] gi|289159575|gb|EFD07763.1| von Willebrand factor type A domain protein [Propionibacterium acnes J165] gi|291375881|gb|ADD99735.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK137] gi|313764513|gb|EFS35877.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL013PA1] gi|313772104|gb|EFS38070.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL074PA1] gi|313792200|gb|EFS40301.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA1] gi|313801849|gb|EFS43083.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA2] gi|313807458|gb|EFS45945.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA2] gi|313809968|gb|EFS47689.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL083PA1] gi|313813000|gb|EFS50714.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL025PA1] gi|313816054|gb|EFS53768.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL059PA1] gi|313818503|gb|EFS56217.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL046PA2] gi|313820269|gb|EFS57983.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA1] gi|313822922|gb|EFS60636.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA2] gi|313825146|gb|EFS62860.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL063PA1] gi|313827717|gb|EFS65431.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL063PA2] gi|313830297|gb|EFS68011.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL007PA1] gi|313833671|gb|EFS71385.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL056PA1] gi|313838673|gb|EFS76387.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL086PA1] gi|314915507|gb|EFS79338.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA4] gi|314918208|gb|EFS82039.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA1] gi|314920023|gb|EFS83854.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA3] gi|314925156|gb|EFS88987.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL036PA3] gi|314932037|gb|EFS95868.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL067PA1] gi|314955907|gb|EFT00307.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL027PA1] gi|314958219|gb|EFT02322.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA1] gi|314960060|gb|EFT04162.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA2] gi|314962859|gb|EFT06959.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL082PA1] gi|314967773|gb|EFT11872.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL037PA1] gi|314973302|gb|EFT17398.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL053PA1] gi|314975980|gb|EFT20075.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL045PA1] gi|314978483|gb|EFT22577.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL072PA2] gi|314984001|gb|EFT28093.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA1] gi|314989988|gb|EFT34079.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA3] gi|315078074|gb|EFT50125.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL053PA2] gi|315080702|gb|EFT52678.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL078PA1] gi|315084374|gb|EFT56350.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL027PA2] gi|315085715|gb|EFT57691.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL002PA3] gi|315088865|gb|EFT60841.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL072PA1] gi|315096217|gb|EFT68193.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL038PA1] gi|315098475|gb|EFT70451.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL059PA2] gi|315101165|gb|EFT73141.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL046PA1] gi|315108386|gb|EFT80362.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL030PA2] gi|327326129|gb|EGE67919.1| putative von Willebrand factor type A domain protein [Propionibacterium acnes HL096PA2] gi|327331996|gb|EGE73733.1| putative von Willebrand factor type A domain protein [Propionibacterium acnes HL096PA3] gi|327443198|gb|EGE89852.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL013PA2] gi|327445983|gb|EGE92637.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL043PA2] gi|327448037|gb|EGE94691.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL043PA1] gi|327450841|gb|EGE97495.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA3] gi|327453082|gb|EGE99736.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL092PA1] gi|327453813|gb|EGF00468.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL083PA2] gi|328753529|gb|EGF67145.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL020PA1] gi|328754260|gb|EGF67876.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL087PA1] gi|328754489|gb|EGF68105.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL025PA2] gi|328760649|gb|EGF74216.1| putative von Willebrand factor type A domain protein [Propionibacterium acnes HL099PA1] gi|332675378|gb|AEE72194.1| hypothetical protein PAZ_c10190 [Propionibacterium acnes 266] Length = 322 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 18/154 (11%), Positives = 51/154 (33%), Gaps = 25/154 (16%) Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 P S + V ++ + + T A+ + + + + ++ ++D Sbjct: 150 PPSTDRPTVLRAVDGIELQDGTALGGAIDKSLEAVK----MAPGGSKNPAPA-AIVMLSD 204 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PEGQDLLRKC 360 G+N+ + N + + +Y++A LL Sbjct: 205 GDNTQGGSPLVAANRAAAAK------VPVYTIAFGTETGYVDLNGQRERVAPDTKLLSTV 258 Query: 361 TD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 D + + + + + +L E + ++ + + VR Sbjct: 259 ADRTHAKSWTADSADKLQEVYQQVHSSVGYEPVR 292 >gi|308476046|ref|XP_003100240.1| hypothetical protein CRE_21951 [Caenorhabditis remanei] gi|308265764|gb|EFP09717.1| hypothetical protein CRE_21951 [Caenorhabditis remanei] Length = 879 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 30/145 (20%), Positives = 57/145 (39%), Gaps = 16/145 (11%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN--PYENTNTYPAMHHAYRELYNEKE 293 VR+G I Y+ + N++ + L L P E+T T A+ A E++N Sbjct: 70 VRVGLIQYSDAAKTEFNLSRYSERNDIITHLETLTFMPGEDTRTGVALDKADEEMFNYIG 129 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + + +I TDG + + + +R G+KIY+++V++ Sbjct: 130 GARLKAT-----RLIILFTDGL--------SMDKPTKSAKTLRRKGVKIYTISVNSIGFV 176 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLE 378 ++L + F D + E Sbjct: 177 PEML-GIVGDADNVFGPTDEDRIEE 200 >gi|300776964|ref|ZP_07086822.1| von Willebrand factor type A domain protein [Chryseobacterium gleum ATCC 35910] gi|300502474|gb|EFK33614.1| von Willebrand factor type A domain protein [Chryseobacterium gleum ATCC 35910] Length = 800 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 33/276 (11%), Positives = 78/276 (28%), Gaps = 25/276 (9%) Query: 108 KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQK 167 A Y + ++ + + + + + + + K Sbjct: 360 NASYSNVRRMINNGQVVDKNAVRIEEMVNYFKYDYPQPKNENPFSI--NTEYSDAPWNPK 417 Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 H K L S K+ +L S L+N ++ Sbjct: 418 HKLLKIGLQGKNLPMDKLPASNLVFLID----VSGSMSDENKLPLLKSSFKVLLNQLRPK 473 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 ++G + Y + +++ L++L +T + AY+ Sbjct: 474 D--------KVGIVVYAGSAGMVLPPTSAGEKDKIIEALDRLQAGGSTAGGAGIELAYKL 525 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 N VI TDG+ + ++ + + + E R +G+ + + Sbjct: 526 AQENFVKEGNNR--------VIIATDGDFNVGTSSIS--DLKTLIEDRRKSGVFLTCLGF 575 Query: 348 SAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDK 382 + L D G + +++ +E + K Sbjct: 576 GMGNYKDNTLETLADKGNGNYAYIDNMQEANKFLGK 611 >gi|66774330|gb|AAY55950.1| complement component Bf/C2-A [Ginglymostoma cirratum] Length = 753 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 35/309 (11%), Positives = 83/309 (26%), Gaps = 46/309 (14%) Query: 101 LQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSM 160 + E + ++ + S + S ++ + + Sbjct: 172 IGSKVEYACNDGLVLVGSNIRQCLESREWSGREPSCQHKSSFDTPGDVASAFTASFTSML 231 Query: 161 EDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNL 220 + + L P + + + + + NL Sbjct: 232 GLTQTGEGKQPASTARRIILAKDTPLHIYILLD-------ASESVGEANFEKAKNVIQNL 284 Query: 221 VNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL--SNNLNEVKSRL-NKLNPYEN--- 274 ++ I + + G ++Y + + N +EV +L N Sbjct: 285 IDKIASFD-----VRPKFGVVSYASEPITISSINDKETANADEVIYQLENAEEAKYGVHA 339 Query: 275 ----TNTYPAMHHAYRELYNEKESSHNTIGSTRLKK-FVIFITDGENSGASAYQNTLNTL 329 TN + A + K S + I TDG+++ + Sbjct: 340 DKRGTNIHAAFKKVMEMMVLSK---IIYKDSWEEIRFVTILFTDGKSNMGG------DPR 390 Query: 330 QICEYMRNAGMK----------IYSVAVSAPPEGQDLLRKCTD---SSGQFFAVNDSREL 376 +++ + +YS V + L + F V ++ +L Sbjct: 391 TAVREIKDFVIAQNKSLIDYLDMYSFGV-SDDANMLELNALSSQKHGEKHCFIVKNTLDL 449 Query: 377 LESFDKITD 385 +++FD+I D Sbjct: 450 IKAFDEILD 458 >gi|30794326|ref|NP_851361.1| epithelial chloride channel protein [Bos taurus] gi|2623763|gb|AAB86529.1| Lu-ECAM-1 [Bos taurus] Length = 905 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 18/112 (16%), Positives = 42/112 (37%), Gaps = 21/112 (18%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T+ + ++ + + + +I +TDGE++ ++ Sbjct: 382 ANGGTSICRGLKAGFQAIIHS--------DQSTSGSEIILLTDGEDNEINSCF------- 426 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN-DSRELLESF 380 E ++ +G I+++A+ P L ++ + G F N D L +F Sbjct: 427 --EDVKRSGAIIHTIALG--PSAAKELETLSNMTGGYRFFANKDITGLTNAF 474 >gi|326779287|ref|ZP_08238552.1| Protein of unknown function DUF3520 [Streptomyces cf. griseus XylebKG-1] gi|326659620|gb|EGE44466.1| Protein of unknown function DUF3520 [Streptomyces cf. griseus XylebKG-1] Length = 575 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 33/311 (10%), Positives = 80/311 (25%), Gaps = 22/311 (7%) Query: 75 GSYIRENAGDIAQKAQINITKDKNNPLQYIAES-KAQYEIPTENLFLKGLIPSALTNLSL 133 G+ ++ + + + A Y L L Sbjct: 93 EGRSAAPTGEPGDESGRPDVAAPDYLSTFALDVDTASYGYARRTLGDGRLPAPEEVRPEE 152 Query: 134 RSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKN 193 + + +D +R ++ + + L S + Sbjct: 153 FVNSFRQGYERPKGSGFSVNVDGARIGAGKGGGGGTGASDWSLLRVGLATEAAPSTAERP 212 Query: 194 T---TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 T ++D++ +S L + ++ + + ++ Sbjct: 213 PAALTFVVDISGSMAETGRLDLVRKSLAVLTDELRDDDS--------VSLVTFSDAAETR 264 Query: 251 QCTP-LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 L N N +K + ++ P ++TN + Y E V+ Sbjct: 265 LPMTRLQGNRNRIKDVVEEMRPEQSTNVEAGITRGYEESVEGHRKGATNR--------VV 316 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA 369 ++D + + + R G+ ++ V V + + R G Sbjct: 317 LLSDALANTGDTEADGILERIDSTR-REYGITLFGVGVGSDYGDAFMERLTNKGDGNTTY 375 Query: 370 VNDSRELLESF 380 V D + + F Sbjct: 376 VGDEAQARKVF 386 >gi|124007374|ref|ZP_01692081.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] gi|123987207|gb|EAY26947.1| von Willebrand factor type A domain protein [Microscilla marina ATCC 23134] Length = 1088 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 25/173 (14%), Positives = 54/173 (31%), Gaps = 23/173 (13%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 + K+ +L ES L++ ++ + + Y ++N Sbjct: 920 VSGSMSSKDKLPLLKESFKYLISIMRPQDD--------VSIVIYAGDAAIVLKPTSASNQ 971 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 ++ + ++KL TN AY+ + + N +I TDGE + Sbjct: 972 EQINAVIDKLRSRGKTNVKAGFKLAYKWMSKNFKEGGNNR--------IILATDGEFPIS 1023 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVN 371 + G+ + + + + + L K G + VN Sbjct: 1024 KYIYKLVEKRAT------KGINLSVFSFGSMTKKFETLEKLVAKGKGNYEQVN 1070 >gi|257867801|ref|ZP_05647454.1| von Willebrand factor type A/Cna B-type domain-containing protein [Enterococcus casseliflavus EC30] gi|257874128|ref|ZP_05653781.1| von Willebrand factor type A domain-containing protein [Enterococcus casseliflavus EC10] gi|257801884|gb|EEV30787.1| von Willebrand factor type A/Cna B-type domain-containing protein [Enterococcus casseliflavus EC30] gi|257808292|gb|EEV37114.1| von Willebrand factor type A domain-containing protein [Enterococcus casseliflavus EC10] Length = 1191 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 42/407 (10%), Positives = 97/407 (23%), Gaps = 58/407 (14%) Query: 41 CASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNP 100 ++ S + T ++ + ++ + +E +Q + + + Sbjct: 198 ASTESSKEVSNEETKEETNETGTATSASEEETLIEPFKQERLYQKTTVSQTFASIEPDYT 257 Query: 101 LQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSM 160 EN + + + NL S V Sbjct: 258 TDQSGTYPVANWKTDENSNVLNHQGNKDAGETWDGITTWNGDPANLTHSYIEYGGVGDEA 317 Query: 161 EDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYA--PAPAPANRKIDVLIESAG 218 + + + N L +I + + Sbjct: 318 DFALRKFAKETNTPGLFDVYLNVRGNVQRQIDPIDVVLVVDWSGSMNEMGRIAEVKKGVD 377 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYN-IGIVGNQCTPLSNNLNEVKSRLNKLNP---YEN 274 +N I SV +G + Y+ G T +EVK + + P Sbjct: 378 RFLNQI---EGSGIQDSVYMGYVGYSSDGSNYQNKTCQLGKFSEVKETIRSMTPETAAGG 434 Query: 275 TNTYPAMHHAYRELYNEK--------------------ESSHNTIGSTRLKKFV------ 308 T T + A L + H + Sbjct: 435 TFTQRGLRQAGDMLSTQNGHKKVIVLLTDGVPTYSYHVSKVHTQADGSYYGTVFSLTQDQ 494 Query: 309 ----------IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS---------A 349 F D +N+ + T+ ++ G++I+ + + Sbjct: 495 PMNTSFLYNGYFAFDQQNNYKWINNTFIATIGEAMALKERGIEIHGLGIQLQGDQTAGYT 554 Query: 350 PPEGQDLLRKC--TDSSGQFFA--VNDSRELLESFDKITDKIQEQSV 392 + + +R+ D G + N++ ++ + +K I Sbjct: 555 KADVEKKMRQMVSADEDGHLYYESANEAADIADYLEKKALHISATVT 601 >gi|260808845|ref|XP_002599217.1| hypothetical protein BRAFLDRAFT_64431 [Branchiostoma floridae] gi|229284494|gb|EEN55229.1| hypothetical protein BRAFLDRAFT_64431 [Branchiostoma floridae] Length = 600 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 33/212 (15%), Positives = 60/212 (28%), Gaps = 27/212 (12%) Query: 180 LLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG 239 P KI LV+ Q + + +IG Sbjct: 211 DCVDKVGCCRPDIVFVLDYSGSIPDSEFVKI---KNFVAALVDRFQVGVLDA-----QIG 262 Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAY---RELYNEKESSH 296 I YN ++ +N +V S ++ + T A Y L + Sbjct: 263 VIRYNHAVIHEFHLNTHDNKADVLSDVSAMPTATTGGTNTAAALTYVASTMLLPGNGNRP 322 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + VI +TDG +SG + + G++ +++ V A Sbjct: 323 DAPD------VVIVLTDGYSSG---------VVGPASVLHGMGVQTFAIGVGACANSAQ- 366 Query: 357 LRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 L + + + D L + D+I Sbjct: 367 LTQIASCPDYIYRLPDFSALRSITAGMHDQIC 398 >gi|197334600|ref|YP_002156233.1| von Willebrand factor, type A [Vibrio fischeri MJ11] gi|197316090|gb|ACH65537.1| von Willebrand factor, type A [Vibrio fischeri MJ11] Length = 356 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 27/151 (17%), Positives = 64/151 (42%), Gaps = 16/151 (10%) Query: 246 GIVGNQCTPLSNNLN---EVKSRLNKLNPYENTNTYPAMHHAYREL-YNEKESSHNTIGS 301 G TP + + + E+ ++ + ++T+ A+ A + + ++ + + Sbjct: 157 GDAAFVQTPFTADQSVWLELLNQTDVAMAGQSTHLGDAIGLAIKVFEQSSEDKASAEENA 216 Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-GQDLL--- 357 +K I +TDG ++G+ + + +R I+ +A+ P G+ L Sbjct: 217 KPREKVAIVLTDGNDTGSYVEPIDAAKVAAAKGVR-----IHMIAMGDPRTVGEQALDMN 271 Query: 358 ---RKCTDSSGQFFAVNDSRELLESFDKITD 385 R +S G+ F + EL +++D+I + Sbjct: 272 IINRVAKESGGKAFQAINRDELEQAYDEIGE 302 >gi|13473814|ref|NP_105382.1| hypothetical protein mll4535 [Mesorhizobium loti MAFF303099] gi|14024565|dbj|BAB51168.1| mll4535 [Mesorhizobium loti MAFF303099] Length = 373 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 25/205 (12%), Positives = 49/205 (23%), Gaps = 23/205 (11%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + +++ + +++D + R+ Q ALDAA+L T D T+ Q Sbjct: 33 LFFLMLVPIISAVGFSVDYTSAIQTRSNQQQALDAALL--------SITTMDTTSTLAQR 84 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 T + + A +N ++ A + Sbjct: 85 QTALQDSFIAN------------GGLGTATLNSFVAGTTTTPATGQASASFS---MPTIF 129 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 + +++ S + N I V LY + Sbjct: 130 MKIARIDSVPIAVASAVSKPPALVNATFKIAKVSGWWNKTMTLYGTQFGATAAKPLMSIE 189 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAP 205 TT Sbjct: 190 YTYNGFGDPKGYGTTNVYTITNNGG 214 >gi|256374467|ref|YP_003098127.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827] gi|255918770|gb|ACU34281.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827] Length = 564 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 36/367 (9%), Positives = 86/367 (23%), Gaps = 50/367 (13%) Query: 32 ALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQI--KKHLKQGSYIRENAGDIAQKA 89 A + AVL+ + R + +I + A Sbjct: 240 ASEQAVLAANRAAGGLRVAASYPAEGTMLLDYPVVRIKRASDQPGTGVAASGFEQALRSA 299 Query: 90 QINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAIS 149 + ++ A + + + + + S + + +L Sbjct: 300 KTRERFVDAGFRTPDGQAAAGLSAERDGVGGDAVNAMPKPSPAEVSELLGTWGAVSLDSR 359 Query: 150 ICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRK 209 + VLDVS SM +L + + S Sbjct: 360 MLAVLDVSGSMTELMGNGQTRMAAASEAALTALGMLPDTSEIGLWAFS------------ 407 Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 N + + + ++ L Sbjct: 408 ----------------------TNKRPPNDWVELVPLGPLGEVLGSAPRRTRLQQGAKGL 445 Query: 270 NP--YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 T A+R + + + V+ ITDG N ++ Sbjct: 446 AALVGGGTALNDTTLAAFRRMQSTYDPEKI--------NSVVLITDGRNDDYASITTAQL 497 Query: 328 TLQICEYMR-NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + + + V + + + L + + G+ + ++ ++ + D Sbjct: 498 LQALESESDPARPIPLIMVGLGQEADMEALQEISSATGGKAYQALEAADIRSV---LLDA 554 Query: 387 IQEQSVR 393 I ++ R Sbjct: 555 ISQRRCR 561 >gi|170291084|ref|YP_001737900.1| von Willebrand factor type A [Candidatus Korarchaeum cryptofilum OPF8] gi|170175164|gb|ACB08217.1| von Willebrand factor type A [Candidatus Korarchaeum cryptofilum OPF8] Length = 328 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 28/167 (16%), Positives = 58/167 (34%), Gaps = 21/167 (12%) Query: 220 LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYP 279 L+ + AI K S R G + +N + + + +V + L P TN Sbjct: 182 LIAMMIAAIASHKLRSGRYGVVGFNSTAFVIKSPAENKDSVKVIEEILDLVPIGYTNISD 241 Query: 280 AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG 339 + S++ + ITDGE + + ++ +N Sbjct: 242 GLKKGLEI-------SYHLKNPKY-----LLITDGEYNVGE------DPRKVARRFKNLC 283 Query: 340 MKIYSVAVSAPPEGQDLLRKCTDSSG-QFFAVNDSRELLESFDKITD 385 + I++ G L ++ G ++F ++D +++ I D Sbjct: 284 V-IHTRG-KRDSRGSVLCKEIARIGGSKYFVIDDIKQIQRVMKSILD 328 >gi|331694298|ref|YP_004330537.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] gi|326948987|gb|AEA22684.1| von Willebrand factor type A [Pseudonocardia dioxanivorans CB1190] Length = 362 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 18/185 (9%), Positives = 45/185 (24%), Gaps = 63/185 (34%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN------------EKESSHNTIGSTR 303 + + + + L+ L T A+ + + T Sbjct: 144 TTDRDAITRALDGLTTGRGTTVGSAILKSVDAISEIDPDVAPSDPAPGTVPPPPRAPGTY 203 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP------------ 351 + V+ +TDG N+ ++ + G+++Y + Sbjct: 204 APEIVVLLTDGANTTGVTPED------AAKTAAERGVRVYPIGFGTDEPSSMVCSAAQLG 257 Query: 352 --------------------------------EGQDLLRKCT-DSSGQFFAVNDSRELLE 378 + L++ + G++FA D+ L Sbjct: 258 RSSDGAFGGFGGYAGPSGGSGGGGGGGRGYLVADEGSLQQVADTTGGRYFAAADADRLQT 317 Query: 379 SFDKI 383 + Sbjct: 318 VLADL 322 >gi|296489197|gb|DAA31310.1| epithelial chloride channel protein [Bos taurus] Length = 905 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 18/112 (16%), Positives = 42/112 (37%), Gaps = 21/112 (18%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T+ + ++ + + + +I +TDGE++ ++ Sbjct: 382 ANGGTSICRGLKAGFQAIIHS--------DQSTSGSEIILLTDGEDNEINSCF------- 426 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN-DSRELLESF 380 E ++ +G I+++A+ P L ++ + G F N D L +F Sbjct: 427 --EDVKRSGAIIHTIALG--PSAAKELETLSNMTGGYRFFANKDITGLTNAF 474 >gi|116622501|ref|YP_824657.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225663|gb|ABJ84372.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 337 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 41/304 (13%), Positives = 91/304 (29%), Gaps = 50/304 (16%) Query: 113 IPTENLFLKGLIPS------ALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ 166 +P P + + R E + + +D + + + + Sbjct: 15 LPAAYWAFAAFGPKLAQAQDPVVAIEPRVAARPESKGSANRTTPNIRVDSNLVLIPVMVT 74 Query: 167 KHNDNNNMTSNKY------LLPPPPKKSFWSKNTTKSKYAP--APAPANRKIDVLIESAG 218 H D K +F S++ S K+ + Sbjct: 75 DHQDRLITGLEKIHFHLFDDKVEQEITTFASEDVPVSIVIVFDCSGSMGPKLAKSRAAVA 134 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 ++S + + +N + + +E++S+L T Sbjct: 135 AFLSSANPEDE--------FSLVLFNDRA--QLVSGFNRQTDELQSKLFYAQSKGRTALL 184 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 A++ A ++ + K S K V+ I+DG ++ + ++ ++ Sbjct: 185 DAIYLAMDQMKHAKHSR----------KAVLVISDGGDNCSRYS-----MREVKNRVKEG 229 Query: 339 GMKIYSVAV-------SAPP---EGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKI 387 +IYS+ + G LL S G+ F +++ EL + KI + Sbjct: 230 DAQIYSIGILEAMGFRGRSAEELAGPALLDDIASQSGGRLFEIDNLNELSDVASKIGMAL 289 Query: 388 QEQS 391 + Q Sbjct: 290 RNQY 293 >gi|314923048|gb|EFS86879.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL001PA1] gi|314966820|gb|EFT10919.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL082PA2] gi|315093260|gb|EFT65236.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL060PA1] gi|315103482|gb|EFT75458.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL050PA2] gi|327327646|gb|EGE69422.1| putative von Willebrand factor type A domain protein [Propionibacterium acnes HL103PA1] Length = 322 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 19/154 (12%), Positives = 51/154 (33%), Gaps = 25/154 (16%) Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 P S + V ++ + + T A+ + + + + ++ ++D Sbjct: 150 PPSTDRPTVLRAVDGIELQDGTALGGAIDKSLEAVK----MAPGGSKNPAPA-AIVMLSD 204 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PEGQDLLRKC 360 G+N+ + N + + +Y++A LL Sbjct: 205 GDNTQGGSPLVAANRAAAAK------VPVYTIAFGTETGYVDLDGQRERVAPDTKLLSTV 258 Query: 361 TD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 D + Q + + + +L E + ++ + + VR Sbjct: 259 ADRTHAQSWTADSADKLQEVYQQVHSSVGYEPVR 292 >gi|327543524|gb|EGF29943.1| von Willebrand factor type A domain-containing protein [Rhodopirellula baltica WH47] Length = 274 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 17/107 (15%), Positives = 36/107 (33%), Gaps = 22/107 (20%) Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T A+ + E + K +I +TDG ++ + + + Sbjct: 172 MAGPRTAFGDAIGLGVNLFDEDTERA----------KTIIALTDGNDTK-----SKVPPV 216 Query: 330 QICEYMRNAGMKIYSVAVSAPPE------GQDLLRKCTD-SSGQFFA 369 + +KIY+VA+ P + L+ + G++F Sbjct: 217 EAARVATQRDIKIYTVAIGDPTTVGEDKLDEQSLKDVASETGGKYFF 263 >gi|198423392|ref|XP_002124188.1| PREDICTED: similar to fibropellin Ia [Ciona intestinalis] Length = 1781 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 21/204 (10%), Positives = 58/204 (28%), Gaps = 24/204 (11%) Query: 192 KNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI-AYNIGIVGN 250 R++ I++ + N + + + I Sbjct: 80 LVDGSGSVNYNAPGNWRRVLTWIKAVASGFNISNENVNIGVVQYSHWYRTLPMSSQIYLK 139 Query: 251 QCTPLSNNLNE--VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 L+ + + ++K+ T A + + + ++ K + Sbjct: 140 TEIALNRCRTQACFQYLVDKIQIMGYTTYTGAA--INKTILEDFSTTPARSN-----KIL 192 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS--SGQ 366 I +TDG + Y + R + I+ + V L+ + + Sbjct: 193 ILLTDGISKDDVNYASAF--------ARRQNITIFCIGVG--SYSLSQLQIIANGELNNN 242 Query: 367 F--FAVNDSRELLESFDKITDKIQ 388 + + +N +L + + +I+ Sbjct: 243 WRIYQLNSFNQLPTTVQNLQREIK 266 >gi|159899681|ref|YP_001545928.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159892720|gb|ABX05800.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 550 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 52/373 (13%), Positives = 111/373 (29%), Gaps = 45/373 (12%) Query: 36 AVLSGCASIVSDRTIKD-PTTKKDQTSTIFKKQIKKHLKQGSYIRENA-----GDIAQKA 89 A L SD T++D K Q + IK + +N I+ Sbjct: 198 AELYAANGKTSDLTVEDINQEKSQQFLRDLAQGIKHYGSNTLVFSQNMQKYGMAYISAFP 257 Query: 90 QINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAIS 149 IT N A Y + I + ++ + S Sbjct: 258 MEEITLIDFNKQAPNVPLVAIYPKEGTFIHDNPFIVMSDATADQKAAASVFYDFLLTPES 317 Query: 150 ICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRK 209 + +M+ + + D + P + S T + A Sbjct: 318 Q------NLAMQQGFRPANVDVALASPLTAQFGVDPNQPRNSLATPPADVIVAAKN--AW 369 Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSV--------------RIGTIAYNIGIVGNQCTPL 255 + + LV ++++ + + PL Sbjct: 370 ANNRKPANIMLVVDSSGSMRDDDKMDQAKLGVEVFLNRLPSKDNVGMIGFSSSPAVLVPL 429 Query: 256 ---SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 S N+ ++ + L P NT+ Y A+ A +EL N K+ ++ ++ Sbjct: 430 ATRSENMANLQMQTQGLVPDGNTSLYDAIDLARQELENLKQPDRI--------NAIVVLS 481 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND 372 DG ++ + + Q+ + ++I+ +A A +L++ D S Sbjct: 482 DGADTASQLSID-----QMLGNFGESSIQIFPIAYGADA-ETSILQQIADFSRTELVQGS 535 Query: 373 SRELLESFDKITD 385 + ++ + F+ ++ Sbjct: 536 TGDIDKIFENLSR 548 >gi|21703186|gb|AAM76090.1| Vwa1 protein [Boltenia villosa] Length = 599 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 26/219 (11%), Positives = 62/219 (28%), Gaps = 25/219 (11%) Query: 149 SICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR 208 ++ RS D + + + P P + + + + Sbjct: 371 NLGQSFPKGRSTNKCLPSTRWDKD----HPFCTRPCPPNARMDAFV---ILDSSSSIGDE 423 Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++ N++ S I + + N+ + + +K Sbjct: 424 NWLIMKAFIRNILGSF--TISDDTTHFAIVRYNGLVDTSTQVLLNDFPNSKAGLLAAFDK 481 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 L T T A+ H + + ++ V+ ITDG++ + Sbjct: 482 LPYNGSGTKTGQAIAHVRDNMMSSANG-----NREGIQDLVMVITDGKSQDDVLKPSND- 535 Query: 328 TLQICEYMRNAGMKIYSVAVSAP--PEGQDLLRKCTDSS 364 +R G I+++ ++ P + L + S Sbjct: 536 -------LRKMGALIFAIGITPPRGALDEAQLLEIAGSP 567 >gi|239831900|ref|ZP_04680229.1| Poly [ADP-ribose] polymerase 4 [Ochrobactrum intermedium LMG 3301] gi|239824167|gb|EEQ95735.1| Poly [ADP-ribose] polymerase 4 [Ochrobactrum intermedium LMG 3301] Length = 777 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 26/171 (15%), Positives = 52/171 (30%), Gaps = 18/171 (10%) Query: 223 SIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMH 282 + + + I N+ + + L T PA+H Sbjct: 404 DYALSHLQPGDRFNVIRFDDTLTRFFEVSVEASQQNIASARHFVMSLEAQGGTAMLPALH 463 Query: 283 HAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKI 342 A L + + + + ++F+TDGE + L R +I Sbjct: 464 AA---LDDSHQGNGL--------RQIVFLTDGEI-------SNEQQLLDAIAARRGRSRI 505 Query: 343 YSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + V + P + G F + + E+ E + DK++ +V Sbjct: 506 FMVGIGTAPNSYLMNHAAELGRGTFTHIGSAAEVDERMRALFDKLENPAVT 556 >gi|125830338|ref|XP_692362.2| PREDICTED: anthrax toxin receptor 1 [Danio rerio] Length = 552 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 34/187 (18%), Positives = 61/187 (32%), Gaps = 21/187 (11%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 K + + + ++ Q+ + +R+ I ++ GN L+ N ++ Sbjct: 44 YFVLDKSGSVKHHWQEIYSFVELLEQKFISPMLRMSFIVFSTR--GNTIMRLTENRETIR 101 Query: 264 SRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 LN L P +T + + A ++Y E + +I +TDGE Sbjct: 102 KGLNVLRREIPGGDTFMHLGLEKANEQIYQE---------NYGTASVIIALTDGELQEHQ 152 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV-NDSRELLES 379 + G +Y V V + L D+S F V + L Sbjct: 153 LIAAQQEAARARTL----GAIVYCVGV--KDFNETQLATIADTSKHVFPVLGGFQALRGM 206 Query: 380 FDKITDK 386 D I K Sbjct: 207 IDSIIKK 213 >gi|310829014|ref|YP_003961371.1| hypothetical protein ELI_3449 [Eubacterium limosum KIST612] gi|308740748|gb|ADO38408.1| hypothetical protein ELI_3449 [Eubacterium limosum KIST612] Length = 684 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 41/314 (13%), Positives = 89/314 (28%), Gaps = 59/314 (18%) Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD--VSRSMEDLYLQKHNDNNNMT 175 + + + + LA + V+++ + N+ Sbjct: 1 MLNMKKTGKQIIAVIALLGMLWSSGFSVLAAPEPSSPNIEVTKTAALKDWESRTYQINLG 60 Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 + + + S K++VL ++A + + + Sbjct: 61 AKASSSAVGAADVVLVIDRSGSMGERYDGSRQTKMEVLKDTAKDFITQLSAQSPA----- 115 Query: 236 VRIGTIAYNIGIVGN---QCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 ++ ++Y N N+ + ++KL T + + AY L Sbjct: 116 SQVSVVSYASDSKTNIGLTSLDTQENIQSLNRAIDKLWASGATRSDLGLEDAYSVLGAAD 175 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY--------MRNAG----- 339 + K+FVIF+TDGE + S + + ++ G Sbjct: 176 SGN---------KQFVIFLTDGEPNSYSGFDREIAARAESTASIIKGEDLIKRDGRIFGD 226 Query: 340 -----------------------MKIYSVAVSAPPEGQ---DLLRKCTDSSGQFFAVNDS 373 +I+S+ + Q D L DS + + Sbjct: 227 YDGSLDDGSSHNPDWEFEGDPLSAEIFSIGILKSWSSQRVHDYLNYI-DSQHSAALADTA 285 Query: 374 RELLESFDKITDKI 387 + L + F+ IT +I Sbjct: 286 QALQDIFEAITHQI 299 >gi|282897345|ref|ZP_06305347.1| hypothetical protein CRD_02269 [Raphidiopsis brookii D9] gi|281197997|gb|EFA72891.1| hypothetical protein CRD_02269 [Raphidiopsis brookii D9] Length = 449 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 20/160 (12%), Positives = 51/160 (31%), Gaps = 12/160 (7%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQE------KKNLSVRIGTIAYNIGIVGNQCTPLSN 257 + ++KI I + + + + ++ S+ Sbjct: 143 SGGSKKIAGAINAIREFTKVSSDRGGDTQISVVPFGEAGKNCPEYTVNKDTLDKFLSASD 202 Query: 258 -NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS---HNTIGSTRLKKFVIFITD 313 L L+ LNP +TN Y + A L N ++ + +I ++D Sbjct: 203 FKLQNSLEYLSGLNPCGSTNLYQPLKKALEFLGNPEDPRFTLPENSSEPNPRLSIILLSD 262 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 G ++ + +Q+ + + + ++++ P Sbjct: 263 GYHNAMNEFQDFNELKSLLQSY--ENITVHTLGYGLTPSQ 300 >gi|317012798|gb|ADU83406.1| hypothetical protein HPLT_05065 [Helicobacter pylori Lithuania75] Length = 219 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 67/190 (35%), Gaps = 30/190 (15%) Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 +I VL ++ ++++ + KK L ++ + + G G +++ + + Sbjct: 39 RTRIGVLNLCIQKMIETLKQ--EAKKELFNKMAIVTF--GENGAVLHTPFDDIKNINFK- 93 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA----- 321 L+ T A A + +T + K + I ++DGE + Sbjct: 94 -PLSASGGTPLDQAFKLAKDLIE-----YKDTFPTKFYKPYSILVSDGEPNDDKWQEPLF 147 Query: 322 --YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 + + + +C +S+ + + + F +D +L++ Sbjct: 148 NFHHDGRSAKSVC----------WSIFIGDREANPQVNKD--FGKDGVFYADDVEKLVKL 195 Query: 380 FDKITDKIQE 389 F+ +T I + Sbjct: 196 FEIMTQTISK 205 >gi|319654932|ref|ZP_08009006.1| hypothetical protein HMPREF1013_05628 [Bacillus sp. 2_A_57_CT2] gi|317393357|gb|EFV74121.1| hypothetical protein HMPREF1013_05628 [Bacillus sp. 2_A_57_CT2] Length = 245 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 18/103 (17%), Positives = 41/103 (39%), Gaps = 10/103 (9%) Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS----APPEGQD 355 T K ++ ITDG ++ + + + + G+ + + V +G + Sbjct: 1 MKTGTLKQILLITDGCSNQGE------DPVAMAALAKEQGITVNVIGVMEQDVIDEQGMN 54 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + S G V S++L ++ +T K Q+++ N+ Sbjct: 55 EIEGIAMSGGGVSQVVYSQQLSQTVQMVTRKAMTQTLQGVVNK 97 >gi|198415894|ref|XP_002125076.1| PREDICTED: similar to calcium-dependent chloride channel-1, partial [Ciona intestinalis] Length = 863 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 31/331 (9%), Positives = 77/331 (23%), Gaps = 44/331 (13%) Query: 59 QTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENL 118 + H + + +NI Sbjct: 64 VFDEYATRGHGSHYVDSDGVLQGTRCPKSLRGLNIEFTPPYTNLCEVNRTTGLPQTDTCA 123 Query: 119 FLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNK 178 F + N S+ S S + C + + N Sbjct: 124 FYVHDVQHPELNTSMMSH-----SYVQQVVEFCHSDPSDPVNQHNTEADNEQNAKCNLRS 178 Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 F + + P ++ ++ +SA + + V I Sbjct: 179 TWDVITSTSDFSGGSNPPNPTLTRP----DRMLMMRQSASTFITTSL-----PIGDCVGI 229 Query: 239 GTIAYNIGIVGNQC-TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHN 297 + ++ S + + + ++ T + A E+ + Sbjct: 230 TSFQSTSQLLLQISNITSSEDRANLTRMI-PISAGGTTCIGCGIKTAIDEMERDNPGG-- 286 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 + +I +TDG + ++++ ++L Sbjct: 287 ------VCGDIIVLTDGMENEEP-----------------KNCTVHAI-FFTTAHNRELA 322 Query: 358 RKCTDSSGQFFAVNDSR--ELLESFDKITDK 386 ++ GQ++ D L+ +F I + Sbjct: 323 ELAIETGGQWYFAQDRDIKRLMGAFTAIATE 353 >gi|119569860|gb|EAW49475.1| von Willebrand factor A domain containing 2 [Homo sapiens] Length = 576 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 21/136 (15%), Positives = 49/136 (36%), Gaps = 18/136 (13%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + VR+G ++ + EVK+R+ ++ T T A+ + Sbjct: 30 DISPERVRVGAFQFSSTPHLEFPLDSFSTQQEVKARIKRMVFKGGRTETELALKYLLHRG 89 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ + + +I +TDG++ G A + + ++ G+ +++V V Sbjct: 90 LPGGRNA-------SVPQILIIVTDGKSQGDVALPS--------KQLKERGVTVFAVGVR 134 Query: 349 APPEGQDLLRKCTDSS 364 P + L Sbjct: 135 FPRWEE--LHALASEP 148 >gi|38348304|ref|NP_940898.1| von Willebrand factor A domain-containing protein 2 [Homo sapiens] gi|34527908|dbj|BAC85505.1| unnamed protein product [Homo sapiens] Length = 725 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 21/136 (15%), Positives = 49/136 (36%), Gaps = 18/136 (13%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + VR+G ++ + EVK+R+ ++ T T A+ + Sbjct: 82 DISPERVRVGAFQFSSTPHLEFPLDSFSTQQEVKARIKRMVFKGGRTETELALKYLLHRG 141 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ + + +I +TDG++ G A + + ++ G+ +++V V Sbjct: 142 LPGGRNA-------SVPQILIIVTDGKSQGDVALPS--------KQLKERGVTVFAVGVR 186 Query: 349 APPEGQDLLRKCTDSS 364 P + L Sbjct: 187 FPRWEE--LHALASEP 200 >gi|74722595|sp|Q5GFL6|VWA2_HUMAN RecName: Full=von Willebrand factor A domain-containing protein 2; AltName: Full=A domain-containing protein similar to matrilin and collagen; Short=AMACO; AltName: Full=Colon cancer secreted protein 2; Short=CCSP-2; Flags: Precursor gi|50429312|gb|AAT77225.1| colon cancer secreted protein-2 [Homo sapiens] gi|50429314|gb|AAT77226.1| colon cancer secreted protein-2 [Homo sapiens] Length = 755 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 21/136 (15%), Positives = 49/136 (36%), Gaps = 18/136 (13%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + VR+G ++ + EVK+R+ ++ T T A+ + Sbjct: 82 DISPERVRVGAFQFSSTPHLEFPLDSFSTQQEVKARIKRMVFKGGRTETELALKYLLHRG 141 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ + + +I +TDG++ G A + + ++ G+ +++V V Sbjct: 142 LPGGRNA-------SVPQILIIVTDGKSQGDVALPS--------KQLKERGVTVFAVGVR 186 Query: 349 APPEGQDLLRKCTDSS 364 P + L Sbjct: 187 FPRWEE--LHALASEP 200 >gi|27657363|emb|CAD60276.1| AMACO [Homo sapiens] Length = 755 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 21/136 (15%), Positives = 49/136 (36%), Gaps = 18/136 (13%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + VR+G ++ + EVK+R+ ++ T T A+ + Sbjct: 82 DISPERVRVGAFQFSSTPHLEFPLDSFSTQQEVKARIKRMVFKGGRTETELALKYLLHRG 141 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ + + +I +TDG++ G A + + ++ G+ +++V V Sbjct: 142 LPGGRNA-------SVPQILIIVTDGKSQGDVALPS--------KQLKERGVTVFAVGVR 186 Query: 349 APPEGQDLLRKCTDSS 364 P + L Sbjct: 187 FPRWEE--LHALASEP 200 >gi|241205700|ref|YP_002976796.1| Vault protein inter-alpha-trypsin domain protein [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240859590|gb|ACS57257.1| Vault protein inter-alpha-trypsin domain protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 791 Score = 50.3 bits (118), Expect = 5e-04, Method: Composition-based stats. Identities = 21/164 (12%), Positives = 47/164 (28%), Gaps = 16/164 (9%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYR 286 + + I T +N + + L T PA+ A R Sbjct: 383 SKLNPDDRFNVIRFDDTMTDYFKGLVTATPDNREKAIGYVRGLTADGGTEMLPALQAALR 442 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 + + V+F+TDG + ++++V Sbjct: 443 NQGPVASGA---------LRQVVFLTDGAIGNEQQLFQEI-------TANRGDARVFTVG 486 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + + P + + G F A+ + ++ ++ K+Q Sbjct: 487 IGSAPNTYFMTKAAEMGRGTFTAIGSTDQVASRMGELFAKLQNP 530 >gi|119505575|ref|ZP_01627647.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2080] gi|119458684|gb|EAW39787.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2080] Length = 316 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 23/139 (16%), Positives = 47/139 (33%), Gaps = 25/139 (17%) Query: 253 TPLSNNLNEVKSRLNK---LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 +P + + LN+ +T A+ A + + + ++ Sbjct: 141 SPFTEDHRTWLLLLNETRIRMAGPSTALGDAVGLAIKLF----------KDAETEHRVLL 190 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------GQDLLRKCTD- 362 +TDG ++G+ + ++IY +AV P D L + + Sbjct: 191 LLTDGNDTGSLVP-----PVDAARVAATEDIRIYPIAVGDPTAVGEEAIDLDTLARMAEV 245 Query: 363 SSGQFFAVNDSRELLESFD 381 + GQ F S +L+ F Sbjct: 246 TGGQAFEALSSEDLIAVFK 264 >gi|194679013|ref|XP_608702.4| PREDICTED: matrilin 4-like [Bos taurus] gi|297490994|ref|XP_002698558.1| PREDICTED: matrilin 4-like [Bos taurus] gi|296472628|gb|DAA14743.1| matrilin 4-like [Bos taurus] Length = 788 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 24/164 (14%), Positives = 61/164 (37%), Gaps = 19/164 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + VR+G + ++ + VK ++ ++ T T A+ + R+ Sbjct: 82 DINPRKVRVGAVQFSSVPRLEFPLDAFSTQQGVKGKIKRMVFKGGHTETSLALKYLLRKG 141 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + ++ + + +I ITDG + G A + ++ G+ ++SV + Sbjct: 142 FPGGRNA-------SVPQVLIIITDGRSQGHVALP--------AKQLKQRGITVFSVGIH 186 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 P + L + V + E+ ++ + + + S+ Sbjct: 187 FPRWEE--LHSLASEP-REQHVLMAEEVDDATNGLFSTLSGSSI 227 >gi|110832907|ref|YP_691766.1| von Willebrand factor type A domain-containing protein [Alcanivorax borkumensis SK2] gi|110646018|emb|CAL15494.1| protein containing a von Willebrand factor type A domain [Alcanivorax borkumensis SK2] Length = 698 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 22/165 (13%), Positives = 58/165 (35%), Gaps = 29/165 (17%) Query: 235 SVRIGTIAYNIGI---VGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNE 291 + R + ++ NN+ + ++ +L TN + + A L + Sbjct: 351 NDRFRIVLFDDRAEELTSGFVDATPNNIRQYTQKIMQLQSRGGTNLFGGLSLALTPLDAD 410 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 + + ++ +TDG + + + N +++++ V Sbjct: 411 RPTG------------IVLVTDGVANVGKT-----RQKDFIDLLENHDVRLFTF-VMGNS 452 Query: 352 EGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 + +L TD S+G +V++S + I +I + ++ Sbjct: 453 ANRPMLTAMTDASNGFAISVSNSDD-------IAGQILNATSKVT 490 >gi|114665186|ref|XP_001171182.1| PREDICTED: similar to leukointegrin alpha d chain, partial [Pan troglodytes] Length = 129 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 21/97 (21%), Positives = 37/97 (38%), Gaps = 10/97 (10%) Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS---APPE 352 H KK +I ITDG+ Y++ L + AG+ Y++ V P Sbjct: 6 HKNGARKSAKKILIVITDGQ-----KYKDPLEYSDVIPQAEKAGIIRYAIGVGHAFQGPT 60 Query: 353 GQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKI 387 + L + + F V++ L ++ +KI Sbjct: 61 ARQELNTISSAPPQDHVFKVDNFAALGSIQKQLQEKI 97 >gi|194016210|ref|ZP_03054824.1| YwmD [Bacillus pumilus ATCC 7061] gi|194011683|gb|EDW21251.1| YwmD [Bacillus pumilus ATCC 7061] Length = 225 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 23/194 (11%), Positives = 51/194 (26%), Gaps = 23/194 (11%) Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE----KKNLSVRIGTIAYNIGIVGNQC 252 S +KID+ S + + + Q + Sbjct: 40 SGSMAQSVEGEKKIDIAKRSIQSFASILSDDTQVLLRVFGHEGTNKNAGKAISCESSEAV 99 Query: 253 TPL-SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 S + + LN P T A+ ++ + + K V + Sbjct: 100 FGFGSYESSTFQQALNVYKPTGWTPLAKALTDTKQDFEDHQAEG---------KNIVYVV 150 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRKCT-DSSGQFF 368 +DG+ + + Q + + G+ + + + L+ GQ+ Sbjct: 151 SDGQETCGGSPS------QAAKELHEGGIDTIVNIIGFDVNEKEARSLKSVAKAGGGQYQ 204 Query: 369 AVNDSRELLESFDK 382 ++ EL Sbjct: 205 PAANAEELNYILQN 218 >gi|197124353|ref|YP_002136304.1| von Willebrand factor A [Anaeromyxobacter sp. K] gi|196174202|gb|ACG75175.1| von Willebrand factor type A [Anaeromyxobacter sp. K] Length = 480 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 28/158 (17%), Positives = 52/158 (32%), Gaps = 21/158 (13%) Query: 248 VGNQCTPLSNN-LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 T ++ + + +K L +L P NTN + + + T + Sbjct: 88 TLAAPTEMTQDRKDALKVALGRLRPRHNTNLAGGLLAGL-------DHAKVTKVPDGMPV 140 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNA--GMKIYSVAVSAPPEGQDLLRKCTD-S 363 VI TDG + A +C + + + Q+LLR+ + Sbjct: 141 RVILFTDGLANEGPATS----PEGLCALLEANLGTASVSAFGYG-DDADQELLRELSTLG 195 Query: 364 SGQFFAVNDSRELLESFDKITDKIQEQS-----VRIAP 396 G + V + L +F + + VR+AP Sbjct: 196 RGNYAYVRSPEDALTAFARELGGLLSTYAQRIEVRVAP 233 >gi|282898869|ref|ZP_06306855.1| hypothetical protein CRC_00003 [Cylindrospermopsis raciborskii CS-505] gi|281196242|gb|EFA71153.1| hypothetical protein CRC_00003 [Cylindrospermopsis raciborskii CS-505] Length = 1499 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 50/382 (13%), Positives = 118/382 (30%), Gaps = 44/382 (11%) Query: 29 MQSALDAAVLSGCASIVSDRTIKDPTTKKDQT-STIFKKQIKKHLKQGSYIRENAGDIAQ 87 + +A D A ++ ++ + + + + + K ++ +GD Sbjct: 680 LSNATDGATITTATAVGTIQNDDVVAPGLAIAPTNAIQTEGNIGTKAFTFTVTRSGDTTS 739 Query: 88 KAQINITKDKNNPLQYIAE---------------SKAQYEIPTENLFLKGLIPSALTNLS 132 + N ++ Q + I + L + P ++ Sbjct: 740 SSSANWAVTGSSTNQADVTDFGGTLPTGTVNFTAGETSKTIIVDVLGDTTVEPDEGFTVT 799 Query: 133 LRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSK 192 L + ++ +I D + S++ + N N P Sbjct: 800 LSNPTNATITTATAVGTITNDDDNTGSIQGFKWKDINGNGVREDLIQGDSPNIVFVIDVS 859 Query: 193 NTTKSKYAPAPAPANRKI----DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIV 248 +T+ + P K +L + +++ ++ +++ Sbjct: 860 GSTRGPFQGIPVGDVNKDGIQNTILDAEIAGFIALNNSLVRKGFGSRAKVSIVSFASDAK 919 Query: 249 GNQCTPL-----SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 T N +V+ +L L TN A+ A + L + ++ N Sbjct: 920 TLLTTNPETDSNKNGTKDVEEKLISLKSGGETNFEIALQEAAKTLRDIGTTAGN------ 973 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS 363 VIF++DG+ + + L+ ++ AG+K+ + V D L+ + Sbjct: 974 --GNVIFMSDGQPNQGNYTDEVLD-------LQKAGVKLSAFGVGTGA-SIDSLKLINPN 1023 Query: 364 SGQFFAVNDSRELLESFDKITD 385 + F + +LL FD + Sbjct: 1024 ASIF---TSTDQLLGVFDGLGS 1042 >gi|109129708|ref|XP_001116037.1| PREDICTED: integrin alpha-D-like, partial [Macaca mulatta] Length = 101 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 21/97 (21%), Positives = 36/97 (37%), Gaps = 10/97 (10%) Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS---APPE 352 H KK +I ITDG+ Y++ L + AG+ Y++ V P Sbjct: 6 HKNGARKSAKKILIVITDGQ-----KYKDPLEYRDVIPQAEKAGIIRYAIGVGRAFQEPT 60 Query: 353 GQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKI 387 + L + F V++ L ++ +KI Sbjct: 61 ARQELNTIGSAPPQDHVFKVDNFAALSSIQKQLQEKI 97 >gi|268574618|ref|XP_002642288.1| C. briggsae CBR-CUT-6 protein [Caenorhabditis briggsae] gi|187025290|emb|CAP35757.1| CBR-CUT-6 protein [Caenorhabditis briggsae AF16] Length = 357 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 22/158 (13%), Positives = 50/158 (31%), Gaps = 18/158 (11%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHA 284 + + I Y N+ +++ + ++ TNT A+ Sbjct: 75 ASRLNISEEGSHMALIQYAETPKLEFSLGQFNHPTQLEWAIQRIEYQSGATNTGQALRLT 134 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + + K I ITDG++ + + + +R+A + +Y+ Sbjct: 135 LEKGLQGARTGI--------PKVAIVITDGQSQDDVSEPS--------QLLRDADVMVYA 178 Query: 345 VAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + V L + T + + F V +L + Sbjct: 179 IGV-TNLVNVHQLHQMTGNPVRVFTVESFEQLDRALAD 215 >gi|298372685|ref|ZP_06982675.1| BatB protein [Bacteroidetes oral taxon 274 str. F0058] gi|298275589|gb|EFI17140.1| BatB protein [Bacteroidetes oral taxon 274 str. F0058] Length = 345 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 23/165 (13%), Positives = 56/165 (33%), Gaps = 45/165 (27%) Query: 254 PLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 P++++ K L + P + T A+ A + + K+S + + Sbjct: 145 PITSDKVSAKMFLKTIQPDLIQRQGTAIGSAIDLAVKSFNDTKQSG---------GRAIF 195 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------------- 353 +TD EN + N ++ + R+ + + V + P Sbjct: 196 LLTDAEN-------HEDNAVEAAKMARDKNITVNVVGIGTPEGSPIPVKGTMSYIKDKDG 248 Query: 354 -------QDLL--RKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + L + +G + +++ +++ K DK+Q+ Sbjct: 249 NVVVSKLNEELGKQIAAAGNGIYVRASNTGATIKALAKEIDKMQK 293 >gi|330834639|ref|YP_004409367.1| von Willebrand factor, type A [Metallosphaera cuprina Ar-4] gi|329566778|gb|AEB94883.1| von Willebrand factor, type A [Metallosphaera cuprina Ar-4] Length = 383 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 22/125 (17%), Positives = 46/125 (36%), Gaps = 25/125 (20%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + ++ + + LN T + A+ A+ + +VI +TDG + Sbjct: 92 DPEDLTNEIVNLNAGGQTALFTALLTAFNL-----------HNKHGVPSYVILLTDGNPT 140 Query: 318 GASAYQNTLNTLQICEYMRNAGMK--IYSVAVSA-PPEGQDLLRKCTD-SSGQFFAVNDS 373 + E R + + +V+ + +L+ D S G F+ V+D+ Sbjct: 141 DDTNI----------ETYRRLQIPSSVQAVSFGLGDDYNESILKSLADKSGGVFYHVSDA 190 Query: 374 RELLE 378 E+ E Sbjct: 191 MEIPE 195 >gi|189347157|ref|YP_001943686.1| outer membrane adhesin like proteiin [Chlorobium limicola DSM 245] gi|189341304|gb|ACD90707.1| outer membrane adhesin like proteiin [Chlorobium limicola DSM 245] Length = 2825 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 35/308 (11%), Positives = 87/308 (28%), Gaps = 25/308 (8%) Query: 53 PTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYE 112 + + + ++ ++ GS G IT + Y + Sbjct: 2033 TSAGETVVYDVTGDRLTAYVDSGSSTGALDGTDRVVFTFEITPIRPTCAAYEFTPVLT-Q 2091 Query: 113 IPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNN 172 I + + +LS G ++ +V+ V Sbjct: 2092 IDHPDAAADNIETLKTIDLSGSVYGYDNDGDRIGLGTVTLVITVQ---------DDVPIP 2142 Query: 173 NMTSNKYLLPPPPKKSFWSKNTTKS-KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEK 231 + S + P + + S + ++ E+ +L+N Sbjct: 2143 TVVSESFTAKPIDTNLTIILDVSGSMNDVIPNSGGKTRLQFAKEAIASLINQYDAL---- 2198 Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSN-NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 V++ I ++ + + +++N L+ +TN A+ N Sbjct: 2199 --GDVKVQIITFSASAQPIIVSGQVWLDPGTAITQINGLSASGSTNYDDALVDVMTTYNN 2256 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQ--NTLNTLQICEYMRNAGMKIYSVAVS 348 + + + + F++DG + + N+ Q ++ + Y++ V Sbjct: 2257 A-----GKLSTPQSQNVGYFLSDGVPNTPATSPGINSPEEQQWESFLTTNNINNYAIGVG 2311 Query: 349 APPEGQDL 356 A L Sbjct: 2312 AGATQATL 2319 >gi|145494949|ref|XP_001433468.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124400586|emb|CAK66071.1| unnamed protein product [Paramecium tetraurelia] Length = 611 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 23/150 (15%), Positives = 51/150 (34%), Gaps = 27/150 (18%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N ++ +++++ T A + A+++L + + V ++DG Sbjct: 249 NKQYFQAVISQISAEGLTKISSATYIAFKQLKEK-----VYRNNVTS---VFLLSDGH-- 298 Query: 318 GASAYQNTLNTLQICEYMR--NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE 375 + +I + +R I + + Q + +G F+ V D Sbjct: 299 ------DGDALFEISDQIRHVKEVFTISTFGFGDDHDAQMMTSISNLKNGNFYYVKDITL 352 Query: 376 LLESF--------DKITDKIQ-EQSVRIAP 396 L E F I ++IQ S+ + Sbjct: 353 LDEFFAHALGGIVSVIAEQIQISLSLTLTK 382 >gi|163750668|ref|ZP_02157905.1| type IV pilin biogenesis protein, putative [Shewanella benthica KT99] gi|161329663|gb|EDQ00654.1| type IV pilin biogenesis protein, putative [Shewanella benthica KT99] Length = 1180 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 31/291 (10%), Positives = 76/291 (26%), Gaps = 56/291 (19%) Query: 154 LDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL 213 + + + + + + P ++++ Sbjct: 190 VPYTYITSSASSGDRAAALDKAFLTNFGLGKSVTLYTERYVN-WYHGRKPKEWKTRMEIA 248 Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR-----LNK 268 ++V + + G + LN ++ + +N Sbjct: 249 KRVMEDVVVTTPSVDFGLAVFNYNTSNYNTTYRNDGGRIISGIQRLNAIEKKSLVQTINN 308 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK---------------------- 306 L+ T ++ AY + G + Sbjct: 309 LSAETWTPLCETLYEAYLYFGAKAPLYALEAGIASPPRDTSIEREGKYLSPFKESQCSNR 368 Query: 307 -FVIFITDGENSGASAYQNTLNTL---------------QICEYMRNAGMKI-------- 342 ++++ITDG + SA T++ L + M N + Sbjct: 369 SYIVYITDGAPTNDSAANYTVSQLPGYDKNNIVNGSYLPALTSIMNNEDINTKLAGDQFV 428 Query: 343 --YSVAVSAPPEGQD-LLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQE 389 +++ S +L+K + G ++A D+ +L + I +I E Sbjct: 429 STFTIGFSDGATSAAPILKKAAELGGGDYYAAKDAIKLQSALSDIFSQILE 479 >gi|161086980|ref|NP_631887.2| chloride channel calcium activated 4 [Mus musculus] Length = 1044 Score = 50.3 bits (118), Expect = 6e-04, Method: Composition-based stats. Identities = 18/112 (16%), Positives = 47/112 (41%), Gaps = 21/112 (18%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 P T+ + + + + + ++ +TDGE++ S+ Sbjct: 384 PNGGTSICNGLKKGFEAITSS--------DQSTSGSEIVLLTDGEDNRISSCF------- 428 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSG-QFFAVNDSRELLESF 380 + ++++G I+++A+ + L +D + G +F+A D L+++F Sbjct: 429 --QEVKHSGAIIHTIALGPSAARE--LETLSDMTGGLRFYAKEDVNGLIDAF 476 >gi|163797398|ref|ZP_02191350.1| hypothetical protein BAL199_28750 [alpha proteobacterium BAL199] gi|159177317|gb|EDP61874.1| hypothetical protein BAL199_28750 [alpha proteobacterium BAL199] Length = 683 Score = 49.9 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 21/138 (15%), Positives = 41/138 (29%), Gaps = 18/138 (13%) Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 ++ L T A A + G + V+FITDG S + Sbjct: 391 AALKVIDGLKAGGGTEMAAAFELALQM-----------PGDPDRLQQVVFITDGAVSNEA 439 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 A N + ++++V + + P + G + + D+ Sbjct: 440 ALFNQIKGELGAR-------RLFTVGIGSAPNTFFMEEAARFGRGTYTYIGDTSSAERVM 492 Query: 381 DKITDKIQEQSVRIAPNR 398 + KI ++ R Sbjct: 493 RDLFTKISFPALTNIEVR 510 >gi|116624980|ref|YP_827136.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228142|gb|ABJ86851.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 331 Score = 49.9 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 30/163 (18%), Positives = 59/163 (36%), Gaps = 24/163 (14%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHN 297 Y + P S +L +RL ++ T Y A++ A + L Sbjct: 122 DMAALYTFNWEIREQQPFSRDLRAFDNRLKMMHGEAGTAMYDAVYLAAQRLEP------- 174 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV-------SAP 350 +K ++ +TDG ++ + L+ + E + A IY++ V Sbjct: 175 ----RDGRKVIVVVTDGGDTVSR-----LSVQKALEAAQLADAVIYAIVVVPITNDAGRN 225 Query: 351 PEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 G+ L + G+ F EL ++F I +++ Q + Sbjct: 226 IGGEHALDFMAKGTGGRIFMPTLGAELDKAFADIITELRTQYL 268 >gi|297618211|ref|YP_003703370.1| von Willebrand factor A [Syntrophothermus lipocalidus DSM 12680] gi|297146048|gb|ADI02805.1| von Willebrand factor type A [Syntrophothermus lipocalidus DSM 12680] Length = 587 Score = 49.9 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 20/134 (14%), Positives = 45/134 (33%), Gaps = 17/134 (12%) Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 P + N + L ++ P T + + + + + + +I I Sbjct: 455 AVPFTRNQKVLSKGLARIRPGGLTPMADGIFTSVELIRSSRVHNP----------LLILI 504 Query: 312 TDGENSGASAYQN-TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFA 369 TDG + + + L+ + AG+K + + + + LR + G + Sbjct: 505 TDGMPNFPLWSFDAKSDALEAARKVAEAGIKFVCIGLESN---KVFLRDVAKQARGTLYI 561 Query: 370 VNDS--RELLESFD 381 V+D L+ Sbjct: 562 VDDLNRDSLINIVK 575 >gi|59712029|ref|YP_204805.1| von Willebrand factor type A domain-containing protein [Vibrio fischeri ES114] gi|59480130|gb|AAW85917.1| von Willebrand factor type A domain protein [Vibrio fischeri ES114] Length = 356 Score = 49.9 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 27/151 (17%), Positives = 64/151 (42%), Gaps = 16/151 (10%) Query: 246 GIVGNQCTPLSNNLN---EVKSRLNKLNPYENTNTYPAMHHAYREL-YNEKESSHNTIGS 301 G TP + + + E+ ++ + ++T+ A+ A + + ++ + + Sbjct: 157 GDAAFVQTPFTADQSVWLELLNQTDVAMAGQSTHLGDAIGLAIKVFEQSSEDKASAEENA 216 Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-GQDLL--- 357 +K I +TDG ++G+ + + +R I+ +A+ P G+ L Sbjct: 217 KPREKVAIVLTDGNDTGSYVEPIDAAKVAAAKDVR-----IHMIAMGDPRTVGEQALDMN 271 Query: 358 ---RKCTDSSGQFFAVNDSRELLESFDKITD 385 R +S G+ F + EL +++D+I + Sbjct: 272 IINRVAKESGGKAFQAINRDELEQAYDEIGE 302 >gi|55981030|ref|YP_144327.1| hypothetical protein TTHA1061 [Thermus thermophilus HB8] gi|55772443|dbj|BAD70884.1| conserved hypothetical protein [Thermus thermophilus HB8] Length = 706 Score = 49.9 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 24/121 (19%), Positives = 44/121 (36%), Gaps = 20/121 (16%) Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 L T A A R L G +K V+ +TDG + A Sbjct: 373 SLRAGGGTVLGGAFREAVRLLQ----------GVPGERKAVLVLTDGLIADAKEP----- 417 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDK 386 I + + +G+++ ++A+ P L++ G+F+ REL F + + Sbjct: 418 ---ILDLAQTSGVEVSALALG-PDADAPFLKELARRGGGRFYQAPSPRELPRLFLREGQE 473 Query: 387 I 387 + Sbjct: 474 V 474 >gi|268324441|emb|CBH38029.1| putitive magnesium-chelatase subunit [uncultured archaeon] Length = 705 Score = 49.9 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 30/197 (15%), Positives = 64/197 (32%), Gaps = 26/197 (13%) Query: 189 FWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIV 248 S T A ANR+++ + +L+ + ++G +A+ G Sbjct: 503 KISTATMFVVDASGSMGANRRMESAKGAVLSLL-------LDSYQQRDKVGMVAFK-GDQ 554 Query: 249 GNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 + PL ++ + RL +L T + L EK + Sbjct: 555 ADVLLPLCSSSDLAVERLRELPTGGRTPLAAGLEQGLNLLMAEKHR------DEEAIPIL 608 Query: 309 IFITDGENSGASAYQNTLNTL--QICEYMRNAGMKIYSVAVSAPPEGQDLLR-------K 359 + I+DG + ++ L + E R G+ Y + + ++ Sbjct: 609 LLISDGRANVSAGGSKELEQELLALAEQARAKGI--YVIVIDTEIVSDSFIQMQLGYCRA 666 Query: 360 CTD-SSGQFFAVNDSRE 375 + S G+++ + D Sbjct: 667 IANYSGGKYYPIADLTS 683 >gi|170041024|ref|XP_001848278.1| sushi [Culex quinquefasciatus] gi|167864620|gb|EDS28003.1| sushi [Culex quinquefasciatus] Length = 2239 Score = 49.9 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 24/196 (12%), Positives = 62/196 (31%), Gaps = 28/196 (14%) Query: 196 KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL 255 + + +I + + + S S + + +Q Sbjct: 142 DASSSVGRQNFASEIKFVKKLLSDFNVSYNYTRVAVITFSSQKKI--FRHIDQISQSVED 199 Query: 256 SNNLNEVKSRLNKLN-PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 ++ + ++ ++ T TY A+ A N + S KK + ITDG Sbjct: 200 NDKCLLLNYQVPRIAFSGGGTYTYGALKEAEEIFKNARLDS---------KKIIFLITDG 250 Query: 315 ENSGASAYQNTLNTLQICEYMR-NAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVN 371 ++G + + + ++ + + IYS+ + + + L + + ++ Sbjct: 251 FSNG-------RDPIPLAGRLKKDNNVVIYSIGIQSGNYAE--LHAIASAPEGDHCYLLD 301 Query: 372 DSRELLESFDKITDKI 387 F+ + K Sbjct: 302 SFDH----FETLARKA 313 >gi|325919992|ref|ZP_08181973.1| von Willebrand factor type A-like protein [Xanthomonas gardneri ATCC 19865] gi|325549526|gb|EGD20399.1| von Willebrand factor type A-like protein [Xanthomonas gardneri ATCC 19865] Length = 142 Score = 49.9 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 21/106 (19%), Positives = 43/106 (40%), Gaps = 18/106 (16%) Query: 251 QCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 TPL+ +L V+ +L T A+ + + L +K+ ++ Sbjct: 46 ALTPLTADLTSVRDQLADSVVGLAGRETAIGDAIALSVKRLREQKQG----------QRV 95 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 V+ +TDG N+ LN L+ E + G++++++A Sbjct: 96 VVLLTDGVNTAGV-----LNPLKAAELAKAEGVRVHTIAFGGSGGD 136 >gi|322378392|ref|ZP_08052846.1| phage/colicin/tellurite resistance cluster TerY protein [Helicobacter suis HS1] gi|322380073|ref|ZP_08054329.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter suis HS5] gi|321147480|gb|EFX42124.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter suis HS5] gi|321149148|gb|EFX43594.1| phage/colicin/tellurite resistance cluster TerY protein [Helicobacter suis HS1] Length = 236 Score = 49.9 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 32/195 (16%), Positives = 68/195 (34%), Gaps = 13/195 (6%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 T S + +I L + +++ +++ + +V + TP Sbjct: 23 TSSSMSTNMNGGQTRIGCLNDCVQTMIDLLKEEAK---RENVSKLAVITFGAGGVKLQTP 79 Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 LS + ++ + L NT A+ L + + +T +V+ ++DG Sbjct: 80 LSK-IESIQF--SPLGTGGNTPLGMALE-----LTRDYIQNKDTFPGKFYTPYVVMVSDG 131 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSR 374 E + + R++ YSV + E + + S Q + ND + Sbjct: 132 EPNDDWQGPLHDFIHN--KENRSSKSVRYSVFIGNEGEEPQAVHDFSGSPNQVYYANDVQ 189 Query: 375 ELLESFDKITDKIQE 389 L+ F IT + + Sbjct: 190 SLINCFKAITASVTQ 204 >gi|242016552|ref|XP_002428850.1| conserved hypothetical protein [Pediculus humanus corporis] gi|212513586|gb|EEB16112.1| conserved hypothetical protein [Pediculus humanus corporis] Length = 1945 Score = 49.9 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 20/111 (18%), Positives = 42/111 (37%), Gaps = 20/111 (18%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 P +NTNT A+ A + L ++ + K + ITDG ++ N L Sbjct: 110 PGKNTNTKEALEKAEKILTRARKGA---------VKIIFLITDGFSNMG-------NPLP 153 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 + + +++ IY+ + ++LL+ + + E + Sbjct: 154 MAQILKDQDTIIYTFGI-INGNARELLK-ISSQPIEEHAYLFTSFDEFEKF 202 >gi|332262298|ref|XP_003280198.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-1(XX) chain-like [Nomascus leucogenys] Length = 1284 Score = 49.9 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 24/140 (17%), Positives = 45/140 (32%), Gaps = 18/140 (12%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAY 285 A E V++G Y+ +V + +++L NT T A+ H Sbjct: 207 APFEIGPDKVQVGLTQYSGDAQTEWDLNSLGTKEQVLAAVHRLRYKGGNTFTGLALTHVL 266 Query: 286 -RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + L K VI +TDG++ +++ G+ +++ Sbjct: 267 GQNLQPAAGLRPEA------AKVVILVTDGKSQDDVH--------TAARVLKDLGVNVFA 312 Query: 345 VAVSAPPEGQDLLRKCTDSS 364 V V + LR Sbjct: 313 VGV--KNADEAELRLLASPP 330 >gi|308497300|ref|XP_003110837.1| CRE-CUT-6 protein [Caenorhabditis remanei] gi|308242717|gb|EFO86669.1| CRE-CUT-6 protein [Caenorhabditis remanei] Length = 576 Score = 49.9 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 22/158 (13%), Positives = 49/158 (31%), Gaps = 18/158 (11%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHA 284 + + I Y N+ +++ + ++ TNT A+ Sbjct: 75 ASRLNISEEGSHMALIQYAETPKLEFSLGQFNHPTQLEWAIQRIEYQSGATNTGQALRLT 134 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + K I ITDG++ + + + +R+A + +Y+ Sbjct: 135 LEKGLQGARPGI--------PKVAIVITDGQSQDDVSEPS--------QLLRDADVMVYA 178 Query: 345 VAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + V L + T + + F V +L + Sbjct: 179 IGV-TNLVNVHQLHQMTGNPVRVFTVETFEQLDRALAD 215 >gi|301058344|ref|ZP_07199377.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] gi|300447580|gb|EFK11312.1| von Willebrand factor type A domain protein [delta proteobacterium NaphS2] Length = 598 Score = 49.9 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 30/150 (20%), Positives = 51/150 (34%), Gaps = 31/150 (20%) Query: 254 PLSNNLNEVKSRLNKLNPYE----NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 PL+ + + LN L+P T+ A+ A + + E+ K ++ Sbjct: 144 PLTLDYGALMMFLNILHPNNIPHPGTDLGAAVLGAIKAFDPKSET----------DKVIL 193 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA 369 ITDGE++ L G+KI+ + P G + F Sbjct: 194 LITDGEDN-------EKRGLDAAREAVRKGIKIFVFGMGDPAGGPIP----ASNGKGGFV 242 Query: 370 VNDSRE-----LLES-FDKITDKIQEQSVR 393 +D E L E+ +I + VR Sbjct: 243 KDDKGELVLSKLDEAGLQEIAAVTGGEYVR 272 >gi|213966336|ref|ZP_03394518.1| secreted Mg-chelatase subunit [Corynebacterium amycolatum SK46] gi|213951042|gb|EEB62442.1| secreted Mg-chelatase subunit [Corynebacterium amycolatum SK46] Length = 543 Score = 49.9 bits (117), Expect = 6e-04, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 65/182 (35%), Gaps = 22/182 (12%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN----LNEVK 263 +I L + +V+ + L R + T + + ++ Sbjct: 374 DRIADLQATMRAIVDGSARTETGSVGLRNREIATILPFSTSVGEPTTTTIDGPESRAQLT 433 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 + ++ L T Y A+ A+ L + ++S + ++ +TDG+ + + Sbjct: 434 AAVDGLYAEGETALYDALIQAFDLLGSSDKNSIPS---------IVVLTDGQVTSGKTFS 484 Query: 324 NTLNTLQICEYMRNAGMK-IYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFD 381 + Q + + ++ + G+ +++ + + G+ F + EL + F Sbjct: 485 EFRDFYQS----KGGNLPPVFVIRYGEADPGE--MQELANLTGGKVFE-SRETELADVFK 537 Query: 382 KI 383 +I Sbjct: 538 EI 539 >gi|268324906|emb|CBH38494.1| conserved hypothetical protein [uncultured archaeon] Length = 709 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 30/197 (15%), Positives = 64/197 (32%), Gaps = 26/197 (13%) Query: 189 FWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIV 248 S T A ANR+++ + +L+ + ++G +A+ G Sbjct: 507 KISTATMFVVDASGSMGANRRMESAKGAVLSLL-------LDSYQQRDKVGMVAFK-GDQ 558 Query: 249 GNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 + PL ++ + RL +L T + L EK + Sbjct: 559 ADVLLPLCSSSDLAVERLRELPTGGRTPLAAGLEQGLNLLMAEKHR------DEEAIPIL 612 Query: 309 IFITDGENSGASAYQNTLNTL--QICEYMRNAGMKIYSVAVSAPPEGQDLLR-------K 359 + I+DG + ++ L + E R G+ Y + + ++ Sbjct: 613 LLISDGRANVSAGGSKELEQELLALAEQARAKGI--YVIVIDTEIVSDSFIQMQLGYCRA 670 Query: 360 CTD-SSGQFFAVNDSRE 375 + S G+++ + D Sbjct: 671 IANYSGGKYYPIADLTS 687 >gi|260800507|ref|XP_002595171.1| hypothetical protein BRAFLDRAFT_240983 [Branchiostoma floridae] gi|229280414|gb|EEN51182.1| hypothetical protein BRAFLDRAFT_240983 [Branchiostoma floridae] Length = 352 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 19/163 (11%), Positives = 56/163 (34%), Gaps = 15/163 (9%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYR 286 + +IG + Y+ ++ ++ + + TNT A+ + + Sbjct: 33 DFNIGYDATQIGLVQYSDEAETIFALDSFSSPTSLRDAIETIQYTGGATNTGNALDYMVQ 92 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 ++ + + K I +T G ++ + + MR + + Y++ Sbjct: 93 YMFASRNGAR-----QDSTKIAIVLTGG--------ASSDDIKAAAQRMRKSSVITYAIG 139 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + + D L + + S L++ + ++ K+ + Sbjct: 140 IGSE-LDYDQLDYIAGAPTSLTILQASTGLVQLRNTLSRKLCQ 181 Score = 45.3 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 37/335 (11%), Positives = 96/335 (28%), Gaps = 29/335 (8%) Query: 29 MQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQK 88 +Q + +A + S S +++D T G Sbjct: 46 VQYSDEAETIFALDSFSSPTSLRDAIETIQYTGGATNTGNALDYMVQYMFASRNGARQDS 105 Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI 148 +I I I + + + + G+ + + +S + Sbjct: 106 TKIAIVLTGGASSDDIKAAAQRMRKSSVITYAIGIG--SELDYDQLDYIAGAPTSLTILQ 163 Query: 149 SICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR 208 + ++ + ++ Q N T+ L + + T S Sbjct: 164 ASTGLVQLRNTLSRKLCQGLYSNLPSTAGPSPLCSVQVDLVFVLDGTGS-------VGAT 216 Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 + + +++ + RIG + Y+ ++ ++ + Sbjct: 217 NFEKMKTFVQKMISDFDLGPEA-----TRIGVVVYSNRASLEISLDAYDDQEALQDAVAD 271 Query: 269 L-NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + P T T A+ + ++ + + ++K + +TDG + + Sbjct: 272 IAYPGGYTLTGAAIDYTTTFAFSTRNGAR-----DGVRKVAVILTDGVSYD--------D 318 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 + + MR A + Y+V + + +D L Sbjct: 319 PAEPAQSMRKAAIITYAVGIGSN-LDRDQLDVIAG 352 >gi|220941747|emb|CAX15448.1| novel protein similar to vertebrate collagen, type VI, alpha 3 (COL6A3) [Danio rerio] Length = 429 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 20/154 (12%), Positives = 47/154 (30%), Gaps = 15/154 (9%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAY 285 + VR+ + ++ N +EV +N L NT A+ Sbjct: 273 PLDIGINKVRVSVVQHSDRPSPNFFLDTYQTKDEVLRAVNGLTLAGGRGLNTGAALTFMK 332 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + S + +F+I +T G ++ + + ++ G+ + Sbjct: 333 NTVLSTARGSRAAQN---VPQFLIVLTAG--------RSRDSVREPAVALKTEGVVPF-- 379 Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 V + + + F V + +L Sbjct: 380 GVGVKNADPKEIEAISHNPSFAFNVKEFSQLNTI 413 >gi|218662246|ref|ZP_03518176.1| putative vault protein inter-alpha-trypsin domain [Rhizobium etli IE4771] Length = 487 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 20/164 (12%), Positives = 47/164 (28%), Gaps = 16/164 (9%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYR 286 + + I +N + + + L T PA+ A R Sbjct: 76 SRLNPDDRFNVIRFDDTMTDYFKGLVAATPDNREKAIAYVRGLTADGGTEMLPALEDALR 135 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 + + V+F+TDG + ++++V Sbjct: 136 NQGPVASGA---------LRQVVFLTDGAIGNEQQLFQEI-------TANRGDARVFTVG 179 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + + P + + G F A+ + ++ ++ K+Q Sbjct: 180 IGSAPNTYFMTKAAEIGRGTFTAIGSTDQVASRMGELFAKLQNP 223 >gi|126731725|ref|ZP_01747530.1| Von Willebrand domain containing protein [Sagittula stellata E-37] gi|126707891|gb|EBA06952.1| Von Willebrand domain containing protein [Sagittula stellata E-37] Length = 321 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 22/147 (14%), Positives = 52/147 (35%), Gaps = 30/147 (20%) Query: 254 PLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 P + + + R+ + TN A+ A + + GS + VI Sbjct: 151 PFTFDTEAIARRIEEATIGISGRATNISDALGLALKRM----------AGSDADTRVVIL 200 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL-----------LRK 359 ++DG N+ + N + + G++++++A+ + L Sbjct: 201 LSDGANNAGAT-----NPRGVAQLAAQMGVRVHTIAMGPKSVDEAEEGERGVVDAETLDA 255 Query: 360 CTD-SSGQFFAVNDSRELLESFDKITD 385 + S G+ F V + +L+ + + Sbjct: 256 ISKVSGGETFRVRTTEDLIAVTEALDR 282 >gi|260786375|ref|XP_002588233.1| hypothetical protein BRAFLDRAFT_124700 [Branchiostoma floridae] gi|229273393|gb|EEN44244.1| hypothetical protein BRAFLDRAFT_124700 [Branchiostoma floridae] Length = 1313 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 31/266 (11%), Positives = 74/266 (27%), Gaps = 30/266 (11%) Query: 130 NLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPP---PPK 186 L + S + + + + + + PP Sbjct: 233 ELPTVTDFCNSDPSSPNYHNFHALGSDQNQFCNFRSTWDVISASDDFSGNNNPPRTVYNT 292 Query: 187 KSFWSKNTT--KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYN 244 + ++ + ID+ L K I+ + S+ + Sbjct: 293 IPNFKLVYYRQYAERVSIVLDVSGSIDMGTL-LPRLNQEASKYIRSFADGSMVGLITFSD 351 Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKLNPY--ENTNTYPAMHHAYRELYNEKESSHNTIGST 302 V + T L+ + + +S + L +T+ + L + Sbjct: 352 TAAVDHALTELTAD-SHRQSLITALPSSTYGSTSIGAGIQAGLSMLKPTGQGGT------ 404 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 ++ +TDG+ + A Q+ ++ + + ++A+ L Sbjct: 405 -----IVLMTDGQENTAPMIQDVWPSVL------QQKVTLVTIAIG--EYADMSLEDLAS 451 Query: 363 --SSGQFFAVNDSRELLESFDKITDK 386 S F+ D+ L E F I+ + Sbjct: 452 QTSGLSFYDTEDASHLSEIFTAISSQ 477 >gi|327188854|gb|EGE56047.1| putative vault protein inter-alpha-trypsin domain-containing protein [Rhizobium etli CNPAF512] Length = 794 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 19/134 (14%), Positives = 44/134 (32%), Gaps = 16/134 (11%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 +N + + + L T PA+ A R + + V+F+TDG Sbjct: 413 DNREKAITYVRGLTADGGTEMLPALEDALRNQGPVASGA---------LRQVVFLTDGAI 463 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 + ++++V + + P + + G F A+ + ++ Sbjct: 464 GNEQQLFQEI-------SANRGDARVFTVGIGSAPNTYFMTKAAEIGRGTFTAIGSTDQV 516 Query: 377 LESFDKITDKIQEQ 390 ++ K+Q Sbjct: 517 ASRMGELFAKLQNP 530 >gi|219847650|ref|YP_002462083.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] gi|219541909|gb|ACL23647.1| von Willebrand factor type A [Chloroflexus aggregans DSM 9485] Length = 419 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 18/139 (12%), Positives = 44/139 (31%), Gaps = 16/139 (11%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N + ++++ T PA+ +EL + +I +TDG+ Sbjct: 101 NRAMILDLVHRIRDAGGTRIAPAVEKGLQELQKMPPGV----------RRLILLTDGQTE 150 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 + + G+ I ++ + L+ S G + E++ Sbjct: 151 HENECLLR------ADDAGRLGVPITALGIGKDWNEDLLIEMANRSKGVADYIAQPGEIV 204 Query: 378 ESFDKITDKIQEQSVRIAP 396 F + Q+ ++ + Sbjct: 205 NYFQHTVQRAQQTVIQNSV 223 >gi|314988185|gb|EFT32276.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL005PA2] Length = 322 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 18/154 (11%), Positives = 51/154 (33%), Gaps = 25/154 (16%) Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 P S + V ++ + + T A+ + + + + ++ ++D Sbjct: 150 PPSTDRPTVLRAVDGIELQDGTALGGAIDKSLEAVK----MAPGGSKNPAPA-AIVMLSD 204 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PEGQDLLRKC 360 G+N+ + N + + +Y++A LL Sbjct: 205 GDNTQGGSPLVAANRAAAAK------VPVYTIAFGTETGYVDLNGQRERVAPDTKLLSTV 258 Query: 361 TD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 D + + + + + +L E + ++ + + VR Sbjct: 259 ADRTHAKSWTADSADKLREVYQQVHSSVGYEPVR 292 >gi|313676832|ref|YP_004054828.1| von willebrand factor type a [Marivirga tractuosa DSM 4126] gi|312943530|gb|ADR22720.1| von Willebrand factor type A [Marivirga tractuosa DSM 4126] Length = 587 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 21/150 (14%), Positives = 54/150 (36%), Gaps = 16/150 (10%) Query: 244 NIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 + + ++ + P T A+ HA L + + + Sbjct: 451 RNHSSYLTIKDFDEKWSNARYKVGAIAPSGYTRIGTALRHAGTRLQSRESQN-------- 502 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRN---AGMKIYSVAVSAPPEGQDLLRKC 360 K+VI ++DG+ + Y+ + + ++ + Y++A+ + + L + Sbjct: 503 --KWVILLSDGKPNDYDRYEGKYGVQDVKQALKEFNQNNINSYAIAI--ESQAKYYLPQM 558 Query: 361 TDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + ++ +L+ S K+ KI+ Q Sbjct: 559 -FGQNHYQILSKPDDLIHSLVKLYTKIKNQ 587 >gi|189461338|ref|ZP_03010123.1| hypothetical protein BACCOP_01988 [Bacteroides coprocola DSM 17136] gi|189431867|gb|EDV00852.1| hypothetical protein BACCOP_01988 [Bacteroides coprocola DSM 17136] Length = 341 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 19/168 (11%), Positives = 56/168 (33%), Gaps = 46/168 (27%) Query: 251 QCTPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 P++++ K L +NP + T+ A++ A + + + + Sbjct: 142 TQLPITSDYISAKMFLETINPSLISTQGTDIAGAINLAMKSFTP----------NEGVGR 191 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------------- 352 ++ ITDGEN ++ + G++++ + V +P Sbjct: 192 AIVLITDGENHEG-------GAVEAAQEAAKKGVRVFVLGVGSPDGAPIPAEGTNEFRRD 244 Query: 353 ----------GQDLLRKCTDSSGQFF-AVNDSRELLESFDKITDKIQE 389 + + ++ + + V+++ + + K+ + Sbjct: 245 KDGNVIVTRLNEQMCQEIAKAGNGIYVRVDNTNNAERALNAEISKLAK 292 >gi|270008949|gb|EFA05397.1| hypothetical protein TcasGA2_TC015569 [Tribolium castaneum] Length = 2194 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 17/148 (11%), Positives = 48/148 (32%), Gaps = 8/148 (5%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 +N+ + K N TN + + + ++ T+ + +I +TDG Sbjct: 335 DNIKKAKGIKNDDMHMGCTNIIGGLVVGLFLVRRTLKKNYEQNVETKHQPMIILLTDGLP 394 Query: 317 SGASAYQNTLNTLQICEYMRN--AGMKIYSVAVSAPPEGQDLLRKCTDSSG---QFFAVN 371 + + + +I + I+S++ + L + + G + Sbjct: 395 NVGL--SDPVEITKIVTKINQGTNRAAIFSLSFGEDADKNFLKKLSAQNLGFSRHIYEAA 452 Query: 372 DSR-ELLESFDKITDKIQEQSVRIAPNR 398 D+ +L + + + ++ Sbjct: 453 DAALQLQNFYRTVFSPLLRDVRFKYVDK 480 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 34/270 (12%), Positives = 80/270 (29%), Gaps = 29/270 (10%) Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 + + VLD S SME ++ +K L P F Sbjct: 991 VHFFAPSGLQTFPKHVVFVLDHSGSMEGRKYEQLMQAM----DKILSDLNPDDLF--HIV 1044 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 S+ K D + +Q+ + ++ +N+G Sbjct: 1045 RFSENVSVWNFEKNKFDQV------------SFLQKPEYRNLDSFLAEFNLGDAAQVTEG 1092 Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 N+ + K + TN + + + + T+ + +IF+TDG Sbjct: 1093 ---NIKKAKKIKDHDVDMGCTNIIGGLVVGLYLVRRTLQKFYEKNVETKHQPMIIFLTDG 1149 Query: 315 ENSGASAYQNTLNTLQICEYMRN--AGMKIYSVAVSAPPEGQDLLRKCTDSSG---QFFA 369 + + + + +I + I+S++ + L + + G + Sbjct: 1150 LPNEGISNPDKI--TKIVTKINQGTNRAAIFSLSFGEDADKNFLKKLSAQNLGFSRHIYE 1207 Query: 370 VNDSR-ELLESFDKITDKIQEQSVRIAPNR 398 D+ +L + ++ + ++ Sbjct: 1208 AADAALQLQNFYRTVSSPLLRDVRFKYVDK 1237 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 30/270 (11%), Positives = 81/270 (30%), Gaps = 29/270 (10%) Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 + L + VLD S SME ++ + + +S Sbjct: 1779 VHFFAPSGLQTLPKHVVFVLDYSASMEGRKHEQLMQAMDKILSDLNPDDLFHIVRFSVIV 1838 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 + + + D + + Q+ + ++ +N+G Sbjct: 1839 SVWNFE------KNRFDQIKFA------------QKPEYENLDSFLAEFNLGDAAQVSE- 1879 Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 +N+ + K + + TN + + E + T+ + +IF+TDG Sbjct: 1880 --DNIKKAKEIKDHDVDMDCTNIIGGLVVGLYLVRQTLEKFYEKNIETKHQPMIIFLTDG 1937 Query: 315 ENSGASAYQNTLNTLQICEYMRN--AGMKIYSVAVSAPPEGQDLLRKCTDSSG---QFFA 369 + ++ + + + I+S++ + L + + G + Sbjct: 1938 LPNVGLIIRDEI--TDVVTKINQGTNRAAIFSLSFGEDADKNFLKKLSAQNLGFSRHIYE 1995 Query: 370 VNDSR-ELLESFDKITDKIQEQSVRIAPNR 398 D+ +L + ++ + ++ Sbjct: 1996 AADAALQLQNFYRTVSSPLLRDVRFKYVDK 2025 >gi|189238315|ref|XP_972124.2| PREDICTED: similar to inter-alpha-trypsin inhibitor family heavy chain-related protein [Tribolium castaneum] Length = 750 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 17/148 (11%), Positives = 48/148 (32%), Gaps = 8/148 (5%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 +N+ + K N TN + + + ++ T+ + +I +TDG Sbjct: 380 DNIKKAKGIKNDDMHMGCTNIIGGLVVGLFLVRRTLKKNYEQNVETKHQPMIILLTDGLP 439 Query: 317 SGASAYQNTLNTLQICEYMRN--AGMKIYSVAVSAPPEGQDLLRKCTDSSG---QFFAVN 371 + + + +I + I+S++ + L + + G + Sbjct: 440 NVGL--SDPVEITKIVTKINQGTNRAAIFSLSFGEDADKNFLKKLSAQNLGFSRHIYEAA 497 Query: 372 DSR-ELLESFDKITDKIQEQSVRIAPNR 398 D+ +L + + + ++ Sbjct: 498 DAALQLQNFYRTVFSPLLRDVRFKYVDK 525 >gi|224534421|ref|ZP_03674999.1| von Willebrand factor type A domain protein [Borrelia spielmanii A14S] gi|224514523|gb|EEF84839.1| von Willebrand factor type A domain protein [Borrelia spielmanii A14S] Length = 333 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 26/172 (15%), Positives = 62/172 (36%), Gaps = 41/172 (23%) Query: 252 CTPLSNNLNEVKSRLNK---LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P++ + + +L+ ++ + + A L S LK+ + Sbjct: 146 VVPITTDRDFFNKKLDDIYIMDLGNGSALGLGISIALSHL----------KHSEALKRSI 195 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA------------------- 349 + +TDG + Y++ Q+ + +KIYS+ + + Sbjct: 196 VVLTDGVVNSDEIYKD-----QVINLAQGLNVKIYSIGIGSSEEFGVEFKLRSGKFYQGS 250 Query: 350 --PPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQ-EQSVRIAPN 397 +L + ++ + G F++VND + + K E+ ++IA + Sbjct: 251 LREVYDPSMLVEISNKTGGLFYSVNDDFSFQFAIQDFSKKENLERKIKIAVD 302 >gi|148680072|gb|EDL12019.1| mCG141954, isoform CRA_b [Mus musculus] Length = 972 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 18/112 (16%), Positives = 47/112 (41%), Gaps = 21/112 (18%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 P T+ + + + + + ++ +TDGE++ S+ Sbjct: 384 PNGGTSICNGLKKGFEAITSS--------DQSTSGSEIVLLTDGEDNRISSCF------- 428 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSG-QFFAVNDSRELLESF 380 + ++++G I+++A+ + L +D + G +F+A D L+++F Sbjct: 429 --QEVKHSGAIIHTIALGPSAARE--LETLSDMTGGLRFYAKEDVNGLIDAF 476 >gi|116626778|ref|YP_828934.1| hypothetical protein Acid_7751 [Candidatus Solibacter usitatus Ellin6076] gi|116229940|gb|ABJ88649.1| hypothetical protein Acid_7751 [Candidatus Solibacter usitatus Ellin6076] Length = 466 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 32/275 (11%), Positives = 72/275 (26%), Gaps = 26/275 (9%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + I++ V F + +A+DL + +R ++ A +AA ++ + ++ D T Q Sbjct: 41 LAVIMVPVLFGMMGFALDLGRLYLVRGELNHAAEAAAIAAASHLIGTTGALDAATTAAQQ 100 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK----------DKNNPLQYIAESKAQ 110 + K L G I A + + + Sbjct: 101 TLTLNKYNFGSLTPGEGSGNLTSTITDPAYFSTVAGATGNDPNGSQADGTTARHVQVSLT 160 Query: 111 YEIPTENLFLKGLIPSALTNLSLRSTGIIERSS-ENLAISICMVLDVSRSMEDLYLQKHN 169 + P L S T ++ ++ I I + S + Sbjct: 161 ADAPLLFWSLLSAGQSRKTPIAAQALAGISVPLCTACGIEPFAIAAKDASDLTDFGFGAP 220 Query: 170 DNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNS------ 223 ++ + Y + A + D + + + Sbjct: 221 ADDVHYTFYYNCTGTAPAFL------PNSGQSAAYTIINRYDAGNTTVTDESDQLFRYGA 274 Query: 224 ---IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL 255 + +V +G + + PL Sbjct: 275 GGLLSSTTPNPTGSTVPLGCVGIGDSVEAVWNGPL 309 >gi|119774146|ref|YP_926886.1| type IV pilin biogenesis protein [Shewanella amazonensis SB2B] gi|119766646|gb|ABL99216.1| type IV pilin biogenesis protein, putative [Shewanella amazonensis SB2B] Length = 1148 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 35/285 (12%), Positives = 78/285 (27%), Gaps = 60/285 (21%) Query: 157 SRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIES 216 S S E + N + K++ + +K K +ID+ E+ Sbjct: 177 SGSKEATKELAMSKAKNTGFGTGRVVTLYTKTYLTWYHSKKKQVN-----RTRIDIAKEA 231 Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS-NNLNEVKSRLNKLNPYENT 275 N++ + + + + G + P + +N ++ ++ L T Sbjct: 232 VTNVLLTTPGVDFGLAIFNSNVYEGYDDGGRIIAGIKPATASNKKDLIEAVDLLKGTTWT 291 Query: 276 NTYPAMHHAYRE--------------LYNEKESSHNTIGSTRLK--------KFVIFITD 313 ++ AYR L ++ S +++++TD Sbjct: 292 PLCETLYEAYRYFSGGEVWFGDDDPTLKPYRDKDAIDANSRYKSPFKTCQNRAYIVYVTD 351 Query: 314 GENSGASAYQNTLNTLQICEYM--------------------------RNAGMKIYSVAV 347 GE + S + L + Y++ Sbjct: 352 GEPTRDSNANQLVYDLTGGVDAYTSSPASYLSSLSSWMNTQDVNPNMTGKQSVSTYTIGF 411 Query: 348 SAPPEGQDLL--RKCTDSSGQFFAVNDSRELLE----SFDKITDK 386 S L G+++ + +L + F I +K Sbjct: 412 SQGAASAAGLLRHTAEKGGGKYYDATNVDDLQKSLMQVFKNILEK 456 >gi|297671393|ref|XP_002813835.1| PREDICTED: collagen alpha-1(VII) chain-like [Pongo abelii] Length = 2889 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 24/161 (14%), Positives = 51/161 (31%), Gaps = 31/161 (19%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNE 291 VR T+ Y+ G + S++ T + L + Sbjct: 73 SAQGVRFATVQYSDDPRGQWASDASDSFE-------------GTGLSINVLMIRAILSSV 119 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 ++ + K I ITDG++ + ++ G+K+++V + Sbjct: 120 TQAR------DCVPKVCILITDGKSQD--------LVDTAAQRLKGQGIKLFAVGI--KN 163 Query: 352 EGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQ 390 + L++ FF VND L ++ ++ Sbjct: 164 ADPEELKRVASQPTSDFFFFVNDFSILRTLLPLVSRRVCTT 204 Score = 45.3 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 30/206 (14%), Positives = 64/206 (31%), Gaps = 24/206 (11%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 P E+ ++ + A+ +V++G ++Y+ S++ Sbjct: 1041 VVFLPHATQDNAHRA-EATRRVLERLVSALGPLGPQAVQVGLLSYSHRPSPLFPLNGSHD 1099 Query: 259 LNEVKSRLNKLNPY--ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 L + R+ + N A+ AYR + + ++ + D Sbjct: 1100 LGIILQRIRDMPYMDPSGNNLGTAVVTAYRYVLAPDAPGRR----QHVPGVMVLLVDEPL 1155 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG-----QFFAVN 371 G + AG+ + V + + LR+ G FFAV+ Sbjct: 1156 RGDIFSP--------IREAQAAGLNV--VMLGMAGADPEQLRRLA--PGMDSVQTFFAVD 1203 Query: 372 DSRELLESFDKITDKIQEQSVRIAPN 397 D L ++ + + + S P Sbjct: 1204 DGPSLDQAVSGLAAALCQASFATQPR 1229 >gi|163815506|ref|ZP_02206879.1| hypothetical protein COPEUT_01671 [Coprococcus eutactus ATCC 27759] gi|158449143|gb|EDP26138.1| hypothetical protein COPEUT_01671 [Coprococcus eutactus ATCC 27759] Length = 550 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 19/136 (13%), Positives = 44/136 (32%), Gaps = 13/136 (9%) Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN-LNEVKSRLNKLNPYENTN 276 L NS+ Q + + N + N + + +N L T Sbjct: 392 NQLKNSLTNGAQYINDNNYVGLVSYSNSVTIEVPIAQFDLNQRSYFQGAVNNLIASGGTA 451 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 +Y A+ A + + K + + ++DG + + +I +R Sbjct: 452 SYDAVVVAVKMITEAKAQHPDAK------CMLFLLSDGYANNGYSMD------EITSALR 499 Query: 337 NAGMKIYSVAVSAPPE 352 +G+ +Y++ + Sbjct: 500 TSGIPVYTIGYGDDAD 515 >gi|296815358|ref|XP_002848016.1| U-box domain containing protein [Arthroderma otae CBS 113480] gi|238841041|gb|EEQ30703.1| U-box domain containing protein [Arthroderma otae CBS 113480] Length = 748 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 27/233 (11%), Positives = 70/233 (30%), Gaps = 26/233 (11%) Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 S + K D+ +L S S + + +D+ Sbjct: 49 NSMVVSIQPPLKPKDDVPHVPCDIVLVIDISASMNSAAPIPTGESGGEDTGLSILDLTKH 108 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN--NLNEVKSRLNKLNPYE 273 +A ++ ++ N + R+ + + I + N ++V + ++ L+ Sbjct: 109 AAKTIIQTL--------NENDRLAVVTFCTEIRVAFELEFMSEENKSKVLAAIDCLHGIS 160 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 +TN + + + L + + ++ +TDG + Sbjct: 161 STNLWHGIKEGLKVLATNSTQGNV--------QALLVLTDGAPNH-MCPAQGYVPKLRQT 211 Query: 334 YMRNAGMK-----IYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESF 380 + + + I++ LL+ + G F + D+ + F Sbjct: 212 LLDHRDLTGSLPLIHTFGFGYYLRSP-LLQSIAEIGGGTFAFIPDAGMIGTVF 263 >gi|62881|emb|CAA39981.1| type VI collagen subunit alpha2 [Gallus gallus] Length = 918 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 31/228 (13%), Positives = 78/228 (34%), Gaps = 26/228 (11%) Query: 175 TSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNL 234 ++ P ++ T A P +D + + + ++ + + + Sbjct: 34 VTSCTEKTDCPISVYFV-IDTSESIALQTVPIQSLVDQIKQFIPRFIEKLENEVYQ-NQV 91 Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKE 293 S+ + V +PL+ + + ++L + T T A+ + ++ ++ Sbjct: 92 SITWMFGGLHYSDVVEIYSPLTRSKDTYLTKLRAIRYLGRGTFTDCAISNMTQQFQSQTA 151 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 KF + ITDG +G+ E R+ G+K+++VA + Sbjct: 152 RD---------VKFAVVITDGHVTGSPCGG----MKMQAERARDMGIKLFAVA-PSEDVY 197 Query: 354 QDLLRKCTDSS-GQF---FAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + LR+ + + + L I + E+ ++ + Sbjct: 198 EQGLREIASPPHDLYRSNYTITPKDALH-----IDENTIERIIKAMKH 240 Score = 44.1 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 24/154 (15%), Positives = 52/154 (33%), Gaps = 19/154 (12%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS-VRIGTIAYNIGIVGNQCTPLS--- 256 + + + N+V+ + ++ K+ + R+G + Y+ Sbjct: 622 SSESIGYTNFTLEKNFVVNVVSRLGSIAKDPKSETGARVGVVQYSHEGTFEAIKLDDERI 681 Query: 257 NNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 N+L+ K + +L T T A+ AY +L E + F + ITDG Sbjct: 682 NSLSSFKEAVKRLEWIAGGTWTPSALQFAYNKLIKESRREKA-------QVFAVVITDGR 734 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + N +C + + ++ + Sbjct: 735 ---YDPRDDDKNLGALC----GRDVLVNTIGIGD 761 >gi|45384382|ref|NP_990679.1| collagen alpha-2(VI) chain precursor [Gallus gallus] gi|115352|sp|P15988|CO6A2_CHICK RecName: Full=Collagen alpha-2(VI) chain; Flags: Precursor gi|62877|emb|CAA33144.1| type VI collagen alpha-2 subunit preprotein [Gallus gallus] gi|62882|emb|CAA39982.1| type VI collagen subunit alpha2 [Gallus gallus] Length = 1022 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 31/228 (13%), Positives = 78/228 (34%), Gaps = 26/228 (11%) Query: 175 TSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNL 234 ++ P ++ T A P +D + + + ++ + + + Sbjct: 34 VTSCTEKTDCPISVYFV-IDTSESIALQTVPIQSLVDQIKQFIPRFIEKLENEVYQ-NQV 91 Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKE 293 S+ + V +PL+ + + ++L + T T A+ + ++ ++ Sbjct: 92 SITWMFGGLHYSDVVEIYSPLTRSKDTYLTKLRAIRYLGRGTFTDCAISNMTQQFQSQTA 151 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 KF + ITDG +G+ E R+ G+K+++VA + Sbjct: 152 RD---------VKFAVVITDGHVTGSPCGG----MKMQAERARDMGIKLFAVA-PSEDVY 197 Query: 354 QDLLRKCTDSS-GQF---FAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + LR+ + + + L I + E+ ++ + Sbjct: 198 EQGLREIASPPHDLYRSNYTITPKDALH-----IDENTIERIIKAMKH 240 Score = 44.1 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 24/154 (15%), Positives = 52/154 (33%), Gaps = 19/154 (12%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS-VRIGTIAYNIGIVGNQCTPLS--- 256 + + + N+V+ + ++ K+ + R+G + Y+ Sbjct: 622 SSESIGYTNFTLEKNFVVNVVSRLGSIAKDPKSETGARVGVVQYSHEGTFEAIKLDDERI 681 Query: 257 NNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 N+L+ K + +L T T A+ AY +L E + F + ITDG Sbjct: 682 NSLSSFKEAVKRLEWIAGGTWTPSALQFAYNKLIKESRREKA-------QVFAVVITDGR 734 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + N +C + + ++ + Sbjct: 735 ---YDPRDDDKNLGALC----GRDVLVNTIGIGD 761 >gi|218693343|gb|ACL01175.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 39/383 (10%), Positives = 100/383 (26%), Gaps = 37/383 (9%) Query: 28 QMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQ 87 ++ LD VL S + + + + + +K I K + Sbjct: 217 ELNQPLDVVVL-LDNSNSMNNERANNSQRALKAGEAVEKLIDKITSNKDNRVALVTYAST 275 Query: 88 KAQINITKDKNNPLQYIAESKAQYEIP--TENLFLKGLIPSALTNLSLRSTGIIERSSEN 145 ++ + F + NL+ + + S Sbjct: 276 IFDGTEATVSKGVADQNGKALNDSVSWDYHKTTFTATTHNYSYLNLTNDANEVNILKSRI 335 Query: 146 LAISICMVLDVSRS-MEDLYLQKHNDNNNMTSNKYLLPP--------PPKKSFWSKNTTK 196 + + D + + QK N S Sbjct: 336 PKEAEHINGDRTLYQFGATFTQKALMKANEILETQSSNARKKLIFHVTDGVPTMSYAINF 395 Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 + Y + + + + +Q+ + + + + ++ P++ Sbjct: 396 NPYISTS--YQNQFNSFLNKIPDRSGILQEDFIINGDDYQIVKGDGESFKLFSDRKVPVT 453 Query: 257 NNLNEVKSRL--NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 + R+ N+L+ N + Y + + ++ T G Sbjct: 454 GGTTQAAYRVPQNQLSVMSNEGYAINSGYIYLYWRDYNWVYPFDPKTKKVSATKQIKTHG 513 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------GQDLLRKCTDSSGQF 367 E + N +R G I++V + + + ++ + + + Sbjct: 514 EPTTLYFNGN----------IRPKGYDIFTVGIGVNGDPGETPLEAEKFMQSISSKTENY 563 Query: 368 FAVNDS----RELLESFDKITDK 386 V+D+ EL + F I ++ Sbjct: 564 TNVDDTNKIYDELNKYFKTIVEE 586 >gi|220905850|ref|YP_002481161.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] gi|219862461|gb|ACL42800.1| von Willebrand factor type A [Cyanothece sp. PCC 7425] Length = 236 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 26/168 (15%), Positives = 57/168 (33%), Gaps = 14/168 (8%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 KI L + + +Q+ + N V + I ++ G + TP + + R Sbjct: 37 KIQSLNAAIKETIPMLQQTAADNPNAQVLVRAIKFSDGAEWHIPTPTPVD----QFRWTD 92 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L T+ A+ +L S L ++ I+DG+ + Sbjct: 93 LTAGGVTDLGMALEMVAEQLRVPPMSERA------LPPVLVLISDGQPTDDFGSGLKAL- 145 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 + + +++ ++AV + L + S + N+ +L Sbjct: 146 --MAQPWGQKAVRV-AIAVGQDANHEVLQKFIGPSELRVLQANNPDQL 190 >gi|190892740|ref|YP_001979282.1| vault protein inter-alpha-trypsin domain [Rhizobium etli CIAT 652] gi|190698019|gb|ACE92104.1| putative vault protein inter-alpha-trypsin domain [Rhizobium etli CIAT 652] Length = 794 Score = 49.9 bits (117), Expect = 7e-04, Method: Composition-based stats. Identities = 19/134 (14%), Positives = 45/134 (33%), Gaps = 16/134 (11%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 +N + + + L+ T PA+ A R + + V+F+TDG Sbjct: 413 DNREKAITYVRGLSADGGTEMLPALEDALRNQGPVASGA---------LRQVVFLTDGAI 463 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 + ++++V + + P + + G F A+ + ++ Sbjct: 464 GNEQQLFQEI-------SANRGDARVFTVGIGSAPNTYFMTKAAEIGRGTFTAIGSTDQV 516 Query: 377 LESFDKITDKIQEQ 390 ++ K+Q Sbjct: 517 ASRMGELFAKLQNP 530 >gi|309362046|emb|CAP28695.2| hypothetical protein CBG_09096 [Caenorhabditis briggsae AF16] Length = 516 Score = 49.9 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 31/173 (17%), Positives = 63/173 (36%), Gaps = 14/173 (8%) Query: 211 DVLIESAGNLVNSI-QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 + + LV+ + + + V + T + + + L N V+ + + Sbjct: 262 HLFSDQKKFLVDRVLGNINIHPEAVRVALITYSGQAFVHFKFNSFLYGNNTSVQGFVKNI 321 Query: 270 NPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 T T A+ A+ L ++ S+ + K + +TDG + + Sbjct: 322 RSIKGTTATNVALMDAFDLLTSKDPSTGIR---EGVPKMALVLTDGHSHRS--------P 370 Query: 329 LQICEYMRNAGMKIYSVAVSA-PPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 + E MR AG+ + +V+V+ P + LR S + F + E F Sbjct: 371 KDMAEKMRAAGIIMIAVSVTPRPLVDEAELRLIAGSEKRAFTPPNLHEFESEF 423 >gi|309362033|emb|CAP28676.2| hypothetical protein CBG_09117 [Caenorhabditis briggsae AF16] gi|309362045|emb|CAP28694.2| hypothetical protein CBG_09097 [Caenorhabditis briggsae AF16] Length = 787 Score = 49.9 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 31/173 (17%), Positives = 63/173 (36%), Gaps = 14/173 (8%) Query: 211 DVLIESAGNLVNSI-QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 + + LV+ + + + V + T + + + L N V+ + + Sbjct: 262 HLFSDQKKFLVDRVLGNINIHPEAVRVALITYSGQAFVHFKFNSFLYGNNTSVQGFVKNI 321 Query: 270 NPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 T T A+ A+ L ++ S+ + K + +TDG + + Sbjct: 322 RSIKGTTATNVALMDAFDLLTSKDPSTGIR---EGVPKMALVLTDGHSHRS--------P 370 Query: 329 LQICEYMRNAGMKIYSVAVSA-PPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 + E MR AG+ + +V+V+ P + LR S + F + E F Sbjct: 371 KDMAEKMRAAGIIMIAVSVTPRPLVDEAELRLIAGSEKRAFTPPNLHEFESEF 423 >gi|218693235|gb|ACL01121.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 49.9 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 39/383 (10%), Positives = 99/383 (25%), Gaps = 37/383 (9%) Query: 28 QMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQ 87 ++ LD VL S + + + + + +K I K + Sbjct: 217 ELNQPLDVVVL-LDNSNSMNNERANNSQRALKAGEAVEKLIDKITSNKDNRVALVTYAST 275 Query: 88 KAQINITKDKNNPLQYIAESKAQYEIP--TENLFLKGLIPSALTNLSLRSTGIIERSSEN 145 ++ + F + NL+ + + S Sbjct: 276 IFDGTEATVSKGVADQNGKALNDSVSWDYHKTTFTATTHNYSYLNLTNDANEVNILKSRI 335 Query: 146 LAISICMVLDVSRS-MEDLYLQKHNDNNNMTSNKYLLPP--------PPKKSFWSKNTTK 196 + + D + + QK N S Sbjct: 336 PKEAEHINGDRTLYQFGATFTQKALMKANEILETQSSNARKKLIFHVTDGVPTMSYAINF 395 Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 + Y + + + +Q+ + + + + ++ P++ Sbjct: 396 NPYISTS--YQNQFNSFFNKIPDRSGILQEDFIINGDDYQIVKGDGESFKLFSDRKVPVT 453 Query: 257 NNLNEVKSRL--NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 + R+ N+L+ N + Y + + ++ T G Sbjct: 454 GGTTQAAYRVPQNQLSVMSNEGYAINSGYIYLYWRDYNWVYPFDPKTKKVSATKQIKTHG 513 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------GQDLLRKCTDSSGQF 367 E + N +R G I++V + + + ++ + + + Sbjct: 514 EPTTLYFNGN----------IRPKGYDIFTVGIGVNGDPGATPLEAEKFMQSISSKTENY 563 Query: 368 FAVNDS----RELLESFDKITDK 386 V+D+ EL + F I ++ Sbjct: 564 TNVDDTNKIYDELNKYFKTIVEE 586 >gi|51893578|ref|YP_076269.1| magnesium chelatase [Symbiobacterium thermophilum IAM 14863] gi|51857267|dbj|BAD41425.1| magnesium chelatase [Symbiobacterium thermophilum IAM 14863] Length = 741 Score = 49.9 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 20/147 (13%), Positives = 51/147 (34%), Gaps = 18/147 (12%) Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 P + + + V+ L ++ P T + + +++ + ++ I Sbjct: 610 YVPFTRSYSAVEDGLARIQPMGLTPLAHGLIRSMELIHSARVRRP----------LLLLI 659 Query: 312 TDGENSGASAYQNTL-NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV 370 TDG + + L + ++ +R + + + + L + +G V Sbjct: 660 TDGIPTVPKWSVDPLADAVEAARQLRAQRIPFTCIGLQP---SRRYLEQLVRQAGGTLHV 716 Query: 371 NDSRELLESFDKITDKIQEQSVRIAPN 397 EL E + + + R+AP+ Sbjct: 717 --VDELSE--ESLIRIAHHERQRLAPH 739 >gi|268572089|ref|XP_002641231.1| Hypothetical protein CBG09097 [Caenorhabditis briggsae] gi|268572157|ref|XP_002641249.1| Hypothetical protein CBG09117 [Caenorhabditis briggsae] Length = 772 Score = 49.9 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 31/173 (17%), Positives = 63/173 (36%), Gaps = 14/173 (8%) Query: 211 DVLIESAGNLVNSI-QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 + + LV+ + + + V + T + + + L N V+ + + Sbjct: 262 HLFSDQKKFLVDRVLGNINIHPEAVRVALITYSGQAFVHFKFNSFLYGNNTSVQGFVKNI 321 Query: 270 NPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 T T A+ A+ L ++ S+ + K + +TDG + + Sbjct: 322 RSIKGTTATNVALMDAFDLLTSKDPSTGIR---EGVPKMALVLTDGHSHRS--------P 370 Query: 329 LQICEYMRNAGMKIYSVAVSA-PPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 + E MR AG+ + +V+V+ P + LR S + F + E F Sbjct: 371 KDMAEKMRAAGIIMIAVSVTPRPLVDEAELRLIAGSEKRAFTPPNLHEFESEF 423 >gi|268572085|ref|XP_002641230.1| Hypothetical protein CBG09096 [Caenorhabditis briggsae] Length = 564 Score = 49.9 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 31/173 (17%), Positives = 63/173 (36%), Gaps = 14/173 (8%) Query: 211 DVLIESAGNLVNSI-QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 + + LV+ + + + V + T + + + L N V+ + + Sbjct: 262 HLFSDQKKFLVDRVLGNINIHPEAVRVALITYSGQAFVHFKFNSFLYGNNTSVQGFVKNI 321 Query: 270 NPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 T T A+ A+ L ++ S+ + K + +TDG + + Sbjct: 322 RSIKGTTATNVALMDAFDLLTSKDPSTGIR---EGVPKMALVLTDGHSHRS--------P 370 Query: 329 LQICEYMRNAGMKIYSVAVSA-PPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 + E MR AG+ + +V+V+ P + LR S + F + E F Sbjct: 371 KDMAEKMRAAGIIMIAVSVTPRPLVDEAELRLIAGSEKRAFTPPNLHEFESEF 423 >gi|260785923|ref|XP_002588009.1| hypothetical protein BRAFLDRAFT_125403 [Branchiostoma floridae] gi|229273165|gb|EEN44020.1| hypothetical protein BRAFLDRAFT_125403 [Branchiostoma floridae] Length = 948 Score = 49.9 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 23/183 (12%), Positives = 60/183 (32%), Gaps = 18/183 (9%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIG---IVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 L + + ++ + G + P+ ++L +K ++ L+ T Sbjct: 492 QLFGDGRYPPTDVTGDTLNSYDGLISTGRHLAEKFELQPIKDSLATLKEKVCDLSAGGTT 551 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY--QNTLNTLQICE 333 PA+ + + + + +I TDG + ++ +I + Sbjct: 552 ALGPALAVCAGVVAD------------KPRSEIILCTDGAANVGVGDVKKDPGFYKKIGD 599 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD-KIQEQSV 392 + R+ + I + + + + +SG + EL+ +I + V Sbjct: 600 FARSHKIVISIIGIEGENVALEQVSAAAATSGGTVNILHPLELVRQIRQIAQNNLVATDV 659 Query: 393 RIA 395 R+ Sbjct: 660 RVT 662 >gi|290981305|ref|XP_002673371.1| predicted protein [Naegleria gruberi] gi|284086954|gb|EFC40627.1| predicted protein [Naegleria gruberi] Length = 353 Score = 49.9 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 21/178 (11%), Positives = 52/178 (29%), Gaps = 18/178 (10%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE--VKSRLNKLNPYENTN 276 + N + I + R+G + ++ + +L++ +K + K+ +TN Sbjct: 10 QVANQVICEIIDNLREFERLGIVLFDDKAETFLPLTIVQDLDKKSLKETVLKIKEKGSTN 69 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 M +I++TD + + T Sbjct: 70 FEAGMQRGIDLFST------LDSSDLSNSNRIIYLTDACPNVGGTDTLDILTKD----AN 119 Query: 337 NAGMKIYS--VAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES----FDKITDKIQ 388 + I+S + + + +F+V + + + F+ I I Sbjct: 120 SGPYNIFSTFIGIGLDFNSDIVEELTQVRGCNYFSVKSTEDFTKILNQDFNYIVTPIC 177 >gi|270008952|gb|EFA05400.1| hypothetical protein TcasGA2_TC015572 [Tribolium castaneum] Length = 767 Score = 49.9 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 17/148 (11%), Positives = 51/148 (34%), Gaps = 8/148 (5%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 +N+ + K + TN + + + + +T+ + +IF+TDG Sbjct: 260 DNIKKAKRIKDDDVNMGCTNIIGGLAVGLYLVRRTLQKFYEKNVATKHQPMIIFLTDGLP 319 Query: 317 SGASAYQNTLNTLQICEYMRN--AGMKIYSVAVSAPPEGQDLLRKCTDSSG---QFFAVN 371 + + + + +I + I+S++ + L + + G + Sbjct: 320 NVGISNPDEI--TKIVTKINQGTNRAAIFSMSFGEDADKNFLKKLSAQNLGFSRHIYEAA 377 Query: 372 DSR-ELLESFDKITDKIQEQSVRIAPNR 398 D+ +L + ++ + ++ Sbjct: 378 DAALQLQNFYRTVSSPLLRDVRFKYVDK 405 >gi|189238321|ref|XP_972429.2| PREDICTED: similar to Inter-alpha-trypsin inhibitor heavy chain H4 precursor (ITI heavy chain H4) (Inter-alpha-inhibitor heavy chain 4) (Inter-alpha-trypsin inhibitor family heavy chain-related protein) (IHRP) (Plasma kallikrein sensitive glycoprotein 120) (P [Tribolium castaneum] Length = 653 Score = 49.9 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 17/148 (11%), Positives = 51/148 (34%), Gaps = 8/148 (5%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 +N+ + K + TN + + + + +T+ + +IF+TDG Sbjct: 334 DNIKKAKRIKDDDVNMGCTNIIGGLAVGLYLVRRTLQKFYEKNVATKHQPMIIFLTDGLP 393 Query: 317 SGASAYQNTLNTLQICEYMRN--AGMKIYSVAVSAPPEGQDLLRKCTDSSG---QFFAVN 371 + + + + +I + I+S++ + L + + G + Sbjct: 394 NVGISNPDEI--TKIVTKINQGTNRAAIFSMSFGEDADKNFLKKLSAQNLGFSRHIYEAA 451 Query: 372 DSR-ELLESFDKITDKIQEQSVRIAPNR 398 D+ +L + ++ + ++ Sbjct: 452 DAALQLQNFYRTVSSPLLRDVRFKYVDK 479 >gi|311033467|sp|A8K7I4|CLCA1_HUMAN RecName: Full=Calcium-activated chloride channel regulator 1; AltName: Full=Calcium-activated chloride channel family member 1; Short=hCLCA1; AltName: Full=Calcium-activated chloride channel protein 1; Short=CaCC-1; Short=hCaCC-1; Flags: Precursor gi|56203695|emb|CAI22169.1| CLCA family member 1, chloride channel regulator [Homo sapiens] Length = 914 Score = 49.9 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 22/151 (14%), Positives = 58/151 (38%), Gaps = 26/151 (17%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 ++ + + RL T+ + A+ + + + ++ +T Sbjct: 362 INSGSDRDTLAKRLPA-AASGGTSICSGLRSAFTVIRKKYPTD---------GSEIVLLT 411 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND 372 DGE++ S N + + +G I++VA+ P Q+L + G +D Sbjct: 412 DGEDNTISGCFNEV---------KQSGAIIHTVALG-PSAAQELEELSKMTGGLQTYASD 461 Query: 373 ---SRELLESFDKITD---KIQEQSVRIAPN 397 + L+++F ++ + ++S+++ Sbjct: 462 QVQNNGLIDAFGALSSGNGAVSQRSIQLESK 492 >gi|110611231|ref|NP_001276.2| calcium-activated chloride channel regulator 1 precursor [Homo sapiens] gi|146327635|gb|AAI41452.1| Chloride channel accessory 1 [synthetic construct] gi|162318850|gb|AAI56806.1| Chloride channel accessory 1 [synthetic construct] Length = 914 Score = 49.9 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 22/151 (14%), Positives = 58/151 (38%), Gaps = 26/151 (17%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 ++ + + RL T+ + A+ + + + ++ +T Sbjct: 362 INSGSDRDTLAKRLPA-AASGGTSICSGLRSAFTVIRKKYPTD---------GSEIVLLT 411 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND 372 DGE++ S N + + +G I++VA+ P Q+L + G +D Sbjct: 412 DGEDNTISGCFNEV---------KQSGAIIHTVALG-PSAAQELEELSKMTGGLQTYASD 461 Query: 373 ---SRELLESFDKITD---KIQEQSVRIAPN 397 + L+++F ++ + ++S+++ Sbjct: 462 QVQNNGLIDAFGALSSGNGAVSQRSIQLESK 492 >gi|46199004|ref|YP_004671.1| hypothetical protein TTC0696 [Thermus thermophilus HB27] gi|46196628|gb|AAS81044.1| hypothetical membrane associated protein [Thermus thermophilus HB27] Length = 706 Score = 49.9 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 24/121 (19%), Positives = 45/121 (37%), Gaps = 20/121 (16%) Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 L T A A R L+ G +K V+ +TDG + A Sbjct: 373 SLRAGGGTVLGGAFREAVRLLH----------GVPGERKAVLVLTDGLIADAKEP----- 417 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDK 386 I + + +G+++ ++A+ P L++ G+F+ REL F + + Sbjct: 418 ---ILDLAQTSGVEVSALALG-PDADAPFLKELARRGGGRFYQAPSPRELPRLFLREGQE 473 Query: 387 I 387 + Sbjct: 474 V 474 >gi|332809378|ref|XP_003308230.1| PREDICTED: LOW QUALITY PROTEIN: epithelial chloride channel protein-like [Pan troglodytes] Length = 901 Score = 49.9 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 16/113 (14%), Positives = 42/113 (37%), Gaps = 21/113 (18%) Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T+ + ++ + + +I +TDGE+ + Sbjct: 379 EADGGTSICRGLKAGFQAISQS--------NQSTFGSEIILLTDGED----------YQI 420 Query: 330 QIC-EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN-DSRELLESF 380 +C ++ +G I+++A+ P ++L + G F + + L+++F Sbjct: 421 SLCFGEVKQSGTVIHTIALG-PSADEELETLSNMTGGHRFYAHKNINGLIDAF 472 >gi|320450000|ref|YP_004202096.1| hypothetical protein TSC_c09220 [Thermus scotoductus SA-01] gi|320150169|gb|ADW21547.1| conserved hypothetical protein [Thermus scotoductus SA-01] Length = 691 Score = 49.9 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 20/138 (14%), Positives = 50/138 (36%), Gaps = 21/138 (15%) Query: 254 PLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 P++ E +S L + T A A R L + ++ ++ Sbjct: 346 PMTEQGKKEAESLLLSVRAGGGTVLGSAFREAVRLL----QGVPVERKG------ILVLS 395 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN 371 DG + + + +G+++ ++A+ A + L+ + G+++ Sbjct: 396 DGL------ISDPQDPILALAEA--SGLEVSAMALGADA-DRAFLKVLAERGGGRYYQAA 446 Query: 372 DSRELLESFDKITDKIQE 389 ++EL F K ++ + Sbjct: 447 TAQELPRLFLKEGQEVFQ 464 >gi|221123691|ref|XP_002160228.1| PREDICTED: similar to CnPolydom [Hydra magnipapillata] Length = 954 Score = 49.9 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 26/194 (13%), Positives = 56/194 (28%), Gaps = 20/194 (10%) Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 D+ L++ + + RI + + L Sbjct: 66 YSTGSFQGVTMTGFDIGKTFINALLSKVHISFNA-----TRIAIGTFGTNHKIDINFILR 120 Query: 257 NNLN----EVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 + + + K K+ Y TN A+ + + + + VI + Sbjct: 121 PDYSMHKCKFKKDFEKIRIYGGMTNLRGALQDSLNIFRELDSNPDTHKKRHKTNRVVILL 180 Query: 312 TDGENSG-------ASAYQNTL--NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 +DGE + + + L N I +R +++Y++ V + +L Sbjct: 181 SDGEGNVMDNPNGRGVTHNDGLARNPHDIAHNLRLGLVEVYTIGV-TSAPDRAVLEGLAT 239 Query: 363 SSGQFFAVNDSREL 376 F D +L Sbjct: 240 EKNLFLFSKDFTDL 253 >gi|198420538|ref|XP_002128804.1| PREDICTED: similar to hedgling [Ciona intestinalis] Length = 420 Score = 49.9 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 21/96 (21%), Positives = 37/96 (38%), Gaps = 3/96 (3%) Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ-ICE 333 T A+ HAY E+ + + +++ +I ITDG + +T LQ + + Sbjct: 285 TCLVDALIHAYVEMSATINGGRQSQNN--VEQDIILITDGCANCHHYGVSTEQVLQNLAD 342 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA 369 + GM IY + V + LR + Sbjct: 343 FFVQQGMHIYVIGVGLQHACRQKLRILAQGGRCYHF 378 >gi|221134029|ref|ZP_03560334.1| vault protein inter-alpha-trypsin [Glaciecola sp. HTCC2999] Length = 757 Score = 49.9 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 22/144 (15%), Positives = 51/144 (35%), Gaps = 22/144 (15%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + E L ++ TN A+ + +L + VIF+TDG + Sbjct: 450 SKAEAMRFLRHVDSDGGTNMQDALALSLTQLLDSSTGLTQ----------VIFVTDGSIN 499 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSREL 376 QI E + + ++++V + A P + G + ++D E+ Sbjct: 500 NER-----ELLKQIAEQLGDK--RLFTVGIGAAPNSH-FMEYAAMLGKGTYTYIDDLTEI 551 Query: 377 LESFDKITDKIQEQS---VRIAPN 397 + +++ +++ P+ Sbjct: 552 QPKMAYLFSQLRSPMITDIQLTPS 575 >gi|116622522|ref|YP_824678.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225684|gb|ABJ84393.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 324 Score = 49.9 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 26/158 (16%), Positives = 61/158 (38%), Gaps = 27/158 (17%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 PL+ ++ +++++L T A+ A EL ++ Sbjct: 138 NDQPKMVVPLTRDVEQIQNQLTWAQSKGRTALLDAIFLAMSELKKSTKNR---------- 187 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV----------SAPPEGQD 355 K ++ I+DG ++ + ++ + +R + IY++ V G Sbjct: 188 KALLIISDGGDNSSRYTESEVR-----NLVRENDVLIYAIGVYEFAGGRMRTPEEAGGPG 242 Query: 356 LLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSV 392 LL + ++ + G+ D+ EL + KI +++ + V Sbjct: 243 LLTELSEQTGGRHLPA-DANELPDIAAKIGVELRNRYV 279 >gi|4585469|gb|AAD25487.1|AF127036_1 calcium-activated chloride channel protein 1 [Homo sapiens] gi|119593592|gb|EAW73186.1| chloride channel, calcium activated, family member 1 [Homo sapiens] gi|189067292|dbj|BAG37002.1| unnamed protein product [Homo sapiens] Length = 914 Score = 49.9 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 22/151 (14%), Positives = 58/151 (38%), Gaps = 26/151 (17%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 ++ + + RL T+ + A+ + + + ++ +T Sbjct: 362 INSGSDRDTLAKRLPA-AASGGTSICSGLRSAFTVIRKKYPTD---------GSEIVLLT 411 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND 372 DGE++ S N + + +G I++VA+ P Q+L + G +D Sbjct: 412 DGEDNTISGCFNEV---------KQSGAIIHTVALG-PSAAQELEELSKMTGGLQTYASD 461 Query: 373 ---SRELLESFDKITD---KIQEQSVRIAPN 397 + L+++F ++ + ++S+++ Sbjct: 462 QVQNNGLIDAFGALSSGNGAVSQRSIQLESK 492 >gi|4009458|gb|AAC95428.1| calcium-dependent chloride channel-1 [Homo sapiens] Length = 914 Score = 49.9 bits (117), Expect = 8e-04, Method: Composition-based stats. Identities = 22/151 (14%), Positives = 58/151 (38%), Gaps = 26/151 (17%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 ++ + + RL T+ + A+ + + + ++ +T Sbjct: 362 INSGSDRDTLAKRLPA-AASGGTSICSGLRSAFTVIRKKYPTD---------GSEIVLLT 411 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND 372 DGE++ S N + + +G I++VA+ P Q+L + G +D Sbjct: 412 DGEDNTISGCFNEV---------KQSGAIIHTVALG-PSAAQELEELSKMTGGLQTYASD 461 Query: 373 ---SRELLESFDKITD---KIQEQSVRIAPN 397 + L+++F ++ + ++S+++ Sbjct: 462 QVQNNGLIDAFGALSSGNGAVSQRSIQLESK 492 >gi|270007556|gb|EFA04004.1| hypothetical protein TcasGA2_TC014153 [Tribolium castaneum] Length = 813 Score = 49.5 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 20/157 (12%), Positives = 50/157 (31%), Gaps = 10/157 (6%) Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGS 301 Y + L V+ + + + + L + + Sbjct: 361 HYGNLEPYVKKLFLPRTSKAVRQNIEAARSTIYDKSGLGLSNPVYALEVGLFLAKRIQDN 420 Query: 302 --TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 R + +IF+TD + QN + + N + I+S++ ++ +R+ Sbjct: 421 LPNRYQPMIIFLTDSYPTVGMTSQNEIINT--VTKVNNNRIPIFSLSFG-EDVDKNFMRQ 477 Query: 360 CTDSS----GQFFAVNDSR-ELLESFDKITDKIQEQS 391 + G + D+ ++L + I+ + Q Sbjct: 478 LAAKNLGFSGHIYEALDASVQILNFYRSISSPVLSQV 514 >gi|226308012|ref|YP_002767972.1| hypothetical protein RER_45250 [Rhodococcus erythropolis PR4] gi|226187129|dbj|BAH35233.1| hypothetical protein RER_45250 [Rhodococcus erythropolis PR4] Length = 551 Score = 49.5 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 21/129 (16%), Positives = 47/129 (36%), Gaps = 13/129 (10%) Query: 256 SNNLNEVKSRLNKLNP--YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 +++ +++ SR++ L+ T Y ++ AYR + + + VI +TD Sbjct: 423 TDHRSKLLSRIDNLSSIVGGGTGLYDSVLAAYRSMQQSYDPASI--------NSVILLTD 474 Query: 314 GENSGASAYQNTLNTLQICEYMR-NAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN 371 G N S + + I ++ V D+L + + + G Sbjct: 475 GANDDPSGIALQELLDTLTREQDPMRPVPIITIGV-TDDADTDVLEQISTLTGGNSHFAP 533 Query: 372 DSRELLESF 380 ++ + F Sbjct: 534 TPADIPKVF 542 >gi|189237279|ref|XP_973594.2| PREDICTED: similar to inter-alpha-trypsin inhibitor family heavy chain-related protein [Tribolium castaneum] Length = 750 Score = 49.5 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 20/157 (12%), Positives = 50/157 (31%), Gaps = 10/157 (6%) Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGS 301 Y + L V+ + + + + L + + Sbjct: 361 HYGNLEPYVKKLFLPRTSKAVRQNIEAARSTIYDKSGLGLSNPVYALEVGLFLAKRIQDN 420 Query: 302 --TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 R + +IF+TD + QN + + N + I+S++ ++ +R+ Sbjct: 421 LPNRYQPMIIFLTDSYPTVGMTSQNEIINT--VTKVNNNRIPIFSLSFG-EDVDKNFMRQ 477 Query: 360 CTDSS----GQFFAVNDSR-ELLESFDKITDKIQEQS 391 + G + D+ ++L + I+ + Q Sbjct: 478 LAAKNLGFSGHIYEALDASVQILNFYRSISSPVLSQV 514 >gi|2689175|emb|CAA06010.1| hypothetical protein [Borrelia burgdorferi] Length = 328 Score = 49.5 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 60/172 (34%), Gaps = 41/172 (23%) Query: 252 CTPLSNNLNEVKSRLNK---LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P++ + +L+ ++ + + A L S LK+ + Sbjct: 146 VVPITTDREFFNKKLDDIYIMDLGNGSALGLGISIALSHL----------KHSEALKRSI 195 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA------------------- 349 + +TDG + ++ Q+ + +KIYS+ + + Sbjct: 196 VVLTDGVVNSDEIDKD-----QVINLAQGLNVKIYSIGIGSSEEFSVEFKLRSGKFYQGS 250 Query: 350 --PPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQ-EQSVRIAPN 397 +L + ++ + G F++VND + + K E+ ++IA + Sbjct: 251 FKEVYDPSMLVEISNKTGGLFYSVNDDFSFQFAIQDFSKKENLERKIKIAVD 302 >gi|330794740|ref|XP_003285435.1| hypothetical protein DICPUDRAFT_76377 [Dictyostelium purpureum] gi|325084610|gb|EGC38034.1| hypothetical protein DICPUDRAFT_76377 [Dictyostelium purpureum] Length = 1962 Score = 49.5 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 25/189 (13%), Positives = 58/189 (30%), Gaps = 19/189 (10%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEK---KNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 ++ + G +V+ I+++ + +++L Sbjct: 1645 TGSMGSELTQVKTKLGQIVDRIKESKVNVRVANVFYNDHHQVYLQTDESPTVQVDFTSDL 1704 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE---- 315 +E+KS++ LN + A+ ++ K IFI D Sbjct: 1705 DEIKSKIESLNVDGGDDNAEAVADGLYQVSKLNFRR-------NSTKVCIFIADAPAHGF 1757 Query: 316 ----NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAV 370 ++ + + +++ + G+ Y+V+ D L D S G + Sbjct: 1758 DEAIDNFPNGCPCGHDCIELVRKIVKMGVTFYTVSCRPTHSSSDFLCAIADISEGACVEL 1817 Query: 371 NDSRELLES 379 S L + Sbjct: 1818 QSSEYLSDF 1826 >gi|198424960|ref|XP_002123531.1| PREDICTED: polydomain protein-like [Ciona intestinalis] Length = 3458 Score = 49.5 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 25/268 (9%), Positives = 65/268 (24%), Gaps = 25/268 (9%) Query: 126 SALTNLSLRSTGIIERSSENLAI--SICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPP 183 + + + + + + + ++ + Sbjct: 3193 FNHIDNGQVTCTNSNYHHSQCSFQCNPGTLTNPITHHGSHVMTCQSNGRWGGTPPCCAES 3252 Query: 184 PPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAY 243 P ++ + D G + I++ + V + Sbjct: 3253 CPTRTKVDLYIVLESSTSG------ESDDWNRLLGFVETLIRRFSIDDSTTLVGLLRYHR 3306 Query: 244 NIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGST 302 N+ ++G N+ E+ + L +N + H +S Sbjct: 3307 NVDVIGEVSLGRYRNIEELSRAVKFLPYGGYGSNIGQTLEHLAT------DSLLVDTNRP 3360 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 ++ VI TDG + N ++ + V + L + Sbjct: 3361 NVRDHVILFTDG--------SSDDNVTASAGLLKQR-ATVQVVGMDGTKTQLQQLEEIAT 3411 Query: 363 SSGQFFAVNDSREL-LESFDKITDKIQE 389 V++ +L D I + + Sbjct: 3412 KPSYVHTVSNISQLGTSVIDAIIMNMCQ 3439 >gi|86147465|ref|ZP_01065777.1| hypothetical protein MED222_21464 [Vibrio sp. MED222] gi|85834758|gb|EAQ52904.1| hypothetical protein MED222_21464 [Vibrio sp. MED222] Length = 421 Score = 49.5 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 20/177 (11%), Positives = 48/177 (27%), Gaps = 1/177 (0%) Query: 4 IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTI 63 I + V A+D+ H++ + ++Q+A+D+A L+ + + + + + T Sbjct: 22 IALLVFLAVSALAVDINHMLVNKTRLQNAVDSAALAAATILDNSKDQAAVSAEVTSTLNA 81 Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGL 123 H S + T E Sbjct: 82 MAGASGNHEFNFSTAVVSVEYSNDPQSFAGTTTFGADDDVYVRVNVSSLDMDEFFIQM-F 140 Query: 124 IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 + + + + N + I + + S D+ ++D Sbjct: 141 GLAKDVSATAVAGPSSGLQVVNNIVPIGVCIGDGTSDNDINEDGYDDATGEAITNVF 197 >gi|315052466|ref|XP_003175607.1| U-box domain-containing protein [Arthroderma gypseum CBS 118893] gi|311340922|gb|EFR00125.1| U-box domain-containing protein [Arthroderma gypseum CBS 118893] Length = 740 Score = 49.5 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 31/211 (14%), Positives = 64/211 (30%), Gaps = 19/211 (9%) Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE-SAGNLVNSIQKAIQEKKNLSVR 237 +P P + + S + AP P + S +L + I E N + R Sbjct: 63 NDVPHVPCDIVLVIDISGSMNSSAPIPTGERGGEDTGLSILDLTKHAARTIIETLNENDR 122 Query: 238 IGTIAYNIGIVGNQCTPLSN--NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS 295 + + + + N N + V +NKL +TN + + + L Sbjct: 123 LAVVTFCTEVKVAFELDFMNKENKSMVLRAINKLYGTSSTNLWHGIKEGLKVLTATPVRE 182 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK-----IYSVAVSAP 350 + + ++ +TDG + + + I++ Sbjct: 183 NV--------QSLLVLTDGAPNH-MCPAQGYVPKLRQTLLDHRNSTGSLPLIHTFGFGYY 233 Query: 351 PEGQDLLRKCTDSSGQFF-AVNDSRELLESF 380 LL+ + G F + D+ + F Sbjct: 234 LRSP-LLQSIAEIGGGIFAFIPDAGMIGTVF 263 >gi|218693363|gb|ACL01185.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 49.5 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 39/383 (10%), Positives = 100/383 (26%), Gaps = 37/383 (9%) Query: 28 QMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQ 87 ++ LD VL S + + + + + +K I K + Sbjct: 217 ELNQPLDVVVL-LDNSNSMNNERANNSQRALKAGEAVEKLIDKITSNKDNRVALVTYAST 275 Query: 88 KAQINITKDKNNPLQYIAESKAQYEIP--TENLFLKGLIPSALTNLSLRSTGIIERSSEN 145 ++ + F + NL+ + + S Sbjct: 276 IFDGTEATVSKGVADQNGKALNDSVSWDYHKTTFTATTHNYSYLNLTNDANEVNILKSRI 335 Query: 146 LAISICMVLDVSRS-MEDLYLQKHNDNNNMTSNKYLLPP--------PPKKSFWSKNTTK 196 + + D + + QK N S Sbjct: 336 PKEAEHINGDRTLYQFGATFTQKALMKANEILETQSSNARKKLIFHVTDGVPTMSYAINF 395 Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 + Y + + + + +Q+ + + + + ++ P++ Sbjct: 396 NPYISTS--YQNQFNSFLNKIPDRSGILQEDFIINGDDYQIVKGDGESFKLFSDRKVPVT 453 Query: 257 NNLNEVKSRL--NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 + R+ N+L+ N + Y + + ++ T G Sbjct: 454 GGTTQAAYRVPQNQLSVMSNEGYAINSGYIYLYWRDYNWVYPFDPKTKKVSATKQIKTHG 513 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------GQDLLRKCTDSSGQF 367 E + N +R G I++V + + + ++ + + + Sbjct: 514 EPTTLYFNGN----------IRPKGYDIFTVGIGVNGDPGATPLEAEKFMQSISSKTENY 563 Query: 368 FAVNDS----RELLESFDKITDK 386 V+D+ EL + F I ++ Sbjct: 564 TNVDDTNKIYDELNKYFKTIVEE 586 >gi|56418593|ref|YP_145911.1| hypothetical protein GK0058 [Geobacillus kaustophilus HTA426] gi|261417558|ref|YP_003251240.1| hypothetical protein GYMC61_0058 [Geobacillus sp. Y412MC61] gi|297528433|ref|YP_003669708.1| hypothetical protein GC56T3_0057 [Geobacillus sp. C56-T3] gi|319765215|ref|YP_004130716.1| hypothetical protein GYMC52_0058 [Geobacillus sp. Y412MC52] gi|56378435|dbj|BAD74343.1| hypothetical conserved protein [Geobacillus kaustophilus HTA426] gi|261374015|gb|ACX76758.1| conserved hypothetical protein [Geobacillus sp. Y412MC61] gi|297251685|gb|ADI25131.1| hypothetical protein GC56T3_0057 [Geobacillus sp. C56-T3] gi|317110081|gb|ADU92573.1| hypothetical protein GYMC52_0058 [Geobacillus sp. Y412MC52] Length = 246 Score = 49.5 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 14/103 (13%), Positives = 39/103 (37%), Gaps = 10/103 (9%) Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV----SAPPEGQD 355 + ++ ITDG ++ + + R G+ + + + + G+ Sbjct: 1 MRKGTLRQILLITDGCSNHGE------DPAAMAALAREQGITVNVIGILDQGAMDENGRR 54 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + + G V +++L ++ +T + Q+++ NR Sbjct: 55 EIEAIAAAGGGMSQVVYAKQLSQTVQMVTRQAMTQTLQGLVNR 97 >gi|330811037|ref|YP_004355499.1| hypothetical protein PSEBR_a4091 [Pseudomonas brassicacearum subsp. brassicacearum NFM421] gi|327379145|gb|AEA70495.1| Hypothetical protein; putative exported protein [Pseudomonas brassicacearum subsp. brassicacearum NFM421] Length = 426 Score = 49.5 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 18/140 (12%), Positives = 42/140 (30%), Gaps = 8/140 (5%) Query: 18 DLAHIMYIRNQMQSALDAAVLSGCASIV--------SDRTIKDPTTKKDQTSTIFKKQIK 69 D H++ + ++Q+A+DAA L G ++ + T + ++ Sbjct: 21 DGGHMLLNKTRLQNAVDAAALGGAKTLSQVTGGMNMASTTRAAALDTLSRNASAVGNAEL 80 Query: 70 KHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALT 129 G+ A +++ + +Y+ S A Y++ + S Sbjct: 81 ATAVAGNPGAFAAVELSSSVYGPFSYPGPTDAKYVRVSVANYQLNGFFWSFVQSMGSGGL 140 Query: 130 NLSLRSTGIIERSSENLAIS 149 + S Sbjct: 141 GGKAVAAIATAGPSPTSPCD 160 >gi|332221819|ref|XP_003260062.1| PREDICTED: calcium-activated chloride channel regulator 1 [Nomascus leucogenys] Length = 914 Score = 49.5 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 59/152 (38%), Gaps = 28/152 (18%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 ++ + + RL T+ + A+ + + + ++ +T Sbjct: 362 INSGSDRDTLAKRLPA-AASGGTSICRGLRSAFTVIKKKYPTD---------GSEIVLLT 411 Query: 313 DGENSGASAYQNTLNTLQIC-EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 DGE++ T+ C ++ +G I++VA+ P Q+L + G + Sbjct: 412 DGEDN----------TISECFNEVKQSGAIIHTVALG-PSAAQELEELSKMTGGLQTYAS 460 Query: 372 D---SRELLESFDKITD---KIQEQSVRIAPN 397 D + L+++F ++ + ++S+++ Sbjct: 461 DQVQNNGLIDAFGALSSGNGAVSQRSIQLESK 492 >gi|227114885|ref|ZP_03828541.1| hypothetical protein PcarbP_18070 [Pectobacterium carotovorum subsp. brasiliensis PBR1692] Length = 539 Score = 49.5 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 48/457 (10%), Positives = 108/457 (23%), Gaps = 102/457 (22%) Query: 3 AIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTST 62 A+ + + +D + Q++ + + ++ + Sbjct: 41 ALGAMALLVTAAFIVDTSTATGDATQIKR--------ATDAAALAVAHQATINGEEYSQE 92 Query: 63 IFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL-- 120 K ++K + + D + + +T+ +N+ + +E L L Sbjct: 93 ETNKLAYDYVKNNLGMNKALSDKLEVGDVAVTEGRNSETRKTYTVTVAFETKPSLLNLGA 152 Query: 121 ---------------------------KGLIPSALTNLSLRSTGIIERSSENLAISICM- 152 RS SS + Sbjct: 153 RQQEVFSTAEVINRPTEVAFVMPVTGDMSEGDIRSLKSVSRSFVERMLSSADGKRDNLWL 212 Query: 153 -VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTT---------------- 195 ++ S+S+ + N ++ L PP + F S + Sbjct: 213 SLVPYSQSVSVYDAEDANRIRRWSAPGALNPPELRSLFASGVVSSLADRRFPDRRANLLC 272 Query: 196 -KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT- 253 + V + Q +S R ++ + Sbjct: 273 MYRGLGREENFFWDEPPVGQFKVYYRHDLPQNGSPGAPPISWRGPNPDFDDTDAVDTRWI 332 Query: 254 ------------PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGS 301 PL+N +++ R+++ + N N M A L S + Sbjct: 333 VADRGCPNAALMPLTNEESKLNQRIDQFSARFNVNYAIGMSWAGAALSPNMRGSDGWGDA 392 Query: 302 TRL-----------KKFVIFITDGENSGASAYQNTLN--------------------TLQ 330 T +K ++ + + + N Sbjct: 393 TLPLDFNLDGNGDGQKVIVMLANTIGNWFDTDSYNFNRNAFSGQTGSDPARTFAAQRFRD 452 Query: 331 ICEYMRNAGMKIYSVAV--SAPPEGQDLLRKCTDSSG 365 +C R +K Y V + P + L + G Sbjct: 453 LCSSFRARNIKFYFVGIRPGDPEDFGRNLFDAEATPG 489 >gi|118093056|ref|XP_421769.2| PREDICTED: similar to AMACO [Gallus gallus] Length = 800 Score = 49.5 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 23/162 (14%), Positives = 55/162 (33%), Gaps = 21/162 (12%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKE 293 VR+G + ++ EVK R+ + T T A+ + + + Sbjct: 86 RVRVGMVQFSSAPHLEFSLDSYLTKQEVKERIKRTAFRGGSTETGRALKYILHKGFPGGR 145 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + + K +I I+DG++ G++A ++ G +++V + P Sbjct: 146 -------NLTVPKILIIISDGKSQGSTAVP--------AMQVKERGTTVFAVGIKFPRWE 190 Query: 354 QDLLRKCTD--SSGQFFAVNDSRE-LLESFDKITDKIQEQSV 392 + L + D+ + + ++ I + Sbjct: 191 E--LHAVASEPTEQHVLFAGDASDAANGLYSALSSSICSVTT 230 >gi|47226324|emb|CAG09292.1| unnamed protein product [Tetraodon nigroviridis] Length = 752 Score = 49.5 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 23/174 (13%), Positives = 60/174 (34%), Gaps = 11/174 (6%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + + E + + + A + R+ I + + PL+++ + Sbjct: 577 GQSNLQYVREFVQKVSDRLVLARSRTDRMRARMALIEFGKENETHVAFPLTHDQAAISDG 636 Query: 266 LNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + +L +++ PA+ H ++ + + +FITDG + + Sbjct: 637 IARLPYFDSSSSVGPAIIHTIDKVLGKGPARKTRRNVEIS---FVFITDG-------FTD 686 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + N + MR A + +A + + L++ + F +L + Sbjct: 687 SRNLEEAVSAMRGAQVVPTVIATGNDVDEEVLMKLAMGDAEAIFKAKAVTDLSQ 740 >gi|331266438|ref|YP_004326068.1| hypothetical protein SOR_1070 [Streptococcus oralis Uo5] gi|326683110|emb|CBZ00728.1| conserved hypothetical protein [Streptococcus oralis Uo5] Length = 863 Score = 49.5 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 43/331 (12%), Positives = 88/331 (26%), Gaps = 57/331 (17%) Query: 104 IAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDL 163 A+S + + LSL T + ++ + + +V D+S SM++ Sbjct: 34 NADSTTEPQTTLHKTITPISGQDDKYELSLDITSKLGTETQTDPLDVVLVADLSGSMQNQ 93 Query: 164 Y--LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR-KIDVLI-ESAGN 219 + + + K L + + KID + + Sbjct: 94 DVQSFDGRTISRIDALKNTLRGTNGRKGLIDTILSNSNNRLSMVGFGGKIDNKKVDQYWD 153 Query: 220 LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYE----- 273 + N SNN K+ + ++ Sbjct: 154 GNKWRLFRPYWPYERMTKYYDGVSPWDDA-NTILGWSNNARAAKTAVYNMSIAGGNSIGT 212 Query: 274 ------NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG--------- 318 TN + A + + + + ++ KK VI ++DG + Sbjct: 213 ESGIGTGTNIGAGLTLANQLMGSARSNA---------KKVVILLSDGFANMVYDANGYTI 263 Query: 319 ASAYQNTLNTLQIC--------EYMRNAGMKI-------YSVAVSAPPEGQDLL------ 357 + N + + + YS+ + Sbjct: 264 YNYNNEDPNIETAPQWFWDRLNNNLNSLSYSLAPTLDGFYSIKFRYSNNVDSITSLQYYI 323 Query: 358 -RKCTDSSGQFFAVNDSRELLESFDKITDKI 387 + + + ND +L +SF ITDKI Sbjct: 324 RQHNASIPNEILSANDEDQLRDSFKNITDKI 354 >gi|224065915|ref|XP_002191423.1| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain H3, partial [Taeniopygia guttata] Length = 869 Score = 49.5 bits (116), Expect = 8e-04, Method: Composition-based stats. Identities = 19/133 (14%), Positives = 49/133 (36%), Gaps = 11/133 (8%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 NL+E + + ++ TN + + L E + S +I +TDG+ + Sbjct: 330 NLDEARKFVQHISAQGLTNLHGGLMRGIDILNAAHEENLVPKRS---ASIIIMLTDGQPN 386 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG----QFFAVNDS 373 NT + + +Y++ + L + + + + +D+ Sbjct: 387 VGL--SNTHEIENAVKKAIDGRYTLYNLGFGSGV-DYGFLERMALENKGLARRIYPDSDA 443 Query: 374 R-ELLESFDKITD 385 +L +D++++ Sbjct: 444 ALQLQGFYDEVSN 456 >gi|254779584|ref|YP_003057690.1| hypothetical protein HELPY_0994 [Helicobacter pylori B38] gi|254001496|emb|CAX29512.1| Conserved hypothetical protein [Helicobacter pylori B38] Length = 214 Score = 49.5 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 29/200 (14%), Positives = 71/200 (35%), Gaps = 30/200 (15%) Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 S P + +I VL ++ ++++ + KK L ++ + + G G Sbjct: 24 SGSMSHPLGNSTRIGVLNLCIQKMIETLKQ--EAKKELFSKMAIVTF--GENGAVLHTPF 79 Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 +++ V L+ T A A + ++ +T + K + I ++DGE Sbjct: 80 DDIKNVNFE--PLSTSGGTPLDQAFRLAKDLIEDK-----DTFPTKFYKPYSILVSDGEP 132 Query: 317 SGASA-------YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA 369 + + + + +C +S+ + + + F Sbjct: 133 NNDKWQEPLSSFHHDGRSAKSVC----------WSIFIGDREANPQVNKD--FGKDGVFY 180 Query: 370 VNDSRELLESFDKITDKIQE 389 +D +L++ F+ +T I + Sbjct: 181 ADDVEKLVKLFEIMTQTISK 200 >gi|15822539|gb|AAG23712.1| calcium-activated chloride channel CLCA4 [Mus musculus] gi|148680071|gb|EDL12018.1| mCG141954, isoform CRA_a [Mus musculus] gi|162317876|gb|AAI56643.1| Chloride channel calcium activated 4 [synthetic construct] Length = 909 Score = 49.5 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 18/112 (16%), Positives = 47/112 (41%), Gaps = 21/112 (18%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 P T+ + + + + + ++ +TDGE++ S+ Sbjct: 384 PNGGTSICNGLKKGFEAITSS--------DQSTSGSEIVLLTDGEDNRISSCF------- 428 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSG-QFFAVNDSRELLESF 380 + ++++G I+++A+ + L +D + G +F+A D L+++F Sbjct: 429 --QEVKHSGAIIHTIALGPSAARE--LETLSDMTGGLRFYAKEDVNGLIDAF 476 >gi|289425109|ref|ZP_06426886.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK187] gi|289154087|gb|EFD02775.1| von Willebrand factor type A domain protein [Propionibacterium acnes SK187] Length = 322 Score = 49.5 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 17/152 (11%), Positives = 50/152 (32%), Gaps = 25/152 (16%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 S + V ++ + + T A+ + + + + ++ ++DG+ Sbjct: 152 STDRPTVLRAVDGIELQDGTALGGAIDKSLEAVK----MAPGGSKNPAPA-AIVMLSDGD 206 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PEGQDLLRKCTD 362 N+ + N + + +Y++A LL D Sbjct: 207 NTQGGSPLVAANRAAAAK------VPVYTIAFGTETGYVDLNGQRERVAPDTKLLSTVAD 260 Query: 363 -SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + + + + + +L E + ++ + + VR Sbjct: 261 RTHAKSWTADSADKLQEVYQQVHSSVGYEPVR 292 >gi|159039503|ref|YP_001538756.1| von Willebrand factor type A [Salinispora arenicola CNS-205] gi|157918338|gb|ABV99765.1| von Willebrand factor type A [Salinispora arenicola CNS-205] Length = 583 Score = 49.5 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 27/189 (14%), Positives = 66/189 (34%), Gaps = 14/189 (7%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 + A A + V +++A ++ + + + + PLSN Sbjct: 399 SVAGAGGASRQQVTLDAARRGLSLFDDS-WQIGLWEFSTNLGSGRDYRRLVEIGPLSNQR 457 Query: 260 NEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + ++ L ++ P +T + + AY + E + ++ TDG+N Sbjct: 458 SRLEQALTQIQPTRGDTGLFDTVLAAYEAVQEEWDPGQV--------NSIVLFTDGKNDD 509 Query: 319 ASAYQNTLNTLQICEYMRN--AGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 + + L E +++ +++ + + A +L + G F D ++ Sbjct: 510 DNGI-SQQQLLAELERIKDAERPVQVVLIGIGADVSKAELESITKVTGGGSFVTEDPTKI 568 Query: 377 LESF-DKIT 384 + F I Sbjct: 569 GDIFLKAIA 577 >gi|218693385|gb|ACL01196.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 49.5 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 39/383 (10%), Positives = 100/383 (26%), Gaps = 37/383 (9%) Query: 28 QMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQ 87 ++ LD VL S + + + + + +K I K + Sbjct: 217 ELNQPLDVVVL-LDNSNSMNNERANNSQRALKAGEAVEKLIDKITSNKDNRVALVTYAST 275 Query: 88 KAQINITKDKNNPLQYIAESKAQYEIP--TENLFLKGLIPSALTNLSLRSTGIIERSSEN 145 ++ + F + NL+ + + S Sbjct: 276 IFDGTEATVSKGVADQNGKALNDSVSWDYHKTTFTATTHNYSYLNLTNDANEVNILKSRI 335 Query: 146 LAISICMVLDVSRS-MEDLYLQKHNDNNNMTSNKYLLPP--------PPKKSFWSKNTTK 196 + + D + + QK N S Sbjct: 336 PKEAEHINGDRTLYQFGATFTQKALMKANEILETQSSNARKKLIFHVTDGVPTMSYAINF 395 Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 + Y + + + + +Q+ + + + + ++ P++ Sbjct: 396 NPYISTS--YQNQFNSFLNKIPDRSGILQEDFIINGDDYQIVKGDGESFKLFSDRKVPVT 453 Query: 257 NNLNEVKSRL--NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 + R+ N+L+ N + Y + + ++ T G Sbjct: 454 GGTTQAAYRVPHNQLSVMSNEGYAINSGYIYLYWRDYNWVYPFDPKTKKVSATKQIKTHG 513 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------GQDLLRKCTDSSGQF 367 E + N +R G I++V + + + ++ + + + Sbjct: 514 EPTTLYFNGN----------IRPKGYDIFTVGIGVNGDPGATPLEAEKFMQSISSKTENY 563 Query: 368 FAVNDS----RELLESFDKITDK 386 V+D+ EL + F I ++ Sbjct: 564 TNVDDTNKIYDELNKYFKTIVEE 586 >gi|261253067|ref|ZP_05945640.1| hypothetical protein VIA_003092 [Vibrio orientalis CIP 102891] gi|260936458|gb|EEX92447.1| hypothetical protein VIA_003092 [Vibrio orientalis CIP 102891] Length = 424 Score = 49.5 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 18/155 (11%), Positives = 43/155 (27%), Gaps = 5/155 (3%) Query: 4 IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIV-----SDRTIKDPTTKKD 58 + + +ID++H + + ++Q+A+D L+G D + K Sbjct: 32 MSMVAFITIAALSIDVSHFVVNKTRLQNAVDTIALAGATVANRTNEKGDTDTAIIESYKK 91 Query: 59 QTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENL 118 + +I+ + + + + + N Sbjct: 92 VIESPGNDEIELTATDDGNGLNLLSIEYSDSPNSGFSTTFPSSPDMVYVRVEVSEIGLNE 151 Query: 119 FLKGLIPSALTNLSLRSTGIIERSSENLAISICMV 153 F L T + G + + + I M Sbjct: 152 FFLDLFGIEKTVSASAVAGPVFVDGTSNILPIGMC 186 >gi|218693409|gb|ACL01208.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 49.5 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 39/383 (10%), Positives = 100/383 (26%), Gaps = 37/383 (9%) Query: 28 QMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQ 87 ++ LD VL S + + + + + +K I K + Sbjct: 217 ELNQPLDVVVL-LDNSNSMNNERANNSQRALKAGEAVEKLIDKITSNKDNRVALVTYAST 275 Query: 88 KAQINITKDKNNPLQYIAESKAQYEIP--TENLFLKGLIPSALTNLSLRSTGIIERSSEN 145 ++ + F + NL+ + + S Sbjct: 276 IFDGTEATVSKGVADQNGKALNDSVSWDYHKTTFTATTHNYSYLNLTNDANEVNILKSRI 335 Query: 146 LAISICMVLDVSRS-MEDLYLQKHNDNNNMTSNKYLLPP--------PPKKSFWSKNTTK 196 + + D + + QK N S Sbjct: 336 PKEAEHINGDRTLYQFGATFTQKALMKANEILETQSSNARKKLIFHVTDGVPTMSYAINF 395 Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 + Y + + + + +Q+ + + + + ++ P++ Sbjct: 396 NPYISTS--YQNQFNSFLNKIPDRSGILQEDFIINGDDYQIVKGDGESFKLFSDRKVPVT 453 Query: 257 NNLNEVKSRL--NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 + R+ N+L+ N + Y + + ++ T G Sbjct: 454 GGTTQAAYRVPQNQLSVMSNEGYAINSGYIYLYWRDYNWVYPFDPKTKKVSATKQIKTHG 513 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------GQDLLRKCTDSSGQF 367 E + N +R G I++V + + + ++ + + + Sbjct: 514 EPTTLYFNGN----------IRPKGYDIFTVGIGVNGDPGATPLEAEKFMQSISSKTENY 563 Query: 368 FAVNDS----RELLESFDKITDK 386 V+D+ EL + F I ++ Sbjct: 564 TNVDDTNKIYDELNKYFKTIVEE 586 >gi|218693353|gb|ACL01180.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 49.5 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 39/383 (10%), Positives = 100/383 (26%), Gaps = 37/383 (9%) Query: 28 QMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQ 87 ++ LD VL S + + + + + +K I K + Sbjct: 217 ELNQPLDVVVL-LDNSNSMNNERANNSQRALKAGEAVEKLIDKITSNKDNRVALVTYAST 275 Query: 88 KAQINITKDKNNPLQYIAESKAQYEIP--TENLFLKGLIPSALTNLSLRSTGIIERSSEN 145 ++ + F + NL+ + + S Sbjct: 276 IFDGTEATVSKGVADQNGKALNDSVSWDYHKTTFTATTHNYSYLNLTNDANEVNILKSRI 335 Query: 146 LAISICMVLDVSRS-MEDLYLQKHNDNNNMTSNKYLLPP--------PPKKSFWSKNTTK 196 + + D + + QK N S Sbjct: 336 PKEAEHINGDRTLYQFGATFTQKALMKANEILETQSSNARKKLIFHVTDGVPTMSYAINF 395 Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 + Y + + + + +Q+ + + + + ++ P++ Sbjct: 396 NPYISTS--YQNQFNSFLNKIPDRSGILQEDFIINGDDYQIVKGDGESFKLFSDRKVPVT 453 Query: 257 NNLNEVKSRL--NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 + R+ N+L+ N + Y + + ++ T G Sbjct: 454 GGTTQAAYRVPQNQLSVMSNEGYAINSGYIYLYWRDYNWVYPFDPKTKKVSATKQIKTHG 513 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------GQDLLRKCTDSSGQF 367 E + N +R G I++V + + + ++ + + + Sbjct: 514 EPTTLYFNGN----------IRPKGYDIFTVGIGVNGDPGATPLEAEKFMQSISSKTENY 563 Query: 368 FAVNDS----RELLESFDKITDK 386 V+D+ EL + F I ++ Sbjct: 564 TNVDDTNKIYDELNKYFKTIVEE 586 >gi|195941904|ref|ZP_03087286.1| hypothetical protein Bbur8_03396 [Borrelia burgdorferi 80a] gi|312149118|gb|ADQ29189.1| von Willebrand factor type A domain protein [Borrelia burgdorferi N40] Length = 333 Score = 49.5 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 26/172 (15%), Positives = 61/172 (35%), Gaps = 41/172 (23%) Query: 252 CTPLSNNLNEVKSRLNK---LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P++ + +L+ ++ + + A L S LK+ + Sbjct: 146 VVPITTDREFFNKKLDDIYIMDLGNGSALGLGISIALSHL----------KHSEALKRSI 195 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA------------------- 349 + +TDG + Y++ Q+ + +KIYS+ + + Sbjct: 196 VVLTDGVVNSDEIYKD-----QVINLAQGLNVKIYSIGIGSSEEFSVEFKLRSGKFYQGS 250 Query: 350 --PPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQ-EQSVRIAPN 397 +L + ++ + G F++VND + + K E+ ++IA + Sbjct: 251 FKEVYDPSMLVEISNKTGGLFYSVNDDFSFQFAIQDFSKKENLERKIKIAVD 302 >gi|218249312|ref|YP_002374701.1| von Willebrand factor type A domain protein [Borrelia burgdorferi ZS7] gi|223889245|ref|ZP_03623833.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 64b] gi|226320920|ref|ZP_03796471.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 29805] gi|226321491|ref|ZP_03797017.1| von Willebrand factor type A domain protein [Borrelia burgdorferi Bol26] gi|218164500|gb|ACK74561.1| von Willebrand factor type A domain protein [Borrelia burgdorferi ZS7] gi|223885278|gb|EEF56380.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 64b] gi|226232680|gb|EEH31433.1| von Willebrand factor type A domain protein [Borrelia burgdorferi Bol26] gi|226233692|gb|EEH32422.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 29805] gi|312147800|gb|ADQ30459.1| von Willebrand factor type A domain protein [Borrelia burgdorferi JD1] Length = 333 Score = 49.5 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 26/172 (15%), Positives = 61/172 (35%), Gaps = 41/172 (23%) Query: 252 CTPLSNNLNEVKSRLNK---LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P++ + +L+ ++ + + A L S LK+ + Sbjct: 146 VVPITTDREFFNKKLDDIYIMDLGNGSALGLGISIALSHL----------KHSEALKRSI 195 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA------------------- 349 + +TDG + Y++ Q+ + +KIYS+ + + Sbjct: 196 VVLTDGVVNSDEIYKD-----QVINLAQGLNVKIYSIGIGSSEEFSVEFKLRSGKFYQGS 250 Query: 350 --PPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQ-EQSVRIAPN 397 +L + ++ + G F++VND + + K E+ ++IA + Sbjct: 251 FKEVYDPSMLVEISNKTGGLFYSVNDDFSFQFAIQDFSKKENLERKIKIAVD 302 >gi|216264497|ref|ZP_03436489.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 156a] gi|221217546|ref|ZP_03589016.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 72a] gi|224532807|ref|ZP_03673422.1| von Willebrand factor type A domain protein [Borrelia burgdorferi WI91-23] gi|224534090|ref|ZP_03674673.1| von Willebrand factor type A domain protein [Borrelia burgdorferi CA-11.2a] gi|225548563|ref|ZP_03769611.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 94a] gi|225549807|ref|ZP_03770771.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 118a] gi|215980970|gb|EEC21777.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 156a] gi|221192609|gb|EEE18826.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 72a] gi|224512196|gb|EEF82582.1| von Willebrand factor type A domain protein [Borrelia burgdorferi WI91-23] gi|224512789|gb|EEF83157.1| von Willebrand factor type A domain protein [Borrelia burgdorferi CA-11.2a] gi|225369615|gb|EEG99064.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 118a] gi|225370826|gb|EEH00261.1| von Willebrand factor type A domain protein [Borrelia burgdorferi 94a] Length = 333 Score = 49.5 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 26/172 (15%), Positives = 61/172 (35%), Gaps = 41/172 (23%) Query: 252 CTPLSNNLNEVKSRLNK---LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P++ + +L+ ++ + + A L S LK+ + Sbjct: 146 VVPITTDREFFNKKLDDIYIMDLGNGSALGLGISIALSHL----------KHSEALKRSI 195 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA------------------- 349 + +TDG + Y++ Q+ + +KIYS+ + + Sbjct: 196 VVLTDGVVNSDEIYKD-----QVINLAQGLNVKIYSIGIGSSEEFSVEFKLRSGKFYQGS 250 Query: 350 --PPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQ-EQSVRIAPN 397 +L + ++ + G F++VND + + K E+ ++IA + Sbjct: 251 FKEVYDPSMLVEISNKTGGLFYSVNDDFSFQFAIQDFSKKENLERKIKIAVD 302 >gi|47207969|emb|CAF93030.1| unnamed protein product [Tetraodon nigroviridis] Length = 586 Score = 49.5 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 46/152 (30%), Gaps = 22/152 (14%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNT 298 + Y N +E + L +++ T T A+ + + L+ Sbjct: 449 AVQYTYDQRLEFGFLDQPNKDEALAALRRISYMSGGTATGEAISFSTQNLFRRTAPGR-- 506 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLR 358 F++ +TDG++ + G+ ++SV V+ P LR Sbjct: 507 -------NFLVVVTDGQSYDDVRRP--------AMEAQAQGISVFSVGVAWAPLDD--LR 549 Query: 359 KCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 + F + L + + I Sbjct: 550 AMSSEPKDSHTFFSREFSGLSDLVPVLVRGIC 581 >gi|218693397|gb|ACL01202.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 49.5 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 39/383 (10%), Positives = 100/383 (26%), Gaps = 37/383 (9%) Query: 28 QMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQ 87 ++ LD VL S + + + + + +K I K + Sbjct: 217 ELNQPLDVVVL-LDNSNSMNNERANNSQRALKAGEAVEKLIDKITSNKDNRVALVTYAST 275 Query: 88 KAQINITKDKNNPLQYIAESKAQYEIP--TENLFLKGLIPSALTNLSLRSTGIIERSSEN 145 ++ + F + NL+ + + S Sbjct: 276 IFDGTEATVSKGVADQNGKALNDSVSWDYHKTTFTATTHNYSYLNLTNDANEVNILKSRI 335 Query: 146 LAISICMVLDVSRS-MEDLYLQKHNDNNNMTSNKYLLPP--------PPKKSFWSKNTTK 196 + + D + + QK N S Sbjct: 336 PKEAEHINGDRTLYQFGATFTQKALMKANEILETQSSNARKKLIFHVTDGVPTMSYAINF 395 Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 + Y + + + + +Q+ + + + + ++ P++ Sbjct: 396 NPYISTS--YQNQFNSFLNKIPDRSGILQEDFIINGDDYQIVKGDGESFKLFSDRKVPVT 453 Query: 257 NNLNEVKSRL--NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 + R+ N+L+ N + Y + + ++ T G Sbjct: 454 GGTTQAAYRVPQNQLSVMSNEGYAINSGYIYLYWRDYNWVYPFDPKTKKVSATKQIKTHG 513 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------GQDLLRKCTDSSGQF 367 E + N +R G I++V + + + ++ + + + Sbjct: 514 EPTTLYFNGN----------IRPKGYDIFTVGIGVNGDPGATPLEAEKFMQSISSKTENY 563 Query: 368 FAVNDS----RELLESFDKITDK 386 V+D+ EL + F I ++ Sbjct: 564 TNVDDTNKIYDELNKYFKTIVEE 586 >gi|218693287|gb|ACL01147.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 49.5 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 39/383 (10%), Positives = 100/383 (26%), Gaps = 37/383 (9%) Query: 28 QMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQ 87 ++ LD VL S + + + + + +K I K + Sbjct: 217 ELNQPLDVVVL-LDNSNSMNNERANNSQRALKAGEAVEKLIDKITSNKDNRVALVTYAST 275 Query: 88 KAQINITKDKNNPLQYIAESKAQYEIP--TENLFLKGLIPSALTNLSLRSTGIIERSSEN 145 ++ + F + NL+ + + S Sbjct: 276 IFDGTEATVSKGVADQNGKALNDSVSWDYHKTTFTATTHNYSYLNLTNDANEVNILKSRI 335 Query: 146 LAISICMVLDVSRS-MEDLYLQKHNDNNNMTSNKYLLPP--------PPKKSFWSKNTTK 196 + + D + + QK N S Sbjct: 336 PKEAEHINGDRTLYQFGATFTQKALMKANEILETQSSNARKKLIFHVTDGVPTMSYAINF 395 Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 + Y + + + + +Q+ + + + + ++ P++ Sbjct: 396 NPYISTS--YQNQFNSFLNKIPDRSGILQEDFIINGDDYQIVKGDGESFKLFSDRKVPVT 453 Query: 257 NNLNEVKSRL--NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 + R+ N+L+ N + Y + + ++ T G Sbjct: 454 GGTTQAAYRVPQNQLSVMSNEGYAINSGYIYLYWRDYNWVYPFDPKTKKVSATKQIKTHG 513 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------GQDLLRKCTDSSGQF 367 E + N +R G I++V + + + ++ + + + Sbjct: 514 EPTTLYFNGN----------IRPKGYDIFTVGIGVNGDPGATPLEAEKFMQSISSKTENY 563 Query: 368 FAVNDS----RELLESFDKITDK 386 V+D+ EL + F I ++ Sbjct: 564 TNVDDTNKIYDELNKYFKTIVEE 586 >gi|260799304|ref|XP_002594637.1| hypothetical protein BRAFLDRAFT_217575 [Branchiostoma floridae] gi|229279872|gb|EEN50648.1| hypothetical protein BRAFLDRAFT_217575 [Branchiostoma floridae] Length = 181 Score = 49.5 bits (116), Expect = 9e-04, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 62/191 (32%), Gaps = 26/191 (13%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + + D N++N Q ++G I Y+ + Sbjct: 7 VDGSTSVGPNEFDKSKAFLRNVINQFQIGPDA-----TKVGVIQYSSTVREEFSLNRYLT 61 Query: 259 LNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 +V ++++ T T A+ + ++ + K I ITDG Sbjct: 62 KQDVMRAIDRIPFLGGFTRTGEAITFMKQ----------HSQFRGNVPKIAIVITDGRAQ 111 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 +Y + + AG+ +Y+++V + L+ + V+D L Sbjct: 112 DDVSYPS--------QMAHGAGVIMYAISVG--SVNMNELKLIASAERYITTVSDFDALS 161 Query: 378 ESFDKITDKIQ 388 +T++I Sbjct: 162 TLTLSLTEQIC 172 >gi|321460553|gb|EFX71594.1| hypothetical protein DAPPUDRAFT_326977 [Daphnia pulex] Length = 1000 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 20/110 (18%), Positives = 41/110 (37%), Gaps = 17/110 (15%) Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 T + A + L G+ + ++ +TDG NS L+ + E Sbjct: 402 TCIGCGLDLAVQMLQE--------KGNNKTGGIIVLVTDGRNSAGY-----LDISDVEED 448 Query: 335 MRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE---LLESFD 381 + AG+++ +VA + + R + G+ + + D L +F Sbjct: 449 IVKAGIRVVTVAFGSEADSNIE-RLADVTGGKSYYIKDGDSSEALQRAFT 497 >gi|218693373|gb|ACL01190.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 39/383 (10%), Positives = 100/383 (26%), Gaps = 37/383 (9%) Query: 28 QMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQ 87 ++ LD VL S + + + + + +K I K + Sbjct: 217 ELNQPLDVVVL-LDNSNSMNNERANNSQRALKAGEAVEKLIDKITSNKDNRVALVTYAST 275 Query: 88 KAQINITKDKNNPLQYIAESKAQYEIP--TENLFLKGLIPSALTNLSLRSTGIIERSSEN 145 ++ + F + NL+ + + S Sbjct: 276 IFDGTEATVSKGVADQNGKALNDSVSWDYHKTTFTATTHNYSYLNLTNDANEVNILKSRI 335 Query: 146 LAISICMVLDVSRS-MEDLYLQKHNDNNNMTSNKYLLPP--------PPKKSFWSKNTTK 196 + + D + + QK N S Sbjct: 336 PKEAEHINGDRTLYQFGATFTQKALMKANEILETQSSNARKKLIFHVTDGVPTMSYAINF 395 Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 + Y + + + + +Q+ + + + + ++ P++ Sbjct: 396 NPYISTS--YQNQFNSFLNKIPDRSGILQEDFIINGDDYQIVKGDGESFKLFSDRKVPVT 453 Query: 257 NNLNEVKSRL--NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 + R+ N+L+ N + Y + + ++ T G Sbjct: 454 GGTTQAAYRVPQNQLSVMSNEGYAINSGYIYLYWRDYNWVYPFDPKTKKVSATKQIKTHG 513 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------GQDLLRKCTDSSGQF 367 E + N +R G I++V + + + ++ + + + Sbjct: 514 EPTTLYFNGN----------IRPKGYDIFTVGIGVNGDPGATPLEAEKFMQSISSKTENY 563 Query: 368 FAVNDS----RELLESFDKITDK 386 V+D+ EL + F I ++ Sbjct: 564 TNVDDTNKIYDELNKYFKTIVEE 586 >gi|156399648|ref|XP_001638613.1| predicted protein [Nematostella vectensis] gi|156225735|gb|EDO46550.1| predicted protein [Nematostella vectensis] Length = 1841 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 29/241 (12%), Positives = 79/241 (32%), Gaps = 18/241 (7%) Query: 153 VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV 212 V S D+ + + T+ + + + N A + D+ Sbjct: 576 VTPSSAIQMDMPIAEEVVAPPKTTEEISADDKQQDECFCPNKVCDANADLAFVIDGSSDI 635 Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY 272 ++++ ++ + ++ +AY N +E+ ++ ++ Sbjct: 636 PQMYFSHVLDYLKALARGFNASQTQVSLMAYGDDATAAFNFGSVNTTSEIDDGIDAVSYM 695 Query: 273 ENT-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T A++ A + L+ + ++ +T G + + + + Sbjct: 696 GGEPKTGNALYKAKKGLFEVSSR-------PGVSHALVLLTSG------SASDEIGPPAL 742 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQ 390 +RNAG+K+ ++ + ++L+ ++ S F L E D+I + Sbjct: 743 --ELRNAGVKVTAIGLGKLANVKELIEIASEPKSSHVFTAF-LDTLPERMDEIVTGVCSD 799 Query: 391 S 391 Sbjct: 800 V 800 >gi|22536818|ref|NP_687669.1| cell wall surface anchor family protein [Streptococcus agalactiae 2603V/R] gi|25010692|ref|NP_735087.1| hypothetical protein gbs0632 [Streptococcus agalactiae NEM316] gi|76788219|ref|YP_329407.1| Cna B domain-containing protein [Streptococcus agalactiae A909] gi|77411235|ref|ZP_00787585.1| cell wall surface anchor family protein [Streptococcus agalactiae CJB111] gi|22533664|gb|AAM99541.1|AE014220_13 cell wall surface anchor family protein, putative [Streptococcus agalactiae 2603V/R] gi|23095046|emb|CAD46276.1| Unknown [Streptococcus agalactiae NEM316] gi|76563276|gb|ABA45860.1| cna B-type domain protein [Streptococcus agalactiae A909] gi|77162661|gb|EAO73622.1| cell wall surface anchor family protein [Streptococcus agalactiae CJB111] gi|218693223|gb|ACL01115.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693225|gb|ACL01116.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693227|gb|ACL01117.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693229|gb|ACL01118.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693231|gb|ACL01119.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693233|gb|ACL01120.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693237|gb|ACL01122.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693239|gb|ACL01123.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693243|gb|ACL01125.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693245|gb|ACL01126.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693249|gb|ACL01128.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693251|gb|ACL01129.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693253|gb|ACL01130.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693255|gb|ACL01131.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693257|gb|ACL01132.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693259|gb|ACL01133.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693261|gb|ACL01134.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693263|gb|ACL01135.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693265|gb|ACL01136.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693267|gb|ACL01137.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693269|gb|ACL01138.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693271|gb|ACL01139.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693273|gb|ACL01140.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693275|gb|ACL01141.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693277|gb|ACL01142.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693279|gb|ACL01143.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693281|gb|ACL01144.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693283|gb|ACL01145.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693285|gb|ACL01146.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693289|gb|ACL01148.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693291|gb|ACL01149.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693293|gb|ACL01150.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693295|gb|ACL01151.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693297|gb|ACL01152.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693299|gb|ACL01153.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693301|gb|ACL01154.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693303|gb|ACL01155.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693305|gb|ACL01156.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693307|gb|ACL01157.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693309|gb|ACL01158.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693311|gb|ACL01159.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693313|gb|ACL01160.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693315|gb|ACL01161.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693317|gb|ACL01162.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693319|gb|ACL01163.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693321|gb|ACL01164.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693323|gb|ACL01165.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693325|gb|ACL01166.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693327|gb|ACL01167.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693331|gb|ACL01169.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693333|gb|ACL01170.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693337|gb|ACL01172.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693339|gb|ACL01173.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693341|gb|ACL01174.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693345|gb|ACL01176.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693347|gb|ACL01177.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693349|gb|ACL01178.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693355|gb|ACL01181.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693359|gb|ACL01183.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693369|gb|ACL01188.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693371|gb|ACL01189.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693377|gb|ACL01192.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693379|gb|ACL01193.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693381|gb|ACL01194.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693383|gb|ACL01195.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693387|gb|ACL01197.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693389|gb|ACL01198.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693391|gb|ACL01199.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693393|gb|ACL01200.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693395|gb|ACL01201.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693399|gb|ACL01203.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693401|gb|ACL01204.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693403|gb|ACL01205.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693405|gb|ACL01206.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693407|gb|ACL01207.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693411|gb|ACL01209.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693415|gb|ACL01211.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693419|gb|ACL01213.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693421|gb|ACL01214.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693433|gb|ACL01220.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693435|gb|ACL01221.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693439|gb|ACL01223.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693447|gb|ACL01227.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693453|gb|ACL01230.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693455|gb|ACL01231.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693457|gb|ACL01232.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693459|gb|ACL01233.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693461|gb|ACL01234.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693463|gb|ACL01235.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693465|gb|ACL01236.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693467|gb|ACL01237.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693471|gb|ACL01239.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693473|gb|ACL01240.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693475|gb|ACL01241.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 39/383 (10%), Positives = 100/383 (26%), Gaps = 37/383 (9%) Query: 28 QMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQ 87 ++ LD VL S + + + + + +K I K + Sbjct: 217 ELNQPLDVVVL-LDNSNSMNNERANNSQRALKAGEAVEKLIDKITSNKDNRVALVTYAST 275 Query: 88 KAQINITKDKNNPLQYIAESKAQYEIP--TENLFLKGLIPSALTNLSLRSTGIIERSSEN 145 ++ + F + NL+ + + S Sbjct: 276 IFDGTEATVSKGVADQNGKALNDSVSWDYHKTTFTATTHNYSYLNLTNDANEVNILKSRI 335 Query: 146 LAISICMVLDVSRS-MEDLYLQKHNDNNNMTSNKYLLPP--------PPKKSFWSKNTTK 196 + + D + + QK N S Sbjct: 336 PKEAEHINGDRTLYQFGATFTQKALMKANEILETQSSNARKKLIFHVTDGVPTMSYAINF 395 Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 + Y + + + + +Q+ + + + + ++ P++ Sbjct: 396 NPYISTS--YQNQFNSFLNKIPDRSGILQEDFIINGDDYQIVKGDGESFKLFSDRKVPVT 453 Query: 257 NNLNEVKSRL--NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 + R+ N+L+ N + Y + + ++ T G Sbjct: 454 GGTTQAAYRVPQNQLSVMSNEGYAINSGYIYLYWRDYNWVYPFDPKTKKVSATKQIKTHG 513 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------GQDLLRKCTDSSGQF 367 E + N +R G I++V + + + ++ + + + Sbjct: 514 EPTTLYFNGN----------IRPKGYDIFTVGIGVNGDPGATPLEAEKFMQSISSKTENY 563 Query: 368 FAVNDS----RELLESFDKITDK 386 V+D+ EL + F I ++ Sbjct: 564 TNVDDTNKIYDELNKYFKTIVEE 586 >gi|320164207|gb|EFW41106.1| conserved hypothetical protein [Capsaspora owczarzaki ATCC 30864] Length = 1176 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 26/192 (13%), Positives = 62/192 (32%), Gaps = 11/192 (5%) Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 + T +K A I ++ + + I++ K + IG + Sbjct: 23 FGNVPTTNKVEICFAFDTTGI---RQNLEATITRLFAEIKDIKISIMGIGDYCDDQSTYA 79 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 + LS N +E+ + ++ + A + + S K ++ Sbjct: 80 VRYMDLSRNKDELCAFAKSVSATGGGDADECYEWALYK-------AQLLSWSEDTAKSLV 132 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFF 368 I D ++ L+ E + + G+K+Y V + + ++ + GQ+ Sbjct: 133 IIGDCCPHRPEYTTLGVDWLEEAELLASRGIKVYGVRALHNTTAEAFYKPLSEMTGGQYI 192 Query: 369 AVNDSRELLESF 380 + + F Sbjct: 193 QFTSFHLITDMF 204 >gi|291230030|ref|XP_002734973.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii] Length = 454 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 20/185 (10%), Positives = 59/185 (31%), Gaps = 4/185 (2%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNS-IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN 257 + I+ E+ N+V+ + K + + + V + + Sbjct: 29 VIDCTSSMGSWINEAKENIHNIVDEIVAKEMSDIRLALVEYRDHPPQDSTFVTRVLDFTP 88 Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAY-RELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 +L +++ +++ + + + A+ + + D + Sbjct: 89 SLKDMQKQMDSMAAHGGGDGPEAVADGLHETFNLNWRPLATKVCVLIADAPPHGLGDSGD 148 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQFFAVNDSR 374 + ++I M + +YSV + EG + + GQ+ ++ D++ Sbjct: 149 GFPKGCPAGHDPMKIARQMAEQNITLYSVVCGSYAEGFKDFFMAIAHVTGGQYVSLKDAK 208 Query: 375 ELLES 379 L + Sbjct: 209 LLSKV 213 >gi|281348290|gb|EFB23874.1| hypothetical protein PANDA_022043 [Ailuropoda melanoleuca] Length = 426 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 18/113 (15%), Positives = 41/113 (36%), Gaps = 21/113 (18%) Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T+ + ++ + + + +I +TDGE+ S+ Sbjct: 53 EASGGTSICSGLRAGFQAIIHS--------NQSTSGSEIILLTDGEDDQISSCF------ 98 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQ-DLLRKCTDSSGQFFAVN-DSRELLESF 380 E ++ +G I+++A+ + + L + G F N D L ++F Sbjct: 99 ---EEVKQSGAVIHTIALGPSAARELETLSNM--TGGYRFYANKDINGLTDAF 146 >gi|225559690|gb|EEH07972.1| U-box domain-containing protein [Ajellomyces capsulatus G186AR] Length = 759 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 36/206 (17%), Positives = 68/206 (33%), Gaps = 15/206 (7%) Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE---SAGNLVNSIQKAIQEKKNLSVRI 238 P P + + S + AP P + E S +L + I E N + R+ Sbjct: 70 PHVPCDIVLCIDVSYSMQSSAPLPTTDESGEREETGLSVLDLTKHAARTIIETLNENDRL 129 Query: 239 GTIAYNIGIVGNQCTPLSN--NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 G +A++ N N + L P +TN + + + NE Sbjct: 130 GIVAFSTEAEVVYKISKMNESNKKAALKAVEALKPLSSTNLWHGLKLGLKAFENE----- 184 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK-IYSVAVSAPPEGQD 355 + + + + +TDG + Q + L+ + M I++ Sbjct: 185 --RHTLQSVQALYVLTDGMPNHMCPKQGYVTKLRPILQLLGHRMPMIHTFGFGYNIRS-G 241 Query: 356 LLRKCTD-SSGQFFAVNDSRELLESF 380 LL+ + G F + D+ + F Sbjct: 242 LLQAIAEVGGGTFAFIPDAGMIGTVF 267 >gi|4009460|gb|AAC95429.1| calcium-dependent chloride channel-1 [Homo sapiens] Length = 914 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 22/151 (14%), Positives = 58/151 (38%), Gaps = 26/151 (17%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 ++ + + RL T+ + A+ + + + ++ +T Sbjct: 362 INSGSDRDTLAKRLPA-AASGGTSICSGLRSAFTVIRKKYPTD---------GSEIVLLT 411 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND 372 DGE++ S N + + +G I++VA+ P Q+L + G +D Sbjct: 412 DGEDNTISGCFNEV---------KQSGAIIHTVALG-PSAAQELEELSKMTGGLQTYASD 461 Query: 373 ---SRELLESFDKITD---KIQEQSVRIAPN 397 + L+++F ++ + ++S+++ Sbjct: 462 QVQNNGLIDAFGALSSGNGAVSQRSIQLESK 492 >gi|326922323|ref|XP_003207399.1| PREDICTED: collagen alpha-2(VI) chain-like [Meleagris gallopavo] Length = 1022 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 32/228 (14%), Positives = 79/228 (34%), Gaps = 26/228 (11%) Query: 175 TSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNL 234 ++ P ++ T A P +D + + + ++ + + + Sbjct: 34 VTSCTEKTDCPISVYFV-IDTSESIALQTVPIQSLVDQIKQFIPRFIEKLENEVYQ-NQV 91 Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKE 293 S+ + V +PL+ + + ++L +N T T A+ + ++ ++ Sbjct: 92 SITWMFGGLHYSDVVEIYSPLTRSKDTYLTKLRAINYLGRGTFTDCAISNMTQQFQSQTA 151 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 KF + ITDG +G+ E R+ G+K+++VA + Sbjct: 152 RD---------VKFAVVITDGHVTGSPCGG----MKMQAERARDMGIKLFAVA-PSEDVY 197 Query: 354 QDLLRKCTDSS-GQF---FAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + LR+ + + + L I + E+ ++ + Sbjct: 198 EQGLREIASPPHDLYRSNYTITPKDALH-----IDENTIERIIKAMKH 240 Score = 44.1 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 24/154 (15%), Positives = 52/154 (33%), Gaps = 19/154 (12%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS-VRIGTIAYNIGIVGNQCTPLS--- 256 + + + N+V+ + ++ K+ + R+G + Y+ Sbjct: 622 SSESIGYTNFTLEKNFVVNVVSRLGSIAKDPKSETGARVGVVQYSHEGTFEAIKLDDERI 681 Query: 257 NNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 N+L+ K + +L T T A+ AY +L E + F + ITDG Sbjct: 682 NSLSSFKEAVKRLEWIAGGTWTPSALQFAYNKLIKESRREKA-------QVFAVVITDGR 734 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + N +C + + ++ + Sbjct: 735 ---YDPRDDDKNLGALC----GRDVLVNTIGIGD 761 >gi|123975106|ref|XP_001330196.1| Ubiquitin-conjugating enzyme family protein [Trichomonas vaginalis G3] gi|121896184|gb|EAY01343.1| Ubiquitin-conjugating enzyme family protein [Trichomonas vaginalis G3] Length = 963 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 38/320 (11%), Positives = 85/320 (26%), Gaps = 25/320 (7%) Query: 67 QIKKHLKQGSYIRENAGDIAQKAQINITKD-----KNNPLQYIAESKAQYEIPTENLFLK 121 + I D ++ I + N Q + + N Sbjct: 328 YFFLNFSDYESISTYNSDFSRCFATTIKRSLPEVKDLNSSQSQIFREFKIIEAISNPNPM 387 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 + + LSL + + S ++ +D + + + + Sbjct: 388 IIRGNNTFLLSLGTCYKQQDSKSIVSQKQIQYIDPKEGEIKVEVIDTMAEALRSKGVVID 447 Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 P + S + +++ + +VN + Sbjct: 448 TPSETQQINIICIDTSGSMGGTS-----LEIAKQCFKIIVNRAYEVGP-------LSLWG 495 Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGS 301 Y PLS NE ++ + + T Y + A E+ + + Sbjct: 496 LYIFDTTPKIILPLSPIPNEFHRAVDTIRAWGCTALYRCIELAQDEINAKVRQKPEFKNA 555 Query: 302 TRLKKFVIFITDGENSGASAYQNT--LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 K +I +TDG ++ N I + G+ + + PE + Sbjct: 556 I---KRIIALTDGGDNEYHYKTNNHMQELADIANGLVRDGIVFDYIELGDDPEYPP--QN 610 Query: 360 CT-DSSGQFFAVNDSRELLE 378 +S G + +D + + Sbjct: 611 IAINSGGCYLQFSDRDFIKK 630 >gi|284997525|ref|YP_003419292.1| von Willebrand factor, type A [Sulfolobus islandicus L.D.8.5] gi|284445420|gb|ADB86922.1| von Willebrand factor, type A [Sulfolobus islandicus L.D.8.5] Length = 380 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 26/149 (17%), Positives = 56/149 (37%), Gaps = 33/149 (22%) Query: 253 TPLSNNLNEVKSRLNKLN---------PYENTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 S+N+N +K ++ L+ T Y A+ A + + Sbjct: 78 ITFSSNVNVIKEFVDPLDLTNEILQIAAGGQTALYTAILTANSLAKKYQMPT-------- 129 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD- 362 +++ +TDG + + N L M++YS + + LL+ +D Sbjct: 130 ---YLLLLTDGNPTDETNVGNYLKLPYF------EKMQVYSFGIG-DDYNEQLLQNISDK 179 Query: 363 SSGQFFAVNDSREL-----LESFDKITDK 386 +SG + ++D+ E+ ++ +I K Sbjct: 180 TSGVMYHISDANEIPQKLPQKAVTQIAAK 208 >gi|262201266|ref|YP_003272474.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] gi|262084613|gb|ACY20581.1| von Willebrand factor type A [Gordonia bronchialis DSM 43247] Length = 594 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 17/121 (14%), Positives = 44/121 (36%), Gaps = 10/121 (8%) Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T Y A++++ + + VI +TDG+N + T Sbjct: 478 RLGGGTGLYDTTLAAFKKVQSTYDP--------NYSNSVIIMTDGQNEDPGSITLTELLA 529 Query: 330 QICEYMR-NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF-DKITDKI 387 ++ E + I ++ +S + L + + G + + ++ + F + I ++ Sbjct: 530 ELKELEDPARPVLILTIGISEDADTNALRQIAQATGGTTYVAKTAADIKQVFTNAIAARV 589 Query: 388 Q 388 + Sbjct: 590 E 590 >gi|186686512|ref|YP_001869708.1| vault protein inter-alpha-trypsin subunit [Nostoc punctiforme PCC 73102] gi|186468964|gb|ACC84765.1| Vault protein inter-alpha-trypsin domain protein [Nostoc punctiforme PCC 73102] Length = 818 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 22/169 (13%), Positives = 45/169 (26%), Gaps = 25/169 (14%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE---VKSRLNKLNPYENTNTYPAMHHAYR 286 N I ++ PL N + +N+LN T + Sbjct: 327 NGLNPDDTFNIIDFSDTTQQLSPVPLPNTSANRLLAINYINRLNAGGGTEMLGGIRTVLN 386 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA---GMKIY 343 + + ++ +TDG + QI ++ G +++ Sbjct: 387 L----------KATNPGRLRNIVLLTDGYIGNEN---------QILAEVKQRLQPGTRLH 427 Query: 344 SVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 S + L R G + + E +K +I + Sbjct: 428 SFGAGSSVNRFLLNRIAELGRGIARIIRHDEPVDEVVEKFFRQINNPVL 476 >gi|218693367|gb|ACL01187.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693469|gb|ACL01238.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 39/383 (10%), Positives = 100/383 (26%), Gaps = 37/383 (9%) Query: 28 QMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQ 87 ++ LD VL S + + + + + +K I K + Sbjct: 217 ELNQPLDVVVL-LDNSNSMNNERANNSQRALKAGEAVEKLIDKITSNKDNRVALVTYAST 275 Query: 88 KAQINITKDKNNPLQYIAESKAQYEIP--TENLFLKGLIPSALTNLSLRSTGIIERSSEN 145 ++ + F + NL+ + + S Sbjct: 276 IFDGTEATVSKGVADQNGKALNDSVSWDYHKTTFTATTHNYSYLNLTNDANEVNILKSRI 335 Query: 146 LAISICMVLDVSRS-MEDLYLQKHNDNNNMTSNKYLLPP--------PPKKSFWSKNTTK 196 + + D + + QK N S Sbjct: 336 PKEAEHINGDRTLYQFGATFTQKALMKANEILETQSSNARKKLIFHVTDGVPTMSYAINF 395 Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 + Y + + + + +Q+ + + + + ++ P++ Sbjct: 396 NPYISTS--YQNQFNSFLNKIPDRSGILQEDFIINGDDYQIVKGDGESFKLFSDRKVPVT 453 Query: 257 NNLNEVKSRL--NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 + R+ N+L+ N + Y + + ++ T G Sbjct: 454 GGTTQAAYRVPQNQLSVMSNEGYAINSGYIYLYWRDYNWVYPFDPKTKKVSATKQIKTHG 513 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------GQDLLRKCTDSSGQF 367 E + N +R G I++V + + + ++ + + + Sbjct: 514 EPTTLYFNGN----------IRPKGYDIFTVGIGVNGDPGATPLEAEKFMQSISSKTENY 563 Query: 368 FAVNDS----RELLESFDKITDK 386 V+D+ EL + F I ++ Sbjct: 564 TNVDDTNKIYDELNKYFKTIVEE 586 >gi|170591769|ref|XP_001900642.1| von Willebrand factor type A domain containing protein [Brugia malayi] gi|158591794|gb|EDP30397.1| von Willebrand factor type A domain containing protein [Brugia malayi] Length = 381 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 26/126 (20%), Positives = 54/126 (42%), Gaps = 13/126 (10%) Query: 258 NLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 N V LN L T+T+ A+H AY+ L + + +KK +I TDG + Sbjct: 253 NNTSVIRHLNGLKSIKGTTSTHIALHQAYKLLTDTDNENGVR---EGVKKMIIIFTDGHS 309 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAV-SAPPEGQDLLRKCTDSSGQFFAVNDSRE 375 + + +++ G++I+++ + AP + L T ++ F + + Sbjct: 310 QRS--------PQDMALRLKDKGVEIFAITLTPAPYADEGELLSITQNTDHIFTPVNLKV 361 Query: 376 LLESFD 381 L+ ++ Sbjct: 362 LITTYK 367 >gi|94989622|ref|YP_597722.1| collagen adhesion protein [Streptococcus pyogenes MGAS10270] gi|94543130|gb|ABF33178.1| Collagen adhesion protein [Streptococcus pyogenes MGAS10270] gi|198417325|gb|ACH87880.1| ancillary protein 1 [Streptococcus pyogenes] gi|198417327|gb|ACH87881.1| ancillary protein 1 [Streptococcus pyogenes] gi|198417329|gb|ACH87882.1| ancillary protein 1 [Streptococcus pyogenes] gi|198417331|gb|ACH87883.1| ancillary protein 1 [Streptococcus pyogenes] Length = 890 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 38/381 (9%), Positives = 92/381 (24%), Gaps = 33/381 (8%) Query: 28 QMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQ 87 ++ LD VL S + + + + +K I K + Sbjct: 217 ELNQPLDV-VLLLDNSNSMNNERAHNSQRALKAGEAVEKLIDKITSNKDNRVALVTYAST 275 Query: 88 KAQINITKDKNNPLQYIAESKAQYEIP--TENLFLKGLIPSALTNLSLRSTGIIERSSEN 145 ++ + F + NL+ + + S Sbjct: 276 IFDGTEATVSKGVADRNGKALNDSVSWDYHKTTFTATTHNYSYLNLTNDANEVNILKSRI 335 Query: 146 LAISICMVLDVSRS-MEDLYLQKHNDNNNMTSNKYLLPP--------PPKKSFWSKNTTK 196 + + + + + QK N S Sbjct: 336 PKEAEHINGNRTLYQFGATFTQKALMKANEILETQSSNDRKKVIFHVTDGVPTMSYAINF 395 Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 + Y K + + + I V+ ++ + P Sbjct: 396 NPYISTSYQNQFKSFLNKTPDRSGILQEDFIINGDDYQIVKGDGESFKLFSDRKVPVPGG 455 Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + N+L+ N + Y + + + T GE Sbjct: 456 TTQAAYQVPQNQLSVMSNEGYAINRGYIYLYWRDYNWVYPFDPKTKTVSATKQIKTHGEP 515 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------GQDLLRKCTDSSGQFFA 369 + N ++ G I++V + + ++ ++ + + + Sbjct: 516 TTLYFNGN----------IKPKGYDIFTVGIGVNGDPGATPLEAKEFMQSISSKTENYTN 565 Query: 370 VNDS----RELLESFDKITDK 386 V+D+ EL + F I ++ Sbjct: 566 VDDTNKIYDELNKYFKTIVEE 586 >gi|77407764|ref|ZP_00784518.1| cell wall surface anchor family protein [Streptococcus agalactiae COH1] gi|77173630|gb|EAO76745.1| cell wall surface anchor family protein [Streptococcus agalactiae COH1] gi|218693241|gb|ACL01124.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693247|gb|ACL01127.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693329|gb|ACL01168.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693351|gb|ACL01179.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693361|gb|ACL01184.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693365|gb|ACL01186.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693375|gb|ACL01191.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693413|gb|ACL01210.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693417|gb|ACL01212.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693437|gb|ACL01222.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693441|gb|ACL01224.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693443|gb|ACL01225.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693445|gb|ACL01226.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693449|gb|ACL01228.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693451|gb|ACL01229.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 39/383 (10%), Positives = 100/383 (26%), Gaps = 37/383 (9%) Query: 28 QMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQ 87 ++ LD VL S + + + + + +K I K + Sbjct: 217 ELNQPLDVVVL-LDNSNSMNNERANNSQRALKAGEAVEKLIDKITSNKDNRVALVTYAST 275 Query: 88 KAQINITKDKNNPLQYIAESKAQYEIP--TENLFLKGLIPSALTNLSLRSTGIIERSSEN 145 ++ + F + NL+ + + S Sbjct: 276 IFDGTEATVSKGVADQNGKALNDSVSWDYHKTTFTATTHNYSYLNLTNDANEVNILKSRI 335 Query: 146 LAISICMVLDVSRS-MEDLYLQKHNDNNNMTSNKYLLPP--------PPKKSFWSKNTTK 196 + + D + + QK N S Sbjct: 336 PKEAEHINGDRTLYQFGATFTQKALMKANEILETQSSNARKKLIFHVTDGVPTMSYAINF 395 Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 + Y + + + + +Q+ + + + + ++ P++ Sbjct: 396 NPYISTS--YQNQFNSFLNKIPDRSGILQEDFIINGDDYQIVKGDGESFKLFSDRKVPVT 453 Query: 257 NNLNEVKSRL--NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 + R+ N+L+ N + Y + + ++ T G Sbjct: 454 GGTTQAAYRVPQNQLSVMSNEGYAINSGYIYLYWRDYNWVYPFDPKTKKVSATKQIKTHG 513 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------GQDLLRKCTDSSGQF 367 E + N +R G I++V + + + ++ + + + Sbjct: 514 EPTTLYFNGN----------IRPKGYDIFTVGIGVNGDPGATPLEAEKFMQSISSKTENY 563 Query: 368 FAVNDS----RELLESFDKITDK 386 V+D+ EL + F I ++ Sbjct: 564 TNVDDTNKIYDELNKYFKTIVEE 586 >gi|327265809|ref|XP_003217700.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H3-like [Anolis carolinensis] Length = 885 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 19/144 (13%), Positives = 44/144 (30%), Gaps = 11/144 (7%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N+ E K + + TN + L N + R I ++DGE + Sbjct: 342 NVQEAKYFIGNITESGLTNFNGGLMAGIEMLNNAHKLKIVP---ERSASLTIMLSDGEAN 398 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF----FAVNDS 373 + + + +YS+ L + + + + +D+ Sbjct: 399 VGE--TDQFRIQENAKNASQGKYPLYSLGFGYN-LDYGFLERLSKVNNGVARRIYDDSDA 455 Query: 374 R-ELLESFDKITDKIQEQSVRIAP 396 +L +D++ + + P Sbjct: 456 ALQLQGFYDEVANPLLTDVALEYP 479 >gi|219684622|ref|ZP_03539565.1| von Willebrand factor type A domain protein [Borrelia garinii PBr] gi|219685812|ref|ZP_03540621.1| von Willebrand factor type A domain protein [Borrelia garinii Far04] gi|219671984|gb|EED29038.1| von Willebrand factor type A domain protein [Borrelia garinii PBr] gi|219672645|gb|EED29675.1| von Willebrand factor type A domain protein [Borrelia garinii Far04] Length = 333 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 26/172 (15%), Positives = 61/172 (35%), Gaps = 41/172 (23%) Query: 252 CTPLSNNLNEVKSRLNK---LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P++ + + +L+ ++ + + A L S LK+ + Sbjct: 146 VVPITTDRDFFNKKLDDIYIMDLGNGSALGLGISIALSHL----------KHSEALKRSI 195 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP----------------- 351 + +TDG + Y++ Q+ + +KIYS+ + + Sbjct: 196 VVLTDGVVNSDEIYKD-----QVINLAQGLNVKIYSIGIGSSEEFSVEFKLRSGKFYQGS 250 Query: 352 ----EGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQ-EQSVRIAPN 397 +L + + + G F++VND + + K E+ ++IA + Sbjct: 251 LKEIYDPSMLVEISHKTGGLFYSVNDDFSFQFAIQDFSKKENLERKIKIAVD 302 >gi|227830108|ref|YP_002831887.1| von Willebrand factor A [Sulfolobus islandicus L.S.2.15] gi|229578921|ref|YP_002837319.1| von Willebrand factor A [Sulfolobus islandicus Y.G.57.14] gi|229582327|ref|YP_002840726.1| von Willebrand factor type A [Sulfolobus islandicus Y.N.15.51] gi|227456555|gb|ACP35242.1| von Willebrand factor type A [Sulfolobus islandicus L.S.2.15] gi|228009635|gb|ACP45397.1| von Willebrand factor type A [Sulfolobus islandicus Y.G.57.14] gi|228013043|gb|ACP48804.1| von Willebrand factor type A [Sulfolobus islandicus Y.N.15.51] Length = 380 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 26/149 (17%), Positives = 56/149 (37%), Gaps = 33/149 (22%) Query: 253 TPLSNNLNEVKSRLNKLN---------PYENTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 S+N+N +K ++ L+ T Y A+ A + + Sbjct: 78 ITFSSNVNVIKEFVDPLDLTNEILQIAAGGQTALYTAILTANSLAKKYQMPT-------- 129 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD- 362 +++ +TDG + + N L M++YS + + LL+ +D Sbjct: 130 ---YLLLLTDGNPTDETNVGNYLKLPYF------EKMQVYSFGIG-DDYNEQLLQNISDK 179 Query: 363 SSGQFFAVNDSREL-----LESFDKITDK 386 +SG + ++D+ E+ ++ +I K Sbjct: 180 TSGVMYHISDANEIPQKLPQKAVTQIAAK 208 >gi|119470035|ref|ZP_01612840.1| hypothetical protein ATW7_05334 [Alteromonadales bacterium TW-7] gi|119446745|gb|EAW28018.1| hypothetical protein ATW7_05334 [Alteromonadales bacterium TW-7] Length = 1090 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 42/364 (11%), Positives = 96/364 (26%), Gaps = 75/364 (20%) Query: 91 INITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISI 150 ++ T + Y+ + E P + + ++ + + + Sbjct: 17 VSATASAEDIELYVNHNVETDENPRVLMIFDTSGSMDWSVINGDNQVCYIKKITQSNGNG 76 Query: 151 CMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKI 210 + + + + + + A A +R+I Sbjct: 77 NGNGNGNGNGNGNGNGGGGQVYYENVTCFASKDEYAEFNEQCYIGVNGVAVADC-HDRRI 135 Query: 211 DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN 270 DV + LVN N + G + + + G L + + S +N L Sbjct: 136 DVAKNAMTQLVND---------NSDIDFGLMRFRSNVGGYVVAKLGTDKTSLISDINSLP 186 Query: 271 PYENTNTYPAMHHAYRELYNEKE-------------------SSHNTIGSTRLKK----- 306 +T + AYR + + +S + + Sbjct: 187 ASGSTPMTETLWEAYRYITGQSLDYAFNVSDRDKSADNSVVYTSPFKPNNGDPLRCDNSI 246 Query: 307 FVIFITDGENSG----------------------ASAYQNTLNTLQICEYMR---NAGM- 340 VI +TDG+ + + + + + + + + Sbjct: 247 NVILMTDGDPTNDDGRDTSIAQTHNSYFNDDIPFDDGTYDDSYLVAMAKILHGTSDTKVD 306 Query: 341 ------------KIYSVAVSAP--PEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 ++Y++ G DLL K GQ+ + L E+ Sbjct: 307 LYTPSTDVLDTGRLYTIGFGTGMSQGGIDLLDKAARVGGGQYIEARTAEALSEALKNTIS 366 Query: 386 KIQE 389 +I+E Sbjct: 367 RIRE 370 >gi|218693423|gb|ACL01215.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693429|gb|ACL01218.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693431|gb|ACL01219.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 39/383 (10%), Positives = 100/383 (26%), Gaps = 37/383 (9%) Query: 28 QMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQ 87 ++ LD VL S + + + + + +K I K + Sbjct: 217 ELNQPLDVVVL-LDNSNSMNNERANNSQRALKAGEAVEKLIDKITSNKDNRVALVTYAST 275 Query: 88 KAQINITKDKNNPLQYIAESKAQYEIP--TENLFLKGLIPSALTNLSLRSTGIIERSSEN 145 ++ + F + NL+ + + S Sbjct: 276 IFDGTEATVSKGVADQNGKALNDSVSWDYHKTTFTATTHNYSYLNLTNDANEVNILKSRI 335 Query: 146 LAISICMVLDVSRS-MEDLYLQKHNDNNNMTSNKYLLPP--------PPKKSFWSKNTTK 196 + + D + + QK N S Sbjct: 336 PKEAEHINGDRTLYQFGATFTQKALMKANEILETQSSNARKKLIFHVTDGVPTMSYAINF 395 Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 + Y + + + + +Q+ + + + + ++ P++ Sbjct: 396 NPYISTS--YQNQFNSFLNKIPDRSGILQEDFIINGDDYQIVKGDGESFKLFSDRKVPVT 453 Query: 257 NNLNEVKSRL--NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 + R+ N+L+ N + Y + + ++ T G Sbjct: 454 GGTTQAAYRVPQNQLSVMSNEGYAINSGYIYLYWRDYNWVYPFDPKTKKVSATKQIKTHG 513 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------GQDLLRKCTDSSGQF 367 E + N +R G I++V + + + ++ + + + Sbjct: 514 EPTTLYFNGN----------IRPKGYDIFTVGIGVNGDPGATPLEAEKFMQSISSKTENY 563 Query: 368 FAVNDS----RELLESFDKITDK 386 V+D+ EL + F I ++ Sbjct: 564 TNVDDTNKIYDELNKYFKTIVEE 586 >gi|218693357|gb|ACL01182.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693425|gb|ACL01216.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] gi|218693427|gb|ACL01217.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 39/383 (10%), Positives = 100/383 (26%), Gaps = 37/383 (9%) Query: 28 QMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQ 87 ++ LD VL S + + + + + +K I K + Sbjct: 217 ELNQPLDVVVL-LDNSNSMNNERANNSQRALKAGEAVEKLIDKITSNKDNRVALVTYAST 275 Query: 88 KAQINITKDKNNPLQYIAESKAQYEIP--TENLFLKGLIPSALTNLSLRSTGIIERSSEN 145 ++ + F + NL+ + + S Sbjct: 276 IFDGTEATVSKGVADQNGKALNDSVSWDYHKTTFTATTHNYSYLNLTNDANEVNILKSRI 335 Query: 146 LAISICMVLDVSRS-MEDLYLQKHNDNNNMTSNKYLLPP--------PPKKSFWSKNTTK 196 + + D + + QK N S Sbjct: 336 PKEAEHINGDRTLYQFGATFTQKALMKANEILETQSSNARKKLIFHVTDGVPTMSYAINF 395 Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 + Y + + + + +Q+ + + + + ++ P++ Sbjct: 396 NPYISTS--YQNQFNSFLNKIPDRSGILQEDFIINGDDYQIVKGDGESFKLFSDRKVPVT 453 Query: 257 NNLNEVKSRL--NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 + R+ N+L+ N + Y + + ++ T G Sbjct: 454 GGTTQAAYRVPQNQLSVMSNEGYAINSGYIYLYWRDYNWVYPFDPKTKKVSATKQIKTHG 513 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------GQDLLRKCTDSSGQF 367 E + N +R G I++V + + + ++ + + + Sbjct: 514 EPTTLYFNGN----------IRPKGYDIFTVGIGVNGDPGATPLEAEKFMQSISSKTENY 563 Query: 368 FAVNDS----RELLESFDKITDK 386 V+D+ EL + F I ++ Sbjct: 564 TNVDDTNKIYDELNKYFKTIVEE 586 >gi|156370955|ref|XP_001628532.1| predicted protein [Nematostella vectensis] gi|156215511|gb|EDO36469.1| predicted protein [Nematostella vectensis] Length = 737 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 22/193 (11%), Positives = 67/193 (34%), Gaps = 16/193 (8%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + + + +L++ + + + + ++ + + Sbjct: 95 ASYSITTTGWEGIATATKDLIDKL-----DISPAGTHVSLMKFSTDVETFYMFDNQATKD 149 Query: 261 EVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +KS ++K+ + T A++ A ++ + K ++ TDG+ G Sbjct: 150 RLKSLIDKMEYDGEWSRTDIALNAAKNHIFVPSTGARG-----DAPKAIVLFTDGKTDGP 204 Query: 320 SAYQNTLNTLQIC----EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE 375 + ++ + + +R+A + I+ VA + + F +D Sbjct: 205 TPIDRNIDWVASLIKPLQDLRDANVTIFCVASG-NHPDYSQINWLAGDRARVFTTSDMDA 263 Query: 376 LLESFDKITDKIQ 388 L+ S + + K+ Sbjct: 264 LVGSLGRASKKVC 276 >gi|110833328|ref|YP_692187.1| pilin biogenesis-like protein [Alcanivorax borkumensis SK2] gi|110646439|emb|CAL15915.1| pilin biogenesis related protein [Alcanivorax borkumensis SK2] Length = 1009 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 22/116 (18%), Positives = 46/116 (39%), Gaps = 7/116 (6%) Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM------RNAG 339 + + N S + ++ TDGE + + ++ C+ R+ Sbjct: 270 KIVDNGNYVSPRNTANECESNHIVLFTDGEANDVNLPCGGGSSYD-CQRAISDYLDRDFE 328 Query: 340 MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 +K Y+V + D+ +D + + +D+ L +F I D I+++S IA Sbjct: 329 IKTYNVGLHMEDNRADMETVSSDGADGTYTASDAESLASAFLDIFDLIEKESRSIA 384 >gi|46445753|ref|YP_007118.1| putative batA protein [Candidatus Protochlamydia amoebophila UWE25] gi|46399394|emb|CAF22843.1| putative batA protein [Candidatus Protochlamydia amoebophila UWE25] Length = 362 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 27/169 (15%), Positives = 59/169 (34%), Gaps = 22/169 (13%) Query: 249 GNQCTPLSNNLNEVKSRLNKLNPY-----ENTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 +PL+ + + ++LN L + T A++ + K + + Sbjct: 155 PKILSPLTLDHELLINQLNDLKAINSMEEDGTAMGYAIYKTAHLIVATKHFAQELQKKGK 214 Query: 304 LKK-----FVIFITDGENSGASAYQ----NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 ++ +TDG T+ + Y +NAG+ +Y + V Sbjct: 215 PAYEIKNAIMVVLTDGFQDPNRLDYGNRLRTIELDEAIAYAKNAGIHLYIINVDPKFSSP 274 Query: 355 DL------LRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 + + GQ + N ++L FD I D++++ S+ + Sbjct: 275 QFAPHRRQIETLAESTGGQLYLANQEKDLKRVFDTI-DRLEKSSLPLDS 322 >gi|241191500|ref|YP_002968894.1| hypothetical protein Balac_1485 [Bifidobacterium animalis subsp. lactis Bl-04] gi|241196905|ref|YP_002970460.1| hypothetical protein Balat_1485 [Bifidobacterium animalis subsp. lactis DSM 10140] gi|240249892|gb|ACS46832.1| hypothetical sortase [Bifidobacterium animalis subsp. lactis Bl-04] gi|240251459|gb|ACS48398.1| hypothetical sortase [Bifidobacterium animalis subsp. lactis DSM 10140] gi|295794492|gb|ADG34027.1| hypothetical sortase [Bifidobacterium animalis subsp. lactis V9] Length = 671 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 24/202 (11%), Positives = 61/202 (30%), Gaps = 32/202 (15%) Query: 211 DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN 270 D + L+ K + N + YN + +L + ++ +N L Sbjct: 24 DPGKKVQVALIKYAGKNSDKIGNDTYNEDGYNYNYSQTVHSLAWTPEDLQKEQAAVNSLK 83 Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS--GASAYQNTLNT 328 T + HA ++L + + + +K +F +DG + + N Sbjct: 84 AGGATRADFGLQHAVKQLNSGRPGA---------QKLTVFYSDGSPTSSDGFEAKIANNA 134 Query: 329 LQICEYMRNAGMKIYSVAVSAP------PEGQDLLRKCTDS--------------SGQFF 368 ++ ++N ++ S+ + + + G ++ Sbjct: 135 IKAAAQLKNDHSQVISIGAMPGADPSGTDNANKFMNYVSSNYPKAQSMSEPHDRVEGTYY 194 Query: 369 AVNDS-RELLESFDKITDKIQE 389 + +L F +I + Sbjct: 195 YAVSARTDLQTIFKEIISIVTS 216 >gi|37676927|ref|NP_937323.1| hypothetical protein VVA1267 [Vibrio vulnificus YJ016] gi|37201471|dbj|BAC97293.1| uncharacterized protein [Vibrio vulnificus YJ016] Length = 688 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 22/121 (18%), Positives = 49/121 (40%), Gaps = 19/121 (15%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 NN+ + +N++ TN Y A+ A L +++ + +I +TDG Sbjct: 367 NNVTQAIETINQIATGGGTNLYDALERAVSGLDSDRTTG------------IILVTDGVA 414 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRE 375 + Q + M+ +++Y+ + LL T S+G ++++S + Sbjct: 415 NVGVT-----EKKQFLKLMQRYDVRLYTF-IMGNSANTPLLEPMTQVSNGFATSISNSDD 468 Query: 376 L 376 + Sbjct: 469 I 469 >gi|282854078|ref|ZP_06263415.1| von Willebrand factor type A domain protein [Propionibacterium acnes J139] gi|282583531|gb|EFB88911.1| von Willebrand factor type A domain protein [Propionibacterium acnes J139] gi|314981158|gb|EFT25252.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA3] gi|315091981|gb|EFT63957.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL110PA4] Length = 322 Score = 49.5 bits (116), Expect = 0.001, Method: Composition-based stats. Identities = 18/154 (11%), Positives = 50/154 (32%), Gaps = 25/154 (16%) Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 P S + V ++ + + T A+ + + + + ++ ++D Sbjct: 150 PPSTDRPTVLRAVDGIELQDGTALGGAIDKSLEAVK----MAPGGSKNPAPA-AIVMLSD 204 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PEGQDLLRKC 360 G N+ + N + + +Y++A LL Sbjct: 205 GNNTQGGSPLVAANRAAAAK------VSVYTIAFGTETGYVDLDGQRERVAPDTKLLSTV 258 Query: 361 TD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 D + + + + + +L E + ++ + + VR Sbjct: 259 ADRTHAKSWTADSADKLQEVYQQVHSSVGYEPVR 292 >gi|313902415|ref|ZP_07835818.1| hypothetical protein ThesuDRAFT_1385 [Thermaerobacter subterraneus DSM 13965] gi|313467346|gb|EFR62857.1| hypothetical protein ThesuDRAFT_1385 [Thermaerobacter subterraneus DSM 13965] Length = 170 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 8/47 (17%), Positives = 20/47 (42%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDR 48 +++ V + +D + ++ R Q+ D A L+G + + Sbjct: 25 FLLLLPVILAALGLVLDGSRLVLTRAHAQAVADFASLAGVQEVDEEA 71 >gi|224092976|ref|XP_002187320.1| PREDICTED: similar to calcium channel, voltage-dependent, alpha 2/delta subunit 1 [Taeniopygia guttata] Length = 1092 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 31/313 (9%), Positives = 94/313 (30%), Gaps = 25/313 (7%) Query: 71 HLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK-AQYEIPTENLFLKGLIPSALT 129 + K E + +I + + + + A IPT+ ++ + L Sbjct: 126 NAKDDQNDPEKNETESGSQRIRPVFEDDPVFRRQTSYQHAAVHIPTDIYEGSTIVLNELN 185 Query: 130 NLSLRSTGIIERSSENLAISICMVLDVSR----SMEDLYLQKHNDNNNMTSNKYLLPPPP 185 + E+ + + + ++ K N + P Sbjct: 186 WTAALDDVFKRNREEDPTLLWQVFGSATGLARYYPASPWVDKSRTPNKIDLYDVRRRPWY 245 Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ-EKKNLSVRIGTIAYN 244 + S + + + + ++ S ++ ++ + + ++ Sbjct: 246 IQGAASPKDMLILVDASGSVSGLTLKLIRTSVIEMLETLSDDDFVNVVSFNNNAQNVSCF 305 Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 +V N ++K + K++ T+ +A+ +L N S Sbjct: 306 NHLVQANVR----NKKKLKEAVYKISAKGITDYKKGFSYAFEQLLNHSVSRA------NC 355 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DS 363 K ++ TDG A + N + +++++ +V + ++ ++ Sbjct: 356 NKIIMLFTDGGEERAQEIFHKYNE--------DKKVRVFTFSVGQHNYDKGPIQWMACEN 407 Query: 364 SGQFFAVNDSREL 376 G ++ + + Sbjct: 408 KGYYYEIPSIGAI 420 >gi|218710404|ref|YP_002418025.1| hypothetical protein VS_2441 [Vibrio splendidus LGP32] gi|218323423|emb|CAV19600.1| hypothetical protein VS_2441 [Vibrio splendidus LGP32] Length = 422 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 16/171 (9%), Positives = 48/171 (28%), Gaps = 7/171 (4%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + + V A+D+ H++ + ++Q+A+D+A L+ + + + + Sbjct: 19 LVTAALLVFLAVSALAVDVNHMLVNKTRLQNAVDSAALAAATILDNSKDKDAVDAEIGTA 78 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 H + + + + T + + + + F Sbjct: 79 LNAMAASTGNH--EIDFTTASINIDYSNDPQDFTGTATFGDDDDVYVRVRVDSLDMDEFF 136 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDN 171 + L + + + V+ + + D D Sbjct: 137 IQMFG-----LVKEVSASAVAGPSSGQEVVNNVVPIGVCIGDGTSDNDVDP 182 >gi|329963582|ref|ZP_08301061.1| von Willebrand factor type A domain protein [Bacteroides fluxus YIT 12057] gi|328528571|gb|EGF55542.1| von Willebrand factor type A domain protein [Bacteroides fluxus YIT 12057] Length = 342 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 21/168 (12%), Positives = 55/168 (32%), Gaps = 49/168 (29%) Query: 251 QCTPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 P++++ K L ++P + T A++ A R + + + Sbjct: 142 TQLPITSDYISAKMFLESIDPSLISKQGTAIGAAINLATRSFTPQ----------EGVGR 191 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------------- 352 +I ITDGEN ++ + G+++ + V P Sbjct: 192 AIIVITDGENHEG-------GAVEAAKAATEKGIQVNVLGVGMPDGAPIPMEGTNDYRRD 244 Query: 353 ----------GQDLLRKCTDSSGQFF-AVNDSRELLESFDKITDKIQE 389 +++ ++ + + V+++ + I+ +I + Sbjct: 245 RDGNVIVTRLNEEMCQEIAKAGNGIYVRVDNT---NSAQKAISQEINK 289 >gi|326916561|ref|XP_003204575.1| PREDICTED: matrilin-3-like [Meleagris gallopavo] Length = 363 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 20/86 (23%), Positives = 27/86 (31%), Gaps = 15/86 (17%) Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT 361 T K VI +TDG + R AG++IY+V V LR Sbjct: 67 TYFDKVVIVVTDGRPQD--------QVENVAANARTAGIEIYAVGVGR--ADMQSLRIMA 116 Query: 362 DSS--GQFFAVND---SRELLESFDK 382 F V +L F + Sbjct: 117 SEPLDEHVFYVETYGVIEKLTAKFRE 142 >gi|225621507|ref|YP_002722766.1| hypothetical protein BHWA1_02609 [Brachyspira hyodysenteriae WA1] gi|225216328|gb|ACN85062.1| putative membrane protein containing von Willebrand factor (vWA) type A domain [Brachyspira hyodysenteriae WA1] Length = 324 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 24/165 (14%), Positives = 45/165 (27%), Gaps = 41/165 (24%) Query: 248 VGNQCTPLSNNLNEVKSRLNKLNPYE----NTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 +P + ++ L+ L T A+ A Sbjct: 116 TSFIASPFTQDMETFTYILDNLTTKSVTLQGTRIADALVTAKNTFN----------VDAV 165 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP------------ 351 KK +I ITDGE+ G I + + + +Y+V V Sbjct: 166 SKKSIILITDGEDHGGYFDD-------ILKQLNEMNVSVYTVGVGTEVGATISTDLGVRE 218 Query: 352 ------EGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQE 389 + L+ + G+ + + L FD + + Sbjct: 219 KSVVSKRDDNTLKLIADSTHGKSYIAENVS-LESIFDDMKQSMDS 262 >gi|114557513|ref|XP_001143250.1| PREDICTED: calcium-activated chloride channel regulator 1 [Pan troglodytes] Length = 914 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 22/151 (14%), Positives = 58/151 (38%), Gaps = 26/151 (17%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 ++ + + RL T+ + A+ + + + ++ +T Sbjct: 362 INSGSDRDTLAKRLPA-AASGGTSICSGLRLAFTVIRKKYPTD---------GSEIVLLT 411 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND 372 DGE++ S N + + +G I++VA+ P Q+L + G +D Sbjct: 412 DGEDNTISGCFNEV---------KQSGAIIHTVALG-PSAAQELEELSKMTGGLQTYASD 461 Query: 373 ---SRELLESFDKITD---KIQEQSVRIAPN 397 + L+++F ++ + ++S+++ Sbjct: 462 QVQNNGLIDAFGALSSGNGAVSQRSIQLESK 492 >gi|51598434|ref|YP_072622.1| hypothetical protein BG0172 [Borrelia garinii PBi] gi|51573005|gb|AAU07030.1| hypothetical protein BG0172 [Borrelia garinii PBi] Length = 333 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 26/172 (15%), Positives = 61/172 (35%), Gaps = 41/172 (23%) Query: 252 CTPLSNNLNEVKSRLNK---LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P++ + + +L+ ++ + + A L S LK+ + Sbjct: 146 VVPITTDRDFFNKKLDDIYIMDLGNGSALGLGISIALSHL----------KHSEALKRSI 195 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP----------------- 351 + +TDG + Y++ Q+ + +KIYS+ + + Sbjct: 196 VVLTDGVVNSDEIYKD-----QVINLAQGLNVKIYSIGIGSSEEFSVEFKLRSGKFYQGS 250 Query: 352 ----EGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQ-EQSVRIAPN 397 +L + + + G F++VND + + K E+ ++IA + Sbjct: 251 LKEIYDPSMLVEISHKTGGLFYSVNDDFSFQFAIQDFSKKENLERKIKIAVD 302 >gi|238064101|ref|ZP_04608810.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] gi|237885912|gb|EEP74740.1| von Willebrand factor type A [Micromonospora sp. ATCC 39149] Length = 626 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 28/236 (11%), Positives = 59/236 (25%), Gaps = 57/236 (24%) Query: 147 AISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPA 206 ++ LD S SM D + ++ T Sbjct: 443 PVAAYYCLDGSGSMGDNDGWTGIEAAATQVFDPEQAAQNLLQTHPQDVTTVAIFNGGVTG 502 Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 V L + + + Sbjct: 503 GSPWQVRGNDGDALRD---------------------------------------LARSV 523 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 P TN Y + A EL ++ K+ V+ +TDG++ Sbjct: 524 ADYEPEGGTNMYACLLRATTELTGQQNGDR--------KRLVVLMTDGQSGAEQRDD--- 572 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + +R+A + + ++A + + L ++G F +L+ + + Sbjct: 573 ----ALDALRSADVPVVAIAFGRDADPRQLEEVAKATNGTFVR---EDDLVAALRQ 621 >gi|74136383|ref|NP_001028084.1| calcium-activated chloride channel regulator 1 precursor [Macaca mulatta] gi|75043731|sp|Q6PT52|CLCA1_MACMU RecName: Full=Calcium-activated chloride channel regulator 1; AltName: Full=Calcium-activated chloride channel family member 1; Flags: Precursor gi|46371863|gb|AAS90562.1| calcium-activated chloride channel family member 1 [Macaca mulatta] Length = 913 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 22/151 (14%), Positives = 58/151 (38%), Gaps = 26/151 (17%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 ++ + + RL T+ + A+ + + + ++ +T Sbjct: 362 INSGSDRDTLTKRLPT-AASGGTSICSGLRLAFTVIKKKYPTD---------GSEIVLLT 411 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND 372 DGE++ S N + + +G I++VA+ P ++L + G +D Sbjct: 412 DGEDNTISGCFNEV---------KQSGAIIHTVALG-PSAARELEELSKMTGGLQTYASD 461 Query: 373 ---SRELLESFDKITDK---IQEQSVRIAPN 397 + L+++F ++ + E+S+++ Sbjct: 462 QVQNNGLIDAFGALSSGNGVVSERSIQLESK 492 >gi|86559780|gb|ABD04185.1| collagen alpha-1 chain, type XII-like protein [Anthopleura elegantissima] Length = 101 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 12/95 (12%), Positives = 30/95 (31%), Gaps = 4/95 (4%) Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + + K ITDG + ++ G+++Y++ + + Sbjct: 8 TPAAGMRRNVPKMAFVITDGVQTRRDPITPLATASG---RLKAKGVRVYALGIGKNIKDN 64 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + L + + + + L + I I E Sbjct: 65 E-LNAIASNRKFVYRTSSFKLLTPLINGIVKSICE 98 >gi|332705169|ref|ZP_08425251.1| hypothetical protein LYNGBM3L_03330 [Lyngbya majuscula 3L] gi|332356119|gb|EGJ35577.1| hypothetical protein LYNGBM3L_03330 [Lyngbya majuscula 3L] Length = 571 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 20/157 (12%), Positives = 50/157 (31%), Gaps = 21/157 (13%) Query: 235 SVRIGTIAYNIGIVGNQCTPLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 + +I I ++ I + + + + LNP Y A A L + Sbjct: 424 TEQIALIDFDSEIRPPVLVDSTPQKRDRAMAFILNLNPDGGNRLYDAALEARNWLQKNYQ 483 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG------MKIYSVAV 347 V+ +TDG++ + L Q+ + + +G + +++ Sbjct: 484 ERAI--------NAVVILTDGDD-----SDSKLRLEQLSQELEKSGFSSDNRIAFFTIGY 530 Query: 348 SA-PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 +L + + + ++ D +L + Sbjct: 531 GNEGEFNPKVLEQIAEFNWGYYRQGDPSTILNLMADL 567 >gi|296208407|ref|XP_002751080.1| PREDICTED: calcium-activated chloride channel regulator 1 [Callithrix jacchus] Length = 914 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 21/151 (13%), Positives = 56/151 (37%), Gaps = 24/151 (15%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN--PYENTNTYPAMHHAYRELYNEKESSHN 297 + ++ N+ ++ + +L T+ + A+ + + + Sbjct: 346 MVTFDSAAYVQSELVQINSGSDRDTLAKRLPTAAAGGTSICTGLRSAFTVIRKKYPTD-- 403 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 ++ +TDGE++ S N + + +G I++VA+ P Q+L Sbjct: 404 -------GSEIVLLTDGEDNTISGCFNEV---------KQSGAVIHTVALG-PSAAQELE 446 Query: 358 RKCTDSSGQFFAVNDSRE---LLESFDKITD 385 + + G +D + L+++F ++ Sbjct: 447 QLSKMTGGFQTYASDQAQNNGLIDAFGALSS 477 >gi|320589835|gb|EFX02291.1| von willebrand factor type a domain containing protein [Grosmannia clavigera kw1407] Length = 735 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 25/195 (12%), Positives = 60/195 (30%), Gaps = 27/195 (13%) Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 +K P + +D++ +A ++ ++ + V Sbjct: 110 DAPVPATKGEPMESNGLTVLDLVKHAARTILETLNEHD-------CLGIVTFSEDANVLL 162 Query: 251 QCTPLSN-NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 TP++ N + + L P TN + + ++ + S ++ Sbjct: 163 MLTPMTQVNKAKALQVILDLEPLTVTNLWKGLTAGIEIFSSKAQFSSVPS--------IM 214 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAG---MKIYSVAVSAPPEGQDLLRKCTD-SSG 365 +TDG + +R G I++ LL+ ++ + G Sbjct: 215 LLTDGLPNFMHP------PQGYIPKLRTFGKLPAPIHTFGFGYNLRS-GLLKSISELTGG 267 Query: 366 QFFAVNDSRELLESF 380 + ++D+ L F Sbjct: 268 NYAFISDAGMLGTVF 282 >gi|162449101|ref|YP_001611468.1| hypothetical protein sce0831 [Sorangium cellulosum 'So ce 56'] gi|161159683|emb|CAN90988.1| hypothetical protein predicted by Glimmer/Critica [Sorangium cellulosum 'So ce 56'] Length = 377 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 30/185 (16%), Positives = 56/185 (30%), Gaps = 17/185 (9%) Query: 208 RKIDVLIESAGNLVNSIQKAIQE------KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 K + + V A ++ + + + + L N E Sbjct: 107 SKWVGVTNALSAFVTDAASAGMNVGLTYFPRSSNGQSDCNHTSYDELAVAIGELPVNTPE 166 Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + + + +P T PA+ K+++ + V+ TDG+ SG S Sbjct: 167 LTASIQSTSPGGGTPMRPALQGVLTNATAYKDANPSHKV------IVVLATDGDPSGCSG 220 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVN-DSRELLE 378 +R G++ Y VAV L + + G + F V D Sbjct: 221 NTVASTAEMAQRALRYNGVQTYVVAVQ--GSTLTNLDQIAAAGGTTRAFDVTADITAFSA 278 Query: 379 SFDKI 383 +I Sbjct: 279 KMAEI 283 >gi|27367212|ref|NP_762739.1| hypothetical protein VV2_0803 [Vibrio vulnificus CMCP6] gi|27358780|gb|AAO07729.1| Uncharacterized protein [Vibrio vulnificus CMCP6] Length = 688 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 22/121 (18%), Positives = 49/121 (40%), Gaps = 19/121 (15%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 NN+ + +N++ TN Y A+ A L +++ + +I +TDG Sbjct: 367 NNVTQAIETINQIATGGGTNLYDALERAVSGLDSDRTTG------------IILVTDGVA 414 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRE 375 + Q + M+ +++Y+ + LL T S+G ++++S + Sbjct: 415 NVGVT-----EKKQFLKLMQRYDVRLYTF-IMGNSANTPLLEPMTQVSNGFATSISNSDD 468 Query: 376 L 376 + Sbjct: 469 I 469 >gi|30030|emb|CAA33889.1| alpha-1 collagen VI (AA 574-1009) [Homo sapiens] Length = 436 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 32/222 (14%), Positives = 62/222 (27%), Gaps = 25/222 (11%) Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 N + +K +F S P P + D A L A Sbjct: 211 KNVTAQICIDKKCPDYTCPITFSSPADITILLEPPPDVGSHNFDTTKRFAKRLAERFLTA 270 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQ---CTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHH 283 + VR+ + Y+ N + S ++ ++ + T+ A+ + Sbjct: 271 GRTDPAHDVRVAVVQYSGTGQQRPERASLQFLQNYTALASAVDAMDFINDATDVNDALGY 330 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 R + KK ++ +DG + G + + + AG++I+ Sbjct: 331 VTRFYREASSGAA--------KKRLLLFSDGNSQG----ATPAAIEKAVQEAQRAGIEIF 378 Query: 344 SVAVSAPPE---------GQDLLRKCTDSSGQFFAVNDSREL 376 V V G+ F V + L Sbjct: 379 VVVVGRQVNEPHIRVLVTGKTAEYDVAYGESHLFRVPSYQAL 420 >gi|291404848|ref|XP_002718766.1| PREDICTED: von Willebrand factor A domain containing 2 [Oryctolagus cuniculus] Length = 787 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 19/136 (13%), Positives = 46/136 (33%), Gaps = 18/136 (13%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + VR+G + + EVK+++ ++ T T A+ Sbjct: 82 DISQERVRVGAMQFGSAPHLEFPLDAFPTRQEVKAQVKRMIFKGGRTETGLALKRLLHGG 141 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + + + + ++ +TDG++ G A + ++ G+ +++V V Sbjct: 142 FPGGRNG-------SVPQLLVIVTDGKSQGHVASP--------AKQLKERGVTVFAVGVR 186 Query: 349 APPEGQDLLRKCTDSS 364 P + L Sbjct: 187 FPRWEE--LHALASEP 200 >gi|310825891|ref|YP_003958248.1| hypothetical protein ELI_0266 [Eubacterium limosum KIST612] gi|308737625|gb|ADO35285.1| predicted protein [Eubacterium limosum KIST612] Length = 838 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 40/271 (14%), Positives = 75/271 (27%), Gaps = 31/271 (11%) Query: 149 SICMVLDVSRSMEDLYLQKHNDNNN---MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAP 205 SI +VLD S SM D L + T S Sbjct: 106 SIVLVLDNSGSMGWGSSPTPADYARDALKEFANEFLKNGNSGNKLGLVTYSSGSGVPIYD 165 Query: 206 ANRKIDVLI----ESAGNLVNSIQKAIQEKKNLSVRIG-------TIAYNIGIVGNQCTP 254 A +I + ++ + +Q + +A + Sbjct: 166 AYDEIKYVQGYSMTENSDIFGQVVDGLQTPSGETDVQMGIKTARDILAADTSGNPQFILV 225 Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 S+ +R N P + + G+ ++ Sbjct: 226 FSDGATNRSARPTSAGELGGANISPC-TFGDKIYDMTFKFDGFDYGAQGSA---VYNDGV 281 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL-------LRKCTDSSGQF 367 + + +T+ + ++ G+ IYSV P ++ S GQ+ Sbjct: 282 YTTNGNVNAHTVAAVSEALLAKDQGIDIYSVFYHNPALNDLEYGAGVFVMKNSASS-GQY 340 Query: 368 FAVN--DSRELLESFDKITDKIQEQSVRIAP 396 ++ ++ E F +I +IQE IAP Sbjct: 341 TEISPGNAGAFAEIFTEIEKQIQES---IAP 368 >gi|218666515|ref|YP_002427074.1| hypothetical protein AFE_2697 [Acidithiobacillus ferrooxidans ATCC 23270] gi|218518728|gb|ACK79314.1| conserved hypothetical protein [Acidithiobacillus ferrooxidans ATCC 23270] Length = 590 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 13/62 (20%), Positives = 32/62 (51%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + AI++ + L + + ID+ H+ Y++ +Q D A ++G + + +++ K+ Sbjct: 23 IAAIVMPIMILALAFGIDIGHMAYVQRNLQKIADMAAIAGAEDVPNAQSLATGNAVKNGL 82 Query: 61 ST 62 T Sbjct: 83 QT 84 >gi|156383253|ref|XP_001632749.1| predicted protein [Nematostella vectensis] gi|156219809|gb|EDO40686.1| predicted protein [Nematostella vectensis] Length = 184 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 18/161 (11%), Positives = 47/161 (29%), Gaps = 20/161 (12%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT-NTYPAMHHAYRELY 289 I Y + +++ +++ +N + + NT A+ ++ Sbjct: 38 VSESRSHISVTVYTDNPRIHFTLTQHYDISTIETAINSITYTGGSANTGKALSSVKTSVF 97 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + + + + +I + + + + +R+AG+ I+SV + Sbjct: 98 DATGRA-------NVPRVLILLAHAVSLDGVSS--------ASQALRSAGVTIFSVGIG- 141 Query: 350 PPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQ 388 L F EL + +I Sbjct: 142 NYIKIFQLNTIASDPDVDHVFTA-SFSELHSLEGSVRKRIC 181 >gi|145482457|ref|XP_001427251.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124394331|emb|CAK59853.1| unnamed protein product [Paramecium tetraurelia] Length = 568 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 24/169 (14%), Positives = 57/169 (33%), Gaps = 22/169 (13%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN--NLNEVKSRLNKLNPYENTN 276 +LV + IQ+ + + RI + + N ++ N ++K + + ++TN Sbjct: 149 SLVKDSLRYIQKILSPNDRIALVTFGTYSGINLPWTINKPENKQKIKDAIIGMKIRDSTN 208 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM- 335 + R + K+ + T + +TDG+ + + C+ Sbjct: 209 IADGVKLGLRMIKERKQKNPVT--------CMFVLTDGQ-------DDNKGADERCQQAI 253 Query: 336 RNAGMK----IYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 ++ I S + + + G F + + + E F Sbjct: 254 NEYQIQDTFVINSFGYGQDHDAKVMNNISNLKGGTFTFIENIAKASEHF 302 >gi|198284406|ref|YP_002220727.1| membrane protein [Acidithiobacillus ferrooxidans ATCC 53993] gi|198248927|gb|ACH84520.1| membrane protein [Acidithiobacillus ferrooxidans ATCC 53993] Length = 596 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 13/62 (20%), Positives = 32/62 (51%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + AI++ + L + + ID+ H+ Y++ +Q D A ++G + + +++ K+ Sbjct: 29 IAAIVMPIMILALAFGIDIGHMAYVQRNLQKIADMAAIAGAEDVPNAQSLATGNAVKNGL 88 Query: 61 ST 62 T Sbjct: 89 QT 90 >gi|89889806|ref|ZP_01201317.1| BatB [Flavobacteria bacterium BBFL7] gi|89518079|gb|EAS20735.1| BatB [Flavobacteria bacterium BBFL7] Length = 343 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 22/165 (13%), Positives = 45/165 (27%), Gaps = 45/165 (27%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 P++ + + K L +N ++ A+ A + + T K ++ I+ Sbjct: 144 LPITTDYSSAKMFLQSMNTDLVSSQGTAIAEAIQLAESY------YSEDTEASKVLVIIS 197 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA-------------------PPEG 353 DGE+ L E G++I ++ V G Sbjct: 198 DGEDHEG-------EALDYAEAAAENGIRIITIGVGTEKGGTIPIKRNGIVREFKKDSNG 250 Query: 354 QDL--------LRKCTDSSGQFF-----AVNDSRELLESFDKITD 385 + L + + + +L E I Sbjct: 251 NTVITRLNSETLEEIASVGNGVYIDGTITASVIEKLKEELSGIDK 295 >gi|311271483|ref|XP_003133150.1| PREDICTED: anthrax toxin receptor-like [Sus scrofa] Length = 728 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 34/253 (13%), Positives = 70/253 (27%), Gaps = 28/253 (11%) Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + +LD+S S+ + ++ K P K S T ++ Sbjct: 82 QSTYDLYFILDMSGSINNNWM--DIYALVEDLVKKFDNP---KIRMSFITYSTQ-----G 131 Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 K+ + + V + R + P+ S Sbjct: 132 HILMKLTSDKDLSEFEVEENESVGCYHSGGRKRGRV----GCGPLVEELPIREKKTTASS 187 Query: 265 R--LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + + L + + + ++ G ++ +I +TDG Sbjct: 188 KAGIQNLKQESFGEAFEGLPDPLYRVEANEQIEQANSGENKVPSMIIALTDG----TLES 243 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + T Q + R G +Y + V + L + DSS F V+ F Sbjct: 244 ISLQETKQQADRARKLGANVYCIGV--KDYETEQLSEIADSSDHVFGVDQG------FKA 295 Query: 383 ITDKIQEQSVRIA 395 + + I + Sbjct: 296 LKNIIDPLMTKTC 308 >gi|294140611|ref|YP_003556589.1| inter-alpha-trypsin inhibitor domain-containing protein [Shewanella violacea DSS12] gi|293327080|dbj|BAJ01811.1| inter-alpha-trypsin inhibitor domain protein [Shewanella violacea DSS12] Length = 747 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 22/141 (15%), Positives = 46/141 (32%), Gaps = 15/141 (10%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRE--LYNEKESSHNTIGSTRLKKFVIFITDGE 315 N+ + L T PA+ A + + K V+F+TDG Sbjct: 398 NIGRANQFIRTLKADGGTEMGPALTRALDNGNHSTSHQDEEDFDSDGVRLKQVLFMTDGA 457 Query: 316 NSGASAYQNTLNTLQICEYMRNAGM---KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND 372 + + N + + +++++ + A P + R G F + Sbjct: 458 VANERSLFN----------LIEDKIGHSRLFTIGIGAAPNSHFMERAAEFGKGTFTYIGK 507 Query: 373 SRELLESFDKITDKIQEQSVR 393 E+ + + + KI+ V Sbjct: 508 LGEVQQKIESLLYKIEHPQVT 528 >gi|225174955|ref|ZP_03728952.1| von Willebrand factor type A [Dethiobacter alkaliphilus AHT 1] gi|225169595|gb|EEG78392.1| von Willebrand factor type A [Dethiobacter alkaliphilus AHT 1] Length = 841 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 27/149 (18%), Positives = 63/149 (42%), Gaps = 19/149 (12%) Query: 257 NNLNEVKSRL-NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 +N K + L P T+ A+ A+ +L++ + + V+F+TDGE Sbjct: 98 DNKGTFKEAVEGNLVPRGFTDYVGALEEAFEQLHSVETGDAR--------QVVVFLTDGE 149 Query: 316 NSGASAYQNTLNTLQ--------ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQ 366 + +N ++ + AG+ +Y VA + G ++L + + Sbjct: 150 PNPHLDARNDDEFMEGYLGELWDLTGEYAAAGVPVYPVAF-SDEVGPEVLEQIAGHTGAD 208 Query: 367 FFAVNDSRELLESFDKITDKIQEQSVRIA 395 F + D +L+ +F ++ +++ +++ A Sbjct: 209 FVLMPDPGDLVVTFFELVSRLKNRNLFFA 237 >gi|254416017|ref|ZP_05029773.1| Vault protein inter-alpha-trypsin [Microcoleus chthonoplastes PCC 7420] gi|196177192|gb|EDX72200.1| Vault protein inter-alpha-trypsin [Microcoleus chthonoplastes PCC 7420] Length = 744 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 26/164 (15%), Positives = 46/164 (28%), Gaps = 19/164 (11%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSN---NLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 N + ++ PL+N N + + +N+L T + + Sbjct: 287 NPQDTFTILDFSDITTQLSAKPLANTPQNRIKALTYINQLKANGGTYLLNGIRAVLN--F 344 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + ++ ITDG N L + +G ++YS V + Sbjct: 345 PAAPEGRL--------RSIVLITDG------YIGNESEILAEVKQYLKSGNRLYSFGVGS 390 Query: 350 PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 P L R G V E +K I + Sbjct: 391 SPNRFLLNRMAELGRGTSRIVRQDESTQEVTEKFLRHINNPVLT 434 >gi|18490111|gb|AAH22236.1| Unknown (protein for IMAGE:4178997) [Homo sapiens] Length = 439 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 30/222 (13%), Positives = 62/222 (27%), Gaps = 25/222 (11%) Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 N + +K +F S + + + D A L A Sbjct: 214 KNVTAQICIDKKCPDYTCPITFSSPADITILLDGSASVGSHNFDTTKHFAKRLAERFLTA 273 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQ---CTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHH 283 + VR+ + Y+ N + S ++ ++ + T+ A+ + Sbjct: 274 GRTDPAHDVRVAVVQYSGTGQQRPERASLQFLQNYTALASAVDAMDFINDATDVNDALGY 333 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 R + KK ++ +DG + G + + + AG++I+ Sbjct: 334 VTRFYREASSGAA--------KKRLLLFSDGNSQG----ATPAAIEKAVQEAQRAGIEIF 381 Query: 344 SVAVSAPPE---------GQDLLRKCTDSSGQFFAVNDSREL 376 V V G+ F V + L Sbjct: 382 VVVVGRQVNEPHIRVLVTGKTAEYDVAYGESHLFRVPSYQAL 423 >gi|325845103|ref|ZP_08168414.1| von Willebrand factor type A domain protein [Turicibacter sp. HGF1] gi|325488845|gb|EGC91243.1| von Willebrand factor type A domain protein [Turicibacter sp. HGF1] Length = 315 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 35/255 (13%), Positives = 72/255 (28%), Gaps = 38/255 (14%) Query: 94 TKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIER-----SSENLAI 148 T + N Q + + A P + G+ P + + + Sbjct: 81 TVIQTNLNQSLFVTSASNVKPQIDFTYLGITPINPMQGQEFTVRYKLTPNPFQHNISKPK 140 Query: 149 SICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR 208 I +VLD S SM + N L K ++ + P Sbjct: 141 EIVLVLDGSGSMSGT-KLTNLKNAAKDFIDRLKGVDNLKVAIVVFSSNATINPISVSGTT 199 Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 KI + + + L N + + + +N Sbjct: 200 KI----------------------KSTDKSSESSIPNYKTLQNEYFLDINDSRLITMINN 237 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 ++ TNT + A L G + K +I ++DG + +T + Sbjct: 238 IDAQGGTNTGDGLRKAEYLLS--------QKGDSVANKTIILMSDGLPTY--YSGSTESG 287 Query: 329 LQICEYMRNAGMKIY 343 + + +++ + I+ Sbjct: 288 VNYYKEIKDDVVGIF 302 >gi|157738379|ref|YP_001491063.1| hypothetical protein Abu_2179 [Arcobacter butzleri RM4018] gi|157700233|gb|ABV68393.1| hypothetical protein Abu_2179 [Arcobacter butzleri RM4018] Length = 1866 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 34/320 (10%), Positives = 81/320 (25%), Gaps = 23/320 (7%) Query: 52 DPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQY 111 S IK + K + + Sbjct: 1243 SAALTDTDGSETLSVIIKNVPASATLESSKYEVSKNSDGSYTVKVPQGETSISDKLTMKV 1302 Query: 112 EIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDN 171 + A + ++++ ++ S + D Sbjct: 1303 PQEDAKNINLQIEAKATEARDNEDGQNFKTATDSTTDKTPTLVVGSNKDSVINGGAGKDI 1362 Query: 172 NNMTSNKYLLPPPPKKSFWSKNTTK--------SKYAPAPAPANRKIDVLIESAGNLVNS 223 + L K++ S A +ID+L ++ NL +S Sbjct: 1363 LIGDTGGTQLNVQAGKNYNIALVVDTSGSMKEASGSKTAWGTTISRIDLLKDALKNLADS 1422 Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 ++ + V I N S N++++ ++++ L TN A Sbjct: 1423 LKGHDGKIN---VSIIDFDTNAKEPITFNDLTSKNISDLITKIDALKAEGGTNYEDAFLK 1479 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA----- 338 + S + + F+TDG+ + ++ + T M++A Sbjct: 1480 TTSWFDTQ---SVTYGKAQGYENLTYFLTDGDPTFSNRNTTSTGTTTEYSDMKDAVDAFK 1536 Query: 339 ----GMKIYSVAVSAPPEGQ 354 ++++ + Sbjct: 1537 TLSGQSTVHAIGIGNGINEN 1556 >gi|304320960|ref|YP_003854603.1| hypothetical protein PB2503_06977 [Parvularcula bermudensis HTCC2503] gi|303299862|gb|ADM09461.1| hypothetical protein PB2503_06977 [Parvularcula bermudensis HTCC2503] Length = 433 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 19/219 (8%), Positives = 52/219 (23%), Gaps = 21/219 (9%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+++ +D + + ++Q + + D Sbjct: 31 LMAVMLVPSVGLGALIVDGSRMRTAHLEIQ----------IVAEAAALAAAQNLPSVDDA 80 Query: 61 STIFKKQIKKHLKQ---GSYIRENAGDIAQKAQINITKDKNNPLQYIAES-KAQYEIPTE 116 + +L G+ +R + N T + + + Sbjct: 81 REAATDYAEANLDPTKYGNVVRSTDVEFGTYDDSNGTFSVGGTTAVRVTAGRTEDRSNAF 140 Query: 117 NLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS 176 + G+I +L+ + + E S N + + N + Sbjct: 141 STLFGGVIGRPSVDLTASAIAVAETSGGNPICILVLGF---GYYGLDMDGDIN----VDI 193 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 + + S A ++D + Sbjct: 194 PDCGIQVNSDDDDAMNSKDDSYVNAAYIHVVGEVDGDTD 232 >gi|126273404|ref|XP_001377627.1| PREDICTED: similar to AMACO [Monodelphis domestica] Length = 784 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 29/171 (16%), Positives = 61/171 (35%), Gaps = 18/171 (10%) Query: 225 QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHH 283 +IG + Y + + + +++ +T A+ H Sbjct: 560 SSLQFNINRDVTQIGLVVYGSRVQTTFALDTHPTSSSLLQAISQAPYMDGAGSTGSALLH 619 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 Y E+ ++ + + K V+ IT+G T + + + +RN G+ + Sbjct: 620 VYEEVMTVQKGAR-----PGVSKAVVVITEG--------TGTEDAVVPAQKLRNNGISVL 666 Query: 344 SVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKIT-DKIQEQSVR 393 +AV P + LLR S V +L + + ++I E++ R Sbjct: 667 VIAVG-PVLKETLLR-LAGSPDFLIHVASYEDLEN-YQDLFIERICEEAKR 714 Score = 46.4 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 21/145 (14%), Positives = 47/145 (32%), Gaps = 20/145 (13%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEK 292 VR+G + ++ EVK ++ K+ T T A+ + + + Sbjct: 86 DRVRVGVLQFSSVPQLEFPLDSFFTREEVKEKIKKIVFKGGSTETGLALKYLLHKGFPGG 145 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 +S + + +I +TDG++ G ++ G+ +++V V P Sbjct: 146 RNS-------SVPQLLIIVTDGKSQGNIDLP--------ANQLKERGVMVFAVGVRFP-- 188 Query: 353 GQDLLRKCTDSSGQFF--AVNDSRE 375 L + + Sbjct: 189 RWTELHTLASEPKDQYVLFAEHVDD 213 >gi|320159019|ref|YP_004191397.1| hypothetical protein VVM_02412 [Vibrio vulnificus MO6-24/O] gi|319934331|gb|ADV89194.1| uncharacterized protein [Vibrio vulnificus MO6-24/O] Length = 688 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 22/121 (18%), Positives = 49/121 (40%), Gaps = 19/121 (15%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 NN+ + +N++ TN Y A+ A L +++ + +I +TDG Sbjct: 367 NNVTQAIETINQIATGGGTNLYDALERAVSGLDSDRTTG------------IILVTDGVA 414 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRE 375 + Q + M+ +++Y+ + LL T S+G ++++S + Sbjct: 415 NVGVT-----EKKQFLKLMQRYDVRLYTF-IMGNSANTPLLEPMTQVSNGFATSISNSDD 468 Query: 376 L 376 + Sbjct: 469 I 469 >gi|260061450|ref|YP_003194530.1| hypothetical protein RB2501_07615 [Robiginitalea biformata HTCC2501] gi|88785582|gb|EAR16751.1| hypothetical protein RB2501_07615 [Robiginitalea biformata HTCC2501] Length = 348 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 15/109 (13%), Positives = 38/109 (34%), Gaps = 20/109 (18%) Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKLNPYE----NTNTYPAMHHAYRELYNEKESSHNTIG 300 P++ + K L +N T + A+ A +E++++ Sbjct: 136 YAAQAFPQLPITTDYGAAKMFLQSMNTDMLSSQGTAIHEAIELAATYFDDEEQTN----- 190 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + + ++DGE + E + G++I+++ V + Sbjct: 191 -----RILFLVSDGE------DHAEDQVMDAIEQATDQGIRIFTIGVGS 228 >gi|154496349|ref|ZP_02035045.1| hypothetical protein BACCAP_00637 [Bacteroides capillosus ATCC 29799] gi|150274432|gb|EDN01509.1| hypothetical protein BACCAP_00637 [Bacteroides capillosus ATCC 29799] Length = 1896 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 20/186 (10%), Positives = 60/186 (32%), Gaps = 25/186 (13%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 A ++ + + + N+V ++++ + K + ++ N ++ Sbjct: 205 SGKAQSRLKIAVSAVKNMVGTLREQLGGKLTA----KFVVFSSEGYKNGVDKRASAKVIT 260 Query: 263 KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 +++L++L T+ + + + + ++ + DG++ Sbjct: 261 EAQLDQLTAVGGTDLSAGVALGVDQFKSSSAR-----------QVLVVVADGDSDDGYPN 309 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + +N IY+V + + + N EL E+ + Sbjct: 310 RT-------ANNFKNKDGIIYTVGFTFSSDS---FNNLATDADHALLANSDTELGEAMED 359 Query: 383 ITDKIQ 388 I+ I Sbjct: 360 ISTDIT 365 >gi|115689614|ref|XP_787130.2| PREDICTED: similar to inter-alpha (globulin) inhibitor H3 variant [Strongylocentrotus purpuratus] gi|115969501|ref|XP_001184100.1| PREDICTED: similar to inter-alpha (globulin) inhibitor H3 variant [Strongylocentrotus purpuratus] Length = 902 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 25/225 (11%), Positives = 66/225 (29%), Gaps = 20/225 (8%) Query: 146 LAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWS---------KNTTK 196 Y +++ NN + + F+S Sbjct: 291 APGPREQTAANPNGFMGDYTVQYDTNNPEDGSDIQILDNHFVQFFSPSGLPVLRKNVIFI 350 Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 + + A + + ++ ++N + + +K N+ + + S Sbjct: 351 IDVSGSMAGVKLRQ--VKDALTTILNDMPET--DKFNIIPFSDDVNFLDR--NKMLFSTS 404 Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 +N+ K + L +NTN + A+ R L +E + + + +I ++DG Sbjct: 405 SNVRRAKRFVKSLQERDNTNLHKAIIAGVRMLRDESDQN--VRPDENVVSMLIVLSDGNP 462 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT 361 + + + E ++++ L + Sbjct: 463 NHGEI--DKEIIERNVEEAIRGDFSLFNLGFG-EDLDFPFLERMA 504 >gi|260801245|ref|XP_002595506.1| hypothetical protein BRAFLDRAFT_69088 [Branchiostoma floridae] gi|229280753|gb|EEN51518.1| hypothetical protein BRAFLDRAFT_69088 [Branchiostoma floridae] Length = 1641 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 26/152 (17%), Positives = 44/152 (28%), Gaps = 20/152 (13%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE--NTNTYPAMHHAYRELYNEKESSHN 297 Y + N +V + + + T T A+ Y+ Sbjct: 627 VYQYGSDVRTEFPIGQYNTREDVLNAVLNIQYMNQWGTFTGKALEEVYKTF--------- 677 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 KK VI ITDG+ + + + ++ G I +V V L Sbjct: 678 -PAGDDAKKVVIIITDGK------AMDEEVLRKASQDVKADGAMICAVGVG--GFRLKEL 728 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 S F D ++ D + D + E Sbjct: 729 SLLASSQDLVFTATDFDKMDAIRDTVLDAVCE 760 >gi|320160484|ref|YP_004173708.1| hypothetical protein ANT_10740 [Anaerolinea thermophila UNI-1] gi|319994337|dbj|BAJ63108.1| hypothetical protein ANT_10740 [Anaerolinea thermophila UNI-1] Length = 802 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 18/133 (13%), Positives = 45/133 (33%), Gaps = 14/133 (10%) Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + + + L+ TN + A+ A + + +++ + ++IF+TDG + Sbjct: 358 QAQKWVAGLSAAGGTNIHRALLDAIQFVRSQRPT------------YLIFLTDGLPTVGI 405 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 + G++++ V + L G V +L ++ Sbjct: 406 TDR-EQILDDFARQA-PRGLRLFVFGVGYDVDTFLLDELALAHHGLSLYVRPEEDLNQAV 463 Query: 381 DKITDKIQEQSVR 393 +KI + Sbjct: 464 AGFFEKISTPVLT 476 >gi|308063814|gb|ADO05701.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter pylori Sat464] Length = 217 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 28/203 (13%), Positives = 70/203 (34%), Gaps = 30/203 (14%) Query: 194 TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 S +I+ L ++ ++++ + KK L ++ I + G G Sbjct: 21 VDTSGSMNESLGNCTRIEALNLCIQKMIETLKQ--EAKKELFSKMAIITF--GENGAVLH 76 Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 +++ + + L+ T A A + ++ +T + K + I ++D Sbjct: 77 TPFDDVKNINFK--PLSASGGTPLDQAFRLAKNLIEDK-----DTFPTKFYKPYSILVSD 129 Query: 314 GENSGASA-------YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ 366 GE + + + +T +C +S+ + + + Sbjct: 130 GELNDGKWQKALSDFHHDGRSTKSVC----------WSIFIGDREANPQVNKD--FGKDG 177 Query: 367 FFAVNDSRELLESFDKITDKIQE 389 F +D +L+ F+ +T I + Sbjct: 178 VFYADDVEKLVGLFEIMTQTISK 200 >gi|154280917|ref|XP_001541271.1| predicted protein [Ajellomyces capsulatus NAm1] gi|150411450|gb|EDN06838.1| predicted protein [Ajellomyces capsulatus NAm1] Length = 759 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 35/206 (16%), Positives = 68/206 (33%), Gaps = 15/206 (7%) Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE---SAGNLVNSIQKAIQEKKNLSVRI 238 P P + + S + AP P + E S +L + I E N + R+ Sbjct: 70 PHVPCDIVLCIDVSYSMQSSAPLPTTDESGEREETGLSVLDLTKHAARTIIETLNENDRL 129 Query: 239 GTIAYNIGIVGNQCTPLSN--NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 G +A++ N + + L P +TN + + + NE Sbjct: 130 GIVAFSTEAEVVYEISKMNESSKKAALKAVEALKPLSSTNLWHGLKLGLKAFENE----- 184 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK-IYSVAVSAPPEGQD 355 + + + + +TDG + Q + L+ + M I++ Sbjct: 185 --RHTPQSVQALYVLTDGMPNHMCPKQGYVTKLRPILQLLGHRMPMIHTFGFGYNIRS-G 241 Query: 356 LLRKCTD-SSGQFFAVNDSRELLESF 380 LL+ + G F + D+ + F Sbjct: 242 LLQAIAEVGGGTFAFIPDAGMIGTVF 267 >gi|313230659|emb|CBY18875.1| unnamed protein product [Oikopleura dioica] Length = 524 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 65/172 (37%), Gaps = 16/172 (9%) Query: 229 QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-------ENTNTYPAM 281 + R+ + Y+ + + +N V + +L +T T AM Sbjct: 252 DGISPDTNRVAMLRYSSDVKEDLNFIEGSNEPTVMRNIQRLKYKPITDDRHGSTYTAHAM 311 Query: 282 HHAYRELYNEKES--SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG 339 A + ++ + + T +++ V+ ITDGE++ G Sbjct: 312 DKALKTIFTSEAGWRNGTTEDGIKVRTEVVIITDGESNDPDETFTIQGQKV---KYDEYG 368 Query: 340 MKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQ 390 +K+Y++ V +D +R+ T F + ++L +F++I + + E Sbjct: 369 IKVYALGVG--DIKKDEIRQLTSMDDESIFYLMSWKDL-AAFNRIIETLIET 417 >gi|15594518|ref|NP_212307.1| hypothetical protein BB0173 [Borrelia burgdorferi B31] gi|3915348|sp|O51195|Y173_BORBU RecName: Full=Uncharacterized protein BB_0173 gi|2688067|gb|AAC66565.1| predicted coding region BB0173 [Borrelia burgdorferi B31] Length = 341 Score = 49.1 bits (115), Expect = 0.001, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 60/172 (34%), Gaps = 41/172 (23%) Query: 252 CTPLSNNLNEVKSRLNK---LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P++ + +L+ ++ + + A L S LK+ + Sbjct: 154 VVPITTDREFFNKKLDDIYIMDLGNGSALGLGISIALSHL----------KHSEALKRSI 203 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA------------------- 349 + +TDG + ++ Q+ + +KIYS+ + + Sbjct: 204 VVLTDGVVNSDEIXKD-----QVINLAQGLNVKIYSIGIGSSEEFSVEFKLRSGKFYQGS 258 Query: 350 --PPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQ-EQSVRIAPN 397 +L + ++ + G F++VND + + K E+ ++IA + Sbjct: 259 FKEVYDPSMLVEISNKTGGLFYSVNDDFSFQFAIQDFSKKENLERKIKIAVD 310 >gi|260793652|ref|XP_002591825.1| hypothetical protein BRAFLDRAFT_125323 [Branchiostoma floridae] gi|229277036|gb|EEN47836.1| hypothetical protein BRAFLDRAFT_125323 [Branchiostoma floridae] Length = 2660 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 18/106 (16%), Positives = 38/106 (35%), Gaps = 12/106 (11%) Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 + T T A+ + E + + KK + ITDG+++ + Sbjct: 2068 DAKGGTATRLALKFLRESVIPEAVAELDRPDE--SKKALFLITDGKSNTGG------DPS 2119 Query: 330 QICEYMRNA-GMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVND 372 + +R G++IY++ + + + L S F + D Sbjct: 2120 EEARKLREELGLEIYTIGI-SNDVSKTELASVASSPKKEHMFTLKD 2164 >gi|153012136|ref|YP_001373346.1| cell wall anchor domain-containing protein [Ochrobactrum anthropi ATCC 49188] gi|151564024|gb|ABS17517.1| LPXTG-motif cell wall anchor domain protein [Ochrobactrum anthropi ATCC 49188] Length = 750 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 26/136 (19%), Positives = 50/136 (36%), Gaps = 18/136 (13%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N+ + + L T PA+H A L + + + + ++F+TDGE Sbjct: 412 NIASARRFVTSLEAQGGTEMLPALHAA---LDDSNQGNGL--------RQIVFLTDGEI- 459 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 + L R +I+ V + + P + R G F + + E+ Sbjct: 460 ------SNEQQLLDAVAARRGRSRIFMVGIGSAPNSYLMNRAAELGRGTFTHIGSAAEVD 513 Query: 378 ESFDKITDKIQEQSVR 393 E + DK++ +V Sbjct: 514 ERMRALFDKLENPAVT 529 >gi|119510959|ref|ZP_01630081.1| von Willebrand factor, type A [Nodularia spumigena CCY9414] gi|119464398|gb|EAW45313.1| von Willebrand factor, type A [Nodularia spumigena CCY9414] Length = 464 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 25/189 (13%), Positives = 50/189 (26%), Gaps = 23/189 (12%) Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 Y + T K KID++ ES L+ S + + RI Sbjct: 57 YEIVAGDTTPTGKTYTVDGKEYTQVTGGKSKIDIVSESLLALIRSGRLGASD------RI 110 Query: 239 GTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHN 297 + ++ + +N+++ + +L T + A L Sbjct: 111 AIVQFDDTASQIIGLTSATEINKLEDAIAQLRTFSGGTRMGLGLRRALEMLN-------- 162 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 + + TDG+ + I + I ++ V L Sbjct: 163 --NQQMTVRRTLLFTDGQT------FDDDQCRAIASDFATNNIPITALGVGEDFNEDLLT 214 Query: 358 RKCTDSSGQ 366 + G Sbjct: 215 HLSDSTGGT 223 >gi|67473840|ref|XP_652669.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS] gi|56469543|gb|EAL47283.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS] Length = 390 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 36/189 (19%), Positives = 66/189 (34%), Gaps = 20/189 (10%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQC-----TPLSNNLNEVKSRLNKL 269 +A + I I + + +VR + Y + T ++++ +K +NK+ Sbjct: 21 NAAQQSIEKIINTIIQSEKCNVRFALVEYKDHPPQDSSFAFRLTDFTDSMKCIKEAVNKM 80 Query: 270 NPYENTNT----YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + +T AMH A Y EK + I DG G + Sbjct: 81 DANGGGDTPESVCCAMHCASNLKYREKAAKVIVWIGDAPPHGFIREGDGFPKGCPCGCDL 140 Query: 326 LNTLQICEYMRNAGMKIYSVAVS--APPEGQDLLRKCTD-SSGQFFAVNDSREL-----L 377 L ++ C N + IY V + L+R + GQ+ A+ + L Sbjct: 141 LKEVRECM---NKDIVIYCVGAEPLTSEYLRTLMRAVAQLTGGQYAALASANVLGELITN 197 Query: 378 ESFDKITDK 386 + ++I K Sbjct: 198 GAIEEIQMK 206 >gi|67482407|ref|XP_656553.1| hypothetical protein [Entamoeba histolytica HM-1:IMSS] gi|56473758|gb|EAL51167.1| hypothetical protein, conserved [Entamoeba histolytica HM-1:IMSS] Length = 382 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 36/189 (19%), Positives = 66/189 (34%), Gaps = 20/189 (10%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQC-----TPLSNNLNEVKSRLNKL 269 +A + I I + + +VR + Y + T ++++ +K +NK+ Sbjct: 21 NAAQQSIEKIINTIIQSEKCNVRFALVEYKDHPPQDSSFAFRLTDFTDSMKCIKEAVNKM 80 Query: 270 NPYENTNT----YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + +T AMH A Y EK + I DG G + Sbjct: 81 DANGGGDTPESVCCAMHCASNLKYREKAAKVIVWIGDAPPHGFIREGDGFPKGCPCGCDL 140 Query: 326 LNTLQICEYMRNAGMKIYSVAVS--APPEGQDLLRKCTD-SSGQFFAVNDSREL-----L 377 L ++ C N + IY V + L+R + GQ+ A+ + L Sbjct: 141 LKEVRECM---NKDIVIYCVGAEPLTSEYLRTLMRAVAQLTGGQYAALASANVLGELITN 197 Query: 378 ESFDKITDK 386 + ++I K Sbjct: 198 GAIEEIQMK 206 >gi|227827415|ref|YP_002829194.1| von Willebrand factor A [Sulfolobus islandicus M.14.25] gi|229584630|ref|YP_002843131.1| von Willebrand factor A [Sulfolobus islandicus M.16.27] gi|238619571|ref|YP_002914396.1| von Willebrand factor type A [Sulfolobus islandicus M.16.4] gi|227459210|gb|ACP37896.1| von Willebrand factor type A [Sulfolobus islandicus M.14.25] gi|228019679|gb|ACP55086.1| von Willebrand factor type A [Sulfolobus islandicus M.16.27] gi|238380640|gb|ACR41728.1| von Willebrand factor type A [Sulfolobus islandicus M.16.4] Length = 380 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 26/149 (17%), Positives = 56/149 (37%), Gaps = 33/149 (22%) Query: 253 TPLSNNLNEVKSRLNKLN---------PYENTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 S+N+N +K ++ L+ T Y A+ A + + Sbjct: 78 ITFSSNVNVIKEFVDPLDLTNEILQIAAGGQTALYTAILTANSLAKKYQMPT-------- 129 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD- 362 +++ +TDG + + N L M++YS + + LL+ +D Sbjct: 130 ---YLLLLTDGNPTDETNVGNYLKLPYF------EKMQVYSFGIG-DDYNEQLLQSISDK 179 Query: 363 SSGQFFAVNDSREL-----LESFDKITDK 386 +SG + ++D+ E+ ++ +I K Sbjct: 180 TSGVMYHISDANEIPQKLPQKAVTQIAAK 208 >gi|162452306|ref|YP_001614673.1| glycine-rich protein [Sorangium cellulosum 'So ce 56'] gi|161162888|emb|CAN94193.1| glycine-rich protein [Sorangium cellulosum 'So ce 56'] Length = 408 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 28/222 (12%), Positives = 64/222 (28%), Gaps = 36/222 (16%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI-AYNIGIVGNQCTPLSNNLNEVK 263 + + ++ + G ++ I+ + L + P + ++ +K Sbjct: 113 EGSDRWTMMKGALGTALDVIRDRMSVGLQLFPSDEHCGMPAGEDLSVAVAPGATSVPAIK 172 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELY-----------------NEKESSHNTIGSTRLKK 306 + L+ +P T T A+ A L + + + T Sbjct: 173 TLLDGTDPGGATPTADALARALGYLTVGAGGALEGDKYVLLATDGGPNCNQDPAMTCEAA 232 Query: 307 FVIFITDGENSGASAYQNTLNTLQIC----------EYMRNAGMKIYSVAVSAPPEGQDL 356 DG+ +C + +R AG+K + V + + Sbjct: 233 TCTTNMDGDCPSGVPNCCVAGLTDVCLDDVRTVQRVKDLRAAGIKTFVVGIPGATAYAHV 292 Query: 357 LRKCTDSSG--------QFFAVNDSRELLESFDKITDKIQEQ 390 L + ++F V D+ L ++ IT + Sbjct: 293 LDQLAVEGDTATSETSPRYFEVVDAASLGDTLTGITRDLVRT 334 >gi|119493487|ref|ZP_01624154.1| von Willebrand factor, type A [Lyngbya sp. PCC 8106] gi|119452670|gb|EAW33850.1| von Willebrand factor, type A [Lyngbya sp. PCC 8106] Length = 843 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 52/165 (31%), Gaps = 21/165 (12%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSN---NLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 N I ++ TPL+N N + + +N+L T + Sbjct: 374 NPDDTFSIIDFSDTTTALSATPLTNTVTNQQKAIAYINQLEANGGTELLNGIQTVMDFPS 433 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + + ++ ITDG N L + + +G ++YS V + Sbjct: 434 PPVKR----------LRSIVLITDG------YIGNENEVLSVVKNQLKSGNRLYSFGVGS 477 Query: 350 PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKIT-DKIQEQSVR 393 + LL + + + E E ++ ++I + Sbjct: 478 SVN-RFLLNRLAEIGRGTTQITRPDEPTEDQVELFVNQINNPVLT 521 >gi|72162665|ref|YP_290322.1| von Willebrand factor, type A [Thermobifida fusca YX] gi|71916397|gb|AAZ56299.1| von Willebrand factor, type A [Thermobifida fusca YX] Length = 596 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 37/353 (10%), Positives = 86/353 (24%), Gaps = 21/353 (5%) Query: 40 GCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNN 99 + D S + L S + + + ++ Sbjct: 235 AALQNLQKSIAPDEEAAFTLLSEAADGESAAPLLVLSEQAAWRHSTRTGSTVYVIYPEDG 294 Query: 100 PLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVL---DV 156 + P L + + + S + +I ++ Sbjct: 295 TYTLDYPYVLRTNDPEVQLAAEDFRAFLTSRTAQEVIRADGFRSADDSIDPAVLTEEHGF 354 Query: 157 SRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIES 216 + + + + S P ++ V + Sbjct: 355 RPEHPGELPAPGEQAAKSLIQAWNQLKLGTRLLTVIDISGSMAELVPGTGLTRMQVTTMA 414 Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN------EVKSRLNKL- 269 A + + PL ++N + + L +L Sbjct: 415 ATQGLTLFPGNAEVGLWEFSVDLDEG-RDYRELVPIRPLDTDVNGATQHETLTAALAQLA 473 Query: 270 -NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 P +T Y + AY+EL ++ +TDG N + Sbjct: 474 PKPDGDTGLYDTILAAYQELTETYAPDRVNT--------LLVLTDGNNDDYDSITLEELL 525 Query: 329 LQICEYMR-NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 ++ E + + + I +++ + + L R + G + D E+ E F Sbjct: 526 KELGEAYQPDRPVSIIAISFGPDVDPEPLERIAEVTRGAAYTTEDPTEIGEIF 578 >gi|32476015|ref|NP_869009.1| inter-alpha-trypsin inhibitor domain-containing protein [Rhodopirellula baltica SH 1] gi|32446559|emb|CAD76394.1| inter-alpha-trypsin inhibitor family heavy chain-related protein-hypothetical secreted or membrane-associated protein containing vWFA domain [Rhodopirellula baltica SH 1] Length = 764 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 51/169 (30%), Gaps = 22/169 (13%) Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIV---GNQCTPLSNNLNEVKSRLNKLNPYENTNTYPA 280 + + N + IA++ N + N+ K + L TN PA Sbjct: 366 FADHVLDHLNPNDEFRVIAFSNRTTAFQPNAVSATDANIQSAKQFVRGLRASGGTNLLPA 425 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + A + +++I +TD + L + Sbjct: 426 LKLAL----------GGEADESARPRYMILMTDALVGNDHSILRYLRQPEF------QDA 469 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSR---ELLESFDKITDK 386 +++ +A A P + R G V + E+ F ++T + Sbjct: 470 RVFPIAFGAAPNDYLISRAAEMGRGFSMQVTNQDNTPEIARRFHELTSQ 518 >gi|260912479|ref|ZP_05919015.1| aerotolerance protein BatB [Prevotella sp. oral taxon 472 str. F0295] gi|260633398|gb|EEX51552.1| aerotolerance protein BatB [Prevotella sp. oral taxon 472 str. F0295] Length = 591 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 63/166 (37%), Gaps = 48/166 (28%) Query: 253 TPLSNNLNEVKSRLNKLNP----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P++ + K L ++P + T+ A++ + R +K+ + K V Sbjct: 181 LPITTDYVSAKMFLQNIDPALIATQGTDIAKAINLSMRSFSQQKD----------IGKAV 230 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA---------------PPEG 353 I ITDGE+ L+ + G++++ + + + G Sbjct: 231 IVITDGEDHEG-------GALEAAKAANERGIRVFILGIGSTKGSPIPLAEGGYLADRSG 283 Query: 354 QDLL--------RKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQ 390 Q +L ++ + G + V+++ ++ +K+ +++ + Sbjct: 284 QTVLTALNESMCKQIAQAGNGTYIHVDNT---NDAQEKLNNELAKL 326 >gi|221126641|ref|XP_002157470.1| PREDICTED: similar to microneme 1 [Hydra magnipapillata] Length = 403 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 53/153 (34%), Gaps = 14/153 (9%) Query: 236 VRIGTIAYNIGIVGNQCT--PLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEK 292 VR I Y+ + P + +++ + T T A+ A + ++ Sbjct: 6 VRFAVIDYSDDAILQISVSDPRFWDHETFGEKVSSIEYSHGKTRTDLALKVARKHVFCN- 64 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + K +I +TDG+++ Q + +++ + I SV V + Sbjct: 65 ----ECSLQHNIPKLLIVLTDGQSTFPKLTQFEAHLIKV-----ENNLTIISVGV-SDQV 114 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + L+ F +N L + +KI Sbjct: 115 DIEELKSLATDRDHVFLLNSYSYLNDKINKILK 147 Score = 43.4 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 24/183 (13%), Positives = 53/183 (28%), Gaps = 25/183 (13%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS--NNLNEVKSRLN 267 + ++ + + ++ + I ++ S + +++ Sbjct: 235 SNNWKKTIQFVQDFSKEFKFGPTGA--QFAVIDFSDKATLQVNISDSRFWSNEGFTKKIS 292 Query: 268 KLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 + T T A+ R+++ K + K VI +TDG ++ Sbjct: 293 NIKYKMGKTRTDLALRLTRRKVFCRKCGLRV-----DVSKLVIVVTDGRSTF------PF 341 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 TL ++ I + V + L F LL + I D Sbjct: 342 LTLSEARLIKKQS-SIICIGVG-DEVDINELNTIATDENHVF-------LLNGYRYINDN 392 Query: 387 IQE 389 I + Sbjct: 393 INQ 395 >gi|312879450|ref|ZP_07739250.1| von Willebrand factor type A [Aminomonas paucivorans DSM 12260] gi|310782741|gb|EFQ23139.1| von Willebrand factor type A [Aminomonas paucivorans DSM 12260] Length = 813 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 33/174 (18%), Positives = 62/174 (35%), Gaps = 13/174 (7%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 + +A NLV+S+ K + + Y I ++ + + K+ +N L Sbjct: 348 LSYAKTAACNLVDSLPKNVYVGIVQFDDSTSQVYPITLIASNDAAAAATRAAAKAAINGL 407 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 +T Y A +A + + T S L +TDGE++ +S + Sbjct: 408 TSGGSTAIYDAASYALSQF-----VAQKTALSADLLGVTYLLTDGEDNSSSKSVGEVIGE 462 Query: 330 QICEYMRNAGMKIYSVAVSAP--PEGQDLLRKCTDSSGQFFAVN-DSRELLESF 380 + + + +V A L + + GQ+FA D L + F Sbjct: 463 YQAQK-----VPLITVGYGAGGQAGSFALTQLADGTGGQYFASPVDQAALQQVF 511 >gi|308502682|ref|XP_003113525.1| hypothetical protein CRE_26515 [Caenorhabditis remanei] gi|308263484|gb|EFP07437.1| hypothetical protein CRE_26515 [Caenorhabditis remanei] Length = 861 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 25/173 (14%), Positives = 62/173 (35%), Gaps = 23/173 (13%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNT 277 I + K N R+G + +N + +L + + ++ + T+ Sbjct: 392 AFAKRIVAQYKYKDNDFTRVGVLTFNDIVTEKLTLQKGVDLATINAAIDSVEYLGGLTDV 451 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 A+ A E +++H K +I ++D + + + + + + Sbjct: 452 TAALKAAKDLFSKESDNAH--------SKVLIVLSDAVPTVDTY----ADEIAAGQALSA 499 Query: 338 AGMKIYSVAVSAPPEGQDLLRKC--TDSSGQFFAVNDSRELLES-FDKITDKI 387 AG+ + V D+L++ + F ++ ++ F+ IT +I Sbjct: 500 AGVATFFVGY--NHYSDDVLKQLGQVTNPAYVF-----GDMSDASFNGITQQI 545 >gi|218693335|gb|ACL01171.1| PI-1 ancillary protein 1 [Streptococcus agalactiae] Length = 890 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 39/383 (10%), Positives = 100/383 (26%), Gaps = 37/383 (9%) Query: 28 QMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQ 87 ++ LD VL S + + + + + +K I K + Sbjct: 217 ELNQPLDVVVL-LDNSNSMNNERANNSQRALKAGEEVEKLIDKITSNKDNRVALVTYAST 275 Query: 88 KAQINITKDKNNPLQYIAESKAQYEIP--TENLFLKGLIPSALTNLSLRSTGIIERSSEN 145 ++ + F + NL+ + + S Sbjct: 276 IFDGTEATVSKGVADQNGKALNDSVSWDYHKTTFTATTHNYSYLNLTNDANEVNILKSRI 335 Query: 146 LAISICMVLDVSRS-MEDLYLQKHNDNNNMTSNKYLLPP--------PPKKSFWSKNTTK 196 + + D + + QK N S Sbjct: 336 PKEAEHINGDRTLYQFGATFTQKALMKANEILETQSSNARKKLIFHVTDGVPTMSYAINF 395 Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 + Y + + + + +Q+ + + + + ++ P++ Sbjct: 396 NPYISTS--YQNQFNSFLNKIPDRSGILQEDFIINGDDYQIVKGDGESFKLFSDRKVPVT 453 Query: 257 NNLNEVKSRL--NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 + R+ N+L+ N + Y + + ++ T G Sbjct: 454 GGTTQAAYRVPQNQLSVMSNEGYAINSGYIYLYWRDYNWVYPFDPKTKKVSATKQIKTHG 513 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------GQDLLRKCTDSSGQF 367 E + N +R G I++V + + + ++ + + + Sbjct: 514 EPTTLYFNGN----------IRPKGYDIFTVGIGVNGDPGATPLEAEKFMQSISSKTENY 563 Query: 368 FAVNDS----RELLESFDKITDK 386 V+D+ EL + F I ++ Sbjct: 564 TNVDDTNKIYDELNKYFKTIVEE 586 >gi|307726382|ref|YP_003909595.1| hypothetical protein BC1003_4370 [Burkholderia sp. CCGE1003] gi|307586907|gb|ADN60304.1| Protein of unknown function DUF2134, membrane [Burkholderia sp. CCGE1003] Length = 350 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 25/155 (16%), Positives = 52/155 (33%), Gaps = 8/155 (5%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSD-------RTIKDP 53 MTA+ ++V A+DL +RN++Q+A DAA L+G S+ + + Sbjct: 21 MTALCLTVLIGTTALAVDLGRAWVVRNELQNAADAAALAGAGSLGPNYASPNWTQAQTKA 80 Query: 54 TTKKDQTSTIFKKQIKKHLKQGS-YIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYE 112 + T + ++ G + + + +Q A Sbjct: 81 QSAITLNRTEGTSLLTAQVQTGYWNVTGTPAGMQALPVPSPGAYDKPAVQVTVSRAAGQN 140 Query: 113 IPTENLFLKGLIPSALTNLSLRSTGIIERSSENLA 147 +L L ++ + +S + +I Sbjct: 141 GGPLSLVLAPVLGISTMPISATAVAVISAPGYAGP 175 >gi|118359381|ref|XP_001012930.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89294697|gb|EAR92685.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 713 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 33/219 (15%), Positives = 71/219 (32%), Gaps = 25/219 (11%) Query: 173 NMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK 232 + SN S S+ TK + + +D++ S +V ++ + + Sbjct: 60 SKVSNSICCVVDVSGSMGSRAVTKQSGGNSELGYSV-LDIVKHSLNTIVQNLDEGDE--- 115 Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 + T + N +V N +N+ +N+ P +TN + + ++ N+ Sbjct: 116 ---FSMVTFSDNSKLVCNYQQMTESNIKSSVDLINQCQPDASTNIWAGIEQGLEQMQND- 171 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK-----IYSVAV 347 + + +I +TDG+ + L N + I + Sbjct: 172 -------SNKNKNQQLIVLTDGQPNVN----PPRGILTTLNNFYNKNIISPKPSINTFGF 220 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF-DKITD 385 + L D G + + DS + F + I Sbjct: 221 GYYLDSHLLFNIAQDCQGIYSFIPDSSFVGTIFTNSIAS 259 >gi|111114995|ref|YP_709613.1| hypothetical protein BAPKO_0175 [Borrelia afzelii PKo] gi|216263812|ref|ZP_03435806.1| von Willebrand factor type A domain protein [Borrelia afzelii ACA-1] gi|110890269|gb|ABH01437.1| hypothetical protein BAPKO_0175 [Borrelia afzelii PKo] gi|215979856|gb|EEC20678.1| von Willebrand factor type A domain protein [Borrelia afzelii ACA-1] Length = 333 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 62/172 (36%), Gaps = 41/172 (23%) Query: 252 CTPLSNNLNEVKSRLNK---LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P++ + + +L+ ++ + + A L S LK+ + Sbjct: 146 VVPITTDRDFFNKKLDDIYIMDLGNGSALGLGISIALSHL----------KHSEALKRSI 195 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA------------------- 349 + +TDG + Y++ Q+ + ++IYS+ + + Sbjct: 196 VVLTDGVVNSDEIYKD-----QVINLAQGLNVRIYSIGIGSSEEFSVEFKLRSGKFYQGS 250 Query: 350 --PPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQ-EQSVRIAPN 397 +L + ++ + G F++VND + + K E+ ++IA + Sbjct: 251 LKEVYDPSMLVEISNKTGGLFYSVNDDFSFQFAIQDFSKKENLERKIKIAVD 302 >gi|261415412|ref|YP_003249095.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|261371868|gb|ACX74613.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|302326806|gb|ADL26007.1| BatB protein [Fibrobacter succinogenes subsp. succinogenes S85] Length = 342 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 26/168 (15%), Positives = 51/168 (30%), Gaps = 47/168 (27%) Query: 250 NQCTPLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 PL+ + V+ L +LNP TN A+ N G Sbjct: 140 QVMVPLTLDYGTVQMVLRELNPGWLMPGTNLESAIRKGMTLFKNS--------GGASQHS 191 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------------- 352 +I ++DGE A+A + G+KIY++ + + Sbjct: 192 VMILMSDGEELEAAAVNAAKEAAEF-------GIKIYTIGIGSREGVPIPLKDKNGGSVY 244 Query: 353 -------------GQDLLRKCTD-SSGQFFAVNDSR-ELLESFDKITD 385 + L++ + + +F + +L + +I Sbjct: 245 KKDMQGNIVTTRLEEGTLQEIANVTGALYFYASPGEFQLQKVLTEIAT 292 >gi|138893737|ref|YP_001124190.1| hypothetical protein GTNG_0058 [Geobacillus thermodenitrificans NG80-2] gi|196250779|ref|ZP_03149466.1| conserved hypothetical protein [Geobacillus sp. G11MC16] gi|134265250|gb|ABO65445.1| Conserved hypothetical protein [Geobacillus thermodenitrificans NG80-2] gi|196209729|gb|EDY04501.1| conserved hypothetical protein [Geobacillus sp. G11MC16] Length = 246 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 17/103 (16%), Positives = 39/103 (37%), Gaps = 10/103 (9%) Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV----SAPPEGQD 355 + ++ ITDG ++ + + R G+ + + V + GQ Sbjct: 1 MRKGTLRQILLITDGCSNHGE------DPAAMAALAREQGITVNVIGVLDQDTIDENGQR 54 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + + G V +R+L ++ +T + Q+++ NR Sbjct: 55 EIEAIAAAGGGMSQVVYARQLSQTVQMVTRQAMTQTLQGLVNR 97 >gi|77359874|ref|YP_339449.1| pilin biogenesis protein [Pseudoalteromonas haloplanktis TAC125] gi|76874785|emb|CAI86006.1| conserved protein of unknown function ; putative pilin biogenesis protein [Pseudoalteromonas haloplanktis TAC125] Length = 1056 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 41/304 (13%), Positives = 92/304 (30%), Gaps = 74/304 (24%) Query: 151 CMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKN----TTKSKYAPAPAPA 206 M+ D S SM + Y + K+ +++N S Y + Sbjct: 41 MMIFDTSGSMAWDVSDGDACYMRSGNRYYEVDCFQSKNSYNRNEQCYKRSSYYNYEATCS 100 Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 + ++ V + LVN N + G + +N G + + + ++ Sbjct: 101 DSRLKVAQNAIKQLVND---------NEDIEFGLMRFNGDNGGYVLARVGADKKSLLEKI 151 Query: 267 NKLNPYENTNTYPAMHHAY-----RELYNEKESSHNTIGSTRLKKF-------------- 307 +L T + AY +++Y ++ + + Sbjct: 152 EELPANGATPLTETLWEAYLYITGQKVYYGNNTNDRDKTAESSGYYNSPFVPVTGEPLRC 211 Query: 308 -----VIFITDGENSGASAYQNTLNTL------------------QICEYMRN------- 337 +I +TDG+ S S N + L + + + Sbjct: 212 DNSINIILMTDGDPSNDSNRNNDIYYLHRDYFSENPPIVSNSYLAALAKIIHGTDDVEVD 271 Query: 338 -----AGM----KIYSVAVSAP--PEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + +++ + G++LL + D GQ+ N + +L ++ Sbjct: 272 LYKNTKNVLDIGRVFPIGFGTGMSNSGKNLLTETADYGGGQYLHANTAAQLSDALKNTIS 331 Query: 386 KIQE 389 +I+E Sbjct: 332 RIRE 335 >gi|268580761|ref|XP_002645363.1| Hypothetical protein CBG15420 [Caenorhabditis briggsae] Length = 862 Score = 48.7 bits (114), Expect = 0.001, Method: Composition-based stats. Identities = 28/145 (19%), Positives = 56/145 (38%), Gaps = 16/145 (11%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL--NPYENTNTYPAMHHAYRELYNEKE 293 VR+G + Y+ + N++ + L L P E+T T A+ A E+++ Sbjct: 70 VRVGIVQYSDEAKTEFNLSRYSERNDIITHLETLKFMPGEDTRTGVALSKADDEIFDYDG 129 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + + +I TDG + + +R G+KIY+++V++ Sbjct: 130 GARLKAT-----RLIIVFTDGL--------SMDKPTLAAKALRRKGVKIYTISVNSIGFV 176 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLE 378 ++L + F D + E Sbjct: 177 PEML-GIVGDADNVFGPTDENRIEE 200 >gi|330805799|ref|XP_003290865.1| hypothetical protein DICPUDRAFT_155398 [Dictyostelium purpureum] gi|325078990|gb|EGC32613.1| hypothetical protein DICPUDRAFT_155398 [Dictyostelium purpureum] Length = 942 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 35/207 (16%), Positives = 64/207 (30%), Gaps = 12/207 (5%) Query: 188 SFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGI 247 S S + I+ + V + + I +G Y Sbjct: 616 SSISSDVEIVFCFDTTGSMASVIESVKSKVNQTVTRLMQTI--PNIKIGIMGLGDYCDRE 673 Query: 248 VGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 L+ N+ ++ + + K+ + A +A LY KE S S K Sbjct: 674 NVITTLDLTENVEKLTTFITKIPHTSGGDVPEAYEYA---LYKAKELSW----SKHTSKA 726 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF 367 + I D S +N + C+ + + G+KIY + A E + G Sbjct: 727 FVMIGDSNPHEPSFTNLHINWFEECDNLFDMGIKIY--GIQAIKECCFYQEIAERTGGLC 784 Query: 368 FAVNDSRELLESFDKI-TDKIQEQSVR 393 N + E F I + ++ + Sbjct: 785 IKFNKFDLITEMFLAICYREANKEKFK 811 >gi|118593261|ref|ZP_01550646.1| von Willebrand factor type A like domain [Stappia aggregata IAM 12614] gi|118434152|gb|EAV40808.1| von Willebrand factor type A like domain [Stappia aggregata IAM 12614] Length = 772 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 18/137 (13%), Positives = 35/137 (25%), Gaps = 19/137 (13%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 + + G N + + L+ T A++ A+ + Sbjct: 397 PDDYFRILHFSNDTSQFAGQAVLATERNKQKALKFVADLSAGGGTEINQAVNAAFDQAQP 456 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + V+F+TDG I + A +IY+ V Sbjct: 457 -----------DNTTRIVVFLTDGY-----IGDEATVIKSIANRIGKA--RIYAFGVG-N 497 Query: 351 PEGQDLLRKCTDSSGQF 367 + LL + Sbjct: 498 SVNRFLLDAMATEGRGY 514 >gi|308171956|ref|YP_003918661.1| hypothetical protein BAMF_0065 [Bacillus amyloliquefaciens DSM 7] gi|307604820|emb|CBI41191.1| conserved hypothetical protein [Bacillus amyloliquefaciens DSM 7] gi|328551766|gb|AEB22258.1| hypothetical protein BAMTA208_00330 [Bacillus amyloliquefaciens TA208] gi|328910026|gb|AEB61622.1| hypothetical protein LL3_00067 [Bacillus amyloliquefaciens LL3] Length = 245 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 13/98 (13%), Positives = 39/98 (39%), Gaps = 10/98 (10%) Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-GQDLLR 358 + V+ ITDG ++ + + + + + G+ + + + Q+ ++ Sbjct: 1 MNKGHLNQVLLITDGCSNHGE------DPIAMAAFAKEQGITVNVIGIMEENAIDQEAMK 54 Query: 359 K---CTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + + G V + +L ++ +T K Q+++ Sbjct: 55 EVEGIAMAGGGVHQVVYASQLSQTVQMVTKKAMTQTLQ 92 >gi|154684584|ref|YP_001419745.1| YabS [Bacillus amyloliquefaciens FZB42] gi|63146677|emb|CAG28929.1| YabS protein [Bacillus amyloliquefaciens FZB42] gi|154350435|gb|ABS72514.1| YabS [Bacillus amyloliquefaciens FZB42] Length = 245 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 13/98 (13%), Positives = 39/98 (39%), Gaps = 10/98 (10%) Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-GQDLLR 358 + V+ ITDG ++ + + + + + G+ + + + Q+ ++ Sbjct: 1 MNKGHLNQVLLITDGCSNHGE------DPIAMAAFAKEQGITVNVIGIMEENAIDQEAMK 54 Query: 359 K---CTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + + G V + +L ++ +T K Q+++ Sbjct: 55 EVEGIAMAGGGVHQVVYASQLSQTVQMVTKKAMTQTLQ 92 >gi|297664534|ref|XP_002810694.1| PREDICTED: calcium-activated chloride channel regulator 1-like isoform 2 [Pongo abelii] Length = 914 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 22/151 (14%), Positives = 58/151 (38%), Gaps = 26/151 (17%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 ++ + + RL T+ + A+ + + + ++ +T Sbjct: 362 INSGSDRDTLAKRLPA-AASGGTSICSGLRSAFTVIRKKYPTD---------GSEIVLLT 411 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND 372 DGE++ S N + + +G I++VA+ P Q+L + G +D Sbjct: 412 DGEDNTISGCFNEV---------KQSGAIIHTVALG-PSAAQELEELSKMTGGLRTYASD 461 Query: 373 ---SRELLESFDKITD---KIQEQSVRIAPN 397 + L+++F ++ + ++S+++ Sbjct: 462 QVQNNGLIDAFGALSSGNGAVSQRSIQLESK 492 >gi|323474469|gb|ADX85075.1| von Willebrand factor type A [Sulfolobus islandicus REY15A] gi|323477206|gb|ADX82444.1| von Willebrand factor type A [Sulfolobus islandicus HVE10/4] Length = 380 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 25/149 (16%), Positives = 55/149 (36%), Gaps = 33/149 (22%) Query: 253 TPLSNNLNEVKSRLNKLN---------PYENTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 S+N+ +K ++ L+ T Y A+ A + + Sbjct: 78 ITFSSNVTVIKEFVDPLDLTNEILQIAAGGQTALYTAILTANSLAKKYQMPT-------- 129 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD- 362 +++ +TDG + + N L M++YS + + LL+ +D Sbjct: 130 ---YLLLLTDGNPTDETNVGNYLKLPYF------EKMQVYSFGIG-DDYNEQLLQSISDK 179 Query: 363 SSGQFFAVNDSREL-----LESFDKITDK 386 +SG + ++D+ E+ ++ +I K Sbjct: 180 TSGVMYHISDANEIPQKLPQKAVTQIAAK 208 >gi|321475774|gb|EFX86736.1| hypothetical protein DAPPUDRAFT_221972 [Daphnia pulex] Length = 891 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 40/341 (11%), Positives = 102/341 (29%), Gaps = 31/341 (9%) Query: 66 KQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIP 125 + + Q E + K + + QY++ ++ Sbjct: 225 NSTENEIAQIIRTNETTVVSVVYEPAEAEQIKMSKDGLQGQFVVQYDVDRSSIE--KKGG 282 Query: 126 SALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPP 185 + L + VLD S SM +++ N + Sbjct: 283 EIHVVDGYFVHFFVPADLPTLPKHVIFVLDTSGSMAGTRIEQTKQAMN---SILDQLRKD 339 Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 + F +++ + + + ++ ++ + + Sbjct: 340 EDIFSV-----VEFSSGVTEWDLRKPYKGPDHYYFNSPPEETTEDATAVPQNNESEVKFG 394 Query: 246 GIVGNQCTPLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 P++ ++ K + ++ +TN A+ A + + + T Sbjct: 395 PYDDILAYPVTEQSVKRAKEFVAAMDVTSSTNINDALLLALKNSQSVQSRVRLTP----- 449 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG----MKIYSVAVSAPPEGQDLLRKC 360 +IF+TDGE + + +T +I + +R + I+ +A + Q L + Sbjct: 450 --IIIFLTDGEPTASVT-----DTTEILKNVRKGNSDDVVSIFCLAFGTGTDYQFLTKIS 502 Query: 361 TDSSGQF---FAVNDSR-ELLESFDKITDKIQEQSVRIAPN 397 + + G + D+ +L FD++ + + N Sbjct: 503 SQNRGFARKIYEAADATLQLKGFFDEVASPLLSNVNFVYNN 543 >gi|297664532|ref|XP_002810693.1| PREDICTED: calcium-activated chloride channel regulator 1-like isoform 1 [Pongo abelii] Length = 914 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 22/151 (14%), Positives = 58/151 (38%), Gaps = 26/151 (17%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 ++ + + RL T+ + A+ + + + ++ +T Sbjct: 362 INSGSDRDTLAKRLPA-AASGGTSICSGLRSAFTVIRKKYPTD---------GSEIVLLT 411 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND 372 DGE++ S N + + +G I++VA+ P Q+L + G +D Sbjct: 412 DGEDNTISGCFNEV---------KQSGAIIHTVALG-PSAAQELEELSKMTGGLRTYASD 461 Query: 373 ---SRELLESFDKITD---KIQEQSVRIAPN 397 + L+++F ++ + ++S+++ Sbjct: 462 QVQNNGLIDAFGALSSGNGAVSQRSIQLESK 492 >gi|229819442|ref|YP_002880968.1| von Willebrand factor A [Beutenbergia cavernae DSM 12333] gi|229565355|gb|ACQ79206.1| von Willebrand factor type A [Beutenbergia cavernae DSM 12333] Length = 647 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 14/193 (7%), Positives = 55/193 (28%), Gaps = 12/193 (6%) Query: 196 KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL 255 + +++ + ++ + + + Sbjct: 464 SASMGYDNTAGQTRLEGAQHAITAALDHFTAGDRVGLAGFTTTDGT--ITPGLVAPVADI 521 Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 ++N + +N L P +T Y A+ ++ + ++ ++DG Sbjct: 522 ADNRGALVEGVNSLEPVAHTPLYQAVADFAQQQAAMWDPDRI--------NAIVLLSDGV 573 Query: 316 NSGASAYQNT-LNTLQICEYMRNA-GMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDS 373 N+ + + + + + ++++ S + + L + + ++ D Sbjct: 574 NATGDIETIGQDDMIHVLHGLHAETPVLVFTLGYSPDADVETLQAISSATGAHYYDATDP 633 Query: 374 RELLESFDKITDK 386 E+ + Sbjct: 634 TEVEAVLGDLVTS 646 >gi|313675311|ref|YP_004053307.1| von willebrand factor type a [Marivirga tractuosa DSM 4126] gi|312942009|gb|ADR21199.1| von Willebrand factor type A [Marivirga tractuosa DSM 4126] Length = 322 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 27/145 (18%), Positives = 54/145 (37%), Gaps = 39/145 (26%) Query: 254 PLSNNLNEVKSRLN----KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 PL+ + N + + L P +T+ A++ A+ +L +E S K +I Sbjct: 133 PLTYDQNALNLFIETLNTGLVPGSSTDFGSALNMAHEKLTSEAAPSSQQKS-----KIII 187 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----------------- 352 I+DGE+ G +T + ++G++++S+ V Sbjct: 188 LISDGEDFGD-------DTEGAVSKINDSGIRLFSLGVGTEQGSKIRTRRGYRKDKSGND 240 Query: 353 -----GQDLLRKCTD-SSGQFFAVN 371 L D + G++F +N Sbjct: 241 IVTKLDSRSLENLADQTDGEYFEIN 265 >gi|119952971|ref|YP_945180.1| hypothetical membrane spanning protein [Borrelia turicatae 91E135] gi|119861742|gb|AAX17510.1| hypothetical membrane spanning protein [Borrelia turicatae 91E135] Length = 341 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 28/172 (16%), Positives = 59/172 (34%), Gaps = 41/172 (23%) Query: 252 CTPLSNNLNEVKSRLNK---LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 PL+ + + +L+ ++ + + A L S K+ V Sbjct: 154 IVPLTIDRDFFSKKLDDIYIMDLGNGSALGLGISIALSHL----------KHSEAPKRSV 203 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP----------------- 351 I +TDG + Y++ Q+ + +KIYS+ + + Sbjct: 204 IVLTDGVVNSDEVYKD-----QVINLAQGLNVKIYSIGIGSSEELSVGFKLRSGRFYQGT 258 Query: 352 ----EGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQ-EQSVRIAPN 397 +L + + + G F++V D + + K E+ V+IA + Sbjct: 259 LREIYDPSMLFEISGKTGGLFYSVGDDFSFKLAIQDFSKKENVERKVKIAVD 310 >gi|157375629|ref|YP_001474229.1| vault protein inter-alpha-trypsin subunit [Shewanella sediminis HAW-EB3] gi|157318003|gb|ABV37101.1| vault protein inter-alpha-trypsin domain protein [Shewanella sediminis HAW-EB3] Length = 770 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 21/170 (12%), Positives = 51/170 (30%), Gaps = 18/170 (10%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY-- 289 + I + + + N+ ++ L T A+ HA + Sbjct: 409 DDTFNVIEFNSKVSSLSKGPIPASTKNIEMANRFVHSLTSDGGTEMALALEHALGQESGG 468 Query: 290 NEKESSHNTIGSTRLKKF---VIFITDGENSGASAYQNTLNTLQICEYMRNAGM---KIY 343 + + + V+F+TDG + + + +++ Sbjct: 469 SSWQETGLQGKDEESTSRLRQVLFMTDG----------AVGNEAELFKLIKYRIGKSRLF 518 Query: 344 SVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 ++ + + P + R G F + D E+ E + KI+ + Sbjct: 519 TLGIGSAPNSHFMQRAAEFGRGTFTYIGDLDEVQEKIQGLLYKIEHPQIT 568 >gi|119476218|ref|ZP_01616569.1| hypothetical protein GP2143_07204 [marine gamma proteobacterium HTCC2143] gi|119450082|gb|EAW31317.1| hypothetical protein GP2143_07204 [marine gamma proteobacterium HTCC2143] Length = 750 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 21/150 (14%), Positives = 53/150 (35%), Gaps = 18/150 (12%) Query: 238 IGTIAYNIGIVGNQCTPLSN--NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS 295 + +V + + N + +++ + TN A+ A ++ + Sbjct: 81 WTFGQWVNNLVPSNTVTTAWRANASAQAEKISSVALR--TNIPAALEKAMADVKS----- 133 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL-----QICEYMRNAGMKIYSVAVSAP 350 T +I +TDG + + + +I ++ AG+ I++VA+S Sbjct: 134 ----KGTDYSIHLILLTDGMVDVSLSSIDNETARQRIVAEILPALKGAGVTIHTVALSQN 189 Query: 351 PEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 + + + ++ G + +L F Sbjct: 190 ADLELMELLAAETGGLSAVAETAEDLSRVF 219 >gi|120556589|ref|YP_960940.1| vault protein inter-alpha-trypsin subunit [Marinobacter aquaeolei VT8] gi|120326438|gb|ABM20753.1| Vault protein inter-alpha-trypsin domain protein [Marinobacter aquaeolei VT8] Length = 712 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 29/160 (18%), Positives = 59/160 (36%), Gaps = 19/160 (11%) Query: 237 RIGTIAYNIGIVGNQCTPLSNN---LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 R I +N P+ N L + + L T A+ A Sbjct: 390 RFNVIQFNSQAHALYTQPVPANGHYLARARDYVQDLTADGGTEMAGALSLAMGM------ 443 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + S+ + ++F+TDG SA + + T + N ++++VA+ + P Sbjct: 444 ---DGSESSGHVQQMVFMTDGAVGNESALFDQIRTG-----LGNR--RLFTVAIGSAPNM 493 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 L GQ+ AV+ + E+ ++ K+ ++ + Sbjct: 494 HFLREAARWGRGQYTAVHSAAEVDKALGKLFAAMEAPVMT 533 >gi|313240178|emb|CBY32528.1| unnamed protein product [Oikopleura dioica] Length = 524 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 66/172 (38%), Gaps = 16/172 (9%) Query: 229 QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY-------ENTNTYPAM 281 + R+ + Y+ + + +N +V + +L +T T AM Sbjct: 252 DGISPDTNRVAMLRYSSDVKEDLNFIEGSNEPKVMRNIQRLKYKPITDDRHGSTYTAHAM 311 Query: 282 HHAYRELYNEKES--SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG 339 A + ++ + + T +++ V+ ITDGE++ G Sbjct: 312 DKALKTIFTSEAGWRNGTTEDGIKVRTEVVIITDGESNDPDETFTIQGQKV---KYDEYG 368 Query: 340 MKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQ 390 +K+Y++ V +D +R+ T F + ++L +F++I + + E Sbjct: 369 IKVYALGVG--DIKKDEIRQLTSMDDESIFYLMSWKDL-AAFNRIIETLIET 417 >gi|296166599|ref|ZP_06849029.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] gi|295898085|gb|EFG77661.1| conserved hypothetical protein [Mycobacterium parascrofulaceum ATCC BAA-614] Length = 897 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 32/282 (11%), Positives = 72/282 (25%), Gaps = 42/282 (14%) Query: 120 LKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSM-EDLYLQKHNDNNNMTSNK 178 L L + + S + + A + +++D S S+ D Sbjct: 31 LTPLAGADTPDRSAAVSRYGGCLASQKAGDLLLLVDESGSLKSTDPKAARVDAAKYLVKT 90 Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 A R++D ++ + +Q Q Sbjct: 91 LGDYADRTNVNLDVAIAGFSEGYAVRQGWRRLDSAS--VDSVDDQLQTLAQHNSGADTDY 148 Query: 239 GTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK-ESSHN 297 L + + N N A+ + K + S Sbjct: 149 W-------------LALDGARQTLADHVQ-----GNPNRCQAIAW----FSDGKLDFSKR 186 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTL-NTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 G + DG +++ + + +R A + + V + D+ Sbjct: 187 PGGRPYAPGVDLGSDDGVAEMIRRATDSICRPGGLADQLRAAHVVMLGVGLGNDDSDFDV 246 Query: 357 LRKCTDS---------------SGQFFAVNDSRELLESFDKI 383 + + G F+ V++ ++L +FD + Sbjct: 247 MSAISTGAGPNGTRCGGITDPKPGAFYRVSNIDQMLFAFDAL 288 >gi|239906053|ref|YP_002952792.1| hypothetical protein DMR_14150 [Desulfovibrio magneticus RS-1] gi|239795917|dbj|BAH74906.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 391 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 31/215 (14%), Positives = 63/215 (29%), Gaps = 10/215 (4%) Query: 3 AIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTST 62 A+I+ V T A+D + Y R+Q+Q A + + R D + Sbjct: 22 ALIMIVLAGLATLAVDYGFLQYKRSQLQ--TAADAAALAGAADLLRNGDDFDAVRATAVD 79 Query: 63 IFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKG 122 ++ + + S + ++ + + ++ A AQ P + +FL Sbjct: 80 FGQRNLGEQDTVASAVTTGDVELLKGETPAAGATPD-TVRVTAGRTAQRGNPVD-MFLGP 137 Query: 123 LIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLP 182 ++ +L+ ++ + S L D N N Sbjct: 138 VLGWNTQDLTATASASLFCS------DQTKCLKPFSPPAKFTWDDSCDTNKKFKNNDAFD 191 Query: 183 PPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 P S N P K+ ++ Sbjct: 192 PGSSCELSSVNVLGYDAGDLGTPIVLKLSDSHDTV 226 >gi|115488386|ref|NP_001066680.1| Os12g0431700 [Oryza sativa Japonica Group] gi|77554879|gb|ABA97675.1| von Willebrand factor type A domain containing protein, expressed [Oryza sativa Japonica Group] gi|113649187|dbj|BAF29699.1| Os12g0431700 [Oryza sativa Japonica Group] gi|125536450|gb|EAY82938.1| hypothetical protein OsI_38156 [Oryza sativa Indica Group] gi|125579179|gb|EAZ20325.1| hypothetical protein OsJ_35934 [Oryza sativa Japonica Group] Length = 524 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 23/198 (11%), Positives = 57/198 (28%), Gaps = 33/198 (16%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGI-VGNQCTPLSNN-LNEVKSRL 266 KI+ + ++ ++ K R+ + + + ++ + E+ + Sbjct: 77 KIESVKKALQFVI--------MKLTPVDRLSIVTFESSAKRLTKLRAMTQDFRGELDGIV 128 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 L T+ + L + + T + ++DG+ G ++ T Sbjct: 129 KSLIANGGTDIKAGLDLGLAVLADRVFTESRTAN-------IFLMSDGKLEGKTSGDPTQ 181 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT--DSSGQFFAVNDSRELLESFDKIT 384 + +Y+ LL G + V D L F + Sbjct: 182 VNPGE--------VSVYTFGFGHG-TDHQLLTDIAKNSPGGTYSTVPDGTNLSAPFATLL 232 Query: 385 DKI-----QEQSVRIAPN 397 + Q+ + + P Sbjct: 233 GGLVTVVAQDVRLTLTPK 250 >gi|307544656|ref|YP_003897135.1| hypothetical protein HELO_2066 [Halomonas elongata DSM 2581] gi|307216680|emb|CBV41950.1| hypothetical protein HELO_2066 [Halomonas elongata DSM 2581] Length = 612 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 13/111 (11%), Positives = 35/111 (31%) Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 Y + A + + + + + + + G G + Sbjct: 114 QYTDIETALEQAASAEANGWRHLILMTDGVIDLPPSRGNKPGIDEASRQRLVESMAPRFA 173 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 + G+ ++++A S + + + + G LL +F I ++I Sbjct: 174 DRGVVVHAIAFSDEADLALVEQLAQRTGGLASVAKSPESLLGAFLDIIERI 224 >gi|170574976|ref|XP_001893043.1| Zona pellucida-like domain containing protein [Brugia malayi] gi|158601129|gb|EDP38122.1| Zona pellucida-like domain containing protein [Brugia malayi] Length = 664 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 26/125 (20%), Positives = 52/125 (41%), Gaps = 13/125 (10%) Query: 258 NLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 N V LN L T+T+ A+H AY+ L + + +KK +I TDG + Sbjct: 23 NNTSVIRHLNGLKSIKGTTSTHIALHQAYKLLTDTDNENGVR---EGVKKMIIIFTDGHS 79 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAV-SAPPEGQDLLRKCTDSSGQFFAVNDSRE 375 + + +++ G++I+++ + AP + L T ++ F + ++ Sbjct: 80 QRS--------PQDMALRLKDKGVEIFAITLTPAPYADEGELLSITQNTDHIFTPVNLKD 131 Query: 376 LLESF 380 F Sbjct: 132 FEIKF 136 >gi|315225171|ref|ZP_07866988.1| von Willebrand factor [Capnocytophaga ochracea F0287] gi|314944854|gb|EFS96886.1| von Willebrand factor [Capnocytophaga ochracea F0287] Length = 449 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 27/291 (9%), Positives = 72/291 (24%), Gaps = 24/291 (8%) Query: 91 INITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISI 150 I + N A +S + N Sbjct: 154 IAVLVTANGKPAVNVTVALYRNNTLLWTAKTDNTGKAELWVS----AFQKEKELNTEHLR 209 Query: 151 CMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKI 210 V D + S + + S + A + ++ Sbjct: 210 LKVNDQ---------WVSTEKAISESTLNRIALKNEVKKASNEVQIAFMVDATGSMSDEL 260 Query: 211 DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN 270 + L N+++ +++ + +S + + + ++N+ +N Sbjct: 261 EFLKMDLKNVISKVEEGNKN-LKISTVTIFYRDEGDEYVVKHSDFTKDINKTIQFINSQK 319 Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 + A+H A +L + + D ++ Sbjct: 320 ADGGGDFPEAVHTALNQLNK------LQWDGEARTRIAFLVLDAPPHHEDKILKSVQAS- 372 Query: 331 ICEYMRNAGMKIYSV-AVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLES 379 + G+K+ + A + L+R ++G + + D + + Sbjct: 373 -VKTAAEKGIKLIPIVASGIDKPTEFLMRFMAIYTNGTYVFITDDSGIGNA 422 >gi|298249212|ref|ZP_06973016.1| von Willebrand factor type A [Ktedonobacter racemifer DSM 44963] gi|297547216|gb|EFH81083.1| von Willebrand factor type A [Ktedonobacter racemifer DSM 44963] Length = 420 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 18/165 (10%), Positives = 51/165 (30%), Gaps = 16/165 (9%) Query: 233 NLSVRIGTIAYNIGIVGNQCTP--LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 + S++ + +N + N N + + + T A++ + Sbjct: 70 DASMQFMVVTFNDNARIIFGPAAGIEENKNRAIAAIQTVYAASGTRMSTALNTIVDKF-- 127 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + ++F+TDG+ + + C + I + V Sbjct: 128 --------GNNQSRATRILFLTDGK-NEGEPRVALDRAVARCSAA---NISISAWGVGTD 175 Query: 351 PEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 + +LL + G + ++ +F +++ ++ A Sbjct: 176 WDAAELLHMAEATRGSADIIPTPNQVEAAFSSSFSEMRRTAITNA 220 >gi|38197044|gb|AAH05159.2| COL6A1 protein [Homo sapiens] Length = 445 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 30/222 (13%), Positives = 62/222 (27%), Gaps = 25/222 (11%) Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 N + +K +F S + + + D A L A Sbjct: 220 KNVTAQICIDKKCPDYTCPITFSSPADITILLDGSASVGSHNFDTTKRFAKRLAERFLTA 279 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQ---CTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHH 283 + VR+ + Y+ N + S ++ ++ + T+ A+ + Sbjct: 280 GRTDPAHDVRVAVVQYSGTGQQRPERASLQFLQNYTALASAVDAMDFINDATDVNDALGY 339 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 R + KK ++ +DG + G + + + AG++I+ Sbjct: 340 VTRFYREASSGAA--------KKRLLLFSDGNSQG----ATPAAIEKAVQEAQRAGIEIF 387 Query: 344 SVAVSAPPE---------GQDLLRKCTDSSGQFFAVNDSREL 376 V V G+ F V + L Sbjct: 388 VVVVGRQVNEPHIRVLVTGKTAEYDVAYGESHLFRVPSYQAL 429 >gi|326382230|ref|ZP_08203922.1| von Willebrand factor type A [Gordonia neofelifaecis NRRL B-59395] gi|326198960|gb|EGD56142.1| von Willebrand factor type A [Gordonia neofelifaecis NRRL B-59395] Length = 552 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 35/379 (9%), Positives = 94/379 (24%), Gaps = 32/379 (8%) Query: 11 LFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKK 70 + +++ + L L S + T+ + ++ Sbjct: 184 AVVGALSEVSGGSLTHRDLVQVLTKRALM-MNDNDSSPDLVQVAASSGPTAALTTERAYT 242 Query: 71 HLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTN 130 + + + + +++ N ++ A I L+ + Sbjct: 243 KFTRENPDAQLEAAVPGTGTVSLDYRAVNVAPAADQTVAGDAIEALVATLQTDKGKQIRQ 302 Query: 131 LSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFW 190 + E +N + V+ + + + Sbjct: 303 SEAVRSADGEPLPDNEGVDGVTVIKPPARELVDNILRKWTALTQPIRALVAQD------- 355 Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT---IAYNIGI 247 S A + +L E++ + K Y + Sbjct: 356 -----VSGSMEQDAGGRSRAALLREASLFGLQQFPKNTALGYWEFSIDRGGKGQDYREVM 410 Query: 248 VGNQCTPLSNNLNEVKSRLNKLNPY-----ENTNTYPAMHHAYRELYNEKESSHNTIGST 302 + ++ + + T Y A++ +Y+ + + Sbjct: 411 PIAPISDKTDGRTNRDLLADAIRQTLGNVHGGTGLYDTALAAFKTVYDSYDPA------- 463 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN--AGMKIYSVAVSAPPEGQDLLRKC 360 VI +TDG + + + M+N + I +V +S + L + Sbjct: 464 -YSNSVIIMTDGR-NEDRDSITLQHLVSELNIMKNPARRIPIIAVGISEDADADALKQIA 521 Query: 361 TDSSGQFFAVNDSRELLES 379 + G F D +++ Sbjct: 522 DATGGSSFIARDPKDIPAI 540 >gi|225551939|ref|ZP_03772879.1| von Willebrand factor type A domain protein [Borrelia sp. SV1] gi|225370937|gb|EEH00367.1| von Willebrand factor type A domain protein [Borrelia sp. SV1] Length = 332 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 27/172 (15%), Positives = 61/172 (35%), Gaps = 41/172 (23%) Query: 252 CTPLSNNLNEVKSRLNK---LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P++ + +L+ ++ + + A L S LK+ + Sbjct: 145 VVPITTDREFFNKKLDDIYIMDLGNGSALGLGISIALSHL----------KHSEALKRSI 194 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA------------------- 349 + +TDG + Y++ Q+ + +KIYSV + + Sbjct: 195 VVLTDGVVNSDEIYKD-----QVINLAQGLNVKIYSVGIGSSEEFSVEFKLRSGKFYQGS 249 Query: 350 --PPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQ-EQSVRIAPN 397 +L + ++ + G F++VND + + K E+ ++IA + Sbjct: 250 FKEVYDPSMLVEISNKTGGLFYSVNDDFSFQFAIQDFSKKENLERKIKIAVD 301 >gi|13365521|dbj|BAB39134.1| CD11-1 [Cyprinus carpio] Length = 1196 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 32/248 (12%), Positives = 72/248 (29%), Gaps = 38/248 (15%) Query: 164 YLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAG----- 218 + ++ ++ P N +++ + + E + Sbjct: 103 FGMSAAVSSAALTSCSPYFPHECDGNSYLNGVCYQFSSSLQAVSNFTAAYQECSKREVNL 162 Query: 219 ----NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 + +S++ E ++ + + S ++ V N Sbjct: 163 VFLFDGSSSMKTVDFEMNKNFIKDIMKKLSNSSIKFAAVQFSTDVRTVFDF-NDYQSGSA 221 Query: 275 -------------TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 TNT+ A+ + + L N S ++ +K ++ ITDG+ + Sbjct: 222 EEKLMKETHMKSLTNTHKAIDYILKNLLNSMLSGADSK----AQKALVIITDGDP----S 273 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 + N L+ C+ + Y + V L + F + D L Sbjct: 274 DNDDYNVLKKCDE---QNILRYIIGVGK--VDLIELTQLASEPKRNNTFYIKDYSGLKGL 328 Query: 380 FDKITDKI 387 D + KI Sbjct: 329 LDNLQKKI 336 >gi|299532595|ref|ZP_07045984.1| hypothetical protein CTS44_17423 [Comamonas testosteroni S44] gi|298719398|gb|EFI60366.1| hypothetical protein CTS44_17423 [Comamonas testosteroni S44] Length = 412 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 13/137 (9%), Positives = 40/137 (29%) Query: 13 ITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHL 72 + +A+D + ++ ++Q+A D+ L+ + + + + + Sbjct: 30 MGFALDFGRLFVVKTELQTATDSCALAAAQELDGASDARTRAVNAGLAAGNSNRVHFQGA 89 Query: 73 KQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLS 132 G + + + A + T + E +P L +A + Sbjct: 90 SAGIAVSDISFSDALNGSYSTTFPVTSAKYARCEHARTGLLPWLLQALGAFSGNASYGAN 149 Query: 133 LRSTGIIERSSENLAIS 149 + + + Sbjct: 150 QSVKALAVATRAPSQTN 166 >gi|291242943|ref|XP_002741339.1| PREDICTED: chloride channel accessory 2-like [Saccoglossus kowalevskii] Length = 958 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 15/182 (8%), Positives = 48/182 (26%), Gaps = 28/182 (15%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIV---GNQCTPLS 256 +I L ++ + + + ++ Sbjct: 321 VSGSMSLKSRITKLRQAVYTFIMDEISLGIDVG-------CVTFSDTATIISWLTPINSD 373 Query: 257 NNLNEVKSRLN-KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + E + + LN NT + + L + S + +TDG+ Sbjct: 374 EDREEFLALVMPTLNADGNTAIGSGLLTGLQVLSQNQTESVE-------GSIMFLVTDGQ 426 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS--SGQFFAVNDS 373 + + + + + +G+ + ++A ++ L ++ + Sbjct: 427 ENVPPYIDD------VTDNIIESGVVVDTLAWGV--FAEEKLETIASGTKGSSYYYSEQT 478 Query: 374 RE 375 + Sbjct: 479 QS 480 >gi|262193846|ref|YP_003265055.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262077193|gb|ACY13162.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 344 Score = 48.7 bits (114), Expect = 0.002, Method: Composition-based stats. Identities = 18/108 (16%), Positives = 34/108 (31%), Gaps = 22/108 (20%) Query: 254 PLSNNLNEVKSRLNKLNP----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 PL+ + + L+ ++ T A+ A R K ++ Sbjct: 150 PLTPDYGFFRMILDGVDTKSVSRGGTEIGQALRKAVRSFDPGPG-----------AKMIL 198 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 ITDGE+ G AG+++ ++ + Q L Sbjct: 199 LITDGEDHGG-------YAEDAAREALEAGVRVVAIGFGSEQGSQITL 239 >gi|223936328|ref|ZP_03628240.1| von Willebrand factor type A [bacterium Ellin514] gi|223894846|gb|EEF61295.1| von Willebrand factor type A [bacterium Ellin514] Length = 657 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 19/110 (17%), Positives = 34/110 (30%), Gaps = 23/110 (20%) Query: 254 PLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 PL+ + ++ L+ T A++ A K ++ Sbjct: 145 PLTLDDAAFSQSIDSLDTRTISEGGTALAEAINTARETF----------KNEKDNHKVLV 194 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 TDGE+ A GM I+++ + P +LLR Sbjct: 195 LFTDGEDQDMGAVSAAEKAAA-------EGMLIFTIGIGTPDG--ELLRI 235 >gi|120554526|ref|YP_958877.1| von Willebrand factor, type A [Marinobacter aquaeolei VT8] gi|120324375|gb|ABM18690.1| von Willebrand factor, type A [Marinobacter aquaeolei VT8] Length = 715 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 23/166 (13%), Positives = 56/166 (33%), Gaps = 21/166 (12%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKS--RLNKLNPYENTNTYPAMHHAYRELYNEKESS 295 Y +V S ++ ++N + +TN AM A + + Sbjct: 74 WTFGQYVNMLVPYGVVDQSWRDTAIERSEQINSVAL--HTNLGLAMEKAANDWLSGGTLE 131 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ-----ICEYMRNAGMKIYSVAVSAP 350 + +I ++DG+ + + + +++ G I++V + + Sbjct: 132 NT---------HLIVLSDGKVDVPGGDDASQAEEKRIVDSLLPALKDKGATIHTVGL-SE 181 Query: 351 PEGQDLLRKCT-DSSGQFFAVNDSRELLESF-DKITDKIQEQSVRI 394 LR ++ G F + L +F + + + ++ V I Sbjct: 182 KADIRFLRNLARETGGSFQLAQSAEALNLAFANALNTAVPQEQVPI 227 >gi|326426493|gb|EGD72063.1| hypothetical protein PTSG_00082 [Salpingoeca sp. ATCC 50818] Length = 571 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 33/336 (9%), Positives = 88/336 (26%), Gaps = 28/336 (8%) Query: 37 VLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKD 96 VL + T+ + ++Q ++ + + + Sbjct: 17 VLMVAVDVHGQTTLLNLLQAREQEVEQLATALRGLYDDAYCGSASCASDVAACESELLTS 76 Query: 97 KNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDV 156 F ++ +A T + + I + Sbjct: 77 ---TCASTFGGCRSKTENRRLDFSSSVMRTASTPFDDDVRQEACWTRQLDNTFISINGGT 133 Query: 157 SRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAP----------- 205 S + + ++ ++ Y P + + A P Sbjct: 134 SENTDTATKWQYVGTSSGFYRIYPGVPQQDCNAYDPRLRPWYVAATSGPKDIVIVLDRSG 193 Query: 206 ---ANRKIDVLIESAGNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 N + + +++A ++ ++ A + G+ ++N+ Sbjct: 194 SMATNNRWETAMDAAETVLETLTIADFVAIVVFDTSASQVCGTTIPCGSLVQATADNVGT 253 Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 +++ L NP +TN A A+ L E + + ++F+TDG + Sbjct: 254 LRTLLANFNPDGSTNFESAFQVAFSVLKQTGERT------SNCHTAILFMTDGMITAGLE 307 Query: 322 YQNTL----NTLQICEYMRNAGMKIYSVAVSAPPEG 353 L + E +++ + + Sbjct: 308 GNAFLDFVDDEQDALEAAVGKRAVLFTFSFGTGADE 343 >gi|221127354|ref|XP_002168164.1| PREDICTED: similar to microneme 1, partial [Hydra magnipapillata] Length = 285 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 25/153 (16%), Positives = 54/153 (35%), Gaps = 14/153 (9%) Query: 236 VRIGTIAYNIGIVGNQCT--PLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEK 292 VR I Y+ + P + +++ + T T A+ A + ++ Sbjct: 6 VRFAVIDYSDDAILQISVSDPRFWDHETFGEKVSSIEYSHGKTRTDLALKVARKHVFCN- 64 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + K +I +TDG+++ + Q + +++ + I SV V + Sbjct: 65 ----ECSLQHNIPKLLIVLTDGQSTFPKSTQFEAHLIKV-----ENDLTIISVGV-SDQV 114 Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + L+ F +N L + +KI Sbjct: 115 DIEELKSLATDRDHVFLLNSYSYLNDKINKILK 147 >gi|148656025|ref|YP_001276230.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148568135|gb|ABQ90280.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 932 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 37/356 (10%), Positives = 95/356 (26%), Gaps = 39/356 (10%) Query: 53 PTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK----------NNPLQ 102 P + + + + E + + + + Q Sbjct: 260 PEAPAAFNRYTVRLDVPGDTRAQNNAVETFSVVRGRPRALLVAQSPEDAAGLERALRAAQ 319 Query: 103 YIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISI-----CMVLDVS 157 A +P L + AL N+ R+ + MV Sbjct: 320 IDVAVVAPAAMPDTLLAMSQYDAIALVNVPRRAFSESTLQHLATYVHDRGGGLIMVGGPR 379 Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 + + + + S K+ + ++ A Sbjct: 380 SFGPGGWRGTPVEAALPVTMDIPIYRTMPPVSVVIVIDISGSMAMTEDGIPKLSLALDGA 439 Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTN 276 + + ++ + + + ++ + ++ +++++ N Sbjct: 440 RRIASLLRDEDE--------LTILPFDDRPGVVVGPLPGSQRDKAIEQMSQVRLGGSGIN 491 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 + A+ A R + S R + +I ITDG ++ L I +R Sbjct: 492 IHDALVAAARYV----------RASDRPIRHIITITDGNDTV-----QQEGALDIVRALR 536 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + + + S+AV + G+ F + +L + + I + S+ Sbjct: 537 DERVTLTSIAVGQGSHVPFIRDMAAVGGGRTFLTERAADLPDLLLDEAEMIIQPSI 592 >gi|296412728|ref|XP_002836073.1| hypothetical protein [Tuber melanosporum Mel28] gi|295629876|emb|CAZ80230.1| unnamed protein product [Tuber melanosporum] Length = 997 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 39/299 (13%), Positives = 90/299 (30%), Gaps = 36/299 (12%) Query: 110 QYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHN 169 + + + L + S S + + + S + Sbjct: 407 RALLDLNAVELASTMRSPRYEDKRGSNSDEDDDFRATKTTRRVSSVNSAYSGTHGGGRSV 466 Query: 170 DNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPAN--RKIDVLIESAGNLVNSIQKA 227 + P+ S + + KI+++ +S L++++ Sbjct: 467 VTAPTEYTASIRGAKPQISPAVHIPLDIVVVIPVSSSMQGLKINLIRDSLKFLIHNLG-- 524 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLS-NNLNEVKSRLNKLNPYE----NTNTYPAMH 282 R+G + + G TPLS + + +N + P T+ + Sbjct: 525 ------ERDRMGLVTFGSSSGGVALTPLSVKSWSGWAKVVNSIRPVGQKSLRTDVVDGAN 578 Query: 283 HAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKI 342 A L S+ ++ I+D S+ +T N + A + I Sbjct: 579 VAMDLL--------MQRKSSNPIASILLISD------SSTSDTENVDFVVSRAEAAKITI 624 Query: 343 YSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE-------SFDKITDKIQEQSVRI 394 +S + + ++ T + GQF V D L E + ++ + + +++ Sbjct: 625 HSFGLGLTHKPDTMIELSTRTKGQFTYVKDWMMLRECVAGCIGALQTLSHQNAKLRLKL 683 >gi|224823582|ref|ZP_03696691.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Lutiella nitroferrum 2002] gi|224604037|gb|EEG10211.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Lutiella nitroferrum 2002] Length = 995 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 32/189 (16%), Positives = 57/189 (30%), Gaps = 53/189 (28%) Query: 259 LNEVKSRLNKLNPYENT----------NTYPAMHHAYRELYNEKESSHNTIGSTRLKKF- 307 L+ VKS ++ L NT Y M AY + N S + ++ Sbjct: 154 LDRVKSAIDSLGADTNTPLAEAYYEMTRYYRGMSSAYYKPVNPASGSRGSGRYASPVQYR 213 Query: 308 -----VIFITDGENSGASAYQNTLNTLQI------------------------------- 331 V+ +TDGE + + + + + L Sbjct: 214 CQANFVLMMTDGEPTEDNEFPLSGDPLFAGLSLGKDNPLPSFAGRAAAGDLFTSGSDGEG 273 Query: 332 ----CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 E +K Y++ + L + G++F ++ +L SF D I Sbjct: 274 QGWDAETFARQSIKTYTIGFTVD--NTLLQKTAMQGGGRYFTASNREQLKASFTSAMDDI 331 Query: 388 QEQSVRIAP 396 Q+ P Sbjct: 332 YRQNSTSTP 340 >gi|313243983|emb|CBY14858.1| unnamed protein product [Oikopleura dioica] gi|313245509|emb|CBY40220.1| unnamed protein product [Oikopleura dioica] Length = 1393 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 20/174 (11%), Positives = 59/174 (33%), Gaps = 24/174 (13%) Query: 184 PPKKSFWSKNTTKSKYAPAPAPAN--RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 S + + + K+D ++ ++++S+ ++ +G Sbjct: 776 KSNAQSSSSAPYNVVFVMDKSGSMIGTKLDQTKDAFRSMISSL-----DRNAKFSIVGFN 830 Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGS 301 + N+ E +S +++++ TN + A+ A +E S Sbjct: 831 YATTAWRNKLVRATNYNVEEARSFISRISAGGGTNMHAALLDAIELCNSE--------SS 882 Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA-----GMKIYSVAVSAP 350 + + ++F+TDG + T + + + ++ + + + A Sbjct: 883 STVPCMIMFMTDGTATVGV----TEESRILADVTKSRQQGKANIALNVIGFGAG 932 >gi|312886237|ref|ZP_07745851.1| von Willebrand factor type A [Mucilaginibacter paludis DSM 18603] gi|311301262|gb|EFQ78317.1| von Willebrand factor type A [Mucilaginibacter paludis DSM 18603] Length = 348 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 24/211 (11%), Positives = 64/211 (30%), Gaps = 61/211 (28%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 +++ + L++++ RIG I + P++ + + K LN Sbjct: 110 NRLENAKRAISQLIDNLHD---------DRIGIIVFAGQAYVQ--LPITTDYSAAKLFLN 158 Query: 268 ----KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 + P + T A+ + + + K +I ITDGEN Sbjct: 159 TINTNMVPTQGTAIGAAIDLGMQSFDFKN----------GMSKAMIVITDGEN------- 201 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPE---------------------------GQDL 356 + + + + R+ + + + V + +++ Sbjct: 202 HEDDAVSAANHARDKDVTVNVIGVGSEEGAPIPIIKDGKQAGFHTDSAGKVVVSKLDENM 261 Query: 357 LRKCTDSSGQFF-AVNDSRE-LLESFDKITD 385 R+ + + ++ L +++ Sbjct: 262 GREIAAAGNGVYVRATNANSGLNIVMEQMGK 292 >gi|296232327|ref|XP_002807820.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-2(VI) chain-like [Callithrix jacchus] Length = 1018 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 33/220 (15%), Positives = 71/220 (32%), Gaps = 18/220 (8%) Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 P+K+ N D+L+ V +Q + L + Sbjct: 35 NNCPEKTDCPINVYFVLDTSESVAMQSPTDILLFHMKQFVPQFISQLQNEFYLDQVALSW 94 Query: 242 AY---NIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHN 297 Y + +P ++ L ++ T T A+ + ++ Sbjct: 95 RYGGLHFSDQVEVFSPPGSDRASFIKSLQGISSFRRGTFTDCALANMTEQI--------R 146 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 GS F + ITDG +G+ L E R G+++++VA + L Sbjct: 147 LHGSKGTVHFAVVITDGHVTGSPCGGIKLQ----AERAREEGIRLFAVA-PNQNLKEQGL 201 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 R + + + + + L +S +I + +++ + Sbjct: 202 RDIASTPHELYRSDYATMLPDS-TEIDQDTINRIIQVMKH 240 >gi|313902401|ref|ZP_07835804.1| Protein of unknown function DUF2134, membrane [Thermaerobacter subterraneus DSM 13965] gi|313467332|gb|EFR62843.1| Protein of unknown function DUF2134, membrane [Thermaerobacter subterraneus DSM 13965] Length = 308 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 10/96 (10%), Positives = 31/96 (32%), Gaps = 9/96 (9%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + AI ++V + +D + R ++Q A + + + Sbjct: 26 LVAISLAVLLGMVGLVVDGGRLYAERTRLQ---------AAADAAALAGAPALPEEPARA 76 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKD 96 + ++ ++++ R G + ++ T Sbjct: 77 RNLAEEYLRRNGVDAGGARIQIGPGGRTLRVEATAT 112 >gi|198418440|ref|XP_002122148.1| PREDICTED: similar to integrin alpha Hr1 precursor-like [Ciona intestinalis] Length = 1270 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 34/190 (17%), Positives = 72/190 (37%), Gaps = 22/190 (11%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN-NLN 260 + + ++ NL+ S I + + Y+ + + PLS + Sbjct: 188 STSVFPSNFELGKSWIKNLLQSFSSDIDKHN-----VVVGLYSFSNIIKREIPLSARTYS 242 Query: 261 EVKSRLNKLN-PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 + ++ + PY T + A++ A +E +S + K +I ITDGE + Sbjct: 243 TLSGMIDAVRYPYGQTFIHTAINEAVQEYQRAGRAS--------VPKLLIVITDGEATVP 294 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 SA + N + AG+ + +V + + ++ L ++ + F V+D L Sbjct: 295 SAVAPSANAARA------AGIILTAVGIGSSVN-ENELTTIAGAAERVFRVSDFSSLGSI 347 Query: 380 FDKITDKIQE 389 + I + Sbjct: 348 LAPLQKVITD 357 >gi|147677783|ref|YP_001211998.1| flp pilus assembly protein TadD [Pelotomaculum thermopropionicum SI] gi|146273880|dbj|BAF59629.1| flp pilus assembly protein TadD [Pelotomaculum thermopropionicum SI] Length = 312 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 17/116 (14%), Positives = 33/116 (28%), Gaps = 4/116 (3%) Query: 1 MTAIII----SVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTK 56 + A+++ + F F D+ + R Q+ + +DAA L+G + + + Sbjct: 21 LAAVMLCAGMAALFGFAALVTDIGLLAAKRQQLINTMDAAALAGAQELPDNPAQAVQVAR 80 Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYE 112 F G + I A S A + Sbjct: 81 DYAGKNGFAPDSLNISISGDNRTISVAGREVVNTIFARVLGIYSKTVSAGSSASVQ 136 >gi|219125320|ref|XP_002182931.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] gi|217405725|gb|EEC45667.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] Length = 523 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 24/182 (13%), Positives = 48/182 (26%), Gaps = 51/182 (28%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 N ++ L TN A+ A +EL I + + + F+TDG Sbjct: 126 QNKASALQKIQSLTTRGCTNMSAALGLAVQELK--------IIEKSNPVRSLFFLTDGLA 177 Query: 317 SGASAYQN----------------------------------------TLNTLQICEYMR 336 + + + T + E + Sbjct: 178 NEGISDLDGLVSLTRNCLLPSDNPSNVLNSEVMIAECLDDLATSQHQITRLPVAEIESVC 237 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDS--SGQFFAVNDSRELLESFDKITDKIQEQSVRI 394 A + +++ LL D+ G ++ + D + +F I + Sbjct: 238 RAPITLHTFGYGRDHNA-ALLESLADTTQGGAYYFIEDDSNVGSAFGNALGGIMSIVAQN 296 Query: 395 AP 396 A Sbjct: 297 AV 298 >gi|198421549|ref|XP_002127942.1| PREDICTED: similar to calcium activated chloride channel 4 [Ciona intestinalis] Length = 1075 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 38/298 (12%), Positives = 78/298 (26%), Gaps = 39/298 (13%) Query: 113 IPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNN 172 N L S +S + V+D + + HN Sbjct: 215 TSDPNNNFLPLEGCLFFPFSELGQPDDLSASLLSHQFVDQVVDFCHNDTNDPTNLHNKEA 274 Query: 173 NMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA--PANRKIDVLIESAGNLVNSIQKAIQE 230 N+ + + + P P I+V + V + + Sbjct: 275 PNEHNRLCDQRSVWEIMMASRDFNAVNHPNPTLENILPTINVRQQPTNRYVLVLDTSGSM 334 Query: 231 KKNLSVRIG------TIAYNIGIVGNQCTPLSNNLNEVKSRLN------------KL--N 270 + + + Y S + ++ +L Sbjct: 335 SGSNYEYMMQAATDFIMTYIPKGAEAGIVEFSYTATTLSQLVSIENKADREYLASRLPGQ 394 Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 P +T + + L N+ +I +TDGE + + + + Sbjct: 395 PDGSTCIGCGILNGIEVLSNQGRDPAGGQ--------LIVLTDGEENYSPYVNDVRD--- 443 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA--VNDSRELLESFDKITDK 386 A + + S+ A G L + D+ G + V D L E+F ++ + Sbjct: 444 ---NAIEAHVVVDSIFFGASGNG-ALQQLTEDTKGTMYYNDVTDITGLKETFKQLAES 497 >gi|158257430|dbj|BAF84688.1| unnamed protein product [Homo sapiens] Length = 914 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 22/151 (14%), Positives = 57/151 (37%), Gaps = 26/151 (17%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 ++ + + RL T+ + A + + + ++ +T Sbjct: 362 INSGSDRDTLAKRLPA-AASGGTSICSGLRSASTVIRKKYPTD---------GSEIVLLT 411 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND 372 DGE++ S N + + +G I++VA+ P Q+L + G +D Sbjct: 412 DGEDNTISGCFNEV---------KQSGAIIHTVALG-PSAAQELEELSKMTGGLQTYASD 461 Query: 373 ---SRELLESFDKITD---KIQEQSVRIAPN 397 + L+++F ++ + ++S+++ Sbjct: 462 QVQNNGLIDAFGALSSGNGAVSQRSIQLESK 492 >gi|67459075|ref|YP_246699.1| hypothetical protein RF_0683 [Rickettsia felis URRWXCal2] gi|67004608|gb|AAY61534.1| unknown [Rickettsia felis URRWXCal2] Length = 194 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 23/137 (16%), Positives = 44/137 (32%), Gaps = 16/137 (11%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 N+L ++K + LN T Y + A L + +I TDG+N Sbjct: 9 NSLEDIKKYVENLNADGYTRLYGTVKDALELL----------KEKIDIHSTIIVFTDGKN 58 Query: 317 SGASAYQNTLNTLQ-ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF----AVN 371 G + ++N +Y+V ++ + G + Sbjct: 59 EGTDCNTTQQEVIDSAISVIQNPQFNMYAVGFG-ENYNKEFFEEIAMRGGFTHISLRDPS 117 Query: 372 DSRELLESFDKITDKIQ 388 ++L + D I K+ Sbjct: 118 GMKQLEQYTDNIEQKVV 134 >gi|325108192|ref|YP_004269260.1| hypothetical protein Plabr_1627 [Planctomyces brasiliensis DSM 5305] gi|324968460|gb|ADY59238.1| protein of unknown function DUF1355 [Planctomyces brasiliensis DSM 5305] Length = 938 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 24/142 (16%), Positives = 47/142 (33%), Gaps = 20/142 (14%) Query: 246 GIVGNQCTPLSN--NLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGST 302 + PL+N N +++ ++ + Y A+ A EL S Sbjct: 447 DSTPHTIVPLTNVSNPDDIAQQVLGIQSMGGGIFVYEALVAAGNELMK----------SD 496 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-GQDLLRKCT 361 K +I +D + + + + NAG+ + + + + LL++ Sbjct: 497 LATKHIILFSDAAD-----SEEPGAYRSLIKDYENAGITVSVIGLGTTADVDAKLLQEIA 551 Query: 362 D-SSGQFFAVNDSRELLESFDK 382 SG D EL F + Sbjct: 552 TLGSGNIMFTQDVAELPRLFTE 573 >gi|91775988|ref|YP_545744.1| membrane protein-like protein [Methylobacillus flagellatus KT] gi|91709975|gb|ABE49903.1| membrane protein-like protein [Methylobacillus flagellatus KT] Length = 542 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 15/141 (10%), Positives = 40/141 (28%), Gaps = 4/141 (2%) Query: 3 AIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVS-DRTIKDPTTKKDQTS 61 + + + LF+ A+D + + ++Q+ D A ++ + + K + Sbjct: 18 VLTLLMAVLFVAVAVDSGRLWMEKRKLQNIADMAAIAAGGQVGGCAQNNSSEAYKAAAQA 77 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + +L + G I N + + +LF Sbjct: 78 AAAANGYQGNLLAAPNAVQLGGYHTDSDGIRTFAANNERSAVRVLATQEV---PSSLFAG 134 Query: 122 GLIPSALTNLSLRSTGIIERS 142 G+ + + + Sbjct: 135 GIFNQRIVLRTEAVGAVRTPY 155 >gi|157818579|ref|NP_001100919.1| calcium-activated chloride channel regulator 1 [Rattus norvegicus] gi|149026146|gb|EDL82389.1| chloride channel calcium activated 3 (predicted) [Rattus norvegicus] Length = 910 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 22/166 (13%), Positives = 63/166 (37%), Gaps = 27/166 (16%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN--PYENTNTYPAMHHAYRELYNEKESSHN 297 + ++ N+ + + +L T+ + A+ + + + Sbjct: 347 MVTFDSTAYVQSELTQLNSGADRDLLIKRLPTVASGGTSICSGLQAAFTSIKKKYPTDGA 406 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 ++ +TDGE++ S+ + + +N+G I++VA+ P ++L Sbjct: 407 E---------IVLLTDGEDNTISSCFDLV---------KNSGAIIHTVALG-PSAAKELE 447 Query: 358 RKCTDSSGQFFAVND---SRELLESFDKITD---KIQEQSVRIAPN 397 + + G +D + L+++F ++ I + S+++ Sbjct: 448 QLSKMTGGLQTYSSDQIQNNGLVDAFAALSSGNAAISQHSIQLESR 493 >gi|269926137|ref|YP_003322760.1| hypothetical protein Tter_1022 [Thermobaculum terrenum ATCC BAA-798] gi|269789797|gb|ACZ41938.1| hypothetical protein Tter_1022 [Thermobaculum terrenum ATCC BAA-798] Length = 490 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 37/342 (10%), Positives = 83/342 (24%), Gaps = 27/342 (7%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTT----- 55 + A++ V FI IDL R MQ+A+DAA L+ + + + Sbjct: 28 LFALLSVVLIGFIAMGIDLGMAYSQRRFMQNAVDAAALAATNQLADNLQGTSDGSWTFLV 87 Query: 56 KKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQ-------------INITKDKNNPLQ 102 ++ I + + + + N + Sbjct: 88 NDQNIRDTVRRYIDANSGITPPGSSYGAAGGCSSGPFCIEYLNVNKSLLARSPTSNGQVP 147 Query: 103 YIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMED 162 T + FL ++ ++ + I S+ V + R Sbjct: 148 SGTAIVRVNIKRTYSTFLATVLGRNKMSVGATAAAQIFPVSKPKY-PPTGVWPMVRKYTG 206 Query: 163 LYLQKHNDNNNMTSNKYLLPPPPKKSFWSKN-------TTKSKYAPAPAPANRKIDVLIE 215 L + N P S + + K+ I Sbjct: 207 NALTEFPTNGACPPPVVFWSPNDSGSTVGDFKGLFYVGKYSAYVQELNYDPSNKVAGTIP 266 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 + ++ + + + + N L+ +K + Sbjct: 267 NHVQMITEFLSDNDDPYGDTEGPPANLNGSDVHDSLVQWFRNGLS-IKLTAGSYIAPPSD 325 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 +Y + + + + S K + + N+ Sbjct: 326 ADNEPTIPSYSGKLSSWQINRTSTYSNYGDKVEVITGNLGNN 367 >gi|312133570|ref|YP_004000909.1| protein [Bifidobacterium longum subsp. longum BBMN68] gi|311772822|gb|ADQ02310.1| Hypothetical protein BBMN68_1309 [Bifidobacterium longum subsp. longum BBMN68] Length = 362 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 39/348 (11%), Positives = 101/348 (29%), Gaps = 30/348 (8%) Query: 36 AVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK 95 A LS + + DQ++ L+ + ++ D + Sbjct: 39 ATLSIASGSENKEVAVAIQKAADQSNVAVTMHYMGSLEIMNALKAGGQDHDAVWPASSMW 98 Query: 96 DKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD 155 ++I + A + A + +++ ++ + + Sbjct: 99 ISMGDTKHIVKDAASTSTTPI------VFGIAKSKAVKLGWADDTGATKPVSTADILAAV 152 Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 + ++ + + L P + W + + S + ++ ++ Sbjct: 153 SDGKLTFSMTSATVIDSALNVYQTALR-KPSWTIWVVDYSGSMSGEGKNGVVKGLNAALD 211 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 + +K+ E + V I + T S +++ + + T Sbjct: 212 -----PDQAKKSYIEPASGGVNILIPFETEAHRPVKATGTS--TSDLLHEADATDASGGT 264 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + Y + A EL +E E+S ++ +TDG ++ + Sbjct: 265 DIYEVLLSALDELPSESEASQ-------YTTAIVLMTDGRSNSDHQDEFESAYKS----- 312 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDK 382 R + I+S+ L+ S+ + F +L F + Sbjct: 313 RGRDLPIFSIMFG--DADPSQLKSLATLSNAKVFDGRS-GDLAAVFRQ 357 >gi|298291248|ref|YP_003693187.1| von Willebrand factor A [Starkeya novella DSM 506] gi|296927759|gb|ADH88568.1| von Willebrand factor type A [Starkeya novella DSM 506] Length = 313 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 25/156 (16%), Positives = 49/156 (31%), Gaps = 31/156 (19%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKE 293 RIG + + Q P + ++ V + +T + A R L Sbjct: 130 RIGLVIFGDRAYVAQ--PPTFDVGSVAHAIEAAQIGISGRSTAISDGLGLATRRLLQ--- 184 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-E 352 S K V+ ++DG ++ + + G++++++A+ E Sbjct: 185 -------SDATSKVVVLLSDGVDTSG-----KVQAGDAARLAASHGIRVHTIALGPEDLE 232 Query: 353 GQDLLRKCTD----------SSGQFFAVNDSRELLE 378 Q R D G F V + +L Sbjct: 233 NQPESRDAVDAAALHAMAEAGGGTSFRVRNMEDLET 268 >gi|57956|emb|CAA79152.1| collagen alpha 1 chain type VI [Mus musculus] Length = 583 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 47/367 (12%), Positives = 100/367 (27%), Gaps = 38/367 (10%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 NQMQ + R +D + + Q + R Sbjct: 224 SHNQMQE-------HVTLRSPNVRNAQDFKEAVKKLQWMAGGTFTGEALQYARDRLLPPT 276 Query: 85 IAQKAQINITKDKNNPLQYI--AESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERS 142 + + IT +++ + +I ++ +K + + L S Sbjct: 277 QNNRIALVITDGRSDTQRDTTPLSVLCGADIQVVSVGIKDVFGFVAGSDQLNVISCQGLS 336 Query: 143 SENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPA 202 IS ++ + + N + +K +F S + Sbjct: 337 QGRPGIS---LVKENYAELLDDGFLKNITAQICIDKKCPDYTCPITFSSPADITILLDSS 393 Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ---CTPLSNNL 259 + + + A L A + + VR+ + Y+ N Sbjct: 394 ASVGSHNFETTKVFAKRLAERFLSAGRADPSQDVRVAVVQYSGQGQQQPGRAALQFLQNY 453 Query: 260 NEVKSRLNKLN-PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + S ++ ++ + T+ A+ + R + KK V+ +DG Sbjct: 454 TVLASSVDSMDFINDATDVNDALSYVTRFYREASSGA--------TKKRVLLFSDG---- 501 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE---------GQDLLRKCTDSSGQFFA 369 S + + R AG++I+ V V G+ F Sbjct: 502 -SQGATAEAIEKAVQEARRAGIEIFVVVVGPQVNEPHIRVLVTGKTAEYDVAFGERHLFR 560 Query: 370 VNDSREL 376 V + + L Sbjct: 561 VPNYQAL 567 >gi|311028997|ref|ZP_07707087.1| hypothetical protein Bm3-1_00293 [Bacillus sp. m3-13] gi|311032266|ref|ZP_07710356.1| hypothetical protein Bm3-1_17297 [Bacillus sp. m3-13] Length = 245 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 13/96 (13%), Positives = 37/96 (38%), Gaps = 10/96 (10%) Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-GQDLLRK- 359 K ++ ITDG ++ + + + + G+ + + V + +++ Sbjct: 3 KGTLKQILLITDGCSNSG------DDPIAMAALAKEQGITVNVIGVMDEDTIDERGMQEI 56 Query: 360 --CTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 S G + ++ L ++ +T K Q+++ Sbjct: 57 EGIAMSGGGVSQIVYAKNLSQTVQMVTRKAMTQTLQ 92 >gi|298370192|ref|ZP_06981508.1| pilus-associated protein [Neisseria sp. oral taxon 014 str. F0314] gi|298281652|gb|EFI23141.1| pilus-associated protein [Neisseria sp. oral taxon 014 str. F0314] Length = 1092 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 17/72 (23%), Positives = 27/72 (37%), Gaps = 4/72 (5%) Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--PEGQDLLRKCTDSSGQFFAVNDSRELL 377 S + + +N ++ Y+V + G+ LR S +F D L Sbjct: 281 SWDGDYSDPRG--TNFKNQLVQTYTVGFGSGVSATGEAYLRNGASGSNNYFNARDENGLF 338 Query: 378 ESFDKITDKIQE 389 +FD ITD I Sbjct: 339 AAFDAITDSIAN 350 >gi|194228584|ref|XP_001915384.1| PREDICTED: inter-alpha (globulin) inhibitor H5-like [Equus caballus] Length = 1313 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 26/176 (14%), Positives = 56/176 (31%), Gaps = 13/176 (7%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE 273 ++ ++ +Q ++ + N+ + K L + Sbjct: 302 KKAMNVILGDLQANDYFNIISFSDTVSVWKAGCSIQATI----QNVYDAKDYLGHMEAGG 357 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 T+ A+ A LY S +IF+TDGE + ++ I + Sbjct: 358 WTDINTALLAAASVLYPSN-XEPGRGPSVGRISLIIFLTDGEPTAD-MMTPSVILSNILQ 415 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-G---QFFAVNDSR-ELLESFDKIT 384 + N + ++S+ LLR + + G + D+ L + +I Sbjct: 416 ALGNR-VNLFSLVF-WDDADFPLLRHLSLENWGAAWHIYKDTDAALHLEGLYKEIF 469 >gi|159900457|ref|YP_001546704.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159893496|gb|ABX06576.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 831 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 19/143 (13%), Positives = 38/143 (26%), Gaps = 8/143 (5%) Query: 3 AIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTST 62 AI+ V F AID + Q Q+A AA ++G S+ S+ D ++ Sbjct: 23 AILFFVMIAFAALAIDTGEAFSRQRQQQAASTAASIAGLESMNSEIDGTDGAVQQAIRDA 82 Query: 63 IFKKQIKKHLKQGSYIRENAG--------DIAQKAQINITKDKNNPLQYIAESKAQYEIP 114 + I + + +++ + Sbjct: 83 LAANGITNAVYINGDWGNLDPSQNYYRAFYTQRGSRVEYPVGSGGQVSTEFNGLRVEVRS 142 Query: 115 TENLFLKGLIPSALTNLSLRSTG 137 N + +S + Sbjct: 143 ARNTIFGQALGIDTLEVSAENKA 165 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 25/183 (13%), Positives = 62/183 (33%), Gaps = 6/183 (3%) Query: 142 SSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFW---SKNTTKSK 198 S + I +V+D+S SM+ Y + + + + Y + K + + Sbjct: 371 SINTTSYDISLVVDISGSMQWCYDSQRTCSVDANARWYRVKDFLAKFSYKMLDVWNAPAG 430 Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIV---GNQCTPL 255 A ++ + N + +++ + + + + T + Sbjct: 431 QNMNNASLFPGEALVGKGGDNRIAAVRFSGNAVTSSPSFGFVTSPAGSDQASVSARTTTM 490 Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 +N+N + S + K N +T+ + R N + +K ++ +TDG Sbjct: 491 RSNMNSLISWITKANMSGSTSGGRGLREGIRYFDNVSAHTRVDRFGRPIKLVMVMLTDGL 550 Query: 316 NSG 318 + Sbjct: 551 TNV 553 >gi|167763115|ref|ZP_02435242.1| hypothetical protein BACSTE_01484 [Bacteroides stercoris ATCC 43183] gi|167699455|gb|EDS16034.1| hypothetical protein BACSTE_01484 [Bacteroides stercoris ATCC 43183] Length = 342 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 22/168 (13%), Positives = 55/168 (32%), Gaps = 49/168 (29%) Query: 251 QCTPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 P++++ K L ++P + T A++ A R + + + Sbjct: 142 TQLPITSDYISAKMFLESIDPSLISKQGTAIGAAINLASRSFTPQ----------EGVGR 191 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------------- 352 VI ITDGEN + ++ + G+++ + V P Sbjct: 192 AVIVITDGENHEG-------DAVEAAKDAAEKGIQVNVLGVGMPEGAPIPAEGTNDYRRD 244 Query: 353 ----------GQDLLRKCTDSSGQFF-AVNDSRELLESFDKITDKIQE 389 + + ++ + + V+++ + I+ +I + Sbjct: 245 RDGNVIVTRLNEQMCQEIAKAGNGIYVRVDNT---NGAQKAISREINK 289 >gi|255531386|ref|YP_003091758.1| von Willebrand factor A [Pedobacter heparinus DSM 2366] gi|255344370|gb|ACU03696.1| von Willebrand factor type A [Pedobacter heparinus DSM 2366] Length = 344 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 21/150 (14%), Positives = 49/150 (32%), Gaps = 32/150 (21%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 +++ + L++++ + RIG I + P++ + + K LN Sbjct: 110 NRLENAKRAISQLIDNLH---------NDRIGIIIFAGEAYVQ--LPITTDYSAAKLFLN 158 Query: 268 KLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 + P + T A+ + K +I +TDGEN Sbjct: 159 NITTDIVPTQGTAIGAAIDMGMKSFNFVN----------GTSKAMILMTDGEN------- 201 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + + + + + I+ + V + Sbjct: 202 HEDDAVSAAKRASAKDVAIHVIGVGSEEGA 231 >gi|330506652|ref|YP_004383080.1| von Willebrand factor, type A [Methanosaeta concilii GP-6] gi|328927460|gb|AEB67262.1| von Willebrand factor, type A [Methanosaeta concilii GP-6] Length = 551 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 14/94 (14%), Positives = 33/94 (35%), Gaps = 12/94 (12%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 NN + + L T T+ + A + L + + ++DGE Sbjct: 434 NNRSYFVGAVGGLQAGGATATFDGIAVAMKMLEEQLALDPKLKPK------IFVLSDGET 487 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + ++ +G+ IY++ +A Sbjct: 488 NRGHSLNDIRKLVE------ESGIPIYTIGYNAD 515 >gi|307107471|gb|EFN55714.1| hypothetical protein CHLNCDRAFT_134017 [Chlorella variabilis] Length = 611 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 16/111 (14%), Positives = 37/111 (33%), Gaps = 23/111 (20%) Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 TN A+ A + L ++H + ++ +TDG + + Sbjct: 511 NGGTNIALAVQKAGQLLKPLSATAH---------RVLVLLTDGRIDSH----QSREARDM 557 Query: 332 CEYMRNA--GMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 + + +++++ V + Q+LLR C D + + Sbjct: 558 AARLGDEQANVRVHAYGVGRGVDKQELLRICA--------ARDPDSAEDRY 600 >gi|326795600|ref|YP_004313420.1| von Willebrand factor type A [Marinomonas mediterranea MMB-1] gi|326546364|gb|ADZ91584.1| von Willebrand factor type A [Marinomonas mediterranea MMB-1] Length = 635 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 19/135 (14%), Positives = 48/135 (35%), Gaps = 13/135 (9%) Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 ++ S ++ + + L ++ S + I +TDG Sbjct: 90 KAKLASYVDNYVT---QDLKTDLESIIELLLKTPDTPSL---SDGFDRHWILVTDGMVDV 143 Query: 319 A-----SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVND 372 + + + E + G+ +++V+ +D+L++ ++ V Sbjct: 144 SLDKDVNEASRRRIMTTLTEALEKRGIHLHTVS-MTGYTDEDMLKQLAVKTNASHTEVAS 202 Query: 373 SRELLESFDKITDKI 387 ELL++FD+I + Sbjct: 203 PEELLDTFDRIFAQA 217 >gi|262073024|ref|NP_001159971.1| integrin alpha-2 [Bos taurus] gi|296475809|gb|DAA17924.1| integrin alpha-2 [Bos taurus] Length = 1179 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 40/343 (11%), Positives = 85/343 (24%), Gaps = 42/343 (12%) Query: 59 QTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENL 118 + L + + + + + ++ TE Sbjct: 49 AVQQFINPKGNWLLVGSPWSGFPKNRMGDVYKCPVDLSTTTCEKLNLQTSTSMSNVTEMK 108 Query: 119 FLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNK 178 L + N+ S V + + Sbjct: 109 TNMSLGLTLTRNVGTGGFLTCGPLWAQQCGSQYYTTGVCSDVSPDF----------QLRT 158 Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 P + + + P D + V + + ++ Sbjct: 159 SFAPAVQTCPSFIDVVVVCDESNSIYPW----DAVKNFLEKFVQGL-----DIGPTKTQM 209 Query: 239 GTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSH 296 G I Y + +E+ ++ Y TNT+ A+ +A Y+ Sbjct: 210 GLIQYANNPRVVFNLNTFKSKDEMIKATSQTFQYGGDLTNTFKAIQYARDTAYSTAAGGR 269 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV----SAPPE 352 K ++ +TDGE+ S + + + + +AV + Sbjct: 270 PGAT-----KVMVVVTDGESHDGS------KLKAVIDQCNKDNILRFGIAVLGYLNRNAL 318 Query: 353 GQDLL----RKCTD--SSGQFFAVNDSRELLESFDKITDKIQE 389 L + + FF V+D +LLE I ++I Sbjct: 319 DTKNLIKEIKAIASIPTERHFFNVSDEADLLEKAGTIGEQIFS 361 >gi|116624267|ref|YP_826423.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116227429|gb|ABJ86138.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 306 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 24/146 (16%), Positives = 52/146 (35%), Gaps = 21/146 (14%) Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 ++++ +++S L KL T A+ L KK ++ ITD Sbjct: 136 DFTSDVKQLESSLRKLGSKGETAMRDALSLGLDHLRAPARKD---------KKVLVVITD 186 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-------SSGQ 366 GE++ + Q + + + IY + + A R + G+ Sbjct: 187 GEDNSSIQKQEN-----LIRAAHLSNVIIYGIGLLAAEAPASAQRAKASLDVLTLATGGR 241 Query: 367 FFAVNDSRELLESFDKITDKIQEQSV 392 + + ++ + +I +I+ Q V Sbjct: 242 SWYPENVADIEKITPEIAHEIRNQYV 267 >gi|317502941|ref|ZP_07961033.1| aerotolerance protein BatB [Prevotella salivae DSM 15606] gi|315665940|gb|EFV05515.1| aerotolerance protein BatB [Prevotella salivae DSM 15606] Length = 340 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 26/168 (15%), Positives = 53/168 (31%), Gaps = 41/168 (24%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 P++N+ K L +NP T + A + + + + +I IT Sbjct: 144 LPITNDYVSAKMFLQNINPSLITTQGTDLARAISLSQSCFTQREH------IGRAIIVIT 197 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA---------------PPEGQDLL 357 DGE+ L+ G+ ++ + V GQ +L Sbjct: 198 DGEDHEG-------GALEAAREAYKKGINVFILGVGTSKGAPIPDGNGGYLKDNSGQTVL 250 Query: 358 ---------RKCTDSSGQFFAVNDSRE----LLESFDKITDKIQEQSV 392 + +G + V+++ + L + K+ I + V Sbjct: 251 STLNEQMCQQVAKAGNGVYIHVDNTSDAQEKLNKELSKLQSGISDTVV 298 >gi|306828192|ref|ZP_07461454.1| collagen adhesion protein [Streptococcus pyogenes ATCC 10782] gi|304429615|gb|EFM32662.1| collagen adhesion protein [Streptococcus pyogenes ATCC 10782] Length = 1036 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 29/198 (14%), Positives = 64/198 (32%), Gaps = 22/198 (11%) Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL-NKLNPYE- 273 + S+ ++K TI Y ++ + + + + L+ + Sbjct: 536 AVVGFQGSVAYRYYDEKPERTPWNTIMYQPSKSTSKDADVLKDWETSSNLSRDSLSYQDR 595 Query: 274 -NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI--------TDGENSGASAYQN 324 TN + A+ A L + H I F +G S + + Sbjct: 596 NGTNYHAALLKADEMLQKVANNGHRKIMVFISDGVPTFYFGADNYRSGNGTVSDGNIINS 655 Query: 325 TLNTLQICEYMRNA--GMKIYSVAVSAPPEGQD------LLRKCTDSSGQFFAVNDSREL 376 + + +N + IYS+ VS +L+ + + D+ +L Sbjct: 656 QKGSKLAIDEFKNKYPNLSIYSLGVSKDINSDTSSSSPVVLKYL-SGDDYYSGITDTEQL 714 Query: 377 LESFDKITD--KIQEQSV 392 ++ +KI + KI ++ Sbjct: 715 EKTANKIVEDSKISNLTI 732 >gi|261404225|ref|YP_003240466.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] gi|261280688|gb|ACX62659.1| von Willebrand factor type A [Paenibacillus sp. Y412MC10] Length = 562 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 13/98 (13%), Positives = 35/98 (35%), Gaps = 15/98 (15%) Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 +N L NT T+ + A + L +E ++ + + ++DGE + + + Sbjct: 454 AINSLEASGNTATFDGIVVAMKMLQDEMAANPDVKP------LIFVLSDGETNVGHSLDD 507 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 ++ + +Y++ L+ + Sbjct: 508 IRG------LIQAFKIPVYTIGY---NADIQALQSISS 536 >gi|75812633|ref|YP_320251.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] gi|75705389|gb|ABA25062.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] Length = 464 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 29/194 (14%), Positives = 60/194 (30%), Gaps = 25/194 (12%) Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 Y + T +K KID++IES LV S + + R+ Sbjct: 57 YEIVTGETTPTGVTYTQDAKEYSQVTGGKSKIDIVIESLLALVRSGRLEASD------RV 110 Query: 239 GTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHN 297 + ++ + ++++++ + +L T + A L Sbjct: 111 AIVQFDDTASQIIDLTPATQVSQLENAIAQLRSFSGGTRMGLGLRRALDMLS-------- 162 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 G + + TDG+ + + + I ++ V +DLL Sbjct: 163 --GQDMAVRRTLLFTDGQT------FDEDICRALASDFATKNIPITALGVG-EDFKEDLL 213 Query: 358 RKCT-DSSGQFFAV 370 + + G F V Sbjct: 214 SHLSDSTGGTLFYV 227 >gi|321466601|gb|EFX77596.1| hypothetical protein DAPPUDRAFT_30003 [Daphnia pulex] Length = 3316 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 14/91 (15%), Positives = 31/91 (34%), Gaps = 15/91 (16%) Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG 365 K V ITDG ++G + + +R G++I++ L Sbjct: 11 KAVFLITDGYSNGG-------DPRPAAKRLREQGVRIFT--FGIRNGNVKELYDMASEPA 61 Query: 366 --QFFAVNDSRELLESFDKITDKIQEQSVRI 394 + V+ E F+ + + + ++ Sbjct: 62 QEHSYIVDSFEE----FEALARRALHEDLQT 88 >gi|195978918|ref|YP_002124162.1| fimbrial structural subunit protein FszD [Streptococcus equi subsp. zooepidemicus MGCS10565] gi|195975623|gb|ACG63149.1| fimbrial structural subunit protein FszD [Streptococcus equi subsp. zooepidemicus MGCS10565] Length = 967 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 34/261 (13%), Positives = 78/261 (29%), Gaps = 35/261 (13%) Query: 144 ENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAP 203 + + + +V+D S SM++ Q + N + F + N Sbjct: 411 KKQPLDVLVVVDRSASMKEGISQNDIPRDQAVKNALTGAGGLLQKFININAENKLSVIGF 470 Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 + + + +S R +I + + + Sbjct: 471 QGSLN---------------YNSREGKPERISWRSIIYQPSINNNKDADVLKNWESSSAL 515 Query: 264 SRLNKLNPYE--NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI--------TD 313 +R + L+ + TN + A+ A L + H I F + Sbjct: 516 NR-DDLSYKDKNGTNYHAALVKADEMLNKVADDGHRKIMVFVSDGVPTFYFGSDHYRAGN 574 Query: 314 GENSGASAYQNTLNTLQICEYMRNA--GMKIYSVAVSAPPEGQD------LLRKCTDSSG 365 G + ++ + T + + + IYS+ VS +L+ Sbjct: 575 GTSDASNIKSSQDGTRAAIDDFKKKHPNLSIYSLGVSKDINSDTASSSPVVLKYL-SGED 633 Query: 366 QFFAVNDSRELLESFDKITDK 386 ++ + ++ EL + +KI + Sbjct: 634 HYYGITNTVELEKIANKIVED 654 >gi|149566038|ref|XP_001520798.1| PREDICTED: similar to calcium channel, voltage-dependent, alpha 2/delta subunit 4 [Ornithorhynchus anatinus] Length = 808 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 24/351 (6%), Positives = 98/351 (27%), Gaps = 19/351 (5%) Query: 39 SGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKN 98 + + + + + + D + + ++ D + + N Sbjct: 88 MLGRKVEAVQRLVEAAEEADLNHEFNASLVFDYYNAVLINEKDEKDEFVELGAEFVLEAN 147 Query: 99 NPLQYIA--ESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDV 156 + S + ++PT I + + + +E + ++ Sbjct: 148 EHFSDLLVNTSVSNVQLPTNVYNKDPNILNGVYMSEALNPVFVENFQRDPTLTWQYFGSS 207 Query: 157 SRSME--DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLI 214 + + ++ + + + K+ ++ + Sbjct: 208 TGFFRLYPGIKWTPDQRGVISFDCRNRGWYIQAATSPKDIVIVVDVSGSMKG-LQMTIAK 266 Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 + +++++ + ++ G +N K L++L Sbjct: 267 HTVATILDTLGENDF-VNIIAYSDYVHYLEPCFQGILVQADRDNREHFKQLLDELQAKGV 325 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 + A+ A+ L +++ + + + ++ ITDG G + N Sbjct: 326 GSVSKALREAFTVLQQVRDAGQGALCN----QAIMLITDGAVDGYESIFEKYNWPG---- 377 Query: 335 MRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 +++++ + ++ ++ ++ L + + + + Sbjct: 378 ---RKVRVFTYLIGREVSFAANVKWIACNNKGYY--TQISTLADVQENVME 423 >gi|329928399|ref|ZP_08282269.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] gi|328937835|gb|EGG34241.1| von Willebrand factor type A domain protein [Paenibacillus sp. HGF5] Length = 562 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 13/98 (13%), Positives = 35/98 (35%), Gaps = 15/98 (15%) Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 +N L NT T+ + A + L +E ++ + + ++DGE + + + Sbjct: 454 AINSLEASGNTATFDGIVVAMKMLQDEMAANPDVKP------LIFVLSDGETNVGHSLDD 507 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 ++ + +Y++ L+ + Sbjct: 508 IRG------LIQAFKIPVYTIGY---NADIQALQSISS 536 >gi|210135186|ref|YP_002301625.1| phage/colicin/tellurite resistance cluster protein TerY [Helicobacter pylori P12] gi|210133154|gb|ACJ08145.1| phage/colicin/tellurite resistance cluster protein TerY [Helicobacter pylori P12] Length = 214 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 27/192 (14%), Positives = 67/192 (34%), Gaps = 30/192 (15%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +I VL ++ ++++ + KK L ++ + + G G +++ V Sbjct: 32 GNGTRIGVLNLCIQKMIETLKQ--EAKKELFSKMAIVTF--GENGVNLHTPFDDIKNVNF 87 Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA--- 321 L+ T A A + ++ +T + K + I ++DGE + Sbjct: 88 E--PLSASGGTPLDQAFKLAKDLIEDK-----DTFPTKFYKPYSILVSDGEPNNDKWQEP 140 Query: 322 ----YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 + + + +C +S+ + + + F +D +L+ Sbjct: 141 LFNFHHDGRSAKSVC----------WSIFIGDREVNPQVNKD--FGKDGVFYADDVEKLV 188 Query: 378 ESFDKITDKIQE 389 F+ +T I + Sbjct: 189 GLFEIMTQTISK 200 >gi|56696061|ref|YP_166415.1| hypothetical protein SPO1165 [Ruegeria pomeroyi DSS-3] gi|56677798|gb|AAV94464.1| conserved domain protein [Ruegeria pomeroyi DSS-3] Length = 257 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 20/201 (9%), Positives = 57/201 (28%), Gaps = 36/201 (17%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 +I E+ + + I + + + P + + N Sbjct: 57 YNGLDRPRILDAREALHDALPRIAALRRLGLVTYGAAMDGDADGDLCKRVSMPFTPSPNA 116 Query: 262 ---VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + + ++ + P NT A++ A R ++ +TDG+ + Sbjct: 117 AGQILNLIDAIEPDGNTALTDAVNLAARVFDQPPRP-----------GVIVLVTDGDETC 165 Query: 319 ASAYQNTLNTLQICEYMRNA-GMKIYSVAV---------SAPPEGQDLLRKCT------- 361 A + R+ G+ ++ + + + Sbjct: 166 GGAPCAL-----AADLARDTPGLTVHVIGFRVRSQFFGWEGGDGNPTVPETISPAECLAQ 220 Query: 362 DSSGQFFAVNDSRELLESFDK 382 + G++ + EL+ + ++ Sbjct: 221 ATGGEYVSTETVEELIRALNQ 241 >gi|47203515|emb|CAG02000.1| unnamed protein product [Tetraodon nigroviridis] Length = 349 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 20/178 (11%), Positives = 64/178 (35%), Gaps = 16/178 (8%) Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 + + + K ++ + VR+ + Y+ ++ + N+ + + L L Sbjct: 18 AIREFIKRMAKDLETHDDA-VRMAVMQYSDDVMVHFNLKSHNSKKALINALRNLRHKGGR 76 Query: 276 N--TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 N T A+ ++ S + + + + +T G+ + + Sbjct: 77 NRKTGAALQFVRDRVFTTLFGSRHL---EGVPQILFLLTLGKAGDDVS--------KAAL 125 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 ++ G++ +++ + L++ S + + ELL ++ +Q+++ Sbjct: 126 SLKQFGVQTFAIGI--KKAKLQELQQIASPSRFLYNLPVFGELLSIQPQLAALVQQRT 181 >gi|325698104|gb|EGD39985.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK160] Length = 464 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 29/269 (10%), Positives = 69/269 (25%), Gaps = 62/269 (23%) Query: 175 TSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNL 234 + W N ++ +L + A ++ ++ + N+ Sbjct: 196 NVAISFVFDKSGSMSWDLN----GNNTNYWGPKSRMSILKDKATIMMRDLK----DIGNV 247 Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY----- 289 SV + + + V + L +++ +N L TN + + L Sbjct: 248 SVNLVSFSILGSYVQKDFSELDKGTTTIEASINALQTGGVTNPGDGLRYGMMSLQNHSAQ 307 Query: 290 -----------------NEKESSHNTIGSTRLKKF---------------VIFITDGENS 317 + ++S T+ Sbjct: 308 LKYVVLLTDGIPNAYTVDTNDTSWRNRNVQPYYNRWRETVGELVTFNNGPYDVTTNLTTD 367 Query: 318 GASAYQNTLNTLQICEYMRNA----------GMK-IYSVAV-SAP---PEGQDLLRKCTD 362 ++ + + + G+K + + P G+DL R Sbjct: 368 QNRVSYDSYSNEALRKKAIEYAGKVSQTFGAGVKRVNVIGFSGVPSEIAYGEDLTRSIGS 427 Query: 363 SS--GQFFAVNDSRELLESFDKITDKIQE 389 ++ D L ++F I +IQ+ Sbjct: 428 GGMEAKYVPAADEAALQQTFSDIKKQIQQ 456 >gi|325285570|ref|YP_004261360.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] gi|324321024|gb|ADY28489.1| von Willebrand factor type A [Cellulophaga lytica DSM 7489] Length = 235 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 28/227 (12%), Positives = 59/227 (25%), Gaps = 35/227 (15%) Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + ++D+S SME + D S SK + Sbjct: 36 TSTKNTLYLIDISGSMEGIDEGSVKDQVVRE------VGNKAGSQVSKAIGGKIGSILGK 89 Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 ++ L + + K + + K V + Sbjct: 90 QVTKEATKL-GAVKRKLIPAIKGLPDGKKFLVFSF-NNNVTKQATEFRVASNTTRTSSNI 147 Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + L TNT + A ST + ++ ++DG + Sbjct: 148 FVQNLKASGGTNTLEGLLEAL---------------STADVQEIVLMSDGLPNSG----- 187 Query: 325 TLNTLQICEYMRN---AGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF 368 + E ++ + + I+++A D +R + F Sbjct: 188 ---PKAVLEEIKKVNTSNIIIHTIAFGEDA-DLDFMRTLAQENNGTF 230 >gi|261194779|ref|XP_002623794.1| U-box domain-containing protein [Ajellomyces dermatitidis SLH14081] gi|239588332|gb|EEQ70975.1| U-box domain-containing protein [Ajellomyces dermatitidis SLH14081] Length = 756 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 66/204 (32%), Gaps = 14/204 (6%) Query: 183 PPPKKSFWSKNTTKSKYAPAPAPANR---KIDVLIESAGNLVNSIQKAIQEKKNLSVRIG 239 P + + S + AP P K + S +L + I E N + R+G Sbjct: 71 HVPCDIVLCIDISYSMSSSAPLPTTDDSGKPEDTGLSVLDLTKHAARTIIETLNDNDRLG 130 Query: 240 TIAYNIGIVGNQCTPLSN--NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHN 297 +A++ N N + L P +TN + + + L Sbjct: 131 VVAFSTDAEVVYKISNMNEDNKKAALKAVEALWPLSSTNLWHGLKLSLEALEEVTP---- 186 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 + + + +TDG + Q + L+ ++ I++ LL Sbjct: 187 ---IPQNVQALYILTDGMPNHMCPRQGYVPKLRSILQQKDRLPMIHTFGFGYYIRS-GLL 242 Query: 358 RKCTD-SSGQFFAVNDSRELLESF 380 + ++ G + + D+ + F Sbjct: 243 QAISEVGGGTYSFIPDAGMIGTVF 266 >gi|290975167|ref|XP_002670315.1| hypothetical protein NAEGRDRAFT_59739 [Naegleria gruberi] gi|284083872|gb|EFC37571.1| hypothetical protein NAEGRDRAFT_59739 [Naegleria gruberi] Length = 608 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 24/130 (18%), Positives = 49/130 (37%), Gaps = 6/130 (4%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 SN++ ++ S + L + A L K S + ++ ++ I Sbjct: 358 LDFSNDIEQIVSFVQSLKGTGGLDPAECYELA---LKEAKTLSWTPKDNAFTQRCLVMIG 414 Query: 313 DGENSGASAYQNTLNTLQICEYM-RNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAV 370 D A+ Y ++ + C+ + + +KI+SV A P + D + G+ F + Sbjct: 415 DEVPHTATEYFK-IDWQEECQSLFNDYHIKIHSVQCQARPHADQFYQYLADETGGKRFEL 473 Query: 371 NDSRELLESF 380 D + F Sbjct: 474 KDMDSMQSMF 483 >gi|301617755|ref|XP_002938301.1| PREDICTED: integrin alpha-E-like [Xenopus (Silurana) tropicalis] Length = 1148 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 19/132 (14%), Positives = 46/132 (34%), Gaps = 15/132 (11%) Query: 262 VKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + ++ + N T T A+ H ++ E+ S S K ++ +TDG+ Sbjct: 250 ILQKVQDIKQVGNVTKTASALDHVLESVFTEEHGS-----SETATKIILVLTDGD----- 299 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAP---PEGQDLLRKCTDSS-GQFFAVNDSREL 376 + + ++ + + ++ + + V + L+ V+D +L Sbjct: 300 IFMDPMDINDVMNNSKMKKIERFVIGVGEAFQKEKALKTLKTIASQGEEHLLTVDDYSKL 359 Query: 377 LESFDKITDKIQ 388 + KI Sbjct: 360 EGLLTSLQQKII 371 >gi|291229678|ref|XP_002734799.1| PREDICTED: chloride channel calcium activated 2-like [Saccoglossus kowalevskii] Length = 1003 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 30/209 (14%), Positives = 63/209 (30%), Gaps = 26/209 (12%) Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 +P +K T N +ID L ++ N + + + Sbjct: 272 SSVPTFRTVKRRNKRTVLVMDTSGSMEENGRIDKLHQAVSNYI-------LNTLDDGEEV 324 Query: 239 GTIAYNIGIVGNQCTPLSNN--LNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESS 295 G + ++ L NN E+ SR+ + T+ + A+ L + ++ Sbjct: 325 GVVTFSTTATIQSHLVLINNESRTELLSRVPSMQSVGRWTSIGSGLLKAFDVLEEGERNA 384 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 ++ I+DGE + + I + G+ + S+ + Sbjct: 385 AG--------GVIVVISDGEENRDPLIAD------IIPMVLEKGVTVDSIGIGTDASTNL 430 Query: 356 LLRKCTDSSGQFFAVNDSRE--LLESFDK 382 + F+ DS L E+ Sbjct: 431 EVLPAATDGMTFYYSEDSNSNGLNEALAA 459 >gi|241672104|ref|XP_002411442.1| hypothetical protein IscW_ISCW011070 [Ixodes scapularis] gi|215504093|gb|EEC13587.1| hypothetical protein IscW_ISCW011070 [Ixodes scapularis] Length = 1021 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 15/97 (15%), Positives = 34/97 (35%), Gaps = 17/97 (17%) Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 TNT + A+ + ++ KK +I +TDG + + ++ Sbjct: 127 GATNTGAGLQAAWEVFQRSRPTA---------KKLLILVTDGMATMG------PDPVKKA 171 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA 369 E ++N G+ I+ + + L + + Sbjct: 172 EKLKNMGVDIFVFGIGRMLKQH--LEQLASTPANVSD 206 >gi|327351891|gb|EGE80748.1| U-box domain-containing protein [Ajellomyces dermatitidis ATCC 18188] Length = 756 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 29/277 (10%), Positives = 74/277 (26%), Gaps = 17/277 (6%) Query: 107 SKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ 166 +++ + + IP + + +E + S + Sbjct: 4 AQSVASTEDDFEIIDDQIPIRSRPSTGTNIAGERNPNEVAVQLHPLPDTNSMILSVHPPL 63 Query: 167 KHNDNNNMTSNKYLLPPPPKKSFWSKNT--TKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 +L S S T +D+ +A ++ ++ Sbjct: 64 HPEKELRHVPCDIVLCIDISYSMSSSAPLPTTDDSGKPEDTGLSILDLTKHAARTIIETL 123 Query: 225 QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHA 284 + + Y I + +N + L P +TN + + + Sbjct: 124 NDNDRLGVVAFSTDAEVVYKISNMNE------DNKKAALKAVEALWPLSSTNLWHGLKLS 177 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 L + + + +TDG + Q + L+ ++ I++ Sbjct: 178 LEALEEVTP-------IPQNVQALYILTDGMPNHMCPRQGYVPKLRSILQQKDRLPMIHT 230 Query: 345 VAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESF 380 LL+ ++ G + + D+ + F Sbjct: 231 FGFGYYIRS-GLLQAISEVGGGTYSFIPDAGMIGTVF 266 >gi|76798101|ref|ZP_00780356.1| cell wall surface anchor family protein [Streptococcus agalactiae 18RS21] gi|76586556|gb|EAO63059.1| cell wall surface anchor family protein [Streptococcus agalactiae 18RS21] Length = 420 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 15/124 (12%), Positives = 37/124 (29%), Gaps = 23/124 (18%) Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 + Y + + ++ T GE + N Sbjct: 5 GYAINSGYIYLYWR--DYNWVYPFDPKTKKVSATKQIKTHGEPTTLYFNGN--------- 53 Query: 334 YMRNAGMKIYSVAVSAPPE-------GQDLLRKCTDSSGQFFAVNDS----RELLESFDK 382 +R G I++V + + + ++ + + + V+D+ EL + F Sbjct: 54 -IRPKGYDIFTVGIGVNGDPGATPLEAEKFMQSISSKTENYTNVDDTNKIYDELNKYFKT 112 Query: 383 ITDK 386 I ++ Sbjct: 113 IVEE 116 >gi|66819075|ref|XP_643197.1| hypothetical protein DDB_G0276383 [Dictyostelium discoideum AX4] gi|60471310|gb|EAL69272.1| hypothetical protein DDB_G0276383 [Dictyostelium discoideum AX4] Length = 2026 Score = 48.4 bits (113), Expect = 0.002, Method: Composition-based stats. Identities = 43/304 (14%), Positives = 92/304 (30%), Gaps = 32/304 (10%) Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSN 177 L LI ++ + + S ++I +LD + + K ++ + Sbjct: 1616 LLSNELIKTSRILENKNEKKPVFLDSLQVSILFGYLLDSDSLVACSMVCKLWRKCSLN-S 1674 Query: 178 KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN---L 234 + SK+ + + +I + E ++V+ I+KA Sbjct: 1675 NIWTRFMSRDHQVSKSLDLVFCIDSTGSMSGEIKEVKEKITSIVDRIEKAKVNVNVGLVF 1734 Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKES 294 I ++++ VK + + + + A+ E+ K Sbjct: 1735 YNDHEIIYLPTDKSPTIVYEFTDDIPLVKKNIGTIKAFGGNDHPEAVADGLYEVSKLKFR 1794 Query: 295 SHNTIGSTRLKKFVIFITDG--------------ENSGASAYQNTLNTLQICEYMRNAGM 340 S K I ITD E+ + +++++ + G+ Sbjct: 1795 S-------NASKICILITDAPAHGFDSKLNETYSEDYHPNGCPCGHDSIELVRSLVRRGI 1847 Query: 341 KIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLE-----SFDKI-TDKIQEQSVR 393 Y+V+ P D L S G+ + S L + + D I D I + Sbjct: 1848 TFYTVSCRPTPNSSDYLNAIASISEGKCVNLTSSEILSDFIVGSAQDAIYLDSISALVRK 1907 Query: 394 IAPN 397 + Sbjct: 1908 EVEH 1911 >gi|260823627|ref|XP_002606182.1| hypothetical protein BRAFLDRAFT_126499 [Branchiostoma floridae] gi|229291521|gb|EEN62192.1| hypothetical protein BRAFLDRAFT_126499 [Branchiostoma floridae] Length = 951 Score = 48.0 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 28/205 (13%), Positives = 56/205 (27%), Gaps = 24/205 (11%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL--- 255 P P + + + N L Sbjct: 64 QIPLPMEWTSLLQNKRTHVIVAADKSGSMSGNPWRQVQQALLYMIGDVASVNPSVALDVV 123 Query: 256 -SNNLNEV-------KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 N+ + + +N++N T+ A L E + + + Sbjct: 124 IYNDKASLLQYAGSYQDAVNRVNADGMTSFAAAFSCIKDCLKTEIQGTPVSKT------V 177 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAG--MKIYSVAVSAPPEGQDL--LRKCTDS 363 V+F+TDG ++ + E + G ++ V SA + L LR + Sbjct: 178 VVFMTDGADTCNRGADIDRSVRSWKEALARLGHEAIVHVVGFSAQHDYNFLGRLRNTGTT 237 Query: 364 SGQFFA---VNDSRELLESFDKITD 385 +G F + + L ++ D Sbjct: 238 AGLFRYTEPSDGTEALKAKLQELFD 262 >gi|163816539|ref|ZP_02207903.1| hypothetical protein COPEUT_02729 [Coprococcus eutactus ATCC 27759] gi|158448239|gb|EDP25234.1| hypothetical protein COPEUT_02729 [Coprococcus eutactus ATCC 27759] Length = 465 Score = 48.0 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 18/181 (9%), Positives = 57/181 (31%), Gaps = 23/181 (12%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + K D + +A L+ I + + + L N + Sbjct: 154 SSSMKTSDKNDRRLTAANELLEHIDGNRKVGLIRFSKDIHCYIPMDY-------LKVNKS 206 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + L T+ A++ ++ + VI +TDG+++ Sbjct: 207 TLNHELENKAKEGGTDINDALYAVLNAFDKVGTATG--------SRSVILLTDGKSTT-- 256 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC-TDSSGQFFAVNDSRELLES 379 ++ + + ++I +++ + +++ + + G+ + L + Sbjct: 257 ----NVDEEYLINRANSMNIQINVISLG-NHTDKAFIKRITSSTGGKAAKTSSDFYLDAA 311 Query: 380 F 380 + Sbjct: 312 Y 312 >gi|126667415|ref|ZP_01738387.1| hypothetical protein MELB17_14151 [Marinobacter sp. ELB17] gi|126628171|gb|EAZ98796.1| hypothetical protein MELB17_14151 [Marinobacter sp. ELB17] Length = 774 Score = 48.0 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 26/219 (11%), Positives = 64/219 (29%), Gaps = 25/219 (11%) Query: 167 KHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK 226 P + + S P ++ + L + Sbjct: 37 SGMAQAQAEQADSPQLPGAVDVRIIVDISGSMKQNDPQ------NLRRPAVRLLARLLPD 90 Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYR 286 + + V + ++ ++ + + + TN A+ A Sbjct: 91 GATAGVWTFGQYVNMLVPHREVSDAWRDMAIEQSDAINSV-AMR----TNLGAAIETASD 145 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL-----QICEYMRNAGMK 341 + S+ I +TDG+ + + ++ G + Sbjct: 146 GYFTGGVLSNT---------HFIVLTDGKVDISRNPSANKAEANRILDTLVPPLKQQGAR 196 Query: 342 IYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 ++VA+SA + + L + +DS+G F ++ +L +F Sbjct: 197 FHAVALSAEADTEFLRKLASDSNGSFHVAENANDLSRAF 235 >gi|258515700|ref|YP_003191922.1| hypothetical protein Dtox_2493 [Desulfotomaculum acetoxidans DSM 771] gi|257779405|gb|ACV63299.1| hypothetical protein Dtox_2493 [Desulfotomaculum acetoxidans DSM 771] Length = 140 Score = 48.0 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 12/99 (12%), Positives = 36/99 (36%), Gaps = 8/99 (8%) Query: 9 CFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQ--------T 60 +F +D +++Q+Q+A DAA L+G +++ + + ++ Sbjct: 1 MLMFSALGLDYGRAYVLKHQLQAACDAASLAGSSAVSAKLITDGTGSVTNKKLLLDPIIA 60 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNN 99 +++ +I + + + +D + Sbjct: 61 EARATDVWNQNVSSMKFIDKGVTIVDTSNHSALDEDSDG 99 >gi|149921342|ref|ZP_01909796.1| pentapeptide repeat protein [Plesiocystis pacifica SIR-1] gi|149817775|gb|EDM77239.1| pentapeptide repeat protein [Plesiocystis pacifica SIR-1] Length = 739 Score = 48.0 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 25/125 (20%), Positives = 42/125 (33%), Gaps = 11/125 (8%) Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV-IFITDGENSGASAYQNTLNTLQ 330 T T+ + + L + + + + ++V I ITDG+ S Y Sbjct: 438 GSGTFTHRGLELVFDNLLDYQANPPALYPADEDTQYVNILITDGQ---YSTYSTDAQVQN 494 Query: 331 ICEYMRNAGMKIYSVAVSAPPE---GQDLLRKCTD--SSGQF--FAVNDSRELLESFDKI 383 E + +AG K Y + Q L S G F + EL + I Sbjct: 495 ALEALLDAGSKTYVIGFGDGLNTTEAQLQLMNMATWGSGGTEMPFDADSQAELELALGAI 554 Query: 384 TDKIQ 388 + I+ Sbjct: 555 IEDIE 559 >gi|327538509|gb|EGF25172.1| von Willebrand factor type A domain-containing protein [Rhodopirellula baltica WH47] Length = 764 Score = 48.0 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 23/169 (13%), Positives = 50/169 (29%), Gaps = 22/169 (13%) Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIV---GNQCTPLSNNLNEVKSRLNKLNPYENTNTYPA 280 + + N + IA++ + N+ K + L TN PA Sbjct: 366 FADHVLDHLNPNDEFRVIAFSNRTTAFQPDAIAATDANIQSAKQFVRGLRASGGTNLLPA 425 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + A + ++++ +TD + L + Sbjct: 426 LKLAL----------GGEADESARPRYMVLMTDALVGNDHSILRYLRQPEF------QDA 469 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSR---ELLESFDKITDK 386 +++ +A A P + R G V + E+ F ++T + Sbjct: 470 RVFPIAFGAAPNDYLISRAAEMGRGFSMQVTNQDNTPEIARRFHELTSQ 518 >gi|282897675|ref|ZP_06305674.1| von Willebrand factor, type A [Raphidiopsis brookii D9] gi|281197354|gb|EFA72251.1| von Willebrand factor, type A [Raphidiopsis brookii D9] Length = 464 Score = 48.0 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 40/227 (17%), Positives = 73/227 (32%), Gaps = 43/227 (18%) Query: 146 LAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAP 205 S V+D S SM ++ D Y K Sbjct: 42 PPTSFTFVIDTSGSMYEVVAGDVEDT----GVTYQQDGKEYKQ--------------VTG 83 Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 KID++IES LVNS + Q+ R+ + ++ + ++++ Sbjct: 84 GKSKIDIVIESLLRLVNSGKLKQQD------RVSIVQFDDSASQIIGLTSATETKQIETA 137 Query: 266 LNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + KL T + A+ L ++ + K + TDG+ + Sbjct: 138 IKKLRDFSGGTRMGLGLRRAFDILSEQEMTV----------KRALLFTDGQT------FD 181 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAV 370 I + + I ++ V +DLL +D + G+ F V Sbjct: 182 EDQCQSIANHFATRNIPITALGVG-EEFNEDLLTHLSDYTGGKLFYV 227 >gi|148699894|gb|EDL31841.1| procollagen, type VI, alpha 1, isoform CRA_c [Mus musculus] Length = 227 Score = 48.0 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 23/151 (15%), Positives = 55/151 (36%), Gaps = 13/151 (8%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS---NNLNE 261 P +D + +++++ + + ++ A + L+ + +E Sbjct: 53 PYGALVDKVKSFTKRFIDNLRD-RYYRCDRNLVWNAGALHYSDEVEIIRGLTRMPSGRDE 111 Query: 262 VKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 +K+ ++ + + T T A+ EL SH K++I +TDG Sbjct: 112 LKASVDAVKYFGKGTYTDCAIKKGLEELL--IGGSHLKEN-----KYLIVVTDGHPLEGY 164 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 ++ G+K++SVA++ Sbjct: 165 KEPCG-GLEDAVNEAKHLGIKVFSVAITPDH 194 >gi|52078560|ref|YP_077351.1| YabS [Bacillus licheniformis ATCC 14580] gi|52783922|ref|YP_089751.1| YabS [Bacillus licheniformis ATCC 14580] gi|52001771|gb|AAU21713.1| YabS [Bacillus licheniformis ATCC 14580] gi|52346424|gb|AAU39058.1| YabS [Bacillus licheniformis ATCC 14580] Length = 245 Score = 48.0 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 12/90 (13%), Positives = 38/90 (42%), Gaps = 10/90 (11%) Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-GQDLLRK---CTDS 363 ++ +TDG ++ + + + + G+ + + + E Q+ +++ + Sbjct: 9 ILLLTDGCSNRGE------DPQAMAAFAKEQGITVNVIGIMDEHEMDQEAMKEVEGIALA 62 Query: 364 SGQFFAVNDSRELLESFDKITDKIQEQSVR 393 G V + +L ++ +T K Q+++ Sbjct: 63 GGGVHQVVYTSQLSQTVQMVTKKAMTQTLQ 92 >gi|328872160|gb|EGG20527.1| hypothetical protein DFA_00388 [Dictyostelium fasciculatum] Length = 2097 Score = 48.0 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 48/346 (13%), Positives = 107/346 (30%), Gaps = 30/346 (8%) Query: 38 LSGCASIVSDRTIKDPTTKK-DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKD 96 LS + +++ +P+ + + + + + Q Y + + D Sbjct: 1636 LSKAMETLLKQSLDNPSLTEFVKIQKFVELVEQVSVDQSLYTSNAQKKQLAQDTASNEVD 1695 Query: 97 KNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDV 156 + L T L+ I+ ++L+ L Sbjct: 1696 AVGGSSTAVTTTH---------VSNSLTNREKTQLNEMELNILFGYVKDLSTLKACSLVN 1746 Query: 157 SRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIES 216 + H + + T S +I+ + Sbjct: 1747 KLWRKVTLYPTHW---TRFIGNIGGDERDLDLVFLVDNTGSMSG--------EIEQCKDK 1795 Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN--QCTPLSNNLNEVKSRLNKLNPYEN 274 +V+ I N++VR+G + YN V N Q ++N+ ++K++L+ + Y Sbjct: 1796 IKEIVDDISAI----GNVNVRVGMVFYNDHPVSNVCQVFDFTDNIAKMKTQLSSVTVYGG 1851 Query: 275 TNTYPAMHHAYRELY--NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 + AM + E+ + +S + F +G++ N Sbjct: 1852 DDEPEAMADGFYEVNKLSFTPNSTRVLVLIGDANPHGFGGNGDHYPGGCPCNHDIIELAR 1911 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELL 377 + + + IYSV + +D S G+ F ++++ EL Sbjct: 1912 KLVIEKRVTIYSVLCRPANHTYQVFSALSDLSEGRLFTLSNAAELS 1957 >gi|218671335|ref|ZP_03521005.1| hypothetical protein RetlG_06538 [Rhizobium etli GR56] Length = 49 Score = 48.0 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 14/48 (29%), Positives = 20/48 (41%) Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 AP GQ LL+ C + +F +L +F I K Q R+ Sbjct: 1 MAPEGGQALLQYCASDASHYFQAEKMEDLFAAFKAIGAKASTQVTRLT 48 >gi|108758819|ref|YP_634592.1| hypothetical protein MXAN_6470 [Myxococcus xanthus DK 1622] gi|108462699|gb|ABF87884.1| conserved hypothetical protein [Myxococcus xanthus DK 1622] Length = 914 Score = 48.0 bits (112), Expect = 0.002, Method: Composition-based stats. Identities = 15/123 (12%), Positives = 40/123 (32%), Gaps = 17/123 (13%) Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + + + A+ E+ S + + V+ +D + Sbjct: 462 LDAVARGFSGGGGIYVGEALRAGRTEIL----------RSEKPTRHVLLFSDAAD----- 506 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-GQDLLRKCT-DSSGQFFAVNDSRELLES 379 + + + ++R + + + + P + DLLR+ G+ + D+ L Sbjct: 507 SEEPDDYQRTLAHLREQEVTVSVIGLGVPSDPDADLLREVAHRGGGRVYFAEDAMSLPRI 566 Query: 380 FDK 382 F + Sbjct: 567 FSQ 569 >gi|315126093|ref|YP_004068096.1| pilin biogenesis protein [Pseudoalteromonas sp. SM9913] gi|315014607|gb|ADT67945.1| pilin biogenesis protein [Pseudoalteromonas sp. SM9913] Length = 1057 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 40/303 (13%), Positives = 89/303 (29%), Gaps = 74/303 (24%) Query: 152 MVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKN----TTKSKYAPAPAPAN 207 M+ D S SM ++ Y + K + +N S Y P ++ Sbjct: 44 MIFDTSGSMAWDVNDGDACYKRSGNSYYEVDCFESKDSYKRNEQCYKRTSYYNYEPTCSD 103 Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 ++ V + LVN N + G + +N G + + + +++ Sbjct: 104 SRLRVAQNAITQLVND---------NDDIEFGLMRFNGSNGGYVLARVGAEKSFLLNKVE 154 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK--------------------- 306 +L +T + AY + + Sbjct: 155 ELPASGSTPLTETLWEAYLYITGQNVYYGGNTNERDFTAESSGIYKSPFAPVTGEPLRCD 214 Query: 307 ---FVIFITDGENSGASAYQNTLNTL------------------QICEYM---------- 335 +I +TDG+ + S N + L + + + Sbjct: 215 NSINIILMTDGDPTNDSNRNNDIYNLHNSYFSGNPPAVSNSYLAALAKIIHGTDDVKVDL 274 Query: 336 --RNAGM----KIYSVAVSAP--PEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDK 386 + + +++ + G++LL + D GQ+ N + +L ++ + Sbjct: 275 YSKTKNVLDIGRVFPIGFGTGMSDSGKNLLTETADYGGGQYLHANTAAQLSDALKNTISR 334 Query: 387 IQE 389 I+E Sbjct: 335 IRE 337 >gi|256811138|ref|YP_003128507.1| Magnesium chelatase [Methanocaldococcus fervens AG86] gi|256794338|gb|ACV25007.1| Magnesium chelatase [Methanocaldococcus fervens AG86] Length = 246 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 28/253 (11%), Positives = 76/253 (30%), Gaps = 32/253 (12%) Query: 138 IIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKS 197 + + + I+ + + +K N + + + ++ S + Sbjct: 8 FKPANDKIVDIAFDATFRRAAIHQKKRREKANKKLAIYLEREDIVEKVRQRKISSHILFV 67 Query: 198 KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN 257 A A ++++ + +L+ + +IG IA+ P ++ Sbjct: 68 VDASGSMGAMKRMEAAKGAIISLL-------LDAYQKRNKIGMIAFR-KDRAELILPFTS 119 Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 ++ + L L T A +Y E + + +I I+D + + Sbjct: 120 SVELGEKLLKDLPTGGKTPLADAFIKSYEVFDREI------RKNPNIIPIMIVISDFKPN 173 Query: 318 GASAYQNTLNTLQICEYMRNAGMKI--------YSVAVSAPPEGQDLLRKCTDS-SGQFF 368 A CE + G+ + + + ++ D +++ Sbjct: 174 VAVKGDYIKEVFDACEKIVEKGINVVLIDTEPQSFIKIGIG-------KEIADRFGFKYY 226 Query: 369 AVN--DSRELLES 379 + +LL+ Sbjct: 227 KIEELSKDKLLDI 239 >gi|187251529|ref|YP_001876011.1| von Willebrand factor type A [Elusimicrobium minutum Pei191] gi|186971689|gb|ACC98674.1| Von Willebrand factor type [Elusimicrobium minutum Pei191] Length = 335 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 26/187 (13%), Positives = 57/187 (30%), Gaps = 56/187 (29%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP----YENTNTYPAMHHAYRELYNEK 292 R G +A+ P++N++ +K +N+L P + T PA+ A L Sbjct: 131 RTGIVAFTSKAYTQ--CPITNDVEALKYFVNQLRPEMLNAKGTALAPAVQRAAEMLSKYP 188 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 K +I +TDGE + + + G+KI +V + Sbjct: 189 GK-----------KALILLTDGE------DHEPEQIEEAIKTAQKEGIKIIAVGIGTEEG 231 Query: 353 ----------------------------GQDLLRKCTDSSG----QFFAVND-SRELLES 379 + L++ +G ++ + ++ + Sbjct: 232 EPIPEKIEGGRVLEYKKDADGKTVITKLDEKSLKELASKTGGVYIKYKNAQTVAAQIAQV 291 Query: 380 FDKITDK 386 + + Sbjct: 292 LEDLDKN 298 >gi|149375210|ref|ZP_01892982.1| hypothetical protein MDG893_06314 [Marinobacter algicola DG893] gi|149360574|gb|EDM49026.1| hypothetical protein MDG893_06314 [Marinobacter algicola DG893] Length = 658 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 31/235 (13%), Positives = 73/235 (31%), Gaps = 27/235 (11%) Query: 170 DNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ 229 L P + + S P ++ + L + Sbjct: 16 TTALAQQAPTLQLPDSADVRIIVDISGSMKTNDPN------NLRRPAVRLLARMLPGQAN 69 Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 + + G V + L+ ++ +N + TN A+ A + Sbjct: 70 AGVWTFGQYVNMLVPHGKVTDDWRGLAVERSD---EINSVALR--TNLGEAIQVASDDYL 124 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL-----QICEYMRNAGMKIYS 344 +S NT I +TDG+ + + + + + G +++ Sbjct: 125 LGSDSLDNT--------DFILLTDGKVDISDNESANDRERERILGALLDELSSRGATLHT 176 Query: 345 VAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESF-DKITDKIQEQSVRIAPN 397 VA+ + LL+ + + G++ + + L +F + + + +Q + I N Sbjct: 177 VAL-SEEADLALLKSLAERTGGRYALASSADALTLAFLEALNTAVPQQQIPIEDN 230 >gi|327270778|ref|XP_003220165.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 925 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 30/188 (15%), Positives = 64/188 (34%), Gaps = 34/188 (18%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 N +I L ++A + +G + +N + Sbjct: 310 VSGSMSGNNRIARLKQAAETFILQ-------NIEDGSWVGIVTFNNAATIQTGLQQVVS- 361 Query: 260 NEVKSRLNK---LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + V+ LN ++ TN + ++ + ++ ++ +TDGE+ Sbjct: 362 DTVRKTLNGYLPISANGGTNICAGVQKGFQVFSS--------KYASTEGCEIVLLTDGED 413 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN---D 372 SG S+ ++ +G I+++A+ P L D + G F+ D Sbjct: 414 SGLSSCF---------AEVQRSGSVIHTIALG--PSAAKELEMLADMTGGLKFSATDSVD 462 Query: 373 SRELLESF 380 S L ++F Sbjct: 463 SSSLEDAF 470 >gi|301768026|ref|XP_002919432.1| PREDICTED: calcium-activated chloride channel regulator 1-like [Ailuropoda melanoleuca] Length = 913 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 18/133 (13%), Positives = 52/133 (39%), Gaps = 25/133 (18%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T+ + A+ + + + ++ +TDGE++ S+ N + Sbjct: 379 ASGGTSICSGLRSAFAVIRKKFSTD---------GSEIVLLTDGEDNTISSCFNEV---- 425 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE---LLESFDKITD-- 385 + +G I++VA+ P ++L + G +D + L+++F ++ Sbjct: 426 -----KQSGAVIHTVALG-PSAAKELEELSKMTGGLQTYASDQAQNNGLIDAFGALSSGN 479 Query: 386 -KIQEQSVRIAPN 397 ++++++ Sbjct: 480 GAASQRAIQLESK 492 >gi|212635869|ref|YP_002312394.1| Von Willebrand factor, type A [Shewanella piezotolerans WP3] gi|212557353|gb|ACJ29807.1| Von Willebrand factor, type A [Shewanella piezotolerans WP3] Length = 710 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 25/159 (15%), Positives = 57/159 (35%), Gaps = 20/159 (12%) Query: 238 IGTIAYNIGIVGNQCTPLSN---NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKES 294 + +N + TPL+ N+ ++ + +L T A+ A + +E Sbjct: 386 FNILQFNSNVYALSDTPLNASAKNIGRAQAYVQRLQANGGTEMSLALDKALSQQDANRER 445 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + V+FITDG QI ++ + +++++ + P Sbjct: 446 ----------LRQVLFITDGAV-----GNEPQLFTQIRNQLQQS--RLFTIGIGDAPNAH 488 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + R G + + E+ + DK+++ +V Sbjct: 489 FMQRAAELGRGTYTYIGKQSEVKSKMVAMLDKLEKPTVT 527 >gi|116625363|ref|YP_827519.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116228525|gb|ABJ87234.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 317 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 37/211 (17%), Positives = 71/211 (33%), Gaps = 34/211 (16%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 ID+ +A + + + T + N Sbjct: 79 VDVSGSQRNLIDIERSAASQFFREVLRKKDLAFLIMFGEETELLQDYTGSPRLLTEGLNH 138 Query: 260 NEVKSRLNKLNPYE--------NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 EV S ++ ++P T Y A++ A E + K ++ I Sbjct: 139 LEVSSGVSGIHPGPVPTMGGPRGTVLYDAVYLAANEKLKGEVGR----------KVIVVI 188 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV------AVS----APPEGQDLLRKCT 361 TDG + G+ +N Q E + + IYS+ A G+ LRK + Sbjct: 189 TDGVDQGSRMSRN-----QAIEAAQKSDCVIYSIDYSDPRAYGPFNMVGGGGEGELRKMS 243 Query: 362 D-SSGQFFAVNDSRELLESFDKITDKIQEQS 391 D + G+ + V+ L + F ++ ++++ Q Sbjct: 244 DETGGRVYKVDRRHTLDQVFKELQEEMRSQY 274 >gi|222635792|gb|EEE65924.1| hypothetical protein OsJ_21784 [Oryza sativa Japonica Group] Length = 578 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 28/179 (15%), Positives = 65/179 (36%), Gaps = 27/179 (15%) Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL----SNNLNEV 262 ++DVL S ++ + R+ +A+N G V + L + + Sbjct: 72 ATRLDVLKASMKFIIRKLDDGD--------RLSIVAFNDGPVKEYSSGLLDVSGDGRSIA 123 Query: 263 KSRLNKLNPYENTNT--YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 ++++L P + A + L S F++ +TDG+++ Sbjct: 124 GKKIDRLQAVVAVALRLCPELQEAVKILDE------RQGNSRNRVGFILLLTDGDDTTGF 177 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 + + + + +++ A+ A + + LL +S G + V+D L + Sbjct: 178 RWSRDVIHGAVGK------YPVHTFALGAAHDPEALLHIAQESRGTYSFVDD-GNLDKI 229 >gi|239825644|ref|YP_002948268.1| hypothetical protein GWCH70_0060 [Geobacillus sp. WCH70] gi|239805937|gb|ACS23002.1| conserved hypothetical protein [Geobacillus sp. WCH70] Length = 244 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 17/103 (16%), Positives = 40/103 (38%), Gaps = 10/103 (9%) Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV-SAPPEGQDLLR 358 + ++ ITDG ++ + + + + G+ + + V + LR Sbjct: 1 MRKGTLRQILLITDGCSNHGE------DPIAMASLAKEQGITVNVIGVLDQDTIDESGLR 54 Query: 359 K---CTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + S G V +++L ++ +T K Q+++ NR Sbjct: 55 EIEGIALSGGGISQVVYAKQLSQTVQMVTRKAMTQTLQGVVNR 97 >gi|260824071|ref|XP_002606991.1| hypothetical protein BRAFLDRAFT_64975 [Branchiostoma floridae] gi|229292337|gb|EEN63001.1| hypothetical protein BRAFLDRAFT_64975 [Branchiostoma floridae] Length = 179 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 14/105 (13%), Positives = 30/105 (28%), Gaps = 15/105 (14%) Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A L + + V+ +TDG + + + G++ + Sbjct: 19 ANTMLLPLNGNRPDAPD------VVVVLTDGFSGDDVTGP--------ASLLHSMGVQTF 64 Query: 344 SVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 + V L + + ++D L D I +I Sbjct: 65 VIGVG-GCVSDAQLGNIANCEDHVYKLSDFNGLDGITDSIHKQIC 108 >gi|126352405|ref|NP_001075268.1| calcium-activated chloride channel regulator 1 precursor [Equus caballus] gi|122142874|sp|Q2TU62|CLCA1_HORSE RecName: Full=Calcium-activated chloride channel regulator 1; AltName: Full=Calcium-activated chloride channel family member 1; Short=eCLCA1; Flags: Precursor gi|46578151|gb|AAT01505.1| putative calcium activated chloride channel-like protein 1 [Equus caballus] Length = 913 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 18/119 (15%), Positives = 44/119 (36%), Gaps = 24/119 (20%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T+ + A+ + ++ +TDGE++ S+ N + Sbjct: 379 ASGGTSICSGLRSAFTVI---------RKKYKTDGSEIVLLTDGEDNTISSCFNEV---- 425 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQ-DLLRKCTDSSGQFFAVNDSRE---LLESFDKITD 385 + +G I++VA+ + + L K + G +D + L+++F ++ Sbjct: 426 -----KQSGAIIHTVALGPSAAAELEELSKM--TGGLQTYASDQAQNNGLIDAFGALSS 477 >gi|281352224|gb|EFB27808.1| hypothetical protein PANDA_008060 [Ailuropoda melanoleuca] Length = 911 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 18/133 (13%), Positives = 52/133 (39%), Gaps = 25/133 (18%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T+ + A+ + + + ++ +TDGE++ S+ N + Sbjct: 379 ASGGTSICSGLRSAFAVIRKKFSTD---------GSEIVLLTDGEDNTISSCFNEV---- 425 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE---LLESFDKITD-- 385 + +G I++VA+ P ++L + G +D + L+++F ++ Sbjct: 426 -----KQSGAVIHTVALG-PSAAKELEELSKMTGGLQTYASDQAQNNGLIDAFGALSSGN 479 Query: 386 -KIQEQSVRIAPN 397 ++++++ Sbjct: 480 GAASQRAIQLESK 492 >gi|300781256|ref|ZP_07091110.1| secreted Mg-chelatase subunit [Corynebacterium genitalium ATCC 33030] gi|300532963|gb|EFK54024.1| secreted Mg-chelatase subunit [Corynebacterium genitalium ATCC 33030] Length = 528 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 15/126 (11%), Positives = 36/126 (28%), Gaps = 16/126 (12%) Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + + +N L T Y + A R ++ ++DGE + Sbjct: 414 KAKYQGYVNDLVADGQTAIYDTLFDALRSSDPNAG-----------ISSIVLLSDGEVTH 462 Query: 319 A-SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 Y + + ++ + ++ + G F + +L Sbjct: 463 GMDYYAFEKQYQGLSPE--QRSIPVFVILYG-EANASEMNNLAELTGGAVFDALN-GDLD 518 Query: 378 ESFDKI 383 +F +I Sbjct: 519 AAFKEI 524 >gi|156603358|ref|XP_001618818.1| hypothetical protein NEMVEDRAFT_v1g224783 [Nematostella vectensis] gi|156200502|gb|EDO26718.1| predicted protein [Nematostella vectensis] Length = 147 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 20/124 (16%), Positives = 47/124 (37%), Gaps = 9/124 (7%) Query: 260 NEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 VK+ LN++ T A+ A ++L+ +K +KK ++ + G + Sbjct: 6 ANVKTLLNRVRHQRGYTFADKALTLANQQLFTKKAGMRIA-----VKKVLLVLMHGRQTT 60 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 LN + + +++ G++I+++ V + L + V + Sbjct: 61 HKGPATPLN--RAAQGLKDKGVEIWALGVG-DEVSRAELIDVASDLDKVITVGSFGAMKG 117 Query: 379 SFDK 382 D+ Sbjct: 118 IVDE 121 >gi|47168593|pdb|1Q0P|A Chain A, A Domain Of Factor B Length = 223 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 34/209 (16%), Positives = 68/209 (32%), Gaps = 31/209 (14%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + +S NL+ + + + V T V S+N + Sbjct: 25 SDSIGASNFTGAKKSLVNLIEKVASYGVKPRYGLVTYATYPKIWVKVSEAD---SSNADW 81 Query: 262 VKSRLNKLNP-----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 V +LN++N TNT A+ Y + + R + +I +TDG + Sbjct: 82 VTKQLNEINYEDHKLKSGTNTKKALQAVYSMMSWPDDVPPEGWN--RTRHVIILMTDGLH 139 Query: 317 SGASAYQNTLNTLQICEYMRN-------------AGMKIYSVAVS--APPEGQDLLRKCT 361 + + + + + +R+ + +Y V + L Sbjct: 140 NMGG------DPITVIDEIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALASKK 193 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQ 390 D+ F V D L + F ++ D+ Q Sbjct: 194 DNEQHVFKVKDMENLEDVFYQMIDESQSL 222 >gi|1708567|sp|P53710|ITA2_BOVIN RecName: Full=Integrin alpha-2; AltName: Full=CD49 antigen-like family member B; AltName: Full=Collagen receptor; AltName: Full=Platelet membrane glycoprotein Ia; Short=GPIa; AltName: Full=VLA-2 subunit alpha; AltName: CD_antigen=CD49b; Flags: Precursor gi|439696|gb|AAB59255.1| integrin alpha 2 subunit [Bos taurus] Length = 1170 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 40/343 (11%), Positives = 85/343 (24%), Gaps = 42/343 (12%) Query: 59 QTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENL 118 + L + + + + + ++ TE Sbjct: 40 AVQQFINPKGNWLLVGSPWSGFPKNRMGDVYKCPVDLSTTTCEKLNLQTSTSMSNVTEMK 99 Query: 119 FLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNK 178 L + N+ S V + + Sbjct: 100 TNMSLGLTLTRNVGTGGFLTCGPLWAQQCGSQYYTTGVCSDVSPDF----------QLRT 149 Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 P + + + P D + V + + ++ Sbjct: 150 SFAPAVQTCPSFIDVVVVCDESNSIYPW----DAVKNFLEKFVQGL-----DIGPTKTQM 200 Query: 239 GTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSH 296 G I Y + +E+ ++ Y TNT+ A+ +A Y+ Sbjct: 201 GLIQYANNPRVVFNLNTFKSKDEMIKATSQTFQYGGDLTNTFKAIQYARDTAYSTAAGGR 260 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV----SAPPE 352 K ++ +TDGE+ S + + + + +AV + Sbjct: 261 PGAT-----KVMVVVTDGESHDGS------KLKAVIDQCNKDNILRFGIAVLGYLNRNAL 309 Query: 353 GQDLL----RKCTD--SSGQFFAVNDSRELLESFDKITDKIQE 389 L + + FF V+D +LLE I ++I Sbjct: 310 DTKNLIKEIKAIASIPTERHFFNVSDEADLLEKAGTIGEQIFS 352 >gi|333030668|ref|ZP_08458729.1| von Willebrand factor type A [Bacteroides coprosuis DSM 18011] gi|332741265|gb|EGJ71747.1| von Willebrand factor type A [Bacteroides coprosuis DSM 18011] Length = 342 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 25/179 (13%), Positives = 65/179 (36%), Gaps = 51/179 (28%) Query: 251 QCTPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 P++N+ K L+ ++P + T+ A+ A R ++ + + Sbjct: 142 TQLPITNDFVSAKMFLSSISPNLIERQGTSIGKAVDLATRSFTSQ----------EGVGR 191 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------------- 352 +I ITDGE+ + L+ + +AG+++ + V +P Sbjct: 192 TIILITDGED-------HEPGALEAVKRAVDAGIQVNVMGVGSPDGAPIPVTERNGSEYR 244 Query: 353 ------------GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 +++ ++ +G + V+++ + IT +I + + ++ Sbjct: 245 KDNEGNVVVTKLNEEMAQEIAKAGNGLYVRVDNT---NAAQKAITSEINKLTKTNIESK 300 >gi|321475776|gb|EFX86738.1| hypothetical protein DAPPUDRAFT_307863 [Daphnia pulex] Length = 829 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 40/338 (11%), Positives = 101/338 (29%), Gaps = 31/338 (9%) Query: 69 KKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSAL 128 + Q E + K + + QY++ ++ Sbjct: 161 DNEIAQIIRTNETTVVSVVYEPTEAEQIKMSKDGLQGQFVVQYDVDRSSIE--KKGGEIH 218 Query: 129 TNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKS 188 + L + VLD S SM +++ N + + Sbjct: 219 VVDGYFVHFFVPADLPTLPKHVIFVLDTSGSMAGTRIEQTKQAMN---SILDQLRKDEDI 275 Query: 189 FWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIV 248 F +++ + + + ++ ++ + + Sbjct: 276 FSV-----VEFSSGVTEWDLRKPYKGPDHYYFNSPPEETTEDATAVPQNNESEVKFGPYD 330 Query: 249 GNQCTPLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 P++ ++ K + ++ +TN A+ A + + + T Sbjct: 331 DILAYPVTEQSVKRAKEFVAAMDVTSSTNINDALLLALKNSQSVQSRVRLTP-------I 383 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAG----MKIYSVAVSAPPEGQDLLRKCTDS 363 +IF+TDGE + + +T +I + +R + I+ +A + Q L + + + Sbjct: 384 IIFLTDGEPTASVT-----DTTEILKNVRKGNSDDVVSIFCLAFGTGTDYQFLTKISSQN 438 Query: 364 SGQF---FAVNDSR-ELLESFDKITDKIQEQSVRIAPN 397 G + D+ +L FD++ + + N Sbjct: 439 RGFARKIYEAADATLQLKGFFDEVASPLLSNVNFVYNN 476 >gi|320537260|ref|ZP_08037220.1| von Willebrand factor type A domain protein [Treponema phagedenis F0421] gi|320145888|gb|EFW37544.1| von Willebrand factor type A domain protein [Treponema phagedenis F0421] Length = 319 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 16/98 (16%), Positives = 35/98 (35%), Gaps = 13/98 (13%) Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 PL+ + + L+P T + + A +++ + K +I Sbjct: 135 SVPLTFEHEVLLKAIESLSPSSYTASGTNLQKAL------LKAAAVFPKNRATAKTIILC 188 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 TDGE S + L+ + ++ G+++ V Sbjct: 189 TDGEQSEGNI-------LEAAKEIQRHGIQLIIVGFGT 219 >gi|260771476|ref|ZP_05880401.1| hypothetical protein VFA_000095 [Vibrio furnissii CIP 102972] gi|260613602|gb|EEX38796.1| hypothetical protein VFA_000095 [Vibrio furnissii CIP 102972] Length = 407 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 25/156 (16%), Positives = 49/156 (31%), Gaps = 4/156 (2%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + I + V +AID+ H M R ++Q+++DAA L+ + D T T T Sbjct: 20 IVTIAMLVLIAVAAFAIDINHAMMNRTKLQNSVDAAALAAAIVLDKDGTEAQADTIARST 79 Query: 61 ST---IFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 T + L + + + ++P ++ E+ Sbjct: 80 LTKMSTAAGNAELTLDVSNVVNVEVQFSNDPTVFPDSGYSSSPDGDRYVRVVINQLDLES 139 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMV 153 F ++ L+ + I V Sbjct: 140 FFFARVMGV-TKRLTASAVAGPSPGGNACNIVPMAV 174 >gi|156400007|ref|XP_001638792.1| predicted protein [Nematostella vectensis] gi|156225915|gb|EDO46729.1| predicted protein [Nematostella vectensis] Length = 974 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 29/196 (14%), Positives = 57/196 (29%), Gaps = 35/196 (17%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-- 270 A + ++ K V + + +N ++ +LN + Sbjct: 90 YQACANYATSIVRNLDMPKTGARVGTVIYSKRSIV----LFDFKDNKTDIILQLNTIKYR 145 Query: 271 --PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 P +T A+ A + L+ KK I +T ++ + Sbjct: 146 EKPNRALSTGQALELAQKRLFQNARR--------NSKKIAILVTGEKSQDDVIIPS---- 193 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA--VNDSRELLESF------ 380 + M+++G+ IY+V V L S + V EL Sbjct: 194 ----KLMKDSGVIIYTVGVG-EGYFLPQLESIASSPSYVYTEKVKGIGELATRMVEGVCK 248 Query: 381 --DKITDKIQEQSVRI 394 + I K + V + Sbjct: 249 EKENIATKEGSKKVNV 264 >gi|123498822|ref|XP_001327483.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121910413|gb|EAY15260.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 729 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 40/355 (11%), Positives = 102/355 (28%), Gaps = 54/355 (15%) Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 K+QI + + +N + I N +Q ++ + +I F K Sbjct: 81 EFQKEQIGSQSDEYTSTIQNYFGSGYSSFILGCIVPNKEVQIHLKASSNADINENGYFYK 140 Query: 122 GLIPSALTN--LSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKY 179 + + + + L + +++ ++ + N + Sbjct: 141 IPVNRQYIPGKFEFSTKIKTRKEIKELIAPVDGIMNAIDLHNISFVTQELPKENSIFIET 200 Query: 180 LLPPPPKKSFWSKNTTKSKYAPAPAPAN-------------------RKIDVLIESAGNL 220 + K S + S +I L Sbjct: 201 RIKDKDKSIAVSSDGYISISTYEYFEGKIYANSEFYFIIDCSGSMEESRIKNAKFCLNLL 260 Query: 221 VNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYP 279 ++S+ + + + ++ N+ + ++N+++ E T+ Sbjct: 261 IHSLPVGCRFSIIKFGSMYEVVLPTCDYTDE------NVAKAMEQINQMDANMEGTDILS 314 Query: 280 AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG 339 + + K V +TDGE+ + + + L N Sbjct: 315 PLKFVSDQ-----------STKEGFIKQVFLLTDGED----IHTDQIYALVQANRTNNR- 358 Query: 340 MKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 I+++ + + ++L++ S G + D+ E K+ +KI E + Sbjct: 359 --IFTIGIGSGA-DRNLIKNIARISGGNNALIEDNDE------KMNEKIIELLRK 404 >gi|118364359|ref|XP_001015401.1| Helicase conserved C-terminal domain containing protein [Tetrahymena thermophila] gi|89297168|gb|EAR95156.1| Helicase conserved C-terminal domain containing protein [Tetrahymena thermophila SB210] Length = 2730 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 17/226 (7%), Positives = 71/226 (31%), Gaps = 20/226 (8%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 ++ ++ + + + + + + +++ + + + I Sbjct: 2405 EQIYYKTLIAMDVTGSMSSLITQTKNTIQTTFEQTRDILKEKGYDPQCFLIMISCFRSYN 2464 Query: 246 GIVGNQCTPLSN--NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 + N ++++S L + T ++ + + + + Sbjct: 2465 SRWQEIFQTSTWENNPDKLRSFLQTITASGGTGPGESVEVGLWWANKQNDENPISQVIIL 2524 Query: 304 LKKFVIFITDGE---NSGASAYQNTLNTLQI------CEYMRNAGMKIYSVAVSAPPEGQ 354 + + + N +Y +T + C+ + + + + + + + + Sbjct: 2525 GDQPAHLQNEAQAHRNQFGQSYWDTTPLKGLTYYVPECQKLNSKKIHVNAFYL--HSQAK 2582 Query: 355 DLLRKCTD-SSG--QFFAVNDSR---ELLESF-DKITDKIQEQSVR 393 ++G Q+ +N ++ +L F ++I I ++ R Sbjct: 2583 STFENIAKLTNGISQYLDINSAQSSKQLTTLFVEQILKDIGKKDGR 2628 >gi|304393605|ref|ZP_07375533.1| conserved hypothetical protein [Ahrensia sp. R2A130] gi|303294612|gb|EFL88984.1| conserved hypothetical protein [Ahrensia sp. R2A130] Length = 170 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 11/62 (17%), Positives = 25/62 (40%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A + + A++++ +++Q+Q A D A LS A++ + Sbjct: 32 IFAAAALPMAIGVAGALEISQYSQLKSQLQEASDRAALSAMAALREGPRAMRQQARLVMK 91 Query: 61 ST 62 T Sbjct: 92 QT 93 >gi|126631586|gb|AAI34052.1| Col6a2 protein [Danio rerio] Length = 310 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 23/141 (16%), Positives = 50/141 (35%), Gaps = 16/141 (11%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP----YENTNTYPAMHHAYREL 288 N + G+ +Q + + + ++ L P T+ A+ + ++L Sbjct: 77 NYRGSVNITWAIGGLHFSQLQEFFSTITTKEKFISNLRPIRYLGRGTHIDCAITNMTKQL 136 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 K+F + ITDG + + E R+ ++I++VA S Sbjct: 137 V-------RFPSRPDAKRFAVVITDGHVTANPCGGIKV----AAERARDENIRIFAVA-S 184 Query: 349 APPEGQDLLRKCTDSSGQFFA 369 + + LR+ +S + Sbjct: 185 SRNLEETGLREIANSPAGVYR 205 >gi|125829706|ref|XP_696164.2| PREDICTED: collagen alpha-2(VI) chain [Danio rerio] Length = 1015 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 23/141 (16%), Positives = 50/141 (35%), Gaps = 16/141 (11%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP----YENTNTYPAMHHAYREL 288 N + G+ +Q + + + ++ L P T+ A+ + ++L Sbjct: 77 NYRGSVNITWAIGGLHFSQLQEFFSTITTKEKFISNLRPIRYLGRGTHIDCAITNMTKQL 136 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 K+F + ITDG + + E R+ ++I++VA S Sbjct: 137 V-------RFPSRPDAKRFAVVITDGHVTANPCGGIKV----AAERARDENIRIFAVA-S 184 Query: 349 APPEGQDLLRKCTDSSGQFFA 369 + + LR+ +S + Sbjct: 185 SRNLEETGLREIANSPAGVYR 205 >gi|145596106|ref|YP_001160403.1| von Willebrand factor, type A [Salinispora tropica CNB-440] gi|145305443|gb|ABP56025.1| von Willebrand factor, type A [Salinispora tropica CNB-440] Length = 576 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 21/145 (14%), Positives = 53/145 (36%), Gaps = 13/145 (8%) Query: 244 NIGIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGST 302 + PLS+ +E++ L ++ P +T + + AY + + + Sbjct: 435 RDYRRLVEIGPLSSQRSELEQALAQIQPTRGDTGLFDTVLAAYEAVQEDWDEGQVNS--- 491 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN--AGMKIYSVAVSAPPEGQDLLRKC 360 ++ TDG+N + + + E +++ +++ + + A +L Sbjct: 492 -----IVLFTDGKNDDDNGI-SQQQLIAELERIKDPERPVQVVLIGIGADVSKAELESIT 545 Query: 361 TDSSGQFFAVNDSRELLESF-DKIT 384 + G F D ++ + F I Sbjct: 546 EVTGGGSFITEDPTKIGDIFLKAIA 570 >gi|319902110|ref|YP_004161838.1| von Willebrand factor type A [Bacteroides helcogenes P 36-108] gi|319417141|gb|ADV44252.1| von Willebrand factor type A [Bacteroides helcogenes P 36-108] Length = 342 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 21/168 (12%), Positives = 54/168 (32%), Gaps = 46/168 (27%) Query: 251 QCTPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 P++N+ K L ++P + T A+ A R + + + Sbjct: 142 TQLPITNDYISAKMFLESISPSLISKQGTAIGAAISLATRSFTPQ----------EGIGR 191 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------------- 352 +I ITDGEN + + G+++ + V P Sbjct: 192 AIIVITDGENHEG-------GVAEAAKTATEKGIQVNVLGVGMPDGAPIPVEGTNDYRRD 244 Query: 353 ----------GQDLLRKCTDSSGQFF-AVNDSRELLESFDKITDKIQE 389 +++ ++ + + V+++ ++ + +K+ + Sbjct: 245 REGNVIVTRLNEEMCQEIAKAGNGIYVRVDNTNSAQKAISQEMNKMAK 292 >gi|319639064|ref|ZP_07993821.1| PilC protein [Neisseria mucosa C102] gi|317399642|gb|EFV80306.1| PilC protein [Neisseria mucosa C102] Length = 1123 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 31/236 (13%), Positives = 66/236 (27%), Gaps = 25/236 (10%) Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKID--VLIESAGNLVNSIQKAIQE- 230 + P + +I+ + S K I + Sbjct: 186 ANMSCSNQVPGEDPRLSRNTNFNYDRDYYYSNYYHRIERSANTFAYQYFGPSEVKTIDDA 245 Query: 231 ---KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK----SRLNKLNPYENTNTYPAMHH 283 K R G Y + + L N ++ + +P+ + + A Sbjct: 246 YGPGKLFDDRHGFKGYFDFPIYQYESFLPENERKLLCQSSKYAREYSPFYKRDIWVA--- 302 Query: 284 AYRELYNEKESSHNTI-------GSTRLKKFVIFITDGENSGA-SAYQNTLNTLQICEYM 335 A + + ++ T +K + DG + S + + + Sbjct: 303 AGEIIVPYWDRNYKDEKRGLRFFSRTLAEKDIKTAKDGLDDAGKSWDGDPSDPKGV--DY 360 Query: 336 RNAGMKIYSVAVSAPPE--GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 ++ ++V G++ L K +F +LLE+F I D I+ Sbjct: 361 SKQLVQTFTVGFGEGISEVGREYLEKGASRPDWYFNAAKKEDLLEAFKTIVDNIEN 416 >gi|301166645|emb|CBW26221.1| putative membrane protein [Bacteriovorax marinus SJ] Length = 287 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 32/204 (15%), Positives = 62/204 (30%), Gaps = 37/204 (18%) Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 L F KN K+ + L+ S+ +I Sbjct: 104 LDVDVGFKFKGKNIVFLIDVSGSMKTLDKMGQVKAGLKMLITSMPS--------DYQIDV 155 Query: 241 IAYNIGIVGNQCTPLSNN-------LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 I + + S +V LN+LNP T T A+ +A + + + Sbjct: 156 IHFPGKRGARYYSLWSYTQKLGERQKKDVYRFLNRLNPKGATPTRSALKYALTKYPDLTD 215 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA--GMKIYSVAVSAP- 350 V+ ++DG + ++ + + E ++ ++I ++ V A Sbjct: 216 --------------VVLLSDGAPTKMNSSEYDDIKDILSEVKKDNFKNIQINTIGVGAAF 261 Query: 351 -----PEGQDLLRKCTDSSGQFFA 369 L++ SG FF Sbjct: 262 SLQSTTPASVFLKELAKQSGGFFY 285 >gi|260837282|ref|XP_002613634.1| hypothetical protein BRAFLDRAFT_93675 [Branchiostoma floridae] gi|229299020|gb|EEN69643.1| hypothetical protein BRAFLDRAFT_93675 [Branchiostoma floridae] Length = 1460 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 26/204 (12%), Positives = 61/204 (29%), Gaps = 24/204 (11%) Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 P K + + S P ++++ ++A +++++ +S Sbjct: 1160 ASPKKKNVVIVMDVSGSMREPPGPEEQNRLNLAKQAALTVLDTL-TPRDWGGVVSFSARA 1218 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG 300 + + N+ +K +N+ P T A+ + Sbjct: 1219 ETPEGCLGDSLGEANPTNIGIMKDFINQRVPETITMYGVGFRKAFDMFAEARNKKPEQFE 1278 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA----------- 349 +IF++DG + LN + + + + + I++ + A Sbjct: 1279 DCY--NIIIFLSDGSPTDKDFA---LNAITQGQELMDRSVYIFTYGLGANLMWASSQWAP 1333 Query: 350 PPEGQ-------DLLRKCTDSSGQ 366 P Q D LR D + Sbjct: 1334 DPNNQYVYLPALDFLRTIADQNNH 1357 >gi|223940571|ref|ZP_03632417.1| von Willebrand factor type A [bacterium Ellin514] gi|223890763|gb|EEF57278.1| von Willebrand factor type A [bacterium Ellin514] Length = 965 Score = 48.0 bits (112), Expect = 0.003, Method: Composition-based stats. Identities = 19/103 (18%), Positives = 39/103 (37%), Gaps = 10/103 (9%) Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 S K +I +DG+ + S + + + + + +V + A G + Sbjct: 493 GLKKSHSNLKHIIVFSDGDPNAPSPS--------LMNDIVSDRITVSTVLI-AGHAGPET 543 Query: 357 LRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + D G+F+ V EL + F K I + ++ P + Sbjct: 544 MIWIADHGKGRFYPVTSPNELPQIFIKEAAVILKSAIYEDPFK 586 >gi|85701714|ref|NP_001028371.1| calcium activated chloride channel [Mus musculus] gi|74202052|dbj|BAE23018.1| unnamed protein product [Mus musculus] gi|148680069|gb|EDL12016.1| mCG120741 [Mus musculus] gi|187951335|gb|AAI39089.1| Expressed sequence AI747448 [Mus musculus] gi|187957592|gb|AAI39090.1| Expressed sequence AI747448 [Mus musculus] Length = 925 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 22/207 (10%), Positives = 77/207 (37%), Gaps = 35/207 (16%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 ++ +++ + ++A ++ I + N S + + ++ +++++ Sbjct: 315 VSGSMSSSDRLNRMNQAAKYFLSQIIE------NRSWVGMVHFSSQATIVHELIQMNSDI 368 Query: 260 NEVKSRLNKL--NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 L L + T+ + A++ N + + T ++ ++DGE+S Sbjct: 369 ER-NKLLQTLPTSAIGGTSICSGIKTAFQVFKNGEYQTDGTE--------ILLLSDGEDS 419 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-GQDLLRKCTDSSGQFFAVNDSRE- 375 + ++++G ++ +A+ + + + G D + Sbjct: 420 T---------AKDCIDEVKDSGSIVHFIALGPSADLAVTNMSIL--TGGNHKLATDEAQN 468 Query: 376 --LLESFDKITDK---IQEQSVRIAPN 397 L+++F + + I ++S+++ Sbjct: 469 NGLIDAFGALASENTDITQKSLQLESK 495 >gi|73997822|ref|XP_534930.2| PREDICTED: similar to voltage-gated calcium channel alpha(2)delta-4 subunit isoform 1 [Canis familiaris] Length = 1121 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 25/351 (7%), Positives = 102/351 (29%), Gaps = 19/351 (5%) Query: 39 SGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKN 98 + + + + + + D + + + ++ + + N Sbjct: 116 MLRRKVEAVKNLVEAAEEADLNHEFNESLVFNYYNSVLINEKDENGDYVELGAEFVLESN 175 Query: 99 NPLQ--YIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDV 156 + S + ++PT I + + + +E + ++ Sbjct: 176 AHFSNLMVNTSISSVQLPTNVYNKDPDILNGVYMSEALNPVFVENFQRDPTLTWQYFGSS 235 Query: 157 SRSME--DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLI 214 + ++N + + + + K+ ++ + Sbjct: 236 TGFFRIYPGIKWTPDENGVIAFDCRNRGWYIQAATSPKDIVIVVDTSGSMKG-LRMTIAK 294 Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 + +++++ + ++ G +N K +++L Sbjct: 295 HTISTILDTLGENDF-VNIIAYSDYIHYVEPCFKGTLVQADRDNREHFKQLVDELMVKGV 353 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 PA+ A++ L +E+ ++ + + ++ ITDG N Sbjct: 354 GIVNPALTEAFQILKQFQEARQGSLCN----QAIMLITDGAVEDYKPVFEKYNWP----- 404 Query: 335 MRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + +++++ + D ++ ++ ++ L ++ + + + Sbjct: 405 --DRKIRVFTYLIGREVTFADRMKWIACNNKGYY--TQISTLADAQENVME 451 >gi|332707319|ref|ZP_08427370.1| hypothetical protein LYNGBM3L_36250 [Lyngbya majuscula 3L] gi|332353913|gb|EGJ33402.1| hypothetical protein LYNGBM3L_36250 [Lyngbya majuscula 3L] Length = 464 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 30/225 (13%), Positives = 69/225 (30%), Gaps = 35/225 (15%) Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 Y + K T KID++ +L+N ++ + RI Sbjct: 58 YEVVEGDTKPTGRVYTQDGNDYEEVIGGKTKIDIV---IESLLNLVRSNQLGGSD---RI 111 Query: 239 GTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYRELYNEKESSHN 297 I ++ + +++++ + KL T M L Sbjct: 112 AIIQFDDQASTIVGLTPATETSQLEAGIEKLRNYSGGTCMGEGMEQTLTMLS-------- 163 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 G T + + TDG+ + ++ + G+ I ++ V +DLL Sbjct: 164 --GQTMTSRHALIFTDGQ------AFDEEECRELAKQFSENGIPITALGVG--DYNEDLL 213 Query: 358 RKCT-DSSGQFFAVNDSR---------ELLESFDKITDKIQEQSV 392 + ++G + +L + + + Q++ + Sbjct: 214 INLSDTTAGHLHHIVSGDATGTDVAIRDLPQILFREFSQAQQEVI 258 >gi|262198733|ref|YP_003269942.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262082080|gb|ACY18049.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 684 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 42/319 (13%), Positives = 91/319 (28%), Gaps = 27/319 (8%) Query: 96 DKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD 155 A+ + N+ L G+ A L + + S + +++ V + Sbjct: 7 TNLRTKSVARRPAARSSLRVGNIALAGVFAGAAL---LGACDMSVGQSTSSLVTVTAVPN 63 Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 + + + Y PP + + + A A N ++ V+ Sbjct: 64 PPLDPRCGLDAVIVLDASSSVRNYNNPPDANGAVDLIAGAGNAFLGAFADTNSRVAVVSY 123 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 +A + A+ + IA + +E T Sbjct: 124 NADPRLQLDLTAVTTDSLAAGGAHGIAMGDPGGPQGPMSPTTGYSE------HARNGSGT 177 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI--FITDGENSG-ASAYQNTLNTLQIC 332 N + +A L N + + DG + + ++ Sbjct: 178 NWEAGLVYAQNVLENNGRADVPKLVIHVTDGRPTRHLTPDGTVTDEGGMAVHVAEAAEVA 237 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCT-------DSSGQFFA--------VNDSRELL 377 + ++ +G+ I++V V P+ + L+ + G F D +L Sbjct: 238 DQLKASGVHIFAVGVGRAPQFSEELQATSGPDVFDQTQPGDAFDVVNDDVILAADFDQLE 297 Query: 378 ESFDKITDKIQEQSVRIAP 396 E + D+I S+ I Sbjct: 298 ELLRGVADQICGASLTITK 316 >gi|295401970|ref|ZP_06811932.1| conserved hypothetical protein [Geobacillus thermoglucosidasius C56-YS93] gi|312109213|ref|YP_003987529.1| von Willebrand factor type A [Geobacillus sp. Y4.1MC1] gi|294975972|gb|EFG51588.1| conserved hypothetical protein [Geobacillus thermoglucosidasius C56-YS93] gi|311214314|gb|ADP72918.1| von Willebrand factor type A [Geobacillus sp. Y4.1MC1] Length = 244 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 17/103 (16%), Positives = 41/103 (39%), Gaps = 10/103 (9%) Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV-SAPPEGQDLLR 358 S + ++ ITDG ++ + + + + G+ + + V + LR Sbjct: 1 MSRGTLRQILLITDGCSNHGE------DPIAMAALAKEQGITVNVIGVLDQDTIDESGLR 54 Query: 359 K---CTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + S G V +++L ++ +T K Q+++ N+ Sbjct: 55 EIEGIALSGGGISQVVYAKQLSQTVQMVTRKAMTQTLQGVVNK 97 >gi|254427565|ref|ZP_05041272.1| Vault protein inter-alpha-trypsin [Alcanivorax sp. DG881] gi|196193734|gb|EDX88693.1| Vault protein inter-alpha-trypsin [Alcanivorax sp. DG881] Length = 684 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 20/163 (12%), Positives = 56/163 (34%), Gaps = 29/163 (17%) Query: 237 RIGTIAYNIGI---VGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 R + ++ NN+ + ++ +L TN + + A L ++ Sbjct: 339 RFRIVLFDDRAEELTSGFVDATPNNIRQYTKKVMQLQSRGGTNLFGGLSLALNPLDADRP 398 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + ++ +TDG + + + N +++++ + Sbjct: 399 TG------------IVLVTDGVANVGKT-----QQKDFIDLLENHDVRLFTFVMG-NSAN 440 Query: 354 QDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 + +L T S+G +V++S + I +I + ++ Sbjct: 441 RPMLTAMTNASNGFAISVSNSDD-------IAGQILNATAKLT 476 >gi|189346892|ref|YP_001943421.1| hypothetical protein Clim_1382 [Chlorobium limicola DSM 245] gi|189341039|gb|ACD90442.1| conserved hypothetical protein [Chlorobium limicola DSM 245] Length = 412 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 20/171 (11%), Positives = 49/171 (28%), Gaps = 20/171 (11%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGI------VGNQCTPLSNNLNEVK 263 I+ +VN + + + + +++ Y Q TP + +L+ + Sbjct: 87 INQAKSQLWRIVNETSRMHKRGEQIRLQVALYEYGNNSLSPASGYIRQVTPFTEDLDLLS 146 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 L L + + + S + K + + + Sbjct: 147 EALFSLTTNGGSEYCG-------HVIGSSLNRLRWNSSRQGLKMIYIAGNEPFNQG---- 195 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL--RKCTDSSGQFFAVND 372 +N C + + + ++ G D R G +FA++ Sbjct: 196 -EVNYEAACRWAAERDITVNTIYCGDYRTGVDTFWQRGAAIGRGGYFAIDS 245 >gi|114705831|ref|ZP_01438734.1| hypothetical protein FP2506_15234 [Fulvimarina pelagi HTCC2506] gi|114538677|gb|EAU41798.1| hypothetical protein FP2506_15234 [Fulvimarina pelagi HTCC2506] Length = 548 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 19/154 (12%), Positives = 37/154 (24%), Gaps = 15/154 (9%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A+++ + F +A DL R +Q +D Sbjct: 21 FALLLPLILGFGGFATDLGLAYLERRSLQ-------------AATDAAALAAIRHPHDAE 67 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 I K ++ + G + + + P L Sbjct: 68 AIAAKVFAASSQKDATFDVVFGSYEPDRNRDERFRPEGSSAVSV-VRVEARKPHHYT-LA 125 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLD 155 ++ LS + + A S M +D Sbjct: 126 RVLGVNGVELSASADASLNAQVVFQAGSRLMKVD 159 >gi|312130321|ref|YP_003997661.1| von willebrand factor type a [Leadbetterella byssophila DSM 17132] gi|311906867|gb|ADQ17308.1| von Willebrand factor type A [Leadbetterella byssophila DSM 17132] Length = 318 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 25/160 (15%), Positives = 52/160 (32%), Gaps = 31/160 (19%) Query: 194 TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 S+ A +ID L++ RIG I +N T Sbjct: 84 IDLSESMNATDVVPSRIDRAKNELQGLIDRFSA---------DRIGIILFNSNAYL--LT 132 Query: 254 PLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 PL+ + +++ + L + +T+ P + +L ++ K I Sbjct: 133 PLTFDTENIRNTIGNLKTHMIDKGSTDFSPMLEMINEKLSVGTQNR---------GKVAI 183 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 +TDGE Q+ + ++ + ++ + V Sbjct: 184 VVTDGETHY-------QIDEQLAKRLKQNNIHLFWLGVGT 216 >gi|251799254|ref|YP_003013985.1| von Willebrand factor A [Paenibacillus sp. JDR-2] gi|247546880|gb|ACT03899.1| von Willebrand factor type A [Paenibacillus sp. JDR-2] Length = 562 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 18/107 (16%), Positives = 40/107 (37%), Gaps = 15/107 (14%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 +N + +N L T T+ + A + L ++ + NT + ++DGE Sbjct: 445 TNQQSLFVGAVNSLQSGGGTATFDGIVVALKLLEDQLATDPNTKP------VIFVLSDGE 498 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 + Q+ + E + + IY++ +A L+ + Sbjct: 499 TNQGYTLQD---IKGLIETYK---IPIYTIGYNAN---IQALQNISS 536 >gi|296453244|ref|YP_003660387.1| von Willebrand factor type A (vWA) domain-containing protein [Bifidobacterium longum subsp. longum JDM301] gi|296182675|gb|ADG99556.1| von Willebrand factor type A (vWA) domain protein [Bifidobacterium longum subsp. longum JDM301] Length = 565 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 30/213 (14%), Positives = 64/213 (30%), Gaps = 18/213 (8%) Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE 230 + L S Y+ + + + V +A + +K+ E Sbjct: 365 PAATVIDSALNVYQTALRKPSWTIWVVDYSGSMSGEGKNGVVKGLNAALDPDQAKKSYIE 424 Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 + V I + T S +++ + + T+ Y + A EL + Sbjct: 425 PASGDVNILIPFETEAHRPVKATGTS--TSDLLHEADATDASGGTDIYEGLLSALDELPS 482 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 E E+S ++ +TDG ++ + R + I+S+ Sbjct: 483 ESEASQ-------YTTAIVLMTDGRSNSDHQDEFESAYKS-----RGRDLPIFSIMFG-- 528 Query: 351 PEGQDLLRKCTD-SSGQFFAVNDSRELLESFDK 382 L+ S+ + F +L F + Sbjct: 529 DADPSQLKSLATLSNAKVFDGRS-GDLAAVFRQ 560 >gi|74203017|dbj|BAE26210.1| unnamed protein product [Mus musculus] Length = 791 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 23/147 (15%), Positives = 45/147 (30%), Gaps = 17/147 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + VR+G + + + EVK + + T T A+ L Sbjct: 82 DISPGRVRVGALQFGSTPHLEFPLDSFSTRQEVKESIKGIVFKGGRTETGLALK----RL 137 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + + + +I ITDG++ G A + +R G+ +++V V Sbjct: 138 SRGFPGGR----NGSVPQILIIITDGKSQGPVALP--------AKQLRERGIVVFAVGVR 185 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRE 375 P + L + Sbjct: 186 FPRWDELLTLASEPKDQHVLLAEQVED 212 >gi|304315865|ref|YP_003851010.1| von Willebrand factor A [Thermoanaerobacterium thermosaccharolyticum DSM 571] gi|302777367|gb|ADL67926.1| von Willebrand factor type A [Thermoanaerobacterium thermosaccharolyticum DSM 571] Length = 230 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 14/88 (15%), Positives = 35/88 (39%), Gaps = 7/88 (7%) Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-GQDLLRKCTDSS 364 + +I +TDG+++ N G+ + ++ + ++ + Sbjct: 7 RQIIVVTDGKSNVGG------NPADAAYLANKKGITVSAIGIVDDGNLSHKEIKDIANWG 60 Query: 365 GQFFAVNDSRELLESFDKITDKIQEQSV 392 G + + S E ++S IT K E+++ Sbjct: 61 GGVYDIIYSDEFIKSLSAITQKSAEKTI 88 >gi|125625081|ref|YP_001033564.1| hypothetical protein llmg_2320 [Lactococcus lactis subsp. cremoris MG1363] gi|124493889|emb|CAL98883.1| hypothetical protein predicted by Glimmer/Critica [Lactococcus lactis subsp. cremoris MG1363] Length = 1444 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 27/267 (10%), Positives = 72/267 (26%), Gaps = 32/267 (11%) Query: 140 ERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY 199 + + I +V+D+S SME + + S Sbjct: 296 NVQNPIKPVDIVLVVDMSGSMEGAREGAIKQGVKSFLSSIENTAYAQYVNVGLVGYSSPG 355 Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKA------IQEKKNLSVRIGTIAYNIGIVGNQ-- 251 + + + + + V+++ KA L +R G GN+ Sbjct: 356 YISNSGYIT-VPMESLATDGHVSAMNKALERQFVGGTFTQLGIRQGAQMLKEDASGNEKM 414 Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 +++ + ++++ L +S + + + + Sbjct: 415 IILMTDGVPTFSNKVSSAQLEGGVLYGTDFDS--NSLDEPSFTSQLWMMNGNNRTPAPYT 472 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS-------APPEGQDLLRK----- 359 GE + TL + ++ G +I+++ + L + Sbjct: 473 VSGETINDTW----AATLGEAKIAKDDGAEIHTLGIQLGKDSGYTNDSSNTYLSQEEVRK 528 Query: 360 ----CTDSSGQFFAVNDSRELLESFDK 382 S G + + + + + Sbjct: 529 RTSLIASS-GLYQDADSAENITDYLKN 554 >gi|119026487|ref|YP_910332.1| truncated clumping factor [Bifidobacterium adolescentis ATCC 15703] gi|118766071|dbj|BAF40250.1| truncated clumping factor [Bifidobacterium adolescentis ATCC 15703] Length = 431 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 30/240 (12%), Positives = 67/240 (27%), Gaps = 35/240 (14%) Query: 49 TIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK 108 T +D K + +A A +I +D+ + Sbjct: 223 TAEDADDADKNGKNDADKDAASAQNGNDNDKNDAAKDDTDADEHIMRDRFTFNRKTVMRA 282 Query: 109 AQYEIPTENLFLKGL----IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLY 164 A+ + K + NL++ E I + +VLD S SM Sbjct: 283 ARNVAVPQPDHTKSITYNNGGKYTLNLNVVGKDTRESHETTEKIEVVLVLDTSGSMNYCM 342 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 +++ L E+A + +++ Sbjct: 343 DGSQRR-------------------------------CNKSNPKRLTALKEAATSFIDAT 371 Query: 225 QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHA 284 + ++ + ++ G + L+++ +KS +++L+ T M A Sbjct: 372 ETTNDTIQDENSKVRIAIAQFGQTSGVVSSLTSDTAALKSSVSRLSANGATPADKGMAAA 431 >gi|315105441|gb|EFT77417.1| von Willebrand factor type A domain protein [Propionibacterium acnes HL030PA1] Length = 322 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 18/154 (11%), Positives = 51/154 (33%), Gaps = 25/154 (16%) Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 P S + V ++ + + T A+ + + + + ++ ++D Sbjct: 150 PPSTDRPTVLRAVDGIELQDGTALGGAIDKSLEAVK----MAPGGSKNPAPA-AIVMLSD 204 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------------PEGQDLLRKC 360 G+N+ + N + + +Y++A LL Sbjct: 205 GDNTQGGSPLVAANRAAAAK------VPVYTIAFGTETGYVDLNGQRERVAPDTKLLSTV 258 Query: 361 TD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 D + + + + + +L E + ++ + + VR Sbjct: 259 ADRTHAKSWTADSADKLQEGYQQVHSSVGYEPVR 292 >gi|294673503|ref|YP_003574119.1| BatB/BatC protein [Prevotella ruminicola 23] gi|294471951|gb|ADE81340.1| putative BatB/BatC protein [Prevotella ruminicola 23] Length = 566 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 27/167 (16%), Positives = 57/167 (34%), Gaps = 46/167 (27%) Query: 253 TPLSNNLNEVKSRLNKLNPYE----NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P++++ K L+ ++P T+ A+ A +H+ + K + Sbjct: 144 LPITSDYVSAKMFLSSIDPSMMATQGTDIARAIDMA----------THSFTQEEGIGKAI 193 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA----------------PPE 352 I ITDGE+ L+ E + AGM++Y + V + Sbjct: 194 IVITDGEDHEG-------GALESAEAAKKAGMRVYVLGVGSTQGAPIPIPGTGDYMKDNT 246 Query: 353 GQDLL---------RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 G ++ + G + V ++ + D K+ ++ Sbjct: 247 GNTVMSALNEDMCRQVAQAGGGAYIHVENNSAAQDQLDNELSKLAKK 293 >gi|323529406|ref|YP_004231558.1| hypothetical protein BC1001_5117 [Burkholderia sp. CCGE1001] gi|323386408|gb|ADX58498.1| Protein of unknown function DUF2134, membrane [Burkholderia sp. CCGE1001] Length = 353 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 24/155 (15%), Positives = 51/155 (32%), Gaps = 8/155 (5%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 MTA+ ++ A+DL +RN++Q+A DAA L+G S+ + + T + Sbjct: 21 MTALCLTALVGITALAVDLGRAWVVRNELQNAADAAALAGAGSLGPNYKSPNWTQAAAKA 80 Query: 61 STIFK-------KQIKKHLKQGS-YIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYE 112 + + ++ G ++ + +Q A Sbjct: 81 QSAITLNKTEGVALVTAQVQTGYWNVKGTPAGMQALPVPAPGAYDRPAVQVTVSRAAGQN 140 Query: 113 IPTENLFLKGLIPSALTNLSLRSTGIIERSSENLA 147 +L L ++ +S + +I Sbjct: 141 GGPLSLVLAPVLGITTMPISATAVAVISAPGYAGP 175 >gi|218778178|ref|YP_002429496.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] gi|218759562|gb|ACL02028.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] Length = 480 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 26/182 (14%), Positives = 60/182 (32%), Gaps = 25/182 (13%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG-NQCTPLSNNL-NEVKSRL 266 K+ + LV ++ R + Y+ + G + L+ + N + + Sbjct: 109 KVRDAKAAVKGLVEGLRSQD--------RFSLVTYSNSVNGGDGLHYLTADKRNSLNWMV 160 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 + + TN + L VI I+DG+ + N L Sbjct: 161 DSIPAGGGTNLGGGLEKGVGVLRAYGAPDRMGK--------VILISDGQANQGVTDPNQL 212 Query: 327 NTLQICEYMRNAG--MKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKI 383 + +R+ G + +V + + L+ D G+++ + + + L F + Sbjct: 213 AAMAA---LRDDGLVYSVTTVGIGQD-FNEQLMATVADGGRGRYYYLENPGDFLAVFQEE 268 Query: 384 TD 385 + Sbjct: 269 AN 270 >gi|145551332|ref|XP_001461343.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124429177|emb|CAK93970.1| unnamed protein product [Paramecium tetraurelia] Length = 265 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 24/147 (16%), Positives = 57/147 (38%), Gaps = 13/147 (8%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 S +++ + L + + T A+ + + + + K + +TDGE Sbjct: 8 SAPQSKLNNILPRFSCSGATAFRDAVVKGNQLMQQLLDLFCKKGAHDKFKFVHVVLTDGE 67 Query: 316 NSGASAYQNTLNTLQICEYMRN---AGM-KIYSVAVSAPPEG--QDLLR---KCTDSSGQ 366 ++ + Q+ ++N + + + + V+ Q +R +C+ S Q Sbjct: 68 DNRSQISLQEFLAYQV--RLKNELPENILQTFYIGVNVENNQTVQQEMRAILQCSGKSAQ 125 Query: 367 FFAVNDSRELLESFDKITDKIQEQSVR 393 +F ++ + E F KI +I V+ Sbjct: 126 YFPIHS-NGINEIFQKIQMQIG-LRVQ 150 >gi|123446482|ref|XP_001311991.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121893822|gb|EAX99061.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 722 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 36/303 (11%), Positives = 91/303 (30%), Gaps = 40/303 (13%) Query: 111 YEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 Y+ P + + G + S ++ ++ + + +D ++ ++D + Sbjct: 139 YKFPLTHRYQSGAVTSNNSDKPESFHFATTIKTQRDIQDLKVSVDGTKDVKDAHNATFVT 198 Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAP-APAPANRKIDVLIESAGNLVNSIQKAIQ 229 NN + ++ K + + Y P +V+ Sbjct: 199 NNPPAKDAIVIETHIKDEDKNVAISSDGYIAITTYPYFEGPIDSNSEFYFIVDCSGSMSC 258 Query: 230 EKKNLSVRIGTIA------------YNIGIVGNQCTPLSN----NLNEVKSRLNKLNP-Y 272 + N +++ + G P + N+ + L+ ++ Sbjct: 259 SRINNAIKCMRLFIQSLPVGCRFSILRFGSHFETVLPPCDYTDENVANAMNLLDNISANM 318 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 TN + H S K + F+TDGE + Sbjct: 319 GGTNILAPLQHVSDL-----------QASEGFVKQIFFLTDGEVDNSDIIC--------A 359 Query: 333 EYMRNAGM-KIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQ 390 ++N +I+S+ + + L++ S G + + D+ + E ++ Sbjct: 360 TALKNRSTNRIFSIGLGSGA-DPGLIKGMARKSGGNYAIIGDNDNMNEKVIEMLSSAISP 418 Query: 391 SVR 393 ++R Sbjct: 419 ALR 421 >gi|118357550|ref|XP_001012024.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89293791|gb|EAR91779.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 853 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 26/145 (17%), Positives = 48/145 (33%), Gaps = 29/145 (20%) Query: 256 SNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 S NL + +N+ + T Y +H+ + E K + +TDG Sbjct: 372 SQNLEQAVQIINQYSANLGGTEIYQPLHNVFNE-----------KKIEGYNKQIFLLTDG 420 Query: 315 ENSGASAYQNTLNTLQICEYMRNAG--MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND 372 + N Q+ ++ +I+S+ + Q + G VN Sbjct: 421 QVD---------NPKQVVNLIKKNNKFSRIHSIGFGNDADKQLIQETAVYGKGISKIVNQ 471 Query: 373 SRELLESFDKITDKIQEQSVRIAPN 397 + +L E I+ S+ I P Sbjct: 472 NCDLQEVV------IEMLSLSITPT 490 >gi|323491534|ref|ZP_08096713.1| hypothetical protein VIBR0546_18031 [Vibrio brasiliensis LMG 20546] gi|323314110|gb|EGA67195.1| hypothetical protein VIBR0546_18031 [Vibrio brasiliensis LMG 20546] Length = 411 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 23/153 (15%), Positives = 45/153 (29%), Gaps = 2/153 (1%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M + + AID++H + + ++Q+++DAA L+ + S+ T + TT + T Sbjct: 21 MVTVAMLALVGVAALAIDVSHAVLNKARLQNSVDAAALAAAIVMDSEGTNAEATTAANTT 80 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 T S T + L + F Sbjct: 81 LTNLASATGNSEMDFSSSTVTVQYSNDPTIFPQTSGYDADLDTYVRVTVSAYS--LDNFF 138 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMV 153 L +S G + + + Sbjct: 139 AYLFGVDKQLVSSAVAGPSPGIDVVNVVPMAVC 171 >gi|300071886|gb|ADJ61286.1| hypothetical protein LLNZ_11975 [Lactococcus lactis subsp. cremoris NZ9000] Length = 1438 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 27/267 (10%), Positives = 72/267 (26%), Gaps = 32/267 (11%) Query: 140 ERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY 199 + + I +V+D+S SME + + S Sbjct: 290 NVQNPIKPVDIVLVVDMSGSMEGAREGAIKQGVKSFLSSIENTAYAQYVNVGLVGYSSPG 349 Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKA------IQEKKNLSVRIGTIAYNIGIVGNQ-- 251 + + + + + V+++ KA L +R G GN+ Sbjct: 350 YISNSGYIT-VPMESLATDGHVSAMNKALERQFVGGTFTQLGIRQGAQMLKEDASGNEKM 408 Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 +++ + ++++ L +S + + + + Sbjct: 409 IILMTDGVPTFSNKVSSAQLEGGVLYGTDFDS--NSLDEPSFTSQLWMMNGNNRTPAPYT 466 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS-------APPEGQDLLRK----- 359 GE + TL + ++ G +I+++ + L + Sbjct: 467 VSGETINDTW----AATLGEAKIAKDDGAEIHTLGIQLGKDSGYTNDSSNTYLSQEEVRK 522 Query: 360 ----CTDSSGQFFAVNDSRELLESFDK 382 S G + + + + + Sbjct: 523 RTSLIASS-GLYQDADSAENITDYLKN 548 >gi|126457247|ref|YP_001076991.1| hypothetical protein BURPS1106A_A2962 [Burkholderia pseudomallei 1106a] gi|242311800|ref|ZP_04810817.1| conserved hypothetical protein [Burkholderia pseudomallei 1106b] gi|254192526|ref|ZP_04898965.1| conserved hypothetical protein [Burkholderia pseudomallei S13] gi|126231015|gb|ABN94428.1| conserved hypothetical protein [Burkholderia pseudomallei 1106a] gi|169649284|gb|EDS81977.1| conserved hypothetical protein [Burkholderia pseudomallei S13] gi|242135039|gb|EES21442.1| conserved hypothetical protein [Burkholderia pseudomallei 1106b] Length = 418 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 33/322 (10%), Positives = 89/322 (27%), Gaps = 29/322 (9%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A++++V F+ A+DL + R+++Q++ DA L+ + + P Sbjct: 26 LVALMLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLTGAINLSVPEAAGITA 85 Query: 61 ---STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 + +Q ++ S + + N ++ + + Sbjct: 86 GHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCTTSRTGIVNWFI 145 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSM-------EDLYLQKHND 170 L + +TN S+ +T + + +I + + + + + Sbjct: 146 QTLNLVPGVTVTNASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVAGATYNIGDWLS 205 Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR------------KIDVLIESAG 218 + + + N+ S + Sbjct: 206 AKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGDKAATTNAYN 265 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR-------LNKLNP 271 ++ + AY+ +Q S+ +++ + + +N Sbjct: 266 TRFGIYANPYKDPSYGTPDFTGFAYDATTWPSQSNAYSDFVSKRLTFASYQGDLITGINT 325 Query: 272 YENTNTYPAMHHAYRELYNEKE 293 + N A R L E Sbjct: 326 GGSYNPSYYAAGADRRLALAPE 347 >gi|296124235|ref|YP_003632013.1| hypothetical protein Plim_4003 [Planctomyces limnophilus DSM 3776] gi|296016575|gb|ADG69814.1| protein of unknown function DUF1355 [Planctomyces limnophilus DSM 3776] Length = 1023 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 22/165 (13%), Positives = 51/165 (30%), Gaps = 20/165 (12%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 + V + YN G + E+ +N + + M L Sbjct: 488 GEQDEVGVLAYDYNDGEKWIFELTPAGKYEELSLLINSAEIGDMPSFQQTMQMGIDGLEA 547 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA-VSA 349 S K +I I+DG+ + + + + +A + I +VA Sbjct: 548 ----------SDASSKHMIIISDGDP--------SPASPDLLKRFIDAKVTISTVAVFPH 589 Query: 350 PPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + + G+++ + +L F K + ++ ++ Sbjct: 590 GDVDTPTMTSIAQITGGRYYKPTNPNQLPAIFIKESKTLRRSMLQ 634 >gi|224368584|ref|YP_002602747.1| hypothetical protein HRM2_14740 [Desulfobacterium autotrophicum HRM2] gi|223691300|gb|ACN14583.1| conserved hypothetical protein [Desulfobacterium autotrophicum HRM2] Length = 222 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 29/200 (14%), Positives = 76/200 (38%), Gaps = 18/200 (9%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 + KID + ++ +L+++ + + + N + + I + PL+ Sbjct: 20 TSGSMSVDGKIDAMNQALRDLIDTF--SGESRLNAEIHLSVITFGGDGAKEH-LPLT--C 74 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 S + L + T A A + ++ I S + ++ ++DG + Sbjct: 75 AHTISGFSDLQAHGMTPMGGAFRIAKELIEDK-----EKIPSRAYRPVIVLVSDGYPNDD 129 Query: 320 S-AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ-FFAVNDSRELL 377 A ++L + + ++A+ + +++L+ + F N +R+++ Sbjct: 130 WEAAFSSLRGSERAQKATRM-----AMAIGSDA-DENMLKDFINDPETPVFRANGARDII 183 Query: 378 ESFDKITDKIQEQSVRIAPN 397 F ++ + +S APN Sbjct: 184 RFFRAVSMSVTSRSRSSAPN 203 >gi|218670347|ref|ZP_03520018.1| hypothetical protein RetlG_01180 [Rhizobium etli GR56] Length = 125 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 12/70 (17%), Positives = 27/70 (38%), Gaps = 6/70 (8%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 MTAI+ V A+ + ++ + Q+Q A A++ + + + T + Q Sbjct: 1 MTAILAPVLLGAAGMAVHVGDMLLSKQQLQEAA------DSAALATATALANGTIQTSQA 54 Query: 61 STIFKKQIKK 70 + + Sbjct: 55 EAFARNFVAG 64 >gi|194374787|dbj|BAG62508.1| unnamed protein product [Homo sapiens] Length = 677 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 22/151 (14%), Positives = 58/151 (38%), Gaps = 26/151 (17%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 ++ + + RL T+ + A+ + + + ++ +T Sbjct: 125 INSGSDRDTLAKRLPA-AASGGTSICSGLRSAFTVIRKKYPTD---------GSEIVLLT 174 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND 372 DGE++ S N + + +G I++VA+ P Q+L + G +D Sbjct: 175 DGEDNTISGCFNEV---------KQSGAIIHTVALG-PSAAQELEELSKMTGGLQTYASD 224 Query: 373 ---SRELLESFDKITD---KIQEQSVRIAPN 397 + L+++F ++ + ++S+++ Sbjct: 225 QVQNNGLIDAFGALSSGNGAVSQRSIQLESK 255 >gi|149921114|ref|ZP_01909572.1| hypothetical protein PPSIR1_24789 [Plesiocystis pacifica SIR-1] gi|149818001|gb|EDM77460.1| hypothetical protein PPSIR1_24789 [Plesiocystis pacifica SIR-1] Length = 389 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 24/225 (10%), Positives = 68/225 (30%), Gaps = 32/225 (14%) Query: 187 KSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIG 246 ++ ++ + +D + + ++ S +++ Sbjct: 102 NAWDDDGNPDTEDVTRWHSLHDTVDTVGHQYQDGMSLGLTLFPSVDAES------SFDGA 155 Query: 247 IVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 N+ + L ++ L + ++T+ + A +H Sbjct: 156 CPVNEVPEVGVGLGNAEALLAAIPAADDTDLHGATPA---AAGIATALAHLEALEDGRPA 212 Query: 307 FVIFITDGENSGA--------SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ---- 354 +I +TDG + + + + L + + AG+ Y V + + Sbjct: 213 AMILVTDGAANCSAGANDITKFSQYDEDLPLVVADAWDRAGIPTYVVGIDIQESSEHPFT 272 Query: 355 ---DLLRKCTDSSG--------QFFAVNDSRELLESFDKITDKIQ 388 + L + ++ G F+ D++ L + D+I + Sbjct: 273 NPREKLHEVAEAGGVARSDGEVGFYDAGDAQALTAALDEIAASVS 317 >gi|320170832|gb|EFW47731.1| predicted protein [Capsaspora owczarzaki ATCC 30864] Length = 1062 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 18/130 (13%), Positives = 43/130 (33%), Gaps = 14/130 (10%) Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + + + L L T+ A++ K + V+F+TDG+++ Sbjct: 152 TSTLSNVLPNLQAGGGTSFCAALNAISNACMPHKGTI-----------SVVFMTDGQDTD 200 Query: 319 -ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL--LRKCTDSSGQFFAVNDSRE 375 +S N ++ + + + + +V L L + + G Sbjct: 201 MSSLPAAKANFQRMVKSSPDRSVVVSTVGFGGEANLAFLRELSEIGTAPGYARFAETGDN 260 Query: 376 LLESFDKITD 385 ++K+ D Sbjct: 261 SAALYEKLVD 270 >gi|282900972|ref|ZP_06308905.1| hypothetical protein CRC_02388 [Cylindrospermopsis raciborskii CS-505] gi|281194063|gb|EFA69027.1| hypothetical protein CRC_02388 [Cylindrospermopsis raciborskii CS-505] Length = 464 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 34/169 (20%), Positives = 61/169 (36%), Gaps = 26/169 (15%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK- 263 KID++IES LVNS + Q+ RI + +N + + + K Sbjct: 82 GGKSKIDIVIESLLALVNSGRLKQQD------RIAIVQFNDSASSIIGLTSATEIKKTKK 135 Query: 264 SRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +N L T + A+ L ++ + K V+ TDG+ Sbjct: 136 TAINNLRNFSGGTRMGLGLRRAFDILSEQEMTV----------KRVLLFTDGQT------ 179 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAV 370 + I + + ++ V +DLL +D + G+ F V Sbjct: 180 FDEDQCQSIANDFATRNIPVTALGVG-EEFNEDLLSHLSDCTGGKLFYV 227 >gi|203284094|ref|YP_002221834.1| hypothetical protein BDU_172 [Borrelia duttonii Ly] gi|201083537|gb|ACH93128.1| uncharacterized conserved protein [Borrelia duttonii Ly] Length = 341 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 60/172 (34%), Gaps = 41/172 (23%) Query: 252 CTPLSNNLNEVKSRLNK---LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 PL+ + + +L+ ++ + + A L S KK V Sbjct: 154 IVPLTIDRDFFSKKLDDIYIMDLGNGSALGLGISIALSHL----------KHSEAPKKSV 203 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP----------------- 351 I +TDG + Y++ Q+ + +KIYSV + + Sbjct: 204 IVLTDGVVNSDEVYKD-----QVINLAQGLNVKIYSVGIGSDEELNVGFKLRSGKFYQGV 258 Query: 352 ----EGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQ-EQSVRIAPN 397 +L + ++ + G F++V D + + K E+ V+I+ + Sbjct: 259 LKEVYDPSMLFEISNKTGGLFYSVGDDFSFKLAIQDFSKKENVERKVKISVD 310 >gi|149200158|ref|ZP_01877182.1| batB protein [Lentisphaera araneosa HTCC2155] gi|149136799|gb|EDM25228.1| batB protein [Lentisphaera araneosa HTCC2155] Length = 621 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 26/152 (17%), Positives = 56/152 (36%), Gaps = 30/152 (19%) Query: 254 PLSNNLNEVKSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 PL++ + V L+ LN P TN A+ HA ++ + R + V+ Sbjct: 148 PLTSEPDMVLLYLSDLNSSLLPGGGTNIAAALDHAQKQF----------KENERDSRVVV 197 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK----C-TDSS 364 ++DGE G ++ E ++ + + +++ P +L + Sbjct: 198 LLSDGETDGNKWRESL-------EALQKKKIPVNVISLGDPKREGLVLNEKGHPIRNSKG 250 Query: 365 GQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 +++D+ L +I D+ + P Sbjct: 251 DYVMSLSDTSTL----KQIADETGGTYIPWDP 278 >gi|45332244|gb|AAS58046.1| thrombospondin-related anonymous protein [Babesia bovis] Length = 655 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 58/164 (35%), Gaps = 14/164 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 + N +R+ Y T L + + L KL+ T M + R L Sbjct: 77 DLDNTDIRLSLTTY-STPTRQIFTFLDAAASSTRLALTKLDWMNGTKARYGMTYTGRAL- 134 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 N + G + K ++ ITDG +S S T Q+ +R+ G+ + + V Sbjct: 135 NYVRKAILPYGRKNVPKALLLITDGVSSDGSY------TAQVAAMLRDEGVNVMVIGVG- 187 Query: 350 PPEGQDLLRKCTDSSGQ----FFAVNDSRELLESFDKITDKIQE 389 R G F + ++++ F+ + ++ + Sbjct: 188 -DVNVAECRGIVGCDGIMDCPMFKQTNWKDIMGLFNSLMKEVCD 230 >gi|298247107|ref|ZP_06970912.1| von Willebrand factor type A [Ktedonobacter racemifer DSM 44963] gi|297549766|gb|EFH83632.1| von Willebrand factor type A [Ktedonobacter racemifer DSM 44963] Length = 550 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 36/266 (13%), Positives = 81/266 (30%), Gaps = 28/266 (10%) Query: 128 LTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNM----TSNKYLLPP 183 T++ + + R + +S+ V + + + L Sbjct: 297 STDVQQKLLQLGRRPAGGGGLSLTGVDKTTFNPDWGIKATLRQQAITYPAPDVINAALDN 356 Query: 184 PPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNS-------IQKAIQEKKNLSV 236 +T +N D L ES+ L + +Q Q+ ++ V Sbjct: 357 YQTVYRRPVHTIYCLDGSGSMGSNGGWDQLKESSELLFDQTKARQYLLQTHPQDLTSVMV 416 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 IA + + + P TN Y + + + Sbjct: 417 FNSDIAAGPDGSWTVEGNDPQKMRGLYDNIQAREPDGGTNMYACLQRSVELFKQQP---- 472 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + K+ +I +TDG++ + QI + + + G+ + SVA + + L Sbjct: 473 ----NENRKRLIIVMTDGQSEKGNGVD------QIIQSVASLGVPVISVAFGSDADVTQL 522 Query: 357 LRKCTDSSGQFFAVNDSRELLESFDK 382 T + G +V ++++ + Sbjct: 523 NEISTATHG---SVTKKDNMVDAMRE 545 >gi|311743549|ref|ZP_07717355.1| conserved hypothetical protein [Aeromicrobium marinum DSM 15272] gi|311312679|gb|EFQ82590.1| conserved hypothetical protein [Aeromicrobium marinum DSM 15272] Length = 445 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 23/180 (12%), Positives = 48/180 (26%), Gaps = 10/180 (5%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVL---------SGCASIVSDRTIK 51 +TA+I++ ID ++R Q Q+A DA + A Sbjct: 22 ITAVIMTSLLSIGALTIDYGAASHVRRQTQNAADATTRSIVENCAKQAAAAGQNVLDGAC 81 Query: 52 DPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQY 111 T STI + I +++ + L ++ Sbjct: 82 VSATGTADASTIVQGNAPGSAPDAPVIGGAGREVSVTVAEPVDYRLAQLLGKDSDVVRS- 140 Query: 112 EIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDN 171 E L+ + + + + + L S ++ + +N Sbjct: 141 SATAEWTNLRPVEGFPVVPFGVAQCTYNDFRPSAGNVDGARTLIRSDVLQTVRSLVNNAT 200 >gi|262196568|ref|YP_003267777.1| FHA domain containing protein [Haliangium ochraceum DSM 14365] gi|262079915|gb|ACY15884.1| FHA domain containing protein [Haliangium ochraceum DSM 14365] Length = 564 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 19/115 (16%), Positives = 41/115 (35%), Gaps = 7/115 (6%) Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + A + ++K + ++DG + A + N ++ + + Sbjct: 149 LIEAVNRARDTLARLEPEREGVPMRKLIAVVSDGRD----ADPSPENYRRVAKRAARNDI 204 Query: 341 KIYSVAVSAPPEGQDL--LRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 +I+++ A L L + + S G F V F ++ +I EQ V Sbjct: 205 RIHTIGFPADRNRYPLYGLAEMSKQSEGTFRLVLTESAFGSHFGQLAREINEQYV 259 >gi|271964249|ref|YP_003338445.1| hypothetical protein Sros_2742 [Streptosporangium roseum DSM 43021] gi|270507424|gb|ACZ85702.1| conserved hypothetical protein [Streptosporangium roseum DSM 43021] Length = 774 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 23/136 (16%), Positives = 41/136 (30%), Gaps = 18/136 (13%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N L +L T + A L + ++ +TDG+ Sbjct: 356 NRYRAVEHLARLEARGGTEMLAPLEQAVALLSESGR-----------DRVLVLVTDGQVG 404 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 + R AG+++++V + L R SG+ V L Sbjct: 405 NEDQILERIG-------ARLAGVRVHTVGIDRAVNAGFLGRLAGLGSGRCELVESEDRLD 457 Query: 378 ESFDKITDKIQEQSVR 393 E+ + I +I V Sbjct: 458 EAMEHIHRRIGAPLVT 473 >gi|167744177|ref|ZP_02416951.1| hypothetical protein Bpse14_39258 [Burkholderia pseudomallei 14] gi|167851185|ref|ZP_02476693.1| hypothetical protein BpseB_38456 [Burkholderia pseudomallei B7210] gi|167916479|ref|ZP_02503570.1| hypothetical protein Bpse112_38762 [Burkholderia pseudomallei 112] Length = 396 Score = 47.6 bits (111), Expect = 0.003, Method: Composition-based stats. Identities = 33/322 (10%), Positives = 89/322 (27%), Gaps = 29/322 (9%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A++++V F+ A+DL + R+++Q++ DA L+ + + P Sbjct: 4 LVALMLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLTGAINLSVPEAAGITA 63 Query: 61 ---STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 + +Q ++ S + + N ++ + + Sbjct: 64 GHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCTTSRTGIVNWFI 123 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSM-------EDLYLQKHND 170 L + +TN S+ +T + + +I + + + + + Sbjct: 124 QTLNLVPGVTVTNASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVAGATYNIGDWLS 183 Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR------------KIDVLIESAG 218 + + + N+ S + Sbjct: 184 AKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGDKAATTNAYN 243 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR-------LNKLNP 271 ++ + AY+ +Q S+ +++ + + +N Sbjct: 244 TRFGIYANPYKDPSYGTPDFTGFAYDATTWPSQSNAYSDFVSKRLTFASYQGDLITGINT 303 Query: 272 YENTNTYPAMHHAYRELYNEKE 293 + N A R L E Sbjct: 304 GGSYNPGYYAAGADRRLALAPE 325 >gi|254781007|ref|YP_003065420.1| hypothetical protein CLIBASIA_04540 [Candidatus Liberibacter asiaticus str. psy62] gi|254040684|gb|ACT57480.1| hypothetical protein CLIBASIA_04540 [Candidatus Liberibacter asiaticus str. psy62] Length = 411 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 34/334 (10%), Positives = 80/334 (23%), Gaps = 27/334 (8%) Query: 59 QTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENL 118 Q + L I D + + + + Q + Sbjct: 84 QIEKALNTYNSRDLSNTGSIESIVKDAVILTKNVNSLPLQFTVDIALSTTVQLRGSLLQM 143 Query: 119 FLKGLIPSA-LTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSN 177 F + + I A + + + + Sbjct: 144 FSQSKGKVDISRRKKVMYKQNIGLMIMPFAWDGYWLASRGKVADSKVHPPKYLEYSHYYQ 203 Query: 178 KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVR 237 +YL K+F S+ K+ ++ + ++ N K Sbjct: 204 QYLNRNTLVKNFLSQIPYKNFCMAPYHYSSILYWAVGTLTYSVDNKTTTREYYKDPYYAT 263 Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHA---YRELYNEKES 294 Y+ T +V TNT H + + Sbjct: 264 WDHFPYSFIKNVFDMTSNQFGDGQVL-----------TNTNHCFPHGASQNKYMLMLAIG 312 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + + S +K + D + ++ + + I+SV S + + Sbjct: 313 NQLSRSSVEKEKIEKVLQD-CHYMHKRHRTGRDA-----------ITIFSVGFSPDQDTR 360 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 LR+C +++ +N ++ + + Sbjct: 361 YTLRQCASDPSKYYEINSDENVMPIAKSLARNVI 394 >gi|126306104|ref|XP_001362407.1| PREDICTED: similar to calcium-dependent chloride channel-1 [Monodelphis domestica] Length = 870 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 21/205 (10%), Positives = 71/205 (34%), Gaps = 36/205 (17%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 +++ L +++ + I + G + ++ + + Sbjct: 316 SMKVGNRLNRLRQASQFFLLQIIEKG-------SWTGIVTFDSSATIQSELIQIESDVQR 368 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 K+ +++L + + A+ + + + ++ +TDGE++ Sbjct: 369 KTLISRLPNVTVAGGGAHICSGLRTAFMVVKKKFLTD---------GSEMVLLTDGEDNT 419 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND---SRE 375 + E ++ +G I+++ + P + L + + G D + Sbjct: 420 TNTCF---------EEVKQSGAIIHTIVLG-PSTEKGLEKLSEMTGGMKTTATDNVQNNG 469 Query: 376 LLESFDKITD---KIQEQSVRIAPN 397 L+++F ++ I ++S+++ Sbjct: 470 LIDAFSALSSGNAAITQRSIKLESK 494 >gi|32965155|gb|AAP91765.1| polydomain protein-like [Ciona intestinalis] Length = 1969 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 24/268 (8%), Positives = 64/268 (23%), Gaps = 25/268 (9%) Query: 126 SALTNLSLRSTGIIERSSENLAI--SICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPP 183 + + + + + + + ++ + Sbjct: 1704 FNHIDNGQVTCTNSNYHHSQCSFQCNPGTLTNPITHHGSHVMTCQSNGRWGGTPPCCAES 1763 Query: 184 PPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAY 243 P ++ + D + I++ + V + Sbjct: 1764 CPTRTKVDLYIVLESSTSG------ESDDWNRLFAFVETLIRRFSIDDSTTLVGLLRYHR 1817 Query: 244 NIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGST 302 N+ ++G N+ E+ + L +N + H +S Sbjct: 1818 NVDVIGEVSLGRYRNVEELSRAVKFLPYGGYGSNIGQTLEHLAT------DSLLVDTNRP 1871 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 ++ VI TDG + N ++ + V + L + Sbjct: 1872 NVRDHVILFTDG--------SSDDNVTASAGLLKQR-ATVQVVGMDGTKTQLQQLEEIAT 1922 Query: 363 SSGQFFAVNDSREL-LESFDKITDKIQE 389 V++ +L D I + + Sbjct: 1923 KPSYVHTVSNISQLGTSVTDAIIMNMCQ 1950 >gi|261876473|dbj|BAI47562.1| collagen type VI alpha 2 subunit [Mesocricetus auratus] Length = 1026 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 26/154 (16%), Positives = 55/154 (35%), Gaps = 19/154 (12%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS-VRIGTIAYNIGIVGNQCTPLS--- 256 + + + N+VN + ++ K+ + R+G + Y+ Sbjct: 629 SSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHEGTFEAIRLDDERV 688 Query: 257 NNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 N+L+ K + L T T A+ AY +L E + + F + ITDG Sbjct: 689 NSLSSFKEAVKNLEWIAGGTWTPSALKFAYNQLIKESRR-------QKTRVFAVVITDGR 741 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + + LN +C + + + ++ + Sbjct: 742 HD---PRDDDLNLRALC----DRDVTVTAIGIGD 768 Score = 43.4 bits (100), Expect = 0.064, Method: Composition-based stats. Identities = 29/220 (13%), Positives = 68/220 (30%), Gaps = 18/220 (8%) Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 P+K+ N D L+ V +Q + L + Sbjct: 42 NNCPEKADCPVNVYFVLDTSESVAMQSPTDSLLYHMQQFVPQFISQLQNEFYLDQVALSW 101 Query: 242 AY---NIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHN 297 Y + +P ++ L + T T A+ + +++ Sbjct: 102 RYGGLHFSDQVEVFSPPGSDRASFTKSLQSIRSFRRGTFTDCALANMTQQI--------R 153 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 + F + ITDG +G+ + E R G+++++VA + L Sbjct: 154 QHVGRGVVNFAVVITDGHVTGSPCGGIKMQ----AERAREEGIRLFAVA-PNRNLNEQGL 208 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 R ++ + + N+ + +I + +++ + Sbjct: 209 RDIANTPHELYR-NNYATMRPDSTEIDQDTINRIIKVMKH 247 >gi|254780914|ref|YP_003065327.1| hypothetical protein CLIBASIA_04065 [Candidatus Liberibacter asiaticus str. psy62] gi|254780929|ref|YP_003065342.1| hypothetical protein CLIBASIA_04140 [Candidatus Liberibacter asiaticus str. psy62] gi|254040591|gb|ACT57387.1| hypothetical protein CLIBASIA_04065 [Candidatus Liberibacter asiaticus str. psy62] gi|254040606|gb|ACT57402.1| hypothetical protein CLIBASIA_04140 [Candidatus Liberibacter asiaticus str. psy62] Length = 408 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 34/334 (10%), Positives = 80/334 (23%), Gaps = 27/334 (8%) Query: 59 QTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENL 118 Q + L I D + + + + Q + Sbjct: 84 QIEKALNTYNSRDLSNIGSIESIVKDAVILTKNVNSLPLQFTVDIALSTTVQLRGSLLQM 143 Query: 119 FLKGLIPSA-LTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSN 177 F + + I A + + + + Sbjct: 144 FSQSKGKVDISRRKKVMYKQNIGLMIMPFAWDGYWLASRGKVADSKVHPPKYLEYSHYYQ 203 Query: 178 KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVR 237 +YL K+F S+ K+ ++ + ++ N K Sbjct: 204 QYLNRNTLVKNFLSQIPYKNFCMAPYHYSSILYWAVGTLTYSVDNKTTTREYYKDPYYAT 263 Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHA---YRELYNEKES 294 Y+ T +V TNT H + + Sbjct: 264 WDHFPYSFIKNVFDMTSNQFGDGQVL-----------TNTNHCFPHGASQNKYMLMLAIG 312 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + + S +K + D + ++ + + I+SV S + + Sbjct: 313 NQLSRSSVEKEKIEKVLQD-CHYMHKRHRTGRDA-----------ITIFSVGFSPDQDTR 360 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 LR+C +++ +N ++ + + Sbjct: 361 YTLRQCASDPSKYYEINSDENVMPIAKSLARNVI 394 >gi|281427229|ref|NP_001094211.1| collagen, type VI, alpha 2 [Rattus norvegicus] gi|149043684|gb|EDL97135.1| procollagen, type VI, alpha 2, isoform CRA_a [Rattus norvegicus] Length = 1027 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 26/154 (16%), Positives = 55/154 (35%), Gaps = 19/154 (12%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS-VRIGTIAYNIGIVGNQCTPLS--- 256 + + + N+VN + ++ K+ + R+G + Y+ Sbjct: 630 SSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHEGTFEAIRLDDERV 689 Query: 257 NNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 N+L+ K + L T T A+ AY +L E + + F + ITDG Sbjct: 690 NSLSSFKEAVKNLEWIAGGTWTPSALKFAYNQLIKESRR-------QKTRVFAVVITDGR 742 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + + LN +C + + + ++ + Sbjct: 743 HD---PRDDDLNLRALC----DRDVTVTAIGIGD 769 Score = 42.2 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 25/229 (10%), Positives = 71/229 (31%), Gaps = 16/229 (6%) Query: 170 DNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ 229 D + N S+ +P + + + + ++ +Q Sbjct: 35 DISTTDRNNNCPEKADCPVNVYFVLDTSESVAMQSPTDSLLYHMQQFVPQFISQLQN-EF 93 Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + +++ + +P ++ L + T T A+ + +++ Sbjct: 94 YLEQVALSWRYGGLHFSDQVEVFSPPGSDRASFTKSLQGIRSFRRGTFTDCALANMTQQI 153 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + F + ITDG +G + E R G+++++VA Sbjct: 154 --------RQHVGRGVVNFAVVITDGHVTGNPCGGIKMQ----AERAREEGIRLFAVA-P 200 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + LR ++ + + N+ + +I + +++ + Sbjct: 201 NRNLNEQGLRDIANTPHELYR-NNYATMRPDSTEIDQDTINRIIKVMKH 248 >gi|22203747|ref|NP_666119.1| collagen alpha-2(VI) chain precursor [Mus musculus] gi|125987813|sp|Q02788|CO6A2_MOUSE RecName: Full=Collagen alpha-2(VI) chain; Flags: Precursor gi|21706759|gb|AAH34414.1| Collagen, type VI, alpha 2 [Mus musculus] gi|148699895|gb|EDL31842.1| procollagen, type VI, alpha 2 [Mus musculus] Length = 1034 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 26/154 (16%), Positives = 55/154 (35%), Gaps = 19/154 (12%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS-VRIGTIAYNIGIVGNQCTPLS--- 256 + + + N+VN + ++ K+ + R+G + Y+ Sbjct: 637 SSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHEGTFEAIRLDDERV 696 Query: 257 NNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 N+L+ K + L T T A+ AY +L E + + F + ITDG Sbjct: 697 NSLSSFKEAVKNLEWIAGGTWTPSALKFAYNQLIKESRR-------QKTRVFAVVITDGR 749 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + + LN +C + + + ++ + Sbjct: 750 HD---PRDDDLNLRALC----DRDVTVTAIGIGD 776 Score = 43.4 bits (100), Expect = 0.074, Method: Composition-based stats. Identities = 30/220 (13%), Positives = 68/220 (30%), Gaps = 18/220 (8%) Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 P+K+ N D L+ V +Q + L + Sbjct: 50 NNCPEKADCPVNVYFVLDTSESVAMQSPTDSLLYHMQQFVPQFISQLQNEFYLDQVALSW 109 Query: 242 AY---NIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHN 297 Y + +P ++ L + T T A+ + +++ Sbjct: 110 RYGGLHFSDQVEVFSPPGSDRASFTKSLQGIRSFRRGTFTDCALANMTQQI--------R 161 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 + F + ITDG +G+ + E R G+++++VA + L Sbjct: 162 QHVGKGVVNFAVVITDGHVTGSPCGGIKMQ----AERAREEGIRLFAVA-PNRNLNEQGL 216 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 R +S + + N+ + +I + +++ + Sbjct: 217 RDIANSPHELYR-NNYATMRPDSTEIDQDTINRIIKVMKH 255 >gi|49809|emb|CAA46541.1| alpha-2 collagen [Mus musculus] Length = 1029 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 26/154 (16%), Positives = 55/154 (35%), Gaps = 19/154 (12%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS-VRIGTIAYNIGIVGNQCTPLS--- 256 + + + N+VN + ++ K+ + R+G + Y+ Sbjct: 632 SSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHEGTFEAIRLDDERV 691 Query: 257 NNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 N+L+ K + L T T A+ AY +L E + + F + ITDG Sbjct: 692 NSLSSFKEAVKNLEWIAGGTWTPSALKFAYNQLIKESRR-------QKTRVFAVVITDGR 744 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + + LN +C + + + ++ + Sbjct: 745 HD---PRDDDLNLRALC----DRDVTVTAIGIGD 771 >gi|49907|emb|CAA44206.1| alpha-2 collagen type VI, subunit [Mus musculus] Length = 764 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 26/154 (16%), Positives = 55/154 (35%), Gaps = 19/154 (12%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS-VRIGTIAYNIGIVGNQCTPLS--- 256 + + + N+VN + ++ K+ + R+G + Y+ Sbjct: 367 SSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHEGTFEAIRLDDERV 426 Query: 257 NNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 N+L+ K + L T T A+ AY +L E + + F + ITDG Sbjct: 427 NSLSSFKEAVKNLEWIAGGTWTPSALKFAYNQLIKESRR-------QKTRVFAVVITDGR 479 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + + LN +C + + + ++ + Sbjct: 480 HD---PRDDDLNLRALC----DRDVTVTAIGIGD 506 >gi|332290703|ref|YP_004429312.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] gi|332168789|gb|AEE18044.1| von Willebrand factor type A [Krokinobacter diaphorus 4H-3-7-5] Length = 351 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 20/169 (11%), Positives = 62/169 (36%), Gaps = 8/169 (4%) Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 I + S ++ ++ V ++ PL+ + ++ ++ + + + ++ Sbjct: 140 ISNVMPAIPTDSFKMAIYWFDGEDVLHELQPLTTSAAMLQEAIDGITEDISNDPSTDLYG 199 Query: 284 AYRELYNEKESSHNTIGSTRL--KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK 341 A + ++ + + L V+ TDG + A + + + Sbjct: 200 AVIKAAEIADNIIDVSENEDLFAAASVVIFTDGTDQAARY--TEEEAVDAVSNA-DDDVS 256 Query: 342 IYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 +++ + + + +L K + F +++ EL F+ I+ + Q Sbjct: 257 FFTIGLGSEI-DEGILSKIGQTESVF--ASNAAELESVFNDISSGVAGQ 302 >gi|329851995|ref|ZP_08266676.1| hypothetical protein ABI_47640 [Asticcacaulis biprosthecum C19] gi|328839844|gb|EGF89417.1| hypothetical protein ABI_47640 [Asticcacaulis biprosthecum C19] Length = 399 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 20/151 (13%), Positives = 41/151 (27%), Gaps = 3/151 (1%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 ++A+ + IT ID + ++ R +Q+A DA L G A + + Sbjct: 19 ISALCAAPLLYVITATIDHSAMVKDRFSLQAAADAGALMGAAKLALGSDELVFGAAEAAA 78 Query: 61 STIFKKQ---IKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 + + + I + P A + A+ + T Sbjct: 79 HQQIGDLRNPVTFEVVVDRSTGAVTVTGRSQHAPLIGFMSDGPTPISARATAEGLLKTPL 138 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAI 148 L+ + N + Sbjct: 139 CILQIGSSEMKIDDQATVRAGGCLVHSNKDV 169 >gi|294139780|ref|YP_003555758.1| type IV pilin biogenesis protein [Shewanella violacea DSS12] gi|293326249|dbj|BAJ00980.1| type IV pilin biogenesis protein, putative [Shewanella violacea DSS12] Length = 1195 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 10/73 (13%), Positives = 26/73 (35%), Gaps = 4/73 (5%) Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL--RKCTDSSGQFFAVNDSREL 376 N +NT + +++ S + L R G++++ + +L Sbjct: 420 GWMNTNDVNTNAAFPE--EQNVTTFTIGFSDGADDAAPLLTRTAELGGGEYYSAKSATQL 477 Query: 377 LESFDKITDKIQE 389 + ++ +I E Sbjct: 478 QAALSQVFSQILE 490 >gi|260824533|ref|XP_002607222.1| hypothetical protein BRAFLDRAFT_67980 [Branchiostoma floridae] gi|229292568|gb|EEN63232.1| hypothetical protein BRAFLDRAFT_67980 [Branchiostoma floridae] Length = 1897 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 28/179 (15%), Positives = 53/179 (29%), Gaps = 26/179 (14%) Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE 230 + P K+ SK + +L +AG + Sbjct: 1567 TWSQGPVDIPAPTCRSKADIHVLVDGSKSVKTRNFPAVRQFILKLAAGFEIGP------- 1619 Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY--ENTNTYPAMHHAYREL 288 R G + + NN + + K+ T T ++ Y E Sbjct: 1620 ---NKARFGVYQFAKDMQTEFKMNQYNNREALLDAIKKIEYMNQYQTKTGQSLKAVYEEF 1676 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 + ++K +I ITDG+ + Q +Y++N G +++V V Sbjct: 1677 TKANGAR------DGVEKIIILITDGKATD--------QVRQPAQYVKNKGAHVFTVGV 1721 >gi|189238317|ref|XP_972173.2| PREDICTED: similar to Inter-alpha-trypsin inhibitor heavy chain H4 precursor (ITI heavy chain H4) (Inter-alpha-inhibitor heavy chain 4) (Inter-alpha-trypsin inhibitor family heavy chain-related protein) (IHRP) (Plasma kallikrein sensitive glycoprotein 120) (P [Tribolium castaneum] Length = 604 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 34/270 (12%), Positives = 80/270 (29%), Gaps = 29/270 (10%) Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 + + VLD S SME ++ +K L P F Sbjct: 247 VHFFAPSGLQTFPKHVVFVLDHSGSMEGRKYEQLMQAM----DKILSDLNPDDLF--HIV 300 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 S+ K D + +Q+ + ++ +N+G Sbjct: 301 RFSENVSVWNFEKNKFDQV------------SFLQKPEYRNLDSFLAEFNLGDAAQVTEG 348 Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 N+ + K + TN + + + + T+ + +IF+TDG Sbjct: 349 ---NIKKAKKIKDHDVDMGCTNIIGGLVVGLYLVRRTLQKFYEKNVETKHQPMIIFLTDG 405 Query: 315 ENSGASAYQNTLNTLQICEYMRN--AGMKIYSVAVSAPPEGQDLLRKCTDSSG---QFFA 369 + + + + +I + I+S++ + L + + G + Sbjct: 406 LPNEGISNPDKI--TKIVTKINQGTNRAAIFSLSFGEDADKNFLKKLSAQNLGFSRHIYE 463 Query: 370 VNDSR-ELLESFDKITDKIQEQSVRIAPNR 398 D+ +L + ++ + ++ Sbjct: 464 AADAALQLQNFYRTVSSPLLRDVRFKYVDK 493 >gi|86134840|ref|ZP_01053422.1| aerotolerance-related membrane protein [Polaribacter sp. MED152] gi|85821703|gb|EAQ42850.1| aerotolerance-related membrane protein [Polaribacter sp. MED152] Length = 349 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 25/128 (19%), Positives = 45/128 (35%), Gaps = 23/128 (17%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK----LNPYENTNTYPAMHHAYRELYN 290 S R+G I Y P++ + L + + T A+ A Sbjct: 128 SDRVGVIIYAGNSYP--LLPITTDHAAANMFLQNANPDMVSSQGTAINEALELAKTYYN- 184 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + +F++ I+DGE+ T Q+ + + N G+KIY++ V Sbjct: 185 ---------NDEQTNRFLVIISDGEDHQ-------EETKQVAQNLSNDGVKIYTIGVGTE 228 Query: 351 PEGQDLLR 358 G +R Sbjct: 229 KGGPIPMR 236 >gi|188527810|ref|YP_001910497.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter pylori Shi470] gi|188144050|gb|ACD48467.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter pylori Shi470] Length = 217 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 27/193 (13%), Positives = 68/193 (35%), Gaps = 16/193 (8%) Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 S +I+ L ++ ++++ + KK L ++ I + G G Sbjct: 24 SGSMNESLGNCTRIEALNLCIQKMIETLKQ--EAKKELFSKMAIITF--GENGAVLHTPF 79 Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 +++ + + L+ T A A + ++ +T + K + I ++DGE Sbjct: 80 DDVKNINFK--PLSASGGTPLDQAFRLAKDLIEDK-----DTFPTKFYKLYSILVSDGEP 132 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 + + N ++ +S+ + + + F +D +L Sbjct: 133 NDDKWQKALSNFHHDGRSAKSVC---WSIFIGDRNTNPQVNKD--FGKDGVFYADDVEKL 187 Query: 377 LESFDKITDKIQE 389 + F+ +T I + Sbjct: 188 VGLFEIMTQTISK 200 >gi|119776240|ref|YP_928980.1| von Willebrand factor type A (vWA) domain-containing protein [Shewanella amazonensis SB2B] gi|119768740|gb|ABM01311.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Shewanella amazonensis SB2B] Length = 713 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 33/200 (16%), Positives = 72/200 (36%), Gaps = 21/200 (10%) Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE-KKNLSVRIGTIAYNIGIVGNQCTPL 255 K + LV +++ + + RI + P+ Sbjct: 327 VTQGRDWVFVLDKSGSMNGKYATLVEGVRQGLGKLPAQDRFRIILFDESTQEFSKGFVPV 386 Query: 256 -SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 SNN+N+ + + ++P T+ Y + A L ++ + V+ ITDG Sbjct: 387 DSNNINQALAWVEGISPGNGTDLYQGLKRALTPLDADRSTG------------VVLITDG 434 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDS 373 + + E M+ +++++ + LL T S+G +V+++ Sbjct: 435 VANVGVT-----EKRRFLELMQQQDVRLFTF-IMGNSANTPLLVPMTRLSNGVATSVSNA 488 Query: 374 RELLESFDKITDKIQEQSVR 393 +++ IT K+ Q+ R Sbjct: 489 DDIVGHLMNITSKLTHQAYR 508 >gi|301767378|ref|XP_002919104.1| PREDICTED: LOW QUALITY PROTEIN: collagen alpha-2(VI) chain-like [Ailuropoda melanoleuca] Length = 1011 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 38/224 (16%), Positives = 69/224 (30%), Gaps = 24/224 (10%) Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 + P+K+ N ID L+ V +Q++ Sbjct: 26 IAPGNSERNSCPEKADCPINVYFVLDTSESVTMQSPIDSLLYHMKQFVRQFISQLQDETY 85 Query: 234 LSVRIGTIAY---NIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELY 289 L + Y + V +P ++ L + + T T A+ + +E+ Sbjct: 86 LEQVALSWRYGGLHFSDVVRVFSPPDSDRASFTKSLESIVSIRKGTFTDCALANMTQEI- 144 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + S F + ITDG +G+ L E R G++I++VA Sbjct: 145 -------RQLKSKGGVHFAVVITDGYVTGSPCGGIKLQ----AERAREEGIRIFTVA-PD 192 Query: 350 PPEGQDLLRKCTDSS-----GQFFAVNDSRELLESFDKITDKIQ 388 + LR + V E+ + D I I+ Sbjct: 193 QVPNEQGLRDMASMPLELYRNNYATVRPDLEIDQ--DTINRIIK 234 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 26/154 (16%), Positives = 55/154 (35%), Gaps = 19/154 (12%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS-VRIGTIAYNIGIVGNQCTPLS--- 256 + + + N+VN + ++ K+ + R+G + Y+ Sbjct: 620 SSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHEGTFEAIQLDDERI 679 Query: 257 NNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 ++L+ K + L T T A+ AY +L E + + F + ITDG Sbjct: 680 DSLSSFKEAVKNLEWIAGGTWTPSALKFAYNQLIKESRR-------QKTRVFAVVITDGR 732 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + + LN +C N + + ++ + Sbjct: 733 HD---PRDDDLNLRALC----NHDVTVTAIGIGD 759 >gi|148669822|gb|EDL01769.1| von Willebrand factor A domain containing 2 [Mus musculus] Length = 748 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 22/128 (17%), Positives = 44/128 (34%), Gaps = 17/128 (13%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + VR+G + + + EVK + + T T A+ L Sbjct: 39 DISPGRVRVGALQFGSTPHLEFPLDSFSTRQEVKESIKGIVFKGGRTETGLALK----RL 94 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + + + +I +TDG++ G A + +R G+ +++V V Sbjct: 95 SRGFPGGR----NGSVPQILIIVTDGKSQGPVALP--------AKQLRERGIVVFAVGVR 142 Query: 349 APPEGQDL 356 P + L Sbjct: 143 FPRWDELL 150 >gi|110556625|ref|NP_997091.3| calcium-activated chloride channel regulator 4 [Mus musculus] gi|148680067|gb|EDL12014.1| mCG119588 [Mus musculus] gi|148922513|gb|AAI46305.1| Chloride channel calcium activated 6 [synthetic construct] gi|151556758|gb|AAI48748.1| Chloride channel calcium activated 6 [synthetic construct] Length = 925 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 23/207 (11%), Positives = 72/207 (34%), Gaps = 35/207 (16%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 + +++ + ++A ++ I +G + ++ N+ Sbjct: 315 VSGSMTSYDRLNRMNQAAKYFLSQI-------IENRSWVGMVHFSSQATIVHELIQINSD 367 Query: 260 NEVKSRLNKL--NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 E L L + T+ + A++ N + + T ++ ++DGE+S Sbjct: 368 IERNQLLQTLPTSANGGTSICSGIKAAFQVFKNGEYQTDGTE--------ILLLSDGEDS 419 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-GQDLLRKCTDSSGQFFAVNDSRE- 375 + ++++G ++ +A+ + + + G D + Sbjct: 420 T---------AKDCIDEVKDSGSIVHFIALGPSADLAVTNMSIL--TGGNHKLATDEAQN 468 Query: 376 --LLESFDKITDK---IQEQSVRIAPN 397 L+++F + + I ++S+++ Sbjct: 469 NGLIDAFGALASENADITQKSLQLESK 495 >gi|109733269|gb|AAI16637.1| Von Willebrand factor A domain containing 2 [Mus musculus] Length = 791 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 22/128 (17%), Positives = 44/128 (34%), Gaps = 17/128 (13%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + VR+G + + + EVK + + T T A+ L Sbjct: 82 DISPGRVRVGALQFGSTPHLEFPLDSFSTRQEVKESIKGIVFKGGRTETGLALK----RL 137 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + + + +I +TDG++ G A + +R G+ +++V V Sbjct: 138 SRGFPGGR----NGSVPQILIIVTDGKSQGPVALP--------AKQLRERGIVVFAVGVR 185 Query: 349 APPEGQDL 356 P + L Sbjct: 186 FPRWDELL 193 >gi|26325252|dbj|BAC26380.1| unnamed protein product [Mus musculus] Length = 721 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 22/128 (17%), Positives = 44/128 (34%), Gaps = 17/128 (13%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + VR+G + + + EVK + + T T A+ L Sbjct: 82 DISPGRVRVGALQFGSTPHLEFPLDSFSTRQEVKESIKGIVFKGGRTETGLALK----RL 137 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + + + +I +TDG++ G A + +R G+ +++V V Sbjct: 138 SRGFPGGR----NGSVPQILIIVTDGKSQGPVALP--------AKQLRERGIVVFAVGVR 185 Query: 349 APPEGQDL 356 P + L Sbjct: 186 FPRWDELL 193 >gi|42741661|ref|NP_766428.2| von Willebrand factor A domain-containing protein 2 precursor [Mus musculus] gi|81893914|sp|Q70UZ7|VWA2_MOUSE RecName: Full=von Willebrand factor A domain-containing protein 2; AltName: Full=A domain-containing protein similar to matrilin and collagen; Short=AMACO; Flags: Precursor gi|27657433|emb|CAD60277.1| AMACO [Mus musculus] gi|74225702|dbj|BAE21683.1| unnamed protein product [Mus musculus] Length = 791 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 22/128 (17%), Positives = 44/128 (34%), Gaps = 17/128 (13%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + VR+G + + + EVK + + T T A+ L Sbjct: 82 DISPGRVRVGALQFGSTPHLEFPLDSFSTRQEVKESIKGIVFKGGRTETGLALK----RL 137 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + + + +I +TDG++ G A + +R G+ +++V V Sbjct: 138 SRGFPGGR----NGSVPQILIIVTDGKSQGPVALP--------AKQLRERGIVVFAVGVR 185 Query: 349 APPEGQDL 356 P + L Sbjct: 186 FPRWDELL 193 >gi|294460157|gb|ADE75661.1| unknown [Picea sitchensis] Length = 350 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 4/43 (9%), Positives = 15/43 (34%) Query: 340 MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + +++ + + S G F + + ++F + Sbjct: 3 IPVHTFGFGTDHDSASMHSISETSGGTFSFIETESIIQDAFAQ 45 >gi|254787962|ref|YP_003075391.1| PKD domain-containing protein [Teredinibacter turnerae T7901] gi|237687416|gb|ACR14680.1| PKD domain protein [Teredinibacter turnerae T7901] Length = 1083 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 21/206 (10%), Positives = 53/206 (25%), Gaps = 32/206 (15%) Query: 223 SIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL-----SNNLNEVKSRLNKLNPYENTNT 277 +IG I ++ + L S ++++ +N ++ T Sbjct: 436 EAANTYVNISAAEDKIGIIDFDGSARVVKPFTLLGQQGSTERIQLENAINGIDAVGGTEI 495 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 ++ A L + + N+ + + I + + Sbjct: 496 AGSLGLACSTLMASWQDDIIEQQHMVNDIKALVEEFPNNTISHSNARRAAIPGIAHAINS 555 Query: 338 AGMKI-------------------------YSVAVSAPPEGQDLLRKCTDSSGQFFAVND 372 + I Y++ G+ L S G + A + Sbjct: 556 KKIAILLTDGDTPSSYSQANQCFIENGWMLYTIGFG-GANGEKLAPLAEASGGYYIAADS 614 Query: 373 S-RELLESFDKITDKIQEQSVRIAPN 397 S +L ++ +I + + Sbjct: 615 SLLDLNCAYQQIRSHAAGEQASECES 640 >gi|311252831|ref|XP_003125289.1| PREDICTED: vitrin-like isoform 1 [Sus scrofa] Length = 656 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 27/269 (10%), Positives = 71/269 (26%), Gaps = 26/269 (9%) Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERS--SENLAISICMVLDV 156 +S + + + L + + + + S ++ + V Sbjct: 163 GETTKAYQSPSVPGTTAQPVTLMQVPGTTAVEATHTALPKPSPSAGFTTSSLRLQPVGQR 222 Query: 157 SRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE- 215 SR ++ + ++ N P + S S + ID Sbjct: 223 SRELDLWCTTTYTNSQNSPQANPGFVPKEELSTQSLEPVSQGDPSCKVDLSFLIDGSSSI 282 Query: 216 -------SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 L + Q +G + Y N ++K+ + K Sbjct: 283 GKRRFRIQKQFLADVAQALDIGPAGPL--MGVVQYGDNPATQFNLKTHMNSRDLKTAIEK 340 Query: 269 LNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + +N A+ + +++ + + + + DG + Sbjct: 341 ITQRGGLSNVGRAISFVTKNFFSKSNGNRGGAPN-----VAVVMVDGWPTD--------K 387 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + R +G+ I+ + + E + Sbjct: 388 VEEASRLARESGINIFFITIEGAVENEKQ 416 >gi|301614659|ref|XP_002936803.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Xenopus (Silurana) tropicalis] Length = 720 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 27/183 (14%), Positives = 60/183 (32%), Gaps = 20/183 (10%) Query: 214 IESAGNLVNSIQKA-IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY 272 + +V+ +++ ++ S R+ + Y+ +V Q KSR+ + Sbjct: 77 KKFVLQMVDKLKEVKPNSGRSFSWRMALLQYSSTVVIEQTFRDWKGPENFKSRIAPIAYI 136 Query: 273 E-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T T +A L + K I ITDG + + + Sbjct: 137 GHGTYTS----YAITNLTQIYMNEGTHKS----VKVAILITDGVD-----HPRNPDIFAA 183 Query: 332 CEYMRNAGMKIYSVA---VSAPPEGQDLLRKCTDSSGQ--FFAVNDSRELLESFDKITDK 386 ++ +K+++V V+ LR F + D + + ++ + Sbjct: 184 TSNAKHHDIKLFTVGMTSVAKETANNAKLRLLASVPATRFVFHMLDPDVVDKILKEVKEL 243 Query: 387 IQE 389 +E Sbjct: 244 AEE 246 >gi|222637454|gb|EEE67586.1| hypothetical protein OsJ_25118 [Oryza sativa Japonica Group] Length = 755 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 26/260 (10%), Positives = 70/260 (26%), Gaps = 39/260 (15%) Query: 141 RSSENLAISICMVL---DVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKS 197 + + + + D+S + + ++ +LLP + + K Sbjct: 279 ENWSSKDFNFSYSVYSGDLSGGVLVQPSTLRDYDDRDRFCIFLLPGGGNRKVFRKAVVFV 338 Query: 198 KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN 257 + ++ + + ++ + + I +N + Sbjct: 339 IDTSGSMQGHP-LENVKNAMSTALSELTEGDY--------FNIITFNDELHSFSSCLEKV 389 Query: 258 NLNEVKSRLN----KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 N + S L+ T+ ++ A L + + + +TD Sbjct: 390 NEKSIASALDWINLNFVAGGGTDIMHPLNEAMASLSSAHDVLPQ----------IFLMTD 439 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDS 373 G T+ T I ++ + + + + LR + Sbjct: 440 GSVDDEHNICQTVKTELISRGSKSPRISTFGLGLYCNHY---FLRMLASIGRGHYDA--- 493 Query: 374 RELLESFDKITDKIQEQSVR 393 +F+ I+ Q +R Sbjct: 494 -----AFET--GSIESQVLR 506 >gi|218200012|gb|EEC82439.1| hypothetical protein OsI_26857 [Oryza sativa Indica Group] Length = 863 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 26/260 (10%), Positives = 70/260 (26%), Gaps = 39/260 (15%) Query: 141 RSSENLAISICMVL---DVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKS 197 + + + + D+S + + ++ +LLP + + K Sbjct: 387 ENWSSKDFNFSYSVYSGDLSGGVLVQPSTLRDYDDRDRFCIFLLPGGGNRKVFRKAVVFV 446 Query: 198 KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN 257 + ++ + + ++ + + I +N + Sbjct: 447 IDTSGSMQGHP-LENVKNAMSTALSELTEGDY--------FNIITFNDELHSFSSCLEKV 497 Query: 258 NLNEVKSRLN----KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 N + S L+ T+ ++ A L + + + +TD Sbjct: 498 NEKSIASALDWINLNFVAGGGTDIMHPLNEAMASLSSAHDVLPQ----------IFLMTD 547 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDS 373 G T+ T I ++ + + + + LR + Sbjct: 548 GSVDDEHNICQTVKTELISRGSKSPRISTFGLGLYCNHY---FLRMLASIGRGHYDA--- 601 Query: 374 RELLESFDKITDKIQEQSVR 393 +F+ I+ Q +R Sbjct: 602 -----AFET--GSIESQVLR 614 >gi|223462563|gb|AAI50654.1| Von Willebrand factor A domain containing 3B [Homo sapiens] Length = 1294 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 26/268 (9%), Positives = 80/268 (29%), Gaps = 32/268 (11%) Query: 107 SKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ 166 + +I + + ++ + + +++++ Y + Sbjct: 418 ADGVVDIKAKPENESVQTSAETNKKTVHAKYCSRFVHAPWKDGSLVHVNITKEKCKWYSE 477 Query: 167 KHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYA--PAPAPANRKIDVLIESAGNLVNSI 224 + + + +S + + Y K+D++ + + Sbjct: 478 RIHTALARIRRRIKWLQDGSQSLFGRLHNDCIYILIDTSHSMKSKLDLVKDKIIQFIQEQ 537 Query: 225 QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHA 284 K + V+ A + +NL + +S + + +TNT A+ A Sbjct: 538 LKYKSKFN--FVKFDGQAVAWREQLAEVNE--DNLEQAQSWIRDIKIGSSTNTLSALKTA 593 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN-AGMKIY 343 + + + + + +TDG + + ++ + IY Sbjct: 594 F---------------ADKETQAIYLLTDGRPDQP--------PETVIDQVKRFQEIPIY 630 Query: 344 SVAVSAPPE-GQDLLRKCTD-SSGQFFA 369 +++ + E L++ + G+F Sbjct: 631 TISFNYNDEIANRFLKEVAALTGGEFHF 658 >gi|221040994|dbj|BAH12174.1| unnamed protein product [Homo sapiens] Length = 951 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 26/268 (9%), Positives = 80/268 (29%), Gaps = 32/268 (11%) Query: 107 SKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ 166 + +I + + ++ + + +++++ Y + Sbjct: 75 ADGVVDIKAKPENESVQTSAETNKKTVHAKYCSRFVHAPWKDGSLVHVNITKEKCKWYSE 134 Query: 167 KHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYA--PAPAPANRKIDVLIESAGNLVNSI 224 + + + +S + + Y K+D++ + + Sbjct: 135 RIHTALARIRRRIKWLQDGSQSLFGRLHNDCIYILIDTSHSMKSKLDLVKDKIIQFIQEQ 194 Query: 225 QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHA 284 K + V+ A + +NL + +S + + +TNT A+ A Sbjct: 195 LKYKSKFN--FVKFDGQAVAWREQLAEVNE--DNLEQAQSWIRDIKIGSSTNTLSALKTA 250 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN-AGMKIY 343 + + + + + +TDG + + ++ + IY Sbjct: 251 F---------------ADKETQAIYLLTDGRPDQP--------PETVIDQVKRFQEIPIY 287 Query: 344 SVAVSAPPE-GQDLLRKCTD-SSGQFFA 369 +++ + E L++ + G+F Sbjct: 288 TISFNYNDEIANRFLKEVAALTGGEFHF 315 >gi|119622317|gb|EAX01912.1| hypothetical protein MGC26733, isoform CRA_a [Homo sapiens] Length = 1080 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 26/268 (9%), Positives = 80/268 (29%), Gaps = 32/268 (11%) Query: 107 SKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ 166 + +I + + ++ + + +++++ Y + Sbjct: 418 ADGVVDIKAKPENESVQTSAETNKKTVHAKYCSRFVHAPWKDGSLVHVNITKEKCKWYSE 477 Query: 167 KHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYA--PAPAPANRKIDVLIESAGNLVNSI 224 + + + +S + + Y K+D++ + + Sbjct: 478 RIHTALARIRRRIKWLQDGSQSLFGRLHNDCIYILIDTSHSMKSKLDLVKDKIIQFIQEQ 537 Query: 225 QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHA 284 K + V+ A + +NL + +S + + +TNT A+ A Sbjct: 538 LKYKSKFN--FVKFDGQAVAWREQLAEVNE--DNLEQAQSWIRDIKIGSSTNTLSALKTA 593 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN-AGMKIY 343 + + + + + +TDG + + ++ + IY Sbjct: 594 F---------------ADKETQAIYLLTDGRPDQP--------PETVIDQVKRFQEIPIY 630 Query: 344 SVAVSAPPE-GQDLLRKCTD-SSGQFFA 369 +++ + E L++ + G+F Sbjct: 631 TISFNYNDEIANRFLKEVAALTGGEFHF 658 >gi|118918435|ref|NP_659429.4| von Willebrand factor A domain-containing protein 3B [Homo sapiens] gi|296439299|sp|Q502W6|VWA3B_HUMAN RecName: Full=von Willebrand factor A domain-containing protein 3B Length = 1294 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 26/268 (9%), Positives = 80/268 (29%), Gaps = 32/268 (11%) Query: 107 SKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ 166 + +I + + ++ + + +++++ Y + Sbjct: 418 ADGVVDIKAKPENESVQTSAETNKKTVHAKYCSRFVHAPWKDGSLVHVNITKEKCKWYSE 477 Query: 167 KHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYA--PAPAPANRKIDVLIESAGNLVNSI 224 + + + +S + + Y K+D++ + + Sbjct: 478 RIHTALARIRRRIKWLQDGSQSLFGRLHNDCIYILIDTSHSMKSKLDLVKDKIIQFIQEQ 537 Query: 225 QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHA 284 K + V+ A + +NL + +S + + +TNT A+ A Sbjct: 538 LKYKSKFN--FVKFDGQAVAWREQLAEVNE--DNLEQAQSWIRDIKIGSSTNTLSALKTA 593 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN-AGMKIY 343 + + + + + +TDG + + ++ + IY Sbjct: 594 F---------------ADKETQAIYLLTDGRPDQP--------PETVIDQVKRFQEIPIY 630 Query: 344 SVAVSAPPE-GQDLLRKCTD-SSGQFFA 369 +++ + E L++ + G+F Sbjct: 631 TISFNYNDEIANRFLKEVAALTGGEFHF 658 >gi|118388811|ref|XP_001027501.1| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila] gi|89309271|gb|EAS07259.1| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila SB210] Length = 1543 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 10/149 (6%), Positives = 51/149 (34%), Gaps = 3/149 (2%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + I + + +++ +I+ + ++ + I S+ ++ Sbjct: 930 CTGSMSGWIQAVKDEILSIIAAIKDINKGNTSIRISFIGYRDYGSIQRFSIFDFSSEIDS 989 Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG---STRLKKFVIFITDGENSG 318 ++ LN++ ++ + ++ ++ S + + ++ Sbjct: 990 FQNFLNQIQAEGGNDSEEDVAGGFKHANLQQWKSQAKYAVFIADCPAHGREYSDGHDDRY 1049 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 + ++ Q + + G+++Y++ + Sbjct: 1050 PDGDPDGVDLKQEFKNLIKKGVQLYTIQI 1078 >gi|30268323|emb|CAD89964.1| hypothetical protein [Homo sapiens] Length = 1060 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 26/268 (9%), Positives = 80/268 (29%), Gaps = 32/268 (11%) Query: 107 SKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ 166 + +I + + ++ + + +++++ Y + Sbjct: 418 ADGVVDIKAKPENESVQTSAETNKKTVHAKYCSRFVHAPWKDGSLVHVNITKEKCKWYSE 477 Query: 167 KHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYA--PAPAPANRKIDVLIESAGNLVNSI 224 + + + +S + + Y K+D++ + + Sbjct: 478 RIHTALARIRRRIKWLQDGSQSLFGRLHNDCIYILIDTSHSMKSKLDLVKDKIIQFIQEQ 537 Query: 225 QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHA 284 K + V+ A + +NL + +S + + +TNT A+ A Sbjct: 538 LKYKSKFN--FVKFDGQAVAWREQLAEVNE--DNLEQAQSWIRDIKIGSSTNTLSALKTA 593 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN-AGMKIY 343 + + + + + +TDG + + ++ + IY Sbjct: 594 F---------------ADKETQAIYLLTDGRPDQP--------PETVIDQVKRFQEIPIY 630 Query: 344 SVAVSAPPE-GQDLLRKCTD-SSGQFFA 369 +++ + E L++ + G+F Sbjct: 631 TISFNYNDEIANRFLKEVAALTGGEFHF 658 >gi|291398581|ref|XP_002715573.1| PREDICTED: chloride channel accessory 4 [Oryctolagus cuniculus] Length = 874 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 20/119 (16%), Positives = 47/119 (39%), Gaps = 21/119 (17%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T + A++ + S ++ +TDGE+S AS+ N + Sbjct: 328 AGGGTQICLGIRAAFQVFKQQ--------NSEIDGSEIVLLTDGEDSTASSCVNEVI--- 376 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE---LLESFDKITDK 386 +G I+ +A+ P +++ + G + +D+ + L+++F +T + Sbjct: 377 ------ESGTIIHFIALG-PSADASVIQMSNLTGGSHYYASDNAQNNGLIDAFGALTSE 428 >gi|332358821|gb|EGJ36643.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK1056] Length = 434 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 45/354 (12%), Positives = 105/354 (29%), Gaps = 49/354 (13%) Query: 70 KHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALT 129 ++ ++ I K+ + + E+ LI LT Sbjct: 88 NYIGLSPDGKKVINYIWNKSTKSWDESVLGTNSLYDMQLDLEFKADESYQDNRLINYNLT 147 Query: 130 NLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF 189 S + + A++ V+ + + ++ + Sbjct: 148 GQYKNSKNKLSIDTAISALNTKQVISKVAKGKKGVALAYRNDPIEGQVNT-AVTFVFDTS 206 Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 S P + ++++L A LV+ +++ N+SV + + + + Sbjct: 207 GSMGYGLWNQKLEPTDSRTRMNILKTKANLLVDDLKEI----GNVSVNLVRFSGDASYIQ 262 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 L + + +K+++ L TN + + L + K+V+ Sbjct: 263 EDFVELDKDTDTIKTKIKALPTSWITNPGDGLRYGLVSLQ----------RNPAQLKYVV 312 Query: 310 FITDGENSGASAYQN-------TLNTLQICEYMRN-------------------AGMK-I 342 +TDG + +A + T N + ++ +G+K I Sbjct: 313 LLTDGIPNAYTASPDGIGKYDLTANFPTNGKQIKADQPVSLTTEYVGQVAKTFGSGVKRI 372 Query: 343 YSVAVSAPPEGQDLLRKCTDS-------SGQFFAVNDSRELLESFDKITDKIQE 389 + S + +K D F + L ++F I +IQ+ Sbjct: 373 SVIGFSGNVKEIKDGQKIADQIKTVGKVDSTFVIATNEAALEQTFADIKKQIQQ 426 >gi|195115748|ref|XP_002002418.1| GI17380 [Drosophila mojavensis] gi|193912993|gb|EDW11860.1| GI17380 [Drosophila mojavensis] Length = 1220 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 40/336 (11%), Positives = 96/336 (28%), Gaps = 21/336 (6%) Query: 58 DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQ--YIAESKAQYEIPT 115 T F K+ E ++ + + NP Y ++ Sbjct: 138 TYAETNFTYYSSKYSPFNGNSSEELEANVKEYEYMYREMMLNPDTHFYNISVDTEHSSVH 197 Query: 116 ENLFLKGLIPSALTNLSLRST---GIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNN 172 + P+ L + + + A+S + + ++ Sbjct: 198 VPSNVWDRAPNVLKTIQWSEYLDEVFRQNYQSDPALSWQYFGSDTGILRHYP-AAQWYDS 256 Query: 173 NMTSNKYLLPPPPKKSFWSKNTTKSKYAP-----APAPANRKIDVLIESAGNLVNSIQKA 227 K+S++ + T SK + + + V + +++++ Sbjct: 257 RANKLDADTYDCRKRSWYIETATCSKDIVILLDHSGSMTGHRHHVAKFTIRSILDTFSNN 316 Query: 228 IQEK---KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHH 283 + V+ + +V N++ S + +L+ P N A H Sbjct: 317 DFFTIFRYSNDVQGIIPCFKDALVQATP----ENIDVFNSAIAELDDPEGYANLTLAYEH 372 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A++ L N ST + ++ +TDG + N ++I+ Sbjct: 373 AFQILRNYYIKRRCNETSTC-NQAIMLVTDGVAGNTTDIFEKYNWGNGENGTSRMNVRIF 431 Query: 344 SVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLE 378 + + ++ + G + V E+ E Sbjct: 432 TYLLGKEVTKVREIQWMACLNRGYYSHVQTLDEVHE 467 >gi|297619214|ref|YP_003707319.1| Magnesium chelatase [Methanococcus voltae A3] gi|297378191|gb|ADI36346.1| Magnesium chelatase [Methanococcus voltae A3] Length = 264 Score = 47.6 bits (111), Expect = 0.004, Method: Composition-based stats. Identities = 28/272 (10%), Positives = 85/272 (31%), Gaps = 35/272 (12%) Query: 125 PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP 184 +++ S + + + +S+ N I+ + S + + N N ++ + Sbjct: 15 GKRVSSYSSKGSYVKYKSNSNNDIAFDATIKASAPFQKFRRENSNKNLSLYIESEDMKYK 74 Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYN 244 KK S + A ++++ + +L+ + ++ +A+ Sbjct: 75 VKKKNISTHIMFGVDASGSMGVLKRMEASKGAVVSLL-------MDAYQNRDKVSMVAFR 127 Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 P ++++ ++ L L T Y A AY + + Sbjct: 128 -KDKAELVVPFTSSVELAEANLVGLKTGGRTPLYDAFTKAYETF------EIEMRKNPNM 180 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM-KIYS---------VAVSAPPEGQ 354 ++ I+D + + + +I E + + I+ + + Sbjct: 181 IPILVMISDFKPNVNINKNYFVEICEIVEKLLEKNIRTIFIDTEKKSFVKIGIG------ 234 Query: 355 DLLRKCTDSSG-QFFAVNDSRELLESFDKITD 385 G ++ +++ + + + I+ Sbjct: 235 ---ENIAKKYGIDYYNIDNLTD-DDILNTISS 262 >gi|302381356|ref|YP_003817179.1| hypothetical protein Bresu_0241 [Brevundimonas subvibrioides ATCC 15264] gi|302191984|gb|ADK99555.1| hypothetical protein Bresu_0241 [Brevundimonas subvibrioides ATCC 15264] Length = 416 Score = 47.2 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 15/146 (10%), Positives = 36/146 (24%), Gaps = 10/146 (6%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+ L A++L + R ++Q D A L+G + + Sbjct: 25 IFALSTPAVVLISVGAVELGSVQSNRAKLQDIADTAALAGANELALAIDDAAAIERAKVF 84 Query: 61 ST----------IFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQ 110 +I L+ + + + + Sbjct: 85 IDGHVSEWKSAPAVTPEIAVILRDKQRVIQVVLKGHTPSFFANMLPPGGWKYHAEARAVA 144 Query: 111 YEIPTENLFLKGLIPSALTNLSLRST 136 + + + G S + N+ Sbjct: 145 VGLTPLCVLITGSSGSKMLNVKDSGR 170 >gi|321460552|gb|EFX71593.1| hypothetical protein DAPPUDRAFT_255504 [Daphnia pulex] Length = 983 Score = 47.2 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 19/110 (17%), Positives = 41/110 (37%), Gaps = 18/110 (16%) Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 T + A + L G++ ++ +TDG+NS + + + E Sbjct: 406 TCIGCGLELAVQMLNEN-------KGTSETGGVIVLVTDGKNSPGYLHISDVQ-----ED 453 Query: 335 MRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSR---ELLESF 380 + A +++ ++A + L + G+ + V D L E+F Sbjct: 454 ILKAKIRVITIAFGEKAD--KNLEDLARQTDGKSYFVKDEDGGAALQEAF 501 >gi|149925131|ref|ZP_01913438.1| hypothetical protein PPSIR1_40904 [Plesiocystis pacifica SIR-1] gi|149813994|gb|EDM73632.1| hypothetical protein PPSIR1_40904 [Plesiocystis pacifica SIR-1] Length = 413 Score = 47.2 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 29/297 (9%), Positives = 70/297 (23%), Gaps = 32/297 (10%) Query: 127 ALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPK 186 + T S E+ C ++++ + D + N Sbjct: 44 STTTAEEDSGSTTAEEDESTTGPDCGTVEITPEYVPPNVMLVVDASGSMVNNSW--DHDL 101 Query: 187 KSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIG 246 + T + + + S S + +G Sbjct: 102 DPNTPQVTRWNTLHGVVDNVMTNFGSTMYAGVQRFPSADACPDATPQSSNCYNLGSCIVG 161 Query: 247 IVGNQCTPLSNNLNEVKSRLNKLNPY-----ENTNTYPAMHHAYRELYNEKESSHNTIGS 301 ++N + + + + T ++ A L + E++ Sbjct: 162 TQPEVGVG-ADNGDAILAAIPGPGAGNTEIVGGTPATLGINSAVDHLLAQNETNPR-YIL 219 Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV---------SAPPE 352 TD + + + G+ Y V + Sbjct: 220 FITDGAANCNTDLPYPEYIESYDETLPTTVEAAFEDEGITTYVVGIDIVDMLQGAGTDGS 279 Query: 353 GQDL----LRKCTDSSG----------QFFAVNDSRELLESFDKITDKIQEQSVRIA 395 + L + G +FF + +ELL++ I + + E + + Sbjct: 280 PEANPFVRLNDVALAGGAPKNEGMDAEKFFNTTNQQELLDALQVILEGVTECVIDLT 336 >gi|7463254|pir||E70121 hypothetical protein BB0173 - Lyme disease spirochete Length = 340 Score = 47.2 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 25/172 (14%), Positives = 58/172 (33%), Gaps = 42/172 (24%) Query: 252 CTPLSNNLNEVKSRLNK---LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P++ + +L+ ++ + + A L S LK+ + Sbjct: 154 VVPITTDREFFNKKLDDIYIMDLGNGSALGLGISIALSHL----------KHSEALKRSI 203 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA------------------- 349 + +TDG + Q+ + +KIYS+ + + Sbjct: 204 VVLTDGVVNSDEIKD------QVINLAQGLNVKIYSIGIGSSEEFSVEFKLRSGKFYQGS 257 Query: 350 --PPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQ-EQSVRIAPN 397 +L + ++ + G F++VND + + K E+ ++IA + Sbjct: 258 FKEVYDPSMLVEISNKTGGLFYSVNDDFSFQFAIQDFSKKENLERKIKIAVD 309 >gi|323444971|gb|EGB01833.1| hypothetical protein AURANDRAFT_69450 [Aureococcus anophagefferens] Length = 376 Score = 47.2 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 23/134 (17%), Positives = 49/134 (36%), Gaps = 14/134 (10%) Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +V + L KL P +TN + + EL G+ + V+ +TDG + Sbjct: 66 AKVHAALEKLAPGTSTNLWGGLELGVDEL---------VGGAGDNARAVLLLTDGVPN-- 114 Query: 320 SAYQNTLNTLQICEYMRNAG---MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 ++ + + G + +++ + LL + G F V D+ + Sbjct: 115 NSPPEGEVAALRAKRLTKDGSETVAVFAAGFGYALKSDLLLSLAREGGGLFSFVPDAGMV 174 Query: 377 LESFDKITDKIQEQ 390 SF+ + ++ Sbjct: 175 GTSFNHLVASLRSS 188 >gi|146343645|ref|YP_001208693.1| hypothetical protein BRADO6887 [Bradyrhizobium sp. ORS278] gi|146196451|emb|CAL80478.1| conserved hypothetical protein; protein containing a von Willebrand factor type A (VWA) domain; putative signal peptide [Bradyrhizobium sp. ORS278] Length = 755 Score = 47.2 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 55/164 (33%), Gaps = 20/164 (12%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR---LNKLNPYENTNTYPAMHHAYRELY 289 + R I ++ + + + + V + ++ L+ T PAM A Sbjct: 389 QPNDRFNVIRFDDTMTVLFPSSVPADAEHVGNATRFVSSLDARGGTEMVPAMRAALT--- 445 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + + + V+F+TDG A N + MR +I+ V + + Sbjct: 446 -------DDGSDSDRMRQVVFLTDG------AIGNDQQLFETITAMRGRS-RIFMVGIGS 491 Query: 350 PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 P + R G F + ++ E + K++ V Sbjct: 492 APNTYLMSRAAELGRGAFTHIGSVEQVEERMRDLFAKLENPVVT 535 >gi|196232430|ref|ZP_03131283.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] gi|196223502|gb|EDY18019.1| von Willebrand factor type A [Chthoniobacter flavus Ellin428] Length = 879 Score = 47.2 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 23/145 (15%), Positives = 55/145 (37%), Gaps = 14/145 (9%) Query: 259 LNEVKSRLNKLNPYENT-NTYPAMHHAYRELYNEKESSHN----TIGSTRLKKFVIFITD 313 + ++ + Y +M A+++L + + + + +K ++D Sbjct: 484 KGAAEQKILSITAGGGGIYIYTSMVEAFQQLRDIPARVKHLLLFSDAADAEEKAAGEMSD 543 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-GQDLLRKCTDSS-GQFFAVN 371 G +G ++ L + M A + V + + LR+ + G+F+ + Sbjct: 544 GIRTGGNS-------LDLASAMLAAKITTSVVGLGTEQDKDTPFLRQLAERGSGRFYLTD 596 Query: 372 DSRELLESFDKITDKIQEQSVRIAP 396 D+ L + F T K+ + S+ P Sbjct: 597 DATTLPQIFSTETMKVAQSSLIEEP 621 >gi|311070712|ref|YP_003975635.1| hypothetical protein BATR1942_18935 [Bacillus atrophaeus 1942] gi|310871229|gb|ADP34704.1| hypothetical protein BATR1942_18935 [Bacillus atrophaeus 1942] Length = 245 Score = 47.2 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 13/90 (14%), Positives = 35/90 (38%), Gaps = 10/90 (11%) Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS----APPEGQDLLRKCTDS 363 ++ ITDG ++ + L + + + G+ + + + E + + Sbjct: 9 ILLITDGCSNHGE------DPLAMAAFAKEQGITVNVIGIMEENTVDHEAMKEVEGIALA 62 Query: 364 SGQFFAVNDSRELLESFDKITDKIQEQSVR 393 G V + +L ++ +T K Q+++ Sbjct: 63 GGGVHQVVYASQLSQTVQMVTKKAMTQTLQ 92 >gi|66809097|ref|XP_638271.1| hypothetical protein DDB_G0285405 [Dictyostelium discoideum AX4] gi|60466726|gb|EAL64777.1| hypothetical protein DDB_G0285405 [Dictyostelium discoideum AX4] Length = 923 Score = 47.2 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 32/208 (15%), Positives = 65/208 (31%), Gaps = 10/208 (4%) Query: 178 KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVR 237 K T K ++D ++ + +++ AI + Sbjct: 157 KSNFNDNQANEIVISFDTTGKMFSYFIEFKNQLDNILSILFKRLPTLKIAIMAHGDYCDD 216 Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHN 297 +Y + P +NN+ E+KS +N + + ++ R + Sbjct: 217 SRQTSYFNRSIS--ILPFTNNIEEIKSFVNSIERTNGDDAPECYEYSMR-------KARE 267 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 S KK I I D S Y+ G+K+Y V + Sbjct: 268 LPWSIYSKKSFIIIGDDVPHPPSYTDQHFYWRNELHYLTKIGIKVYGVQCHNNSHAKLFY 327 Query: 358 RKCTD-SSGQFFAVNDSRELLESFDKIT 384 + S+G++ + DS+++ + I Sbjct: 328 EELATLSNGRYVHLKDSKDIGDLIVLIA 355 >gi|194226347|ref|XP_001489610.2| PREDICTED: similar to Collagen, type VI, alpha 2 [Equus caballus] Length = 1019 Score = 47.2 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 27/154 (17%), Positives = 55/154 (35%), Gaps = 19/154 (12%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS-VRIGTIAYNIGIVGNQCTPLS--- 256 + + + N+VN + ++ K+ + R+G + Y+ Sbjct: 622 SSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHEGTFEAIQLDDERI 681 Query: 257 NNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 N+L+ K + L T T A+ AY +L E + + F + ITDG Sbjct: 682 NSLSSFKEAVKNLEWIAGGTWTPSALKFAYNQLIKESRR-------QKTRVFAVVITDGR 734 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + + LN +C N + + ++ + Sbjct: 735 HD---PRDDDLNLRALC----NHDVTVTAIGIGD 761 >gi|237737389|ref|ZP_04567870.1| BatB protein [Fusobacterium mortiferum ATCC 9817] gi|229421251|gb|EEO36298.1| BatB protein [Fusobacterium mortiferum ATCC 9817] Length = 322 Score = 47.2 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 28/176 (15%), Positives = 63/176 (35%), Gaps = 54/176 (30%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP----YENTNTYPAMHHAYRELYNEK 292 R+G I ++ PL+++ N ++ +N ++ T Y A+ A + Sbjct: 120 RVGFIPFSDSAYIQ--MPLTDDYNITQNYINAIDTTLISGGGTELYQALELAEKSFKEIG 177 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + K VI I+DG + +L+ + + + +YS+ V Sbjct: 178 SEN----------KTVIVISDG----GDFDKKSLDFV------KENKIDVYSIGVGTKEG 217 Query: 353 ---------------------------GQDLLRKCT-DSSGQFFAVNDSRELLESF 380 D L+K + +++G+++ VN+ + ++F Sbjct: 218 NVIPEYLNGVKRGFIKDESGSAVISKLNSDFLQKISNENNGKYYEVNNLVDTSKNF 273 >gi|73960095|ref|XP_547299.2| PREDICTED: similar to calcium activated chloride channel 1 precursor [Canis familiaris] Length = 911 Score = 47.2 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 18/118 (15%), Positives = 46/118 (38%), Gaps = 22/118 (18%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T+ + A+ + + + ++ +TDGE++ S+ N Sbjct: 379 ATGGTSICSGLRSAFAVIKKKYPTDGAE---------IVLLTDGEDNTISSCFN------ 423 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE---LLESFDKITD 385 ++ +G I++VA+ P ++L + G +D + L+++F ++ Sbjct: 424 ---EVKQSGAVIHTVALG-PSAAKELEELSKMTGGLQTYASDQAQNNGLIDAFGALSS 477 >gi|297490310|ref|XP_002698147.1| PREDICTED: integrin alpha X-like [Bos taurus] gi|296473225|gb|DAA15340.1| integrin alpha X-like [Bos taurus] Length = 889 Score = 47.2 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 11/67 (16%), Positives = 24/67 (35%), Gaps = 5/67 (7%) Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLR---KCTDSSG--QFFAVNDSRELLESF 380 L + AG+ Y++ V + + L+ + F V + L + Sbjct: 112 LGYKDVIPRAEAAGIIRYAIGVGSAFRYKKSLKELIDIASTPSEEHVFQVENFDALRDIQ 171 Query: 381 DKITDKI 387 ++ +KI Sbjct: 172 KQLKEKI 178 >gi|163858556|ref|YP_001632854.1| hypothetical protein Bpet4238 [Bordetella petrii DSM 12804] gi|163262284|emb|CAP44587.1| hypothetical protein Bpet4238 [Bordetella petrii] Length = 244 Score = 47.2 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 22/177 (12%), Positives = 64/177 (36%), Gaps = 24/177 (13%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 KI + ++ +++++ + + V I + + +Q ++++ Sbjct: 35 KIRNVNDAVRDMLDTFSDTENGETEIHV--AIITFGSQVALHQPLASASDI-----HWQD 87 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS----AYQN 324 L+ T A+ A + ++ + + S + V+ ++DG + A Sbjct: 88 LSAGGMTPLGTALQMAKAMIEDK-----DVVPSRAYRPTVVLVSDGGPNDAWEKPLNAFI 142 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFD 381 + C+ + A+ A + L + +S + F ++++L + F Sbjct: 143 SDGRSAKCDRLAM--------AIGADADEAVLGKFIEGTSNRLFYAENAKQLRDFFK 191 >gi|326435505|gb|EGD81075.1| hypothetical protein PTSG_11020 [Salpingoeca sp. ATCC 50818] Length = 552 Score = 47.2 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 22/177 (12%), Positives = 62/177 (35%), Gaps = 18/177 (10%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 K+D+ + L+ ++ + + ++A+ + + + + L+ Sbjct: 80 SKLDLAKATLEFLIKNLSQTDHMGLVVYHSDVSVAFPLTRMDAE------GKRTATAALS 133 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 L TN + + S V+ +TDG + N + Sbjct: 134 TLRAQRCTNLSGGLFKGIEMMQ-------GRERSAASVSSVLLMTDGIANEGVRGPNLIT 186 Query: 328 TLQICEYMRNA-GMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDK 382 + + M + +Y+ + ++LL+ ++ +G ++ + ++ + ESF Sbjct: 187 ATR--QLMGDNPSYSLYTFGYGSNH-EEELLKDLSEVGNGMYYYIENNDTIPESFGD 240 >gi|317419026|emb|CBN81064.1| Inter-alpha-trypsin inhibitor heavy chain H3 [Dicentrarchus labrax] Length = 836 Score = 47.2 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 16/152 (10%), Positives = 45/152 (29%), Gaps = 11/152 (7%) Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 + N+++ + ++N T+ A+ L R Sbjct: 267 DSWQKSLTKATKENVDQAMIYVQQINYRGGTDINQAVLTGVEMLLK---DRREKKLPERS 323 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS 364 +I +TDG + ++ + M ++ + L + + Sbjct: 324 VDMIILLTDGMPNSGESHL--PRIQENVRSAIRGNMSLFCLGFG-NDVDYSFLDVMSKQN 380 Query: 365 G----QFFAVNDSR-ELLESFDKITDKIQEQS 391 + F +D+ +L +D+++ + Sbjct: 381 KGLARRIFEGSDATLQLQGFYDEVSSPLLSDV 412 >gi|329894134|ref|ZP_08270119.1| Inter-alpha-trypsin inhibitor domain protein [gamma proteobacterium IMCC3088] gi|328923306|gb|EGG30626.1| Inter-alpha-trypsin inhibitor domain protein [gamma proteobacterium IMCC3088] Length = 460 Score = 47.2 bits (110), Expect = 0.004, Method: Composition-based stats. Identities = 18/135 (13%), Positives = 44/135 (32%), Gaps = 18/135 (13%) Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 L + L T A + ++ + +IFITDG S Sbjct: 165 LAHAIRFIRSLEADGGTEIEAAFDLTLALPTDAQK-----------LRQIIFITDGSVSN 213 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 S +N + + ++++V + + P + G F + + ++ + Sbjct: 214 ESELLAKIN-----RELEDR--RLFTVGIGSSPNRYFMEEAARAGRGTFSYIANVSDVED 266 Query: 379 SFDKITDKIQEQSVR 393 ++ K+ ++ Sbjct: 267 EIGRLLGKLSRPALT 281 >gi|311252837|ref|XP_003125292.1| PREDICTED: vitrin-like [Sus scrofa] Length = 595 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 27/269 (10%), Positives = 70/269 (26%), Gaps = 26/269 (9%) Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLS--LRSTGIIERSSENLAISICMVLDV 156 +S + + + L + + + S ++ + V Sbjct: 102 GETTKAYQSPSVPGTTAQPVTLMQVPGTTAVEATHTALSKPSPSAGFTTSSLRLQPVGQR 161 Query: 157 SRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE- 215 SR ++ + ++ N P + S S + ID Sbjct: 162 SRELDLWCTTTYTNSQNSPQANPGFVPKEELSTQSLEPVSQGDPSCKVDLSFLIDGSSSI 221 Query: 216 -------SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 L + Q +G + Y N ++K+ + K Sbjct: 222 GKRRFRIQKQFLADVAQALDIGPAGPL--MGVVQYGDNPATQFNLKTHMNSRDLKTAIEK 279 Query: 269 LNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + +N A+ + +++ + + + + DG + Sbjct: 280 ITQRGGLSNVGRAISFVTKNFFSKSNGNRGGAPN-----VAVVMVDGWPTD--------K 326 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + R +G+ I+ + + E + Sbjct: 327 VEEASRLARESGINIFFITIEGAVENEKQ 355 >gi|304312981|ref|YP_003812579.1| hypothetical protein HDN1F_33640 [gamma proteobacterium HdN1] gi|301798714|emb|CBL46947.1| Hypothetical protein HDN1F_33640 [gamma proteobacterium HdN1] Length = 979 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 17/140 (12%), Positives = 51/140 (36%), Gaps = 17/140 (12%) Query: 259 LNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N+++ T A+ A + + ++ V+ ++DG Sbjct: 148 KQAAAREANRISSSSLYTAIGDALDAAIQ---------GDLKPDPAWERSVVLLSDGMVD 198 Query: 318 GASAYQNTLNTL-----QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN 371 + + ++ + G K+++VA+ + + L++ + + G F + Sbjct: 199 ISKNPADNQREQQRIFQEVVPRLVAGGYKVHAVAL-SEQADIEFLKRLAEATKGHFSIAH 257 Query: 372 DSRELLESFDKITDKIQEQS 391 + +L+ F +D++ + Sbjct: 258 SADQLMHVFVDASDRVNQPL 277 >gi|317403331|gb|EFV83845.1| hemolysin-type calcium-binding region [Achromobacter xylosoxidans C54] Length = 1141 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 35/336 (10%), Positives = 89/336 (26%), Gaps = 21/336 (6%) Query: 32 ALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQI 91 A D A S A T + I L + N ++ + Sbjct: 478 AADNAT-SVDAGSSHTFTAAEFNFNDGAEGNQLDSVIITRLPTDGTLTLNGQAVSVNTVV 536 Query: 92 NITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISIC 151 + L Y + Q + G + + S I+ ++ + Sbjct: 537 SAADIAAGKLVYTPSASGQDTSFGFQVRDDGGTANGGKDTSGDYNFAIKTNNFISGDNDG 596 Query: 152 MVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKID 211 + + + + P +T+ S + ++ Sbjct: 597 SGTGTKPPINGGSGDDVILGDKGGTVTTVEPGKNYNIAIVVDTSGSMSEASGTKGLTRMQ 656 Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP 271 + I++ NL N+++ V + ++N+ ++ + KL+ Sbjct: 657 LTIDALKNLANTLKGHDGIVN---VALIGFESTASTKYTINGLNASNVGDLIKAIEKLSA 713 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS-----------GAS 320 TN A A + + + + F+TDG+ + Sbjct: 714 SGGTNYEGAFDEAVKWFNKQ----PTSSNGQAFENVTYFLTDGDPTFSNRGSNGDWWSGG 769 Query: 321 AYQNTLNTLQICEYMR--NAGMKIYSVAVSAPPEGQ 354 + N + + + + ++++ + Sbjct: 770 STTNYYDMKDAVDKFKGLSGKSTVHAIGIGTGVNEA 805 >gi|308502223|ref|XP_003113296.1| CRE-DIG-1 protein [Caenorhabditis remanei] gi|308265597|gb|EFP09550.1| CRE-DIG-1 protein [Caenorhabditis remanei] Length = 13921 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 32/235 (13%), Positives = 74/235 (31%), Gaps = 27/235 (11%) Query: 147 AISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPA 206 ++ S + S + + + + P Sbjct: 13160 PHYSLQIVPNVESARTWPTPRTKATTPAGSGRSCSSIDFESDVIIVLDSSENFTP----- 13214 Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 + D + ++ ++V++ + +IG + Y+ + + E+ ++ Sbjct: 13215 -DEFDSMKDAVASIVDTGFDLAPDVS----KIGFVIYSDKVAVPVALGHYEDKIELIEKI 13269 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 + A L N G K VI IT+G+N G Sbjct: 13270 VDAE-----KINDGVAIALYGL-NAARQQFQLHGRENATKIVILITNGKNRG-------- 13315 Query: 327 NTLQICEYMRN-AGMKIYSVAVSAPPEGQDLLRKCT--DSSGQFFAVNDSRELLE 378 N E +R+ G+++++VAV + P+ +++ + V S E+ + Sbjct: 13316 NAAAAAEDLRDMYGVQLFAVAVGSNPDELATIKRLVGNSNPDHVIEVAQSTEIDD 13370 >gi|78357412|ref|YP_388861.1| hypothetical protein Dde_2369 [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] gi|78219817|gb|ABB39166.1| conserved hypothetical protein [Desulfovibrio desulfuricans subsp. desulfuricans str. G20] Length = 163 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 18/129 (13%), Positives = 38/129 (29%), Gaps = 7/129 (5%) Query: 3 AIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTST 62 A+ + V + + I+ + ++ R +D A L G V+ + D T Sbjct: 24 ALTLPVLLMMVFGLIEFGYNLFART----TVDKAALIGARYAVTGQGFDDGTRHARIVQE 79 Query: 63 I--FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + Q + + I D P + + +Y + Sbjct: 80 ARRLTGVLAGSSPQSVTVTIGSIAAGAGDDALIEGDAGLPCDR-VQVRVEYRYTPVTPVV 138 Query: 121 KGLIPSALT 129 L+ +T Sbjct: 139 GSLLGPEIT 147 >gi|86138567|ref|ZP_01057140.1| von Willebrand factor type A domain protein [Roseobacter sp. MED193] gi|85824627|gb|EAQ44829.1| von Willebrand factor type A domain protein [Roseobacter sp. MED193] Length = 472 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 62/185 (33%), Gaps = 22/185 (11%) Query: 188 SFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGI 247 S + S KI + L+ ++ ++ + Sbjct: 19 SRSTLVLDASGSMWGQVDGVAKITIAQTVIQQLLETLPATQEQGLMAYGHRRKGDCSD-- 76 Query: 248 VGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 + P ++ + + + + K++P T A+ A L + +E + Sbjct: 77 IEQLIAPAADTRDAIAAAVAKISPKGKTPISAAVRQAADALRHSEEKAT----------- 125 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGM--KIYSVAVS-APPEGQDLLRKCTDSS 364 VI I+DGE + L+ + + +G+ ++++ A + L+ +++ Sbjct: 126 VIPISDGEETCG------LDPCAVGADLEASGVDFTLHAIGFGIADDTARAQLQCLAENT 179 Query: 365 GQFFA 369 G + Sbjct: 180 GGVYR 184 >gi|327266508|ref|XP_003218047.1| PREDICTED: complement factor B-like [Anolis carolinensis] Length = 767 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 33/265 (12%), Positives = 85/265 (32%), Gaps = 20/265 (7%) Query: 142 SSENLAISICMVLDVSRSMEDLYLQKHNDNN-NMTSNKYLLPPPPKKSFWSKNTTKSKYA 200 S D + ++ + + S+K + +K K+ + + Y Sbjct: 202 SGAEPECRSPFSYDTPEEVSSKFISSLTETAESSDSDKNVSTTGKRKIKIEKDGSLNIYI 261 Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 A + K D + + I+K + + T A + + S + + Sbjct: 262 VLDASRSIKKDQFKHAQNMSIKLIEKISSYDISPRYAVITFATEVKELVRTTDDQSTDAS 321 Query: 261 EVKSRLNKLNPYE-----NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK-----FVIF 310 V +L + E TN + Y + ++ + + ++ Sbjct: 322 WVIEKLEGMKYTEHKQKPGTNIQKGLSSVYSMMITQQAAERRRGLNPPPVSEKTRHVIVL 381 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMK-------IYSVAVSAPPEGQDLLRKCTDS 363 ++DG+ + + ++ + +Y AV +++ + + Sbjct: 382 LSDGDYNMGGDPIRVIRQIREFLNIGRNRTHPREDFLDVYVFAVGGTVVMENVNKIASQK 441 Query: 364 SG--QFFAVNDSRELLESFDKITDK 386 SG F + D +L +F+++ D+ Sbjct: 442 SGERHAFKIKDYSDLQLAFEEMIDE 466 >gi|254497958|ref|ZP_05110722.1| hypothetical protein LDG_2328 [Legionella drancourtii LLAP12] gi|254352852|gb|EET11623.1| hypothetical protein LDG_2328 [Legionella drancourtii LLAP12] Length = 607 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 28/200 (14%), Positives = 67/200 (33%), Gaps = 40/200 (20%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV---------------- 262 ++ S++ + ++ + SN+L ++ Sbjct: 39 DVSGSMKHTDPQNLRVTAVKLFNYLVNNRAVVSVSTFSNDLEQIIPPQIVTAKFQESFLK 98 Query: 263 -KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 K ++ TN A+ + ++ KK +I +TDG S Sbjct: 99 KKKQIKSDGA--WTNIDAAL-------------NGVNKSWSKNKKVIILLTDGMLDLGSD 143 Query: 322 YQNTLNTLQI----CEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 N +T Q+ ++ +++Y++ + + LL + ++ F V +++L Sbjct: 144 ALNKKSTQQLNETTIPILQREHVQVYTIGL-SNEADSTLLSNISLKTNALFQPVISAKDL 202 Query: 377 LESFDKITDKIQEQSVRIAP 396 + I + V+ AP Sbjct: 203 DNALYAIFSSVIS--VQEAP 220 >gi|12805443|gb|AAH02194.1| Col6a1 protein [Mus musculus] Length = 406 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 47/367 (12%), Positives = 103/367 (28%), Gaps = 37/367 (10%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 NQMQ +D + R +D + + Q + R Sbjct: 46 SHNQMQEHVD-------MRSPNVRNAQDFKEAVKKLQWMAGGTFTGEALQYTRDRLLPPT 98 Query: 85 IAQKAQINITKDKNNPLQYI--AESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERS 142 + + IT +++ + +I ++ +K + + L S Sbjct: 99 QNNRIALVITDGRSDTQRDTTPLSVLCGADIQVVSVGIKDVFGFVAGSDQLNVISCQGLS 158 Query: 143 SENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPA 202 IS ++ + + N + +K +F S + Sbjct: 159 QGRPGIS---LVKENYAELLDDGFLKNITAQICIDKKCPDYTCPITFSSPADITILLDSS 215 Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ---CTPLSNNL 259 + + + A L A + + VR+ + Y+ N Sbjct: 216 ASVGSHNFETTKVFAKRLAERFLSAGRADPSQDVRVAVVQYSGQGQQQPGRAALQFLQNY 275 Query: 260 NEVKSRLNKLN-PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + S ++ ++ + T+ A+ + R + KK V+ +DG + G Sbjct: 276 TVLASSVDSMDFINDATDVNDALSYVTRFYREASSGA--------TKKRVLLFSDGNSQG 327 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE---------GQDLLRKCTDSSGQFFA 369 + + + AG++I+ V V G+ F Sbjct: 328 ----ATAEAIEKAVQEAQRAGIEIFVVVVGPQVNEPHIRVLVTGKTAEYDVAFGERHLFR 383 Query: 370 VNDSREL 376 V + + L Sbjct: 384 VPNYQAL 390 >gi|238027555|ref|YP_002911786.1| membrane protein [Burkholderia glumae BGR1] gi|237876749|gb|ACR29082.1| Membrane protein [Burkholderia glumae BGR1] Length = 620 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 16/122 (13%), Positives = 38/122 (31%), Gaps = 1/122 (0%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 MT I ++V + +D+ ++ + R +Q D A L+G + + + + + Sbjct: 31 MTIIFMTVMIAVLGM-LDIGNVFFQRRDLQRIADMAALAGVQRLDATCSQAPVSASRSAA 89 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 S + G + + A + + + + F Sbjct: 90 SNGLNTGAGDTISVGCGRWDPTANPAPSYYVPVANPGASSPAVQLNAVQVAVSRQVPYFF 149 Query: 121 KG 122 G Sbjct: 150 LG 151 >gi|123390689|ref|XP_001299929.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121880878|gb|EAX86999.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 661 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 35/358 (9%), Positives = 102/358 (28%), Gaps = 47/358 (13%) Query: 44 IVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK--DKNNPL 101 I + ++ + + + K +L G I + + + Sbjct: 65 IGDKQIRPQLRMSEEASKEYQESKEKGYLSLLGRNSSGNGIIFNFGNSPDETKIEVHYTI 124 Query: 102 QYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI---SICMVLDVSR 158 Y+AE Q + K + S + ++ +N++ + ++ Sbjct: 125 SYLAEVNNQGFFFRFPIASKYQHGYETSLPRSISFYLKIKTDKNISKIEANNSATINQFD 184 Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAG 218 + + + L+ K + S + + Sbjct: 185 NHNAFINLDKFEPAI--FVQTLISDQDKSTAVSSDDYIVVSTYKEFSSKSNCYECKADYF 242 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNI--GIVGNQCTPLSNNLNEVK------------- 263 +++ ++ +V+ + + + + Sbjct: 243 FVIDRSASMEGDRIEKAVKCMRLMLQSLPMMCRFSIVCFGSEFQSLLPIVEYNNENVLLA 302 Query: 264 -SRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + + +N T+ Y + + + + K + +TDGE Sbjct: 303 MNLIKNINANMGGTDIYHPLEYIF--------------SQNGMTKKIFLLTDGE------ 342 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLE 378 N+ + +++ + + G IY+V + + L+R + ++G++ V D+ E Sbjct: 343 DSNSEDIIRLVQENKQFG-NIYTVGIGSGA-DSGLIRNLAEVTNGKWTYVLDNENFNE 398 >gi|291229809|ref|XP_002734863.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii] Length = 2065 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 26/193 (13%), Positives = 68/193 (35%), Gaps = 20/193 (10%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 + + I + + E + V + + + + + N Sbjct: 61 VDSSGSIGASNFHFEINFIREISTIFSMSPDEARVSVVTYSDSSKIVRQIDYIGSSVGKN 120 Query: 259 LNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 L+ + T+T A+ A R L + + + + V+ +TDG+++ Sbjct: 121 KCTFLGELSLIRYEAGWTDTKGALEEADRVLQHARSGA---------NRLVVLLTDGQST 171 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 + + I +RN G++I ++ V D L ++ F ++ ++ Sbjct: 172 EG-------DPVGIATRIRNKGIRIVAIGVG--NVNMDELTSIA-TAQYVFILDRLSYVV 221 Query: 378 ESFDKITDKIQEQ 390 + +I + ++E+ Sbjct: 222 DLATRIKNDVKEK 234 >gi|255557524|ref|XP_002519792.1| protein binding protein, putative [Ricinus communis] gi|223541031|gb|EEF42588.1| protein binding protein, putative [Ricinus communis] Length = 436 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 20/129 (15%), Positives = 44/129 (34%), Gaps = 33/129 (25%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 N+ N+++ +N L+ TN + R L + S + ++ ++DGE Sbjct: 80 NSQNDLEILINGLHADAATNITAGLQTGLRVLNDRSLSGGRVVD-------IMLMSDGEQ 132 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS--SGQFFAVNDSR 374 + A + + +L+ D+ G F V ++ Sbjct: 133 NAGDD---------------AAQIPV---------GNMPVLKAIADNSMGGTFSDVQNTD 168 Query: 375 ELLESFDKI 383 L ++F + Sbjct: 169 NLSKAFSDL 177 >gi|307155059|ref|YP_003890443.1| Vault protein inter-alpha-trypsin domain-containing protein [Cyanothece sp. PCC 7822] gi|306985287|gb|ADN17168.1| Vault protein inter-alpha-trypsin domain protein [Cyanothece sp. PCC 7822] Length = 796 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 21/167 (12%), Positives = 46/167 (27%), Gaps = 19/167 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPL---SNNLNEVKSRLNKLNPYENTNTYPAMHHAYR 286 N I ++ PL S N + + +N+L T + Sbjct: 328 NGLNPDDTFSIIDFSDTTQQLSPVPLANTSQNRSLALNYINRLTAGGGTELMRGIRAVLN 387 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 + ++ + ++ +TDG N L + G ++YS Sbjct: 388 --FPITDAGRL--------RSIVLLTDG------YIGNENQILAEVQQHLKPGNRLYSFG 431 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + L R G + + + ++ +I + Sbjct: 432 AGSSVNRFLLNRIAEIGRGLARIIRHDEPVNQVVEQFFRQINNPVLT 478 >gi|115687249|ref|XP_792282.2| PREDICTED: similar to polydom protein [Strongylocentrotus purpuratus] Length = 2422 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 19/123 (15%), Positives = 48/123 (39%), Gaps = 22/123 (17%) Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T T A A + L + +S + V +TDG ++G N + + Sbjct: 284 GGGTYTKGAFELAKKVLQGARANST---------QAVFLLTDGLSNGP-------NPVPV 327 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQ 390 +++ G++++S + +LL+ ++ + ++ E F+ + + + Sbjct: 328 AVSLKDDGVEVFSFGI-RDGYIPELLQMASEKKDEHCYILDSFAE----FEALARRALHE 382 Query: 391 SVR 393 +R Sbjct: 383 DLR 385 >gi|21740064|emb|CAD39048.1| hypothetical protein [Homo sapiens] Length = 803 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 22/170 (12%), Positives = 56/170 (32%), Gaps = 30/170 (17%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 K+D++ + + K + V+ A + +NL + Sbjct: 25 SHSMKSKLDLVKDKIIQFIQEQLKYKSKFN--FVKFDGQAVAWREQLAEVNE--DNLEQA 80 Query: 263 KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 +S + + +TNT A+ A+ + + + + +TDG Sbjct: 81 QSWIRDIKIGSSTNTLSALKTAF---------------ADKETQAIYLLTDGRPDQP--- 122 Query: 323 QNTLNTLQICEYMRN-AGMKIYSVAVSAPPE-GQDLLRKCTD-SSGQFFA 369 + + ++ + IY+++ + E L++ + G+F Sbjct: 123 -----PETVIDQVKRFQEIPIYTISFNYNDEIANRFLKEVAALTGGEFHF 167 >gi|291295700|ref|YP_003507098.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] gi|290470659|gb|ADD28078.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] Length = 354 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 19/182 (10%), Positives = 51/182 (28%), Gaps = 43/182 (23%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNT----------------- 298 + + ++ + +L P +NT+ A+ R L Sbjct: 144 TTDRQKLLEAIERLKPAQNTSIENAIITGVRMLPGRNTLRPPAELQPPGLSQPDPLQGIP 203 Query: 299 ---------IGSTRLKKFVIFITDGENSGASAYQNTLNTLQ--ICEYMRNAGMKIYSVAV 347 + ++ ++DG ++ +S T + +NA +++Y+ + Sbjct: 204 DLPLPQQAQPPANLPPGSLVILSDGASNVSSNPTLPTRTTLEVAARFAKNANVRLYTFPM 263 Query: 348 SAPPEGQDL--------------LRKCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 P L + + G+ L ++ I+ + Sbjct: 264 GQPGGAVTQIEGRHYYIPFEPRNLEQLAQATGGKNTYPPTEEALRAIVKELGVVIRWEGT 323 Query: 393 RI 394 + Sbjct: 324 KT 325 >gi|104779911|ref|YP_606409.1| hypothetical protein PSEEN0657 [Pseudomonas entomophila L48] gi|95108898|emb|CAK13594.1| conserved hypothetical protein [Pseudomonas entomophila L48] Length = 635 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 20/171 (11%), Positives = 53/171 (30%), Gaps = 2/171 (1%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVL-SGCASIVSDRTIKDPTTKKDQ 59 M A+ + + LF+ ++D + + ++Q D A L + S + + Sbjct: 1 MAALTMGLALLFMLLSVDSGRLYLEQRKLQRIADMAALEAAEHSASCNGSGPQAFALARN 60 Query: 60 TSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLF 119 +T + L D ++ + T+++ ++ + Sbjct: 61 AATRNGLAVTDPLIASCGYLRTGNDNLRRFVADNTRNEAIKVEVSNVVVTSVAAGVFAMV 120 Query: 120 LKGLIP-SALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHN 169 +P + + S + + + S ++ S+ L K Sbjct: 121 QGNTVPRTTTLHASAVAASPMPPLAMLSIRSSLATVNASQGSLLNALTKAL 171 >gi|312068041|ref|XP_003137027.1| hypothetical protein LOAG_01440 [Loa loa] gi|307767808|gb|EFO27042.1| hypothetical protein LOAG_01440 [Loa loa] Length = 808 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 25/155 (16%), Positives = 55/155 (35%), Gaps = 15/155 (9%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKE 293 + +I + Y+ + N +++ L K+ T T A+ A ++ Sbjct: 77 NTQIAVMQYSSYTRVEFNFSANPNKESLRASLQKIRHISGTTKTGKALDKAL-HVFRHDS 135 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S + + + + +TDG + + + +R AG++I ++ + A Sbjct: 136 SFGARLNQDDVAQVAVVVTDGHSHD--------DPIPAAVRLRQAGVQILTLGIGAHINM 187 Query: 354 QDLLRKCTDSSGQFFA-VNDSRELLES---FDKIT 384 +L+ T F + L + F KI Sbjct: 188 GELME-ITGDENLAFQNLTSQASLDQFVYQFKKIA 221 >gi|282879638|ref|ZP_06288369.1| von Willebrand factor type A domain protein [Prevotella timonensis CRIS 5C-B1] gi|281306586|gb|EFA98615.1| von Willebrand factor type A domain protein [Prevotella timonensis CRIS 5C-B1] Length = 346 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 22/162 (13%), Positives = 53/162 (32%), Gaps = 40/162 (24%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 P++++ K L +P T + A R + ++ + +I IT Sbjct: 149 LPITSDYVSAKMFLQNTDPSLITTQGTDIARAIRLSMSSFTQ------QDKVGRAIILIT 202 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA---------------PPEGQDLL 357 DGE+ L+ G+ ++ + V GQ ++ Sbjct: 203 DGEDHEG-------GALEAAADANKKGINVFILGVGNTQGAPIPVAEGGYMKDENGQTVM 255 Query: 358 --------RKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQ 390 + + G + V+++ ++ K+ D++ + Sbjct: 256 TALNEKMCQDIARAGKGTYIHVDNT---NDAQKKLNDELAKL 294 >gi|162455534|ref|YP_001617901.1| hypothetical protein sce7252 [Sorangium cellulosum 'So ce 56'] gi|161166116|emb|CAN97421.1| hypothetical protein predicted by Glimmer/Critica [Sorangium cellulosum 'So ce 56'] Length = 402 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 16/81 (19%), Positives = 31/81 (38%), Gaps = 7/81 (8%) Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG-------QFF 368 SG + +T+ E + +G+++Y V + D+L + + G +F Sbjct: 252 PSGPEGCLDRDDTVAAIERLAASGIEVYVVGIPGSEFYGDVLDQMALAGGAAQFVSPFYF 311 Query: 369 AVNDSRELLESFDKITDKIQE 389 V+D L KI + Sbjct: 312 KVDDLDTLGNVLSKIASIVVS 332 >gi|543095|pir||B53274 complement factor B subunit Bb - pig (fragment) Length = 170 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 23/142 (16%), Positives = 50/142 (35%), Gaps = 18/142 (12%) Query: 254 PLSNNLNEVKSRLNKLNPYE-----NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P S + + V +L+K++ + TNT A+ Y + + + R + + Sbjct: 28 PKSADADWVTEQLDKISYDDHKLKAGTNTKKALLEVYNMMSWGVNNFPDNWN--RTRHVI 85 Query: 309 IFITDGENSGASAYQNTLNTL-QICEYMRNAGMK------IYSVAVSAPPEGQDLLRKCT 361 + +TDG ++ ++ + + ++ IY V P Q+ + Sbjct: 86 VLLTDGLHNMGGDSVTVIDEIRDLLNIGKDRKNPREDYLDIYVFGVG-PLVNQENINALA 144 Query: 362 DSSG---QFFAVNDSRELLESF 380 F + D L + F Sbjct: 145 SKKDKEQHVFKLKDVDNLEDVF 166 >gi|288554674|ref|YP_003426609.1| hypothetical protein BpOF4_08295 [Bacillus pseudofirmus OF4] gi|288545834|gb|ADC49717.1| hypothetical protein BpOF4_08295 [Bacillus pseudofirmus OF4] Length = 246 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 16/103 (15%), Positives = 39/103 (37%), Gaps = 10/103 (9%) Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----GQD 355 S K ++ +TDG ++ + + I + G+ + + V G D Sbjct: 1 MSRGTLKQILLLTDGHSNQGE------DPVAIAALAKEQGITVNVIGVVDENHLNKQGID 54 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + + G + +++L ++ +T K Q++ N+ Sbjct: 55 EIEAIALAGGGVSQIVYAKQLAKTVQMVTRKAMTQTLHGVVNK 97 >gi|51245384|ref|YP_065268.1| hypothetical protein DP1532 [Desulfotalea psychrophila LSv54] gi|50876421|emb|CAG36261.1| unknown protein [Desulfotalea psychrophila LSv54] Length = 420 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 30/272 (11%), Positives = 71/272 (26%), Gaps = 26/272 (9%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 +TAI+++V +F A+DL ++ +RN++Q+ A +++ +D Sbjct: 21 LTAILMAVLIMFAALAVDLGYLYGVRNELQNGA---DAGALAGAHELLDVENGILTRDDA 77 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIA--------------- 105 ++ + + ++ + + T N Sbjct: 78 IAEAERVVSLNSTGNDAVQFKPIETGHWSFTTSTFSPNPTDTQGEWQEKSFAELDADLNF 137 Query: 106 --ESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDL 163 + F + A ++ + + L + E + Sbjct: 138 INAVRVVSFRDDTPAFFARIFGFASFLVNTEAIAY--IGFAGDLYPGEIDLPIGICKETI 195 Query: 164 YLQKH-NDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVN 222 N N N + W T S+ + A + D+ +G+ + Sbjct: 196 TDGDSFNCNMGRMLNSGGNAATEMTAMW---TNFSQEPCSTASNSDMQDLTSGCSGSNIT 252 Query: 223 SIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 I V+ Sbjct: 253 DIHFGEGMGTQNGVQDNIFGNITDCWEAWIID 284 >gi|166366827|ref|YP_001659100.1| von Willebrand factor type A [Microcystis aeruginosa NIES-843] gi|166089200|dbj|BAG03908.1| von Willebrand factor type A [Microcystis aeruginosa NIES-843] Length = 724 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 15/161 (9%), Positives = 42/161 (26%), Gaps = 20/161 (12%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 + K+ +A + ++ R + + G T L+++L Sbjct: 60 TSGSMNDDNKLQEAKNAAKAFIERQDPSVN-------RFAVVGF--GSQVQIGTGLTSDL 110 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK---------ESSHNTIGSTRLKKFVIF 310 + ++ L+ T + A +L + + S +I Sbjct: 111 ATLNQAIDNLSDGGGTRMDLGLATAIEQLESSSSDRHILLFTDGQPAPAPSPEQIDIMIV 170 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNA--GMKIYSVAVSA 349 + + + ++ +I +A Sbjct: 171 LDVTSSMNEEIAGVQQGIQNFAQELKKRKLDAQIGLIAFGD 211 >gi|158319480|ref|YP_001511987.1| hypothetical protein Clos_0429 [Alkaliphilus oremlandii OhILAs] gi|158139679|gb|ABW17991.1| conserved hypothetical protein [Alkaliphilus oremlandii OhILAs] Length = 244 Score = 47.2 bits (110), Expect = 0.005, Method: Composition-based stats. Identities = 12/97 (12%), Positives = 37/97 (38%), Gaps = 10/97 (10%) Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--SAPPEGQDLLRK--CT 361 K +I +TDG+++ + + E G+ + ++ + L Sbjct: 8 KQMILVTDGQSNVGGS------PIIAAEKAYRNGIIVNTIGIVDGKESNEDALNEIVEIA 61 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + G + + EL ++ +T + ++++ ++ Sbjct: 62 KAGGGTYEYSYINELFQTMQSLTYQTVNKTLQEVVSK 98 >gi|312106135|ref|XP_003150649.1| hypothetical protein LOAG_15109 [Loa loa] gi|307754186|gb|EFO13420.1| hypothetical protein LOAG_15109 [Loa loa] Length = 313 Score = 46.8 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 18/96 (18%), Positives = 31/96 (32%), Gaps = 9/96 (9%) Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 E + + I ITDG + N + R + + ++SV V Sbjct: 7 SERRGARKQGDDVARVSIVITDGRSQD--------NVTEAAISARKSHINMFSVGV-TDH 57 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 L S Q+F V+ ++L + K Sbjct: 58 VLGSELEAIAGSPLQWFHVDRFKDLDTRLRSLIQKA 93 >gi|307947552|ref|ZP_07662881.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] gi|307769249|gb|EFO28481.1| conserved hypothetical protein [Roseibium sp. TrichSKD4] Length = 212 Score = 46.8 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 25/178 (14%), Positives = 46/178 (25%), Gaps = 13/178 (7%) Query: 15 YAIDLAHIMYIRNQMQSALDAAVL-SGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLK 73 ID++ R+Q Q D L + + + T +K + K Sbjct: 37 VGIDMSFAYNKRDQSQLVADEVSLFAVTTFRKYVADGMSKNQARKRAETDARKFLTARTK 96 Query: 74 QGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSL 133 E INI + ++ ++E ++ + + + Sbjct: 97 SLDGTTEKFSIK-----INIVDREAKVVKANVNISGKHES-----YMTHAMGFDNIDYTA 146 Query: 134 RSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWS 191 S S ++DVS SM + P S S Sbjct: 147 DSES--TISFGQGKYEFIFLVDVSPSMGIGASNRDRQIMQRAIGCQFACHEPWYSSVS 202 >gi|167045536|gb|ABZ10188.1| putative von Willebrand factor type A domain protein [uncultured marine microorganism HF4000_APKG10H12] Length = 356 Score = 46.8 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 19/115 (16%), Positives = 39/115 (33%), Gaps = 15/115 (13%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHN 297 Y + + + + + P NT Y A+ ES Sbjct: 132 DEIFLYRFDYTPELLQDWTVDRIRLSRAIRDIRPRGNTALYDAVA----------ESVPR 181 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 G KK ++ I+DG ++ + + ++ E +R + IY++ + P Sbjct: 182 VAGGQHFKKALLIISDGNDNN-----SETDVRELRELIRESEALIYAIGIDGPST 231 >gi|332798630|ref|YP_004460129.1| pilin isopeptide linkage domain-containing protein [Tepidanaerobacter sp. Re1] gi|332696365|gb|AEE90822.1| pilin isopeptide linkage domain protein [Tepidanaerobacter sp. Re1] Length = 925 Score = 46.8 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 40/266 (15%), Positives = 77/266 (28%), Gaps = 35/266 (13%) Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 L D ND S Sbjct: 186 PEYAKTIDYLGDGIPNPDTSANGLNDYRIYLDVTTEASETETDRDIIFVLDVSNSMDTAL 245 Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 + +VL + + VNS+ +N S RI I + T + ++ + Sbjct: 246 GNTTRFNVLKNTVKSAVNSL------VQNPSNRISIITFGTRA--QIVTTRETDRTKLIN 297 Query: 265 RLNKLNP----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 +N L+ TN Y +M HA + + SH +K + F++DGE + + Sbjct: 298 CVNSLSLPGGTAGGTNYYESMLHAAQIVNGSINGSH--------EKVIFFVSDGEPTASL 349 Query: 321 AYQNTLNTLQICE--------YMRN-AGM-KIYSVAVSAPPEGQDLLRKC-----TDSSG 365 N + E + + + YSV + L+ ++ Sbjct: 350 PAANAMGYAAYAEVATIYAYHAAQEFQNVDRFYSVFIGDDSGSASTLQTITQMVEVNNEK 409 Query: 366 QFFAVNDSRELLESFDKITDKIQEQS 391 + + +L +F++ K+ Sbjct: 410 YMVQASSAEQLTSAFNRFVSKVGNSL 435 >gi|87308731|ref|ZP_01090870.1| hypothetical protein DSM3645_10847 [Blastopirellula marina DSM 3645] gi|87288442|gb|EAQ80337.1| hypothetical protein DSM3645_10847 [Blastopirellula marina DSM 3645] Length = 625 Score = 46.8 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 13/106 (12%), Positives = 35/106 (33%), Gaps = 9/106 (8%) Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 N + + + + F DG +++ ++ + I++++V Sbjct: 509 NRYPNRWSLPYNWNWADYTDFDGDGNADYSTSDRSKQYAFYEAVQAHKLDVTIHTMSVGL 568 Query: 350 PPEGQDLLRKCT--------DSSGQFFAVNDSRELLESFDKITDKI 387 + L+ G ++LL++F +I K+ Sbjct: 569 DA-DRSLMTAIAFACGGVHIAVPGGATVAEMEQQLLDAFGQIAAKV 613 Score = 39.9 bits (91), Expect = 0.72, Method: Composition-based stats. Identities = 29/337 (8%), Positives = 72/337 (21%), Gaps = 21/337 (6%) Query: 10 FLFITYAIDLAHIMYIRNQMQSALD----------AAVLSGCASIVSDRTIKDPTTKKDQ 59 F FI++ ID I + +MQ+A+D + ++ D + K Sbjct: 31 FGFISFGIDTGLISLEQTRMQNAVDAAALAASQEITSAVAQAGDSGGDPNSISISFAKQM 90 Query: 60 TSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAES-----KAQYEIP 114 + + + E+ + Sbjct: 91 AVDVAAANGVYLNADRDIVFGKRTYDPGSGEWAYDWTTGPYNVVKVEAHRDQPNLEAPDG 150 Query: 115 TENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNM 174 L + L +T +E + + + S + ND Sbjct: 151 RVPLAFGWAVGVPSIPLVTSATSFVEARDMVVVLDFSGSM-NDDSQFKAINRLGNDAVTQ 209 Query: 175 TSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNL 234 P + T A D ++ K+ ++ N+ Sbjct: 210 NLRDIFTSMSPNVGSLPLDPTYLTVVGAAP--TSGCDSQVQVTFKGTEIYVKSSKDLSNV 267 Query: 235 SVRIG--TIAYNIGIVGNQCTPLSNNLNEVKSRL-NKLNPYENTNTYPAMHHAYRELYNE 291 ++ + + N + ++ + R Sbjct: 268 VLQFDNGNKYKYDNLNQGKTGTFQGNGGNYNRVITKAWVKSGCNSSGEGSGYGERFEDTN 327 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 + + + D + + + Sbjct: 328 SAVKNAFGLNHVPYPYASGSWDDYINYCRNDNDVRSA 364 >gi|226365428|ref|YP_002783211.1| hypothetical protein ROP_60190 [Rhodococcus opacus B4] gi|226243918|dbj|BAH54266.1| hypothetical protein [Rhodococcus opacus B4] Length = 548 Score = 46.8 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 16/110 (14%), Positives = 33/110 (30%), Gaps = 9/110 (8%) Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T Y A+R + + + VI +TDG N + + Sbjct: 437 GGGTGLYDTTLAAFRTVQDSYDPRAV--------NSVIILTDGANEDPDSITKEQLLGIL 488 Query: 332 CEYMR-NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 M + I ++ ++ + L + G + D ++ F Sbjct: 489 EREMDPARPVIIVTIGITDDADAATLAEISRVTGGSSYVAKDPADIANVF 538 >gi|212637907|ref|YP_002314427.1| hypothetical protein Aflv_0058 [Anoxybacillus flavithermus WK1] gi|212559387|gb|ACJ32442.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Anoxybacillus flavithermus WK1] Length = 247 Score = 46.8 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 17/103 (16%), Positives = 40/103 (38%), Gaps = 10/103 (9%) Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV-SAPPEGQDLLR 358 + ++ ITDG ++ + + + + G+ + + V + LR Sbjct: 1 MRKGTLRQILLITDGCSNHGE------DPIAMAALAKEQGITVNVIGVLDQDIIDEQGLR 54 Query: 359 K---CTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + S G V +++L ++ +T K Q+++ NR Sbjct: 55 EIEGIAQSGGGISQVVYAKQLSQTVQMVTRKAMTQTLQGVINR 97 >gi|223939936|ref|ZP_03631804.1| von Willebrand factor type A [bacterium Ellin514] gi|223891427|gb|EEF57920.1| von Willebrand factor type A [bacterium Ellin514] Length = 346 Score = 46.8 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 32/215 (14%), Positives = 69/215 (32%), Gaps = 50/215 (23%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE- 273 + G +++ N V +PL+ N + + L++L+ Sbjct: 111 DETGTRFGIASAVLEDFVNKRPNDRIGLIVFSGVPYLASPLTLNHDWLVENLHRLHIGII 170 Query: 274 ---NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T A A + L K+ + +I +TDG+N+ L Sbjct: 171 RELGTAIGDATAAATKRLQMSKD---------SKSRIIILLTDGDNNQGEIEPVPAAQLA 221 Query: 331 ICEYMRNAGMKIYSVAVS------APPEGQD----------------LLRKCTDS----- 363 + IY++ + P D +L+ S Sbjct: 222 AAIGAK-----IYTIGLGIEEPSHLPAFDVDTGKFKHGPGGELIPTIMLQPANYSVLGQM 276 Query: 364 ----SGQFFAVNDSRELLESFDKITDKIQEQSVRI 394 G+F+ + R+L +++I D++++ V++ Sbjct: 277 SRLAHGKFYRATNRRDLENIYNEI-DRLEKTEVKL 310 >gi|301609300|ref|XP_002934201.1| PREDICTED: epithelial chloride channel protein-like [Xenopus (Silurana) tropicalis] Length = 919 Score = 46.8 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 26/219 (11%), Positives = 68/219 (31%), Gaps = 40/219 (18%) Query: 187 KSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIG 246 S K T + P P + +++ + + + + + Sbjct: 270 NSTDIKATPPQADSNLPVPTFTLLQSSDRVVTLVLDVSGSMASDGRIGRLYQAAEVFVMQ 329 Query: 247 IVGNQC-------TPLSNNLNEVKSRLNKLN-----------PYENTNTYPAMHHAYREL 288 IV + + L+++ ++ TN + + Sbjct: 330 IVEEGSHVGIVSFSTSTTVLSKLVQVIDDTQRNHLKFLLPKTAVGGTNICAGIREGIKV- 388 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 +N + ++ +TDGE++ ++ + N+G+ ++ +A+ Sbjct: 389 -------NNQHDGSSYGTEIVLLTDGEDNYNTSLCFPD--------ISNSGIIVHFIALG 433 Query: 349 APPEGQDLLRKCTD-SSGQFFAVN---DSRELLESFDKI 383 P L D + G F D++ L+++F + Sbjct: 434 PNPNPN--LETIVDMTGGLRFLATDKVDAQGLIDAFSSL 470 >gi|291398583|ref|XP_002715574.1| PREDICTED: chloride channel accessory 1-like [Oryctolagus cuniculus] Length = 911 Score = 46.8 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 20/120 (16%), Positives = 50/120 (41%), Gaps = 24/120 (20%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 P T+ + A+ + + + ++ +TDGE++ T++ Sbjct: 379 PSGGTSICSGLRVAFSVIKKKYPTD---------GSEIVLLTDGEDN----------TIK 419 Query: 331 IC-EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND---SRELLESFDKITDK 386 +C + +R +G I++VA+ P +L + G +D + L+++F ++ + Sbjct: 420 VCFDEVRQSGAIIHTVALG-PSAALELEELSKMTGGLQTYASDQVQNNGLIDAFGALSSE 478 >gi|198430849|ref|XP_002120173.1| PREDICTED: similar to cGMP-dependent protein kinase 1, alpha isozyme (CGK 1 alpha) (cGKI-alpha) [Ciona intestinalis] Length = 1896 Score = 46.8 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 26/176 (14%), Positives = 59/176 (33%), Gaps = 20/176 (11%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 + ++ +V + ++ +V +G +A++ + + +++R+ L Sbjct: 853 LSSYKQALRWVVELLTSFREDVDKGNVHVGVVAFHSWAGTRIALG-AFEFSNLQARIIAL 911 Query: 270 NPY---ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 + TN PA+ RE + +K +I +TDG +S +A Sbjct: 912 SNGRNYGGTNIAPAIDETLREFNRNGRTGI--------QKQMILMTDGYSSYPNAISP-- 961 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + R G+ + V L + + F + L E + Sbjct: 962 ----AAQRARAQGVVTVA--VGVGGSSYAQLLNIAGNQTRVFYATNFNRLGEVVES 1011 >gi|170692557|ref|ZP_02883719.1| putative transmembrane protein [Burkholderia graminis C4D1M] gi|170142213|gb|EDT10379.1| putative transmembrane protein [Burkholderia graminis C4D1M] Length = 373 Score = 46.8 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 31/284 (10%), Positives = 74/284 (26%), Gaps = 18/284 (6%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAA----------------VLSGCASIV 45 A+ + AID+ +++ R+Q+Q+A DAA + + Sbjct: 18 VALCMLFLLGIAALAIDIGNLLVARSQLQNAADAAAMAGAGCLMRRTECGNTTATQPDWI 77 Query: 46 SDRTIKDPTTKKDQTSTIFKKQIKK-HLKQGSYIRENAGDIAQKAQINITKDKNNPLQYI 104 + + T+ + +K + G + + +Q Sbjct: 78 TASAKASSFSTSTVTNKVQADYVKASTVATGYWNATGTPYGLESLPFTPGASDLPAVQVT 137 Query: 105 AESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLY 164 +F ++ + S +T ++ L +S+ + D Y Sbjct: 138 IHKDGSNANGAVPVFFGSVLGVQILKASAVATAVLSTPGNVGP-GGLFPLAMSKCLYDNY 196 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 + ++ N LP + + A + + S Sbjct: 197 WNAATSSPKLSPNNNTLPGTSVAQQQGQPYFFQIGSSYHYGACESGQWTTFNVNDNSASY 256 Query: 225 QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 + + N + + I L + + N Sbjct: 257 ARGLLTSGNSTTFSIGASPGTWIQTGTENTLFKGAGDCSAAGNG 300 >gi|160899637|ref|YP_001565219.1| von Willebrand factor type A [Delftia acidovorans SPH-1] gi|160365221|gb|ABX36834.1| von Willebrand factor type A [Delftia acidovorans SPH-1] Length = 244 Score = 46.8 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 23/177 (12%), Positives = 64/177 (36%), Gaps = 24/177 (13%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 KI + ++ +++++ + + V I + + +Q ++++ Sbjct: 35 KIRNVNDAVRDMLDTFSDTENGETEIHV--AIITFGSQVALHQPLASASDI-----HWQD 87 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS----AYQN 324 L+ T A+ A + ++ + I S + V+ ++DG + A Sbjct: 88 LSAGGMTPLGTALQMAKAMIEDK-----DVIPSRAYRPTVVLVSDGGPNDAWEKPLNAFI 142 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFD 381 + C+ + A+ A + L + +S + F ++++L + F Sbjct: 143 SDGRSAKCDRLAM--------AIGADADEAVLGKFIEGTSNRLFYAENAKQLRDFFK 191 >gi|221486991|gb|EEE25237.1| von willebrand factor type A domain-containing protein, putative [Toxoplasma gondii GT1] Length = 1109 Score = 46.8 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 28/171 (16%), Positives = 68/171 (39%), Gaps = 15/171 (8%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYN 290 V + + + P+S ++ +R+++L PY TNT A+ AY+ Sbjct: 290 DQSHVSVVRFSTTARADWSLVQPVSWTEKQLTNRISRLPQPYGGTNTPAALEEAYKIFVT 349 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAY------QNTLNTLQICEYMRNAGMKIYS 344 ++ + S + + ++ TDG + + + + L+ ++N +K+ Sbjct: 350 S-MNNRDEHDSKHVHRVLLLATDGCVNQWDRFKFRTPEAHLHDVLERMSSLKNLHIKVLG 408 Query: 345 VAVSAPPEGQDLLRKC------TDSSGQFFAVNDS-RELLESFDKITDKIQ 388 + S L+ C + + ++ N EL E ++I ++++ Sbjct: 409 IGKSICHSEIRLIAGCDPKGTESCRNAKYTDFNSVLDELPEYLEEICEEVE 459 >gi|254296485|ref|ZP_04963941.1| conserved hypothetical protein [Burkholderia pseudomallei 406e] gi|157806387|gb|EDO83557.1| conserved hypothetical protein [Burkholderia pseudomallei 406e] Length = 418 Score = 46.8 bits (109), Expect = 0.005, Method: Composition-based stats. Identities = 32/322 (9%), Positives = 88/322 (27%), Gaps = 29/322 (9%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A++++V F+ A+DL + R+++Q++ DA L+ + + P Sbjct: 26 LVALMLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLTGAINLSVPEAAGITA 85 Query: 61 ---STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 + +Q ++ S + + N ++ + + Sbjct: 86 GHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCTTSRTGIVNWFI 145 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSM-------EDLYLQKHND 170 L + + N S+ +T + + +I + + + + + Sbjct: 146 QTLNLVPGVTVANASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVAGATYNIGDWLS 205 Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR------------KIDVLIESAG 218 + + + N+ S + Sbjct: 206 AKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGDKAATTNAYN 265 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR-------LNKLNP 271 ++ + AY+ +Q S+ +++ + + +N Sbjct: 266 TRFGIYANPYKDPSYGTPDFTGFAYDATTWPSQSNAYSDFVSKRLTFASYQGDLITGINT 325 Query: 272 YENTNTYPAMHHAYRELYNEKE 293 + N A R L E Sbjct: 326 GGSYNPSYYAAGADRRLALAPE 347 >gi|295841331|dbj|BAJ07080.1| von Willebrand factor, type A [uncultured bacterium] Length = 334 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 22/151 (14%), Positives = 52/151 (34%), Gaps = 16/151 (10%) Query: 234 LSVRIGTIAYNIGIVGNQCTP-LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 R+G I + + L NN + + ++ +T A++ A++ L Sbjct: 191 DRERVGLIEFETSVKPVVQLDELGNNRAALDLAVQRMEAAGDTALLDAVYEAHQRL---- 246 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKI--YSVAVSAP 350 +G V+ +TDG+ + + L + + + + +++A Sbjct: 247 ----RKLGDEERINAVVVMTDGQENNSWISLRKLVP----QLAEDWPVPVVVFAIAYG-D 297 Query: 351 PEGQDLLRKCTDSSGQFFAVNDSRELLESFD 381 LR T+ +G D + + + Sbjct: 298 DADIATLRAITEPTGGQVREGDPETIRDLYK 328 >gi|166366825|ref|YP_001659098.1| von Willebrand factor type A [Microcystis aeruginosa NIES-843] gi|166089198|dbj|BAG03906.1| von Willebrand factor type A [Microcystis aeruginosa NIES-843] Length = 460 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 26/166 (15%), Positives = 56/166 (33%), Gaps = 26/166 (15%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 KID++IES L+NS + ++ RI I ++ ++ Sbjct: 78 VVGGIAKIDMVIESLRTLINSGRFTPED------RIALIQFDDQASTLIGLTPVTQTRQL 131 Query: 263 KSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + + KL T ++ A L + + + TDG+ Sbjct: 132 EDAIAKLRNFSGGTCMGRGINQALALL----------ANQSMTSRHTMIFTDGDT----- 176 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQ 366 + + + + + G+ I ++ V ++LL + + G Sbjct: 177 -FDEDDCQNLAQQFASQGISITALGVG--EFNENLLTDISGKTGGH 219 >gi|156394499|ref|XP_001636863.1| predicted protein [Nematostella vectensis] gi|156223970|gb|EDO44800.1| predicted protein [Nematostella vectensis] Length = 175 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 49/169 (28%), Gaps = 19/169 (11%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + R + L + + R I Y+ Sbjct: 25 SSGSIGVRDYKKEKQFVQGLSDIF-----DISPGQSRASLIIYSDFPKLIFDLEDGVTNQ 79 Query: 261 EVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 + S L L T A+ A + + + + + +TDG+ + Sbjct: 80 NITSVLKNLEYLRGRTRIDKALMMAEEVFADARPT---------VPRIAFILTDGKQTQD 130 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF 368 L+ + ++ G+KIY + V P LR T+ G F Sbjct: 131 -YDAIPLDVSS--QRLKKMGVKIYVIGVG-PYVDISELRLLTEKPGDVF 175 >gi|34980918|gb|AAH57200.1| Itgax protein [Mus musculus] Length = 304 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 19/139 (13%), Positives = 48/139 (34%), Gaps = 11/139 (7%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PY 272 +++ ++ + + + S R + ++ + + + S L+ + Sbjct: 165 STDFEKMLDFVKAVMSQLQRPSTRFSLMQFSDYFRVHFTFNNFISTSSPLSLLDSVRQLR 224 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T T A+ H EL+ + + K +I ITDG G + +++ + Sbjct: 225 GYTYTASAIKHVITELFTTQSGAR-----QDATKVLIVITDGRKQGDNLSYDSVIPMAEA 279 Query: 333 EYMRNAGMKIYSVAVSAPP 351 + Y++ V Sbjct: 280 ASIIR-----YAIGVGHKD 293 >gi|42524203|ref|NP_969583.1| hypothetical protein Bd2793 [Bdellovibrio bacteriovorus HD100] gi|39576411|emb|CAE80576.1| conserved hypothetical protein [Bdellovibrio bacteriovorus HD100] Length = 354 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 25/150 (16%), Positives = 48/150 (32%), Gaps = 37/150 (24%) Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST-RLKKFVIF 310 +PL+N+ +K L L P ++ A + E + T ++ + ++ Sbjct: 143 LSPLTNDPGAIKMYLESLEPSSVSSQGTNFTEALKISKEAFERGGVSTDETVKVTRVILI 202 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ---------------- 354 +DGE + L + M G++I+S+A G Sbjct: 203 ASDGE-------DHEQGALDEAKKMAGEGVRIFSLAYGTEKGGAIPVRDGMGFLKGYKKD 255 Query: 355 ------------DLLRKCTDSS-GQFFAVN 371 D LR ++ G F+ Sbjct: 256 RQGQTILTTVKGDALRALAEAGQGSFYFAT 285 >gi|319784283|ref|YP_004143759.1| hypothetical protein Mesci_4600 [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317170171|gb|ADV13709.1| hypothetical protein Mesci_4600 [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 407 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 34/324 (10%), Positives = 81/324 (25%), Gaps = 21/324 (6%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 TA + + + A+DL + +Q+Q++LDAA L+ D Sbjct: 19 TAFAMLPIMIGLAGAVDLIGTSHDASQLQNSLDAAGLAIGTKFSPD-------MAAGDVQ 71 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + + ++ A A + + + S+ + + Sbjct: 72 QLGLQFFAANMSAADQQEYLGSVSAFAATASGSPSAYFISLSSSISRPSFISASAPWQAY 131 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 ++ + + A+++ +VS + ++ L+ Sbjct: 132 RSASV-KIKPGAQACVLALDPHASAAVNLQGSTNVSMDNCVIAANSDASDSVNRGGSALV 190 Query: 182 ---PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 + A P + A + + + + Sbjct: 191 SAGCVSTVGGTSGLLPPSASLA-CGTPHEHRYASFDPLADVVPPPYTLCLPVPNGKTYTL 249 Query: 239 GTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPA------MHHAYRELYNEK 292 Y + + N R + P N + M A + + Sbjct: 250 SPGTYCDKTLSGNI---TLNPGVYLMRGTTIKPGGNGSLTGQGVTIFLMESAQIYINANE 306 Query: 293 ESSHNTIGSTRLKKFVIFITDGEN 316 + + + S IF G Sbjct: 307 KVNLSPPTSGPYAGITIFQDHGNT 330 >gi|298370193|ref|ZP_06981509.1| pilus-associated protein [Neisseria sp. oral taxon 014 str. F0314] gi|298281653|gb|EFI23142.1| pilus-associated protein [Neisseria sp. oral taxon 014 str. F0314] Length = 1071 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 15/102 (14%), Positives = 39/102 (38%), Gaps = 4/102 (3%) Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY--MRNAGMKIYSVAVSAP 350 ++ + S +L + + +G S + + R+ ++ +++ Sbjct: 230 RNTGLSFFSRKLSSIDFKTSGLDKAGKSWQGDPKDPKTAKNPNGFRDQLVQTFTIGFGRD 289 Query: 351 --PEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 G++ L +F+ +L +FD ITD I+++ Sbjct: 290 ISSAGRNYLTNGASRDDYYFSAESEDDLYRAFDTITDSIKDE 331 >gi|196233140|ref|ZP_03131987.1| protein of unknown function DUF1355 [Chthoniobacter flavus Ellin428] gi|196222784|gb|EDY17307.1| protein of unknown function DUF1355 [Chthoniobacter flavus Ellin428] Length = 992 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 20/158 (12%), Positives = 52/158 (32%), Gaps = 20/158 (12%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHN 297 G + +++ +N P + + + M A L Sbjct: 492 DYAWGAQNGYNWVFPLTPVSEYDKLVPLINAATPGDMPDFHTPMQLALTGLQQ------- 544 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA-PPEGQDL 356 S K +I I+DG+ S T + + ++ + + +A++ + Sbjct: 545 ---SDAALKHLIVISDGDPSPP--------TPALVQSFVDSKISVSMIAINPHGGRDISI 593 Query: 357 LRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 ++ + + G+++ D L F K ++ ++ Sbjct: 594 MQAISQQTGGRYYFPEDPAALPSIFIKEAKTLKRSMLQ 631 >gi|218779355|ref|YP_002430673.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] gi|218760739|gb|ACL03205.1| von Willebrand factor type A [Desulfatibacillum alkenivorans AK-01] Length = 504 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 20/135 (14%), Positives = 43/135 (31%), Gaps = 21/135 (15%) Query: 246 GIVGNQCTPLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 PL+ + + S + K+ P A+ A+ + Sbjct: 83 CDNAEVLVPLAPYDRRALISAIKKVQPQGKAPLAAALRKAWEQ-----------GAGLSQ 131 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG--MKIYSVAVSAP-PEGQDLLRKCT 361 + ITDG + + + E ++ G + + + V+ + L+R Sbjct: 132 GCVITLITDG------WDDCWGDPVSMVEDLKARGAGIIVNIIGVAPNREDAAKLMRLAR 185 Query: 362 DSSGQFFAVNDSREL 376 S G + A + +L Sbjct: 186 ASGGAYRAADTRADL 200 >gi|211616|gb|AAA48705.1| type VI collagen, alpha-2 subunit [Gallus gallus] Length = 720 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 26/171 (15%), Positives = 61/171 (35%), Gaps = 24/171 (14%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYN 290 +S+ + V +PL+ + + ++L + T T A+ + ++ + Sbjct: 13 NQVSITWMFGGLHYSDVVEIYSPLTRSKDTYLTKLRAIRYLGRGTFTDCAISNMTQQFQS 72 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + KF + ITDG +G+ E R+ G+K+++VA + Sbjct: 73 QTARD---------VKFAVVITDGHVTGSPCGG----MKMQAERARDMGIKLFAVA-PSE 118 Query: 351 PEGQDLLRKCTDSS-GQF---FAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + LR+ + + + L I + E+ ++ + Sbjct: 119 DVYEQGLREIASPPHDLYRSNYTITPKDALH-----IDENTIERIIKAMKH 164 Score = 44.1 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 24/154 (15%), Positives = 52/154 (33%), Gaps = 19/154 (12%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS-VRIGTIAYNIGIVGNQCTPLS--- 256 + + + N+V+ + ++ K+ + R+G + Y+ Sbjct: 546 SSESIGYTNFTLEKNFVVNVVSRLGSIAKDPKSETGARVGVVQYSHEGTFEAIKLDDERI 605 Query: 257 NNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 N+L+ K + +L T T A+ AY +L E + F + ITDG Sbjct: 606 NSLSSFKEAVKRLEWIAGGTWTPSALQFAYNKLIKESRREKA-------QVFAVVITDGR 658 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + N +C + + ++ + Sbjct: 659 ---YDPRDDDKNLGALC----GRDVLVNTIGIGD 685 >gi|329768306|ref|ZP_08259804.1| hypothetical protein HMPREF0428_01501 [Gemella haemolysans M341] gi|328837279|gb|EGF86914.1| hypothetical protein HMPREF0428_01501 [Gemella haemolysans M341] Length = 1880 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 49/168 (29%), Gaps = 8/168 (4%) Query: 160 MEDLYLQKHNDNNN-MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA--PANRKIDVLIES 216 D + + Y L ++ S S P I Sbjct: 130 TGDPDTTDRVMVASFKDLDGYRLYDKQQQPSASSYVDASGRNYDPEKRTWYSYSGNDIYW 189 Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 + +N+ + N + ++ + + L+N+ + + +N + T Sbjct: 190 WNSSINNWVTKYEADTNFTGDYQEGTWDGSKYTLESSNLTNDKTAIHNFINSITTRGGTP 249 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 T PA+ + + K + + K + ITDG +G Sbjct: 250 TVPAIEDIINKYNSVKGN-----MNNDRKTVFLLITDGVANGVRKDGK 292 >gi|254172494|ref|ZP_04879169.1| PKD domain protein [Thermococcus sp. AM4] gi|214033423|gb|EEB74250.1| PKD domain protein [Thermococcus sp. AM4] Length = 1418 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 27/191 (14%), Positives = 60/191 (31%), Gaps = 21/191 (10%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + +I + + LV S++ + R + + P + N++ Sbjct: 92 TGSMDDEIGTMKRNVNELVESLE-----GYGIRARYALVTFKDSPSLR--LPFTTNVSLF 144 Query: 263 KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG--AS 320 ++KL +T A S +K +I ITD Sbjct: 145 TQTVSKLYASGGGDTPEDDLDAIAMAL-------RLNYSRLSQKILILITDAPTHYAGDG 197 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVA--VSAPPEGQDLLRKCTDSSGQFFAVNDSR---E 375 + + +I E +R G+ + V+ + D+ + G + ++ + Sbjct: 198 SGYSDYTIPEIAEMLREQGVLLIVVSPNFGSINPKNDVRELAVLTGGLWIDIHSADFGRI 257 Query: 376 LLESFDKITDK 386 L + D I + Sbjct: 258 LQKVIDSIGES 268 >gi|162451937|ref|YP_001614304.1| hypothetical protein sce3664 [Sorangium cellulosum 'So ce 56'] gi|161162519|emb|CAN93824.1| hypothetical protein sce3664 [Sorangium cellulosum 'So ce 56'] Length = 641 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 25/132 (18%), Positives = 51/132 (38%), Gaps = 14/132 (10%) Query: 263 KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 ++ +N P T YPA+ A N K++ + V+F+TDG S + Sbjct: 312 QAFINFPTPNGETPMYPALDGATTWANNYKDAHPEEEVA------VVFLTDGVPSSCNTV 365 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELL---- 377 N + L N G+++++V +L+ D G+ + ++ L Sbjct: 366 PNAIGNLAR-NAFVNHGVRVHAVGFG--NSNAELINLIADQGGGRAYNLSAGSTLEGSVL 422 Query: 378 ESFDKITDKIQE 389 ++ I + + Sbjct: 423 DALVSIRGEARS 434 >gi|222101614|gb|ACM44012.1| thrombospondin-related anonymous protein [Babesia bovis] gi|222101618|gb|ACM44014.1| thrombospondin-related anonymous protein [Babesia bovis] Length = 657 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 58/164 (35%), Gaps = 14/164 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 + N +R+ Y T L + + L KL+ T M + R L Sbjct: 77 DLDNTDIRLSLTTY-STPTRQIFTFLDAAASSTRLALTKLDWMAGTKARSGMTYTGRAL- 134 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 N + G + K ++ ITDG +S S T Q+ +R+ G+ + + V Sbjct: 135 NYVRKAILPYGRKNVPKALLLITDGVSSDGSY------TAQVAAMLRDEGVNVMVIGVG- 187 Query: 350 PPEGQDLLRKCTDSSGQ----FFAVNDSRELLESFDKITDKIQE 389 R G F + ++++ F+ + ++ + Sbjct: 188 -DVNVAECRGIVGCDGVMDCPMFKHTNWKDIMGLFNSLMKEVCD 230 >gi|149636528|ref|XP_001511995.1| PREDICTED: similar to putative calcium activated chloride channel-like protein 1; eCLCA1 [Ornithorhynchus anatinus] Length = 800 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 17/115 (14%), Positives = 45/115 (39%), Gaps = 23/115 (20%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T+ + A++ + + +++ + ++ +TDGE+ S+ Sbjct: 379 AGGGTSICSGVQAAFQAIKQKFQTTDGSE--------IVLLTDGEDVTVSSCF------- 423 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE----LLESFD 381 E ++ +G I++VA+ + L + +D +G L+++F Sbjct: 424 --EEVKQSGATIHTVALGTSAAQE--LERLSDMTGGISTAPSDEAQNNGLIDAFS 474 >gi|134281810|ref|ZP_01768517.1| conserved hypothetical protein [Burkholderia pseudomallei 305] gi|134246872|gb|EBA46959.1| conserved hypothetical protein [Burkholderia pseudomallei 305] Length = 418 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 33/322 (10%), Positives = 88/322 (27%), Gaps = 29/322 (9%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A++++V F+ A+DL + R+++Q++ DA L+ + + P Sbjct: 26 LVALMLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLTGAINLSVPEAAGITA 85 Query: 61 ---STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 + +Q ++ S + + N ++ + + Sbjct: 86 GHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCTTSRTGIVNWFI 145 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSM-------EDLYLQKHND 170 L + +TN S+ +T + + +I + + + + + Sbjct: 146 QTLNLVPGVTVTNASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVAGATYNIGDWLS 205 Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR------------KIDVLIESAG 218 + + + N+ S + Sbjct: 206 AKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGNKAAATNAYN 265 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR-------LNKLNP 271 + + AY+ +Q S+ +++ + + +N Sbjct: 266 TRFGIYANPYKNPSYGTPDFTGFAYDATTWPSQSNAYSDFVSKRLTFASYQGDLITGINT 325 Query: 272 YENTNTYPAMHHAYRELYNEKE 293 + N A R L E Sbjct: 326 GGSYNPSYYAAGADRRLALAPE 347 >gi|303246180|ref|ZP_07332461.1| von Willebrand factor type A [Desulfovibrio fructosovorans JJ] gi|302492576|gb|EFL52447.1| von Willebrand factor type A [Desulfovibrio fructosovorans JJ] Length = 329 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 24/161 (14%), Positives = 49/161 (30%), Gaps = 42/161 (26%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKE 293 RIG +A+ P +++ + L++L+ T A+ A ++L Sbjct: 129 RIGLVAFGSRAYV--VLPPTDDRAALTQALSRLSVGAAGRRTAMGDAVGLAVKQLDRA-- 184 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-- 351 L + V+ DG ++ + ++ + G+ +++V V Sbjct: 185 --------PGLARLVVVFGDGLSNAG-----EVRPVEAAKAAAARGIAVFTVGVGGDGPA 231 Query: 352 --------------------EGQDLLRKCTDSSGQFFAVND 372 + L S G FF D Sbjct: 232 PFLVNHPLLGQEIVRENAAVDTAALTELAALSGGAFFRAED 272 >gi|182414211|ref|YP_001819277.1| von Willebrand factor type A [Opitutus terrae PB90-1] gi|177841425|gb|ACB75677.1| von Willebrand factor type A [Opitutus terrae PB90-1] Length = 611 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 17/105 (16%), Positives = 39/105 (37%), Gaps = 21/105 (20%) Query: 253 TPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 +PLS++ ++ L L+P TN ++ A + +F+ Sbjct: 146 SPLSSDYEILREFLPALDPTFLPEGGTNYDALINTALTAFGATGAA----------DRFL 195 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 I ++DGE + + ++N G+++ ++ V Sbjct: 196 IILSDGEATEDDWRSHVA-------ELKNRGIRVIALGVGTTAGA 233 >gi|329901495|ref|ZP_08272833.1| Type IV fimbrial biogenesis protein PilY1 [Oxalobacteraceae bacterium IMCC9480] gi|327549103|gb|EGF33703.1| Type IV fimbrial biogenesis protein PilY1 [Oxalobacteraceae bacterium IMCC9480] Length = 1132 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 38/422 (9%), Positives = 104/422 (24%), Gaps = 67/422 (15%) Query: 31 SALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQ 90 A D A + + + + ++ + K ++ +++ + E Sbjct: 72 QAADGAGNAPYTVSSNGNLVDNGASRLNVAKAGIKAIVQAYMQNTDFALETYSTSGTSLY 131 Query: 91 INITKDKNNPLQYIAESK----------------AQYEIPTENLFLKGLIPSALTNLSLR 134 + + A ++ L++ Sbjct: 132 TTWVYYMSPAGSGFVFTSTKLAGNRYVANPCLNYALASSTVKSNCTGIAGAGLYAALAVS 191 Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 + ++ + + +I VL S + + Y L S Sbjct: 192 GSAYMQIGAASDDANINDVLYASGQPGVYINYGNTSPASPYPPSYSLANYNNGSVLITYP 251 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 + + A + S + ++ S + + Sbjct: 252 SSVP-SANRATGPTNAGYVPYSPQVMYAQRGFGYGGSQSPSSGNVLVPMTSAGTVPTASS 310 Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHN--------TIGSTRLKK 306 ++ V + L L P NT + + + + + K+ Sbjct: 311 VT---TAVNTFLPPLAPETNTLQTTEIKAVAGQAPTAGLLAKAQSYLASVLSTANCAPKQ 367 Query: 307 FVIFITDGENSGA-------------------------------SAYQNTLNTLQICEYM 335 +V+ I+DG + + +T+ + Sbjct: 368 YVVLISDGLPTQDLAGKYWPPLGSAAAAGYGLTASFKADGSLNVTNDTALTDTITNLAKL 427 Query: 336 RNAGMKIYSVAVSAPPE------GQDLLRKCTDSSG--QFFAVNDSRELLESFDKITDKI 387 + AG+ Y + + A + L+ + G ++ + L++ + I + Sbjct: 428 KAAGINTYVIGLGAGVDPSVNPQAAATLQAMAIAGGTANYYPASSPAALVDGLNSILISV 487 Query: 388 QE 389 Q Sbjct: 488 QS 489 >gi|162450402|ref|YP_001612769.1| hypothetical protein sce2130 [Sorangium cellulosum 'So ce 56'] gi|161160984|emb|CAN92289.1| hypothetical protein sce2130 [Sorangium cellulosum 'So ce 56'] Length = 865 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 17/141 (12%), Positives = 41/141 (29%), Gaps = 19/141 (13%) Query: 254 PLSN-NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 P + L + ++ L T + A L + + + V+ +T Sbjct: 346 PFTQATLKAADAFVDGLRADGGTEMLNPLLAAVGMLGDAER-----------DRVVVLLT 394 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND 372 DG+ + + + R G++IY+ + + + G ++ Sbjct: 395 DGQVGNEAQIVDRVVQ-------RGKGVRIYTFGIGTNVSDVLVNDLARRTEGAAEFIHP 447 Query: 373 SRELLESFDKITDKIQEQSVR 393 + E + V Sbjct: 448 GERIDEKVTAQFARATAVRVT 468 >gi|88801582|ref|ZP_01117110.1| hypothetical protein PI23P_02947 [Polaribacter irgensii 23-P] gi|88782240|gb|EAR13417.1| hypothetical protein PI23P_02947 [Polaribacter irgensii 23-P] Length = 330 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 23/119 (19%), Positives = 42/119 (35%), Gaps = 23/119 (19%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK----LNPYENTNTYPAMHHAYRELYN 290 S R+G I Y P++ + L + + T A+ A Sbjct: 110 SDRVGIIVYAGNSYP--LLPITTDHAAANMFLQNANPDMVSSQGTAINEALELAKTYYN- 166 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + +F+I ++DGE+ T Q+ + + N G+KIY++ V Sbjct: 167 ---------NDEQTNRFLIILSDGEDHQ-------EETKQVAQNLANNGVKIYTIGVGT 209 >gi|291242484|ref|XP_002741138.1| PREDICTED: chloride channel calcium activated 2-like [Saccoglossus kowalevskii] Length = 765 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 18/119 (15%), Positives = 42/119 (35%), Gaps = 15/119 (12%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T+ + A + L + +S ++ ITDG + + Sbjct: 355 ASGATSIGDGLRVALQVLQDGNVTSEGAS--------LLLITDGIENTY------PLLMN 400 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + + + ++G+++ ++A L ++ G +F V D+ D + I E Sbjct: 401 VMQEVYDSGVRVDTIAY-TEAAQSTLQELSDNTGGLYFYVPDNDTSTAFIDSLAATISE 458 >gi|254481786|ref|ZP_05095029.1| Vault protein inter-alpha-trypsin [marine gamma proteobacterium HTCC2148] gi|214037915|gb|EEB78579.1| Vault protein inter-alpha-trypsin [marine gamma proteobacterium HTCC2148] Length = 686 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 29/211 (13%), Positives = 70/211 (33%), Gaps = 18/211 (8%) Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYN 244 P ++ + + + ++ G+L +++ + I + + Sbjct: 305 PASQRAAETVPREIVFVVDTSGSMGGVSIKQAKGSLTRALRH--LGPNDRFNVIEFNSSH 362 Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 + + +NL + L T PA+ A + + + + Sbjct: 363 RALFQHAVPASHHNLQLASEYVRHLEASGGTEMMPALQLALKL-----PGAQDELRPEPA 417 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNA--GMKIYSVAVSAPPEGQDLLRKCTD 362 + VIFITDG N + E++ ++ G ++++V + + P + + Sbjct: 418 LRQVIFITDGAVG---------NESALFEHIVDSLGGSRLFTVGIGSAPNAWFMRKAAEY 468 Query: 363 SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 G F + D E+ E D + + Sbjct: 469 GRGTFTYIGDVAEVGEKMDALFLNLTRPVAT 499 >gi|91084771|ref|XP_972223.1| PREDICTED: similar to Inter-alpha-trypsin inhibitor heavy chain H4 precursor (ITI heavy chain H4) (Inter-alpha-inhibitor heavy chain 4) (Inter-alpha-trypsin inhibitor family heavy chain-related protein) (IHRP) (Plasma kallikrein sensitive glycoprotein 120) (P [Tribolium castaneum] Length = 606 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 30/270 (11%), Positives = 81/270 (30%), Gaps = 29/270 (10%) Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 + L + VLD S SME ++ + + +S Sbjct: 237 VHFFAPSGLQTLPKHVVFVLDYSASMEGRKHEQLMQAMDKILSDLNPDDLFHIVRFSVIV 296 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 + + + D + + Q+ + ++ +N+G Sbjct: 297 SVWNFE------KNRFDQIKFA------------QKPEYENLDSFLAEFNLGDAAQVSE- 337 Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 +N+ + K + + TN + + E + T+ + +IF+TDG Sbjct: 338 --DNIKKAKEIKDHDVDMDCTNIIGGLVVGLYLVRQTLEKFYEKNIETKHQPMIIFLTDG 395 Query: 315 ENSGASAYQNTLNTLQICEYMRN--AGMKIYSVAVSAPPEGQDLLRKCTDSSG---QFFA 369 + ++ + + + I+S++ + L + + G + Sbjct: 396 LPNVGLIIRDEI--TDVVTKINQGTNRAAIFSLSFGEDADKNFLKKLSAQNLGFSRHIYE 453 Query: 370 VNDSR-ELLESFDKITDKIQEQSVRIAPNR 398 D+ +L + ++ + ++ Sbjct: 454 AADAALQLQNFYRTVSSPLLRDVRFKYVDK 483 >gi|221506674|gb|EEE32291.1| von willebrand factor type A domain-containing protein, putative [Toxoplasma gondii VEG] Length = 931 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 28/171 (16%), Positives = 68/171 (39%), Gaps = 15/171 (8%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYN 290 V + + + P+S ++ +R+++L PY TNT A+ AY+ Sbjct: 290 DQSHVSVVRFSTTARADWSLVQPVSWTEKQLTNRISRLPQPYGGTNTPAALEEAYKIFVT 349 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAY------QNTLNTLQICEYMRNAGMKIYS 344 ++ + S + + ++ TDG + + + + L+ ++N +K+ Sbjct: 350 S-MNNRDEHDSEHVHRVLLLATDGCVNQWDRFKFRTPEAHLHDVLERMSSLKNLHIKVLG 408 Query: 345 VAVSAPPEGQDLLRKC------TDSSGQFFAVNDS-RELLESFDKITDKIQ 388 + S L+ C + + ++ N EL E ++I ++++ Sbjct: 409 IGKSICHSEIRLIAGCDPKGTESCRNAKYTDFNSVLDELPEYLEEICEEVE 459 >gi|237831727|ref|XP_002365161.1| von Willebrand factor type A domain-containing protein [Toxoplasma gondii ME49] gi|211962825|gb|EEA98020.1| von Willebrand factor type A domain-containing protein [Toxoplasma gondii ME49] Length = 929 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 28/171 (16%), Positives = 68/171 (39%), Gaps = 15/171 (8%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHAYRELYN 290 V + + + P+S ++ +R+++L PY TNT A+ AY+ Sbjct: 290 DQSHVSVVRFSTTARADWSLVQPVSWTEKQLTNRISRLPQPYGGTNTPAALEEAYKIFVT 349 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAY------QNTLNTLQICEYMRNAGMKIYS 344 ++ + S + + ++ TDG + + + + L+ ++N +K+ Sbjct: 350 S-MNNRDEHDSEHVHRVLLLATDGCVNQWDRFKFRTPEAHLHDVLERMSSLKNLHIKVLG 408 Query: 345 VAVSAPPEGQDLLRKC------TDSSGQFFAVNDS-RELLESFDKITDKIQ 388 + S L+ C + + ++ N EL E ++I ++++ Sbjct: 409 IGKSICHSEIRLIAGCDPKGTESCRNAKYTDFNSVLDELPEYLEEICEEVE 459 >gi|226510867|gb|ACO59960.1| matrilin-1 [Salmo salar] Length = 108 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 19/119 (15%), Positives = 39/119 (32%), Gaps = 14/119 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYREL 288 + +G + Y+ + NN ++K + K+ E T T A+ + Sbjct: 3 DVSETKAHVGLVQYSSSVKQEFPLGRYNNKKDLKDAVKKMAYMERGTMTGQALRY----- 57 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 + + + K I TDG + + + G K+++V V Sbjct: 58 LTDSSFAPAGGARPGVAKVGIVFTDGRSQDYIGD--------AAKKAKEQGFKMFAVGV 108 >gi|47215752|emb|CAG05763.1| unnamed protein product [Tetraodon nigroviridis] Length = 565 Score = 46.8 bits (109), Expect = 0.006, Method: Composition-based stats. Identities = 14/99 (14%), Positives = 30/99 (30%), Gaps = 15/99 (15%) Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 T K + ITDG ++G + + +R G++I+++ + Sbjct: 166 TASTPQQILRTSRTNATKVIFLITDGYSNGG-------DPRPVAAALRERGVEIFTLGI- 217 Query: 349 APPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKITD 385 L + V + E F+ + Sbjct: 218 -WQGNIKELHDMASEPKEQHCYLVPNFAE----FEALAR 251 >gi|315181058|gb|ADT87972.1| hypothetical protein vfu_A02859 [Vibrio furnissii NCTC 11218] Length = 406 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 24/156 (15%), Positives = 46/156 (29%), Gaps = 4/156 (2%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + I + +AID+ H M R ++Q+++DAA L+ + D T T T Sbjct: 20 IVTIAMLALIAVAAFAIDINHAMMNRTKLQNSVDAAALAAAIVLDKDGTEAQADTIARST 79 Query: 61 ST---IFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 T + L + + ++P ++ E+ Sbjct: 80 LTKMSTAAGNAELTLDVSDVVNVEVQFSNDPTVFPDSGYSSSPDGDRYVRVVINQLDLES 139 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMV 153 F + L+ + I V Sbjct: 140 FFFARALGV-TKRLTASAVAGPSPGGNACNIVPMAV 174 >gi|313672125|ref|YP_004050236.1| von willebrand factor type a [Calditerrivibrio nitroreducens DSM 19672] gi|312938881|gb|ADR18073.1| von Willebrand factor type A [Calditerrivibrio nitroreducens DSM 19672] Length = 1174 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 32/229 (13%), Positives = 63/229 (27%), Gaps = 52/229 (22%) Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 + G ++ +QK E +N V + N S + + + Sbjct: 204 YSDVAGYDPEKGAVLGVLQKLSNEPRNPRVGAVLFSSNNIS----VIKPSYDYISLIKAI 259 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 N +TNT A+ + + ++ K F I ++DGE + T+ Sbjct: 260 NDTKAGGSTNTKGAIDTISNYYKSTEAYQFDSNVVPCAKNFAIVVSDGEWNVGGDPLPTI 319 Query: 327 N----TLQICEYMRNAGMKIYSVA--VSAPPEGQDLLRKCT------------------- 361 T + +K Y++A + + G L+ Sbjct: 320 RDMWKTDLMGNLTGKQNVKTYTLAMFMDSSSNGTRALKHMAVFGGYNDIDKNGLPCNYNK 379 Query: 362 -----------------------DSSGQFFAVNDSRELLESFDKITDKI 387 FF N+ E ++ + I +I Sbjct: 380 DSFNSLLENFPSTTCSEWDADNNGKPDGFFQGNNPDEFKQAIEDIFKQI 428 >gi|317125812|ref|YP_004099924.1| von Willebrand factor A [Intrasporangium calvum DSM 43043] gi|315589900|gb|ADU49197.1| von Willebrand factor type A [Intrasporangium calvum DSM 43043] Length = 577 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 14/113 (12%), Positives = 36/113 (31%), Gaps = 10/113 (8%) Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T Y + AY ++ + V+ +TDG+N + L Sbjct: 460 DGWTGLYDTIWAAYSKVKASYDPQRV--------NAVVILTDGKNEDPGGGLSLEQLLAK 511 Query: 332 CEYMRN--AGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + I ++ + + + L + S ++ + ++ K Sbjct: 512 IKDATDPKRPIAITTIGIGPGVDAESLRKISRSSYSDYYGAENPADMTTVLAK 564 >gi|218515243|ref|ZP_03512083.1| hypothetical protein Retl8_16920 [Rhizobium etli 8C-3] Length = 94 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 13/70 (18%), Positives = 26/70 (37%), Gaps = 10/70 (14%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQM----------QSALDAAVLSGCASIVSDRTI 50 MTA+++ A+D AH + +R Q+ A + ++ ++ + TI Sbjct: 19 MTALLMVPLVGTAGMAVDFAHALSLRTQLYAAADAAAVGSIAEKSGAVAAAMAMNGNGTI 78 Query: 51 KDPTTKKDQT 60 T Sbjct: 79 SLGKTDARDI 88 >gi|1915902|emb|CAA67576.1| collagen (VI) alpha-1 chain [Homo sapiens] Length = 436 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 26/185 (14%), Positives = 56/185 (30%), Gaps = 16/185 (8%) Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 N + +K +F S + + + D A L A Sbjct: 211 KNVTAQICIDKKCPDYTCPITFSSPADITILLDGSASVGSHNFDTTKRFAKRLAERFLTA 270 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQ---CTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHH 283 + VR+ + Y+ N + S ++ ++ + T+ A+ + Sbjct: 271 GRTDPAHDVRVAVVQYSGTGQQRPERASLQFLQNYTALASAVDAMDFINDATDVNDALGY 330 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 R + KK ++ +DG + G + + + AG++I+ Sbjct: 331 VTRFYREASSGAA--------KKRLLLFSDGNSQG----ATPAAIEKAVQEAQRAGIEIF 378 Query: 344 SVAVS 348 V V Sbjct: 379 VVVVG 383 >gi|307109844|gb|EFN58081.1| hypothetical protein CHLNCDRAFT_142385 [Chlorella variabilis] Length = 654 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 35/353 (9%), Positives = 85/353 (24%), Gaps = 41/353 (11%) Query: 31 SALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQ 90 S LD++ + S S T + ++ + I Sbjct: 315 STLDSSTAAVSGSGASGSVANTLTMTGGE-------ELSGLQTDPVKQEQLKQAILASLN 367 Query: 91 INITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISI 150 + +N + + ++ + S+ Sbjct: 368 LPPGYTTDNIQINVVSVTQEGGRRRLQAPQTKVVVEYTLTATPEVQ---VEVLAKAEASV 424 Query: 151 CMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKI 210 + + V + + + T A + Sbjct: 425 YLFIRVQQPAIPWWSG------PTATICISDSEGACNVNTPPITPPVVIQDCSANVCFLL 478 Query: 211 DVLI-----ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG---NQCTPLSNNLNEV 262 D + ++V S + + + + ++ + ++ N + V Sbjct: 479 DGSKSLTNVDGWNDVVASARSIMYSLNDPAAVFDVFWFSNDVEKIGHATGAEVAANNSFV 538 Query: 263 KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 +N T A+ L E + + ++ ITDG+ + Sbjct: 539 SMVVNTTPDAHGTWMAQAITTCQGVLLEEDTQA---------SRTIVLITDGKPT----- 584 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE 375 + T + + + G+K+ V V A L+ FA + Sbjct: 585 -DPQPTFEAADAAKARGIKM--VVVGAGEIDYATLKALASGPQFVFANTNLDS 634 >gi|229825750|ref|ZP_04451819.1| hypothetical protein GCWU000182_01113 [Abiotrophia defectiva ATCC 49176] gi|229790313|gb|EEP26427.1| hypothetical protein GCWU000182_01113 [Abiotrophia defectiva ATCC 49176] Length = 1659 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 35/292 (11%), Positives = 78/292 (26%), Gaps = 39/292 (13%) Query: 127 ALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPK 186 ++ L+ G E N + I +V+D S SME ++ LLP Sbjct: 66 GTYDIELKVKGSTEVVQNNKILDIVLVMDTSGSME-GKSLENAKKAANNFVDKLLPQNNN 124 Query: 187 KSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ----------------- 229 + ++ + R + L + L Q Sbjct: 125 --VNIGIVSFAEKGEIKSGLTRNVTTLKNAIKGLKADGGTYTQQGLEKAATVLNGAPAEH 182 Query: 230 --------EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAM 281 + + ++ G P + + N Sbjct: 183 KKVMVVIGDGEPTYANGEHPNFDKGGFYRIYNPATKKEGYEQWYGNAFKWLGKGYNSAHR 242 Query: 282 HHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK 341 ++ + + + + + D TL+ + ++ + Sbjct: 243 NYLVKLINGGFGNGTKERKGWGHVSWPFNLMDDYF--------ENATLKAADTIK-NNTE 293 Query: 342 IYSVAVSAPPEG--QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 I +V + + ++ K S A + EL + D I +K+Q++ Sbjct: 294 IITVGIDIENNDLAKSIMNKLATSGKYLKAGAVAGELDKILDDIAEKLQKKV 345 >gi|95147674|ref|NP_001035616.1| complement factor B precursor [Bos taurus] gi|146345391|sp|P81187|CFAB_BOVIN RecName: Full=Complement factor B; AltName: Full=C3/C5 convertase; AltName: Full=EC-VMFB; Contains: RecName: Full=Complement factor B Ba fragment; Contains: RecName: Full=Complement factor B Bb fragment; Flags: Precursor gi|86438491|gb|AAI12505.1| Complement factor B [Bos taurus] gi|296474252|gb|DAA16367.1| complement factor B precursor [Bos taurus] Length = 761 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 34/227 (14%), Positives = 68/227 (29%), Gaps = 30/227 (13%) Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANR-----KIDVLIESAGNLVNSIQKAIQEKKNLSV 236 P +K + + S + + + + + Sbjct: 253 PGEQQKRKIVLDPSGSMNIYLVLDGSDSVGAHNFTGAKNCLRDFIEKVASYGVKP----- 307 Query: 237 RIGTIAYNIGIVG--NQCTPLSNNLNEVKSRLNKLNPYE-----NTNTYPAMHHAYRELY 289 + G + Y P S+ + V +LN++N + TNT A+ Y + Sbjct: 308 KYGLVTYATEPKVLIRVSDPKSSEADWVTDQLNQINYADHKLKAGTNTKRALLEVYNMMS 367 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL-NTLQICEYMRNAGMK------I 342 E T R + +I +TDG ++ + + + + RN I Sbjct: 368 REVNQFKETWN--RTRHVIIIMTDGLHNMGGDPVTVIHDIRYLLDIGRNRKNPREDYLDI 425 Query: 343 YSVAVSAPPEGQDLLRKCTDSSG---QFFAVNDSRELLESFDKITDK 386 Y V P Q+ + F + L + F ++ D+ Sbjct: 426 YVFGVG-PLVNQENINALASKKDKEKHVFKLQGMENLEDVFVQMLDE 471 >gi|332305539|ref|YP_004433390.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Glaciecola agarilytica 4H-3-7+YE-5] gi|332172868|gb|AEE22122.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Glaciecola agarilytica 4H-3-7+YE-5] Length = 1359 Score = 46.8 bits (109), Expect = 0.007, Method: Composition-based stats. Identities = 11/58 (18%), Positives = 29/58 (50%), Gaps = 1/58 (1%) Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQE 389 E + ++ + +++ +A + L + G F+ ++S+ELL +F I +++ Sbjct: 629 ESVIDSRIITHTIGFAANTQANSFLNQIALQGGGGFYQADNSQELLGAFQSILKTVKD 686 >gi|313203639|ref|YP_004042296.1| von willebrand factor type a [Paludibacter propionicigenes WB4] gi|312442955|gb|ADQ79311.1| von Willebrand factor type A [Paludibacter propionicigenes WB4] Length = 346 Score = 46.4 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 18/169 (10%), Positives = 52/169 (30%), Gaps = 49/169 (28%) Query: 251 QCTPLSNNLNEVKSRLNKLNP----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 P++ + K L+ ++P + T A+ A + + E+ K Sbjct: 142 TQLPITVDYVSAKMFLSNISPELVPRQGTAIGSALDLAIKSFGAKSEAG----------K 191 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------------- 352 +I ITDGEN + + + + + + + + Sbjct: 192 AIILITDGEN-------HEDDAIGAAKLAAENNIIVNVIGMGKTDGAPIPVPGTMSFRKD 244 Query: 353 ----------GQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQ 390 + + ++ + G + +++ ++ I ++ + Sbjct: 245 KDGNVVVSKLNEQMCKEIALAGKGVYVHADNT---NGAYKVINKELDKL 290 >gi|309812068|ref|ZP_07705828.1| Tat pathway signal sequence domain protein [Dermacoccus sp. Ellin185] gi|308433947|gb|EFP57819.1| Tat pathway signal sequence domain protein [Dermacoccus sp. Ellin185] Length = 597 Score = 46.4 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 32/187 (17%), Positives = 64/187 (34%), Gaps = 10/187 (5%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQC--TPLSNNLN 260 KID L+ S A + + R+G A++ + N PL+N Sbjct: 402 SGSMQTKIDGGQSRI-ELMESTAIAALDVLPKTTRLGAWAFSSNLQKNHVDYLPLTNGEQ 460 Query: 261 EVKSRL--NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTR---LKKFVIFITDGE 315 + N L +T A + LY+ +++ ++ T V+ +TDG Sbjct: 461 PILDDTYRNGLIAKAHTLPGLAAKNGDTALYDTIAAAYKSVTDTYDPNYVNSVVVLTDGT 520 Query: 316 NSGASAYQNTLNTLQICEY--MRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDS 373 N + L + + +KI ++++ + L R + G + Sbjct: 521 NDDPNGGLALDQLLARLKSQYSADKPVKIVTISLGTGTDPDALKRIAKATDGLSYQTKTP 580 Query: 374 RELLESF 380 ++ F Sbjct: 581 EQISGVF 587 >gi|297746279|emb|CBI16335.3| unnamed protein product [Vitis vinifera] Length = 602 Score = 46.4 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 18/146 (12%), Positives = 41/146 (28%), Gaps = 21/146 (14%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE---VKSRLNKLNPYENTNTYPAMHHA 284 N +V I A N T L+ +N L TN + Sbjct: 200 STSHNNFTVLIHLKAPLTSGRQNSGTKLALLKRAMGFALQAVNSLVSNGGTNIAEGLRKG 259 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSG-----ASAYQNTLNTLQICEYMRNAG 339 + + + +I ++DG+++ A+ T +L + + G Sbjct: 260 AKVMLD--------RKWKNPVSSIILLSDGQDTYTVCSPGGAHSRTDYSLLLPFSIHRNG 311 Query: 340 -----MKIYSVAVSAPPEGQDLLRKC 360 + +++ + + Sbjct: 312 GTGFQIPVHAFGFAEGVIQDAFAQCI 337 >gi|326672458|ref|XP_695815.4| PREDICTED: von Willebrand factor A domain-containing protein 2 [Danio rerio] Length = 795 Score = 46.4 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 29/295 (9%), Positives = 73/295 (24%), Gaps = 31/295 (10%) Query: 84 DIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSS 143 +D + + + E + + + Sbjct: 432 GFQSTPVFADVQDDLPRVVVLLTGTPSADHVVEPAKYAR---DREIFIIGVAPEGMRAEI 488 Query: 144 ENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAP 203 N+ + + + + + + A + Sbjct: 489 NNITGNPQRTIMYQSPDRLSSKIPELRAKICSVDNQGCLGQALDLVFVLD------ASSG 542 Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 I + V S + ++G + Y V N+ + V Sbjct: 543 VGKENFIH-----FQDFVRS-TSVQFDINRDVAQVGLVVYGRRPVTVFDLDKYNSGSAVL 596 Query: 264 SRL-NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + + + A+ H + + + + K V+ +TDG Sbjct: 597 RAVGDAAFLGGKASVGSALLHVLSQSLTVGKGARPGVN-----KAVVVLTDG-------- 643 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 + + +R++G+ I+ + V Q+LL + + S V +L Sbjct: 644 TGVEDAAVPAQKIRDSGVSIFLIGVG--DIQQELLLRISGSEDHMITVPSYDDLK 696 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 24/161 (14%), Positives = 58/161 (36%), Gaps = 21/161 (13%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + + VR+G + + E+K +L K++ T T A+ R+ Sbjct: 82 DVRQDKVRVGVVQFGSTPKLEVSLDSYKTKEELKKKLKKIHYRGGSTQTGLALKFVLRKG 141 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 ++ +S + + V+ ++DG++ G ++ +G+ +++V + Sbjct: 142 FSGGRNS-------SVPRVVVLLSDGKSQG--------AVQLPASELKLSGVLLFAVGIR 186 Query: 349 APPEGQDLLRKCTDSS--GQFFAVNDSRE-LLESFDKITDK 386 P + LR+ F + + F +T Sbjct: 187 YPRWDE--LRELASGPSDSHVFFAEHFSDAVNGLFTSLTTS 225 >gi|296329567|ref|ZP_06872053.1| hypothetical protein BSU6633_00600 [Bacillus subtilis subsp. spizizenii ATCC 6633] gi|305672764|ref|YP_003864435.1| hypothetical protein BSUW23_00335 [Bacillus subtilis subsp. spizizenii str. W23] gi|296153310|gb|EFG94173.1| hypothetical protein BSU6633_00600 [Bacillus subtilis subsp. spizizenii ATCC 6633] gi|305411007|gb|ADM36125.1| conserved hypothetical protein [Bacillus subtilis subsp. spizizenii str. W23] Length = 245 Score = 46.4 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 12/98 (12%), Positives = 39/98 (39%), Gaps = 10/98 (10%) Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-GQDLLR 358 + ++ ITDG ++ + L + + + G+ + + + + + ++ Sbjct: 1 MNNGHLNQILLITDGCSNHGE------DPLAMAAFAKEQGITVNVIGIMEENQIDPEAMK 54 Query: 359 K---CTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + + G V + +L ++ +T K Q+++ Sbjct: 55 EVEGIALAGGGVHQVVYASQLSQTVQMVTKKAMTQTLQ 92 >gi|291230028|ref|XP_002734972.1| PREDICTED: membrane-bound transcription factor peptidase, site 1-like [Saccoglossus kowalevskii] Length = 893 Score = 46.4 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 19/182 (10%), Positives = 58/182 (31%), Gaps = 4/182 (2%) Query: 202 APAPANRKIDVLIESAGNLVNS-IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 I+ ++ ++V+ + K + + + V + + +L Sbjct: 32 CTGSMGSWINEARKNIQSIVDEIVAKEMSDIRLALVEYRDHPPQESTFVTRVLDFTPSLK 91 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAY-RELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +++ +++ + + A+ L + + D + Sbjct: 92 DMQKQMDNMKANGGGDGPEAVADGLHEALNLNWRPLATKVCVLIADAPPHGLRDSGDGFP 151 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQFFAVNDSRELL 377 + ++I M + +YSV + EG + + GQ+ ++ D++ L Sbjct: 152 KGCPAGHDPMKIARQMAEQNITLYSVVCGSYAEGFKDFFMAIAHVTGGQYVSLKDAKLLS 211 Query: 378 ES 379 + Sbjct: 212 KV 213 >gi|16077133|ref|NP_387946.1| hypothetical protein BSU00650 [Bacillus subtilis subsp. subtilis str. 168] gi|221307875|ref|ZP_03589722.1| hypothetical protein Bsubs1_00330 [Bacillus subtilis subsp. subtilis str. 168] gi|221312197|ref|ZP_03594002.1| hypothetical protein BsubsN3_00330 [Bacillus subtilis subsp. subtilis str. NCIB 3610] gi|221317130|ref|ZP_03598424.1| hypothetical protein BsubsJ_00330 [Bacillus subtilis subsp. subtilis str. JH642] gi|221321393|ref|ZP_03602687.1| hypothetical protein BsubsS_00330 [Bacillus subtilis subsp. subtilis str. SMY] gi|321313735|ref|YP_004206022.1| hypothetical protein BSn5_11895 [Bacillus subtilis BSn5] gi|586890|sp|P37561|YABS_BACSU RecName: Full=Uncharacterized protein yabS gi|467454|dbj|BAA05300.1| unknown [Bacillus subtilis] gi|2632332|emb|CAB11841.1| conserved hypothetical protein [Bacillus subtilis subsp. subtilis str. 168] gi|291482437|dbj|BAI83512.1| hypothetical protein BSNT_00116 [Bacillus subtilis subsp. natto BEST195] gi|320020009|gb|ADV94995.1| hypothetical protein BSn5_11895 [Bacillus subtilis BSn5] Length = 245 Score = 46.4 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 12/98 (12%), Positives = 39/98 (39%), Gaps = 10/98 (10%) Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-GQDLLR 358 + ++ ITDG ++ + L + + + G+ + + + + + ++ Sbjct: 1 MNNGHLNQILLITDGCSNHGE------DPLAMAAFAKEQGITVNVIGIMEENQIDPEAMK 54 Query: 359 K---CTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + + G V + +L ++ +T K Q+++ Sbjct: 55 EVEGIALAGGGVHQVVYASQLSQTVQMVTKKAMTQTLQ 92 >gi|109899476|ref|YP_662731.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like [Pseudoalteromonas atlantica T6c] gi|109701757|gb|ABG41677.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Pseudoalteromonas atlantica T6c] Length = 1364 Score = 46.4 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 11/54 (20%), Positives = 27/54 (50%), Gaps = 1/54 (1%) Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQE 389 ++ + +++ +A E L + G F+ ++S+ELL +F I +++ Sbjct: 638 DSRIITHTIGFAANTEANSFLNQIALQGGGGFYQADNSQELLGAFQSILKTVKD 691 >gi|254780135|ref|YP_003064548.1| hypothetical protein CLIBASIA_00070 [Candidatus Liberibacter asiaticus str. psy62] gi|254039812|gb|ACT56608.1| hypothetical protein CLIBASIA_00070 [Candidatus Liberibacter asiaticus str. psy62] Length = 408 Score = 46.4 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 8/49 (16%), Positives = 21/49 (42%) Query: 340 MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 + I+SV S + + LR+C +++ +N ++ + + Sbjct: 346 ITIFSVGFSPDQDTRYTLRQCASDPSKYYEINSDENVMPIAKSLARNVI 394 >gi|260837139|ref|XP_002613563.1| hypothetical protein BRAFLDRAFT_149227 [Branchiostoma floridae] gi|229298948|gb|EEN69572.1| hypothetical protein BRAFLDRAFT_149227 [Branchiostoma floridae] Length = 195 Score = 46.4 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 19/146 (13%), Positives = 46/146 (31%), Gaps = 17/146 (11%) Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTN 276 + + N ++G + Y + L+E+ + ++ T Sbjct: 66 NFVATTTSDFQIGPNNA--QVGIVQYANWLYEEVSLNQYKTLDELLPAIYNISYWGGGTY 123 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 T A+ + E + + K VI +TDG+ + + Q + Sbjct: 124 TGWAIDYVVNATLTESRGAR-----QDVPKVVIVVTDGQ--------SADDVRQPALRAK 170 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTD 362 +G+ + ++ V + L + Sbjct: 171 QSGIIMVAIGVGS-IYDGTELVEIAT 195 >gi|203287632|ref|YP_002222647.1| hypothetical protein BRE_171 [Borrelia recurrentis A1] gi|201084852|gb|ACH94426.1| hypothetical protein BRE_171 [Borrelia recurrentis A1] Length = 341 Score = 46.4 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 29/172 (16%), Positives = 60/172 (34%), Gaps = 41/172 (23%) Query: 252 CTPLSNNLNEVKSRLNK---LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 PL+ + + +L+ ++ + + A L S KK V Sbjct: 154 IVPLTIDRDFFSKKLDDIYIMDLGNGSALGLGISIALSHL----------KHSEAPKKSV 203 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP----------------- 351 I +TDG + Y++ Q+ + +KIYSV + + Sbjct: 204 IVLTDGVVNSDEVYKD-----QVINLAQGLNVKIYSVGIGSDEELNVGFKLRSGKFYQGV 258 Query: 352 ----EGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQ-EQSVRIAPN 397 +L + ++ + G F++V D + + K E+ V+I+ + Sbjct: 259 LKEVYDPSMLFEISNKTGGLFYSVGDDFSFKLAIQDFSKKENVERKVKISVD 310 >gi|198417365|gb|ACH87900.1| ancillary protein 1 [Streptococcus pyogenes] Length = 1042 Score = 46.4 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 29/198 (14%), Positives = 65/198 (32%), Gaps = 22/198 (11%) Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL-NKLNP--Y 272 + S+ ++K TI Y ++ + + + + L+ Sbjct: 542 AVVGFQGSVAYRYYDEKPERTPWNTIMYQPSKSTSKDADVLKDWETSSNLSRDSLSYEDR 601 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI--------TDGENSGASAYQN 324 TN + A+ A +L + H I F +G S ++ + Sbjct: 602 NGTNYHAALLKADEKLQKVANNGHRKIMVFISDGVPTFYFGADNYRSGNGTVSDSNIINS 661 Query: 325 TLNTLQICEYMRNA--GMKIYSVAVSAPPEGQD------LLRKCTDSSGQFFAVNDSREL 376 + + +N + IYS+ VS +L+ + + D+ +L Sbjct: 662 QKGSKLAIDEFKNKYPNLSIYSLGVSKDINSDTSSSSPVVLKYL-SGDDYYSGITDTEQL 720 Query: 377 LESFDKITD--KIQEQSV 392 ++ +KI + KI ++ Sbjct: 721 EKTANKIVEDSKISNLTI 738 >gi|53723209|ref|YP_112194.1| hypothetical protein BPSS2192 [Burkholderia pseudomallei K96243] gi|52213623|emb|CAH39677.1| putative membrane protein [Burkholderia pseudomallei K96243] Length = 396 Score = 46.4 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 33/322 (10%), Positives = 89/322 (27%), Gaps = 29/322 (9%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A++++V F+ A+DL + R+++Q++ DA L+ + + P Sbjct: 4 LVALMLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLTGAINLSVPEAAGITA 63 Query: 61 ---STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 + +Q ++ S + + N ++ + + Sbjct: 64 GHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCTTSRTGIVNWFI 123 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSM-------EDLYLQKHND 170 L + +TN S+ +T + + +I + + + + + Sbjct: 124 QTLNLVPGVTVTNASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVAGATYNIGDWLS 183 Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR------------KIDVLIESAG 218 + + + N+ S + Sbjct: 184 AKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGDKAATTNAYN 243 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR-------LNKLNP 271 ++ + AY+ +Q S+ +++ + + +N Sbjct: 244 TRFGIYANPYKDPSYGTPDFTGFAYDATTWPSQSNAYSDFVSKRLTFASYQGDLITGINT 303 Query: 272 YENTNTYPAMHHAYRELYNEKE 293 + N A R L E Sbjct: 304 GGSYNPSYYAAGADRRLALAPE 325 >gi|47218505|emb|CAF97239.1| unnamed protein product [Tetraodon nigroviridis] Length = 1060 Score = 46.4 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 19/190 (10%), Positives = 55/190 (28%), Gaps = 21/190 (11%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + + ++ S + ++ VR G + ++ E Sbjct: 470 SWSVGQNSFSHVKDFISAIITSFKDSV--VGTEGVRFGVTVFGDVPKMRIALTDYSSQEE 527 Query: 262 VKSRLNKLNPYENTN-TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 V + L + A+ + +++ + K + IT+G + Sbjct: 528 VLRAIRDLPYEGRSRRIGDALTFLVQHVFSPVIRRDHG------PKIAVLITNGRSDDPV 581 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLE 378 + ++G+ +++V V LR+ D L Sbjct: 582 D--------AAARLVADSGISLFAVGVG--GADASELRRMVSEPREEHLLLGADYSALEN 631 Query: 379 SFDKITDKIQ 388 +++ ++ Sbjct: 632 LLARLSRRVC 641 >gi|222101622|gb|ACM44016.1| thrombospondin-related anonymous protein [Babesia bovis] Length = 657 Score = 46.4 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 58/164 (35%), Gaps = 14/164 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 + N +R+ Y T L + + L KL+ T M + R L Sbjct: 77 DLDNTDIRLSLTTY-STPTRQIFTFLDAAASSTRLALTKLDWMAGTKARSGMTYTGRAL- 134 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 N + G + K ++ ITDG +S S T Q+ +R+ G+ + + V Sbjct: 135 NYVRKAILPYGRKNVPKALLLITDGVSSDGSY------TAQVAAMLRDEGVNVMVIGVG- 187 Query: 350 PPEGQDLLRKCTDSSGQ----FFAVNDSRELLESFDKITDKIQE 389 R G F + ++++ F+ + ++ + Sbjct: 188 -DVNVAECRGIVGCDGVMDCPMFKHTNWKDIMGLFNSLMKEVCD 230 >gi|149181776|ref|ZP_01860267.1| hypothetical protein BSG1_01140 [Bacillus sp. SG-1] gi|148850517|gb|EDL64676.1| hypothetical protein BSG1_01140 [Bacillus sp. SG-1] Length = 949 Score = 46.4 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 42/289 (14%), Positives = 81/289 (28%), Gaps = 56/289 (19%) Query: 133 LRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSK 192 + + E I + V D S SM+D + + + K L + Sbjct: 58 EVTPKGQATNEERKPIDVVFVHDTSGSMKDSFGGVKKATSAENALKESLRFFNQNQQSKD 117 Query: 193 NTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAI-QEKKNLSVRIGTIAYNIGIVGNQ 251 + D I+ A L + + A + Y + Sbjct: 118 KYFFVPFDSD-VSYKNYGDKRIQPAEGLSDILPMAEHLDFSEAYWVKKYSWYYGYYWSQE 176 Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 S TN ++ +A + ++ K+++IF+ Sbjct: 177 IFDFS---------------VGGTNYTQSLEYALSKFSGMRD----------SKRYIIFL 211 Query: 312 TDGENSGASAYQNTL--------------------------NTLQICEYMRNAGMKIYSV 345 TDGE + + + E + +K+YS+ Sbjct: 212 TDGEPTSLNHDNKQYTLYTNGTARAGNVNANYNDVQKFIHEKAVASAEKLGVNDVKMYSI 271 Query: 346 AVSAPPE-GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSV 392 A + P E LL ++ + G+ + L F I+ + SV Sbjct: 272 AFAEPGEVNYQLLENMSNKTGGRAIQA-NPNSLSNVFTDISKEFNSPSV 319 >gi|148645283|gb|ABR01165.1| complement factor B [Ovis aries] Length = 761 Score = 46.4 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 33/227 (14%), Positives = 69/227 (30%), Gaps = 30/227 (13%) Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANR-----KIDVLIESAGNLVNSIQKAIQEKKNLSV 236 P +K + + S + + + + + Sbjct: 253 PGEQQKRKIVLDPSGSMNIYLVLDGSDSVGAHNFTGAKNCLRDFIEKVASYGVKP----- 307 Query: 237 RIGTIAYNIGIVG--NQCTPLSNNLNEVKSRLNKLNPYE-----NTNTYPAMHHAYRELY 289 + G + Y P S+ + V +LN++N + TNT A+ Y + Sbjct: 308 KYGLVTYATEPKVLIKVFDPKSSEADWVTEQLNRINYADHKLKAGTNTKRALLEVYNMMS 367 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL-NTLQICEYMRNAGMK------I 342 + + T R + +I +TDG ++ + + + + RN I Sbjct: 368 RDINNLKETWN--RTRHVIIIMTDGLHNMGGDPVTVIHDIRYLLDIGRNRKNPREDYLDI 425 Query: 343 YSVAVSAPPEGQDLLRKCTDSSG---QFFAVNDSRELLESFDKITDK 386 Y V P Q+ + F + L + F ++ D+ Sbjct: 426 YVFGVG-PLVNQENINALASKKDKEQHVFKLQGMENLEDVFVQMLDE 471 >gi|73953968|ref|XP_853856.1| PREDICTED: similar to tumor endothelial marker 8 isoform 1 precursor [Canis familiaris] Length = 924 Score = 46.4 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 23/104 (22%), Positives = 37/104 (35%), Gaps = 7/104 (6%) Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 G + +I +TDG T E R G IYS+ V + Sbjct: 475 EEANSGGKKFPSMIIALTDGTLMPEPY----EETKIEAENSRQLGATIYSIGVM--DYRR 528 Query: 355 DLLRKCTDSSGQFFAVND-SRELLESFDKITDKIQEQSVRIAPN 397 D L DS F V++ + L + + +T K + + P+ Sbjct: 529 DQLLSIADSPDHVFGVDNGFKGLQDIVEPLTAKSCIEITNLEPS 572 >gi|315127492|ref|YP_004069495.1| inter-alpha-trypsin inhibitor domain-containing protein [Pseudoalteromonas sp. SM9913] gi|315016006|gb|ADT69344.1| inter-alpha-trypsin inhibitor domain-containing protein [Pseudoalteromonas sp. SM9913] Length = 666 Score = 46.4 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 23/158 (14%), Positives = 50/158 (31%), Gaps = 21/158 (13%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPL---SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 + I +N + TPL NL + + L T A+ Sbjct: 355 DSDDSFNIIGFNNQVTAMSDTPLVASDFNLRRARRFIYNLQADGGTEIQGALDAVLNGAQ 414 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + V+F+TDG S I + ++ ++++V + + Sbjct: 415 -----------FEGFVRQVVFLTDG-----SVSNEDELFKSIARTLGDS--RLFTVGIGS 456 Query: 350 PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 P + R G + + + ++ ++ DK+ Sbjct: 457 APNRFFMRRAADIGKGSYTFIGSTFDVQPKMQQLFDKL 494 >gi|289606823|emb|CBI60997.1| unnamed protein product [Sordaria macrospora] Length = 599 Score = 46.4 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 34/227 (14%), Positives = 56/227 (24%), Gaps = 54/227 (23%) Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE------ 230 + P KS W+ + K A ++ V+ + Sbjct: 13 DPSDDRPYEFKSVWNGCIEERKTNSAAINGGSSTTAPSDAYDLDVDLVPYNDDTRWRPMW 72 Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPL--------SNNLNEVKSRLNKLNPYENTNTYPAMH 282 + +Y +G P NN + S LN L T M Sbjct: 73 NDVSYYPDWSWSYGVGRQPVAYCPTEAKRLQNYHNNRSGFVSYLNGLVARGGTYHDIGMI 132 Query: 283 HAYRELYNEK--------------ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 R L + I +KK++IF+TDG+ S + Sbjct: 133 WGARFLSTTGLFKSATPETNDVTDPDNPAKIRGFSVKKYMIFMTDGDMSPTWNDYSAYGI 192 Query: 329 LQ--------------------------ICEYMRNAGMKIYSVAVSA 349 C + G+ I+ +A S Sbjct: 193 EYLDGRVNGSPTTDNAALLARHLQRFRMACNAAKAKGIDIWVIAFST 239 >gi|115379114|ref|ZP_01466238.1| TPR domain protein [Stigmatella aurantiaca DW4/3-1] gi|310823566|ref|YP_003955924.1| Batb protein [Stigmatella aurantiaca DW4/3-1] gi|115363897|gb|EAU63008.1| TPR domain protein [Stigmatella aurantiaca DW4/3-1] gi|309396638|gb|ADO74097.1| BatB protein [Stigmatella aurantiaca DW4/3-1] Length = 352 Score = 46.4 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 19/103 (18%), Positives = 45/103 (43%), Gaps = 19/103 (18%) Query: 253 TPLSNNLNEVKSRLNKLNP----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 +PL+++ + VK L ++P +N A+ A + L N + ++ V Sbjct: 158 SPLTSDYSAVKLFLRAVDPEQMPQGGSNIGAALKLANQVLSNADRGA--------KERAV 209 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 + ++DGE+ + E +++ G+++ +V V + Sbjct: 210 VLLSDGEDLFGEVG-------EATEALKDGGVQVLAVGVGSES 245 >gi|301614661|ref|XP_002936793.1| PREDICTED: collagen alpha-1(XXVIII) chain-like [Xenopus (Silurana) tropicalis] Length = 344 Score = 46.4 bits (108), Expect = 0.007, Method: Composition-based stats. Identities = 20/150 (13%), Positives = 50/150 (33%), Gaps = 37/150 (24%) Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + T T A+ A +E + + ++K I +TDG+ + ++ Sbjct: 1 MMYMGEGTYTGTAIRKATQEGFFGARTG--------VRKVAIVLTDGQTDK----RESVK 48 Query: 328 TLQICEYMRNAGMKIYSVAV--SAPPEGQDLLRK---CTDSSG--QFFAVNDSRELLE-- 378 + A +++Y++ + ++ P D LR+ + ++D L Sbjct: 49 LDIAVREAQAANIEMYAIGIVNASDPTQVDFLRELNLIASDPDSEHMYLIDDFNTLPALE 108 Query: 379 ---------------SFDKITDKIQEQSVR 393 +++I I ++ Sbjct: 109 SKLVNQFCEDENGALIYNRIGSTINSMVMQ 138 >gi|148692793|gb|EDL24740.1| calcium channel, voltage-dependent, alpha2/delta subunit 3 [Mus musculus] Length = 1091 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 59/177 (33%), Gaps = 14/177 (7%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++ + ++ ++++++ ++ + G N + L+K Sbjct: 271 RLTIAKQTVSSILDTLGD-DDFFNIITYNEELHYVEPCLNGTLVQADRTNKEHFREHLDK 329 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L A++ A+ L + + +I S + ++ ITDG N Sbjct: 330 LFAKGIGMLDIALNEAFNILSDFNHTGQGSICS----QAIMLITDGAVDTYDTIFAKYNW 385 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + ++I++ + D L+ ++ FF L + + + + Sbjct: 386 P-------DRKVRIFTYLIGREAAFADNLKWMACANKGFF--TQISTLADVQENVME 433 >gi|17231904|ref|NP_488452.1| hypothetical protein alr4412 [Nostoc sp. PCC 7120] gi|17133548|dbj|BAB76111.1| alr4412 [Nostoc sp. PCC 7120] Length = 820 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 23/166 (13%), Positives = 46/166 (27%), Gaps = 19/166 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE---VKSRLNKLNPYENTNTYPAMHHAYR 286 N + ++ PL+NN + +N+L+ T + Sbjct: 328 NGLNPDDTFSIVDFSDTTRQLSPVPLANNAQNRTRAINYINQLSANGGTEMLRGIRAVLN 387 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 + + + ++ +TDG N L + +G ++YS Sbjct: 388 --FPVTDPGRL--------RSIVLLTDG------YIGNENQILAEVQQHLKSGNRLYSFG 431 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + L R G + E DK +I + Sbjct: 432 AGSSVNRFLLNRIAELGRGIAQIIRHDEPTDEIVDKFYRQINNPVL 477 >gi|6753236|ref|NP_033915.1| voltage-dependent calcium channel subunit alpha-2/delta-3 [Mus musculus] gi|81872883|sp|Q9Z1L5|CA2D3_MOUSE RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-3; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-3; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-3; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-3; Flags: Precursor gi|4186073|emb|CAA09423.1| calcium channel alpha-2-delta-C subunit [Mus musculus] gi|147898121|gb|AAI40360.1| Calcium channel, voltage-dependent, alpha2/delta subunit 3 [synthetic construct] gi|151555329|gb|AAI48701.1| Calcium channel, voltage-dependent, alpha2/delta subunit 3 [synthetic construct] Length = 1091 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 59/177 (33%), Gaps = 14/177 (7%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++ + ++ ++++++ ++ + G N + L+K Sbjct: 271 RLTIAKQTVSSILDTLGD-DDFFNIITYNEELHYVEPCLNGTLVQADRTNKEHFREHLDK 329 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L A++ A+ L + + +I S + ++ ITDG N Sbjct: 330 LFAKGIGMLDIALNEAFNILSDFNHTGQGSICS----QAIMLITDGAVDTYDTIFAKYNW 385 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + ++I++ + D L+ ++ FF L + + + + Sbjct: 386 P-------DRKVRIFTYLIGREAAFADNLKWMACANKGFF--TQISTLADVQENVME 433 >gi|88856118|ref|ZP_01130779.1| von Willebrand factor, type A [marine actinobacterium PHSC20C1] gi|88814686|gb|EAR24547.1| von Willebrand factor, type A [marine actinobacterium PHSC20C1] Length = 585 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 21/181 (11%), Positives = 62/181 (34%), Gaps = 14/181 (7%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN-LNEVKS 264 R+I++ SA + +++ + + L + + V + + + +++ Sbjct: 399 GQRRIEMFQNSAEDAFSTLSEDDELGMWLFSKERVGTEDWEDVAGIRSMNDPDHVKNLQA 458 Query: 265 RLNKLNP--YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 ++ L T + A + + ++ V+ ITDG N + Sbjct: 459 IIDSLPSRINGYTGLNDTVLAAVTHVREDY--------NSEKVNSVMLITDGRNEDDNGI 510 Query: 323 QNTLNTLQICEYMR---NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 ++ E ++ + + + V + + + + + + G + N+ EL Sbjct: 511 SLQKLLDKLTEMIQKDTDEPVPVVLVGIGPDTDVEAMRKIAQATGGTAYQANNPTELSNV 570 Query: 380 F 380 Sbjct: 571 M 571 >gi|194018005|ref|ZP_03056612.1| YabS [Bacillus pumilus ATCC 7061] gi|194010342|gb|EDW19917.1| YabS [Bacillus pumilus ATCC 7061] Length = 244 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 14/95 (14%), Positives = 38/95 (40%), Gaps = 10/95 (10%) Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-EGQDLLRK---CTDS 363 ++ ITDG ++ + L I + G+ + + + + +++ + Sbjct: 9 ILLITDGCSNHGE------DPLAIASLAKEQGITVNVIGIMEENRHDHEAMKEVEGIALA 62 Query: 364 SGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 G V ++L ++ +T K Q+++ N+ Sbjct: 63 GGGIHQVVYVQQLSQTVQMVTKKAMTQTLQGVVNK 97 >gi|157690847|ref|YP_001485309.1| hypothetical protein BPUM_0049 [Bacillus pumilus SAFR-032] gi|157679605|gb|ABV60749.1| hypothetical protein BPUM_0049 [Bacillus pumilus SAFR-032] Length = 244 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 14/95 (14%), Positives = 38/95 (40%), Gaps = 10/95 (10%) Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-EGQDLLRK---CTDS 363 ++ ITDG ++ + L I + G+ + + + + +++ + Sbjct: 9 ILLITDGCSNHGE------DPLAIASLAKEQGITVNVIGIMEENRHDHEAMKEVEGIALA 62 Query: 364 SGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 G V ++L ++ +T K Q+++ N+ Sbjct: 63 GGGIHQVVYVQQLSQTVQMVTKKAMTQTLQGVVNK 97 >gi|114684811|ref|XP_531503.2| PREDICTED: collagen, type VI, alpha 1 isoform 3 [Pan troglodytes] Length = 997 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 30/222 (13%), Positives = 62/222 (27%), Gaps = 25/222 (11%) Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 N + +K +F S + + + D A L A Sbjct: 772 KNVTAQICVDKKCPDYTCPITFSSPADITILLDGSASVGSHNFDTTKRFAKRLAERFLTA 831 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQ---CTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHH 283 + VR+ + Y+ N + S ++ ++ + T+ A+ + Sbjct: 832 GRTDPAHDVRVAVVQYSGTGQQRPERASLQFLQNYTALASAVDAMDFINDATDVNDALGY 891 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 R + KK ++ +DG + G + + + AG++I+ Sbjct: 892 VTRFYREASSGAA--------KKRLLLFSDGNSQG----ATPAAIEKAVQEAQRAGIEIF 939 Query: 344 SVAVSAPPE---------GQDLLRKCTDSSGQFFAVNDSREL 376 V V G+ F V + L Sbjct: 940 VVVVGRQVNEPHIRVLVTGKTAEYDVAYGERHLFRVPSYQAL 981 Score = 44.5 bits (103), Expect = 0.026, Method: Composition-based stats. Identities = 22/126 (17%), Positives = 45/126 (35%), Gaps = 14/126 (11%) Query: 266 LNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 ++ + + T T A+ +L SH K++I +TDG Sbjct: 86 VDAVKYFGKGTYTDCAIKKGLEQLLVG--GSHLKEN-----KYLIVVTDGHPLEGYKEPC 138 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF---FAVNDSRELLESFD 381 ++ G+K++SVA+ P + L + + F D + ++ + Sbjct: 139 G-GLEDAVNEAKHLGVKVFSVAI-TPDHLEPRLSIIA-TDHTYRRNFTAADWGQSRDAEE 195 Query: 382 KITDKI 387 I+ I Sbjct: 196 AISQTI 201 >gi|319778370|ref|YP_004129283.1| exopolysaccharide biosynthesis domain protein [Taylorella equigenitalis MCE9] gi|317108394|gb|ADU91140.1| exopolysaccharide biosynthesis domain protein [Taylorella equigenitalis MCE9] Length = 553 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 37/234 (15%), Positives = 73/234 (31%), Gaps = 42/234 (17%) Query: 149 SICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR 208 + +V D S SM D + +L S P P+ Sbjct: 342 QMVLVFDNSISMLHSM-----DAATANFEQQMLNA------MLSGVIPSGGIPMPSNGPT 390 Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +I ++A +++NSI I R T + + +KS +N+ Sbjct: 391 RIQTAKKAANSIINSIDPYIDIGFISLDRCPTATNHGFFKP-------SKRGALKSTINR 443 Query: 269 LNPY--ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 ++ T + A + F++ ++DG ++ Sbjct: 444 MDTNNSSGTVLANGIQQAANMV-----------DGKNRDAFILVLSDGASTCG------P 486 Query: 327 NTLQICEYMRNAG--MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 N + + +KI V + + E L R + GQ + N + +L + Sbjct: 487 NICTVAANAKRNKPKLKINVVDIGSTGEANCLARV---TGGQVYKANTAAQLAK 537 >gi|269104660|ref|ZP_06157356.1| putative hemagglutinin/hemolysin-related protein [Photobacterium damselae subsp. damselae CIP 102761] gi|268161300|gb|EEZ39797.1| putative hemagglutinin/hemolysin-related protein [Photobacterium damselae subsp. damselae CIP 102761] Length = 3986 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 38/373 (10%), Positives = 105/373 (28%), Gaps = 34/373 (9%) Query: 30 QSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKA 89 Q ++ + L + + + ++ S +K + + + Sbjct: 3376 QITVEGSALLKDSDRIVEASVTTTDGAHHSASDTAEKSYQIDGVVIQADNGDNTIVGTVG 3435 Query: 90 QINITKDKNNPLQYIAESKAQYEIPTE-NLFLKGLIPSALTNLSLRSTGIIERSSENLAI 148 + D + + + I T +++ L+ +++ + + Sbjct: 3436 SDLLIGDLDPAKIVDVPTNVNFVIDTSGSMYYGRLLNLDSIHMNSAEKYKVFVNYGATLT 3495 Query: 149 SICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR 208 + + S + ++ D S Y P+ Sbjct: 3496 AADGTQLYNGSSQSGWVTVTYDQMKAGLQYDGYRAEDPIYIKSSIGEDQTYKLTDFPSV- 3554 Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL------NEV 262 D+ ++ LV+ I +K +L+ + T +G + +N ++ Sbjct: 3555 -FDMTKQAYQVLVDEILTNTNDKSSLNFNVVTFNSTVGGDSSFHYDAESNSFVNSRGTDI 3613 Query: 263 KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + LN L T + + + + V F+TDG+++ + Sbjct: 3614 HNYLNSLIAGGGTEFEAPLKTISDHIVTDGNTR----------NVVYFLTDGKDNTGFSN 3663 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT----------DSSGQFFAVND 372 + ++ ++ S+AV + + + + + Sbjct: 3664 SANNSDYAALKHA-----EVISIAVGPSGDADQVNQIAQLGEGYNNNNDSEPSYSKVITN 3718 Query: 373 SRELLESFDKITD 385 + EL + F I Sbjct: 3719 TNELTDIFKDIGQ 3731 >gi|15897953|ref|NP_342558.1| hypothetical protein SSO1089 [Sulfolobus solfataricus P2] gi|13814278|gb|AAK41348.1| Hypothetical protein SSO1089 [Sulfolobus solfataricus P2] Length = 436 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 20/112 (17%), Positives = 45/112 (40%), Gaps = 19/112 (16%) Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T + A++ A + + +I +TDG+ + + N Sbjct: 184 GYTTRLHEAVNFALNLAKQSQ-----------VPNKIIMLTDGKPT------DKRNVKDY 226 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDK 382 ++ +I ++ + + + +L+K D SSG+F+ + D EL + F+ Sbjct: 227 EKFDIPPNTQIITIGIGSD-YNERILKKLADKSSGKFYHLKDISELPDVFES 277 >gi|254444377|ref|ZP_05057853.1| Vault protein inter-alpha-trypsin [Verrucomicrobiae bacterium DG1235] gi|198258685|gb|EDY82993.1| Vault protein inter-alpha-trypsin [Verrucomicrobiae bacterium DG1235] Length = 808 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 34/196 (17%), Positives = 70/196 (35%), Gaps = 26/196 (13%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 K+ L + ++ ++ + + + NL E Sbjct: 291 SGSMQGKLHTLASGVKKAIGQLK-----PEDRFRVVAFNNTAFDLNRGWVSATEANLRET 345 Query: 263 KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 +RL++LN TN Y +H A L ++ ++ +I +TDG + Sbjct: 346 FARLDQLNSNGGTNVYAGVHLALERLDADRVAT------------LILVTDGVTNQGI-- 391 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL-ESFD 381 ++ + M ++ Y + + C S G + AV++S +++ E Sbjct: 392 ---VDPKAFYKLMHKQDLRFYGFLLGNSSNWPLMQLMCDASGGSYRAVSNSDDIIGEVM- 447 Query: 382 KIT-DKIQEQSVRIAP 396 I +KI +S+R A Sbjct: 448 -IAKNKIVYESMRHAE 462 >gi|145482919|ref|XP_001427482.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124394563|emb|CAK60084.1| unnamed protein product [Paramecium tetraurelia] Length = 280 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 20/137 (14%), Positives = 48/137 (35%), Gaps = 8/137 (5%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + L ++ T A+ + + + K + +TDGE++ Sbjct: 16 GKSRLNQLLPRVTCEGCTAFRDAVIKGNQLMLELFALFCQQGMHEKFKFVHVILTDGEDN 75 Query: 318 GASAY-QNTLNTLQICEYMRNAGM-KIYSVAVSAPPEGQDLLR-----KCTDSSGQFFAV 370 + Q+ L Q + + + + + V+ KC+ S ++ V Sbjct: 76 KSQTSLQDFLVYQQFLQQKLPPNILQTFYIGVNVENNNTVQQEMSAILKCSGKSASYYPV 135 Query: 371 NDSRELLESFDKITDKI 387 + ++ + F KI +I Sbjct: 136 SS-NQINDIFQKIQMQI 151 >gi|148252253|ref|YP_001236838.1| hypothetical protein BBta_0664 [Bradyrhizobium sp. BTAi1] gi|146404426|gb|ABQ32932.1| hypothetical protein BBta_0664 [Bradyrhizobium sp. BTAi1] Length = 755 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 28/164 (17%), Positives = 54/164 (32%), Gaps = 20/164 (12%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEV---KSRLNKLNPYENTNTYPAMHHAYRELY 289 + R I ++ + + + + V S ++ L T PAM A Sbjct: 389 QPNDRFNVIRFDDTMTVLFPSSVPADAEHVGSATSFVSALEARGGTEMVPAMRAALT--- 445 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + + + V+F+TDG A N + MR +I+ V + + Sbjct: 446 -------DDGSDSDRVRQVVFLTDG------AIGNEQQLFETITAMRGRS-RIFMVGIGS 491 Query: 350 PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 P + R G F + ++ E + K++ V Sbjct: 492 APNTYLMTRAAELGRGAFTPIGSVEQVEERMRDLFAKLENPVVT 535 >gi|198420433|ref|XP_002131388.1| PREDICTED: similar to Vwa1 protein [Ciona intestinalis] Length = 438 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 24/197 (12%), Positives = 60/197 (30%), Gaps = 25/197 (12%) Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 WS++ + I S ++ +V+ + Sbjct: 219 NHCLSNGSWSESPPCCATDIVTLYHSSNIHAYEHMVRIAFLSAEQTAIAG--DAVKYAGL 276 Query: 242 AYNIGIVGNQCTPLS-NNLNEV-------KSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 Y+ + + N +++ + + NT AM A ++N+ Sbjct: 277 YYSGEVNMESIVHFNEYNDDKLTMQGLFMQRVIMNNTADRKVNTGAAMEFARTHMFNQSN 336 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE- 352 + + ++ ITD ++ + L+ E +R G+ Y++ ++ + Sbjct: 337 GVRGE-----VPRILMVITDENSTD--------DVLEPAEKLRAEGVLTYAIGLAEDGKI 383 Query: 353 -GQDLLRKCTDSSGQFF 368 +D L + F Sbjct: 384 LDRDQLMQIAGEDDHLF 400 >gi|288928459|ref|ZP_06422306.1| BatB protein [Prevotella sp. oral taxon 317 str. F0108] gi|288331293|gb|EFC69877.1| BatB protein [Prevotella sp. oral taxon 317 str. F0108] Length = 554 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 23/166 (13%), Positives = 60/166 (36%), Gaps = 48/166 (28%) Query: 253 TPLSNNLNEVKSRLNKLNP----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P++ + K L ++P + T+ A++ + R +K+ + K + Sbjct: 144 LPITTDYVSAKMFLQNIDPALIATQGTDIAKAINLSMRSFSQQKD----------IGKAI 193 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS---------------APPEG 353 I ITDGE+ L+ + G+ ++ + + G Sbjct: 194 IVITDGEDHEG-------GALEAAKAANERGIHVFILGIGSTKGSPIPTSEGGYLTDRSG 246 Query: 354 QDLL---------RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 Q +L + +G + V+++ ++ +K+ +++ + Sbjct: 247 QTVLTALNESMCKQIAQAGNGTYIHVDNT---NDAQEKLNNELAKL 289 >gi|170592037|ref|XP_001900776.1| von Willebrand factor type A domain containing protein [Brugia malayi] gi|158591928|gb|EDP30531.1| von Willebrand factor type A domain containing protein [Brugia malayi] Length = 817 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 24/155 (15%), Positives = 54/155 (34%), Gaps = 15/155 (9%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKE 293 + +I + Y+ + N +++ L K+ T T A+ A ++ Sbjct: 78 NTQIAVMQYSSYTRVEFNFSANPNKESLRASLQKIRHISGTTKTGKALDKAL-HVFCHDS 136 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + + + + + +TDG + + + +R AG++I ++ + A Sbjct: 137 NFGTRLNQDDVAQVAVVVTDGHSHD--------DPIPAAMRLRQAGVEILTLGIGA-HIN 187 Query: 354 QDLLRKCTDSSGQFFA-VNDSRELLES---FDKIT 384 L + T F + L + F KI Sbjct: 188 MGELVEITGDQNLAFQNLTSQASLDQFVHQFKKIA 222 >gi|149595900|ref|XP_001521428.1| PREDICTED: similar to anthrax toxin receptor, partial [Ornithorhynchus anatinus] Length = 96 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 20/95 (21%), Positives = 31/95 (32%), Gaps = 7/95 (7%) Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS 363 +I +TDGE + + R+ G +Y V V + L DS Sbjct: 3 TASVIIALTDGELQRDQF----YYAEKEADRARSLGAIVYCVGV--KDFNETQLSTIADS 56 Query: 364 SGQFFAVN-DSRELLESFDKITDKIQEQSVRIAPN 397 F V L D I K + + + P+ Sbjct: 57 IDHVFPVTGGFHALRGVIDSILKKSCIEILAVEPS 91 >gi|84687514|ref|ZP_01015390.1| von Willebrand factor type A domain protein [Maritimibacter alkaliphilus HTCC2654] gi|84664423|gb|EAQ10911.1| von Willebrand factor type A domain protein [Rhodobacterales bacterium HTCC2654] Length = 760 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 21/132 (15%), Positives = 49/132 (37%), Gaps = 21/132 (15%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + + +N LNP T A+ A L +E++ VI ++DG Sbjct: 18 TDRQAIADAVNGLNPKGKTPLSAAVLQAAETLKYTEEAAT-----------VILVSDGIE 66 Query: 317 SGASAYQNTLNTLQICEYMRNAGM--KIYSVAVS-APPEGQDLLRKCTD-SSGQFFAVND 372 + + + + G+ + + PE Q L+ + + G + + ++ Sbjct: 67 T------CDFDPCALGRQLEETGVGFTAHVIGFDVTEPEAQAQLQCLAEETGGMYRSASN 120 Query: 373 SRELLESFDKIT 384 + EL + +++ Sbjct: 121 ASELSGALEEVA 132 >gi|322434934|ref|YP_004217146.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] gi|321162661|gb|ADW68366.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] Length = 373 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 31/201 (15%), Positives = 69/201 (34%), Gaps = 34/201 (16%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + K + + +++ ++ ++ + + V T A+ LS L+ Sbjct: 148 SSGSMYDKRNAVDKAS---IDLVKLSNPMDEEFLVDFSTEAFIDTDFTTSVDKLSQGLSY 204 Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 +KS T Y A+ + L + K+ +I ITDGE++ Sbjct: 205 IKS-------SGGTAAYDALVASADYLT---------KNAKNTKQVLIIITDGEDNA--- 245 Query: 322 YQNTLNTLQICEYMRN-AGMKIYSVA--VSAPPE------GQDLLRK-CTDSSGQFFAVN 371 ++ Q +++ G IY V + + +L ++ GQ + Sbjct: 246 --SSATLEQSIRRIQDLDGPVIYCVGLLFGEDTDRRESRHARKVLESLAAETGGQAYFPK 303 Query: 372 DSRELLESFDKITDKIQEQSV 392 +E+ ++ I+ Q Sbjct: 304 SLKEVDGIAAEVAADIRTQYT 324 >gi|78484767|ref|YP_390692.1| von Willebrand factor, type A [Thiomicrospira crunogena XCL-2] gi|78363053|gb|ABB41018.1| CbbO-m protein [Thiomicrospira crunogena XCL-2] Length = 757 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 26/133 (19%), Positives = 53/133 (39%), Gaps = 17/133 (12%) Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +EVKSR+ + +T AM HA L + KK ++ +TDGE + Sbjct: 639 DEVKSRIAAMEASYSTRMGAAMRHAAHYLEAQ----------QAEKKLMLILTDGEPADI 688 Query: 320 SAYQNTL---NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 + +T + E +++ G+ Y + + P + + + ++ +L Sbjct: 689 DTKDPQVLIQDTHKAVEELKSKGIYSYCITL--DPNADEYVETI--FDNHYTVIDHVDKL 744 Query: 377 LESFDKITDKIQE 389 E ++ KI + Sbjct: 745 PEKLPQVFMKITQ 757 >gi|302495833|ref|XP_003009930.1| hypothetical protein ARB_03856 [Arthroderma benhamiae CBS 112371] gi|291173452|gb|EFE29285.1| hypothetical protein ARB_03856 [Arthroderma benhamiae CBS 112371] Length = 705 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 31/214 (14%), Positives = 60/214 (28%), Gaps = 44/214 (20%) Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE-SAGNLVNSIQKAIQEKKNLSVR 237 +P P + + S + AP P + S +L K I E N R Sbjct: 63 NDVPHVPCDIVLVIDISGSMNSAAPIPTGERGGEDTGLSILDLTKHAAKTIIETLNEKDR 122 Query: 238 IGTIAYNIGIVGNQCTPL----------SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 + + + + N + V S ++KL +TN + + Sbjct: 123 LAVVTFCTEVNVRTIESSPVAFELDYMNKENKSTVLSAIDKLYGKSSTNLWHGIKKGLNV 182 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 L + + ++ +TDG +I++ Sbjct: 183 LTTNP--------AQGKIQSLLVLTDGTLP-----------------------QIHTFGF 211 Query: 348 SAPPEGQDLLRKCTDSSGQFF-AVNDSRELLESF 380 LL+ + G F + D+ + F Sbjct: 212 GYYLRS-SLLQSIAEIGGGIFAFIPDAGMIGTVF 244 >gi|119493558|ref|ZP_01624223.1| hypothetical protein L8106_25942 [Lyngbya sp. PCC 8106] gi|119452612|gb|EAW33794.1| hypothetical protein L8106_25942 [Lyngbya sp. PCC 8106] Length = 757 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 32/288 (11%), Positives = 82/288 (28%), Gaps = 38/288 (13%) Query: 127 ALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPK 186 +++ + S + + + + N +Y + Sbjct: 192 QDIGVTVEIHAGLPISQVRSTSHQINITENGEIVTVKLDNEDTIPNKDLILRYQVSGDNT 251 Query: 187 KSFWSKNTTKSKYAPA----PAPANRKIDVLIESAGNLVNSIQKAIQEKK---------- 232 ++ + + A PA + +++ + L+++ + Sbjct: 252 QTTVLTQSDERGGHFALYFIPAIEYKTDEIVAKDVLFLMDTSGSQQGDPLFKCQELMRRF 311 Query: 233 ----NLSVRIGTIAYNIGIVGNQCTPLSN---NLNEVKSRLNKLNPYENTNTYPAMHHAY 285 N + + + TPL+N N + +N+L T Sbjct: 312 INGLNPNDTFNIMDFAHTTCTLSETPLANSPENRSLAIHYINQLRANGGT---------- 361 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 EL N T + ++ +TDG +A + + + ++ G +++S Sbjct: 362 -ELLNGIREVLKFPELTGRLRSIVLLTDGYIGNENAILSEVQ-----DNLK-PGNRLHSF 414 Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 V + + R G V + + +K +I + Sbjct: 415 GVGSSVNRFLINRIAEIGRGISRVVRQNESTQKVAEKFFRQINNPVLT 462 >gi|327471789|gb|EGF17230.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK408] Length = 464 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 29/269 (10%), Positives = 69/269 (25%), Gaps = 62/269 (23%) Query: 175 TSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNL 234 + W N ++ +L + A ++ ++ + N+ Sbjct: 196 NVAISFVFDKSGSMSWDLN----GNNTNYWGPKSRMSILQDKATIMMRDLK----DIGNV 247 Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY----- 289 SV + + + V + L +++ +N L TN + + L Sbjct: 248 SVNLVSFSILGSYVQKDFSELDKGTTTIEASINALQTGGVTNPGDGLRYGMMSLQNHSAQ 307 Query: 290 -----------------NEKESSHNTIGSTRLKKF---------------VIFITDGENS 317 + ++S T+ Sbjct: 308 LKYVVLLTDGIPNAYTVDTNDTSWRNRNVHPYYNRWRETVGELVTFNNGPYDVTTNLTTD 367 Query: 318 GASAYQNTLNTLQICEYMRNA----------GMK-IYSVAV-SAP---PEGQDLLRKCTD 362 ++ + + + G+K + + P G+DL R Sbjct: 368 QNRVSYDSYSNEALRKKAIEYAGKVSQTFGAGVKRVNVIGFSGVPSEIAYGEDLTRSIGS 427 Query: 363 SS--GQFFAVNDSRELLESFDKITDKIQE 389 ++ D L ++F I +IQ+ Sbjct: 428 GGMEAKYVPAADEAALQQTFSDIKKQIQQ 456 >gi|154497289|ref|ZP_02035985.1| hypothetical protein BACCAP_01582 [Bacteroides capillosus ATCC 29799] gi|150273688|gb|EDN00816.1| hypothetical protein BACCAP_01582 [Bacteroides capillosus ATCC 29799] Length = 234 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 13/98 (13%), Positives = 36/98 (36%), Gaps = 3/98 (3%) Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 ++ L T + A L +++ + G + ++ ++DG +G Sbjct: 86 QVEPLEANGLTYMGEGLTMALD-LLEQRKERYKAAGVDYYQPILVVMSDGCPNGDPRVLR 144 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 +IC+ + + + +V + + LR+ + Sbjct: 145 E-AAQRICQMVEARRLTVVAVGIGEGA-DMEQLRRISG 180 >gi|163848376|ref|YP_001636420.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222526300|ref|YP_002570771.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] gi|163669665|gb|ABY36031.1| von Willebrand factor type A [Chloroflexus aurantiacus J-10-fl] gi|222450179|gb|ACM54445.1| von Willebrand factor type A [Chloroflexus sp. Y-400-fl] Length = 446 Score = 46.4 bits (108), Expect = 0.008, Method: Composition-based stats. Identities = 37/242 (15%), Positives = 73/242 (30%), Gaps = 38/242 (15%) Query: 137 GIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTK 196 I + L + + +V D SRSM L N + + Sbjct: 28 TIAAAGTTGLPLHMTIVADASRSMRIPIL------NEQQFRDVVRGSGAHEVLVDGVPVW 81 Query: 197 SKYAPAPAPANRK----IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQC 252 P A + ID + +L+ + R+G IA + Sbjct: 82 QLNNPLSAEQRSRYRSPIDYTTHALHSLIERLDHND--------RLGLIACASDAIVLAS 133 Query: 253 TPLSNNLNEVKSRLNKLNP---YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 + E+ + + +L E TN + A + ++ + ++ Sbjct: 134 GIPGSRRAELVAAIARLPALRLGETTNLAQGLQLALAQFVAADDA---------TVRRIV 184 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFF 368 ITDG + + L + G+ + ++ + + LL + D S G+ Sbjct: 185 LITDGFTT------DQTLCLTLAREAAARGISLSTIGLG-GSFEEHLLTQLADLSGGRAS 237 Query: 369 AV 370 V Sbjct: 238 FV 239 >gi|296482552|gb|DAA24667.1| vitrin precursor [Bos taurus] Length = 652 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 18/187 (9%), Positives = 55/187 (29%), Gaps = 20/187 (10%) Query: 173 NMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK 232 + L P + K + + R+ + + ++ ++ Sbjct: 246 KEELSTQSLEPASQGDPSCKVDLSFLIDGSSSIGKRRFRIQKQFLTDVAQTLDIGPAGPL 305 Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNE 291 +G + Y N +VK+ + K++ +N A+ + +++ Sbjct: 306 -----MGVVQYGDNPATQFNLKTHMNSQDVKAAIEKISQRGGLSNAGRAISFVTKNFFSK 360 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV-SAP 350 + + + + DG + + R +G+ I+ + + A Sbjct: 361 FNGNRGGAPN-----VAVVMVDGWPTD--------KVEEASRLARESGVNIFFITIEGAS 407 Query: 351 PEGQDLL 357 + + Sbjct: 408 ENEKQYM 414 >gi|27806781|ref|NP_776396.1| vitrin precursor [Bos taurus] gi|75054534|sp|Q95LI2|VITRN_BOVIN RecName: Full=Vitrin; Flags: Precursor gi|17941422|gb|AAL18262.2| vitrin [Bos taurus] Length = 652 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 18/187 (9%), Positives = 55/187 (29%), Gaps = 20/187 (10%) Query: 173 NMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK 232 + L P + K + + R+ + + ++ ++ Sbjct: 246 KEELSTQSLEPASQGDPSCKVDLSFLIDGSSSIGKRRFRIQKQFLTDVAQTLDIGPAGPL 305 Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNE 291 +G + Y N +VK+ + K++ +N A+ + +++ Sbjct: 306 -----MGVVQYGDNPATQFNLKTHMNSQDVKAAIEKISQRGGLSNAGRAISFVTKNFFSK 360 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV-SAP 350 + + + + DG + + R +G+ I+ + + A Sbjct: 361 FNGNRGGAPN-----VAVVMVDGWPTD--------KVEEASRLARESGVNIFFITIEGAS 407 Query: 351 PEGQDLL 357 + + Sbjct: 408 ENEKQYM 414 >gi|295395230|ref|ZP_06805438.1| conserved hypothetical protein [Brevibacterium mcbrellneri ATCC 49030] gi|294971992|gb|EFG47859.1| conserved hypothetical protein [Brevibacterium mcbrellneri ATCC 49030] Length = 1021 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 35/271 (12%), Positives = 77/271 (28%), Gaps = 80/271 (29%) Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAP-----APAPANRKIDVLIESAGNLVNSIQ 225 + L+ S + ++ + K L + + Sbjct: 70 AAAGGNIDVLILVDESGSLVDSDPNNARVTSGLHLVSRMSKLTKSGNLSVAVSGFGHEYS 129 Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN---TNTYPAMH 282 VR N G ++ ++ +N L+ N T+ + AM Sbjct: 130 T---------VRDWQEVKNDG-----------DVESLRGAINDLSNRTNGIDTDYWTAMD 169 Query: 283 HAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL---------------- 326 A ++L +H + + +I+ +DGE N Sbjct: 170 EARKQL---ANRAHERGSDAKSCQAIIWFSDGELDYEVRKGNMASKYGETKPFAPDISLK 226 Query: 327 -----------------NTLQICEYMRNAGMKIYSVAVSAPPEGQ-DLLRKCTDS----- 363 + + + +R++ + ++ V + + + DL+R Sbjct: 227 TEDGAKKVEEKAREDICRSGGLADQLRSSRVAMFGVGLGSKDGKEFDLMRSIATGKDSGG 286 Query: 364 ----------SGQFFAVNDSRELLESFDKIT 384 G+F+ +D LL +FD I+ Sbjct: 287 KSCGDLTDPVPGEFYLASDIDSLLIAFDSIS 317 >gi|46359648|dbj|BAD15328.1| CbbOm [Hydrogenovibrio marinus] Length = 754 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 28/133 (21%), Positives = 51/133 (38%), Gaps = 23/133 (17%) Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN-S 317 ++VK+R+ + +T AM HA L + KK ++ +TDGE Sbjct: 635 TSDVKARIAAMEASYSTRMGAAMRHAAHYLSAQ----------QAEKKLMLILTDGEPAD 684 Query: 318 GASAYQNTL--NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND--- 372 S TL +T + E +++ G+ Y + + + G + V D Sbjct: 685 IDSKDPQTLIHDTHKAVEELKSEGIYSYCITLDPNAD-----EYVETIFGNQYTVIDQVE 739 Query: 373 --SRELLESFDKI 383 +L + F K+ Sbjct: 740 KLPEQLPQVFMKL 752 >gi|73976419|ref|XP_852853.1| PREDICTED: hypothetical protein XP_847760 [Canis familiaris] Length = 642 Score = 46.4 bits (108), Expect = 0.009, Method: Composition-based stats. Identities = 24/157 (15%), Positives = 53/157 (33%), Gaps = 16/157 (10%) Query: 210 IDVLIESAGNLVNS-IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 D + +L + Q +++ + ++ + + +L K ++ Sbjct: 64 FDKQKDFVNSLSDKVFQLTPVGFLKYDIKLAALQFSSSVQIDPPFSSWKDLQTFKQKVKS 123 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 +N T +Y A+ +A L E K + +TDG + + + Sbjct: 124 MNFIGQGTFSYYAISNATMLLKREGRKDGV--------KVALLMTDGID-----HPKNPD 170 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS 364 I E R AG+ ++ +S + LR + Sbjct: 171 VQSISEDARTAGILFITIGLST-VVNEAKLRLISGDP 206 >gi|332706718|ref|ZP_08426779.1| hypothetical protein LYNGBM3L_23300 [Lyngbya majuscula 3L] gi|332354602|gb|EGJ34081.1| hypothetical protein LYNGBM3L_23300 [Lyngbya majuscula 3L] Length = 972 Score = 46.1 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 27/168 (16%), Positives = 51/168 (30%), Gaps = 21/168 (12%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN---EVKSRLNKLNPYENTNTYPAMHHAYR 286 N I + PLSN+ + + +N+L T + H Sbjct: 324 NGLNPKDTFTIIDVSDRATQLSTKPLSNSPQNCRKAINYINQLKANGGTYLLKGIRHLLN 383 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 L E + ++ ++DG N L + G +IYS Sbjct: 384 -LPAAPEGR---------LRSIVLLSDG------YISNENQVLAEVQQQLKPGNRIYSFG 427 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF-DKITDKIQEQSVR 393 V + P + LL + + + E E+ ++ +I + Sbjct: 428 VGSSPN-RFLLNRLAEIGRGIARIVRQDEPTEAVAEQFFRQINNPVLT 474 >gi|167924336|ref|ZP_02511427.1| hypothetical protein BpseBC_37618 [Burkholderia pseudomallei BCC215] Length = 396 Score = 46.1 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 32/261 (12%), Positives = 83/261 (31%), Gaps = 5/261 (1%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALD----AAVLSGCASIVSDRTIKDPTTK 56 + A++++V F+ A+DL + R+++Q++ D AA +I T Sbjct: 4 LVALMLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLTGAINLSVPEAAGITA 63 Query: 57 KDQTSTIFKKQ-IKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPT 115 +F++ ++ ++ + Q + ++ Sbjct: 64 GHLNHALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCTTSRTGIVNWFI 123 Query: 116 ENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMT 175 + L L + A ++S + + + AI + + +++ + +N + ++ Sbjct: 124 QTLNLVPGVTVANASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVAGATYNIGDWLS 183 Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 + P +F S A + A + + G+ V + + Sbjct: 184 AKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGDKAATTNAYN 243 Query: 236 VRIGTIAYNIGIVGNQCTPLS 256 R G A + Sbjct: 244 TRFGIYANPYKDPSYGTPDFT 264 >gi|148706513|gb|EDL38460.1| vitrin, isoform CRA_b [Mus musculus] Length = 643 Score = 46.1 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 33/338 (9%), Positives = 89/338 (26%), Gaps = 42/338 (12%) Query: 80 ENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLS-LRSTGI 138 E+ K Q + + A+ T+ + + + ++ ++ Sbjct: 148 ESFIVAESKPQKGVAYPSTLTYSSSKTAAAKAGETTKAYEKPSIPGTTIQPVTLTQAQAT 207 Query: 139 IERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMT--------SNKYLLPPPPKKSFW 190 + + S V+ S + + P + S Sbjct: 208 PVAEVTHRSTSKPFAASVTNSPRPQPVGHRSQEMEEVDGWKPGPVLLDSGFVPKEELSTQ 267 Query: 191 SKNTTKSKYAPAPAPANRKIDVLIE--------SAGNLVNSIQKAIQEKKNLSVRIGTIA 242 S + ID L + +Q V G + Sbjct: 268 SSEPVPQGDPNCKIDLSFLIDGSTSIGKRRFRIQKQFLADVVQALDIGPAGPLV--GVVQ 325 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGS 301 Y N ++K+ + K+ +N A+ + +++ + + Sbjct: 326 YGDNPATQFNLKTHMNSQDLKTAIEKITQRGGLSNVGRAISFVTKTFFSKANGNRGGAPN 385 Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL---- 357 + + DG + ++ R +G+ ++ + V E + Sbjct: 386 -----VAVVMVDGWPTD--------KVEEVSRVARESGINVFFITVEGAAEREKQHVVEP 432 Query: 358 ----RKCTDSSGQF-FAVNDSRELLESFDKITDKIQEQ 390 + ++G + F V L ++ + ++ + Sbjct: 433 GFASKAVCRTNGFYSFNVQSWLSLHKTVQPLVKRVCDT 470 >gi|272938026|gb|ACZ96962.1| vwa2 protein [Danio rerio] Length = 260 Score = 46.1 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 22/154 (14%), Positives = 50/154 (32%), Gaps = 16/154 (10%) Query: 225 QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL-NKLNPYENTNTYPAMHH 283 + ++G + Y V N+ + V + + + A+ H Sbjct: 31 TSVQFDINRDVAQVGLVVYGRRPVTVFDLDKYNSGSAVLRAVGDAAFLGGKASVGSALLH 90 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 + + + + K V+ +TDG + + +R++G+ I+ Sbjct: 91 VLSQSLTVGKGARPGVN-----KAVVVLTDG--------TGVEDAAVPAQKIRDSGVSIF 137 Query: 344 SVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 + V Q+LL + + S V +L Sbjct: 138 LIGVG--DIQQELLLRISGSEDHMITVPSYDDLK 169 >gi|126724455|ref|ZP_01740298.1| hypothetical protein RB2150_11506 [Rhodobacterales bacterium HTCC2150] gi|126705619|gb|EBA04709.1| hypothetical protein RB2150_11506 [Rhodobacterales bacterium HTCC2150] Length = 354 Score = 46.1 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 27/137 (19%), Positives = 45/137 (32%), Gaps = 29/137 (21%) Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 + + P TN Y + AY L + + H +K VI +TDG + + Sbjct: 232 DNIQPGGTTNIYGPLREAYGWLSESERTDH--------QKAVILLTDGRANDD--AASES 281 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT----DSSGQFFAVNDSRELLESFDK 382 TL + + Y+ LR G S +L F+ Sbjct: 282 QTLAMKDDA-------YTFVYYMGDSDDRWLRSLADNYFSGGGHV-----SAQLERYFNV 329 Query: 383 ITDKIQEQSV---RIAP 396 ++D Q+V + P Sbjct: 330 VSDAYSAQTVLELQTCP 346 >gi|320161334|ref|YP_004174558.1| hypothetical protein ANT_19320 [Anaerolinea thermophila UNI-1] gi|319995187|dbj|BAJ63958.1| hypothetical protein ANT_19320 [Anaerolinea thermophila UNI-1] Length = 652 Score = 46.1 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 30/294 (10%), Positives = 71/294 (24%), Gaps = 62/294 (21%) Query: 143 SENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPA 202 + + D +++ +N ++N + ++ N S+ Sbjct: 366 WGGVPCDDDNLFDAFNWVDENTDGDNNPDSNPEFSI-------ADHNFASNWLASRDPDG 418 Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 P + V G + + +++ + G+ TP ++N Sbjct: 419 AGPLTATLSVTSTCNGCGIRTASNLLRQFGRPGSVWVMVYLTDGVANMSDTPRTDN---- 474 Query: 263 KSRLNKLNPYE----NTNTYPAMHHAYRELYNEKESSH---NTIGSTRLKKFVIFITDGE 315 + P + N + + + + I D Sbjct: 475 --SIPSGYPNGFCQGSLNPGNPGFWPPLCVDRNISPRYCIDTNSNTCPPGSIHVNIPDPR 532 Query: 316 NSGASAYQNTLNTLQIC--------------EYMRNAGMKIYSVAVSAP---------PE 352 S A ++ + + + IY++ + A Sbjct: 533 YSAADYARDMADAAALTVNQSSNPLDSRYNPTEPIGNDIAIYTIGLGAVTPDNPPAWLDT 592 Query: 353 GQDLLRKCTDSS-------------------GQFFAVNDSRELLESFDKITDKI 387 G+ +LR G ++ L FD I +I Sbjct: 593 GEKILRYMAAVGDDGDRVTDPCASVGPKRSCGNYYYAPSGDALRAIFDDIASRI 646 >gi|255602535|ref|XP_002537872.1| conserved hypothetical protein [Ricinus communis] gi|223514758|gb|EEF24510.1| conserved hypothetical protein [Ricinus communis] Length = 120 Score = 46.1 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 9/34 (26%), Positives = 20/34 (58%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALD 34 M A+++ + + +AIDL+ + + ++QS D Sbjct: 30 MAALVLPIMIAMLGFAIDLSRVYNRKVELQSVAD 63 >gi|126344397|ref|XP_001365113.1| PREDICTED: similar to calcium-dependent chloride channel-1, partial [Monodelphis domestica] Length = 660 Score = 46.1 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 24/169 (14%), Positives = 63/169 (37%), Gaps = 27/169 (15%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN--PYENTNTYPAMHHAYRELYNEKES 294 G + ++ + S +++L T+ + A+ + N+ + Sbjct: 99 WTGMVTFDSSATIQSALIQIETDAQRNSLISRLPTAAGGGTSICSGLRTAFTVIKNKFST 158 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 ++ +TDGE+S S+ + + + +G I++VA+ P Sbjct: 159 D---------GSEIVLLTDGEDSTISSCFDEV---------KQSGAIIHTVALG-PSADP 199 Query: 355 DLLRKCTDSSGQFFAVNDSRE---LLESFDKITD---KIQEQSVRIAPN 397 L + G + D+ + L+++F ++ I ++S+++ Sbjct: 200 GLEELAKMTGGMKTSPTDNAQNNGLIDAFSALSSGNGAITQRSIQLESK 248 >gi|308062304|gb|ADO04192.1| phage/colicin/tellurite resistance cluster terY protein [Helicobacter pylori Cuz20] Length = 217 Score = 46.1 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 26/200 (13%), Positives = 68/200 (34%), Gaps = 30/200 (15%) Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 S +I+ L ++ +++ + KK L ++ I + G G Sbjct: 24 SGSMNESLGNCTRIEALNLCIQKMIEILKQ--EAKKELFSKMAIITF--GENGAVLHTPF 79 Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 +++ + + L+ T A A + ++ +T + K + I ++DGE Sbjct: 80 DDVKNINFK--PLSASGGTPLDQAFRLAKDLIEDK-----DTFPTKFYKLYSILVSDGEP 132 Query: 317 SGASA-------YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA 369 + + + + +C +S+ + + + F Sbjct: 133 NDDKWQKALSNFHHDGRSAKSVC----------WSIFIGDRNTNPQVNKD--FGKDGVFY 180 Query: 370 VNDSRELLESFDKITDKIQE 389 ++ +L+ F+ +T I + Sbjct: 181 ADNVEKLVGLFEIMTQTISK 200 >gi|260820654|ref|XP_002605649.1| hypothetical protein BRAFLDRAFT_150471 [Branchiostoma floridae] gi|229290984|gb|EEN61659.1| hypothetical protein BRAFLDRAFT_150471 [Branchiostoma floridae] Length = 168 Score = 46.1 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 56/177 (31%), Gaps = 23/177 (12%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + + + ++V+ + + R+G + + + + + Sbjct: 13 SASVGPLQFEKSKKFVRDMVDGF-----DIGSAQTRVGVVQFAWMVQAEFNLGDYLDGTD 67 Query: 262 VKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +++ + ++ + T A+ R L++E + + + + VI ITDG +S Sbjct: 68 LRNAIARIRYMDGPGTEIGKALVFTKRRLFSELYGARPE--TQDVPRIVILITDGRSSPE 125 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 S + + L S + + V D L Sbjct: 126 SQLHTAGVVVYAVGV--------------GEAVDEAELETAASDSSKVYHVTDFDSL 168 >gi|78189841|ref|YP_380179.1| von Willebrand factor, type A [Chlorobium chlorochromatii CaD3] gi|78172040|gb|ABB29136.1| von Willebrand factor, type A [Chlorobium chlorochromatii CaD3] Length = 329 Score = 46.1 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 28/191 (14%), Positives = 59/191 (30%), Gaps = 30/191 (15%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN----EVKSRLNKL 269 + ++A + +PL+ + + + +++ Sbjct: 113 QTDTQSRFEIARQAARNVVEQRSNDRIGLVVFRGEAYTLSPLTRDHTVLSLLLDNLSSRI 172 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 + T A+ A L S + VI +TDGEN+ ++ L Sbjct: 173 IQDDGTAIGSALLVALNRLQA----------SESELQMVILLTDGENNAG-----EVSPL 217 Query: 330 QICEYMRNAGMKIYS--VAVSAPPEGQD--------LLRKCTD-SSGQFFAVNDSRELLE 378 G++ Y VA + + L++ + G +F VN+ EL Sbjct: 218 TAAALAARRGVRFYVLNVAFESVKDENAPRSALYAAELQEVARRTGGSYFTVNNKTELET 277 Query: 379 SFDKITDKIQE 389 + I + + Sbjct: 278 TIASIAARAKN 288 >gi|149034246|gb|EDL89016.1| calcium channel, voltage-dependent, alpha 2/delta 3 subunit, isoform CRA_b [Rattus norvegicus] Length = 924 Score = 46.1 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 58/177 (32%), Gaps = 14/177 (7%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++ + ++ ++++++ ++ + G N + L+K Sbjct: 94 RLTIAKQTVSSILDTLGD-DDFFNIITYNEELHYVEPCLNGTLVQADRTNKEHFREHLDK 152 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L A++ A+ L + + +I S + ++ ITDG N Sbjct: 153 LFAKGIGMLDIALNEAFNVLSDFNHTGQGSICS----QAIMLITDGAVDTYDTIFAKYNW 208 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 ++I++ + D L+ ++ FF L + + + + Sbjct: 209 P-------ERKVRIFTYLIGREAAFADNLKWMACANKGFF--TQISTLADVQENVME 256 >gi|149034245|gb|EDL89015.1| calcium channel, voltage-dependent, alpha 2/delta 3 subunit, isoform CRA_a [Rattus norvegicus] Length = 930 Score = 46.1 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 58/177 (32%), Gaps = 14/177 (7%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++ + ++ ++++++ ++ + G N + L+K Sbjct: 94 RLTIAKQTVSSILDTLGD-DDFFNIITYNEELHYVEPCLNGTLVQADRTNKEHFREHLDK 152 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L A++ A+ L + + +I S + ++ ITDG N Sbjct: 153 LFAKGIGMLDIALNEAFNVLSDFNHTGQGSICS----QAIMLITDGAVDTYDTIFAKYNW 208 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 ++I++ + D L+ ++ FF L + + + + Sbjct: 209 P-------ERKVRIFTYLIGREAAFADNLKWMACANKGFF--TQISTLADVQENVME 256 >gi|28212256|ref|NP_783185.1| voltage-dependent calcium channel subunit alpha-2/delta-3 [Rattus norvegicus] gi|81871225|sp|Q8CFG5|CA2D3_RAT RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-3; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-3; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-3; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-3; Flags: Precursor gi|27450708|gb|AAO14654.1|AF486278_1 calcium channel alpha-2 delta-3 subunit [Rattus norvegicus] Length = 1085 Score = 46.1 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 58/177 (32%), Gaps = 14/177 (7%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++ + ++ ++++++ ++ + G N + L+K Sbjct: 271 RLTIAKQTVSSILDTLGD-DDFFNIITYNEELHYVEPCLNGTLVQADRTNKEHFREHLDK 329 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L A++ A+ L + + +I S + ++ ITDG N Sbjct: 330 LFAKGIGMLDIALNEAFNVLSDFNHTGQGSICS----QAIMLITDGAVDTYDTIFAKYNW 385 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 ++I++ + D L+ ++ FF L + + + + Sbjct: 386 P-------ERKVRIFTYLIGREAAFADNLKWMACANKGFF--TQISTLADVQENVME 433 >gi|270008951|gb|EFA05399.1| hypothetical protein TcasGA2_TC015571 [Tribolium castaneum] Length = 902 Score = 46.1 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 16/148 (10%), Positives = 51/148 (34%), Gaps = 8/148 (5%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 +N+ + K + TN + + E ++ + + + +IF+TDG Sbjct: 326 DNIEKAKKSKRHVVGMGCTNIIGGLVVGLFLVRRTLEKNYEKNVALKHQPMIIFLTDGLP 385 Query: 317 SGASAYQNTLNTLQICEYMRN--AGMKIYSVAVSAPPEGQDLLRKCTDSSG---QFFAVN 371 + + + + ++ + I+S++ + L + + G + Sbjct: 386 NVGISNPDEI--TKLVTKINQGTNRAAIFSLSFGEDADKNFLKKLSAQNLGFSRHIYEAA 443 Query: 372 DSR-ELLESFDKITDKIQEQSVRIAPNR 398 D+ +L + ++ + ++ Sbjct: 444 DAALQLQNFYRTVSSPLLRDVRFKYVDK 471 >gi|222101616|gb|ACM44013.1| thrombospondin-related anonymous protein [Babesia bovis] Length = 657 Score = 46.1 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 58/164 (35%), Gaps = 14/164 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 + N +R+ Y T L + + L KL+ T M + R L Sbjct: 77 DLDNTDIRLSLTTY-STPTRQIFTFLDAAASSTRLALTKLDWMAGTKARSGMTYTGRAL- 134 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 N + G + K ++ ITDG +S S T Q+ +R+ G+ + + V Sbjct: 135 NYVRKAILPYGRKNVPKALLLITDGVSSDGSY------TAQVAAMLRDEGVNVMVIGVG- 187 Query: 350 PPEGQDLLRKCTDSSGQ----FFAVNDSRELLESFDKITDKIQE 389 R G F + ++++ F+ + ++ + Sbjct: 188 -DVNVAECRGIVGCDGVMDCPMFKHTNWKDIMGLFNSLMKEVCD 230 >gi|189238319|ref|XP_972336.2| PREDICTED: similar to inter-alpha-trypsin inhibitor family heavy chain-related protein [Tribolium castaneum] Length = 713 Score = 46.1 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 16/148 (10%), Positives = 51/148 (34%), Gaps = 8/148 (5%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 +N+ + K + TN + + E ++ + + + +IF+TDG Sbjct: 342 DNIEKAKKSKRHVVGMGCTNIIGGLVVGLFLVRRTLEKNYEKNVALKHQPMIIFLTDGLP 401 Query: 317 SGASAYQNTLNTLQICEYMRN--AGMKIYSVAVSAPPEGQDLLRKCTDSSG---QFFAVN 371 + + + + ++ + I+S++ + L + + G + Sbjct: 402 NVGISNPDEI--TKLVTKINQGTNRAAIFSLSFGEDADKNFLKKLSAQNLGFSRHIYEAA 459 Query: 372 DSR-ELLESFDKITDKIQEQSVRIAPNR 398 D+ +L + ++ + ++ Sbjct: 460 DAALQLQNFYRTVSSPLLRDVRFKYVDK 487 >gi|156084610|ref|XP_001609788.1| thrombospondin-related anonymous protein [Babesia bovis] gi|154797040|gb|EDO06220.1| thrombospondin-related anonymous protein [Babesia bovis] Length = 660 Score = 46.1 bits (107), Expect = 0.009, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 58/164 (35%), Gaps = 14/164 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 + N +R+ Y T L + + L KL+ T M + R L Sbjct: 77 DLDNTDIRLSLTTY-STPTRQIFTFLDAAASSTRLALTKLDWMAGTKARSGMTYTGRAL- 134 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 N + G + K ++ ITDG +S S T Q+ +R+ G+ + + V Sbjct: 135 NYVRKAILPYGRKNVPKALLLITDGVSSDGSY------TAQVAAMLRDEGVNVMVIGVG- 187 Query: 350 PPEGQDLLRKCTDSSGQ----FFAVNDSRELLESFDKITDKIQE 389 R G F + ++++ F+ + ++ + Sbjct: 188 -DVNVAECRGIVGCDGVMDCPMFKHTNWKDIMGLFNSLMKEVCD 230 >gi|281420094|ref|ZP_06251093.1| BatB protein [Prevotella copri DSM 18205] gi|281405894|gb|EFB36574.1| BatB protein [Prevotella copri DSM 18205] Length = 345 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 29/218 (13%), Positives = 66/218 (30%), Gaps = 57/218 (26%) Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 S ++D NLV++ + +IG I + P++ Sbjct: 99 SNSMLCQDVVPSRLDKSKMLIENLVDNFN---------NDKIGLIVFAGDAFVQ--LPIT 147 Query: 257 NNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 + K L + P + TN A+ A + + + + +I IT Sbjct: 148 TDYVSAKMFLQNITPGLIQTQGTNIGAAIDLASKSFTQQ----------ENVGRAIIVIT 197 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA---------------PPEGQDLL 357 DGEN + + G+ ++ + + G ++ Sbjct: 198 DGEN-------HEPGAQEAAAAANKKGINVFILGIGNTKGAPIPMGDGSYLKDNAGNTVM 250 Query: 358 --------RKCTDSS-GQFFAVNDSRELLESFD-KITD 385 ++ + GQ+ V+++ + + + I Sbjct: 251 TALNEQMCKELAQAGKGQYIHVDNTSDAERALNDDIAK 288 >gi|309364361|emb|CAR98722.1| hypothetical protein CBG_25517 [Caenorhabditis briggsae AF16] Length = 342 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 63/164 (38%), Gaps = 12/164 (7%) Query: 223 SIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPA-M 281 I + ++ V T YN V + S + ++KS++ +L T T + M Sbjct: 33 QIGTGYADPRSTRVGFITYNYNATDVADFYKLQSYD--DLKSQIQRLKMTPLTTTTVSRM 90 Query: 282 HHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK 341 A N S+ + KK VI T+ + S + + ++ G+ Sbjct: 91 DTALYAAMNMINSTAGFRDN--YKKVVIVFTNVHGTYKSNP-----PKDVSKSLQMKGIP 143 Query: 342 IYSVAVSAPPEGQDLLRKCTDSSGQF--FAVNDSRELLESFDKI 383 + +V + + Q L+ ++ F + N ++E+ ++ I Sbjct: 144 VITVNTGSSSDTQSWLKNIASTNMAFAIYDGNVTQEIQKAMTDI 187 >gi|257469960|ref|ZP_05634052.1| von Willebrand factor (vWA) type A domain-containing protein [Fusobacterium ulcerans ATCC 49185] Length = 322 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 27/179 (15%), Positives = 61/179 (34%), Gaps = 55/179 (30%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP----YENTNTYPAMHHAYRELYNEK 292 RIG I ++ PL+++ + K+ +N L+ T Y A+ A + Sbjct: 120 RIGFIPFSDSAYIQ--MPLTDDYSIGKNYINALDTNLISGGGTELYQALELAEKSFKEIN 177 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + K +I ++DG + + +++++ M ++S+ + Sbjct: 178 SDN----------KTIIVLSDGGDFDDKS----------LKFVKDNKMNVFSIGIGTDEG 217 Query: 353 ---------------------------GQDLLRKCTD-SSGQFFAVND-SRELLESFDK 382 D L+K + S+G+++ VN+ + F Sbjct: 218 TIIPEYVNGKKVGFIKDQKGSAVISKLNSDFLKKLSSESNGKYYEVNNLKDDTSNFFRD 276 >gi|225010242|ref|ZP_03700714.1| von Willebrand factor type A [Flavobacteria bacterium MS024-3C] gi|225005721|gb|EEG43671.1| von Willebrand factor type A [Flavobacteria bacterium MS024-3C] Length = 351 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 24/187 (12%), Positives = 56/187 (29%), Gaps = 50/187 (26%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE----NTNTYPAMHHAYRELYN 290 S RIG IAY P++ + K L +N T A+ A + Sbjct: 128 SDRIGIIAYAAQAYPQ--LPITTDFGAAKMFLQGMNTDMLSSQGTAISDAIELATTYYND 185 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 +++ + + ++DGE + + AG+KI+++ V Sbjct: 186 AAQTN----------RVLFIVSDGE------DHSEGGAVNAVSKATEAGIKIFTIGVGTE 229 Query: 351 ----------------------------PEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + L ++ +G + ++ + + Sbjct: 230 KGAPIPIKKKGIVESLKRDAEGEVVITKRNTETLFEIASEGNGIYIDGENTASAVAIIKE 289 Query: 383 ITDKIQE 389 +++ + Sbjct: 290 QLNQMDK 296 >gi|123509108|ref|XP_001329794.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121912842|gb|EAY17659.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 694 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 39/336 (11%), Positives = 95/336 (28%), Gaps = 55/336 (16%) Query: 88 KAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLA 147 + I+I+ L + + +L L LT + Sbjct: 99 NSDIDISFTMYTSLPTVFSPSNELFFNRVSLPLTLFPRYKLTPNTGSEQAPETVIGSTTY 158 Query: 148 -ISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKS--------- 197 ++ + E + N MT F +KN Sbjct: 159 KFNLKFTIPKDSKFETKMEDYSIEGNVMTLKTIPTTDFNVDVFLNKNPVSVKEITGNTQV 218 Query: 198 -KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS---------------VRIGTI 241 + P ++S L++ + + + V+ + Sbjct: 219 VNFKINPLNYLSNRKTDVKSIVFLLDCSGSMTIDNRIENAIKAMDLFLHSLEPGVKFEIV 278 Query: 242 AYNIGIVGNQCTPLS----NNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSH 296 + L+ ++LN + + + T + + Y EL + Sbjct: 279 RFGSTFNSLFDFKLTEYNDDSLNTALAFIKGTSANLGGTEIFNPIKQIYNELSPD----- 333 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + +TDG N+ + +++R++ KI+S+ + A + + Sbjct: 334 ----------VLFVLTDGAVD---------NSQAVLDFVRDSSTKIFSLGLGAGADMNLV 374 Query: 357 LRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + + G V D+ +L +S ++ + ++ Sbjct: 375 RNLASFTGGVSEHVLDASQLRDSIIRLLEDSTNPTL 410 >gi|159900726|ref|YP_001546973.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159893765|gb|ABX06845.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 455 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 20/129 (15%), Positives = 42/129 (32%), Gaps = 17/129 (13%) Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 + +T P + L + ST + ++ +TDG ++ Sbjct: 157 DLGRSTRLAPGLRSTIDLLDSMP--------STGFSQRIVVLTDG------FVEDEQQAF 202 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDK--- 386 ++ + + ++ + + Q L+ S GQ + D +L D K Sbjct: 203 AYARHLAARSIPVSTIGLGVEFQEQLLMSFADQSGGQSSFITDPSDLPNLLDVEFGKAHA 262 Query: 387 IQEQSVRIA 395 I Q R+ Sbjct: 263 IIAQKARLD 271 >gi|328865987|gb|EGG14373.1| hypothetical protein DFA_12145 [Dictyostelium fasciculatum] Length = 781 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 24/147 (16%), Positives = 45/147 (30%), Gaps = 11/147 (7%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHN 297 +G Y G + L+ ++ + + + + A A + S H Sbjct: 260 MGLGDYCDGKLVLTTQDLTQDVTTLVKFIKNVPSTSGGDAPEAYEFALMKAKELTWSEH- 318 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV-AVSAPPEGQDL 356 K + I D + +N + C+ + G+KIY V A+ + D Sbjct: 319 ------TSKAFVMIGDSNPHQPTFTDLNINWFKECDDLHERGIKIYGVKALGSSIFYND- 371 Query: 357 LRKCTDSSGQFFAVNDSRELLESFDKI 383 + G + E F I Sbjct: 372 --IAERTGGICIDFKKFDLITEMFLAI 396 >gi|291295671|ref|YP_003507069.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] gi|290470630|gb|ADD28049.1| von Willebrand factor type A [Meiothermus ruber DSM 1279] Length = 744 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 27/150 (18%), Positives = 46/150 (30%), Gaps = 18/150 (12%) Query: 240 TIAYNIGIVGNQCT-PLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHN 297 + ++ P++ E +S L T A A L S Sbjct: 376 VVVFSDRPRWLFRPRPMTEQGRKEAESLLLSTQAGGGTMIRRAYLEALEALEQVPTES-- 433 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 K VI +TDG A T + +K +VA+ A +G+ L Sbjct: 434 --------KQVIALTDGL-----AADVTPDLFDAAREASPR-IKTNTVAIGADADGRFLR 479 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDKI 387 G ++ V +L F + ++ Sbjct: 480 ELAQAGDGTYWDVPRPEDLPRFFLEEAQRV 509 >gi|149918084|ref|ZP_01906577.1| hypothetical protein PPSIR1_41829 [Plesiocystis pacifica SIR-1] gi|149821089|gb|EDM80495.1| hypothetical protein PPSIR1_41829 [Plesiocystis pacifica SIR-1] Length = 719 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 19/109 (17%), Positives = 32/109 (29%), Gaps = 10/109 (9%) Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + + + + I ITDG +AY E M N G+ + V Sbjct: 449 QYHLDGQAPDAEFPTHAQTPYVNILITDG---AYAAYSTDAQVQAALEAMFNEGVTTHVV 505 Query: 346 AVSAPPEGQD---LLRKCT----DSSGQFFAVNDSRELLESFDKITDKI 387 + L G + V+ +EL + +I I Sbjct: 506 GFGEGADTPQALIELSAMAAWGSGGEGAPYHVDTQQELQSALAQIAASI 554 >gi|222101620|gb|ACM44015.1| thrombospondin-related anonymous protein [Babesia bovis] Length = 657 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 30/164 (18%), Positives = 58/164 (35%), Gaps = 14/164 (8%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 + N +R+ Y T L + + L KL+ T M + R L Sbjct: 77 DLDNTDIRLSLTTY-STPTRQIFTFLDAAASSTRLALTKLDWMAGTKARSGMTYTGRAL- 134 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 N + G + K ++ ITDG +S S T Q+ +R+ G+ + + V Sbjct: 135 NYVRKAILPYGRKNVPKALLLITDGVSSDGSY------TAQVAAMLRDEGVNVMVIGVG- 187 Query: 350 PPEGQDLLRKCTDSSGQ----FFAVNDSRELLESFDKITDKIQE 389 R G F + ++++ F+ + ++ + Sbjct: 188 -DVNVAECRGIVGCDGVMDCPMFKHTNWKDIMGLFNSLMKEVCD 230 >gi|218261918|ref|ZP_03476586.1| hypothetical protein PRABACTJOHN_02257 [Parabacteroides johnsonii DSM 18315] gi|218223694|gb|EEC96344.1| hypothetical protein PRABACTJOHN_02257 [Parabacteroides johnsonii DSM 18315] Length = 339 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 52/169 (30%), Gaps = 46/169 (27%) Query: 251 QCTPLSNNLNEVKSRLNKLNPYE----NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 P++++ K L+ +NP T A++ A R S K Sbjct: 142 TQLPITSDYVSAKMFLSSINPSMVSTQGTAIGAAINLAVRSFTP----------SETSDK 191 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------PEGQDLLR- 358 +I ITDGEN + + ++ G+ + V + P + ++ Sbjct: 192 AIILITDGEN-------HEDDAVKAAAAAAEKGIHVNIVGMGDPKGSPIPVDGSNNYMKD 244 Query: 359 -----------------KCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 G + +++ L + K DK+ + Sbjct: 245 KDGNVVITKLNEEMCQEIAAAGHGTYVRADNTNSALRALQKEIDKMNKS 293 >gi|156358481|ref|XP_001624547.1| predicted protein [Nematostella vectensis] gi|156211334|gb|EDO32447.1| predicted protein [Nematostella vectensis] Length = 545 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 20/147 (13%), Positives = 47/147 (31%), Gaps = 16/147 (10%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 ++R G + + ++ EVK+ + ++ T T A R Sbjct: 298 GMTATIRFGFMIFGSSSQVVFDFTKFSSFAEVKTAVLGVSMVGGTCTAGAAITMCR---- 353 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 +S S+ + ++ + G+++ + ++ G+K+ V + Sbjct: 354 ---TSMFASSSSSSARVLVAMMAGKSTDS--------VTAPASAIKEIGIKMICVGMG-G 401 Query: 351 PEGQDLLRKCTDSSGQFFAVNDSRELL 377 ++ L SS EL Sbjct: 402 QYDKEQLTGMASSSELILYAASWSELT 428 >gi|126334644|ref|XP_001371234.1| PREDICTED: similar to integrin, alpha L (antigen CD11A (p180), lymphocyte function-associated antigen 1; alpha polypeptide) [Monodelphis domestica] Length = 1139 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 19/115 (16%), Positives = 39/115 (33%), Gaps = 19/115 (16%) Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 TNTY + H +++N + + +I ITDG+ + E Sbjct: 182 TNTYQGIRHVVEDVFNMNRGARPEA-----VRVMIIITDGDATDTGNI----------EQ 226 Query: 335 MRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKI 387 + + Y + + + + L + ++ +L FD + KI Sbjct: 227 AK--HIIRYVIGIGGGIQEPEKLDDFASTPKSDFVKILDTYDKLKGLFDDLEKKI 279 >gi|123468942|ref|XP_001317686.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121900426|gb|EAY05463.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 688 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 27/257 (10%), Positives = 72/257 (28%), Gaps = 36/257 (14%) Query: 138 IIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKS 197 ++ + +S+ +V S + + + + K S + + Sbjct: 170 NTQKELSDFKVSVDGTKNVIDSHNATFETNEAPKKDAIFIETPIKDEDKSIAVSSDGYIA 229 Query: 198 KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN 257 + + + S+ A +R+ + +G + ++ Sbjct: 230 ISTNPSFSGKIESNSEFYFVVDCSGSMSGARIINAVKCMRLFIQSLPLGCRFSIIKFGTS 289 Query: 258 -------------NLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 N+++ + L +N T+ + H Sbjct: 290 FETVLQPCDYSDENVDKALNLLKSVNAKMGGTDILSPLQHIA-----------GLKPQPG 338 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG-MKIYSVAVSAPPEGQDLLRKCT- 361 K + +TDGE + ++N +I+S+ + + L++ Sbjct: 339 FVKQIFLLTDGEVNNPDITC--------ATALKNRNENRIFSIGLGSGA-DPGLIKGLAK 389 Query: 362 DSSGQFFAVNDSRELLE 378 S G + + D + E Sbjct: 390 KSGGNYIMIADEDNMNE 406 >gi|77919348|ref|YP_357163.1| hypothetical protein Pcar_1750 [Pelobacter carbinolicus DSM 2380] gi|77545431|gb|ABA88993.1| conserved hypothetical protein [Pelobacter carbinolicus DSM 2380] Length = 421 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 25/231 (10%), Positives = 62/231 (26%), Gaps = 41/231 (17%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQS----------------------------A 32 + AI++ + AID+ H+ +RN++Q+ A Sbjct: 19 LVAILLILFLGIAALAIDVYHVYVVRNELQNAADAGALAGARELYLESGASVNPNANVIA 78 Query: 33 LDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQIN 92 + A+ + + + D + ++ GS + G+ + N Sbjct: 79 NNTAIENISEDVPVEVNYNAAANTGDVQRGHWSFAARQFTPNGSLTAIDVGNYTTEDLDN 138 Query: 93 ITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICM 152 + N L + + + + F + ++ + I S + Sbjct: 139 PDPNINGGLINAVKVVVRRQDRPASSFFAQIFGFENFGITAEAIAYIGFSGTINPAELDQ 198 Query: 153 --------VLDVSRSMEDLY-----LQKHNDNNNMTSNKYLLPPPPKKSFW 190 +LD + + + + +N P Sbjct: 199 PIAICEESILDDNGNYNCNMGRMLNSGSNLNTSNTGGWTNFSQPCDTADAS 249 >gi|294496937|ref|YP_003560637.1| hypothetical protein BMQ_0087 [Bacillus megaterium QM B1551] gi|295702304|ref|YP_003595379.1| hypothetical protein BMD_0085 [Bacillus megaterium DSM 319] gi|39640|emb|CAA45619.1| unnamed protein product [Bacillus megaterium] gi|294346874|gb|ADE67203.1| conserved hypothetical protein [Bacillus megaterium QM B1551] gi|294799963|gb|ADF37029.1| conserved hypothetical protein [Bacillus megaterium DSM 319] Length = 246 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 13/103 (12%), Positives = 37/103 (35%), Gaps = 10/103 (9%) Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----GQD 355 K ++ ITDG ++ + + + + G+ + + V + Sbjct: 1 MKKGTLKQMLVITDGGSNTGE------DPVAMAALAKEQGISVNVIGVMEEDTIDEQSTN 54 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + S G + ++L ++ +T + Q+++ N+ Sbjct: 55 EIEGIAMSGGGVSQIVYVKQLSQTVQMVTRQAMTQTLQGVVNK 97 >gi|221121786|ref|XP_002165500.1| PREDICTED: similar to procollagen, type XIV, alpha 1 [Hydra magnipapillata] Length = 3126 Score = 46.1 bits (107), Expect = 0.010, Method: Composition-based stats. Identities = 22/159 (13%), Positives = 47/159 (29%), Gaps = 17/159 (10%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAY 285 + + + +G + Y N+ + + + + +T Y Sbjct: 782 ASAFKVDQENSHLGLVTYATDAQIMLNFHHFNDPDTLTEARDAVRVKPHTGKYTG----- 836 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + L KE + + +I +TDG +S + +R+ G+KI +V Sbjct: 837 QALSLAKEGLFDKGHRSDALDVLILMTDGPSSDDVTEPS--------RALRDMGVKIIAV 888 Query: 346 AVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDK 382 + + L F D L + Sbjct: 889 GIG-NQIDRKQLNDIASDPDDEHVFTA-DYDNLGTIIRR 925 >gi|77456864|ref|YP_346369.1| hypothetical protein Pfl01_0636 [Pseudomonas fluorescens Pf0-1] gi|77380867|gb|ABA72380.1| putative exported protein [Pseudomonas fluorescens Pf0-1] Length = 659 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 23/163 (14%), Positives = 41/163 (25%), Gaps = 8/163 (4%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M A+ + + +FI +D + R +Q D A L T Sbjct: 20 MAALTLGMALVFILVVVDSGRLYLERRHLQQIADVAALEAATR---GGNCGAGATANAYA 76 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENL-- 118 + G + G + A N + +P Sbjct: 77 QASVVRNNFPIPSAGRTLAVACGTLNLDASNLRVFAVNAASTEAIRVVVSHTVPQSFAGA 136 Query: 119 ---FLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR 158 G +A NLS + + +L I + + Sbjct: 137 IGGLFGGAGRNATINLSATAVAAVPPPLASLTIRSTALSVDTG 179 >gi|281205213|gb|EFA79406.1| hypothetical protein PPL_07824 [Polysphondylium pallidum PN500] Length = 760 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 31/205 (15%), Positives = 68/205 (33%), Gaps = 11/205 (5%) Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG-TIAYNIGIVG 249 S ++ + ++ + +++ + +++ N+ + I Y G + Sbjct: 210 SSISSDVEIVFCFDTTGSMASIIKNVKLQVESTVARLMRDIPNIRIGIMGMGDYCDGKLV 269 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 L+ N +E+ + + ++ A +A L+ KE S S K + Sbjct: 270 LSTLDLTQNTDELIRFIKSVQDTSGGDSPEAYEYA---LFRAKELSW----SHHTSKAFV 322 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA 369 I D S +N + C+ + G+K+Y V A + G Sbjct: 323 MIGDSNPHPPSFTDLNINWFKECDDLYEKGIKVY--GVKALGNSVFYDEIAERTGGICIN 380 Query: 370 VNDSRELLESFDKI-TDKIQEQSVR 393 + E F I + +Q + Sbjct: 381 FKKFNLITEMFLAICYREASKQKFK 405 >gi|327270788|ref|XP_003220170.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 930 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 19/135 (14%), Positives = 51/135 (37%), Gaps = 26/135 (19%) Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 +++ + A++ ++ ++ +T+GE S + + + Sbjct: 383 TASGDSDICEGVTTAFQVFS--------RKLTSTEGCEIVLLTNGEGLDLSPCLSKIQSQ 434 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN---DSRELLESFDKITD 385 + + I+++A + + L K D + G+ F DS L+++F I+ Sbjct: 435 E---------IIIHTIAFGSKASNE--LEKLADMTGGKTFYATDSLDSNGLIDAFGGISS 483 Query: 386 ---KIQEQSVRIAPN 397 +QS+++ Sbjct: 484 GSGDASQQSIQLESK 498 >gi|170741515|ref|YP_001770170.1| hypothetical protein M446_3336 [Methylobacterium sp. 4-46] gi|168195789|gb|ACA17736.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 407 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 27/267 (10%), Positives = 54/267 (20%), Gaps = 13/267 (4%) Query: 14 TYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTK------------KDQTS 61 ++DL+ + R ++Q+ D A LS S + + D T+ Sbjct: 36 GSSVDLSRTLRERARLQALTDQAALSAVTSSSASQDPADTVQTFFPQPSDPDQRRLAPTA 95 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 T+ + + + I AQI IT + Sbjct: 96 TVTVEGSTVTVTSTQNVATAFTGILGVAQIPITARSTAAPGNDGPPVCVLALNPTATDAI 155 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 +A S ++ L + + Sbjct: 156 LFSGNATVVASNCVIYSNSSAANALTRQGSASVQATGYCAVGGTNLPTSTTPR-PQSGCP 214 Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 + A + L + ++ Sbjct: 215 RLDDPFRNLPLASGSGCKDAGEADKPNRTQTLDPGVYCGLTLKGTVTLNPGLYLIKGPLD 274 Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNK 268 V Q L + +N Sbjct: 275 IGAQASVSGQGVTLYLTGGDANFTING 301 >gi|254187133|ref|ZP_04893648.1| conserved hypothetical protein [Burkholderia pseudomallei Pasteur 52237] gi|254263081|ref|ZP_04953946.1| conserved hypothetical protein [Burkholderia pseudomallei 1710a] gi|157934816|gb|EDO90486.1| conserved hypothetical protein [Burkholderia pseudomallei Pasteur 52237] gi|254214083|gb|EET03468.1| conserved hypothetical protein [Burkholderia pseudomallei 1710a] Length = 418 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 32/261 (12%), Positives = 82/261 (31%), Gaps = 5/261 (1%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A++++V F+ A+DL + R+++Q++ DA L+ + + P Sbjct: 26 LVALMLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLTGAINLSVPEAAGITA 85 Query: 61 ---STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQY--IAESKAQYEIPT 115 + +Q ++ S + + N ++ Sbjct: 86 GHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCTTSRTGIVNWFI 145 Query: 116 ENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMT 175 + L L + A ++S + + + AI + + +++ + +N + ++ Sbjct: 146 QTLNLVPGVTVANASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVAGATYNIGDWLS 205 Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 + P +F S A + A + + G+ V + + Sbjct: 206 AKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGDKAATTNAYN 265 Query: 236 VRIGTIAYNIGIVGNQCTPLS 256 R G A + Sbjct: 266 TRFGIYANPYKNPSYGTPDFT 286 >gi|121535594|ref|ZP_01667401.1| von Willebrand factor, type A [Thermosinus carboxydivorans Nor1] gi|121305834|gb|EAX46769.1| von Willebrand factor, type A [Thermosinus carboxydivorans Nor1] Length = 586 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 18/116 (15%), Positives = 39/116 (33%), Gaps = 13/116 (11%) Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 PL+ + + +S L + + +T + L + + VI ITD Sbjct: 460 PLTRDFAQAESSLAHIESFGSTPLALGLKVGIEYLKESRAKNP----------LVILITD 509 Query: 314 GENSGASAYQNTL-NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF 368 G + + L + L +++ G + + P L + + G + Sbjct: 510 GVPTVGDITGDPLADALTAAASIKSHGYGFTCIGL--KPHRDYLTQVAQAAGGNIY 563 >gi|323450885|gb|EGB06764.1| hypothetical protein AURANDRAFT_71955 [Aureococcus anophagefferens] Length = 1008 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 23/134 (17%), Positives = 48/134 (35%), Gaps = 14/134 (10%) Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +V + L KL P +TN + + EL G+ + V+ +TDG + Sbjct: 434 AKVHAALEKLAPGTSTNLWGGLELGVDEL---------VGGAGDNARAVLLLTDGVPN-- 482 Query: 320 SAYQNTLNTLQICEYMRNAG---MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 ++ + + G + +++ LL + G F V D+ + Sbjct: 483 NSPPEGEVAALRAKRLTKDGSETVAVFAAGFGYALRSDLLLSLAREGGGLFSFVPDAGMV 542 Query: 377 LESFDKITDKIQEQ 390 SF+ + ++ Sbjct: 543 GTSFNHLVASLRSS 556 >gi|319648589|ref|ZP_08002803.1| YabS protein [Bacillus sp. BT1B_CT2] gi|317389356|gb|EFV70169.1| YabS protein [Bacillus sp. BT1B_CT2] Length = 245 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 11/90 (12%), Positives = 38/90 (42%), Gaps = 10/90 (11%) Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-GQDLLRK---CTDS 363 ++ +TDG ++ + + + + G+ + + + E Q+ +++ + Sbjct: 9 ILLLTDGCSNRGE------DPQAMAAFAKEQGITVNVIGIMDEHEMDQEAMKEVEGIALA 62 Query: 364 SGQFFAVNDSRELLESFDKITDKIQEQSVR 393 G + + +L ++ +T K Q+++ Sbjct: 63 GGGVHQLVYTSQLSQTVQMVTKKAMTQTIQ 92 >gi|241267206|ref|XP_002406336.1| calcium activated chlorine channel, putative [Ixodes scapularis] gi|215496880|gb|EEC06520.1| calcium activated chlorine channel, putative [Ixodes scapularis] Length = 519 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 15/139 (10%), Positives = 43/139 (30%), Gaps = 20/139 (14%) Query: 234 LSVRIGTIAYNIGIVGNQ---CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 +G + ++ ++ + + ++ L T+ + A L Sbjct: 193 DYQALGIVTFSGRCQVAHPLVVLNTTDARDGIAKVIDGLVLGAGTSIGCGLSKATEMLEG 252 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 S+ V +TDG+ + ++ + ++G+K+ + A+ Sbjct: 253 NGTSARG--------GLVFLVTDGDENYKPWI------VEQLPILVSSGVKVSTFALGT- 297 Query: 351 PEGQDLLRKCT-DSSGQFF 368 + L + G + Sbjct: 298 -LAEKKLEDVALQTGGTAY 315 >gi|116750907|ref|YP_847594.1| von Willebrand factor, type A [Syntrophobacter fumaroxidans MPOB] gi|116699971|gb|ABK19159.1| von Willebrand factor, type A [Syntrophobacter fumaroxidans MPOB] Length = 268 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 28/204 (13%), Positives = 62/204 (30%), Gaps = 29/204 (14%) Query: 187 KSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIG 246 + +++ S + K + + K + G Sbjct: 80 NYYVIMDSSGSMNEVRCSGNRTKSEAAKTALAQFARITPKDANMG-------LAVFDAYG 132 Query: 247 IVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 I L N ++ + +N P T + A+ YR L + Sbjct: 133 IAERVPLGL-ENRDKFIAAVNATAPGNGTPLHDALLLGYRRLEETARRQAGYG-----EY 186 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--SAPPEGQDLLRKCTDSS 364 ++ ITDG+ QN T + +R + + I+++ Sbjct: 187 HLVVITDGQA----YPQNQDPTPVVAYILRQSPVVIHTIGFCIGTDHSLNQ--------P 234 Query: 365 GQ--FFAVNDSRELLESFDKITDK 386 G+ + A ++ REL + +++ + Sbjct: 235 GRTVYRAADNPRELQQGLEEVLAE 258 >gi|198437168|ref|XP_002122703.1| PREDICTED: similar to thrombospondin type 1 repeat containing protein [Ciona intestinalis] Length = 1530 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 20/140 (14%), Positives = 49/140 (35%), Gaps = 13/140 (9%) Query: 259 LNEVKSRLNKLNP-----YENTNTYPAMHHAYRELYNEKESSHNTIGSTR--LKKFVIFI 311 +N ++ + A+ A ++ + + + G+ + ++ Sbjct: 1396 YETADDVVNGISATIFRGGYRSKIGSALKDANTWMFQQSQGMRSFPGNDEDEIDHEILIA 1455 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV-SAPPEGQDLLRKCTDSSGQFFAV 370 TDG + N + ++ G+K+ +A+ + P Q+ +R D + + Sbjct: 1456 TDGCRT-----SNGYYLQAGLKAIKKRGIKLSVLAIYARDPRCQETIRSMVDDPNDLYEI 1510 Query: 371 NDSRELLESFDKITDKIQEQ 390 D L+E +I Q Sbjct: 1511 GDWAALVEMTKQIDAGACRQ 1530 >gi|118363890|ref|XP_001015168.1| Helicase conserved C-terminal domain containing protein [Tetrahymena thermophila] gi|89296935|gb|EAR94923.1| Helicase conserved C-terminal domain containing protein [Tetrahymena thermophila SB210] Length = 714 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 22/213 (10%), Positives = 63/213 (29%), Gaps = 24/213 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN------QCTPLS 256 I + + +++K + + + + Sbjct: 491 TGSMGNLITQTKNTIQTTFEQARDILKQKGYDPQCFQIMICCFRSYNSKFEEILEASSWE 550 Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD--- 313 NN ++++S L K+ T ++ + + + + + Sbjct: 551 NNPDKLRSYLQKITASGGTYPGESVEVGLWWANKQSDENPIGQVIVLGDQPAHLQQEAQS 610 Query: 314 -----GENSGASAYQNTL-NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSG- 365 G+ S L + C +++ + + + + + + + G Sbjct: 611 HRQQLGQLYWDSTPLKGLTYYVPECYKLQSKNVPVNTFYL--NQSAKQTYEEIASLTKGI 668 Query: 366 -QFFAVNDSR---ELLESF-DKITDKIQEQSVR 393 QF +N ++ EL +F ++I I ++ R Sbjct: 669 SQFLDINSAQSSKELKNAFVEQILKDIGKEDGR 701 >gi|167821376|ref|ZP_02453056.1| hypothetical protein Bpse9_40018 [Burkholderia pseudomallei 91] Length = 396 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 32/322 (9%), Positives = 88/322 (27%), Gaps = 29/322 (9%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A++++V F+ A+DL + R+++Q++ DA L+ + + P Sbjct: 4 LVALMLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLTGAINLSVPEAAGITA 63 Query: 61 ---STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 + +Q ++ S + + N ++ + + Sbjct: 64 GHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCTTSRTGIVNWFI 123 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSM-------EDLYLQKHND 170 L + + N S+ +T + + +I + + + + + Sbjct: 124 QTLNLVPGVTVANASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVAGATYNIGDWLS 183 Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR------------KIDVLIESAG 218 + + + N+ S + Sbjct: 184 AKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGDKAATTNAYN 243 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR-------LNKLNP 271 ++ + AY+ +Q S+ +++ + + +N Sbjct: 244 TRFGIYANPYKDPSYGTPDFTGFAYDATTWPSQSNAYSDFVSKRLTFASYQGDLITGINT 303 Query: 272 YENTNTYPAMHHAYRELYNEKE 293 + N A R L E Sbjct: 304 GGSYNPSYYAAGADRRLALAPE 325 >gi|317122731|ref|YP_004102734.1| von Willebrand factor A [Thermaerobacter marianensis DSM 12885] gi|315592711|gb|ADU52007.1| von Willebrand factor type A [Thermaerobacter marianensis DSM 12885] Length = 1122 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 15/106 (14%), Positives = 32/106 (30%), Gaps = 17/106 (16%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T+ + A + + + + VI +TDG + Sbjct: 524 AGGGTSLGVGLEAALHLMKDVRADV----------RHVIALTDGV-------SEPFDVAG 566 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 G+ + +VA+ A + L + GQ + D ++ Sbjct: 567 TARAFHEQGITVSAVAIGADADTTTLGWLAQEGGGQLYVAADPGQI 612 >gi|198436180|ref|XP_002124514.1| PREDICTED: similar to von Willebrand factor A domain containing 3A [Ciona intestinalis] Length = 1107 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 18/160 (11%), Positives = 50/160 (31%), Gaps = 25/160 (15%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL---NKLNPYENTNTYPAMHHAYREL 288 +A++ Q + +N + + + L + + T A+ A Sbjct: 897 NGNKTAFNIVAFSNTSTKWQDSITESNQSACHDAVQWVSALTAHGGSATLKAIQVAL--- 953 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + + + +TDG+ + TL + + I++++ + Sbjct: 954 ------------ADEEAEAIYLLTDGKP-----DSSIKLTLSEASNLNKKNIPIHTISFN 996 Query: 349 APPE-GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDK 386 D L+ + S G+F + + + ++ Sbjct: 997 CDNREANDFLKSLSSNSGGRFHRCHGEADAQFAIHRLMQD 1036 >gi|332140408|ref|YP_004426146.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Alteromonas macleodii str. 'Deep ecotype'] gi|327550430|gb|AEA97148.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Alteromonas macleodii str. 'Deep ecotype'] Length = 1355 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 43/352 (12%), Positives = 94/352 (26%), Gaps = 36/352 (10%) Query: 63 IFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKG 122 I ++ + I A + + N + I Sbjct: 346 ITDAYLEFTAYSTRTYGNPSMRIRGVADDDASDFHPNRRYRLRNLPKTSGITWSMPDFYN 405 Query: 123 LIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLP 182 ++S I++RS + V+D S + K+ Sbjct: 406 NYVYRTPDISGIVKQIVDRSGWQSGNDMAFVMDNFVSYRGAHTYNSPSKAPKLIVKFNGA 465 Query: 183 PPPKKSFWSKNTTKSKYAPAPAPANRKI-DVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 P S + SK A I D L+E+A +++ V Sbjct: 466 ATPGASATVREHLVSKIDELSANGFTPIVDTLLEAANYYGGRSVDYGRKRGENDVNWSVR 525 Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKESSHNTI 299 + + + P+ T N + + S + Sbjct: 526 RSTRVSHRSSYLG-----------ADSILPWGCTEDNLSDSDCITEQIPTPATYISPVSD 574 Query: 300 GSTRLKKFVIFITDGENSGASAYQN---------------------TLNTLQICEYMRNA 338 + ++ ++DGE + + N + Sbjct: 575 LQCQTNNHIVLLSDGEANNNHSVSKIQTLLGQSCTGNGGEKCGLDLVRNISDTSTSVIGP 634 Query: 339 GMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQE 389 + +++ +A + L + S G F+ ++S +LLE+F+ I +++ Sbjct: 635 RVITHTIGFAANNTANNFLNQLALQSGGGFYQADNSTDLLEAFNTILRSVKD 686 >gi|301756599|ref|XP_002914168.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-4-like [Ailuropoda melanoleuca] Length = 1127 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 24/328 (7%), Positives = 87/328 (26%), Gaps = 34/328 (10%) Query: 58 DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 D+ + + L+ ++ + + T N + + Sbjct: 158 DENGNYVELGAEFILESNTHFNNLMVNTSISNVQLPTNVYNKDPDILNGVYMSEALNPVF 217 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSN 177 + P+ STG + + + + Sbjct: 218 VENFQRDPTLTWQYFGSSTGFFRIYPGIKWTPDENGVIAFDCRNRGWYIQ----AATSPK 273 Query: 178 KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVR 237 ++ S ++ + + +++++ + ++ Sbjct: 274 DIVIVVDTSGSMKGL----------------RMTIAKHTISTILDTLGENDF-VNIIAYS 316 Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHN 297 G +N K +++L A+ A++ L +E+ Sbjct: 317 DYIHYIEPCFKGILVQADRDNREHFKQLVDELMVKGVGVVNQALTEAFQILKQFQEARQG 376 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 ++ + + ++ ITDG N + +++++ + D + Sbjct: 377 SLCN----QAIMLITDGAVEDYEPVFEKYNWP-------DRKVRVFTYLIGREVTFADRM 425 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITD 385 + ++ ++ L ++ + + + Sbjct: 426 KWIACNNKGYY--TQISTLADAQENVME 451 >gi|297287371|ref|XP_001099130.2| PREDICTED: collagen alpha-2(VI) chain-like isoform 3 [Macaca mulatta] Length = 1029 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 31/220 (14%), Positives = 71/220 (32%), Gaps = 18/220 (8%) Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 P+K+ + D+L+ V +Q + L + Sbjct: 35 NNCPEKTDCPIHVYFVLDTSESVTMQSPTDILLFHMKQFVPQFISQLQNEFYLDQVALSW 94 Query: 242 AY---NIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHN 297 Y + +P ++ L ++ T T A+ + ++ Sbjct: 95 RYGGLHFSDQVEVFSPPGSDRASFIKSLQGISSFRRGTFTDCALANMTEQI--------R 146 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 G+ F + ITDG +G+ L E R G+++++VA + L Sbjct: 147 QHGTKGTVHFAVVITDGHVTGSPCGGIKLQ----AERAREEGIRLFAVA-PNRNLKEQGL 201 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 R + + + + + L +S +I + +++ + Sbjct: 202 RDIASTPHELYRNDYATMLPDS-TEIDQDTINRIIKVMKH 240 >gi|281349604|gb|EFB25188.1| hypothetical protein PANDA_001994 [Ailuropoda melanoleuca] Length = 805 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 24/328 (7%), Positives = 87/328 (26%), Gaps = 34/328 (10%) Query: 58 DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 D+ + + L+ ++ + + T N + + Sbjct: 158 DENGNYVELGAEFILESNTHFNNLMVNTSISNVQLPTNVYNKDPDILNGVYMSEALNPVF 217 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSN 177 + P+ STG + + + + Sbjct: 218 VENFQRDPTLTWQYFGSSTGFFRIYPGIKWTPDENGVIAFDCRNRGWYIQ----AATSPK 273 Query: 178 KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVR 237 ++ S ++ + + +++++ + ++ Sbjct: 274 DIVIVVDTSGSMKGL----------------RMTIAKHTISTILDTLGENDF-VNIIAYS 316 Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHN 297 G +N K +++L A+ A++ L +E+ Sbjct: 317 DYIHYIEPCFKGILVQADRDNREHFKQLVDELMVKGVGVVNQALTEAFQILKQFQEARQG 376 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 ++ + + ++ ITDG N + +++++ + D + Sbjct: 377 SLCN----QAIMLITDGAVEDYEPVFEKYNWP-------DRKVRVFTYLIGREVTFADRM 425 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITD 385 + ++ ++ L ++ + + + Sbjct: 426 KWIACNNKGYY--TQISTLADAQENVME 451 >gi|167538097|ref|XP_001750714.1| hypothetical protein [Monosiga brevicollis MX1] gi|163770738|gb|EDQ84419.1| predicted protein [Monosiga brevicollis MX1] Length = 748 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 29/262 (11%), Positives = 68/262 (25%), Gaps = 28/262 (10%) Query: 104 IAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDL 163 Q+ T L ++ + + + S+ + + +C+VLD S SM Sbjct: 35 NTRPTDQHPQSTAPALSAQLYATSTADTLICALAASAASARDTPLHVCLVLDRSASMSVP 94 Query: 164 YLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNS 223 + ++ + ++ A +I V+ + ++ Sbjct: 95 VTARDREHA---------IVEYDITVMDMVKYAARVALHCLAPGDRISVVSFADQARIDV 145 Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + + A L V + ++++ TN + + Sbjct: 146 PPTTLPSTPTTTTTTTGPAAQGNSE----------LAAVVAGIDQIAAEGRTNLWAGLRT 195 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A L I +TDG + A + + + Sbjct: 196 ALTLLREHHVPG--------THNLCIALTDGVPNCHPALGYVEAQRAF-QNDPDFRYVLN 246 Query: 344 SVAVSAPPEGQDLLRKCTDSSG 365 ++ LL + G Sbjct: 247 TIPFGYEDMDAKLLHQLALQGG 268 >gi|323474472|gb|ADX85078.1| von Willebrand factor type A [Sulfolobus islandicus REY15A] gi|323477209|gb|ADX82447.1| von Willebrand factor type A [Sulfolobus islandicus HVE10/4] Length = 356 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 19/98 (19%), Positives = 41/98 (41%), Gaps = 8/98 (8%) Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 L+ + N +++ +I +TDG+ + + N + +I ++ Sbjct: 107 TRLHEAVSFTINLAKQSQVPTKIIMLTDGKPT------DKRNVKDYEKLDIPPNTQIITI 160 Query: 346 AVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDK 382 + + +L+K D SSG+F+ + D EL F+ Sbjct: 161 GIG-NDYNERILKKLADRSSGKFYHIKDISELPNIFES 197 >gi|281207766|gb|EFA81946.1| hypothetical protein PPL_05180 [Polysphondylium pallidum PN500] Length = 1990 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 34/217 (15%), Positives = 75/217 (34%), Gaps = 33/217 (15%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN-------Q 251 +I + + N+V+ I ++ VR+ + YN + + Sbjct: 1667 VVDDTGSMGSEIAKVKQEIQNIVDDIVSI----GSIEVRVAMVFYNDHTPNSDHSKSVCK 1722 Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 ++++ E++ L+ + + + AM + E+ + K I I Sbjct: 1723 VFKFTSDIPELRRGLDSVVVHGGADHPEAMADGFYEVTK-------LDFAKSSTKVCIVI 1775 Query: 312 TDGENSGASAYQNTL--------NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD- 362 D G S ++ + + + G+ Y+V + + L D Sbjct: 1776 GDAPPHGFSGSGDSFPQGCPCGHDLIASVRQLVQGGVTFYTVMCRGDSQTYETLNAIADL 1835 Query: 363 SSGQFFAVNDSRELLESF------DKITDKIQEQSVR 393 S G+F +N++ EL E + D I E+ ++ Sbjct: 1836 SEGRFVLLNNASELTEIITGSAKASILLDTIAEEVLK 1872 >gi|111018687|ref|YP_701659.1| hypothetical protein RHA1_ro01688 [Rhodococcus jostii RHA1] gi|110818217|gb|ABG93501.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length = 904 Score = 46.1 bits (107), Expect = 0.011, Method: Composition-based stats. Identities = 26/180 (14%), Positives = 51/180 (28%), Gaps = 31/180 (17%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH---------- 283 S + A + L + ++R + ++ A Sbjct: 94 FSEKFIVHAPWTRLDNGSLPALQGEVERFRTRTDGIDTDYWNALDGARRTLAERDGQSEA 153 Query: 284 ----AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL--NTLQICEYMRN 337 A + K + V +D A I + +R+ Sbjct: 154 NRCQAVAWFSDGKLDFTVRDAEKPYAQGVSLGSDQGVQQVVAAARESICRPAGIADQLRS 213 Query: 338 AGMKIYSVAVSAP---PEGQDLLRKCTD------------SSGQFFAVNDSRELLESFDK 382 +G+ ++V ++A P DL+R S G F+ + +LL +FD Sbjct: 214 SGIVTFAVGLAAGTAQPSDFDLMRSIATGGDGACGKTTSPSPGDFYLAQNIDDLLFAFDA 273 >gi|332974518|gb|EGK11438.1| PilC protein [Kingella kingae ATCC 23330] Length = 1328 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 23/209 (11%), Positives = 54/209 (25%), Gaps = 24/209 (11%) Query: 198 KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT-PLS 256 A + + E N + + + L+ Sbjct: 228 CNAAYRQSNSNAKNWCSELPQNFIYYPYSGVSDPSVNFGPSVRKRELGYWYTESPLRNLT 287 Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGS-----TRLKKFVIFI 311 ++L+ + S P + T + G + + Sbjct: 288 SDLDFLDSDSAFYTPSKRTAYGLYGMYPIFNYQPILAPQTQYAGEMLSFFSEPINKIDLR 347 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--PEGQDLLR----KCTDSSG 365 T G + ++ + + +++ + G LL+ G Sbjct: 348 TVGNDDTGKSWNDPAFPT--------QNITTFTIGFGSGLTSSGLALLKGGASDIKLKDG 399 Query: 366 Q----FFAVNDSRELLESFDKITDKIQEQ 390 ++ ND L ++FD I D+I+ + Sbjct: 400 TVQKAYYPANDQAGLQKAFDSIFDQIERE 428 >gi|313669375|ref|YP_004049802.1| von Willebrand factor type A [Sulfuricurvum kujiense DSM 16994] gi|313156572|gb|ADR35249.1| von Willebrand factor type A [Sulfuricurvum kujiense DSM 16994] Length = 588 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 14/133 (10%), Positives = 49/133 (36%), Gaps = 16/133 (12%) Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG- 318 + ++ R++ + P T + + + L + K ++ ++DG+ + Sbjct: 469 DTIRGRIHAIKPGYYTRMGAGIRESAKILDKQ----------QSANKLLLILSDGKPNDV 518 Query: 319 --ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 +T + E ++ G+ + + + ++ L + + D+++L Sbjct: 519 DRYDGRYGIEDTKKAIEEVKKKGITPFCITIDIDA--KEYLPYL-FGRNSYAVIRDAKKL 575 Query: 377 LESFDKITDKIQE 389 + +I + + Sbjct: 576 PKVLPEIYMNLTK 588 >gi|260813733|ref|XP_002601571.1| hypothetical protein BRAFLDRAFT_141158 [Branchiostoma floridae] gi|229286869|gb|EEN57583.1| hypothetical protein BRAFLDRAFT_141158 [Branchiostoma floridae] Length = 161 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 27/177 (15%), Positives = 60/177 (33%), Gaps = 26/177 (14%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + D + + A ++VNS + + R+G Y+ + Sbjct: 9 SGSVGTANFDKVKQFAADVVNSF-----DVSPTATRVGVAQYSDRNSLVFNLGDHADKPS 63 Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 S +N ++ T T A+ + + K +I +TDG++ A Sbjct: 64 TVSAINGISYQRGGTKTGAALEFVRQN---------AAWRGGAVPKVMIVLTDGKSGDAV 114 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSREL 376 A + + + G+ +Y++ V L++ +S + + D L Sbjct: 115 AAPS--------QSLAADGVAVYAIGVG--NFDHAELQQIANSDQDKVIELTDFNAL 161 >gi|240169099|ref|ZP_04747758.1| hypothetical protein MkanA1_07284 [Mycobacterium kansasii ATCC 12478] Length = 741 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 38/362 (10%), Positives = 85/362 (23%), Gaps = 63/362 (17%) Query: 32 ALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQI 91 A D + + + R D + Q S + + + + + Sbjct: 180 ARDTDAVPDASRVTPPRLADDDERPELQISLTVESAGLPVSDLRTSLPTAVLEDSADGPT 239 Query: 92 NITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISIC 151 + + + + + + + + + T + + + Sbjct: 240 RLRVEPGARADRDFVLRFRLDRGALSSSALLVPDADGDEGTWSVTLVPPAEPSSAPRDVV 299 Query: 152 MVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKID 211 +VLD S SM + L Sbjct: 300 VVLDRSGSMG-GWKMVAARRAAGRIVDMLDTVDRFCVL---------------------- 336 Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP 271 + + +++ S + N S L L Sbjct: 337 ----AFDDRIDTPTDMAPGLVEGSDQ--------------------NRFAAASWLGSLRS 372 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T + A L E + V+ +TDG+ +G + Sbjct: 373 RGGTEMAEPLRRAVELLAGSDEGRQAS---------VVLVTDGQITGEDHLL-----RSL 418 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 + + ++IY V + L R G+ V L E+ ++ I + Sbjct: 419 AQSLGR--IRIYCVGIDRAVNAGFLDRLARLGRGRAELVESEARLDEAMSRLARTIGRPA 476 Query: 392 VR 393 + Sbjct: 477 LT 478 >gi|126442905|ref|YP_001064078.1| hypothetical protein BURPS668_A3087 [Burkholderia pseudomallei 668] gi|126222396|gb|ABN85901.1| conserved hypothetical protein [Burkholderia pseudomallei 668] Length = 418 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 32/261 (12%), Positives = 82/261 (31%), Gaps = 5/261 (1%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A++++V F+ A+DL + R+++Q++ DA L+ + + P Sbjct: 26 LVALMLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLTGAINLSVPEAAGITA 85 Query: 61 ---STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQY--IAESKAQYEIPT 115 + +Q ++ S + + N ++ Sbjct: 86 GHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCTTSRTGIVNWFI 145 Query: 116 ENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMT 175 + L L + A ++S + + + AI + + +++ + +N + ++ Sbjct: 146 QTLNLVPGVTVANASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVAGATYNIGDWLS 205 Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 + P +F S A + A + + G+ V + + Sbjct: 206 AKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGDKAATTNAYN 265 Query: 236 VRIGTIAYNIGIVGNQCTPLS 256 R G A + Sbjct: 266 TRFGIYANPYKNPSYGTPDFT 286 >gi|254182581|ref|ZP_04889175.1| conserved hypothetical protein [Burkholderia pseudomallei 1655] gi|184213116|gb|EDU10159.1| conserved hypothetical protein [Burkholderia pseudomallei 1655] Length = 418 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 32/261 (12%), Positives = 82/261 (31%), Gaps = 5/261 (1%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A++++V F+ A+DL + R+++Q++ DA L+ + + P Sbjct: 26 LVALMLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLTGAINLSVPEAAGITA 85 Query: 61 ---STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQY--IAESKAQYEIPT 115 + +Q ++ S + + N ++ Sbjct: 86 GHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCTTSRTGIVNWFI 145 Query: 116 ENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMT 175 + L L + A ++S + + + AI + + +++ + +N + ++ Sbjct: 146 QTLNLVPGVTVANASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVAGATYNIGDWLS 205 Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 + P +F S A + A + + G+ V + + Sbjct: 206 AKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGNKAATTNAYN 265 Query: 236 VRIGTIAYNIGIVGNQCTPLS 256 R G A + Sbjct: 266 TRFGIYANPYKNPSYGTPDFT 286 >gi|317064189|ref|ZP_07928674.1| conserved hypothetical protein [Fusobacterium ulcerans ATCC 49185] gi|313689865|gb|EFS26700.1| conserved hypothetical protein [Fusobacterium ulcerans ATCC 49185] Length = 325 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 27/182 (14%), Positives = 62/182 (34%), Gaps = 55/182 (30%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP----YENTNTYPAMHHAYRELYNEK 292 RIG I ++ PL+++ + K+ +N L+ T Y A+ A + Sbjct: 123 RIGFIPFSDSAYIQ--MPLTDDYSIGKNYINALDTNLISGGGTELYQALELAEKSFKEIN 180 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + K +I ++DG + + +++++ M ++S+ + Sbjct: 181 SDN----------KTIIVLSDGGDFDDKS----------LKFVKDNKMNVFSIGIGTDEG 220 Query: 353 ---------------------------GQDLLRKCTD-SSGQFFAVND-SRELLESFDKI 383 D L+K + S+G+++ VN+ + F Sbjct: 221 TIIPEYVNGKKVGFIKDQKGSAVISKLNSDFLKKLSSESNGKYYEVNNLKDDTSNFFRDT 280 Query: 384 TD 385 + Sbjct: 281 AN 282 >gi|288921206|ref|ZP_06415492.1| von Willebrand factor type A [Frankia sp. EUN1f] gi|288347413|gb|EFC81704.1| von Willebrand factor type A [Frankia sp. EUN1f] Length = 625 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 22/130 (16%), Positives = 42/130 (32%), Gaps = 13/130 (10%) Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 V ++++ P +T Y A+R L E V+ +TDG NS Sbjct: 502 RQAVIDEMSRIEPRGDTGLYETTLAAFRHLNQHYEDGWPNQ--------VVLLTDGRNSD 553 Query: 319 ASAYQNTLNTLQICEYMRN-AGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE-- 375 + + ++I ++ + L R + Q + D Sbjct: 554 PGSMSLDELVRTLRREYSALHPVRIITIGYGEDADLGALARISDATGAQSYPALDPNSIF 613 Query: 376 --LLESFDKI 383 L+ +F +I Sbjct: 614 VVLVGAFTEI 623 >gi|167908132|ref|ZP_02495337.1| hypothetical protein BpseN_38271 [Burkholderia pseudomallei NCTC 13177] Length = 396 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 33/322 (10%), Positives = 88/322 (27%), Gaps = 29/322 (9%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A++++V F+ A+DL + R+++Q++ DA L+ + + P Sbjct: 4 LVALMLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLTGAINLSVPEAAGITA 63 Query: 61 ---STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 + +Q ++ S + + N ++ + + Sbjct: 64 GHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCTTSRTGIVNWFI 123 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSM-------EDLYLQKHND 170 L + +TN S+ +T + + +I + + + + + Sbjct: 124 QTLNLVPGVTVTNASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVAGATYNIGDWLS 183 Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR------------KIDVLIESAG 218 + + + N+ S + Sbjct: 184 AKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGNKAAATNAYN 243 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR-------LNKLNP 271 + + AY+ +Q S+ +++ + + +N Sbjct: 244 TRFGIYANPYKNPSYGTPDFTGFAYDATTWPSQSNAYSDFVSKRLTFASYQGDLITGINT 303 Query: 272 YENTNTYPAMHHAYRELYNEKE 293 + N A R L E Sbjct: 304 GGSYNPSYYAAGADRRLALAPE 325 >gi|226194158|ref|ZP_03789758.1| conserved hypothetical protein [Burkholderia pseudomallei Pakistan 9] gi|225933851|gb|EEH29838.1| conserved hypothetical protein [Burkholderia pseudomallei Pakistan 9] Length = 418 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 32/261 (12%), Positives = 82/261 (31%), Gaps = 5/261 (1%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A++++V F+ A+DL + R+++Q++ DA L+ + + P Sbjct: 26 LVALMLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLTGAINLSVPEAAGITA 85 Query: 61 ---STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQY--IAESKAQYEIPT 115 + +Q ++ S + + N ++ Sbjct: 86 GHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCTTSRTGIVNWFI 145 Query: 116 ENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMT 175 + L L + A ++S + + + AI + + +++ + +N + ++ Sbjct: 146 QTLNLVPGVTVANASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVAGATYNIGDWLS 205 Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 + P +F S A + A + + G+ V + + Sbjct: 206 AKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGDKAATTNAYN 265 Query: 236 VRIGTIAYNIGIVGNQCTPLS 256 R G A + Sbjct: 266 TRFGIYANPYKNPSYGTPDFT 286 >gi|134298678|ref|YP_001112174.1| hypothetical protein Dred_0811 [Desulfotomaculum reducens MI-1] gi|134051378|gb|ABO49349.1| hypothetical protein Dred_0811 [Desulfotomaculum reducens MI-1] Length = 183 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 19/160 (11%), Positives = 50/160 (31%), Gaps = 11/160 (6%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCAS----------IVSDRTI 50 + + + LF+ ID R ++Q+ DA L+G ++ D ++ Sbjct: 16 LFVPVFLIIMLFMARGIDWGMATVARGKLQTISDAGSLAGASAVEPITKVELVNNDDGSL 75 Query: 51 KDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK-NNPLQYIAESKA 109 + + Q + L + T+++ Y +S A Sbjct: 76 ELREKVTGIKINSEEAQRRARLARELNGGTQDYWQGVGGHWEGTEERIEGDDIYCVKSTA 135 Query: 110 QYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAIS 149 + ++P + + + +++ + + I Sbjct: 136 RVKLPFLSRIYEKVSGHKDLTITMPGDARSVLIKQKMNID 175 >gi|282896561|ref|ZP_06304580.1| hypothetical protein CRD_00534 [Raphidiopsis brookii D9] gi|281198552|gb|EFA73434.1| hypothetical protein CRD_00534 [Raphidiopsis brookii D9] Length = 587 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 15/125 (12%), Positives = 45/125 (36%), Gaps = 16/125 (12%) Query: 240 TIAYNIGIVGNQCTPLSNNLNE--VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHN 297 ++Y + ++L + ++ L T Y + EL +++++ N Sbjct: 451 LVSYGDQPINLVKLAPFDDLQHKRFLAGIDGLEADGATAMYDGVMVGLSELLQQRKTNPN 510 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 +++ +TDG+ + ++ ++ +G+++Y +A + L Sbjct: 511 GKF------YLLLLTDGQTNQGFNFEQVKEIIEY------SGVRVYPIAYG--EVNEAEL 556 Query: 358 RKCTD 362 Sbjct: 557 NAIAA 561 >gi|193788521|dbj|BAG53415.1| unnamed protein product [Homo sapiens] Length = 328 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 22/151 (14%), Positives = 58/151 (38%), Gaps = 26/151 (17%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 ++ + + RL T+ + A+ + + + ++ +T Sbjct: 75 INSGSDRDTLAKRLPA-AASGGTSICSGLRSAFTVIRKKYPTD---------GSEIVLLT 124 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND 372 DGE++ S N + + +G I++VA+ P Q+L + G +D Sbjct: 125 DGEDNTISGCFNEV---------KQSGAIIHTVALG-PSAAQELEELSKMTGGLQTYASD 174 Query: 373 ---SRELLESFDKITD---KIQEQSVRIAPN 397 + L+++F ++ + ++S+++ Sbjct: 175 QVQNNGLIDAFGALSSGNGAVSQRSIQLESK 205 >gi|21359974|ref|NP_444506.2| vitrin isoform 1 [Homo sapiens] gi|62702118|gb|AAF19243.2|AC007363_1 unknown [Homo sapiens] gi|16552271|dbj|BAB71279.1| unnamed protein product [Homo sapiens] gi|119620821|gb|EAX00416.1| vitrin, isoform CRA_a [Homo sapiens] Length = 693 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 14/120 (11%), Positives = 39/120 (32%), Gaps = 14/120 (11%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSH 296 +G + Y + N ++K+ + K+ +N A+ + +++ Sbjct: 347 MGVVQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANG-- 404 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + V+ + DG + + R +G+ I+ + + E + Sbjct: 405 ---NRSGAPNVVVVMVDGWPTD--------KVEEASRLARESGINIFFITIEGAAENEKQ 453 >gi|66805993|ref|XP_636718.1| hypothetical protein DDB_G0288381 [Dictyostelium discoideum AX4] gi|60465117|gb|EAL63216.1| hypothetical protein DDB_G0288381 [Dictyostelium discoideum AX4] Length = 549 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 24/149 (16%), Positives = 56/149 (37%), Gaps = 11/149 (7%) Query: 250 NQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 + +S N + L ++ +T Y A++HA E+ ++ + + + Sbjct: 185 DTAFEISKNFDSFSQELGEVVANQGSTRLYEAIYHAANEIEKY-RNNPKEKLAPDVCCRI 243 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ-DLLRKCTDSSGQF 367 +TDG+++ N ++ + + ++ + L K + G Sbjct: 244 FLLTDGQDTSNINPYNVYQY------LKPLNIILDAIPIGRDDNSTLSTLTKA--TGGSC 295 Query: 368 FAVNDSRELLESFDKITDKIQEQSVRIAP 396 F N ++E +E F++ I Q +P Sbjct: 296 FMANSTQEGVELFEREALLIPTQRDSFSP 324 >gi|297475370|ref|XP_002707867.1| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta subunit 4-like [Bos taurus] gi|296487070|gb|DAA29183.1| calcium channel, voltage-dependent, alpha 2/delta subunit 4-like [Bos taurus] Length = 1111 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 26/354 (7%), Positives = 105/354 (29%), Gaps = 25/354 (7%) Query: 39 SGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKN 98 + + + + + + D + + + +++ + + N Sbjct: 111 MLRRKVEAVQNLVEAAEEADLNHEFNESLVFDYYNSVLLNEKDSSGAYVELGAEFLLEAN 170 Query: 99 NPLQYIA--ESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDV 156 + + + ++PT I + + + +E + ++ Sbjct: 171 AHFSDLLVNTTLSSVQLPTNVYNKDPDILNGIYMSEALNAVFVENFQRDPTLTWQYFGSS 230 Query: 157 SRSME--DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLI 214 + ++N +T + + + K+ ++ + Sbjct: 231 TGFFRIYPGIKWTPDENGVITFDCRNRGWYIQAATSPKDIVIVVDTSGSMKG-LRMTIAK 289 Query: 215 ESAGNLVNSIQKAIQEK---KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP 271 + +++++ + N + + +V +N K +++L Sbjct: 290 HTVSTILDTLGENDFVNIIAYNDYIHYIEPCFKGILVQADR----DNREHFKQLVDELMV 345 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 A+ A++ L +E+ ++ + + ++ ITDG N Sbjct: 346 KGVGVVDRALREAFQILQQFQEAGQGSLCN----QAIMLITDGAVEDYEPVLEKYNWP-- 399 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + +++++ + D L+ ++ ++ L ++ + + + Sbjct: 400 -----DRKVRVFTYLIGREVSFADRLKWIACNNKGYY--TQISTLADAQENVME 446 >gi|194667489|ref|XP_001787557.1| PREDICTED: voltage-gated calcium channel alpha(2)delta-4 subunit [Bos taurus] Length = 1111 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 26/354 (7%), Positives = 105/354 (29%), Gaps = 25/354 (7%) Query: 39 SGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKN 98 + + + + + + D + + + +++ + + N Sbjct: 111 MLRRKVEAVQNLVEAAEEADLNHEFNESLVFDYYNSVLLNEKDSSGAYVELGAEFLLEAN 170 Query: 99 NPLQYIA--ESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDV 156 + + + ++PT I + + + +E + ++ Sbjct: 171 AHFSDLLVNTTLSSVQLPTNVYNKDPDILNGIYMSEALNAVFVENFQRDPTLTWQYFGSS 230 Query: 157 SRSME--DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLI 214 + ++N +T + + + K+ ++ + Sbjct: 231 TGFFRIYPGIKWTPDENGVITFDCRNRGWYIQAATSPKDIVIVVDTSGSMKG-LRMTIAK 289 Query: 215 ESAGNLVNSIQKAIQEK---KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP 271 + +++++ + N + + +V +N K +++L Sbjct: 290 HTVSTILDTLGENDFVNIIAYNDYIHYIEPCFKGILVQADR----DNREHFKQLVDELMV 345 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 A+ A++ L +E+ ++ + + ++ ITDG N Sbjct: 346 KGVGVVDRALREAFQILQQFQEAGQGSLCN----QAIMLITDGAVEDYEPVLEKYNWP-- 399 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + +++++ + D L+ ++ ++ L ++ + + + Sbjct: 400 -----DRKVRVFTYLIGREVSFADRLKWIACNNKGYY--TQISTLADAQENVME 446 >gi|13603394|gb|AAA52056.2| type VI collagen alpha 2 chain precursor [Homo sapiens] Length = 1019 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 31/220 (14%), Positives = 72/220 (32%), Gaps = 18/220 (8%) Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 P+K+ + D+L+ V +Q + L + Sbjct: 35 NNCPEKTDCPIHVYFVLDTSESVTMQSPTDILLFHMKQFVPQFISQLQNEFYLDQVALSW 94 Query: 242 AY---NIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHN 297 Y + +P ++ L ++ T T A+ + ++ ++ Sbjct: 95 RYGGLHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQIRQDR----- 149 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 S F + ITDG +G+ L E R G+++++VA + L Sbjct: 150 ---SKGTVHFAVVITDGHVTGSPCGGIKLQ----AERAREEGIRLFAVA-PNQNLKEQGL 201 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 R + + + + + L +S +I + +++ + Sbjct: 202 RDIASTPHELYRNDYATMLPDS-TEINQDTINRIIKVMKH 240 >gi|332141921|ref|YP_004427659.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Alteromonas macleodii str. 'Deep ecotype'] gi|327551943|gb|AEA98661.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Alteromonas macleodii str. 'Deep ecotype'] Length = 1355 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 43/352 (12%), Positives = 94/352 (26%), Gaps = 36/352 (10%) Query: 63 IFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKG 122 I ++ + I A + + N + I Sbjct: 346 ITDAYLEFTAYSTRTYGNPSMRIRGVADDDASDFHPNRRYRLRNLPKTSGITWSMPDFYN 405 Query: 123 LIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLP 182 ++S I++RS + V+D S + K+ Sbjct: 406 NYVYRTPDISGIVKQIVDRSGWQSGNDMAFVMDNFVSYRGAHTYNSPSKAPKLIVKFNGA 465 Query: 183 PPPKKSFWSKNTTKSKYAPAPAPANRKI-DVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 P S + SK A I D L+E+A +++ V Sbjct: 466 ATPGASATVREHLVSKIDELSANGFTPIVDTLLEAANYYGGRSVDYGRKRGENDVNWSVR 525 Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNEKESSHNTI 299 + + + P+ T N + + S + Sbjct: 526 RSTRVSHRSSYLG-----------ADSILPWGCTEDNLSDSDCITEQIPTPATYISPVSD 574 Query: 300 GSTRLKKFVIFITDGENSGASAYQN---------------------TLNTLQICEYMRNA 338 + ++ ++DGE + + N + Sbjct: 575 LQCQTNNHIVLLSDGEANNNHSVSKIQTLLGQSCTGNGGEKCGLDLVRNISDTSTSVIGP 634 Query: 339 GMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQE 389 + +++ +A + L + S G F+ ++S +LLE+F+ I +++ Sbjct: 635 RVITHTIGFAANNTANNFLNQLALQSGGGFYQADNSTDLLEAFNTILRSVKD 686 >gi|23500008|ref|NP_699448.1| norD protein [Brucella suis 1330] gi|81751586|sp|Q8FX38|NORD_BRUSU RecName: Full=Protein norD gi|23463592|gb|AAN33453.1| norD protein [Brucella suis 1330] Length = 633 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 24/132 (18%), Positives = 45/132 (34%), Gaps = 20/132 (15%) Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG-- 318 V+ R+ L P T AM HA +L + KK ++ +TDG+ + Sbjct: 515 TVEHRIAALKPGFYTRMGAAMRHATAKLAEQPNR----------KKLLLLLTDGKPNDVD 564 Query: 319 -ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ--DLLRKCTDSSGQFFAVNDSRE 375 ++ + R G+ +++V V L G + V + + Sbjct: 565 HYEGRFALEDSRRAAGEARAKGVNVFAVTVDREASAHLPALF-----GRGGYALVANLAK 619 Query: 376 LLESFDKITDKI 387 L + I + Sbjct: 620 LPVALPAIYRML 631 >gi|89068016|ref|ZP_01155433.1| CpaB family protein [Oceanicola granulosus HTCC2516] gi|89046255|gb|EAR52312.1| CpaB family protein [Oceanicola granulosus HTCC2516] Length = 281 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 18/199 (9%), Positives = 47/199 (23%), Gaps = 3/199 (1%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDP---TTKKD 58 +++ V +A+ +A Q Q A + AV + V + + Sbjct: 5 FGLVLIVGVGLAGFAVYMAQGYLQDTQAQLARERAVAAQATPTVEVYAVTRQIGYGEQLT 64 Query: 59 QTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENL 118 K + L G + E Q+ + + + K + Sbjct: 65 PDDVTLVKYAEPFLPDGVFRAEEELFPEGDTQLRVVLRPMEVNEPVLAVKVTEPGEDAGI 124 Query: 119 FLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNK 178 + + + + + + + N N+ + Sbjct: 125 TSRLGRGMRAFAIRVDVASGVSGFLRPGDKVDIYWTGRNSRGGEEVTKLIESNVNLIAID 184 Query: 179 YLLPPPPKKSFWSKNTTKS 197 + ++ T Sbjct: 185 QSADGSALNTTIARTVTVE 203 >gi|307720337|ref|YP_003891477.1| von Willebrand factor A [Sulfurimonas autotrophica DSM 16294] gi|306978430|gb|ADN08465.1| von Willebrand factor type A [Sulfurimonas autotrophica DSM 16294] Length = 599 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 17/131 (12%), Positives = 50/131 (38%), Gaps = 16/131 (12%) Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 ++ R+ + P T A+ + + L + K ++ I+DG+ + Sbjct: 482 IRGRIESIKPQYYTRMGAAIRESAKILDKQ----------QSANKLLLIISDGKPNDEDR 531 Query: 322 YQNTL---NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 Y +T + + ++ G+ + + + ++ L +G + V D ++L + Sbjct: 532 YDGRYGIEDTKKALQEIKKKGITPFCITIDLDA--KEYLNYLFGQNG-YAIVRDGQKLPK 588 Query: 379 SFDKITDKIQE 389 ++ + + Sbjct: 589 VLTEVYINLTK 599 >gi|119620823|gb|EAX00418.1| vitrin, isoform CRA_c [Homo sapiens] Length = 700 Score = 45.7 bits (106), Expect = 0.012, Method: Composition-based stats. Identities = 14/120 (11%), Positives = 39/120 (32%), Gaps = 14/120 (11%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSH 296 +G + Y + N ++K+ + K+ +N A+ + +++ Sbjct: 354 MGVVQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANG-- 411 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + V+ + DG + + R +G+ I+ + + E + Sbjct: 412 ---NRSGAPNVVVVMVDGWPTD--------KVEEASRLARESGINIFFITIEGAAENEKQ 460 >gi|156408868|ref|XP_001642078.1| predicted protein [Nematostella vectensis] gi|156229219|gb|EDO50015.1| predicted protein [Nematostella vectensis] Length = 257 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 26/146 (17%), Positives = 52/146 (35%), Gaps = 12/146 (8%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNT 298 + + + N ++ +LN L TNT A++ A +N Sbjct: 96 VYSTITFSTEAEISFRYANRSQAIEKLNDLPYMACKTNTQLALNLAEMIFFNNTLGPLRP 155 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLR 358 ++ ++ TDG+++ TL ++ G++IY VAV G + Sbjct: 156 G-----RRRILIFTDGQSNV-----KEQMTLYRAFRLKKRGVEIYVVAVGKYLYGMHEII 205 Query: 359 KCTDSSGQF-FAVNDSRELLESFDKI 383 SS + V ++ ++ I Sbjct: 206 GLATSSSHHLYRVRSMKDFVKIVQLI 231 >gi|89100511|ref|ZP_01173372.1| hypothetical protein B14911_02110 [Bacillus sp. NRRL B-14911] gi|89084777|gb|EAR63917.1| hypothetical protein B14911_02110 [Bacillus sp. NRRL B-14911] Length = 245 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 18/103 (17%), Positives = 39/103 (37%), Gaps = 10/103 (9%) Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS----APPEGQD 355 T K ++ ITDG ++ + + + + G+ + + V G + Sbjct: 1 MKTGTLKQILLITDGCSNQGE------DPVAMSALAKEQGITVNVIGVMEKDVIDERGMN 54 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + S G V S+ L ++ +T K Q+++ N+ Sbjct: 55 EIDGIALSGGGVSQVVYSQALSQTVQMVTRKAMTQTLQGVVNK 97 >gi|332872319|ref|XP_003319171.1| PREDICTED: collagen alpha-2(VI) chain isoform 2 [Pan troglodytes] Length = 828 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 31/220 (14%), Positives = 72/220 (32%), Gaps = 18/220 (8%) Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 P+K+ + D+L+ V +Q + L + Sbjct: 35 NNCPEKTDCPIHVYFVLDTSESVTMQSPTDILLFHMKQFVPQFISQLQNEFYLDQVALSW 94 Query: 242 AY---NIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHN 297 Y + +P ++ L ++ T T A+ + ++ ++ Sbjct: 95 RYGGLHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQIRQDR----- 149 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 S F + ITDG +G+ L E R G+++++VA + L Sbjct: 150 ---SKGTVHFAVVITDGHVTGSPCGGIKLQ----AERAREEGIRLFAVA-PNQNLKEQGL 201 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 R + + + + + L +S +I + +++ + Sbjct: 202 RDIASTPHELYRNDYATMLPDS-TEIDQDTINRIIKVMKH 240 >gi|332872317|ref|XP_003319170.1| PREDICTED: collagen alpha-2(VI) chain isoform 1 [Pan troglodytes] Length = 918 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 31/220 (14%), Positives = 72/220 (32%), Gaps = 18/220 (8%) Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 P+K+ + D+L+ V +Q + L + Sbjct: 35 NNCPEKTDCPIHVYFVLDTSESVTMQSPTDILLFHMKQFVPQFISQLQNEFYLDQVALSW 94 Query: 242 AY---NIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHN 297 Y + +P ++ L ++ T T A+ + ++ ++ Sbjct: 95 RYGGLHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQIRQDR----- 149 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 S F + ITDG +G+ L E R G+++++VA + L Sbjct: 150 ---SKGTVHFAVVITDGHVTGSPCGGIKLQ----AERAREEGIRLFAVA-PNQNLKEQGL 201 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 R + + + + + L +S +I + +++ + Sbjct: 202 RDIASTPHELYRNDYATMLPDS-TEIDQDTINRIIKVMKH 240 >gi|332872315|ref|XP_531504.3| PREDICTED: collagen alpha-2(VI) chain isoform 3 [Pan troglodytes] Length = 1019 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 31/220 (14%), Positives = 72/220 (32%), Gaps = 18/220 (8%) Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 P+K+ + D+L+ V +Q + L + Sbjct: 35 NNCPEKTDCPIHVYFVLDTSESVTMQSPTDILLFHMKQFVPQFISQLQNEFYLDQVALSW 94 Query: 242 AY---NIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHN 297 Y + +P ++ L ++ T T A+ + ++ ++ Sbjct: 95 RYGGLHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQIRQDR----- 149 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 S F + ITDG +G+ L E R G+++++VA + L Sbjct: 150 ---SKGTVHFAVVITDGHVTGSPCGGIKLQ----AERAREEGIRLFAVA-PNQNLKEQGL 201 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 R + + + + + L +S +I + +++ + Sbjct: 202 RDIASTPHELYRNDYATMLPDS-TEIDQDTINRIIKVMKH 240 >gi|313892786|ref|ZP_07826367.1| von Willebrand factor type A domain protein [Veillonella sp. oral taxon 158 str. F0412] gi|313442717|gb|EFR61128.1| von Willebrand factor type A domain protein [Veillonella sp. oral taxon 158 str. F0412] Length = 230 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 33/189 (17%), Positives = 72/189 (38%), Gaps = 16/189 (8%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 KID L ++ +++ + A +++ + V I T G T + + + K Sbjct: 32 KIDSLYDATIDMIETFSAAQAKEQVIDVAIITF----GTHVELHTKYTPVKDLQAKGICK 87 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 + T A+ A + ++ + S + V+ ++DG + Sbjct: 88 FSASGLTPMGTALRMAKDMIEDK-----DVTPSRIYRPAVVLVSDGAPNDDWKSP----M 138 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 + R+A + ++VA+ + +L + T D++++ E F I+ I Sbjct: 139 DKFINDGRSAKCQRFAVAIG-NDADRSILERFTQDPNAVLFAEDAKDISEQFKTISMSIS 197 Query: 389 EQSVRIAPN 397 +V APN Sbjct: 198 TMAV--APN 204 >gi|297708173|ref|XP_002830853.1| PREDICTED: hypothetical protein LOC100461231, partial [Pongo abelii] Length = 885 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 31/220 (14%), Positives = 72/220 (32%), Gaps = 18/220 (8%) Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 P+K+ + D+L+ V +Q + L + Sbjct: 657 NNCPEKTDCPIHVYFVLDTSESVTMQSPTDILLFHMKQFVPQFISQLQNEFYLDQVALSW 716 Query: 242 AY---NIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHN 297 Y + +P ++ L ++ T T A+ + ++ ++ Sbjct: 717 RYGGLHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQIRQDR----- 771 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 S F + ITDG +G+ L E R G+++++VA + L Sbjct: 772 ---SKGTVHFAVVITDGHVTGSPCGGIKLQ----AERAREEGIRLFAVA-PNQNLKEQGL 823 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 R + + + + + L +S +I + +++ + Sbjct: 824 RDIASTPHELYRNDYATMLPDS-TEIDQDTINRIIKVMKH 862 >gi|119629723|gb|EAX09318.1| collagen, type VI, alpha 2, isoform CRA_c [Homo sapiens] Length = 1019 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 31/220 (14%), Positives = 72/220 (32%), Gaps = 18/220 (8%) Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 P+K+ + D+L+ V +Q + L + Sbjct: 35 NNCPEKTDCPIHVYFVLDTSESVTMQSPTDILLFHMKQFVPQFISQLQNEFYLDQVALSW 94 Query: 242 AY---NIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHN 297 Y + +P ++ L ++ T T A+ + ++ ++ Sbjct: 95 RYGGLHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQIRQDR----- 149 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 S F + ITDG +G+ L E R G+++++VA + L Sbjct: 150 ---SKGTVHFAVVITDGHVTGSPCGGIKLQ----AERAREEGIRLFAVA-PNQNLKEQGL 201 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 R + + + + + L +S +I + +++ + Sbjct: 202 RDIASTPHELYRNDYATMLPDS-TEIDQDTINRIIKVMKH 240 >gi|119629721|gb|EAX09316.1| collagen, type VI, alpha 2, isoform CRA_a [Homo sapiens] Length = 828 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 31/220 (14%), Positives = 72/220 (32%), Gaps = 18/220 (8%) Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 P+K+ + D+L+ V +Q + L + Sbjct: 35 NNCPEKTDCPIHVYFVLDTSESVTMQSPTDILLFHMKQFVPQFISQLQNEFYLDQVALSW 94 Query: 242 AY---NIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHN 297 Y + +P ++ L ++ T T A+ + ++ ++ Sbjct: 95 RYGGLHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQIRQDR----- 149 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 S F + ITDG +G+ L E R G+++++VA + L Sbjct: 150 ---SKGTVHFAVVITDGHVTGSPCGGIKLQ----AERAREEGIRLFAVA-PNQNLKEQGL 201 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 R + + + + + L +S +I + +++ + Sbjct: 202 RDIASTPHELYRNDYATMLPDS-TEIDQDTINRIIKVMKH 240 >gi|119629722|gb|EAX09317.1| collagen, type VI, alpha 2, isoform CRA_b [Homo sapiens] Length = 918 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 31/220 (14%), Positives = 72/220 (32%), Gaps = 18/220 (8%) Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 P+K+ + D+L+ V +Q + L + Sbjct: 35 NNCPEKTDCPIHVYFVLDTSESVTMQSPTDILLFHMKQFVPQFISQLQNEFYLDQVALSW 94 Query: 242 AY---NIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHN 297 Y + +P ++ L ++ T T A+ + ++ ++ Sbjct: 95 RYGGLHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQIRQDR----- 149 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 S F + ITDG +G+ L E R G+++++VA + L Sbjct: 150 ---SKGTVHFAVVITDGHVTGSPCGGIKLQ----AERAREEGIRLFAVA-PNQNLKEQGL 201 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 R + + + + + L +S +I + +++ + Sbjct: 202 RDIASTPHELYRNDYATMLPDS-TEIDQDTINRIIKVMKH 240 >gi|115527066|ref|NP_478054.2| collagen alpha-2(VI) chain isoform 2C2a precursor [Homo sapiens] Length = 918 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 31/220 (14%), Positives = 72/220 (32%), Gaps = 18/220 (8%) Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 P+K+ + D+L+ V +Q + L + Sbjct: 35 NNCPEKTDCPIHVYFVLDTSESVTMQSPTDILLFHMKQFVPQFISQLQNEFYLDQVALSW 94 Query: 242 AY---NIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHN 297 Y + +P ++ L ++ T T A+ + ++ ++ Sbjct: 95 RYGGLHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQIRQDR----- 149 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 S F + ITDG +G+ L E R G+++++VA + L Sbjct: 150 ---SKGTVHFAVVITDGHVTGSPCGGIKLQ----AERAREEGIRLFAVA-PNQNLKEQGL 201 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 R + + + + + L +S +I + +++ + Sbjct: 202 RDIASTPHELYRNDYATMLPDS-TEIDQDTINRIIKVMKH 240 >gi|115527070|ref|NP_478055.2| collagen alpha-2(VI) chain isoform 2C2a' precursor [Homo sapiens] Length = 828 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 31/220 (14%), Positives = 72/220 (32%), Gaps = 18/220 (8%) Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 P+K+ + D+L+ V +Q + L + Sbjct: 35 NNCPEKTDCPIHVYFVLDTSESVTMQSPTDILLFHMKQFVPQFISQLQNEFYLDQVALSW 94 Query: 242 AY---NIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHN 297 Y + +P ++ L ++ T T A+ + ++ ++ Sbjct: 95 RYGGLHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQIRQDR----- 149 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 S F + ITDG +G+ L E R G+++++VA + L Sbjct: 150 ---SKGTVHFAVVITDGHVTGSPCGGIKLQ----AERAREEGIRLFAVA-PNQNLKEQGL 201 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 R + + + + + L +S +I + +++ + Sbjct: 202 RDIASTPHELYRNDYATMLPDS-TEIDQDTINRIIKVMKH 240 >gi|115527062|ref|NP_001840.3| collagen alpha-2(VI) chain isoform 2C2 precursor [Homo sapiens] gi|125987812|sp|P12110|CO6A2_HUMAN RecName: Full=Collagen alpha-2(VI) chain; Flags: Precursor Length = 1019 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 31/220 (14%), Positives = 72/220 (32%), Gaps = 18/220 (8%) Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 P+K+ + D+L+ V +Q + L + Sbjct: 35 NNCPEKTDCPIHVYFVLDTSESVTMQSPTDILLFHMKQFVPQFISQLQNEFYLDQVALSW 94 Query: 242 AY---NIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHN 297 Y + +P ++ L ++ T T A+ + ++ ++ Sbjct: 95 RYGGLHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQIRQDR----- 149 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 S F + ITDG +G+ L E R G+++++VA + L Sbjct: 150 ---SKGTVHFAVVITDGHVTGSPCGGIKLQ----AERAREEGIRLFAVA-PNQNLKEQGL 201 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 R + + + + + L +S +I + +++ + Sbjct: 202 RDIASTPHELYRNDYATMLPDS-TEIDQDTINRIIKVMKH 240 >gi|41350923|gb|AAH65509.1| Collagen, type VI, alpha 2 [Homo sapiens] gi|190690005|gb|ACE86777.1| collagen, type VI, alpha 2 protein [synthetic construct] gi|190691377|gb|ACE87463.1| collagen, type VI, alpha 2 protein [synthetic construct] Length = 1019 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 31/220 (14%), Positives = 72/220 (32%), Gaps = 18/220 (8%) Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 P+K+ + D+L+ V +Q + L + Sbjct: 35 NNCPEKTDCPIHVYFVLDTSESVTMQSPTDILLFHMKQFVPQFISQLQNEFYLDQVALSW 94 Query: 242 AY---NIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHN 297 Y + +P ++ L ++ T T A+ + ++ ++ Sbjct: 95 RYGGLHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQIRQDR----- 149 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 S F + ITDG +G+ L E R G+++++VA + L Sbjct: 150 ---SKGTVHFAVVITDGHVTGSPCGGIKLQ----AERAREEGIRLFAVA-PNQNLKEQGL 201 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 R + + + + + L +S +I + +++ + Sbjct: 202 RDIASTPHELYRNDYATMLPDS-TEIDQDTINRIIKVMKH 240 >gi|32394566|gb|AAM93981.1| alpha 1 type VII collagen precursor [Griffithsia japonica] Length = 194 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 26/170 (15%), Positives = 58/170 (34%), Gaps = 19/170 (11%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 + L + + N+ A + + + N G VG +PL++++ + L Sbjct: 43 TEFAQLNAALVEITNTFSTAAP-GSSFAAVDFSGLQNGGGVGEVVSPLTSDVTTFLAALA 101 Query: 268 KLNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 T++ ++ +EL + + K ++ ITDG ++ + Sbjct: 102 ANPKRGGFTSSGTGLNLCDQELDGQAQP-----------KVILLITDGVDNRNPIGVDVE 150 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 ++ G + V + DL + S + V+D L Sbjct: 151 AA------IKANGNTVAVVGIGDNLNDDDLRDTISSSPDLYTFVDDFSAL 194 >gi|319442698|ref|ZP_07991854.1| hypothetical protein CvarD4_13132 [Corynebacterium variabile DSM 44702] Length = 916 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 33/233 (14%), Positives = 75/233 (32%), Gaps = 23/233 (9%) Query: 170 DNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ 229 N + + + S P ++ +++ +LV + + Sbjct: 55 QANINELGSCIAATKTADILLILDQSASLKGFDGKPPTDPDNIRVDATRDLVKQLGTHAE 114 Query: 230 EKK---NLSVRIGTIAYNIGI-VGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAY 285 + N+ + Y+ + N ++ ++ NT+ Y A Sbjct: 115 DLGADINVKLAGFGEGYHNSTGDYGDWVSVGANSGDLDKAVDGFGRR-NTDMYTNYEDAL 173 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE------YMRNAG 339 E + T G + + V+F +DG+ + N +C +RN+G Sbjct: 174 AGASREFAGAPTTDGKSSDCQAVLFFSDGKVTHPEKS-NEQAAADVCRPNSPLVSLRNSG 232 Query: 340 MKIYSVAV---SAPPEGQDLLRKCTDSS-------GQFFAV-NDSRELLESFD 381 ++ ++V + ++LL + + G +F D LL +F Sbjct: 233 VRFFTVGLIPEDETDSPRELLTEMAEGPCGGGEANGAYFDAGTDPAGLLSAFR 285 >gi|311252833|ref|XP_003125290.1| PREDICTED: vitrin-like isoform 2 [Sus scrofa] Length = 634 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 26/266 (9%), Positives = 64/266 (24%), Gaps = 25/266 (9%) Query: 100 PLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRS 159 + + + L + + + + S+ S+ + RS Sbjct: 145 TYPSALTYSSVPGTTAQPVTLMQVPGTTAVEATHTALPKPSPSAGFTTSSLRLQPVGQRS 204 Query: 160 MEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE---- 215 E P + S S + ID Sbjct: 205 RELGEEPDLWKP-GSVLLDAGFVPKEELSTQSLEPVSQGDPSCKVDLSFLIDGSSSIGKR 263 Query: 216 ----SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP 271 L + Q +G + Y N ++K+ + K+ Sbjct: 264 RFRIQKQFLADVAQALDIGPAGPL--MGVVQYGDNPATQFNLKTHMNSRDLKTAIEKITQ 321 Query: 272 YEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 +N A+ + +++ + + + + DG + + Sbjct: 322 RGGLSNVGRAISFVTKNFFSKSNGNRGGAPN-----VAVVMVDGWPTD--------KVEE 368 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDL 356 R +G+ I+ + + E + Sbjct: 369 ASRLARESGINIFFITIEGAVENEKQ 394 >gi|312072922|ref|XP_003139287.1| von Willebrand factor domain-containing protein [Loa loa] gi|307765555|gb|EFO24789.1| von Willebrand factor domain-containing protein [Loa loa] Length = 2142 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 21/154 (13%), Positives = 52/154 (33%), Gaps = 29/154 (18%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENT 275 V I++ N +R+ I+++ N NN E+ + L L NT Sbjct: 921 LQFAVTLIKQIPPSAFNNRIRVAAISFSSEAQINFQFNEFNNRTEILNALLSLTHSGGNT 980 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 ++ ++ A +E+ G +++ ++ ++DG + + L + + Sbjct: 981 SSVSGINLAIKEILE--------RGREDVRRMIVLMSDGNSQD-----CWEDLLDASDRL 1027 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA 369 +Y++ + +F Sbjct: 1028 HATNTIVYAI---------------AANPDYYFR 1046 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 23/156 (14%), Positives = 59/156 (37%), Gaps = 16/156 (10%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + A +L++S+ K+ +++ + + +S NE Sbjct: 130 SSGSVFNVFEDERKLAHDLIDSL--VPVTLKDGRIQVSVMRFASSAEVVIPFKISRTPNE 187 Query: 262 VKSRLNKLN-PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + +L+K+ +T A+ A +L + + + I I+DG + Sbjct: 188 IMEKLDKIKFTGGSTRIAKAVDLALTDLSRWRRNDAI--------QIFILISDG-----N 234 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 ++ ++NA +++++V VS +L Sbjct: 235 GHELWHVAQTAGRKLQNANIEVFAVPVSQDHNLNEL 270 >gi|149377773|ref|ZP_01895506.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Marinobacter algicola DG893] gi|149357945|gb|EDM46434.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Marinobacter algicola DG893] Length = 973 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 29/313 (9%), Positives = 70/313 (22%), Gaps = 52/313 (16%) Query: 113 IPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNN 172 + + +A I +LDVS SM + Sbjct: 9 TTVLSAAYFTMAGTAAIADDTEIFFTDADGVVKPNI--MFILDVSGSMGTADVGGKTRLR 66 Query: 173 NMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK 232 M L + A I+ + + + + Sbjct: 67 VMKDVTKDLFADMEDVNVGLMVFGGNEGGYFKSAVSPIENKRAALIDSIEDLSDGGNTPL 126 Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 + ++ + + + + + + Sbjct: 127 SETLFQSMRYFQGEDY----------------FIRYWDEPYQDANGNWVADVPPGVTEDG 170 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL---QIC-----EYMR-------- 336 + + + +TDGE + + ++ + C + + Sbjct: 171 KYKSPIEYECQPNSV-VLLTDGEPTEDTNHEGDFEAVLGSGACVDNCLDEIAGYMWENDM 229 Query: 337 -------------NAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDK 382 + + Y+V Q LL G +F ++ L +FD Sbjct: 230 IPPSNDPDDDFRGDQRISTYTVGFKT---DQKLLGDAAAKGNGTYFKAENAASLKSAFDD 286 Query: 383 ITDKIQEQSVRIA 395 + + +S A Sbjct: 287 LFTDVLARSTTFA 299 >gi|254786772|ref|YP_003074201.1| lipoprotein [Teredinibacter turnerae T7901] gi|237685641|gb|ACR12905.1| putative lipoprotein [Teredinibacter turnerae T7901] Length = 1051 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 33/275 (12%), Positives = 86/275 (31%), Gaps = 17/275 (6%) Query: 125 PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP 184 L +++ S + VS + ++ + ND + P Sbjct: 157 GYNLVPFGIQTQAPSIVVSGLRVVDRATNKPVSGLTLNDFVVRENDEALGSEAFLDSEPV 216 Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYN 244 + K + + + I L ++A ++ + L Y Sbjct: 217 SNANLIIKTVLL--LDISTSLSESDIITLKQAARAVIYEENNHGVKVSRLIPGQRVAIYT 274 Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKLN---PYENT---NTYPAMHHAYRELYNEKESSHNT 298 + S+++ + ++ + T + A + + + Sbjct: 275 FDSQVERIIGFSSDVGALADAIDSIPEKIVEGGTTLRGNSTNLIGAVQTGVAQWTNRFGL 334 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK--IYSVAVSAPPEGQDL 356 + + I +TDGE++ A ++ + AG++ +Y++AV Sbjct: 335 DDAET--GYAILVTDGEHTSDDATPASIQGQLV----NAAGVRKDVYAIAV-RNNADMAA 387 Query: 357 LRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 L + T ++ + + D +L ++ + EQ+ Sbjct: 388 LSEITGTTNKVYQALDISDLNLRLQEVQAEAIEQT 422 >gi|237507530|ref|ZP_04520245.1| flp pilus assembly protein TadG [Burkholderia pseudomallei MSHR346] gi|234999735|gb|EEP49159.1| flp pilus assembly protein TadG [Burkholderia pseudomallei MSHR346] Length = 418 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 32/261 (12%), Positives = 82/261 (31%), Gaps = 5/261 (1%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A++++V F+ A+DL + R+++Q++ DA L+ + + P Sbjct: 26 LVALMLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLTGAINLSVPEAAGITA 85 Query: 61 ---STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQY--IAESKAQYEIPT 115 + +Q ++ S + + N ++ Sbjct: 86 GHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCTTSRTGIVNWFI 145 Query: 116 ENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMT 175 + L L + A ++S + + + AI + + +++ + +N + ++ Sbjct: 146 QTLNLVPGVTVANASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVAGATYNIGDWLS 205 Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 + P +F S A + A + + G+ V + + Sbjct: 206 AKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGNKAATTNAYN 265 Query: 236 VRIGTIAYNIGIVGNQCTPLS 256 R G A + Sbjct: 266 TRFGIYANPYKNPSYGTPDFT 286 >gi|261416578|ref|YP_003250261.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|261373034|gb|ACX75779.1| von Willebrand factor type A [Fibrobacter succinogenes subsp. succinogenes S85] gi|302325972|gb|ADL25173.1| von Willebrand factor type A domain protein [Fibrobacter succinogenes subsp. succinogenes S85] Length = 236 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 33/210 (15%), Positives = 74/210 (35%), Gaps = 24/210 (11%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIV-----GNQCTP 254 K+D L + N+++S + A + + I + + Sbjct: 23 VSGSMNEIGKLDSLKHALNNMISSFKDASSSSLEAEIYVSIITFGNQAANIILEPQSASE 82 Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 ++N+ +++ + +NK+ NT A+ L N S + F++ +DG Sbjct: 83 IANDPSKM-NVINKMQAIGNTPLGKALTSLVDLLEN-----REIYPSRAYRPFIVLASDG 136 Query: 315 ENSG-ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ-FFAVND 372 + + L + + ++A+ A + +L+K ++ F N+ Sbjct: 137 MPNDLWQQPLDRLLNSERSKKANRL-----ALAIGADA-DESMLKKFVNNEEMPIFKANN 190 Query: 373 SRELLESFD-----KITDKIQEQSVRIAPN 397 + E+ + F I + IAPN Sbjct: 191 AIEIQKFFKCVTMSAIKSSQSAKPGEIAPN 220 >gi|150024647|ref|YP_001295473.1| outer membrane protein precursor YfbK [Flavobacterium psychrophilum JIP02/86] gi|149771188|emb|CAL42655.1| Probable outer membrane protein precursor YfbK [Flavobacterium psychrophilum JIP02/86] Length = 631 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 23/150 (15%), Positives = 45/150 (30%), Gaps = 11/150 (7%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 ++ + Y N + + L KLN +T + AY+ Sbjct: 309 KVAIVVYAGAAGLVLPPTAGNEKQTIINALEKLNAGGSTAGGAGIELAYKTAQENFIKDG 368 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 N VI TDG+ + S + + E R +G+ + + Sbjct: 369 NNR--------VILATDGDFNVGSTSDS--AMQTLIEDKRESGVFLTCLGYGMGNYKDSK 418 Query: 357 LRKCTDSS-GQFFAVNDSRELLESFDKITD 385 + D G + +++ +E K Sbjct: 419 MEILADKGNGNYAYIDNIQEANRFLGKEFK 448 >gi|306845564|ref|ZP_07478133.1| norD protein [Brucella sp. BO1] gi|306273885|gb|EFM55712.1| norD protein [Brucella sp. BO1] Length = 633 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 24/132 (18%), Positives = 45/132 (34%), Gaps = 20/132 (15%) Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG-- 318 V+ R+ L P T AM HA +L + KK ++ +TDG+ + Sbjct: 515 TVEHRIAALKPGFYTRMGAAMRHATAKLAEQPNR----------KKLLLLLTDGKPNDVD 564 Query: 319 -ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQFFAVNDSRE 375 ++ + R G+ +++V V L G + V + + Sbjct: 565 HYEGRFALEDSRRAAGEARAKGVNVFAVTVDREASAYLPALF-----GRGGYALVANLAK 619 Query: 376 LLESFDKITDKI 387 L + I + Sbjct: 620 LPVALPAIYRML 631 >gi|261313650|ref|ZP_05952847.1| conserved hypothetical protein [Brucella pinnipedialis M163/99/10] gi|261302676|gb|EEY06173.1| conserved hypothetical protein [Brucella pinnipedialis M163/99/10] Length = 391 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 24/132 (18%), Positives = 45/132 (34%), Gaps = 20/132 (15%) Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG-- 318 V+ R+ L P T AM HA +L + KK ++ +TDG+ + Sbjct: 273 TVEHRIAALKPGFYTRMGAAMRHATAKLAEQPNR----------KKLLLLLTDGKPNDVD 322 Query: 319 -ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQFFAVNDSRE 375 ++ + R G+ +++V V L G + V + + Sbjct: 323 HYEGRFALEDSRRAAGEARAKGVNVFAVTVDREASAYLPALF-----GRGGYALVANLAK 377 Query: 376 LLESFDKITDKI 387 L + I + Sbjct: 378 LPVALPAIYRML 389 >gi|256015034|ref|YP_003105043.1| nitric-oxide reductase NorD protein [Brucella microti CCM 4915] gi|255997694|gb|ACU49381.1| nitric-oxide reductase NorD protein [Brucella microti CCM 4915] Length = 633 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 24/132 (18%), Positives = 45/132 (34%), Gaps = 20/132 (15%) Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG-- 318 V+ R+ L P T AM HA +L + KK ++ +TDG+ + Sbjct: 515 TVEHRIAALKPGFYTRMGAAMRHATAKLAEQPNR----------KKLLLLLTDGKPNDVD 564 Query: 319 -ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQFFAVNDSRE 375 ++ + R G+ +++V V L G + V + + Sbjct: 565 HYEGRFALEDSRRAAGEARAKGVNVFAVTVDREASAYLPALF-----GRGGYALVANLAK 619 Query: 376 LLESFDKITDKI 387 L + I + Sbjct: 620 LPVALPAIYRML 631 >gi|254719982|ref|ZP_05181793.1| hypothetical protein Bru83_10644 [Brucella sp. 83/13] gi|265984989|ref|ZP_06097724.1| norD [Brucella sp. 83/13] gi|306837885|ref|ZP_07470746.1| norD protein [Brucella sp. NF 2653] gi|264663581|gb|EEZ33842.1| norD [Brucella sp. 83/13] gi|306407055|gb|EFM63273.1| norD protein [Brucella sp. NF 2653] Length = 633 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 24/132 (18%), Positives = 45/132 (34%), Gaps = 20/132 (15%) Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG-- 318 V+ R+ L P T AM HA +L + KK ++ +TDG+ + Sbjct: 515 TVEHRIAALKPGFYTRMGAAMRHATAKLAEQPNR----------KKLLLLLTDGKPNDVD 564 Query: 319 -ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQFFAVNDSRE 375 ++ + R G+ +++V V L G + V + + Sbjct: 565 HYEGRFALEDSRRAAGEARAKGVNVFAVTVDREASAYLPALF-----GRGGYALVANLAK 619 Query: 376 LLESFDKITDKI 387 L + I + Sbjct: 620 LPVALPAIYRML 631 >gi|254712101|ref|ZP_05173912.1| hypothetical protein BcetM6_01657 [Brucella ceti M644/93/1] gi|254715172|ref|ZP_05176983.1| hypothetical protein BcetM_01672 [Brucella ceti M13/05/1] gi|261216880|ref|ZP_05931161.1| conserved hypothetical protein [Brucella ceti M13/05/1] gi|261319749|ref|ZP_05958946.1| conserved hypothetical protein [Brucella ceti M644/93/1] gi|260921969|gb|EEX88537.1| conserved hypothetical protein [Brucella ceti M13/05/1] gi|261292439|gb|EEX95935.1| conserved hypothetical protein [Brucella ceti M644/93/1] Length = 633 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 24/132 (18%), Positives = 45/132 (34%), Gaps = 20/132 (15%) Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG-- 318 V+ R+ L P T AM HA +L + KK ++ +TDG+ + Sbjct: 515 TVEHRIAALKPGFYTRMGAAMRHATAKLAEQPNR----------KKLLLLLTDGKPNDVD 564 Query: 319 -ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQFFAVNDSRE 375 ++ + R G+ +++V V L G + V + + Sbjct: 565 HYEGRFALEDSRRAAGEARAKGVNVFAVTVDREASAYLPALF-----GRGGYALVANLAK 619 Query: 376 LLESFDKITDKI 387 L + I + Sbjct: 620 LPVALPAIYRML 631 >gi|260818553|ref|XP_002604447.1| hypothetical protein BRAFLDRAFT_122285 [Branchiostoma floridae] gi|229289774|gb|EEN60458.1| hypothetical protein BRAFLDRAFT_122285 [Branchiostoma floridae] Length = 497 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 16/70 (22%), Positives = 28/70 (40%), Gaps = 1/70 (1%) Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT 361 + + ITDG S Q T + R+AG+++Y+V V + L+ Sbjct: 147 DGVPSAAVVITDGSAQSDSEGQVTDDYAAQAAEARDAGIQLYAVGVGDEVDDAA-LQAIA 205 Query: 362 DSSGQFFAVN 371 S + F + Sbjct: 206 GSPDRVFDSD 215 >gi|161620327|ref|YP_001594213.1| von Willebrand factor type A [Brucella canis ATCC 23365] gi|163844435|ref|YP_001622090.1| hypothetical protein BSUIS_B0256 [Brucella suis ATCC 23445] gi|225628708|ref|ZP_03786742.1| Protein norD [Brucella ceti str. Cudo] gi|254699541|ref|ZP_05161369.1| hypothetical protein Bsuib55_01579 [Brucella suis bv. 5 str. 513] gi|254702662|ref|ZP_05164490.1| hypothetical protein Bsuib36_01702 [Brucella suis bv. 3 str. 686] gi|254706206|ref|ZP_05168034.1| hypothetical protein BpinM_04195 [Brucella pinnipedialis M163/99/10] gi|254711500|ref|ZP_05173311.1| hypothetical protein BpinB_14857 [Brucella pinnipedialis B2/94] gi|256029869|ref|ZP_05443483.1| hypothetical protein BpinM2_04290 [Brucella pinnipedialis M292/94/1] gi|256059516|ref|ZP_05449716.1| hypothetical protein Bneo5_04090 [Brucella neotomae 5K33] gi|256158038|ref|ZP_05455956.1| hypothetical protein BcetM4_04255 [Brucella ceti M490/95/1] gi|256253005|ref|ZP_05458541.1| hypothetical protein BcetB_01592 [Brucella ceti B1/94] gi|260167029|ref|ZP_05753840.1| hypothetical protein BruF5_01342 [Brucella sp. F5/99] gi|260568430|ref|ZP_05838899.1| von Willebrand factor [Brucella suis bv. 4 str. 40] gi|261220099|ref|ZP_05934380.1| protein norD [Brucella ceti B1/94] gi|261319110|ref|ZP_05958307.1| norD [Brucella pinnipedialis B2/94] gi|261323486|ref|ZP_05962683.1| protein norD [Brucella neotomae 5K33] gi|261749993|ref|ZP_05993702.1| protein norD [Brucella suis bv. 5 str. 513] gi|261753246|ref|ZP_05996955.1| protein norD [Brucella suis bv. 3 str. 686] gi|261756415|ref|ZP_06000124.1| von Willebrand factor [Brucella sp. F5/99] gi|265986887|ref|ZP_06099444.1| protein norD [Brucella pinnipedialis M292/94/1] gi|265996553|ref|ZP_06109110.1| protein norD [Brucella ceti M490/95/1] gi|294853265|ref|ZP_06793937.1| norD [Brucella sp. NVSL 07-0026] gi|161337138|gb|ABX63442.1| von Willebrand factor type A [Brucella canis ATCC 23365] gi|163675158|gb|ABY39268.1| Hypothetical protein, conserved [Brucella suis ATCC 23445] gi|225616554|gb|EEH13602.1| Protein norD [Brucella ceti str. Cudo] gi|260155095|gb|EEW90176.1| von Willebrand factor [Brucella suis bv. 4 str. 40] gi|260918683|gb|EEX85336.1| protein norD [Brucella ceti B1/94] gi|261298333|gb|EEY01830.1| norD [Brucella pinnipedialis B2/94] gi|261299466|gb|EEY02963.1| protein norD [Brucella neotomae 5K33] gi|261736399|gb|EEY24395.1| von Willebrand factor [Brucella sp. F5/99] gi|261739746|gb|EEY27672.1| protein norD [Brucella suis bv. 5 str. 513] gi|261742999|gb|EEY30925.1| protein norD [Brucella suis bv. 3 str. 686] gi|262550850|gb|EEZ07011.1| protein norD [Brucella ceti M490/95/1] gi|264659084|gb|EEZ29345.1| protein norD [Brucella pinnipedialis M292/94/1] gi|294818920|gb|EFG35920.1| norD [Brucella sp. NVSL 07-0026] Length = 633 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 24/132 (18%), Positives = 45/132 (34%), Gaps = 20/132 (15%) Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG-- 318 V+ R+ L P T AM HA +L + KK ++ +TDG+ + Sbjct: 515 TVEHRIAALKPGFYTRMGAAMRHATAKLAEQPNR----------KKLLLLLTDGKPNDVD 564 Query: 319 -ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQFFAVNDSRE 375 ++ + R G+ +++V V L G + V + + Sbjct: 565 HYEGRFALEDSRRAAGEARAKGVNVFAVTVDREASAYLPALF-----GRGGYALVANLAK 619 Query: 376 LLESFDKITDKI 387 L + I + Sbjct: 620 LPVALPAIYRML 631 >gi|148558518|ref|YP_001257290.1| norD protein [Brucella ovis ATCC 25840] gi|148369803|gb|ABQ62675.1| norD protein [Brucella ovis ATCC 25840] Length = 633 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 24/132 (18%), Positives = 45/132 (34%), Gaps = 20/132 (15%) Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG-- 318 V+ R+ L P T AM HA +L + KK ++ +TDG+ + Sbjct: 515 TVEHRIAALKPGFYTRMGAAMRHATAKLAEQPNR----------KKLLLLLTDGKPNDVD 564 Query: 319 -ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQFFAVNDSRE 375 ++ + R G+ +++V V L G + V + + Sbjct: 565 HYEGRFALEDSRRAAGEARAKGVNVFAVTVDREASAYLPALF-----GRGGYALVANLAK 619 Query: 376 LLESFDKITDKI 387 L + I + Sbjct: 620 LPVALPAIYRML 631 >gi|55377967|ref|YP_135817.1| von Willebrand factor type A like metal binding protein [Haloarcula marismortui ATCC 43049] gi|55230692|gb|AAV46111.1| von Willebrand factor type A like metal binding protein [Haloarcula marismortui ATCC 43049] Length = 394 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 12/132 (9%), Positives = 47/132 (35%), Gaps = 11/132 (8%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + + + ++ T+ Y + A L + + V+ ++DG+ Sbjct: 95 TISRETAVDAVADISAGGGTDMYSGLLEAKASLQD-------LPTDDNTARRVLLLSDGK 147 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE 375 ++ + + + G++I + + + + + T + G++ ++ + + Sbjct: 148 DNSH----DPEAFGTLAREIDTEGIRIKAAGIGSDYREETIRTLGTVARGEWTHLDAAGD 203 Query: 376 LLESFDKITDKI 387 + F ++ Sbjct: 204 IESFFGDAVEEA 215 >gi|256819778|ref|YP_003141057.1| hypothetical protein Coch_0941 [Capnocytophaga ochracea DSM 7271] gi|256581361|gb|ACU92496.1| hypothetical protein Coch_0941 [Capnocytophaga ochracea DSM 7271] Length = 458 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 36/379 (9%), Positives = 94/379 (24%), Gaps = 37/379 (9%) Query: 16 AIDLAHI--MYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFK-------- 65 +D++ + +M A D A P + S + Sbjct: 75 TVDVSRNDESTMSAKMSVAKDVATSKPAKRTEPKHRKHRPESDPQPQSGLVTAGEWNDLN 134 Query: 66 --KQIKKHLKQGSYIRENA-GDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKG 122 +K L + + + +I + N A Sbjct: 135 SWDFYQKTLNKNEFASFPEHWQMYTNHRIAVLVTANGKPAVNATVALYRNNTLLWTAKTD 194 Query: 123 LIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLP 182 A +S + N V D + + + Sbjct: 195 NTGKAELWVS----AFQKEKELNTEHLRLKVNDQ---------WVSTEKAISENTLNRIA 241 Query: 183 PPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIA 242 + S + A + +++ L N+++ +++ + +S Sbjct: 242 LKNEVKKASNEVQIAFMVDATGSMSDELEFLKMDLKNVISKVEEGNKN-LKISTATVFYR 300 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 + + + ++N+ +N + A+H A +L Sbjct: 301 DEGDEYVVKHSDFTKDINKTIQFINSQKADGGGDFPEAVHTALNQLNK------LQWDGE 354 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV-AVSAPPEGQDLLRKCT 361 + + D + + + G+K+ + A + L+R Sbjct: 355 ARTRIAFLVLDAPP--HHEDKVLKSVQASVKTAAEKGIKLIPIVASGIDKPTEFLMRFMA 412 Query: 362 D-SSGQFFAVNDSRELLES 379 ++G + + D + + Sbjct: 413 IYTNGTYVFITDDSGIGNA 431 >gi|227827418|ref|YP_002829197.1| von Willebrand factor A [Sulfolobus islandicus M.14.25] gi|229584633|ref|YP_002843134.1| von Willebrand factor A [Sulfolobus islandicus M.16.27] gi|238619574|ref|YP_002914399.1| von Willebrand factor type A [Sulfolobus islandicus M.16.4] gi|227459213|gb|ACP37899.1| von Willebrand factor type A [Sulfolobus islandicus M.14.25] gi|228019682|gb|ACP55089.1| von Willebrand factor type A [Sulfolobus islandicus M.16.27] gi|238380643|gb|ACR41731.1| von Willebrand factor type A [Sulfolobus islandicus M.16.4] Length = 356 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 19/97 (19%), Positives = 41/97 (42%), Gaps = 8/97 (8%) Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 L+ + N +++ +I +TDG+ + + N + +I ++ Sbjct: 107 TRLHEAVNFTINLAKQSQVPTKIIMLTDGKPT------DKRNVKDYEKLDIPPNTQIITI 160 Query: 346 AVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFD 381 + + +L+K D SSG+F+ + D EL F+ Sbjct: 161 GIG-NDYNERILKKLADRSSGKFYHIKDISELPNIFE 196 >gi|295789109|ref|NP_001171442.1| vitrin isoform 4 [Homo sapiens] gi|119620822|gb|EAX00417.1| vitrin, isoform CRA_b [Homo sapiens] Length = 656 Score = 45.7 bits (106), Expect = 0.013, Method: Composition-based stats. Identities = 14/120 (11%), Positives = 39/120 (32%), Gaps = 14/120 (11%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSH 296 +G + Y + N ++K+ + K+ +N A+ + +++ Sbjct: 310 MGVVQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANG-- 367 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + V+ + DG + + R +G+ I+ + + E + Sbjct: 368 ---NRSGAPNVVVVMVDGWPTD--------KVEEASRLARESGINIFFITIEGAAENEKQ 416 >gi|296448099|ref|ZP_06890001.1| conserved hypothetical protein [Methylosinus trichosporium OB3b] gi|296254413|gb|EFH01538.1| conserved hypothetical protein [Methylosinus trichosporium OB3b] Length = 618 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 30/304 (9%), Positives = 73/304 (24%), Gaps = 15/304 (4%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAA----VLSGCASIVSDRTIKDPTTK 56 M A ++ + + ++ + + I+ + Q D A L+ + + Sbjct: 1 MFAFVLPLVIGVCSLVVEFGNALLIKARYQRVADIASFSGALAYSGTSSTTTMTNASVAV 60 Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 + +A +A A + + A Sbjct: 61 AALNQVPAAAVSANLTVSPASSSNSAVLVAISAAHPLFLTPVLNGPRSLQIAASAYSQIA 120 Query: 117 NLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS 176 L+ + + +G ++ + V S + + Sbjct: 121 VSASSCLLALDSSKSGVTLSGGTSVTATSC---------VVASNSSVTVPCGTSIAAKAV 171 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 + Y P + Y P + + + S Sbjct: 172 DYYSSSAPSAPCSGISGSIVKAYTTDPVAGTSGLTTATARLTTVAAQTAPSAPSVSTGSN 231 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 +GN CT +++ K + P T T+ A+ + + ++ Sbjct: 232 VSFGYTAASMSIGNGCTATTSSAYSGKWTVT--CPAGGTYTFGAITTSATLSFATSGTAA 289 Query: 297 NTIG 300 T Sbjct: 290 TTYN 293 >gi|291242482|ref|XP_002741137.1| PREDICTED: chloride channel calcium activated 3-like [Saccoglossus kowalevskii] Length = 975 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 34/280 (12%), Positives = 81/280 (28%), Gaps = 26/280 (9%) Query: 113 IPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNN 172 I + +F + + S + + S L D N Sbjct: 260 IKSSIMFAQWIHGVEEFCHDDPSDMYSFHNRMAPNRQNIICEGQSAWGVMLENTDFKDGN 319 Query: 173 NMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK 232 N T + + +S + + + L S I+ + Sbjct: 320 NPTRII-------DDTTPNFIIKRSGVRRVVLVLDTSGSMDGDRIQRLYQSATYFIENRI 372 Query: 233 NLSVRIGTIAYNIGI---VGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 +G + ++ +EV S++ + T+ + A + L Sbjct: 373 EDGSFVGIVGFSSYAVILASMTELKYGYQRSEVSSKVPQ-EADGATSIGGGVRLALQVLQ 431 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + +S ++ ITDG + N + + + ++G+++ ++A Sbjct: 432 DGNVTSEGAS--------LLLITDGVENTYPFLMN------VMQEVYDSGVRVDTIAF-T 476 Query: 350 PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 L + +++ G + V + D + I E Sbjct: 477 EAAQSTLQQLSSNTGGLYNYVPEDDNSTAFIDSLAATINE 516 >gi|307107982|gb|EFN56223.1| hypothetical protein CHLNCDRAFT_35166 [Chlorella variabilis] Length = 329 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 35/271 (12%), Positives = 84/271 (30%), Gaps = 45/271 (16%) Query: 117 NLFLKGLIPSALTNLSLRSTGIIER---------SSENLAISICMVLDVSRSMEDLYLQK 167 L+ GL P L S + +++ +VLDVS SM + Sbjct: 78 PLYSVGLSPDPLLGTPASSEFYMAVGLDSGMKAADFARKQLNLVVVLDVSGSMGSPF--- 134 Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 ++ + + P +KIDV E +V ++ Sbjct: 135 ------------------DSYYYDQTVQPTAGVPDEGETKKKIDVAKEVLAGIVGLLR-- 174 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPL--SNNLNEVKSRLN-KLNPYENTNTYPAMHHA 284 + + ++ + +++++K +++ + TN + Sbjct: 175 ------PDDSLSVVLFSDAACVPKPLGPVRCADVDKLKEQISADVVEMGGTNFQAGIDAG 228 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 +L + ++ ++ V+F+TD + + + L + + G+ Sbjct: 229 GAQLTGCA--ACMEANASLVENRVVFLTDAQPNAG--DDSEQGLLARIKALSADGIYTTI 284 Query: 345 VAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE 375 + V Q + +F+V+ E Sbjct: 285 IGVGLDFNTQLVESIGKVRGSNYFSVHTPGE 315 >gi|226309704|ref|YP_002769598.1| hypothetical protein BBR47_01170 [Brevibacillus brevis NBRC 100599] gi|226092652|dbj|BAH41094.1| conserved hypothetical protein [Brevibacillus brevis NBRC 100599] Length = 253 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 13/103 (12%), Positives = 39/103 (37%), Gaps = 10/103 (9%) Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV----SAPPEGQD 355 + ++ +TDG ++ + + R G+ + + V +G+ Sbjct: 1 MKEATLRQILVVTDGCSNSGMS------PVAAAALAREQGITVNVIGVIDKSELGEKGEK 54 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 +R ++ G + ++L ++ +T K +++ N+ Sbjct: 55 EIRDIAEAGGGLCDIVYPQQLAQTVQMLTRKAMTRTIHQVVNK 97 >gi|118390982|ref|XP_001028302.1| hypothetical protein TTHERM_02533180 [Tetrahymena thermophila] gi|89281272|gb|EAR80639.1| hypothetical protein TTHERM_02533180 [Tetrahymena thermophila SB210] Length = 202 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 13/70 (18%), Positives = 28/70 (40%), Gaps = 5/70 (7%) Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF-DKITDK----IQ 388 ++ I+S +G + + G F+ V + ++ E F D + Q Sbjct: 18 KLQEESFTIHSFGFGNDHDGPLMQKIAQIKDGSFYFVEKNDQVDEFFIDALGGLFSVVAQ 77 Query: 389 EQSVRIAPNR 398 + +++I NR Sbjct: 78 DLTIKIEINR 87 >gi|293571190|ref|ZP_06682227.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] gi|291608742|gb|EFF38027.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] Length = 1219 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 26/274 (9%), Positives = 74/274 (27%), Gaps = 37/274 (13%) Query: 140 ERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMT---------------SNKYLLPPP 184 + + I +V+D+S SME ND Y+ Sbjct: 330 NEQQDIKPVDIVLVVDMSGSMESSQSNGWNDRAGAARNGVKNFLQTIKDAGIGDYVNVGL 389 Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYN 244 S T + Y P + + + + + + + Sbjct: 390 VGFSSPGYVTGPNGYLTVPIGKASDTSHI-NAINDALKPKFTGGTYTQIGIEQGQQMLAG 448 Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 L++ + ++ + T + E N + + + + Sbjct: 449 SSNENKMMIVLTDGVPTFSKKVTAAQTIDGTTYATKFGNTLDEPRNTSKLNSSYEVGSW- 507 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK----- 359 + + TL + ++AG+ I+++ + +G L + Sbjct: 508 -----------GNRTNINSTWPATLGAAKIAKDAGLTIHTLGIQLSKDGNFLTEQQVRDR 556 Query: 360 ---CTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + G++ + ++ + ++ + + Sbjct: 557 ASLIA-TPGKYKDAETTNDVSDYLNEQAKNVVKS 589 >gi|229523713|ref|ZP_04413118.1| hypothetical protein VCA_001283 [Vibrio cholerae bv. albensis VL426] gi|229337294|gb|EEO02311.1| hypothetical protein VCA_001283 [Vibrio cholerae bv. albensis VL426] Length = 886 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 33/296 (11%), Positives = 84/296 (28%), Gaps = 28/296 (9%) Query: 38 LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK 97 ++ +SI + + + T+ Q + + + G+Y A + T Sbjct: 132 IADGSSISNSQDTTNNTSADGQLNLSNVAHLSMGIPTGNYTSNGAAISWVLSADKQTLTG 191 Query: 98 NNPLQYIAESKAQYEIPTENLFLKGL-----IPSALTNLSLRSTGIIERSSENLAISICM 152 + + E + + + +++ + + Sbjct: 192 SAGGNKVVEFTLDNQGKVHSTLHSPIDHANKSGEDSLAINIPLEAKNAAGAIGTGRVTLV 251 Query: 153 VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV 212 + D + +D++ ++ + + +L ++ V Sbjct: 252 IEDDAPIAKDIFHMTESETKQGANVQLMLDV------------SGSMGRDAGNGKTRLQV 299 Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN-NLNEVKSRLNKLNP 271 + ESA L+ Q Q ++ I ++ + L + E K+ +N L+ Sbjct: 300 MKESAIQLIEQYQALGQT------KVQLILFSSDASIKTASGLLWMTVAEAKNYINALSA 353 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 T+ A+ A + N + F++DG G Sbjct: 354 NGGTDYDDAIKLA----QESWSGTINGQPLSGATNVSYFLSDGVPEGYDWELKNSQ 405 >gi|322436659|ref|YP_004218871.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] gi|321164386|gb|ADW70091.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] Length = 316 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 29/173 (16%), Positives = 70/173 (40%), Gaps = 20/173 (11%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK--LNPYENTNTYPAMHHAYRELYNEK 292 + + + I + + +N+ E+K ++K +N T + L + Sbjct: 106 TPKDEAFLISFDINVDLLSDYTNSPREIKRSIDKATINTGAGTGSVTGNSTPKGTLLYDA 165 Query: 293 E--SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA- 349 ++H+ + +K ++ +TDG + G + E + A +Y + ++ Sbjct: 166 VYLAAHDKLRQEAGRKILVMLTDGGDQG-----SQETLKTATEAAQKANAIVYVILIADR 220 Query: 350 ---------PPEGQDLLRKCTDSSGQFFAV-NDSRELLESFDKITDKIQEQSV 392 +D+ D+ G+ V N+ R+L ++FD+I D+++ Q + Sbjct: 221 GFYSGGGFSFGGDRDMESLAHDTGGRVINVGNNGRKLEDAFDQIQDELRTQYL 273 >gi|227830111|ref|YP_002831890.1| von Willebrand factor A [Sulfolobus islandicus L.S.2.15] gi|227456558|gb|ACP35245.1| von Willebrand factor type A [Sulfolobus islandicus L.S.2.15] Length = 356 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 19/97 (19%), Positives = 41/97 (42%), Gaps = 8/97 (8%) Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 L+ + N +++ +I +TDG+ + + N + +I ++ Sbjct: 107 TRLHEAVSFTINLAKQSQVPTKIIMLTDGKPT------DKRNVKDYEKLDIPPNTQIITI 160 Query: 346 AVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFD 381 + + +L+K D SSG+F+ + D EL F+ Sbjct: 161 GIG-NDYNERILKKLADRSSGKFYHIKDISELPNIFE 196 >gi|126306102|ref|XP_001362319.1| PREDICTED: similar to putative calcium activated chloride channel-like protein 1; eCLCA1 [Monodelphis domestica] Length = 911 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 21/205 (10%), Positives = 70/205 (34%), Gaps = 36/205 (17%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 +++ L +++ + I + G + ++ + + Sbjct: 316 SMKVGNRLNRLRQASQFFLLQIIEKG-------SWTGVVTFDSSATIQSELIQIESDVQR 368 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 K+ +++L + + A+ + + + + +TDGE++ Sbjct: 369 KTLISRLPTVTVAGGGAHICSGLRTAFMVVKKKFLTD---------GSEMALLTDGEDNT 419 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND---SRE 375 + E ++ +G I+++ + P + L + + G D + Sbjct: 420 TNTCF---------EEVKQSGAIIHTIVLG-PSTEKGLEKLSEMTGGMKTTATDNVQNNG 469 Query: 376 LLESFDKITD---KIQEQSVRIAPN 397 L+++F ++ I ++S+++ Sbjct: 470 LIDAFSALSSGNAAITQRSIKLESK 494 >gi|295789104|ref|NP_001171440.1| vitrin isoform 2 [Homo sapiens] gi|74739159|sp|Q6UXI7|VITRN_HUMAN RecName: Full=Vitrin; Flags: Precursor gi|37181801|gb|AAQ88704.1| VIT [Homo sapiens] gi|119620824|gb|EAX00419.1| vitrin, isoform CRA_d [Homo sapiens] Length = 678 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 14/120 (11%), Positives = 39/120 (32%), Gaps = 14/120 (11%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSH 296 +G + Y + N ++K+ + K+ +N A+ + +++ Sbjct: 332 MGVVQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANG-- 389 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + V+ + DG + + R +G+ I+ + + E + Sbjct: 390 ---NRSGAPNVVVVMVDGWPTD--------KVEEASRLARESGINIFFITIEGAAENEKQ 438 >gi|70606980|ref|YP_255850.1| hypothetical protein Saci_1211 [Sulfolobus acidocaldarius DSM 639] gi|68567628|gb|AAY80557.1| conserved protein [Sulfolobus acidocaldarius DSM 639] Length = 380 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 24/153 (15%), Positives = 51/153 (33%), Gaps = 26/153 (16%) Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG 300 I++ + + ++ ++ T Y A+ A Sbjct: 75 ISFLTFSNNVNILSEYADAPSLVQQIKQIRSGGQTVLYRALERAIEIAKKH--------- 125 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA-PPEGQDLLRK 359 L ++I +TDG+ + + L E + +A + LL+ Sbjct: 126 --DLPGYIILLTDGQPT-DVPETDAYEKLNYPEAYK-------VIAFGIGDDYNERLLKV 175 Query: 360 CTD-SSGQFFAVNDSRELLE-----SFDKITDK 386 TD ++G + V D++E+ E + +I K Sbjct: 176 ITDKTAGILYHVEDAKEIAEMLPQSAVTEIGAK 208 >gi|8567336|ref|NP_059502.1| calcium-activated chloride channel regulator 1 precursor [Mus musculus] gi|81881572|sp|Q9D7Z6|CLCA1_MOUSE RecName: Full=Calcium-activated chloride channel regulator 1; AltName: Full=Calcium-activated chloride channel family member 3; Short=mCLCA3; AltName: Full=Protein gob-5; Flags: Precursor gi|3721912|dbj|BAA33743.1| gob-5 [Mus musculus] gi|15919901|dbj|BAB25815.2| unnamed protein product [Mus musculus] gi|74201990|dbj|BAE22995.1| unnamed protein product [Mus musculus] gi|109731429|gb|AAI16320.1| Chloride channel calcium activated 3 [Mus musculus] gi|109732845|gb|AAI16319.1| Chloride channel calcium activated 3 [Mus musculus] gi|148680065|gb|EDL12012.1| chloride channel calcium activated 3 [Mus musculus] Length = 913 Score = 45.7 bits (106), Expect = 0.014, Method: Composition-based stats. Identities = 19/131 (14%), Positives = 53/131 (40%), Gaps = 25/131 (19%) Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T+ + A+ + + + ++ +TDGE++ S+ + + Sbjct: 382 GGTSICSGLRTAFTVIKKKYPTD---------GSEIVLLTDGEDNTISSCFDLV------ 426 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND---SRELLESFDKITD---K 386 + +G I++VA+ P ++L + + G +D + L+++F ++ Sbjct: 427 ---KQSGAIIHTVALG-PAAAKELEQLSKMTGGLQTYSSDQVQNNGLVDAFAALSSGNAA 482 Query: 387 IQEQSVRIAPN 397 I + S+++ Sbjct: 483 IAQHSIQLESR 493 >gi|123428709|ref|XP_001307556.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121889192|gb|EAX94626.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 667 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 43/329 (13%), Positives = 104/329 (31%), Gaps = 45/329 (13%) Query: 68 IKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSA 127 + + + I N G++ +I + + Y+A Q + K Sbjct: 95 VLGYSLSENAIIFNFGNLPIDTKIEVHY----TISYLATINNQGFFFRFPIASKDQYGYD 150 Query: 128 LTNLSLRSTGIIERSSENL-AISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPK 186 + + S + ++ +N+ I ++ + N+ + + L+ K Sbjct: 151 SSLPNTISFSLKIKTDKNIRKIEANNSAIINEIDKHNASINLNEFESSIFVQTLISDQDK 210 Query: 187 KSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIA---- 242 + S + + + R + +++ + + +V+ + Sbjct: 211 CTAVSSDDYIAVSVYKEFISKRNDYECLSDYFFVIDCSGSMEGKLIDKAVKCMRLMLQSL 270 Query: 243 --------YNIGIVGNQCTPL--SNNLNEVK--SRLNKLNP-YENTNTYPAMHHAYRELY 289 Y G Q P+ NN N + + + + TN Y + + Sbjct: 271 PMKCRFSIYCFGYNFRQLLPIVEYNNENVLLAMNLIKNIKANMGGTNIYNPLKDIF---- 326 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + K + +TDGE N+ + + E + G IY+V + + Sbjct: 327 ----------SQDGMLKKIFLLTDGEVD------NSEEIINLVEKNKAFG-NIYTVGIGS 369 Query: 350 PPEGQDLLRKCTD-SSGQFFAVNDSRELL 377 L+R + ++G++ V D+ Sbjct: 370 GA-DPGLIRNLAEVTNGKWTYVLDNENFN 397 >gi|309364927|emb|CAP23537.2| CBR-CLEC-143 protein [Caenorhabditis briggsae AF16] Length = 666 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 26/174 (14%), Positives = 60/174 (34%), Gaps = 9/174 (5%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTI---AYNIGIVGNQCTPLSNNLNEVKSRL-NK 268 + + ++ SI K K R + Y+ S + +++ + + + Sbjct: 330 VRNTLTQVLGSISKIGPVKYPADPRSTCVGIVTYDDNATTQSQLDASKSFSDLYNVIQSS 389 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L +NTNT K+ + T K + F D + L+ Sbjct: 390 LISVDNTNTSYLSLALLAAEKALKDGRNRTYRFNYKKVIIAFAAD-----YQGHGTALDA 444 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + I +++ + I +VA ++ + Q ++ D+ +L +F + Sbjct: 445 MPIANRLKDNAVTIITVACTSNSDKQTAIQGIASPGFDLVDEMDTPKLPTAFAQ 498 >gi|297265788|ref|XP_001107747.2| PREDICTED: vitrin-like isoform 3 [Macaca mulatta] Length = 693 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 38/367 (10%), Positives = 90/367 (24%), Gaps = 40/367 (10%) Query: 46 SDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIA 105 S T + Q K + + + + T A Sbjct: 148 SALTYSSSKSPAAQAGETTKAYQRPPIPATTAQPVTLMQLLAVTVAVATPTILPRPSPSA 207 Query: 106 ESKAQYEIPTENLFLKGLIPSALTNLSLRSTGI-------IERSSENLAISICMVLDVSR 158 S P + T S + A + DVS Sbjct: 208 ASTTSIRRPEPVGHRSQEMDLWSTTTYTSSQNRPGADPGIQRQDPSGAAFEKPVRADVSL 267 Query: 159 SMEDLYLQKHN-----DNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL 213 D + + L P K + + R+ + Sbjct: 268 GEMDSWKPGSVLLDEGRVPKEELSTQSLEPVSLGDPNCKIDLSFLIDGSTSIGKRRFRIQ 327 Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE 273 + + ++ +G + Y + N ++K+ + K+ Sbjct: 328 KQLLAGVAQALDIGPAGPL-----MGVVQYGDNPATHFNLKTHMNSRDLKTAIEKITQRG 382 Query: 274 N-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 +N A+ + +++ + V+ + DG + + Sbjct: 383 GLSNVGRAISFVTKNFFSKANG-----NRSSAPNVVVVMVDGWPTD--------KVEEAS 429 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLL--------RKCTDSSGQF-FAVNDSRELLESFDKI 383 R +G+ I+ + V E + + ++G + F V L ++ + Sbjct: 430 RLARESGINIFFITVEGAAENEKQYVVEPNFANKAVCRTNGFYSFHVQSWFGLHKTLQPL 489 Query: 384 TDKIQEQ 390 ++ + Sbjct: 490 VKRVCDT 496 >gi|47219205|emb|CAG11223.1| unnamed protein product [Tetraodon nigroviridis] Length = 950 Score = 45.7 bits (106), Expect = 0.015, Method: Composition-based stats. Identities = 35/362 (9%), Positives = 97/362 (26%), Gaps = 41/362 (11%) Query: 35 AAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINIT 94 AAVL + + + + +L + Q+ + Sbjct: 209 AAVLDAVKGLTYPG-------GDESNLGAALEDVAMNLLNERAGGRAEDGV---PQMLVV 258 Query: 95 KDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVL 154 + A + + F G+ +A +L +T S + ++ Sbjct: 259 ITAGSSDDDTAAGDRALKGASVVTFGLGIGDTAAADLEAVATDKSFILSAPDFRTAAGIV 318 Query: 155 DVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLI 214 D +N+ + + + A P + Sbjct: 319 DQLLPYVL-----DVIRHNIIIQTEFREVSGVERDVAFLIDGTDNVRADFPY------IK 367 Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 + +++ + + VRI + + N +EV + L Sbjct: 368 DFIIKVIDPL-----DIGQNKVRIAVVQQSEQPYANFYLNTYQTKDEVMQAVRDLALIGG 422 Query: 275 TNTYPAM--HHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 + +++ + + + +F+I ++ G ++ N + Sbjct: 423 RSLNTGFSLKFMKETIFS---KRYGSRAEDMVPQFLIVLSGG--------RSRDNVKEPA 471 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 ++ G+ + V + + + + F V + EL K+ + + Sbjct: 472 GVLKTGGVLPF--GVGVKDADRKQIEAISHNPAFAFTVKEFSELNTIPQKLNNYVSLSRE 529 Query: 393 RI 394 ++ Sbjct: 530 QL 531 Score = 37.6 bits (85), Expect = 3.8, Method: Composition-based stats. Identities = 17/103 (16%), Positives = 37/103 (35%), Gaps = 12/103 (11%) Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + + + V+ +TDG + A + +R AG+++++V V + Sbjct: 33 NTGSGSRAVEGAVQVVVVLTDGRSQDDVAMP--------AQVLRLAGVELFAVGVQDAVD 84 Query: 353 GQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQEQSVR 393 + LR+ F+V+ L + + I R Sbjct: 85 SE--LREMASQPYDTHVFSVDSFLALRDIIQDLVVGICGAVTR 125 >gi|226360795|ref|YP_002778573.1| hypothetical protein ROP_13810 [Rhodococcus opacus B4] gi|226239280|dbj|BAH49628.1| hypothetical protein [Rhodococcus opacus B4] Length = 891 Score = 45.3 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 24/176 (13%), Positives = 50/176 (28%), Gaps = 31/176 (17%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH-------------- 283 A + L + + ++R + ++ A Sbjct: 98 FTVHAPWTRLDNGSLPALQGEVEKFRTRTDGIDTDYWNALDGARRTLAEHDSQSEANRCQ 157 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL--NTLQICEYMRNAGMK 341 A + K + + +D A I + +R++G+ Sbjct: 158 AVAWFSDGKLDFTVRDAEKPYAQGISLRSDQGVQQVVAAARESICRPAGIADQLRSSGIV 217 Query: 342 IYSVAVSAP---PEGQDLLRKCTD------------SSGQFFAVNDSRELLESFDK 382 ++V ++A P DL+R S G F+ + +LL +FD Sbjct: 218 TFAVGLAAGTAQPSDFDLMRSIATGGDGACGKTTSPSPGDFYLAQNIDDLLFAFDA 273 >gi|81892746|sp|Q6Q473|CLCA4_MOUSE RecName: Full=Calcium-activated chloride channel regulator 4; AltName: Full=Calcium-activated chloride channel regulator 6; Short=mClca6; Contains: RecName: Full=Calcium-activated chloride channel regulator 4, 110 kDa form; Contains: RecName: Full=Calcium-activated chloride channel regulator 4, 30 kDa form; Flags: Precursor gi|50882459|gb|AAS86332.2| calcium activated chloride channel [Mus musculus] Length = 924 Score = 45.3 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 23/207 (11%), Positives = 72/207 (34%), Gaps = 35/207 (16%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 + +++ + ++A ++ I +G + ++ N+ Sbjct: 314 VSGSMTSYDRLNRMNQAAKYFLSQI-------IENRSWVGMVHFSSQATIVHELIQINSD 366 Query: 260 NEVKSRLNKL--NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 E L L + T+ + A++ N + + T ++ ++DGE+S Sbjct: 367 IERNQLLQTLPTSANGGTSICSGIKAAFQVFKNGEYQTDGTE--------ILLLSDGEDS 418 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-GQDLLRKCTDSSGQFFAVNDSRE- 375 + ++++G ++ +A+ + + + G D + Sbjct: 419 T---------AKDCIDEVKDSGSIVHFIALGPLADLAVTNMSIL--TGGNHKLATDEAQN 467 Query: 376 --LLESFDKITDK---IQEQSVRIAPN 397 L+++F + + I ++S+++ Sbjct: 468 NGLIDAFGALASENADITQKSLQLESK 494 >gi|324504675|gb|ADY42017.1| Collagen alpha-5(VI) chain [Ascaris suum] Length = 898 Score = 45.3 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 26/149 (17%), Positives = 54/149 (36%), Gaps = 14/149 (9%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELY 289 VRI I Y + N ++++ L+++ T T A+ A EL+ Sbjct: 33 VDRDGVRIAAIQYAGFPLTEFALGTYLNADDIRQHLSQIKFQSGVTRTGYALRKADSELF 92 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 ++ + + K ++ TDG + + L ++ +KIY V+V + Sbjct: 93 RQERGAR-----SDAIKIIVLFTDGL-----SIDDPLKPAHELRDIKR--VKIYVVSVGS 140 Query: 350 PPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + + F N+ L + Sbjct: 141 DGF-EPEMNRIAGDKRNVFGPNELSRLRD 168 >gi|148555259|ref|YP_001262841.1| hypothetical protein Swit_2344 [Sphingomonas wittichii RW1] gi|148500449|gb|ABQ68703.1| hypothetical protein Swit_2344 [Sphingomonas wittichii RW1] Length = 625 Score = 45.3 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 27/255 (10%), Positives = 68/255 (26%), Gaps = 6/255 (2%) Query: 3 AIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTST 62 A+ ++V +A++++ Q D A L+G + + +P+ Sbjct: 29 ALSLTVLVGMGAFAVEISRGYAADTANQRIADMAALAGALAYNVN---SNPSEMTATAKA 85 Query: 63 IFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKG 122 + Q + ++ Q Q+++T L + S Y++ Sbjct: 86 VVVAQGLPASAATVALVTDSATSKQLVQVSVTTSVPIALGRVFSSALAYDVTATGSATTT 145 Query: 123 LIPSAL---TNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKY 179 + + S ++I+ + + + + + K Sbjct: 146 ATTTTAPPCIAALSSTPTYGITLSGGVSITSPGCAVNTNAGVTVPWGTTITAKQVNAGKG 205 Query: 180 LLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG 239 + P + + + A K D ++ VN + R Sbjct: 206 IDNPGKGITTSPTANDIVQNKASAATDWMKDDSTLKGLLCKVNQLSGYSDPDYADGNRSC 265 Query: 240 TIAYNIGIVGNQCTP 254 T + Sbjct: 266 TTPLVTPATQTSAST 280 >gi|148225160|ref|NP_001089228.1| hypothetical protein LOC734275 [Xenopus laevis] gi|58047691|gb|AAH89181.1| MGC98917 protein [Xenopus laevis] Length = 1014 Score = 45.3 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 31/159 (19%), Positives = 56/159 (35%), Gaps = 24/159 (15%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSH 296 + + ++ N E KS+LN + T T A+ ++ Sbjct: 87 WQYGGLHYSDEVIIFSDITTNKQEYKSKLNAVTYIGRGTFTDCAL---------SNMTAL 137 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 F + ITDG +G+ + + RNAG+K++SVA S + Sbjct: 138 IQRQGGDAINFAVVITDGHVTGSPCGG----MMHQADRARNAGIKLFSVAASHDVY-ESG 192 Query: 357 LRKCTDSSGQFF-------AVNDSRELLESFDKITDKIQ 388 LR+ ++ + F +D + + I IQ Sbjct: 193 LREIANAPYELFRNSYSLTRADDRTVIND--KTIDKIIQ 229 >gi|125975601|ref|YP_001039511.1| von Willebrand factor, type A [Clostridium thermocellum ATCC 27405] gi|125715826|gb|ABN54318.1| von Willebrand factor, type A [Clostridium thermocellum ATCC 27405] Length = 1300 Score = 45.3 bits (105), Expect = 0.015, Method: Composition-based stats. Identities = 31/260 (11%), Positives = 83/260 (31%), Gaps = 29/260 (11%) Query: 140 ERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY 199 + SI V+D+ R + ++ D + + P + Sbjct: 25 TIKTTVDHFSIYGVIDIVRFAQSWGIKDILDKLLNPGGETIPPVAEIGQADIVFVIDTTG 84 Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAY----NIGIVGNQCTPL 255 + I+ + + N N++ + N+ VR+G I Y G+ + Sbjct: 85 SMGSV-----INNVKNNITNFANTLME-----NNVDVRLGLIDYKDLEEDGMDSTKNLGW 134 Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 +N+++ + +N + A L + + KF++ TD Sbjct: 135 FDNVSDFIASVNNMRATGG---GDAPESTVDALEEARRMDFR----PGVNKFIMLFTDVS 187 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE 375 ++ +++ + + E ++ + + ++ P + L R +G + Sbjct: 188 YKESTRFEDVQSMKTVIEKLKEDKIVVSAI---VPSGYESLYRNLYTETGGVY-----AN 239 Query: 376 LLESFDKITDKIQEQSVRIA 395 + ++F + + Sbjct: 240 ITQAFSSALQSLISNIASVT 259 >gi|327270784|ref|XP_003220168.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 952 Score = 45.3 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 21/135 (15%), Positives = 51/135 (37%), Gaps = 26/135 (19%) Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T + ++ ++ ++ +TDGE+S S + + Sbjct: 373 TAGGGTRICNGVEAGFKVFK--------QKYASEKGCEIVLLTDGEDSTISYCLDEV--- 421 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN---DSRELLESFDKITD 385 + +G I+++A+ + L + D + G F+ DS L+++F I+ Sbjct: 422 ------KRSGSIIHTIALGRSAD--PGLEELADMTGGLKFSATDSLDSNSLIDAFTGISS 473 Query: 386 ---KIQEQSVRIAPN 397 + +QS+++ Sbjct: 474 SDGNLTQQSIQLESK 488 >gi|145491137|ref|XP_001431568.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124398673|emb|CAK64170.1| unnamed protein product [Paramecium tetraurelia] Length = 591 Score = 45.3 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 26/157 (16%), Positives = 52/157 (33%), Gaps = 20/157 (12%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTP--LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 + N + R I + N K + ++N T A A+++ Sbjct: 197 DFLNENDRYQLITFESQAQRLTPLKRVTDGNKQYFKQVIQQINSGGGTTIGTATEIAFKQ 256 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG--MKIYSV 345 L K ++ T + ++DG+ + +I E ++ +++ Sbjct: 257 LQERKYRNNVTS--------IFLLSDGQ--------DGQANQRIQEQIKTVNEVFTLHTF 300 Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + Q + + C SG F+ V D L E F Sbjct: 301 GFGEDHDAQMMTQLCNLKSGSFYFVQDVTLLDEFFAD 337 >gi|77465024|ref|YP_354527.1| hypothetical protein RSP_3006 [Rhodobacter sphaeroides 2.4.1] gi|77389442|gb|ABA80626.1| conserved hypothetical protein containing Von Willebrand factor, type A domain [Rhodobacter sphaeroides 2.4.1] Length = 222 Score = 45.3 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 14/96 (14%), Positives = 38/96 (39%), Gaps = 5/96 (5%) Query: 263 KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + + L T A+ A L+ KE + G + +++ +TDG + + Sbjct: 83 RQTVPSLTAGGMTPMGEAVETALELLHTRKE-EYKRAGVDYYQPWLVIMTDGAPTDNISK 141 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLR 358 + L + +R + ++++ + + +L + Sbjct: 142 ASRLVD----DLVREKKLAVFAIGIGKDADMNELAK 173 >gi|281416613|ref|ZP_06247633.1| von Willebrand factor type A [Clostridium thermocellum JW20] gi|281408015|gb|EFB38273.1| von Willebrand factor type A [Clostridium thermocellum JW20] Length = 1363 Score = 45.3 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 31/260 (11%), Positives = 83/260 (31%), Gaps = 29/260 (11%) Query: 140 ERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY 199 + SI V+D+ R + ++ D + + P + Sbjct: 88 TIKTTVDHFSIYGVIDIVRFAQSWGIKDILDKLLNPGGETIPPVAEIGQADIVFVIDTTG 147 Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAY----NIGIVGNQCTPL 255 + I+ + + N N++ + N+ VR+G I Y G+ + Sbjct: 148 SMGSV-----INNVKNNITNFANTLME-----NNVDVRLGLIDYKDLEEDGMDSTKNLGW 197 Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 +N+++ + +N + A L + + KF++ TD Sbjct: 198 FDNVSDFIASVNNMRATGG---GDAPESTVDALEEARRMDFR----PGVNKFIMLFTDVS 250 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE 375 ++ +++ + + E ++ + + ++ P + L R +G + Sbjct: 251 YKESTRFEDVQSMKTVIEKLKEDKIVVSAI---VPSGYESLYRNLYTETGGVY-----AN 302 Query: 376 LLESFDKITDKIQEQSVRIA 395 + ++F + + Sbjct: 303 ITQAFSSALQSLISNIASVT 322 >gi|163752614|ref|ZP_02159791.1| type IV pilin biogenesis protein, putative [Shewanella benthica KT99] gi|161327493|gb|EDP98702.1| type IV pilin biogenesis protein, putative [Shewanella benthica KT99] Length = 564 Score = 45.3 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 10/73 (13%), Positives = 29/73 (39%), Gaps = 4/73 (5%) Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL--RKCTDSSGQFFAVNDSREL 376 N +NT M++ + +++ S + L + G++++ + +L Sbjct: 236 GWMNTNDVNTNAAFPEMQS--VTTFTIGFSDGADDAAPLLTKTAELGGGEYYSAKSATQL 293 Query: 377 LESFDKITDKIQE 389 + ++ +I E Sbjct: 294 QAALSQVFSQILE 306 >gi|154252742|ref|YP_001413566.1| vault protein inter-alpha-trypsin subunit [Parvibaculum lavamentivorans DS-1] gi|154156692|gb|ABS63909.1| Vault protein inter-alpha-trypsin domain protein [Parvibaculum lavamentivorans DS-1] Length = 755 Score = 45.3 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 22/163 (13%), Positives = 51/163 (31%), Gaps = 16/163 (9%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 + I + + NL K + L T PA+ + + Sbjct: 380 PGDTFNVIRFDDTLTVLFPDAVPAHGENLAVAKKFVKSLEANGGTEMLPALRASLID--- 436 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + ++F+TDG +I + + ++++V + + Sbjct: 437 ------RNVNDGTRLRQIVFLTDGA-----ISNEAELFHEITSNLGRS--RLFTVGIGSA 483 Query: 351 PEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 P + R G F + E+ E ++ +K+Q + Sbjct: 484 PNSYFMTRASEAGRGTFTHIGKETEVTERMAELFEKLQNPVMT 526 >gi|94968893|ref|YP_590941.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94550943|gb|ABF40867.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 628 Score = 45.3 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 29/202 (14%), Positives = 72/202 (35%), Gaps = 28/202 (13%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 A + ++AG + + ++ + ++ I+ Q +++ Sbjct: 403 VIDTSASIAGRFKFEQDAAGEFLQRVLTGPEDLG------FVVGFSNSILMAQ--DFTHD 454 Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 ++ + P T + A++ A +L + E + K +I I+DGE++ Sbjct: 455 SKQIAHSIQAFAPSGGTALWDAVNFAAEKLASHPERQP-------VAKILIVISDGEDN- 506 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAV-------SAPPEGQDLLRKCTD-SSGQFFAV 370 + Q + ++ + +Y++ PP G L+ + + G F Sbjct: 507 ----SSATTAKQAIQRAQSEEVAVYAINTLEITQRSEEPPVGVRALKTLAEMTGGAAFTP 562 Query: 371 NDSRELLESFDKITDKIQEQSV 392 R L S + + I+ + + Sbjct: 563 GSVRWLNSSLNDLQQVIRSRYL 584 >gi|332237899|ref|XP_003268144.1| PREDICTED: integrin alpha-10 isoform 2 [Nomascus leucogenys] Length = 1036 Score = 45.3 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 40/219 (18%), Positives = 65/219 (29%), Gaps = 25/219 (11%) Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 S + I E L + K + + +++G + Y V Sbjct: 28 SDGCPTYMDVVIVLDGSNSIYPWSEVQTFLRRLVGKLFIDPEQ--IQVGLVQYGESPVHE 85 Query: 251 QCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 EV L+ E T T A+ A E +++ + + Sbjct: 86 WSLGDFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQSHGGRPEA-----ARLL 140 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--------SAPPEGQDLLRKC 360 + +TDGE+ L+ CE R + Y +AV P +R Sbjct: 141 VVVTDGESHDGE---QLPAALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTI 194 Query: 361 TDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 FF V D L + D + D+I A N Sbjct: 195 ASDPDERFFFNVTDEAALTDIVDALGDRIFGLEGSHAEN 233 >gi|170571356|ref|XP_001891697.1| von Willebrand factor type A domain containing protein [Brugia malayi] gi|158603658|gb|EDP39502.1| von Willebrand factor type A domain containing protein [Brugia malayi] Length = 319 Score = 45.3 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 16/100 (16%), Positives = 35/100 (35%), Gaps = 9/100 (9%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 N ++ + + +L T A+ R +K+ H + K Sbjct: 220 KTRTQFNLDRYFNGKDIVTAIRRLESSGGT---TAVGEGIRLGIEQKDKQHGGRPNEIAK 276 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 K ++ TDG ++ + ++ + AG +Y+V Sbjct: 277 KAMLVFTDGWSNKG------PDVEEMSRNAKGAGFTLYTV 310 >gi|300795696|ref|NP_001178680.1| voltage-dependent calcium channel subunit alpha-2/delta-4 [Rattus norvegicus] Length = 1145 Score = 45.3 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 28/332 (8%), Positives = 92/332 (27%), Gaps = 40/332 (12%) Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 KD + + L+ ++ +++ + T N + + Sbjct: 181 KDDKGNYVELGAEFLLESDAHFNNLRVNVSMSSVQLPTNVYNKDPDILNGVYMSEALNPV 240 Query: 117 NLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS 176 + P+ STG + + + + Sbjct: 241 FVENFQRDPTLTWQYFGSSTGFFRIYPGIKWTPDENGVIAFDCRNRGWYIQ----AATSP 296 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEK---KN 233 ++ S ++ + + +++++ + N Sbjct: 297 KDIVILVDMSGSMKGL----------------RMAIAKHTVTTILDTLGENDFVNIIAYN 340 Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 V + +V +N K +++L A+ A++ L +E Sbjct: 341 DYVHYIEPCFKGILVQADR----DNREHFKQLVDELMVKGVGIVSQALIEAFQILKQFQE 396 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S ++ + + ++ +TDG T N + +++++ + Sbjct: 397 SRQGSLCN----QAIMLVTDGAVEDYEPVFETYNWP-------DRKVRVFTYLIGREVTF 445 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 D ++ ++ ++ L ++ + + + Sbjct: 446 ADRMKWIACNNKGYY--TQISTLADAQENVME 475 >gi|293347024|ref|XP_001056965.2| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta subunit 4-like [Rattus norvegicus] Length = 1179 Score = 45.3 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 28/332 (8%), Positives = 92/332 (27%), Gaps = 40/332 (12%) Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 KD + + L+ ++ +++ + T N + + Sbjct: 181 KDDKGNYVELGAEFLLESDAHFNNLRVNVSMSSVQLPTNVYNKDPDILNGVYMSEALNPV 240 Query: 117 NLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS 176 + P+ STG + + + + Sbjct: 241 FVENFQRDPTLTWQYFGSSTGFFRIYPGIKWTPDENGVIAFDCRNRGWYIQ----AATSP 296 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEK---KN 233 ++ S ++ + + +++++ + N Sbjct: 297 KDIVILVDMSGSMKGL----------------RMAIAKHTVTTILDTLGENDFVNIIAYN 340 Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 V + +V +N K +++L A+ A++ L +E Sbjct: 341 DYVHYIEPCFKGILVQADR----DNREHFKQLVDELMVKGVGIVSQALIEAFQILKQFQE 396 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S ++ + + ++ +TDG T N + +++++ + Sbjct: 397 SRQGSLCN----QAIMLVTDGAVEDYEPVFETYNWP-------DRKVRVFTYLIGREVTF 445 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 D ++ ++ ++ L ++ + + + Sbjct: 446 ADRMKWIACNNKGYY--TQISTLADAQENVME 475 >gi|299138149|ref|ZP_07031329.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] gi|298600079|gb|EFI56237.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] Length = 349 Score = 45.3 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 21/106 (19%), Positives = 47/106 (44%), Gaps = 13/106 (12%) Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV------AVS 348 +H+ + S +K ++ +TDG + G + E + A +Y + S Sbjct: 206 AHDKLQSQTGRKILVLLTDGGDQG-----SQETLKSATEAAQKANAILYVILIADRANFS 260 Query: 349 APPEGQDLLRKCT-DSSGQFFAV-NDSRELLESFDKITDKIQEQSV 392 + + ++ G+ V N+ ++L E+FD+I D+++ Q + Sbjct: 261 YGFNADGQMEQLAHETGGRVINVGNNGKKLEEAFDQIQDELRTQYL 306 >gi|254456981|ref|ZP_05070409.1| phage/colicin/tellurite resistance cluster TerY protein [Campylobacterales bacterium GD 1] gi|207085773|gb|EDZ63057.1| phage/colicin/tellurite resistance cluster TerY protein [Campylobacterales bacterium GD 1] Length = 229 Score = 45.3 bits (105), Expect = 0.016, Method: Composition-based stats. Identities = 29/196 (14%), Positives = 71/196 (36%), Gaps = 29/196 (14%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN- 267 ID L ++ +++NS +KA +++ I + + TPL+ S+++ Sbjct: 32 NIDTLNKAVESMLNSFKKAETM--ETFIKLSIITFGSENGVDLHTPLTE-----VSKIDF 84 Query: 268 -KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS----AY 322 L +T A + ++ + + ++ ++DGE + Sbjct: 85 KPLTVSGSTPMGAAFKMGKAMIEDK-----DIFKGRDYRPTIVLLSDGEPNDDWRQPLDD 139 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK-CTDSSGQFFAVNDSRELLESFD 381 + + C+ M A++ + +L F D+ +++ F Sbjct: 140 FVSTGRTKKCDRM----------ALAIGAADKTVLNMFIEGCENSLFYAEDAENIIDEFK 189 Query: 382 KITDKIQEQSVRIAPN 397 KIT + +++ + N Sbjct: 190 KITMSVTQRTKSVNKN 205 >gi|289524039|ref|ZP_06440893.1| type IV pilus assembly protein PilY1 [Anaerobaculum hydrogeniformans ATCC BAA-1850] gi|289502695|gb|EFD23859.1| type IV pilus assembly protein PilY1 [Anaerobaculum hydrogeniformans ATCC BAA-1850] Length = 1071 Score = 45.3 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 17/158 (10%), Positives = 46/158 (29%), Gaps = 39/158 (24%) Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTI---------GSTRLKKFVIFITDGENSGA 319 L + T +++ A + + + + ++I +TDG ++ Sbjct: 256 LRAHGATPLAASIYGADAHPDKKNPYRGDVRDFFLMDEAINAWCQQNWLIVLTDGADTQT 315 Query: 320 SAYQNTLNTLQICEYM---------------RNAGMKIYSVA-VSAPPEGQDLLRK---- 359 + + + + + ++ + ++ G L+ Sbjct: 316 W--PYNTSPVTAVKNLYDEHAKTSWPSFYGKKAQPVRTMVIGLINPNASGVSTLKNTLNR 373 Query: 360 CTDSSG--------QFFAVNDSRELLESFDKITDKIQE 389 D + D EL+++F I +IQ Sbjct: 374 MADMGDDGQENGSSHAYFATDVDELMQAFKDIFKQIQS 411 >gi|118356595|ref|XP_001011553.1| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila] gi|89293320|gb|EAR91308.1| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila SB210] Length = 899 Score = 45.3 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 14/167 (8%), Positives = 48/167 (28%), Gaps = 10/167 (5%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 + ID + +V +I+ + V S +L Sbjct: 272 VDCTGSMSSWIDAVKLEITGIVAAIKNQHH-GSQIRVSFVGYRDYGDSERYSIFNFSEDL 330 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK------KFVIFITD 313 + + ++K+ + + +++ ++ S ++ D Sbjct: 331 EKFQDFISKVQACGGNDAAEDVAGGFKQANSQNWKSQAKYAVLLADAPAHGIQYHGDKAD 390 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 + + ++ + + + G+K+Y+V + ++ Sbjct: 391 FYDRYPKGDPDGIDLKKEFQNLIKKGVKLYAVEIM---NSTKMMYDI 434 >gi|300789363|ref|YP_003769654.1| hypothetical protein AMED_7539 [Amycolatopsis mediterranei U32] gi|299798877|gb|ADJ49252.1| conserved hypothetical protein [Amycolatopsis mediterranei U32] Length = 526 Score = 45.3 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 16/123 (13%), Positives = 41/123 (33%), Gaps = 10/123 (8%) Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 P T Y ++ AY+ + V+ +TDG N + Sbjct: 407 KPGGATGLYDSILAAYQNARQSWQLGRI--------NVVVVLTDGRNEDDDSIGLPGLLA 458 Query: 330 QICEYMR-NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF-DKITDKI 387 ++ + + + + + +L + + G+ F D R++ + F ++ + Sbjct: 459 ELGRLQDPRKPLPVIGIGIGPDIDASELRQVSAATGGESFTTPDPRKISDVFYQALSKLM 518 Query: 388 QEQ 390 + Sbjct: 519 CQP 521 >gi|317133199|ref|YP_004092513.1| von Willebrand factor type A [Ethanoligenens harbinense YUAN-3] gi|315471178|gb|ADU27782.1| von Willebrand factor type A [Ethanoligenens harbinense YUAN-3] Length = 535 Score = 45.3 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 16/103 (15%), Positives = 40/103 (38%), Gaps = 14/103 (13%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 ++++ + + KL T+ Y + A ++L ++ VI +TDG++ Sbjct: 420 SSMSTLDQNIQKLQAGGGTDIYTPVMTALQQL--------AGADVSQCNPAVILMTDGQS 471 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 + + N +T + + I+S+ L++ Sbjct: 472 NTGRTFTNVQSTYKSI----GKDIPIFSIEFG--AADPTQLKQ 508 >gi|154492260|ref|ZP_02031886.1| hypothetical protein PARMER_01894 [Parabacteroides merdae ATCC 43184] gi|154087485|gb|EDN86530.1| hypothetical protein PARMER_01894 [Parabacteroides merdae ATCC 43184] Length = 339 Score = 45.3 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 26/169 (15%), Positives = 52/169 (30%), Gaps = 46/169 (27%) Query: 251 QCTPLSNNLNEVKSRLNKLNPYE----NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 P++++ K L+ +NP T A++ A R S K Sbjct: 142 TQLPITSDYVSAKMFLSSINPSMVSTQGTAIGAAINLAMRSFTP----------SETSDK 191 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-------PEGQDLLR- 358 +I ITDGEN + + ++ G+ + V + P + ++ Sbjct: 192 AIILITDGEN-------HEDDAVKAAAAAAEKGIHVNIVGMGDPKGSPIPVDGSNNYMKD 244 Query: 359 -----------------KCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 G + +++ L + K DK+ + Sbjct: 245 KDGNVVITKLNEEMCQEIAAAGHGTYVRADNTNSALRALQKEIDKMNKS 293 >gi|328868036|gb|EGG16417.1| hypothetical protein DFA_09452 [Dictyostelium fasciculatum] Length = 946 Score = 45.3 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 24/147 (16%), Positives = 49/147 (33%), Gaps = 10/147 (6%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHN 297 + Y G L+++++++ ++ + + A +A L K+ + Sbjct: 74 MAMGDYCDGSAVLSTLNLTSDVDQLTKFIHSVTGTGGGDLPEAYEYA---LMKAKDLAW- 129 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 S K + I D S +N + + + G+KIY V A + Sbjct: 130 ---SPNASKAFVVIGDAAPHPPSYTDLNINWFKEVDDLAKMGVKIY--GVKAKCDQPIFY 184 Query: 358 RKCTD-SSGQFFAVNDSRELLESFDKI 383 + + S G + E F I Sbjct: 185 EEIAERSGGISINFEKFSLITELFLAI 211 >gi|290975425|ref|XP_002670443.1| vWFA domain-containing protein [Naegleria gruberi] gi|284084002|gb|EFC37699.1| vWFA domain-containing protein [Naegleria gruberi] Length = 348 Score = 45.3 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 22/168 (13%), Positives = 57/168 (33%), Gaps = 25/168 (14%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNI--GIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 A + V SI + + +R ++Y + P + N+ + K +N++ Sbjct: 55 AKDTVQSIITTLHDHFQSDLRFSAVSYRDHTDDYVVKEYPFTKNVEKAKGYVNEMFAKGG 114 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD----GENSGASAYQNT----- 325 + A+ A + + + K I+I D G + + + Sbjct: 115 GDMPEALASALKVVNEIPFNKKGR-------KICIWIADAPPHGMGASGDYFPDGCKDEQ 167 Query: 326 ---LNTLQICEYMRNAGMKIYSVAVSAPPEGQDL---LRKCTD-SSGQ 366 ++ +++ +++ + YS+ Q L + + G+ Sbjct: 168 GEIIDWIKLASHLQEKNVVFYSIICGRSKNDQQLSLFMDYLATKTDGK 215 >gi|67968521|dbj|BAE00622.1| unnamed protein product [Macaca fascicularis] Length = 480 Score = 45.3 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 21/119 (17%), Positives = 41/119 (34%), Gaps = 19/119 (15%) Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 A +++ + ++ +I +TDGE +TL+ + R Sbjct: 1 MQAGFRKAIQQIETFNSGN-------KVPSMIIAMTDGE----LVAHAFQDTLREAQKAR 49 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 G +Y+V V D + DS FAV + F + D + + ++ Sbjct: 50 KLGANVYTVGV--ADYKLDQITAIADSPEHVFAVENG------FKAMRDTVDALTSKVC 100 >gi|167899815|ref|ZP_02487216.1| hypothetical protein Bpse7_39190 [Burkholderia pseudomallei 7894] Length = 396 Score = 45.3 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 32/261 (12%), Positives = 82/261 (31%), Gaps = 5/261 (1%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A++++V F+ A+DL + R+++Q++ DA L+ + + P Sbjct: 4 LVALMLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLTGAINLSVPEAAGITA 63 Query: 61 ---STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQY--IAESKAQYEIPT 115 + +Q ++ S + + N ++ Sbjct: 64 GHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCTTSRTGIVNWFI 123 Query: 116 ENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMT 175 + L L + A ++S + + + AI + + +++ + +N + ++ Sbjct: 124 QTLNLVPGVTVANASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVAGATYNIGDWLS 183 Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 + P +F S A + A + + G+ V + + Sbjct: 184 AKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGDKAATTNAYN 243 Query: 236 VRIGTIAYNIGIVGNQCTPLS 256 R G A + Sbjct: 244 TRFGIYANPYKNPSYGTPDFT 264 >gi|218192066|gb|EEC74493.1| hypothetical protein OsI_09963 [Oryza sativa Indica Group] Length = 641 Score = 45.3 bits (105), Expect = 0.017, Method: Composition-based stats. Identities = 11/124 (8%), Positives = 41/124 (33%), Gaps = 14/124 (11%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 K+ +L + ++ ++ + + + + + L+ + ++ Sbjct: 265 KLSLLKRAMSFVIQTLG-----PNDRLSVVAFSSTAQRLFPLRRMTLT-GRQQALQAISS 318 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L TN A+ + + + + +I ++DG+++ + + + Sbjct: 319 LVASGGTNIADALKKGAKVVKDRRR--------KNPVSSIILLSDGQDTHSFLSGSIQDA 370 Query: 329 LQIC 332 C Sbjct: 371 FAQC 374 >gi|332809815|ref|XP_001161858.2| PREDICTED: integrin alpha-10 isoform 1 [Pan troglodytes] Length = 1036 Score = 45.3 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 40/219 (18%), Positives = 65/219 (29%), Gaps = 25/219 (11%) Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 S + I E L + K + + +++G + Y V Sbjct: 28 SDGCPTYMDVVIVLDGSNSIYPWSEVQTFLRRLVGKLFIDPEQ--IQVGLVQYGESPVHE 85 Query: 251 QCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 EV L+ E T T A+ A E +++ + + Sbjct: 86 WSLGDFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQSHGGRPEA-----ARLL 140 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--------SAPPEGQDLLRKC 360 + +TDGE+ L+ CE R + Y +AV P +R Sbjct: 141 VVVTDGESHDGEELP---AALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTI 194 Query: 361 TDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 FF V D L + D + D+I A N Sbjct: 195 ASDPDERFFFNVTDEAALTDIVDALGDRIFGLEGSHAEN 233 >gi|326385750|ref|ZP_08207379.1| hypothetical protein Y88_2807 [Novosphingobium nitrogenifigens DSM 19370] gi|326209729|gb|EGD60517.1| hypothetical protein Y88_2807 [Novosphingobium nitrogenifigens DSM 19370] Length = 543 Score = 45.3 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 26/183 (14%), Positives = 54/183 (29%), Gaps = 16/183 (8%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 +TA+ I + + ID A +++++ A + TT +T Sbjct: 33 ITALAIIPALFGLGFTIDYARAEMLQSRIN-----------AVADAAALAATDTTYISKT 81 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 ST+ + + + + +T L + ++ + NL Sbjct: 82 STVAQAASTQIFTTQVTDYADFVYTPASD-LTVTITDGGTLNLGRTALVKWRGSSTNL-F 139 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 G++ A + ST I+ +V+D S SM + Sbjct: 140 SGILGVATLPIGGSSTAYASFF---PNINFYLVMDKSPSMLLPSTSSGISAIQKVTGCAF 196 Query: 181 LPP 183 Sbjct: 197 ACH 199 >gi|194385334|dbj|BAG65044.1| unnamed protein product [Homo sapiens] Length = 1036 Score = 45.3 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 40/219 (18%), Positives = 65/219 (29%), Gaps = 25/219 (11%) Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 S + I E L + K + + +++G + Y V Sbjct: 28 SDGCPTYMDVVIVLDGSNSIYPWSEVQTFLRRLVGKLFIDPEQ--IQVGLVQYGESPVHE 85 Query: 251 QCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 EV L+ E T T A+ A E +++ + + Sbjct: 86 WSLGDFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQSHGGRPEA-----ARLL 140 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--------SAPPEGQDLLRKC 360 + +TDGE+ L+ CE R + Y +AV P +R Sbjct: 141 VVVTDGESHDGEELP---AALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTI 194 Query: 361 TDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 FF V D L + D + D+I A N Sbjct: 195 ASDPDERFFFNVTDEAALTDIVDALGDRIFGLEGSHAEN 233 >gi|149049604|gb|EDM02058.1| similar to putative voltage-gated calcium channel alpha(2)delta-4 subunit (predicted) [Rattus norvegicus] Length = 700 Score = 45.3 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 28/332 (8%), Positives = 92/332 (27%), Gaps = 40/332 (12%) Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 KD + + L+ ++ +++ + T N + + Sbjct: 181 KDDKGNYVELGAEFLLESDAHFNNLRVNVSMSSVQLPTNVYNKDPDILNGVYMSEALNPV 240 Query: 117 NLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS 176 + P+ STG + + + + Sbjct: 241 FVENFQRDPTLTWQYFGSSTGFFRIYPGIKWTPDENGVIAFDCRNRGWYIQ----AATSP 296 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEK---KN 233 ++ S ++ + + +++++ + N Sbjct: 297 KDIVILVDMSGSMKGL----------------RMAIAKHTVTTILDTLGENDFVNIIAYN 340 Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 V + +V +N K +++L A+ A++ L +E Sbjct: 341 DYVHYIEPCFKGILVQADR----DNREHFKQLVDELMVKGVGIVSQALIEAFQILKQFQE 396 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S ++ + + ++ +TDG T N + +++++ + Sbjct: 397 SRQGSLCN----QAIMLVTDGAVEDYEPVFETYNWP-------DRKVRVFTYLIGREVTF 445 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 D ++ ++ ++ L ++ + + + Sbjct: 446 ADRMKWIACNNKGYY--TQISTLADAQENVME 475 >gi|253584082|ref|ZP_04861280.1| batA protein [Fusobacterium varium ATCC 27725] gi|251834654|gb|EES63217.1| batA protein [Fusobacterium varium ATCC 27725] Length = 325 Score = 45.3 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 28/179 (15%), Positives = 59/179 (32%), Gaps = 55/179 (30%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP----YENTNTYPAMHHAYRELYNEK 292 RIG I ++ PL+++ + K+ +N L+ T Y A+ A + Sbjct: 123 RIGFIPFSDSAYIQ--MPLTDDYSIGKNYINALDTNLISGGGTELYQALELAEKSFKEIN 180 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 + K +I ++DG + +L + ++ M ++S+ + Sbjct: 181 SDN----------KTIIILSDG----GDFDEKSLKFV------KDNKMNVFSIGIGTEEG 220 Query: 353 ---------------------------GQDLLRKCTD-SSGQFFAVND-SRELLESFDK 382 D L+K + S G+++ VN+ + F Sbjct: 221 TIIPEYVNGKKVGFIKDQNGSAVISKLNSDFLKKLSSESDGKYYEVNNLKDDSSNFFKD 279 >gi|260837445|ref|XP_002613714.1| hypothetical protein BRAFLDRAFT_130689 [Branchiostoma floridae] gi|229299103|gb|EEN69723.1| hypothetical protein BRAFLDRAFT_130689 [Branchiostoma floridae] Length = 1875 Score = 45.3 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 13/81 (16%), Positives = 30/81 (37%), Gaps = 9/81 (11%) Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF 367 +I +TDG++ + A + + G+ +Y++ V + LL + Sbjct: 1 MIVVTDGKSGDSVASS--------ANDLASQGVDVYAIGVG-NYDATQLLEIAAGNQNNV 51 Query: 368 FAVNDSRELLESFDKITDKIQ 388 + D L ++I + Sbjct: 52 IELTDFNALSAEINQIAQTVC 72 >gi|118347184|ref|XP_001007069.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89288836|gb|EAR86824.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 821 Score = 45.3 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 22/151 (14%), Positives = 51/151 (33%), Gaps = 23/151 (15%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR---LNKLNP-YENTNTYPAMHHAYRELY 289 I++ G N +KS ++K + T Y + ++ Sbjct: 369 DDSYFNVISFGSGYQFLFEEAKKKNKQSMKSALEQISKFSADMGGTEIYQPLEKIFQ--- 425 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + + + +TDG+ S + + +++ + + + Sbjct: 426 -------CKNVNDLYQMQIFLLTDGQVSQPDMVVQLIRN-------NSHKARVHCIGLGS 471 Query: 350 PPEGQDLLRKCTDSS-GQFFAVNDSRELLES 379 + LLR+C++S G V+++ EL E Sbjct: 472 GV-DKQLLRRCSESGRGANRQVDNASELKEV 501 >gi|332216482|ref|XP_003257380.1| PREDICTED: LOW QUALITY PROTEIN: voltage-dependent calcium channel subunit alpha-2/delta-3-like [Nomascus leucogenys] Length = 1398 Score = 45.3 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 59/177 (33%), Gaps = 14/177 (7%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++ + ++ ++++++ ++ + G N + L+K Sbjct: 614 RLTIAKQTVSSILDTLGD-DDFFNIIAYNEELHYVEPCLNGTLVQADRTNKEHFREHLDK 672 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L A++ A+ L + + +I S + ++ ITDG N Sbjct: 673 LFAKGIGMLDIALNEAFNILSDFNHTGQGSICS----QAIMLITDGAVDTYDTIFAKYNW 728 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + ++I++ + D L+ ++ FF L + + + + Sbjct: 729 P-------DRKVRIFTYLIGREAAFADNLKWMACANKGFF--TQISTLADVQENVME 776 >gi|297671074|ref|XP_002813673.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-3-like [Pongo abelii] Length = 987 Score = 45.3 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 59/177 (33%), Gaps = 14/177 (7%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++ + ++ ++++++ ++ + G N + L+K Sbjct: 261 RLTIAKQTVSSILDTLGD-DDFFNIIAYNEELHYVEPCLNGTLVQADRTNKEHFREHLDK 319 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L A++ A+ L + + +I S + ++ ITDG N Sbjct: 320 LFAKGIGMLDIALNEAFNILSDFNHTGQGSICS----QAIMLITDGAVDTYDTIFAKYNW 375 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + ++I++ + D L+ ++ FF L + + + + Sbjct: 376 P-------DRKVRIFTYLIGREAAFADNLKWMACANKGFF--TQISTLADVQENVME 423 >gi|296225455|ref|XP_002758485.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-3 isoform 1 [Callithrix jacchus] gi|296225457|ref|XP_002758486.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-3 isoform 2 [Callithrix jacchus] Length = 1091 Score = 45.3 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 59/177 (33%), Gaps = 14/177 (7%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++ + ++ ++++++ ++ + G N + L+K Sbjct: 271 RLTIAKQTVSSILDTLGD-DDFFNIIAYNEELHYVEPCLNGTLVQADRTNKEHFREHLDK 329 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L A++ A+ L + + +I S + ++ ITDG N Sbjct: 330 LFAKGIGMLDIALNEAFNILSDFNHTGQGSICS----QAIMLITDGAVDTYDTIFAKYNW 385 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + ++I++ + D L+ ++ FF L + + + + Sbjct: 386 P-------DRKVRIFTYLIGREAAFADNLKWMACANKGFF--TQISTLADVQENVME 433 >gi|291393868|ref|XP_002713441.1| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta 3 subunit [Oryctolagus cuniculus] Length = 1352 Score = 45.3 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 59/177 (33%), Gaps = 14/177 (7%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++ + ++ ++++++ ++ + G N + L+K Sbjct: 538 RLTIAKQTVSSILDTLGD-DDFFNIIAYNEELHYVEPCLNGTLVQADRTNKEHFREHLDK 596 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L A++ A+ L + + +I S + ++ ITDG N Sbjct: 597 LFAKGIGMLDIALNEAFNILSDFNHTGQGSICS----QAIMLITDGAVDTYDTIFAKYNW 652 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + ++I++ + D L+ ++ FF L + + + + Sbjct: 653 P-------DRKVRIFTYLIGREAAFADNLKWMACANKGFF--TQISTLADVQENVME 700 >gi|194389314|dbj|BAG61618.1| unnamed protein product [Homo sapiens] Length = 525 Score = 45.3 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 59/177 (33%), Gaps = 14/177 (7%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++ + ++ ++++++ ++ + G N + L+K Sbjct: 177 RLTIAKQTVSSILDTLGD-DDFFNIIAYNEELHYVEPCLNGTLVQADRTNKEHFREHLDK 235 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L A++ A+ L + + +I S + ++ ITDG N Sbjct: 236 LFAKGIGMLDIALNEAFNILSDFNHTGQGSICS----QAIMLITDGAVDTYDTIFAKYNW 291 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + ++I++ + D L+ ++ FF L + + + + Sbjct: 292 P-------DRKVRIFTYLIGREAAFADNLKWMACANKGFF--TQISTLADVQENVME 339 >gi|119585707|gb|EAW65303.1| calcium channel, voltage-dependent, alpha 2/delta 3 subunit, isoform CRA_a [Homo sapiens] Length = 992 Score = 45.3 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 59/177 (33%), Gaps = 14/177 (7%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++ + ++ ++++++ ++ + G N + L+K Sbjct: 177 RLTIAKQTVSSILDTLGD-DDFFNIIAYNEELHYVEPCLNGTLVQADRTNKEHFREHLDK 235 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L A++ A+ L + + +I S + ++ ITDG N Sbjct: 236 LFAKGIGMLDIALNEAFNILSDFNHTGQGSICS----QAIMLITDGAVDTYDTIFAKYNW 291 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + ++I++ + D L+ ++ FF L + + + + Sbjct: 292 P-------DRKVRIFTYLIGREAAFADNLKWMACANKGFF--TQISTLADVQENVME 339 >gi|109039062|ref|XP_001082066.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-3-like, partial [Macaca mulatta] Length = 691 Score = 45.3 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 59/177 (33%), Gaps = 14/177 (7%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++ + ++ ++++++ ++ + G N + L+K Sbjct: 271 RLTIAKQTVSSILDTLGD-DDFFNIIAYNEELHYVEPCLNGTLVQADRTNKEHFREHLDK 329 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L A++ A+ L + + +I S + ++ ITDG N Sbjct: 330 LFAKGIGMLDIALNEAFNILSDFNHTGQGSICS----QAIMLITDGAVDTYDTIFAKYNW 385 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + ++I++ + D L+ ++ FF L + + + + Sbjct: 386 P-------DRKVRIFTYLIGREAAFADNLKWMACANKGFF--TQISTLADVQENVME 433 >gi|73985413|ref|XP_533789.2| PREDICTED: similar to calcium channel, voltage-dependent, alpha 2/delta 3 subunit [Canis familiaris] Length = 1128 Score = 45.3 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 59/177 (33%), Gaps = 14/177 (7%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++ + ++ ++++++ ++ + G N + L+K Sbjct: 308 RLTIAKQTVSSILDTLGD-DDFFNIIAYNEELHYVEPCLNGTLVQADRTNKEHFREHLDK 366 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L A++ A+ L + + +I S + ++ ITDG N Sbjct: 367 LFAKGIGMLDIALNEAFNILSDFNHTGQGSICS----QAIMLITDGAVDTYDTIFAKYNW 422 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + ++I++ + D L+ ++ FF L + + + + Sbjct: 423 P-------DRKVRIFTYLIGREAAFADNLKWMACANKGFF--TQISTLADVQENVME 470 >gi|54112397|ref|NP_060868.2| voltage-dependent calcium channel subunit alpha-2/delta-3 [Homo sapiens] gi|74723683|sp|Q8IZS8|CA2D3_HUMAN RecName: Full=Voltage-dependent calcium channel subunit alpha-2/delta-3; AltName: Full=Voltage-gated calcium channel subunit alpha-2/delta-3; Contains: RecName: Full=Voltage-dependent calcium channel subunit alpha-2-3; Contains: RecName: Full=Voltage-dependent calcium channel subunit delta-3; Flags: Precursor gi|22770596|gb|AAN06673.1| voltage-gated calcium channel alpha(2)delta-3 subunit [Homo sapiens] gi|187950675|gb|AAI37506.1| Calcium channel, voltage-dependent, alpha 2/delta subunit 3 [Homo sapiens] gi|187953583|gb|AAI37503.1| Calcium channel, voltage-dependent, alpha 2/delta subunit 3 [Homo sapiens] Length = 1091 Score = 45.3 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 59/177 (33%), Gaps = 14/177 (7%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++ + ++ ++++++ ++ + G N + L+K Sbjct: 271 RLTIAKQTVSSILDTLGD-DDFFNIIAYNEELHYVEPCLNGTLVQADRTNKEHFREHLDK 329 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L A++ A+ L + + +I S + ++ ITDG N Sbjct: 330 LFAKGIGMLDIALNEAFNILSDFNHTGQGSICS----QAIMLITDGAVDTYDTIFAKYNW 385 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + ++I++ + D L+ ++ FF L + + + + Sbjct: 386 P-------DRKVRIFTYLIGREAAFADNLKWMACANKGFF--TQISTLADVQENVME 433 >gi|7105926|emb|CAB75962.1| calcium channel alpha2-delta3 subunit [Homo sapiens] Length = 997 Score = 45.3 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 59/177 (33%), Gaps = 14/177 (7%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++ + ++ ++++++ ++ + G N + L+K Sbjct: 177 RLTIAKQTVSSILDTLGD-DDFFNIIAYNEELHYVEPCLNGTLVQADRTNKEHFREHLDK 235 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L A++ A+ L + + +I S + ++ ITDG N Sbjct: 236 LFAKGIGMLDIALNEAFNILSDFNHTGQGSICS----QAIMLITDGAVDTYDTIFAKYNW 291 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + ++I++ + D L+ ++ FF L + + + + Sbjct: 292 P-------DRKVRIFTYLIGREAAFADNLKWMACANKGFF--TQISTLADVQENVME 339 >gi|7024361|emb|CAB75878.1| calcium channel alpha2-delta3 subunit [Homo sapiens] Length = 519 Score = 45.3 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 59/177 (33%), Gaps = 14/177 (7%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++ + ++ ++++++ ++ + G N + L+K Sbjct: 177 RLTIAKQTVSSILDTLGD-DDFFNIIAYNEELHYVEPCLNGTLVQADRTNKEHFREHLDK 235 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L A++ A+ L + + +I S + ++ ITDG N Sbjct: 236 LFAKGIGMLDIALNEAFNILSDFNHTGQGSICS----QAIMLITDGAVDTYDTIFAKYNW 291 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + ++I++ + D L+ ++ FF L + + + + Sbjct: 292 P-------DRKVRIFTYLIGREAAFADNLKWMACANKGFF--TQISTLADVQENVME 339 >gi|290976446|ref|XP_002670951.1| predicted protein [Naegleria gruberi] gi|284084515|gb|EFC38207.1| predicted protein [Naegleria gruberi] Length = 1082 Score = 45.3 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 24/138 (17%), Positives = 43/138 (31%), Gaps = 19/138 (13%) Query: 259 LNEVKSRLNKLNP---YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 ++S + + T+ A A + +ES + + F+TDGE Sbjct: 183 KGTLQSIIRNIETAFTGGGTDFASAFQLACTIIT--RESGQDRENLPFGNVVITFLTDGE 240 Query: 316 NSGASAYQNTLNTLQICEYMRN--AG-MKIYSVAVSAPPEGQ--DLLRKCTDSSGQFFAV 370 S + E + G + I++V + + D +RK G + Sbjct: 241 --DFSKVGKPGGLQYLSEEINRVYRGDITIHTVGFGSHHNLELLDNIRKVGTIEGAYRYA 298 Query: 371 NDSRE-------LLESFD 381 N L FD Sbjct: 299 NYDDNNDVICGKLTSIFD 316 >gi|296268733|ref|YP_003651365.1| hypothetical protein Tbis_0747 [Thermobispora bispora DSM 43833] gi|296091520|gb|ADG87472.1| hypothetical protein Tbis_0747 [Thermobispora bispora DSM 43833] Length = 587 Score = 45.3 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 10/81 (12%), Positives = 30/81 (37%), Gaps = 6/81 (7%) Query: 317 SGASAYQNTLNTLQICEYMR-----NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 + ++ ++ E +R + ++I V A + L ++G+ + Sbjct: 503 TYGQDDGRGISRQELAEALRKEWNPDRPVQIVVVMFGAGRDRAALEEAAAITNGEVYVAR 562 Query: 372 DSRELLESF-DKITDKIQEQS 391 E+++ F I ++ + Sbjct: 563 QPGEIIDVFLSAIARRLCHPT 583 >gi|148657455|ref|YP_001277660.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] gi|148569565|gb|ABQ91710.1| von Willebrand factor, type A [Roseiflexus sp. RS-1] Length = 429 Score = 45.3 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 19/181 (10%), Positives = 55/181 (30%), Gaps = 21/181 (11%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 K + ++ LV + + + P+ ++ +R Sbjct: 188 GAPKTVIARQALIALVERLPATTNVALRTYGHRRADDCSDTELVQAPAPIQ--RADLINR 245 Query: 266 LNKLNP--YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +N + P T ++ R+L + ++ ++DG+ + Sbjct: 246 INAIRPVNGGRTPIAQSLEDMARDLA-----------GVDGEVLIVLVSDGDETCGGDPV 294 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQFFAVNDSRELLESFD 381 T L +++ + + E + L G +F ++ +L ++ + Sbjct: 295 ATAAALHTANP----RLRVSVIGFNIEQEEWRRRLEGIAAYGGGAYFDAANAVQLADALE 350 Query: 382 K 382 + Sbjct: 351 Q 351 >gi|328949411|ref|YP_004366747.1| von Willebrand factor type A [Treponema succinifaciens DSM 2489] gi|328449735|gb|AEB15450.1| von Willebrand factor type A [Treponema succinifaciens DSM 2489] Length = 385 Score = 45.3 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 19/115 (16%), Positives = 49/115 (42%), Gaps = 8/115 (6%) Query: 281 MHHAYRELYNEKESSHNT-IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG 339 + LY+ + +K VI ++DG + + ++ Q+ E + N+G Sbjct: 159 LSEGQPRLYDSIMNLIRKVEQKKGKRKVVIILSDGRDQNSRFSKD-----QLIETLSNSG 213 Query: 340 MKIYSVA--VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + +Y+V V + +L + G ++ +++ ++ K+ D I++ + Sbjct: 214 LPVYTVGMKVLSNQSLSNLDEISQLTGGTYYYSTRFKDIPDNLKKVVDCIKQSYI 268 >gi|323484869|ref|ZP_08090225.1| hypothetical protein HMPREF9474_01976 [Clostridium symbiosum WAL-14163] gi|323401865|gb|EGA94207.1| hypothetical protein HMPREF9474_01976 [Clostridium symbiosum WAL-14163] Length = 2032 Score = 45.3 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 43/282 (15%), Positives = 89/282 (31%), Gaps = 42/282 (14%) Query: 140 ERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNM--TSNKYLLPPPPKKSFWSKNTTKS 197 + + I V+D S SM D + N ++ + S SK + Sbjct: 1069 QTTIGGGTADIVFVIDKSSSMNDWDDSLDDYRWNKLESTVDRFINKLKITSPNSKISFIE 1128 Query: 198 KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN 257 + D + + A S K ++ K R I VG++ + Sbjct: 1129 YQSSDLGTYQETYDDIRKVAVADTESADKNLEGNKLKYFRTLQKWTAISEVGSKPYGSAP 1188 Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG--- 314 N+ T++ A R L K + + S K++I++ DG Sbjct: 1189 GYNQ------------GTHSAGGYLGAERALDRLKTYNPDEYNS--NSKYIIYLADGTAG 1234 Query: 315 --ENSGASAYQNTLNTLQICEYM----------RNAGMKIYSVAVSAPPEGQ-DLLRKCT 361 N+ + + ++ IY+VA + + ++ Sbjct: 1235 YYVNNNGTQAGSGSGGNANARRAAITQSGELKKKHPDATIYTVAFGSDSSANMNWMKPGA 1294 Query: 362 DSSGQ----------FFAVNDSRELLESFDKITDKIQEQSVR 393 + F++ +++EL E+FD + ++ +V Sbjct: 1295 YNGNSDNPYNPNVTAFYSAANTKELEETFDNLAAQVGSSAVT 1336 >gi|2342594|emb|CAA73785.1| matrilin-3 precursor [Homo sapiens] Length = 310 Score = 45.3 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 15/93 (16%), Positives = 30/93 (32%), Gaps = 15/93 (16%) Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + S+ + K I +TDG ++ + +G+++Y+V V Sbjct: 2 AGAREPSSNIPKVAIIVTDGRPQD--------QVNEVAARAQASGIELYAVGV--DRADM 51 Query: 355 DLLRKCTDSS--GQFFAVND---SRELLESFDK 382 L+ F V +L F + Sbjct: 52 ASLKMMASEPLEEHVFYVETYGVIEKLSSRFQE 84 >gi|167842450|ref|ZP_02469134.1| hypothetical protein Bpse38_37650 [Burkholderia thailandensis MSMB43] Length = 418 Score = 45.3 bits (105), Expect = 0.018, Method: Composition-based stats. Identities = 29/261 (11%), Positives = 80/261 (30%), Gaps = 5/261 (1%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A++++V F+ A+DL + R+++Q++ DA L+ + + P Sbjct: 26 LVALMLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLTGAINLSVPEAAGITA 85 Query: 61 ---STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQY--IAESKAQYEIPT 115 + +Q + S + + + ++ Sbjct: 86 GHLNYALFEQFPVQMLTNSNVTFSNSLSNPFQPKSSIASPSSIKYVKCTTSRTGIVNWFI 145 Query: 116 ENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMT 175 + L + + A ++S + + + AI + + +++ + +N + + Sbjct: 146 QTLNMVPGVSVANASVSATAVATVGAAQTTCAIPVFVCKAGTQTSPPVAGATYNIGDWLA 205 Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 P +F S A + + + G+ V + + + + Sbjct: 206 VKTGSPPSFGAGNFGWSALDGSNSAASIKSELTGNYCALPATGSQVGTPGNKVADSAAYN 265 Query: 236 VRIGTIAYNIGIVGNQCTPLS 256 R G A + Sbjct: 266 TRFGIYANPYKDPSYGTPDFT 286 >gi|118595076|ref|ZP_01552423.1| hypothetical protein MB2181_05370 [Methylophilales bacterium HTCC2181] gi|118440854|gb|EAV47481.1| hypothetical protein MB2181_05370 [Methylophilales bacterium HTCC2181] Length = 700 Score = 45.3 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 23/167 (13%), Positives = 56/167 (33%), Gaps = 16/167 (9%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYR 286 +K + I + ++ N L T+ A+ A+ Sbjct: 367 MRLDKGDRFNIIDFDTQFTPLFTEAMPAININKKSGLQFTKSLEADGGTDPLEAIKFAFT 426 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 I S L + +IF+TDG+ S +T+ Q + + +++ Sbjct: 427 S---------KKIPSQPLLRQIIFLTDGQVSNEHEIIDTVR--QYIDQDK-----FFTIG 470 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + + P + + G F + + +E+ ++ K++ ++ Sbjct: 471 IGSAPNSYLMTKLADYGRGAFTYIGEVKEVRTKMTELFSKLESPALT 517 >gi|70606978|ref|YP_255848.1| hypothetical protein Saci_1209 [Sulfolobus acidocaldarius DSM 639] gi|68567626|gb|AAY80555.1| conserved protein [Sulfolobus acidocaldarius DSM 639] Length = 360 Score = 45.3 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 15/129 (11%), Positives = 44/129 (34%), Gaps = 19/129 (14%) Query: 249 GNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 N L + + L + TN + A+ + + + + Sbjct: 83 SNDIETLYEGESGKQIELKSIKMGYTTNLHKAITKVLEKFKSSEIPVK-----------I 131 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQF 367 I ++DG+ + ++ + +++ ++ + + +++ D G F Sbjct: 132 ILLSDGKPTDKRYSRDYESLQVP------KNVQLITIGLG-EDYNEAIMKILADKGSGVF 184 Query: 368 FAVNDSREL 376 + +ND +L Sbjct: 185 YHINDPSQL 193 >gi|290976796|ref|XP_002671125.1| predicted protein [Naegleria gruberi] gi|284084691|gb|EFC38381.1| predicted protein [Naegleria gruberi] Length = 1058 Score = 45.3 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 17/156 (10%), Positives = 48/156 (30%), Gaps = 26/156 (16%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSN-NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 N+ + + + +S +L E +++ +++ T+ A R Sbjct: 92 TNIDGHNERVLFIPYDTSAELIDMSRMSLTEKLNQVQRVHAGGGTDFACVFD-AIRLFTG 150 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA------GMKIYS 344 + ++F TDG+ N + M+ + ++ Sbjct: 151 ALDGQIA----------IVFFTDGQ------DGYNGNRETAIDMMKKRLTTESESFEFHT 194 Query: 345 VAVSAPPEGQDL--LRKCTDSSGQFFAVNDSRELLE 378 + S+ + + L + + + G F + + Sbjct: 195 IGFSSGHDARLLTDMTRLGSAQGTFQYAESASSIST 230 >gi|149918007|ref|ZP_01906501.1| protein containing a von Willebrand factor type A domain [Plesiocystis pacifica SIR-1] gi|149821273|gb|EDM80677.1| protein containing a von Willebrand factor type A domain [Plesiocystis pacifica SIR-1] Length = 1606 Score = 45.3 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 15/130 (11%), Positives = 42/130 (32%), Gaps = 13/130 (10%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + N++ L + T+ A +++ + + Sbjct: 593 YPEQGMTPASTENIDAALEALGERRHGGATDLGAIFERALDRVHDADQPA---------- 642 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG 365 V++I DG + + L + ++ + + ++++V V + L R G Sbjct: 643 --VVYIGDGLATSGERGSDDL-SERLRRALSGSPARLFTVGVGPTIDDSLLERLARVGGG 699 Query: 366 QFFAVNDSRE 375 + V+ + Sbjct: 700 ESLRVSTPDQ 709 >gi|198421553|ref|XP_002122451.1| PREDICTED: similar to CLCA family member 1, chloride channel regulator [Ciona intestinalis] Length = 1034 Score = 45.3 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 36/212 (16%), Positives = 68/212 (32%), Gaps = 23/212 (10%) Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 Y+ + + T AP NR D N++ + I LS + Sbjct: 251 YMDQVVDFCHNDTNDPTNLHNKEAPNEHNRLCDQRSSRYENMMQAATDFIMTYVPLSAEV 310 Query: 239 GTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN--PYENTNTYPAMHHAYRELYNEKESSH 296 G + + + N + E ++L P T + + L N Sbjct: 311 GIVEFESEASTLSWLVMINGIAERNYLKDRLPKPPSGGTCIGCGIEQGIKVLGNRA---- 366 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 +I +TDGE + + + E + + + S+ P DL Sbjct: 367 ---------GHLIVLTDGEETTEPYVNSVKD-----EVLAKQNVVVDSIFFG-PSGNLDL 411 Query: 357 LRKCTDSSG--QFFAVNDSRELLESFDKITDK 386 + D+ G + V+D L E+F ++ + Sbjct: 412 QQLTEDTEGIMYYNDVSDITGLKETFKQLAES 443 >gi|115905847|ref|XP_797936.2| PREDICTED: hypothetical protein [Strongylocentrotus purpuratus] gi|115976253|ref|XP_001179346.1| PREDICTED: hypothetical protein [Strongylocentrotus purpuratus] Length = 870 Score = 45.3 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 17/142 (11%), Positives = 46/142 (32%), Gaps = 20/142 (14%) Query: 251 QCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 P+S + ++K++++ L +T PA+ L + ++ Sbjct: 515 NIRPVSTAVGDLKTQVSNLRTAGSTALGPALCVCVGLL------------AKEPGSEIVL 562 Query: 311 ITDGENSGASAY---QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS---S 364 TDG+ + + + +I + I + + + +++ + Sbjct: 563 CTDGQPNVGVGSFGYSSRGDNPEIGNIAKEQQTTISIIGIQT--NQSLQMEHISNTVSIT 620 Query: 365 GQFFAVNDSRELLESFDKITDK 386 G + D EL ++ Sbjct: 621 GGNLNLLDPHELTRQIRQLGQD 642 >gi|307102430|gb|EFN50705.1| hypothetical protein CHLNCDRAFT_28788 [Chlorella variabilis] Length = 344 Score = 45.3 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 26/181 (14%), Positives = 60/181 (33%), Gaps = 15/181 (8%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 KIDV E +++ + N SV I + ++ ++++ Sbjct: 122 NTTKIDVAKEVLSGVLDLLAP------NDSVAIVLFSTRACTPQPLSRVSCLDIPALQAQ 175 Query: 266 LN-KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 ++ ++ +T+ + A EL E + T + ++ ITD + + Sbjct: 176 IDKDMHATSSTSLSAGLDLAIAELKKCSEG--MSASLTDTENRIMVITDQQPNSGDYTTG 233 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE----LLESF 380 L + + G+ + V + +++V+ E L + F Sbjct: 234 GLAARLRKDA--DDGIFTTIIGVGLDLNSELAESISKVRGANYYSVHRPGEFRRRLTDEF 291 Query: 381 D 381 D Sbjct: 292 D 292 >gi|306840900|ref|ZP_07473644.1| norD protein [Brucella sp. BO2] gi|306289103|gb|EFM60361.1| norD protein [Brucella sp. BO2] Length = 633 Score = 45.3 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 23/132 (17%), Positives = 45/132 (34%), Gaps = 20/132 (15%) Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG-- 318 V+ R+ L P T AM +A +L + KK ++ +TDG+ + Sbjct: 515 TVEHRIAALKPGFYTRMGAAMRYATAKLAEQPNR----------KKLLLLLTDGKPNDVD 564 Query: 319 -ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQFFAVNDSRE 375 ++ + R G+ +++V V L G + V + + Sbjct: 565 HYEGRFALEDSRRAAGEARAKGVNVFAVTVDREASAYLPALF-----GRGGYALVANLAK 619 Query: 376 LLESFDKITDKI 387 L + I + Sbjct: 620 LPVALPAIYRML 631 >gi|288959068|ref|YP_003449409.1| hypothetical protein AZL_022270 [Azospirillum sp. B510] gi|288911376|dbj|BAI72865.1| hypothetical protein AZL_022270 [Azospirillum sp. B510] Length = 149 Score = 45.3 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 13/123 (10%), Positives = 30/123 (24%), Gaps = 24/123 (19%) Query: 13 ITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHL 72 +D + ++ A+DAA L+G + + + + F K Sbjct: 38 AGCTVDALRSYAVEARLSQAVDAAALAGGRVMFDSQ-------RDGHIRSFFDKAFPNGF 90 Query: 73 KQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLS 132 + ++ + A + L L + Sbjct: 91 L--------------GSHLSPLTIAEDAAAGTLTVSAHASVNAIFL---RLFGKKEVTVE 133 Query: 133 LRS 135 +S Sbjct: 134 AQS 136 >gi|254454548|ref|ZP_05067985.1| conserved hypothetical protein [Octadecabacter antarcticus 238] gi|198268954|gb|EDY93224.1| conserved hypothetical protein [Octadecabacter antarcticus 238] Length = 237 Score = 45.3 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 16/124 (12%), Positives = 35/124 (28%), Gaps = 19/124 (15%) Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + N S + + N P + + +++ + +N L+ Y NT + Sbjct: 1 MSHFSYGGSNDSPINPDVTWCFTGDQNAIIPHTTSESDLHNAINDLHAYGNTAIDMGVKW 60 Query: 284 AYRELYNE-------------------KESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 L + + K ++ +TDGEN+ + Sbjct: 61 GVALLDPSTKSLIASLAGASGSGVPSVAAGRPEQFTQSDVLKVLVLMTDGENTQQWDLYD 120 Query: 325 TLNT 328 Sbjct: 121 HYKN 124 >gi|167520728|ref|XP_001744703.1| hypothetical protein [Monosiga brevicollis MX1] gi|163777034|gb|EDQ90652.1| predicted protein [Monosiga brevicollis MX1] Length = 785 Score = 45.3 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 13/108 (12%), Positives = 36/108 (33%), Gaps = 9/108 (8%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + +V +++ + +TN + + + L + + L + +TDGE Sbjct: 216 TTVADVLGKIDAIYHGGSTNLWAGIETGLQLLASCAQPH--------LHNVCVALTDGEP 267 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS 364 + + + M N ++++ LL+ + Sbjct: 268 NRHPEQGYETAHRRF-KQMPNFSYVLHTLPFGFGRIDSALLQSLARTG 314 >gi|134118676|ref|XP_771841.1| hypothetical protein CNBN0230 [Cryptococcus neoformans var. neoformans B-3501A] gi|50254445|gb|EAL17194.1| hypothetical protein CNBN0230 [Cryptococcus neoformans var. neoformans B-3501A] Length = 502 Score = 45.3 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 20/189 (10%), Positives = 52/189 (27%), Gaps = 26/189 (13%) Query: 202 APAPANRKIDVLIESAGNLVNSIQ-----KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 + I+ + + + + I+ + + V + P + Sbjct: 63 CTGSMQKYINSVRDHIIGICDMIRGEEGLNGPDDLRVAVVNYRDHPPQDSTYVYKFHPFT 122 Query: 257 NNLNEVKSRLNKLNPYENTNTYPAM--HHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 +++ EV++ L L + A+ A E + I +G Sbjct: 123 SDIPEVQNYLKGLTASGGGDGPEAVTAAMAATLTELEWRREAARMAVLVADAPPHGIGEG 182 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSR 374 + + + L + M G+ ++ + C D+ + Sbjct: 183 GDQFKQGDPDGHDPLVVARMMAQNGITMF-------------MVACEDTLSGYSHA---- 225 Query: 375 ELLESFDKI 383 ++ F I Sbjct: 226 --VDFFQAI 232 >gi|295789107|ref|NP_001171441.1| vitrin isoform 3 [Homo sapiens] Length = 657 Score = 45.3 bits (105), Expect = 0.019, Method: Composition-based stats. Identities = 14/120 (11%), Positives = 39/120 (32%), Gaps = 14/120 (11%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSH 296 +G + Y + N ++K+ + K+ +N A+ + +++ Sbjct: 311 MGVVQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANG-- 368 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + V+ + DG + + R +G+ I+ + + E + Sbjct: 369 ---NRSGAPNVVVVMVDGWPTD--------KVEEASRLARESGINIFFITIEGAAENEKQ 417 >gi|260810987|ref|XP_002600204.1| hypothetical protein BRAFLDRAFT_118258 [Branchiostoma floridae] gi|229285490|gb|EEN56216.1| hypothetical protein BRAFLDRAFT_118258 [Branchiostoma floridae] Length = 421 Score = 45.3 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 18/81 (22%), Positives = 35/81 (43%), Gaps = 3/81 (3%) Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-GQDLLRKC 360 + V+ +TDG G + T + + E R+AGM++Y+ A+ + L + Sbjct: 319 EGVPSIVVILTDGRIYGTIDGRYTEDVIGYAEAARDAGMEVYAGAIGREVFVDETALEEI 378 Query: 361 TDSSGQFFAV--NDSRELLES 379 + S + F+ D L+ Sbjct: 379 SGSEDRTFSTFDEDPSVLVAI 399 >gi|115496702|ref|NP_001068594.1| collagen alpha-2(VI) chain [Bos taurus] gi|94574217|gb|AAI16098.1| Collagen, type VI, alpha 2 [Bos taurus] gi|296490819|gb|DAA32932.1| collagen, type VI, alpha 2 [Bos taurus] Length = 917 Score = 45.3 bits (105), Expect = 0.020, Method: Composition-based stats. Identities = 26/154 (16%), Positives = 55/154 (35%), Gaps = 19/154 (12%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS-VRIGTIAYNIGIVGNQCTPLS--- 256 + + + N+VN + ++ K+ + R+G + Y+ Sbjct: 621 SSESIGYTNFTLEKNFVINVVNRLGAIAKDPKSETGTRVGVVQYSHEGTFEAIQLDDERI 680 Query: 257 NNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 ++L+ K + L T T A+ AY +L E + + F + ITDG Sbjct: 681 DSLSSFKEAVKNLEWIAGGTWTPSALKFAYNKLIKESRR-------QKTRVFAVVITDGR 733 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + + LN +C N + + ++ + Sbjct: 734 HD---PRDDDLNLRALC----NHEVTVTAIGIGD 760 >gi|332256729|ref|XP_003277468.1| PREDICTED: collagen alpha-2(VI) chain [Nomascus leucogenys] Length = 1124 Score = 44.9 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 31/220 (14%), Positives = 72/220 (32%), Gaps = 18/220 (8%) Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 P+K+ + D+L+ V +Q + L + Sbjct: 168 NNCPEKTDCPIHVYFVLDTSESVAMQSPTDILLFHMKQFVPQFISQLQNEFYLDQVALSW 227 Query: 242 AY---NIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHN 297 Y + +P ++ L ++ T T A+ + ++ ++ Sbjct: 228 RYGGLHFSDQVEVFSPPGSDRASFIKNLQGISSFRRGTFTDCALANMTEQIRQDR----- 282 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 S F + ITDG +G+ L E R G+++++VA + L Sbjct: 283 ---SKGTVHFAVVITDGHVTGSPCGGIKLQ----AERAREEGIRLFAVA-PNQNLKEQGL 334 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 R + + + + + L +S +I + +++ + Sbjct: 335 RDIASTPHELYRNDYATMLPDS-TEIDQDTINRIIKVMKH 373 >gi|332164672|ref|NP_001193680.1| calcium channel, voltage-dependent, alpha 2/delta subunit 3 [Bos taurus] gi|296474881|gb|DAA16996.1| calcium channel, voltage-dependent, alpha 2/delta subunit 3 [Bos taurus] Length = 1091 Score = 44.9 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 59/177 (33%), Gaps = 14/177 (7%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++ + ++ ++++++ ++ + G N + L+K Sbjct: 271 RMTIAKQTVSSILDTLGD-DDFFNIIAYNEELHYVEPCLNGTLVQADRTNKEHFREHLDK 329 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L A++ A+ L + + +I S + ++ ITDG N Sbjct: 330 LFAKGIGMLDIALNEAFNILSDFNHTGQGSICS----QAIMLITDGAVDTYDTIFAKYNW 385 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + ++I++ + D L+ ++ FF L + + + + Sbjct: 386 P-------DRKVRIFTYLIGREAAFADNLKWMACANKGFF--TQISTLADVQENVME 433 >gi|319955584|ref|YP_004166851.1| von willebrand factor type a [Cellulophaga algicola DSM 14237] gi|319424244|gb|ADV51353.1| von Willebrand factor type A [Cellulophaga algicola DSM 14237] Length = 363 Score = 44.9 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 17/171 (9%), Positives = 67/171 (39%), Gaps = 12/171 (7%) Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + + E ++ S ++ ++ + L+++ ++ + ++ + + + ++ Sbjct: 152 VNNVMPEVESESNKMAIYWFDGEDELHLLNDLTSSKEDLIASIDAITDDISNDPSTDLYG 211 Query: 284 AYRELYNEKESSHNTIGSTRLKKF--VIFITDGENSGASAYQNTLNTLQICEYMRNA--G 339 A + + E + + ++ TDG + + + + ++NA Sbjct: 212 AVIKSTEKAEKLIKETKNNDIIGAASIVIFTDGTDQASRYTE-----KAALDKVKNASLN 266 Query: 340 MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + +++ + + ++L +S F + EL +F+ I+ + ++ Sbjct: 267 ISYFTIGLGSEI-DTEVLASIGKTSSVF--AGNKAELETTFNDISYLVSQR 314 >gi|300779171|ref|ZP_07089029.1| conserved hypothetical protein [Chryseobacterium gleum ATCC 35910] gi|300504681|gb|EFK35821.1| conserved hypothetical protein [Chryseobacterium gleum ATCC 35910] Length = 396 Score = 44.9 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 27/170 (15%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAY-----NIGIVGNQCTPLSNNLNEVKS 264 ID N+VN++ K + I Y Q TPL+ +L+ V Sbjct: 70 IDQAKSRLWNIVNTLTTLKYNGKAPEIEIALYEYGNDGIRDENYIRQVTPLTQDLDLVSE 129 Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 +L L + A + + ++ N + + K + + Sbjct: 130 KLFALRTNGGSEYCGA-------VIRDAAANLNWDSNDKSMKLIYIAGNEAFDQG----- 177 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEG-QDLLRKCTD-SSGQFFAVND 372 +N ++ +N + I ++ + EG Q + G++F ++ Sbjct: 178 KINYREVVSKAKNKNIYINTIFCGSREEGIQTFWQNGASLGGGKYFNIDS 227 >gi|150392228|ref|YP_001322277.1| hypothetical protein Amet_4546 [Alkaliphilus metalliredigens QYMF] gi|149952090|gb|ABR50618.1| conserved hypothetical protein [Alkaliphilus metalliredigens QYMF] Length = 245 Score = 44.9 bits (104), Expect = 0.020, Method: Composition-based stats. Identities = 14/104 (13%), Positives = 39/104 (37%), Gaps = 10/104 (9%) Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP----PEGQ 354 + + +I +TDGE++ + + + G+ I ++ + Sbjct: 2 KEQEAVIRQMILVTDGESNIGG------DPISKAKEACQRGIVINTIGIVDGKMQRENPL 55 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 D + + G + + +L E+ +T K ++++ N+ Sbjct: 56 DEVINIAKAGGGSYEFSYIDQLYETMQSLTYKTVNETLQSVVNK 99 >gi|172040750|ref|YP_001800464.1| hypothetical protein cur_1070 [Corynebacterium urealyticum DSM 7109] gi|171852054|emb|CAQ05030.1| hypothetical protein cu1070 [Corynebacterium urealyticum DSM 7109] Length = 413 Score = 44.9 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 20/178 (11%), Positives = 55/178 (30%), Gaps = 10/178 (5%) Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIA-YNIGIVGNQCTPLSNNLNEVKSR 265 R++DV+ + + + + N + + Sbjct: 244 ARRLDVVRGALHSAFQRVGDNEGAVSLWNYSSQLSPGARTPYRVNVDLSARDGGAAAGAV 303 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 L++LN + ++ A++ + GS + ++ + G+N + + Sbjct: 304 LDQLNVGGGNHANVSISAAHKAAVDSA-----AAGSGKPAGRMVVVLAGKNQDELSVEQL 358 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 L +++ V + +L R + G+F+ D++ + I Sbjct: 359 KAQLAAANP----QVRVDIVGIGGDVAADELARIAEATGGKFYPARDAKAVDGVLKGI 412 >gi|269128868|ref|YP_003302238.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] gi|268313826|gb|ACZ00201.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] Length = 515 Score = 44.9 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 24/198 (12%), Positives = 56/198 (28%), Gaps = 18/198 (9%) Query: 192 KNTTKSKYAPAPAPANRKIDVLIESAG-NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 + + S P + ++ VL ++A L + S ++ V Sbjct: 314 LDISGSMLQRVPGTGDTRMQVLAKAAQLGLKFQPDDTELGQWVFSTKLDGDRDWKETVPV 373 Query: 251 QCTP----LSNNLNEVKSRLNKLNP--YENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 ++ S L L P +T Y + A + + Sbjct: 374 GPLGERLGSGTRRQQILSSLASLQPKPDGDTGLYDTVLAALSYMRKTYKPDMVNT----- 428 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRN--AGMKIYSVAVSAPPEGQDLLRKCTD 362 V+ +TDG + TL + +++ + + ++L + Sbjct: 429 ---VLLMTDGR-NDDDDGPTLRQTLAKLRAGHDPERPVQLVIIGFGDEVDRKELQQLAEA 484 Query: 363 SSGQFFAVNDSRELLESF 380 + G + ++ F Sbjct: 485 TGGSVHFAKTAEDMRNIF 502 >gi|195033822|ref|XP_001988770.1| GH10400 [Drosophila grimshawi] gi|193904770|gb|EDW03637.1| GH10400 [Drosophila grimshawi] Length = 1180 Score = 44.9 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 33/335 (9%), Positives = 92/335 (27%), Gaps = 11/335 (3%) Query: 52 DPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK-AQ 110 + T+ K + K K + + + A + + +++ + Sbjct: 96 NLTSAKANFTYYSSKYSKFNGNSSEELEPGEKEYAFMYRDMTLNPDTHFYNISVDTEHSS 155 Query: 111 YEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 +P+ + + + + A+S + + Sbjct: 156 VHVPSNVWDRAPHVLKTIQWSEQLDEVFRQNYQSDPALSWQYFGSDTGILRHYP-ASLWS 214 Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAP-----APAPANRKIDVLIESAGNLVNSIQ 225 ++ K+S++ + T SK + + + V + +++++ Sbjct: 215 DSRANKLDADTYDCRKRSWYIETATCSKDIVILLDHSGSMTGHRNHVAKFTIRSILDTFS 274 Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NPYENTNTYPAMHHA 284 N++ + + +L +P N A A Sbjct: 275 NNDF-FTIFRYSSEVEGIIPCFKNALVQATPENIDVFNTAIAELPDPEGYANLTLAYEQA 333 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 ++ L S ST + ++ +TDG + N ++I++ Sbjct: 334 FQILRTYYVSRRCNETSTC-NQAIMLVTDGVAGNTTDIFEKYNYGNGENGTSRMNVRIFT 392 Query: 345 VAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLE 378 + ++ + G + V E+ E Sbjct: 393 YLLGKEVTKVREIQWMACLNRGYYSHVQTLDEVHE 427 >gi|167829718|ref|ZP_02461189.1| hypothetical protein Bpseu9_38925 [Burkholderia pseudomallei 9] Length = 396 Score = 44.9 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 32/261 (12%), Positives = 82/261 (31%), Gaps = 5/261 (1%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A++++V F+ A+DL + R+++Q++ DA L+ + + P Sbjct: 4 LVALMLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLTGAINLSVPEAAGITA 63 Query: 61 ---STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQY--IAESKAQYEIPT 115 + +Q ++ S + + N ++ Sbjct: 64 GHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCTTSRTGIVNWFI 123 Query: 116 ENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMT 175 + L L + A ++S + + + AI + + +++ + +N + ++ Sbjct: 124 QTLNLVPGVTVANASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVAGATYNIGDWLS 183 Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 + P +F S A + A + + G+ V + + Sbjct: 184 AKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGDKAATTNAYN 243 Query: 236 VRIGTIAYNIGIVGNQCTPLS 256 R G A + Sbjct: 244 TRFGIYANPYKNPSYGTPDFT 264 >gi|156743748|ref|YP_001433877.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] gi|156235076|gb|ABU59859.1| von Willebrand factor type A [Roseiflexus castenholzii DSM 13941] Length = 936 Score = 44.9 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 43/357 (12%), Positives = 90/357 (25%), Gaps = 41/357 (11%) Query: 53 PTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK----------NNPLQ 102 P ++ + + E + ++ + + Sbjct: 260 PEAPAGFNRYTAYLEVPNDARTQNNAIETFSYVRGTPRVLLVAQAPDDAISLERALRAAR 319 Query: 103 YIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISIC-----MVLDVS 157 + IP L AL N+ + MV Sbjct: 320 IEVTVVSPASIPATFGELIRYDAIALINVPRALFSNETVQRIAAYVRDFGGGLLMVGGPQ 379 Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 + + + S A K+ + +E A Sbjct: 380 SFGPGGWRGTPVEAALPVTMDIPERQRQPPVSIVVVIDISGSMAATEDGIPKLSLALEGA 439 Query: 218 GNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV-KSRLNKLNPYE-NT 275 + ++ + T+ G PL + +V +LN++ Sbjct: 440 RRIAALLRDEDE---------LTVIPFDDRPGVIVGPLPGSRRDVAIEQLNQVRLGGSGI 490 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 N + A+ A + T S R + +I ITDG ++ L I + Sbjct: 491 NIHDALRVA----------ARYTRASERPVRHIITITDGNDT-----TQQEGALDIVRSL 535 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + G+ + SVA+ + G+ F + ++ + T I + Sbjct: 536 HDEGVTLTSVAIGQGDHVPFIRDMAAVGGGRTFLTERAADVPDLLTGETQTIMTPYI 592 >gi|284053489|ref|ZP_06383699.1| von Willebrand factor, type A [Arthrospira platensis str. Paraca] Length = 396 Score = 44.9 bits (104), Expect = 0.021, Method: Composition-based stats. Identities = 27/155 (17%), Positives = 56/155 (36%), Gaps = 13/155 (8%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG---TIAYNIGIVGNQCTPLSNNLNE 261 R+ID +++ + I + K V G + N+ Sbjct: 89 SGKRRIDGALDATRRFLEQISDRGGDTKVAIVPFGKGGANCPGFEVTQRGINSKFFPAND 148 Query: 262 VK--SRLNKLNPYE---NTNTYPAMHHAYRELYNEKESS---HNTIGSTRLKKFVIFITD 313 +K + L+ L T+ Y + A R L N ++ G + VI ++D Sbjct: 149 IKQTNFLDYLAAQTLCAATDIYGPLSEAIRVLGNRQDPRFYVPEDSGRLEPRLSVILLSD 208 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 G ++ + Q+ N + + E RN + ++++ Sbjct: 209 GFHNQPNEQQDFDNLITLLE--RNNNIIVHTLGYG 241 >gi|325695689|gb|EGD37588.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK150] Length = 460 Score = 44.9 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 26/209 (12%), Positives = 63/209 (30%), Gaps = 49/209 (23%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYR--- 286 N+SV + + V + L +++ +NKL+ TN + + Sbjct: 244 SIGNVSVNLVAFSTLGSYVQEDFSELDKGTTTIETSINKLDEGGYTNPGDGLRYGMVSLQ 303 Query: 287 ----------ELYNEKESSHNTIGSTRLKKFVI-------------------FITDGENS 317 L + ++ + + I T+ ++ Sbjct: 304 KNPAQLKYVVLLTDGVPNAFTAKTNDVSSRNRIKVYYPNGYYWIRFNNSTYDLTTNFTSN 363 Query: 318 GASAYQNTLNTLQICEYMRNA----------GMK-IYSVAVSAPPEG----QDLLRKCTD 362 A + + + + G+K + + S E QDL + + Sbjct: 364 EDYAAYDPYSQETLRKQSIEYSGKVSQTFGAGIKRVNVIGFSGREEDVRYGQDLTKAIKE 423 Query: 363 --SSGQFFAVNDSRELLESFDKITDKIQE 389 + ++ + + +L +F I +IQ+ Sbjct: 424 GKTDAEYTSAENEEKLQATFSDIKKQIQQ 452 >gi|269969412|sp|C7G0B5|PIF_PINFU RecName: Full=PIF; Contains: RecName: Full=Pif97; Contains: RecName: Full=Pif80; AltName: Full=Aragonite-binding protein; Flags: Precursor gi|256252203|dbj|BAH97338.1| Pif177 [Pinctada fucata] Length = 1007 Score = 44.9 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 20/161 (12%), Positives = 55/161 (34%), Gaps = 17/161 (10%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYREL 288 + +R+G + Y + PL + ++ + + P + + M A R Sbjct: 60 SIDDNQIRLGMVTYGSEVC--DSIPLQGDRLDLARTIRYMKKPTGPSKPFKGMGEARRMF 117 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + G + + + G + + + R+ +K+ ++ + Sbjct: 118 SS--------RGRYNVPHITMNLG-----GDIVDTEVKDLMDETDKARDEDIKVMAIGLG 164 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 A +D + Q + ++D +L+ +I D + + Sbjct: 165 AKV-DRDEIESIAYDRDQAYFMDDEDDLIRKVKEIPDYLCK 204 >gi|260793444|ref|XP_002591722.1| hypothetical protein BRAFLDRAFT_80817 [Branchiostoma floridae] gi|229276931|gb|EEN47733.1| hypothetical protein BRAFLDRAFT_80817 [Branchiostoma floridae] Length = 987 Score = 44.9 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 21/145 (14%), Positives = 44/145 (30%), Gaps = 17/145 (11%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 + + ++ + E+ + T + A L Sbjct: 350 MVKFHQWATRLLDLTEIATEEDRQEIADAVPN-EASGGTCIGCGLTEALEVLSMN----- 403 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 G+ VI ++DG+ S + T+ T + AG+ ++SV + + Sbjct: 404 ---GADPAGGIVIILSDGDESLNVSPNLTVATQH----LVAAGVTVHSVTY-SSSADTRM 455 Query: 357 LRKCTDSSGQFFA---VNDSRELLE 378 + G+ F +S L E Sbjct: 456 EEVAASTHGRAFFYSGAANSNSLEE 480 >gi|182412149|ref|YP_001817215.1| von Willebrand factor type A [Opitutus terrae PB90-1] gi|177839363|gb|ACB73615.1| von Willebrand factor type A [Opitutus terrae PB90-1] Length = 859 Score = 44.9 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 25/136 (18%), Positives = 44/136 (32%), Gaps = 20/136 (14%) Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 TN A+ A ++ + ++ ITDG + +A L T Sbjct: 586 TGGTNLEAALSLAGELARRHHNAAA--------QNRIVLITDGAANLGNADPAQLATR-- 635 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV----NDSR-----ELLESFDK 382 E +R G+ + V +L T + V ++ +L +F Sbjct: 636 IETLRQQGIAFDACGVGTDGLDDAVLEALTRKGDGRYYVLDAPENADAGFARQLAGAFRP 695 Query: 383 ITDKIQEQSVRIAPNR 398 + I + VR P R Sbjct: 696 AAENI-KVQVRFNPAR 710 >gi|118763608|gb|AAI28261.1| VIT protein [Homo sapiens] Length = 657 Score = 44.9 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 14/120 (11%), Positives = 39/120 (32%), Gaps = 14/120 (11%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSH 296 +G + Y + N ++K+ + K+ +N A+ + +++ Sbjct: 311 MGVVQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANG-- 368 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + V+ + DG + + R +G+ I+ + + E + Sbjct: 369 ---NRSGAPNVVVVMVDGWPTD--------KVEEASRLARESGINIFFITIEGAAENEKQ 417 >gi|317122048|ref|YP_004102051.1| hypothetical protein Tmar_1211 [Thermaerobacter marianensis DSM 12885] gi|315592028|gb|ADU51324.1| Protein of unknown function DUF2134, membrane [Thermaerobacter marianensis DSM 12885] Length = 344 Score = 44.9 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 11/64 (17%), Positives = 21/64 (32%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + AI ++ +D + R ++Q+A D A L+G + D Sbjct: 62 LVAICLAALMAMAGLVVDGGRLYTERARLQAAADVAALAGATELPEDPAAARQVALDYLA 121 Query: 61 STIF 64 Sbjct: 122 RNGV 125 >gi|284173928|ref|ZP_06387897.1| hypothetical protein Ssol98_04615 [Sulfolobus solfataricus 98/2] gi|261602665|gb|ACX92268.1| von Willebrand factor type A [Sulfolobus solfataricus 98/2] Length = 356 Score = 44.9 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 17/97 (17%), Positives = 41/97 (42%), Gaps = 6/97 (6%) Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 L+ + N +++ +I +TDG+ + + N ++ +I ++ Sbjct: 107 TRLHEAVNFALNLAKQSQVPNKIIMLTDGKPT------DKRNVKDYEKFDIPPNTQIITI 160 Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + L + SSG+F+ + D EL + F+ Sbjct: 161 GIGSDYNERILKKLADKSSGKFYHLKDISELPDVFES 197 >gi|73980138|ref|XP_540147.2| PREDICTED: similar to vitrin isoform 1 [Canis familiaris] Length = 649 Score = 44.9 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 26/281 (9%), Positives = 71/281 (25%), Gaps = 38/281 (13%) Query: 96 DKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLA--ISICMV 153 +P E+ Y+ P+ + + S + +S Sbjct: 147 SSKSPTAKTGEATKAYQKPSVPGTTAQPVTLMQVTGATASEASHTSLPKPSPSAVSTTSS 206 Query: 154 LDVSRSMEDLYLQKHNDNNNMTSN------KYLLPPPPKKSFWSKNTTKSKYAPA----- 202 L + ++ S P + S S Sbjct: 207 LRSQPMGHRSWELDLWSTSSDASGQNSPRANAGFVPKEELSTQSLEPVSQGDPNCKIDLS 266 Query: 203 ------PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 R+ + + ++ ++ +G + Y Sbjct: 267 FLIDGSSGIGKRRFRIQKQFLADVAQTLDIGPAGPL-----MGVVQYGDNPAAQFNLRTH 321 Query: 257 NNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 N ++K+ + K+ +N A+ + + +++ + + + I DG Sbjct: 322 MNSRDLKTAIEKITQRGGLSNAGRAISYVTKNFFSKANGNRGGAPN-----VAVVIVDGW 376 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + + + R +G+ I+ + + E + Sbjct: 377 PTD--------KVEEASRFARESGINIFFITIEGATENEKQ 409 >gi|156404155|ref|XP_001640273.1| predicted protein [Nematostella vectensis] gi|156227406|gb|EDO48210.1| predicted protein [Nematostella vectensis] Length = 1128 Score = 44.9 bits (104), Expect = 0.022, Method: Composition-based stats. Identities = 27/193 (13%), Positives = 67/193 (34%), Gaps = 18/193 (9%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQE------KKNLSVRIGTIAYNIGIVGNQC 252 + + ++ + E+A +++++ + V++ + Sbjct: 217 VDYSGSMGGSRLPIAKEAAKTVLDTLNPRDRVAFLAFESGVRRVKVTSGDAKDEKCFESS 276 Query: 253 TPLSN--NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 ++ N++ +K L+ T A + A+ L + +T + ++F Sbjct: 277 LAKASPVNIDILKKFLDGEYASGGTMYAIAFNAAFDILDK-----YYKEKNTTRRPVILF 331 Query: 311 ITDGENSGA-SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD---SSGQ 366 +TDG + NT+ T + + + + P G DLL+ + G Sbjct: 332 MTDGAPNDDPGTILNTVKTRNQGLSTKAD-ILTFGMGGGISPAGVDLLQSLAEQTLDGGA 390 Query: 367 FFAVNDSRELLES 379 F V+ + L + Sbjct: 391 RFEVSLTTALRDV 403 >gi|261854814|ref|YP_003262097.1| von Willebrand factor A [Halothiobacillus neapolitanus c2] gi|261835283|gb|ACX95050.1| von Willebrand factor type A [Halothiobacillus neapolitanus c2] Length = 339 Score = 44.9 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 26/156 (16%), Positives = 56/156 (35%), Gaps = 20/156 (12%) Query: 251 QCTPLSNNLNEVKSRLNKLNPYEN---TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 P++ + + + +L N T + A R + + ++ Sbjct: 165 TLVPMTTDHAVLDYWIRQLRAGINGSDTALGDGLAMAIRSIAAQSQAGQ-------PAPL 217 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV----SAPPEGQDLLRKCTD- 362 ++ TDG ++G + R G+K+++V + S P +GQ L + D Sbjct: 218 LVVWTDGFSTGGLMT-----PAEALALARAYGIKLFTVNLAPKGSPPDQGQPSLAQLADL 272 Query: 363 SSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + G+ +D + D+I + +S R Sbjct: 273 TGGKPILASDLAAMNAVTDQIAASVAPKSATPTERR 308 >gi|167725250|ref|ZP_02408486.1| hypothetical protein BpseD_39891 [Burkholderia pseudomallei DM98] Length = 396 Score = 44.9 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 32/261 (12%), Positives = 82/261 (31%), Gaps = 5/261 (1%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A++++V F+ A+DL + R+++Q++ DA L+ + + P Sbjct: 4 LVALMLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLTGAINLSVPEAAGITA 63 Query: 61 ---STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQY--IAESKAQYEIPT 115 + +Q ++ S + + N ++ Sbjct: 64 GHLNYALFEQFPVQMQTNSNVTFSDSLSNPFQPKNAIASPSSIKYVKCTTSRTGIVNWFI 123 Query: 116 ENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMT 175 + L L + A ++S + + + AI + + +++ + +N + ++ Sbjct: 124 QTLNLVPGVTVANASVSATAVATVGAAQTTCAIPVFICKAGTQTSPPVAGATYNIGDWLS 183 Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 + P +F S A + A + + G+ V + + Sbjct: 184 AKTGSPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGNKAATTNAYN 243 Query: 236 VRIGTIAYNIGIVGNQCTPLS 256 R G A + Sbjct: 244 TRFGIYANPYKNPSYGTPDFT 264 >gi|332237901|ref|XP_003268145.1| PREDICTED: integrin alpha-10 isoform 3 [Nomascus leucogenys] Length = 1032 Score = 44.9 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 39/215 (18%), Positives = 64/215 (29%), Gaps = 25/215 (11%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 + I E L + K + + +++G + Y V Sbjct: 28 PTYMDVVIVLDGSNSIYPWSEVQTFLRRLVGKLFIDPEQ--IQVGLVQYGESPVHEWSLG 85 Query: 255 LSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 EV L+ E T T A+ A E +++ + ++ +T Sbjct: 86 DFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQSHGGRPEA-----ARLLVVVT 140 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--------SAPPEGQDLLRKCTDSS 364 DGE+ L+ CE R + Y +AV P +R Sbjct: 141 DGESHDGE---QLPAALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASDP 194 Query: 365 G--QFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 FF V D L + D + D+I A N Sbjct: 195 DERFFFNVTDEAALTDIVDALGDRIFGLEGSHAEN 229 >gi|332237897|ref|XP_003268143.1| PREDICTED: integrin alpha-10 isoform 1 [Nomascus leucogenys] Length = 1175 Score = 44.9 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 39/215 (18%), Positives = 64/215 (29%), Gaps = 25/215 (11%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 + I E L + K + + +++G + Y V Sbjct: 171 PTYMDVVIVLDGSNSIYPWSEVQTFLRRLVGKLFIDPEQ--IQVGLVQYGESPVHEWSLG 228 Query: 255 LSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 EV L+ E T T A+ A E +++ + ++ +T Sbjct: 229 DFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQSHGGRPEA-----ARLLVVVT 283 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--------SAPPEGQDLLRKCTDSS 364 DGE+ L+ CE R + Y +AV P +R Sbjct: 284 DGESHDGE---QLPAALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASDP 337 Query: 365 G--QFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 FF V D L + D + D+I A N Sbjct: 338 DERFFFNVTDEAALTDIVDALGDRIFGLEGSHAEN 372 >gi|160874283|ref|YP_001553599.1| type IV pilin biogenesis protein [Shewanella baltica OS195] gi|160859805|gb|ABX48339.1| type IV pilin biogenesis protein, putative [Shewanella baltica OS195] gi|315266516|gb|ADT93369.1| type IV pilin biogenesis protein, putative [Shewanella baltica OS678] Length = 1168 Score = 44.9 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 37/302 (12%), Positives = 81/302 (26%), Gaps = 71/302 (23%) Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 +S S + N ++ SK P ++++ + Sbjct: 195 ISSSPSSTWADALAAAKNTDFGVGQPVTFYADNYLRWY-WLSKAGKLPTVKVSRLEIAKK 253 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIV-GNQCTPLSNNL-NEVKSRLNKLNPYE 273 + N+++S + G + T ++++ + + ++ L Sbjct: 254 AISNIISSTPTVDFGLAVFNYNYPNEGNRDGGRIVSGITQMTDSTRASLLTTIDNLLAKT 313 Query: 274 NTNTYPAMHHAYREL------YNEKESSHNTIGSTRLKK--------------------- 306 NT M+ AYR Y ++ + + + Sbjct: 314 NTPLCETMYEAYRYFAGKGVKYGHGDTDYGSYVGNKPPYDSLVEKGGSYESPFKVCTDIA 373 Query: 307 FVIFITDGENSGASAYQNT---------------------------------------LN 327 +VI++TDG + + N +N Sbjct: 374 YVIYVTDGAPTVDKSANNDVISLTSTGSKDGDYSSFSKNLDTASYLPALASYMFNNDLIN 433 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLL--RKCTDSSGQFFAVNDSRELLESFDKITD 385 L + ++ Y++ S E L G +FA +S EL + + Sbjct: 434 KLDSSNTEQMQNVRTYTIGFSKGAEDAAPLLAETAKRGGGLYFAAQNSIELQNALNDALS 493 Query: 386 KI 387 I Sbjct: 494 NI 495 >gi|329663456|ref|NP_001192519.1| integrin alpha-10 [Bos taurus] gi|297472754|ref|XP_002686129.1| PREDICTED: integrin, alpha 10-like [Bos taurus] gi|296489504|gb|DAA31617.1| integrin, alpha 10-like [Bos taurus] Length = 1167 Score = 44.9 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 37/205 (18%), Positives = 62/205 (30%), Gaps = 25/205 (12%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 + I E L + + + + +++G + Y V Sbjct: 163 PTYMDVVIVLDGSNSIYPWSEVQTFLRRLVGRLFIDPEQ--IQVGLVQYGESSVHEWSLG 220 Query: 255 LSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 EV L+ E T T A+ A E +++ + ++ +T Sbjct: 221 DFRTKEEVVRAARNLSRREGRETKTAQAIMMACTEGFSQSRGGRPEA-----ARLLVVVT 275 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--------SAPPEGQDLLRKCTDSS 364 DGE+ LQ CE R + Y +AV P +R Sbjct: 276 DGESHDGEELPT---ALQACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRAIASDP 329 Query: 365 G--QFFAVNDSRELLESFDKITDKI 387 FF V D L + D + D+I Sbjct: 330 DEKFFFNVTDEAALTDIVDALGDRI 354 >gi|118346233|ref|XP_977011.1| hypothetical protein TTHERM_00035110 [Tetrahymena thermophila] gi|89288362|gb|EAR86350.1| hypothetical protein TTHERM_00035110 [Tetrahymena thermophila SB210] Length = 603 Score = 44.9 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 31/268 (11%), Positives = 80/268 (29%), Gaps = 21/268 (7%) Query: 123 LIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLP 182 +I + I+ + N IS ++LD S SM+ ++ + + + L P Sbjct: 53 MISIRGQSKLASVQSKIQSEASNKGISYLILLDRSESMQVNQKIQNAKKSVIELIQNLTP 112 Query: 183 PPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI----QKAIQEKKNLSVRI 238 P + + ++ +I + I ++ Sbjct: 113 YDRFCL-----IPFGGSNGVAIPFTDSNSINKQETFEIIQNIVCKGKTDIVSVIQTAINT 167 Query: 239 GTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNT 298 Q + + ++ +N + N L K+ + Sbjct: 168 IKQEQIHQNTIKQEFEKTTKKS-LQQSINSSLTRVSRNINVD---ELELLNMSKKHQNKN 223 Query: 299 IGSTRLKKFV--IFITDGENSGASAYQNTLNTLQICEYMRNA--GMKIYSVAVSAPPEGQ 354 + + + + ++DGE++ +I E ++N I +G Sbjct: 224 SSNQIVDRTYCFVLLSDGEDN----IHQNYALQRIRECIKNETLNYSINCFGFGIEHDGN 279 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESFDK 382 L + GQ++ + ++ + + + Sbjct: 280 LLSSIAQLTGGQYYYIKENESIYDYLKE 307 >gi|297265790|ref|XP_001107629.2| PREDICTED: vitrin-like isoform 1 [Macaca mulatta] Length = 678 Score = 44.9 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 19/163 (11%), Positives = 53/163 (32%), Gaps = 23/163 (14%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSH 296 +G + Y + N ++K+ + K+ +N A+ + +++ Sbjct: 332 MGVVQYGDNPATHFNLKTHMNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANG-- 389 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + V+ + DG + + R +G+ I+ + V E + Sbjct: 390 ---NRSSAPNVVVVMVDGWPTD--------KVEEASRLARESGINIFFITVEGAAENEKQ 438 Query: 357 L--------RKCTDSSGQF-FAVNDSRELLESFDKITDKIQEQ 390 + ++G + F V L ++ + ++ + Sbjct: 439 YVVEPNFANKAVCRTNGFYSFHVQSWFGLHKTLQPLVKRVCDT 481 >gi|229578924|ref|YP_002837322.1| von Willebrand factor A [Sulfolobus islandicus Y.G.57.14] gi|284997528|ref|YP_003419295.1| von Willebrand factor, type A [Sulfolobus islandicus L.D.8.5] gi|228009638|gb|ACP45400.1| von Willebrand factor type A [Sulfolobus islandicus Y.G.57.14] gi|284445423|gb|ADB86925.1| von Willebrand factor, type A [Sulfolobus islandicus L.D.8.5] Length = 356 Score = 44.9 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 19/97 (19%), Positives = 41/97 (42%), Gaps = 8/97 (8%) Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 L+ + N +++ +I +TDG+ + + N + +I ++ Sbjct: 107 TRLHEAVSFTINLAKQSQVPTKIIMLTDGKPT------DKRNVKDYEKLDIPPNTQIITI 160 Query: 346 AVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFD 381 + + +L+K D SSG+F+ + D EL F+ Sbjct: 161 GIG-NNYNERILKKLADRSSGKFYHIKDISELPNIFE 196 >gi|171695786|ref|XP_001912817.1| hypothetical protein [Podospora anserina S mat+] gi|170948135|emb|CAP60299.1| unnamed protein product [Podospora anserina S mat+] Length = 345 Score = 44.9 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 18/120 (15%), Positives = 43/120 (35%), Gaps = 13/120 (10%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + + ++ + T + A EL + +TDG + Sbjct: 53 DPSGAGATIDGIPANGGTFIGGGISAAVDELTRSGNKPDAAG--------IFVLTDGADD 104 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIY--SVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE 375 S +T++++ + AG+++ ++V A + + S G F VN + + Sbjct: 105 PPSLISDTIDSI---NRAQQAGIRVSFGFLSVDAEEQDSRITSAILSSGGTFTTVNTAED 161 >gi|115535038|ref|NP_509469.2| hypothetical protein K09E2.1 [Caenorhabditis elegans] gi|90568060|gb|AAC46572.2| Hypothetical protein K09E2.1 [Caenorhabditis elegans] Length = 915 Score = 44.9 bits (104), Expect = 0.023, Method: Composition-based stats. Identities = 27/145 (18%), Positives = 58/145 (40%), Gaps = 16/145 (11%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY--ENTNTYPAMHHAYRELYNEKE 293 VR+G + Y+ + N++ + + L E+T T A++ A E+++ Sbjct: 70 VRVGLVQYSESAKTEFNLSKYSERNDIIAHMETLTFMQVEDTRTGVALNKADEEIFDFNG 129 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + + +I TDG + + + +R G+KIY+++V++ Sbjct: 130 GARLKAT-----RLIIIFTDGL--------SMDKPSKAAKALRRKGVKIYTISVNSIGFI 176 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLE 378 ++L + F ND + E Sbjct: 177 PEML-GIVGDADNVFGPNDEERIEE 200 >gi|222625100|gb|EEE59232.1| hypothetical protein OsJ_11214 [Oryza sativa Japonica Group] Length = 718 Score = 44.9 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 23/262 (8%), Positives = 64/262 (24%), Gaps = 23/262 (8%) Query: 136 TGIIERSSENLAISICMVL---DVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSK 192 + + N + + D+S M + ++ L P + Sbjct: 239 HEAVVENWSNKDFTFAYSVYSGDLSGGMLVQPSTSDDYDDRDMF-CIFLLPGNNQKRKVF 297 Query: 193 NTTKSKYAPAPAPANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ 251 K ++ + + ++ + + ++ +++ + Sbjct: 298 RNASVFIIDTSGSMQGKPLESVKNAMYTTLSELVQGDY-FNIITFNDELHSFSSCLEQVN 356 Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 + N V + T+ + A L N + + + Sbjct: 357 EKTIENAREWVNT---NFIAEGGTDIMHPLSEAIALLSNSHNALPQ----------IFLV 403 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDG T+ ++ +I + + + LR + Sbjct: 404 TDGSVEDERNICRTVKEQLATRGSKSP--RISTFGLGSYCN-HYFLRMLASIGKGHYDAA 460 Query: 372 -DSRELLESFDKITDKIQEQSV 392 D+ + + K V Sbjct: 461 FDTGSIEGRMVQWFQKASSTIV 482 >gi|198426249|ref|XP_002120426.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 1937 Score = 44.9 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 23/187 (12%), Positives = 60/187 (32%), Gaps = 19/187 (10%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + +++ ++NS Q++ + +R + + + Sbjct: 98 SSSSVGSENWNIMKNFVRTIINSFQRSATSTQISVLRYNRV--VDTSTQILLNEYLTDQS 155 Query: 261 EVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 + +++ T T A+ + + ++ +I +TDG Sbjct: 156 GFLAAYDRIPYNGGGTLTGNALRYVNDVILTGANG-----DRPGVRDVLITLTDGRAHD- 209 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSA--PPEGQDLLRKCTDSSGQFFAVN-DSREL 376 N L +R G++ Y V + + + L + + S + F + L Sbjct: 210 -------NVLAPSRMLRAKGVETYVVGIQSRLGALRESQLLEISGSRDRMFILTAGFASL 262 Query: 377 LESFDKI 383 SF + Sbjct: 263 SRSFANM 269 >gi|108708732|gb|ABF96527.1| von Willebrand factor type A domain containing protein, expressed [Oryza sativa Japonica Group] gi|218193020|gb|EEC75447.1| hypothetical protein OsI_11983 [Oryza sativa Indica Group] Length = 751 Score = 44.9 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 23/262 (8%), Positives = 64/262 (24%), Gaps = 23/262 (8%) Query: 136 TGIIERSSENLAISICMVL---DVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSK 192 + + N + + D+S M + ++ L P + Sbjct: 272 HEAVVENWSNKDFTFAYSVYSGDLSGGMLVQPSTSDDYDDRDMF-CIFLLPGNNQKRKVF 330 Query: 193 NTTKSKYAPAPAPANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ 251 K ++ + + ++ + + ++ +++ + Sbjct: 331 RNASVFIIDTSGSMQGKPLESVKNAMYTTLSELVQGDY-FNIITFNDELHSFSSCLEQVN 389 Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 + N V + T+ + A L N + + + Sbjct: 390 EKTIENAREWVNT---NFIAEGGTDIMHPLSEAIALLSNSHNALPQ----------IFLV 436 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDG T+ ++ +I + + + LR + Sbjct: 437 TDGSVEDERNICRTVKEQLATRGSKSP--RISTFGLGSYCN-HYFLRMLASIGKGHYDAA 493 Query: 372 -DSRELLESFDKITDKIQEQSV 392 D+ + + K V Sbjct: 494 FDTGSIEGRMVQWFQKASSTIV 515 >gi|108708734|gb|ABF96529.1| von Willebrand factor type A domain containing protein, expressed [Oryza sativa Japonica Group] Length = 680 Score = 44.9 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 23/262 (8%), Positives = 64/262 (24%), Gaps = 23/262 (8%) Query: 136 TGIIERSSENLAISICMVL---DVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSK 192 + + N + + D+S M + ++ L P + Sbjct: 272 HEAVVENWSNKDFTFAYSVYSGDLSGGMLVQPSTSDDYDDRDMF-CIFLLPGNNQKRKVF 330 Query: 193 NTTKSKYAPAPAPANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ 251 K ++ + + ++ + + ++ +++ + Sbjct: 331 RNASVFIIDTSGSMQGKPLESVKNAMYTTLSELVQGDY-FNIITFNDELHSFSSCLEQVN 389 Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 + N V + T+ + A L N + + + Sbjct: 390 EKTIENAREWVNT---NFIAEGGTDIMHPLSEAIALLSNSHNALPQ----------IFLV 436 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDG T+ ++ +I + + + LR + Sbjct: 437 TDGSVEDERNICRTVKEQLATRGSKSP--RISTFGLGSYCN-HYFLRMLASIGKGHYDAA 493 Query: 372 -DSRELLESFDKITDKIQEQSV 392 D+ + + K V Sbjct: 494 FDTGSIEGRMVQWFQKASSTIV 515 >gi|108708735|gb|ABF96530.1| von Willebrand factor type A domain containing protein, expressed [Oryza sativa Japonica Group] Length = 614 Score = 44.9 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 23/262 (8%), Positives = 64/262 (24%), Gaps = 23/262 (8%) Query: 136 TGIIERSSENLAISICMVL---DVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSK 192 + + N + + D+S M + ++ L P + Sbjct: 272 HEAVVENWSNKDFTFAYSVYSGDLSGGMLVQPSTSDDYDDRDMF-CIFLLPGNNQKRKVF 330 Query: 193 NTTKSKYAPAPAPANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ 251 K ++ + + ++ + + ++ +++ + Sbjct: 331 RNASVFIIDTSGSMQGKPLESVKNAMYTTLSELVQGDY-FNIITFNDELHSFSSCLEQVN 389 Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 + N V + T+ + A L N + + + Sbjct: 390 EKTIENAREWVNT---NFIAEGGTDIMHPLSEAIALLSNSHNALPQ----------IFLV 436 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDG T+ ++ +I + + + LR + Sbjct: 437 TDGSVEDERNICRTVKEQLATRGSKSP--RISTFGLGSYCN-HYFLRMLASIGKGHYDAA 493 Query: 372 -DSRELLESFDKITDKIQEQSV 392 D+ + + K V Sbjct: 494 FDTGSIEGRMVQWFQKASSTIV 515 >gi|53370749|gb|AAU89244.1| von Willebrand factor type A domain containing protein [Oryza sativa Japonica Group] Length = 801 Score = 44.9 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 23/262 (8%), Positives = 64/262 (24%), Gaps = 23/262 (8%) Query: 136 TGIIERSSENLAISICMVL---DVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSK 192 + + N + + D+S M + ++ L P + Sbjct: 272 HEAVVENWSNKDFTFAYSVYSGDLSGGMLVQPSTSDDYDDRDMF-CIFLLPGNNQKRKVF 330 Query: 193 NTTKSKYAPAPAPANRK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ 251 K ++ + + ++ + + ++ +++ + Sbjct: 331 RNASVFIIDTSGSMQGKPLESVKNAMYTTLSELVQGDY-FNIITFNDELHSFSSCLEQVN 389 Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 + N V + T+ + A L N + + + Sbjct: 390 EKTIENAREWVNT---NFIAEGGTDIMHPLSEAIALLSNSHNALPQ----------IFLV 436 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 TDG T+ ++ +I + + + LR + Sbjct: 437 TDGSVEDERNICRTVKEQLATRGSKSP--RISTFGLGSYCN-HYFLRMLASIGKGHYDAA 493 Query: 372 -DSRELLESFDKITDKIQEQSV 392 D+ + + K V Sbjct: 494 FDTGSIEGRMVQWFQKASSTIV 515 >gi|310657503|ref|YP_003935224.1| hypothetical protein CLOST_0189 [Clostridium sticklandii DSM 519] gi|308824281|emb|CBH20319.1| exported protein of unknown function [Clostridium sticklandii] Length = 466 Score = 44.9 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 36/325 (11%), Positives = 92/325 (28%), Gaps = 60/325 (18%) Query: 117 NLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS 176 + ++++S S + + Sbjct: 136 FTLQGYIDGKTDPKMTIKSEIEAFNSVVVEDGGSPSYPATALAYRTDDTPGPQTKAGKKV 195 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 + + + N + + + +I +L A L++ + + + Sbjct: 196 VAAITLVLDESGSMANNMSGGRVTSSNP---SRISILRNRAKALIDQF--SGLGNIYVGI 250 Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 + I + N+N +K++++ L TNT AM +Y + + S Sbjct: 251 IPYSDDAYISGTKSFVLANGTNVNTIKNKIDSLTAQGMTNTGDAMRVSY-YATKQFKDSP 309 Query: 297 NTIGSTRLK-----KFVIFITDGENSGAS-----------------AYQNTLNTLQICEY 334 N+I +T ++I ++DG+ + S + + I + Sbjct: 310 NSIDNTLPTDTKVIPYMILLSDGDPTVFSATTREWVGWWIFGHYEYSGYKQDSGYVISDR 369 Query: 335 MRNAG----------------------------MKIYSVAVSAPPEGQDLLRKCTDS-SG 365 + + +K + + S + + + Sbjct: 370 ISDDSESKNTSQWSSMGYSEYIGQNLVVGGDLEIKSHVIGFSNDATNA---QTIANYINA 426 Query: 366 QFFAVNDSRELLESFDKITDKIQEQ 390 ++ + N EL FD+I I ++ Sbjct: 427 EYHSANSDVELEAVFDEIGGTILKE 451 >gi|330834711|ref|YP_004409439.1| von Willebrand factor, type A [Metallosphaera cuprina Ar-4] gi|329566850|gb|AEB94955.1| von Willebrand factor, type A [Metallosphaera cuprina Ar-4] Length = 379 Score = 44.9 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 18/129 (13%), Positives = 48/129 (37%), Gaps = 20/129 (15%) Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA--SAYQ 323 ++++ T+ Y A+ A + ++ + I +TDG + + Sbjct: 98 IDEIKAGSGTSLYKALEEASKLAERYRQP-----------SYFILLTDGVPTDRGCTHGL 146 Query: 324 NTLNTLQICEYMRN-----AGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELL 377 + L+ C + +++ S + + +L ++ G F+ + D +E++ Sbjct: 147 SRKFDLERCLPVYQGLSLPHNVQVISFGIGQD-YNEKILSLISEKGNGFFYHIKDPKEIV 205 Query: 378 ESFDKITDK 386 E K+ Sbjct: 206 EKMPKLAKS 214 >gi|297265794|ref|XP_001107688.2| PREDICTED: vitrin-like isoform 2 [Macaca mulatta] Length = 656 Score = 44.9 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 19/163 (11%), Positives = 53/163 (32%), Gaps = 23/163 (14%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSH 296 +G + Y + N ++K+ + K+ +N A+ + +++ Sbjct: 310 MGVVQYGDNPATHFNLKTHMNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANG-- 367 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + V+ + DG + + R +G+ I+ + V E + Sbjct: 368 ---NRSSAPNVVVVMVDGWPTD--------KVEEASRLARESGINIFFITVEGAAENEKQ 416 Query: 357 L--------RKCTDSSGQF-FAVNDSRELLESFDKITDKIQEQ 390 + ++G + F V L ++ + ++ + Sbjct: 417 YVVEPNFANKAVCRTNGFYSFHVQSWFGLHKTLQPLVKRVCDT 459 >gi|1510110|dbj|BAA09092.1| ubiquitin-conjugating enzyme [Paramecium tetraurelia] Length = 425 Score = 44.9 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 24/149 (16%), Positives = 50/149 (33%), Gaps = 14/149 (9%) Query: 221 VNSIQKAIQEKKNLSVRIGTIAYNI-GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYP 279 VN+ A +K + +N+ N ++ +P T Y Sbjct: 17 VNAFFSAFADKTLAFEFNHIVKLVWFESFITDKCDFTNDFNNFIKLVDDASPRGGTKCYD 76 Query: 280 AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG 339 A+ +A +L + +I +TDG+++ Q+ N + + Sbjct: 77 AIAYAIEQLKE------IKKKYPNIILRIIALTDGDDN-----QSKENPQSLVNRIFENQ 125 Query: 340 MKIYSVAVSAPPEGQDLLRKCTDSSGQFF 368 + I S V+ G L ++G+ + Sbjct: 126 IIIDSFVVNNDCVGLKTLTHA--TNGRCY 152 >gi|291569126|dbj|BAI91398.1| hypothetical protein [Arthrospira platensis NIES-39] Length = 396 Score = 44.9 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 27/164 (16%), Positives = 56/164 (34%), Gaps = 13/164 (7%) Query: 196 KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG---TIAYNIGIVGNQC 252 R+ID +++ + I + K V G + Sbjct: 80 SGSMNELDTSGKRRIDGALDATRRFLEQISDRGGDTKVAIVPFGEGGRNCPGFEVTQRGI 139 Query: 253 TPLSNNLNEVK--SRLNKLNPYE---NTNTYPAMHHAYRELYNEKESS---HNTIGSTRL 304 N++K + L+ L T+ Y + A R L N ++ G Sbjct: 140 NSKFFPANDIKQTNFLDYLAAQTLCAATDIYGPLSEAIRVLGNRQDPRFYVPEDSGRLEP 199 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + VI ++DG ++ + Q+ N + + E RN + ++++ Sbjct: 200 RLSVILLSDGFHNQPNEQQDFDNLITLLE--RNNNIIVHTLGYG 241 >gi|127512039|ref|YP_001093236.1| type IV pilin biogenesis protein, putative [Shewanella loihica PV-4] gi|126637334|gb|ABO22977.1| type IV pilin biogenesis protein, putative [Shewanella loihica PV-4] Length = 1182 Score = 44.9 bits (104), Expect = 0.024, Method: Composition-based stats. Identities = 19/210 (9%), Positives = 52/210 (24%), Gaps = 33/210 (15%) Query: 196 KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL 255 Y + + G+ L+ T + + Sbjct: 339 YEAYQYFSGGPVTFGNDDKNAQGSYKIDGYIPNSPPSILTGGNYTTPFKKCPDTAYIIYI 398 Query: 256 SNNL----NEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 ++ + S + L NT Y +S ++ L + Sbjct: 399 TDGAPTIDSNADSAIISLASSAENTANYSGFDFV---------NSWGDTETSYLPALAAY 449 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-GQDLLRKCTDSSGQ--- 366 + + + N ++ ++++++ S + LL + G Sbjct: 450 MYNNDMVKGVKDANGIDNK--------QNVRLFTIGFSEGADVAAKLLEEAAFRGGNPRD 501 Query: 367 -------FFAVNDSRELLESFDKITDKIQE 389 ++ + +L+ + D I Sbjct: 502 DSGVSKGYYVAKNGLDLVAAMDDALKSILS 531 >gi|326927888|ref|XP_003210120.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-3-like, partial [Meleagris gallopavo] Length = 1069 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 58/177 (32%), Gaps = 14/177 (7%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++ + ++ ++++++ ++ + G N + L+K Sbjct: 249 RLTIAKQTVSSILDTLGD-DDFFNIIAYNEELHYVEPCLNGTLVQADRTNKEHFREHLDK 307 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L A++ A+ L + +I S + ++ ITDG N Sbjct: 308 LFAKGIGMLDIALNEAFNMLNEFNHTGQGSICS----QAIMLITDGAVDTYDTIFAKYNW 363 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + ++I++ + D L+ ++ FF L + + + + Sbjct: 364 P-------DRKVRIFTYLIGREAAFADNLKWMACANKGFF--TQISTLADVQENVME 411 >gi|304412583|ref|ZP_07394188.1| type IV pilin biogenesis protein, putative [Shewanella baltica OS183] gi|307303599|ref|ZP_07583352.1| type IV pilin biogenesis protein, putative [Shewanella baltica BA175] gi|304349059|gb|EFM13472.1| type IV pilin biogenesis protein, putative [Shewanella baltica OS183] gi|306912497|gb|EFN42920.1| type IV pilin biogenesis protein, putative [Shewanella baltica BA175] Length = 1168 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 37/301 (12%), Positives = 80/301 (26%), Gaps = 71/301 (23%) Query: 157 SRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIES 216 S S + NN ++ SK P ++++ ++ Sbjct: 196 SSSPGTSWADALAAANNTDFGVGQPVTLYTDNYLRWY-GLSKAGKLPTVKVSRLEIAKKA 254 Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIV-GNQCTPLSNNL-NEVKSRLNKLNPYEN 274 N+++S + G + T ++++ + + ++ L N Sbjct: 255 ISNIISSTPTVDFGLAVFNYNYPNEGNRDGGRIVSGITQMTDSTRASLLTTIDNLLAKTN 314 Query: 275 TNTYPAMHHAYREL------YNEKESSHNTIGSTRLKK---------------------F 307 T M+ AYR Y ++ + + + + Sbjct: 315 TPLCETMYEAYRYFAGKGVKYGHGDTDYGSYVGNKPPYDSLVEKGGSYESPFKVCTDIAY 374 Query: 308 VIFITDGENSGASAYQNTLNTLQIC----------------------------------- 332 VI++TDG + N + +L Sbjct: 375 VIYVTDGTPTVDKNANNDVISLTASGSKEGNYSSFSKNLDTASYLPALASYMFNNDLINK 434 Query: 333 ----EYMRNAGMKIYSVAVSAPPEGQDLL--RKCTDSSGQFFAVNDSRELLESFDKITDK 386 + ++ Y++ S E L G +FA +S EL + + Sbjct: 435 LDSSNTEQMQNVRTYTIGFSKGAEDAAPLLAETAKRGGGLYFAAQNSLELQNALNDALSN 494 Query: 387 I 387 I Sbjct: 495 I 495 >gi|118096863|ref|XP_414338.2| PREDICTED: similar to voltage-gated calcium channel alpha(2)delta-3 subunit [Gallus gallus] Length = 1090 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 58/177 (32%), Gaps = 14/177 (7%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++ + ++ ++++++ ++ + G N + L+K Sbjct: 270 RLTIAKQTVSSILDTLGD-DDFFNIIAYNEELHYVEPCLNGTLVQADRTNKEHFREHLDK 328 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L A++ A+ L + +I S + ++ ITDG N Sbjct: 329 LFAKGIGMLDIALNEAFNMLNEFNHTGQGSICS----QAIMLITDGAVDTYDTIFAKYNW 384 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + ++I++ + D L+ ++ FF L + + + + Sbjct: 385 P-------DRKVRIFTYLIGREAAFADNLKWMACANKGFF--TQISTLADVQENVME 432 >gi|290474200|ref|YP_003467077.1| hypothetical protein XBJ1_1152 [Xenorhabdus bovienii SS-2004] gi|289173510|emb|CBJ80290.1| conserved hypothetical protein [Xenorhabdus bovienii SS-2004] Length = 201 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 21/168 (12%), Positives = 53/168 (31%), Gaps = 25/168 (14%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS-NNLNEVKSRLNKLNPYENTNT 277 L K I +K + I +A + P S +K ++++ P T Sbjct: 43 TLAKKSAKKIIDKLPSDMNISLVAAFDCRHVSASLPFSPTQRPVLKRAIDRITPDGKTPL 102 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 A+ A + ++ I+DG+ + + + ++ Sbjct: 103 ALALEKAGALV-----------DGVNRDAIILLISDGDETCGG------DPCAVARALKE 145 Query: 338 AG--MKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDK 382 +++ V + G + G+ F N++ + + ++ Sbjct: 146 NKPRLQVNVVDIMNSGAGN----CIASQTGGKVFTANNANQFNQVINQ 189 >gi|217974384|ref|YP_002359135.1| type IV pilin biogenesis protein [Shewanella baltica OS223] gi|217499519|gb|ACK47712.1| type IV pilin biogenesis protein, putative [Shewanella baltica OS223] Length = 1168 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 37/301 (12%), Positives = 80/301 (26%), Gaps = 71/301 (23%) Query: 157 SRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIES 216 S S + NN ++ SK P ++++ ++ Sbjct: 196 SSSPGTSWADALAAANNTDFGVGQPVTLYTDNYLRWY-GLSKAGKLPTVKVSRLEIAKKA 254 Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIV-GNQCTPLSNNL-NEVKSRLNKLNPYEN 274 N+++S + G + T ++++ + + ++ L N Sbjct: 255 ISNIISSTPTVDFGLAVFNYNYPNEGNRDGGRIVSGITQMTDSTRASLLTTIDNLLAKTN 314 Query: 275 TNTYPAMHHAYREL------YNEKESSHNTIGSTRLKK---------------------F 307 T M+ AYR Y ++ + + + + Sbjct: 315 TPLCETMYEAYRYFAGKGVKYGHGDTDYGSYVGNKPPYDSLVEKGGSYESPFKVCTDIAY 374 Query: 308 VIFITDGENSGASAYQNTLNTLQIC----------------------------------- 332 VI++TDG + N + +L Sbjct: 375 VIYVTDGTPTVDKNANNDVISLTASGSKEGNYSSFSKNLDTASYLPALASYMFNNDLINK 434 Query: 333 ----EYMRNAGMKIYSVAVSAPPEGQDLL--RKCTDSSGQFFAVNDSRELLESFDKITDK 386 + ++ Y++ S E L G +FA +S EL + + Sbjct: 435 LDSSNTEQMQNVRTYTIGFSKGAEDAAPLLAETAKRGGGLYFAAQNSLELQNALNDALSN 494 Query: 387 I 387 I Sbjct: 495 I 495 >gi|332809817|ref|XP_003308326.1| PREDICTED: integrin alpha-10 [Pan troglodytes] Length = 1024 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 39/215 (18%), Positives = 64/215 (29%), Gaps = 25/215 (11%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 + I E L + K + + +++G + Y V Sbjct: 20 PTYMDVVIVLDGSNSIYPWSEVQTFLRRLVGKLFIDPEQ--IQVGLVQYGESPVHEWSLG 77 Query: 255 LSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 EV L+ E T T A+ A E +++ + ++ +T Sbjct: 78 DFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQSHGGRPEA-----ARLLVVVT 132 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--------SAPPEGQDLLRKCTDSS 364 DGE+ L+ CE R + Y +AV P +R Sbjct: 133 DGESHDGEELP---AALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASDP 186 Query: 365 G--QFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 FF V D L + D + D+I A N Sbjct: 187 DERFFFNVTDEAALTDIVDALGDRIFGLEGSHAEN 221 >gi|220672885|emb|CAX13868.1| novel protein similar to vertebrate vitrin (VIT) [Danio rerio] Length = 761 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 15/121 (12%), Positives = 43/121 (35%), Gaps = 14/121 (11%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESS 295 +G I Y V +N ++K ++K+ ++ A+ + ++ +++ + Sbjct: 414 MMGIIQYGDDPVTEFSLKQFSNSKDLKPAISKIVQKGGPSHVGKALSYINKQFFSDANGN 473 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 + + + DG + + R +G+ I+ V + P + + Sbjct: 474 RGGAPN-----VAVVLVDGWPTD--------RVEEASRLARESGINIFFVTIEGPDDNEK 520 Query: 356 L 356 Sbjct: 521 Q 521 >gi|194385372|dbj|BAG65063.1| unnamed protein product [Homo sapiens] Length = 1095 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 39/215 (18%), Positives = 64/215 (29%), Gaps = 25/215 (11%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 + I E L + K + + +++G + Y V Sbjct: 129 PTYMDVVIVLDGSNSIYPWSEVQTFLRRLVGKLFIDPEQ--IQVGLVQYGESPVHEWSLG 186 Query: 255 LSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 EV L+ E T T A+ A E +++ + ++ +T Sbjct: 187 DFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQSHGGRPEA-----ARLLVVVT 241 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--------SAPPEGQDLLRKCTDSS 364 DGE+ L+ CE R + Y +AV P +R Sbjct: 242 DGESHDGEELP---AALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASDP 295 Query: 365 G--QFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 FF V D L + D + D+I A N Sbjct: 296 DERFFFNVTDEAALTDIVDALGDRIFGLEGSHAEN 330 >gi|187954851|gb|AAI40832.1| ITGA10 protein [Homo sapiens] gi|219519694|gb|AAI44638.1| ITGA10 protein [Homo sapiens] Length = 1024 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 39/215 (18%), Positives = 64/215 (29%), Gaps = 25/215 (11%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 + I E L + K + + +++G + Y V Sbjct: 20 PTYMDVVIVLDGSNSIYPWSEVQTFLRRLVGKLFIDPEQ--IQVGLVQYGESPVHEWSLG 77 Query: 255 LSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 EV L+ E T T A+ A E +++ + ++ +T Sbjct: 78 DFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQSHGGRPEA-----ARLLVVVT 132 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--------SAPPEGQDLLRKCTDSS 364 DGE+ L+ CE R + Y +AV P +R Sbjct: 133 DGESHDGEELP---AALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASDP 186 Query: 365 G--QFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 FF V D L + D + D+I A N Sbjct: 187 DERFFFNVTDEAALTDIVDALGDRIFGLEGSHAEN 221 >gi|119591830|gb|EAW71424.1| integrin, alpha 10, isoform CRA_a [Homo sapiens] Length = 1177 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 39/215 (18%), Positives = 64/215 (29%), Gaps = 25/215 (11%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 + I E L + K + + +++G + Y V Sbjct: 163 PTYMDVVIVLDGSNSIYPWSEVQTFLRRLVGKLFIDPEQ--IQVGLVQYGESPVHEWSLG 220 Query: 255 LSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 EV L+ E T T A+ A E +++ + ++ +T Sbjct: 221 DFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQSHGGRPEA-----ARLLVVVT 275 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--------SAPPEGQDLLRKCTDSS 364 DGE+ L+ CE R + Y +AV P +R Sbjct: 276 DGESHDGEELP---AALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASDP 329 Query: 365 G--QFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 FF V D L + D + D+I A N Sbjct: 330 DERFFFNVTDEAALTDIVDALGDRIFGLEGSHAEN 364 >gi|114558438|ref|XP_514418.2| PREDICTED: integrin alpha-10 isoform 2 [Pan troglodytes] Length = 1167 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 39/215 (18%), Positives = 64/215 (29%), Gaps = 25/215 (11%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 + I E L + K + + +++G + Y V Sbjct: 163 PTYMDVVIVLDGSNSIYPWSEVQTFLRRLVGKLFIDPEQ--IQVGLVQYGESPVHEWSLG 220 Query: 255 LSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 EV L+ E T T A+ A E +++ + ++ +T Sbjct: 221 DFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQSHGGRPEA-----ARLLVVVT 275 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--------SAPPEGQDLLRKCTDSS 364 DGE+ L+ CE R + Y +AV P +R Sbjct: 276 DGESHDGEELP---AALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASDP 329 Query: 365 G--QFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 FF V D L + D + D+I A N Sbjct: 330 DERFFFNVTDEAALTDIVDALGDRIFGLEGSHAEN 364 >gi|90577458|ref|ZP_01233269.1| putative hemagglutinin/hemolysin-related protein [Vibrio angustum S14] gi|90440544|gb|EAS65724.1| putative hemagglutinin/hemolysin-related protein [Vibrio angustum S14] Length = 1679 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 28/212 (13%), Positives = 70/212 (33%), Gaps = 38/212 (17%) Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 K + ++Y+ ID+ +S L +SI ++++K ++ + T + ++ + Sbjct: 1235 IKASDGTEYSDNFNSLPSLIDMAKKSYQTLTSSIIDSVEDKSKITFNMVTFSSDVKGNTS 1294 Query: 251 QCTPLSNNL------NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 ++ + + ++ L T A+ R + + Sbjct: 1295 FHYDETSKTFVNDQHQTINNYIDSLVAGGGTQFEGALSDISRHITDPSMR---------- 1344 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS 364 + F++DG+ + ++ L G +I S+AV P + + Sbjct: 1345 -NVIYFLSDGK-DEDKFHPQGIHFL--------KGTEIVSIAVG-PSADATQINQIAQMG 1393 Query: 365 GQF-----------FAVNDSRELLESFDKITD 385 + + ++ EL F I Sbjct: 1394 TGYDHNNPNAPSYSKIITNANELDGVFHNIGQ 1425 >gi|7385003|gb|AAF61638.1|AF172723_1 integrin alpha 10 subunit [Homo sapiens] Length = 517 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 39/215 (18%), Positives = 64/215 (29%), Gaps = 25/215 (11%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 + I E L + K + + +++G + Y V Sbjct: 2 PTYMDVVIVLDGSNSIYPWSEVQTFLRRLVGKLFIDPEQ--IQVGLVQYGESPVHEWSLG 59 Query: 255 LSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 EV L+ E T T A+ A E +++ + ++ +T Sbjct: 60 DFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQSHGGRPEA-----ARLLVVVT 114 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--------SAPPEGQDLLRKCTDSS 364 DGE+ L+ CE R + Y +AV P +R Sbjct: 115 DGESHDGEELP---AALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASDP 168 Query: 365 G--QFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 FF V D L + D + D+I A N Sbjct: 169 DERFFFNVTDEAALTDIVDALGDRIFGLEGSHAEN 203 >gi|3420888|gb|AAC31952.1| integrin subunit alpha 10 precursor [Homo sapiens] Length = 1167 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 39/215 (18%), Positives = 64/215 (29%), Gaps = 25/215 (11%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 + I E L + K + + +++G + Y V Sbjct: 163 PTYMDVVIVLDGSNSIYPWSEVQTFLRRLVGKLFIDPEQ--IQVGLVQYGESPVHEWSLG 220 Query: 255 LSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 EV L+ E T T A+ A E +++ + ++ +T Sbjct: 221 DFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQSHGGRPEA-----ARLLVVVT 275 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--------SAPPEGQDLLRKCTDSS 364 DGE+ L+ CE R + Y +AV P +R Sbjct: 276 DGESHDGEELP---AALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASDP 329 Query: 365 G--QFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 FF V D L + D + D+I A N Sbjct: 330 DERFFFNVTDEAALTDIVDALGDRIFGLEGSHAEN 364 >gi|38569398|ref|NP_003628.2| integrin alpha-10 precursor [Homo sapiens] gi|115502407|sp|O75578|ITA10_HUMAN RecName: Full=Integrin alpha-10; Flags: Precursor gi|6650628|gb|AAF21944.1|AF112345_1 integrin alpha 10 subunit [Homo sapiens] gi|119591831|gb|EAW71425.1| integrin, alpha 10, isoform CRA_b [Homo sapiens] gi|182887771|gb|AAI60008.1| Integrin, alpha 10 [synthetic construct] gi|189055316|dbj|BAG36921.1| unnamed protein product [Homo sapiens] Length = 1167 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 39/215 (18%), Positives = 64/215 (29%), Gaps = 25/215 (11%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 + I E L + K + + +++G + Y V Sbjct: 163 PTYMDVVIVLDGSNSIYPWSEVQTFLRRLVGKLFIDPEQ--IQVGLVQYGESPVHEWSLG 220 Query: 255 LSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 EV L+ E T T A+ A E +++ + ++ +T Sbjct: 221 DFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQSHGGRPEA-----ARLLVVVT 275 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--------SAPPEGQDLLRKCTDSS 364 DGE+ L+ CE R + Y +AV P +R Sbjct: 276 DGESHDGEELP---AALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASDP 329 Query: 365 G--QFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 FF V D L + D + D+I A N Sbjct: 330 DERFFFNVTDEAALTDIVDALGDRIFGLEGSHAEN 364 >gi|291398101|ref|XP_002715428.1| PREDICTED: integrin, alpha 10 [Oryctolagus cuniculus] Length = 1169 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 62/205 (30%), Gaps = 25/205 (12%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 + I E L + + + + +++G + Y V Sbjct: 163 PTYMDVVIVLDGSNSIYPWSEVQTFLRRLVGRLFIDPEQ--IQVGLVQYGESAVHEWSLG 220 Query: 255 LSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 EV L+ E T T A+ A E +++ + ++ +T Sbjct: 221 DFRTKEEVVRAARNLSRREGRETKTAQAIMVACTEGFSQSRGGRPEA-----ARLLVVVT 275 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--------SAPPEGQDLLRKCTDSS 364 DGE+ L+ CE R + Y +AV P +R Sbjct: 276 DGESHDGEELPT---ALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLKEIRAIASDP 329 Query: 365 G--QFFAVNDSRELLESFDKITDKI 387 FF V D L + D + D+I Sbjct: 330 DERFFFNVTDEAALTDIVDALGDRI 354 >gi|145502106|ref|XP_001437032.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124404179|emb|CAK69635.1| unnamed protein product [Paramecium tetraurelia] Length = 556 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 24/149 (16%), Positives = 50/149 (33%), Gaps = 14/149 (9%) Query: 221 VNSIQKAIQEKKNLSVRIGTIAYNI-GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYP 279 VN+ A +K + +N+ N ++ +P T Y Sbjct: 148 VNAFFSAFADKTLAFEFNHIVKLVWFESFITDKCDFTNDFNNFIKLVDDASPRGGTKCYD 207 Query: 280 AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG 339 A+ +A +L + +I +TDG+++ Q+ N + + Sbjct: 208 AIAYAIEQLKE------IKKKYPNIILRIIALTDGDDN-----QSKENPQSLVNRIFENQ 256 Query: 340 MKIYSVAVSAPPEGQDLLRKCTDSSGQFF 368 + I S V+ G L ++G+ + Sbjct: 257 IIIDSFVVNNDCVGLKTLTHA--TNGRCY 283 >gi|169856457|ref|XP_001834887.1| elongation factor-2 kinase [Coprinopsis cinerea okayama7#130] gi|116504062|gb|EAU86957.1| elongation factor-2 kinase [Coprinopsis cinerea okayama7#130] Length = 406 Score = 44.9 bits (104), Expect = 0.025, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 51/185 (27%), Gaps = 12/185 (6%) Query: 166 QKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQ 225 + + S+ A +KI A +S Sbjct: 27 GPKDAPAPKRPLDIVFLQDATG---SQGPYIQSAVKAIHSICQKIS--QSPALGGGDSGS 81 Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT----NTYPAM 281 + + + + ++N++EVK L+ L + A+ Sbjct: 82 APTESIRFGLIAFRDHPPQDRSYVTKNFGFTSNVDEVKKHLSGLIASGGGDGPEASTAAL 141 Query: 282 HHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK 341 A + E + + + + DG +G + + L I M G+ Sbjct: 142 AEALNMEWKENAVKIVVLITDAPPHGLGEVGDGFPNG---SPDQNDPLSIARQMAEHGIS 198 Query: 342 IYSVA 346 +Y +A Sbjct: 199 LYIIA 203 >gi|194758339|ref|XP_001961419.1| GF14941 [Drosophila ananassae] gi|190615116|gb|EDV30640.1| GF14941 [Drosophila ananassae] Length = 1177 Score = 44.9 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 39/383 (10%), Positives = 98/383 (25%), Gaps = 30/383 (7%) Query: 23 MYIRNQM-------QSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQG 75 R ++ QS D + + R I++ ++ + + Sbjct: 48 YNARVELKNGTELIQSITDNVGKMLARKMDAVRCIQERAETVNENFEFNLTWAETNFTYI 107 Query: 76 SYIRENAGDIAQKAQINITKD----------KNNPLQYIAESKAQYEIPTENLFLKGLIP 125 S + + D + Y ++ + Sbjct: 108 SSKYSTFNGNSSEELQPNEADYAYMYRKMDLNQDTHFYNTPVDTEHSSVHVPSNVWDRSE 167 Query: 126 SALTNLSLRST---GIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLP 182 L + + + A+S + + ++ Sbjct: 168 RVLKTIQWSEQLDEVFRQNYQSDPALSWQYFGSDTGILRHYP-AAQWTDSRANRQDADTY 226 Query: 183 PPPKKSFWSKNTTKSKYAP-----APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVR 237 K+S++ + T SK + + + V + +++++ Sbjct: 227 DCRKRSWYIETATCSKDIVILLDHSGSMTGFRHHVAKFTIRSILDTFSNNDF-FTIFRYA 285 Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYRELYNEKESSH 296 G N+ + KL+ P N A A+R L +S H Sbjct: 286 ADVEDIIPCFNGALVQATPENIEVFNEAIEKLDDPEGYANLTLAYDKAFRLLRTYYDSRH 345 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 S+ + ++ +TDG + N +++++ + Sbjct: 346 CN-NSSTCNQAIMLVTDGVAGNTTEVFQKYNWGDGENGTSEMNVRVFTYLLGKEVTKVRE 404 Query: 357 LRKCT-DSSGQFFAVNDSRELLE 378 ++ + G + V E+ E Sbjct: 405 IQWMACLNRGYYSHVQTLDEVHE 427 >gi|229582324|ref|YP_002840723.1| von Willebrand factor type A [Sulfolobus islandicus Y.N.15.51] gi|228013040|gb|ACP48801.1| von Willebrand factor type A [Sulfolobus islandicus Y.N.15.51] Length = 356 Score = 44.9 bits (104), Expect = 0.026, Method: Composition-based stats. Identities = 19/97 (19%), Positives = 41/97 (42%), Gaps = 8/97 (8%) Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 L+ + N +++ +I +TDG+ + + N + +I ++ Sbjct: 107 TRLHEAVSFTINLAKQSQVPTKIIMLTDGKPT------DKRNVKDYEKLDIPPNTQIITI 160 Query: 346 AVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFD 381 + + +L+K D SSG+F+ + D EL F+ Sbjct: 161 GIG-NNYNERILKKLADRSSGKFYHIKDISELPNIFE 196 >gi|226531948|ref|NP_001141509.1| hypothetical protein LOC100273621 [Zea mays] gi|194704870|gb|ACF86519.1| unknown [Zea mays] Length = 357 Score = 44.5 bits (103), Expect = 0.026, Method: Composition-based stats. Identities = 20/126 (15%), Positives = 36/126 (28%), Gaps = 14/126 (11%) Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 TN + A L ++ + F+TDG +TL Sbjct: 4 NFVAQGGTNIMHPLSEAMTLLSTSHDALPQ----------IYFVTDGSVDDERNICHTLK 53 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDK 386 T I ++ +I + + + LR G + A D+ + + K Sbjct: 54 TQLIKSGSKSP--RISTFGLGSYCN-HYFLRMLASIGKGHYAAAFDTGSIEGRMLQWFQK 110 Query: 387 IQEQSV 392 V Sbjct: 111 ASSTIV 116 >gi|149030577|gb|EDL85614.1| integrin, alpha 10 (predicted) [Rattus norvegicus] Length = 746 Score = 44.5 bits (103), Expect = 0.026, Method: Composition-based stats. Identities = 37/205 (18%), Positives = 62/205 (30%), Gaps = 25/205 (12%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 + I E L + + + + +++G + Y V Sbjct: 163 PTYMDVVIVLDGSNSIYPWSEVQTFLRRLVGRLFIDPEQ--IQVGLVQYGENPVHEWSLG 220 Query: 255 LSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 EV L+ E T T A+ A E ++E + ++ +T Sbjct: 221 DFRTKEEVVRAARNLSRREGRETRTAQAIMVACTEGFSESRGGRPEA-----ARLLVVVT 275 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--------SAPPEGQDLLRKCTDSS 364 DGE+ L+ CE R + Y +AV P +R Sbjct: 276 DGESHDGEELP---AALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRAIASDP 329 Query: 365 G--QFFAVNDSRELLESFDKITDKI 387 FF V D L + D + D+I Sbjct: 330 DERFFFNVTDEAALTDIVDALGDRI 354 >gi|164565428|ref|NP_001101169.2| integrin, alpha 10 [Rattus norvegicus] Length = 1167 Score = 44.5 bits (103), Expect = 0.026, Method: Composition-based stats. Identities = 37/205 (18%), Positives = 62/205 (30%), Gaps = 25/205 (12%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 + I E L + + + + +++G + Y V Sbjct: 163 PTYMDVVIVLDGSNSIYPWSEVQTFLRRLVGRLFIDPEQ--IQVGLVQYGENPVHEWSLG 220 Query: 255 LSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 EV L+ E T T A+ A E ++E + ++ +T Sbjct: 221 DFRTKEEVVRAARNLSRREGRETRTAQAIMVACTEGFSESRGGRPEA-----ARLLVVVT 275 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--------SAPPEGQDLLRKCTDSS 364 DGE+ L+ CE R + Y +AV P +R Sbjct: 276 DGESHDGEELP---AALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRAIASDP 329 Query: 365 G--QFFAVNDSRELLESFDKITDKI 387 FF V D L + D + D+I Sbjct: 330 DERFFFNVTDEAALTDIVDALGDRI 354 >gi|125719088|ref|YP_001036221.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain-containing protein [Streptococcus sanguinis SK36] gi|125499005|gb|ABN45671.1| Nitric oxide reductase NorD / Von Willebrand factor type A (vWA) domain protein, putative [Streptococcus sanguinis SK36] gi|324989618|gb|EGC21563.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK353] Length = 444 Score = 44.5 bits (103), Expect = 0.026, Method: Composition-based stats. Identities = 35/269 (13%), Positives = 86/269 (31%), Gaps = 49/269 (18%) Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPA-NRKIDVLI 214 V+ + + ++ + K ++ K+ P A ++++L Sbjct: 182 VALAYRNDPIEGQVNTAVSFVFDTSGSMAYGLRNEGKRNSQGKWGPLDADNPRARMNILK 241 Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 + A LV+ +++ N+SV + + + + L + ++K ++ L Sbjct: 242 KKANLLVDDLKEI----GNVSVNLVRFSGSASYIQEDFVELDKDTGKIKEKIKSLPTSWI 297 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC-- 332 TN + + L + K+V+ +TDG + + + + + Sbjct: 298 TNPGDGLRYGLVSLQ----------RNPAQLKYVVLLTDGIPNAYTGSPDGIGKYDLTAN 347 Query: 333 -----EYMRN-------------------AGMK-IYSVAVSAPPEGQDLLRKCTDS---- 363 + ++ +G+K I + S +K D Sbjct: 348 FPTDNKQIKADQPVSLTTEYVGQVAKTFGSGVKRISVIGFSGNVGEIKDGQKIADQIKTV 407 Query: 364 ---SGQFFAVNDSRELLESFDKITDKIQE 389 F + L ++F I +IQ+ Sbjct: 408 GNVESTFVIATNEAALEQTFADIKKQIQQ 436 >gi|322435057|ref|YP_004217269.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] gi|321162784|gb|ADW68489.1| VWFA-related domain protein [Acidobacterium sp. MP5ACTX9] Length = 393 Score = 44.5 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 28/168 (16%), Positives = 52/168 (30%), Gaps = 33/168 (19%) Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 + + T +NN + V LN L ++T Y L Sbjct: 183 YDLRTHILTDFTNNKDTVAQSLNSLQIPGFSDTNM-FDALYETLDRTSRIEGR------- 234 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA------PPEGQ---- 354 K++I I G ++ + + + + I+S+ A GQ Sbjct: 235 -KYIILIGSGRDTFSKLTLDKMLAKIKATP----NVTIFSIGTGALAQELGDARGQIGGI 289 Query: 355 ---------DLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + L+ + G +F L + F +I D I+ Q + Sbjct: 290 GRMNILQAQNQLKTFATMTGGLYFDPMFQGALPDIFSQINDSIRNQYI 337 >gi|7495466|pir||T32949 hypothetical protein C05G6.3 - Caenorhabditis elegans Length = 341 Score = 44.5 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 14/116 (12%), Positives = 35/116 (30%), Gaps = 14/116 (12%) Query: 235 SVRIGTIAYNI--GIVGNQCTPLSNNLNEVKSRLNKLNPYENT-NTYPAMHHAYRELYNE 291 ++ + + EV ++ L T + A +L Sbjct: 149 YTQVAAVTFATVGRTRVRFNLKKYQTQEEVLRGIDNLKSRGGTTAIGAGIEKALTQLDES 208 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 + + K ++ TDG ++ + + ++G ++Y+VA Sbjct: 209 EGARPGIAT-----KVMVVFTDGWSNKG------PDPEKRARDAVSSGFEMYTVAY 253 >gi|297171348|gb|ADI22352.1| hypothetical protein [uncultured nuHF2 cluster bacterium HF0500_02A10] Length = 266 Score = 44.5 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 24/127 (18%), Positives = 49/127 (38%), Gaps = 19/127 (14%) Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 +++ TN YPA+ A+ EL + VI ++DG+ Sbjct: 1 SRIIASAQTNIYPALEMAFEELSE----------IDAEVRHVILLSDGQTYPDKY----- 45 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + M + + SVAV + LL + +G+ + + D+ + + F + T Sbjct: 46 --ESLVTRMAKDDISVSSVAVGQES-DRALLADIAEWGNGRSYFILDAARVPQVFIQETQ 102 Query: 386 KIQEQSV 392 +Q++ Sbjct: 103 IASQQTL 109 >gi|221109528|ref|XP_002169888.1| PREDICTED: similar to collagen type VI alpha 6 [Hydra magnipapillata] Length = 366 Score = 44.5 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 21/196 (10%), Positives = 58/196 (29%), Gaps = 14/196 (7%) Query: 178 KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVR 237 + P K S + + + R+ ++ + S V Sbjct: 171 DIIQSGKPSKESCSDAIVDVGFILDSSGSLRRDYKNVKEFLKTIASFFDIKINGSQAGV- 229 Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT-NTYPAMHHAYRELYNEKESSH 296 I ++ + ++++ + ++K+ +T A+ H+ ++ + Sbjct: 230 ---ITFSHRSEHSIKLNDFSDVDSFEKAVDKIPLMGSTTRIDKALRHSKNVMFTNQNGGR 286 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 K +I +TDG + + I + +RN G+ + + Sbjct: 287 LEAT-----KLLILLTDGSQT---FSAKQEDPSIIADEIRNDGV-LIIAIGIGEGINKTE 337 Query: 357 LRKCTDSSGQFFAVND 372 L + + + Sbjct: 338 LNRIAGKDENTYNADT 353 >gi|139948509|ref|NP_001077327.1| complement factor B [Danio rerio] gi|125858059|gb|AAI29235.1| Zgc:158446 protein [Danio rerio] Length = 751 Score = 44.5 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 20/117 (17%), Positives = 48/117 (41%), Gaps = 6/117 (5%) Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ--I 331 TN A + Y + E ++ + VI TDG+ + + + ++ ++ + Sbjct: 336 GTNIAQAYNKIYESMTIELMTN--KEDFKATQHIVIMFTDGQANMGGSPKPLVDKIKSLV 393 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDL--LRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + +++Y + +D+ L+ + FF + +L E+FD + D+ Sbjct: 394 RQNSVEEKLELYVFGLGNDVHAEDINDLKTDRANEKFFFKLKSLDDLKETFDNMIDE 450 >gi|75907575|ref|YP_321871.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] gi|75701300|gb|ABA20976.1| von Willebrand factor, type A [Anabaena variabilis ATCC 29413] Length = 819 Score = 44.5 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 26/166 (15%), Positives = 45/166 (27%), Gaps = 19/166 (11%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPL---SNNLNEVKSRLNKLNPYENTNTYPAMHHAYR 286 N I ++ PL S N + +N+L T + Sbjct: 328 NGLNPDDTFSIIDFSDTTRQLSPVPLANNSQNRTRAINYINRLTANGGTEMLRGIRAVLN 387 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 + +S + ++ +TDG N L + AG ++YS Sbjct: 388 --FPVTDSGRL--------RSIVLLTDG------YIGNENQILAEVQQHLQAGNRLYSFG 431 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + L R G + E DK +I + Sbjct: 432 AGSSVNRFLLNRIAELGRGIAQIIRHDEPTDEVVDKFYRQINNPVL 477 >gi|2326547|emb|CAA04501.1| Matrilin-3 [Homo sapiens] Length = 303 Score = 44.5 bits (103), Expect = 0.027, Method: Composition-based stats. Identities = 15/87 (17%), Positives = 29/87 (33%), Gaps = 15/87 (17%) Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 S+ + K I +TDG ++ + +G+++Y+V V L+ Sbjct: 1 SSNIPKVAIIVTDGRPQD--------QVNEVAARAQASGIELYAVGV--DRADMASLKMM 50 Query: 361 TDSS--GQFFAVND---SRELLESFDK 382 F V +L F + Sbjct: 51 ASEPLEEHVFYVETYGVIEKLSSRFQE 77 >gi|218709385|ref|YP_002417006.1| putative hemolysin-type calcium-binding region [Vibrio splendidus LGP32] gi|218322404|emb|CAV18557.1| putative hemolysin-type calcium-binding region [Vibrio splendidus LGP32] Length = 1883 Score = 44.5 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 31/322 (9%), Positives = 92/322 (28%), Gaps = 23/322 (7%) Query: 45 VSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK--------D 96 ++ D T + +L + + +I +T Sbjct: 1101 DNNGLPTDSTDAYSIQGDLVITDANNNLHDTVFSETSTINITTSTGQPLTSSGSLVTWNV 1160 Query: 97 KNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDV 156 N + + Q I + + + + + + Sbjct: 1161 SNGGHTLVGSANGQNVIEATLTSDGHYEIHLKGPVDHTNVNGEDSLVLQIPVIAKDTSGL 1220 Query: 157 SRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIES 216 + + + + +D + + P K + + +N K+ ++ + Sbjct: 1221 TSTGGQISVSIEDDQPVAKNIDISVSPETKSNTNVQLI-LDTSGSMSNSSNGKLAIMKAA 1279 Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN-NLNEVKSRLNKLNPYENT 275 +++ VR+ I +N + + ++E K +N+L T Sbjct: 1280 VSKMLDQYHDM------GDVRVQLIDFNSRSTRLEFNGRAWMTVSEAKYLVNRLTAGGGT 1333 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ--ICE 333 + A+ A + +++ + + FI+DG+ + + Sbjct: 1334 DYDDAVKKARQSWDHDEHLQLDNANN-----VSYFISDGKPQDGHDDATISDNEETKWAN 1388 Query: 334 YMRNAGMKIYSVAVSAPPEGQD 355 ++ + G+ S+ +++ D Sbjct: 1389 HLISNGITSQSIGINSSGSLND 1410 >gi|158891|gb|AAA29076.1| em100 gene is homologous the Eimeria tenella gene et100 (accession number M73495) encoding the microneme protein Etp100 [Eimeria maxima] Length = 724 Score = 44.5 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 26/159 (16%), Positives = 51/159 (32%), Gaps = 23/159 (14%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + N ++ + + VR+G + + V LS++ + Sbjct: 55 SGSIGTSNYGKVRSFISNFAGTMPLSPDD-----VRVGLVTFGTSAVTR--WDLSDSRAQ 107 Query: 262 VKSRLNKLN-----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 L +T T+ + A L++ G K ++ +TDG Sbjct: 108 NADLLAAAAKKLPYAAGSTYTHLGLAKAEEILFS-----FQKGGRDNAPKMILVMTDG-- 160 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 A TL E +RN G+ I + V + Sbjct: 161 ----ASSRRSQTLSAAEKLRNRGVIIVVLGVGTGVNSAE 195 >gi|83643820|ref|YP_432255.1| von Willebrand factor type A domain-containing protein [Hahella chejuensis KCTC 2396] gi|83631863|gb|ABC27830.1| protein containing VWFA domain [Hahella chejuensis KCTC 2396] Length = 407 Score = 44.5 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 25/190 (13%), Positives = 60/190 (31%), Gaps = 8/190 (4%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 I E+ ++ +++ A + + + + T LS +L+ + + L Sbjct: 73 IQAAKENIWSIASTMASAQPAP-EIKMGLVAFRDRGDSYITRVTDLSPDLDSMYATLMDY 131 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 + L++ + + + + D YQN Sbjct: 132 QAEGG---GDGPESVNQALFDAVHKISWSQDKDSY-RVIFLVGDAPPHMD--YQNEQQYP 185 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQ 388 Q + + G+ + ++ P Q R+ + G FF V S + + +++ Sbjct: 186 QTLKDALSKGIVVNTIQAGDDPFTQTEWRRIAQLNQGSFFQVEQSGQAVAVATPFDERLA 245 Query: 389 EQSVRIAPNR 398 S + R Sbjct: 246 SLSKEMDKTR 255 >gi|317126805|ref|YP_004093087.1| hypothetical protein Bcell_0064 [Bacillus cellulosilyticus DSM 2522] gi|315471753|gb|ADU28356.1| hypothetical protein Bcell_0064 [Bacillus cellulosilyticus DSM 2522] Length = 245 Score = 44.5 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 15/103 (14%), Positives = 40/103 (38%), Gaps = 10/103 (9%) Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV----SAPPEGQD 355 S + ++ +TDG ++ + + I + GM + + + G D Sbjct: 1 MSKGTLRQILLLTDGCSNSG------DDPVAIAALAKEEGMTVNVIGIVGEGEMSERGID 54 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + ++ G + +++L ++ +T K Q++ N+ Sbjct: 55 EIESIAEAGGGISQIVYAKQLSQTVQMVTRKAMTQTIHGVVNK 97 >gi|123482632|ref|XP_001323846.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121906718|gb|EAY11623.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 720 Score = 44.5 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 20/192 (10%), Positives = 55/192 (28%), Gaps = 29/192 (15%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN--EVKSR 265 +I+ L++S+ + R I + +N N S Sbjct: 256 SRIENAKFCLNILIHSLP--------IGCRFSIIQFGNSYKEVVSICDYSNKNVKYAMSA 307 Query: 266 LNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + ++N T+ + + ++ + + +TDGE + + Sbjct: 308 IARINADMGGTDILSPLEYVFK-----------KKLGKGFIRKIFLLTDGEVHNSDMICS 356 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKIT 384 + +I+++ + + + + S G + + D + +I Sbjct: 357 RVQKE-------RENNRIFAIGLGSGADPGLIKNISAKSGGNYVLIADDDNMNNMIVEIM 409 Query: 385 DKIQEQSVRIAP 396 S+ Sbjct: 410 KSALSPSLSNIS 421 >gi|296224100|ref|XP_002757907.1| PREDICTED: vitrin [Callithrix jacchus] Length = 656 Score = 44.5 bits (103), Expect = 0.028, Method: Composition-based stats. Identities = 14/120 (11%), Positives = 39/120 (32%), Gaps = 14/120 (11%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSH 296 +G + Y + N ++K+ + K+ +N A+ + +++ Sbjct: 310 MGVVQYGDNPATHFNLKTHMNSQDLKTAIEKIPQRGGLSNVGRAISFVTKNFFSKANG-- 367 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + V+ + DG + + R +G+ I+ + + E + Sbjct: 368 ---NRSGAPNVVVVMVDGWPTD--------KVEEASRLARESGINIFFITIEGAAENEKQ 416 >gi|94968394|ref|YP_590442.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94550444|gb|ABF40368.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 491 Score = 44.5 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 24/166 (14%), Positives = 58/166 (34%), Gaps = 18/166 (10%) Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + ++L G L+ N ++ + ++ T A+ Sbjct: 336 LDALTDMIRHLRNDDEVFIMAYGKSLQFEQDLTGNPKLLEEAMEQIKAESGTALLDAVGF 395 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A + H +T + ++ I+DG N+ + N L + + + +++ Sbjct: 396 A---------AGHLERIATNKNRLLLVISDGRNT-----PSKDNPLTLSQKL--NTVRVD 439 Query: 344 SVAVSAP-PEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKI 387 + + G+ L S GQ +D+R+L + ++ + I Sbjct: 440 CIGLDVDGDSGRRQLESLAAYSGGQVSFASDTRQLRTAAVQMAEAI 485 >gi|290971865|ref|XP_002668693.1| predicted protein [Naegleria gruberi] gi|284082192|gb|EFC35949.1| predicted protein [Naegleria gruberi] Length = 454 Score = 44.5 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 26/232 (11%), Positives = 68/232 (29%), Gaps = 34/232 (14%) Query: 170 DNNNMTSNKYLLPPPPKKSFWSKNTTKSK-----YAPAPAPANRK---IDVLIESAGNLV 221 DNN + + +Y + + + + + + ID + NL Sbjct: 8 DNNTIETRQYEIDDSITCLQFDLISNIQRKEKQIVIALDVSGSMRGQGIDQAKIAISNLF 67 Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV-KSRLNKLNPYENTNTYPA 280 + + + L E +S L ++ T+ Sbjct: 68 EQVVDTPD-----------VVLITYDTSAELYDLRKKPAETRQSTLEQIQAGGGTDFTCV 116 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + + + + ++F TDG++ + + + ++ + Sbjct: 117 FEAISNL----------DMFNRQSEVAILFFTDGQDGSSHKREKAIEQMKKVLETKTQSF 166 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSS---GQFFAVNDSRELLESFDKITDKIQE 389 + +++ LL + T G F V D+ E+ +S + + + Sbjct: 167 EFHTIGF-TSSHDVALLTQITQLGSVQGTFQYVKDANEINQSMENLIGLLTS 217 >gi|224066048|ref|XP_002192868.1| PREDICTED: similar to calcium channel, voltage-dependent, alpha2/delta subunit 3 [Taeniopygia guttata] Length = 1090 Score = 44.5 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 58/177 (32%), Gaps = 14/177 (7%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++ + ++ ++++++ ++ + G N + L+K Sbjct: 270 RLTIAKQTVSSILDTLGD-DDFFNIIAYNEELHYVEPCLNGTLVQADRANKEHFREHLDK 328 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L A++ A+ L + +I S + ++ ITDG N Sbjct: 329 LFAKGIGMLDIALNEAFNMLNEFNHTGQGSICS----QAIMLITDGAVDTYDTIFAKYNW 384 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + ++I++ + D L+ ++ FF L + + + + Sbjct: 385 P-------DRKVRIFTYLIGREAAFADNLKWMACANKGFF--TQISTLADVQENVME 432 >gi|329893975|ref|ZP_08269994.1| BatA [gamma proteobacterium IMCC3088] gi|328923374|gb|EGG30692.1| BatA [gamma proteobacterium IMCC3088] Length = 323 Score = 44.5 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 25/135 (18%), Positives = 51/135 (37%), Gaps = 25/135 (18%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 G + L+ + V+ +++L ++T + +A L + Sbjct: 144 GEQSFVMSDLTAYGDTVRYMVSQLETGFAGDSTRLGDGVGYAVSLLADVDSERA------ 197 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------GQDL 356 V+ ITDG ++G + L ++ + +K+Y AV A + L Sbjct: 198 ----IVVLITDGNDTG-----SDLPPVEAARLAKALDVKLYVAAVGADVSTDREPIDEAL 248 Query: 357 LRKCTD-SSGQFFAV 370 LR+ + + G FF + Sbjct: 249 LRRLAERTGGAFFRI 263 >gi|294054316|ref|YP_003547974.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] gi|293613649|gb|ADE53804.1| von Willebrand factor type A [Coraliomargarita akajimensis DSM 45221] Length = 678 Score = 44.5 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 25/193 (12%), Positives = 57/193 (29%), Gaps = 47/193 (24%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN--LNEVKSR 265 +++ + +LV + S RIG +A+ L + + Sbjct: 110 TRLERAKLAILDLVEQL---------ESDRIGLVAFAGSAFLQTPPTLDYGAFRESLDAT 160 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + ++ A+ A + K V+ +TDGE+ G Sbjct: 161 APDMMSRGGSDLGVALREATKAF-----------PVENNYKAVVLLTDGEDLGGH----- 204 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK--- 382 + + G+K++++ + P + + + E Sbjct: 205 --AIDEAKKASKEGVKVFAIGLGTPEGD-------------YLRQTNDAGIEEYMRDADG 249 Query: 383 --ITDKIQEQSVR 393 I K+ E ++R Sbjct: 250 QPIRTKLDEATLR 262 >gi|242035517|ref|XP_002465153.1| hypothetical protein SORBIDRAFT_01g032840 [Sorghum bicolor] gi|241919007|gb|EER92151.1| hypothetical protein SORBIDRAFT_01g032840 [Sorghum bicolor] Length = 756 Score = 44.5 bits (103), Expect = 0.029, Method: Composition-based stats. Identities = 32/302 (10%), Positives = 77/302 (25%), Gaps = 31/302 (10%) Query: 100 PLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVL---DV 156 +Q S A E+ + + + + + + D+ Sbjct: 237 KIQLTVNSGASKEVILQGTSHPLKEKNRQGEKLSFLHEAVVENWSTKDFTFAYTVYSGDL 296 Query: 157 SRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF-WSKNTTKSKYAPAPAPANRKIDVLIE 215 S + + ++ +LLP + K ++ + Sbjct: 297 SGGVLVQPSTLRDYDDRDMFCLFLLPGNNANRKVFRKAVVYIIDTSGSMQGKP-LESVKN 355 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN----KLNP 271 + ++ + + I +N + N + + + Sbjct: 356 AMSTTLSDLMQGDY--------FNIITFNDELHSFSSRLEQVNERTIGNAIEWMNLNFVA 407 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T+ + A L + + + F+TDG +TL T I Sbjct: 408 QGGTDMMHPLSEAMALLSSSHDVLPQ----------IYFVTDGSVDDERNICHTLKTQLI 457 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQ 390 ++ +I + + + LR G + A D+ + + K Sbjct: 458 KSGSKSP--RISTFGLGSYCN-HYFLRMLASIGKGHYAAALDTGSIEGRMLQWFQKASST 514 Query: 391 SV 392 V Sbjct: 515 IV 516 >gi|308176071|ref|YP_003915477.1| hypothetical protein AARI_02710 [Arthrobacter arilaitensis Re117] gi|307743534|emb|CBT74506.1| hypothetical membrane protein [Arthrobacter arilaitensis Re117] Length = 1066 Score = 44.5 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 28/300 (9%), Positives = 71/300 (23%), Gaps = 27/300 (9%) Query: 80 ENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGII 139 + ++ N + +L + + N + Sbjct: 213 QIEAGDNGICYFSVPNTNNGGANRTKFYIRELAPAAGSLAAQNYNVLSKFNTGANGSTES 272 Query: 140 ERSSENLAISICMVLDVSRSMEDLYLQKHNDN---NNMTSNKYLLPPPPKKSFWSKNTTK 196 ++ + + + + N N L + Sbjct: 273 TYQYRTKTLTSNSTISMPADHDGALRTADASSGTWANSIKNPTLPVTCELGIKVALVFDL 332 Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPL 255 S A+ + L + N S R GT + + T Sbjct: 333 SGSVKNAGAADTLGNAGKGFVDALSGTNSSVALFSFGNTSPRAGTPNFPELRNIDSGTNS 392 Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + N+++ +N + T+ T + + V+ +TDG Sbjct: 393 TRIKNDIQDYINSFSGEGYTSNGTNWDAGL----------WATAQNAKQYDLVVVLTDGN 442 Query: 316 NS-----------GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS 364 + ++ ++ + ++ G +I +V + + L S Sbjct: 443 PTFSGIESTVGPGTSTYFRELEAAVFSANAVKAQGARIINVGIGEGLNADNNL--AATSG 500 >gi|269929102|ref|YP_003321423.1| hypothetical protein Sthe_3201 [Sphaerobacter thermophilus DSM 20745] gi|269788459|gb|ACZ40601.1| Protein of unknown function DUF2134, membrane [Sphaerobacter thermophilus DSM 20745] Length = 341 Score = 44.5 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 20/195 (10%), Positives = 48/195 (24%), Gaps = 11/195 (5%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+ + AIDL + + +Q+A DA L+G + + + Sbjct: 22 LFAVALVGMIGMTGLAIDLGYTFSQKRAIQNAADAGALAGAQYLTKSTPDTRYPVRNEVA 81 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + + Q + + + + F+ Sbjct: 82 QVVAQNGFGGATPQLVSCQYVDASDNLLGDCSQEPPDEATG-----VEVRVRETHRTFFI 136 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVL---DVSRSMEDLY---LQKHNDNNNM 174 + + + T + N V+ D + + N+ + Sbjct: 137 QIVPGAPDTASTEAMARARVFKVTNAPGDAPFVVCGYDTELAGGGTFSILKPGTNEIDPA 196 Query: 175 TSNKYLLPPPPKKSF 189 K P+ Sbjct: 197 AIGKTFRIHAPQGIS 211 >gi|254784280|ref|YP_003071708.1| von Willebrand factor A [Teredinibacter turnerae T7901] gi|237684173|gb|ACR11437.1| von Willebrand factor type A domain protein [Teredinibacter turnerae T7901] Length = 593 Score = 44.5 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 31/181 (17%), Positives = 67/181 (37%), Gaps = 17/181 (9%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK---- 268 L +A + S + + + L + Y+ T ++++N +++ ++ Sbjct: 255 LKTAAKASLISYENDTKTSRLLPGQQQVAIYSFDSEITLLTDYTSDINLLEAAIDTIPNS 314 Query: 269 -LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 L +TN AM A ++ + G + + +TDGE++ ++ + Sbjct: 315 VLERGNSTNLLGAMEIAAERWNDQIDLIAVERG------YAVLLTDGEHNFD--SRSPAD 366 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV---NDSRELLESFDKIT 384 K+Y++AV ++ L T SS Q V + EL F ++ Sbjct: 367 IEADLTNFFGTRKKVYAIAVGNNVNLEN-LEAITASSEQVLTVNSFESAEELEAVFTEVA 425 Query: 385 D 385 Sbjct: 426 T 426 >gi|166366808|ref|YP_001659081.1| von Willebrand factor type A [Microcystis aeruginosa NIES-843] gi|166089181|dbj|BAG03889.1| von Willebrand factor type A [Microcystis aeruginosa NIES-843] Length = 456 Score = 44.5 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 34/293 (11%), Positives = 77/293 (26%), Gaps = 26/293 (8%) Query: 75 GSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLR 134 S + + I +T E +I + + + Sbjct: 16 SSSTIASLTHKLSLSMIMLTALITPAWGIKPEITDIEDIDDKVTLQIQVTGEESKPIMGL 75 Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 + + + + + + + +L S++ Sbjct: 76 AESDFQLKVLDKKNNKTY----QGKQLPFDWKSPRETTPPDAWIVVLIDFSGSMNCSQDL 131 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI-----GIVG 249 A A A RK+D I + G + + V G N Sbjct: 132 NTKCDAKAVAKGKRKLDAAINALGTFIKLASERKGNTYLSIVPFGVEGKNDKPGACDYYP 191 Query: 250 NQCTPLSNNLN-----EVKSRLNKL---NPYENTNTYPAMHHAYRELYNEKESSHNTIGS 301 + +N N ++ + L L P TN Y A+ + N+KE Sbjct: 192 KVTSETLDNFNLVQDVKLTNFLGSLADKTPCATTNFYQALKETVKFFKNDKEGRFYPKDK 251 Query: 302 TR------LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + +I ++DG ++ ++ + N + ++++ Sbjct: 252 EGKPLKPQPRLSIILLSDGFDNNSNY---QEVQKTLANLQNNKDIVVHTLGYG 301 >gi|47218289|emb|CAG04121.1| unnamed protein product [Tetraodon nigroviridis] Length = 979 Score = 44.5 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 21/186 (11%), Positives = 52/186 (27%), Gaps = 36/186 (19%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-------------- 273 + + + S + + K+ + + P Sbjct: 342 DLTVDDSFSIVDFNHNVRCWSEDLVPGSSVQVKDAKTYIENIKPNGGETQTQEAPPPPSP 401 Query: 274 ----------NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 TN A+ A + L +SH + R +I ++DGE + Sbjct: 402 HADCGVSVASGTNINEALMRAVQMLV---RASHQGLVDPRSVSMIILVSDGEP--HREIK 456 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ-----FFAVNDSRELLE 378 + + MR ++S+ + D L + + + + + +L Sbjct: 457 LSAIQKNVKRAMREE-FSLFSLGIGFDV-DFDFLERIATENRGVAQRIYANQDTADQLRS 514 Query: 379 SFDKIT 384 + ++ Sbjct: 515 FYRQVA 520 >gi|326672739|ref|XP_002664110.2| PREDICTED: vitrin [Danio rerio] Length = 868 Score = 44.5 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 15/121 (12%), Positives = 43/121 (35%), Gaps = 14/121 (11%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESS 295 +G I Y V +N ++K ++K+ ++ A+ + ++ +++ + Sbjct: 521 MMGIIQYGDDPVTEFSLKQFSNSKDLKPAISKIVQKGGPSHVGKALSYINKQFFSDANGN 580 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 + + + DG + + R +G+ I+ V + P + + Sbjct: 581 RGGAPN-----VAVVLVDGWPTD--------RVEEASRLARESGINIFFVTIEGPDDNEK 627 Query: 356 L 356 Sbjct: 628 Q 628 >gi|153876590|ref|ZP_02003832.1| von Willebrand factor, type A [Beggiatoa sp. PS] gi|152066946|gb|EDN66168.1| von Willebrand factor, type A [Beggiatoa sp. PS] Length = 305 Score = 44.5 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 21/121 (17%), Positives = 45/121 (37%), Gaps = 10/121 (8%) Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 +K + N P T Y A+ +AY L +I ++DG +S + Sbjct: 189 LKRQFNYQFPGGGTALYDAIFNAYTFLQKNSFPDKIA--------VMIVLSDGGDSHSEL 240 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFD 381 L + + ++I++V + + + L + G+F+ ++ + F Sbjct: 241 NFKDLLSKIPFNS-DTSPIRIFAVGYGSITDKKRLNEIAKMTQGKFYDGA-MVDVDKIFK 298 Query: 382 K 382 K Sbjct: 299 K 299 >gi|109732283|gb|AAI15771.1| Integrin, alpha 10 [Mus musculus] Length = 1166 Score = 44.5 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 62/205 (30%), Gaps = 25/205 (12%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 + I E L + + + + +++G + Y V Sbjct: 163 PTYMDVVIVLDGSNSIYPWSEVQTFLRRLVGRLFIDPEQ--IQVGLVQYGENPVHEWSLG 220 Query: 255 LSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 EV L+ E T T A+ A E +++ + ++ +T Sbjct: 221 DFRTKEEVVRAARNLSRREGRETRTAQAIMVACTEGFSQSRGGRPEA-----ARLLVVVT 275 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--------SAPPEGQDLLRKCTDSS 364 DGE+ L+ CE R + Y +AV P +R Sbjct: 276 DGESHDGEELP---AALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRAIASDP 329 Query: 365 G--QFFAVNDSRELLESFDKITDKI 387 FF V D L + D + D+I Sbjct: 330 DERFFFNVTDEAALTDIVDALGDRI 354 >gi|17533687|ref|NP_496745.1| C-type LECtin family member (clec-65) [Caenorhabditis elegans] gi|3876683|emb|CAB03057.1| C. elegans protein F35C5.8, confirmed by transcript evidence [Caenorhabditis elegans] Length = 372 Score = 44.5 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 24/164 (14%), Positives = 57/164 (34%), Gaps = 11/164 (6%) Query: 223 SIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPA-M 281 I + ++ V T +N V + S ++ S++ +L +++ + M Sbjct: 62 QIGTGYDDPRSTRVGFITYNWNATDVADFYKLQSW--ADLNSQIQRLQYTPQSSSPASRM 119 Query: 282 HHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK 341 +++ + KK VI T + + + +++ G+ Sbjct: 120 DTGLNAAIGMIDATAGFRDN--YKKIVIVFT-----SVHGSYKSNQPRDVSKILKSRGIP 172 Query: 342 IYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + +V + + Q L++ FA+ D E +TD Sbjct: 173 VVTVNTGSSSDTQAYLKQIAS-DNMSFAIADGNVTQEILKAMTD 215 >gi|124486692|ref|NP_001074522.1| integrin alpha-10 [Mus musculus] Length = 1167 Score = 44.5 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 62/205 (30%), Gaps = 25/205 (12%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 + I E L + + + + +++G + Y V Sbjct: 163 PTYMDVVIVLDGSNSIYPWSEVQTFLRRLVGRLFIDPEQ--IQVGLVQYGENPVHEWSLG 220 Query: 255 LSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 EV L+ E T T A+ A E +++ + ++ +T Sbjct: 221 DFRTKEEVVRAARNLSRREGRETRTAQAIMVACTEGFSQSRGGRPEA-----ARLLVVVT 275 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--------SAPPEGQDLLRKCTDSS 364 DGE+ L+ CE R + Y +AV P +R Sbjct: 276 DGESHDGEELP---AALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRAIASDP 329 Query: 365 G--QFFAVNDSRELLESFDKITDKI 387 FF V D L + D + D+I Sbjct: 330 DERFFFNVTDEAALTDIVDALGDRI 354 >gi|329898072|ref|ZP_08272282.1| hypothetical protein IMCC3088_280 [gamma proteobacterium IMCC3088] gi|328920971|gb|EGG28395.1| hypothetical protein IMCC3088_280 [gamma proteobacterium IMCC3088] Length = 490 Score = 44.5 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 19/112 (16%), Positives = 39/112 (34%), Gaps = 16/112 (14%) Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI--- 331 TN A+ A +L + V+ +TDG+ +S+ N ++ Sbjct: 105 TNIPAALDAALYDLN---------RLDPDYRTSVVLLTDGKVDVSSSAVANANAARMLLE 155 Query: 332 --CEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESF 380 + G+ ++++A+ + LRK + G + L F Sbjct: 156 EQAPILGQTGVPVHTIAL-SNEADWAFLRKLAESTQGLAEKAESADSLTRVF 206 >gi|311254427|ref|XP_003125838.1| PREDICTED: LOW QUALITY PROTEIN: integrin alpha-10-like [Sus scrofa] Length = 1177 Score = 44.5 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 62/205 (30%), Gaps = 25/205 (12%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 + I E L + + + + +++G + Y V Sbjct: 163 PTYMDVVIVLDGSNSIYPWSEVQTFLRRLVGRLFIDPEQ--IQVGLVQYGESSVHEWSLG 220 Query: 255 LSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 EV L+ E T T A+ A E +++ + ++ +T Sbjct: 221 DFRTKEEVVRAARNLSRREGRETKTAQAIMMACTEGFSQSHGGRPEA-----ARLLVVVT 275 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--------SAPPEGQDLLRKCTDSS 364 DGE+ L+ CE R + Y +AV P +R Sbjct: 276 DGESHDGEELP---EALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLQEIRAIASDP 329 Query: 365 G--QFFAVNDSRELLESFDKITDKI 387 FF V D L + D + D+I Sbjct: 330 DERFFFNVTDEAALTDIVDALGDRI 354 >gi|257791506|ref|YP_003182112.1| hypothetical protein Elen_1758 [Eggerthella lenta DSM 2243] gi|257475403|gb|ACV55723.1| hypothetical protein Elen_1758 [Eggerthella lenta DSM 2243] Length = 213 Score = 44.5 bits (103), Expect = 0.030, Method: Composition-based stats. Identities = 8/141 (5%), Positives = 38/141 (26%), Gaps = 7/141 (4%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 ++ + F+ +A+D + + + +++ L+ ++ + Sbjct: 28 FLMLFFLVLGFLAFAVDASQFLTKKTLLENTLN-----LAREERMAPSVTLVAKNSQEPD 82 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + + + L+ +Y + + + + K + + + Sbjct: 83 AVIARSVAGTLRDANYQGDIDVYFYEVPEAQLPPSKRDTERVY--AYQVVLTEQVKTVFA 140 Query: 122 GLIPSALTNLSLRSTGIIERS 142 + L + Sbjct: 141 SIFGVYDIPLKSTVVAVSNPY 161 >gi|146185719|ref|XP_001032387.2| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|146143099|gb|EAR84724.2| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 796 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 28/195 (14%), Positives = 60/195 (30%), Gaps = 39/195 (20%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 I+ ++ + S+ + T+ Y + N NL + + Sbjct: 293 VGSNIETAKQALIFFLKSLPEGSIYNIISFGTNYTVMYPQSVQVND-----QNLQDSIDK 347 Query: 266 LNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + K TN A+ + L ++ L+K + ITDGE Sbjct: 348 IEKFQANMGGTNISQALKYLMYNLQDQ----------YGLRKKIYIITDGEFQDYQP--- 394 Query: 325 TLNTLQICEYMRNAGMK--IYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 E ++ K I ++ + + + ++ G F V D+ Sbjct: 395 ------ALEIVKKNKFKCDINALCIGSYEFLYAT-QILNETGGNFQKVTDTS-------- 439 Query: 383 ITDKIQEQSVRIAPN 397 +I Q +++ + Sbjct: 440 ---QIISQVIQLLKD 451 >gi|156744081|ref|YP_001434210.1| hypothetical protein Rcas_4161 [Roseiflexus castenholzii DSM 13941] gi|156235409|gb|ABU60192.1| conserved hypothetical protein [Roseiflexus castenholzii DSM 13941] Length = 392 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 19/125 (15%), Positives = 36/125 (28%), Gaps = 6/125 (4%) Query: 17 IDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGS 76 ID A R + ++ D A L +++ DR + T + ++ G Sbjct: 33 IDGALAFAWRRNVMNSADGAALIATRALIVDRGSVTGLELTNAVRTYLQTEL------GV 86 Query: 77 YIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRST 136 + A I + P A T + +L G++ + S Sbjct: 87 SDPDFELTYVNGAGQPIGPVGSGPAPANARGVTVVARHTFDTYLMGILGQPTLTVRGVSA 146 Query: 137 GIIER 141 Sbjct: 147 ARFGN 151 >gi|288576300|ref|ZP_05978557.2| pilus-associated protein [Neisseria mucosa ATCC 25996] gi|288565840|gb|EFC87400.1| pilus-associated protein [Neisseria mucosa ATCC 25996] Length = 1081 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 32/314 (10%), Positives = 72/314 (22%), Gaps = 17/314 (5%) Query: 82 AGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLI-PSALTNLSLRSTGIIE 140 D +Q N + P + S E Sbjct: 48 FIDDSQSMNRNAVTGEYTPGPTRMQVTKNALNGILENHKDKFNWGLQTLYNGGSSDTTPE 107 Query: 141 RSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYA 200 + + S ++D M+ L + ++P + S S Sbjct: 108 ETFDPEKASWQKMIDKVNKMKPTGLTPATSRYYEVVTQTVMPNIKYRCQKSYVVMMSDGD 167 Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + AY L+ Sbjct: 168 ANFGCNFHSKGFSYNFDYDNPKGYYSIYPSPYRPFLPKNEAAYKYFGPSALAGLSQTTLS 227 Query: 261 EVKSR---LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT-DGEN 316 + S+ ++ + +++ + Y + K ++ DG+ Sbjct: 228 GLCSKAGGVDNMPYWDSADGYNGKVGGIEFFSRTLSWKDIKTAADGNGKDAAGVSWDGDP 287 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP--PEGQDLLRKCTDSSGQFFAVNDSR 374 S + L ++ ++V G+ L + +F + Sbjct: 288 SKDPKGVDYSKQL----------VQTFTVGFGQGISDAGKAFLTRGASQDDWYFNADKPE 337 Query: 375 ELLESFDKITDKIQ 388 +L +F+KI I Sbjct: 338 DLENAFNKIISLIS 351 >gi|311693026|gb|ADP95899.1| von Willebrand factor, type A-like protein [marine bacterium HP15] Length = 590 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 22/121 (18%), Positives = 41/121 (33%), Gaps = 25/121 (20%) Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 + T AM +A REL + + +I ITDG + A L+ + Sbjct: 493 SSGGGTPLAEAMLYAARELSASHKPR----------QVLIVITDGSPNNGHAVNYLLDLM 542 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + Y++ + + G + +ND REL + +I + + Sbjct: 543 -------KHQIDTYAIGIGSNAVKSYF--------GNWTVINDVRELQSALFRIAGNVLD 587 Query: 390 Q 390 Sbjct: 588 L 588 >gi|118104463|ref|XP_424917.2| PREDICTED: similar to polydom protein [Gallus gallus] Length = 3520 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 13/90 (14%), Positives = 30/90 (33%), Gaps = 15/90 (16%) Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 K + ITDG ++G + I +R+ G++I++ L Sbjct: 132 NSTKVIFLITDGYSNGG-------DPRPIAASLRDFGVEIFT--FGIWQGNIRELNDMAS 182 Query: 363 SS--GQFFAVNDSRELLESFDKITDKIQEQ 390 + ++ E F+ + + + Sbjct: 183 HPKEDHCYLLHSFTE----FEALARRALHE 208 >gi|118365082|ref|XP_001015762.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89297529|gb|EAR95517.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 755 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 24/143 (16%), Positives = 51/143 (35%), Gaps = 19/143 (13%) Query: 246 GIVGNQCTPLSN----NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGS 301 PL+ N ++KS + ++N T P + L + KE T Sbjct: 91 SSTAKTLCPLTQVNDENKQQIKSAIKQINGQGGTFVIPGFKEVTKILNSRKEQREQT--- 147 Query: 302 TRLKKFVIFITDG---ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLR 358 F++ +TDG + QN E + IY+ ++L+ Sbjct: 148 -----FILLLTDGEFGDIDSGKVIQNINRLFTQSEIQKTP--YIYTYGYG-DDVNPEILQ 199 Query: 359 KCTDS-SGQFFAVNDSRELLESF 380 + G++ +++ +++ + F Sbjct: 200 EIAQKFQGKYCLISNVQQVTDWF 222 >gi|331239018|ref|XP_003332163.1| hypothetical protein PGTG_13530 [Puccinia graminis f. sp. tritici CRL 75-36-700-3] gi|309311153|gb|EFP87744.1| hypothetical protein PGTG_13530 [Puccinia graminis f. sp. tritici CRL 75-36-700-3] Length = 501 Score = 44.5 bits (103), Expect = 0.031, Method: Composition-based stats. Identities = 34/219 (15%), Positives = 58/219 (26%), Gaps = 34/219 (15%) Query: 162 DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLV 221 D + +TT S + I ++ + Sbjct: 6 DAMQSDPVPTLEDRLANMDFSEKMLDLCFILDTTGSMGSY--------ITAATQNIELIC 57 Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQC-----TPLSNNLNEVKSRLNKLNPYENTN 276 + I + + +RIG IAY + ++N VK L L + Sbjct: 58 DEIINSERLASPECLRIGLIAYRDHPPQDMSYVTLKFAFTSNPKAVKENLKTLWASGGGD 117 Query: 277 TYPAMHHAY-RELYNEKESSHNTIGSTRLKKFVIFITDGENSG--------ASAYQNTLN 327 A+ A L + K + ITD G + + + Sbjct: 118 GPEAVTAAMHEALTLDWRPQA--------SKMAVLITDAPPHGIGEYGDGFSRGDPSGHD 169 Query: 328 TLQICEYMRNAGMKIYSV----AVSAPPEGQDLLRKCTD 362 LQ+ M G+ ++ V A S D R Sbjct: 170 PLQLARKMAQTGISLFVVACEPAFSGYSYSNDFFRAIAS 208 >gi|188580652|ref|YP_001924097.1| LPXTG-motif cell wall anchor domain protein [Methylobacterium populi BJ001] gi|179344150|gb|ACB79562.1| LPXTG-motif cell wall anchor domain protein [Methylobacterium populi BJ001] Length = 723 Score = 44.5 bits (103), Expect = 0.032, Method: Composition-based stats. Identities = 21/163 (12%), Positives = 52/163 (31%), Gaps = 16/163 (9%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 + I + + ++L K ++ L T + A + Sbjct: 369 PGDRFNVIRFDHSFDTLFPDVVPADESHLARAKRFVSGLEASGGTEMLAPLRAALADATP 428 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 E + + ++F+TDG + + + E R+ ++ V + + Sbjct: 429 EDTAR---------LRQIVFLTDGAIGNEAQIFSAI----AAERGRSR---LFMVGIGSA 472 Query: 351 PEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 P G + G F ++ ++ E + K++ +V Sbjct: 473 PNGYLMSHAAELGRGSFTQIDTPDQVSERMRALLTKLESPAVT 515 >gi|321265474|ref|XP_003197453.1| hypothetical protein CGB_N0210C [Cryptococcus gattii WM276] gi|317463933|gb|ADV25666.1| Hypothetical Protein CGB_N0210C [Cryptococcus gattii WM276] Length = 445 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 18/151 (11%), Positives = 47/151 (31%), Gaps = 11/151 (7%) Query: 202 APAPANRKIDVLIESAGNLVNSIQ-----KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 + I+ + + + + I+ + + V + P + Sbjct: 64 CTGSMQKYINSVRDHIIGICDMIRGEEGLNGPDDLRVAVVNYRDHPPQDSTYVYKFHPFT 123 Query: 257 NNLNEVKSRLNKLNPYENTNT----YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 +++ +V++ L L + AM EL +E++ + I Sbjct: 124 SDIPDVQNYLKGLTASGGGDGPEAVTAAMAATLTELEWRREAARMAVLVADAPPHGI--G 181 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 +G + + + L I M G+ ++ Sbjct: 182 EGGDQFKQGDPDGHDPLVIARMMAQNGITMF 212 >gi|198424466|ref|XP_002124191.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 1306 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 34/329 (10%), Positives = 81/329 (24%), Gaps = 23/329 (6%) Query: 71 HLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTN 130 ++ + + TK N + Sbjct: 76 NVGSTGDGAVIKCSANVQPFSSTTKAGNTKWMTDHGCSVVLNTNAVPTGDNQNKSADAIG 135 Query: 131 LSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFW 190 +++ G + + + + S Y M + +P Sbjct: 136 MTMIPLGGNKTAVCSPGRRKICGRSLRLSPGACYTGTLGAAMTMKTGWGEIPCFKNYLDL 195 Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 S K ++ S + + Sbjct: 196 VYVVDSSNSISDANFTIMKQIIVNASEAFEASIGDTTQVAVLQYGNLDSAAFDHTDSKYY 255 Query: 251 QCTPLSNNLNEVKSRLNKLNPY------ENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 + + N++ + NT T A+ A + + Sbjct: 256 KSPTKLGDCNDIDCFNRAIKANMTHLNAANTFTSLAIRRAVEF------DFAQSKNKDKA 309 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS 364 KK ++ ITDG+ + S + Q + + +Y++ V+ + + LR + Sbjct: 310 KKILVLITDGQANFQSQLIVSYRLTQ------SHNITVYAIGVALKSDAE--LRISANGG 361 Query: 365 ---GQFFAVNDSRELLESFDKITDKIQEQ 390 + N+ EL ++ +T+ I + Sbjct: 362 VSKERVLDANNYSELSKALRNLTETIAQS 390 >gi|180915|gb|AAA96439.1| type VII collagen [Homo sapiens] Length = 625 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 55/169 (32%), Gaps = 23/169 (13%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY--ENTNTYPAMHHAYRELYNEKE 293 V++G ++Y+ S++L + R+ + N A+ A+R + Sbjct: 277 VQVGLLSYSHRPSPLFPLNGSHDLGIILQRIRDMPYMDPSGNNLGTAVVTAHRYMLAPDA 336 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + ++ + D G + +G+ + V + Sbjct: 337 PGRR----QHVPGVMVLLVDEPLRGDIFSP--------IREAQASGLNV--VMLGMAGAD 382 Query: 354 QDLLRKCTDSSG-----QFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + LR+ G FFAV+D L ++ + + + S P Sbjct: 383 PEQLRRLA--PGMDSVQTFFAVDDGPSLDQAVSGLATALCQASFTTQPR 429 >gi|262309|gb|AAB24637.1| collagen VII [Homo sapiens] gi|170145164|gb|AAA36357.2| type VII collagen [Homo sapiens] Length = 887 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 55/169 (32%), Gaps = 23/169 (13%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY--ENTNTYPAMHHAYRELYNEKE 293 V++G ++Y+ S++L + R+ + N A+ A+R + Sbjct: 723 VQVGLLSYSHRPSPLFPLNGSHDLGIILQRIRDMPYMDPSGNNLGTAVVTAHRYMLAPDA 782 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + ++ + D G + +G+ + V + Sbjct: 783 PGRR----QHVPGVMVLLVDEPLRGDIFSP--------IREAQASGLNV--VMLGMAGAD 828 Query: 354 QDLLRKCTDSSG-----QFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + LR+ G FFAV+D L ++ + + + S P Sbjct: 829 PEQLRRLA--PGMDSVQTFFAVDDGPSLDQAVSGLATALCQASFTTQPR 875 >gi|294781746|ref|ZP_06747079.1| phage/colicin/tellurite resistance cluster TerY protein [Fusobacterium sp. 1_1_41FAA] gi|294481856|gb|EFG29624.1| phage/colicin/tellurite resistance cluster TerY protein [Fusobacterium sp. 1_1_41FAA] Length = 229 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 18/124 (14%), Positives = 46/124 (37%), Gaps = 12/124 (9%) Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG-ASAYQN 324 LN T A+ A + ++ T S + V+ ++DG + + Sbjct: 85 LNPFLASGMTPLGTALRMAKDMIEDK-----ETTPSNIYRPAVVLVSDGVPTDEWRGPLD 139 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKIT 384 R++ + ++VA+ +L+ + + FF + ++++ F +I+ Sbjct: 140 NFKNNG-----RSSKCQRFAVAIG-NDADNQMLKSFAECNENFFIAENVSDIVDKFKQIS 193 Query: 385 DKIQ 388 + Sbjct: 194 MSVS 197 >gi|257883753|ref|ZP_05663406.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium 1,231,501] gi|257819591|gb|EEV46739.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium 1,231,501] Length = 1475 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 33/372 (8%), Positives = 93/372 (25%), Gaps = 50/372 (13%) Query: 48 RTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAES 107 + + +T++ + + I ++++ + Sbjct: 320 PSSTASDLHPEDATTLYNVYLDVIGGEKKEISPIDIVFVLDKSASMSELTAGTNSQTKNA 379 Query: 108 KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQK 167 I N K L+ ++ + +S + + D+ D+ Sbjct: 380 AL---IEAVNEMSKDLLSDPSLDIRI-GMVNFYHNSTAINNHEQISSDIFPLTNDINRLT 435 Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 ++N + P NR + ++ G+ + A Sbjct: 436 GSENTALNRTPIGGTP----LTLGLKNGYETLYKDNGGENRNPEKILIVVGDGTPTFSYA 491 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 S R T + +N ++ NT+ H Sbjct: 492 ---PIQSSYRTSTNGAWSNWTVMEDKIAEDNGVLFRNF---EEFSGNTS-NAGFTHPVTY 544 Query: 288 LYNEKESSHNTIGSTRLKKFVI-------FITDGENSGASAYQNTLNTLQIC-------- 332 + R + ++ DG ++ + T Sbjct: 545 ASDFNRPEDEVNVHYRYGEVKEGDNKATHWVGDGSSNNNTNGSPTSQEKSSAINTVAYHH 604 Query: 333 ---EYMRNAGMKIYSVAVSAPPE----------GQDLLRKCTD------SSGQFFAVNDS 373 + I+S+ + G+++L+ D + +++ N+ Sbjct: 605 WLKNKYQENPPSIFSIGLGIDGNVSGRQRLDAIGRNVLKNIADLEEDGVTP-RYYNANNK 663 Query: 374 RELLESFDKITD 385 +++ + + I+ Sbjct: 664 NDIVTALEDISS 675 >gi|219127465|ref|XP_002183955.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] gi|217404678|gb|EEC44624.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] Length = 582 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 24/180 (13%), Positives = 60/180 (33%), Gaps = 12/180 (6%) Query: 192 KNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ 251 KN ++ +L ++A ++ ++ + ++ + Sbjct: 199 KNVVLVLDTSGSMTDGNRLSLLKQAAKQVIETL--TVGDRVAIVEFSSQAKLFAQDNKFL 256 Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 T N + + ++ TN A A+ L + + ++ +T ++F+ Sbjct: 257 FTATQKNKELLATHIDSFTAAGATNFLDAFTAAFAVLNDSIDQEYHVGCTT----AILFL 312 Query: 312 TDGE----NSGASAYQNTLNTLQICE-YMR-NAGMKIYSVAVSAPPEGQDLLRKCTDSSG 365 TDGE + A L I R + +++ ++S ++ S+G Sbjct: 313 TDGEMTQPENVQEADVLDLVNTGISNLEARLGRSVFLFTFSISDNNNVHAFPKQIACSTG 372 >gi|297265792|ref|XP_002799252.1| PREDICTED: vitrin-like [Macaca mulatta] Length = 657 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 19/163 (11%), Positives = 53/163 (32%), Gaps = 23/163 (14%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSH 296 +G + Y + N ++K+ + K+ +N A+ + +++ Sbjct: 311 MGVVQYGDNPATHFNLKTHMNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANG-- 368 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + V+ + DG + + R +G+ I+ + V E + Sbjct: 369 ---NRSSAPNVVVVMVDGWPTD--------KVEEASRLARESGINIFFITVEGAAENEKQ 417 Query: 357 L--------RKCTDSSGQF-FAVNDSRELLESFDKITDKIQEQ 390 + ++G + F V L ++ + ++ + Sbjct: 418 YVVEPNFANKAVCRTNGFYSFHVQSWFGLHKTLQPLVKRVCDT 460 >gi|88811039|ref|ZP_01126295.1| von Willebrand factor, type A [Nitrococcus mobilis Nb-231] gi|88791578|gb|EAR22689.1| von Willebrand factor, type A [Nitrococcus mobilis Nb-231] Length = 930 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 32/279 (11%), Positives = 71/279 (25%), Gaps = 24/279 (8%) Query: 76 SYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRS 135 + + G + I D E + +A +NL + Sbjct: 300 TNVPAGMGGVGVTTYRAIRWDVVTCRSLTFEITVGPTGGFGTPQGASVTVTADSNLPTAA 359 Query: 136 TGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTT 195 I + A R ++ N N N + Sbjct: 360 ARIWLSYTSTNAGDTASGSVTVRCVQTGQSWTININANTIARPRSAV-------SLVIDR 412 Query: 196 KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL 255 K+ L E+A +N + + L T + I +P Sbjct: 413 SGSMNDDAGDGITKVQKLREAANVFINIM--LPGDGIGLVRFNDTAQRLMEITDVGASPG 470 Query: 256 SNNLNEVKSRL--NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 + + + + ++P T+ + + L + + + T ++ +TD Sbjct: 471 GAGRTDALNHIAGSDIDPSGATSIGDGIVNGRNMLNDAQAAPMPDYDVT----AMVVLTD 526 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 G + + + + Y+V + P Sbjct: 527 GMWN---------RPPSLADVAGSINANTYAVGLGIPSN 556 >gi|313216071|emb|CBY37449.1| unnamed protein product [Oikopleura dioica] Length = 107 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 13/104 (12%), Positives = 35/104 (33%), Gaps = 16/104 (15%) Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 +KK ++ +TDG++ +N ++ +++ V +D L Sbjct: 9 AGSRPYVKKHMVLLTDGQSQDDVGAP--------ARAAKNFNIRTFAIGVG--DAIEDEL 58 Query: 358 RKCTDSS--GQFFAVNDSRELLESFDKIT----DKIQEQSVRIA 395 + + V D + D + + + + + +A Sbjct: 59 KLVATPPFSDTLYHVEDYDGIRHLQDTLAFKFCEDLGKSLLTLA 102 >gi|218196035|gb|EEC78462.1| hypothetical protein OsI_18329 [Oryza sativa Indica Group] Length = 614 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 16/126 (12%), Positives = 41/126 (32%), Gaps = 16/126 (12%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 K ++ L +TN A+ L + K + S ++F++DG Sbjct: 88 QGRAVAKEIVDGLVADGSTNMGAALLEGAMILRDRKGARD---ESNGRVGCMMFLSDG-T 143 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN-DSRE 375 + ++ ++ + + + ++S + VN + + Sbjct: 144 NDEIYKEDISGEFPA-----------HTFGLGSDHNPNVMRHIADETSATYSFVNRNIAD 192 Query: 376 LLESFD 381 + +FD Sbjct: 193 IKGAFD 198 >gi|110833078|ref|YP_691937.1| hypothetical protein ABO_0217 [Alcanivorax borkumensis SK2] gi|110646189|emb|CAL15665.1| hypothetical protein ABO_0217 [Alcanivorax borkumensis SK2] Length = 151 Score = 44.5 bits (103), Expect = 0.033, Method: Composition-based stats. Identities = 10/127 (7%), Positives = 31/127 (24%), Gaps = 5/127 (3%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 ++ + + A + + +MQ ++D A + ++ + T + Sbjct: 20 FLLVSPLVIGLVYAAATYGVLFSWQMRMQVSVDRAA-AAATTLDRNTTSDPGVMAASLAN 78 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + + ++I + + P + Sbjct: 79 GALANNVPTFMGSVPADACVTVGAEVVCDLSIALTDGGCSEASGVATG----PDQLGIFG 134 Query: 122 GLIPSAL 128 G P Sbjct: 135 GFPPLPD 141 >gi|325954651|ref|YP_004238311.1| von Willebrand factor type A [Weeksella virosa DSM 16922] gi|323437269|gb|ADX67733.1| von Willebrand factor type A [Weeksella virosa DSM 16922] Length = 336 Score = 44.1 bits (102), Expect = 0.034, Method: Composition-based stats. Identities = 18/104 (17%), Positives = 38/104 (36%), Gaps = 21/104 (20%) Query: 251 QCTPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 +PL+N+ + S L L+ + T+ A A L G+ K Sbjct: 141 TISPLTNDYAAIDSYLGSLSTNLISNQGTDFSAAFREAVSVL----------KGAPNTSK 190 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 V+ ++DGE+ + Q+ + + + + S+ + Sbjct: 191 LVVLLSDGEDHESGENQS-------IKLANDNQIHVVSIGIGTD 227 >gi|293569033|ref|ZP_06680345.1| von Willebrand factor type A domain protein [Enterococcus faecium E1071] gi|291588214|gb|EFF20050.1| von Willebrand factor type A domain protein [Enterococcus faecium E1071] Length = 1502 Score = 44.1 bits (102), Expect = 0.034, Method: Composition-based stats. Identities = 31/372 (8%), Positives = 90/372 (24%), Gaps = 50/372 (13%) Query: 48 RTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAES 107 + + +T++ + + I ++++ + Sbjct: 347 PSSTASDLHPEDATTLYNVYLDVIGGEKKEISPIDIVFVLDKSASMSELTAGTNSQTKNA 406 Query: 108 KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQK 167 I N K L+ ++ + +S + + D+ D+ Sbjct: 407 AL---IEAVNEMSKDLLSDPSLDIRI-GMVNFYHNSTAINNHEQISSDIFPLTNDINRLT 462 Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 ++N + P T + + V+ + + ++ Sbjct: 463 GSENTALNRTPIGGTPLTLGLKNGYETLYKDNGGENRNPEKILIVVGDGTPTFSYAPIQS 522 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 I + L N E NT+ H Sbjct: 523 SYRTSTNGAWSNWTVMEDKIAEDNDV-LFRNFEEF---------SGNTS-NAGFTHPVTY 571 Query: 288 LYNEKESSHNTIGSTRLKKFVI-------FITDGENSGASAYQNTLNTLQIC-------- 332 + R + ++ DG ++ + T Sbjct: 572 ASDFNRPEDEVNVHYRYGEVKEGDNKATHWVGDGSSNNNTNGSPTSQEKSSAINTVAYHH 631 Query: 333 ---EYMRNAGMKIYSVAVSAPPE----------GQDLLRKCTD------SSGQFFAVNDS 373 + I+S+ + G+++L+ D + +++ N+ Sbjct: 632 WLKNKYQENPPSIFSIGLGIDGNVSGRQRLDAIGRNVLKNIADLEEDGVTP-RYYNANNK 690 Query: 374 RELLESFDKITD 385 +++ + + I+ Sbjct: 691 NDIVTALEDISS 702 >gi|254480742|ref|ZP_05093989.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] gi|214039325|gb|EEB79985.1| von Willebrand factor type A domain protein [marine gamma proteobacterium HTCC2148] Length = 493 Score = 44.1 bits (102), Expect = 0.034, Method: Composition-based stats. Identities = 19/125 (15%), Positives = 40/125 (32%), Gaps = 9/125 (7%) Query: 259 LNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT-DGEN 316 + + L TN A+ A +L ++ K + + Sbjct: 91 RQRAQQAIATLKAAGQRTNIPAALEQATADLEQPTSGYRTSVLLLTAGKVDVAESPIINV 150 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRE 375 + + N + G+ ++++A SA + LLR ++G + + E Sbjct: 151 NESRKLLNGR-----AVELGELGVPVHTIAFSAQADAM-LLRSLARQTNGTSRQADSADE 204 Query: 376 LLESF 380 L F Sbjct: 205 LSVMF 209 >gi|126666680|ref|ZP_01737657.1| 1-deoxy-D-xylulose 5-phosphate reductoisomerase [Marinobacter sp. ELB17] gi|126628725|gb|EAZ99345.1| 1-deoxy-D-xylulose 5-phosphate reductoisomerase [Marinobacter sp. ELB17] Length = 1318 Score = 44.1 bits (102), Expect = 0.034, Method: Composition-based stats. Identities = 42/364 (11%), Positives = 91/364 (25%), Gaps = 29/364 (7%) Query: 39 SGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKN 98 + A + + + S I K+ + S+ N + ++TK Sbjct: 91 AVKAIVNNFGDRTRIGLMAYKQSGINKRYLHDSQYDASFNPNNYDATFTGDRASVTKRNR 150 Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR 158 + + Y + L S + + + R Sbjct: 151 VSNPTNSGAYIYYNVALPFYSSSNLGN--AFCYSSTADFDNGSETASGPWDNYDC--YQR 206 Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAG 218 + + S+++ T + Sbjct: 207 KTNTSDSGAAGFSQYLFSSRFEASDSDFAQNILDFGTYLTWQYVSLSWFSNSSPGQGMLH 266 Query: 219 NLVNSIQKAIQEK--KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 ++ + A + + L+ + + + + TPL+ L T Sbjct: 267 VAIDDVDAAHKTRLLNKLNTSQFSSSSDTPLRNAGLTPLAGTLESASKYF------GGTL 320 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY-- 334 + L + +V+ +TDG S + T + + Sbjct: 321 SNGEAASGVSTLAPTP-------NFCGTQDYVVLVTDGLPSVNKSGSKTTDVTSAVDAVA 373 Query: 335 -----MRNAGMKIYSVAVSAP-PEGQDLLRKCTDSSGQ--FFAVNDSRELLESFDKITDK 386 + + G++ Y V P + LL K + G NDS L + I Sbjct: 374 DEAAALLSQGVRTYVVGFGLPTGVDKSLLNKIAIAGGTKATLFANDSTTLDAALGSIFLN 433 Query: 387 IQEQ 390 I + Sbjct: 434 IFNR 437 >gi|115623672|ref|XP_785426.2| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3 [Strongylocentrotus purpuratus] gi|115960633|ref|XP_001186588.1| PREDICTED: similar to inter-alpha-trypsin inhibitor heavy chain3 [Strongylocentrotus purpuratus] Length = 1028 Score = 44.1 bits (102), Expect = 0.034, Method: Composition-based stats. Identities = 23/177 (12%), Positives = 61/177 (34%), Gaps = 24/177 (13%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE 273 + +++ ++ + L + +V S+N+ K +N L+ Sbjct: 330 KRAFTTILDDVRPIDRINIILFSSYAHVWREDQMVEAT----SDNIAAAKRHVNGLSVGG 385 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 TN Y ++ A L + +I +TDG+ A+A + ++ Sbjct: 386 GTNIYDSLMKAVEILLEH--------DTGDAMPLIIMLTDGQVGNAAAIVRDVTSVIGGR 437 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG----QFFAVNDSR-ELLESFDKITD 385 ++S+ L K + S+ + + + + ++ +D++ + Sbjct: 438 L------SLFSIGFG-NGVDFPFLEKLSLSNQALARKVYEDSSASLQMKGFYDEVAN 487 >gi|270016374|gb|EFA12820.1| hypothetical protein TcasGA2_TC001887 [Tribolium castaneum] Length = 1264 Score = 44.1 bits (102), Expect = 0.034, Method: Composition-based stats. Identities = 26/273 (9%), Positives = 61/273 (22%), Gaps = 40/273 (14%) Query: 111 YEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 + + S L + Sbjct: 199 HNYEDDPSLSWQFFGSGTGFLRRYPGIAWPPVDMSTVWQRPRSSRNVYDFRSSAWYVSAA 258 Query: 171 NNNMTSNK-YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIES--AGNLVNSIQKA 227 + + ++ N ++V S V + Sbjct: 259 TSPKDIVILIDNSGSMSGHKSNLARATTESILNTLGDNDFVNVFKFSDITEETVPCFKDM 318 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR-LNKLNPYENTNTYPAMHHAYR 286 + + N +VR L +L+ KS + A+ + Sbjct: 319 LVQANNENVRW----------------LKESLSTFKSENIANFTA--------ALVTGFE 354 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 L+ + + + ++ ITDG S N ++I++ Sbjct: 355 ILHKYNRTGQGCQCN----QAIMLITDGPPSSYQEIFKMYNFPH-------YPVRIFTYL 403 Query: 347 VSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLE 378 V +R + G + + + E+ + Sbjct: 404 VGKDSSSAHEMRWMACANKGYYTRIENFDEINQ 436 >gi|189242452|ref|XP_969952.2| PREDICTED: similar to voltage-gated calcium channel alpha2-delta subunit 1 [Tribolium castaneum] Length = 1217 Score = 44.1 bits (102), Expect = 0.034, Method: Composition-based stats. Identities = 26/273 (9%), Positives = 61/273 (22%), Gaps = 40/273 (14%) Query: 111 YEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 + + S L + Sbjct: 199 HNYEDDPSLSWQFFGSGTGFLRRYPGIAWPPVDMSTVWQRPRSSRNVYDFRSSAWYVSAA 258 Query: 171 NNNMTSNK-YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIES--AGNLVNSIQKA 227 + + ++ N ++V S V + Sbjct: 259 TSPKDIVILIDNSGSMSGHKSNLARATTESILNTLGDNDFVNVFKFSDITEETVPCFKDM 318 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR-LNKLNPYENTNTYPAMHHAYR 286 + + N +VR L +L+ KS + A+ + Sbjct: 319 LVQANNENVRW----------------LKESLSTFKSENIANFTA--------ALVTGFE 354 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 L+ + + + ++ ITDG S N ++I++ Sbjct: 355 ILHKYNRTGQGCQCN----QAIMLITDGPPSSYQEIFKMYNFPH-------YPVRIFTYL 403 Query: 347 VSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLE 378 V +R + G + + + E+ + Sbjct: 404 VGKDSSSAHEMRWMACANKGYYTRIENFDEINQ 436 >gi|303241521|ref|ZP_07328022.1| viral A-type inclusion repeat-containing protein [Acetivibrio cellulolyticus CD2] gi|302590939|gb|EFL60686.1| viral A-type inclusion repeat-containing protein [Acetivibrio cellulolyticus CD2] Length = 1061 Score = 44.1 bits (102), Expect = 0.034, Method: Composition-based stats. Identities = 12/35 (34%), Positives = 17/35 (48%) Query: 3 AIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAV 37 AII+SV + +D A Q+QS +AA Sbjct: 16 AIILSVMLILTGVLVDGARARTAEAQVQSTTEAAA 50 >gi|119468931|ref|ZP_01611956.1| hypothetical protein ATW7_04187 [Alteromonadales bacterium TW-7] gi|119447583|gb|EAW28850.1| hypothetical protein ATW7_04187 [Alteromonadales bacterium TW-7] Length = 664 Score = 44.1 bits (102), Expect = 0.034, Method: Composition-based stats. Identities = 22/164 (13%), Positives = 51/164 (31%), Gaps = 21/164 (12%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPL---SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 I ++ + PL NL + + L T A+ Sbjct: 350 ESDDSFNIIGFDNNVTAMSDRPLIASDFNLRRAERFIYSLEADGGTEIQGALDAVLD--- 406 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + V+F+TDG S + + + ++++V + + Sbjct: 407 --------GSTFDGFVRQVVFLTDGSVSNEATLFKNI-------QAKLGDSRLFTVGIGS 451 Query: 350 PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 P + R G F + ++E+ +++ DK+ ++ Sbjct: 452 APNSFFMRRAADIGKGTFTFIGSTQEVQPKMEQLFDKLAHPAIT 495 >gi|313904291|ref|ZP_07837669.1| von Willebrand factor type A [Eubacterium cellulosolvens 6] gi|313470841|gb|EFR66165.1| von Willebrand factor type A [Eubacterium cellulosolvens 6] Length = 371 Score = 44.1 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 31/287 (10%), Positives = 79/287 (27%), Gaps = 35/287 (12%) Query: 126 SALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPP 185 A L R S I + +DVS S+ + D + Sbjct: 73 IASLTLLARPYKTETTGSGVKKRDIFLCMDVSYSIY-NLNAELVDRLQKVVSGM------ 125 Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI---QKAIQEKKNLSVRIGTIA 242 K + + P D +++ ++ ++ + + + Sbjct: 126 KDDRFGIAIYNTSTV-LYVPMTDDYDFVVKKLEDIKEYFILQKQYMDKFGKYNYTSEIPD 184 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 ++ T L + + + ++ + ++ + + Sbjct: 185 EDMDEYKRLRTEL--DYYDAGTLVDNMT-KGSSLIGEGLA--------SCMYDFPRLEKQ 233 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV-----AVSAP-----PE 352 + +I TD S L + E + + ++ + Sbjct: 234 NRTRVIIMSTDNAQEERSKPLVEL--QEAAELCKKNDITVFGIFPNKSQFDQDMSMDYET 291 Query: 353 GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 ++ +R+C + + G F+ +DS + + I K + I R Sbjct: 292 DKEAMRECVELTGGSFYEESDSLSVDDIITDIQRKEAMEVEEITTTR 338 >gi|120537195|ref|YP_957252.1| von Willebrand factor, type A [Marinobacter aquaeolei VT8] gi|120327030|gb|ABM21337.1| von Willebrand factor, type A [Marinobacter aquaeolei VT8] Length = 584 Score = 44.1 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 19/121 (15%), Positives = 40/121 (33%), Gaps = 25/121 (20%) Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 + T AM +A REL ++ + +I ITDG+ + + Sbjct: 487 DSGGGTPLAEAMLYAARELTASHKAR----------QVMIVITDGDPN-------NPQAV 529 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + + Y++ + +P + C ++D +L + I + E Sbjct: 530 NYLNGLIKGHIDTYAIGIGSPAVKRFFENWCV--------ISDVSQLQSALFSIASNVLE 581 Query: 390 Q 390 Sbjct: 582 L 582 >gi|297667858|ref|XP_002812180.1| PREDICTED: vitrin-like isoform 1 [Pongo abelii] Length = 693 Score = 44.1 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 14/120 (11%), Positives = 39/120 (32%), Gaps = 14/120 (11%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSH 296 +G + Y + N ++K+ + K+ +N A+ + +++ Sbjct: 347 MGVVQYGDSPATHFNLKTHMNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANG-- 404 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + V+ + DG + + R +G+ I+ + + E + Sbjct: 405 ---NRSSAPNVVVVMVDGWPTD--------KVEEASRLARESGINIFFITIEGAAENEKQ 453 >gi|194221204|ref|XP_001915782.1| PREDICTED: calcium channel, voltage-dependent, alpha 2/delta subunit 3 [Equus caballus] Length = 1055 Score = 44.1 bits (102), Expect = 0.035, Method: Composition-based stats. Identities = 20/177 (11%), Positives = 58/177 (32%), Gaps = 14/177 (7%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++ + ++ ++++++ ++ + G N + L+K Sbjct: 235 RLTIAKQTVSSILDTLGD-DDFFNIIAYNEELHYVEPCLNGTLVQADRTNKEHFREHLDK 293 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L A++ A+ L + + +I + ++ ITDG N Sbjct: 294 LFAKGIGMLDIALNEAFNILSDFNHTGQGSICI----QAIMLITDGAVDTYDTIFAKYNW 349 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + ++I++ + D L+ ++ FF L + + + + Sbjct: 350 P-------DRKVRIFTYLIGREAAFADNLKWMACANKGFF--TQISTLADVQENVME 397 >gi|297663822|ref|XP_002810364.1| PREDICTED: integrin alpha-10-like [Pongo abelii] Length = 1177 Score = 44.1 bits (102), Expect = 0.036, Method: Composition-based stats. Identities = 39/215 (18%), Positives = 64/215 (29%), Gaps = 25/215 (11%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 + I E L + K + + +++G + Y V Sbjct: 163 PTYMDVVIVLDGSNSIYPWSEVQTFLRRLVGKLFIDPEQ--IQVGLVQYGESPVHEWSLG 220 Query: 255 LSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 EV L+ E T T A+ A E +++ + ++ +T Sbjct: 221 DFRTKEEVVRAAKNLSRREGRETKTAQAILVACTEGFSQSHGGRPEA-----ARLLVVVT 275 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--------SAPPEGQDLLRKCTDSS 364 DGE+ L+ CE R + Y +AV P +R Sbjct: 276 DGESHDGEELP---AALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASDP 329 Query: 365 G--QFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 FF V D L + D + D+I A N Sbjct: 330 DERFFFNVTDEAALTDIVDALGDRIFGLEGSHAEN 364 >gi|254695378|ref|ZP_05157206.1| Von Willebrand factor, type A [Brucella abortus bv. 3 str. Tulya] gi|261215750|ref|ZP_05930031.1| norD protein [Brucella abortus bv. 3 str. Tulya] gi|260917357|gb|EEX84218.1| norD protein [Brucella abortus bv. 3 str. Tulya] Length = 633 Score = 44.1 bits (102), Expect = 0.036, Method: Composition-based stats. Identities = 24/132 (18%), Positives = 46/132 (34%), Gaps = 20/132 (15%) Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG-- 318 V+ R+ L P T AM HA +L + KK ++ +TDG+ + Sbjct: 515 TVEHRIAALKPGFYTRMGAAMRHATAKLAEQPNR----------KKLLLLLTDGKPNDVD 564 Query: 319 -ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQFFAVNDSRE 375 ++ + +R G+ +++V V L G + V + + Sbjct: 565 HYEGRFALEDSRRAAGEVRAKGVNVFAVTVDREASAYLPALF-----GRGGYALVANLAK 619 Query: 376 LLESFDKITDKI 387 L + I + Sbjct: 620 LPVAMPAIYRML 631 >gi|161524898|ref|YP_001579910.1| hypothetical protein Bmul_1725 [Burkholderia multivorans ATCC 17616] gi|160342327|gb|ABX15413.1| conserved hypothetical protein [Burkholderia multivorans ATCC 17616] Length = 626 Score = 44.1 bits (102), Expect = 0.036, Method: Composition-based stats. Identities = 23/155 (14%), Positives = 50/155 (32%), Gaps = 8/155 (5%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M AI + + + + AID+ ++ + R +Q D A ++ S+ + ++ Sbjct: 22 MAAIWVMIAIVVLG-AIDVGNLYFQRRNLQRIADMAAIASVESMTDQCSQQNSPAMMAAQ 80 Query: 61 STIFKKQIKKHLK-QGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLF 119 S Q I D + N T N + + Y +F Sbjct: 81 SNALANGFDYRASGQTLSIECGRWDTSATPYFNSTFTPLNAVSVSVTQQVPY------IF 134 Query: 120 LKGLIPSALTNLSLRSTGIIERSSENLAISICMVL 154 L + + + + ++ + +I L Sbjct: 135 LGRFFGKSGSTGATVAAFSTAKAINIDSFTIGTTL 169 >gi|154411703|ref|XP_001578886.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121913087|gb|EAY17900.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 710 Score = 44.1 bits (102), Expect = 0.036, Method: Composition-based stats. Identities = 24/267 (8%), Positives = 73/267 (27%), Gaps = 47/267 (17%) Query: 150 ICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLP-----PPPKKSFWSKNTTKSKYAPAPA 204 ++D ++ + +D + ++ S + Sbjct: 182 NMKIIDNHNAIFETKTTPRDDAIIIETSIKDEDKGIAISSDGYIAISTYPYFEGKVENNS 241 Query: 205 PAN-----------RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 +I+ L++S+ + R I + Sbjct: 242 EFYFLIDCSGSMYGSRIENAKFCLNLLIHSLP--------IDSRFSIIKFGTSYEEIFPI 293 Query: 254 PLSNNLNE--VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 N N ++ L+ + T+ + + Y + + + Sbjct: 294 CDYTNKNVKIAMRQIKDLDADMDGTDILSPLEYVYTQTTKNG-----------YHRKIFL 342 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFA 369 +TDG+ + + + + R+ +IY++ + + L++ + S G + Sbjct: 343 LTDGQVHNSDVICS------LAQEKRDNN-RIYAIGLGSGA-DPGLIKNVSLKSWGNYVL 394 Query: 370 VNDSRELLESFDKITDKIQEQSVRIAP 396 + D + E ++ + + Sbjct: 395 IADKDNMNEKVIELMESSISPYLTNIS 421 >gi|62317822|ref|YP_223675.1| NorD protein [Brucella abortus bv. 1 str. 9-941] gi|83269804|ref|YP_419095.1| von Willebrand factor, type A [Brucella melitensis biovar Abortus 2308] gi|189023075|ref|YP_001932816.1| Von Willebrand factor, type A [Brucella abortus S19] gi|237817362|ref|ZP_04596354.1| Protein norD [Brucella abortus str. 2308 A] gi|254691324|ref|ZP_05154578.1| Von Willebrand factor, type A [Brucella abortus bv. 6 str. 870] gi|254699113|ref|ZP_05160941.1| Von Willebrand factor, type A [Brucella abortus bv. 2 str. 86/8/59] gi|254732556|ref|ZP_05191134.1| Von Willebrand factor, type A [Brucella abortus bv. 4 str. 292] gi|256256509|ref|ZP_05462045.1| Von Willebrand factor, type A [Brucella abortus bv. 9 str. C68] gi|260545056|ref|ZP_05820877.1| von Willebrand factor [Brucella abortus NCTC 8038] gi|260756933|ref|ZP_05869281.1| norD protein [Brucella abortus bv. 6 str. 870] gi|260760365|ref|ZP_05872713.1| nitric oxide reductase activation protein [Brucella abortus bv. 4 str. 292] gi|260763605|ref|ZP_05875937.1| norD protein [Brucella abortus bv. 2 str. 86/8/59] gi|260882746|ref|ZP_05894360.1| protein norD [Brucella abortus bv. 9 str. C68] gi|297249867|ref|ZP_06933568.1| nitric-oxide reductase NorD protein [Brucella abortus bv. 5 str. B3196] gi|75495379|sp|Q576X0|NORD_BRUAB RecName: Full=Protein norD gi|114152095|sp|Q2YJT9|NORD_BRUA2 RecName: Full=Protein norD gi|62198015|gb|AAX76314.1| NorD protein [Brucella abortus bv. 1 str. 9-941] gi|82940078|emb|CAJ13118.1| Von Willebrand factor, type A [Brucella melitensis biovar Abortus 2308] gi|189021649|gb|ACD74370.1| Von Willebrand factor, type A [Brucella abortus S19] gi|237788175|gb|EEP62391.1| Protein norD [Brucella abortus str. 2308 A] gi|260098327|gb|EEW82201.1| von Willebrand factor [Brucella abortus NCTC 8038] gi|260670683|gb|EEX57623.1| nitric oxide reductase activation protein [Brucella abortus bv. 4 str. 292] gi|260674026|gb|EEX60847.1| norD protein [Brucella abortus bv. 2 str. 86/8/59] gi|260677041|gb|EEX63862.1| norD protein [Brucella abortus bv. 6 str. 870] gi|260872274|gb|EEX79343.1| protein norD [Brucella abortus bv. 9 str. C68] gi|297173736|gb|EFH33100.1| nitric-oxide reductase NorD protein [Brucella abortus bv. 5 str. B3196] Length = 633 Score = 44.1 bits (102), Expect = 0.036, Method: Composition-based stats. Identities = 24/132 (18%), Positives = 46/132 (34%), Gaps = 20/132 (15%) Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG-- 318 V+ R+ L P T AM HA +L + KK ++ +TDG+ + Sbjct: 515 TVEHRIAALKPGFYTRMGAAMRHATAKLAEQPNR----------KKLLLLLTDGKPNDVD 564 Query: 319 -ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQFFAVNDSRE 375 ++ + +R G+ +++V V L G + V + + Sbjct: 565 HYEGRFALEDSRRAAGEVRAKGVNVFAVTVDREASAYLPALF-----GRGGYALVANLAK 619 Query: 376 LLESFDKITDKI 387 L + I + Sbjct: 620 LPVAMPAIYRML 631 >gi|12957162|dbj|BAB32650.1| complement factor B/C2-A3 [Cyprinus carpio] Length = 754 Score = 44.1 bits (102), Expect = 0.036, Method: Composition-based stats. Identities = 30/188 (15%), Positives = 62/188 (32%), Gaps = 18/188 (9%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY 272 L+ I + T I + + + NNL E+ RL Sbjct: 270 AKGVIKTLIEKISYYEVSPNYEILIFATDVARIVSMRDFKSAQKNNLLEILKRLKDYEYN 329 Query: 273 E-----NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 TN A + + E N + VI TDG+ + + ++ Sbjct: 330 SKGDRTGTNIAQAYRSILESM--QIEQMTNKEEFKTTQHIVIMFTDGQANMGGNPRPWVD 387 Query: 328 TLQICEYMRNA-------GMKIYSVAVSAPPEGQDL--LRKCTDSSGQFFAVNDSRELLE 378 ++ + ++ + +Y + +D+ L+ + FF + + +L E Sbjct: 388 QIK--DLVKKNSPSEEEENLDLYVFGMGDDVNAEDINDLKTDRGNEKFFFKLKNLEDLQE 445 Query: 379 SFDKITDK 386 +FD + D+ Sbjct: 446 TFDSMIDE 453 >gi|291221810|ref|XP_002730913.1| PREDICTED: chloride channel calcium activated 2-like [Saccoglossus kowalevskii] Length = 858 Score = 44.1 bits (102), Expect = 0.037, Method: Composition-based stats. Identities = 24/122 (19%), Positives = 49/122 (40%), Gaps = 20/122 (16%) Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 + TN + A +EL + + ++ ITDG+ S S + + Sbjct: 428 SANGGTNFGAGIRAALQELKESQ--------LSLKGASLLIITDGQFSYTSDVTDEVYA- 478 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQ 388 +G+++ ++A +D LR +D + G ++ V+D E D +T I Sbjct: 479 --------SGVRVDTIAY--TQAAEDSLRVLSDRTGGSYYYVSDDETSTELLDSLTSTIT 528 Query: 389 EQ 390 ++ Sbjct: 529 DR 530 >gi|146338945|ref|YP_001203993.1| hypothetical protein BRADO1888 [Bradyrhizobium sp. ORS278] gi|146191751|emb|CAL75756.1| conserved hypothetical protein [Bradyrhizobium sp. ORS278] Length = 408 Score = 44.1 bits (102), Expect = 0.037, Method: Composition-based stats. Identities = 30/221 (13%), Positives = 56/221 (25%), Gaps = 9/221 (4%) Query: 14 TYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLK 73 A+D + R Q+Q A+D+AVL+G + + F Sbjct: 41 GTAVDFSKSSDTRTQLQKAVDSAVLAGVVQPSGQQVSTANAIFRGDFGGRFGTAASASFT 100 Query: 74 QGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIP---------TENLFLKGLI 124 S A +T + L A + A+ L + L+ Sbjct: 101 ANSDGSLTGSASATVNTSFLTVMGTSSLGISASATAKPGAQSKSPVCILLVSTLNSQSLL 160 Query: 125 PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP 184 ++ L+ + + S++N A L+V R N + Sbjct: 161 VNSGAQLNAPTCEVHVLSTQNPAALFNATLNVKRICIKGSTIIKNGGVTPPAETSCAAIS 220 Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQ 225 + T + L + Sbjct: 221 DPFAGSLPTVTVGSCTTNNKVYDPGSVTLSPGVYCGSTNFN 261 >gi|326911040|ref|XP_003201870.1| PREDICTED: collagen alpha-1(XIV) chain-like [Meleagris gallopavo] Length = 438 Score = 44.1 bits (102), Expect = 0.037, Method: Composition-based stats. Identities = 16/113 (14%), Positives = 41/113 (36%), Gaps = 12/113 (10%) Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 ++ + SH K V+ IT G+ ++ + +++ Sbjct: 130 GSSLKVGSALAFAAHAMSHPDTLREDAAKVVVLITSGK--------SSDLVEDKAQVLQD 181 Query: 338 AGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDKITDKIQ 388 AG+ +++V + + L K ++ V+D L + K++ ++ Sbjct: 182 AGVTVFAVGI--KDADKHELNKIASEPTAEHVIYVDDFHLLHNAAQKLSRRLC 232 >gi|307591436|ref|YP_003900235.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] gi|306986290|gb|ADN18169.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] Length = 441 Score = 44.1 bits (102), Expect = 0.037, Method: Composition-based stats. Identities = 32/229 (13%), Positives = 79/229 (34%), Gaps = 22/229 (9%) Query: 154 LDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL 213 L+VS + + + L P + S +T+ + + + Sbjct: 2 LNVSITPHREFFPAD-TADQRLFIMLKLRPTKEVSNSRPSTSFAFVIDTSGSMDEVVTGG 60 Query: 214 IESAGNLVNSIQKAIQEKK-NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL-NP 271 ++ S+ + ++ + I I ++ + + ++++ + +L + Sbjct: 61 KSKKSIVIESLYQLVRSGRLTQDDHIAIIEFHDQASTLIGLTPATQVFQLENAIARLNDF 120 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T M+ A L N+ +S + V+ TDGE + + I Sbjct: 121 SGGTCMGKGMNEALVLLTNQSMTS----------RRVLIFTDGET------FDEEDCEII 164 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFA--VNDSRELL 377 + N G+ I ++ V L+ K +D + G+ + + ++ L Sbjct: 165 AQQFSNQGISITAMGVGDEFNEDLLIHKISDPTGGRLYPLVIGNATGLQ 213 >gi|307591429|ref|YP_003900228.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] gi|306986283|gb|ADN18162.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] Length = 426 Score = 44.1 bits (102), Expect = 0.037, Method: Composition-based stats. Identities = 23/160 (14%), Positives = 56/160 (35%), Gaps = 17/160 (10%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 KI+ I++ + +S+ + +N + V Sbjct: 125 GTTKIEGAIKAIREFTEIAKD-RGGNTQVSIVPFGDPGKNCAGYPIDSNTLDNFSRVDDA 183 Query: 266 --------LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTR------LKKFVIFI 311 L L+P +TN Y + A R L + +S + S+ + +I + Sbjct: 184 KLQIFLDNLASLSPCASTNLYEPLSKAVRFLGKKNDSRFYPLDSSGNPIEPQPRLSIILL 243 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 +DG ++ + Q+ + Q+ + +N + ++++ Sbjct: 244 SDGYHNKPNEAQDFQSLNQLLK--KNNQIIVHTLGYGLTA 281 >gi|91228647|ref|ZP_01262563.1| putative calcium-binding outer membrane-like protein [Vibrio alginolyticus 12G01] gi|91187798|gb|EAS74114.1| putative calcium-binding outer membrane-like protein [Vibrio alginolyticus 12G01] Length = 2510 Score = 44.1 bits (102), Expect = 0.037, Method: Composition-based stats. Identities = 32/318 (10%), Positives = 82/318 (25%), Gaps = 26/318 (8%) Query: 47 DRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAE 106 + + + + + + +++ Sbjct: 1912 TLNVSEEGLSNAIADDSALNGFDDTTDSSTDVEQLTLGQTV-DSVTLSEPSTALTSNGVS 1970 Query: 107 SKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ 166 + L + + + + S D + + Sbjct: 1971 VTWTLSNYDQMLIGSANGEEVIKISVNDTGVVSTELLGPIDHSNPSGEDTLNIEVPVLVS 2030 Query: 167 KHNDNNNMTSNKYLLPPPPKKSFW-----------SKNTTKSKYAPAPAPANRKIDVLIE 215 N T+ ++ S S N + ++ ++ + Sbjct: 2031 NARGLTNSTTVNVIVEDDSPDSASIIHDVVAETKESANVQLIMDVSGSMRTDNRLQIMKD 2090 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN-NLNEVKSRLNKLNPYEN 274 SA L+N + Q R+ I Y+ + + E K+ + L Sbjct: 2091 SATQLLNQYESIGQT------RVQIITYSSTASTYAIGAATWLTVEEAKAYIETLTAGGA 2144 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 TN A++ A + + T F++DG+ + AS++ N ++ Sbjct: 2145 TNYNNALNEAKQSWDDVG-------KLTSASNVSYFLSDGQPNPASSFINDAREQSWIDH 2197 Query: 335 MRNAGMKIYSVAVSAPPE 352 + + +I ++A Sbjct: 2198 LTDPDNQITALAYGMGVN 2215 >gi|77359583|ref|YP_339158.1| inter-alpha-trypsin inhibitor domain-containing protein [Pseudoalteromonas haloplanktis TAC125] gi|76874494|emb|CAI85715.1| conserved protein of unknown function ; putative Inter-alpha-trypsin inhibitor domain protein [Pseudoalteromonas haloplanktis TAC125] Length = 664 Score = 44.1 bits (102), Expect = 0.037, Method: Composition-based stats. Identities = 24/164 (14%), Positives = 52/164 (31%), Gaps = 21/164 (12%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPL---SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 + + I ++ + PL NL + + L T A+ Sbjct: 354 DSNDSFNIIGFDNVVTLMSDKPLVASGFNLRRAERFIYGLQADGGTEIQGALDAVLD--- 410 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + VIF+TDG S A ++ + ++++V + + Sbjct: 411 --------GSQFDGFVRQVIFLTDGSVSNEDALFKSI-------QAKLGDSRLFTVGIGS 455 Query: 350 PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 P + R G F + + E+ ++ DK+ ++ Sbjct: 456 APNSFFMRRAADVGKGSFTFIGSTSEVQPKMQQLFDKLAHPAIT 499 >gi|330831165|ref|YP_004394117.1| TapY1 [Aeromonas veronii B565] gi|328806301|gb|AEB51500.1| TapY1 [Aeromonas veronii B565] Length = 1214 Score = 44.1 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 34/323 (10%), Positives = 84/323 (26%), Gaps = 78/323 (24%) Query: 132 SLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWS 191 + + V +R+ + + N + + + S ++ Sbjct: 164 EQNAKNQYSGPYSTYIVDGYPVDKPNRTTGSWFYTSNVANALASGSSTQI------SLFT 217 Query: 192 KNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAI--------QEKKNLSVRIGTIAY 243 N + Y ++ V ++ LV + + R G Sbjct: 218 ANYVRWYYGSTGVSTQTRLKVAQDAVKELVLATPAVDFGLAVFNYNDGNEPGSRNGGRII 277 Query: 244 NIGIVGNQCTPLS-NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY-------NEKESS 295 + + + ++ ++N L+ T ++ +YR ++ + Sbjct: 278 RNIVGNDAILTSGKSGAQDLIDKVNTLSSDTWTPLCETLYESYRYFGGLGVYFGDDDVNR 337 Query: 296 HNTIGS---------------TRLKKFVIFITDGENSGASAYQNTLNTL----------- 329 S ++I++TDGE + A + + L Sbjct: 338 TPKRDSNAEVTSGIYKTPYDKCSNNGYIIYMTDGEPTNDKAADSLVANLISTLSTEDKVA 397 Query: 330 -----------------QICEYMRNAGM----------KIYSVAVSAPP---EGQDLLRK 359 + YM+N + ++V G L Sbjct: 398 YGGLVSYGSSGEKSYLAALAGYMKNNDVNTSAIGKQTVTTFTVGFGDEAIKGAGALLAET 457 Query: 360 CTDSSGQFFAVNDSRELLESFDK 382 G+++ D+ +L E+ Sbjct: 458 ARRGGGKYYPATDATQLNEALKS 480 >gi|145502983|ref|XP_001437469.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124404619|emb|CAK70072.1| unnamed protein product [Paramecium tetraurelia] Length = 562 Score = 44.1 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 30/172 (17%), Positives = 65/172 (37%), Gaps = 15/172 (8%) Query: 221 VNSIQKAIQEKKNLSVRIGTIAYNI-GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYP 279 VN+ A +K + G +++ N+ ++ NP +T Y Sbjct: 145 VNAFFSAFADKTLAFEFNHIVKLVWFGSTLFDKCEFTSDFNKFIKLVDDANPGGSTKCYD 204 Query: 280 AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG 339 A+ +A +L K+ + ++ +TDGE++ +++ NT + + + + Sbjct: 205 AIDYAINKLLEVKQKYP------DIVLRILALTDGEDNASASKPNT-----LVQRIFDHK 253 Query: 340 MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 + I S V G L S+G+ + D + + F +I + Sbjct: 254 IIIDSFVVGDNCVGLKTLTHA--SNGRCYCPRDLGQGMSLF-EIESILSTSR 302 >gi|307102442|gb|EFN50717.1| hypothetical protein CHLNCDRAFT_142575 [Chlorella variabilis] Length = 575 Score = 44.1 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 15/126 (11%), Positives = 31/126 (24%), Gaps = 4/126 (3%) Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + TN + A E+ + + ++ ITD + + L Sbjct: 197 DIADTSGTNMQAGLDAATGEM--RACRTCMEADRAATENRIVLITDAQPNQGDISDEGLL 254 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 G+ + V + + +F+V+ E D Sbjct: 255 ARLKANAA--DGIHTTIIGVGLDFNTELVEGISKVRGANYFSVHSPGEFRRRLTDEFDYA 312 Query: 388 QEQSVR 393 V Sbjct: 313 VSPLVF 318 >gi|332260454|ref|XP_003279304.1| PREDICTED: von Willebrand factor A domain-containing protein 3B [Nomascus leucogenys] Length = 1160 Score = 44.1 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 22/170 (12%), Positives = 56/170 (32%), Gaps = 30/170 (17%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 K+D++ + + K + V+ A + +NL + Sbjct: 430 SHSMKSKLDLVKDKIIQFIQEQLKYKSKFN--FVKFDGQAVAWREQLAEVNE--DNLEQA 485 Query: 263 KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 +S + + +TNT A+ A+ + + + + +TDG Sbjct: 486 QSWIRDMKIGSSTNTLSALKTAF---------------ADKETQAIYLLTDGRPDQP--- 527 Query: 323 QNTLNTLQICEYMRN-AGMKIYSVAVSAPPE-GQDLLRKCTD-SSGQFFA 369 + + ++ + IY+++ + E L++ + G+F Sbjct: 528 -----PEMVIDQVKVFQEIPIYTISFNYNDEIANRFLKEVAALTGGEFHF 572 >gi|221113899|ref|XP_002160633.1| PREDICTED: similar to Collagen alpha-5(VI) chain [Hydra magnipapillata] Length = 9981 Score = 44.1 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 20/154 (12%), Positives = 50/154 (32%), Gaps = 13/154 (8%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 + + +G +A + + ++ + L L T + +A++ Sbjct: 9053 SYKISTDATYVGVVANGDQPIVSIKLNAVSSYESIVGYLKNLKY---TGESRKLSYAFQI 9109 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 + S N + K ++F G + + + +++ G+KI +A+ Sbjct: 9110 VRTSLFSEENGGRESIPKTLIVFSNSG------FSIDMNDLSDEAQALKDMGVKIVFIAL 9163 Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFD 381 + +L++ FF D L Sbjct: 9164 GEDARSE-MLKQVV---DVFFFAEDLPNLKYILQ 9193 Score = 43.0 bits (99), Expect = 0.090, Method: Composition-based stats. Identities = 14/65 (21%), Positives = 27/65 (41%), Gaps = 3/65 (4%) Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKI 383 N L E ++ +G+ I ++ + L K S+ VN+++ L E F I Sbjct: 4637 KDKNALLASEELKKSGVVINIISF---CYQKANLAKIATSNNHLVEVNENQTLPEIFADI 4693 Query: 384 TDKIQ 388 + + Sbjct: 4694 STNLV 4698 Score = 40.3 bits (92), Expect = 0.49, Method: Composition-based stats. Identities = 27/206 (13%), Positives = 57/206 (27%), Gaps = 14/206 (6%) Query: 187 KSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNS-IQKAIQEKKNLSVRIGTIAYNI 245 K T+ + + D + E + I + +++I + + Sbjct: 8386 KKTACGITSINIDSINIVFGFDSSDRVTEDVYTSIKEMAANLIPSFLSPNLQISVVLFGK 8445 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + S K + + + L E S ++ Sbjct: 8446 KLYHYPVLRESFIEATFKKEIINMPLLGG---KIGIKPLIEYLETEV-FSKQLGTKIEMR 8501 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG 365 ++ T A ++ + N G+KI V ++ + S+G Sbjct: 8502 NILVLFT-------HALTLDVDAPLFEKLQNNRGVKIVVVGMNFNDNNIEKWNNAVSSNG 8554 Query: 366 QFFAVNDSRELLESFDKITDKIQEQS 391 + AV +EL F I I + Sbjct: 8555 KIVAVT--KELPLFFGAIESSISQLL 8578 >gi|198435757|ref|XP_002131969.1| PREDICTED: similar to polydomain protein-like [Ciona intestinalis] Length = 594 Score = 44.1 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 28/171 (16%), Positives = 61/171 (35%), Gaps = 25/171 (14%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLS--NNLNEVKSRLNKLNPYE-NTNTYPAMHHAYR 286 +R+G YN + L + +E+K++++K+ T T A+ HAY Sbjct: 432 TIGRDLMRVGAFRYNQRVDTATEVLLGEIDTFDELKTKVHKIPYNGSGTRTGNALLHAYN 491 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 N + N V+ TDG + + ++ +++ G I V Sbjct: 492 HSLNAPGNRPNVRD------IVLVFTDGVSHD--------DVIEPARLLQSRGADINVVG 537 Query: 347 VSAPPE--GQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVRI 394 + + +R+ + V + + F+ +T + + R+ Sbjct: 538 IKNSRGRLDINQMREMVSKPIDEHITV-----ISQGFESLTSEFVNRISRL 583 >gi|553536|gb|AAA36226.1| MHC factor B [Homo sapiens] Length = 171 Score = 44.1 bits (102), Expect = 0.038, Method: Composition-based stats. Identities = 23/134 (17%), Positives = 46/134 (34%), Gaps = 23/134 (17%) Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 TNT A+ Y + + R + +I +TDG ++ + + + + Sbjct: 14 GTNTKKALQAVYSMMSWPDDVPPEGWN--RTRHVIILMTDGLHNMGG------DPITVID 65 Query: 334 YMRN-------------AGMKIYSVAVS--APPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 +R+ + +Y V + L D+ F V D L + Sbjct: 66 EIRDLLYIGKDRKNPREDYLDVYVFGVGPLVNQVNINALASKKDNEQHVFKVKDMENLED 125 Query: 379 SFDKITDKIQEQSV 392 F ++ D+ Q S+ Sbjct: 126 VFYQMIDESQSLSL 139 >gi|15921062|ref|NP_376731.1| hypothetical protein ST0828 [Sulfolobus tokodaii str. 7] gi|15621846|dbj|BAB65840.1| 360aa long hypothetical protein [Sulfolobus tokodaii str. 7] Length = 360 Score = 44.1 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 20/136 (14%), Positives = 49/136 (36%), Gaps = 26/136 (19%) Query: 255 LSNNLNEVKS-------RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 SN+L ++ L + TN + A+ ++L + Sbjct: 82 FSNDLEKIYEGPSGNPIALQNVKKGYTTNLHKALSKILQQLAYVQLPVK----------- 130 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQ 366 +I ++DG+ + + N + ++I S+ + + +L++ D G Sbjct: 131 LIILSDGKPT------DKRNVKEYEGLQVPPNVQIISIGIGRD-YNEVILKRMADKGSGV 183 Query: 367 FFAVNDSRELLESFDK 382 ++ + D +L F + Sbjct: 184 YYHIEDPTQLPSVFAQ 199 >gi|305680383|ref|ZP_07403191.1| von Willebrand factor type A domain protein [Corynebacterium matruchotii ATCC 14266] gi|305659914|gb|EFM49413.1| von Willebrand factor type A domain protein [Corynebacterium matruchotii ATCC 14266] Length = 880 Score = 44.1 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 23/128 (17%), Positives = 52/128 (40%), Gaps = 27/128 (21%) Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG-ASAYQNTLNTLQIC- 332 T+ A++ A +L + + K ++F TDG+ + + + IC Sbjct: 155 TDYGTALNGAMADLASRPDPESC--------KAILFFTDGKLTVQGDQKADIVAQKAICS 206 Query: 333 -----EYMRNAGMKIYSVAV--SAPPEGQDLLRKCTDS---------SGQFFAVN-DSRE 375 + +R+A +++++V + S + +LR ++ +G FF ++ Sbjct: 207 ADGQVKKLRDANIQLFTVGLIPSGEESPEQILRSMSEGNDCAIDTVPNGAFFNAESNAAS 266 Query: 376 LLESFDKI 383 L +F I Sbjct: 267 LFSAFRSI 274 >gi|225022540|ref|ZP_03711732.1| hypothetical protein CORMATOL_02580 [Corynebacterium matruchotii ATCC 33806] gi|224944663|gb|EEG25872.1| hypothetical protein CORMATOL_02580 [Corynebacterium matruchotii ATCC 33806] Length = 880 Score = 44.1 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 23/128 (17%), Positives = 52/128 (40%), Gaps = 27/128 (21%) Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG-ASAYQNTLNTLQIC- 332 T+ A++ A +L + + K ++F TDG+ + + + IC Sbjct: 155 TDYGTALNGAMADLASRPDPESC--------KAILFFTDGKLTVQGDQKADIVAQKAICS 206 Query: 333 -----EYMRNAGMKIYSVAV--SAPPEGQDLLRKCTDS---------SGQFFAVN-DSRE 375 + +R+A +++++V + S + +LR ++ +G FF ++ Sbjct: 207 ADGQVKKLRDANIQLFTVGLIPSGEESPEQILRSMSEGNDCAIDTVPNGAFFNAESNAAS 266 Query: 376 LLESFDKI 383 L +F I Sbjct: 267 LFSAFRSI 274 >gi|189485267|ref|YP_001956208.1| aerotolerance-related cytoplasmic membrane protein BatB [uncultured Termite group 1 bacterium phylotype Rs-D17] gi|170287226|dbj|BAG13747.1| aerotolerance-related cytoplasmic membrane protein BatB [uncultured Termite group 1 bacterium phylotype Rs-D17] Length = 330 Score = 44.1 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 26/170 (15%), Positives = 52/170 (30%), Gaps = 54/170 (31%) Query: 254 PLSNNLNEVKSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 PL+ +L+ +K L + P T A+ A + E S + +I Sbjct: 142 PLTFDLHALKMFLQSVETTNLPLGGTRISSAIMLASKAASCESAGS----------RVMI 191 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP------------------ 351 I+DGEN + + + AG++I S+ + Sbjct: 192 LISDGENHDSKI-------KEAVNAAKKAGLRIISIGIGKKEGAPIPVKDETGTVIDYVK 244 Query: 352 ----------EGQDLLRKCTDS-SGQFFAVNDSRELLESFDKITDKIQEQ 390 LL+ + G++F D + + + ++ Sbjct: 245 DVNGKVVISRANPVLLKNVAEEMGGKYFDALD----DDVYPSLVKAVRNL 290 >gi|146186309|ref|XP_001033348.2| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila] gi|146143017|gb|EAR85685.2| MHCK/EF2 kinase domain family protein [Tetrahymena thermophila SB210] Length = 1149 Score = 44.1 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 15/159 (9%), Positives = 43/159 (27%), Gaps = 7/159 (4%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + I++ +V ++K Q+ +G + + + ++ Sbjct: 155 CTGSMSSYINMCRTQLFLIVEQVKKQFQKSTLKIGFVGFRDFGDN-NQFEIYQFTTEYDK 213 Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI----GSTRLKKFVIFITDGENS 317 +KS +N ++ +T + A+ + + ++ + Sbjct: 214 LKSFINSVSATGGNDTAEDVAGAFEKALEMNWTCEAKYAILITDAYPHGNKYCKSNSGDR 273 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 S N M G+ Y + + Sbjct: 274 YPSGDPNGRVPENQIIQMIKKGITFY--GLDVSSTSNQM 310 >gi|313229403|emb|CBY23990.1| unnamed protein product [Oikopleura dioica] Length = 625 Score = 44.1 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 36/286 (12%), Positives = 79/286 (27%), Gaps = 22/286 (7%) Query: 111 YEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 + +L G + + +S + + + + + + + N Sbjct: 284 VQTEQWKAYLSGYYSDEVVSTLSKSIRALVVEWMSENMDVINCILNCGNPDCGGGNSVNS 343 Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE 230 ++ S + VL NLV+ I A +E Sbjct: 344 AGTGGGISIKALECEEQVDVVFMVDGSDSVSS----RDWPKVLK-WVTNLVDQISPADRE 398 Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY--------ENTNTYPAMH 282 K + V + + + + ++ + TNTY + Sbjct: 399 KSSTVVFQDFSMNPTTGAIPKEITGRFDPGD-QGSVDDFKAAVLAAKQSAQGTNTYQTLS 457 Query: 283 HAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKI 342 + + T+ + I ++DGE+ + Y + + R Sbjct: 458 ILLK--DDGIFGRLPTVQNKDGATVFIILSDGESRDRAKYYDKKTFDYFKDITRMR---- 511 Query: 343 YSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 +AV + L +D +L+E+ +I D IQ Sbjct: 512 --IAVGVADADKKELMDFAQDDEHLLHYDDFGQLMEAGQEIVDLIQ 555 >gi|73980136|ref|XP_851163.1| PREDICTED: similar to vitrin isoform 2 [Canis familiaris] Length = 645 Score = 44.1 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 26/277 (9%), Positives = 69/277 (24%), Gaps = 34/277 (12%) Query: 96 DKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLA--ISICMV 153 +P E+ Y+ P+ + + S + +S Sbjct: 147 SSKSPTAKTGEATKAYQKPSVPGTTAQPVTLMQVTGATASEASHTSLPKPSPSAVSTTSS 206 Query: 154 LDVSRSMEDLYLQKHNDNNN--MTSNKYLLPPPPKKSFWSKNTTKSKYAPA--------- 202 L + D P + S S Sbjct: 207 LRSQPMGHRSWELGEMDLWKPGSVLLDAGFVPKEELSTQSLEPVSQGDPNCKIDLSFLID 266 Query: 203 --PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 R+ + + ++ ++ +G + Y N Sbjct: 267 GSSGIGKRRFRIQKQFLADVAQTLDIGPAGPL-----MGVVQYGDNPAAQFNLRTHMNSR 321 Query: 261 EVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 ++K+ + K+ +N A+ + + +++ + + + I DG + Sbjct: 322 DLKTAIEKITQRGGLSNAGRAISYVTKNFFSKANGNRGGAPN-----VAVVIVDGWPTD- 375 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + + R +G+ I+ + + E + Sbjct: 376 -------KVEEASRFARESGINIFFITIEGATENEKQ 405 >gi|57960|emb|CAA79156.1| collagen alpha 3 chain type VI [Mus musculus] Length = 959 Score = 44.1 bits (102), Expect = 0.039, Method: Composition-based stats. Identities = 20/124 (16%), Positives = 43/124 (34%), Gaps = 15/124 (12%) Query: 257 NNLNEVKSRLNKLNPYENTNTYP--AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 + ++ +NK+ + + H L N + R+ + IT G Sbjct: 7 STKRQIIDAINKVVYKGGRHANTRVGIEH---LLRNHFVPEAGSRLDERVPQIAFVITGG 63 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSR 374 + + + + + G+K+++V V + + K +S F V + Sbjct: 64 K--------SVEDAQDVSLALTQKGVKVFAVGV--RNIDSEEVGKIASNSATAFRVGSVQ 113 Query: 375 ELLE 378 EL E Sbjct: 114 ELSE 117 >gi|115378470|ref|ZP_01465629.1| CglB [Stigmatella aurantiaca DW4/3-1] gi|115364532|gb|EAU63608.1| CglB [Stigmatella aurantiaca DW4/3-1] Length = 422 Score = 44.1 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 40/302 (13%), Positives = 80/302 (26%), Gaps = 68/302 (22%) Query: 149 SICMVLDVSRSMEDLYLQKHN---DNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAP 205 ++ ++LD S SM + NN S + + ++++ Sbjct: 68 NLMVLLDTSGSMTLPVNTRDPNCYRANNTPSPDDDYCGQTPSTACDTSKCPTRWSELQGA 127 Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +R + A + + +L T+ + L N+++S Sbjct: 128 MSRFLSESGGVARMGLTTYPGPAVGSNSLRCEASTVVNKNIPQSDADEALLGAANDIQSV 187 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + + P TN L + + +T FV+ +TDG + QN Sbjct: 188 ILGI-PNAGTNAPSG--GTPTSLSLQFVGQQPDVQATDRDNFVLLLTDGLPNCNPDNQNA 244 Query: 326 LNTLQIC--------------------------------EYMRNAGMKIYSVAVSAP--- 350 + C E ++ G++ V A Sbjct: 245 GTNVDACQCTLANIGNNTGCQGDYVRRGCLDKDASVVAVEDLKRKGIRTIVVGFGAETAT 304 Query: 351 PEGQDLLRKCTDSSG---------------------------QFFAVNDSRELLESFDKI 383 G L + +F+ + EL E+ +I Sbjct: 305 GNGPATLNAMATAGDFARSCRDDPNACGAGDTCDAQTKLCGRRFYQAANQEELAEALREI 364 Query: 384 TD 385 D Sbjct: 365 ID 366 >gi|323705583|ref|ZP_08117157.1| von Willebrand factor type A [Thermoanaerobacterium xylanolyticum LX-11] gi|323535060|gb|EGB24837.1| von Willebrand factor type A [Thermoanaerobacterium xylanolyticum LX-11] Length = 229 Score = 44.1 bits (102), Expect = 0.040, Method: Composition-based stats. Identities = 14/79 (17%), Positives = 30/79 (37%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY 272 L ++ +++ + K + Y + P ++ +N++++ L L Sbjct: 143 LKKAVNGVMDLLSTMKVRKGKSQFCLIGFPYGNDVYAKVICPFTSKINDIEAHLKNLKAG 202 Query: 273 ENTNTYPAMHHAYRELYNE 291 NT TY A+ A E Sbjct: 203 GNTPTYHAIKLATSLFDTE 221 Score = 43.7 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 15/88 (17%), Positives = 36/88 (40%), Gaps = 7/88 (7%) Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-GQDLLRKCTDSS 364 K +I +TDG+++ N G+++ S+ + + ++ S Sbjct: 6 KQIIVVTDGKSNVGG------NPADAAFLAYRKGIRVSSIGIVDDGKLSIKEIKDIASSG 59 Query: 365 GQFFAVNDSRELLESFDKITDKIQEQSV 392 G + + S + + S +T K E+++ Sbjct: 60 GGVYDIIYSDDFIRSLSMVTQKSAEKTL 87 >gi|313215532|emb|CBY16229.1| unnamed protein product [Oikopleura dioica] Length = 159 Score = 44.1 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 13/75 (17%), Positives = 27/75 (36%), Gaps = 3/75 (4%) Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDK 382 T ++ + ++ ++ V + GQD L S F+ +S +L Sbjct: 1 TDEVNEMADQLKEKVDRVIVVGLGY-AFGQDELAGIASSPTKENFYTAEESTDLAGLVKT 59 Query: 383 ITDKIQEQSVRIAPN 397 I D+I + + Sbjct: 60 IADEICTTELSNPSD 74 >gi|117928940|ref|YP_873491.1| hypothetical protein Acel_1733 [Acidothermus cellulolyticus 11B] gi|117649403|gb|ABK53505.1| hypothetical protein Acel_1733 [Acidothermus cellulolyticus 11B] Length = 177 Score = 44.1 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 12/140 (8%), Positives = 32/140 (22%), Gaps = 5/140 (3%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + V F T +DL+ + + +Q+ D A L+ + + Sbjct: 36 LIVFFGLVILGFTTVIVDLSTVFLAQRVLQATADGAALTAAQHVSL-----AGAYTTELA 90 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + + + Y+ E ++ + Sbjct: 91 EWLPLSDAEVYAAVADYVGEPGRAPQSCRSGTLSITAATLDATDRTVSVSLSCTVSLPIV 150 Query: 121 KGLIPSALTNLSLRSTGIIE 140 + S + + Sbjct: 151 NVITGSWWRGVPIARHADAR 170 >gi|327270790|ref|XP_003220171.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 866 Score = 44.1 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 21/129 (16%), Positives = 51/129 (39%), Gaps = 26/129 (20%) Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 N ++ A++ ++ ++ +T+GE S S + + + + Sbjct: 392 NICEGVNAAFQVFS--------QKLTSTEGCEIVLLTNGEGSDLSPCLSKIQSQE----- 438 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN---DSRELLESFDKITD---KIQ 388 + I+++A + + L K D + G+ F DS L+++F I+ Sbjct: 439 ----IIIHTIAFGSKASNE--LEKLADMTGGKTFYATDSLDSNGLIDAFGGISSGSGDAS 492 Query: 389 EQSVRIAPN 397 +QS+++ Sbjct: 493 QQSIQLESK 501 >gi|297537911|ref|YP_003673680.1| type 1 secretion target domain-containing protein [Methylotenera sp. 301] gi|297257258|gb|ADI29103.1| type 1 secretion target domain-containing protein [Methylotenera sp. 301] Length = 1187 Score = 44.1 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 43/358 (12%), Positives = 90/358 (25%), Gaps = 46/358 (12%) Query: 38 LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK 97 ++ V ++ +++ + S I N + IT + Sbjct: 517 IAAAIHNVEVASVTVNVSEEGLAGGLQDTAGNPDTTNASSIDGNLQISNAAGNLTITLSQ 576 Query: 98 NNPLQYIAES------------KAQYEIPTENLFLKGLIPSALTNLSLRS-TGIIERSSE 144 N + A NL + Sbjct: 577 PNEVLTSGGVAVTWLGSGTQQLTAMAGTHAVATASVDNSGHYAFNLFAPVDHSGVNIEDV 636 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP-PKKSFWSKNTTKSKYAPAP 203 V D S D+ + N+ + +T+ S + Sbjct: 637 KSINFGVTVADGVTSATSTLSVNIEDDAPIALNRSDSFAMIDTNLLITLDTSNSMNDVSG 696 Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 A ++ ++S L++ + VR+ T + N G + L + K Sbjct: 697 INAETRLQSAVKSIERLMDIYD----GFGEIRVRLVTFSNNAETQGTEWVTL----DTAK 748 Query: 264 SRLNKLNPYENT-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + L+ + T N A+ +A + + + F +DG + Sbjct: 749 TILDGILTIGGTTNYDGAIANAMTAFADPG-------KISGAQNISYFFSDGNPNRGDGS 801 Query: 323 QNTL----------NTLQICEYM------RNAGMKIYSVAVSAPPEGQDLLRKCTDSS 364 TL N +Q+ E + +K Y++ + L Sbjct: 802 NTTLSNVGSSVGPDNGIQVAEEAIWKGFLNDNQIKSYAIGMGTGLTDVSYLNPIAYDG 859 >gi|296223008|ref|XP_002757441.1| PREDICTED: von Willebrand factor A domain-containing protein 3B [Callithrix jacchus] Length = 1289 Score = 44.1 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 17/115 (14%), Positives = 42/115 (36%), Gaps = 26/115 (22%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 NL + +S + + +TNT A+ A+ + + + + +TDG Sbjct: 567 NLEQAQSWIRDMKIGSSTNTLSALKTAF---------------ADKETQAIYLLTDGRPD 611 Query: 318 GASAYQNTLNTLQICEYMRN-AGMKIYSVAVSAPPE-GQDLLRKCTD-SSGQFFA 369 + + ++ + IY+++ + E L++ + G+F Sbjct: 612 QP--------PEMVIDQVKVFQEIPIYTISFNYNDEIANRFLKEIAALTGGEFHF 658 >gi|261378471|ref|ZP_05983044.1| pilus-associated protein [Neisseria cinerea ATCC 14685] gi|269145264|gb|EEZ71682.1| pilus-associated protein [Neisseria cinerea ATCC 14685] Length = 1448 Score = 44.1 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 18/121 (14%), Positives = 43/121 (35%), Gaps = 7/121 (5%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 P T ++ ++ +K + +DG ++ ++ + ++ Sbjct: 644 PGAGQCTAGSVDTVLD-----GDNGMGMFSRALAEKDIKSASDGRDAAGVSWDSDASSDP 698 Query: 331 ICEYMRNAGMKIYSVAVSAPPE--GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 ++ Y+V +A ++ LR ++ V D + L + D+I KIQ Sbjct: 699 AGVDFGKQTVQTYTVNFNADEGTVRKEYLRNGASRDNMYYPVTDYKNLTKILDEILGKIQ 758 Query: 389 E 389 Sbjct: 759 S 759 >gi|164425557|ref|XP_960517.2| hypothetical protein NCU05555 [Neurospora crassa OR74A] gi|157070974|gb|EAA31281.2| predicted protein [Neurospora crassa OR74A] Length = 766 Score = 44.1 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 27/168 (16%), Positives = 58/168 (34%), Gaps = 17/168 (10%) Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG-NQCTPLSN-NLNEVKSRLNKLNPYE 273 S +LV + I N S R+G + ++ P++ N + + L + P+ Sbjct: 106 SVLDLVKHAARTIVSTLNSSDRLGIVTFSTEAKVLQPLMPMTALNKKKTERNLGGMQPFS 165 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE 333 TN + + + + ++ +TDG + Q + L+ E Sbjct: 166 ATNLWGGIVEGLKLFDGQSGRMPA----------LMVLTDGMPNHMCPAQGYVAKLRAME 215 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESF 380 + A I++ LL+ + G + + D+ + F Sbjct: 216 TLPAA---IHTFGFGYSLRS-GLLKSVAEIGGGGYSFIPDAGMIGTVF 259 >gi|160693|gb|AAA29768.1| sporozoite surface protein [Plasmodium yoelii] Length = 826 Score = 44.1 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 22/132 (16%), Positives = 46/132 (34%), Gaps = 16/132 (12%) Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 N +P NTN A+ L NE+ + + I +TDG + + Sbjct: 118 NNYSPNGNTNLTSALL-VVDTLINERMYRPDA------IQLAIILTDGIPN------DLP 164 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQ--DLLRKCTD-SSGQFFAVNDSRELLESFDKI 383 + + ++ + + + V A + +L C + +++ E Sbjct: 165 RSTAVVHQLKRKHVNVAIIGVGAGVNNEYNRILVGCDRYAPCPYYSSGSWNEAQNMIKPF 224 Query: 384 TDKIQEQSVRIA 395 K+ ++ RIA Sbjct: 225 LTKVCQEVERIA 236 >gi|83315954|ref|XP_731015.1| sporozoite surface protein 2 precursor [Plasmodium yoelii yoelii str. 17XNL] gi|45645179|sp|Q01443|SSP2_PLAYO RecName: Full=Sporozoite surface protein 2; Flags: Precursor gi|23490922|gb|EAA22580.1| sporozoite surface protein 2 precursor [Plasmodium yoelii yoelii] Length = 827 Score = 44.1 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 22/132 (16%), Positives = 46/132 (34%), Gaps = 16/132 (12%) Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 N +P NTN A+ L NE+ + + I +TDG + + Sbjct: 118 NNYSPNGNTNLTSALL-VVDTLINERMYRPDA------IQLAIILTDGIPN------DLP 164 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQ--DLLRKCTD-SSGQFFAVNDSRELLESFDKI 383 + + ++ + + + V A + +L C + +++ E Sbjct: 165 RSTAVVHQLKRKHVNVAIIGVGAGVNNEYNRILVGCDRYAPCPYYSSGSWNEAQNMIKPF 224 Query: 384 TDKIQEQSVRIA 395 K+ ++ RIA Sbjct: 225 LTKVCQEVERIA 236 >gi|297667864|ref|XP_002812183.1| PREDICTED: vitrin-like isoform 4 [Pongo abelii] Length = 656 Score = 44.1 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 14/120 (11%), Positives = 39/120 (32%), Gaps = 14/120 (11%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSH 296 +G + Y + N ++K+ + K+ +N A+ + +++ Sbjct: 310 MGVVQYGDSPATHFNLKTHMNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANG-- 367 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + V+ + DG + + R +G+ I+ + + E + Sbjct: 368 ---NRSSAPNVVVVMVDGWPTD--------KVEEASRLARESGINIFFITIEGAAENEKQ 416 >gi|239613390|gb|EEQ90377.1| U-box domain-containing protein [Ajellomyces dermatitidis ER-3] Length = 766 Score = 44.1 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 30/212 (14%), Positives = 63/212 (29%), Gaps = 22/212 (10%) Query: 183 PPPKKSFWSKNTTKSKYAPAPAPANR---KIDVLIESAGNLVNSIQKAIQEKKNLSVRIG 239 P + + S + AP P K + S +L + I E N + R+G Sbjct: 71 HVPCDIVLCIDISYSMSSSAPLPTTDDSGKPEDTGLSVLDLTKHAARTIIETLNDNDRLG 130 Query: 240 TIAYNIGIVGNQCTPLSN--NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHN 297 +A++ N N + L P +TN + + + L Sbjct: 131 VVAFSTDAEVVYKISNMNEDNKKAALKAVEALWPLSSTNLWHGLKLSLEALEEVTP---- 186 Query: 298 TIGSTRLKKFVIFITDGENSGASA------YQNTLNTLQICEYM--RNAGMKIYSVAVSA 349 + + + +TDG + + ++ I++ Sbjct: 187 ---IPQNVQALYILTDGMYRIVRSRVPHANASKFRHAKSYVSKAGQKDRLPMIHTFGFGY 243 Query: 350 PPEGQDLLRKCTD-SSGQFFAVNDSRELLESF 380 LL+ ++ G + + D+ + F Sbjct: 244 YIRS-GLLQAISEVGGGTYSFIPDAGMIGTVF 274 >gi|255598079|ref|XP_002536925.1| conserved hypothetical protein [Ricinus communis] gi|223518102|gb|EEF25458.1| conserved hypothetical protein [Ricinus communis] Length = 451 Score = 44.1 bits (102), Expect = 0.041, Method: Composition-based stats. Identities = 38/433 (8%), Positives = 111/433 (25%), Gaps = 48/433 (11%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M + ++ + A +R+++ +A DAA L+ +I + T D Sbjct: 16 MVIVSLTTLLAVVGLAFSAGLSYLVRSKLNAATDAAGLAAARAISNGTTQADQIANAKAA 75 Query: 61 STIF-----------KKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKA 109 F + E ++ A + +A + Sbjct: 76 GQRFFHANFPSNYLMSNATLNDISVTFSGSEVTIGVSASASLPAALFGGFGTSALAPAVV 135 Query: 110 QYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHN 169 + + + S + S + + N + + + + Sbjct: 136 TETKRKDLDMIVVMDTSGSLSPSAANVRSSAITFLNQFNATRDRVGLVHFAFGAIVDDAI 195 Query: 170 DNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ 229 ++ + K +S +T ++ + + N+ + + Sbjct: 196 RQTARGFDRASMTNHIKAYAFSGSTASAEGMYTARQQINSVPTANLNRSNMRVIVFFSDG 255 Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE---VKSRLNKLNPYENTNTYPAM----- 281 + + + + + + T T + Sbjct: 256 APNSFGAYLNWKPGVACADPGTIYTDDDGAGTPAGLYKLDQQYDDLGGTCTPADLPSKAA 315 Query: 282 -----HHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 ++A+ + + V+ + A +N + + R Sbjct: 316 SLPDWYNAHNPAKTPNDPALREFPVVTTSPRVVTNAITYANVNRAARNLV--EAMAAKSR 373 Query: 337 NAGMKIYSVAVSA---------PPEGQDLLRKCTD---------SS----GQFFAVNDSR 374 + G+ ++++ + + +G+D L+ + + G + Sbjct: 374 DEGIYVFTLGLGSSLKTGTGVDGEKGEDTLKCMANAVDAPARCYNPAKPVGVYCFAATQN 433 Query: 375 ELLESFDKITDKI 387 +L F K+ I Sbjct: 434 DLTPCFSKLASAI 446 >gi|229593697|ref|XP_001027162.3| Sec23/Sec24 trunk domain containing protein [Tetrahymena thermophila] gi|225567370|gb|EAS06920.3| Sec23/Sec24 trunk domain containing protein [Tetrahymena thermophila SB210] Length = 558 Score = 44.1 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 39/261 (14%), Positives = 81/261 (31%), Gaps = 44/261 (16%) Query: 141 RSSENLAISICMVLDVSRSMEDL----YLQKHNDNNNMTSNKYLLPPPPKKSFWSKN--- 193 ++N + VLD S SM + + S + + +N Sbjct: 149 NFTKNSGEQLIYVLDNSGSMSSVESYVKNGGVVQLSRKQSVIQSIQEMSNQGNSLQNKEI 208 Query: 194 --TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ 251 S K + S+ N + + ++Q + V + Sbjct: 209 QLIVFSDEVQIYTNIQDKPHNVSSSSLNNWDQLIASVQGIQEQMVGVPAEQV-------- 260 Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 N+L + + NK+ +T PA+ A + + +I Sbjct: 261 -----NDLLTIYRKSNKM---GSTALGPAILSAVEIAKSNR------------GSQIILC 300 Query: 312 TDGENSGA-----SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSG 365 TDG + Q+T + +Y + G+++ + + G ++ K + ++G Sbjct: 301 TDGMANQGLGSISQQQQSTQFYQDVAQYAKQKGVQVSIIGIGDDKMGLQVIGKLAEKTNG 360 Query: 366 QFFAVNDSRELLESFDKITDK 386 F + D L + D I Sbjct: 361 SVFKI-DLSNLNDKLDNILQD 380 >gi|269125512|ref|YP_003298882.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] gi|268310470|gb|ACY96844.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] Length = 814 Score = 44.1 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 17/123 (13%), Positives = 43/123 (34%), Gaps = 12/123 (9%) Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 P +T Y + AYR + + ++ TDG+ + TL Sbjct: 701 KPTGDTGLYDTILAAYRYMSKTYKPEFG--------NSILLFTDGK-NEDDDGPTLRQTL 751 Query: 330 QICEYMRN--AGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF-DKITDK 386 + E M + +++ + + +L + + G + + E+ + F ++ + Sbjct: 752 RELESMIDPTRPIQVIMLGFGPGVDVNELKQIAKVTRGDVYVTQNPNEIQKIFLQALSKR 811 Query: 387 IQE 389 + Sbjct: 812 MAN 814 >gi|182439948|ref|YP_001827667.1| hypothetical protein SGR_6155 [Streptomyces griseus subsp. griseus NBRC 13350] gi|178468464|dbj|BAG22984.1| conserved hypothetical protein [Streptomyces griseus subsp. griseus NBRC 13350] Length = 596 Score = 44.1 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 39/398 (9%), Positives = 94/398 (23%), Gaps = 33/398 (8%) Query: 9 CFLFITYAIDLAHIMYIR-NQMQSALDAAVLS------GCASIVSDRTIKDPTTKKDQTS 61 A ++ A L+ + + + + Sbjct: 201 LLALAGIGASSARQGGDSDTRVAQT--AKALAERMSDGDAQVLGTLARSTSGAEEGNPKR 258 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 ++ + G++ + + P + E+ L Sbjct: 259 NQAVLVSEQAAFAHNAEATGGGELDLFYPRDGAPLLDYPYTLVNEADLSVAESRAALRFM 318 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDV--SRSMEDLYLQKHNDNNNMTSNKY 179 L+ G + T + Sbjct: 319 TLLGDRDARAVFAEHGFRAGDGSAEDSLVAAAGGRKPQPYATPAAEAPSAKELRETLGMW 378 Query: 180 LLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ------EKKN 233 + + + + S P P ++DV ES + + Sbjct: 379 TITVQSARLTTVVDASGSMATPVPGRGQSRMDVTKESLIQALEQFTPNDEIGLWEFATTL 438 Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP--YENTNTYPAMHHAYRELYNE 291 R +G+ + E+ + L P T Y +Y+E + Sbjct: 439 DGDRDYRRLMPTKRLGDPAEGGGTHREELTAAFAGLRPVPGGATGLYDTTLASYKEARST 498 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN--AGMKIYSVAVSA 349 ++ +TDG N + + + + +R+ + + ++AV Sbjct: 499 FVKGKF--------NALVILTDGSNQDDRSI-SRSGLVAELKALRDPERPVPVIAIAVGP 549 Query: 350 PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 + ++ + G + V+D E+ I I Sbjct: 550 DADRDEVAEIARVTGGDGYEVSDPAEIRAV---ILQAI 584 >gi|319425465|gb|ADV53539.1| type IV pili-associated adhesin, PilY [Shewanella putrefaciens 200] Length = 1165 Score = 44.1 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 27/338 (7%), Positives = 78/338 (23%), Gaps = 35/338 (10%) Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + G + + + T ++ + + + Sbjct: 211 ADRLSSLSNTQFGSGQPVTLYSAHYLVWHKWATTTEEGKSSGGVGTRLDVAKSALISALE 270 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 IP E ++ I + S + +L N + Sbjct: 271 SLAIPIDAGLAIFNLNYPNEGDADGGRIVYDLTEMNSINKVNLTSLIKNMPAKTNTPLCE 330 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 + F T N KID L+ T Sbjct: 331 TLYEAYQYFSGGQVTFGNKDKN-GTGNNKIDGY------------TPNNPPSILTSGSYT 377 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG 300 + +++ + + L A + A ++ N Sbjct: 378 TPFKKCPDTAYIIYITDGAPTLDKSADTLING---LVANAKNQAANYAAFSFTNAGNKTE 434 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-GQDLLRK 359 ++ + ++ + + + ++ ++++++ S + LL + Sbjct: 435 TSYMPALAAYMYNNDIVIGNKDSLGIDNK--------QNIRVFTIGFSDGADEAAALLEE 486 Query: 360 CTDSSGQ----------FFAVNDSRELLESFDKITDKI 387 G ++ + +L+ + + I Sbjct: 487 TAFRGGNPRGSNNISKGYYVAKNGLDLVSALEDALKSI 524 >gi|126173305|ref|YP_001049454.1| type IV pilin biogenesis protein [Shewanella baltica OS155] gi|125996510|gb|ABN60585.1| type IV pilin biogenesis protein, putative [Shewanella baltica OS155] Length = 1168 Score = 44.1 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 41/396 (10%), Positives = 96/396 (24%), Gaps = 74/396 (18%) Query: 65 KKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAE---SKAQYEIPTENLFLK 121 K+ K L S + ++ + + E + + Sbjct: 101 KRYFKASLNGCSQSKAALAELGRFTGYIREFTASGTTGIWKELPETNGTAISVVDCWQDI 160 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 ++ ++ + S S + NN Sbjct: 161 SKTDPNNSSSYSNGFAANGLTTGSGKKKKAYPYFTSSSPGTSWADALAAANNTDFGVGQP 220 Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 ++ SK P ++++ ++ N+++S + Sbjct: 221 VTLYTDNYLRWY-GLSKAGKLPTVKVSRLEIAKKAISNIISSTPTVDFGLAVFNYNYPNE 279 Query: 242 AYNIGIV-GNQCTPLSNNL-NEVKSRLNKLNPYENTNTYPAMHHAYREL--------YNE 291 G + T ++++ + + ++ L NT M+ AYR + + Sbjct: 280 GNRDGGRIVSGITQMTDSTRASLLTTIDNLLAKTNTPLCETMYEAYRYFAGKGVKYGHGD 339 Query: 292 KESSHNTIGSTRLKK-------------------FVIFITDGENSGASAYQNTLNTLQIC 332 + + +VI++TDG + N + +L Sbjct: 340 TDYGSYVGNNPPYDSLVEKGGSYESPFKVCTDIAYVIYVTDGTPTVDKNANNDVISLTAS 399 Query: 333 ---------------------------------------EYMRNAGMKIYSVAVSAPPEG 353 + ++ Y++ S E Sbjct: 400 GSKEGNYSSFSKNLDTASYLPALASYMFNNDLINKLDSSNTEQVQNVRTYTIGFSKGAED 459 Query: 354 QDLL--RKCTDSSGQFFAVNDSRELLESFDKITDKI 387 L G +FA +S EL + + I Sbjct: 460 AAPLLAETAKRGGGLYFAAQNSLELQNALNDALSNI 495 >gi|291414463|ref|XP_002723479.1| PREDICTED: collagen, type VI, alpha 1 [Oryctolagus cuniculus] Length = 868 Score = 44.1 bits (102), Expect = 0.042, Method: Composition-based stats. Identities = 35/328 (10%), Positives = 85/328 (25%), Gaps = 23/328 (7%) Query: 25 IRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 NQMQ +D L + + + + F + ++ + + Sbjct: 507 SHNQMQEHVD---LRSPSIRNTQDLKEAIKKLQWMAGGTFTGEALQYTRDRLLPPTP--N 561 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 I + +I ++ +K + + L Sbjct: 562 TRIALVITDGRSDTQRDTTPLSVLCGPDIQVVSVGIKDVFGFTAGSDQLNVISCQG--LA 619 Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 ++ + + N + +K +F S + + Sbjct: 620 PQGRPGISLVKENYAELLDDSFLKNITAQICIDKKCPDYTCPITFSSPTDITILLDGSAS 679 Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ---CTPLSNNLNE 261 + D A L A + VR+ + Y+ N Sbjct: 680 VGSHNFDTTKRFAKRLAERFLSAGRTDPAHDVRVAVVQYSGRGQQQPERAALQFLQNYTV 739 Query: 262 VKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + ++ + + T+ A+ + R ++ V+ +DG + G Sbjct: 740 LAGTVDTMGFFNDATDVNDALSYVTRFYREASPATAKKK--------VLLFSDGNSQG-- 789 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + + + AG++I+ V V Sbjct: 790 --ATAEAIERAVQEAQRAGIEIFVVVVG 815 >gi|312068806|ref|XP_003137386.1| CUTiclin-Like family member [Loa loa] gi|307767445|gb|EFO26679.1| CUTiclin-Like family member [Loa loa] Length = 691 Score = 44.1 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 25/114 (21%), Positives = 47/114 (41%), Gaps = 13/114 (11%) Query: 258 NLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 N V LN L T+T+ A+H AY+ L + + +KK +I TDG + Sbjct: 26 NNTAVIGHLNALKSIKGTTSTHIALHQAYKLLMDTDSGNGAR---EGVKKMIIIFTDGHS 82 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAV-SAPPEGQDLLRKCTDSSGQFFA 369 + + ++N ++I+++ + AP + L T ++ F Sbjct: 83 QQS--------PQDMALRLKNESVEIFAITLTPAPYADEGELLSITQNTDHIFT 128 >gi|268529744|ref|XP_002629998.1| Hypothetical protein CBG13361 [Caenorhabditis briggsae] gi|187028722|emb|CAP32249.1| hypothetical protein CBG_13361 [Caenorhabditis briggsae AF16] Length = 359 Score = 44.1 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 25/151 (16%), Positives = 51/151 (33%), Gaps = 12/151 (7%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKES 294 S R+ I YN P +L+++K+ L L + + L + + Sbjct: 72 SSRVAIITYNNDATTVAQFPTFKSLDQLKTELTVLKNDGTSGNESYLDTG---LSSAQAI 128 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + T KK V+ ++ I +++ G+ + +V + Sbjct: 129 LNKTEDRMNYKKVVLVF------ASTYDLLYDRPDYIAHALKSNGVTVITVYTGNDKTVK 182 Query: 355 DLLRKCTDSSGQFF--AVNDSRELLESFDKI 383 L+ G F +VN + +L + I Sbjct: 183 GQLKSVAS-DGFAFQMSVNTTTDLQNALTMI 212 >gi|297667860|ref|XP_002812181.1| PREDICTED: vitrin-like isoform 2 [Pongo abelii] Length = 678 Score = 44.1 bits (102), Expect = 0.043, Method: Composition-based stats. Identities = 14/120 (11%), Positives = 39/120 (32%), Gaps = 14/120 (11%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSH 296 +G + Y + N ++K+ + K+ +N A+ + +++ Sbjct: 332 MGVVQYGDSPATHFNLKTHMNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANG-- 389 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + V+ + DG + + R +G+ I+ + + E + Sbjct: 390 ---NRSSAPNVVVVMVDGWPTD--------KVEEASRLARESGINIFFITIEGAAENEKQ 438 >gi|291394751|ref|XP_002713732.1| PREDICTED: collagen, type XXVIII [Oryctolagus cuniculus] Length = 1132 Score = 44.1 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 29/176 (16%), Positives = 56/176 (31%), Gaps = 18/176 (10%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKI---DVLIESAGNLVNSIQKAIQEKK-NLSVRIGTI 241 K FW+ + + KI D + L + I + +R+ + Sbjct: 37 KNDFWASICFIDIVFIVDSSESSKIVHFDKQKDFVERLSDKIFQVTPGHSLKYDIRLAAL 96 Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIG 300 ++ + + +L K R LN T +Y A+ + R E + Sbjct: 97 QFSSSVQIDPPFSSWKDLKTFKQRAKSLNLIGQGTFSYYAISNVTRLFKREGRKNGV--- 153 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 K + +TDG + + + I E R +G+ ++ S L Sbjct: 154 -----KVALLMTDGID-----HPKNPDVKSISEDARTSGISFITIGHSTDVNEAKL 199 >gi|283850951|ref|ZP_06368236.1| von Willebrand factor type A [Desulfovibrio sp. FW1012B] gi|283573597|gb|EFC21572.1| von Willebrand factor type A [Desulfovibrio sp. FW1012B] Length = 330 Score = 44.1 bits (102), Expect = 0.044, Method: Composition-based stats. Identities = 18/149 (12%), Positives = 37/149 (24%), Gaps = 40/149 (26%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 G P S + + L +L + T + A + L + Sbjct: 137 GSRAYAVMPPSADRAALTGALARLAVGAAGKRTAMGDGLGLAVKRLSDA----------P 186 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP------------ 350 L + + DG ++ ++ + G+ +YSV V Sbjct: 187 GLSRLAVVFGDGRSNAG-----EVSPEDAAKAASERGVTVYSVGVGGDEPAPFLVTHPLL 241 Query: 351 ---------PEGQDLLRKCT-DSSGQFFA 369 L + G ++ Sbjct: 242 GSQIVTEKAAVDATTLAAMAKATGGAYYR 270 >gi|120599894|ref|YP_964468.1| type IV pilin biogenesis protein [Shewanella sp. W3-18-1] gi|120559987|gb|ABM25914.1| type IV pilin biogenesis protein, putative [Shewanella sp. W3-18-1] Length = 1165 Score = 43.7 bits (101), Expect = 0.044, Method: Composition-based stats. Identities = 27/338 (7%), Positives = 78/338 (23%), Gaps = 35/338 (10%) Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + G + + + T ++ + + + Sbjct: 211 ADRLSSLSNTQFGSGQPVTLYSAHYLVWHKWATTTEEGKSSGGVGTRLDVAKSALISALE 270 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 IP E ++ I + S + +L N + Sbjct: 271 SLAIPIDAGLAIFNLNYPNEGDADGGRIVYDLTEMNSINKVNLTSLIKNMPAKTNTPLCE 330 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 + F T N KID L+ T Sbjct: 331 TLYEAYQYFSGGQVTFGNKDKN-GTGNNKIDGY------------TPNNPPSILTSGSYT 377 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG 300 + +++ + + L A + A ++ N Sbjct: 378 TPFKKCPDTAYIIYITDGAPTLDKSADTLING---LVANAKNQAANYAAFSFTNAGNKTE 434 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-GQDLLRK 359 ++ + ++ + + + ++ ++++++ S + LL + Sbjct: 435 TSYMPALAAYMYNNDIVIGNKDSLGVDNK--------QNIRVFTIGFSDGADEAAALLEE 486 Query: 360 CTDSSGQ----------FFAVNDSRELLESFDKITDKI 387 G ++ + +L+ + + I Sbjct: 487 TAFRGGNPRGSNNISKGYYVAKNGLDLVSALEDALKSI 524 >gi|170726477|ref|YP_001760503.1| cell wall anchor domain-containing protein [Shewanella woodyi ATCC 51908] gi|169811824|gb|ACA86408.1| LPXTG-motif cell wall anchor domain protein [Shewanella woodyi ATCC 51908] Length = 739 Score = 43.7 bits (101), Expect = 0.045, Method: Composition-based stats. Identities = 19/189 (10%), Positives = 57/189 (30%), Gaps = 18/189 (9%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 I + + ++ I + + + N+ + L Sbjct: 380 IAQAKRALNYALAGLKAKDT-----FNVIEFNSNVGSLSPYSLPATAKNIGLANQYVRSL 434 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T A++ A + + + + V+F+TDG + + + Sbjct: 435 KANGGTEMQLALNAALDK------GTETEALGSERLRQVLFMTDGSVGDEQSLFHLIK-- 486 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + + + +++++ + + P + R G F + E+ + + +I+ Sbjct: 487 ---QKIGES--RLFTLGIGSAPNSHFMRRAAEFGRGTFTYIGKLDEVQSKIESLLYQIER 541 Query: 390 QSVRIAPNR 398 + R Sbjct: 542 PQLTDIKLR 550 >gi|283769330|ref|ZP_06342229.1| hypothetical protein HMPREF9013_0305 [Bulleidia extructa W1219] gi|283103987|gb|EFC05371.1| hypothetical protein HMPREF9013_0305 [Bulleidia extructa W1219] Length = 209 Score = 43.7 bits (101), Expect = 0.045, Method: Composition-based stats. Identities = 7/33 (21%), Positives = 18/33 (54%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSAL 33 + A+ ++V I+ A+D+ + +N+M + Sbjct: 19 IFALFLTVLLGMISLAVDVGMMYLKKNRMYEIV 51 >gi|115376470|ref|ZP_01463705.1| phage/colicin/tellurite resistance cluster TerY protein, putative [Stigmatella aurantiaca DW4/3-1] gi|310819725|ref|YP_003952083.1| hypothetical protein STAUR_2452 [Stigmatella aurantiaca DW4/3-1] gi|115366538|gb|EAU65538.1| phage/colicin/tellurite resistance cluster TerY protein, putative [Stigmatella aurantiaca DW4/3-1] gi|309392797|gb|ADO70256.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1] Length = 218 Score = 43.7 bits (101), Expect = 0.045, Method: Composition-based stats. Identities = 30/201 (14%), Positives = 68/201 (33%), Gaps = 16/201 (7%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 A + +D I++ V + +A Q++ +L + G PL Sbjct: 18 VVLADVSGSMGVDGKIQALNLAVREMIEAFQDESDLRAEVHVSVITFGGQSRVHLPLGRA 77 Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + + L T A A + + N + S + ++ ++DG+ + Sbjct: 78 RDAAWT---DLGANGGTPMGAAFDLARTMVED-----RNAVPSRAYRPTIVLVSDGQPTD 129 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIY--SVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 G K + ++A+ A + L D + + +++R++ Sbjct: 130 EWKQPLESLLK------NERGGKAFRMALAIGADADHAVLQAFLADPEARVYRADEARQI 183 Query: 377 LESFDKITDKIQEQSVRIAPN 397 + F +T + +S PN Sbjct: 184 RKFFQLVTMSVSARSRSANPN 204 >gi|300776752|ref|ZP_07086610.1| conserved hypothetical protein [Chryseobacterium gleum ATCC 35910] gi|300502262|gb|EFK33402.1| conserved hypothetical protein [Chryseobacterium gleum ATCC 35910] Length = 335 Score = 43.7 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 22/171 (12%), Positives = 51/171 (29%), Gaps = 52/171 (30%) Query: 252 CTPLSNNLNEVKSRLNKLNPYE----NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 PL+ + N ++ ++ + T+ M A + N + S Sbjct: 143 IMPLTTDYNSAETYISGIETSSMQIQGTDFLKGMQAAVEKFKNVSKGSRK---------- 192 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG-------------- 353 V+ ++DGE++ + G+ I SV + Sbjct: 193 VVLLSDGEDNEGNDN-------AAIRLANKEGVSITSVGIGTDEGAPVPEYVFGQLMGYK 245 Query: 354 --------------QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + L + + G + D + E+ D+I + + ++ Sbjct: 246 TDVNGGTVISKRQTEALKKMAESTDGTYI---DGNNINEAPDRIAEAVNKK 293 >gi|162451664|ref|YP_001614031.1| putative secreted protein [Sorangium cellulosum 'So ce 56'] gi|161162246|emb|CAN93551.1| putative secreted protein [Sorangium cellulosum 'So ce 56'] Length = 392 Score = 43.7 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 24/198 (12%), Positives = 52/198 (26%), Gaps = 14/198 (7%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE- 261 K D ++ + + + I PL+ + Sbjct: 104 WHDKALKWDPVVAATKQFFSDPGSEGLTASLSFFPADDDRCSSEIYATPDVPLTPLPSAA 163 Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF-VIFITDGENSGAS 320 ++ + P + + A+ + K+ ++ +TDG G Sbjct: 164 FTEAIDAIEPASSDDWRGGTPTAW--VMRGTSGFIEAQRRQNPGKYAIVLVTDGYPQGCD 221 Query: 321 AYQNTLNTLQICEYMRN-AGMKIYSVAV------SAPPEGQDLLRKCTDS---SGQFFAV 370 +T++ + G+ Y + V AP DL Sbjct: 222 EASDTIDAVVADAQAALAEGVPTYVIGVENPPIDGAPDTLDDLHEIAAAGGTEGAVLIDT 281 Query: 371 NDSRELLESFDKITDKIQ 388 D + +F D+I+ Sbjct: 282 GDPSQTTAAFRAAVDRIR 299 >gi|148680074|gb|EDL12021.1| mCG3350, isoform CRA_a [Mus musculus] Length = 513 Score = 43.7 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 12/102 (11%), Positives = 38/102 (37%), Gaps = 18/102 (17%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 S++ ++ + L + T+ + ++ + + + ++ +T Sbjct: 364 ITSSSDYQKITANLPQ-QASGGTSICHGLQAGFQAITSS--------DQSTSGSEIVLLT 414 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 DGE++G + E + +G I+++A+ + Sbjct: 415 DGEDNGIRSCF---------EAVSRSGAIIHTIALGPSAARE 447 >gi|148680075|gb|EDL12022.1| mCG3350, isoform CRA_b [Mus musculus] Length = 527 Score = 43.7 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 12/102 (11%), Positives = 38/102 (37%), Gaps = 18/102 (17%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 S++ ++ + L + T+ + ++ + + + ++ +T Sbjct: 378 ITSSSDYQKITANLPQ-QASGGTSICHGLQAGFQAITSS--------DQSTSGSEIVLLT 428 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 DGE++G + E + +G I+++A+ + Sbjct: 429 DGEDNGIRSCF---------EAVSRSGAIIHTIALGPSAARE 461 >gi|16358975|gb|AAH10260.1| Clca1 protein [Mus musculus] Length = 513 Score = 43.7 bits (101), Expect = 0.046, Method: Composition-based stats. Identities = 12/102 (11%), Positives = 38/102 (37%), Gaps = 18/102 (17%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 S++ ++ + L + T+ + ++ + + + ++ +T Sbjct: 364 ITSSSDYQKITANLPQ-QASGGTSICHGLQAGFQAITSS--------DQSTSGSEIVLLT 414 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 DGE++G + E + +G I+++A+ + Sbjct: 415 DGEDNGIRSCF---------EAVSRSGAIIHTIALGPSAARE 447 >gi|156409371|ref|XP_001642143.1| predicted protein [Nematostella vectensis] gi|156229284|gb|EDO50080.1| predicted protein [Nematostella vectensis] Length = 332 Score = 43.7 bits (101), Expect = 0.047, Method: Composition-based stats. Identities = 19/138 (13%), Positives = 49/138 (35%), Gaps = 19/138 (13%) Query: 214 IESAGNLVNSIQKAIQEKK--NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN- 270 + G ++N I+ + R+ + Y+ ++ V +N + Sbjct: 17 KGNFGRMLNLIKSTLNAFSLRQRRTRVSVVLYSNRPFKVFGFNRYSSKLRVIRAINYMRY 76 Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 P T A+++ R L+ ++ K+ ++ +TDG + Sbjct: 77 PRGGTKLRRALYYVKRYLFTRRQRGR--------KQVLVVLTDGISRRG--------VKA 120 Query: 331 ICEYMRNAGMKIYSVAVS 348 + AG++++S+ + Sbjct: 121 PAISLHRAGVEVHSIGIG 138 >gi|149921504|ref|ZP_01909956.1| hypothetical protein PPSIR1_30866 [Plesiocystis pacifica SIR-1] gi|149817707|gb|EDM77174.1| hypothetical protein PPSIR1_30866 [Plesiocystis pacifica SIR-1] Length = 560 Score = 43.7 bits (101), Expect = 0.047, Method: Composition-based stats. Identities = 20/136 (14%), Positives = 39/136 (28%), Gaps = 16/136 (11%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 +V I + N + ++N + TN Y + Y + Sbjct: 254 DTVSICEWDTSNDWTLAGYAVTGPNDELLLEKINDVVHGGGTNLYGGLESGYELAQMVYD 313 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR---NAGMKIYSVAVSAP 350 ++ I+DG + + I E + G+ + V V P Sbjct: 314 PDAINR--------LVLISDGGANAGITDLDL-----IAENAAYGGSDGIYLVGVGVDDP 360 Query: 351 PEGQDLLRKCTDSSGQ 366 + D L +G+ Sbjct: 361 DDYNDELMDAVTDAGK 376 >gi|37499120|gb|AAQ91617.1| unknown [Antonospora locustae] Length = 824 Score = 43.7 bits (101), Expect = 0.047, Method: Composition-based stats. Identities = 14/116 (12%), Positives = 32/116 (27%), Gaps = 14/116 (12%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 P T + A L E +I +TDG N Sbjct: 373 PRGGTCIVAGLQRAVD-LKPAAEDGSIRRN-------IILLTDG------GDSNLREITS 418 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + + G + +++ + ++ G + D+ ++ + +K Sbjct: 419 LVQREAAKGTRFFAIGIGNGVSYDTVMEVARAGRGTHDFIRDACDVGSCLSSMLEK 474 >gi|51891532|ref|YP_074223.1| hypothetical protein STH394 [Symbiobacterium thermophilum IAM 14863] gi|51855221|dbj|BAD39379.1| hypothetical protein [Symbiobacterium thermophilum IAM 14863] Length = 252 Score = 43.7 bits (101), Expect = 0.047, Method: Composition-based stats. Identities = 13/44 (29%), Positives = 25/44 (56%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASI 44 + I +V L I +A+D+ + +R Q+++A +AA L+G Sbjct: 15 LFVFIWAVAVLAIGFALDVGRVFVLREQLRTAEEAAALAGVRQA 58 >gi|328951307|ref|YP_004368642.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] gi|328451631|gb|AEB12532.1| von Willebrand factor type A [Marinithermus hydrothermalis DSM 14884] Length = 744 Score = 43.7 bits (101), Expect = 0.047, Method: Composition-based stats. Identities = 30/152 (19%), Positives = 58/152 (38%), Gaps = 17/152 (11%) Query: 237 RIGTIAYNIGIVGNQCT-PLSN-NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKES 294 R+G + ++ G P++ E K+ L++L P +T A A L Sbjct: 373 RLGIVTFSSGPRWLFPPRPMTARGKLEAKTLLDRLRPGGSTRMLEAYRQAIEALEA---- 428 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 K ++ +TDG+ ++ + + E R G++ SVA+ + Sbjct: 429 ------LELETKQILVLTDGQV-----EEDPAALVALAEAARAQGIRTNSVALGGDADRA 477 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 L R G+F+ V +L F + ++ Sbjct: 478 LLARMSRVGEGRFWDVPTPEDLPRLFLEEAER 509 >gi|330816722|ref|YP_004360427.1| Membrane protein [Burkholderia gladioli BSR3] gi|327369115|gb|AEA60471.1| Membrane protein [Burkholderia gladioli BSR3] Length = 622 Score = 43.7 bits (101), Expect = 0.047, Method: Composition-based stats. Identities = 13/108 (12%), Positives = 30/108 (27%), Gaps = 1/108 (0%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M A+ + V +D+ + R +Q D A +G + + + T Sbjct: 31 MAAVFLVVIAAIFGV-LDVGNTYLQRRDLQQIADMAAAAGVQRVDNLCVQAPTSATNSAT 89 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK 108 + + + A + T +P + + Sbjct: 90 VNGLNTSQGDTIAVTCGRWDPTVNPAPSYYLANTNTSGDPNRLQLNAV 137 >gi|313207256|ref|YP_004046433.1| von willebrand factor type a [Riemerella anatipestifer DSM 15868] gi|312446572|gb|ADQ82927.1| von Willebrand factor type A [Riemerella anatipestifer DSM 15868] gi|315023480|gb|EFT36486.1| BatB [Riemerella anatipestifer RA-YM] gi|325335297|gb|ADZ11571.1| von Willebrand factor type A [Riemerella anatipestifer RA-GD] Length = 335 Score = 43.7 bits (101), Expect = 0.047, Method: Composition-based stats. Identities = 17/106 (16%), Positives = 39/106 (36%), Gaps = 21/106 (19%) Query: 252 CTPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 PL+ + +S ++ L + T+ Y AM A + + S Sbjct: 143 VMPLTTDYLAAESFVSGLETSVVSTQGTDFYKAMQVAVSKFKAVSKGS----------GR 192 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 ++ I+DGE++ + + ++ G+++ +V V Sbjct: 193 IVLISDGEDNEGNEA-------AAIKEAQSNGIQVITVGVGTEEGA 231 >gi|328954590|ref|YP_004371924.1| VWFA-related domain-containing protein [Desulfobacca acetoxidans DSM 11109] gi|328454914|gb|AEB10743.1| VWFA-related domain-containing protein [Desulfobacca acetoxidans DSM 11109] Length = 543 Score = 43.7 bits (101), Expect = 0.049, Method: Composition-based stats. Identities = 16/92 (17%), Positives = 43/92 (46%), Gaps = 5/92 (5%) Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 + + VI +TDG++ G + ++ + + ++ A + IY++ + + L + Sbjct: 176 APTTRAAVILLTDGKDEG-----SPVSEEAVLDRIKGAQVPIYAMGFGSKAQVDYLQKVA 230 Query: 361 TDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + S G F + + +L + + D ++ Q + Sbjct: 231 SASQGAFLSTPQAADLTNLYQTVLDYLKNQYI 262 >gi|320105608|ref|YP_004181198.1| VWFA-like domain-containing protein [Terriglobus saanensis SP1PR4] gi|319924129|gb|ADV81204.1| VWFA-related domain-containing protein [Terriglobus saanensis SP1PR4] Length = 373 Score = 43.7 bits (101), Expect = 0.049, Method: Composition-based stats. Identities = 27/157 (17%), Positives = 54/157 (34%), Gaps = 55/157 (35%) Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 T Y A+ A E+ +++ K +I +TDGE++G+ ++ E Sbjct: 189 TALYDAIFLASDEVIHKQPFR----------KALILLTDGEDNGSKESLSS-----AIEA 233 Query: 335 MRNAGMKIYSVAV-SAPPEG--------------------------------------QD 355 + A +YS+ + Sbjct: 234 AQRADTAVYSIYFKGEEHNDTSSRRPSFGGGGFPGGGGRHGGGGGQGGGQPQRTHVDGKK 293 Query: 356 LLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQS 391 +L++ +D + G+FF V+ L E + KI +++ Q Sbjct: 294 ILQRISDETGGRFFEVSKKEPLAEIYKKIAQELRSQY 330 >gi|297279796|ref|XP_002801793.1| PREDICTED: integrin alpha-10-like isoform 3 [Macaca mulatta] Length = 1036 Score = 43.7 bits (101), Expect = 0.049, Method: Composition-based stats. Identities = 40/219 (18%), Positives = 65/219 (29%), Gaps = 25/219 (11%) Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 S + I E L + K + + +++G + Y V Sbjct: 28 SDGCPTYMDVVIVLDGSNSIYPWSEVQTFLRRLVGKLFIDPEQ--IQVGLVQYGESPVHE 85 Query: 251 QCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 EV L+ E T T A+ A E +++ + + Sbjct: 86 WSLGDFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQSHGGRPEA-----ARLL 140 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--------SAPPEGQDLLRKC 360 + +TDGE+ L+ CE R + Y +AV P +R Sbjct: 141 VVVTDGESHDGEELP---AALKTCEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTI 194 Query: 361 TDSSG--QFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 FF V D L + D + D+I A N Sbjct: 195 ASDPDERFFFNVTDEAALTDIVDALGDRIFGLEGSHAEN 233 >gi|156974653|ref|YP_001445560.1| hypothetical protein VIBHAR_02371 [Vibrio harveyi ATCC BAA-1116] gi|156526247|gb|ABU71333.1| hypothetical protein VIBHAR_02371 [Vibrio harveyi ATCC BAA-1116] Length = 2127 Score = 43.7 bits (101), Expect = 0.049, Method: Composition-based stats. Identities = 33/309 (10%), Positives = 83/309 (26%), Gaps = 32/309 (10%) Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + I + + + ++ S + + Sbjct: 1539 NAIADDFALNGFMDTTDSTLDMDQLTLGKTVDSVALNVPTA--NLTSNGEAITWALSNND 1596 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNN-----MT 175 + L+ S ++ + + + D S +N Sbjct: 1597 QLLVGSGDGEEVIKISVNDTGAISTELLGPIDHPDTSGEDSLNIEVPVVVSNALGLTNSA 1656 Query: 176 SNKYLLPPPPKKSFW-----------SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 ++ S S N ++ ++ +SA L+N Sbjct: 1657 VANVVIEDDSPYSTSIVHDVVAETKESANVQLIMDVSGSMRTGNRLQIMKDSATQLLNQY 1716 Query: 225 QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN-NLNEVKSRLNKLNPYENTNTYPAMHH 283 + Q R+ I Y+ + ++E K+ ++ L TN A++ Sbjct: 1717 ESIGQT------RVQIIKYSGSATTYAIAGATWLTVDEAKAYIDTLTAGGATNYNRAINE 1770 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A + F++DG+ + AS++ N +++ ++ +I Sbjct: 1771 AKDSWDDAG-------KLPSASNVSYFLSDGQPNPASSFINDARENSWIDHLTDSDNQIT 1823 Query: 344 SVAVSAPPE 352 ++A Sbjct: 1824 ALAYGMGVN 1832 >gi|23097530|ref|NP_690996.1| hypothetical protein OB0075 [Oceanobacillus iheyensis HTE831] gi|22775753|dbj|BAC12031.1| hypothetical conserved protein [Oceanobacillus iheyensis HTE831] Length = 246 Score = 43.7 bits (101), Expect = 0.049, Method: Composition-based stats. Identities = 13/104 (12%), Positives = 36/104 (34%), Gaps = 11/104 (10%) Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-----EGQ 354 + K ++ +TDG ++ + + G+ + + + +G Sbjct: 1 MKSGTLKQILLLTDGCSNRGE------DPSAVASLASQQGITVNVIGILEDDQTESPDGL 54 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + S G + S L ++ +T + Q+++ N+ Sbjct: 55 QEVEDIALSGGGVSQIVYSENLSQTVQMVTRQAMTQTLQGFVNK 98 >gi|58262250|ref|XP_568535.1| hypothetical protein [Cryptococcus neoformans var. neoformans JEC21] gi|57230709|gb|AAW47018.1| expressed protein [Cryptococcus neoformans var. neoformans JEC21] Length = 430 Score = 43.7 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 18/166 (10%), Positives = 49/166 (29%), Gaps = 8/166 (4%) Query: 202 APAPANRKIDVLIESAGNLVNSIQ-----KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 + I+ + + + + I+ + + V + P + Sbjct: 63 CTGSMQKYINSVRDHIIGICDMIRGEEGLNGPDDLRVAVVNYRDHPPQDSTYVYKFHPFT 122 Query: 257 NNLNEVKSRLNKLNPYENTNTYPAM--HHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 +++ EV++ L L + A+ A E + I +G Sbjct: 123 SDIPEVQNYLKGLTASGGGDGPEAVTAAMAATLTELEWRREAARMAVLVADAPPHGIGEG 182 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 + + + L + M G+ ++ + V ++L + Sbjct: 183 GDQFKQGDPDGHDPLVVARMMAQNGITMF-LRVRIDDCDRELHERL 227 >gi|327270796|ref|XP_003220174.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 948 Score = 43.7 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 17/71 (23%), Positives = 34/71 (47%), Gaps = 9/71 (12%) Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN---DSRELLESFDKITD---K 386 ++N+G I+++A + + L K D + G F DS L+++F I+ Sbjct: 413 QIQNSGSIIHTIAFGSGTSNE--LEKLADMTGGLAFYATDSLDSNGLMDAFSGISSGGGD 470 Query: 387 IQEQSVRIAPN 397 I +QS+++ Sbjct: 471 ISQQSIQLESK 481 >gi|198421146|ref|XP_002121270.1| PREDICTED: similar to cubilin [Ciona intestinalis] Length = 728 Score = 43.7 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 35/352 (9%), Positives = 82/352 (23%), Gaps = 26/352 (7%) Query: 55 TKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIP 114 K T I ++ T + Sbjct: 386 MPKSTTQAFSCGFNAVATSTSQLITSPNYPNNYPNNVDCTWTITASDGMRVQLNLIRFST 445 Query: 115 TENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNM 174 + + + + + Sbjct: 446 ERSYDYLTISSDGTQLARNSGNYSNRIYTSIGNTLTLRFTSDRSVTRPGFNATFVAVPGV 505 Query: 175 TSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE--------SAGNLVNSIQK 226 P ++ + TT + +D +A + + I Sbjct: 506 EVTTEPTPVEITQAIFQDQTTAKVCQNSHTDLMFLLDSSGSVTSSDFQLAANFVKDFITG 565 Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLS-------NNLNEVKSRLNKLNPYENTNTYP 279 + V + + +Q + + K+ ++ + P+ T Sbjct: 566 IDLTSFQVGVMQYSHYLLNRELDDQPYITTEINIGEYTEADPFKTAMDTIQPHGYTTYT- 624 Query: 280 AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG 339 HA + ++ K ++ ITDG +S +S ++ +N G Sbjct: 625 --AHAVLKAIRVDFPRSTRFNNSCTSKIIVLITDGSSSDSSMLRD------AALEAKNLG 676 Query: 340 MKIYSVAVSAPPEGQD--LLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + IY++ V + L R + + + F ++ L + KI Sbjct: 677 VDIYAIGVGDANTQELVVLTRPESGTKDKIFQIDQYSSLPSILQGLRTKILS 728 >gi|302755028|ref|XP_002960938.1| hypothetical protein SELMODRAFT_402429 [Selaginella moellendorffii] gi|300171877|gb|EFJ38477.1| hypothetical protein SELMODRAFT_402429 [Selaginella moellendorffii] Length = 2174 Score = 43.7 bits (101), Expect = 0.051, Method: Composition-based stats. Identities = 18/128 (14%), Positives = 43/128 (33%), Gaps = 17/128 (13%) Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 + T + A L + +IF++DG N+G S + + Sbjct: 2060 DSSGGTVYSAGLQLAEEILGRSAGDAKAP--------AIIFLSDGGNAGGS------DPV 2105 Query: 330 QICEYMR--NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN-DSRELLESFDKITDK 386 ++ + ++++ + + +L G F V+ D +L SF+ + Sbjct: 2106 AFVRKIKSVEPRLVVHTIVFGSDLLPRKVLVDMAREGGGVFQVSLDELQLGRSFEALAKS 2165 Query: 387 IQEQSVRI 394 + + Sbjct: 2166 LHPTVASL 2173 >gi|156408866|ref|XP_001642077.1| predicted protein [Nematostella vectensis] gi|156229218|gb|EDO50014.1| predicted protein [Nematostella vectensis] Length = 251 Score = 43.7 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 27/112 (24%), Positives = 40/112 (35%), Gaps = 10/112 (8%) Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 TNT A++ A REL+ N+ + V+ ITDG ++ Sbjct: 135 GTTNTQEALNLAQRELF----GKKNSGATPGAIGRVLIITDGLSNVQRNLTLFNAYK--- 187 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKI 383 ++ AG +IY VAV G L + F R L + I Sbjct: 188 --LKMAGPEIYVVAVGQYLYGLHELVGLASSTENHLFRAQSMRGLEGAVRLI 237 >gi|146338399|ref|YP_001203447.1| hypothetical protein BRADO1313 [Bradyrhizobium sp. ORS278] gi|146191205|emb|CAL75210.1| hypothetical protein; putative signal peptide [Bradyrhizobium sp. ORS278] Length = 526 Score = 43.7 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 17/125 (13%), Positives = 40/125 (32%), Gaps = 19/125 (15%) Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 + + +L+ T+ Y A + + + ++ +TDG++ G Sbjct: 414 AGLLKQALQLHAGGGTDFYQCGARALAAMKPTLDGGAHLA-------AIVIMTDGKSYGD 466 Query: 320 SAYQNTLNTLQICEYMRNAG--MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 E R G + ++ V + + L + G+ F + L Sbjct: 467 R--------ATFEEPWRADGGRVPVFGVTFGDEADRKQLDALAKLTGGRVFDGT--KNLT 516 Query: 378 ESFDK 382 ++F Sbjct: 517 DAFRA 521 >gi|69244819|ref|ZP_00603043.1| von Willebrand factor, type A [Enterococcus faecium DO] gi|257882064|ref|ZP_05661717.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,502] gi|257889959|ref|ZP_05669612.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,410] gi|260560224|ref|ZP_05832401.1| von Willebrand factor [Enterococcus faecium C68] gi|314947791|ref|ZP_07851198.1| LPXTG-motif protein cell wall anchor domain protein [Enterococcus faecium TX0082] gi|68196173|gb|EAN10603.1| von Willebrand factor, type A [Enterococcus faecium DO] gi|257817722|gb|EEV45050.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,502] gi|257826319|gb|EEV52945.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,410] gi|260073791|gb|EEW62116.1| von Willebrand factor [Enterococcus faecium C68] gi|313645771|gb|EFS10351.1| LPXTG-motif protein cell wall anchor domain protein [Enterococcus faecium TX0082] Length = 1345 Score = 43.7 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 32/371 (8%), Positives = 90/371 (24%), Gaps = 48/371 (12%) Query: 48 RTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAES 107 + + +T++ + + I ++++ + Sbjct: 190 PSSTASDLHPEDATTLYNVYLDVIGGEKKEISPIDIVFVLDKSASMSELTAGTNSQTKNA 249 Query: 108 KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQK 167 I N K L+ ++ + +S + + D+ D+ Sbjct: 250 AL---IEAVNEMSKDLLSDPSLDIRI-GMVNFYHNSTAINNHEQISSDIFPLTNDINRLT 305 Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 ++N + P T + + V+ + + ++ Sbjct: 306 GSENTALNRTPIGGTPLTLGLKNGYETLYKDNGGENRNPEKILIVVGDGTPTFSYAPIQS 365 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 I + L N E NT+ H Sbjct: 366 SYRTSTNGAWSNWTVMEDKIAEDNDV-LFRNFEEF---------SGNTS-NAGFTHPVTY 414 Query: 288 LYNEKESSHNTIGSTRLKKFVI-------FITDGENSGASAYQNTLNTLQIC-------- 332 + R + ++ DG ++ + T Sbjct: 415 ASDFNRPEDEVNVHYRYGEVKEGDNKATHWVGDGSSNNNTNGSPTSQEKSSAINTVAYHH 474 Query: 333 ---EYMRNAGMKIYSVAVSAPPE----------GQDLLRKCTD--SSG---QFFAVNDSR 374 + I+S+ + G+++L+ D G +++ N+ Sbjct: 475 WLKNKYQENPPSIFSIGLGIDGNVSGRQRLDAIGRNVLKNIADLEEDGVTPRYYNANNKN 534 Query: 375 ELLESFDKITD 385 +++ + + I+ Sbjct: 535 DIVTALEDISS 545 >gi|258615515|ref|ZP_05713285.1| hypothetical protein EfaeD_07377 [Enterococcus faecium DO] gi|293563519|ref|ZP_06677967.1| Bee1, putative [Enterococcus faecium E1162] gi|294622786|ref|ZP_06701740.1| Bee1, putative [Enterococcus faecium U0317] gi|291597744|gb|EFF28882.1| Bee1, putative [Enterococcus faecium U0317] gi|291604521|gb|EFF34007.1| Bee1, putative [Enterococcus faecium E1162] Length = 1344 Score = 43.7 bits (101), Expect = 0.052, Method: Composition-based stats. Identities = 32/371 (8%), Positives = 90/371 (24%), Gaps = 48/371 (12%) Query: 48 RTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAES 107 + + +T++ + + I ++++ + Sbjct: 189 PSSTASDLHPEDATTLYNVYLDVIGGEKKEISPIDIVFVLDKSASMSELTAGTNSQTKNA 248 Query: 108 KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQK 167 I N K L+ ++ + +S + + D+ D+ Sbjct: 249 AL---IEAVNEMSKDLLSDPSLDIRI-GMVNFYHNSTAINNHEQISSDIFPLTNDINRLT 304 Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 ++N + P T + + V+ + + ++ Sbjct: 305 GSENTALNRTPIGGTPLTLGLKNGYETLYKDNGGENRNPEKILIVVGDGTPTFSYAPIQS 364 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 I + L N E NT+ H Sbjct: 365 SYRTSTNGAWSNWTVMEDKIAEDNDV-LFRNFEEF---------SGNTS-NAGFTHPVTY 413 Query: 288 LYNEKESSHNTIGSTRLKKFVI-------FITDGENSGASAYQNTLNTLQIC-------- 332 + R + ++ DG ++ + T Sbjct: 414 ASDFNRPEDEVNVHYRYGEVKEGDNKATHWVGDGSSNNNTNGSPTSQEKSSAINTVAYHH 473 Query: 333 ---EYMRNAGMKIYSVAVSAPPE----------GQDLLRKCTD--SSG---QFFAVNDSR 374 + I+S+ + G+++L+ D G +++ N+ Sbjct: 474 WLKNKYQENPPSIFSIGLGIDGNVSGRQRLDAIGRNVLKNIADLEEDGVTPRYYNANNKN 533 Query: 375 ELLESFDKITD 385 +++ + + I+ Sbjct: 534 DIVTALEDISS 544 >gi|260820612|ref|XP_002605628.1| hypothetical protein BRAFLDRAFT_150512 [Branchiostoma floridae] gi|229290963|gb|EEN61638.1| hypothetical protein BRAFLDRAFT_150512 [Branchiostoma floridae] Length = 168 Score = 43.7 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 54/177 (30%), Gaps = 23/177 (12%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + + + ++V+ R+G + + + + + Sbjct: 13 SASVGPLQFEKSKKFVRDMVDGFN-----IGAAQTRVGVVQFAWMVQAEFNLGDYLDGTD 67 Query: 262 VKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +++ + ++ + T A+ R L++E + + + + VI ITDG +S Sbjct: 68 LRNAIARIRYMDGPGTEIGKALVFTKRRLFSELYGARPE--TQDVPRIVILITDGRSSPE 125 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 S + + L S + + V D L Sbjct: 126 SQLHTAGVVVYAVGV--------------GEAVDEAELETAASDSSKVYHVRDFDSL 168 >gi|83645400|ref|YP_433835.1| von Willebrand factor type A (vWA) domain-containing protein [Hahella chejuensis KCTC 2396] gi|83633443|gb|ABC29410.1| uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Hahella chejuensis KCTC 2396] Length = 749 Score = 43.7 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 25/194 (12%), Positives = 57/194 (29%), Gaps = 21/194 (10%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 N ++ + + + + Y + +Q Sbjct: 40 VSGSMKKNDPKNLRRPALNLVTELLPEGDSAG-----VWTFGQYVNELAPHQVVDPGW-R 93 Query: 260 NEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 K + +++ TN A+ A + K+ I +TDG Sbjct: 94 RLAKDKAREISSTALYTNIGAALEKASEDFVEGKD---------YSNTHFILLTDGVVDI 144 Query: 319 ASAYQNTLNTLQ-----ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDS 373 + + + + + G KI+++A+S + L R S+G +S Sbjct: 145 SQKPGENVAERDRVLTQVLKRVAGFGAKIHTIALSRNADQMLLQRLSIGSNGINAIAENS 204 Query: 374 RELLESFDKITDKI 387 +L F + ++ Sbjct: 205 EQLSRVFLQAFERA 218 >gi|254414923|ref|ZP_05028687.1| Vault protein inter-alpha-trypsin [Microcoleus chthonoplastes PCC 7420] gi|196178412|gb|EDX73412.1| Vault protein inter-alpha-trypsin [Microcoleus chthonoplastes PCC 7420] Length = 928 Score = 43.7 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 23/164 (14%), Positives = 48/164 (29%), Gaps = 25/164 (15%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSN---NLNEVKSRLNKLNPYENTNTYPAMHHAYR 286 N I ++ PL+N N N + +N+LN T + Sbjct: 455 NGLNPHDTFTIIDFSDTTRQLSPVPLANTVQNRNSAMNYINQLNASGGTQLRRGIQAVLN 514 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 + E + + ++ +TDG N L + G +++S Sbjct: 515 --FPEVDPGRL--------RSIVLLTDG------YIGNENQILAEVQRHLKLGNRLHSFG 558 Query: 347 VSAPPEGQDLLRKCTDSSGQF-----FAVNDSRELLESFDKITD 385 + + LL + + + + F +I + Sbjct: 559 AGSSVN-RFLLNRIAEIGRGISRIVRYDEPTEEVAEQFFGQINN 601 >gi|156367148|ref|XP_001627281.1| predicted protein [Nematostella vectensis] gi|156214186|gb|EDO35181.1| predicted protein [Nematostella vectensis] Length = 166 Score = 43.7 bits (101), Expect = 0.053, Method: Composition-based stats. Identities = 21/135 (15%), Positives = 45/135 (33%), Gaps = 18/135 (13%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK--LNPYENTNTYPAMHHAYRELY 289 +VR G + Y N + + LN+ NP + A+ AY + Sbjct: 42 SRTNVRFGLVVYGTRPRVVFGLNGFRNNGGLFNALNRPIKNPQTGSRIGLALRAAYTRVL 101 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 T K ++ + DG ++ + + ++ G+KI+++ + Sbjct: 102 ARSPRRGAT-------KIIVVLADG--------RSEDDVRRPSNALQARGVKIFAIGIGR 146 Query: 350 PPEGQDLLRKCTDSS 364 G+ L + Sbjct: 147 YINGRQ-LDQLASRP 160 >gi|73970088|ref|XP_531792.2| PREDICTED: hypothetical protein XP_531792 [Canis familiaris] Length = 1465 Score = 43.7 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 23/170 (13%), Positives = 56/170 (32%), Gaps = 30/170 (17%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 K+D++ + + K ++ V+ A + +NL Sbjct: 717 SHSMKSKLDLVKDKIIQFIQEQLKYKRKFN--FVQFDAQAVAWQEKLVEINE--DNLRGA 772 Query: 263 KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 +S + + +TNT A+ A+ + + + + +TDG Sbjct: 773 QSWIRDIQIGSSTNTLHALQIAF---------------ADKETQVIYLLTDGRPDQP--- 814 Query: 323 QNTLNTLQICEYMRN-AGMKIYSVAVSAPPE-GQDLLRKCTD-SSGQFFA 369 + E ++ + IY+++ + E L++ + G+F Sbjct: 815 -----PEMVIEQVKVFQKIPIYTISFNYNDEIANGFLKELASLTGGEFHF 859 >gi|325267978|ref|ZP_08134626.1| hypothetical protein HMPREF9098_2354 [Kingella denitrificans ATCC 33394] gi|324980581|gb|EGC16245.1| hypothetical protein HMPREF9098_2354 [Kingella denitrificans ATCC 33394] Length = 1106 Score = 43.7 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 38/357 (10%), Positives = 81/357 (22%), Gaps = 20/357 (5%) Query: 37 VLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKD 96 L+ + S P IK ++ D + + Sbjct: 14 ALAVASVFQSAAAATQPYPSVPLIWQSGTAAIKPNIL-------LFLDTSGSMTTGGVRY 66 Query: 97 KNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDV 156 I +KA N S+R + I + L+ Sbjct: 67 NGGTASRIVVAKAVARDVITATRENNNWGLMTFNGSMRDATHSRDIANGTGILVYDALNP 126 Query: 157 SRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIES 216 + S + ++ + L + + + Sbjct: 127 TASEQRIWSDQWYSIGGKLKQPVLDVSSTTG----AANYNQLISTINGLPANTNTPIPSA 182 Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAY-NIGIVGNQCTPLSNNLNE---VKSRLNKLNPY 272 + + + S + N +S+ V+ +L +L Sbjct: 183 YYEAIRYFRGMPEGTTANSPNTQYTSPIKYRCQKNYIIFVSDGEPTGFPVRYQLGRLMRQ 242 Query: 273 EN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 + + + A N S + + D N + Sbjct: 243 DYILRSDATLRQAINNGANGSWDSAYSTARIAYLAHNWDMMDSGRDAEGKSFNDATGSPV 302 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKI 387 + + Y+V A LL + GQ+F N+ +L + + I Sbjct: 303 GQDFSKQTITTYTVGFQANV---QLLSEMARQGGGQYFIANEGDQLKAALTSALNAI 356 >gi|34525892|emb|CAE46626.1| trombospondin-related protein [Plasmodium falciparum] Length = 331 Score = 43.7 bits (101), Expect = 0.054, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 58/170 (34%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + + N + + S N + Sbjct: 55 CSGSIRRHNWVKHAVPLAMKLIQQLNLNENAIHLYLNIFSNNAKEIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + + ++ +TDG Sbjct: 115 LIIIKSLLSTNLPYGRTNLTDALLQVRKHLNDRI-------NRENASQLIVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI V + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDRGVKIAVVGIGQGINVAFNRFLVGCHPSDGK 211 >gi|153806292|ref|ZP_01958960.1| hypothetical protein BACCAC_00548 [Bacteroides caccae ATCC 43185] gi|149130969|gb|EDM22175.1| hypothetical protein BACCAC_00548 [Bacteroides caccae ATCC 43185] Length = 342 Score = 43.7 bits (101), Expect = 0.055, Method: Composition-based stats. Identities = 21/169 (12%), Positives = 56/169 (33%), Gaps = 46/169 (27%) Query: 251 QCTPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 P++++ K L ++P + T A++ A R + + + Sbjct: 142 TQLPITSDYISAKMFLESISPSLISKQGTAIGEAINLAVRSFTPQ----------EGVGR 191 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------------- 352 ++ ITDGEN A + + G+++ + V P Sbjct: 192 AIVVITDGENHEGGAVEAAKVAAE-------KGIQVSVLGVGMPDGAPIPIEGTNDYRRD 244 Query: 353 ----------GQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQ 390 + + ++ G + V+++ ++ ++ +K+ + Sbjct: 245 REGNVIVTRLNEAMCQEIAKEGKGIYVRVDNTNSAQKAINQEVNKMAKS 293 >gi|27263162|emb|CAD59479.1| leukocyte integrin alpha-M chain [Ovis aries] Length = 79 Score = 43.7 bits (101), Expect = 0.056, Method: Composition-based stats. Identities = 10/65 (15%), Positives = 19/65 (29%), Gaps = 5/65 (7%) Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEG---QDLLRKCTDSS--GQFFAVNDSRELLESFDK 382 + G+ Y + V + L F VN+ L ++ Sbjct: 1 YEDVIPEADRKGIIRYVIGVGDAFNSKKSRKELDTIASKPPADHVFQVNNFEALKTIQNQ 60 Query: 383 ITDKI 387 + +KI Sbjct: 61 LQEKI 65 >gi|157278345|ref|NP_001098275.1| complement factor B [Oryzias latipes] gi|1777334|dbj|BAA12207.1| Bf/C2 [Oryzias latipes] Length = 754 Score = 43.7 bits (101), Expect = 0.056, Method: Composition-based stats. Identities = 34/243 (13%), Positives = 72/243 (29%), Gaps = 31/243 (12%) Query: 166 QKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQ 225 ++ T +K SKN T + Y + + + + ++ I+ Sbjct: 222 GSSIKDSLTTLQPTNDTQAGRKIRISKNGTLNIYIALDISESVEEEHFKRAKLAIITLIK 281 Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL------NPYENTNTYP 279 K + + I + ++S + L + T+ Sbjct: 282 KIAAFTVSPNYEILFFS-ADVYEVVSIVEFYEGKITLESAIKNLEDFQIGDKSTGTDVNA 340 Query: 280 AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA- 338 A+ + ++ + + R + TDG + + TL ++N Sbjct: 341 ALKKFEEGMAWIEQKTGDKFSEHR--HVFLLFTDGAYNMGGSPLPTLA------RIKNRV 392 Query: 339 --------GMKI-----YSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLESFDKI 383 G ++ Y + A DLL + G +F + L +FD I Sbjct: 393 YMSPTGDPGSRLDYLESYVFGIGANIFDDDLLPLTAGTEGELHYFRLKKETNLAATFDDI 452 Query: 384 TDK 386 D+ Sbjct: 453 IDE 455 >gi|327405047|ref|YP_004345885.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] gi|327320555|gb|AEA45047.1| von Willebrand factor type A [Fluviicola taffensis DSM 16823] Length = 541 Score = 43.7 bits (101), Expect = 0.056, Method: Composition-based stats. Identities = 23/130 (17%), Positives = 43/130 (33%), Gaps = 16/130 (12%) Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + +++ ++ + TN L + K V+ +TDGE Sbjct: 426 KKHILEKIDHIHAFGQTNIQSGFETVKTLLSSSKLQKGV--------NSVLLLTDGEFQL 477 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL--LRKCTDSSGQFFAVNDSREL 376 TL I ++ + I V + P + + L K G F + +L Sbjct: 478 ------KEETLAIINQLKANEIGICFVYLGEPLKKKTTKALEKKYSDLGVIFYDTNRIDL 531 Query: 377 LESFDKITDK 386 E+ KI + Sbjct: 532 KEALLKIATE 541 >gi|307152391|ref|YP_003887775.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] gi|306982619|gb|ADN14500.1| von Willebrand factor type A [Cyanothece sp. PCC 7822] Length = 305 Score = 43.7 bits (101), Expect = 0.056, Method: Composition-based stats. Identities = 27/195 (13%), Positives = 64/195 (32%), Gaps = 27/195 (13%) Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAG---NLVNSIQKAIQEKKNLSVRIGTIAYNIGI 247 S +T +S++ +++ + + ++ + V+ T+ + Sbjct: 51 SSSTYQSEFNAPGTTMAQEVQAVKDYVDKNLAILKQPNQIKILGFADQVKPLTVNFTTDS 110 Query: 248 VGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 + L N L+ +KS N + TN A+ L + + Sbjct: 111 QQIKQQ-LENTLDVLKS--NPEDLGGGTNIDLAISEGTDALAAISDRC----------RE 157 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF 367 ++ +TDG+ + S + + +KI +V + + D+ G + Sbjct: 158 LLIVTDGQATINS---------ETITQAKEKNVKINAVVIGPDSPDIQ--QATQDTGGVY 206 Query: 368 FAVNDSRELLESFDK 382 N + L F + Sbjct: 207 LISNQASNLSPLFTE 221 >gi|262199490|ref|YP_003270699.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262082837|gb|ACY18806.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 808 Score = 43.7 bits (101), Expect = 0.056, Method: Composition-based stats. Identities = 22/116 (18%), Positives = 40/116 (34%), Gaps = 12/116 (10%) Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 +L P TN + AYR + VI +TD + S + + Sbjct: 545 RLLPRGGTNLSAGIEVAYRVARRNYDPYRINR--------VIILTDAYANRGSIDPSLIG 596 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDK 382 + + G+ + V +D L TD G +F++ R+ +F + Sbjct: 597 DHVLIGD--DEGIHFSGLGVGYD-FNEDFLNTLTDVGRGTYFSLITERDAARAFGE 649 >gi|74146354|dbj|BAE28944.1| unnamed protein product [Mus musculus] Length = 452 Score = 43.7 bits (101), Expect = 0.056, Method: Composition-based stats. Identities = 21/143 (14%), Positives = 45/143 (31%), Gaps = 15/143 (10%) Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 L P PK + + + S + + ++ + +L Q ++ + N + Sbjct: 303 FAPENLDPIPKNILFVIDVSGSMWGI---KMKQTVEAMKTILDDLRTDDQFSVVDF-NHN 358 Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS 295 VR + + K + K+ P TN A+ A L Sbjct: 359 VRTWRNDLVSATKTQ--------IADAKRYIEKIQPSGGTNINEALLRAIFILNEASNMG 410 Query: 296 HNTIGSTRLKKFVIFITDGENSG 318 + +I ++DG+ + Sbjct: 411 LL---NPDSVSLIILVSDGDPTV 430 >gi|297667862|ref|XP_002812182.1| PREDICTED: vitrin-like isoform 3 [Pongo abelii] Length = 657 Score = 43.7 bits (101), Expect = 0.057, Method: Composition-based stats. Identities = 14/120 (11%), Positives = 39/120 (32%), Gaps = 14/120 (11%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSH 296 +G + Y + N ++K+ + K+ +N A+ + +++ Sbjct: 311 MGVVQYGDSPATHFNLKTHMNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANG-- 368 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + V+ + DG + + R +G+ I+ + + E + Sbjct: 369 ---NRSSAPNVVVVMVDGWPTD--------KVEEASRLARESGINIFFITIEGAAENEKQ 417 >gi|290990556|ref|XP_002677902.1| vWFA domain-containing protein [Naegleria gruberi] gi|284091512|gb|EFC45158.1| vWFA domain-containing protein [Naegleria gruberi] Length = 312 Score = 43.4 bits (100), Expect = 0.057, Method: Composition-based stats. Identities = 25/218 (11%), Positives = 70/218 (32%), Gaps = 40/218 (18%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI--GIVGNQCTPLSNNL 259 + +I+ + +++S++ + +R I+Y + P + N Sbjct: 3 CTGSMSGEIEAAKTTVLTILDSLK----DHFKTDLRFSAISYRDHTDDYAVREFPFTKNF 58 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 + K ++ ++ + A+ A + + + K I+I D G Sbjct: 59 EKAKGYIDTMSAQGGGDHPEALASALKVVNELPFNKKGK-------KICIWIADAPPHGM 111 Query: 320 SAYQNTL------------NTLQICEYMRNAGMKIYSVAVSAPPEGQDL---LRKCTD-S 363 ++ + + +++ ++ G+ Y++ Q L + + Sbjct: 112 NSSGDRYPEGCKDEEGNVIDWIRLGSDLQEKGVVFYTLICKRAQNDQQLALFMDFLATKT 171 Query: 364 SGQFFAVNDSRELLESF-----------DKITDKIQEQ 390 G+ + ++ +L I KI+E Sbjct: 172 DGKCMLLTNANKLPNLIINGSIENDEMDQLIAQKIEEL 209 >gi|313230976|emb|CBY18974.1| unnamed protein product [Oikopleura dioica] Length = 522 Score = 43.4 bits (100), Expect = 0.058, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 50/185 (27%), Gaps = 25/185 (13%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN--NLNEVKSR 265 ++ + + + K + + + I I N + + +++ S Sbjct: 292 SQVGDKQSAVKDFFKDLLKEFDTQTAVKISITDIGDGREGQVNTVLGPTQFVDTSDINSA 351 Query: 266 LNKLNPYENT-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 LN L Y T + ++ + +I ITDG + Sbjct: 352 LNSLTWYGTTTAIADGITEGASQMDTT----------DNVNDVMIVITDG------FDGD 395 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQ-DLLRKCTDS-SGQFFAVNDSRELL----E 378 + + AG+ ++ L + S + EL Sbjct: 396 LSSLQTASAAVATAGITAIAIGYDENGGIIGSTLEDIANGVSSNVIEATSTSELDGLALS 455 Query: 379 SFDKI 383 F+ I Sbjct: 456 VFNSI 460 >gi|209527388|ref|ZP_03275895.1| von Willebrand factor type A [Arthrospira maxima CS-328] gi|209492179|gb|EDZ92527.1| von Willebrand factor type A [Arthrospira maxima CS-328] Length = 396 Score = 43.4 bits (100), Expect = 0.058, Method: Composition-based stats. Identities = 27/155 (17%), Positives = 56/155 (36%), Gaps = 13/155 (8%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG---TIAYNIGIVGNQCTPLSNNLNE 261 R+ID +E+ + + + K V G + N+ Sbjct: 89 SGKRRIDGALEATRRFLEQMSDRGGDTKVAIVPFGKGGANCPGFEVTQRGIDSRFFPAND 148 Query: 262 VK--SRLNKLNPYE---NTNTYPAMHHAYRELYNEKESS---HNTIGSTRLKKFVIFITD 313 +K + L+ L T+ Y + A R L N ++ G + VI ++D Sbjct: 149 IKQTNFLDYLAAQTLCAATDIYGPLSEAIRVLGNRQDPRFYVPEDSGRPEPRLSVILLSD 208 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 G ++ + Q+ N + + E RN + ++++ Sbjct: 209 GFHNQPNEQQDFDNLITLLE--RNNNIIVHTLGYG 241 >gi|152999662|ref|YP_001365343.1| type IV pilin biogenesis protein [Shewanella baltica OS185] gi|151364280|gb|ABS07280.1| type IV pilin biogenesis protein, putative [Shewanella baltica OS185] Length = 1169 Score = 43.4 bits (100), Expect = 0.058, Method: Composition-based stats. Identities = 37/301 (12%), Positives = 80/301 (26%), Gaps = 71/301 (23%) Query: 157 SRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIES 216 S S + NN ++ SK P ++++ ++ Sbjct: 196 SSSPGTSWADALAAANNTDFGVGQPVTLYTDNYLRWY-GLSKAGKLPTVKVSRLEIAKKA 254 Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIV-GNQCTPLSNNL-NEVKSRLNKLNPYEN 274 N+++S + G + T ++++ + + ++ L N Sbjct: 255 ISNIISSTPTVDFGLAVFNYNYPNEGNRDGGRIVSGITQMTDSTRASLLTTIDNLLAKTN 314 Query: 275 TNTYPAMHHAYREL------YNEKESSHNTIGSTRLKK---------------------F 307 T M+ AYR Y ++ + + + + Sbjct: 315 TPLCETMYEAYRYFAGKGVKYGHGDTDYGSYVGNKPPYDSLVEKGGSYESPFKVCTDIAY 374 Query: 308 VIFITDGENSGASAYQNTLNTLQIC----------------------------------- 332 VI++TDG + N + +L Sbjct: 375 VIYVTDGTPTVDKNANNDVISLTASGSKEGNYSSFSKNLDTASYLPALASYMFNNDLINK 434 Query: 333 ----EYMRNAGMKIYSVAVSAPPEGQDLL--RKCTDSSGQFFAVNDSRELLESFDKITDK 386 + ++ Y++ S E L G +FA +S EL + + Sbjct: 435 LDSSNTEQVQNVRTYTIGFSKGAEDAAPLLAETAKRGGGLYFAAQNSLELQNALNDALSN 494 Query: 387 I 387 I Sbjct: 495 I 495 >gi|58429499|gb|AAW78153.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 545 Score = 43.4 bits (100), Expect = 0.058, Method: Composition-based stats. Identities = 26/170 (15%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + N + + S N + Sbjct: 55 CSGSIRRNNWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAREIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 S + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LSIIKSLLSTNLPYGRTNLTDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|172060502|ref|YP_001808154.1| hypothetical protein BamMC406_1450 [Burkholderia ambifaria MC40-6] gi|171993019|gb|ACB63938.1| conserved hypothetical protein [Burkholderia ambifaria MC40-6] Length = 648 Score = 43.4 bits (100), Expect = 0.059, Method: Composition-based stats. Identities = 23/203 (11%), Positives = 54/203 (26%), Gaps = 4/203 (1%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M AI + V + + ID+A++ + +Q +D A L+ + SD + K + T Sbjct: 1 MAAIWVMVAIVVLGV-IDIANLYLQKRDLQRVVDLAALAAVQPMTSDPSGCLSDAKNNVT 59 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 S+ S ++ Y+ + A + Sbjct: 60 SSANINDKGYAFTLISATATANPTAGNDQIAVSCGRWDSATAYVTPASASANAAQVTAYR 119 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 + L + ++ + V ++ +L + + Sbjct: 120 QVNYFFLGLLSQLSGRQAVVSATATARAAAIDTFSVGSTLANLNTSS---SAILDPLLTG 176 Query: 181 LPPPPKKSFWSKNTTKSKYAPAP 203 L ++ Sbjct: 177 LLGATTNVNVGLANYQALAGANV 199 >gi|162448738|ref|YP_001611105.1| hypothetical protein sce0468 [Sorangium cellulosum 'So ce 56'] gi|161159320|emb|CAN90625.1| hypothetical protein sce0468 [Sorangium cellulosum 'So ce 56'] Length = 1041 Score = 43.4 bits (100), Expect = 0.059, Method: Composition-based stats. Identities = 20/126 (15%), Positives = 38/126 (30%), Gaps = 16/126 (12%) Query: 258 NLNEVKSRLNKLNPYENT-NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + ++ + + P + AY+ L T K V+ DG + Sbjct: 504 DGAAIEKAIRAVGPGGGGIYVDITLEAAYQALD----------RDTSSLKHVLLFADGSD 553 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 + + E G VA+ + +L G+F+ V D+ L Sbjct: 554 AENMGPC-----RAMVEAAMRRGTTTSVVALGQGSDVPELEALSRLGGGRFYLVEDATRL 608 Query: 377 LESFDK 382 F + Sbjct: 609 PAVFTQ 614 >gi|224054053|ref|XP_002190891.1| PREDICTED: collagen, type VI, alpha 2 [Taeniopygia guttata] Length = 1016 Score = 43.4 bits (100), Expect = 0.059, Method: Composition-based stats. Identities = 24/154 (15%), Positives = 53/154 (34%), Gaps = 19/154 (12%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS-VRIGTIAYNIGIVGNQCTPLS--- 256 + + + N+V+ + ++ K+L+ R+G + Y+ Sbjct: 616 SSESIGYTNFTLEKNFVINVVSRLGSIAKDPKSLTGARVGVVQYSHEGTFEAIKLDDERI 675 Query: 257 NNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 ++L+ K + +L T T A+ AY +L E + F + ITDG Sbjct: 676 DSLSSFKEAVKRLEWIAGGTWTPSALQFAYNKLIKESRREKA-------QVFAVVITDGR 728 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + N +C + + ++ + Sbjct: 729 ---YDPRDDDKNLGALC----GRDVVVNTIGIGD 755 >gi|149919202|ref|ZP_01907685.1| von Willebrand factor, type A [Plesiocystis pacifica SIR-1] gi|149819916|gb|EDM79338.1| von Willebrand factor, type A [Plesiocystis pacifica SIR-1] Length = 877 Score = 43.4 bits (100), Expect = 0.059, Method: Composition-based stats. Identities = 22/152 (14%), Positives = 51/152 (33%), Gaps = 16/152 (10%) Query: 248 VGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 NL + ++ L T A+ A L E ++ Sbjct: 423 YEAAVPANEQNLVHAERFIDGLQAGGGTMMSGAVDAA---LSPEIGLGR--------HRY 471 Query: 308 VIFITDGE-NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ 366 V F+TDG ++ + ++ + +++ + + + P + L G+ Sbjct: 472 VFFVTDGFISNEDEIARQASALVRAADKAGQR-ARVFGMGIGSSPNRELLASLSKAGKGR 530 Query: 367 FFAVNDSR---ELLESFDKITDKIQEQSVRIA 395 + AV + E +E++ ++ D + I Sbjct: 531 YLAVGNREHPREAVEAYTRMVDSAVLTDIHID 562 >gi|21616104|emb|CAD21938.1| putative complement factor Bf/C2 [Tetraodon nigroviridis] Length = 760 Score = 43.4 bits (100), Expect = 0.059, Method: Composition-based stats. Identities = 39/233 (16%), Positives = 73/233 (31%), Gaps = 16/233 (6%) Query: 166 QKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQ 225 + T +K SKN T + Y + + D + + ++ I Sbjct: 224 GSAIKESLTTLESINDVQGERKIRISKNGTLNIYIAVDISESIQKDHVESAKKAILKLIT 283 Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENTNTYPAMH 282 K + + + + + V N N ++K RL K T+ A Sbjct: 284 KISSFSVSPNYELLFFSSELSEVVNILDFFENQPVDIKGRLTKFKVNAEHTGTDLNLAFK 343 Query: 283 HAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN-----------TLQI 331 + K+ + +I TDG + + T+ + Sbjct: 344 TILERMALIKQ-RVGEKAFEEHRHAIIVFTDGVYNMGGSPLPTVAKIKHMVYMNKIDEET 402 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG-QFFAVNDSRELLESFDKI 383 + R+ + IY + A DL + G FF + + + L E+FD I Sbjct: 403 GQNPRDEYLDIYIFGIGAEIYDSDLRPLTAGTGGEHFFKLLEIQNLQETFDNI 455 >gi|113953124|ref|YP_730426.1| structural toxin protein RtxA [Synechococcus sp. CC9311] gi|113880475|gb|ABI45433.1| structural toxin protein RtxA [Synechococcus sp. CC9311] Length = 2154 Score = 43.4 bits (100), Expect = 0.060, Method: Composition-based stats. Identities = 34/289 (11%), Positives = 78/289 (26%), Gaps = 5/289 (1%) Query: 38 LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIREN-AGDIAQKAQINITKD 96 ++ ++ ++ + + ++E D +N Sbjct: 900 IASTKTLAEWNSLSPTDQADPNAADYVFTITLDPSQSNYTVKEYATIDGTTDTTLNYDVK 959 Query: 97 KNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDV 156 + + + LK + S + I + Sbjct: 960 LTDEDLDFVDGDFEVTWAPSPAVLKVGENISDVASSTTPYKVDYDLVSKSDIIPTGSGTI 1019 Query: 157 SRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIES 216 + + L ++ + L S + + + + L + Sbjct: 1020 EGTNGEDILIGDVGGGSLVNQSINLSLVLDVSRSMILSNINFNNASV----TRFSALQTA 1075 Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 +L++ I ++ K V+ T ++G LN+ ++ L TN Sbjct: 1076 TKDLLSEIAQSGATAKVQIVKYSTEGSDVGYYNFTSGDDQTVLNQAFQDIDDLQAGGGTN 1135 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + A + S+ + K VIFI+DGE S +T Sbjct: 1136 YEAGLVTALNWISGGITSNTPLNVNQTDKDKVIFISDGEPSFYYRGNDT 1184 >gi|284799403|ref|ZP_06390123.1| PilC protein [Neisseria subflava NJ9703] gi|284797751|gb|EFC53098.1| PilC protein [Neisseria subflava NJ9703] Length = 1126 Score = 43.4 bits (100), Expect = 0.060, Method: Composition-based stats. Identities = 32/235 (13%), Positives = 64/235 (27%), Gaps = 21/235 (8%) Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKID--VLIESAGNLVNSIQKAIQEK 231 + P + R I+ + S KA + Sbjct: 186 ANMSCSNQIPGEDPRLSRNTNFNYDRDYYYSNYYRDIERSAGTSAYQYFGPSEVKANDDS 245 Query: 232 KNLSV------RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAY 285 RI + + L N ++ + +K + N M A Sbjct: 246 YGKGKFFNGGGRIKGYFDFLPYQYDNDFGLPENEKKLLCQSSKYKHEYDPNYGRYMWVAA 305 Query: 286 -RELYNEKESSHNTI-------GSTRLKKFVIFITDGENSGA-SAYQNTLNTLQICEYMR 336 + + ++ T +K + DG ++ S + + I Sbjct: 306 GEIIVPYWDRNYKDEKRGMRFFSQTLAEKDIKTAKDGSDAAGKSWDGDPSDPKGI--DYS 363 Query: 337 NAGMKIYSVAVSAPPE--GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 ++ ++V G++ L K +F +LLE+F I D I+ Sbjct: 364 KQLVQTFTVGFGEGISEVGREYLEKGASRPDWYFNAAKKEDLLEAFKTIVDNIEN 418 >gi|156393752|ref|XP_001636491.1| predicted protein [Nematostella vectensis] gi|156223595|gb|EDO44428.1| predicted protein [Nematostella vectensis] Length = 419 Score = 43.4 bits (100), Expect = 0.060, Method: Composition-based stats. Identities = 20/116 (17%), Positives = 39/116 (33%), Gaps = 14/116 (12%) Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 TN A AY + + + +K VI ITDG + + + Sbjct: 124 GMTNMRGAFEKAYEV----CKGTWSGKKRLNIKTTVILITDG---HWNWPWQNPDPVPKA 176 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG---QFFAVNDSRELLESFDKITD 385 + + G++I + V +L + F +++ E F+K+ Sbjct: 177 QQLIREGVEILAFGVGYGISLSNLQTITANQRAGHTYAFQISNFDE----FNKLAT 228 >gi|145297762|ref|YP_001140603.1| type IV pilin biogenesis protein [Aeromonas salmonicida subsp. salmonicida A449] gi|88866552|gb|ABD57328.1| TapY1 [Aeromonas salmonicida subsp. salmonicida A449] gi|142850534|gb|ABO88855.1| type IV pilin biogenesis protein [Aeromonas salmonicida subsp. salmonicida A449] Length = 1203 Score = 43.4 bits (100), Expect = 0.060, Method: Composition-based stats. Identities = 33/324 (10%), Positives = 79/324 (24%), Gaps = 79/324 (24%) Query: 127 ALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPK 186 + V S + + Y + ++ SN + Sbjct: 166 QDIQNRTPANPFSGTLEVGSVSDGYPVDKTSNTRGEWYYTGNANDATSGSNTSVTLYTAN 225 Query: 187 KSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNS---------IQKAIQEKKNLSVR 237 W Y P ++ + ++ ++++S I R Sbjct: 226 YIRW-------YYGPTGYAVESRLRIAKDAVKSVISSTPGVDFGLAIFNTNNSTSTNGGR 278 Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYR----------- 286 + + + T + + + L+ NT ++ AYR Sbjct: 279 VVRRILSNDDMVGTITA----EQNLLNTVEGLSATTNTPLCETLYEAYRFYGGQPVLYGK 334 Query: 287 -------ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL---------- 329 + + + + +VI++TDG + + + + L Sbjct: 335 QGGTRDKLAEDPEGTYTSPYDKCSNNGYVIYVTDGAPTSDTDADSAVKALMDSLSTDERA 394 Query: 330 ------------------QICEYMRNAGM----------KIYSVAVSAPP---EGQDLLR 358 + YM+N + ++V G L Sbjct: 395 AYGSTVGYGSGSAKSYLAAMAGYMKNKDVNTSSPGKQTVTTFTVGFGDEAISGAGNLLAE 454 Query: 359 KCTDSSGQFFAVNDSRELLESFDK 382 GQ++ + L ++ Sbjct: 455 TARRGGGQYYPAVSAEALTQALRS 478 >gi|163815330|ref|ZP_02206705.1| hypothetical protein COPEUT_01494 [Coprococcus eutactus ATCC 27759] gi|158449304|gb|EDP26299.1| hypothetical protein COPEUT_01494 [Coprococcus eutactus ATCC 27759] Length = 348 Score = 43.4 bits (100), Expect = 0.061, Method: Composition-based stats. Identities = 20/93 (21%), Positives = 35/93 (37%), Gaps = 10/93 (10%) Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--SAPPEGQDLL 357 K +IF TD + + + IC +N G+ +Y V + + Sbjct: 224 KDKTRTKIMIFSTDNDIQ-GNPVATLDDAASIC---KNNGVTVYGVGTKEMTDENRESMK 279 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + + GQFF E +FD+I I++ Sbjct: 280 KAVESTGGQFF----MEEESGTFDQIVTAIEKS 308 >gi|256071156|ref|XP_002571907.1| dihydropyridine-sensitive l-type calcium channel [Schistosoma mansoni] gi|238657056|emb|CAZ28137.1| dihydropyridine-sensitive l-type calcium channel, putative [Schistosoma mansoni] Length = 421 Score = 43.4 bits (100), Expect = 0.061, Method: Composition-based stats. Identities = 22/162 (13%), Positives = 51/162 (31%), Gaps = 18/162 (11%) Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAM 281 + + I N S I ++ N+ L +L+ +K ++ ++ Sbjct: 268 DHVAPMIVTANNESEPICFNSFVQATRRNK-LRLFYDLSTLK-------ARGYSDFPASL 319 Query: 282 HHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK 341 AY N ES+ G K ++ +TD + + L + + Sbjct: 320 KFAYEMFRNLTESARGDRGKELRNKILVLLTDNAFVFDESVLSQLKQQ-------KSNIT 372 Query: 342 --IYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFD 381 IYS+ + ++ C ++ + + + Sbjct: 373 TFIYSLGEPVGAAYEHKMKACA-TNDYYQYLPTVGAVSNLMK 413 >gi|327270794|ref|XP_003220173.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 917 Score = 43.4 bits (100), Expect = 0.063, Method: Composition-based stats. Identities = 21/129 (16%), Positives = 51/129 (39%), Gaps = 26/129 (20%) Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 N ++ A++ ++ ++ +T+GE S S + + + + Sbjct: 392 NICEGVNAAFQVFS--------QKLTSTEGCEIVLLTNGEGSDLSPCLSKIQSQE----- 438 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN---DSRELLESFDKITD---KIQ 388 + I+++A + + L K D + G+ F DS L+++F I+ Sbjct: 439 ----IIIHTIAFGSKASNE--LEKLADMTGGKTFYATDSLDSNGLIDAFGGISSGSGDAS 492 Query: 389 EQSVRIAPN 397 +QS+++ Sbjct: 493 QQSIQLESK 501 >gi|325267447|ref|ZP_08134103.1| von Willebrand factor type A [Kingella denitrificans ATCC 33394] gi|324981088|gb|EGC16744.1| von Willebrand factor type A [Kingella denitrificans ATCC 33394] Length = 238 Score = 43.4 bits (100), Expect = 0.063, Method: Composition-based stats. Identities = 26/175 (14%), Positives = 68/175 (38%), Gaps = 14/175 (8%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 KID L ++ N++++ + + + + V + T G + P + ++V+ Sbjct: 30 DKIDNLNKAVENMLDTFAQEEKMETEILVSVITF----GGKVDLHVPFTK-ASQVQWH-- 82 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 L +T A+ A + ++ T S + ++ ++DG+ + + + Sbjct: 83 GLQVNGDTPMGTALKMAKAMIEDK-----ETTPSRAYRPTIVLVSDGQPTDGNIW--KQA 135 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 R++ ++A+ + L R ++ F ++ +L E F + Sbjct: 136 MADFISEGRSSKCDRMAMAIGHDADETVLKRFIEGTAHDLFYAENAGQLHEFFQR 190 >gi|256111841|ref|ZP_05452806.1| NorD protein [Brucella melitensis bv. 3 str. Ether] gi|265993299|ref|ZP_06105856.1| protein norD [Brucella melitensis bv. 3 str. Ether] gi|262764169|gb|EEZ10201.1| protein norD [Brucella melitensis bv. 3 str. Ether] Length = 633 Score = 43.4 bits (100), Expect = 0.063, Method: Composition-based stats. Identities = 24/132 (18%), Positives = 46/132 (34%), Gaps = 20/132 (15%) Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG-- 318 V+ R+ L P T AM HA +L + KK ++ +TDG+ + Sbjct: 515 TVEHRIAALKPGFYTRMGAAMRHATAKLAEQPNR----------KKLLLLLTDGKPNDVD 564 Query: 319 -ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQFFAVNDSRE 375 ++ + +R G+ +++V V L G + V + + Sbjct: 565 HYEGRFALEDSRRAAGEVRAKGVNVFAVTVDREASAYLPALF-----GRGGYALVANLAK 619 Query: 376 LLESFDKITDKI 387 L + I + Sbjct: 620 LPVALPAIYRML 631 >gi|17989341|ref|NP_541974.1| NorD protein [Brucella melitensis bv. 1 str. 16M] gi|256043169|ref|ZP_05446110.1| NorD protein [Brucella melitensis bv. 1 str. Rev.1] gi|260564388|ref|ZP_05834873.1| conserved hypothetical protein [Brucella melitensis bv. 1 str. 16M] gi|265989599|ref|ZP_06102156.1| protein norD [Brucella melitensis bv. 1 str. Rev.1] gi|81850090|sp|Q8YBA4|NORD_BRUME RecName: Full=Protein norD gi|17985210|gb|AAL54238.1| nord protein [Brucella melitensis bv. 1 str. 16M] gi|260152031|gb|EEW87124.1| conserved hypothetical protein [Brucella melitensis bv. 1 str. 16M] gi|263000268|gb|EEZ12958.1| protein norD [Brucella melitensis bv. 1 str. Rev.1] Length = 633 Score = 43.4 bits (100), Expect = 0.063, Method: Composition-based stats. Identities = 24/132 (18%), Positives = 46/132 (34%), Gaps = 20/132 (15%) Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG-- 318 V+ R+ L P T AM HA +L + KK ++ +TDG+ + Sbjct: 515 TVEHRIAALKPGFYTRMGAAMRHATAKLAEQPNR----------KKLLLLLTDGKPNDVD 564 Query: 319 -ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQFFAVNDSRE 375 ++ + +R G+ +++V V L G + V + + Sbjct: 565 HYEGRFALEDSRRAAGEVRAKGVNVFAVTVDREASAYLPALF-----GRGGYALVANLAK 619 Query: 376 LLESFDKITDKI 387 L + I + Sbjct: 620 LPVALPAIYRML 631 >gi|268532320|ref|XP_002631288.1| Hypothetical protein CBG03101 [Caenorhabditis briggsae] Length = 394 Score = 43.4 bits (100), Expect = 0.063, Method: Composition-based stats. Identities = 23/156 (14%), Positives = 53/156 (33%), Gaps = 9/156 (5%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTI---AYNIGIVGNQCTPLSNNLNEVKSRL-NK 268 + + ++ SI K K R + Y+ S + +++ + + + Sbjct: 52 VRNTLTQVLGSISKIGPVKYPADPRSTCVGIVTYDDNATTQSQLDASKSFSDLYNVIQSS 111 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L +NTNT K+ + T K + F D + L+ Sbjct: 112 LISVDNTNTSYLSLALLAAEKALKDGRNRTYRFNYKKVIIAFAAD-----YQGHGTALDA 166 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS 364 + I +++ + I +VA ++ + Q ++ Sbjct: 167 MPIANRLKDNAVTIITVACTSNSDKQTAIQGIASPG 202 >gi|312138512|ref|YP_004005848.1| hypothetical protein REQ_10630 [Rhodococcus equi 103S] gi|325676202|ref|ZP_08155882.1| hypothetical protein HMPREF0724_13665 [Rhodococcus equi ATCC 33707] gi|311887851|emb|CBH47163.1| putative secreted protein [Rhodococcus equi 103S] gi|325552986|gb|EGD22668.1| hypothetical protein HMPREF0724_13665 [Rhodococcus equi ATCC 33707] Length = 545 Score = 43.4 bits (100), Expect = 0.064, Method: Composition-based stats. Identities = 37/351 (10%), Positives = 79/351 (22%), Gaps = 17/351 (4%) Query: 39 SGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKN 98 + ++V + + + + ++ AG A + + + Sbjct: 193 AVPGALVPLAQAQAANMHQTDPAARLAAVASDGGIAIATEQQVAGQNAGDSAVRLEATTP 252 Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR 158 N + P AL + G + LD R Sbjct: 253 NSGAVFLDYPVVATAPGSEHDDARSAGVALAEVMGSDAGRAALAQRGFRAPDRAPLDAGR 312 Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAG 218 + D+ + D S S A +I + ++++ Sbjct: 313 GVGDVAVLTVEDPTETAQVLRRYAVLALPSRALVVEDVSGSMAEQAGPETRIALTVQASE 372 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN------LNEVKSRLNKLNP- 271 Q L + + L Sbjct: 373 TGARLFPDNAQLGLWAFSIGLGGGSQDYKELAPIRRLDETVDGVSHRQRLTDAVRTLPSL 432 Query: 272 -YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T Y A+R++ + + VI +TDG N S Sbjct: 433 VKGGTGLYDTTLAAFRKVKEGYDPAAI--------NSVILLTDGANEDPSTISLDELLAT 484 Query: 331 ICEYMR-NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 + + I ++ ++ + L + + G E+ F Sbjct: 485 LKREQDPARPVIIVTIGITEDADAAVLQKISAATGGTSHVARTPAEIPGVF 535 >gi|189220466|ref|YP_001941106.1| hypothetical protein Minf_2455 [Methylacidiphilum infernorum V4] gi|189187324|gb|ACD84509.1| Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [Methylacidiphilum infernorum V4] Length = 340 Score = 43.4 bits (100), Expect = 0.064, Method: Composition-based stats. Identities = 15/140 (10%), Positives = 37/140 (26%), Gaps = 41/140 (29%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 P TN + A + + KK +I ++DGE+ G Q ++ Sbjct: 166 PNGGTNFAAMLDEALQFFSSSGR----------SKKMLILLSDGEDHGGGWQQRLVD--- 212 Query: 331 ICEYMRNAGMKIYSVAVSAPPE-----------------------GQDLLRKCT-DSSGQ 366 + + + S+ + + L + G Sbjct: 213 ----FKKESIPVLSIGIGSSNGAVIRNSNGSLYKDYNGEPIVSIFNPAALELIAHSTGGL 268 Query: 367 FFAVNDSRELLESFDKITDK 386 + + ++ + + Sbjct: 269 YIQADKYFDITTVVESLAKN 288 >gi|104780565|ref|YP_607063.1| hypothetical protein PSEEN1376 [Pseudomonas entomophila L48] gi|95109552|emb|CAK14253.1| hypothetical protein PSEEN1376 [Pseudomonas entomophila L48] Length = 864 Score = 43.4 bits (100), Expect = 0.064, Method: Composition-based stats. Identities = 47/427 (11%), Positives = 109/427 (25%), Gaps = 57/427 (13%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYI----RNQMQSA-LDA----------------AVLS 39 MT +++ + I+ A ++ ++ A LD+ A L+ Sbjct: 384 MTLMLLGASLGKVGLDIEAAMHAADEQSRKSALREAMLDSLFAALNMTDLGFESSFASLA 443 Query: 40 GCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNN 99 A T+ + L G R + + Sbjct: 444 YKAPPDEVGANLSQWEVSTSTTLAVEGGESNQLITGDLGRSGRLRGIRVGSDGSCWIVLD 503 Query: 100 PLQYIAESKAQYEIP-----TENLFLKGLIPS-------------ALTNLSLRSTGIIER 141 L Y ++ L P Sbjct: 504 GLSYRVRYNHDLQVWQVVPAHNPFAFGPLYPVRLSESNEWKLLVPPRLVGGAPPVVEHMP 563 Query: 142 SSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAP 201 S+ + + + +D +RS + S + + Sbjct: 564 STMSRFWNRHLTVDSARSGASAANALRRQKALLESRSIPQLARDQVPDLDERGLDCVQVD 623 Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 A + + D + +L+ + + K N R GT Y+ + + Sbjct: 624 GVAQYSYRYD--RQYFNSLIE-YYTSDESKVNDVFRSGTYRYDDEDEYIVDLVDTLDQLP 680 Query: 262 VKSRLNKLNPYENTNTYPA--MHHAYRELYNEKESSHNTIGSTRLKKFVIFIT----DGE 315 + +N T +L + ++ T + K F + + Sbjct: 681 KSNEVNLYRAGNGTRGTSGQHFRTGRLKLGDVLVNTDLTSFTENPYKVAEFASAPSLNAP 740 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE 375 + + +T ++ G I + ++ EG+ L + G +F +++ Sbjct: 741 GNLPGVFDDTSVVFELPMSHYQDGTPISAFSL-YWDEGETLFQ-----PGHYFRIDN--- 791 Query: 376 LLESFDK 382 L + + + Sbjct: 792 LEQVYGE 798 >gi|92113590|ref|YP_573518.1| von Willebrand factor, type A [Chromohalobacter salexigens DSM 3043] gi|91796680|gb|ABE58819.1| von Willebrand factor, type A [Chromohalobacter salexigens DSM 3043] Length = 596 Score = 43.4 bits (100), Expect = 0.064, Method: Composition-based stats. Identities = 18/113 (15%), Positives = 41/113 (36%), Gaps = 9/113 (7%) Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENS---------GASAYQNTLNTLQICEYM 335 Y++ + +++ + K+ VI +TDG A + + Sbjct: 102 YQQFTDIEQALREASQAAGGKRHVILLTDGMVDLPGSGEVKRKRDAASRETLIASLAPEL 161 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 + ++++A+S + L R + G ELL +F + ++I Sbjct: 162 ATQDVVVHTIALSRNVDRDLLERVSQSTDGLAAVAETPEELLRAFLDVLERIV 214 >gi|256419952|ref|YP_003120605.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] gi|256034860|gb|ACU58404.1| von Willebrand factor type A [Chitinophaga pinensis DSM 2588] Length = 345 Score = 43.4 bits (100), Expect = 0.064, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 58/185 (31%), Gaps = 50/185 (27%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN----KLNPYENTNTYPAMHHAYRELY 289 + R+G + + PL+ + + K L + P + T A+ A Sbjct: 128 DNDRVGLVVFAGNAYLQ--MPLTIDYSAAKMYLTTVSPDMIPTQGTAIGQAIQVANDAFN 185 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 ++ K +I I+DGE+ + G+ I ++ + + Sbjct: 186 KKERK----------HKSLIIISDGEDHD-------EAAISKARAAFEDGVVINTIGIGS 228 Query: 350 PPE------------------------GQDLLRKCTDSS-GQFFAVNDSRELLESFDKIT 384 P +D L+ + G + ++++ E E + +T Sbjct: 229 PTGSPLPDPETGTYKKDKEGNTVISKLNEDALKSIAAAGKGIYEHLDNNTE--EVVNSLT 286 Query: 385 DKIQE 389 KI Sbjct: 287 QKIDS 291 >gi|330904402|gb|EGH34974.1| von Willebrand factor type A domain-containing protein [Pseudomonas syringae pv. japonica str. M301072PT] Length = 64 Score = 43.4 bits (100), Expect = 0.065, Method: Composition-based stats. Identities = 11/57 (19%), Positives = 22/57 (38%), Gaps = 17/57 (29%) Query: 340 MKIYSVAVSAPPE----------------GQDLLRKCTD-SSGQFFAVNDSRELLES 379 +KIY + + + P+ + L++ S GQ+F D +L + Sbjct: 1 VKIYPIGIGSDPDKDALQSVLGLNPSLDLDEPTLKEIASLSGGQYFRARDGDQLEKI 57 >gi|123491786|ref|XP_001325914.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121908821|gb|EAY13691.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 753 Score = 43.4 bits (100), Expect = 0.065, Method: Composition-based stats. Identities = 26/261 (9%), Positives = 79/261 (30%), Gaps = 35/261 (13%) Query: 130 NLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF 189 N ++ + + +S ++V ++ K N+ + + K Sbjct: 160 NFEFSIAIKTKKEIQEIIVSAKGTMNVIDPHNVTFVTKTYPNDRSIIIEARIKDKDKSIA 219 Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 S + S + + + S+ + + + + + IG Sbjct: 220 ISSDGYISISTYTFFEGKVQANTEFYFIIDCSGSMYGSRIKNAKSCLNVLLHSLPIGCRF 279 Query: 250 NQCTPLSN-------------NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 + + N+++ +L+ ++ M + + + Sbjct: 280 SIIKFGTKFEVALEPCDYTDENMSKAMHQLDLIDAD---MCGNDMISPLKYISEHPQK-- 334 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE--YMRNAGMKIYSVAVSAPPEGQ 354 K V +TDGE+ + + IC +++++ + + + Sbjct: 335 -----KDYIKQVFLLTDGED----------DRISICAMVQANRDNFRVFTIGIGSDADRN 379 Query: 355 DLLRKCTDSSGQFFAVNDSRE 375 ++ + SG++ ++D E Sbjct: 380 LIIDVARNGSGRYIFIDDEDE 400 >gi|222055527|ref|YP_002537889.1| von Willebrand factor A [Geobacter sp. FRC-32] gi|221564816|gb|ACM20788.1| von Willebrand factor type A [Geobacter sp. FRC-32] Length = 569 Score = 43.4 bits (100), Expect = 0.065, Method: Composition-based stats. Identities = 17/119 (14%), Positives = 39/119 (32%), Gaps = 19/119 (15%) Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + P+ T + A+ L F+I +TDG + + Sbjct: 466 SVMPHGRTASGEAIIATALNLKQSSRKP-----------FIIHLTDGAAN---WGCGVSD 511 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 ++ C + +++ ++ + P + L + VND +L F ++ Sbjct: 512 AIRYC---KARNVRLLTIGLECDPMNKTAL--AEEHGEMIQFVNDVDQLPTVFKQLLSS 565 >gi|330792171|ref|XP_003284163.1| hypothetical protein DICPUDRAFT_75136 [Dictyostelium purpureum] gi|325085860|gb|EGC39259.1| hypothetical protein DICPUDRAFT_75136 [Dictyostelium purpureum] Length = 794 Score = 43.4 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 29/158 (18%), Positives = 54/158 (34%), Gaps = 9/158 (5%) Query: 188 SFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGI 247 S S + I + VN + K I +G Y GI Sbjct: 270 SSMSSDVQIVFCFDTTGSMATIIQSVRTQIVQTVNRLMKDI--PNIQIGIMGMGDYCDGI 327 Query: 248 VGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 + LS+N++E+ + +N + + A +A + S H K Sbjct: 328 NVLKTLDLSSNIDEIVAFINSVPNTSGGDAPEAYEYALHKAKELSWSEH-------TSKA 380 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + I D + ++N + C+ + + G+K+Y V Sbjct: 381 FVMIGDSNPHEPTFTNLSINWFKECDDLYDMGIKVYGV 418 >gi|307294426|ref|ZP_07574270.1| hypothetical protein SphchDRAFT_1896 [Sphingobium chlorophenolicum L-1] gi|306880577|gb|EFN11794.1| hypothetical protein SphchDRAFT_1896 [Sphingobium chlorophenolicum L-1] Length = 556 Score = 43.4 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 29/296 (9%), Positives = 69/296 (23%), Gaps = 35/296 (11%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALD------------------AAVLSGCAS 43 A+ + A D A + + +++Q+A D A + A Sbjct: 20 VALSLFGLIAVGGIAFDYARMASLDSELQNAADQAALAAATQLDGKTGACSRAANAASAL 79 Query: 44 IVSDRTIKDPTTKKDQTSTIFKKQ------IKKHLKQGSYIRENAGDIAQKAQINITKDK 97 I +D + T+ + K K A + +T + Sbjct: 80 IRNDARFANDGNASGLAITVANEATCDRTGFIKFYKNKDRSDTGTLADADVNFVEVTVNS 139 Query: 98 NNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 + A + + + A+ + + N + + Sbjct: 140 RTARFALTPVVAMFSSGPLSAKAYAGLGEAICKVPPVMICNPDEPIGNTDVDYDFAVANR 199 Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF-----------WSKNTTKSKYAPAPAPA 206 + N N+ + + W Sbjct: 200 IGKGLKLVTVGNGNSAWAPGNFGYLDTGSSTSNPNVELREALGWISVPGDCSSLEGVKTR 259 Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + ++ + ++A ++ + G G C P N + +V Sbjct: 260 TGAGTTVTQAINTRFDIYERANNNAQSGTGNNGNGNNASCPSGGLCPPSINTVKDV 315 >gi|119614001|gb|EAW93595.1| hCG2042895 [Homo sapiens] Length = 713 Score = 43.4 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 28/156 (17%), Positives = 52/156 (33%), Gaps = 16/156 (10%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN-NLNEVKSRLNK 268 D + +L + I + + V S +L K ++ Sbjct: 64 FDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQIDPPFSSWKDLQTFKQKVKS 123 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 +N T +Y A+ +A R L E K V+ +TDG + + + Sbjct: 124 MNLIGQGTFSYYAISNATRLLKREGRKDGV--------KVVLLMTDGID-----HPKNPD 170 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS 363 I E R +G+ ++A+S + LR + Sbjct: 171 VQSISEDARISGISFITIALST-VVNEAKLRLISGD 205 >gi|51095061|gb|EAL24305.1| similar to Matn2-prov protein [Homo sapiens] Length = 651 Score = 43.4 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 28/156 (17%), Positives = 52/156 (33%), Gaps = 16/156 (10%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN-NLNEVKSRLNK 268 D + +L + I + + V S +L K ++ Sbjct: 64 FDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQIDPPFSSWKDLQTFKQKVKS 123 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 +N T +Y A+ +A R L E K V+ +TDG + + + Sbjct: 124 MNLIGQGTFSYYAISNATRLLKREGRKDGV--------KVVLLMTDGID-----HPKNPD 170 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS 363 I E R +G+ ++A+S + LR + Sbjct: 171 VQSISEDARISGISFITIALST-VVNEAKLRLISGD 205 >gi|51095062|gb|EAL24306.1| similar to Matn2-prov protein [Homo sapiens] Length = 668 Score = 43.4 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 28/156 (17%), Positives = 52/156 (33%), Gaps = 16/156 (10%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN-NLNEVKSRLNK 268 D + +L + I + + V S +L K ++ Sbjct: 64 FDKQKDFVDSLSDKIFQLTPGRSLEYDIKLAALQFSSSVQIDPPFSSWKDLQTFKQKVKS 123 Query: 269 LNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 +N T +Y A+ +A R L E K V+ +TDG + + + Sbjct: 124 MNLIGQGTFSYYAISNATRLLKREGRKDGV--------KVVLLMTDGID-----HPKNPD 170 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS 363 I E R +G+ ++A+S + LR + Sbjct: 171 VQSISEDARISGISFITIALST-VVNEAKLRLISGD 205 >gi|328704829|ref|XP_001947889.2| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-4-like [Acyrthosiphon pisum] Length = 1219 Score = 43.4 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 34/359 (9%), Positives = 86/359 (23%), Gaps = 54/359 (15%) Query: 28 QMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQ 87 ++ A + + L+ + + + F K Y + Sbjct: 110 RLMEAGEQSALA---------NQETAVKSERYVNDGFGDY-KSLESIDRYDQCAFNTTVN 159 Query: 88 KAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLA 147 + T + + Q+ + Sbjct: 160 CVLLQSTYVDPDSVDETVVRDMQWTDHLDP-IFVSNYEMDPVLSWQYFGSTKGTLRRFPT 218 Query: 148 ISICMVLDVSRSMEDLYLQKHN----DNNNMTSNKY---LLPPPPKKSFWSKNTTKSKYA 200 + +S S Y + + K + TT + Sbjct: 219 LRWPSYSGLSPSALFDYRLNPWFVEAATSAKDIVIIADFSIALSDYKLSLVRATTLAALD 278 Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 A + L S +V ++ I + NL Sbjct: 279 TLGANDFVNVLSLESSNYEIVPCFKEMIVQANE-----------------------KNLR 315 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 +++S + + ++N A+ A+ L+ + + + + ++ ITDG Sbjct: 316 DLRSAVAQSKFAGSSNFTGALARAFDILHKFNRTGQGSQCN----QAILIITDGPFGPYK 371 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF-AVNDSRELLE 378 +++++ + + ++ +F + D R L E Sbjct: 372 EILQHNKPH--------MPVRVFTYLIGKDDSNAADMNWIACNNKGYFEHIEDQRNLRE 422 >gi|309358766|emb|CAP33788.2| hypothetical protein CBG_15420 [Caenorhabditis briggsae AF16] Length = 876 Score = 43.4 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 26/148 (17%), Positives = 54/148 (36%), Gaps = 16/148 (10%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL--NPYENTNTYPAMHHAYRELYN 290 N +++I + N++ + L L P E+T T A+ A E+++ Sbjct: 54 NWTIKIVRDLPVHEDAVRVGIVQYSERNDIITHLETLKFMPGEDTRTGVALSKADDEIFD 113 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + +I TDG + + +R G+KIY+++V++ Sbjct: 114 YDGGARLKAT-----RLIIVFTDGL--------SMDKPTLAAKALRRKGVKIYTISVNSI 160 Query: 351 PEGQDLLRKCTDSSGQFFAVNDSRELLE 378 ++L + F D + E Sbjct: 161 GFVPEML-GIVGDADNVFGPTDENRIEE 187 >gi|309791117|ref|ZP_07685650.1| von Willebrand factor type A [Oscillochloris trichoides DG6] gi|308226815|gb|EFO80510.1| von Willebrand factor type A [Oscillochloris trichoides DG6] Length = 454 Score = 43.4 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 22/114 (19%), Positives = 41/114 (35%), Gaps = 16/114 (14%) Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +RL+ L E T+ M EL + + +I +TDG + Sbjct: 149 TRLHSLRLGEATDLSSGMRLGLAELAHAAAPGTV--------RRLILLTDG------FTE 194 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSREL 376 N + + AG+ I ++ + DLL D S G+ + + ++ Sbjct: 195 NADACMALARQAAQAGVSISTLGLG-GEFQDDLLTGLADVSGGRASFMRRADQI 247 >gi|156358483|ref|XP_001624548.1| predicted protein [Nematostella vectensis] gi|156211335|gb|EDO32448.1| predicted protein [Nematostella vectensis] Length = 186 Score = 43.4 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 21/155 (13%), Positives = 48/155 (30%), Gaps = 20/155 (12%) Query: 211 DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN 270 + + ++ + +G I Y+ ++++ +++ Sbjct: 18 GTFKKCINFVKRVLR--AFDVSEKGTHVGAIIYSTDTKLAFDFNTYKERGDIEAAFDRVK 75 Query: 271 PYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T T + A ELY + ++ ITDG + + Sbjct: 76 FLGETTFTGKGLKMALSELYKTAR--------KDVSNLLVVITDGRSHDDVVKPS----- 122 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS 364 E +RNAG++I +V + + L+ Sbjct: 123 ---EMLRNAGVRIVTVGLG-NSFDINQLKAMAGKP 153 >gi|114579064|ref|XP_515640.2| PREDICTED: hypothetical protein LOC459430 isoform 2 [Pan troglodytes] Length = 1294 Score = 43.4 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 26/268 (9%), Positives = 80/268 (29%), Gaps = 32/268 (11%) Query: 107 SKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ 166 + +I + + ++ + + +++++ Y + Sbjct: 418 ADGVVDIKAKPENESVQTSAETNKKTVHAKYCSRFVHAPWKDGSLVHVNITKEKCKWYSE 477 Query: 167 KHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYA--PAPAPANRKIDVLIESAGNLVNSI 224 + + + +S + + Y K+D++ + + Sbjct: 478 RIHTALARIRRRIKWLQDGSQSLFGRLHNDCIYILIDTSHSMKSKLDLVKDKIIQFIQEQ 537 Query: 225 QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHA 284 K + V+ A + +NL + +S + + +TNT A+ A Sbjct: 538 LKYQSKFN--FVKFDGQAVAWREQLAEVNE--DNLKQAQSWIRHIKIGSSTNTLSALKTA 593 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN-AGMKIY 343 + + + + + +TDG + + ++ + IY Sbjct: 594 F---------------ADKETRAIYLLTDGRPDQP--------PETVIDQVKLFQEIPIY 630 Query: 344 SVAVSAPPE-GQDLLRKCTD-SSGQFFA 369 +++ + E L++ + G+F Sbjct: 631 TISFNYNDEIANRFLKEVAALTGGEFHF 658 >gi|114579066|ref|XP_001156305.1| PREDICTED: hypothetical protein isoform 1 [Pan troglodytes] Length = 1059 Score = 43.4 bits (100), Expect = 0.066, Method: Composition-based stats. Identities = 26/268 (9%), Positives = 80/268 (29%), Gaps = 32/268 (11%) Query: 107 SKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ 166 + +I + + ++ + + +++++ Y + Sbjct: 418 ADGVVDIKAKPENESVQTSAETNKKTVHAKYCSRFVHAPWKDGSLVHVNITKEKCKWYSE 477 Query: 167 KHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYA--PAPAPANRKIDVLIESAGNLVNSI 224 + + + +S + + Y K+D++ + + Sbjct: 478 RIHTALARIRRRIKWLQDGSQSLFGRLHNDCIYILIDTSHSMKSKLDLVKDKIIQFIQEQ 537 Query: 225 QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHA 284 K + V+ A + +NL + +S + + +TNT A+ A Sbjct: 538 LKYQSKFN--FVKFDGQAVAWREQLAEVNE--DNLKQAQSWIRHIKIGSSTNTLSALKTA 593 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN-AGMKIY 343 + + + + + +TDG + + ++ + IY Sbjct: 594 F---------------ADKETRAIYLLTDGRPDQP--------PETVIDQVKLFQEIPIY 630 Query: 344 SVAVSAPPE-GQDLLRKCTD-SSGQFFA 369 +++ + E L++ + G+F Sbjct: 631 TISFNYNDEIANRFLKEVAALTGGEFHF 658 >gi|149773089|emb|CAO01894.1| collagen type VI alpha 6 [Mus musculus] Length = 226 Score = 43.4 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 24/162 (14%), Positives = 54/162 (33%), Gaps = 17/162 (10%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL--NKLNPYENTNTYPAMHHAYRELYNE 291 R+ Y+ + N N + + L N + A+ A+R ++ Sbjct: 61 NKYRVALAQYSDALHNEFQLGTFKNRNPMLNHLKKNFGFIGGSLKIGNALQEAHRTYFSA 120 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 + + V+ + ++ + + + +R G+KI SV V Sbjct: 121 PTNGRDKK--QFPPILVVLAS---------AESEDDVEEAAKALREDGVKIISVGVQK-- 167 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 ++ L+ S F + +R+L F +I + + Sbjct: 168 ASEENLKAMATSQ-FHFNLRTARDLS-VFAPNMTEIIKDVTQ 207 >gi|13475442|ref|NP_107006.1| hypothetical protein mlr6511 [Mesorhizobium loti MAFF303099] gi|14026194|dbj|BAB52792.1| mlr6511 [Mesorhizobium loti MAFF303099] Length = 537 Score = 43.4 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 17/131 (12%), Positives = 44/131 (33%), Gaps = 20/131 (15%) Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 P + + + + L T A+ A L ++ + V+ I Sbjct: 71 VPPQPGSASAITDAADSLKFLGKTPLTAAVKQAAEALKYTEDKAT-----------VVLI 119 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYS--VAVSAPPEGQDLLRKCTD-SSGQFF 368 TDG + + + + ++ +G+ + V + + + + G++ Sbjct: 120 TDGLETCGG------DPCALGKELKASGVDFTADVVGFGLTADEGKQIACLAENTGGKYI 173 Query: 369 AVNDSRELLES 379 +D + L E+ Sbjct: 174 QASDEKALQEA 184 >gi|260799828|ref|XP_002594886.1| hypothetical protein BRAFLDRAFT_124461 [Branchiostoma floridae] gi|229280123|gb|EEN50897.1| hypothetical protein BRAFLDRAFT_124461 [Branchiostoma floridae] Length = 394 Score = 43.4 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 12/65 (18%), Positives = 23/65 (35%), Gaps = 4/65 (6%) Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFD 381 + + + R+ G+ +YSV + P LL + F D L ++ Sbjct: 234 DTPLDDHVAEADAARDDGITLYSVGIGDPVNSAVLL-DIAEIPPNVFDDTDPCAL---YN 289 Query: 382 KITDK 386 +I Sbjct: 290 RILQD 294 >gi|187607706|ref|NP_001120436.1| hypothetical protein LOC100145523 [Xenopus (Silurana) tropicalis] gi|170284610|gb|AAI61198.1| LOC100145523 protein [Xenopus (Silurana) tropicalis] Length = 911 Score = 43.4 bits (100), Expect = 0.067, Method: Composition-based stats. Identities = 27/127 (21%), Positives = 50/127 (39%), Gaps = 15/127 (11%) Query: 244 NIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGST 302 + + ++ + E KS+LN +N T T A+ ++ Sbjct: 93 HYSDEVIIFSDITTSKQEYKSKLNAVNYIGRGTFTDCAL---------SNMTALIQKHGG 143 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 F + ITDG +G+ + + RNAG+K++SVA S + LR+ + Sbjct: 144 DAINFAVVITDGHVTGSPCGG----MMHQADRARNAGIKLFSVAASHDVY-ESGLREIAN 198 Query: 363 SSGQFFA 369 + + F Sbjct: 199 TPYELFR 205 >gi|110598613|ref|ZP_01386880.1| von Willebrand factor, type A [Chlorobium ferrooxidans DSM 13031] gi|110339782|gb|EAT58290.1| von Willebrand factor, type A [Chlorobium ferrooxidans DSM 13031] Length = 343 Score = 43.4 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 23/166 (13%), Positives = 55/166 (33%), Gaps = 44/166 (26%) Query: 254 PLSNNLNEVKSRLN----KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 PL+ +++ ++ L L + T+ A+ A+ L ES + K ++ Sbjct: 144 PLTADMDAFEALLGMASPDLIEAQGTDFRSALELAHNVLEPSSESRLASAAKGE--KIMV 201 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS--------------------- 348 ++DGE+ + ++N + ++++ V Sbjct: 202 LLSDGEDHAG-------DLPAAANRIKNGRIHVFAIGVGLIRPAVIPLGEGGAGLKMDER 254 Query: 349 ----APPEGQDLLRKCTDSSGQFFAVNDSRELLE--SFDKITDKIQ 388 + L+ SG F+ L E + +++ +I Sbjct: 255 GRVVTTSFRPETLQNLARQSGGFYF----HSLAERPVYTEVSARIN 296 >gi|78222187|ref|YP_383934.1| type IV pilin biogenesis protein [Geobacter metallireducens GS-15] gi|78193442|gb|ABB31209.1| type IV pilin biogenesis protein, putative [Geobacter metallireducens GS-15] Length = 1030 Score = 43.4 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 35/374 (9%), Positives = 88/374 (23%), Gaps = 16/374 (4%) Query: 35 AAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINIT 94 AA + ++ T K + AG + T Sbjct: 24 AAAPGIARADDTEIYSAGATMKPMVMIIMDNSGSMGDPVPYDNATAYAGTYTKDTVYQYT 83 Query: 95 KDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVL 154 N ++ Q + + + N + Sbjct: 84 CTARNKKGQCTATQWQVYTGVFTDQINRDGGVDVAGQDGIDDSSTALKTGNRLNYEALPT 143 Query: 155 DVSRSMEDLYLQKHNDNNNMTSN-KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL 213 S L D + ++ + A + Sbjct: 144 TSKLSTAKGVLNNLVDLMYNDVDFGFMKFNTEDGGNIISKIGATITAMHGQISAINATTW 203 Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE 273 A L ++ + + + T + + + Sbjct: 204 TPLAEALTDAGKYFEDTYTGQYSPWNSNNWCQKAFIIIVTDGEPTHDTDTTIIGHFLDRG 263 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG--ENSGASAYQNTLNTLQI 331 T A ++ + + + ++ +T +++ D ++ + + + Sbjct: 264 QTGITDANRG--QKWDQDGDYNMHSTTTTDPLNNDVWVADDTYSDTVPQTFLDDVAKYLY 321 Query: 332 CEYMR-----NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE----SFDK 382 +R + Y++ + L R + G +F N++ EL + D Sbjct: 322 THDLRPDLQGTQNVTTYTIGY--AHDSPLLQRTAQNGGGLYFTANNAAELEHSLLMALDD 379 Query: 383 ITDKIQEQSVRIAP 396 I K+Q + + P Sbjct: 380 IAKKLQTYTAPVVP 393 >gi|297666856|ref|XP_002811720.1| PREDICTED: von Willebrand factor A domain-containing protein 3B-like [Pongo abelii] Length = 968 Score = 43.4 bits (100), Expect = 0.068, Method: Composition-based stats. Identities = 21/170 (12%), Positives = 56/170 (32%), Gaps = 30/170 (17%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 K+D++ + + K + V+ A + ++L + Sbjct: 376 SHSMKSKLDLVKDKIIQFIQEQLKYKSKFN--FVKFDGQAVAWREQLAEVNE--DSLEQA 431 Query: 263 KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 +S + + +TNT A+ A+ + + + + +TDG Sbjct: 432 QSWIRDMKIGSSTNTLSALKTAF---------------ADKETQAIYLLTDGRPDQP--- 473 Query: 323 QNTLNTLQICEYMRN-AGMKIYSVAVSAPPE-GQDLLRKCTD-SSGQFFA 369 + + ++ + IY+++ + E L++ + G+F Sbjct: 474 -----PETVIDQVKLFQEIPIYTISFNYNDEIANRFLKEVAALTGGEFHF 518 >gi|297279798|ref|XP_002801794.1| PREDICTED: integrin alpha-10-like isoform 4 [Macaca mulatta] gi|297279800|ref|XP_002801795.1| PREDICTED: integrin alpha-10-like isoform 5 [Macaca mulatta] Length = 1024 Score = 43.4 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 39/215 (18%), Positives = 64/215 (29%), Gaps = 25/215 (11%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 + I E L + K + + +++G + Y V Sbjct: 20 PTYMDVVIVLDGSNSIYPWSEVQTFLRRLVGKLFIDPEQ--IQVGLVQYGESPVHEWSLG 77 Query: 255 LSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 EV L+ E T T A+ A E +++ + ++ +T Sbjct: 78 DFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQSHGGRPEA-----ARLLVVVT 132 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--------SAPPEGQDLLRKCTDSS 364 DGE+ L+ CE R + Y +AV P +R Sbjct: 133 DGESHDGEELP---AALKTCEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASDP 186 Query: 365 G--QFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 FF V D L + D + D+I A N Sbjct: 187 DERFFFNVTDEAALTDIVDALGDRIFGLEGSHAEN 221 >gi|297279792|ref|XP_002801791.1| PREDICTED: integrin alpha-10-like isoform 1 [Macaca mulatta] gi|297279794|ref|XP_002801792.1| PREDICTED: integrin alpha-10-like isoform 2 [Macaca mulatta] Length = 1167 Score = 43.4 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 39/215 (18%), Positives = 64/215 (29%), Gaps = 25/215 (11%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 + I E L + K + + +++G + Y V Sbjct: 163 PTYMDVVIVLDGSNSIYPWSEVQTFLRRLVGKLFIDPEQ--IQVGLVQYGESPVHEWSLG 220 Query: 255 LSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 EV L+ E T T A+ A E +++ + ++ +T Sbjct: 221 DFRTKEEVVRAAKNLSRREGRETKTAQAIMVACTEGFSQSHGGRPEA-----ARLLVVVT 275 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--------SAPPEGQDLLRKCTDSS 364 DGE+ L+ CE R + Y +AV P +R Sbjct: 276 DGESHDGEELP---AALKTCEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASDP 329 Query: 365 G--QFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 FF V D L + D + D+I A N Sbjct: 330 DERFFFNVTDEAALTDIVDALGDRIFGLEGSHAEN 364 >gi|296269618|ref|YP_003652250.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] gi|296092405|gb|ADG88357.1| von Willebrand factor type A [Thermobispora bispora DSM 43833] Length = 223 Score = 43.4 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 26/153 (16%), Positives = 54/153 (35%), Gaps = 15/153 (9%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 PLS +LN+V S + +L P TN A + + + + G + Sbjct: 55 SDRAEVLLPLS-DLNDVHS-IPQLAPRGGTNYGAAFALLKSTIEQDVQ-ALKQAGHRPYR 111 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY-SV-AVSAPPEGQDLLRKCTDS 363 V F+TDG+ + + + ++G + ++ A L++ Sbjct: 112 PCVFFLTDGQPTYEWHQEY--------RSLTDSGFPPHPTILAFGFGDVDATTLQQVAT- 162 Query: 364 SGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 + F ND ++ + ++ V+ A Sbjct: 163 -FRAFIANDDISPAQALREFAKQLLNSVVQSAV 194 >gi|149928031|ref|ZP_01916279.1| hypothetical protein LMED105_14810 [Limnobacter sp. MED105] gi|149823219|gb|EDM82455.1| hypothetical protein LMED105_14810 [Limnobacter sp. MED105] Length = 543 Score = 43.4 bits (100), Expect = 0.069, Method: Composition-based stats. Identities = 10/136 (7%), Positives = 32/136 (23%), Gaps = 8/136 (5%) Query: 1 MTAIIISVCFLFIT-YAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQ 59 +T ++ + + A+ H+ + + ++Q D L+ + + + Sbjct: 18 LTVFVLIMSLGALGVLAV--GHMAWEKTRLQGVADLVALTAARQMSNGPEFAEAQAIALA 75 Query: 60 TSTIFKKQIK-----KHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIP 114 + + +N + ++A Sbjct: 76 NGVSEDDDLTIECIIDDAPTNDCDNSITSRVTIVRPVNGLLVFLPNREVTVLAEATVAPT 135 Query: 115 TENLFLKGLIPSALTN 130 GL+ Sbjct: 136 VVGSVSSGLVSVDTNQ 151 >gi|312073180|ref|XP_003139404.1| hypothetical protein LOAG_03819 [Loa loa] gi|307765434|gb|EFO24668.1| hypothetical protein LOAG_03819 [Loa loa] Length = 444 Score = 43.4 bits (100), Expect = 0.070, Method: Composition-based stats. Identities = 27/189 (14%), Positives = 60/189 (31%), Gaps = 30/189 (15%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + K DV I+ L+ S+ N + Y G + + Sbjct: 272 SGGVSDKRDVYIDFVSILIRSLD------LNRTSAHVAAIYYSGPKRARTLFHLRKHSRT 325 Query: 263 KSRLNKL----NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + + L + T T A+++A E + + K +I TDG + Sbjct: 326 EEAIKDLHQAPSNGGTTRTGEAIYYAINEFSEKFGARKGAK------KMIIIFTDGYSQD 379 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG-----QFF-AVND 372 N + G+++ +V+V D + + + +N+ Sbjct: 380 --------NPAEASRAAHIKGIELKAVSVEDENVPPDTKQIIAITGDPSERLNYKEDINN 431 Query: 373 SRELLESFD 381 + E+++ + Sbjct: 432 NDEVIQFLE 440 >gi|60551291|gb|AAH91051.1| Clca1 protein [Xenopus (Silurana) tropicalis] Length = 937 Score = 43.4 bits (100), Expect = 0.070, Method: Composition-based stats. Identities = 34/260 (13%), Positives = 75/260 (28%), Gaps = 35/260 (13%) Query: 130 NLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNN--MTSNKYLLPPPPKK 187 L + + + + + + + + + SN +P Sbjct: 242 ALQSVTEFCDKNHNTEAPTLQNRICNSRSTWDVIMNSTDIKSTPPQADSNIPAVPSFSLL 301 Query: 188 SFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGI 247 + T +I L ++A V I + + + Sbjct: 302 QSSERVVTLVLDVSGSMGGGNRIGRLYQAAEVFVMQIVEMGSYVG-------IVQFESTA 354 Query: 248 VGNQCTPLSNNLNEVKSRLNKL---NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 L + ++RL L TN + + + + Sbjct: 355 SVRSSL-LQIVDDTQRNRLKSLLPKTATGGTNICAGIREGIKV--------NKKYDGSSY 405 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-S 363 ++ +TDGE++ A++ T N+G I+ +A+ P L D + Sbjct: 406 STELVLLTDGEDNYATSLCFPDVT--------NSGSIIHVIALG--PNAAKELETIVDMT 455 Query: 364 SGQFFAVN---DSRELLESF 380 G F D++ L+++F Sbjct: 456 GGLRFLATDKVDAQGLIDAF 475 >gi|326500868|dbj|BAJ95100.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 752 Score = 43.4 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 33/314 (10%), Positives = 79/314 (25%), Gaps = 37/314 (11%) Query: 91 INITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISI 150 + K +Q S E+ + + + + Sbjct: 224 LPKIFTKKEKIQLTVNSGVSKEVLLQGTSHPLKEKIRQAEKLFFLHEAVVENWSIKDFNF 283 Query: 151 CMVL---DVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPAN 207 + DVS + + ++ + L P ++ Sbjct: 284 SYSVYSGDVSGGVLVQRSTLRDYDDRDIFS-IFLLPGNNQNRKIFRKAVVFIVDTSGSMQ 342 Query: 208 RK-IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 K I+ + + V+ +++ + +N + N +S + Sbjct: 343 GKPIENVKNAISTAVSELEEGDY--------FNIVTFNDELHSFSSCLEKVNGKTTESAI 394 Query: 267 NKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + +N T+ + A L N + + +TDG Sbjct: 395 DWMNSKFVAQGGTDIMHPLTEALALLSNSHGALPQ----------IFLVTDGSVEDERNI 444 Query: 323 QNTLNTLQICEYMRNAG-M--KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN-DSRELLE 378 +T+ T + N G + +I + + + LR + D+ + Sbjct: 445 CHTVKT-----ELTNKGSISPRISTFGLGSYCN-HYFLRMLASIGKGHYDAAFDTGSIEG 498 Query: 379 SFDKITDKIQEQSV 392 + K V Sbjct: 499 RMLQWFQKASSTIV 512 >gi|282892469|ref|ZP_06300803.1| hypothetical protein pah_c260o014 [Parachlamydia acanthamoebae str. Hall's coccus] gi|281497751|gb|EFB40115.1| hypothetical protein pah_c260o014 [Parachlamydia acanthamoebae str. Hall's coccus] Length = 364 Score = 43.4 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 20/171 (11%), Positives = 62/171 (36%), Gaps = 27/171 (15%) Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 + + +P + + ++ + L E T ++ +A + + E Sbjct: 145 WGFAGQATRLSPATMDALFLRLMIRDLQINEGNVTGTSLINAVKAIQKEISELPQDRKL- 203 Query: 303 RLKKFVIFITDGENSGASAYQ-NTLNTLQICEYMR---NAGMKIYSVAVSA--------- 349 + ++DGE++ + + N + + ++ N + IY++ + + Sbjct: 204 ----VAVLLSDGEDTENISAEEKAKNLRVLLDDLKTKFNDRLTIYTIGIGSREGGEIPDV 259 Query: 350 --------PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 L++ +SSG++ + + E I K++++++ Sbjct: 260 LEQGQRIHSKRDDTWLKQIGESSGEYIIADQESSI-EIAQDILSKMKKKNL 309 >gi|192359982|ref|YP_001981471.1| von Willebrand factor type A domain-containing protein [Cellvibrio japonicus Ueda107] gi|190686147|gb|ACE83825.1| von Willebrand factor type A domain protein [Cellvibrio japonicus Ueda107] Length = 2103 Score = 43.4 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 51/387 (13%), Positives = 122/387 (31%), Gaps = 65/387 (16%) Query: 17 IDLAHIMYIRNQMQSAL-DAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQG 75 IDL +++ + + + + D AVL+ S + + + I + + Sbjct: 726 IDLGNVLMRKPGIDALIPDVAVLAIDRSALVSDVASFTASGTIDVTLINRGNASVEIPFI 785 Query: 76 SYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRS 135 Y E+ + N D + L +P + + T++++ Sbjct: 786 IYAFEDLNN-------NGVYDAADTLLGQVSPVISSALPLT------VDGTLNTSIAVSG 832 Query: 136 TGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKN-T 194 + ++ + VL N S+ L +K Sbjct: 833 NLSFRDAPISILLDATNVL------------VELSKANNLSSTAGLCSNQQKPNVDLALC 880 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 S + + A +++ ++ N+V ++ +VR+ + + Sbjct: 881 MDSSGSVSAANFQLQLEGTAQAIENVV---------PRDGTVRVSALQFASAATVELNPT 931 Query: 255 L--SNNLNEVKSRLNKLN-PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 + +N+ + ++ + T+ + + A + N +S + + Sbjct: 932 IIEEDNVQAIADKIRAIRKTGGGTSIHACIDSATTLIANALPAS--------SMQIIDVS 983 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKI-YSVAVSAPPEGQDLLRKCT------DSS 364 TDG+++ + + R AG+ + ++ V LL Sbjct: 984 TDGQSTQS-------QAVAASNRARAAGIDVLNAIGVGTGI-STALLNAIVFPQPVGGDR 1035 Query: 365 GQFFAVNDSRELLESFDKITDKIQEQS 391 G V + +E D IT+KIQ ++ Sbjct: 1036 GFVLTVRNYQE---YMDGITNKIQRET 1059 >gi|160900629|ref|YP_001566211.1| FG-GAP repeat-containing protein [Delftia acidovorans SPH-1] gi|160366213|gb|ABX37826.1| FG-GAP repeat protein [Delftia acidovorans SPH-1] Length = 1182 Score = 43.4 bits (100), Expect = 0.071, Method: Composition-based stats. Identities = 34/233 (14%), Positives = 61/233 (26%), Gaps = 18/233 (7%) Query: 160 MEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGN 219 + + S N A + + Sbjct: 214 YHIMMTDGRWNGTATGVPDSTRRDNATNLTLSDNMVYGGSGAADQK-KSALYRDTSTGTT 272 Query: 220 LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT-PLSNNLNEVKSRLNKLNPYENTNTY 278 L + + + GT+ +N+ + NP N T+ Sbjct: 273 LADWAFYSWATPMQTTGLTGTMQPTADYRKAPSVETFTNDSGSSVTLDRYWNPRYNPATW 332 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 P M Y +++ + TR + V F DG + T L R Sbjct: 333 PHMVT-YTIGFSKMAFDWSYPSITRPTQMVPFGYDGSFPDLAKGSTTWPNLTASTESR-- 389 Query: 339 GMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 DL + G+F+AV +L ++F +I +I Q+ Sbjct: 390 -------------NALDLWHAALNGRGRFYAVEKGEDLEKAFREIFGQINTQT 429 >gi|123444851|ref|XP_001311192.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121892992|gb|EAX98262.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 698 Score = 43.4 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 30/317 (9%), Positives = 77/317 (24%), Gaps = 46/317 (14%) Query: 97 KNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDV 156 N Y + ++ ++ + + E +++ L+ Sbjct: 131 DLNENGYFYKFPLTHKYQKGSVTNDYSDKPETFHFATTIKTQKEIQDVKVSVQGNKNLND 190 Query: 157 SRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIES 216 + + + + + + + Y Sbjct: 191 PHNATFVTNEAPTKDAIIIEAQIKDEDKNVAVSSDGYIAVTTYPFFEGSIES-----NSE 245 Query: 217 AGNLVNSIQKAIQEKKNLSVRIGT----------------IAYNIGIVGNQCTPLSNNLN 260 +V+ ++ +V+ V C N+ Sbjct: 246 FYFVVDCSGSMAGKRIENAVKCMRLFIQSLPVGCRFAILKFGSQFQTVLEPCDYTDENVA 305 Query: 261 EVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 L+ + T+ + H K + F+TDGE Sbjct: 306 RAMKLLDTIKADMGGTDILSPLQHVSEL-----------KAKEGFVKQIFFLTDGEVH-- 352 Query: 320 SAYQNTLNTLQICEYMRNA--GMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSREL 376 N IC + G +I+S+ + + L++ S G + + D + Sbjct: 353 -------NPDMICATAQKNRSGNRIFSIGLGSGA-DPGLIKGMARKSGGNYSIIGDDDNM 404 Query: 377 LESFDKITDKIQEQSVR 393 E ++ +++ Sbjct: 405 NEKVIEMLSSAISPALK 421 >gi|58429531|gb|AAW78169.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 551 Score = 43.4 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 26/170 (15%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + N + + S N + Sbjct: 55 CSGSIRRHNWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAREIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 S + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LSIIKSLLSTNLPYGRTNLTDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|58429491|gb|AAW78149.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 542 Score = 43.4 bits (100), Expect = 0.072, Method: Composition-based stats. Identities = 26/170 (15%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + N + + S N + Sbjct: 55 CSGSIRRHNWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAREIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 S + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LSIIKSLLSTNLPYGRTNLTDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|269128710|ref|YP_003302080.1| Vault protein inter-alpha-trypsin domain-containing protein [Thermomonospora curvata DSM 43183] gi|268313668|gb|ACZ00043.1| Vault protein inter-alpha-trypsin domain protein [Thermomonospora curvata DSM 43183] Length = 795 Score = 43.4 bits (100), Expect = 0.074, Method: Composition-based stats. Identities = 18/130 (13%), Positives = 38/130 (29%), Gaps = 20/130 (15%) Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 L L T + L + + ++ ITDG+ Sbjct: 370 HLAGLQARGGTELAAPLREGAALLDDAGR-----------DRVLVLITDGQVGNEDQLLA 418 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKI 383 ++ G++I++V + L + + G+ V L E+ + I Sbjct: 419 LIDPFL-------NGLRIHAVGIDQ-AVNAGFLGRLATAGQGRLELVESEDRLDEAMEHI 470 Query: 384 TDKIQEQSVR 393 +I + Sbjct: 471 HHRINAPLLT 480 >gi|300022610|ref|YP_003755221.1| von Willebrand factor A [Hyphomicrobium denitrificans ATCC 51888] gi|299524431|gb|ADJ22900.1| von Willebrand factor type A [Hyphomicrobium denitrificans ATCC 51888] Length = 638 Score = 43.4 bits (100), Expect = 0.074, Method: Composition-based stats. Identities = 25/190 (13%), Positives = 56/190 (29%), Gaps = 25/190 (13%) Query: 196 KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL 255 S + KIDV+ E ++ + V G Sbjct: 39 GSMWGRLAPDNKPKIDVVREKLATILQTPSSTRV----GLVSFGHRRRGDCNDVELIASP 94 Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + + L KLNP A+ A + + + +I + DG Sbjct: 95 DSERAALLGPLAKLNPRGPGPVTAALEIAADAIGTSRPAQ------------IIIVGDGA 142 Query: 316 NSGASAYQNTLNTLQICEYMRNA--GMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVND 372 ++ ++ + G+ + + + P + + + G+++ V D Sbjct: 143 DN------CQQDSCAAANDFAKSAPGVAVQVIGIGVPATERPRIACVAQATGGRYYDVTD 196 Query: 373 SRELLESFDK 382 + L + D+ Sbjct: 197 AAGLNAALDE 206 >gi|74217197|dbj|BAC31374.2| unnamed protein product [Mus musculus] Length = 338 Score = 43.4 bits (100), Expect = 0.074, Method: Composition-based stats. Identities = 30/220 (13%), Positives = 68/220 (30%), Gaps = 18/220 (8%) Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 P+K+ N D L+ V +Q + L + Sbjct: 45 NNCPEKADCPVNVYFVLDTSESVAMQSPTDSLLYHMQQFVPQFISQLQNEFYLDQVALSW 104 Query: 242 AY---NIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHN 297 Y + +P ++ L + T T A+ + +++ Sbjct: 105 RYGGLHFSDQVEVFSPPGSDRASFTKSLQGIRSFRRGTFTDCALANMTQQI--------R 156 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 + F + ITDG +G+ + E R G+++++VA + L Sbjct: 157 QHVGKGVVNFAVVITDGHVTGSPCGGIKMQ----AERAREEGIRLFAVA-PNRNLNEQGL 211 Query: 358 RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 R +S + + N+ + +I + +++ + Sbjct: 212 RDIANSPHELYR-NNYATMRPDSTEIDQDTINRIIKVMKH 250 >gi|221066731|ref|ZP_03542836.1| membrane protein-like protein [Comamonas testosteroni KF-1] gi|220711754|gb|EED67122.1| membrane protein-like protein [Comamonas testosteroni KF-1] Length = 723 Score = 43.0 bits (99), Expect = 0.075, Method: Composition-based stats. Identities = 12/48 (25%), Positives = 23/48 (47%), Gaps = 1/48 (2%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRT 49 A++ V F + +DL ++ Y + +Q D A L SI +++ Sbjct: 3 FALVAGVIFSMLGV-VDLGYMYYAKRNLQRIADLAALEAAQSINAEQD 49 >gi|220923701|ref|YP_002499003.1| hypothetical protein Mnod_3796 [Methylobacterium nodulans ORS 2060] gi|219948308|gb|ACL58700.1| conserved hypothetical protein [Methylobacterium nodulans ORS 2060] Length = 439 Score = 43.0 bits (99), Expect = 0.075, Method: Composition-based stats. Identities = 18/227 (7%), Positives = 54/227 (23%), Gaps = 15/227 (6%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALD-AAVLSGCASIVSDRTIKDPTTKKDQ 59 + A+ V + + + + ++Q A D +A +G D + + Sbjct: 29 IAALAFPVVIGGMGLGAETGYWYLTQRKLQHAADLSAHAAGVRKRAGDPKSQIDAAALNI 88 Query: 60 TSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLF 119 ++ S + + + + P + Sbjct: 89 ALNSGMSSSLGNMLANSPPTSGI-KAGDTSSLEVILTEVRPR-------------LFSSV 134 Query: 120 LKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKY 179 S + L+ + + +S S +N+ ++ + Sbjct: 135 FSSEPVSIRARAVASIVAGSQACVLALSPAASGAVTLSGSTVVNLKGCDIASNSTAADAF 194 Query: 180 LLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK 226 L+ ++ + + SA + + Sbjct: 195 LMSGSSAALSAGCASSVGGAVTTIQLKLTQCATVKTSAPVVRDPYAS 241 >gi|311978223|ref|YP_003987343.1| putative ariadne-like ring finger protein [Acanthamoeba polyphaga mimivirus] gi|82000061|sp|Q5UQ35|YR811_MIMIV RecName: Full=Putative ariadne-like RING finger protein R811 gi|55417421|gb|AAV51071.1| unknown [Acanthamoeba polyphaga mimivirus] gi|308205062|gb|ADO18863.1| putative ariadne-like ring finger protein [Acanthamoeba polyphaga mimivirus] Length = 990 Score = 43.0 bits (99), Expect = 0.076, Method: Composition-based stats. Identities = 18/175 (10%), Positives = 51/175 (29%), Gaps = 18/175 (10%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSN--NLNEVKSRLNKLNPYENTNTYPAMHHAY 285 I + N+ I I Y ++++ + L +T A A Sbjct: 36 IIDITNVIQNINIIMYRDYCDSVITASSGWVSKIDDLIPFIRGLRASGGGDTPEAGKTAA 95 Query: 286 RELYNE-KESSHNTIGSTRLKKFVIFITDG---ENSGASAYQNTLNTLQICEYMRNAGMK 341 L + K ++ + D E + + +++C+ + + Sbjct: 96 NNLLDVVKNNTIVIWYADAPPHHKSNARDNFAREINTLIGSDKIFDWIELCDTLAARNII 155 Query: 342 IYSVAVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESFDKITDKIQEQSVRI 394 +Y + ++ + G+ + + E I + ++++ Sbjct: 156 VYPI-INRHHFDTSSFYVAMSTITGGKTLYLESTNE---------KIITQTTIKL 200 >gi|19554206|ref|NP_602208.1| hypothetical protein NCgl2910 [Corynebacterium glutamicum ATCC 13032] gi|62391861|ref|YP_227263.1| hypothetical protein cg3341 [Corynebacterium glutamicum ATCC 13032] gi|21325786|dbj|BAC00407.1| Hypothetical protein [Corynebacterium glutamicum ATCC 13032] gi|41223008|emb|CAF18953.1| putative membrane protein [Corynebacterium glutamicum ATCC 13032] Length = 501 Score = 43.0 bits (99), Expect = 0.076, Method: Composition-based stats. Identities = 9/88 (10%), Positives = 29/88 (32%), Gaps = 8/88 (9%) Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQDLLRK 359 ++ ++DG + ++ + + G+ I +V + + L Sbjct: 3 EGQSGTIVLVSDGIAT-----CTPPPVCEVAAELADQGVDLVINTVGFNVDESARAELEC 57 Query: 360 CTDSS-GQFFAVNDSRELLESFDKITDK 386 + G + +D+ L+ + + Sbjct: 58 IAQAGNGTYADASDADSLVAELKRAATR 85 >gi|308238185|ref|NP_001184129.1| chloride channel accessory 1 [Xenopus (Silurana) tropicalis] Length = 933 Score = 43.0 bits (99), Expect = 0.077, Method: Composition-based stats. Identities = 34/260 (13%), Positives = 75/260 (28%), Gaps = 35/260 (13%) Query: 130 NLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNN--MTSNKYLLPPPPKK 187 L + + + + + + + + + SN +P Sbjct: 238 ALQSVTEFCDKNHNTEAPTLQNRICNSRSTWDVIMNSTDIKSTPPQADSNIPAVPSFSLL 297 Query: 188 SFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGI 247 + T +I L ++A V I + + + Sbjct: 298 QSSERVVTLVLDVSGSMGGGNRIGRLYQAAEVFVMQIVEMGSYVG-------IVQFESTA 350 Query: 248 VGNQCTPLSNNLNEVKSRLNKL---NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 L + ++RL L TN + + + + Sbjct: 351 SVRSSL-LQIVDDTQRNRLKSLLPKTATGGTNICAGIREGIKV--------NKKYDGSSY 401 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-S 363 ++ +TDGE++ A++ T N+G I+ +A+ P L D + Sbjct: 402 STELVLLTDGEDNYATSLCFPDVT--------NSGSIIHVIALG--PNAAKELETIVDMT 451 Query: 364 SGQFFAVN---DSRELLESF 380 G F D++ L+++F Sbjct: 452 GGLRFLATDKVDAQGLIDAF 471 >gi|296141040|ref|YP_003648283.1| von Willebrand factor type A [Tsukamurella paurometabola DSM 20162] gi|296029174|gb|ADG79944.1| von Willebrand factor type A [Tsukamurella paurometabola DSM 20162] Length = 527 Score = 43.0 bits (99), Expect = 0.077, Method: Composition-based stats. Identities = 40/386 (10%), Positives = 98/386 (25%), Gaps = 34/386 (8%) Query: 29 MQSALDAAVLS----GCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYI----RE 80 +Q+A AV + A + + + Q T Q + K+ + Sbjct: 150 LQAATIRAVPADSPYATAPVTAGVSEAQQPGANRQALTAALAQYAQAAKRVDDDPVRSAK 209 Query: 81 NAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIE 140 G + + K + A + L T + + Sbjct: 210 AQGGVVVVPEYAYLAAKKDEPGVSAVVPKSGAPRDDLLLTVTAGGDRATAAKTGADTLAA 269 Query: 141 RSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSN-KYLLPPPPKKSFWSKNTTKSK- 198 + I + ++++ + Sbjct: 270 AFASEKGIVALGEAGLRGKDLSPAPPDGIGKVAGLPEPNTDELTKAEQAYATLAVPLKAL 329 Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQ---KAIQEKKNLSVRIGTIAYNIGIVGNQCTPL 255 + + + G L + I + + + +I + P Sbjct: 330 VVVDTSGSMNESAGDTTRIGMLASGFTKVVTQIPDANAVGLWTFSIGSATRPDWTEVVPT 389 Query: 256 S------NNLNEVKSRLNKLNP-----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 + + ++ ++ L+ +N T Y AYR + + Sbjct: 390 ARLDARRGDKSQRQALLDGVNALPRKVGGATGLYDTTLAAYRRAVENFDPA--------Y 441 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMR-NAGMKIYSVAVSAPPEGQDLLRKCTDS 363 +I +TDG + Q+ + + I++V +S + L R + Sbjct: 442 SNSLILLTDGSDEKPGGMSLDDLVAQLRTLVDPARPVNIHTVGISKDADLPALKRIADAT 501 Query: 364 SGQFFAVNDSRE-LLESFDKITDKIQ 388 G + ++ L + I + + Sbjct: 502 GGTAQEADSEQQMLTDFVTAIAKRAK 527 >gi|326929209|ref|XP_003210761.1| PREDICTED: von Willebrand factor A domain-containing protein 3A-like, partial [Meleagris gallopavo] Length = 1180 Score = 43.0 bits (99), Expect = 0.077, Method: Composition-based stats. Identities = 18/104 (17%), Positives = 40/104 (38%), Gaps = 21/104 (20%) Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 ++K + + NT A+ A S + + + +TDG+ Sbjct: 947 VSKFHAHGNTCILMALQKAL---------------SFQGVEALYILTDGKPDT----SCN 987 Query: 326 LNTLQICEYMRNAGMKIYSVAVS-APPEGQDLLRKCTD-SSGQF 367 L +I + +KI++++ S E + L+K + G++ Sbjct: 988 LILKEIERLRKQQDIKIHTISFSCVDREANEFLKKLASQTGGRY 1031 >gi|290976237|ref|XP_002670847.1| vWFA domain-containing protein [Naegleria gruberi] gi|284084410|gb|EFC38103.1| vWFA domain-containing protein [Naegleria gruberi] Length = 389 Score = 43.0 bits (99), Expect = 0.077, Method: Composition-based stats. Identities = 19/179 (10%), Positives = 57/179 (31%), Gaps = 25/179 (13%) Query: 192 KNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI--GIVG 249 + +I+V + +++++ + K +R ++Y Sbjct: 77 NIVDLVIVMDCTGSMSGEIEVAKRTVTTIISTLHE----KFQSDLRFSAVSYRDHTDDYA 132 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 + P + +LN+ K +N ++ + A+ A + + K V+ Sbjct: 133 VKEFPFTKDLNKAKGYINTMSAQGGGDHPEALASALYVINEMPFNKKGK-------KIVV 185 Query: 310 FITDGENSGASAYQNTL------------NTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 ++ D G ++ + +++ ++ + Y + + Q L Sbjct: 186 WVADAPPHGMKTSSDSYPEGCKDQQGNVIDWIKLGSALQEKNVVFYGILCERAKDDQQL 244 >gi|27378609|ref|NP_770138.1| hypothetical protein blr3498 [Bradyrhizobium japonicum USDA 110] gi|27351757|dbj|BAC48763.1| blr3498 [Bradyrhizobium japonicum USDA 110] Length = 445 Score = 43.0 bits (99), Expect = 0.078, Method: Composition-based stats. Identities = 27/279 (9%), Positives = 67/279 (24%), Gaps = 20/279 (7%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASI--VSDRTIKDPTTKKDQ 59 T I + + + Q+A D+A SG + + Sbjct: 16 TVIALVPLIGAVALGAEAGSWYVTHQHAQNAADSAAYSGALRLSCTMAGAACGTQSVDYL 75 Query: 60 TSTIFKKQIKKHLKQGSY--------IRENAGDIAQKAQINITKDKNNPLQY----IAES 107 + + I++ QI+I + Sbjct: 76 AKEFAAQNGFCNSSPQDSTPYPGTQCAPSLPNRISRAVQIDIGTYTAGTFTTPPAGTGNA 135 Query: 108 KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAIS-ICMVLDVSRSMEDLYLQ 166 + +L ++ N+ ++ ++++ ++ A++ + + Sbjct: 136 VRARVSQQQPAYLAAVLGLTTVNIPAQAIALVQQPTKACALALGPDSGALKLAGNLSNNG 195 Query: 167 KHNDNNNMTSNKYLLPPPPKKSFWSKN-----TTKSKYAPAPAPANRKIDVLIESAGNLV 221 + TS + P S W+ T + P N + L Sbjct: 196 TGCALMSDTSVQLASTPSFTGSGWAVYGVSGCTPSGTCSNISVPHNYFMTYANNPLRKLD 255 Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + S + + + T N Sbjct: 256 TESFNSRTGNTSPSCPVQADGWKHCAPNSAGTGAYGNFT 294 >gi|322433071|ref|YP_004210320.1| hypothetical protein AciX9_4225 [Acidobacterium sp. MP5ACTX9] gi|321165298|gb|ADW71002.1| Protein of unknown function DUF2134, membrane [Acidobacterium sp. MP5ACTX9] Length = 457 Score = 43.0 bits (99), Expect = 0.079, Method: Composition-based stats. Identities = 19/141 (13%), Positives = 42/141 (29%), Gaps = 3/141 (2%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A+ + V F+ +ID+ H + ++QSA D A + I ++ + Sbjct: 19 AAVSLLVILGFLGISIDVGHRQLSKLRLQSATDTAATAAALEIRVCGSLVSCPAMQSAVQ 78 Query: 62 TIF--KKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLF 119 + F L + + + + + Q + Sbjct: 79 SSFIENGYPATPLLLNCATSSDDLTLVLNSPPCAMGSADPNYGKRGYVEVQVA-QHVPTY 137 Query: 120 LKGLIPSALTNLSLRSTGIIE 140 ++ + N+S RS Sbjct: 138 FMKMLGISQFNISARSEAARN 158 >gi|149919617|ref|ZP_01908096.1| putative outer membrane adhesin like protein [Plesiocystis pacifica SIR-1] gi|149819560|gb|EDM78988.1| putative outer membrane adhesin like protein [Plesiocystis pacifica SIR-1] Length = 1168 Score = 43.0 bits (99), Expect = 0.079, Method: Composition-based stats. Identities = 17/109 (15%), Positives = 35/109 (32%), Gaps = 11/109 (10%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + ++ L + P +T A+ A L + N+ + + ITDGE + Sbjct: 659 DSTAIEEYLEDIEPDGSTAIGDAIDAAVAALMAHDDLDPNSSNN----NAIFLITDGEQT 714 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ 366 + E + +Y +A +D + G+ Sbjct: 715 SG-----DKDVCDALEDAAKDDVPVY-IA-PVGSFKEDGFACVEGTGGK 756 >gi|332534874|ref|ZP_08410696.1| inter-alpha-trypsin inhibitor domain protein [Pseudoalteromonas haloplanktis ANT/505] gi|332035673|gb|EGI72162.1| inter-alpha-trypsin inhibitor domain protein [Pseudoalteromonas haloplanktis ANT/505] Length = 676 Score = 43.0 bits (99), Expect = 0.080, Method: Composition-based stats. Identities = 24/164 (14%), Positives = 55/164 (33%), Gaps = 21/164 (12%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPL---SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY 289 + I ++ + PL NL + + L T A+ + Sbjct: 357 DSDDSFNIIGFDNIVTPMSDKPLIASDFNLRRAERFIYSLEADGGTEIQGAL----NAVL 412 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + E V+F+TDG S A + + + ++ ++++V + + Sbjct: 413 DGSEFDGFVRQ-------VVFLTDGSVSNEDALFKNIQSK-----LGDS--RLFTVGIGS 458 Query: 350 PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 P + R G F + + E+ ++ DK+ ++ Sbjct: 459 APNSFFMRRAADIGKGSFTFIGSTSEVQPKMQQLFDKLAHPAIT 502 >gi|325689280|gb|EGD31286.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK115] Length = 471 Score = 43.0 bits (99), Expect = 0.080, Method: Composition-based stats. Identities = 30/243 (12%), Positives = 64/243 (26%), Gaps = 54/243 (22%) Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 + Y+ P +ID+L E ++ +Q NL ++ L Sbjct: 225 NPYSNNPLKNRSRIDILREKTKKMMADLQPIGNVSVNLVQFNSHASFV----QQNFIELD 280 Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK--------------------ESSH 296 L + S ++ LNP TN + + L + + Sbjct: 281 KGLTSINSAIDNLNPEHATNPGDGLRYGMVSLQSNAAQLKYVVLLTDGVPNSYMVGPQYN 340 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN------------------- 337 + F+ + +T Sbjct: 341 YWGNKIETRTRDGFVNQAYFNTWRYDLSTGFHSTRANSAFAGGNPSWAVDGSIEYSGEVA 400 Query: 338 ----AGMK-IYSVAVSAPPEGQDL--LRKCTDSSG----QFFAVNDSRELLESFDKITDK 386 G+K + + SA + G + V+D ++L ++F I + Sbjct: 401 KQFKKGIKRVNVIGFSAKEADKQQGSRLTAAIKEGVPETSYTDVSDDKQLEQTFADIKKQ 460 Query: 387 IQE 389 +++ Sbjct: 461 VEQ 463 >gi|118098040|ref|XP_414917.2| PREDICTED: hypothetical protein [Gallus gallus] Length = 858 Score = 43.0 bits (99), Expect = 0.080, Method: Composition-based stats. Identities = 18/104 (17%), Positives = 41/104 (39%), Gaps = 21/104 (20%) Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 ++K + + NT+ A+ A S + + + +TDG+ Sbjct: 747 VSKFHAHGNTHILMALQKAL---------------SFQDVEALYILTDGKPDT----SCN 787 Query: 326 LNTLQICEYMRNAGMKIYSVAVSA-PPEGQDLLRKCTD-SSGQF 367 L +I + +KI++++ S E + L+K + G++ Sbjct: 788 LILKEIERLRKQQDIKIHTISFSYVDREANEFLKKLASQTGGRY 831 >gi|15669100|ref|NP_247905.1| magnesium chelatase [Methanocaldococcus jannaschii DSM 2661] gi|6226407|sp|Q58320|Y910_METJA RecName: Full=Uncharacterized protein MJ0910 gi|1591584|gb|AAB98912.1| magnesium chelatase [Methanocaldococcus jannaschii DSM 2661] Length = 174 Score = 43.0 bits (99), Expect = 0.081, Method: Composition-based stats. Identities = 28/197 (14%), Positives = 63/197 (31%), Gaps = 36/197 (18%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 A R+++ + +L+ + +IG IA+ P ++++ + Sbjct: 1 MGAMRRMEAAKGAIISLL-------LDAYQKRNKIGMIAFR-KDKAELILPFTSSVELGE 52 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 L L T A +Y E + + +I I+D + + A Sbjct: 53 KLLKDLPTGGKTPLADAFIKSYEVFDREI------RKNPNIIPIMIVISDFKPNVAVKED 106 Query: 324 NTLNTLQICEYMRNAGMKI--------YSVAVSAPPEGQDLLRKCTDS-SGQFFAVNDSR 374 CE + G+ + + + ++ + +++ + Sbjct: 107 YVKEVFDACEKIAEKGINVILIDTEPQSFIKIGIG-------KEIANRFGFKYYKI---E 156 Query: 375 ELLESFDKITDKIQEQS 391 EL + DKI D I + Sbjct: 157 ELSK--DKILD-ICKSL 170 >gi|326671946|ref|XP_002663860.2| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-3 [Danio rerio] Length = 1082 Score = 43.0 bits (99), Expect = 0.082, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 57/177 (32%), Gaps = 14/177 (7%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++ + ++ ++++++ ++ + G S N + K L+K Sbjct: 266 RLTIARQTVASILDTLGD-DDFFNIIAYNQEIHYVEPCLNGTLVQADSTNKDHFKEHLDK 324 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L A+ A+ L ++ + S + ++ ITDG N Sbjct: 325 LFAKGIGLLGNALSEAFTILNEINQTGRGSSCS----QAIMLITDGATEMYDDVFAKYNW 380 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 ++I+ + D L+ ++ +F + L + + + Sbjct: 381 P-------ERKVRIFPYLIGRESAFADNLKWMACANKGYF--SQISTLADVQENVMR 428 >gi|321472278|gb|EFX83248.1| hypothetical protein DAPPUDRAFT_195011 [Daphnia pulex] Length = 418 Score = 43.0 bits (99), Expect = 0.082, Method: Composition-based stats. Identities = 28/325 (8%), Positives = 79/325 (24%), Gaps = 17/325 (5%) Query: 4 IIISVCFLFITYAIDLAHIMYI--RNQM---QSALDAA---VLSGCASIVSDRTIKDPTT 55 I++ +++ +DL + + R +M Q+ LD A ++ ++ Sbjct: 64 IMVPP--AYVSLFVDLMYAYSMEYRVKMVDVQTVLDMARDEQVAVPRVGSPRYSVTWDNY 121 Query: 56 KKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQ--INITKDKNNPLQYIAESKAQYEI 113 T F +I + + + I+ + + Sbjct: 122 YDFNAITAFINEIAAAHPDKVTVSSIGKTFENRDMPLVKISTGGTGKKAIVVDGGIHARE 181 Query: 114 PTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNN 173 F+ LI + N + + + + + L + N N Sbjct: 182 WISPAFVTWLINELVENYAAHPQYVDNVDWYIMPVINPDGYQFTHDTNGDRLWRKNRKPN 241 Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAP-----APANRKIDVLIESAGNLVNSIQKAI 228 + + + + ++ ++ + + ++ Sbjct: 242 AGIGGIPCIGTDMNRNFGFHWNEGGSSANGCSDTFHGGAAFSEIESQNVRDAILAVAGQA 301 Query: 229 QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYREL 288 + G + + S +NKL T + Sbjct: 302 EMYLTFHSYGQYWLTPWGYTSTLPSDYTQLYTLAVSAVNKLTAVYGTQYTIGSSTNVLYV 361 Query: 289 YNEKESSHNTIGSTRLKKFVIFITD 313 + G+ + + + D Sbjct: 362 ASGGSDDWAKGGAGIPFSYTVEMRD 386 >gi|114765751|ref|ZP_01444846.1| hypothetical protein 1100011001350_R2601_23570 [Pelagibaca bermudensis HTCC2601] gi|114541858|gb|EAU44894.1| hypothetical protein R2601_23570 [Roseovarius sp. HTCC2601] Length = 493 Score = 43.0 bits (99), Expect = 0.082, Method: Composition-based stats. Identities = 15/122 (12%), Positives = 38/122 (31%), Gaps = 5/122 (4%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 TA+++ + + DL + R+ +Q+ D A L + ++ + S Sbjct: 13 TAVLLPGILIGMAMLFDLLWLNNHRSHLQAQADMAALEAARYTGERPS----AVRQARVS 68 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + I + ++ + D P +++ + L + Sbjct: 69 VAVNDSFRAERLASRQIELGRWQDGSFSDMDAS-DPRRPNAARVTVRSEAKTNLTALLRR 127 Query: 122 GL 123 G Sbjct: 128 GF 129 >gi|308473880|ref|XP_003099163.1| CRE-CLEC-65 protein [Caenorhabditis remanei] gi|308267636|gb|EFP11589.1| CRE-CLEC-65 protein [Caenorhabditis remanei] Length = 376 Score = 43.0 bits (99), Expect = 0.084, Method: Composition-based stats. Identities = 23/163 (14%), Positives = 51/163 (31%), Gaps = 9/163 (5%) Query: 223 SIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMH 282 I + ++ V T YN V + S ++ S++ +L P Sbjct: 66 QIGTGYTDPRSTRVGFITYNYNATDVADFYKLQSW--ADLDSQIQRLK------MTPLAR 117 Query: 283 HAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKI 342 ++ ++ N I ST + + + + ++ G+ + Sbjct: 118 TTLSKMDTALYAAINMINSTAGFRDNYKKMVIVFTSVHGSYQKNPPKDVSKILKARGIPV 177 Query: 343 YSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 +V + + Q L+ +A+ D E +TD Sbjct: 178 VTVNTGSSSDTQSYLKNIAS-DNMAYAMADGNATQEILKAMTD 219 >gi|150398881|ref|YP_001322648.1| von Willebrand factor type A [Methanococcus vannielii SB] gi|150011584|gb|ABR54036.1| von Willebrand factor type A [Methanococcus vannielii SB] Length = 272 Score = 43.0 bits (99), Expect = 0.084, Method: Composition-based stats. Identities = 17/124 (13%), Positives = 43/124 (34%), Gaps = 9/124 (7%) Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + K + + + + +P + + +K++LN L T A+ Sbjct: 129 VDKYTDDSYVKREYLSVVTFRGREALV-LSPFTKKYSLIKAQLNSLKTGGKTPLSAALKT 187 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM--K 341 A + K+ + + I I+DG+ + ++ + ++ + K Sbjct: 188 ALKVFKQFKDK------NKKCTVEFILISDGKANVPLKENIKYEIEELSKAIQKRKISFK 241 Query: 342 IYSV 345 IY + Sbjct: 242 IYDI 245 >gi|169624118|ref|XP_001805465.1| hypothetical protein SNOG_15311 [Phaeosphaeria nodorum SN15] gi|111056124|gb|EAT77244.1| hypothetical protein SNOG_15311 [Phaeosphaeria nodorum SN15] Length = 1587 Score = 43.0 bits (99), Expect = 0.084, Method: Composition-based stats. Identities = 31/176 (17%), Positives = 65/176 (36%), Gaps = 21/176 (11%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 P ++DVL + +N + N IG +A+ Q P++N + + Sbjct: 1189 PHLSRLDVLKQMFDAFINRLLA-----YNFQTHIGLVAFGSKASVAQ--PITNAVENFRH 1241 Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 +LN + +T + ++ A +L E +I I+DGE++ ++ + Sbjct: 1242 KLNNMLASGDTAIWDSIALAQDQLQTYAEKYPTARL------RIICISDGEDTKSNQDKV 1295 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLES 379 + T ++C G + S + L+ + + G F E + Sbjct: 1296 NV-TSRLC----RNGTMLDS--FCLGDASNESLQIMSYLTKGYTFEPKTLEEAMAI 1344 >gi|47218628|emb|CAG04957.1| unnamed protein product [Tetraodon nigroviridis] Length = 314 Score = 43.0 bits (99), Expect = 0.084, Method: Composition-based stats. Identities = 25/201 (12%), Positives = 62/201 (30%), Gaps = 24/201 (11%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAI-QEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 A A ++++ ++ Q + + R + Y+ ++ Q Sbjct: 46 NGNSESAKDNHAQEKRFATDVMDRLRGLRLQTGRGFTSRAALLQYSSHVIIEQTFNQWRG 105 Query: 259 LNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 ++ K+R+ + T T A+ + R E K + DG Sbjct: 106 ADDFKARVAPMVYIGHGTYTTYAITNLTRIYLEESPLGSI--------KVAFLLFDG--- 154 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP---EGQDLLRKCTDSSGQFFAVNDSR 374 ++ + +N G++ +++ ++ LR S + Sbjct: 155 --ISHPRNPDIFSAVADAKNQGVQFFAIGITPEAKEPGNIAQLRLIASSPASRY----LH 208 Query: 375 ELLESFDKITDKIQEQSVRIA 395 L + I +K+ + +A Sbjct: 209 NLQD--KGIVEKVINEITAVA 227 >gi|253996767|ref|YP_003048831.1| hypothetical protein Mmol_1398 [Methylotenera mobilis JLW8] gi|253983446|gb|ACT48304.1| conserved hypothetical protein [Methylotenera mobilis JLW8] Length = 463 Score = 43.0 bits (99), Expect = 0.085, Method: Composition-based stats. Identities = 15/132 (11%), Positives = 38/132 (28%), Gaps = 3/132 (2%) Query: 18 DLAHIMYIRNQMQSALDAAVLSGCASIV---SDRTIKDPTTKKDQTSTIFKKQIKKHLKQ 74 DL H+ + ++Q+A D++ LSG + + + F Sbjct: 39 DLGHLYVTKTELQNAADSSSLSGARELNGKVTGINSAITRAIEAAGKNNFNLNSTAVTVN 98 Query: 75 GSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLR 134 S + + + + K + T N + ++P Sbjct: 99 ASNMWVGSCPSDGCMVPISSVTTDALAGDKTFLKVDTGLRTINTWFIQVLPGVANTTQTF 158 Query: 135 STGIIERSSENL 146 + + + ++ Sbjct: 159 GMAVAGKYAVDI 170 >gi|269126097|ref|YP_003299467.1| TadE family protein [Thermomonospora curvata DSM 43183] gi|268311055|gb|ACY97429.1| TadE family protein [Thermomonospora curvata DSM 43183] Length = 138 Score = 43.0 bits (99), Expect = 0.086, Method: Composition-based stats. Identities = 12/68 (17%), Positives = 22/68 (32%), Gaps = 4/68 (5%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A ++ + + I ID + ++ AV G + V + DP T Sbjct: 31 MAFLLPIMLMLIFMIIDFGRAFNAQLRINE----AVRQGVRTAVLTQPPADPKTAAADIM 86 Query: 62 TIFKKQIK 69 T + Sbjct: 87 TTALGGLT 94 >gi|313675093|ref|YP_004053089.1| von willebrand factor type a [Marivirga tractuosa DSM 4126] gi|312941791|gb|ADR20981.1| von Willebrand factor type A [Marivirga tractuosa DSM 4126] Length = 461 Score = 43.0 bits (99), Expect = 0.086, Method: Composition-based stats. Identities = 22/187 (11%), Positives = 55/187 (29%), Gaps = 25/187 (13%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ---CTPLSNNLNE 261 +ID +LV+S++ + + L + NN + Sbjct: 49 GNELRIDAAKRVLTDLVDSLRVNNKVELALRPYGHLTPAKERNCQDTKLEIPFAPNNNDR 108 Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + RL + P T ++ + ++ + + +I ITDG Sbjct: 109 IIDRLKYIYPRGTTPIAYSLEQSAKDFPKDN----------NYRNIIIIITDGIE----- 153 Query: 322 YQNTLNTLQICEYMRNAGMKI--YSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 + + ++ + + + + + + G++F D Sbjct: 154 -SCDGDPCAVSLELQKKDIFLRPFVIGLGMEEKFAAEFECM----GEYFNAKDISAFQAV 208 Query: 380 FDKITDK 386 + I D+ Sbjct: 209 LNGILDQ 215 >gi|307943680|ref|ZP_07659024.1| von Willebrand factor type A domain-containing protein [Roseibium sp. TrichSKD4] gi|307773310|gb|EFO32527.1| von Willebrand factor type A domain-containing protein [Roseibium sp. TrichSKD4] Length = 746 Score = 43.0 bits (99), Expect = 0.086, Method: Composition-based stats. Identities = 20/179 (11%), Positives = 49/179 (27%), Gaps = 23/179 (12%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK---SRLNKLNPYENT 275 + S A + + + + + + + +K + + L+ T Sbjct: 360 DASKSFMHAALDGLRENDQFRILRFANNTSAFAKSAMPATRANIKAGKNFVTGLSARGGT 419 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 A++ A+ + V+F+TDG G T+ Sbjct: 420 EMNNAINAAFDL-----------PPVPGTMRIVVFLTDGYIGGDREVIQTVYDRIG---- 464 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVR 393 +I++ + + LL G+ + E+ + I + Sbjct: 465 ---NARIHAFGIG-KAINRYLLEGLAREGRGRVRYIEPGETGQEAAAALASSIDAPLLT 519 >gi|123438167|ref|XP_001309871.1| Ubiquitin-conjugating enzyme family protein [Trichomonas vaginalis G3] gi|121891616|gb|EAX96941.1| Ubiquitin-conjugating enzyme family protein [Trichomonas vaginalis G3] Length = 957 Score = 43.0 bits (99), Expect = 0.086, Method: Composition-based stats. Identities = 17/146 (11%), Positives = 51/146 (34%), Gaps = 8/146 (5%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN-PYENTNTYPAMHHAYRELYNEKESSH 296 N PL+ +E +N+ + ++T + A+ A + ++ E Sbjct: 494 TMWGLINFSSTVKTVLPLTAIASEFSMAVNEDSELGDDTKLFEAIKVASETITSKSEYF- 552 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + K ++ +TDG ++ + ++ + + + + + + + + Sbjct: 553 -----DNVYKRIVVVTDGIDNDNHYKSDES-LQKLTKILTDNKIILDVIFIDESDSRAAV 606 Query: 357 LRKCTDSSGQFFAVNDSRELLESFDK 382 + + T FF ++ + F Sbjct: 607 MSQATGGLAFFFKGSEQNLMESVFSS 632 >gi|79607904|ref|NP_974433.2| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|332645764|gb|AEE79285.1| C3HC4-type RING finger protein [Arabidopsis thaliana] Length = 632 Score = 43.0 bits (99), Expect = 0.086, Method: Composition-based stats. Identities = 14/128 (10%), Positives = 43/128 (33%), Gaps = 18/128 (14%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE--VKSR 265 K+ +L + G ++ ++ + R+ IA++ ++ Sbjct: 257 TKLALLKRAMGFVIQNLGSSD--------RLSVIAFSSTARRLFPLTRMSDAGRQLALQA 308 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 +N L TN + + + + E + +I ++DG ++ + + + Sbjct: 309 VNSLVANGGTNIVDGLRKGAKVMEDRLERNSVAS--------IILLSDGRDTYTTNHPDP 360 Query: 326 LNTLQICE 333 + + Sbjct: 361 SYKDALAQ 368 >gi|79315048|ref|NP_001030861.1| zinc finger (C3HC4-type RING finger) family protein [Arabidopsis thaliana] gi|332645765|gb|AEE79286.1| C3HC4-type RING finger protein [Arabidopsis thaliana] Length = 633 Score = 43.0 bits (99), Expect = 0.086, Method: Composition-based stats. Identities = 14/128 (10%), Positives = 43/128 (33%), Gaps = 18/128 (14%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE--VKSR 265 K+ +L + G ++ ++ + R+ IA++ ++ Sbjct: 258 TKLALLKRAMGFVIQNLGSSD--------RLSVIAFSSTARRLFPLTRMSDAGRQLALQA 309 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 +N L TN + + + + E + +I ++DG ++ + + + Sbjct: 310 VNSLVANGGTNIVDGLRKGAKVMEDRLERNSVAS--------IILLSDGRDTYTTNHPDP 361 Query: 326 LNTLQICE 333 + + Sbjct: 362 SYKDALAQ 369 >gi|332519332|ref|ZP_08395799.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] gi|332045180|gb|EGI81373.1| von Willebrand factor type A [Lacinutrix algicola 5H-3-7-4] Length = 345 Score = 43.0 bits (99), Expect = 0.087, Method: Composition-based stats. Identities = 16/101 (15%), Positives = 38/101 (37%), Gaps = 21/101 (20%) Query: 253 TPLSNNLNEVKSRLNKLNPYE----NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P++ + K L +N T A+ A ++++++ + + Sbjct: 144 LPITTDYASAKMFLQNMNTDMLSSQGTAINEAIELAKTYYDDDQQTN----------RVL 193 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + I+DGE+ +A N G++I+++ V Sbjct: 194 VIISDGEDHSEAAANVAEEAS-------NEGIRIFTIGVGD 227 >gi|262196282|ref|YP_003267491.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] gi|262079629|gb|ACY15598.1| von Willebrand factor type A [Haliangium ochraceum DSM 14365] Length = 340 Score = 43.0 bits (99), Expect = 0.087, Method: Composition-based stats. Identities = 17/116 (14%), Positives = 38/116 (32%), Gaps = 26/116 (22%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + + PY T PA+ A + ++ ++DG + Sbjct: 247 GRDALVHFVLSTEPYGPTALAPALRTALSM----------------NPRRLVLLSDGLGN 290 Query: 318 GASAYQNTLNTLQICEYMRNA---GMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV 370 + + R A G++I ++ + + + LLR + SG + Sbjct: 291 VGG------DASAVLRDAREAMLGGVRIDTIGIGQAQDDR-LLRALAEESGGLYQA 339 >gi|111220185|ref|YP_710979.1| hypothetical protein FRAAL0704 [Frankia alni ACN14a] gi|111147717|emb|CAJ59375.1| hypothetical protein FRAAL0704 [Frankia alni ACN14a] Length = 711 Score = 43.0 bits (99), Expect = 0.087, Method: Composition-based stats. Identities = 21/132 (15%), Positives = 40/132 (30%), Gaps = 10/132 (7%) Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI----GSTRLKKFVIFITDG 314 +++ + L L P +T+ Y A REL + + DG Sbjct: 573 RDDLTASLAALKPGGSTDLYSTAVAAVRELTDRYAADRLNRVVLFTDGDADTGGDDHPDG 632 Query: 315 ENSGASAYQNTLNTLQICEYMRN------AGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF 368 E G A + TL +++ + + L R + G + Sbjct: 633 EQPGRRATASAEPTLDQAVSALRAAADPHRPVQLIVIGCDPGADLASLQRLAGATGGHAY 692 Query: 369 AVNDSRELLESF 380 D+ L + + Sbjct: 693 LAPDADALFDIY 704 >gi|4154309|gb|AAD04919.1| von Willebrand factor [Canis lupus familiaris] Length = 2813 Score = 43.0 bits (99), Expect = 0.087, Method: Composition-based stats. Identities = 15/147 (10%), Positives = 43/147 (29%), Gaps = 17/147 (11%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKES 294 ++ + Y + ++ + S ++ + + A+ A R + +E Sbjct: 1728 TQVSVLQYGSITTIDVPWNVAYEKVHLLSLVDLMQQEGGPSQIGDALSFAVRYVTSEVHG 1787 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + K VI +TD + + E R+ + ++ + + + Sbjct: 1788 ARPGAS----KAVVILVTD---------VSVDSVDAAAEAARSNRVTVFPIGIG-DRYSE 1833 Query: 355 DLLRKCTD--SSGQFFAVNDSRELLES 379 L + + +L Sbjct: 1834 AQLSSLAGPKAGSNMVRLQRIEDLPTV 1860 >gi|1478046|gb|AAB05549.1| von Willebrand factor [Canis lupus familiaris] Length = 2813 Score = 43.0 bits (99), Expect = 0.087, Method: Composition-based stats. Identities = 15/147 (10%), Positives = 43/147 (29%), Gaps = 17/147 (11%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKES 294 ++ + Y + ++ + S ++ + + A+ A R + +E Sbjct: 1728 TQVSVLQYGSITTIDVPWNVAYEKVHLLSLVDLMQQEGGPSQIGDALSFAVRYVTSEVHG 1787 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + K VI +TD + + E R+ + ++ + + + Sbjct: 1788 ARPGAS----KAVVILVTD---------VSVDSVDAAAEAARSNRVTVFPIGIG-DRYSE 1833 Query: 355 DLLRKCTD--SSGQFFAVNDSRELLES 379 L + + +L Sbjct: 1834 AQLSSLAGPKAGSNMVRLQRIEDLPTV 1860 >gi|50950127|ref|NP_001002932.1| von Willebrand factor precursor [Canis lupus familiaris] gi|40786760|gb|AAB93766.2| von Willebrand factor [Canis lupus familiaris] Length = 2813 Score = 43.0 bits (99), Expect = 0.087, Method: Composition-based stats. Identities = 15/147 (10%), Positives = 43/147 (29%), Gaps = 17/147 (11%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKES 294 ++ + Y + ++ + S ++ + + A+ A R + +E Sbjct: 1728 TQVSVLQYGSITTIDVPWNVAYEKVHLLSLVDLMQQEGGPSQIGDALSFAVRYVTSEVHG 1787 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + K VI +TD + + E R+ + ++ + + + Sbjct: 1788 ARPGAS----KAVVILVTD---------VSVDSVDAAAEAARSNRVTVFPIGIG-DRYSE 1833 Query: 355 DLLRKCTD--SSGQFFAVNDSRELLES 379 L + + +L Sbjct: 1834 AQLSSLAGPKAGSNMVRLQRIEDLPTV 1860 >gi|12644030|sp|Q28295|VWF_CANFA RecName: Full=von Willebrand factor; Short=vWF; Flags: Precursor Length = 2813 Score = 43.0 bits (99), Expect = 0.087, Method: Composition-based stats. Identities = 15/147 (10%), Positives = 43/147 (29%), Gaps = 17/147 (11%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKES 294 ++ + Y + ++ + S ++ + + A+ A R + +E Sbjct: 1728 TQVSVLQYGSITTIDVPWNVAYEKVHLLSLVDLMQQEGGPSQIGDALSFAVRYVTSEVHG 1787 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 + K VI +TD + + E R+ + ++ + + + Sbjct: 1788 ARPGAS----KAVVILVTD---------VSVDSVDAAAEAARSNRVTVFPIGIG-DRYSE 1833 Query: 355 DLLRKCTD--SSGQFFAVNDSRELLES 379 L + + +L Sbjct: 1834 AQLSSLAGPKAGSNMVRLQRIEDLPTV 1860 >gi|326789665|ref|YP_004307486.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] gi|326540429|gb|ADZ82288.1| von Willebrand factor type A [Clostridium lentocellum DSM 5427] Length = 564 Score = 43.0 bits (99), Expect = 0.088, Method: Composition-based stats. Identities = 15/104 (14%), Positives = 36/104 (34%), Gaps = 15/104 (14%) Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + + L T T+ M A + L EK ++ + + ++DGE + Sbjct: 450 RSLFTGAVMDLEASGGTATFDGMIVALKMLMEEKAANPDAKL------MLFVLSDGETNR 503 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 + + I ++ + +Y++ +A L + Sbjct: 504 GHSLND---IEGILRTLK---IPVYTIGYNANISA---LNNISS 538 >gi|257879128|ref|ZP_05658781.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,230,933] gi|257813356|gb|EEV42114.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,230,933] Length = 1258 Score = 43.0 bits (99), Expect = 0.088, Method: Composition-based stats. Identities = 32/371 (8%), Positives = 90/371 (24%), Gaps = 48/371 (12%) Query: 48 RTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAES 107 + + +T++ + + I ++++ + Sbjct: 103 PSSTASDLHPEDATTLYNVYLDVIGGEKKEISPIDIVFVLDKSASMSELTAGTNSQTKNA 162 Query: 108 KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQK 167 I N K L+ ++ + +S + + D+ D+ Sbjct: 163 AL---IEAVNEMSKDLLSDPSLDIRI-GMVNFYHNSTAINNHEQISSDIFPLTNDINRLT 218 Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKA 227 ++N + P T + + V+ + + ++ Sbjct: 219 GSENTALNRTPIGGTPLTLGLKNGYETLYKDNGGENRNPEKILIVVGDGTPTFSYAPIQS 278 Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRE 287 I + L N E NT+ H Sbjct: 279 SYRTSTNGAWSNWTVMEDKIAEDNDV-LFRNFEEF---------SGNTS-NAGFTHPVTY 327 Query: 288 LYNEKESSHNTIGSTRLKKFVI-------FITDGENSGASAYQNTLNTLQIC-------- 332 + R + ++ DG ++ + T Sbjct: 328 ASDFNRPEDEVNVHYRYGEVKEGDNKATHWVGDGSSNNNTNGSPTSQEKSSAINTVAYHH 387 Query: 333 ---EYMRNAGMKIYSVAVSAPPE----------GQDLLRKCTD--SSG---QFFAVNDSR 374 + I+S+ + G+++L+ D G +++ N+ Sbjct: 388 WLKNKYQENPPSIFSIGLGIDGNVSGRQRLDAIGRNVLKNIADLEEDGVTPRYYNANNKN 447 Query: 375 ELLESFDKITD 385 +++ + + I+ Sbjct: 448 DIVTALEDISS 458 >gi|269126092|ref|YP_003299462.1| type II secretion system protein [Thermomonospora curvata DSM 43183] gi|268311050|gb|ACY97424.1| type II secretion system protein [Thermomonospora curvata DSM 43183] Length = 660 Score = 43.0 bits (99), Expect = 0.088, Method: Composition-based stats. Identities = 22/134 (16%), Positives = 44/134 (32%), Gaps = 16/134 (11%) Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 G + + ++ L ++ T Y AM A L + Sbjct: 135 TGTPTRPVIDPTTDRRSLRKALGRMRTGGETALYDAMSAAVDRLARANAAE--------- 185 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK-CTDS 363 ++ ++DG++ +T Q+ ++ + VA + LR+ DS Sbjct: 186 -GRLVVLSDGKD-----SASTSTLAQVLARLKRTRIAADVVAFKTAATSEGTLRQLAADS 239 Query: 364 SGQFFAVNDSRELL 377 G+ + D R L Sbjct: 240 GGRLLSSPDPRRLN 253 >gi|163755380|ref|ZP_02162500.1| von Willebrand factor type A like domain [Kordia algicida OT-1] gi|161324800|gb|EDP96129.1| von Willebrand factor type A like domain [Kordia algicida OT-1] Length = 718 Score = 43.0 bits (99), Expect = 0.088, Method: Composition-based stats. Identities = 22/163 (13%), Positives = 48/163 (29%), Gaps = 20/163 (12%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP---YENTNTYPAMHHAYRELYN 290 + + + G PL+ ++ +N L T A+ AY Sbjct: 321 ETDHYNILLFAGGSSVLAPEPLACTKENIQKGINFLTNERGGGGTRLLNALKTAYAL--- 377 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + ++ ITDG S I + + A +++ + + Sbjct: 378 -------PRMDKTSARSMVVITDGYVSVER-----KAFEMIEQNLGQAN--VFTFGIGSG 423 Query: 351 PEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 L S+ + F + E + +K + I+ + Sbjct: 424 VNRYLLEGMAKISNSETFIATEMNEANDVAEKFRNYIKSPLLT 466 >gi|156382085|ref|XP_001632385.1| predicted protein [Nematostella vectensis] gi|156219440|gb|EDO40322.1| predicted protein [Nematostella vectensis] Length = 1221 Score = 43.0 bits (99), Expect = 0.088, Method: Composition-based stats. Identities = 28/211 (13%), Positives = 64/211 (30%), Gaps = 20/211 (9%) Query: 184 PPKKSFWSKNTTKSKYAPAPAPANRKIDVLIES--AGNLVNSIQKAIQEKKNLSVRIGTI 241 T A + D+ LVN S+R G Sbjct: 58 CRPVVKNCFVTMDVALAVDTSDGMSDADLAKTKSLVTTLVNQFSD-----SENSIRFGIT 112 Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN--TYPAMHHAYRELYNEKESSHNTI 299 Y + + ++++ + + A +L++ Sbjct: 113 TYGQEARTLANFKQNFDEAKLRTAIKGIQKTGVQARRHDLAAMAVKNDLFS-----LEGG 167 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 +FVIF + G N+G + + + + + G+ + ++ V++ Q L + Sbjct: 168 MRQGHPRFVIFFSAGANTG-----TADDLKKASKPLTDLGVNMIAIGVNSNA-DQASLAE 221 Query: 360 CTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + F+ N EL + I ++ ++ Sbjct: 222 LASENRFIFSANSPAELDALWPSIEAQMCQE 252 >gi|254443409|ref|ZP_05056885.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] gi|198257717|gb|EDY82025.1| von Willebrand factor type A domain protein [Verrucomicrobiae bacterium DG1235] Length = 257 Score = 43.0 bits (99), Expect = 0.089, Method: Composition-based stats. Identities = 16/132 (12%), Positives = 47/132 (35%), Gaps = 21/132 (15%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N ++++ T ++ YR L + + S + ++ +TDG +S Sbjct: 132 NRQAFIGSVSQIRAKGRTPLVESIVTGYRVLTEQAQ-----RQSGYGRYVLVIVTDGASS 186 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRE 375 + + R + +++ ++ +L G Q+ + + Sbjct: 187 DGNPAGVAMEVT------RESPIEVQTIGFGVADHALNL-------PGVTQYVTASSPKA 233 Query: 376 LLESFDK-ITDK 386 L+++ ++ I + Sbjct: 234 LIDALNQVIASE 245 >gi|328945098|gb|EGG39253.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK1087] Length = 471 Score = 43.0 bits (99), Expect = 0.091, Method: Composition-based stats. Identities = 30/243 (12%), Positives = 64/243 (26%), Gaps = 54/243 (22%) Query: 197 SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 + Y+ P +ID+L E ++ +Q NL ++ L Sbjct: 225 NPYSNNPLKNRSRIDILREKTKKMMADLQPIGNVSVNLVQFNSHASFV----QQNFIELD 280 Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK--------------------ESSH 296 L + S ++ LNP TN + + L + + Sbjct: 281 KGLTSINSAIDNLNPEHATNPGDGLRYGMVSLQSNAAQLKYVVLLTDGVPNSYMVGPQYN 340 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN------------------- 337 + F+ + +T Sbjct: 341 YWGNKIETRTRDGFVNQAYFNTWKYDLSTGFHSTRANSAFAGGNPSWAVDGSIEYSGEVS 400 Query: 338 ----AGMK-IYSVAVSAPPEGQDL--LRKCTDSSG----QFFAVNDSRELLESFDKITDK 386 G+K + + SA + G + V+D ++L ++F I + Sbjct: 401 KQFKKGIKRVNVIGFSAKEADKQQGSRLTAAIKEGVPETSYTDVSDDKQLEQTFADIKKQ 460 Query: 387 IQE 389 +++ Sbjct: 461 VEQ 463 >gi|327266004|ref|XP_003217797.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-3-like [Anolis carolinensis] Length = 1098 Score = 43.0 bits (99), Expect = 0.091, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 59/177 (33%), Gaps = 14/177 (7%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 ++ + ++ ++++++ ++ + G N + L+K Sbjct: 278 RLTIAKQTVSSILDTLGD-DDFFNIIAYNEELHYVEPCLNGTLVQADRANKEHFREHLDK 336 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L A+ A+ L + + +I S + ++ +TDG A N Sbjct: 337 LFAKGIGMLDIALVEAFNMLSDFNHTGQGSICS----QAIMLVTDGAVDTYDAVFEKYNW 392 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + ++I++ + D L+ ++ FF L + + + + Sbjct: 393 P-------DRKVRIFTYLIGREAAFADNLKWMACANKGFF--TQISTLADVQENVME 440 >gi|288940556|ref|YP_003442796.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] gi|288895928|gb|ADC61764.1| von Willebrand factor type A [Allochromatium vinosum DSM 180] Length = 610 Score = 43.0 bits (99), Expect = 0.091, Method: Composition-based stats. Identities = 18/118 (15%), Positives = 44/118 (37%), Gaps = 16/118 (13%) Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA-----SAYQNTLNTL 329 T+ A+ A + ++ +I TDG + SA Sbjct: 107 TDIERAIRTATEDWTKTPPEG---------ERHLILFTDGLVDVSKDEAESAASRERILS 157 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDK 386 + E +++ G+K++++ + + + L+R + G D+ L F ++ ++ Sbjct: 158 EQIESLKSEGVKVHAIGL-SDQIDEPLMRLLATQTDGWLEVAQDAETLQRLFLRVLEQ 214 >gi|3273257|dbj|BAA31172.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 43.0 bits (99), Expect = 0.091, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + + N + + S N + Sbjct: 55 CSGSYRRHNWVKHAVPLAMKLIQQLNLNENAIHLYVNDFSNNAREIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LIIIKSLLSTNLPYGRTNLSDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|23016188|ref|ZP_00055947.1| hypothetical protein Magn03010637 [Magnetospirillum magnetotacticum MS-1] Length = 408 Score = 43.0 bits (99), Expect = 0.092, Method: Composition-based stats. Identities = 22/165 (13%), Positives = 44/165 (26%), Gaps = 17/165 (10%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M I ++ F+ + + +QS DAA L G + S + Sbjct: 21 MVGIGMTAMIGFLALGTETGLWYAAKRNLQSVADAAALGGAFELGSGSNSSVISAAA--- 77 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + + + IA K NP PT LF Sbjct: 78 --------IQDAGRNGFQATGGATIAVHTPPASGKYAGNPQMVEVSVSQ----PTTLLFS 125 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYL 165 + ++ R+ + +++ V++ + Sbjct: 126 ALFLK--SLQVNARAVAKTGSVGDACILALDNVINDAAYFTGSAT 168 >gi|294140885|ref|YP_003556863.1| inter-alpha-trypsin inhibitor domain-containing protein [Shewanella violacea DSS12] gi|293327354|dbj|BAJ02085.1| inter-alpha-trypsin inhibitor domain protein [Shewanella violacea DSS12] Length = 765 Score = 43.0 bits (99), Expect = 0.092, Method: Composition-based stats. Identities = 21/137 (15%), Positives = 47/137 (34%), Gaps = 21/137 (15%) Query: 257 NNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 NNL V+ L L T A+ AY ++ + ++ ITDG+ Sbjct: 321 NNLQRVEKTLQNLRADFGGTEMDSALQAAY-----------SSKTPKNIPTDILLITDGQ 369 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE 375 + L + ++ + V V + L + +++ G V + Sbjct: 370 ------IWDQDYLLTNAQASKHRH---FVVGVGSAVSEAFLSKLASETGGASEFVTPNEN 420 Query: 376 LLESFDKITDKIQEQSV 392 + + ++I++ + Sbjct: 421 MSSRIVRHFERIKQPRL 437 >gi|327270792|ref|XP_003220172.1| PREDICTED: epithelial chloride channel protein-like [Anolis carolinensis] Length = 921 Score = 43.0 bits (99), Expect = 0.093, Method: Composition-based stats. Identities = 21/129 (16%), Positives = 50/129 (38%), Gaps = 26/129 (20%) Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 N ++ A++ ++ ++ +T+GE S S + + + Sbjct: 392 NICEGVNAAFQVFS--------QKLTSTEGCEIVLLTNGEGSDLSPCLSKNQSQE----- 438 Query: 336 RNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN---DSRELLESFDKITD---KIQ 388 + I+++A + + L K D + G+ F DS L+++F I+ Sbjct: 439 ----IIIHTIAFGSKASNE--LEKLADMTGGKTFYATDSLDSNGLIDAFGGISSGSGDAS 492 Query: 389 EQSVRIAPN 397 +QS+++ Sbjct: 493 QQSIQLESK 501 >gi|290990289|ref|XP_002677769.1| predicted protein [Naegleria gruberi] gi|284091378|gb|EFC45025.1| predicted protein [Naegleria gruberi] Length = 754 Score = 43.0 bits (99), Expect = 0.094, Method: Composition-based stats. Identities = 28/232 (12%), Positives = 71/232 (30%), Gaps = 34/232 (14%) Query: 170 DNNNMTSNKYLLPPPPKKSFWSKNTTKSK-----YAPAPAPANRK---IDVLIESAGNLV 221 DNNN+ + +Y + + + + + + ID + NL Sbjct: 8 DNNNIETRQYEIDDSITCLQFDLISNIQRKEKQIVIALDVSGSMRGQGIDQAKIAISNLF 67 Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV-KSRLNKLNPYENTNTYPA 280 + + + L E +S L ++ T+ Sbjct: 68 EQVVDIPD-----------VVLIAYDTSAELYDLRKKPAETRQSTLEQIQAGGGTDFTCV 116 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 S + +++ + ++F TDG++ + + + ++ + Sbjct: 117 FEAI----------SKLDMFNSQSEVAILFFTDGQDGSSHKREKAIEQMKKVLETKTQSF 166 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSS---GQFFAVNDSRELLESFDKITDKIQE 389 + +++ LL + T G F V D+ E+ +S + + + Sbjct: 167 EFHTIGF-TSSHDVALLTQITQLGSVQGTFQYVKDANEINQSMENLIGLLTS 217 >gi|114799760|ref|YP_759488.1| hypothetical protein HNE_0760 [Hyphomonas neptunium ATCC 15444] gi|114739934|gb|ABI78059.1| conserved hypothetical protein [Hyphomonas neptunium ATCC 15444] Length = 576 Score = 43.0 bits (99), Expect = 0.095, Method: Composition-based stats. Identities = 13/44 (29%), Positives = 21/44 (47%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASI 44 + A+I + L + AIDL + R MQS D A ++ + Sbjct: 25 LFALIAPIATLMMAMAIDLGMVNLQRRNMQSMTDLAAITAAGDL 68 >gi|3273293|dbj|BAA31190.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 43.0 bits (99), Expect = 0.095, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + + N + + S N + Sbjct: 55 CSGSYRRHNWVKHAVPLAMKLIQQLNLNESAIHLYVNDFSNNAREIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LIIIKSLLSTNLPYGRTNLTDALLQVRKHLNDRI-------NRENASQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|152207243|gb|ABS30732.1| voltage-gated calcium channel alpha2-delta subunit 1 [Anopheles gambiae] Length = 1256 Score = 43.0 bits (99), Expect = 0.095, Method: Composition-based stats. Identities = 40/353 (11%), Positives = 96/353 (27%), Gaps = 26/353 (7%) Query: 28 QMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQ 87 ++ + + A L S+ + T+K ++ ++I ++ G + + + Sbjct: 114 RIMDSAEQAAL-------SESDPESATSKAHPSAFYDARRINEYQSDGRLAEGSRQMLLR 166 Query: 88 K-AQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENL 146 + + + + + P +K L + S Sbjct: 167 HMRRFEGYPVNISLSSVLLPAGVSLDDPETQSAIKWSSHLD--PLFANNIERDSALSWQY 224 Query: 147 AISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPA 206 S L + S Sbjct: 225 FGSSTGFLRRFPGTAWPPETSYGSKEINDFRSEDWFIQAASSPKDVIILLDSSGSMSGKE 284 Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL 266 + + + + + R+ + +V +N+ EVK+ + Sbjct: 285 YQLAVATASAILDTLGDDDFFNLISFSDQSRVIVPCFQDKMVRATP----DNVKEVKTAI 340 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 N + N A+ A+ L +SS + + + ++ ITDG + Sbjct: 341 NAVECENTANFSAALETAFELLRKYNQSSQGSQCN----QAIMLITDGPSDTFMEVIKHY 396 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLE 378 N + ++I++ + G L K ++ G F +N E + Sbjct: 397 NHPHM-------PVRIFTYLIGTDKSGGKNLYKMACENKGFFVQINSPEEAKK 442 >gi|196228665|ref|ZP_03127531.1| hypothetical protein CfE428DRAFT_0695 [Chthoniobacter flavus Ellin428] gi|196226946|gb|EDY21450.1| hypothetical protein CfE428DRAFT_0695 [Chthoniobacter flavus Ellin428] Length = 921 Score = 42.6 bits (98), Expect = 0.098, Method: Composition-based stats. Identities = 12/71 (16%), Positives = 31/71 (43%), Gaps = 6/71 (8%) Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 L ++S + + + V+ +DG+++ + L++ + + G+ IY+V Sbjct: 318 TNLATALKNSASGDTKEQQRGAVVLFSDGQHNEGES------PLEVAKILAARGLPIYTV 371 Query: 346 AVSAPPEGQDL 356 + +DL Sbjct: 372 GFGSDIRPRDL 382 >gi|123454693|ref|XP_001315098.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121897764|gb|EAY02875.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 665 Score = 42.6 bits (98), Expect = 0.098, Method: Composition-based stats. Identities = 39/358 (10%), Positives = 104/358 (29%), Gaps = 47/358 (13%) Query: 44 IVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK--DKNNPL 101 I + ++ + + + K +L I + + + Sbjct: 65 IGDKQIRPQLRMSEEASKEYQESKEKGYLSLLGRNLSGNRIIFNFGNFPEETKIEVHYTM 124 Query: 102 QYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI---SICMVLDVSR 158 Y+AE Q + K + S + ++ +N++ + ++ Sbjct: 125 SYLAEVNNQGFFFRFPIASKDQYGYETSLPGSISFYLKIKTDKNISKIEANKSATINQFD 184 Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAG 218 + + + L+ K + S + + + Sbjct: 185 NHNAFINLDKFEPAI--FVQTLISDQDKSTAVSSDDYIAVSTYKEFSSKSNGYECKADYF 242 Query: 219 NLVNSIQKAIQEKKNLSVRIGT---------IAYNIGIVGNQCTPL----SNNLNEVKSR 265 +++ ++ +V+ ++I G++ L N V Sbjct: 243 FVIDCSGSMKGDRIEKAVKCMRLILQSLPMKCRFSIVCFGSEFQTLLPIVEYNNENVLLA 302 Query: 266 LNKLNP----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 +N + T+ Y + + + + K + +TDGE Sbjct: 303 MNLIKYIQAIMGGTDIYHPLEYIF--------------SQNGMTKKIFLLTDGE------ 342 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLE 378 N+ +++ + + G IY+V + L+R + ++G++ V D+ E Sbjct: 343 DSNSEEIIRLVQENKQFG-NIYTVGIGIGA-DSGLIRNLAEVTNGKWTYVLDNENFNE 398 >gi|91791025|ref|YP_551976.1| von Willebrand factor, type A [Polaromonas sp. JS666] gi|91700905|gb|ABE47078.1| von Willebrand factor, type A [Polaromonas sp. JS666] Length = 753 Score = 42.6 bits (98), Expect = 0.098, Method: Composition-based stats. Identities = 41/360 (11%), Positives = 97/360 (26%), Gaps = 39/360 (10%) Query: 36 AVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK 95 A+L+ + + D T L + R + + + + + Sbjct: 414 ALLAQAYELKYGKPDIDAPGLAAPAQTTTTTDDFTQLVATALERAADDGESLERALELIE 473 Query: 96 DKNNPLQYIAESKAQYEIPTENLFLKGLIPSA--LTNLSLRSTGIIERSSENLAISICMV 153 + + + + P G+ + L + S + + Sbjct: 474 VALEAVSASEQGEGSEKEPQLLGLAAGIGNATGTAIPLDTSARMSGAVSRLVRIFTKELQ 533 Query: 154 LDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL 213 R+++ +N + K + K S A + + Sbjct: 534 DKRRRTVKLASAGGQVASNRVWRLKAMGDTNVFKVTSS----VCGIDAAATILLDRSGSM 589 Query: 214 IESAGNLVNSIQKAIQEKKNLS-VRIGTIAYNIGI----VGNQCTPLSNNLNEVKSRLNK 268 + Q + +S V+ + + +V R+N+ Sbjct: 590 SRCIVEAAGAALSCSQALERISKVKTSIEMFPGYAKCVGNTVALQAFGQSARQVARRVNE 649 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 ++ T A+ L ++ K+ V +TDG + N Sbjct: 650 VDAEGGTPLAEALQEVMPRLLAQRVK----------KRIVFLVTDGIPN------NRPGA 693 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLL---RKCTDSSGQFFAVNDSRELLESFDKITD 385 L+ G++ + + + L C +ND+ EL ++F+K+ Sbjct: 694 LEEIGKAEKLGVEFVGIGIGVHGRAIEGLTPFSIC---------INDASELPDAFEKLFR 744 >gi|20306196|gb|AAH28343.1| Chloride channel calcium activated 3 [Mus musculus] Length = 913 Score = 42.6 bits (98), Expect = 0.098, Method: Composition-based stats. Identities = 18/131 (13%), Positives = 52/131 (39%), Gaps = 25/131 (19%) Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T+ + A+ + + + ++ +TDGE++ S+ + + Sbjct: 382 GGTSICSGLRTAFTVIKKKYPTD---------GSEIVLLTDGEDNTISSCFDLV------ 426 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND---SRELLESFDKITD---K 386 + +G I++VA+ P ++L + + G +D + +++F ++ Sbjct: 427 ---KQSGAIIHTVALG-PAAAKELEQLSKMTGGLQTYSSDQVQNNGFVDAFAALSSGNAA 482 Query: 387 IQEQSVRIAPN 397 I + S+++ Sbjct: 483 IAQHSIQLESR 493 >gi|229595667|ref|XP_001015169.2| Helicase conserved C-terminal domain containing protein [Tetrahymena thermophila] gi|225565750|gb|EAR94924.2| Helicase conserved C-terminal domain containing protein [Tetrahymena thermophila SB210] Length = 2371 Score = 42.6 bits (98), Expect = 0.100, Method: Composition-based stats. Identities = 16/213 (7%), Positives = 60/213 (28%), Gaps = 20/213 (9%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN- 257 + + +++ + + +++ + + + I + Sbjct: 2148 TGSMGSLITQTKNIIQTTFEQTRDILKEKGYDPQCFLIMISCFRSYNSKWEEIFQTSTWE 2207 Query: 258 -NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD--- 313 N + ++S L + Y T ++ + + + + + Sbjct: 2208 NNPDNLRSFLQTITAYGGTYPGESVEVGLWWANKKNDEDPISQVIILGDQPAHLQNEAQE 2267 Query: 314 --GENSGASAYQNTLNTLQI----CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSG- 365 + + L L C+ + + + + + + + G Sbjct: 2268 FRNQFGQSYWETTPLKGLTYYVPECQKLNTKNIPVNTFYLHQGA--KSTYEDIAKLTKGI 2325 Query: 366 -QFFAVNDSRE---LLESF-DKITDKIQEQSVR 393 ++ +N ++ L F ++I + +Q R Sbjct: 2326 CEYLDINSAQSSKKLTNLFVEQILKDVGKQDGR 2358 >gi|88857994|ref|ZP_01132636.1| hypothetical protein PTD2_11429 [Pseudoalteromonas tunicata D2] gi|88819611|gb|EAR29424.1| hypothetical protein PTD2_11429 [Pseudoalteromonas tunicata D2] Length = 974 Score = 42.6 bits (98), Expect = 0.100, Method: Composition-based stats. Identities = 32/254 (12%), Positives = 79/254 (31%), Gaps = 69/254 (27%) Query: 196 KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL 255 ++ A+ +I V + LV N + G + +N G G L Sbjct: 78 YNRKGNQVYCADSRIKVAQAAMKALVEQ---------NTDIEFGLMRFNGGSGGYILAGL 128 Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAY--------RELYNEKESSHNTIGSTRLKK- 306 ++ + +++ L +T ++ +Y N + + ++ Sbjct: 129 GSSKEAILTKIESLRAGGSTPLAETLYESYLYLTGGGVNYAKNIADRDKDIEDNSSYDSP 188 Query: 307 -------------------FVIFITDGENS---------------GASAYQNTLNTLQIC 332 +I +TDG+ + AY + + + Sbjct: 189 FKPKKDDSGLDILRCDNSINMIIMTDGDPTEDGGQNGNIKSLYNSKYGAYPKSKSGSYLN 248 Query: 333 EYMR----------NAGMK----IYSVAVSAP--PEGQDLLRKCTDSSG-QFFAVNDSRE 375 + +G+ +++ + +G DLL++ G ++ + + + Sbjct: 249 SLAKYMLNVDLFPTTSGVTDIARTFTIGFGSGMSDDGLDLLKQTASDGGGEYLLASTAEQ 308 Query: 376 LLESFDKITDKIQE 389 L E+ K KI++ Sbjct: 309 LTEALKKTITKIRQ 322 >gi|87199928|ref|YP_497185.1| TadE-like [Novosphingobium aromaticivorans DSM 12444] gi|87135609|gb|ABD26351.1| TadE-like protein [Novosphingobium aromaticivorans DSM 12444] Length = 153 Score = 42.6 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 12/126 (9%), Positives = 39/126 (30%), Gaps = 5/126 (3%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A++ V LF ++++ ++ R+++QSAL + ++ S + + Sbjct: 23 VAVLTPVLVLFGLGTVEVSSLVARRSELQSAL---AEAVAIALASKPDTQSKIDTIESVI 79 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + ++ + G + ++ + + + Sbjct: 80 SASTGVSTANIDTA--VIYRCGIDPGYVTLPGLCSQSGEISKYVQLTIRDTYTPMWTDFG 137 Query: 122 GLIPSA 127 P + Sbjct: 138 ISGPIS 143 >gi|291223809|ref|XP_002731900.1| PREDICTED: chloride channel calcium activated 2-like [Saccoglossus kowalevskii] Length = 992 Score = 42.6 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 20/156 (12%), Positives = 49/156 (31%), Gaps = 23/156 (14%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP--YENTNTYPAMHHAYRELYNEKESSHN 297 + ++ L N + + + +L T+ + L ++ Sbjct: 361 IVEFSEFAQELAPLTLVNGSDSREGLIRRLPHSVGGWTSIGAGIMKGIEVLSTNGQNPEG 420 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 ++ I+DG + A + + + +G+ I ++A + ++L Sbjct: 421 --------GLIMAISDGGENRAPTLS------EALQAVDESGVTIDTIAY-SEQADENLA 465 Query: 358 RKCTDSSG-QFFAVNDSRE--LLESFDKITDKIQEQ 390 + G FF D L ++F I + Sbjct: 466 SLAARTGGMSFFYSGDDDSTVLEDAF---ATSITSR 498 >gi|225686099|ref|YP_002734071.1| protein norD [Brucella melitensis ATCC 23457] gi|256262776|ref|ZP_05465308.1| protein norD [Brucella melitensis bv. 2 str. 63/9] gi|225642204|gb|ACO02117.1| Protein norD [Brucella melitensis ATCC 23457] gi|263092583|gb|EEZ16818.1| protein norD [Brucella melitensis bv. 2 str. 63/9] gi|326410427|gb|ADZ67491.1| protein NorD [Brucella melitensis M28] gi|326553720|gb|ADZ88359.1| protein NorD [Brucella melitensis M5-90] Length = 633 Score = 42.6 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 24/132 (18%), Positives = 45/132 (34%), Gaps = 20/132 (15%) Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG-- 318 V+ R+ L P T AM HA +L + KK ++ +TDG+ + Sbjct: 515 TVEHRIAALKPGFYTRMGAAMRHATAKLAEQPNR----------KKLLLLLTDGKPNDVD 564 Query: 319 -ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQFFAVNDSRE 375 + + +R G+ +++V V L G + V + + Sbjct: 565 HYEGRFALEDCRRAAGEVRAKGVNVFAVTVDREASAYLPALF-----GRGGYALVANLAK 619 Query: 376 LLESFDKITDKI 387 L + I + Sbjct: 620 LPVALPAIYRML 631 >gi|205371983|ref|ZP_03224801.1| hypothetical protein Bcoam_00436 [Bacillus coahuilensis m4-4] Length = 245 Score = 42.6 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 15/97 (15%), Positives = 34/97 (35%), Gaps = 10/97 (10%) Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE----GQD 355 S K ++ +TDG ++ + + + G+ I + V G Sbjct: 1 MSVGTIKQILLLTDGCSNSGG------DPVAMAALAYEQGLTINVIGVMDNDTIDEKGMS 54 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + S G + ++ L ++ +T K Q++ Sbjct: 55 EIEGIATSGGGVSQIVYAKNLSQTVQMVTRKAMTQTL 91 >gi|149064977|gb|EDM15053.1| rCG28280 [Rattus norvegicus] Length = 106 Score = 42.6 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 16/111 (14%), Positives = 35/111 (31%), Gaps = 18/111 (16%) Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 E T T A+ A + +KK + ITDG+ ++ + Sbjct: 5 GEGTYTATALQAANDMFKEARPG---------VKKVALVITDGQTD----SRDKRKLADV 51 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQ-----DLLRKCTDSSGQFFAVNDSRELL 377 + ++ ++I+ + V + + S + +D L Sbjct: 52 VKDANDSKVEIFVIGVVKKDDPNFEIFHKEMNLIATDSEHVYQFDDFFTLQ 102 >gi|320106711|ref|YP_004182301.1| VWFA-like domain-containing protein [Terriglobus saanensis SP1PR4] gi|319925232|gb|ADV82307.1| VWFA-related domain-containing protein [Terriglobus saanensis SP1PR4] Length = 384 Score = 42.6 bits (98), Expect = 0.10, Method: Composition-based stats. Identities = 25/200 (12%), Positives = 59/200 (29%), Gaps = 37/200 (18%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE- 273 S L + + + L + + T + + LN L Sbjct: 144 NSWYFLQDMRNTSDVFFRTLKPEDYIAIVTYDLRTHILTDFTQDKRVTAEALNSLTIPGF 203 Query: 274 -NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 +TN + A+ L +K++I + G ++ + + + + Sbjct: 204 SDTNMFDALSETLDRLT-----------RIDGRKYIILVGTGRDTFSKLTLDKI----LA 248 Query: 333 EYMRNAGMKIYSVAVS------APPEG-------------QDLLRKCTD-SSGQFFAVND 372 + + + I+++ A G + ++ + GQ + Sbjct: 249 KVKQAQNVTIFAIGTGQLVRELADSRGRMGGIARMDYLQADNQMKTFAQMTGGQAYFPLF 308 Query: 373 SRELLESFDKITDKIQEQSV 392 L + F +I I+ Q V Sbjct: 309 QGALPDVFSQINQSIRNQYV 328 >gi|291242941|ref|XP_002741338.1| PREDICTED: chloride channel accessory 2-like [Saccoglossus kowalevskii] Length = 788 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 18/181 (9%), Positives = 54/181 (29%), Gaps = 26/181 (14%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKK--NLSVRIGTIAYNIGIVGNQCTPLSN 257 +I L ++ + + S + I++ + I ++ Sbjct: 304 VSGSMSLKSRIIKLQQAVYTFIMDEISLGIDVGCVTFSDKANIISWLMPINSDE------ 357 Query: 258 NLNEVKSRLN-KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + E + + LN NT A+ + L + + +TDG+ Sbjct: 358 DREEFLALVMPTLNTGGNTAIGSALIAGVQVLSQNDTQPAD-------GGILFLVTDGQE 410 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS--SGQFFAVNDSR 374 + ++ + + +G+ + ++A ++ L ++ ++ Sbjct: 411 NVPQFI------EEVIDNVIESGVIVDTLAWGL--FAEEKLETIASGTKGSSYYYSEQAQ 462 Query: 375 E 375 Sbjct: 463 S 463 >gi|194333791|ref|YP_002015651.1| hypothetical protein Paes_0961 [Prosthecochloris aestuarii DSM 271] gi|194311609|gb|ACF46004.1| conserved hypothetical protein [Prosthecochloris aestuarii DSM 271] Length = 413 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 18/172 (10%), Positives = 49/172 (28%), Gaps = 19/172 (11%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGI------VGNQCTPLSNNLNEVK 263 I+ +VN + + + ++ +++ Y Q TP + +L+ + Sbjct: 88 INQAKSQLWRIVNELSRMHKRGGDIRLQVALYEYGNNDLFPSTGFIRQVTPFTEDLDRIS 147 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 L L+ + + + S K + + + Sbjct: 148 EALFSLDTNGGSEYCG-------HVIGSSLNRLRWNRSDEGLKLIYIAGNEPFNQG---- 196 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEG-QDLLRKCTDSSGQFFAVNDSR 374 ++ C + + + ++ G ++ D G + DS Sbjct: 197 -PVSYEVACRWASERDIAVNTIYCGDYRVGIDTFWQRGADVGGGSYFAIDSD 247 >gi|111022920|ref|YP_705892.1| hypothetical protein RHA1_ro05957 [Rhodococcus jostii RHA1] gi|110822450|gb|ABG97734.1| conserved hypothetical protein [Rhodococcus jostii RHA1] Length = 548 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 16/117 (13%), Positives = 35/117 (29%), Gaps = 10/117 (8%) Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 +++ L T Y A+R + + VI +TDG N + Sbjct: 431 KVDGLI-GGGTGLYDTTLAAFRTVQETYDPRAV--------NSVIILTDGANEDPDSITK 481 Query: 325 TLNTLQICEYMR-NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 + + I ++ ++ + L + G + D ++ F Sbjct: 482 EQLLSILQRETDPARPVIIVTIGITGDADAATLAEISRVTGGSSYVAKDPADIANVF 538 >gi|229823545|ref|ZP_04449614.1| hypothetical protein GCWU000282_00843 [Catonella morbi ATCC 51271] gi|229786989|gb|EEP23103.1| hypothetical protein GCWU000282_00843 [Catonella morbi ATCC 51271] Length = 4167 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 29/182 (15%), Positives = 56/182 (30%), Gaps = 12/182 (6%) Query: 146 LAISICMVLDVSRSMEDLYLQKHN--DNNNMTSNKYLLPPPPKKSFWSKNTTKSK----- 198 I + ++ D S S ++ N P + ++ N + Sbjct: 76 KPIDLVILQDTSGSFKNTIGGVQNALRELTTPVPSTQYDPDHPRLVFTDNPKTTDRVMVA 135 Query: 199 --YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 + D S + N + Y+ T LS Sbjct: 136 TFQGLDGKVTYDRWD--TSSVYDSWRRAYPGYNPDSNFTDNPVYEKYDGTSYRINNTSLS 193 Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 NN +E++ +N + T T PA+ ++ ++S + + R + ITDG Sbjct: 194 NNPSEIQDFINNMVTEGGTPTVPALEDIMTRYSSQVQASQGGMENGRKT-VFLLITDGVA 252 Query: 317 SG 318 +G Sbjct: 253 NG 254 >gi|108758937|ref|YP_629042.1| von Willebrand factor type A domain-containing protein [Myxococcus xanthus DK 1622] gi|108462817|gb|ABF88002.1| von Willebrand factor type A domain protein [Myxococcus xanthus DK 1622] Length = 860 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 22/146 (15%), Positives = 41/146 (28%), Gaps = 27/146 (18%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSR---LNKLNPYENTNTYPAMHHAYRELYNEKE 293 R IA+ Q P+ ++ + LN T M A + Sbjct: 318 RFNVIAFENRFQSFQPEPVPFTQRTLEEADRWVAALNADGGTELLAPMRAAVQAA----- 372 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 ++ +TDG+ N L+ R ++YS + Sbjct: 373 ----------PDGVIVLLTDGQV------GNEAEILRAVLEARKT-ARVYSFGIGTNVSD 415 Query: 354 QDLLRKCT-DSSGQFFAVNDSRELLE 378 LLR + G ++ + + Sbjct: 416 V-LLRDMAKQTGGDVEFIHPGERIDD 440 >gi|58429519|gb|AAW78163.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 565 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 58/170 (34%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + N + + S N + Sbjct: 55 CSGSIRRHNWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAREIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 S + L P+ TN A+ + L + + V+ +TDG + Sbjct: 115 LSIIKSLLSTNLPFGRTNLTDALLQVRKHLNDRI-------NRENANQLVVILTDGIPN- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLSDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|58429497|gb|AAW78152.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 542 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 58/170 (34%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + N + + S N + Sbjct: 55 CSGSIRRHNWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAREIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 S + L P+ TN A+ + L + + V+ +TDG + Sbjct: 115 LSIIKSLLSTNLPFGRTNLTDALLQVRKHLNDRI-------NRENANQLVVILTDGIPN- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLSDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|58429469|gb|AAW78138.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 542 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 58/170 (34%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + N + + S N + Sbjct: 55 CSGSIRRHNWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAREIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 S + L P+ TN A+ + L + + V+ +TDG + Sbjct: 115 LSIIKSLLSTNLPFGRTNLTDALLQVRKHLNDRI-------NRENANQLVVILTDGIPN- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLSDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|293571291|ref|ZP_06682325.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] gi|291608698|gb|EFF37986.1| von Willebrand factor type A domain protein [Enterococcus faecium E980] Length = 1364 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 34/376 (9%), Positives = 99/376 (26%), Gaps = 46/376 (12%) Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKN-NPLQYIAESK 108 + D +L ++ I ++ + N L+ +SK Sbjct: 207 VPSNNPTADLHPEDATTLYNVYLDVIGSEKQEISPIDIVFVLDKSASMNEGTLEGGGQSK 266 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKH 168 I N + L+ ++ + +S + + D+ D+ Sbjct: 267 NAALIEAVNEISENLLSDPNMDIRI-GMVNFYHNSTVINNQEQISSDIFPLTNDINRLTG 325 Query: 169 NDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAI 228 ++N + P NR + ++ G+ + A Sbjct: 326 SENTALNRTPIGGTP----LTLGLKNGYETLYADNGGENRNPEKILIVVGDGTPTFSYAP 381 Query: 229 QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYREL 288 + ++ + G + + + ++ NT+ + Sbjct: 382 IQTRSRTSIWGAWSSWSVMGDKIAIDRGDLFKNFETF------SGNTS-NAGFTYPVTYA 434 Query: 289 YNEKESSHNTIGSTRLKKFVI-------FITDGENSGASAYQNTLNTLQIC--------- 332 + + T R + ++ G S + T Sbjct: 435 SDFDRPVNGTNVQYRYGEVKEGDDKATHWVGTGSASNDTNGSPTSQEKSSAINTVAYHHW 494 Query: 333 --EYMRNAGMKIYSVAVSAPPE----------GQDLLRKCTDSSG-----QFFAVNDSRE 375 + I+S+ + G+++L+ D + +++ N+ + Sbjct: 495 LKNKYQENPPSIFSIGLGIDGSIAGRQRLDAIGRNVLKNIADLNDDGTTPRYYDANNKND 554 Query: 376 LLESFDKITDKIQEQS 391 ++ + + I+ ++ Sbjct: 555 IITALEDISSTFKKTI 570 >gi|293402468|ref|ZP_06646604.1| putative porin-like protein [Erysipelotrichaceae bacterium 5_2_54FAA] gi|291304131|gb|EFE45384.1| putative porin-like protein [Erysipelotrichaceae bacterium 5_2_54FAA] Length = 1191 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 40/272 (14%), Positives = 81/272 (29%), Gaps = 23/272 (8%) Query: 125 PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP 184 T + + + D DN + + Sbjct: 558 GITFTAFDNEAEYTKQIKERLASHFYQNGNVYGVISTDNNTLLFYDNTSEEWKYIDIMKN 617 Query: 185 PKKSFWSKNT-TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAY 243 S T T A P + VLI++ + V + + +V I T Sbjct: 618 GTVSTSGAKTLTNEIVAKYPIVYCDRSSVLIDAVSSFVKGMNA------DSNVSIITFGN 671 Query: 244 NIGIVGNQCTPLSNNLNEVKSRLN--KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGS 301 N V + N E+ + PY+ TN + A +++ N S++ Sbjct: 672 NATDVTKGFVNVGENKEELLQTILKCSGTPYQGTNMADGLTMAKKQIDNTPSSNNCYS-- 729 Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK-C 360 I +DG + + + + + AV A +D ++ Sbjct: 730 -------IVFSDGAP----SLPKDIAEQAAVDAAKALKKVSLTYAVCAGEADKDFMKNKI 778 Query: 361 TDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 S QF + ++ ++ + I+D++ S+ Sbjct: 779 ASSPDQFLSTDNMTDIYDLLQIISDEVGAASI 810 >gi|126433420|ref|YP_001069111.1| von Willebrand factor, type A [Mycobacterium sp. JLS] gi|126233220|gb|ABN96620.1| von Willebrand factor, type A [Mycobacterium sp. JLS] Length = 233 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 14/89 (15%), Positives = 32/89 (35%), Gaps = 4/89 (4%) Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 TN +H A L + K ++ G + +++ +TDG+ + + Sbjct: 87 AFTASGTTNMAAGIHLALDILEDRKH-AYKAAGLQYYRPWILLLTDGKPNLDGFDEAVAR 145 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + G+ +++V + Q L Sbjct: 146 LNAVESA---RGVTVFAVGAGPRVDYQQL 171 >gi|229588184|ref|YP_002870303.1| hypothetical protein PFLU0636 [Pseudomonas fluorescens SBW25] gi|229360050|emb|CAY46904.1| putative exported protein [Pseudomonas fluorescens SBW25] Length = 651 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 14/128 (10%), Positives = 34/128 (26%), Gaps = 1/128 (0%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVL-SGCASIVSDRTIKDPTTKKDQ 59 M A ++V +F+ +D + + ++QS D + L + + T K+ Sbjct: 16 MAAGTLAVALVFMLLVVDSGRLYMEKRKLQSIADTSALEAAGRGGLCSPTTTANDYAKEN 75 Query: 60 TSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLF 119 + + +G + + T D + Sbjct: 76 ATRNGFTVVAGDNSRGLVVTCGLLTTNANSVRVFTPDATKNDAVRVVATRSVMTSIATGI 135 Query: 120 LKGLIPSA 127 + Sbjct: 136 WSMFSGAP 143 >gi|221116649|ref|XP_002154434.1| PREDICTED: similar to Ints6 protein [Hydra magnipapillata] Length = 854 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 31/225 (13%), Positives = 69/225 (30%), Gaps = 32/225 (14%) Query: 192 KNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ 251 T + +D+ + + + + I + + T + V Sbjct: 7 LVDTSASMNQKTCLGTTYLDLAKGAVESFLKIRARDINASRGDRYMLVTFDEHNKSVK-- 64 Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAY-----RELYNEKESSHNTIGSTRL-K 305 NL + L L Y T+ A+ A+ L + ++ L Sbjct: 65 -VGWRENLAQFLKELKNLEAYGLTDLGGALKQAFDLLNMTRLQSGIDNYGLGRNPYYLEP 123 Query: 306 KFVIFITDGENSGASA---------YQNTLNTLQICEYMRNAGMKIYSV-----AVSAPP 351 V+ +DG + N L ++ + + +++++ +A Sbjct: 124 ALVMLFSDGCDLINVNGITGEIIVPSNNQLPGAELTKEIYRWDQRLFALNLKIPGFAASV 183 Query: 352 E---------GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 L C D+ G+ +++ + LL+S + I KI Sbjct: 184 NEKLSTLQAEDMALSNLCEDTGGKLYSIGSYKTLLQSLESIAQKI 228 >gi|82703475|ref|YP_413041.1| hypothetical protein Nmul_A2358 [Nitrosospira multiformis ATCC 25196] gi|82411540|gb|ABB75649.1| putative membrane protein [Nitrosospira multiformis ATCC 25196] Length = 437 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 21/138 (15%), Positives = 49/138 (35%), Gaps = 6/138 (4%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAV------LSGCASIVSDRTIKDPT 54 + A+ + V F A+DL + ++++Q++ DA L+G + Sbjct: 31 IVALSLVVLVGFAGLALDLGKLYVAKSELQNSADACALAAARELNGANTNQLVLAEAAGM 90 Query: 55 TKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIP 114 T + +F+ ++ S + +A+ I + +QY + + I Sbjct: 91 TAGMRHDVLFQDEMIALGVDDSVTFSQTLNGVYQAKAAIGPAEAVLMQYARCTVQRTGIA 150 Query: 115 TENLFLKGLIPSALTNLS 132 + + L+P Sbjct: 151 NWFIQVLNLLPGVTIGNQ 168 >gi|71280576|ref|YP_269044.1| von Willebrand factor type A domain-containing protein [Colwellia psychrerythraea 34H] gi|71146316|gb|AAZ26789.1| von Willebrand factor type A domain protein [Colwellia psychrerythraea 34H] Length = 618 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 14/95 (14%), Positives = 33/95 (34%), Gaps = 13/95 (13%) Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG- 318 + V+ R+ + P T A+ A + + + K ++ +TDG+ + Sbjct: 498 DHVRGRIQAITPGFYTRMGAAIRQATKVISEQ----------KTADKLLLILTDGKPNDI 547 Query: 319 --ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 +T Q + G+K + + + Sbjct: 548 DHYEGRFGIEDTHQAINEAKRLGIKPFCITIDVDA 582 >gi|3273275|dbj|BAA31181.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + + N + + S N + Sbjct: 55 CSGSYRRHNWVKHAVPLAMKLIQQLNLNENAIHLYVNDFSNNAKEIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LIIIKSLLSTNLPYGRTNLSDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|3273291|dbj|BAA31189.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + + N + + S N + Sbjct: 55 CSGSYRRHNWVKHAVPLAMKLIQQLNLNENAIHLYVNDFSNNAKEIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LIIIKSLLSTNLPYGRTNLSDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|309358136|emb|CAP34492.2| CBR-DIG-1 protein [Caenorhabditis briggsae AF16] Length = 13580 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 32/307 (10%), Positives = 80/307 (26%), Gaps = 28/307 (9%) Query: 82 AGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIER 141 + QI + + + + + ++ Sbjct: 12740 ISGVDGVPQIVLVVKNGKASDDYSSAVKSLKSERNVTIFVVDSGDDESQDQNSELTDADK 12799 Query: 142 SSENLAISICM--VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY 199 VL + + + Sbjct: 12800 IVVIPQWRGADSEVLGPIADYICKIVPNVESARTWPTPRTKATTLAGSRRSCSTIDYESD 12859 Query: 200 A-----PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 + + D + ++ ++V++ + +IG + Y+ + Sbjct: 12860 VIIVLDSSENFTPDEFDSMKDAVASIVDTGFDLAPDVS----KIGFVIYSDKVAVPVALG 12915 Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 + E+ ++ + A L N G K V+ IT+G Sbjct: 12916 HYEDKIELLEKIVDAE-----KINDGVAIALYGL-NAARQQFQLHGRENATKIVLLITNG 12969 Query: 315 ENSGASAYQNTLNTLQICEYMRN-AGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFFAVN 371 +N G N E +R+ G+++++VAV + P+ +++ + V Sbjct: 12970 KNRG--------NAAAAAEDLRDMYGVQLFAVAVGSNPDELATIKRLVGNANPDNAIEVA 13021 Query: 372 DSRELLE 378 S E+ + Sbjct: 13022 QSTEIDD 13028 >gi|290995707|ref|XP_002680424.1| vWFA domain-containing protein [Naegleria gruberi] gi|284094045|gb|EFC47680.1| vWFA domain-containing protein [Naegleria gruberi] Length = 382 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 18/145 (12%), Positives = 50/145 (34%), Gaps = 13/145 (8%) Query: 192 KNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI--GIVG 249 + +ID ++ +++S++ + +R I+Y Sbjct: 63 NIVDLVIVMDCTGSMSGEIDAAKKTVQTILSSLK----DHFKTDLRFSAISYRDHSDDYA 118 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 + P + ++ + KS ++ ++ + A+ A + + + K I Sbjct: 119 VREFPFTKDIQKAKSYIDTMSAQGGGDHPEALASALKVVNEMPFNKKGK-------KICI 171 Query: 310 FITDGENSGASAYQNTLNTLQICEY 334 +I D G ++ Q + + C+ Sbjct: 172 WIADAPPHGMNSNQGADSYPEGCKD 196 >gi|320105056|ref|YP_004180647.1| hypothetical protein Isop_3541 [Isosphaera pallida ATCC 43644] gi|319752338|gb|ADV64098.1| protein of unknown function DUF1355 [Isosphaera pallida ATCC 43644] Length = 818 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 14/101 (13%), Positives = 33/101 (32%), Gaps = 19/101 (18%) Query: 257 NNLNEVKSRLNKLNPYENTN-TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + + + KL +++ + +L ++ +I TDG+ Sbjct: 198 DGIAPALEAIAKLEASGSSSKLGAGVRQVLTQLRGVAPTA------------IIMFTDGQ 245 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + + E + G+ I +V + P +DL Sbjct: 246 TTEGE------DLASAAELAKRKGVPIITVGLGDPDAPRDL 280 >gi|91788413|ref|YP_549365.1| hypothetical protein Bpro_2551 [Polaromonas sp. JS666] gi|91697638|gb|ABE44467.1| hypothetical protein Bpro_2551 [Polaromonas sp. JS666] Length = 533 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 19/146 (13%), Positives = 42/146 (28%), Gaps = 1/146 (0%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + + + V F +DL H+ + ++Q+A DAA L+G + D QT Sbjct: 18 IFGLTVVVLFAMGGVVLDLGHLYIAKAELQNAADAAALAGAKDLNETTPGIDAAVATAQT 77 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + K + + + Q + K + +L Sbjct: 78 ISAKNKY-NFSTDVTLALANIEFGPSPDGPWSSVATARAAPQGMTFIKVDTGLKVLGTYL 136 Query: 121 KGLIPSALTNLSLRSTGIIERSSENL 146 + + + ++ Sbjct: 137 MRVAGVDTVSTFGLAVAGRFVNNVTP 162 >gi|3273249|dbj|BAA31168.1| thrombospondin-related protein [Plasmodium falciparum] gi|3273271|dbj|BAA31179.1| thrombospondin-related protein [Plasmodium falciparum] Length = 565 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + + N + + S N + Sbjct: 55 CSGSYRRHNWVKHAVPLAMKLIQQLNLNENAIHLYLNDFSNNAREIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LIIIKSLLNTNLPYGRTNLTDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|453699|dbj|BAA02853.1| type VII collagen [Homo sapiens] Length = 1080 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 55/169 (32%), Gaps = 23/169 (13%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY--ENTNTYPAMHHAYRELYNEKE 293 V++G ++Y+ S++L + R+ + N A+ A+R + Sbjct: 896 VQVGLLSYSHRPSPLFPLNGSHDLGIILQRILDMPYMDPSGNNLGTAVVTAHRYMLAPDA 955 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + ++ + D G + +G+ + V + Sbjct: 956 PGRR----QHVPGVMVLLVDEPLRGDIFSP--------IREAQASGLNV--VMLGMAGAD 1001 Query: 354 QDLLRKCTDSSG-----QFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + LR+ G FFAV+D L ++ + + + S P Sbjct: 1002 PEQLRRLA--PGMDSVQTFFAVDDGPSLDQAVSGLATALCQASFTTQPR 1048 >gi|284029672|ref|YP_003379603.1| hypothetical protein Kfla_1709 [Kribbella flavida DSM 17836] gi|283808965|gb|ADB30804.1| hypothetical protein Kfla_1709 [Kribbella flavida DSM 17836] Length = 144 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 17/129 (13%), Positives = 44/129 (34%), Gaps = 12/129 (9%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 TAI++ + + D++ + +R + + D AVL+ + + + + Sbjct: 21 TAIVLLMIAVVT----DISKVFLVRRDLDATADGAVLAAANGLAAIYGQTSAGSTAELDV 76 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 ++ ++L + A++ T E + ++P + + Sbjct: 77 EQARRLAAEYL-DTVAASNRFDGLDWSAEVTGTT-------VTVELTSTVDLPFQPPGWQ 128 Query: 122 GLIPSALTN 130 G A T Sbjct: 129 GSAGIASTA 137 >gi|146304149|ref|YP_001191465.1| von Willebrand factor, type A [Metallosphaera sedula DSM 5348] gi|145702399|gb|ABP95541.1| von Willebrand factor, type A [Metallosphaera sedula DSM 5348] Length = 363 Score = 42.6 bits (98), Expect = 0.11, Method: Composition-based stats. Identities = 18/127 (14%), Positives = 40/127 (31%), Gaps = 4/127 (3%) Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 +E+ L L T Y A+ A+ E ++ + + + + + G + Sbjct: 73 SEMGKELESLKVGSGTAMYKALQEAFNLARKYGEPTYVILLTDGVPSDMGCMP-GLSRKF 131 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 + + ++I S + + L G F+ V D ++ E Sbjct: 132 DLNRCLPVYQGLSVP---ENVQIISFGIGDDYSEEILTEVSEKGRGFFYHVTDPAQIPEK 188 Query: 380 FDKITDK 386 K+ Sbjct: 189 MPKLVKS 195 >gi|242009952|ref|XP_002425745.1| calcium channel, putative [Pediculus humanus corporis] gi|212509649|gb|EEB13007.1| calcium channel, putative [Pediculus humanus corporis] Length = 652 Score = 42.6 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 36/360 (10%), Positives = 103/360 (28%), Gaps = 42/360 (11%) Query: 54 TTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK--NNPLQYIAESKAQY 111 + + ++ ++ + A + + N + Sbjct: 25 REMAAEVKNMMDIKMNAVMRIMDSAEQAALSQKFDPGPSNKYTQRYNVQNGKNVDGSRNM 84 Query: 112 EIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLD----VSRSMEDLYLQK 167 + ++ F + + +++ L + I LD + M+ + Sbjct: 85 FLKSDERFDHLPVNANFSSIFLSPGVKETDPDVQMGIRWSEYLDLLFVNNYEMDPSLSWQ 144 Query: 168 ---------------HNDNNNMTSNKYLLPPPPKKSFWSKNTTKSK-----YAPAPAPAN 207 N KY S++ T SK + + Sbjct: 145 YFGSSSGFLRRYPAIKWPPNEGLLEKYQFHDFRTSSWYIDAATSSKDIVILVDSSSSMGG 204 Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN-NLNEVKSRL 266 +K + +++++ + N+ + + + + N+ E++ Sbjct: 205 KKKGIAKAIVNIILDTLG--NNDFVNIYRFSESATEIVPCFKDVLVQATAENIRELRIAF 262 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 + + + N A+ + L+ + + + ++ ITDG +S Sbjct: 263 DFVKYEGSANFTSALVTGFEILHRYNRTGQGCQCN----QAIMLITDGPSSSYKEIFKQY 318 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA-VNDSRELLE-SFDKIT 384 N + +++++ V Q+ + ++ +FA V +S + E I Sbjct: 319 NWPHM-------PVRMFTYLVGKDGSNQEDMNWMACANKGYFAKVQNSEDAQEKVLQYIA 371 >gi|118096709|ref|XP_001233876.1| PREDICTED: similar to tumor suppressor candidate 4 [Gallus gallus] Length = 1208 Score = 42.6 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 24/252 (9%), Positives = 63/252 (25%), Gaps = 39/252 (15%) Query: 126 SALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPP 185 + L IE E+ ++ + + N + P Sbjct: 19 NELNWTQALEDVFIENRKEDPSLLWQVFGSATGVTLYYPATPWRAPNKIDLYDVRRRPWY 78 Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 + S + + + + ++ S +++++ V Sbjct: 79 IQGASSPKDMVIIVDVSGSVSGLTLKLMKTSVCEMLDTLSDDDYVNVASKV--------- 129 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 K + + T+ +A+ +L N + Sbjct: 130 ----------------FKEDVQGMVVKGTTDYKAGFEYAFDQLQNSNITRA------NCN 167 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSS 364 K ++ TDG N N +++++ +V L+ + Sbjct: 168 KMIMMFTDGGEDRVQDVFEKYNWP-------NKTVRVFTFSVGQHNYDVTPLQWMACANK 220 Query: 365 GQFFAVNDSREL 376 G +F + + Sbjct: 221 GYYFEIPSIGAI 232 >gi|3273247|dbj|BAA31167.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 42.6 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 56/170 (32%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + N + + S N + Sbjct: 55 CSGSYRRHNWVKHAVPLAMKLIQQLNLNENAIHLYANDFSNNAKEIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LIIIKSLLSTNLPYGRTNLSDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|110632964|ref|YP_673172.1| TadE-like [Mesorhizobium sp. BNC1] gi|110283948|gb|ABG62007.1| TadE-like protein [Chelativorans sp. BNC1] Length = 140 Score = 42.6 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 13/97 (13%), Positives = 25/97 (25%), Gaps = 3/97 (3%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAV---LSGCASIVSDRTIKDPTTKKD 58 AI+ L ++ ++RN + A D A L G + + + Sbjct: 23 FAIVCMPLLLICLGIVEFGRAFFVRNDLSYAADVAARKVLIGQIPAGAPSSDAASGLETA 82 Query: 59 QTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK 95 + G + A + T Sbjct: 83 VREAFVGDASLLQIAVGEETVDGARYRTLSIRYPFTF 119 >gi|294645315|ref|ZP_06723031.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CC 2a] gi|294808775|ref|ZP_06767508.1| von Willebrand factor type A domain protein [Bacteroides xylanisolvens SD CC 1b] gi|292639394|gb|EFF57696.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CC 2a] gi|294444072|gb|EFG12806.1| von Willebrand factor type A domain protein [Bacteroides xylanisolvens SD CC 1b] Length = 340 Score = 42.6 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 17/147 (11%), Positives = 50/147 (34%), Gaps = 11/147 (7%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 ++ + Y+ ++ +++ +++L +T + AY+ Sbjct: 7 DKDKVAIVTYSGSAGVKLEATPGSDKQKIREAIDELTAGGSTAGGAGILLAYKIAKKNLI 66 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 S+ N +I +DG+ + + L Q+ E R +G+ + + Sbjct: 67 SNGNNR--------IILCSDGDFNVGVSSAEGL--EQLIEKERKSGVFLTVLGYGMGNYK 116 Query: 354 QDLLRKCTDSS-GQFFAVNDSRELLES 379 ++ + G +++ +E Sbjct: 117 DKKIQVLAEKGNGNHAYIDNLQEANRV 143 >gi|290980233|ref|XP_002672837.1| vWFA domain-containing protein [Naegleria gruberi] gi|284086416|gb|EFC40093.1| vWFA domain-containing protein [Naegleria gruberi] Length = 340 Score = 42.6 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 23/205 (11%), Positives = 68/205 (33%), Gaps = 25/205 (12%) Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 K + +I+V + ++ ++ + K +L + + Sbjct: 23 SDKIVDLVIVMDCTGSMSGEINVAKNTVATIITTLHEHF--KTDLRFTAVSYRDHTDDYA 80 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 + P + ++N K +N ++ + A+ A + + + K V+ Sbjct: 81 VKEFPFTKDINNAKEYINTMSAQGGGDYPEALASALKVVNEMPFNKKGK-------KIVV 133 Query: 310 FITDGENSGASAYQNTL------------NTLQICEYMRNAGMKIYSVAVSAPPEGQDL- 356 ++ D G +A ++ + ++I ++ + Y++ E Q L Sbjct: 134 WVADAPPHGMNASGDSYPNGCLDEQGQKIDWVKIGTELQEKNVVFYNIICERAKEDQQLT 193 Query: 357 --LRKCTD-SSGQFFAVNDSRELLE 378 + + G+ + D+ ++ Sbjct: 194 LFMDYLATKTDGKCLLLTDANKIPN 218 >gi|156405002|ref|XP_001640521.1| predicted protein [Nematostella vectensis] gi|156227656|gb|EDO48458.1| predicted protein [Nematostella vectensis] Length = 308 Score = 42.6 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 43/262 (16%), Positives = 80/262 (30%), Gaps = 30/262 (11%) Query: 131 LSLRSTGIIERSSENLAISICMVLD--VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKS 188 + R + ++ D + + +L + Sbjct: 22 IWERKDSFERNYYDRISQEWRFAKDKWNGDDPGYQKIPPRIETARKFMENFLDRDIKELD 81 Query: 189 FWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIV 248 +KN + + VL SA V + I + L R GI Sbjct: 82 DGAKNIRRRSVGDN--IFYDVVFVLDSSASVGVKDYKNGILALQTLITRAKEDTRYAGIT 139 Query: 249 GNQCTPLSNNLNEVKSRLNKL----NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 + ++ + + L TNT A+ +L+ K+S + Sbjct: 140 FSTEANITFYFTDPLDAMKGLGGITYAPGMTNTQAALDICRTQLWLNKKSGFRRLS---- 195 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT--- 361 K ++ +TDG+++ N TL ++N G++I+ VAV LR Sbjct: 196 FKRILIVTDGQSNI-----NMERTLYNAFQLKNMGIEIFVVAVG------KYLRGIAEIV 244 Query: 362 ----DSSGQFFAVNDSRELLES 379 + + V + R LLE Sbjct: 245 GLASSTDAHLYRVRNLRGLLEV 266 >gi|149758640|ref|XP_001499491.1| PREDICTED: similar to integrin, alpha 10 isoform 1 [Equus caballus] Length = 1167 Score = 42.6 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 62/205 (30%), Gaps = 25/205 (12%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 + I E L + + + + +++G + Y V Sbjct: 163 PTYMDVVIVLDGSNSIYPWSEVRTFLRRLVGRLFIDPEQ--IQVGLVQYGESPVHEWSLG 220 Query: 255 LSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 EV L+ E T T A+ A E +++ + ++ +T Sbjct: 221 DFRTKEEVVRAARNLSRREGRETKTAQAIMVACTEGFSQSHGGRPEA-----ARLLVVVT 275 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--------SAPPEGQDLLRKCTDSS 364 DGE+ L+ CE R + Y +AV P +R Sbjct: 276 DGESHDGEELPT---ALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRAIASDP 329 Query: 365 G--QFFAVNDSRELLESFDKITDKI 387 FF V D L + D + D+I Sbjct: 330 DERFFFNVTDEAALTDIVDALGDRI 354 >gi|149758642|ref|XP_001499504.1| PREDICTED: similar to integrin, alpha 10 isoform 2 [Equus caballus] Length = 1177 Score = 42.6 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 62/205 (30%), Gaps = 25/205 (12%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 + I E L + + + + +++G + Y V Sbjct: 163 PTYMDVVIVLDGSNSIYPWSEVRTFLRRLVGRLFIDPEQ--IQVGLVQYGESPVHEWSLG 220 Query: 255 LSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 EV L+ E T T A+ A E +++ + ++ +T Sbjct: 221 DFRTKEEVVRAARNLSRREGRETKTAQAIMVACTEGFSQSHGGRPEA-----ARLLVVVT 275 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--------SAPPEGQDLLRKCTDSS 364 DGE+ L+ CE R + Y +AV P +R Sbjct: 276 DGESHDGEELPT---ALKACEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRAIASDP 329 Query: 365 G--QFFAVNDSRELLESFDKITDKI 387 FF V D L + D + D+I Sbjct: 330 DERFFFNVTDEAALTDIVDALGDRI 354 >gi|15674025|ref|NP_268200.1| hypothetical protein L107379 [Lactococcus lactis subsp. lactis Il1403] gi|12725093|gb|AAK06141.1|AE006434_3 unknown protein [Lactococcus lactis subsp. lactis Il1403] Length = 1450 Score = 42.6 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 27/261 (10%), Positives = 71/261 (27%), Gaps = 30/261 (11%) Query: 140 ERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWS---KNTTK 196 + + I +V+D+S SM+ + + ++ Sbjct: 318 NVQNPIKPVDIVLVIDMSGSMQGAKETAVRQGVSDFLSTIQNTAYADYVNVGIVGYSSPG 377 Query: 197 SKYAPAPAPANRKIDVLIES------AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 + A ID + L + L + + Sbjct: 378 NYVTGASGYITVPIDKVSSESHVKSINQALAPQFSGGTFTQLGLRKGTEMLEQDSSDNQK 437 Query: 251 QCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 +++ + ++N + N A S + G+T + + Sbjct: 438 MMILMTDGVPTFSYKVNS--ASKVDNVIYGQSFA---------ESRDEPGNTSKIQSPYY 486 Query: 311 ITDGENSGASAYQNT-LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK--------CT 361 + D ++T TL E + +I+++ + +G L ++ Sbjct: 487 VKDINGGSNIEIRDTWAATLGEAEISKQEISEIHTLGIQLGNDGSYLSQEEVKSRTSLIA 546 Query: 362 DSSGQFFAVNDSRELLESFDK 382 + G + N + ++ + Sbjct: 547 TT-GLYQDANSANDITDYLKN 566 >gi|309774616|ref|ZP_07669641.1| cell wall surface anchor family protein [Erysipelotrichaceae bacterium 3_1_53] gi|308917647|gb|EFP63362.1| cell wall surface anchor family protein [Erysipelotrichaceae bacterium 3_1_53] Length = 613 Score = 42.6 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 28/270 (10%), Positives = 69/270 (25%), Gaps = 48/270 (17%) Query: 160 MEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGN 219 + +++DN + + + K T +++ +A Sbjct: 13 NKTAVKSENDDNTYDITLEVVTEQNLSKLTKKSATILVIDTSGSMGDYQRLINAKNTAKE 72 Query: 220 LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYP 279 V + + I+ ++ + +N+L TN + Sbjct: 73 FVKKYAGDEPDSGRYLAIVNFATNTNIILNWTDVSSADGKASADNAINQLYADGGTNLHA 132 Query: 280 AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN--------------- 324 + A + + + + I +TDG + Sbjct: 133 GIKQASQLFDDTAIQDIESKNT-------IVLTDGAPTYYLKNCTSDINCIFYTHVTISN 185 Query: 325 ----------------TLNTLQICEYMRNAGMKIYSVAVSA--------PPEGQDLLRK- 359 T + ++ +Y++ A P L+ Sbjct: 186 TRYHVGGDGDKGSETINDATAAEAKTLKGKS-TVYTICYGASREMTYQGGPTVSAYLKNN 244 Query: 360 CTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + +DS +L+++F IT+ I Sbjct: 245 IASEMKNAYDADDSDDLVDAFKAITETITS 274 >gi|256827824|ref|YP_003156552.1| von Willebrand factor type A [Desulfomicrobium baculatum DSM 4028] gi|256577000|gb|ACU88136.1| von Willebrand factor type A [Desulfomicrobium baculatum DSM 4028] Length = 2452 Score = 42.6 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 32/350 (9%), Positives = 94/350 (26%), Gaps = 17/350 (4%) Query: 33 LDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQIN 92 D+ + + S I K + Sbjct: 1531 TDSVNFAAAIQSAVTPLYGSDGSGGTVISGFELAISGGGELDSGLNSNGLDIILSKDGDD 1590 Query: 93 ITKDKNNPLQYIAESKAQYEIPT-ENLFLKGLIPSALTNLSLRSTGIIERSSENLAIS-I 150 + + + + ++ + + + +L + + + Sbjct: 1591 VVGRTTAGEVFRISVDSTGTVVLKQSAEVDHIEGTPNDDLISLADSKVFLEATATVTDGD 1650 Query: 151 CMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKI 210 V + S++ + D+ N + + + + ++ Sbjct: 1651 NDVATRTLSVDLGGNIRFVDDVPSAENDSVSAAENW-TGAKTYNLMLIVDRSGSINQTEM 1709 Query: 211 DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN 270 + + +L++ + V++ + + + +P+S + E K+ L+ L+ Sbjct: 1710 QAAVVAMNSLLDKYAEVAL-GGEAGVQVQVVTFAVDGTLVHGSPVS--IAEAKAYLDILD 1766 Query: 271 PYEN---TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 TN A+ A + + + + V FI+DG + + N Sbjct: 1767 NSGGSGNTNYDAAVAAATPAI-DGWPVATADHDN-----VVYFISDGAPTVGNGTVGLTN 1820 Query: 328 TLQIC--EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE 375 + + + G +++ V D L G ++ + Sbjct: 1821 AEEAAWETTLESRGATSWAIGVGTSNAVDDDLADVAYPDGNVLLASNFND 1870 >gi|149173297|ref|ZP_01851928.1| hypothetical protein PM8797T_28944 [Planctomyces maris DSM 8797] gi|148848103|gb|EDL62435.1| hypothetical protein PM8797T_28944 [Planctomyces maris DSM 8797] Length = 1020 Score = 42.6 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 17/96 (17%), Positives = 39/96 (40%), Gaps = 10/96 (10%) Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA-VSAPPEGQDLLR 358 S K +I I+DG+ + I ++++N + + VA E +R Sbjct: 553 KSDASTKHMIIISDGDPQPPT-------PQLIGQFLKNK-VSVSMVAIFPHGGEDISKMR 604 Query: 359 KCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + G+++ +D +L F K + ++ ++ Sbjct: 605 DIAGVTGGRYYFPSDPNQLPSIFIKESKTLKRSMIQ 640 >gi|116694147|ref|YP_728358.1| hypothetical protein H16_B0192 [Ralstonia eutropha H16] gi|113528646|emb|CAJ94993.1| Hypothetical protein H16_B0192 [Ralstonia eutropha H16] Length = 562 Score = 42.6 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 15/113 (13%), Positives = 33/113 (29%), Gaps = 1/113 (0%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKD-PTTKKDQ 59 M A++I+ + +ID+ H+ + Q+Q+ +D A +S + + + Sbjct: 22 MAALLIATVAIAALVSIDVGHVFMRQRQLQNVVDLAAMSAAQQLKRADSAANLNAAVLGT 81 Query: 60 TSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYE 112 I K G D + + + Sbjct: 82 VRNIGAKNGYPSGIAMGCGDATGGGADAMTACLGVWDPASGGPKHFSATYEAT 134 >gi|89070480|ref|ZP_01157773.1| hypothetical protein OG2516_06379 [Oceanicola granulosus HTCC2516] gi|89043884|gb|EAR50075.1| hypothetical protein OG2516_06379 [Oceanicola granulosus HTCC2516] Length = 432 Score = 42.6 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 18/221 (8%), Positives = 50/221 (22%), Gaps = 28/221 (12%) Query: 2 TAIIISVC-FLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 A+ + + A+D++H R ++Q+ + + D+ Sbjct: 18 FALFLFILCLTVGAVALDVSHAYAARTRLQA--------ATDATAHAAIVAREYMTADEA 69 Query: 61 STIFKKQIKKHLK-----QGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYE-IP 114 + + HL + D+ + + Sbjct: 70 RAKARAIGEGHLPAGRYGDIFAPQHVTFGTWDGDTRTFVPDETASGAVLVRGMRGLDGGT 129 Query: 115 TENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNM 174 FL L+ ++ + + + + + +N Sbjct: 130 PAPTFLFKLVGIDWWDIVVEALFESYVPACIQ-------------NGFFAEGEVDIQSNN 176 Query: 175 TSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 T + ++N P +D+ Sbjct: 177 TFRAGFCVHSNYRVSLNQNNVFEPGVIVSMPNVNDLDMPSS 217 >gi|146292169|ref|YP_001182593.1| type IV pilin biogenesis protein [Shewanella putrefaciens CN-32] gi|145563859|gb|ABP74794.1| type IV pilin biogenesis protein, putative [Shewanella putrefaciens CN-32] Length = 1165 Score = 42.6 bits (98), Expect = 0.12, Method: Composition-based stats. Identities = 26/333 (7%), Positives = 77/333 (23%), Gaps = 35/333 (10%) Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + G + + + T ++ + + + Sbjct: 211 ADRLSSLSNTQFGSGQPVTLYSAHYLVWHKWATTTEEGKSSGGVGTRLDVAKSALISALE 270 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 IP E ++ I + S + +L N + Sbjct: 271 SLAIPIDAGLAIFNLNYPNEGDADGGRIVYDLTEMNSINKVNLTSLIKNMPAKTNTPLCE 330 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 + F T N KID L+ T Sbjct: 331 TLYEAYQYFSGGQVTFGNKDKN-GTGNNKIDGY------------TPNNPPSILTSGSYT 377 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG 300 + +++ + + L A + A ++ N Sbjct: 378 TPFKKCPDTAYIIYITDGAPTLDKSADTLING---LVANAKNQAANYAAFSFTNAGNKTE 434 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-GQDLLRK 359 ++ + ++ + + + ++ ++++++ S + LL + Sbjct: 435 TSYMPALAAYMYNNDIVIGNKDSLGIDNK--------QNIRVFTIGFSDGADEAAALLEE 486 Query: 360 CTDSSGQ----------FFAVNDSRELLESFDK 382 G ++ + +L+ + + Sbjct: 487 TAFRGGNPRGSNNISKGYYVAKNGLDLVSALED 519 >gi|195051568|ref|XP_001993124.1| GH13254 [Drosophila grimshawi] gi|193900183|gb|EDV99049.1| GH13254 [Drosophila grimshawi] Length = 1237 Score = 42.6 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 33/354 (9%), Positives = 89/354 (25%), Gaps = 32/354 (9%) Query: 28 QMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIREN-----A 82 ++ + + A LS S + I + L G+ + Sbjct: 103 RLMDSAEQAALSELEGQSSTEGGALGQQQHYDARRINEYNADGKLADGARHMDIRFMRRF 162 Query: 83 GDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERS 142 + ++ + + K+ + L + Sbjct: 163 ERLPVNLSLSSILVPHGVDLDETDVKSALQWSAHLD-----------PLFQNNLERDPAL 211 Query: 143 SENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPA 202 S S L + S + Sbjct: 212 SWQYFGSSSGFLRRFPGTAWPPEGSKGSKLIHDFRTHNWFVQAASSPKDIMILLDASSSM 271 Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + + + + V+ + +V +N+ E+ Sbjct: 272 SEKSFDLATSTAFNILDTLGEDDYVNLITFSDVVKTPVPCFKDRMVRATP----DNVQEI 327 Query: 263 KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 KS + + + N + +A+ L+ +S + + + ++ IT+ + Sbjct: 328 KSAVKAIKLQDTANFTAGLEYAFSLLHKYNQSGAGSQCN----QAIMLITESTSESHKEI 383 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF-AVNDSRE 375 N + ++I++ + + + L + S+ FF +N+ E Sbjct: 384 IKQYNWPHM-------PVRIFTYLIGSDSGSRSNLHEMACSNKGFFVQINNYEE 430 >gi|115767164|ref|XP_001193368.1| PREDICTED: similar to calcium activated chloride channel variant [Strongylocentrotus purpuratus] gi|115976266|ref|XP_001179968.1| PREDICTED: similar to calcium activated chloride channel variant [Strongylocentrotus purpuratus] Length = 797 Score = 42.6 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 17/146 (11%), Positives = 45/146 (30%), Gaps = 28/146 (19%) Query: 246 GIVGNQCTPLSNNLNEV--KSRLNKLNPY--ENTNTYPAMHHAYRELYNEKESSHNTIGS 301 L++ + +N L P T + L + + Sbjct: 306 DSTSQISGNLTDITETASRQRLVNALPPSPVGGTCIGCGILSGIEVLGSYAQG------- 358 Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT 361 ++I ++DGE + A +T + ++N+G+ I ++ + + + Sbjct: 359 ----GYIILLSDGEETDAPFIMDTYD------EIKNSGVIIDTITI--SDSADQQMEDLS 406 Query: 362 DSSGQFF-----AVNDSRELLESFDK 382 ++ L+++F Sbjct: 407 TNTSGIANFCSDDARTGIRLIQAFQS 432 >gi|33331711|gb|AAQ11020.1| mesocentin [Caenorhabditis briggsae] Length = 13133 Score = 42.6 bits (98), Expect = 0.13, Method: Composition-based stats. Identities = 33/250 (13%), Positives = 76/250 (30%), Gaps = 26/250 (10%) Query: 137 GIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTK 196 + E S + +VL V + + + Sbjct: 12350 ALDELSISGVDGVPQIVLVVKNGKARYIVPNVESARTWPTPRTKATTLAGSRRSCSTIDY 12409 Query: 197 SKYA-----PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ 251 + + D + ++ ++V++ + +IG + Y+ + Sbjct: 12410 ESDVIIVLDSSENFTPDEFDSMKDAVASIVDTGFDLAPDVS----KIGFVIYSDKVAVPV 12465 Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 + E+ ++ + A L N G K V+ I Sbjct: 12466 ALGHYEDKIELLEKIVDAE-----KINDGVAIALYGL-NAARQQFQLHGRENATKIVLLI 12519 Query: 312 TDGENSGASAYQNTLNTLQICEYMRN-AGMKIYSVAVSAPPEGQDLLRKCTD--SSGQFF 368 T+G+N G N E +R+ G+++++VAV + P+ +++ + Sbjct: 12520 TNGKNRG--------NAAAAAEDLRDMYGVQLFAVAVGSNPDELATIKRLVGNANPDNAI 12571 Query: 369 AVNDSRELLE 378 V S E+ + Sbjct: 12572 EVAQSTEIDD 12581 >gi|66818054|ref|XP_642720.1| hypothetical protein DDB_G0277329 [Dictyostelium discoideum AX4] gi|60470879|gb|EAL68851.1| hypothetical protein DDB_G0277329 [Dictyostelium discoideum AX4] Length = 798 Score = 42.2 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 23/108 (21%), Positives = 42/108 (38%), Gaps = 7/108 (6%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHN 297 +G Y G+ + LS+N++E+ S +N + + A +A S H Sbjct: 319 MGMGDYCDGVNVLKVLDLSSNIDEIVSFINMVPNTSGGDEPEAYEYALYRAKELSWSEH- 377 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 K + I D + +N + C+ + + G+KIY V Sbjct: 378 ------TSKAFVMIGDSNPHEPTYTNLNINWFKECDDLFDRGIKIYGV 419 >gi|118081932|ref|XP_414993.2| PREDICTED: hypothetical protein [Gallus gallus] Length = 360 Score = 42.2 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 18/155 (11%), Positives = 52/155 (33%), Gaps = 17/155 (10%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS 295 +R+ Y + + E+ + +++ ++ + S Sbjct: 15 IRMAVALYGEKPRMSIELTDYVTIEEILVAIQEISIKG-----SSLKVGSALAFAAHAMS 69 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 K V+ IT G+ ++ + +++AG+ +++V + + Sbjct: 70 QPATLRDNAAKVVVLITSGK--------SSDLVEDKAQVLQDAGVTVFAVGI--KDADKH 119 Query: 356 LLRKCTD--SSGQFFAVNDSRELLESFDKITDKIQ 388 L K ++ V+D L + K++ ++ Sbjct: 120 ELNKIASEPTAEHVIYVDDFHLLHSAAPKLSRRLC 154 >gi|116619318|ref|YP_821474.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116222480|gb|ABJ81189.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 296 Score = 42.2 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 28/153 (18%), Positives = 59/153 (38%), Gaps = 27/153 (17%) Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 ++N +++RL P+ T A+ A L H ++ I Sbjct: 116 LLDYTSNCGTMQNRLVMAKPHGMTALLDAIPLAVEHLRKAAHPRHA----------ILII 165 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIY--SVAVSAPPE---------GQDLLRKC 360 +DG + + + + R A +Y ++ + A + G +LLR+ Sbjct: 166 SDGGENASRVRLHDVR-----RQAREANAPVYAATLGLEAEFDQGPYLDARRGPELLREI 220 Query: 361 TD-SSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + G+ F++ +SR + E+ I ++ +Q V Sbjct: 221 AQITGGRAFSIQESRRIEEAAAGIARELHDQYV 253 >gi|291299883|ref|YP_003511161.1| Vault protein inter-alpha-trypsin domain-containing protein [Stackebrandtia nassauensis DSM 44728] gi|290569103|gb|ADD42068.1| Vault protein inter-alpha-trypsin domain protein [Stackebrandtia nassauensis DSM 44728] Length = 831 Score = 42.2 bits (97), Expect = 0.13, Method: Composition-based stats. Identities = 26/186 (13%), Positives = 49/186 (26%), Gaps = 32/186 (17%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS--RLN 267 +D L + V A+ + + N ++ L Sbjct: 332 VDTLSSADRFAVRCFDTAMTSP--------------EGLDPNGLSAGTDRNRFRAVEHLA 377 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 T+ + A L ++ + +I +TDG+ L Sbjct: 378 GTETRGGTDILKPLSTAVDLLTAGEKGR---------DRVIILVTDGQVGNEDQILRELT 428 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 +GM+++ V + L R G+ V L E+ I +I Sbjct: 429 GRL-------SGMRVHVVGIDKAVNAGFLHRLALVGRGRCELVESEDRLDEATAHIHRRI 481 Query: 388 QEQSVR 393 V Sbjct: 482 VAPVVT 487 >gi|301789441|ref|XP_002930137.1| PREDICTED: von Willebrand factor-like [Ailuropoda melanoleuca] Length = 2813 Score = 42.2 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 19/177 (10%), Positives = 52/177 (29%), Gaps = 20/177 (11%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAY 285 + ++ + Y ++ + S ++ + ++ A+ +A Sbjct: 1719 SRANIGPQLTQVSVLQYGSTTTAAVPWNVAYEKAHLLSHVDLMQREGGLSHIGDALDYAV 1778 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 R + +E + K VI +TD A + + + ++ V Sbjct: 1779 RYVTSEVHGARPGAS----KAVVILVTDVSADTVDAAADAAT---------SNRVTVFPV 1825 Query: 346 AVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESF---DKITDKIQEQSVRIAPN 397 + + LR+ + + +L + K+ VR+ + Sbjct: 1826 GIG-DRYDEAQLRRLAGPNAGSNVLRLQRIEDLSAVATLGNSFFHKLCSGFVRVCVD 1881 >gi|281343742|gb|EFB19326.1| hypothetical protein PANDA_020489 [Ailuropoda melanoleuca] Length = 2801 Score = 42.2 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 19/177 (10%), Positives = 52/177 (29%), Gaps = 20/177 (11%) Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAY 285 + ++ + Y ++ + S ++ + ++ A+ +A Sbjct: 1707 SRANIGPQLTQVSVLQYGSTTTAAVPWNVAYEKAHLLSHVDLMQREGGLSHIGDALDYAV 1766 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 R + +E + K VI +TD A + + + ++ V Sbjct: 1767 RYVTSEVHGARPGAS----KAVVILVTDVSADTVDAAADAAT---------SNRVTVFPV 1813 Query: 346 AVSAPPEGQDLLRKCTD--SSGQFFAVNDSRELLESF---DKITDKIQEQSVRIAPN 397 + + LR+ + + +L + K+ VR+ + Sbjct: 1814 GIG-DRYDEAQLRRLAGPNAGSNVLRLQRIEDLSAVATLGNSFFHKLCSGFVRVCVD 1869 >gi|239831665|ref|ZP_04679994.1| Protein norD [Ochrobactrum intermedium LMG 3301] gi|239823932|gb|EEQ95500.1| Protein norD [Ochrobactrum intermedium LMG 3301] Length = 633 Score = 42.2 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 17/96 (17%), Positives = 36/96 (37%), Gaps = 13/96 (13%) Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG-- 318 V+ R+ L P T A+ H+ +L + KK ++ +TDG+ + Sbjct: 515 AVEHRIAALKPGFYTRMGAAIRHSTAKLAEQPNR----------KKLLLVLTDGKPNDVD 564 Query: 319 -ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 ++ + + R G+ +++V V Sbjct: 565 HYEGRFALEDSRRAVQEARARGVNVFAVTVDREASS 600 >gi|323345326|ref|ZP_08085549.1| aerotolerance protein BatB [Prevotella oralis ATCC 33269] gi|323093440|gb|EFZ36018.1| aerotolerance protein BatB [Prevotella oralis ATCC 33269] Length = 340 Score = 42.2 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 24/165 (14%), Positives = 58/165 (35%), Gaps = 45/165 (27%) Query: 253 TPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P++++ K L ++P + T+ A++ A + + ++ K + Sbjct: 144 LPITSDYVSAKMFLQNIDPSLIATQGTDIAGAINLASKSFTQQ----------DKVGKAI 193 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA---------------PPEG 353 I ITDGE+ A + G I+ + + + G Sbjct: 194 IVITDGEDHEGGAIEAAKAARA-------KGYNIFILGIGSTNGAPIPMANGGYLQDASG 246 Query: 354 QDLL--------RKCTDSS-GQFFAVNDSRELLESFDKITDKIQE 389 Q ++ ++ + G + V+++ + E + K+Q+ Sbjct: 247 QTVMTKLNEQMCKEIAQAGNGTYIHVDNTSDAQEKLNDELTKLQK 291 >gi|153834035|ref|ZP_01986702.1| conserved hypothetical protein [Vibrio harveyi HY01] gi|148869590|gb|EDL68580.1| conserved hypothetical protein [Vibrio harveyi HY01] Length = 423 Score = 42.2 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 28/244 (11%), Positives = 59/244 (24%), Gaps = 10/244 (4%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRT-IKDPTTKKDQ 59 + ++++ + + IDL H + + ++Q+A+DAA L+G Sbjct: 20 LISMVLLILLGVAAFGIDLNHQVLNKTRLQNAVDAAALAGAVVADETSDVAAAEAAAVTT 79 Query: 60 TSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLF 119 + I L + ++ +Y + F Sbjct: 80 LANISASAGNSELTFTDGNTAVTFSSDRATFVDAASFSTPAGEYDIYVRVAVSDIGLTQF 139 Query: 120 LKGLIPSA-LTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNK 178 L + + S + + I M D + + K + Sbjct: 140 LSSIFGINKNISASAVAGRSAAINYSCNLTPIAMCGDPLGDASNAWGYKPPGYDPNVDTD 199 Query: 179 YLLPPP----PKKSFWSKNTTKSKYAPAPAPANRKI----DVLIESAGNLVNSIQKAIQE 230 L + + A N D L + I + Sbjct: 200 PSLVHELKVGDQDNTDMGPGNFQLLDFGQAGGNSGAALVRDALSGAYNGCAAIGDTVITK 259 Query: 231 KKNL 234 N Sbjct: 260 PGNS 263 >gi|282164464|ref|YP_003356849.1| hypothetical protein MCP_1794 [Methanocella paludicola SANAE] gi|282156778|dbj|BAI61866.1| conserved hypothetical protein [Methanocella paludicola SANAE] Length = 506 Score = 42.2 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 17/119 (14%), Positives = 36/119 (30%), Gaps = 16/119 (13%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY--ENTNTYPAMHHAYRELYNEKES 294 R + +++N L+ L+ T+ A+ L Sbjct: 368 RDVKVYLFSSEGQTHEIEITDNKKMATEFLDFLSYTFEGGTDFDTALREGVESL------ 421 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + ++FITDG S + + + R G +++++ V G Sbjct: 422 ----KKKQYVNADILFITDGL----SVVNDKYVISGLEQMKRENGTRLFTIIVGNDNAG 472 >gi|149043685|gb|EDL97136.1| procollagen, type VI, alpha 2, isoform CRA_b [Rattus norvegicus] Length = 369 Score = 42.2 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 25/229 (10%), Positives = 71/229 (31%), Gaps = 16/229 (6%) Query: 170 DNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ 229 D + N S+ +P + + + + ++ +Q Sbjct: 35 DISTTDRNNNCPEKADCPVNVYFVLDTSESVAMQSPTDSLLYHMQQFVPQFISQLQN-EF 93 Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYREL 288 + +++ + +P ++ L + T T A+ + +++ Sbjct: 94 YLEQVALSWRYGGLHFSDQVEVFSPPGSDRASFTKSLQGIRSFRRGTFTDCALANMTQQI 153 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + F + ITDG +G + E R G+++++VA Sbjct: 154 --------RQHVGRGVVNFAVVITDGHVTGNPCGGIKMQ----AERAREEGIRLFAVA-P 200 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + LR ++ + + N+ + +I + +++ + Sbjct: 201 NRNLNEQGLRDIANTPHELYR-NNYATMRPDSTEIDQDTINRIIKVMKH 248 >gi|299137493|ref|ZP_07030675.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] gi|298600898|gb|EFI57054.1| VWFA-related domain protein-like protein [Acidobacterium sp. MP5ACTX8] Length = 451 Score = 42.2 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 28/205 (13%), Positives = 67/205 (32%), Gaps = 37/205 (18%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 ++ S + + + +L + + T +NN + ++ L L Sbjct: 206 LEFASNSYAFIQDMQNASASFFHSLQPDDYVAVETYDMRMHVLTDFTNNKDTIRQALQSL 265 Query: 270 NPYE--NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 TN + A++ L + K++I I++G ++ + + Sbjct: 266 TMPGFRETNEFDALYETLDRLSRVEGR-----------KYIILISNGRDTMSRLTLD--- 311 Query: 328 TLQICEYMRNAGMKIYSVAVS------APPEGQ-------------DLLRKCTD-SSGQF 367 + + + IY+++ + GQ + +R + G Sbjct: 312 -QMMAKIKATPNVTIYTISTGGLARELSDARGQMGGSTRMNYLQADNQMRTFAQMTGGAS 370 Query: 368 FAVNDSRELLESFDKITDKIQEQSV 392 + EL + F +I + I+ Q V Sbjct: 371 YQPLFQGELPDIFSQINESIRTQYV 395 >gi|221117275|ref|XP_002154690.1| PREDICTED: similar to predicted protein [Hydra magnipapillata] Length = 895 Score = 42.2 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 15/103 (14%), Positives = 34/103 (33%), Gaps = 13/103 (12%) Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 + A+ Y + T V+ +T G+++ + + Sbjct: 5 NGTSAIDVALKFVYNNM-------PKTFKEDGSPYVVLLLTYGKSTV-----DGKTLYKA 52 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSR 374 + +R +G+ + +VA +LL T + F + D Sbjct: 53 AKPIRQSGIYVVTVAFGQQSNFSELL-SITRNKNAIFQITDIA 94 >gi|47228040|emb|CAF97669.1| unnamed protein product [Tetraodon nigroviridis] Length = 621 Score = 42.2 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 30/193 (15%), Positives = 55/193 (28%), Gaps = 33/193 (17%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY 272 E N+ + A RI + Y PL++N + L+ L+ Sbjct: 16 AKEFIDNVARRLNLASSSSDERDARIALLQYGSATEQRVEFPLTHNFTVISDSLDNLSYM 75 Query: 273 -ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF----------------------VI 309 ++ A+ +A + K + + Sbjct: 76 DSSSALGSAIIYAINNIVTPKVQRDRQRQGNNTENVSLLPTLCPQVGERLARRHAELAFV 135 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF-F 368 FITDG S + R G+ +AV +++LRK F Sbjct: 136 FITDGITSREQLDEGVTAMR------RGGGVPT-VIAVGKD-TDEEVLRKVALGDATAIF 187 Query: 369 AVNDSRELLE-SF 380 ++ L + +F Sbjct: 188 RADNYHMLNKSAF 200 >gi|157112862|ref|XP_001657649.1| dihydropyridine-sensitive l-type calcium channel [Aedes aegypti] gi|108884615|gb|EAT48840.1| dihydropyridine-sensitive l-type calcium channel [Aedes aegypti] Length = 1184 Score = 42.2 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 32/368 (8%), Positives = 100/368 (27%), Gaps = 20/368 (5%) Query: 29 MQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQK 88 ++ +DA V + + + ++ + K+ ++ G I I + Sbjct: 50 LKYMMDAKVSAVKRIMDTAENTAISYDEEPVNQSFQYYNAKEMIEPGEIITTPIPMIDED 109 Query: 89 AQINITKDKNNPLQYI---------AESKAQYEIPTENLFLKGLIPSALTNLSLRSTGII 139 T + + N++ + S I Sbjct: 110 PADITTPIPPKEIVLTKKKHFFNEAVNTSVSSVHVPTNVYDRATEVIKAIKWSEALDSIF 169 Query: 140 ERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY 199 + S + + + L ++ S Sbjct: 170 YNNYIGDPTLTWQYFGSSTGFLRQFPATKWVEDPVDLYDCRLRSWYIEAANSPKDVIILV 229 Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 + + ++ D+ N+++++ + N+ Sbjct: 230 DSSGSMTGQRKDIAKHVVSNILDTLG-PNDYVNIFTFSEEVTEVVDCFRDTLVQANMGNI 288 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 E+K ++ + E N A+ A+ L +ES + + + ++ ++DG Sbjct: 289 RELKLGMDNIETTEIANVSAALTKAFELLETFRESRNGARCN----QAIMLVSDGVPYSF 344 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 N ++ +++++ + +++ + ++ L E Sbjct: 345 EDIFEQYNWKELPFI----PVRVFTYLIGREVADVKEIKEMACKNQGYYV--HLSTLAEV 398 Query: 380 FDKITDKI 387 +++ + I Sbjct: 399 REEVLNYI 406 >gi|108763155|ref|YP_629622.1| hypothetical protein MXAN_1365 [Myxococcus xanthus DK 1622] gi|108467035|gb|ABF92220.1| hypothetical protein MXAN_1365 [Myxococcus xanthus DK 1622] Length = 1494 Score = 42.2 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 44/352 (12%), Positives = 85/352 (24%), Gaps = 70/352 (19%) Query: 108 KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQK 167 + Y + + S A + V+D S + + Sbjct: 130 TSYYRFDGSGSTSQSSFGMTRNPVKFNEPPPNTVISGTAAEACAQVVD-SGNNNTQAAAR 188 Query: 168 HNDNNNMTSNKYLLPPPPKKSFWSKN-TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK 226 + + Y K+ S + ++ E V + Sbjct: 189 AACAQCLATKGYFQYTQEKRVASGNFLNFYSPRGHSAVNVISQVLKDSERTRFGVVTFSA 248 Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLS-----NNLNEVKSRL-NKLNPYENTNTYPA 280 + + + + LS ++ N + S++ N L T A Sbjct: 249 SSEATDTAKWSGQDVVRFERFGPSCGDSLSGAKREDHRNNLLSKMRNGLRFNTGTPLTQA 308 Query: 281 MHHAYREL-------YNEKESSHNTIGSTRLK---------------KFVIFITDGENSG 318 M A + + S S +I +TDGE + Sbjct: 309 MWGASTYFRSAGSDPFPDWFGSDYLRDSGFNDEAAPGRAATCFTCGFNAMILLTDGEPNE 368 Query: 319 ASAYQ--------NTLNTLQICEYM------------------------------RNAGM 340 N C + + Sbjct: 369 PGGDSAQVPAQVRNLDVPCSNCAAASQGSNSGGSSSHIHRIAKWMWTNDLRPELSGSQAV 428 Query: 341 KIYSVAVS-APPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQ 390 Y+V + + +LLR D G+F+A +S +L + I D +Q + Sbjct: 429 ATYTVGFALTNTQAINLLRVTADAGGGRFYAATNSSQLKTALQAIVDDVQNR 480 >gi|118370680|ref|XP_001018540.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89300307|gb|EAR98295.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 930 Score = 42.2 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 19/170 (11%), Positives = 52/170 (30%), Gaps = 29/170 (17%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKE 293 ++ + V+ LN++ Y TN Y + + + Sbjct: 362 FNIYSFGTEFSKLFDQSQKYSNENVELALNEIITYSANYGGTNIYQPLSEIFNQ------ 415 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM--KIYSVAVSAPP 351 + + +TDG+ N + +++ + +++++ + Sbjct: 416 -----PYVKGYGRQIYILTDGQI---------ENKENVMHLIQSNNISNRVHAIGIGLYV 461 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS---VRIAPNR 398 + +++ G V D + ES I V+++ N+ Sbjct: 462 DKDLIIQSAKSGKGCHAHVTDQSLIQESIINILQNSISPILEDVKLSYNK 511 >gi|58429481|gb|AAW78144.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 542 Score = 42.2 bits (97), Expect = 0.14, Method: Composition-based stats. Identities = 26/170 (15%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + N + + S N + Sbjct: 55 CSGSIRRHNWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAREIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 S + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LSIIKSLLSTNLPYGRTNLTDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLSDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|255068089|ref|ZP_05319944.1| neisseria PilC protein [Neisseria sicca ATCC 29256] gi|255047687|gb|EET43151.1| neisseria PilC protein [Neisseria sicca ATCC 29256] Length = 1097 Score = 42.2 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 11/52 (21%), Positives = 24/52 (46%), Gaps = 2/52 (3%) Query: 340 MKIYSVAVSAP--PEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 ++ ++V P G+ L + G +F +L ++F+KI +I + Sbjct: 320 VQTFTVGFGNGISPAGEQYLIRGASHDGWYFNAAKPDDLYKAFEKIISQISD 371 >gi|52080631|ref|YP_079422.1| von Willebrand factor type A domain-containing protein [Bacillus licheniformis ATCC 14580] gi|52786005|ref|YP_091834.1| YojO [Bacillus licheniformis ATCC 14580] gi|319645408|ref|ZP_07999640.1| YojO protein [Bacillus sp. BT1B_CT2] gi|52003842|gb|AAU23784.1| von Willebrand factor, type A domain containing protein [Bacillus licheniformis ATCC 14580] gi|52348507|gb|AAU41141.1| YojO [Bacillus licheniformis ATCC 14580] gi|317392294|gb|EFV73089.1| YojO protein [Bacillus sp. BT1B_CT2] Length = 637 Score = 42.2 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 25/165 (15%), Positives = 57/165 (34%), Gaps = 13/165 (7%) Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS 295 V + + T N + V S + L + + ++ Sbjct: 477 VPHQIVGFWEDTNDATETSQPNYFHTVVSFADSLKAGAGPHI-----MQLEPEEDNRDGY 531 Query: 296 HNTIGST------RLKKFVIFITDGENSGASAYQNTL-NTLQICEYMRNAGMKIYSVAVS 348 + +KF+I +DGE + QN + +T + R +++ +V +S Sbjct: 532 AIRQMTKMLVQRSEAQKFLIVFSDGEPAAFDYEQNGIVDTHEAVMEARKRNIEVINVFLS 591 Query: 349 APPEGQDLLRKCTDSSGQFF-AVNDSRELLESFDKITDKIQEQSV 392 + ++ D G++ V D +L + + K+ +S+ Sbjct: 592 NSEIEESQMKTIQDMYGKYSLFVPDVDQLPDVLYPLLKKLLNKSI 636 >gi|84516632|ref|ZP_01003991.1| CpaB family protein [Loktanella vestfoldensis SKA53] gi|84509668|gb|EAQ06126.1| CpaB family protein [Loktanella vestfoldensis SKA53] Length = 277 Score = 42.2 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 14/157 (8%), Positives = 39/157 (24%), Gaps = 7/157 (4%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 +++ V +A+ + + Q LD A ++V+ + Sbjct: 5 FGLVLLVGMGLAGFAVYMVNQYMATQTAQ--LDRARQIAAQALVTVDVYVTTRSVTYGEY 62 Query: 62 TIFKK-----QIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 + +L +G++ E + + + + +K + Sbjct: 63 LTADDVRLVPYPRDYLPEGTFATEQSLFPQGTQTPRVVVLPMIANELVLTTKVSEPGASR 122 Query: 117 NLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMV 153 L + R + I + Sbjct: 123 GLTALVEPGFRAFPVEGRVAAGFGILRPDDRIDVYWT 159 >gi|291523143|emb|CBK81436.1| fibro-slime domain [Coprococcus catus GD/7] Length = 1745 Score = 42.2 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 37/305 (12%), Positives = 75/305 (24%), Gaps = 30/305 (9%) Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIA-ESKAQYEIPTENLFLKG 122 F K + E D N+T D + + E Sbjct: 744 FNKVTTQGAALAGAEFEIRDDSDSSKVYNVTSDSDGRVSVRLHEGTYTMTETQTPTGYLA 803 Query: 123 LIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNK---- 178 + ++ T I ++ + V + + + + + Sbjct: 804 ASGTWKITVNADGTYTITKNGRPIDKGSDSVYKIVNKGQHEDAEANLTTSKTVKVTDYNK 863 Query: 179 --------YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE 230 + + + + + L+ +A +++++ A + Sbjct: 864 REYEITLGASTSGREAGTEAKAASVVLVLDRSGSMGADGMTALVNAADTFIDTLKTASPD 923 Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL-----NKLNPYENTNTYPAMHHAY 285 + V N +E + N Y T A+ A Sbjct: 924 SQVAVVYFNGTQDEDDNTTTSKNFTKLNTDEKVKSIKDFLSNNGYSYGGTPMGDALEKAK 983 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT---LNTLQICEYMRNAGMKI 342 L T +K+V+F TDG +S N+ C A I Sbjct: 984 GLLD---------ADQTGNQKYVLFFTDGLPGHSSDDAFNCMVANSAVNCATDIKANATI 1034 Query: 343 YSVAV 347 Y+V Sbjct: 1035 YTVGY 1039 >gi|227552322|ref|ZP_03982371.1| von Willebrand factor, type A [Enterococcus faecium TX1330] gi|227178545|gb|EEI59517.1| von Willebrand factor, type A [Enterococcus faecium TX1330] Length = 1518 Score = 42.2 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 36/362 (9%), Positives = 96/362 (26%), Gaps = 50/362 (13%) Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKN-NPLQYIAESKAQYEIPTENLFLKG 122 +L ++ I ++ + N L+ +SK I N + Sbjct: 379 ATTLYNVYLDVIGSEKQEISPIDIVFVLDKSASMNEGTLEGGGQSKNAALIEAVNEISEN 438 Query: 123 LIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLP 182 L+ ++ + +S + + D+ D+ ++N + Sbjct: 439 LLSDPNMDIRI-GMVNFYHNSTVINNQEQISSDIFPLTNDINRLTGSENTALNRTPIGGT 497 Query: 183 PPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIA 242 P NR + ++ G+ + A E+ N Sbjct: 498 P----LTLGLKNGYETLYADNGGENRNPEKILIVVGDGTPTFSYAPIERSNRPDFTNWAV 553 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHA------YRELYNEKESSH 296 N I + L N NT+ + + + + ++ Sbjct: 554 MNNMIARDT-GDLFKNFET---------YSGNTS-GAGFSYPVVYPSEFNRPEDTWDYNY 602 Query: 297 NTIG-STRLKKFVIFITDGENSGASAYQNTLNTLQIC-----------EYMRNAGMKIYS 344 K ++ G S + + + I+S Sbjct: 603 RYGEVKEGDDKAFHWVGTGAASNGTTGEPDTQEKSSAINTAAYHHWLKNKYQENPPSIFS 662 Query: 345 VAVSAPPE----------GQDLLRKCTDSSG-----QFFAVNDSRELLESFDKITDKIQE 389 + + G+++L+ D + +++ N+ +++ + + I+ ++ Sbjct: 663 IGLGIDGSVAGRQRLDAIGRNVLKNIADLNDDGTTPRYYDANNKNDIITALEDISSTFKK 722 Query: 390 QS 391 Sbjct: 723 TI 724 >gi|156408321|ref|XP_001641805.1| predicted protein [Nematostella vectensis] gi|156228945|gb|EDO49742.1| predicted protein [Nematostella vectensis] Length = 981 Score = 42.2 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 47/434 (10%), Positives = 115/434 (26%), Gaps = 51/434 (11%) Query: 7 SVCFLFITYAIDLAHIMYIRN---QMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTI 63 + + + +N Q+ ++ +A L + + S Sbjct: 9 LLVLGLCGCFVLVLGHTNTQNCNMQLLASQVSAQLHATFESNLHAAKLQKLYQNVEFSNA 68 Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGL 123 L+ + E+ + A +N N +P Sbjct: 69 PSTNPHDILQAIARGIEHKLNETINALLNAKTAVTNTGASNTRIIECCAMPAIARANYSY 128 Query: 124 IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPP 183 + + + +++ R+ ++ YL P Sbjct: 129 -RFRAMLNTSVACKTRNPKDSGIIQGENSLVNTFRNNLVDSSVISWQYFGTSTGNYLQFP 187 Query: 184 PPKK--------------------SFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNS 223 K + N + ++ + ++A ++++ Sbjct: 188 ASGKVCNGSSSFDPRFQSWYVEAVTRMRTNIVVVIDRSSSMSTAGRMALARQAAVTVLDT 247 Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE---------- 273 + + I GN L N+N +K+ + L P Sbjct: 248 LGPNDKVGVVAFSHF-IIKPPGCFGGNVAEALPKNINRIKAWVEALTPRGKVSLQKTNLR 306 Query: 274 ------NTNTYPAMHHAYRELYNEKESSH-----NTIGSTRLKKFVIFITDGENSGASAY 322 T PA+ A+ L + + + ++F+TDG+ + Sbjct: 307 YVSFPGATKYVPALEAAFEMLGGDFNIKILHHPLIALIKRSAENMILFLTDGDPFDRNPD 366 Query: 323 QNTLNTLQICEYMRN--AGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN--DSRELL 377 + ++I + A + +Y + S + + L++ ++G F +N D+ L Sbjct: 367 VSIFEAIRIGQRKLAFPARINVYGLGESLNIDNLNRLKQIASLNNGTFTQINDQDASSLS 426 Query: 378 ESFDKITDKIQEQS 391 K + Sbjct: 427 TIMGKYYTSTAKTR 440 >gi|262377085|ref|ZP_06070311.1| predicted protein [Acinetobacter lwoffii SH145] gi|262308123|gb|EEY89260.1| predicted protein [Acinetobacter lwoffii SH145] Length = 1223 Score = 42.2 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 21/138 (15%), Positives = 45/138 (32%), Gaps = 11/138 (7%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPL-SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 + + I + + + + S + KL + T AY Sbjct: 304 TDNTQDGSVINFGTSTNQKKFLAMDKEGKQKFLSEITKLKAFGGTPISQ----AYEVSRG 359 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSG-ASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 + + + L + + F+TDGE S + Y + ++T + G+ +V Sbjct: 360 LFDKTAVNAKQSCLGQGIYFLTDGEPSSAGNNYWSNIHTNAASKRTV-NGVLTATVGFGG 418 Query: 350 PPEGQDLLRKCTDSSGQF 367 + L++ G F Sbjct: 419 GYD----LKESGGKGGFF 432 >gi|220922039|ref|YP_002497340.1| hypothetical protein Mnod_2052 [Methylobacterium nodulans ORS 2060] gi|219946645|gb|ACL57037.1| conserved hypothetical protein [Methylobacterium nodulans ORS 2060] Length = 418 Score = 42.2 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 22/218 (10%), Positives = 68/218 (31%), Gaps = 12/218 (5%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + ++++ + AIDL+ + ++++++ A AS+ + + D TT D+ Sbjct: 15 LASLLLPIGLGIAALAIDLSTLQLVKHRLKVAA------DAASLAAVAVLPDTTTALDRA 68 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKA-QYEIPTENLF 119 +I + + A + T + Q L Sbjct: 69 LSIAADNAGTGAGTVTAASDVRFGSYNSAAKSFTPGATPANAVQVTASRNQAHGNPVVLA 128 Query: 120 LKGLIPSALTNLSLRSTGIIERSSE-----NLAISICMVLDVSRSMEDLYLQKHNDNNNM 174 + + ++S+ + + + + + S + + + + ++ + Sbjct: 129 FAKALGWSTPDISVSAVAVRFSPAYCFLVLDPSASDALSVSGTGRLSVPNCGVQVNSTSA 188 Query: 175 TSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV 212 T+ + + S + +P P + Sbjct: 189 TAATVGNNSTAQARSFCIVGGYSGTSFSPKPITKCAAA 226 >gi|159901411|ref|YP_001547658.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159894450|gb|ABX07530.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 337 Score = 42.2 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 26/175 (14%), Positives = 57/175 (32%), Gaps = 40/175 (22%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 ++G + + PL+++L +S + +NP + + E Sbjct: 128 QVGMLMFGSSSYVQF--PLTSDLAAARSLVEPINPRGLSLGGTDVEEVITEGLRSFPIGQ 185 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ-- 354 + +I ITDG +S + + + M G+ I+++ ++ GQ Sbjct: 186 IE------GRTMILITDGGDSDEQSDGEAVAAAREAAKM---GLTIHTIGMATEAGGQIP 236 Query: 355 ------------------------DLLRKCTD-SSGQFFAVN--DSRELLESFDK 382 LL + + G +F + D +L + D+ Sbjct: 237 IYDDLGNISYVEDQGQRVISKLNRPLLEQIASATGGTYFDGSTLDLNQLQTALDQ 291 >gi|170038754|ref|XP_001847213.1| dihydropyridine-sensitive l-type calcium channel [Culex quinquefasciatus] gi|167882459|gb|EDS45842.1| dihydropyridine-sensitive l-type calcium channel [Culex quinquefasciatus] Length = 1209 Score = 42.2 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 28/368 (7%), Positives = 102/368 (27%), Gaps = 20/368 (5%) Query: 29 MQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQK 88 ++ +DA V + + + ++ + K+ ++ G I I + Sbjct: 61 LKYMMDAKVSAVKRIMDTAENTAISYDEEPVNQSFQYYNAKEMIEPGEIITTPIPMIDED 120 Query: 89 AQINITKDKNNPLQYI---------AESKAQYEIPTENLFLKGLIPSALTNLSLRSTGII 139 T + + N++ + S I Sbjct: 121 PADITTPIPPKEIVLTKKKHFFNEAVNTSVSSVHVPTNVYDRATEVIRAIKWSEALDSIF 180 Query: 140 ERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY 199 + S + + + + L ++ S Sbjct: 181 YNNYIGDPTLTWQYFGSSTGFLRQFPATKWEEDPVDLYDCRLRSWYIEAANSPKDVIILV 240 Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 + + ++ D+ N+++++ + N+ + + N+ Sbjct: 241 DSSGSMTGQRKDIAKHVVSNILDTLG--PNDYVNIFSFSEEVTEVVECFSETLVQ--ANM 296 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 ++ ++ + T + A + + E+ + R + ++ ++DG Sbjct: 297 GNIRELKLGMDHIKTTEIAN-VSAALTKAFELLETFRESRNGARCNQAIMLVSDGVPYSF 355 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 + N ++ +++++ + +++ + ++ L E Sbjct: 356 DEIFSEYNWKELPFI----PVRVFTYLIGREVADVKEIKEMACKNQGYYV--HLSTLAEV 409 Query: 380 FDKITDKI 387 +++ + I Sbjct: 410 REEVLNYI 417 >gi|47218989|emb|CAG02027.1| unnamed protein product [Tetraodon nigroviridis] Length = 849 Score = 42.2 bits (97), Expect = 0.15, Method: Composition-based stats. Identities = 18/154 (11%), Positives = 52/154 (33%), Gaps = 14/154 (9%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 I + + N++E + + Y T+ + A L ++++ Sbjct: 311 FDHRIQFWNTSLSK---ATKENIDEAMVYVKAIQSYGGTDINAPVLKAVDMLKEDRKAKR 367 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 S +I +TDG+ + + + + + M ++S+ Sbjct: 368 LPEKSID---MIILLTDGDPNSGESRIPVI--QENVKAAIGGQMSLFSLGFG-NDVKYPF 421 Query: 357 LRKCTDSSG----QFFAVNDSR-ELLESFDKITD 385 L + + + + +D+ +L +D+++ Sbjct: 422 LDVMSRENNGLARRIYEGSDAALQLQGFYDEVSS 455 >gi|328885837|emb|CCA59076.1| hypothetical protein SVEN_5790 [Streptomyces venezuelae ATCC 10712] Length = 865 Score = 42.2 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 14/109 (12%), Positives = 32/109 (29%), Gaps = 12/109 (11%) Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 + T+ A+ A L + + K V +TDG+ + + Sbjct: 146 DAARMGPGTDFPAAIRQAVSRLTADGTAGGTGAAGKPAPKVVFLLTDGKLDVKDSPEYGT 205 Query: 327 NTL-----------QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS 364 + + R AG++I+ + + + L + Sbjct: 206 DPESRQSNGEKRLTEELARARAAGVQIWPLGFGSEI-DRAALTAMAEGG 253 >gi|3273251|dbj|BAA31169.1| thrombospondin-related protein [Plasmodium falciparum] Length = 574 Score = 42.2 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 56/170 (32%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + N + + S N + Sbjct: 55 CSGSYRRHNWVNHAVPLAMKLIQQLNLNENAIHLYANVFSNNAREIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LIIIKSLLSTNLPYGRTNLTDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|297841999|ref|XP_002888881.1| hypothetical protein ARALYDRAFT_476383 [Arabidopsis lyrata subsp. lyrata] gi|297334722|gb|EFH65140.1| hypothetical protein ARALYDRAFT_476383 [Arabidopsis lyrata subsp. lyrata] Length = 757 Score = 42.2 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 18/136 (13%), Positives = 36/136 (26%), Gaps = 33/136 (24%) Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 L TN + A + L V +TDG + N Sbjct: 396 NLIANGGTNMLLPLKQAIKLLEGSNIGVP----------LVYLVTDG---------SVEN 436 Query: 328 TLQICEYMRN------AGM--KIYSVAVSA--PPEGQDLLRKCTDSSGQFFA-VNDSREL 376 +IC M+ + +I + + + L+ ++ N++ Sbjct: 437 EREICNAMKESCSRNGKSISPRISTFGIGSFCNHY---FLQMLARIGNGYYDGTNNTDSF 493 Query: 377 LESFDKITDKIQEQSV 392 ++ D V Sbjct: 494 EHQMSRLFDIASSTIV 509 >gi|160939191|ref|ZP_02086542.1| hypothetical protein CLOBOL_04085 [Clostridium bolteae ATCC BAA-613] gi|158438154|gb|EDP15914.1| hypothetical protein CLOBOL_04085 [Clostridium bolteae ATCC BAA-613] Length = 598 Score = 42.2 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 25/290 (8%), Positives = 74/290 (25%), Gaps = 45/290 (15%) Query: 115 TENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNM 174 + L + + S + + I++ + + ++ +N Sbjct: 326 AFFMTAILLFGCSPMLSTYASEECYTWDTSSKKITLVDLSSYFDGYKGSFVLYDLQKDNW 385 Query: 175 TSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNL 234 + + S Y A + +++ + Q E N Sbjct: 386 NIYDIEQAT-------IRISPNSTYKIYDALFALEENIITSENSFISCPQQNYPFESWNE 438 Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKES 294 + + + Q + ++S + ++ N N EL + Sbjct: 439 DQTLFSAMNSSVNWYFQALDAKLGKSNLQSYIEQIGY-GNQNING-------ELSSYWME 490 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY----------- 343 S ++ + +++ + + N +N T + + + IY Sbjct: 491 S--SLKISPIEQVELLKSLYFNDFGFTPENIQTTKESIQLFSDVNCTIYGKTGTGCIEEK 548 Query: 344 -----SVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 + + +F + + + + I +I Sbjct: 549 NVNGWFIGF------------VESKNHTYFFATNIQAIDNATGSIASEIT 586 >gi|302796874|ref|XP_002980198.1| hypothetical protein SELMODRAFT_38056 [Selaginella moellendorffii] gi|300151814|gb|EFJ18458.1| hypothetical protein SELMODRAFT_38056 [Selaginella moellendorffii] Length = 55 Score = 42.2 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 5/42 (11%), Positives = 17/42 (40%) Query: 341 KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 +++ + + + +L + G F V + + +F + Sbjct: 1 PVHTFGFGSDHDPEAMLSIAEATGGSFCYVQEESTVQHAFAQ 42 >gi|260785816|ref|XP_002587956.1| hypothetical protein BRAFLDRAFT_87344 [Branchiostoma floridae] gi|229273111|gb|EEN43967.1| hypothetical protein BRAFLDRAFT_87344 [Branchiostoma floridae] Length = 1412 Score = 42.2 bits (97), Expect = 0.16, Method: Composition-based stats. Identities = 33/207 (15%), Positives = 68/207 (32%), Gaps = 25/207 (12%) Query: 183 PPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAG--NLVNSIQKAIQEKKNLSVRIGT 240 P S + A A + + + + A Q + +I Sbjct: 48 PFSTDSDVALRVYCYDMASAEPKEFLTLPSGPDENYAIFFGDRLSNAYQWQCTGPFQIRV 107 Query: 241 IAY-NIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 I+Y TP++ ++ L + + T T A++H H + Sbjct: 108 ISYTCEAHTYFSLTPITMGMSYEIEHLMRGDGGGETRTGHAIYHM----------RHTSK 157 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 + +TDG++ + + E R+AG+ +Y+V V + + Sbjct: 158 FGAESHHAAVILTDGQSDD--------DAIAEAEDARDAGIDLYAVVVG-NFQNRASFPA 208 Query: 360 CTDSSGQFFAVNDSRELLESFDKITDK 386 T+ + F D+ + ++ KI D Sbjct: 209 MTNDPDRVF---DTSQACDAAQKIVDD 232 Score = 41.4 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 40/356 (11%), Positives = 93/356 (26%), Gaps = 40/356 (11%) Query: 30 QSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKA 89 A DAA S T + I + QGS A Sbjct: 220 SQACDAAQKIVDDQCDSHTTSMLGKLLLTTAVALA---IHGYGAQGSCPDGWAAHEQSCY 276 Query: 90 QINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAIS 149 + + Y ++ +S + + + Sbjct: 277 RAFEIYKD---WASARDYCGTYNADLVSITTTAEQEFMTQLISNLGGSFLIGLHDTATEN 333 Query: 150 ICMVLDVSRS-MEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSK-NTTKSKYAPAPAPAN 207 + +D + + +++ + + ++K + A Sbjct: 334 VFAWVDNTPYDPALRLTGNALPDPITSTSNQMFVKFTSDNVYTKPGFRFTYKAVCAIDIV 393 Query: 208 RKIDVLIE--------SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 +DV + ++ + A + N+ + + P+ N + Sbjct: 394 LVLDVSSSIPQDQFLLARDFMMAFVDCAAFQGLNIRIGVICYNCEAKTY-FGLQPIYNGM 452 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 + + + T T A+++ T + + +TDG Sbjct: 453 SYSIHYV--MYKGGETRTGHAIYYM-----------TCTSDFEAKPRVAVILTDG----- 494 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRE 375 ++ N + E R+ G+ +Y+V + P +G F V D+ + Sbjct: 495 ---RSGDNEVAEAENARDMGITLYAVRIGDPRFVDSA--ALATMTGDFTRVFDTDQ 545 >gi|228471033|ref|ZP_04055877.1| BatB protein [Porphyromonas uenonis 60-3] gi|228307253|gb|EEK16276.1| BatB protein [Porphyromonas uenonis 60-3] Length = 342 Score = 42.2 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 21/105 (20%), Positives = 38/105 (36%), Gaps = 21/105 (20%) Query: 253 TPLSNNLNEVKSRLNKLNPYE----NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 PL+ +L ++ L + P TN A+ + + L + K V Sbjct: 146 LPLTPDLPTARTFLADIQPGMVSNQGTNLGQALERSAQALSAPSRAG----------KAV 195 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 I +TDGE+ + + ++ G+K Y V + P Sbjct: 196 ILLTDGEDHEG-------GLEEGIKRLKEQGIKAYVVTIGLPEGA 233 >gi|182437627|ref|YP_001825346.1| hypothetical protein SGR_3834 [Streptomyces griseus subsp. griseus NBRC 13350] gi|178466143|dbj|BAG20663.1| hypothetical protein [Streptomyces griseus subsp. griseus NBRC 13350] Length = 758 Score = 42.2 bits (97), Expect = 0.17, Method: Composition-based stats. Identities = 15/102 (14%), Positives = 31/102 (30%), Gaps = 18/102 (17%) Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T+ A+ EL + + S + + + +TDGE + Q + Sbjct: 139 GTGTDFPSAIRQGVHELSSGTDPS--------VPRVLFLLTDGEMDVTGSPQYGDPAHRE 190 Query: 332 CE----------YMRNAGMKIYSVAVSAPPEGQDLLRKCTDS 363 E ++I+ + P+ + L R Sbjct: 191 AEGKRQLELELKNAAAKNVQIWPLGFGPDPDKEQLDRIAAGG 232 >gi|300787704|ref|YP_003767995.1| hypothetical protein AMED_5848 [Amycolatopsis mediterranei U32] gi|299797218|gb|ADJ47593.1| conserved hypothetical protein [Amycolatopsis mediterranei U32] Length = 476 Score = 41.8 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 54/190 (28%), Gaps = 27/190 (14%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 KI+ + + S++ + + Y + E K+ + Sbjct: 64 TKIEAARHACRAAIGSLRDGVLFGVVECTSRARLVYPAEPPLAVAGERT--RREAKAAAS 121 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 L T + A + + + + +TDG+N Sbjct: 122 GLIAGGGTAMSTWLDLARDLF----------GKTPAVIRHAVLLTDGKNESDHRGALDAA 171 Query: 328 TLQ-----ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + C+ R + E +DL R G AV + EL F + Sbjct: 172 LERCRGEFACD-AR---------GIGDDWEPRDLQRIAAALRGSADAVLEDAELAGDFRR 221 Query: 383 ITDKIQEQSV 392 + ++ ++V Sbjct: 222 LVERAMGRTV 231 >gi|78221957|ref|YP_383704.1| type IV pilus assembly protein PilY1 [Geobacter metallireducens GS-15] gi|78193212|gb|ABB30979.1| type IV pilus assembly protein PilY1 [Geobacter metallireducens GS-15] Length = 991 Score = 41.8 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 19/165 (11%), Positives = 47/165 (28%), Gaps = 36/165 (21%) Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYREL-----YNEKESSHNTIGSTRLKKFVIFITDG 314 V + ++ +N T + ++ + I + K + I +TDG Sbjct: 205 TTVDAAVDGINAETWTPLGETLSEIWQYFRGGDSLYNSGTYTTPITNGCQKNYTIIVTDG 264 Query: 315 ENSGASAYQ----------------NTLNTLQICEYMRN-------------AGMKIYSV 345 E + + YQ N + + ++ + Y++ Sbjct: 265 EPTYDNCYQGPFASYGCPKDPYDKENENAPSHLADVAKDMHDGNASPFSGRVQNVSTYTI 324 Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 ++ D + + G++F L + I + Sbjct: 325 GLTLDSALLD--QTAVNGGGKYFTTTSGISLATAVQNALADIVSK 367 >gi|297559547|ref|YP_003678521.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] gi|296843995|gb|ADH66015.1| von Willebrand factor type A [Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111] Length = 588 Score = 41.8 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 19/122 (15%), Positives = 46/122 (37%), Gaps = 13/122 (10%) Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 + Y + AY + ++ + VI +T G++ G+S + + Sbjct: 469 GGGSRLYDNILAAYDRVQDDYD--------EDKINSVILLTAGQDEGSSDIAHADLVAAL 520 Query: 332 CEYMR-NAGMKIYSVAVSAPP---EGQDLLRKCTDSSGQFFAVNDSRELLESF-DKITDK 386 + + ++ +A + ++L R +SG F +D E+ + F I+ + Sbjct: 521 QDRFDPERPVSMFIIAFGSREQQVAEEELRRIAAATSGSLFVTDDPDEIGDIFLSSISRR 580 Query: 387 IQ 388 + Sbjct: 581 LC 582 >gi|33333556|gb|AAQ11894.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 559 Score = 41.8 bits (96), Expect = 0.17, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + + N + + S N + Sbjct: 55 CSGSIRRHNWVKHAVPLAMKLIQQLNLNENAIHLYVNIFSNNAKEIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LIIIKSLLSTNLPYGRTNLSDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|160725|gb|AAA29777.1| thrombospondin related anonymous protein [Plasmodium falciparum] Length = 559 Score = 41.8 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 56/170 (32%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + N + + S N + Sbjct: 55 CSGSIRRHNWVKHAVPLAMKLIQQLNLNENAIHLYANIFSNNAKEIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LIIIKSLLSTNLPYGRTNLTDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|60681980|ref|YP_212124.1| aerotolerance-related membrane protein [Bacteroides fragilis NCTC 9343] gi|60493414|emb|CAH08200.1| aerotolerance-related membrane protein [Bacteroides fragilis NCTC 9343] Length = 341 Score = 41.8 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 55/168 (32%), Gaps = 49/168 (29%) Query: 251 QCTPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 P++++ K L ++P + T A++ A R + + + Sbjct: 142 TQLPITSDYISAKMFLESISPSLISKQGTAIGAAINLAARSFTPQ----------EGVGR 191 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------------- 352 ++ ITDGEN A + + G+++ + V P Sbjct: 192 AIVVITDGENHEGGAVEAAKEAAK-------KGIQVNVLGVGLPDGAPIPIEGSNDFRRD 244 Query: 353 ----------GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQE 389 + + ++ + +G + V++S + I +I + Sbjct: 245 REGNVIVTRLNEAMCQEIAKEGNGIYIRVDNS---NSAQKAINQEINK 289 >gi|167381981|ref|XP_001735931.1| hypothetical protein [Entamoeba dispar SAW760] gi|165901874|gb|EDR27852.1| hypothetical protein EDI_222580 [Entamoeba dispar SAW760] Length = 327 Score = 41.8 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 23/133 (17%), Positives = 47/133 (35%), Gaps = 13/133 (9%) Query: 257 NNLNEVKSRLNKLNPYENTNTYP-----AMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 N++ ++++ +N+ + Y P A A Y + + Sbjct: 2 NDIKDIQNSINQFSAYGGGMDGPESVTCAFDCAVNLGYRGYAAKVIIWIADAPPHGFNIQ 61 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA----PPEGQDLLRKCT-DSSGQ 366 DG G ++ ++ +++YSVA +DL+R + GQ Sbjct: 62 YDGYPDG---CPCGIDFQEVVLKAIKNDIQVYSVACEPIRPIYKHFRDLMRAVAMMTGGQ 118 Query: 367 FFAVNDSRELLES 379 F A+N + L + Sbjct: 119 FIALNSADCLADV 131 >gi|293377912|ref|ZP_06624093.1| LPXTG-motif cell wall anchor domain protein [Enterococcus faecium PC4.1] gi|292643459|gb|EFF61588.1| LPXTG-motif cell wall anchor domain protein [Enterococcus faecium PC4.1] Length = 1498 Score = 41.8 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 36/362 (9%), Positives = 96/362 (26%), Gaps = 50/362 (13%) Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKN-NPLQYIAESKAQYEIPTENLFLKG 122 +L ++ I ++ + N L+ +SK I N + Sbjct: 359 ATTLYNVYLDVIGSEKQEISPIDIVFVLDKSASMNEGTLEGGGQSKNAALIEAVNEISEN 418 Query: 123 LIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLP 182 L+ ++ + +S + + D+ D+ ++N + Sbjct: 419 LLSDPNMDIRI-GMVNFYHNSTVINNQEQISSDIFPLTNDINRLTGSENTALNRTPIGGT 477 Query: 183 PPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIA 242 P NR + ++ G+ + A E+ N Sbjct: 478 P----LTLGLKNGYETLYADNGGENRNPEKILIVVGDGTPTFSYAPIERSNRPDFTNWAV 533 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHA------YRELYNEKESSH 296 N I + L N NT+ + + + + ++ Sbjct: 534 MNNMIARDT-GDLFKNFET---------YSGNTS-GAGFSYPVVYPSEFNRPEDTWDYNY 582 Query: 297 NTIG-STRLKKFVIFITDGENSGASAYQNTLNTLQIC-----------EYMRNAGMKIYS 344 K ++ G S + + + I+S Sbjct: 583 RYGEVKEGDDKAFHWVGTGAASNGTTGEPDTQEKSSAINTVAYHHWLKNKYQENPPSIFS 642 Query: 345 VAVSAPPE----------GQDLLRKCTDSSG-----QFFAVNDSRELLESFDKITDKIQE 389 + + G+++L+ D + +++ N+ +++ + + I+ ++ Sbjct: 643 IGLGIDGSVAGRQRLDAIGRNVLKNIADLNDDGTTPRYYDANNKNDIITALEDISSTFKK 702 Query: 390 QS 391 Sbjct: 703 TI 704 >gi|120601101|ref|YP_965501.1| hypothetical protein Dvul_0050 [Desulfovibrio vulgaris DP4] gi|120561330|gb|ABM27074.1| conserved hypothetical protein [Desulfovibrio vulgaris DP4] Length = 389 Score = 41.8 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 28/203 (13%), Positives = 61/203 (30%), Gaps = 11/203 (5%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+ T A+DL + R+QMQ A A++ + + + + D Sbjct: 18 IVALSSFALLGLGTMAVDLGVVYTKRSQMQKAA------DIAALAGAQALINSSGNTDMA 71 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQIN---ITKDKNNPLQYIAESKAQYEIPTEN 117 T + +L QG D N T N + A+ + Sbjct: 72 RTQAITTARANLAQGDVPDRAVRDGDVTFSNNAAINTSFPMNRIDVHIRRNAEAG-NSVG 130 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSN 177 L L + ++L++ + + + ++ V D + + +N + Sbjct: 131 LIFANLFGDSYSDLTVMARAEAVPACRSC-VAPLSVPDKFTWNDKCAPDRKLNNGALDPT 189 Query: 178 KYLLPPPPKKSFWSKNTTKSKYA 200 + +S + Sbjct: 190 SSCEMASVQLIGYSPADFGTPVV 212 >gi|47218988|emb|CAG02026.1| unnamed protein product [Tetraodon nigroviridis] Length = 1039 Score = 41.8 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 18/154 (11%), Positives = 52/154 (33%), Gaps = 14/154 (9%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 I + + N++E + + Y T+ + A L ++++ Sbjct: 475 FDHRIQFWNTSLSK---ATKENIDEAMVYVKAIQSYGGTDINAPVLKAVDMLKEDRKAKR 531 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 S +I +TDG+ + + + + + M ++S+ Sbjct: 532 LPEKSID---MIILLTDGDPNSGESRIPVI--QENVKAAIGGQMSLFSLGFG-NDVKYPF 585 Query: 357 LRKCTDSSG----QFFAVNDSR-ELLESFDKITD 385 L + + + + +D+ +L +D+++ Sbjct: 586 LDVMSRENNGLARRIYEGSDAALQLQGFYDEVSS 619 >gi|186470651|ref|YP_001861969.1| putative transmembrane protein [Burkholderia phymatum STM815] gi|184196960|gb|ACC74923.1| putative transmembrane protein [Burkholderia phymatum STM815] Length = 372 Score = 41.8 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 29/251 (11%), Positives = 68/251 (27%), Gaps = 21/251 (8%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALD------------AAVLSGCASIVSDR 48 +TA+ + AID+ +++ RN++Q+A D A + D Sbjct: 16 ITAVSMVSLLGLAALAIDIGNLLVSRNELQNAADAAALAGAPCLYQRAQCGNTTATEPDW 75 Query: 49 TIKDPTT---KKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINIT--KDKNNPLQY 103 T TS + + K + GS G + T + +Q Sbjct: 76 TTATQKASSFATASTSNTVQGSVIKVAQTGSGYWNITGSPGTLETVPFTPGTNDLPAIQV 135 Query: 104 IAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAIS-ICMVLDVSRSMED 162 A + ++L ++ + S +T + R + + Sbjct: 136 TITKSAANANGSVPVYLASILGVTSLSASATATAAVSRPGYVGPGGLFPLAMSKCLFDNY 195 Query: 163 LYLQKHNDNNNMTSNKYLLPPPPKKSFWSK---NTTKSKYAPAPAPANRKIDVLIESAGN 219 ++ ++ K + ++ + A + + Sbjct: 196 WDSSTNSPKLATSTAKIPGQDFNQTPNTPYIFQVSSSYQVNGCEAGQWTTLTSQQNNVTF 255 Query: 220 LVNSIQKAIQE 230 + I + Sbjct: 256 VRGLIAGQNTD 266 >gi|33333550|gb|AAQ11891.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 559 Score = 41.8 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + + N + + S N + Sbjct: 55 CSGSIRRHNWVNHAVPLAMKLIQQLNLNENTIHLYVNIFSNNAKEIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LIIIKSLLSTNLPYGRTNLSDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|16416477|gb|AAL18263.1| vitrin [Homo sapiens] Length = 656 Score = 41.8 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 13/120 (10%), Positives = 38/120 (31%), Gaps = 14/120 (11%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSH 296 +G + Y + N ++K+ + K+ +N + + +++ Sbjct: 310 MGVVQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRTISFVTKNFFSKANG-- 367 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + V+ + DG + + R +G+ I+ + + E + Sbjct: 368 ---NRSGAPNVVVVMVDGWPTD--------KVEEASRLARVSGINIFFITIEGAAENEKQ 416 >gi|160710|gb|AAA29770.1| thrombospondin related anonymous protein [Plasmodium falciparum] gi|33333554|gb|AAQ11893.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 559 Score = 41.8 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 56/170 (32%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + N + + S N + Sbjct: 55 CSGSIRRHNWVKHAVPLAMKLIQQLNLNENAIHLYANIFSNNAKEIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LIIIKSLLSTNLPYGRTNLSDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|3273265|dbj|BAA31176.1| thrombospondin-related protein [Plasmodium falciparum] Length = 565 Score = 41.8 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 28/162 (17%), Positives = 52/162 (32%), Gaps = 23/162 (14%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE--------VKSRL 266 + V K IQ+ I + + L ++ ++ +KS L Sbjct: 63 NWVNHAVPLAMKLIQQLNLNESAIHLYLNDFSNNAREIIRLHSDASKNKEKALIIIKSLL 122 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 N PY TN A+ + L + + V+ +TDG + Sbjct: 123 NTNLPYGRTNLTDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD------SIQ 169 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 ++L+ + + G+KI + L C S G+ Sbjct: 170 DSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|3273301|dbj|BAA31194.1| thrombospondin-related protein [Plasmodium falciparum] Length = 568 Score = 41.8 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 28/162 (17%), Positives = 52/162 (32%), Gaps = 23/162 (14%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE--------VKSRL 266 + V K IQ+ I + + L ++ ++ +KS L Sbjct: 63 NWVNHAVPLAMKLIQQLNLNESAIHLYLNDFSNNAREIIRLHSDASKNKEKALIIIKSLL 122 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 N PY TN A+ + L + + V+ +TDG + Sbjct: 123 NTNLPYGRTNLTDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD------SIQ 169 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 ++L+ + + G+KI + L C S G+ Sbjct: 170 DSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|3273303|dbj|BAA31195.1| thrombospondin-related protein [Plasmodium falciparum] Length = 568 Score = 41.8 bits (96), Expect = 0.18, Method: Composition-based stats. Identities = 28/162 (17%), Positives = 52/162 (32%), Gaps = 23/162 (14%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE--------VKSRL 266 + V K IQ+ I + + L ++ ++ +KS L Sbjct: 63 NWVNHAVPLAMKLIQQLNLNESAIHLYLNDFSNNAREIIRLHSDASKNKEKALIIIKSLL 122 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 N PY TN A+ + L + + V+ +TDG + Sbjct: 123 NTNLPYGRTNLTDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD------SIQ 169 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 ++L+ + + G+KI + L C S G+ Sbjct: 170 DSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|315231887|ref|YP_004072323.1| hypothetical protein TERMP_02126 [Thermococcus barophilus MP] gi|315184915|gb|ADT85100.1| hypothetical protein TERMP_02126 [Thermococcus barophilus MP] Length = 1614 Score = 41.8 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 24/216 (11%), Positives = 62/216 (28%), Gaps = 25/216 (11%) Query: 133 LRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSK 192 E + ++ + +D S ++ L+ + + S + Sbjct: 19 GTVPAFQGNYVEAGSENVFITVDSSNFPSEVVLRISLNVSETLSAENFEVYENGVRQQIT 78 Query: 193 NTTKSKYA----PAPAPANRKIDVLIE---SAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 + + D + + N ++ +VR + + Sbjct: 79 GVYYTGEGLIMPIDVVFIIDRSDSMDSYIEAIKNSAYQFSYDLERIGGENVRFALVTFAN 138 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTN-TYPAMHHAYRELYNEKESSHNTIGSTRL 304 PL+NN++E LN + T ++ + A + + Sbjct: 139 YDDARIDLPLTNNVSEFVEALNSIYTAGGTEWSFGGILKALDLEF-----------NPNA 187 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 +K I +TD + Q+ + ++ + G+ Sbjct: 188 QKVFIVVTDED------DQSPYSVEEVANNLTAEGV 217 >gi|58429529|gb|AAW78168.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 539 Score = 41.8 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + N + + S N + Sbjct: 55 CSGSIRRHNWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAREIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 S + L P+ TN A+ + L + + V+ +TDG Sbjct: 115 LSIIKSLLSTNLPFGRTNLTDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRNLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|70951609|ref|XP_745031.1| von willebrand factor a-domain-related protein, [Plasmodium chabaudi chabaudi] gi|56525221|emb|CAH76847.1| von willebrand factor a-domain-related protein, putative [Plasmodium chabaudi chabaudi] Length = 294 Score = 41.8 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 22/176 (12%), Positives = 49/176 (27%), Gaps = 7/176 (3%) Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 P P + N + +K + + L + + A K + I Sbjct: 83 PDGPNPGNYCDNYYDITLIVENSSFVQKDYWMKGTIPFLESMARNARVSKDKAHMSIILF 142 Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGS 301 A ++ +S + ++ ++ L+ TN + +A + T Sbjct: 143 AERQNLIVPFTDEMSQDKEKLIDKIRSLD-DAATNQHTLYVYALEYALEKVIFGEGTRS- 200 Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 K + G + N + E + +K+ V + L Sbjct: 201 -DAPKVAVLFYYGFD----YGANKSLIPDVVEDYKQNNIKLIIVGIGLANRENAFL 251 >gi|3273261|dbj|BAA31174.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 41.8 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + + N + + S N + Sbjct: 55 CSGSYRRHNWVNHAVPLAMKLIQQLNLNENAIHLYLNVFSNNAREIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LIIIKSLLNTNLPYGRTNLTDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|47228853|emb|CAG09368.1| unnamed protein product [Tetraodon nigroviridis] Length = 632 Score = 41.8 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 28/145 (19%), Positives = 39/145 (26%), Gaps = 30/145 (20%) Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 L + P +T + L E S +I +TDGE + Sbjct: 140 ALKNVIPGGDTLMNLGLEMVREYLLEENLKRLC---SAGTASVIIALTDGELNEWQFDL- 195 Query: 325 TLNTLQICEYMRNAGMKIYSVAV----SAPPEGQDL------------------LRKCTD 362 Q + R+ G +Y V V GQ L L D Sbjct: 196 ---AQQEAQRARSMGAIVYCVGVKDFNQTQVRGQRLRNTNTTAHFAHWMCVLLQLATIAD 252 Query: 363 SSGQFFAV-NDSRELLESFDKITDK 386 + F V + L D I K Sbjct: 253 TVEHVFPVWGGFQALRGIIDSIIKK 277 >gi|85707636|ref|ZP_01038702.1| hypothetical protein NAP1_00335 [Erythrobacter sp. NAP1] gi|85689170|gb|EAQ29173.1| hypothetical protein NAP1_00335 [Erythrobacter sp. NAP1] Length = 740 Score = 41.8 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 22/132 (16%), Positives = 45/132 (34%), Gaps = 18/132 (13%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 +N+ K+ + L T E+ ++ + VIF+TDG Sbjct: 403 SNIAAAKTFTHNLMANGGT-----------EMLPALRAALRDRAPDERVRQVIFLTDG-- 449 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 A N + ++ R +++ V + + P + R G F V E Sbjct: 450 ----ALSNEADMMEEINRNRKDS-RVFMVGIGSAPNTYLMRRMAEAGRGTFTHVGMGEEA 504 Query: 377 LESFDKITDKIQ 388 + ++ D++ Sbjct: 505 EDQMQRLLDRLS 516 >gi|148692918|gb|EDL24865.1| RIKEN cDNA 1700112N15 [Mus musculus] Length = 492 Score = 41.8 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 20/99 (20%), Positives = 35/99 (35%), Gaps = 7/99 (7%) Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 G + +I +TDG L+T++ + R G +Y+V V + L Sbjct: 18 GGRIVNSVIIALTDGLLLLKPY----LDTMEEAKKARRMGAIVYTVGVFM--YSKQQLVN 71 Query: 360 CTDSSGQFFAV-NDSRELLESFDKITDKIQEQSVRIAPN 397 + F V L D +T K + + + P Sbjct: 72 IAGDPDRCFGVDEGFSALEGVVDPLTSKSCTEILSVQPT 110 >gi|118361736|ref|XP_001014096.1| Glutathionylspermidine synthase family protein [Tetrahymena thermophila] gi|89295863|gb|EAR93851.1| Glutathionylspermidine synthase family protein [Tetrahymena thermophila SB210] Length = 1547 Score = 41.8 bits (96), Expect = 0.19, Method: Composition-based stats. Identities = 22/284 (7%), Positives = 67/284 (23%), Gaps = 46/284 (16%) Query: 102 QYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSME 161 A + + +++ + + G + ++ +L VS S Sbjct: 198 DNTARFNLSHALLQKDIDFQLIFSYEGMFDPQVILGSSKIFHQDS--VKSAILPVSHSAM 255 Query: 162 DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLV 221 ++ N+ + + + ++ L+ Sbjct: 256 VSFVPNFNEEITKEIDDTIRAAINNGDNILSD---EFQQKLNQELVDHLNSSRSEFIFLL 312 Query: 222 NSIQKAIQEKKN--------------LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 + + + I++ N ++ ++ Sbjct: 313 DRSGSMSGQPIDRACQALTLFLKSLPTDSYFNVISFGSSFKLLFPQSEKYNSQSLEKAIS 372 Query: 268 KLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 ++ Y T Y + + + K V +TDGE Sbjct: 373 NISKYKADLGGTEIYKPLKNVF-----------VQNKIQGYNKQVFLLTDGEVDS----- 416 Query: 324 NTLNTLQICEYMRNAG--MKIYSVAVSAPPEGQDLLR-KCTDSS 364 Q+ +R +++S+ + + + + Sbjct: 417 ----PEQVISLIRKNNKFSRVHSIGFGSGADQYLINQSAIAGKG 456 >gi|326778278|ref|ZP_08237543.1| von Willebrand factor type A [Streptomyces cf. griseus XylebKG-1] gi|326658611|gb|EGE43457.1| von Willebrand factor type A [Streptomyces cf. griseus XylebKG-1] Length = 758 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 15/102 (14%), Positives = 31/102 (30%), Gaps = 18/102 (17%) Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T+ A+ EL + + S + + + +TDGE + Q + Sbjct: 139 GTGTDFPSAIRQGVHELSSGTDPS--------VPRVLFLLTDGEMDVTGSPQYGDPAHRE 190 Query: 332 CE----------YMRNAGMKIYSVAVSAPPEGQDLLRKCTDS 363 E ++I+ + P+ + L R Sbjct: 191 AEGKRQLELELKNAAAKNVQIWPLGFGPDPDKEQLDRIAAGG 232 >gi|3273269|dbj|BAA31178.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 28/162 (17%), Positives = 52/162 (32%), Gaps = 23/162 (14%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE--------VKSRL 266 + V K IQ+ I + + L ++ ++ +KS L Sbjct: 63 NWVNHAVPLAMKLIQQLNLNENAIHLYLNDFSNNAREIIRLHSDASKNKEKALIIIKSLL 122 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 N PY TN A+ + L + + V+ +TDG + Sbjct: 123 NTNLPYGRTNLSDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD------SIQ 169 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 ++L+ + + G+KI + L C S G+ Sbjct: 170 DSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|3273273|dbj|BAA31180.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 28/162 (17%), Positives = 52/162 (32%), Gaps = 23/162 (14%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE--------VKSRL 266 + V K IQ+ I + + L ++ ++ +KS L Sbjct: 63 NWVNHAVPLAMKLIQQLNLNENAIHLYLNDFSNNAREIIRLHSDASKNKEKALIIIKSLL 122 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 N PY TN A+ + L + + V+ +TDG + Sbjct: 123 NTNLPYGRTNLSDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD------SIQ 169 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 ++L+ + + G+KI + L C S G+ Sbjct: 170 DSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|3273295|dbj|BAA31191.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 28/162 (17%), Positives = 52/162 (32%), Gaps = 23/162 (14%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE--------VKSRL 266 + V K IQ+ I + + L ++ ++ +KS L Sbjct: 63 NWVNHAVPLAMKLIQQLNLNENAIHLYLNDFSNNAREIIRLHSDASKNKEKALIIIKSLL 122 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 N PY TN A+ + L + + V+ +TDG + Sbjct: 123 NTNLPYGRTNLSDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD------SIQ 169 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 ++L+ + + G+KI + L C S G+ Sbjct: 170 DSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|21229089|ref|NP_635011.1| hypothetical protein MM_2987 [Methanosarcina mazei Go1] gi|20907644|gb|AAM32683.1| hypothetical protein MM_2987 [Methanosarcina mazei Go1] Length = 641 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 27/145 (18%), Positives = 48/145 (33%), Gaps = 39/145 (26%) Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 +V+ R++ L P NT A+ H+ ++L + K +I ++DGE Sbjct: 519 DVEKRISLLEPIANTRLGAAIRHSIKKLDQVSSGT----------KILILLSDGEPYDTC 568 Query: 321 AY-------QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDS 373 +T + N G+ + + V S G++ Sbjct: 569 YGEGAYQGKYAEEDTKIAIQEGNNRGIHFFCITVD-------------SSPGEY------ 609 Query: 374 RELLESFDKITDKIQEQSVRIAPNR 398 L + F I + RI P R Sbjct: 610 --LDKIFSDFGYTIIDD-ARILPER 631 >gi|321478616|gb|EFX89573.1| hypothetical protein DAPPUDRAFT_310509 [Daphnia pulex] Length = 999 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 24/214 (11%), Positives = 62/214 (28%), Gaps = 30/214 (14%) Query: 178 KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVR 237 + S + N + + + S + + ++ + Sbjct: 294 DFAGNNNLPASIANLNPRFNVVRVLNGARYVLVTDVSSSMNSFNRIGRLYDSARRWIKYD 353 Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSR-----LNKL--NPYENTNTYPAMHHAYRELYN 290 I I + L + + V +NK+ +T + A L Sbjct: 354 IPDGTSLGIIKFSTAATLLSGMTVVNDATRQQLMNKIPNTAAGSTCIGCGLQIAIDILKP 413 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 ++ +TDG + +++ + A +++ S+A Sbjct: 414 SGNG-----------GIIVLVTDGVENVHPFI------IEMTPQLIAAQVQVVSIAFGRA 456 Query: 351 PEGQDLLRKCTD-SSGQFFAVNDSRE---LLESF 380 E + + ++G+ F ++D L ++F Sbjct: 457 AENE--IEVLATKTNGKSFFIDDEGSSDPLNDAF 488 >gi|58429471|gb|AAW78139.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 545 Score = 41.8 bits (96), Expect = 0.20, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + N + + S N + Sbjct: 55 CSGSIRRHNWVKHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAREIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 S + L P+ TN A+ + L + + V+ +TDG Sbjct: 115 LSIIKSLLSTNLPFGRTNLTDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLSDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|239832733|ref|ZP_04681062.1| Hypothetical protein OINT_1002014 [Ochrobactrum intermedium LMG 3301] gi|239825000|gb|EEQ96568.1| Hypothetical protein OINT_1002014 [Ochrobactrum intermedium LMG 3301] Length = 579 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 21/196 (10%), Positives = 49/196 (25%), Gaps = 16/196 (8%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQS-----------ALDAAVLSGCASIVSDRT 49 M A++ + + +D + + R Q+Q+ +L A + + ++ Sbjct: 19 MAALVSPLFLAVAAFCVDTSSLFLERRQLQNMADLAAVAGAASLSQANEAVLRQLQANG- 77 Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKA 109 + +I + + D + + NP Sbjct: 78 VDPVLMTDGYDPSIVNGKADNKTRVWVEKGNYFPDKGRAVEDRFVAGGANPDAVRVR--- 134 Query: 110 QYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHN 169 P F + I + + E + + + + D S L Sbjct: 135 -LARPGNLYFGQSFISRPALGATGTAATKAEAAFSVGSRLLSLNTDQSVLNGLLGGLLGT 193 Query: 170 DNNNMTSNKYLLPPPP 185 N + L Sbjct: 194 SLNLKLIDYNALAATD 209 >gi|149919193|ref|ZP_01907676.1| hypothetical protein PPSIR1_02211 [Plesiocystis pacifica SIR-1] gi|149819907|gb|EDM79329.1| hypothetical protein PPSIR1_02211 [Plesiocystis pacifica SIR-1] Length = 718 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 46/396 (11%), Positives = 89/396 (22%), Gaps = 57/396 (14%) Query: 32 ALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQI 91 ALD A A+ + +Q + + L G+ Q I Sbjct: 163 ALDRAQAEVAATRALVPDL-----PGEQVRSYVAVVLTDGLWTGADGTTALAPADQDPAI 217 Query: 92 NITK----DKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLA 147 T + AQ E + L G + + N Sbjct: 218 TATALLDEADVPTYVITIDGDAQVEEAADALAFAGGTGA-AFHGGTPGLLDGAIYGLNQE 276 Query: 148 I--------------SICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPP-PPKKSFWSK 192 + + ++LD S S + + T+ + S + Sbjct: 277 LLHDHALPGCSRQHPRVMVLLDASSSTLNDFGGTVPGTMGETAWDEIRAGLTAPDSLFDV 336 Query: 193 NTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNL---SVRIGTIAYNIGIVG 249 + + ++ S +++ G Sbjct: 337 DEGLAPQDMLHVGLTIFGHDDAPEEQRVLASYGPCMRDNLAWALDPEVSCVEPGCDDPWG 396 Query: 250 NQCTPLSN-----NLNEVKSRLNKLNP----------YENTNTYPAMH----HAYRELYN 290 + + N P T T+ + H Sbjct: 397 GPTIAWTYQDGSAGPPGFDLQTNSHMPRCEGPADFCWGSGTFTHRGLETVRLHQIDYAAA 456 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA---- 346 + + I ITDG S Y + E M NAG+ Y + Sbjct: 457 SQTDDALYPTAPDTPYINILITDG---MYSGYSSDAQVQSELEAMFNAGITTYVIGYGDL 513 Query: 347 VSAPPEGQDLLRKC---TDSSGQFFAVNDSRELLES 379 P + + + + + + EL ++ Sbjct: 514 FGFPDAVAQVEQMADWGSGGANDAYLAQNQVELEDA 549 >gi|34481890|emb|CAE46493.1| trap [Plasmodium falciparum] Length = 331 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + + N + + S N + Sbjct: 55 CSGSIRRHNWVKHAVPLAMKLIQQLNLNESAIHLYVNIFSNNAKEIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LIIIRSLLSTNLPYGRTNLTDALLQVRKHLNDRI-------NRENASQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|302783060|ref|XP_002973303.1| hypothetical protein SELMODRAFT_99409 [Selaginella moellendorffii] gi|300159056|gb|EFJ25677.1| hypothetical protein SELMODRAFT_99409 [Selaginella moellendorffii] Length = 99 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 5/43 (11%), Positives = 18/43 (41%) Query: 340 MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + +++ + + + +L + G F V + + +F + Sbjct: 21 VPVHTFGFGSDHDPEAMLSIAEATGGSFCYVQEEFAVQHAFAQ 63 >gi|289772819|ref|ZP_06532197.1| conserved hypothetical protein [Streptomyces lividans TK24] gi|289703018|gb|EFD70447.1| conserved hypothetical protein [Streptomyces lividans TK24] Length = 450 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 18/158 (11%), Positives = 51/158 (32%), Gaps = 27/158 (17%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 + Y + + + + + + + T + + R L + Sbjct: 66 AVVYPDTPRMARASART--RSRAERAVRETVAGGGTCIGAWLDLSRRLLTEQ-------- 115 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQ-----ICEYMRNAGMKIYSVAVSAPPEGQ 354 V+ +TDG+N + +C+ + +G+ Sbjct: 116 --DAPIGHVLLLTDGKNQHDEQMPLARVLEECAGRFVCDAW----------GIGDGWDGR 163 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 +LLR + G +V + L ++++ +++ ++V Sbjct: 164 ELLRITSQLHGSASSVREEEALPSEYEQLMNRLLTKTV 201 >gi|256788931|ref|ZP_05527362.1| hypothetical protein SlivT_30978 [Streptomyces lividans TK24] Length = 477 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 18/158 (11%), Positives = 51/158 (32%), Gaps = 27/158 (17%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 + Y + + + + + + + T + + R L + Sbjct: 93 AVVYPDTPRMARASART--RSRAERAVRETVAGGGTCIGAWLDLSRRLLTEQ-------- 142 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQ-----ICEYMRNAGMKIYSVAVSAPPEGQ 354 V+ +TDG+N + +C+ + +G+ Sbjct: 143 --DAPIGHVLLLTDGKNQHDEQMPLARVLEECAGRFVCDAW----------GIGDGWDGR 190 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 +LLR + G +V + L ++++ +++ ++V Sbjct: 191 ELLRITSQLHGSASSVREEEALPSEYEQLMNRLLTKTV 228 >gi|34481888|emb|CAE46492.1| trap [Plasmodium falciparum] Length = 331 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + + N + + S N + Sbjct: 55 CSGSIRRHNWVKHAVPLAMKLIQQLNLNESAIHLYVNIFSNNAREIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LIIIKSLLSTNLPYGRTNLTDALLQVRKHLNDRI-------NRENASQLVVMLTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLAGCHPSDGK 211 >gi|118372347|ref|XP_001019370.1| TPR Domain containing protein [Tetrahymena thermophila] gi|89301137|gb|EAR99125.1| TPR Domain containing protein [Tetrahymena thermophila SB210] Length = 1208 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 24/170 (14%), Positives = 62/170 (36%), Gaps = 18/170 (10%) Query: 221 VNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR----LNKLNPYENTN 276 ++S + R+ I++N + +S ++ + L + Sbjct: 1035 IDSFVSIRENYIKQFDRLALISFNHNVNVCFELQVSGKNDKFIDKYFKDAKNLACTGDKA 1094 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 Y A++ + + ++++ ITD ++ + + L L Sbjct: 1095 LYNAIYDGIKLFKKTEP--------QPNSRWLVAITDNVDNYSRIDEEQLKPLFF----- 1141 Query: 337 NAGMKIYSVAVSAPPEGQDL-LRKCTDSSGQFFAVNDSRELLESFDKITD 385 +K+ + ++ +++ L+ C ++ G F DS +L +F IT+ Sbjct: 1142 QNNVKLILIGLNLKSNAKEVYLKLCKNTGGTFIENPDSMDLNVAFQSITN 1191 >gi|283787709|ref|YP_003367574.1| tight adherence protein TadG [Citrobacter rodentium ICC168] gi|282951163|emb|CBG90854.1| putative tight adherence protein TadG [Citrobacter rodentium ICC168] Length = 647 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 28/253 (11%), Positives = 63/253 (24%), Gaps = 14/253 (5%) Query: 18 DLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSY 77 D A ++ + ++ A A+L+ A + D I K + +L Sbjct: 46 DGALMINRKARLADASSEAILAISAVDNRLVD----SVAIDNNKQIAKDFVNYYLPNNQA 101 Query: 78 IRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTG 137 + + I + + + Sbjct: 102 EQLKVVVTSFDRTIEKGYIDYKIAISATLPTLLPLGHLGFSAFDRSVTVGNFDNN-SGNA 160 Query: 138 IIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLP-----PPPKKSFWSK 192 + + V+D S SM Y+ + + K ++ S ++ Sbjct: 161 RKFVTVISDPADYVFVVDFSDSMNSSYIDQGRVTTRLAMLKQVVREVISGNKNPDSQFAI 220 Query: 193 NTTKSKYAP----APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIV 248 + + + E G LV + + + +I + + Sbjct: 221 VPFDIGVPFRIKDSVNTTSSYANKENEGGGELVGCSALYVPKPQYAPDKIDYNFWANKNI 280 Query: 249 GNQCTPLSNNLNE 261 + PL NN Sbjct: 281 SQKQYPLFNNSKN 293 >gi|212633291|ref|YP_002309816.1| VCBS [Shewanella piezotolerans WP3] gi|212554775|gb|ACJ27229.1| VCBS [Shewanella piezotolerans WP3] Length = 1477 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 31/343 (9%), Positives = 87/343 (25%), Gaps = 28/343 (8%) Query: 31 SALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQ 90 +A D + + + T + + + D Sbjct: 856 AATDGDGDAVVFAGANTGEFSVTLTPDNYVPNALNNFYQVDYEGLVAGNVITDDTGSGVD 915 Query: 91 INITKDKNNPLQYIA--------ESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERS 142 + D Q E++ + + G S Sbjct: 916 SDSNNDPLLLTQVNGIDLNFIGGEAEVILQGGLLTIQEDGSYTFEHDGNSSTPINFSYTI 975 Query: 143 SENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPA 202 ++ + + ++ + ++ N ++ P + + Sbjct: 976 NDGNGGTDTANVSIAVYDSETLNPGDDNYIGTDGNDTIISDTPDILAGADYNLAFLIDSS 1035 Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN--LN 260 + + + +++ ++ + +V + + ++ LS+N L Sbjct: 1036 GSMGDSAVATAKAQILSVLATLITNANQPSAGTVNVLLVDFDQTAKILIAIDLSSNDPLA 1095 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 + + L ++ TN A AY + FITDGE + + Sbjct: 1096 SITTALEAMSSGGTTNYSAAFTAAYNWFNDNYPQG---------NNRTFFITDGEPNTDN 1146 Query: 321 -----AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLR 358 ++N N + + + ++ + L + Sbjct: 1147 GQPGDYFENAQNAFALLNALS----YVEAIGLGGNVNSSTLQQ 1185 >gi|170743237|ref|YP_001771892.1| cell wall anchor domain-containing protein [Methylobacterium sp. 4-46] gi|168197511|gb|ACA19458.1| LPXTG-motif cell wall anchor domain protein [Methylobacterium sp. 4-46] Length = 761 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 53/160 (33%), Gaps = 19/160 (11%) Query: 237 RIGTIAYNIGIVGNQCTPLSNN---LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 R I ++ + P+ + + + + L T P + A + + E+ Sbjct: 409 RFNVIRFDDTMDLLFPAPVPADEAHRDAARRFVAALEARGGTEMLPPLRAALADPHPEEG 468 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + ++F+TDG A N R +++ + + + P G Sbjct: 469 DR---------VRQIVFLTDG------AIGNEEQIFSAISAGRGRS-RLFMIGIGSAPNG 512 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + G + A+ ++ E ++ K++ V Sbjct: 513 HLMTHAAELGGGSYTAIGTIDQVAERTAELLAKLESPVVT 552 >gi|46581745|ref|YP_012553.1| hypothetical protein DVU3344 [Desulfovibrio vulgaris str. Hildenborough] gi|46451168|gb|AAS97813.1| hypothetical protein DVU_3344 [Desulfovibrio vulgaris str. Hildenborough] gi|311235373|gb|ADP88227.1| Protein of unknown function DUF2134, membrane [Desulfovibrio vulgaris RCH1] Length = 389 Score = 41.8 bits (96), Expect = 0.21, Method: Composition-based stats. Identities = 28/203 (13%), Positives = 61/203 (30%), Gaps = 11/203 (5%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+ T A+DL + R+QMQ A A++ + + + + D Sbjct: 18 IVALSSFALLGLGTMAVDLGVVYTKRSQMQKAA------DIAALAGAQALINSSGNTDMA 71 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQIN---ITKDKNNPLQYIAESKAQYEIPTEN 117 T + +L QG D N T N + A+ + Sbjct: 72 RTQAITTARANLAQGDVPDRAVRDGDVTFSNNAAINTSFPMNRIDVHIRRNAEAG-NSVG 130 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSN 177 L L + ++L++ + + + ++ V D + + +N + Sbjct: 131 LIFANLFGDSYSDLTVMARAEAVPACRSC-VAPLSVPDKFTWNDKCDPDRKLNNGALDPT 189 Query: 178 KYLLPPPPKKSFWSKNTTKSKYA 200 + +S + Sbjct: 190 SSCEMASVQLIGYSPADFGTPVV 212 >gi|320103513|ref|YP_004179104.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] gi|319750795|gb|ADV62555.1| von Willebrand factor type A [Isosphaera pallida ATCC 43644] Length = 342 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 21/139 (15%), Positives = 45/139 (32%), Gaps = 26/139 (18%) Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 + ++ L K V+ +TDG N + A N L+ E + Sbjct: 168 IARPLEDGTNLGDALALAADALRDQDALSKVVVLVTDGRN--SPALPNPLDPQVAAELLD 225 Query: 337 NAGMKIYSVAVSAPPEGQDL----------------------LRKCTD-SSGQFFAVNDS 373 + G+ ++ +A+ + L L + + GQ F + + Sbjct: 226 DLGITLHILALGTEDPTELLETETTSRPATEPRTNLASLGSELERLAQRAGGQVFEITNP 285 Query: 374 RELLESFDKITDKIQEQSV 392 L + I D+++ + Sbjct: 286 GALRAALAAI-DRLETSRL 303 >gi|260177188|gb|ACX33912.1| hypothetical CBS domain-containing protein [uncultured prokaryote AT3] Length = 433 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 16/118 (13%), Positives = 33/118 (27%), Gaps = 6/118 (5%) Query: 1 MTAIIISVCF----LFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTK 56 M+A+ I F A ++A R ++Q D+ LS ++ ++ Sbjct: 1 MSALGILAFLIFTCGFFALA-EMALASSRRAKLQQLADSGDLSAARALQI-KSTPSRFIA 58 Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIP 114 QT + + + I Q + + + I Sbjct: 59 ATQTGLTAASLLAGIFGENALAIHVEHFIEQSLPVLNAAKSEIAITATVTVVTAFAIV 116 >gi|83720829|ref|YP_443052.1| hypothetical protein BTH_I2535 [Burkholderia thailandensis E264] gi|167582052|ref|ZP_02374926.1| hypothetical protein BthaT_28172 [Burkholderia thailandensis TXDOH] gi|167620215|ref|ZP_02388846.1| hypothetical protein BthaB_28164 [Burkholderia thailandensis Bt4] gi|83654654|gb|ABC38717.1| conserved hypothetical protein [Burkholderia thailandensis E264] Length = 602 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 18/106 (16%), Positives = 38/106 (35%), Gaps = 1/106 (0%) Query: 2 TAIIISVCFLFITYAI-DLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 A I + + D+ ++ ++R +Q D A L+G + + + + + Sbjct: 29 VAAAIWMLVAIAALGVVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCSQPNAAAAANAS 88 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAE 106 S F + + S R + A + + T N +Q A Sbjct: 89 SNGFDRAASGNTLTVSCGRWDTQSNAAPSYFSTTSTPLNAVQVTAT 134 >gi|17229534|ref|NP_486082.1| hypothetical protein alr2042 [Nostoc sp. PCC 7120] gi|17131133|dbj|BAB73741.1| alr2042 [Nostoc sp. PCC 7120] Length = 608 Score = 41.8 bits (96), Expect = 0.22, Method: Composition-based stats. Identities = 24/228 (10%), Positives = 65/228 (28%), Gaps = 19/228 (8%) Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 ++ L P W S + S VN+ + + + + Sbjct: 29 FSRVLAQNPKGGIDWIVVVDTSASMRGVGGTRNIFTQVKNSINEFVNTAR--VGDTVTIY 86 Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKES 294 + + + ++K +N L T+T A ++ + Sbjct: 87 NFDSDVTLQ---AQEIPIASNPDRGKLKQIINNLKADGVRTHTG----KAVQQALSTSAK 139 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK--IYSVAVSAPPE 352 + + ++F+TDG + +R K ++ V++ Sbjct: 140 LNQRPNTADRTVSIVFLTDGLEDVQGIPNPVPIPQST-QLLREQECKPYVFFVSLGLKEH 198 Query: 353 GQDLLRKCTDSS-----GQFFAVNDSRELLESFDKITDKIQEQSVRIA 395 + L + + G+ +L + I ++ + + ++ Sbjct: 199 -EKQLNEFASNPALCGKGRVLRDPGGVQLNKLAQNIRPRLIKPQIDVS 245 >gi|158298451|ref|XP_318625.3| AGAP009598-PA [Anopheles gambiae str. PEST] gi|157013887|gb|EAA14534.4| AGAP009598-PA [Anopheles gambiae str. PEST] Length = 1124 Score = 41.4 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 35/347 (10%), Positives = 85/347 (24%), Gaps = 21/347 (6%) Query: 33 LDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQIN 92 +D+A + + + + + + + + Sbjct: 1 MDSAEQAALSESDPESATSKAHPSAFYDARRINEYQSDGRLAEGSRQMLLRHMRRFEGYP 60 Query: 93 ITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICM 152 + + + + + P +K L + S S Sbjct: 61 V---NISLSSVLLPAGVSLDDPETQSAIKWSSHLD--PLFANNIERDSALSWQYFGSSTG 115 Query: 153 VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV 212 L + S + Sbjct: 116 FLRRFPGTAWPPETSYGSKEINDFRSEDWFIQAASSPKDVIILLDSSGSMSGKEYQLAVA 175 Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY 272 + + + + R+ + +V +N+ EVK+ +N + Sbjct: 176 TASAILDTLGDDDFFNLISFSDQSRVIVPCFQDKMVRATP----DNVKEVKTAINAVECE 231 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 N A+ A+ L +SS + + + ++ ITDG + N + Sbjct: 232 NTANFSAALETAFELLRKYNQSSQGSQCN----QAIMLITDGPSDTFMEVIKHYNHPHM- 286 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLE 378 ++I++ + G L K ++ G F +N E + Sbjct: 287 ------PVRIFTYLIGTDKSGGKNLYKMACENKGFFVQINSPEEAKK 327 >gi|123414681|ref|XP_001304538.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121885997|gb|EAX91608.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 709 Score = 41.4 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 19/193 (9%), Positives = 56/193 (29%), Gaps = 31/193 (16%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN--EVKSR 265 +I L++S+ + R I + +N N S Sbjct: 256 SRIGNAKFCLNILIHSLP--------IGCRFSIIQFGDSYKETVSICDYSNRNVRNAMSA 307 Query: 266 LNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 + +N T+ + + ++ + + +TDGE + + + Sbjct: 308 IAGINADMGGTDILSPLEYVFK-----------KKLEKGFIRKIFLLTDGEVNNSDEICS 356 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKI 383 + +I+++ + + L++ + + G + + D + ++ Sbjct: 357 KA-------QLERENNRIFAIGLGSGA-DPGLIKNVSIKNGGNYVLIADEYNMNNMIVEL 408 Query: 384 TDKIQEQSVRIAP 396 S+ Sbjct: 409 MKSAISPSLTNIS 421 >gi|269128332|ref|YP_003301702.1| TadE family protein [Thermomonospora curvata DSM 43183] gi|268313290|gb|ACY99664.1| TadE family protein [Thermomonospora curvata DSM 43183] Length = 149 Score = 41.4 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 10/99 (10%), Positives = 27/99 (27%), Gaps = 2/99 (2%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 T ++ L +D ++ R + + A +G I + D + Sbjct: 24 TVLLTPSVLLLAGLLVDGGLAIHARQRAADMAEQAARAGANEIDVQTLRATGEPRIDAAA 83 Query: 62 TIFKKQ--IKKHLKQGSYIRENAGDIAQKAQINITKDKN 98 + + ++ + G + + I Sbjct: 84 ACASAYDLLDAYGQEVADRDCLPGAEEVRITVTIRVRPQ 122 >gi|281203348|gb|EFA77548.1| type A von Willebrand factor domain-containing protein [Polysphondylium pallidum PN500] Length = 816 Score = 41.4 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 16/175 (9%), Positives = 48/175 (27%), Gaps = 19/175 (10%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 + ++ + + G P++ + + ++ + N Sbjct: 321 NANVLIIGDGTQKPITISGDDLHNAEKLIAHGAQHTLNKPITKTRSMLLKKVLAMYADGN 380 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC-- 332 T PA+ A + ++ TDG ++ +L+ Sbjct: 381 TALGPAIAVAVGM------------TAAVPGSTIVVATDGLSNAGIGSLEGWKSLESAKF 428 Query: 333 -----EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 E + G + + + + + + + + + D ++ E F Sbjct: 429 YNEIGEIAKKNGTTVSVLTIKGTDTKLEQIGQLAEMTNGTVDIVDPLKIKEEFKA 483 >gi|301755424|ref|XP_002913579.1| PREDICTED: voltage-dependent calcium channel subunit alpha-2/delta-3-like [Ailuropoda melanoleuca] Length = 816 Score = 41.4 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 22/142 (15%), Positives = 47/142 (33%), Gaps = 13/142 (9%) Query: 244 NIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 G + L L+ + L+KL A++ A+ L + + +I S Sbjct: 30 PSGSMAVGIPTLLPYLSHFREHLDKLFAKGIGMLDIALNEAFNILSDFNHTGQGSICS-- 87 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS 363 + ++ ITDG N ++I++ + D L+ + Sbjct: 88 --QAIMLITDGAVDTYDTIFAKYNWP-------ERKVRIFTYLIGREAAFADNLKWMACA 138 Query: 364 SGQFFAVNDSRELLESFDKITD 385 + FF L + + + + Sbjct: 139 NKGFF--TQISTLADVQENVME 158 >gi|3273299|dbj|BAA31193.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 41.4 bits (95), Expect = 0.22, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + + N + + S N + Sbjct: 55 CSGSYRRHNWVNHAVPLAMKLIQQLNLNENAIHLYVNDFSNNAKEIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LIIIKSLLSTNLPYGRTNLSDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|145483523|ref|XP_001427784.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124394867|emb|CAK60386.1| unnamed protein product [Paramecium tetraurelia] Length = 293 Score = 41.4 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 13/118 (11%), Positives = 31/118 (26%), Gaps = 34/118 (28%) Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 ++ L T+ M L + K+ + + + Sbjct: 64 AISSLQARGGTDIGNGMKMVLSILKHRKDKKPVSAEEKVREDLYQY-------------- 109 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 +R++ I +V + GQF+ + + + E F + Sbjct: 110 ---------KIRDS-FTIKTVGFGGDCRPK----------GQFYFIPNLTNIDECFTE 147 >gi|116622792|ref|YP_824948.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225954|gb|ABJ84663.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 360 Score = 41.4 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 20/112 (17%), Positives = 44/112 (39%), Gaps = 31/112 (27%) Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS----------------- 348 K ++ ++DG ++G+ ++ E + A +YSV Sbjct: 218 KALVILSDGVDTGSKVSLSS-----AVESAQRADTLVYSVLFEDREAYGVPGFGGMGRGR 272 Query: 349 --------APPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQS 391 AP G+ +L + + + G+FF V+ L + + +I + ++ Q Sbjct: 273 GGRRPMPTAPANGKKVLERISTETGGRFFEVSKKEPLEKIYQEIEEDLRHQY 324 >gi|330470624|ref|YP_004408367.1| hypothetical protein VAB18032_03415 [Verrucosispora maris AB-18-032] gi|328813595|gb|AEB47767.1| hypothetical protein VAB18032_03415 [Verrucosispora maris AB-18-032] Length = 555 Score = 41.4 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 35/361 (9%), Positives = 85/361 (23%), Gaps = 43/361 (11%) Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKN------NPLQY 103 D + ++ + G ++++ N L Sbjct: 192 ADDNKDSAATVAATLRRLATRTFSPGEQASSLTPGRDLPGRVDVVAASEQAVLEHNALTT 251 Query: 104 IAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDL 163 + A Y + R + A S + + Sbjct: 252 SGKLVAAYPEIAVPWLDLPYVVLPAAQGRARDAAARLLTELLTATSRDIFVGHGFRTASG 311 Query: 164 YLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLI--------- 214 + ++ + +++ P P + + T A +DV Sbjct: 312 FPPAMPSDDRLRADERRPVPLPAEETVTDVLTGWSGVQRSARILTLLDVSGSMAAQVPGG 371 Query: 215 ----ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP---LSNNLNEVKSRLN 267 ++ + + L V + + P L + + L Sbjct: 372 GTRLDATVRAAEEGAALLLDNSELGVWAFATNLDGERDHQEILPVAPLGAQRDRLAEALA 431 Query: 268 KLN--PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + P T Y + AYR+ V+ +TDG + ++ Sbjct: 432 AVRVEPQGGTGLYDSTLAAYRDARRNWTPGRI--------NLVLVMTDGR----NEDDDS 479 Query: 326 LNTLQICEYMRN-----AGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLES 379 + + ++ + I + + + L + G+ F + Sbjct: 480 IGRAALLAELKRLQEPRRPLPIIFIGLG-GAIDPEELEAIAKVTGGRVFVTAQPSGMRRI 538 Query: 380 F 380 F Sbjct: 539 F 539 >gi|160716|gb|AAA29773.1| thrombospondin related anonymous protein [Plasmodium falciparum] Length = 568 Score = 41.4 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + N + + S N + Sbjct: 55 CSGSIRRHNWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAKEIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 S + L P+ TN A+ + L + + V+ +TDG Sbjct: 115 LSIIKSLLSTNLPFGRTNLTDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRNLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|3273277|dbj|BAA31182.1| thrombospondin-related protein [Plasmodium falciparum] gi|3273281|dbj|BAA31184.1| thrombospondin-related protein [Plasmodium falciparum] gi|3273283|dbj|BAA31185.1| thrombospondin-related protein [Plasmodium falciparum] Length = 565 Score = 41.4 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + + N + + S N + Sbjct: 55 CSGSYRRHNWVNHAVPLAMKLIQQLNLNENAIHLYLNDFSNNAREIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LIIIKSLLSTNLPYGRTNLTDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|313233582|emb|CBY09754.1| unnamed protein product [Oikopleura dioica] Length = 620 Score = 41.4 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 25/217 (11%), Positives = 59/217 (27%), Gaps = 23/217 (10%) Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 + T + + I +ES + + ++ +I Sbjct: 348 NGGYNNQYAPAGPTTCQAQNLNIVFSVDISASMESVRDFWGWFRGFVRFFDFSKQKISIN 407 Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGS 301 + ++ + + L +++ ++ L K S + G Sbjct: 408 TFADDANVILDLA-HHDSQHIMDTIFNLQQQGASSSQDSV-----LLKGMKTSRVSLKGL 461 Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK-- 359 VI TDG + E G+ + SV + + L+++ Sbjct: 462 EGENSVVIVFTDG------WNSEPKTPVFHGEEAYKDGINMISVGIG-----EKLMKEYL 510 Query: 360 --CTDSSG-QFFAVNDSRELLESFDKITDKIQEQSVR 393 + + V EL + D+I + + Sbjct: 511 YTIATGNQRNVYEVKT-DELNSITASLQDQICSSTSK 546 >gi|304390827|ref|ZP_07372779.1| hypothetical protein HMPREF0574_1557 [Mobiluncus curtisii subsp. curtisii ATCC 35241] gi|304325710|gb|EFL92956.1| hypothetical protein HMPREF0574_1557 [Mobiluncus curtisii subsp. curtisii ATCC 35241] Length = 171 Score = 41.4 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 14/135 (10%), Positives = 39/135 (28%), Gaps = 4/135 (2%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A ++ + ++ A+ + + + ++Q D A++ ++ D Sbjct: 35 LAACVLLCVIILMSMAV--SGVYLEQRRLQRLADQTASMAAANMADTAYYQNGIV--DGV 90 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + + + I++ + +A +IP + Sbjct: 91 PLEIEPYHASERAAEYLSGASISANSGLDGIDLVDVDVASTRVQVTLRATGKIPLVLPLV 150 Query: 121 KGLIPSALTNLSLRS 135 L LT S Sbjct: 151 SSLTQVELTATGAAS 165 >gi|297579199|ref|ZP_06941127.1| conserved hypothetical protein [Vibrio cholerae RC385] gi|297536793|gb|EFH75626.1| conserved hypothetical protein [Vibrio cholerae RC385] Length = 1778 Score = 41.4 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 33/296 (11%), Positives = 84/296 (28%), Gaps = 28/296 (9%) Query: 38 LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK 97 ++ +SI + + + T+ Q + + + G+Y A + T Sbjct: 1024 IADGSSISNSQDTTNNTSADGQLNLSNVAHLSMGIPTGNYTSNGAAISWVLSADKQTLTG 1083 Query: 98 NNPLQYIAESKAQYEIPTENLFLKGL-----IPSALTNLSLRSTGIIERSSENLAISICM 152 + + E + + + +++ + + Sbjct: 1084 SAGGNKVVEFTLDNQGKVHSTLHSPIDHANKSGEDSLAINIPLEAKNAAGAIGTGKVTLI 1143 Query: 153 VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV 212 + D + +D++ ++ + + +L ++ V Sbjct: 1144 IEDDAPIAKDIFHMTESETKQGANVQLMLDV------------SGSMGRDAGNGKTRLQV 1191 Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN-NLNEVKSRLNKLNP 271 + ESA L+ Q Q ++ I ++ + L + E K+ +N L+ Sbjct: 1192 MKESAIQLIEQYQALGQT------KVQLILFSSDASIKTASGLLWMTVAEAKNYINALSA 1245 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 T+ A+ A + N + F++DG G Sbjct: 1246 NGGTDYDDAIKLA----QESWSGTINGQPLSGATNVSYFLSDGVPEGYDWELKNSQ 1297 >gi|270159275|ref|ZP_06187931.1| type IV fimbrial biogenesis PilY1-related protein [Legionella longbeachae D-4968] gi|289165901|ref|YP_003456039.1| type 4 fimbrial biogenesis protein PilY1 [Legionella longbeachae NSW150] gi|269987614|gb|EEZ93869.1| type IV fimbrial biogenesis PilY1-related protein [Legionella longbeachae D-4968] gi|288859074|emb|CBJ13003.1| putative type 4 fimbrial biogenesis protein PilY1 [Legionella longbeachae NSW150] Length = 1159 Score = 41.4 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 34/337 (10%), Positives = 78/337 (23%), Gaps = 43/337 (12%) Query: 75 GSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLR 134 + + + + A + P N L L+ Sbjct: 194 NYTSASSTIQSNCSSIASFYGSSVLSSNAYMQIGATSDNPAINDVLYAGSGFPGVFLTYS 253 Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 N +IS +V S + + Sbjct: 254 GPSPATPYPPNFSISNYNNGNVLISYSKSLPN--------------IGGFGTSPTNAGFV 299 Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 S ++GN++ + A S+ + + + Sbjct: 300 PYSPQVLYSQRGFGYSGTQSATSGNVLVPMTSAGTNPTTTSINNAIGIFTPYLQPETNST 359 Query: 255 LSN--NLNEVKSRLNKLNPY--------ENTNTYPAMHH----------AYRELYNEKES 294 + + V+S + L NT+ ++L + Sbjct: 360 STKEIKASAVQSPVAGLMTTAKTYLTSLGNTSGNGCPQKKYVILISDGLPTQDLNGKLWP 419 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-- 352 + +T F DG + Q + + ++ G+ Y + + A Sbjct: 420 PLGSAAATGYGITATFNPDGSLQSTN-CQALTDAISAITALKQQGIPTYIIGLGAGVNPS 478 Query: 353 ----GQDLLRKCTDSSG--QFFAVNDSRELLESFDKI 383 L + G ++ +S +L+ + I Sbjct: 479 LNPQAAASLTAMAVAGGTTNYYPATNSTDLVNDLNAI 515 >gi|84996511|ref|XP_952977.1| thrombospondin-related protein [Theileria annulata strain Ankara] gi|71532874|emb|CAJ20069.1| thrombospondin-related protein, putative [Theileria annulata] Length = 606 Score = 41.4 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 31/188 (16%), Positives = 68/188 (36%), Gaps = 19/188 (10%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 I+ + L + ++ + + + T + +I + + P S + + L+KL Sbjct: 225 IEEWNKLIPFLKSLVRSINISPNYVHLSMVTFSTSIRWLISFLDPASKDEQLALAVLDKL 284 Query: 270 NPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 T T A++ + + K +I ITDG ++ + Sbjct: 285 KNSKPVFGYTFTGQALNFISEAV-------YMFGARRNSPKGIIIITDGSSTQTN----- 332 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQ-DLLRKCTDSSGQ-FFAVNDSRELLESFDKI 383 T Q +R+AG+ I V V E + + C+ F + + E++ ++ Sbjct: 333 -VTSQASALLRDAGVTILVVGVGKAKESECRGIVGCSTKGECPLFFMTNWDEIIRKVGEL 391 Query: 384 TDKIQEQS 391 ++ E Sbjct: 392 MAEVCETI 399 >gi|86137907|ref|ZP_01056483.1| hypothetical protein MED193_08593 [Roseobacter sp. MED193] gi|85825499|gb|EAQ45698.1| hypothetical protein MED193_08593 [Roseobacter sp. MED193] Length = 181 Score = 41.4 bits (95), Expect = 0.23, Method: Composition-based stats. Identities = 10/36 (27%), Positives = 23/36 (63%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAV 37 AI++ + +F+ ++L ++ ++Q++ ALD AV Sbjct: 25 FAILMPLFLMFLFSTVELGMLILRQSQLERALDIAV 60 >gi|296228555|ref|XP_002807723.1| PREDICTED: LOW QUALITY PROTEIN: integrin alpha-10-like [Callithrix jacchus] Length = 1197 Score = 41.4 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 39/215 (18%), Positives = 63/215 (29%), Gaps = 29/215 (13%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 + I E L + K + + +++G + Y V Sbjct: 191 PTYMDVVIVLDGSNSIYPWSEVQTFLRRLVGKLFIDPEQ--IQVGLVQYGESPVHEWSLG 248 Query: 255 LSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 EV S L+ E T T A+ A E +++ + + +T Sbjct: 249 DFRTKEEVVSAAKNLSRREGRETKTAQAIMVACTEGFSQSHGGRPEA-----ARLXVVVT 303 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--------SAPPEGQDLLRKCTDSS 364 DGE+ + CE R + Y +AV P +R Sbjct: 304 DGESHDG-------EDTKGCEAGR---VTRYGIAVLGHYLRRQRDPSSFLREIRTIASDP 353 Query: 365 G--QFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 FF V D L + D + D+I A N Sbjct: 354 DERFFFNVTDEAALTDIVDALGDRIFGLEGSHAEN 388 >gi|309361018|emb|CAR99586.1| hypothetical protein CBG_25533 [Caenorhabditis briggsae AF16] Length = 278 Score = 41.4 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 29/200 (14%), Positives = 59/200 (29%), Gaps = 26/200 (13%) Query: 194 TTKSKYAPAPAPANRKIDVLIESAGNL-VNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQC 252 T N + + + +N + + L G Q Sbjct: 94 TPYKGLIATVFEMNAAQYEIRDVVNFVKLNLFNSSNYDF--LYSTQAINIPYGGTHQFQK 151 Query: 253 TPLSN----NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 L N+ +++ ++ L + + P + L + ++ Sbjct: 152 LFLEYGRAKNVTDLQYNIDNLYRHSILDRNPTVADGLSWLLDYRDP-------PMYDTQG 204 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF 368 + I G ++ ++ T +R KI SVAV L + DS +F Sbjct: 205 VIIVVGNHN----DESPSLTSAYLGSLRAQRYKIISVAVGKNHGD---LSQIADSPNFYF 257 Query: 369 AVNDSRELLESFDKITDKIQ 388 V++S + D+I Sbjct: 258 QVDESNG-----QDVADRIS 272 >gi|221112732|ref|XP_002163909.1| PREDICTED: similar to tyrosine kinase receptor, partial [Hydra magnipapillata] Length = 1516 Score = 41.4 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 14/65 (21%), Positives = 25/65 (38%), Gaps = 1/65 (1%) Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDS 373 G + ++ + MRN G+ I ++ + P Q L SS + Sbjct: 1079 GLTMSDKPHNDSNKLASLALKMRNKGVIIVAIGIG-PNVDQKKLTDIAGSSDKVLLAEKM 1137 Query: 374 RELLE 378 ++LLE Sbjct: 1138 QDLLE 1142 >gi|254506780|ref|ZP_05118920.1| conserved hypothetical protein [Vibrio parahaemolyticus 16] gi|219550361|gb|EED27346.1| conserved hypothetical protein [Vibrio parahaemolyticus 16] Length = 385 Score = 41.4 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 19/148 (12%), Positives = 44/148 (29%), Gaps = 3/148 (2%) Query: 6 ISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFK 65 + +ID+ H M + ++Q+++DAA L+ + + + T T T Sbjct: 1 MLALIGVAALSIDVNHAMLNKTRLQNSVDAAALAAAIVLDNGGSEAQATAAAKTTLTNVA 60 Query: 66 KQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIP 125 + + A + + Y + + + + + F + Sbjct: 61 NATGNTEMDFTSAQVVVQFSNDPATFPFSGFDPDEDSYARVAVSNFAL---SNFFAHVFS 117 Query: 126 SALTNLSLRSTGIIERSSENLAISICMV 153 S G ++ I V Sbjct: 118 VDKNLASTAVAGPSPSVIDSNNIVPMAV 145 >gi|159900441|ref|YP_001546688.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] gi|159893480|gb|ABX06560.1| von Willebrand factor type A [Herpetosiphon aurantiacus ATCC 23779] Length = 978 Score = 41.4 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 19/108 (17%), Positives = 37/108 (34%), Gaps = 15/108 (13%) Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 N + ++ A L G + +I + DG S Q N +++ Sbjct: 486 GGGINVHDSLVAAGNVL----------KGRNAPIRHIILLADG-----SDSQQQENAVRL 530 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 + R G+ ++A+ + L G+ F V D+ L + Sbjct: 531 TDEHRRLGITTSTIAIGNGGDVGFLNNVAVAGGGRHFLVEDALSLPDI 578 >gi|3273285|dbj|BAA31186.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 41.4 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + + N + + S N + Sbjct: 55 CSGSYRRHNWVNHAVPLAMKLIQQLNLNENAIHLYLNDFSNNAREIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LIIIKSLLSTNLPYGRTNLSDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|3273297|dbj|BAA31192.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 41.4 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + + N + + S N + Sbjct: 55 CSGSYRRHNWVNHAVPLAMKLIQQLNLNENAIHLYLNDFSNNAREIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LIIIKSLLSTNLPYGRTNLSDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|3273289|dbj|BAA31188.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 41.4 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + + N + + S N + Sbjct: 55 CSGSYRRHNWVNHAVPLAMKLIQQLNLNENAIHLYLNDFSNNAREIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LIIIKSLLSTNLPYGRTNLSDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|3273259|dbj|BAA31173.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 41.4 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + + N + + S N + Sbjct: 55 CSGSYRRHNWVNHAVPLAMKLIQQLNLNENAIHLYLNDFSNNAREIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LIIIKSLLSTNLPYGRTNLSDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|3273255|dbj|BAA31171.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 41.4 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + + N + + S N + Sbjct: 55 CSGSYRRHNWVNHAVPLAMKLIQQLNLNENAIHLYLNDFSNNAREIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LIIIKSLLSTNLPYGRTNLSDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|3273287|dbj|BAA31187.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 41.4 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 56/170 (32%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + N + + S N + Sbjct: 55 CSGSYRRHNWVNHAVPLAMKLIQQLNLNENAIHLYANDFSNNAKEIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LIIIKSLLSTNLPYGRTNLSDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|58429455|gb|AAW78131.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 557 Score = 41.4 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 26/162 (16%), Positives = 52/162 (32%), Gaps = 23/162 (14%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE--------VKSRL 266 + V K IQ+ I + L ++ ++ +KS L Sbjct: 63 NWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAKEIIRLHSDASKNKEKALIIIKSLL 122 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 + P+ TN A+ + L + + V+ +TDG + Sbjct: 123 STNLPFGRTNLSDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD------SIQ 169 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 ++L+ +++ G+KI + L C S G+ Sbjct: 170 DSLKESRKLKDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|302767296|ref|XP_002967068.1| hypothetical protein SELMODRAFT_408444 [Selaginella moellendorffii] gi|300165059|gb|EFJ31667.1| hypothetical protein SELMODRAFT_408444 [Selaginella moellendorffii] Length = 2121 Score = 41.4 bits (95), Expect = 0.24, Method: Composition-based stats. Identities = 13/105 (12%), Positives = 35/105 (33%), Gaps = 16/105 (15%) Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 + T + A L + +IF++DG N+G S + + Sbjct: 1980 DSSGGTVYSAGLQLAEEILGRSAGDAKAP--------AIIFLSDGGNAGGS------DPV 2025 Query: 330 QICEYMRNA--GMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND 372 +++ + ++++ + + +L G F + + Sbjct: 2026 AFVRKIKSMEPRLVVHTIVFGSDLSPRKVLVDMAREGGGVFQITN 2070 >gi|24374985|ref|NP_719028.1| hypothetical protein SO_3479 [Shewanella oneidensis MR-1] gi|24349714|gb|AAN56472.1|AE015784_5 hypothetical protein SO_3479 [Shewanella oneidensis MR-1] Length = 596 Score = 41.4 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 9/60 (15%), Positives = 23/60 (38%), Gaps = 2/60 (3%) Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLL--RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 ++ +++ S + L + + G++F D+ +L S + I E+ Sbjct: 471 NLAGKQTVRTFTIGFSEGAASAEHLLKQTAENGGGKYFDATDASQLRSSLQTALNNILEK 530 >gi|88801114|ref|ZP_01116660.1| hypothetical protein MED297_05449 [Reinekea sp. MED297] gi|88776143|gb|EAR07372.1| hypothetical protein MED297_05449 [Reinekea sp. MED297] Length = 553 Score = 41.4 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 14/95 (14%), Positives = 31/95 (32%), Gaps = 12/95 (12%) Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 ++ + +++ T T A+ A EL N K + ++DGE Sbjct: 439 KSQFLGAVERMSAGGGTATNDAILVAAHELLN------FAKTHPEHKLTIFVLSDGETRN 492 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + +Q+ + ++S+A Sbjct: 493 GLPLGDVEKVIQML------NIPVHSIAYGFESAD 521 >gi|291443250|ref|ZP_06582640.1| von Willebrand factor [Streptomyces roseosporus NRRL 15998] gi|291346197|gb|EFE73101.1| von Willebrand factor [Streptomyces roseosporus NRRL 15998] Length = 597 Score = 41.4 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 25/198 (12%), Positives = 59/198 (29%), Gaps = 28/198 (14%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQ------EKKNLSVRIGTIAYNIGIVGNQCTPLS 256 P ++DV ES ++ + + +G+ Sbjct: 403 PGRNQSRMDVTKESLIQALDQFTPNDEIGLWEFATTLDGEKDYRRLMETKRLGDPAAGGG 462 Query: 257 NNLNEVKSRLNKLNP--YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 + ++ + L P T Y L + KE+ + ++ +TDG Sbjct: 463 THREKLTAAFAGLQPVPGGATGLYD------TTLASYKEARSTYVKGKF--NALVILTDG 514 Query: 315 ENSGASAYQNTLNTLQICEYMRN-----AGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA 369 N + + + ++ + + ++AV + ++ + G + Sbjct: 515 SNQDTNGISRS----GLITELKELVDPERPVPVIAIAVGPDADRDEVAEIARITGGDGYE 570 Query: 370 VNDSRELLESFDKITDKI 387 V+D E+ I I Sbjct: 571 VSDPAEIQAV---ILQAI 585 >gi|62733745|gb|AAX95854.1| hypothetical protein LOC_Os11g11610 [Oryza sativa Japonica Group] gi|77549354|gb|ABA92151.1| hypothetical protein LOC_Os11g11610 [Oryza sativa Japonica Group] Length = 182 Score = 41.4 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 3/42 (7%), Positives = 14/42 (33%) Query: 341 KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 +++ + + + G F + + + + F + Sbjct: 89 PVHTFGFGKDHDAVAMHTIAEVTGGTFSFIENEAAIQDGFAQ 130 >gi|321464007|gb|EFX75018.1| hypothetical protein DAPPUDRAFT_323732 [Daphnia pulex] Length = 923 Score = 41.4 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 16/111 (14%), Positives = 41/111 (36%), Gaps = 23/111 (20%) Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 T + A L+ ++ +TDG + ++ + Sbjct: 378 TCIGCVLQMAIDILHPGGNG-----------GVIVLVTDGLENEYPFIRD------VTPE 420 Query: 335 MRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVND---SRELLESFD 381 + +A +++ S+A E + + ++G+ + +ND + EL ++F Sbjct: 421 LIDAKIQVVSIAFGRDAENE--IENLATKTNGKSYFINDNGNNDELNDAFT 469 >gi|320162872|gb|EFW39771.1| ubiquitin [Capsaspora owczarzaki ATCC 30864] Length = 563 Score = 41.4 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 49/160 (30%), Gaps = 12/160 (7%) Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCT-----PLSNNLNEVKSRLNKLNPYENTNTYPA 280 S+RI A+ L++++ ++ S + K+ + Sbjct: 217 STKLMSNISSLRISICAFGDYCDQFSSYVLKQEDLTSDIKKLCSFVEKVGNTGGGDAPEC 276 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + R + + S K ++ I D + A ++ E + + G+ Sbjct: 277 YEYVLRRIQDLSW-------SEDASKALVLIGDADPHPADYTSLHIDWRLEAEQLAHMGI 329 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 K+YSV + G F + + + F Sbjct: 330 KVYSVQAGGSSGTHFYRDVAAMTGGYFVPFTQFQLITDMF 369 >gi|118361111|ref|XP_001013786.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89295553|gb|EAR93541.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 357 Score = 41.4 bits (95), Expect = 0.25, Method: Composition-based stats. Identities = 24/164 (14%), Positives = 49/164 (29%), Gaps = 26/164 (15%) Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIV---GNQCTPLSNNLNEVKSRLNKLNPYE 273 + + K I E+ + I ++ + N+ + +NKL Sbjct: 203 IQFVKAQLTKTINEQLKKYQKFNIITFSNQATYWKPDVIDATPENILAAITYINKLGTSG 262 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS-AYQNTLNTLQIC 332 TN + A+R S + + ++DG + + L Sbjct: 263 ATNISGGLDLAFR--------------SKEVLNTIYLLSDGVPNSGVMTIEGIKKYLTDK 308 Query: 333 EYMRNAGMKIYSVAV--SAPPEGQ------DLLRKCTDSSGQFF 368 R +KI +++ P Q + L D++ F Sbjct: 309 NQNRQEKVKINTISFILGGPENQQERTLSFEFLNAIADATNGSF 352 >gi|293604651|ref|ZP_06687053.1| conserved hypothetical protein [Achromobacter piechaudii ATCC 43553] gi|292816982|gb|EFF76061.1| conserved hypothetical protein [Achromobacter piechaudii ATCC 43553] Length = 3744 Score = 41.4 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 38/344 (11%), Positives = 93/344 (27%), Gaps = 38/344 (11%) Query: 44 IVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQY 103 D + + S+ + + G+ I + + T N Sbjct: 2923 TNDAPVAADNSANVEIGSSHVFTIAEFNFSDGAEGNSLQSVIISRLPTDGTLTLNGNPVT 2982 Query: 104 IAESKAQYEIPTENLFLK----GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRS 159 + + + +I L GL S + + + + + D + Sbjct: 2983 VNTAVSAADIAAGKLVFTPSANGLDTSIGFQVRDNGGTDHGGQNTSGTYNFVLNTDNIVT 3042 Query: 160 MEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPA----------------- 202 E++ N + + K Y A Sbjct: 3043 GENVGSGTGNTPVLNGGSGNDIILGDKGGTVVTVEPGKNYNIALVVDTSGSMAYKLDGST 3102 Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 +I ++ ++ NL N + V + A G S N+ + Sbjct: 3103 NGSGQSRIALVKDALTNLANQLVGHDGIVN---VTLIGFATTAGTPVTLQNLTSANVQTL 3159 Query: 263 KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA--- 319 + + L+ TN A + A + +++ + + F+TDG+ + Sbjct: 3160 LTAITNLSATGGTNYEAAFNSAVSWFNS--QTAAGKSVAAGYENVTFFLTDGDPTYYLKN 3217 Query: 320 ----SAYQNTLNTLQICEYMRN----AGM-KIYSVAVSAPPEGQ 354 +T + + E + +G+ ++ + + + Sbjct: 3218 NGNVGGDGSTTDMTTVQESVNAFAPLSGVSTVHGIGIGSGVNED 3261 >gi|156554405|ref|XP_001604762.1| PREDICTED: similar to GA20484-PA [Nasonia vitripennis] Length = 383 Score = 41.4 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 14/164 (8%), Positives = 52/164 (31%), Gaps = 21/164 (12%) Query: 194 TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 S Y ++ ++ + +S ++ E + Sbjct: 10 VDNSDYMRNGDFLPTRLQAQQDAVNLVCHSKTRSNPENN--------VGLMTLANVEVLA 61 Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 L++++ + S+L+K+ P N + + A+ L + + +H + + Sbjct: 62 TLTSDVGRILSKLHKVQPNGNLSLITGIRIAHLALKHRQGKNHKMRIVAFVGSPIQI--- 118 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 + +++ + ++ + + ++ ++L Sbjct: 119 ----------DEKEAVKLAKRLKKEKVNVDIISFGEESVNNEVL 152 >gi|25149045|ref|NP_741200.1| DIsplaced Gonad family member (dig-1) [Caenorhabditis elegans] gi|74961937|sp|Q09165|DIG1_CAEEL RecName: Full=Mesocentin; Flags: Precursor gi|22331940|gb|AAM78593.1| mesocentin [Caenorhabditis elegans] gi|26251538|gb|AAA50715.2| Displaced gonad protein 1, isoform a, confirmed by transcript evidence [Caenorhabditis elegans] Length = 13100 Score = 41.4 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 31/230 (13%), Positives = 74/230 (32%), Gaps = 27/230 (11%) Query: 152 MVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKID 211 ++ + + + + + + + P + Sbjct: 12343 KIVPNVETSRTWPTPRTKATTTSGTGRSCSSIDYESDVIIVLDSSENFTP------DEFV 12396 Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP 271 + ++ ++V++ + +IG + Y+ + + E+ ++ Sbjct: 12397 SMKDAVASIVDTGFDLAPDVS----KIGFVIYSDKVAVPVALGHYEDKIELLEKITDAE- 12451 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 + A L N G K VI IT+G+N G N Sbjct: 12452 ----KINDGVAIALYGL-NAARQQFQLHGRENATKVVILITNGKNRG--------NAAAA 12498 Query: 332 CEYMRN-AGMKIYSVAVSAPPEGQDLLRKCTDSSG--QFFAVNDSRELLE 378 E +R+ G+++++VAV + PE +++ +S V S E+ + Sbjct: 12499 AEDLRDMYGVQLFAVAVGSNPEELATIKRLVGNSNTENVIEVAQSTEIDD 12548 >gi|312129894|ref|YP_003997234.1| von willebrand factor type a [Leadbetterella byssophila DSM 17132] gi|311906440|gb|ADQ16881.1| von Willebrand factor type A [Leadbetterella byssophila DSM 17132] Length = 329 Score = 41.4 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 32/241 (13%), Positives = 69/241 (28%), Gaps = 19/241 (7%) Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT------TKSKYAPAPAPANRK 209 S+ + D + DNNN T + W ++ T++ + Sbjct: 72 YSKVIIDNFKILDRDNNNYTIQNIKAYEYRGDNVWKEDVEFKVQYTQTTDVGEMVLVLDR 131 Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 + L + E+ S I P++NN + + + L Sbjct: 132 SESLGTDF-ERIKQYAAEFVEQTFASHPEVKIGVVDFSAYPSSLPITNNKEVILNYIKNL 190 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 T Y AM L S + ++ TDG ++ + A N + + Sbjct: 191 EMENFTALYDAMDMGVDMLLRSASQS----------RILVTFTDGTDNFSRATLNDVLSK 240 Query: 330 QICEYMRNAGMKIYSVAV-SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 + ++ + + + + + + ++ E F+K I Sbjct: 241 INSDK-NLNKIRGFFIGLAGKGDLDTSVPTLLSSKGWIVSVPQSATQVKEVFNKFGRLIS 299 Query: 389 E 389 Sbjct: 300 N 300 >gi|258543794|ref|ZP_05704028.1| conserved hypothetical protein [Cardiobacterium hominis ATCC 15826] gi|258520969|gb|EEV89828.1| conserved hypothetical protein [Cardiobacterium hominis ATCC 15826] Length = 1128 Score = 41.4 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 48/374 (12%), Positives = 107/374 (28%), Gaps = 63/374 (16%) Query: 72 LKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNL 131 + IN T N PL +ESK + + L S Sbjct: 16 IASAVGFICQNTLAQSTPSINDTPFANRPLHLQSESKTTSAGGVKPNVMLFLDDSGSMLE 75 Query: 132 SLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFW- 190 + + N + C + + L + N + + L ++ Sbjct: 76 NANTGSYRIPDLWNTYLGNCTYNQNNLLISRNLLPGNGQQTNPSMGRCLYYYNADIIYYR 135 Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEK-----KNLSVRIGTIAYNI 245 +K + A ++D I + ++ ++ + + Y Sbjct: 136 NKGLPPGPKSFLRPAAQVRMDANINAINEVLTKTGDSVNWHLLTLWGSEFRHLNGGFYKF 195 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 +G LS + + +N ++P T A L ++ + K Sbjct: 196 DNIGTASVGLS--AEKARKLVNNMSPIAGTPATERYLKAANVLDHQIKYR-------CQK 246 Query: 306 KFVIFITDGENSGA--------------SAYQNTLNTLQICEYMR--------------- 336 +++F++DGE +G Q+ + + + Sbjct: 247 NYIVFMSDGEANGGGFPWPGGVYGTQPQWWSQHDSSGPNSGDPGKGVSVFSSKLYNMDMR 306 Query: 337 -----------------NAGMKIYSVAVSAP--PEGQDLLRKCTDSSGQFFAVNDSRELL 377 ++ ++ P+G++ L+ S + N++ EL Sbjct: 307 VGGTDVEGGSWDDPKYPKQNIETITIGYGNGLTPQGRNYLKNAAQPSEGAYFANNASELA 366 Query: 378 ESFDKITDKIQEQS 391 ++F K K+Q Q+ Sbjct: 367 DAFLKALAKVQTQT 380 >gi|218778089|ref|YP_002429407.1| OmpA/MotB domain protein [Desulfatibacillum alkenivorans AK-01] gi|218759473|gb|ACL01939.1| OmpA/MotB domain protein [Desulfatibacillum alkenivorans AK-01] Length = 377 Score = 41.4 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 24/189 (12%), Positives = 66/189 (34%), Gaps = 23/189 (12%) Query: 196 KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL 255 + + + L + +++ Q +L ++ G + +Q + L Sbjct: 48 DNFLVILDKSGSMRGGKLGTAVN----TLRHMNQTIPDLDLQAGLRTFGSICPFSQVSKL 103 Query: 256 SNNLNEVKSR------LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 + + ++ + ++ A+ A +L S+ VI Sbjct: 104 EYGMTQYVTKEMCGALAVQKKASGDSPMDLALSGAADDLARTNGST-----------AVI 152 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA 369 ++DG + A ++ + IY++ + + P G+ +L K + G F+ Sbjct: 153 LVSDGYKNAMDAAAAVAAAKELKSRY-MNKVCIYTIQIGSDPTGKAILDKIVKAGGCGFS 211 Query: 370 VNDSRELLE 378 V ++ ++ + Sbjct: 212 V-NAADIAD 219 >gi|149921119|ref|ZP_01909577.1| hypothetical protein PPSIR1_24814 [Plesiocystis pacifica SIR-1] gi|149818006|gb|EDM77465.1| hypothetical protein PPSIR1_24814 [Plesiocystis pacifica SIR-1] Length = 428 Score = 41.4 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 39/339 (11%), Positives = 90/339 (26%), Gaps = 66/339 (19%) Query: 94 TKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMV 153 T+ + + E+ G + S + G + + + +V Sbjct: 44 TESSSGDTTAGDTTADDTTTLDEDTTADGTAEEESSTTSDPNCGEVSIVPTYVPPQVMLV 103 Query: 154 LDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVL 213 +D S SM + D N +++ + +D Sbjct: 104 VDASGSMVNNSWDHDLDPNT--------------------PQVTRWNTLHGVVSTVMDNF 143 Query: 214 IESAGNLVNSIQK--AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP 271 + + A + +S I + +N V + + Sbjct: 144 GPAMYAGIQRFPSEEACPDATPMSSNCYNSGSCIVGTQPEVGVSLDNGASVIAAIPGPTA 203 Query: 272 Y-----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 T M+ A L + E+ ++V+ ITDG + A Sbjct: 204 GNTEIVGGTPATKGMNSAVSHLEQQPEAFP---------RYVLLITDGAANCDQALSFPD 254 Query: 327 NTLQI-------CEYMRNAGMKIYSVAV---------SAPPEGQDL----LRKCTDSSG- 365 Q + + G+ + V + + L + G Sbjct: 255 YIEQYDETLPTTVQAAFDGGITTFVVGIDIEDMLQGVGTDGSPEANPFERLNDVAIAGGA 314 Query: 366 ---------QFFAVNDSRELLESFDKITDKIQEQSVRIA 395 +F+ + +ELL++ I ++ + ++ + Sbjct: 315 PKNEGMDLEKFYNTTNQQELLDAIQAILGEVTDCTIDLT 353 >gi|239817420|ref|YP_002946330.1| hypothetical protein Vapar_4453 [Variovorax paradoxus S110] gi|239803997|gb|ACS21064.1| conserved hypothetical protein [Variovorax paradoxus S110] Length = 589 Score = 41.4 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 23/172 (13%), Positives = 51/172 (29%), Gaps = 3/172 (1%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQ-T 60 TAI +S + +L ++ Y++ ++Q A D A L+G I + T Sbjct: 25 TAIALS-LIVITLIGTELGYLFYMKRELQKATDLAALAGAKEISYAGSCPSAKTAAKLSA 83 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + ++L + + + Q T D + ++ + + L Sbjct: 84 NGTGSTDRNRNLPISFSLEDAEIECGQWDPAKTTSDHFDSAPPDQQNAIRITLNRTPATL 143 Query: 121 KGLI-PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDN 171 + +T + + S+ + L N Sbjct: 144 LSFFEGNRTIQTKAVATNDPIAAFSIGTGVASLDEGAVNSLLNGLLGTGNKI 195 >gi|145516504|ref|XP_001444144.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124411548|emb|CAK76747.1| unnamed protein product [Paramecium tetraurelia] Length = 829 Score = 41.4 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 19/166 (11%), Positives = 51/166 (30%), Gaps = 18/166 (10%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-Y 272 ++ L+ ++ Q+ + I + N L + +++ Sbjct: 343 KKAKEALILFLKSLPQD--SEYNIISFGTNFTKLWNVSQNYSQNTLETAIKHVEEMDADM 400 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T + N S V +TDG+++ + + + Sbjct: 401 GGTCIIAPLKQMIYH--------KNYGASKNTTLNVFLLTDGQDTADPII-DLVQKNNLA 451 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + +IY++ + + R +G++ V+D ++ E Sbjct: 452 QT------RIYTLGIGRECSQYLIRRVAEVGNGKYQIVSDKEDINE 491 >gi|330503959|ref|YP_004380828.1| hemolysin-type calcium-binding repeat-containing protein [Pseudomonas mendocina NK-01] gi|328918245|gb|AEB59076.1| hemolysin-type calcium-binding repeat-containing protein [Pseudomonas mendocina NK-01] Length = 3977 Score = 41.4 bits (95), Expect = 0.26, Method: Composition-based stats. Identities = 29/349 (8%), Positives = 86/349 (24%), Gaps = 42/349 (12%) Query: 47 DRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAE 106 D T T+ ++ + Q + N A+ Sbjct: 3006 GGVQDDEGATTLVTGTLGYDYGDDGFGSFAWQPSGLPSVTSGGQAVEYQVSGNGQVLTAQ 3065 Query: 107 SKAQYEIPTENLFLKGLIPSALTNLSL-----RSTGIIERSSENLAISICMVLDVSRSME 161 + ++ + L ++ + + MV + + Sbjct: 3066 TVGSHQPVFTVTLTNPATGAFTFELHAPLDHPAPASGSVENNLDFQFAYQMVDGNGSTAQ 3125 Query: 162 DLYLQKHNDNNNMTSNKYLLPPPPKK-------SFWSKNTTKSKYAPAPAPANRKIDVLI 214 +D++ + + + + + K+ + Sbjct: 3126 GALHISVDDDSPAQPKDIAKSASEPQGIHTNLMVVLDLSGSMDDAPSGVSGFSTKLALAK 3185 Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIG-IVGNQCTPLSNNLNEVKSRLNKLNPYE 273 ++ L++S V + + + +++ + L Sbjct: 3186 DAVQRLIDSYD------NLGDVMVRIVTFANTASAVGNVWMTASDAKAWLTALANNAGNG 3239 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS-----------AY 322 +TN A+ A + + T ++ F++DG+ + ++ Sbjct: 3240 STNYDDALIKAMNAYDST-----GKLTGTGVQSVSYFLSDGQPTLSNANPGSNNSGSQYN 3294 Query: 323 QNTLNTLQICEYMR------NAGMKIYSVAVSAPPE-GQDLLRKCTDSS 364 + + E G+K Y++ + + + LL Sbjct: 3295 PELGDGIGAGEEADWIAFLTAKGIKSYALGMGLGSDLDKTLLNPIAYDG 3343 >gi|110633696|ref|YP_673904.1| hypothetical protein Meso_1343 [Mesorhizobium sp. BNC1] gi|110284680|gb|ABG62739.1| conserved hypothetical protein [Chelativorans sp. BNC1] Length = 571 Score = 41.4 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 14/155 (9%), Positives = 36/155 (23%), Gaps = 9/155 (5%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M ++ V L + +D + R + Q+ D A ++ A+I T ++ Sbjct: 23 MAGLLFPVMLLGAVFGVDQGSLYLERREAQALTDLAAVTAVANISRANAAAALTMADNRQ 82 Query: 61 -------STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEI 113 + + + + + Sbjct: 83 GNVQLIDRATLSTPMAAGTGAQMLVEPGRYSADPSTESPWRFTPGAEPANAVRVTFRKKG 142 Query: 114 PTENLFLKGLIPSALTNLSLRSTGIIERSSENLAI 148 F G + + + E + + Sbjct: 143 KLY--FGAGFFEPPVITTTGIAAARAEAAFSIGSR 175 >gi|291223815|ref|XP_002731903.1| PREDICTED: chloride channel accessory 2-like [Saccoglossus kowalevskii] Length = 996 Score = 41.4 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 19/119 (15%), Positives = 44/119 (36%), Gaps = 17/119 (14%) Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T+ + A + L N +S ++ ITDG + + N + Sbjct: 422 DGFTSIGAGLELALQVLENGNVASEGAS--------LLLITDGAENRSPFIAN------V 467 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQE 389 + ++G+++ + + Q +L+ + + G +F V D+ D + I + Sbjct: 468 LPDIYDSGVRVDT--FAYTESAQLILQNLSDTTGGLYFYVPDNDNSTAFIDSLAATITD 524 >gi|283779589|ref|YP_003370344.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] gi|283438042|gb|ADB16484.1| von Willebrand factor type A [Pirellula staleyi DSM 6068] Length = 1740 Score = 41.4 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 16/97 (16%), Positives = 35/97 (36%), Gaps = 6/97 (6%) Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR-NAGMKIYSVAVSAPPE---- 352 +K ++ ITDG+N +A T R + +K++ V + P Sbjct: 1315 AQEDGESEKSIVVITDGKNYQFNAPSQLARTKDDVLAARGSRDIKVHIVGFNIEPSESEV 1374 Query: 353 -GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 ++ + G+F + L+ S + + + Sbjct: 1375 ASREFREIAEATGGEFLPATSAGSLVRSLESVLRAAE 1411 >gi|226226934|ref|YP_002761040.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27] gi|226090125|dbj|BAH38570.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27] Length = 565 Score = 41.4 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 18/171 (10%), Positives = 51/171 (29%), Gaps = 47/171 (27%) Query: 252 CTPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 TPL+ + ++ L+ L+P ++ A+ L + Sbjct: 152 LTPLTGDDGALELFLDNLDPGVVGQAGSSLSRAIRQGSELLLAS---------DGSADRA 202 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP---------------- 351 ++ ++DGE ++ + + + G+ + +V Sbjct: 203 LVLLSDGE-----SFDSAEDIESAASEAGSKGISVVTVGFGTRDGATIPVREGTIVQQKR 257 Query: 352 ----------EGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 +LL + S+ F ++ + ++ +I ++ Sbjct: 258 DDAGNVVVTRYAPELLEQAAKSANGTFIAAEASD--KA-SRIRAALRSLRT 305 >gi|164687487|ref|ZP_02211515.1| hypothetical protein CLOBAR_01128 [Clostridium bartlettii DSM 16795] gi|164603261|gb|EDQ96726.1| hypothetical protein CLOBAR_01128 [Clostridium bartlettii DSM 16795] Length = 273 Score = 41.4 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 30/190 (15%), Positives = 64/190 (33%), Gaps = 16/190 (8%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 KI L + L+ ++ +++ + ++ G P+S + + + Sbjct: 50 KIGTLNTTMEELLPELR--GLGGATTDIKLAVMTFSSGCEWITKEPMSVDDYQ---YWTR 104 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L T+ A +L S + +TDG + L T Sbjct: 105 LKAEGLTDLGEAFTELSNKLS---RKEFLNAPSLSYAPVIFLLTDGYATDD--ALEGLKT 159 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKIT---D 385 LQ G+K+ ++ + ++LL+K T + S +L + I Sbjct: 160 LQH-NNWYKYGLKV-ALGLG-EKFDEELLKKFTGNPELVVTAKTSDQLSKLVKTIAVTSS 216 Query: 386 KIQEQSVRIA 395 +I +S+ + Sbjct: 217 QIGSRSMTLT 226 >gi|301778383|ref|XP_002924606.1| PREDICTED: integrin alpha-10-like [Ailuropoda melanoleuca] Length = 1175 Score = 41.4 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 62/205 (30%), Gaps = 25/205 (12%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 + I E L + + + + +++G + Y V Sbjct: 163 PTYMDVVIVLDGSNSIYPWSEVQTFLRRLVGRLFIDPEQ--IQVGLVQYGESPVHEWSLG 220 Query: 255 LSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 +EV L+ E T T A+ A E ++ + ++ +T Sbjct: 221 DFRTKDEVVRAARNLSRREGRETKTAQAILVACTEGFSLSRGGRPEA-----ARLLVVVT 275 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--------SAPPEGQDLLRKCTDSS 364 DGE+ L+ CE R + Y +AV P +R Sbjct: 276 DGESHDGEELP---AALKACEAGR---VTRYGIAVLGHYLRRQRDPTSFLREIRTIASDP 329 Query: 365 G--QFFAVNDSRELLESFDKITDKI 387 FF V D L + D + D+I Sbjct: 330 DERFFFNVTDEAALTDIVDALGDRI 354 >gi|281353330|gb|EFB28914.1| hypothetical protein PANDA_013982 [Ailuropoda melanoleuca] Length = 1128 Score = 41.4 bits (95), Expect = 0.27, Method: Composition-based stats. Identities = 36/205 (17%), Positives = 62/205 (30%), Gaps = 25/205 (12%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 + I E L + + + + +++G + Y V Sbjct: 146 PTYMDVVIVLDGSNSIYPWSEVQTFLRRLVGRLFIDPEQ--IQVGLVQYGESPVHEWSLG 203 Query: 255 LSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 +EV L+ E T T A+ A E ++ + ++ +T Sbjct: 204 DFRTKDEVVRAARNLSRREGRETKTAQAILVACTEGFSLSRGGRPEA-----ARLLVVVT 258 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--------SAPPEGQDLLRKCTDSS 364 DGE+ L+ CE R + Y +AV P +R Sbjct: 259 DGESHDGEELP---AALKACEAGR---VTRYGIAVLGHYLRRQRDPTSFLREIRTIASDP 312 Query: 365 G--QFFAVNDSRELLESFDKITDKI 387 FF V D L + D + D+I Sbjct: 313 DERFFFNVTDEAALTDIVDALGDRI 337 >gi|227818616|ref|YP_002822587.1| type IV pilus associated TadE family protein [Sinorhizobium fredii NGR234] gi|36958872|gb|AAQ87297.1| Hypothetical protein RNGR00523 [Sinorhizobium fredii NGR234] gi|227337615|gb|ACP21834.1| type IV pilus associated TadE family protein [Sinorhizobium fredii NGR234] Length = 140 Score = 41.4 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 14/92 (15%), Positives = 23/92 (25%), Gaps = 2/92 (2%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSG--CASIVSDRTIKDPTTKKDQ 59 A++ L + I+ Y+RN M A D A + Sbjct: 22 FALVCFPLLLLVLGVIEFGRAFYVRNDMSYAADVAAREVLIGKIARDAPDSEAQAKLASA 81 Query: 60 TSTIFKKQIKKHLKQGSYIRENAGDIAQKAQI 91 F L+ + G + I Sbjct: 82 VRDSFDSGDPARLEIAVTKQTVDGIDFRVLSI 113 >gi|27262847|emb|CAD59475.1| leukocyte integrin alpha-M chain [Bos taurus] Length = 79 Score = 41.4 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 9/65 (13%), Positives = 19/65 (29%), Gaps = 5/65 (7%) Query: 328 TLQICEYMRNAGMKIYSVAVSA---PPEGQDLLRKCTDSS--GQFFAVNDSRELLESFDK 382 + + Y + V + + L F VN+ L ++ Sbjct: 1 YEDVIPEADRKKIIRYVIGVGDAFRSRKSRQELDTIASKPPADHVFQVNNFEALKTIQNQ 60 Query: 383 ITDKI 387 + +KI Sbjct: 61 LQEKI 65 >gi|3273263|dbj|BAA31175.1| thrombospondin-related protein [Plasmodium falciparum] Length = 565 Score = 41.4 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + + N + + S N + Sbjct: 55 CSGSYRRHNWVNHAVPLAMKLIQQLNLNESAIHLYVNVFSNNAKEIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LIIIRSLLSTNLPYGRTNLTDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|149913213|ref|ZP_01901747.1| hypothetical protein RAZWK3B_04455 [Roseobacter sp. AzwK-3b] gi|149813619|gb|EDM73445.1| hypothetical protein RAZWK3B_04455 [Roseobacter sp. AzwK-3b] Length = 512 Score = 41.4 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 10/133 (7%), Positives = 32/133 (24%), Gaps = 6/133 (4%) Query: 4 IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTI 63 + + L A+D ++ + ++ D A +G + S + + Sbjct: 29 FAVVMILLLGGVALDATNLWRYQQMLKQTADVAAHAGTVQLASGGDATNAYNA---AFAL 85 Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYE---IPTENLFL 120 + + + + A + + + + +L Sbjct: 86 VEANMPQSWYGNLFANPQADIEVVHYDPDTDLLGGSGPLNAVAVTLRRDGDSGNPVPTYL 145 Query: 121 KGLIPSALTNLSL 133 + S Sbjct: 146 LRIADIMNMGDST 158 >gi|145482427|ref|XP_001427236.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124394316|emb|CAK59838.1| unnamed protein product [Paramecium tetraurelia] Length = 1189 Score = 41.4 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 33/138 (23%), Positives = 61/138 (44%), Gaps = 16/138 (11%) Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 K +NK P TN Y A+ +A +L K++ + +I +TDG+++ Sbjct: 810 FKDLVNKAQPTGRTNLYRALKYAENQLLKFKQTYPKCLL------RIIALTDGQDNDNH- 862 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESF 380 L+ +++ E + + + S VS L+K T + GQ F+ +E L+ F Sbjct: 863 ---PLDPIKVAESILKNEILLDSFVVSDDCTD---LKKITKATGGQCFSPQTIQEGLKLF 916 Query: 381 DKITDKIQEQSVRIAPNR 398 + + I S+R N+ Sbjct: 917 E--FETILSASIRQKQNK 932 >gi|87122726|ref|ZP_01078601.1| hypothetical protein MED121_00165 [Marinomonas sp. MED121] gi|86162023|gb|EAQ63313.1| hypothetical protein MED121_00165 [Marinomonas sp. MED121] Length = 624 Score = 41.4 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 17/141 (12%), Positives = 49/141 (34%), Gaps = 13/141 (9%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 ++ + +K+ L + + + + N + + + I +TDG Sbjct: 81 TDLKSSLKNNLGGYVTE---DVQTDLESVLKLMLNTPPAPGIPNDAE---QHWILVTDGM 134 Query: 316 NSGA-----SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFA 369 + + I + + G+ +++V+ +LL+ + + + Sbjct: 135 VDISLDQEVNDLSRKAILTPILDELIERGIHLHTVS-MTGYTDAELLKTLSIKTDASYTE 193 Query: 370 VNDSRELLESFDKITDKIQEQ 390 V +LL +F++I + Sbjct: 194 VAVPEDLLSTFNRIFTSANKT 214 >gi|53713709|ref|YP_099701.1| hypothetical protein BF2418 [Bacteroides fragilis YCH46] gi|253565657|ref|ZP_04843112.1| BatB [Bacteroides sp. 3_2_5] gi|265764033|ref|ZP_06092601.1| BatB [Bacteroides sp. 2_1_16] gi|4838139|gb|AAD30859.1|AF116251_2 BatB [Bacteroides fragilis] gi|52216574|dbj|BAD49167.1| conserved hypothetical protein BatB [Bacteroides fragilis YCH46] gi|251945936|gb|EES86343.1| BatB [Bacteroides sp. 3_2_5] gi|263256641|gb|EEZ27987.1| BatB [Bacteroides sp. 2_1_16] gi|301163418|emb|CBW22969.1| aerotolerance-related membrane protein [Bacteroides fragilis 638R] Length = 341 Score = 41.4 bits (95), Expect = 0.28, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 55/168 (32%), Gaps = 49/168 (29%) Query: 251 QCTPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 P++++ K L ++P + T A++ A R + + + Sbjct: 142 TQLPITSDYISAKMFLESISPSLISKQGTAIGAAINLAARSFTPQ----------EGVGR 191 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------------- 352 ++ ITDGEN A + + G+++ + V P Sbjct: 192 AIVVITDGENHEGGAVEAAKEAAK-------KGIQVNVLGVGLPDGAPIPIEGSNDFRRD 244 Query: 353 ----------GQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQE 389 + + ++ + +G + V++S + I +I + Sbjct: 245 REGNVIVTRLNEAMCQEIAKEGNGIYVRVDNS---NSAQKAINQEINK 289 >gi|302669471|ref|YP_003829431.1| von Willebrand factor type A domain-containing protein [Butyrivibrio proteoclasticus B316] gi|302393944|gb|ADL32849.1| von Willebrand factor type A domain-containing protein [Butyrivibrio proteoclasticus B316] Length = 561 Score = 41.0 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 34/343 (9%), Positives = 88/343 (25%), Gaps = 30/343 (8%) Query: 46 SDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIA 105 + + + G I T L+ Sbjct: 243 DNENPLSDNAVDQLVQYQNNAPTAAYTTSILKESASKGIIDAMVMEEQTYINTPELKNYV 302 Query: 106 ESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR-SMEDLY 164 + A L + + S+++A L S + Sbjct: 303 YTPAGVRHDHPVYAFPWTDEDELEAAKMFVDFCLTSDSQSMATERGFNLHEDYVSEDYGM 362 Query: 165 LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSI 224 + P + + + + S +I L S + + I Sbjct: 363 TGSDFLSAQKVWKTNKNGTRPTIAVFVTDISGSMNG-------TRIKSLKNSLLSTMQYI 415 Query: 225 QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE--VKSRLNKLNPYENTNTYPAMH 282 + IG ++Y+ + N +N + L+ T TY A+ Sbjct: 416 DSSSY--------IGLVSYSDKVYINLPIAQFDNKQRAYFSGAVKDLDVGGQTATYDAVL 467 Query: 283 HAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKI 342 + L + + + + ++DG + +I + G+ + Sbjct: 468 VGMQMLMEKSKEVPDANM------MLFVLSDGAQNAGF------ELKRITPIVGGLGISV 515 Query: 343 YSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 Y++ ++ L+ ++ + D+ +++ + + Sbjct: 516 YTIGYEMTDSDKEDLKALSEINEAVCIDADTEDIVNELRNLFN 558 >gi|257892784|ref|ZP_05672437.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium 1,231,408] gi|257829163|gb|EEV55770.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium 1,231,408] Length = 677 Score = 41.0 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 20/82 (24%), Positives = 35/82 (42%), Gaps = 1/82 (1%) Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV- 370 TD S+ TL ++NAG+K YSVA + G+ + R S + ++ Sbjct: 271 TDPNAVSVSSSLINDATLGTIISIKNAGIKCYSVATAPSSRGEYIGRNLASSPNNYLSID 330 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + L + +I + I + V Sbjct: 331 ENLTGLGNALKEIANGIDKTIV 352 >gi|168179225|ref|ZP_02613889.1| conserved hypothetical protein [Clostridium botulinum NCTC 2916] gi|168184460|ref|ZP_02619124.1| conserved hypothetical protein [Clostridium botulinum Bf] gi|226950226|ref|YP_002805317.1| hypothetical protein CLM_3180 [Clostridium botulinum A2 str. Kyoto] gi|182669694|gb|EDT81670.1| conserved hypothetical protein [Clostridium botulinum NCTC 2916] gi|182672439|gb|EDT84400.1| conserved hypothetical protein [Clostridium botulinum Bf] gi|226844328|gb|ACO86994.1| conserved hypothetical protein [Clostridium botulinum A2 str. Kyoto] Length = 283 Score = 41.0 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 25/171 (14%), Positives = 56/171 (32%), Gaps = 13/171 (7%) Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP 271 + +++ I K + + + + NN + ++ L +N Sbjct: 63 IYNSDIQDMMEQIYPLDTTGKLMDKDFDPGRFRVYPLLEDVYG--NNSSTIQKNLKNINT 120 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T + A L N + H S I+ +G + L + Sbjct: 121 SYGTVQFNNNSKAAESLKNVLDELHGISKSNGKLNSYIYPLNGTFNYRHIAGTNLLSPHA 180 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + +A+ + +D + T+S GQ + +E++E+F+K Sbjct: 181 -----------FGIAIDLVRDNRDYWKWATESQGQERIASYPKEIVETFEK 220 >gi|156340477|ref|XP_001620458.1| hypothetical protein NEMVEDRAFT_v1g223090 [Nematostella vectensis] gi|156205409|gb|EDO28358.1| predicted protein [Nematostella vectensis] Length = 396 Score = 41.0 bits (94), Expect = 0.29, Method: Composition-based stats. Identities = 12/71 (16%), Positives = 28/71 (39%), Gaps = 1/71 (1%) Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 T + ++ G++I VA+ P ++LL + + V +L ++ Sbjct: 284 YPPKTQGSTIASLPIKEKGVQIIVVAIGRSPSDRELL-PLATTKENLYRVATFEQLSKTV 342 Query: 381 DKITDKIQEQS 391 ++ I +Q Sbjct: 343 QPLSIHICKQI 353 >gi|332799397|ref|YP_004460896.1| von Willebrand factor type A [Tepidanaerobacter sp. Re1] gi|332697132|gb|AEE91589.1| von Willebrand factor type A [Tepidanaerobacter sp. Re1] Length = 547 Score = 41.0 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 13/105 (12%), Positives = 36/105 (34%), Gaps = 11/105 (10%) Query: 249 GNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 N C P + N + ++ +NK+ T A+ + + + + Sbjct: 419 VNVCVPFTRNYDIMEEGINKIVATGLTPLALALDKGICHINKKNLKNP----------LI 468 Query: 309 IFITDGENSGASAYQNTLN-TLQICEYMRNAGMKIYSVAVSAPPE 352 + ITDG + + + +N + + + + + + + Sbjct: 469 MLITDGIPTVSLWTSDPINDAVSAADKIAKNKINFCCIGLQPNKD 513 >gi|239939820|ref|ZP_04691757.1| hypothetical protein SrosN15_02385 [Streptomyces roseosporus NRRL 15998] gi|239986306|ref|ZP_04706970.1| hypothetical protein SrosN1_03262 [Streptomyces roseosporus NRRL 11379] Length = 592 Score = 41.0 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 26/194 (13%), Positives = 60/194 (30%), Gaps = 20/194 (10%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQ------EKKNLSVRIGTIAYNIGIVGNQCTPLS 256 P ++DV ES ++ + + +G+ Sbjct: 398 PGRNQSRMDVTKESLIQALDQFTPNDEIGLWEFATTLDGEKDYRRLMETKRLGDPAAGGG 457 Query: 257 NNLNEVKSRLNKLNP--YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 + ++ + L P T Y L + KE+ + ++ +TDG Sbjct: 458 THREKLTAAFAGLQPVPGGATGLYD------TTLASYKEARSTYVKGKF--NALVILTDG 509 Query: 315 ENSGASAYQNTLNTLQICEYMR-NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDS 373 N + + ++ E + + + ++AV + ++ + G + V+D Sbjct: 510 SNQDTNGISRSGLITELKELVDPERPVPVIAIAVGPDADRDEVAEIARITGGDGYEVSDP 569 Query: 374 RELLESFDKITDKI 387 E+ I I Sbjct: 570 AEIQAV---ILQAI 580 >gi|226307531|ref|YP_002767491.1| hypothetical protein RER_40440 [Rhodococcus erythropolis PR4] gi|226186648|dbj|BAH34752.1| hypothetical protein RER_40440 [Rhodococcus erythropolis PR4] Length = 233 Score = 41.0 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 22/160 (13%), Positives = 46/160 (28%), Gaps = 10/160 (6%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 R+ ++++ + + + + L T+ P + + Sbjct: 48 RVCVLSFSDEARIDVPMCDLADDTRITRE-DFLQVRGGTSFAPIFDLIGERIAADIADLK 106 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + V F+TDG + A N+ T + +++ Sbjct: 107 GHGEGKVFRPTVFFVTDGVPTDAVHEWNSAFTRLTSVKAYPNLVP-----FGLGDADEEV 161 Query: 357 LRKCT----DSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 LR T G FF N ++ IT + + V Sbjct: 162 LRAITFPPYRQDGYFFMANAGTSAEQAMQAITRIVTQSVV 201 >gi|260789950|ref|XP_002590007.1| hypothetical protein BRAFLDRAFT_224774 [Branchiostoma floridae] gi|229275194|gb|EEN46018.1| hypothetical protein BRAFLDRAFT_224774 [Branchiostoma floridae] Length = 142 Score = 41.0 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 22/145 (15%), Positives = 46/145 (31%), Gaps = 21/145 (14%) Query: 207 NRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN--EVKS 264 + ++VN + R+G + Y+ L+ N E+ + Sbjct: 15 TSGFEDAKTFIQSVVNYFTLGEND-----TRVGVVTYSNADAQITRVKLNENYTRVELLT 69 Query: 265 RLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 + L +T T + H + E N +I +TD + Sbjct: 70 EIRNLPYDRGHTFTGLGLDHVRNNSFLEVNGRRN-----NTPDVLIVLTD--------DE 116 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVS 348 + + + + R G+K++ V V Sbjct: 117 SEDDVILPAQLTRQMGIKVFVVGVG 141 >gi|34481892|emb|CAE46494.1| trap [Plasmodium falciparum] Length = 331 Score = 41.0 bits (94), Expect = 0.30, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + + N + + S N + Sbjct: 55 CSGSIRRHNWVKHAVPLAMKLIQQLNLNENAIHLYVNIFSNNAREIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LFIIKSLLSTNLPYGRTNLTDALLQVRKHLNDRI-------NRENASQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDLGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|257895102|ref|ZP_05674755.1| von Willebrand factor domain-containing protein [Enterococcus faecium Com12] gi|257831667|gb|EEV58088.1| von Willebrand factor domain-containing protein [Enterococcus faecium Com12] Length = 1341 Score = 41.0 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 36/362 (9%), Positives = 96/362 (26%), Gaps = 50/362 (13%) Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKN-NPLQYIAESKAQYEIPTENLFLKG 122 +L ++ I ++ + N L+ +SK I N + Sbjct: 202 ATTLYNVYLDVIGSEKQEISPIDIVFVLDKSASMNEGTLEGGGQSKNAALIEAVNEISEN 261 Query: 123 LIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLP 182 L+ ++ + +S + + D+ D+ ++N + Sbjct: 262 LLSDPNMDIRI-GMVNFYHNSTVINNQEQISSDIFPLTNDINRLTGSENTALNRTPIGGT 320 Query: 183 PPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIA 242 P NR + ++ G+ + A E+ N Sbjct: 321 P----LTLGLKNGYETLYADNGGENRNPEKILIVVGDGTPTFSYAPIERSNRPDFTNWAV 376 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHA------YRELYNEKESSH 296 N I + L N NT+ + + + + ++ Sbjct: 377 MNNMIARDT-GDLFKNFET---------YSGNTS-GAGFSYPVVYPSEFNRPEDTWDYNY 425 Query: 297 NTIG-STRLKKFVIFITDGENSGASAYQNTLNTLQIC-----------EYMRNAGMKIYS 344 K ++ G S + + + I+S Sbjct: 426 RYGEVKEGDDKAFHWVGTGAASNGTTGEPDTQEKSSAINTVAYHHWLKNKYQENPPSIFS 485 Query: 345 VAVSAPPE----------GQDLLRKCTDSSG-----QFFAVNDSRELLESFDKITDKIQE 389 + + G+++L+ D + +++ N+ +++ + + I+ ++ Sbjct: 486 IGLGIDGSVAGRQRLDAIGRNVLKNIADLNDDGTTPRYYDANNKNDIITALEDISSTFKK 545 Query: 390 QS 391 Sbjct: 546 TI 547 >gi|237796252|ref|YP_002863804.1| hypothetical protein CLJ_B3044 [Clostridium botulinum Ba4 str. 657] gi|229263519|gb|ACQ54552.1| conserved hypothetical protein [Clostridium botulinum Ba4 str. 657] Length = 283 Score = 41.0 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 25/171 (14%), Positives = 56/171 (32%), Gaps = 13/171 (7%) Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP 271 + +++ I K + + + + NN + ++ L +N Sbjct: 63 IYNSDIQDMMEQIYPLDTTGKLMDKDFDPGRFRVYPLLEDVYG--NNSSTIQKNLKNINT 120 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T + A L N + H S I+ +G + L + Sbjct: 121 SYGTVQFNNNSKAAESLKNVLDELHGISKSNGKLNSYIYPLNGTFNYRHIAGTNLLSPHA 180 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + +A+ + +D + T+S GQ + +E++E+F+K Sbjct: 181 -----------FGIAIDLVRDNRDYWKWATESQGQERIASYPKEIVETFEK 220 >gi|332798629|ref|YP_004460128.1| hypothetical protein TepRe1_0633 [Tepidanaerobacter sp. Re1] gi|332696364|gb|AEE90821.1| hypothetical protein TepRe1_0633 [Tepidanaerobacter sp. Re1] Length = 185 Score = 41.0 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 25/163 (15%), Positives = 45/163 (27%), Gaps = 16/163 (9%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A + + FI +D++ I RN MQ+ L ++ T +D Sbjct: 19 IFAGSMVLIAFFIGICLDVSMIYVKRNSMQNIL-------------QIIREERFTYQDTI 65 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + + S EN D + + N Y E P + Sbjct: 66 RYSDNPALTTYHIAYSAAAENGFDGIVTVYFHEEDPEPNYRSYQVRILLTDECPF---YF 122 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDL 163 + L++ G + + L VS Sbjct: 123 GRIFGLDTVPLNVSLDGGESYGEGSADVIWHSPLPVSNYNGAY 165 >gi|325067680|ref|ZP_08126353.1| hypothetical protein AoriK_07659 [Actinomyces oris K20] Length = 928 Score = 41.0 bits (94), Expect = 0.31, Method: Composition-based stats. Identities = 14/91 (15%), Positives = 33/91 (36%), Gaps = 22/91 (24%) Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC-------------------TDSSGQF 367 + + +R++ + + + +S+ D +++ +D G F Sbjct: 217 RPTGLADQLRSSHITLLGIGLSSGDSNFDFMKRITLGGGANAAKNSVESCGNVSDPPGGF 276 Query: 368 FAVNDSRELLESFDKI---TDKIQEQSVRIA 395 + V D LL +FD I + + +I Sbjct: 277 YPVTDIDSLLMAFDSISAPGRSVSSSTTKIC 307 >gi|298709908|emb|CBJ31633.1| conserved unknown protein [Ectocarpus siliculosus] Length = 304 Score = 41.0 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 24/185 (12%), Positives = 52/185 (28%), Gaps = 37/185 (20%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 ++D+ E+A ++N++ + + T +NL Sbjct: 122 VSGSMSQYGRLDLAKEAAETVINTLGADSFVNVVTFSETARVLLTNSTTLVRATE--DNL 179 Query: 260 NEVKSRLNKLN---PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 E+ S + L TN A + L + S + ++F+TDG Sbjct: 180 GELVSLVQNLEFDLANVGTNFGAAFETTFDILEASRTSEETSSNCQT---AIVFLTDGNT 236 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 + + + C+ + G + V D +L Sbjct: 237 NVGLSTDEVTSKQIACD-----------------------------TGGIYEHVEDGGDL 267 Query: 377 LESFD 381 ++ Sbjct: 268 SQAMA 272 >gi|242034177|ref|XP_002464483.1| hypothetical protein SORBIDRAFT_01g019250 [Sorghum bicolor] gi|241918337|gb|EER91481.1| hypothetical protein SORBIDRAFT_01g019250 [Sorghum bicolor] Length = 222 Score = 41.0 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 12/75 (16%), Positives = 26/75 (34%), Gaps = 13/75 (17%) Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS--SGQF 367 ++DG+ + ++ + +Y+ A +L + G F Sbjct: 2 LMSDGQQNHGGN----------AADVKISNAPVYTFGFGAD-YDPTVLTAVVGNSMGGTF 50 Query: 368 FAVNDSRELLESFDK 382 VND +L +F + Sbjct: 51 SVVNDVDKLTMAFSQ 65 >gi|332197210|gb|AEE35331.1| uncharacterized protein [Arabidopsis thaliana] Length = 756 Score = 41.0 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 17/137 (12%), Positives = 37/137 (27%), Gaps = 33/137 (24%) Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 + L TN + A + L V +TDG + Sbjct: 395 SNLIANGGTNMLLPLKQAMKLLEGSNIGVP----------LVYLVTDG---------SVE 435 Query: 327 NTLQICEYMRN------AGM--KIYSVAVSA--PPEGQDLLRKCTDSSGQFFA-VNDSRE 375 N +IC M+ + +I + + + L+ ++ N++ Sbjct: 436 NEREICHAMKESCSRNGKSISPRISTFGIGSFCNHY---FLQMLARIGNGYYDGTNNTDS 492 Query: 376 LLESFDKITDKIQEQSV 392 ++ + V Sbjct: 493 FEHQMSRLFEIASSTIV 509 >gi|33333552|gb|AAQ11892.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 559 Score = 41.0 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 56/170 (32%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + N + + S N + Sbjct: 55 CSGSIRRHNWVNHAVPLAMKLIQQLNLNENAIHLYANVFSNNAREIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LIIIKSLLSTNLPYGRTNLTDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|15218501|ref|NP_177394.1| inter-alpha-trypsin inhibitor heavy chain-related [Arabidopsis thaliana] gi|12325279|gb|AAG52586.1|AC016529_17 hypothetical protein; 14673-17893 [Arabidopsis thaliana] Length = 758 Score = 41.0 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 17/137 (12%), Positives = 37/137 (27%), Gaps = 33/137 (24%) Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 + L TN + A + L V +TDG + Sbjct: 397 SNLIANGGTNMLLPLKQAMKLLEGSNIGVP----------LVYLVTDG---------SVE 437 Query: 327 NTLQICEYMRN------AGM--KIYSVAVSA--PPEGQDLLRKCTDSSGQFFA-VNDSRE 375 N +IC M+ + +I + + + L+ ++ N++ Sbjct: 438 NEREICHAMKESCSRNGKSISPRISTFGIGSFCNHY---FLQMLARIGNGYYDGTNNTDS 494 Query: 376 LLESFDKITDKIQEQSV 392 ++ + V Sbjct: 495 FEHQMSRLFEIASSTIV 511 >gi|127512111|ref|YP_001093308.1| type IV pilin biogenesis protein, putative [Shewanella loihica PV-4] gi|126637406|gb|ABO23049.1| type IV pilin biogenesis protein, putative [Shewanella loihica PV-4] Length = 1204 Score = 41.0 bits (94), Expect = 0.32, Method: Composition-based stats. Identities = 10/53 (18%), Positives = 24/53 (45%), Gaps = 2/53 (3%) Query: 337 NAGMKIYSVAVSAPP-EGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKI 387 + + +++ S +DLL+K G +F D+ +L S + +++ Sbjct: 458 DQHVITHTIGFSEGAKNAEDLLKKVASKGGGSYFDATDATKLQGSIQQAVNEV 510 >gi|301622626|ref|XP_002940637.1| PREDICTED: integrin alpha-L-like [Xenopus (Silurana) tropicalis] Length = 1031 Score = 41.0 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 25/121 (20%), Positives = 47/121 (38%), Gaps = 17/121 (14%) Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 TNTY A+ + ++ EK S + KK ++ + DGE++ +T + Sbjct: 236 GFTNTYKAIQYTLDRIFTEKYGSRPSA-----KKVLVILADGESTDD-------DTTKAI 283 Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN----DSRELLESFDKITDKIQ 388 E A + Y + V + +D L + D+++L F ++ KI Sbjct: 284 EKADKARVSRYIIGVGQNFKTED-LEAFVSWPAKEHTRTIEKFDAQQLTILFAELQRKIL 342 Query: 389 E 389 Sbjct: 343 S 343 >gi|168701952|ref|ZP_02734229.1| hypothetical protein GobsU_20668 [Gemmata obscuriglobus UQM 2246] Length = 864 Score = 41.0 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 12/106 (11%), Positives = 36/106 (33%), Gaps = 13/106 (12%) Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 + + + + T + A ++ + ++ + TDG + Sbjct: 147 SRYQKDIKSIQTSCATCISQGLKLASEKVMAGELTA------------ITIHTDGYANDP 194 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG 365 S+ L++C+ M+ + + ++A LL + ++ Sbjct: 195 SSTSEAATLLKLCDDMKGKDVFVNTLAYG-DYTDFRLLSRIANAGS 239 >gi|311254856|ref|XP_003125982.1| PREDICTED: calcium-activated chloride channel regulator 4-like [Sus scrofa] Length = 825 Score = 41.0 bits (94), Expect = 0.33, Method: Composition-based stats. Identities = 26/213 (12%), Positives = 55/213 (25%), Gaps = 37/213 (17%) Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYN 244 S S A A + D L+ S+ + N +V+ + Sbjct: 181 EATSSSSWCPVLEFSGVAAAYMYPEHDTKAPPLSRLIPSLNHNRLNRMNQAVKYFLLQTI 240 Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKLN---------------PYENTNTYPAMHHAYRELY 289 + +KS+L ++ + + A++ + Sbjct: 241 ENGSWVGVVDF-DTTAHIKSKLIQIKSNNERRKLLESLPTEASGGISICSGIESAFQVI- 298 Query: 290 NEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA 349 K + DG N + ++ +G I+S+A+ Sbjct: 299 ----------------KEIYPQVDGSEIILVVAGEDKNIRNCMDRVKQSGAIIHSIALGP 342 Query: 350 PPEGQDLLRKCTDSSGQFFAVNDSRE---LLES 379 + + G F D E L ++ Sbjct: 343 NA-DPAVTEMSAVTGGMHFYTTDQSESRGLTDA 374 >gi|328772234|gb|EGF82273.1| hypothetical protein BATDEDRAFT_31537 [Batrachochytrium dendrobatidis JAM81] Length = 343 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 21/169 (12%), Positives = 58/169 (34%), Gaps = 19/169 (11%) Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 S++ +I+ ++A L N+ + E + + G N Sbjct: 7 VLVIDNSEWMRNGDYTPTRIEAQQDAAILLFNAKTQTNPENSVGLMTM------AGKNPN 60 Query: 251 QCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 L+ ++ +V + L+ + + A L + + S H+ ++ ++F Sbjct: 61 VLVTLTKDIGKVLTALHNVQLSGGVKVNIGVQIAQLILKHREHSHHH-------QRIIVF 113 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 + + ++ + + + + ++ + I V+ E Q L Sbjct: 114 VG------SPINEDEASLVTLGKKLKKNNIAIDVVSFGEDAENQTKLEA 156 >gi|58429467|gb|AAW78137.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 545 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 51/162 (31%), Gaps = 23/162 (14%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE--------VKSRL 266 + V K IQ+ I + L ++ ++ +KS L Sbjct: 63 NWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAREIIRLHSDASKNKEKALIIIKSLL 122 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 + PY TN A+ + L + + V+ +TDG + Sbjct: 123 STNLPYGRTNLTDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD------SIQ 169 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 ++L+ + + G+KI + L C S G+ Sbjct: 170 DSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|58429537|gb|AAW78172.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 545 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 51/162 (31%), Gaps = 23/162 (14%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE--------VKSRL 266 + V K IQ+ I + L ++ ++ +KS L Sbjct: 63 NWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAREIIRLHSDASKNKEKALIIIKSLL 122 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 + PY TN A+ + L + + V+ +TDG + Sbjct: 123 STNLPYGRTNLTDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD------SIQ 169 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 ++L+ + + G+KI + L C S G+ Sbjct: 170 DSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|58429535|gb|AAW78171.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 551 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 51/162 (31%), Gaps = 23/162 (14%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE--------VKSRL 266 + V K IQ+ I + L ++ ++ +KS L Sbjct: 63 NWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAREIIRLHSDASKNKEKALIIIKSLL 122 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 + PY TN A+ + L + + V+ +TDG + Sbjct: 123 STNLPYGRTNLTDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD------SIQ 169 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 ++L+ + + G+KI + L C S G+ Sbjct: 170 DSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|58429461|gb|AAW78134.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 551 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 27/162 (16%), Positives = 51/162 (31%), Gaps = 23/162 (14%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE--------VKSRL 266 + V K IQ+ I + L ++ ++ +KS L Sbjct: 63 NWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAREIIRLHSDASKNKEKALIIIKSLL 122 Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 + PY TN A+ + L + + V+ +TDG + Sbjct: 123 STNLPYGRTNLTDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD------SIQ 169 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 ++L+ + + G+KI + L C S G+ Sbjct: 170 DSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|114567581|ref|YP_754735.1| hypothetical protein Swol_2070 [Syntrophomonas wolfei subsp. wolfei str. Goettingen] gi|114338516|gb|ABI69364.1| hypothetical protein Swol_2070 [Syntrophomonas wolfei subsp. wolfei str. Goettingen] Length = 776 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 26/144 (18%), Positives = 49/144 (34%), Gaps = 24/144 (16%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 NL++ + + L+ TN PA+ A +E ++ +K VI TDG+ Sbjct: 337 NLDKASAWVKNLHAMGGTNILPAVQLALKEAGDQ-------------QKVVILATDGQVG 383 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 + N + RN + ++S+ + + + +G L Sbjct: 384 NENEIINYVRK-------RNQNLCLFSLGIDTAVNSYFINQIAEAGNGCAEFSYPGESLE 436 Query: 378 ES----FDKITDKIQEQSVRIAPN 397 E F +I + PN Sbjct: 437 EKMLRHFARINATSMDNVTFSLPN 460 >gi|269468735|gb|EEZ80354.1| hypothetical protein Sup05_1046 [uncultured SUP05 cluster bacterium] Length = 682 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 19/181 (10%), Positives = 50/181 (27%), Gaps = 27/181 (14%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY 272 + +N ++ + + ++ + N L Sbjct: 352 ATNALIQAINRLKPTDRFNIIDFDSDFEVLFDT-----AIPAIDMNKRHGIRFAKHLVAS 406 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T A+ A S + + VIF+TDG+ + + Sbjct: 407 GGTEPLEAIKFAL---------LSKDEDSDKYLRQVIFLTDGQ----------VGNEKEL 447 Query: 333 EYMRNAGM---KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + + +++ + + P + + G F + D E+ ++ K++ Sbjct: 448 FRAVQQNIDDDRFFTIGIGSAPNDYLMTKMAEYGKGAFTYIGDIDEVEVKMGELFSKLES 507 Query: 390 Q 390 Sbjct: 508 P 508 >gi|260826343|ref|XP_002608125.1| hypothetical protein BRAFLDRAFT_91397 [Branchiostoma floridae] gi|229293475|gb|EEN64135.1| hypothetical protein BRAFLDRAFT_91397 [Branchiostoma floridae] Length = 1803 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 25/191 (13%), Positives = 61/191 (31%), Gaps = 32/191 (16%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT--PLSNN 258 + + D +I+ ++N + R+ +++ SN+ Sbjct: 53 RSGSVGRSNYDKIIDFVKAVLNHFSVSPTT-----TRVAVVSFGTSARVEFDLLRSSSND 107 Query: 259 LNE---VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG---STRLKKFVIFIT 312 N+ +++ L KL+ A + + + + K V IT Sbjct: 108 NNKCELLRTHLPKLSYTGG---------ATNTVGALRLALALLKNPGVRSYSTKVVFTIT 158 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN 371 DG + + + +++ G+ +++ + + L + SG++ V Sbjct: 159 DGYWNRGG------DPAYVVRELQSRGVIMFAFGIGSWGISSYRLSALGNRDSGRYKYVY 212 Query: 372 ---DSRELLES 379 D L E Sbjct: 213 LCLDFTVLSEI 223 >gi|47227140|emb|CAG00502.1| unnamed protein product [Tetraodon nigroviridis] Length = 762 Score = 41.0 bits (94), Expect = 0.34, Method: Composition-based stats. Identities = 40/236 (16%), Positives = 75/236 (31%), Gaps = 16/236 (6%) Query: 166 QKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQ 225 + T +K SKN T + Y + + D + + ++ I Sbjct: 226 GSAIKESLTTLESINDVQGERKIRISKNGTLNIYIAVDISESIQKDHVESAKKAILKLIT 285 Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENTNTYPAMH 282 K + + + + + V N N ++K RL K T+ A Sbjct: 286 KISSFSVSPNYELLFFSSELSEVVNILDFFENQPVDIKGRLTKFKVNEEHTGTDLNLAFK 345 Query: 283 HAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN-----------TLQI 331 + K+ + +I TDG + + T+ + Sbjct: 346 TILVRMALIKQ-RVGEKAFEEHRHAIIVFTDGVYNMGGSPLPTVAKIKHMVYMNKIDEET 404 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG-QFFAVNDSRELLESFDKITDK 386 + R+ + IY + A DL + G FF + + + L E+FD I D+ Sbjct: 405 GQNPRDEYLDIYIFGIGAEIYESDLRPLTAGTGGEHFFKLMEIQNLQETFDNIIDE 460 >gi|260463531|ref|ZP_05811730.1| von Willebrand factor type A [Mesorhizobium opportunistum WSM2075] gi|259030622|gb|EEW31899.1| von Willebrand factor type A [Mesorhizobium opportunistum WSM2075] Length = 552 Score = 41.0 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 18/131 (13%), Positives = 44/131 (33%), Gaps = 20/131 (15%) Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 P + + + + + L T A+ A L ++ + V+ I Sbjct: 87 VPPQAGSASAITDAADSLKFLGKTPLTAAVKQAAEALKYTEDKAT-----------VVLI 135 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYS--VAVSAPPEGQDLLRKCTD-SSGQFF 368 TDG + + + + + +G+ + V + + D + G++ Sbjct: 136 TDGLETCGG------DPCALGKELEASGVDFTADVVGFGLTADEGKQIACLADNTGGKYI 189 Query: 369 AVNDSRELLES 379 +D + L E+ Sbjct: 190 QASDEKALQEA 200 >gi|189190514|ref|XP_001931596.1| ubiquitin-conjugating enzyme E2E 3 [Pyrenophora tritici-repentis Pt-1C-BFP] gi|187973202|gb|EDU40701.1| ubiquitin-conjugating enzyme E2E 3 [Pyrenophora tritici-repentis Pt-1C-BFP] Length = 1331 Score = 41.0 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 29/183 (15%), Positives = 63/183 (34%), Gaps = 21/183 (11%) Query: 198 KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN 257 ++DVL + +N + + IG + +N Q ++N Sbjct: 1059 STPSNETNHLTRLDVLKQMFDAYINRVLAYSFQP-----HIGLVTFNTKTQVAQ--KITN 1111 Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + + +LN L Y +T + ++ A ++ + N +I I+DGE++ Sbjct: 1112 AVENSRHKLNNLAAYGDTAIWDSVALAQDQIQQHAKQYPNAKL------RIICISDGEDN 1165 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSREL 376 + + + + G+ + S G L+ + + G FA E Sbjct: 1166 -----TSLNTVEDVAKRLTRCGIVVDS--FCLGNTGNLRLQTLSSLTEGYVFAPKTLDEA 1218 Query: 377 LES 379 + Sbjct: 1219 MAI 1221 >gi|88800837|ref|ZP_01116392.1| hypothetical protein MED297_17512 [Reinekea sp. MED297] gi|88776410|gb|EAR07630.1| hypothetical protein MED297_17512 [Reinekea sp. MED297] Length = 716 Score = 41.0 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 23/128 (17%), Positives = 47/128 (36%), Gaps = 19/128 (14%) Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ---- 330 TN A+ A + + V+ ITDG+ A Sbjct: 90 TNLGRALDEAAYDFAYSTYTGPT---------HVVLITDGQVDIAPNADVNQVERGRILS 140 Query: 331 -ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDK-- 386 + +A +I+++A+ + LLR+ ++ + GQ+ N +LL + ++ Sbjct: 141 QLVPRYNSANARIHTIAL-SDDADHALLRQLSEQTGGQYLRANQGADLLPLLTSLGNEVA 199 Query: 387 -IQEQSVR 393 + + VR Sbjct: 200 PVSQLRVR 207 >gi|221113031|ref|XP_002168311.1| PREDICTED: similar to predicted protein [Hydra magnipapillata] Length = 928 Score = 41.0 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 15/147 (10%), Positives = 45/147 (30%), Gaps = 19/147 (12%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPA--MHHAYREL 288 G + Y+ L+ +++ V + T+ + + + Sbjct: 220 PSMQGYHYGAVVYSDTAELKFNFDLAYDVSGVLEGFEDIKRTF-TSFKEGSRVDLGLQAI 278 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 + S + + +I +TD ++T I + +R++ + I+ + ++ Sbjct: 279 KELFDKSGR----DWVSQVLIVLTD--------SKSTEGVEVIAQSLRDSTVSIFVIGLA 326 Query: 349 --APPEGQDLLRKCTDSS--GQFFAVN 371 + K F ++ Sbjct: 327 NKDSHSDIGQMEKIASQPTQHHVFQID 353 >gi|224090451|ref|XP_002195054.1| PREDICTED: similar to Integrin alpha-2 [Taeniopygia guttata] Length = 1178 Score = 41.0 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 34/227 (14%), Positives = 67/227 (29%), Gaps = 32/227 (14%) Query: 175 TSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNL 234 + P K S + + P + L + L + Sbjct: 152 EILRSFSPAVQKCSSVIDVVVVCDESNSIYPWDAVRAFLKKFVQGL---------DIGLN 202 Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEK 292 ++G I Y +EV + K TNT+ A+ +A + ++ + Sbjct: 203 KTQVGLIQYANDPRVVFNLNTYQTKDEVVKAMEKTFQKGGDLTNTFKAIDNARQYAFSAE 262 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV----- 347 T K ++ +TDGE+ S N + + + +AV Sbjct: 263 SGGRPTAT-----KVMVVVTDGESHDGS------NLKTVIGKCNEDNITRFGIAVLGYLI 311 Query: 348 --SAPPEG-QDLLRKCTD--SSGQFFAVNDSRELLESFDKITDKIQE 389 + ++ + FF V+ LLE + ++I Sbjct: 312 RHELDTKNLIKEIKGIASHPTEKYFFNVSSEAALLEEAGTLGERIFS 358 >gi|224073632|ref|XP_002304123.1| predicted protein [Populus trichocarpa] gi|222841555|gb|EEE79102.1| predicted protein [Populus trichocarpa] Length = 757 Score = 41.0 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 19/127 (14%), Positives = 33/127 (25%), Gaps = 14/127 (11%) Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 +KL TN + A + L T + ITDG N + Sbjct: 401 DKLTADGGTNILGPLKQAIKLLAET----------TNSIPVIFLITDGAVEDERDICNFV 450 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITD 385 + ++I + + LR G F D+ + K+ Sbjct: 451 --KGYLPSGGSISLRISTFGIGTYCN-HHFLRMLAQIGRGHFDTAYDADSVDFRMQKLFT 507 Query: 386 KIQEQSV 392 + Sbjct: 508 TASSIIL 514 >gi|189350352|ref|YP_001945980.1| hypothetical protein BMULJ_01515 [Burkholderia multivorans ATCC 17616] gi|189334374|dbj|BAG43444.1| putative membrane protein [Burkholderia multivorans ATCC 17616] Length = 599 Score = 41.0 bits (94), Expect = 0.35, Method: Composition-based stats. Identities = 20/148 (13%), Positives = 46/148 (31%), Gaps = 7/148 (4%) Query: 8 VCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQ 67 + + + AID+ ++ + R +Q D A ++ S+ + ++ S Sbjct: 1 MIAIVVLGAIDVGNLYFQRRNLQRIADMAAIASVESMTDQCSQQNSPAMMAAQSNALANG 60 Query: 68 IKKHLK-QGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPS 126 Q I D + N T N + + Y +FL Sbjct: 61 FDYRASGQTLSIECGRWDTSATPYFNSTFTPLNAVSVSVTQQVPY------IFLGRFFGK 114 Query: 127 ALTNLSLRSTGIIERSSENLAISICMVL 154 + + + + ++ + +I L Sbjct: 115 SGSTGATVAAFSTAKAINIDSFTIGTTL 142 >gi|332019999|gb|EGI60451.1| Voltage-dependent calcium channel subunit alpha-2/delta-3 [Acromyrmex echinatior] Length = 688 Score = 41.0 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 22/131 (16%), Positives = 49/131 (37%), Gaps = 24/131 (18%) Query: 258 NLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 N++E+K L+ L TN A+ A+ L+ SS + + + + TDG Sbjct: 204 NIHEMKVALSSLKHEDTVTNISAALSTAFEILHKYNRSSQGSQCNQAIM-LITSDTDGPP 262 Query: 317 SGASAYQNTLNTLQICEYMRNAG-----MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 E ++ ++I++ + + DL + G + + Sbjct: 263 ---------------AEVIKRYNWPHMPVRIFTYLIG-GDKSLDLQNTACTTKGFYARIT 306 Query: 372 DSREL-LESFD 381 ++ E+ + F+ Sbjct: 307 NADEIHNKVFE 317 >gi|85706702|ref|ZP_01037794.1| hypothetical protein ROS217_08124 [Roseovarius sp. 217] gi|85668760|gb|EAQ23629.1| hypothetical protein ROS217_08124 [Roseovarius sp. 217] Length = 240 Score = 41.0 bits (94), Expect = 0.36, Method: Composition-based stats. Identities = 20/143 (13%), Positives = 41/143 (28%), Gaps = 29/143 (20%) Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 P+ + V + + L P T ++ A L G V+ +TD Sbjct: 93 PMPDAGAAVVAGIEALTPGGLTPIAASVAAAAEVL-----------GYRTHPGIVVLVTD 141 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA-------------PPEGQDLLRKC 360 G + L + ++ + +GQ + + Sbjct: 142 GNETCGGTPC----ALGTALAAEARDLTVHVIGFRVVHDPFSWNSPEAKGYDGQTVAKCL 197 Query: 361 TD-SSGQFFAVNDSRELLESFDK 382 D + G F + EL+ + + Sbjct: 198 ADATGGLFVSTETVDELVAALRE 220 >gi|118579649|ref|YP_900899.1| von Willebrand factor, type A [Pelobacter propionicus DSM 2379] gi|118502359|gb|ABK98841.1| von Willebrand factor, type A [Pelobacter propionicus DSM 2379] Length = 337 Score = 41.0 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 20/146 (13%), Positives = 45/146 (30%), Gaps = 20/146 (13%) Query: 243 YNIGIVGNQCTPLSNNLNE----VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNT 298 + G PL+N+ +K + ++ A+ +S Sbjct: 136 FLFASKGVPIVPLTNDYGYCQYILKHANDSTISTPGSDLGQAITTGIYLFEESSRTS--- 192 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLR 358 K ++ I+DGE+ +++ + + G+ IY+V +R Sbjct: 193 ------VKSIVLISDGED----INEDSSVMHEAAQRAAAKGIAIYTVGTGRGQGVMVPIR 242 Query: 359 KC--TDSSGQFFAVNDSRELLESFDK 382 G ++ D L ++ Sbjct: 243 DAIGAAIEG-YYQDEDGSYLKTRLEQ 267 >gi|240137956|ref|YP_002962428.1| hypothetical protein MexAM1_META1p1287 [Methylobacterium extorquens AM1] gi|240007925|gb|ACS39151.1| conserved hypothetical protein [Methylobacterium extorquens AM1] Length = 735 Score = 41.0 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 23/132 (17%), Positives = 44/132 (33%), Gaps = 16/132 (12%) Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 KS + L T + A R+ E+ + V+F+TDG A Sbjct: 412 AKSFVAGLQASGGTEMLAPLQAALRDATPEETGR---------LRQVVFLTDG------A 456 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFD 381 N R +++ V + + P G + G F ++ ++ E Sbjct: 457 IGNEAQIFSAIATERGRS-RLFMVGIGSAPNGYLMRHAAELGRGSFTQIDTPDQVTERMR 515 Query: 382 KITDKIQEQSVR 393 + K++ +V Sbjct: 516 ALLVKLESPAVT 527 >gi|146306837|ref|YP_001187302.1| hemolysin-type calcium-binding region [Pseudomonas mendocina ymp] gi|145575038|gb|ABP84570.1| Hemolysin-type calcium-binding region [Pseudomonas mendocina ymp] Length = 3184 Score = 41.0 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 16/164 (9%), Positives = 43/164 (26%), Gaps = 21/164 (12%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 ++ +L ++ + + + + + A + + L Sbjct: 2761 SGGYGSRLALLKDAVNAFIGKLGTHTGQINIALISFSSSASLLLSGTLAQIQTA--LAAP 2818 Query: 263 KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 + L L TN AM A + + F+TDG+ + + Sbjct: 2819 NNVLMALTASGATNYEAAMQQANAWFGGVEVNG--------YNNLAYFLTDGDPTTYNGD 2870 Query: 323 QNTL----------NTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + L + +++++ + L Sbjct: 2871 NSNSGSTVNFNDVNRALDDATTLMAR-AEVHAIGIGTGVNSNVL 2913 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 34/285 (11%), Positives = 76/285 (26%), Gaps = 22/285 (7%) Query: 45 VSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYI 104 V + + + D T I L S +GD + + P+ Y Sbjct: 2213 VDEDELAGGNSDYDGVGTQAAGSIASLLFGESPSVSWSGDTSSLPNL---TSGGKPVVYE 2269 Query: 105 AESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLY 164 + + + + G ++ + + LD++ ++ Sbjct: 2270 VDGNQLLATAGGASIFTVTLN-PNGSFTFHLQGPVDHPQGDGNDEEMLTLDLTGMLQPSS 2328 Query: 165 LQKHN---DNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPAN---RKIDVLIESAG 218 P S + + K+ L ++ Sbjct: 2329 GTTLLGEFKILIEDDVPITANDKPDCIVQSAPPLVNLTLVLDISLSMAGDKLTALKQAVI 2388 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN----NLNEVKSRLNKLNPYEN 274 +L + V + I +N G S+ + + +N L Sbjct: 2389 SLAQ-----GYAGLSAPVHVNLITFNSGAAEIGDFTFSSVGDAGYTALLTAVNGLTASGF 2443 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 TN A+ A ++ ++ + + + FI+DGE + Sbjct: 2444 TNYEQALSVAKAQVLSDISAPGA---DPAQQHKLYFISDGEPTVG 2485 >gi|146302265|ref|YP_001196856.1| von Willebrand factor, type A [Flavobacterium johnsoniae UW101] gi|146156683|gb|ABQ07537.1| von Willebrand factor, type A [Flavobacterium johnsoniae UW101] Length = 2588 Score = 41.0 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 25/272 (9%), Positives = 73/272 (26%), Gaps = 38/272 (13%) Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 +I + + ++++ + + + + + S Sbjct: 27 TSSIGLAQTITTNKTVTANAGNCGIIDVKVDITGANPITRNSDVVLAIDISGSMGNTISG 86 Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 +D ++A +N + + I + + + + ++ + Sbjct: 87 DFKTSMDYAKDAALAFLNQAKANP----QNRIAIVAYSTTASLKIGLTYLNATGVTQITN 142 Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 ++N L +TN Y + + EL + +I +TDG + N Sbjct: 143 QINALQATNSTNIYAGIVRSETELETNGRF------DCSTARAIILLTDGVTNVTGTSGN 196 Query: 325 T-----------LNTLQICEYMRN-------AGMKIYSV----AVSAPPEGQDLLRKCTD 362 T + + + +++SV +S + + Sbjct: 197 TNCNVSKTSQCVTDAINAATNAKTTTKSSVVYNNQVFSVGLFGGISGNINTNNSDQNIAK 256 Query: 363 ------SSGQFFAVNDSRELLESFDKITDKIQ 388 + L +++I +I Sbjct: 257 YTLDNIQGSAAYITQSGANLTAIYNQIATQIS 288 >gi|268561224|ref|XP_002646394.1| Hypothetical protein CBG15363 [Caenorhabditis briggsae] gi|187027190|emb|CAP33690.1| hypothetical protein CBG_15363 [Caenorhabditis briggsae AF16] Length = 400 Score = 41.0 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 27/157 (17%), Positives = 55/157 (35%), Gaps = 10/157 (6%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKES 294 + R+G + YN N ++N+V + + T + + S Sbjct: 81 TTRVGLVTYNSVAKVNADLNTFQSINDVYNGVFNYLSAVTDATDSYLATGLQAANALFAS 140 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 KK VI S L+ +++ + M+ +G+ I +VA GQ Sbjct: 141 QSFNSTRNHYKKVVIVYASEYKSYG-----ELDPVKVADEMKGSGVYIVTVAYDQGGNGQ 195 Query: 355 DLLRKCTD--SSGQFFAVNDSRELLESFDKITDKIQE 389 LL+ + G F +++ + +I + + Sbjct: 196 -LLKDLAGIATPGYSF--SNTDDSDNVIGEIQGALLQ 229 >gi|116622830|ref|YP_824986.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225992|gb|ABJ84701.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 329 Score = 41.0 bits (94), Expect = 0.37, Method: Composition-based stats. Identities = 13/91 (14%), Positives = 39/91 (42%), Gaps = 9/91 (9%) Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA---VSAPPEGQDLLRKCTD 362 K VI ++G ++ + + + + + G+ IY ++ V+ P + ++ Sbjct: 202 KVVIVFSNGPDNASMVAPDDVRAV-----AEDEGIPIYVISTSEVNKDPISSGVFKRIAT 256 Query: 363 -SSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + G+ + ++ +E+F+ I + + Sbjct: 257 RTGGKAYWAKTWQKQVEAFENIREDLGNSYT 287 >gi|239995187|ref|ZP_04715711.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Alteromonas macleodii ATCC 27126] Length = 1356 Score = 40.7 bits (93), Expect = 0.37, Method: Composition-based stats. Identities = 11/51 (21%), Positives = 27/51 (52%), Gaps = 1/51 (1%) Query: 340 MKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQE 389 + +++ +A + L + S G F+ ++S +LLE+F+ I +++ Sbjct: 637 VITHTIGFAANNNANNFLNQLALQSGGGFYQADNSTDLLEAFNTILRSVKD 687 >gi|198283210|ref|YP_002219531.1| hypothetical protein Lferr_1082 [Acidithiobacillus ferrooxidans ATCC 53993] gi|218665578|ref|YP_002425441.1| type IV pilin biogenesis protein, putative [Acidithiobacillus ferrooxidans ATCC 23270] gi|198247731|gb|ACH83324.1| hypothetical protein Lferr_1082 [Acidithiobacillus ferrooxidans ATCC 53993] gi|218517791|gb|ACK78377.1| type IV pilin biogenesis protein, putative [Acidithiobacillus ferrooxidans ATCC 23270] Length = 1137 Score = 40.7 bits (93), Expect = 0.37, Method: Composition-based stats. Identities = 47/432 (10%), Positives = 104/432 (24%), Gaps = 73/432 (16%) Query: 31 SALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKK-----QIKKHLKQGSYIRENAGDI 85 +A +A S AS S I + + + + +++ G +G + Sbjct: 119 AAAQSACASVGASGASSAYIDNSQSMLNVAEQAIGTIINTPEYNNNMQFGLMDYALSGRV 178 Query: 86 AQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSAL-------TNLSLRSTGI 138 + NN + + S + Sbjct: 179 SLYNTWAYYMSDNNGFSFGTSATGAPSGDVTVANPCYQSGSNSCINLQQNIFGQGNGSIS 238 Query: 139 IERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSK 198 + ++ DV ++ + + N Y L + S + Sbjct: 239 DPYLYVQASSDAPVINDVLYAIAAQWQPDATNAVGWGPNPYGLNLQSYMTGNSGVNFSNY 298 Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN- 257 + + + S + + ++ T ++N Sbjct: 299 SNGLVSGMTPTSAGYFPLSQQVWESQRGYAFNANTSYNKGNIVSPISATNSANATNIANA 358 Query: 258 --------NLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 + + + + + T A A + K+V Sbjct: 359 ILPEVFRTDKSINYAAEGPITASAGYSPTAGAFSTALSYYEGSLPNPPPATCG---SKYV 415 Query: 309 IFITDGENSGASAYQNTLNTLQIC------------------------------EYMRNA 338 IFITDG+ + + L + + Sbjct: 416 IFITDGQPTQGMEHGYVYPPLGSASAQMFGVTSITASTWSSTNNNAVVETINEIQALAQK 475 Query: 339 GMKIYSVAVSAPPE----------------GQDLLRKCTDSSG--QFFAVNDSRELLESF 380 G+K Y + V + GQ +L + G F+A + ++ + Sbjct: 476 GIKTYVLGVGSAVNPNVPGASAADQAEALQGQAVLTAMAQAGGTSNFYAATSASDVQSAM 535 Query: 381 DKITDKIQEQSV 392 + I I +SV Sbjct: 536 NSIIANILGKSV 547 >gi|254427823|ref|ZP_05041530.1| PQQ enzyme repeat domain protein [Alcanivorax sp. DG881] gi|196193992|gb|EDX88951.1| PQQ enzyme repeat domain protein [Alcanivorax sp. DG881] Length = 961 Score = 40.7 bits (93), Expect = 0.37, Method: Composition-based stats. Identities = 17/140 (12%), Positives = 46/140 (32%), Gaps = 28/140 (20%) Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ---------ICEYM 335 + +Y+ K S + + +I +TDG+ + S Y + + Sbjct: 182 WEYVYDSKYDSPTNLKNQCESNHIIVMTDGDPTSDSDYSSVTKITGGGCNGSYSCQAKLA 241 Query: 336 R------------NAGMKIY--SVAVSAPPEGQDLLRKCTDSS-----GQFFAVNDSREL 376 + +K + ++ V ++ ++ + +++ EL Sbjct: 242 KWLYNDDKDVEGSRKSVKTWQVALGVGKNSSQARNMKNVAEAGLGDPSAEVRFADNADEL 301 Query: 377 LESFDKITDKIQEQSVRIAP 396 F +I D + + S ++ Sbjct: 302 AAEFKEILDLVDKDSRTLSS 321 >gi|33333558|gb|AAQ11895.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 559 Score = 40.7 bits (93), Expect = 0.37, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + + N + + S N + Sbjct: 55 CSGSIRRHNWVNHAVPLAMKLIQQLNLNENAIHLYLNIFSNNAREIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LIIIKSLLSTNLPYGRTNLSDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|160718|gb|AAA29774.1| thrombospondin related anonymous protein [Plasmodium falciparum] Length = 559 Score = 40.7 bits (93), Expect = 0.37, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + + N + + S N + Sbjct: 55 CSGSIRRHNWVNHAVPLAMKLIQQLNLNENAIHLYLNIFSNNAREIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LIIIKSLLSTNLPYGRTNLSDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|160723|gb|AAA29776.1| thrombospondin related anonymous protein [Plasmodium falciparum] Length = 559 Score = 40.7 bits (93), Expect = 0.37, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + + N + + S N + Sbjct: 55 CSGSIRRHNWVNHAVPLAMKLIQQLNLNENAIHLYLNIFSNNAREIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LIIIKSLLSTNLPYGRTNLSDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|3142286|gb|AAC18657.1| thrombospondin related adhesive protein [Plasmodium falciparum] Length = 562 Score = 40.7 bits (93), Expect = 0.37, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 56/170 (32%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + N + + S N + Sbjct: 55 CSGSIRRHNWVNHAVPLAMKLIQQLNLNENAIHLYANIFSNNAKEIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LIIIKSLLSTNLPYGRTNLSDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|261338458|ref|ZP_05966342.1| putative von Willebrand factor type A domain protein [Bifidobacterium gallicum DSM 20093] gi|270276443|gb|EFA22297.1| putative von Willebrand factor type A domain protein [Bifidobacterium gallicum DSM 20093] Length = 493 Score = 40.7 bits (93), Expect = 0.38, Method: Composition-based stats. Identities = 36/294 (12%), Positives = 91/294 (30%), Gaps = 23/294 (7%) Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSN 177 + L +A + + + A + + S + + + ++ + Sbjct: 1 MNLTRFRKAATALCATAALALTGMMGVATAHADDTTPVIGHSSALMGNGDGSYSLTVSVS 60 Query: 178 KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVR 237 + +++ + + + V + L N + N +VR Sbjct: 61 STDMDTAQQQTESDVVV---LMDVSGSMTTTDMKVAKNAVNGLANQLLNDE----NDTVR 113 Query: 238 IGTIAYNIGIVGNQCTP---LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY-NEKE 293 + + ++ + + +++ V LN L NTN A+ +A + + Sbjct: 114 MSIVRFSSEAKTLEFSNGSEWTHSPALVAQALNTLTSRGNTNWDGALQNASALVQGDSAR 173 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA--GMKIYSVAVSAPP 351 S+ + S + + + N + N +IY+V S Sbjct: 174 KSYVVLMSDGYPNTINSCYPAVANCTDTSWSEPNAVPKAIEAANTMPNTQIYAV--STRT 231 Query: 352 EGQDLLRKCTDS--------SGQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 + +++ D Q D + L +FD I D I+++ + + Sbjct: 232 SASESMKELVDGINAKAPKYPAQIMYGTDQQSLNNAFDTIADAIRKRFTDVTVD 285 >gi|303327225|ref|ZP_07357667.1| hemolysin-type calcium-binding region [Desulfovibrio sp. 3_1_syn3] gi|302863213|gb|EFL86145.1| hemolysin-type calcium-binding region [Desulfovibrio sp. 3_1_syn3] Length = 1149 Score = 40.7 bits (93), Expect = 0.38, Method: Composition-based stats. Identities = 39/339 (11%), Positives = 89/339 (26%), Gaps = 48/339 (14%) Query: 36 AVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK 95 A L G + + Q+K S I E + + Sbjct: 358 ATLDGDKEGDVLFSATLDKDGNWTMEQYEQFQVKTDGADTSNIFELKFETRDSDGDIAST 417 Query: 96 DKNNPLQYIAESKAQYEIPTEN----LFLKGLIPSALTNLSLRSTGIIERSSENLAISIC 151 PL+ + ++ + N + + G A T ++ + G+ E ++C Sbjct: 418 SAKVPLEVVEQTTTEGGDSISNSDDVINIAGGDGVAGTLVAGDTGGVTEGQQVGSNYNVC 477 Query: 152 MVLDVSRSMEDLYLQ--KHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRK 209 VLD S SM+ + ++ + F + A + K Sbjct: 478 FVLDTSGSMDGAVSGHETRLGVATQSIENFIKNSIHEGDFVGTVNLAVVPFASEAGSVIK 537 Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL 269 + + + G + + + + + ++ L L Sbjct: 538 VSITKTAQGER-YTFGEEVYDNYADFSK-----------------------AFETSLGNL 573 Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 N TN +A + S F++DG + Sbjct: 574 NANGGTNYEAGFSNAADWF------NGLEGTSNATGNITYFLSDGVPTYHGTSSYGGGNY 627 Query: 330 QICEYMR------------NAGMKIYSVAVSAPPEGQDL 356 + ++ A M++ ++ + + + Sbjct: 628 ATLDDVKGAWDGYQELLGSAANMQVNAIGFGKDLDDKAM 666 >gi|317123666|ref|YP_004097778.1| type II secretion system F domain [Intrasporangium calvum DSM 43043] gi|315587754|gb|ADU47051.1| Type II secretion system F domain [Intrasporangium calvum DSM 43043] Length = 652 Score = 40.7 bits (93), Expect = 0.38, Method: Composition-based stats. Identities = 16/137 (11%), Positives = 50/137 (36%), Gaps = 16/137 (11%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + + + +N L+ +T+ Y A+ A R + + + ++ ++DG Sbjct: 137 TVDRAAAQRVVNGLDARGDTSLYAAVRSAARAMPGDG------------DRSMVLLSDGA 184 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ--DLLRKCTDSSGQFFAVNDS 373 ++ + + + ++ G+++ V + L + S G ++ Sbjct: 185 DTV--SDDRQGDLAEANRELKRRGVRVDVVRFNTDDPDAVVALRSFASASGGSVIPATNA 242 Query: 374 RELLESFDKITDKIQEQ 390 ++ +F ++ Q Sbjct: 243 SDVGAAFKSAARALRSQ 259 >gi|187939945|gb|ACD39081.1| hypothetical protein PACL_0293 [Pseudomonas aeruginosa] Length = 223 Score = 40.7 bits (93), Expect = 0.38, Method: Composition-based stats. Identities = 20/130 (15%), Positives = 47/130 (36%), Gaps = 11/130 (8%) Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L T A+ A + ++ ++I S K ++ ++DG + Sbjct: 87 LVAEGMTPLGGALSLASEMIEDK-----DSIPSRAYKPVIVLVSDGYPNDDWQGPFARLV 141 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ-FFAVNDSRELLESFDKITDKI 387 R++ +++A+ A + +L + F ++R++ F +T + Sbjct: 142 NGE----RSSKATRFAMAIGADA-DEVMLSDFANDPEAPLFHAENARDIHRFFRAVTMSV 196 Query: 388 QEQSVRIAPN 397 +S PN Sbjct: 197 SARSQSATPN 206 >gi|156402481|ref|XP_001639619.1| predicted protein [Nematostella vectensis] gi|156226748|gb|EDO47556.1| predicted protein [Nematostella vectensis] Length = 154 Score = 40.7 bits (93), Expect = 0.39, Method: Composition-based stats. Identities = 15/131 (11%), Positives = 40/131 (30%), Gaps = 15/131 (11%) Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYN 290 +G I Y+ + + V +++K+ + L Sbjct: 39 VSKQGTHVGIIRYDHKAEILKPFGQVTDKQGVLDKISKIT------FTGGGTKTGQALTL 92 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + + +I +TDG + + ++N+G+ +++V + Sbjct: 93 AMDGLYQIDNRKEVPDVLIVLTDGASKDSVVDP--------ANKLKNSGVTVFAVGIGKE 144 Query: 351 PEGQDLLRKCT 361 +D L Sbjct: 145 -FKEDELNLIA 154 >gi|328876382|gb|EGG24745.1| hypothetical protein DFA_02989 [Dictyostelium fasciculatum] Length = 684 Score = 40.7 bits (93), Expect = 0.40, Method: Composition-based stats. Identities = 25/180 (13%), Positives = 56/180 (31%), Gaps = 14/180 (7%) Query: 166 QKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQ 225 + +N + + + + + + V+ L+ I Sbjct: 170 PTEPEIDNSEFVAHFIENSISNDVEIVFVFDTTGSMSSVLKEVRTKVVET-ITRLMKEIP 228 Query: 226 KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAY 285 +G Y G+ L+++++++ + +NK + A +A Sbjct: 229 SIRIGI------MGLGDYCDGLNVLNTCDLTSDISKLVNFVNKTPSTGGGDEPEAYEYA- 281 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 L KE S S K ++ I D ++ + + + G+KIY V Sbjct: 282 --LLKAKELSW----SKHTSKALVMIGDSPPHSPQFTSLKIDWVAETDNLEAMGVKIYGV 335 >gi|296133928|ref|YP_003641175.1| hypothetical protein TherJR_2435 [Thermincola sp. JR] gi|296032506|gb|ADG83274.1| hypothetical protein TherJR_2435 [Thermincola potens JR] Length = 621 Score = 40.7 bits (93), Expect = 0.40, Method: Composition-based stats. Identities = 20/84 (23%), Positives = 32/84 (38%), Gaps = 2/84 (2%) Query: 3 AIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS- 61 I+ + +FI IDLA I +NQ++ +AA S A + + Sbjct: 24 VIVFMIMVIFIGLFIDLARIKTAQNQLRRVANAAACSVLADYHTSTKQDFGLFTYKGANY 83 Query: 62 -TIFKKQIKKHLKQGSYIRENAGD 84 F K +K +L + N D Sbjct: 84 DQDFAKYVKANLTFSADQNFNLLD 107 >gi|293190491|ref|ZP_06608878.1| putative von Willebrand factor type A domain protein [Actinomyces odontolyticus F0309] gi|292820902|gb|EFF79858.1| putative von Willebrand factor type A domain protein [Actinomyces odontolyticus F0309] Length = 338 Score = 40.7 bits (93), Expect = 0.40, Method: Composition-based stats. Identities = 19/169 (11%), Positives = 46/169 (27%), Gaps = 32/169 (18%) Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRELYNEKESS------ 295 PL+++ L ++ T A +EL++ Sbjct: 135 WDAYSMTMFPLTDDYEMATDVLQDMSDTIDTGLTRIG-GRLSATQELFDYLAPVMDENQE 193 Query: 296 --------------HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK 341 + + ++ TD E G N + E+ ++ G+ Sbjct: 194 VSSIVGDGLASCVMGFDHNDKQRSRTILLATDNEVYGDGV----YNLSEAIEFAKSQGVT 249 Query: 342 IYSVAVSAP---PEGQDLLRK-CTDSSGQFFAVNDSRELLESFDKITDK 386 + ++ + LR + G F+ + + +I + Sbjct: 250 VTALYPGSDITLSSEALQLRDEVRKTGGDFYDASSPSSVDRVVKQIEAE 298 >gi|260824043|ref|XP_002606977.1| hypothetical protein BRAFLDRAFT_64962 [Branchiostoma floridae] gi|229292323|gb|EEN62987.1| hypothetical protein BRAFLDRAFT_64962 [Branchiostoma floridae] Length = 219 Score = 40.7 bits (93), Expect = 0.40, Method: Composition-based stats. Identities = 20/107 (18%), Positives = 36/107 (33%), Gaps = 16/107 (14%) Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 L+N+ V+ ++ L P T + + A +EL + G R+ VI +TDG Sbjct: 48 LTNDYRCVQRAVDNLRPGGTTPMFEGLMEALKEL-CQNGGVLVLPGGIRMTPRVILMTDG 106 Query: 315 ENSGASAYQNTLNTLQICEYMRNAG---------MKIYSVAVSAPPE 352 + N + G + I V + + Sbjct: 107 KPD------NQDKVILAAAAFSRKGYQEVGLPYPVPIACVGCGSGVD 147 >gi|163850829|ref|YP_001638872.1| vault protein inter-alpha-trypsin subunit [Methylobacterium extorquens PA1] gi|163662434|gb|ABY29801.1| Vault protein inter-alpha-trypsin domain protein [Methylobacterium extorquens PA1] Length = 732 Score = 40.7 bits (93), Expect = 0.40, Method: Composition-based stats. Identities = 23/132 (17%), Positives = 43/132 (32%), Gaps = 16/132 (12%) Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 KS + L T + A R E+ + V+F+TDG A Sbjct: 407 AKSFVAGLQASGGTEMLAPLQAALRGATPEETGR---------LRQVVFLTDG------A 451 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFD 381 N R +++ V + + P G + G F ++ ++ E Sbjct: 452 IGNEAQIFSAIATERGRS-RLFMVGIGSAPNGYLMRHAAELGRGSFTQIDTPDQVTERMR 510 Query: 382 KITDKIQEQSVR 393 + K++ +V Sbjct: 511 ALLVKLESPAVT 522 >gi|123445195|ref|XP_001311360.1| hypothetical protein [Trichomonas vaginalis G3] gi|121893166|gb|EAX98430.1| hypothetical protein TVAG_264790 [Trichomonas vaginalis G3] Length = 382 Score = 40.7 bits (93), Expect = 0.41, Method: Composition-based stats. Identities = 19/126 (15%), Positives = 45/126 (35%), Gaps = 25/126 (19%) Query: 257 NNLNEVKSRLNKLNPYE-NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 +N++E +++ ++ T+ ++ + K + +TDGE Sbjct: 9 DNVHEAMEKISHIDADYKGTDILKTLNFVF-----------GLKPQDGFVKQIFLLTDGE 57 Query: 316 NSGASAYQNTLNTLQICEYM--RNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVND 372 + N+ +IC +I+S+ + L++ D S G + + D Sbjct: 58 D---------RNSDKICAEAQMNRNQTRIFSIGLG-DGADPGLIKGVADKSGGSYTLITD 107 Query: 373 SRELLE 378 + E Sbjct: 108 EENMNE 113 >gi|34481896|emb|CAE46496.1| trap [Plasmodium falciparum] Length = 331 Score = 40.7 bits (93), Expect = 0.41, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + + N + + S N + Sbjct: 55 CSGSIRRHNWVKHAVPLAMKLIQQLNLNESAIHLYVNIFSNNAREIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LFIIKSLLSTNLPYGRTNLTDALLQVRKHLNDRI-------NRENASQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDLGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|256028718|ref|ZP_05442552.1| von Willebrand factor type A [Fusobacterium sp. D11] gi|289766627|ref|ZP_06526005.1| von Willebrand factor type A [Fusobacterium sp. D11] gi|289718182|gb|EFD82194.1| von Willebrand factor type A [Fusobacterium sp. D11] Length = 218 Score = 40.7 bits (93), Expect = 0.41, Method: Composition-based stats. Identities = 21/126 (16%), Positives = 45/126 (35%), Gaps = 10/126 (7%) Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 N T A+ A + N I S ++ ++DG + Sbjct: 78 NDFTEGGMTPLGGALRIAKEMVEN-----REIIPSKSYAPIILLLSDGAPNDNGWENEMY 132 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + R+ S+ + D+L+ S+G+ + DS +++ F +T Sbjct: 133 RFI---NDGRSKKCMRMSLGIGRD-YDYDVLKGF-SSNGEVYEAKDSMNIIDFFKFMTMT 187 Query: 387 IQEQSV 392 I+E+++ Sbjct: 188 IKEKTL 193 >gi|39997259|ref|NP_953210.1| hypothetical protein GSU2161 [Geobacter sulfurreducens PCA] gi|39984149|gb|AAR35537.1| hypothetical protein GSU2161 [Geobacter sulfurreducens PCA] Length = 575 Score = 40.7 bits (93), Expect = 0.41, Method: Composition-based stats. Identities = 16/119 (13%), Positives = 36/119 (30%), Gaps = 23/119 (19%) Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 + T A+ A+ EL +E+ K I +DG+ Sbjct: 476 NITGSGGTRLGHALWWAWGELSLRRETR----------KICIAFSDGDTGDG------PV 519 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 T + MR G+++ + + Q L + + + + ++ + Sbjct: 520 TQAAIKRMREEGIEVIGIGIQDNSIKQYL-------PDNHRIIKNLDQFTPALLELLRE 571 >gi|255535988|ref|YP_003096359.1| BatB [Flavobacteriaceae bacterium 3519-10] gi|255342184|gb|ACU08297.1| BatB [Flavobacteriaceae bacterium 3519-10] Length = 335 Score = 40.7 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 17/108 (15%), Positives = 39/108 (36%), Gaps = 21/108 (19%) Query: 250 NQCTPLSNNLNEVKSRLNKLNP----YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + PL+ + V++ + + + T+ AM A + N + S Sbjct: 141 SSIMPLTTDFTAVETYVGGVETSIVKMQGTDFLKAMQTAADKFRNVAKGSRK-------- 192 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 V+ ++DGE++ N + G+++ SV + + Sbjct: 193 --VVLLSDGEDNEG-------NEKAAAKLANREGIRVISVGIGSEEGA 231 >gi|72168566|ref|XP_796840.1| PREDICTED: similar to Clca1 protein [Strongylocentrotus purpuratus] gi|115961659|ref|XP_001187264.1| PREDICTED: similar to Clca1 protein [Strongylocentrotus purpuratus] Length = 958 Score = 40.7 bits (93), Expect = 0.42, Method: Composition-based stats. Identities = 25/186 (13%), Positives = 61/186 (32%), Gaps = 22/186 (11%) Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 N ++ L ++A ++ I + V I + I+ N N+ Sbjct: 326 VSGSMGGNDRLTKLNQAATQY---LRYTIDDGS--FVGIAHFSDYSRIIENLTEITDNSR 380 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 ++ L + T + + L T +++ I+DG+ + Sbjct: 381 EDLVMGLPSI-ANGPTCIGCGVLDGIKILK------GETGMEDPAGGYILLISDGQQNRQ 433 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG-QFFAVNDSRE--L 376 + ++ G+ + ++A + +LL ++G FF + + L Sbjct: 434 PYIDEVFDEVEEA------GVIVDTIAF-SDAADPNLLELSVRTNGLGFFYPDTATSTAL 486 Query: 377 LESFDK 382 ++F Sbjct: 487 NDAFTA 492 >gi|189347154|ref|YP_001943683.1| von Willebrand factor type A [Chlorobium limicola DSM 245] gi|189341301|gb|ACD90704.1| von Willebrand factor type A [Chlorobium limicola DSM 245] Length = 6006 Score = 40.7 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 33/301 (10%), Positives = 61/301 (20%), Gaps = 15/301 (4%) Query: 56 KKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPT 115 + G+ + N A + N E+ T Sbjct: 5318 PVAVADDGGNVAEGGNTITGTSVLGNDQPGADQPATITGFVFTNESGVTQTGSLGSEVDT 5377 Query: 116 ENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMT 175 + T S S + V D + L D+ Sbjct: 5378 IYGRFTMNTDGSWTYTSDASVDHSSAEPLP-DVITYTVQDADDDISTAALTIDVDDVAPA 5436 Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 + + T N + +++ + Sbjct: 5437 VSPTSNSYEVPLQNTNLLITLDVSGSMSRNLNNDSHPTGNDPTRMDIAVESIAEMLSQYD 5496 Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSR---LNKLNPYENTNTYPAMHHAYRELYNEK 292 R + N + + V+ LN L TN A+ A N Sbjct: 5497 YRGDVSVKLVIFSTNGQSLTTAEWVTVEEAKIMLNSLVANGGTNYDGAIEAADDAFVNTN 5556 Query: 293 ESSHNTIGSTRLKKFVIFITDGEN-----SGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 FI+DG + ++ N + Y+V + Sbjct: 5557 ------GMIANADNIAYFISDGLPSLPTITAGDIGIQPAEQTIWETFLENNDVTSYAVGI 5610 Query: 348 S 348 Sbjct: 5611 G 5611 >gi|156383255|ref|XP_001632750.1| predicted protein [Nematostella vectensis] gi|156219810|gb|EDO40687.1| predicted protein [Nematostella vectensis] Length = 174 Score = 40.7 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 13/140 (9%), Positives = 43/140 (30%), Gaps = 17/140 (12%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYR 286 + + +G + ++ ++ ++ S +N + TNT + Sbjct: 47 NFDISVSATHVGLVLFSTTASVKITLNEFYDIVKLTSAINLITIQGGFTNTGKGLQAVKS 106 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 +L++ + + +I +T + + + +R+ + +Y V Sbjct: 107 DLFDATTR-------KNVPRVLIVLT--------VASSLDDVRAPSQALRDNSVTVYVVG 151 Query: 347 VSAPPEGQDLLRKCTDSSGQ 366 V + + + Sbjct: 152 VG-ERVDVEQMNVMGSDPDR 170 >gi|169624023|ref|XP_001805418.1| hypothetical protein SNOG_15261 [Phaeosphaeria nodorum SN15] gi|111056366|gb|EAT77486.1| hypothetical protein SNOG_15261 [Phaeosphaeria nodorum SN15] Length = 296 Score = 40.7 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 21/183 (11%), Positives = 59/183 (32%), Gaps = 24/183 (13%) Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 S+ + + + ++ + ++ A E + + G Sbjct: 6 SMIVVDNSEASRNGDYVPSRWEAQQDAVNLIFSAKTGANPESSVGLMSM------GGSTP 59 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 T L+ ++ +V L++ N++ ++ A L + + K+ +I Sbjct: 60 EILTTLTTDIGKVLDGLHRTKIKGNSHFVTGINVAALALKH--------RQNKSQKQRII 111 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG-----QDLLRKCTDSS 364 + + ++ N +++ + M+ G+ I VA + Q + Sbjct: 112 I-----FNCSPIEEDEKNLIKLAKKMKKTGINIDIVAFGELSDDTTKKLQAFSENVSSGE 166 Query: 365 GQF 367 G + Sbjct: 167 GSY 169 >gi|225435355|ref|XP_002285271.1| PREDICTED: hypothetical protein isoform 2 [Vitis vinifera] Length = 670 Score = 40.7 bits (93), Expect = 0.43, Method: Composition-based stats. Identities = 11/111 (9%), Positives = 34/111 (30%), Gaps = 14/111 (12%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 K+ +L + G ++ S+ + + + + + + +N Sbjct: 292 TKLALLKRAMGFVIQSLGPCDRLSVISFSSTARRLFPLRRMTDT------GRQQALQAVN 345 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 L TN + + + + +I ++DG+++ Sbjct: 346 SLVSNGGTNIAEGLRKGAKVMLD--------RKWKNPVSSIILLSDGQDTY 388 >gi|262171974|ref|ZP_06039652.1| protein BatA [Vibrio mimicus MB-451] gi|261893050|gb|EEY39036.1| protein BatA [Vibrio mimicus MB-451] Length = 335 Score = 40.7 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 25/177 (14%), Positives = 63/177 (35%), Gaps = 23/177 (12%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL---NPYENT 275 + + + +K +++ G TP + + + + L++ ++T Sbjct: 122 SRLTAAKKVLRDFVTQRQGDRFGLILFGDAAFIQTPFTADQDVWLNLLDEAETGMAGQST 181 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 N A+ + ST + ++ +TDG ++G+ + + + Sbjct: 182 NLGDAIGLGIKVFEQSP--------STSQDQIMLVLTDGNDTGSFVSPVDAAKIAAAKGI 233 Query: 336 RNAGMKIYSVAVSAPPE------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 R IY +A+ P D++ + + + + F D +L E++ I Sbjct: 234 R-----IYVIAMGDPENVGEQPLDMDVVSRVSSLTQARSFVAIDQSQLNEAYQVIDQ 285 >gi|295093844|emb|CBK82935.1| von Willebrand factor type A domain. [Coprococcus sp. ART55/1] Length = 343 Score = 40.7 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 27/246 (10%), Positives = 60/246 (24%), Gaps = 24/246 (9%) Query: 160 MEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGN 219 + + + K I + S Sbjct: 83 PYTSRRMQDEKYCRDIILCIDISTSVDYLNENLLDKLKKTVDELQGERFGIVIFNTSPVL 142 Query: 220 LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN----- 274 L K L + + + + + ++ ++ N Sbjct: 143 LTPLTDDYEYVKDQLDLIAQCLKSRNEVNLDDAFSSGYDWIYYQAYISSGTLIGNEQRGS 202 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 + + A + + + K VIF TD + G + + Sbjct: 203 SLIGDGLAAAAIDFSDADKERT---------KVVIFSTDNDIQGTPVATLD----EAADI 249 Query: 335 MRNAGMKIYSVAV--SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + + +Y V P + + + G+F+ E SF +I I++ S Sbjct: 250 CVSNNVTVYGVGTKEMTPENKESMKNAVEKTGGKFY----LEEESGSFGEIVSSIEKLSK 305 Query: 393 RIAPNR 398 + R Sbjct: 306 NLVKVR 311 >gi|218515577|ref|ZP_03512417.1| hypothetical protein Retl8_18742 [Rhizobium etli 8C-3] Length = 54 Score = 40.7 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 7/26 (26%), Positives = 12/26 (46%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIR 26 MTAI+ V AI + ++ + Sbjct: 25 MTAILAPVLLGAAGMAIQVGDMLLSK 50 >gi|115374131|ref|ZP_01461419.1| hypothetical protein STIAU_3107 [Stigmatella aurantiaca DW4/3-1] gi|115368907|gb|EAU67854.1| hypothetical protein STIAU_3107 [Stigmatella aurantiaca DW4/3-1] Length = 1551 Score = 40.7 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 27/320 (8%), Positives = 81/320 (25%), Gaps = 37/320 (11%) Query: 81 NAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIE 140 N + AQ A+ + + S + + + + + + Sbjct: 308 NTTNAAQYARCLTCLNTRGYFRVYDSSTSDNRVNPNFILWGRFLNFNPPKYVTVRAALKQ 367 Query: 141 R------SSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 + ++ + M+ + + S++ ++ T Sbjct: 368 VLKNLKGARAGISTFTQTSSANTLKMQPGCQGILANADAFDSSRTGFIATINSLTFNTAT 427 Query: 195 TKSKYAPAPAPANR-KIDVLIESAGNLVNSIQ---KAIQEKKNLSVRIGTIAYNIGIVGN 250 ++ V + G + + KN + + G + Sbjct: 428 PLARALLNTGYYFTSDQTVYKDVFGFGATNPTVGYAYPTDFKNEPLSSENRSVCWGDQAS 487 Query: 251 QCTPLSNNL-------NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 L++ + V ++ + P+ A + N + Sbjct: 488 AVIILTDGEPNTDTLGSAVVQKIRS-RNGGPVSCPPSAPCA---------DTPNDANAML 537 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD- 362 + T+ +T + +++Y+V + LL+ Sbjct: 538 DDVAKLLFTNDLQ------YSTPPIVGALNTSGQQSLRVYTVGFAIDSN---LLKNAAAV 588 Query: 363 SSGQFFAVNDSRELLESFDK 382 G+ + +D+ L ++ Sbjct: 589 GGGRSYTAHDAAGLRQALQD 608 >gi|3273267|dbj|BAA31177.1| thrombospondin-related protein [Plasmodium falciparum] Length = 559 Score = 40.7 bits (93), Expect = 0.44, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + + N + + S N + Sbjct: 55 CSGSYRRHNWVKHAVPLAMKLIQQLNLNESAIHLYVNDFSNNAREIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LIIIKSLLNTNLPYGRTNLTDALLQVRKHLNDRI-------NRENASQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDLGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|308472879|ref|XP_003098666.1| hypothetical protein CRE_04169 [Caenorhabditis remanei] gi|308268266|gb|EFP12219.1| hypothetical protein CRE_04169 [Caenorhabditis remanei] Length = 382 Score = 40.7 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 31/170 (18%), Positives = 57/170 (33%), Gaps = 18/170 (10%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN------ 267 S ++ NSI + R+G + YN N + ++ +N Sbjct: 46 NNSVVDIQNSISNIFEIVPIPINRVGFVTYNSLATINADLNKFKSWGDLSQGVNDSYNNM 105 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 L+ + + A L + + IG K +I N L+ Sbjct: 106 NLSSENTSFIGTGLITAGELL----QVQGSAIGRVYYPKVIIVYASAFNGTGL-----LD 156 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQFFAVNDSRE 375 L + +++AG+ I +VAV G Q L G F+++ + Sbjct: 157 PLSVANTLKSAGITIITVAVDTDNNGVIQKQLASIAS-PGSAFSLDPDDD 205 >gi|294789671|ref|ZP_06754904.1| pilus-associated protein [Simonsiella muelleri ATCC 29453] gi|294482383|gb|EFG30077.1| pilus-associated protein [Simonsiella muelleri ATCC 29453] Length = 948 Score = 40.7 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 11/63 (17%), Positives = 27/63 (42%), Gaps = 3/63 (4%) Query: 338 AGMKIYSVAVSAPPEGQDL--LRKCTDSSGQFF-AVNDSRELLESFDKITDKIQEQSVRI 394 + +++ A + L L K ++ ++ +EL +F I +I ++++I Sbjct: 126 QKIGTFTIGFGAGLSERGLTSLSKMATANKNVALNASNQKELDAAFTSIIKQITSENIQI 185 Query: 395 APN 397 P Sbjct: 186 PPK 188 >gi|86148746|ref|ZP_01067019.1| VCBS [Vibrio sp. MED222] gi|85833461|gb|EAQ51646.1| VCBS [Vibrio sp. MED222] Length = 2142 Score = 40.7 bits (93), Expect = 0.45, Method: Composition-based stats. Identities = 33/361 (9%), Positives = 97/361 (26%), Gaps = 30/361 (8%) Query: 51 KDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINIT--------KDKNNPLQ 102 D T + +L + ++ +I +T + Sbjct: 1371 TDSTDGHSIQGNLVITDANNNLSDTVFSETSSINITTSTGQPLTSEGHTIDWAVSPDGHA 1430 Query: 103 YIAESKAQYEIPTENLFLKGL---IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRS 159 + + I + + + + + + +A + Sbjct: 1431 LTGTANGKVVIEATLDRNGHYDIQLKAPVDHPNTNGEDNLVIQIPVIAKDTSGLKSSGGQ 1490 Query: 160 MEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGN 219 + + + + S + ++ +L ES Sbjct: 1491 ITVSIEDDQPVATAIDVPVTPETKSDTNIQLVIDVSGSMGYDSGVAGKTRLAILKESLAK 1550 Query: 220 LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN-NLNEVKSRLNKLNPYENTNTY 278 ++ V++ + + S +++E + +NKL P NT+ Sbjct: 1551 MLQQYDTL------GDVKVQIVTFTGNAKLIHDGSKSWFSVSEAITEINKLKPKNNTDYD 1604 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 A+ A ++++S + FITDG + +++ + Sbjct: 1605 DALRKARTSWDHDEDSKLPDANN-----VSYFITDGIPNQDDRIDYW-EAKTWTKHLDDN 1658 Query: 339 GMKIYSVAVSAPPEGQDLLRKCT-----DSSGQFFAVNDSRELLESFDK-ITDKIQEQSV 392 G+ ++ ++ + + + + ++ S +L + + I + V Sbjct: 1659 GITSQAIGITGGYLDKGQIDLVSHDGLTGTDSSAIVIDSSSQLTDILTQPIIQTVNGSLV 1718 Query: 393 R 393 Sbjct: 1719 T 1719 >gi|319782123|ref|YP_004141599.1| von Willebrand factor type A [Mesorhizobium ciceri biovar biserrulae WSM1271] gi|317168011|gb|ADV11549.1| von Willebrand factor type A [Mesorhizobium ciceri biovar biserrulae WSM1271] Length = 554 Score = 40.7 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 18/134 (13%), Positives = 46/134 (34%), Gaps = 20/134 (14%) Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 P + + + + + + L T A+ A L ++ + V+ I Sbjct: 87 VPPQAGSGSAISAAADSLKFLGKTPLTAAVKQAAEALRYTEDKAT-----------VVLI 135 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYS--VAVSAPPEGQDLLRKCTD-SSGQFF 368 TDG + + + + + +G+ + V + + D + G++ Sbjct: 136 TDGLETCGG------DPCALGKELEASGVDFTADVVGFGLTADEGKQIACLADNTGGKYI 189 Query: 369 AVNDSRELLESFDK 382 +D + L E+ + Sbjct: 190 QASDEKALQEALAE 203 >gi|294011132|ref|YP_003544592.1| hypothetical protein SJA_C1-11460 [Sphingobium japonicum UT26S] gi|292674462|dbj|BAI95980.1| hypothetical protein SJA_C1-11460 [Sphingobium japonicum UT26S] Length = 157 Score = 40.7 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 16/138 (11%), Positives = 32/138 (23%) Query: 3 AIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTST 62 AII+ V L A D+A + +Q A + A ++ T + + Sbjct: 18 AIIMPVLVLLTCMAGDVAMAFKAKIGLQRAAERTAQLAAAGGYTNDTTDTSKAYNNLAAD 77 Query: 63 IFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKG 122 Q A + + L Sbjct: 78 AAAAAGVPTGNVTVTPTLLCNATVQTASPEVPCPDGQQTKRYVAIAISGTYTPMFAKLMP 137 Query: 123 LIPSALTNLSLRSTGIIE 140 + ++L + + Sbjct: 138 NSLWSSQGIALTGSASVR 155 >gi|257898559|ref|ZP_05678212.1| von Willebrand factor domain-containing protein [Enterococcus faecium Com15] gi|257836471|gb|EEV61545.1| von Willebrand factor domain-containing protein [Enterococcus faecium Com15] Length = 412 Score = 40.7 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 13/150 (8%), Positives = 49/150 (32%), Gaps = 23/150 (15%) Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 L++ + ++ +++ + + Y + ++ + G+T + D Sbjct: 15 LTDGVPTFSYKVQRVHAQSSNDYYG----------TQFSNTQDQPGNTSRIARSYYAPDQ 64 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG---------QDLLRKCTD--- 362 N + T+ ++ G++I+ + + + + +R+ Sbjct: 65 NNQSRRIDSTFIATIGEAMALKERGIEIHGLGIQLQSDPAAGLSKAEVESRMRQMVSADE 124 Query: 363 SSGQFFA-VNDSRELLESFDKITDKIQEQS 391 ++ + + ++ E K +I Sbjct: 125 KGDLYYESADHATDISEYLAKKAVQISATV 154 >gi|156314124|ref|XP_001617882.1| hypothetical protein NEMVEDRAFT_v1g225712 [Nematostella vectensis] gi|156196293|gb|EDO25782.1| predicted protein [Nematostella vectensis] Length = 188 Score = 40.7 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 21/139 (15%), Positives = 43/139 (30%), Gaps = 19/139 (13%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRE 287 + N S RI Y+ + +++ + K+ N A++++ R+ Sbjct: 60 DISNASARIAVAVYSNYTDSSFSFDSHLTHASLRNAIEKIVYPNGDPRNIGAALNNSMRQ 119 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 LYNE + + +I +S + MR G+ IY ++ Sbjct: 120 LYNEPRPR--------VPRVLIVTAHDRSSDGVYV--------ASDMMRRNGVIIYGIST 163 Query: 348 SAPPEGQDLLRKCTDSSGQ 366 + L Sbjct: 164 G-GGSDKKHLETIASDPDN 181 >gi|39996168|ref|NP_952119.1| hypothetical protein GSU1066 [Geobacter sulfurreducens PCA] gi|39982933|gb|AAR34392.1| hypothetical protein GSU1066 [Geobacter sulfurreducens PCA] gi|298505182|gb|ADI83905.1| type IV pilus tip-associated adhesin [Geobacter sulfurreducens KN400] Length = 1014 Score = 40.7 bits (93), Expect = 0.46, Method: Composition-based stats. Identities = 32/322 (9%), Positives = 79/322 (24%), Gaps = 46/322 (14%) Query: 107 SKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ 166 + + T + + + + + D Sbjct: 47 VMILLDNSGSMDIIMQHSAFDPTARYSGGFDNDRTYYQTTSNGYHYLSTGNDYIRDDKKG 106 Query: 167 KHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK 226 N+ + + + + + I Sbjct: 107 NFTKNSVTIKLPLPYDDTRWDGNYLNWLFYHATSSQRSTVSTDATLQKTRIQTARGVISN 166 Query: 227 AIQEKK-----------NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 ++ + R + G + C L++ V + ++ ++ T Sbjct: 167 LVKTVSGVRFGLAKLNVDGYDRFDRKQTDGGSIVRNCGDLTS--ANVDTSVSGISAETWT 224 Query: 276 NTYPAMHHAYRE------LYNEKESSHNTIGSTRLKKFVIFITDGENSG-----ASAYQN 324 A+ ++ LYN S + I S+ K F I +TDGE + Sbjct: 225 PLGEALSEVWQYFKGGTSLYNTGVSYTSPITSSCQKSFTIVVTDGEPTYDGCYRGDFSSY 284 Query: 325 TLNTLQICE------YMRNAG------------MKIYSVAVSAPPEGQDLLRKCTDSSG- 365 + + G + Y++ ++ LLR ++ G Sbjct: 285 GCDNAADADSHLADVAAHMNGSDATSAYGGTQSVTTYTIGMTIDSS---LLRTTAENGGG 341 Query: 366 QFFAVNDSRELLESFDKITDKI 387 ++ +L + ++I Sbjct: 342 SYYTTTSGMDLATALQNAVNEI 363 >gi|255009406|ref|ZP_05281532.1| hypothetical protein Bfra3_09717 [Bacteroides fragilis 3_1_12] gi|313147165|ref|ZP_07809358.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] gi|313135932|gb|EFR53292.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12] Length = 341 Score = 40.7 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 53/168 (31%), Gaps = 49/168 (29%) Query: 251 QCTPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 P++++ K L +NP + T A+ A R + + + Sbjct: 142 TQLPITSDYISAKMFLESINPSLISKQGTAIGAAISLAARSFTPQ----------EGVGR 191 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------------- 352 ++ ITDGEN A + + G+++ + V P Sbjct: 192 AIVVITDGENHEGGAVEAAKEAAKK-------GIQVNVLGVGLPDGAPIPIEGSNDFRRD 244 Query: 353 ----------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQE 389 + + ++ +G + V++S + I +I + Sbjct: 245 REGNVIVTRLNEAMCQEIAKVGNGIYVRVDNS---NSAQKAINQEINK 289 >gi|317483712|ref|ZP_07942659.1| hemolysin-type calcium-binding protein [Bilophila wadsworthia 3_1_6] gi|316925028|gb|EFV46167.1| hemolysin-type calcium-binding protein [Bilophila wadsworthia 3_1_6] Length = 1111 Score = 40.7 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 44/361 (12%), Positives = 102/361 (28%), Gaps = 27/361 (7%) Query: 38 LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK 97 L+G ASI + T T S+ + + + +++KD Sbjct: 391 LTGSASIEVEHTADVFTPDNVVISSTVPSGEVPSTSITITFEDLLHNDMDRDDASVSKDG 450 Query: 98 NNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS 157 + + K + + + + +I ++ Sbjct: 451 LHITEITIGGKTYTSHDASTDISYNETTKISIDWQKGTISVTNTGKNSESIQFGYGVEDR 510 Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLP-----PPPKKSFWSKNTTKS--KYAPAPAPANRKI 210 D N + + S N + K + + Sbjct: 511 HGATDSADITVNVTATTGAGSIGDDLLQGATTTENVAMSYNISFVLDKSGSMGSSYSTAK 570 Query: 211 DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL---- 266 + + L + IQ N+ V + + G S E+++ L Sbjct: 571 EAVANYIEKLWDDIQNTD-AIINIQVVKFSSSVGWGDNNTFTLDKSTTYKELQAFLSAHV 629 Query: 267 -NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 N NTN A+ A +++E+ + FI+DGE + + Sbjct: 630 TNNDKASGNTNYEDALLKAESWFNSQEENGFANR--------LYFISDGEPNRP-YGKPV 680 Query: 326 LNTLQICEYM---RNAGMKIYSVAV-SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFD 381 + + + + ++++ + D+L K ++ G + ++ EL ++ Sbjct: 681 ERAEAVYDRIVGDSAHPVDVHAIGILGNGANDLDVLNKFDNTDG-ADQIRNAGELYDAIA 739 Query: 382 K 382 Sbjct: 740 S 740 >gi|156409361|ref|XP_001642138.1| predicted protein [Nematostella vectensis] gi|156229279|gb|EDO50075.1| predicted protein [Nematostella vectensis] Length = 989 Score = 40.7 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 33/348 (9%), Positives = 77/348 (22%), Gaps = 46/348 (13%) Query: 51 KDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKA---QINITKDKNNPLQYIAES 107 KK+ +K G + + + + E+ Sbjct: 156 AAKQEATADNKAEPKKEQASGIKDAPEATPKKGADKATPKEVEASGDEVASEAEAKPVEA 215 Query: 108 KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQK 167 P + + +L S Sbjct: 216 AKTESNPAKEDKPAEKAKEDKAGPAKEDKPSKTSPKVGKPHPKGEILGGESSDPLDIAGD 275 Query: 168 HNDNNNMTSNKYLLPPPPKKSFWS-------KNTTKSKYAPAPA----PANRKIDVLIES 216 D + + + +W +S P P P + + Sbjct: 276 TVDTDAPAEEETTDEEGSGQDYWGPEEDSAYDYPPRSSCVPCPPGPELPISNTAVPAKTA 335 Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY---- 272 A V+ + + + R G + I + S ++ +R+ + Sbjct: 336 ASANVDLV--ILIDGSRSVERSGVGNFRRAIDFARDLTSSFVVSPRHTRVGLMVYGKRAY 393 Query: 273 ---------ENTNTYPAMHHAYRELYNEKESSHN---------TIGSTRLKKFVIFITDG 314 +N + + R +++ +K ++ +TDG Sbjct: 394 KVFGFNDYRDNNRLFTGFNKPIRYPRERAQTATALRSAYRTFFGRNKRSAQKVLVLVTDG 453 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 + + + ++ G+KIY V L Sbjct: 454 KIRD-------AKAKRQSQSIKRRGVKIYVVGAG-KYFNIKQLEAMAS 493 >gi|34481898|emb|CAE46497.1| trap [Plasmodium falciparum] Length = 331 Score = 40.7 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 26/170 (15%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + N + + S N + Sbjct: 55 CSGSIRRHNWVNHAVPLAMKLIQQLNLNDNAIHLYVNVFSNNAKEIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + K+ V+ +TDG Sbjct: 115 LIIIRSLLSTNLPYGRTNLTDALLQVRKHLNDRI-------NRENAKQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|66820260|ref|XP_643762.1| hypothetical protein DDB_G0275093 [Dictyostelium discoideum AX4] gi|60471873|gb|EAL69827.1| hypothetical protein DDB_G0275093 [Dictyostelium discoideum AX4] Length = 684 Score = 40.7 bits (93), Expect = 0.47, Method: Composition-based stats. Identities = 20/126 (15%), Positives = 48/126 (38%), Gaps = 7/126 (5%) Query: 220 LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYP 279 L+ ++ K ++E + + I + L+ +++++ + +N + + Sbjct: 217 LMETVTKLMKEIPLIRIGIMAMGDFCEERPLYILDLTTDVDKIVNFINGVKATSGGDEPE 276 Query: 280 AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG 339 A + SSH K ++ I D + LN ++ C+ + G Sbjct: 277 CYEFALHKAKFLSWSSH-------TSKALVMIGDSPPHPPYYSNHHLNWVRECDDLEALG 329 Query: 340 MKIYSV 345 +KIY + Sbjct: 330 VKIYGI 335 >gi|310639526|ref|YP_003944284.1| protein [Paenibacillus polymyxa SC2] gi|309244476|gb|ADO54043.1| Putative uncharacterized protein [Paenibacillus polymyxa SC2] Length = 249 Score = 40.7 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 17/97 (17%), Positives = 42/97 (43%), Gaps = 10/97 (10%) Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS---VAVSA-PPEGQDLLRKCT 361 K ++ ITDG ++ + + + +R G+ + + G +++ Sbjct: 2 KQILIITDGCSNVGVSP-----VMAAAQALRE-GITVNVAGVIDYGTIGELGSAEIQEIA 55 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + G F + +++L ++ +T K Q+++ A NR Sbjct: 56 KAGGGFSQIVGTKQLAQTMQMMTRKTVVQTIQQAVNR 92 >gi|299067814|emb|CBJ39025.1| conserved protein of unknown function [Ralstonia solanacearum CMR15] Length = 348 Score = 40.7 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 41/271 (15%), Positives = 84/271 (30%), Gaps = 14/271 (5%) Query: 5 IISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTI-------KDPTTKK 57 ++ F A+D+A ++ +RN++Q+A DAA L+G A + T+ Sbjct: 25 MLIAIFSVGALAVDIARLIVVRNELQNAADAAALAGAAGLYPTNPTPNWSNGVTQGTSAV 84 Query: 58 DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 + + + ++ G + Q I + +Q A Sbjct: 85 KLNKSSNVQLVSGTVQAGYWNLTGTPAGLQSQSITPGANDVPAVQVTISRSAGNNGGPVA 144 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSE--NLAISICMVLDVSRSMEDLYLQKHNDNNNMT 175 L + + N S + +I A+ + + Y + T Sbjct: 145 TLLAPIFGALSANSSATAVAVIAAPGSAGPGALFPIAISKCLYDLYWNYSTGQPKIDPST 204 Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRK-----IDVLIESAGNLVNSIQKAIQE 230 Y+ S + T ++ P + + S ++ N I A+ Sbjct: 205 GKPYVFQINTSYPTSSSSCTSGEWTGFNGPTDASTLKGLVQNGNTSTLSIGNMINTALAT 264 Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 SV + + + + PLS +E Sbjct: 265 GVKSSVYMAIPSTPLDVTIPVVNPLSPGASE 295 >gi|94312593|ref|YP_585802.1| hypothetical protein Rmet_3661 [Cupriavidus metallidurans CH34] gi|93356445|gb|ABF10533.1| conserved hypothetical protein [Cupriavidus metallidurans CH34] gi|222832771|gb|EEE71248.1| predicted protein [Populus trichocarpa] Length = 575 Score = 40.7 bits (93), Expect = 0.48, Method: Composition-based stats. Identities = 18/142 (12%), Positives = 37/142 (26%), Gaps = 20/142 (14%) Query: 1 MTAIIISVCFLFITY-AIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIK-------- 51 + A ++ ++D+ + Y + Q+Q +D A LSG + S Sbjct: 19 LMAALLIAAIGVAALVSLDVGFVFYTQRQLQKLVDVAALSGAQQLKSADDQATTNANVLS 78 Query: 52 ---DPTTKKDQTSTIFKKQIKKHLKQGSYIRENAG--------DIAQKAQINITKDKNNP 100 + T + +R G + + Sbjct: 79 SVTSAAAQNGYTKAVANDCTTAVAGAADGVRTCLGLWDPANPANGDSTRHFDPGYPATTV 138 Query: 101 LQYIAESKAQYEIPTENLFLKG 122 +A +P +F G Sbjct: 139 SANAVRVQATLTVPLLFMFQGG 160 >gi|332884780|gb|EGK05036.1| hypothetical protein HMPREF9456_03189 [Dysgonomonas mossii DSM 22836] Length = 342 Score = 40.3 bits (92), Expect = 0.49, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 53/170 (31%), Gaps = 47/170 (27%) Query: 253 TPLSNNLNEVK---SRLN-KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 PL+ + K +N L P + T A+ + + K + Sbjct: 146 LPLTPDNQSAKLFLETINPSLVPVQGTAIGSAIDMSM----------SCFSNDADIDKAI 195 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE---------------- 352 + ITDGE +A + + G+ + V + Sbjct: 196 VLITDGEGHEGNAEEAAARAA-------SKGVHVNVVGIGTAEGAMIPEAENSRDIKRDT 248 Query: 353 ---------GQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSV 392 +++ R+ + G + ++S L+S DK+Q++ + Sbjct: 249 QGQPVVTKLNEEMCRQIAKAGEGLYAHADNSNSALKSLQAELDKLQKKEI 298 >gi|257897710|ref|ZP_05677363.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com15] gi|257835622|gb|EEV60696.1| von Willebrand factor type A domain-containing protein [Enterococcus faecium Com15] Length = 1104 Score = 40.3 bits (92), Expect = 0.49, Method: Composition-based stats. Identities = 33/362 (9%), Positives = 98/362 (27%), Gaps = 46/362 (12%) Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKN-NPLQYIAESKAQYEIPTENLFLKG 122 +L ++ I ++ + N L+ +SK I + N + Sbjct: 379 ATTLYNVYLDVIGSEKQEISPIDIVFVLDKSASMNEGTLEGGGQSKNAALIESVNEISEN 438 Query: 123 LIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLP 182 L+ ++ + +S + + D+ D+ ++N + Sbjct: 439 LLSDPNMDIRI-GMVNFYHNSTVINNQEQISSDIFPLTNDINRLTGSENTALNRTPIGGT 497 Query: 183 PPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIA 242 P NR + ++ G+ + A + ++ + G + Sbjct: 498 P----LTLGLKNGYETLYADNGGENRNPEKILIVVGDGTPTFSYAPIQTRSRTSIWGAWS 553 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 + + ++ NT+ + + + T Sbjct: 554 SWSVMGDKIAIDRGDLFKNFETF------SGNTS-NAGFTYPVTYASDFDRPVNGTNVEY 606 Query: 303 RLKKFVI-------FITDGENSGASAYQNTLNTLQIC-----------EYMRNAGMKIYS 344 R + ++ G S + T + I+S Sbjct: 607 RYGEVKEGDDKATHWVGTGSASNDTNGSPTSQEKSSAINTVAYHHWLKNKYQENPPSIFS 666 Query: 345 VAVSAPPE----------GQDLLRKCTDSSG-----QFFAVNDSRELLESFDKITDKIQE 389 + + G+++L+ D + +++ N+ +++ + + I+ ++ Sbjct: 667 IGLGIDGSIAGRQRLDAIGRNVLKNIADLNDDGTTPRYYDANNKNDIITALEDISSTFKK 726 Query: 390 QS 391 Sbjct: 727 TI 728 >gi|145490779|ref|XP_001431389.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124398494|emb|CAK63991.1| unnamed protein product [Paramecium tetraurelia] Length = 272 Score = 40.3 bits (92), Expect = 0.49, Method: Composition-based stats. Identities = 18/139 (12%), Positives = 52/139 (37%), Gaps = 8/139 (5%) Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + ++++ L ++ +T A+ + + + + +TDG+ Sbjct: 14 TARTSKLRYLLPRIQCSGSTAFRDAVIQGNELMLKLFSLFVKEGLHDKFQIVHVVLTDGD 73 Query: 316 NSGASAYQNT--LNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLR---KCTDSSGQFF 368 + + L + + +K + + V+ + ++ C+ S Q++ Sbjct: 74 DCASQTSHQDFLKYQLYLYSQLPPQMLKTFYIGVNLENNSTVRQQMKQIIDCSCESAQYY 133 Query: 369 AVNDSRELLESFDKITDKI 387 ++ +S + E F KI +I Sbjct: 134 SI-NSNGINEIFQKIQMQI 151 >gi|158522693|ref|YP_001530563.1| von Willebrand factor type A [Desulfococcus oleovorans Hxd3] gi|158511519|gb|ABW68486.1| von Willebrand factor type A [Desulfococcus oleovorans Hxd3] Length = 913 Score = 40.3 bits (92), Expect = 0.49, Method: Composition-based stats. Identities = 17/106 (16%), Positives = 42/106 (39%), Gaps = 15/106 (14%) Query: 250 NQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 + P+++ V+ + L+P NT A+ H ++ + + +I Sbjct: 788 DFDQPMTD---TVRRGIGGLSPQRNTRMGAAIRH----------ATAQLKAAAAASRLLI 834 Query: 310 FITDGENSGASAYQNTL--NTLQICEYMRNAGMKIYSVAVSAPPEG 353 +TDG + + +T + R G+ ++++ V+ +G Sbjct: 835 IVTDGFPNDLDYKRERAIEDTRRALLEARAGGISVHTITVNIAGDG 880 >gi|21219969|ref|NP_625748.1| hypothetical protein SCO1467 [Streptomyces coelicolor A3(2)] gi|7209227|emb|CAB76889.1| conserved hypothetical protein SCL6.24c [Streptomyces coelicolor A3(2)] Length = 491 Score = 40.3 bits (92), Expect = 0.49, Method: Composition-based stats. Identities = 18/158 (11%), Positives = 51/158 (32%), Gaps = 27/158 (17%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 + Y + + + + + + + T + + R L + Sbjct: 107 AVVYPDTPRMARASART--RSRAERAVRETVAGGGTCIGAWLDLSRRLLTEQ-------- 156 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQ-----ICEYMRNAGMKIYSVAVSAPPEGQ 354 V+ +TDG+N + +C+ + +G+ Sbjct: 157 --DAPIGHVLLLTDGKNQHDEQMPLARVLEECAGRFVCDAW----------GIGDGWDGR 204 Query: 355 DLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 +LLR + G +V + L ++++ +++ ++V Sbjct: 205 ELLRITSRLHGSASSVREEEALPGEYEQLMNRLLTKTV 242 >gi|218529650|ref|YP_002420466.1| Vault protein inter-alpha-trypsin domain protein [Methylobacterium chloromethanicum CM4] gi|218521953|gb|ACK82538.1| Vault protein inter-alpha-trypsin domain protein [Methylobacterium chloromethanicum CM4] Length = 738 Score = 40.3 bits (92), Expect = 0.50, Method: Composition-based stats. Identities = 25/133 (18%), Positives = 47/133 (35%), Gaps = 18/133 (13%) Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 KS + L T + A R+ E+ + V+F+TDG A Sbjct: 413 AKSFVAGLQASGGTEMLAPLQAALRDATPEETGR---------LRQVVFLTDG------A 457 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESF 380 N R +++ V + + P G L+R + G F ++ ++ E Sbjct: 458 IGNEAQIFSAIATERGRS-RLFMVGIGSAPNGY-LMRHAAEVGQGSFTQIDTPDQVTERM 515 Query: 381 DKITDKIQEQSVR 393 + K++ +V Sbjct: 516 RALLVKLESPAVT 528 >gi|87199929|ref|YP_497186.1| hypothetical protein Saro_1912 [Novosphingobium aromaticivorans DSM 12444] gi|87135610|gb|ABD26352.1| conserved hypothetical protein [Novosphingobium aromaticivorans DSM 12444] Length = 435 Score = 40.3 bits (92), Expect = 0.50, Method: Composition-based stats. Identities = 29/273 (10%), Positives = 57/273 (20%), Gaps = 8/273 (2%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A +A+D A + +MQ A D A LS S + T Q Sbjct: 29 LMAFGAPALIATAGFAVDTAQWYLWKREMQYAADQAALSAAYSKSKGISTTAYETHAVQE 88 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQ-------YIAESKAQYEI 113 + + + G + + + AQ Sbjct: 89 YNANLQLVTFSDTPTVSLAAYNGGTNNSVVVRASATRELAFSGIVLGKPTTVSVSAQATY 148 Query: 114 PTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNN 173 + +I + T + G I+ + ++ N Sbjct: 149 TAGATYTSCIIATNATADGAITIGGSSILKSGCGIAALSNSTNAIKVDGSPTIDVNYVLA 208 Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSK-YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK 232 T+ + +A P N + + Sbjct: 209 AGGIDDWFNTNTDDVVKEYVTSLADPFASLTPPTNNTAATPYRCVRSGGVTQATLNPGTY 268 Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 N GI + + + Sbjct: 269 TDITTSCNTVMNSGIYVINGGSFTIRAQDAVTA 301 >gi|75750453|ref|YP_319892.1| hypothetical protein ATV_gp61 [Acidianus two-tailed virus] gi|123849288|sp|Q3V4Q4|Y892_ATV RecName: Full=Putative VWFA domain-containing protein ORF892 gi|74474836|emb|CAI59910.1| hypothetical protein [Acidianus two-tailed virus] Length = 892 Score = 40.3 bits (92), Expect = 0.50, Method: Composition-based stats. Identities = 17/105 (16%), Positives = 41/105 (39%), Gaps = 7/105 (6%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELY-NEKESSHNTIGSTRLKKFVIFITDGE 315 N+ + + L + TN A+ +A + + + + S R + +I +TDGE Sbjct: 772 YNIKNIANVLGSMKF-GGTNIGSAVLYALKNIDKPDSDYDRKLRESLRKTRTLILLTDGE 830 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 + +N+L+ + +++ + G L++ Sbjct: 831 DEIPDDIAREINSLK-----KKNKVELLCYGIDLGERGLKTLKEI 870 >gi|149005573|ref|ZP_01829312.1| cell wall surface anchor family protein [Streptococcus pneumoniae SP18-BS74] gi|168482703|ref|ZP_02707655.1| cell wall surface anchor family protein [Streptococcus pneumoniae CDC1873-00] gi|307126680|ref|YP_003878711.1| cell wall surface anchor family protein [Streptococcus pneumoniae 670-6B] gi|147762513|gb|EDK69473.1| cell wall surface anchor family protein [Streptococcus pneumoniae SP18-BS74] gi|154432930|gb|ABS82113.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|154432938|gb|ABS82120.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|154432946|gb|ABS82127.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|154432954|gb|ABS82134.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|154432970|gb|ABS82148.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|172043667|gb|EDT51713.1| cell wall surface anchor family protein [Streptococcus pneumoniae CDC1873-00] gi|306483742|gb|ADM90611.1| cell wall surface anchor family protein [Streptococcus pneumoniae 670-6B] gi|332076908|gb|EGI87370.1| cell wall surface anchor family protein [Streptococcus pneumoniae GA17545] gi|332203636|gb|EGJ17703.1| cell wall surface anchor family protein [Streptococcus pneumoniae GA47368] Length = 883 Score = 40.3 bits (92), Expect = 0.50, Method: Composition-based stats. Identities = 16/112 (14%), Positives = 29/112 (25%), Gaps = 23/112 (20%) Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 +LY S G+ + N M G +++V Sbjct: 480 TDLYLYWRDSILAY--PFNSSTDWITNHGDPTTWYYNGN----------MAQDGYDVFTV 527 Query: 346 AVSAPPE-------GQDLLRKCTDSSGQFFAVNDSR----ELLESFDKITDK 386 V + ++ + S + V D EL F I ++ Sbjct: 528 GVGVNGDPGTDEATATRFMQSISSSPDNYTNVADPSQILQELNRYFYTIVNE 579 >gi|119593589|gb|EAW73183.1| hCG25234 [Homo sapiens] Length = 195 Score = 40.3 bits (92), Expect = 0.51, Method: Composition-based stats. Identities = 12/100 (12%), Positives = 34/100 (34%), Gaps = 21/100 (21%) Query: 258 NLNEVKSRLNKLN--PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 + N + L T+ + ++ + + +I +TDGE Sbjct: 100 DDNTYQKITANLPQEADGGTSICRGLKAGFQAI--------PQSNQSTFGSEIILLTDGE 151 Query: 316 NSGASAYQNTLNTLQIC-EYMRNAGMKIYSVAVSAPPEGQ 354 + + +C ++ +G I+++A+ + + Sbjct: 152 D----------YQISLCFGEVKQSGTVIHTIALGPSADEE 181 >gi|310822272|ref|YP_003954630.1| cglb protein [Stigmatella aurantiaca DW4/3-1] gi|309395344|gb|ADO72803.1| CglB protein [Stigmatella aurantiaca DW4/3-1] Length = 430 Score = 40.3 bits (92), Expect = 0.51, Method: Composition-based stats. Identities = 40/302 (13%), Positives = 80/302 (26%), Gaps = 68/302 (22%) Query: 149 SICMVLDVSRSMEDLYLQKHN---DNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAP 205 ++ ++LD S SM + NN S + + ++++ Sbjct: 54 NLMVLLDTSGSMTLPVNTRDPNCYRANNTPSPDDDYCGQTPSTACDTSKCPTRWSELQGA 113 Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 +R + A + + +L T+ + L N+++S Sbjct: 114 MSRFLSESGGVARMGLTTYPGPAVGSNSLRCEASTVVNKNIPQSDADEALLGAANDIQSV 173 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + + P TN L + + +T FV+ +TDG + QN Sbjct: 174 ILGI-PNAGTNAPSG--GTPTSLSLQFVGQQPDVQATDRDNFVLLLTDGLPNCNPDNQNA 230 Query: 326 LNTLQIC--------------------------------EYMRNAGMKIYSVAVSAP--- 350 + C E ++ G++ V A Sbjct: 231 GTNVDACQCTLANIGNNTGCQGDYVRRGCLDKDASVVAVEDLKRKGIRTIVVGFGAETAT 290 Query: 351 PEGQDLLRKCTDSSG---------------------------QFFAVNDSRELLESFDKI 383 G L + +F+ + EL E+ +I Sbjct: 291 GNGPATLNAMATAGDFARSCRDDPNACGAGDTCDAQTKLCGRRFYQAANQEELAEALREI 350 Query: 384 TD 385 D Sbjct: 351 ID 352 >gi|110833087|ref|YP_691946.1| hypothetical protein ABO_0226 [Alcanivorax borkumensis SK2] gi|110646198|emb|CAL15674.1| hypothetical protein ABO_0226 [Alcanivorax borkumensis SK2] Length = 556 Score = 40.3 bits (92), Expect = 0.51, Method: Composition-based stats. Identities = 9/103 (8%), Positives = 31/103 (30%), Gaps = 10/103 (9%) Query: 4 IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTI 63 + ++ FL + +D +++ + ++Q AA + + + + + + + Sbjct: 19 VSLTFIFLMVALVVDGSNLYNEKRRLQ----AAANAVASELAASGQTCFGSELASDATDL 74 Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAE 106 + R+ G + +Y Sbjct: 75 SNN------AETILARDYPGRDFVVTSAQVVTVDGVDGRYQVS 111 >gi|66524917|ref|XP_393112.2| PREDICTED: 26S proteasome non-ATPase regulatory subunit 4 [Apis mellifera] Length = 399 Score = 40.3 bits (92), Expect = 0.51, Method: Composition-based stats. Identities = 13/164 (7%), Positives = 52/164 (31%), Gaps = 21/164 (12%) Query: 194 TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 S Y ++ ++ + +S ++ E + Sbjct: 10 VDNSDYMRNGDFLPTRLQAQQDAVNLVCHSKTRSNPENN--------VGLITLANVEVLA 61 Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 L++++ + S+L+++ P N + + A+ L + + +H + + Sbjct: 62 TLTSDVGRILSKLHQVQPNGNLSLITGIRIAHLALKHRQGKNHKMRIVAFIGSPIEI--- 118 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 + +++ + ++ + + ++ ++L Sbjct: 119 ----------DEKELVKLAKRLKKEKVNVDVISFGEESINNEVL 152 >gi|330818825|ref|YP_004351042.1| von Willebrand factor, type A [Burkholderia gladioli BSR3] gi|327374367|gb|AEA65719.1| von Willebrand factor, type A [Burkholderia gladioli BSR3] Length = 660 Score = 40.3 bits (92), Expect = 0.52, Method: Composition-based stats. Identities = 21/107 (19%), Positives = 37/107 (34%), Gaps = 19/107 (17%) Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 T T AM A L E + ++ +TDG A + + T + Sbjct: 560 TATGMAMTVALSNLILRDEDR----------RMMVLLTDG------AAGDPVMTAASYQA 603 Query: 335 MRNAGMKIYSVAVSAPPEGQDLLRKC--TDS-SGQFFAVNDSRELLE 378 + AG+++ ++ + + L R F VN EL + Sbjct: 604 AKEAGVEVVTIFIGRDIQAIALTRSILNATGFGQHFSNVNSPDELAK 650 >gi|73981421|ref|XP_850355.1| PREDICTED: similar to integrin, alpha 10 precursor [Canis familiaris] Length = 1165 Score = 40.3 bits (92), Expect = 0.52, Method: Composition-based stats. Identities = 37/205 (18%), Positives = 62/205 (30%), Gaps = 25/205 (12%) Query: 195 TKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTP 254 + I E L + + + + V++G + Y V Sbjct: 163 PTYMDVVIVLDGSNSIYPWSEVQTFLRRLVGRLFIDPEQ--VQVGLVQYGESPVHEWSLG 220 Query: 255 LSNNLNEVKSRLNKLNPYEN--TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 EV L+ E T T A+ A E ++ + ++ +T Sbjct: 221 DFRTKEEVVRAARNLSRREGRETKTAQAILVACTEGFSLSRGGRPEA-----ARLLVVVT 275 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--------SAPPEGQDLLRKCTDSS 364 DGE+ L+ CE R + Y +AV P +R + Sbjct: 276 DGESHDGEELP---AALKACEAGR---VTRYGIAVLGHYLRRQRDPTSFLREIRMIANDP 329 Query: 365 G--QFFAVNDSRELLESFDKITDKI 387 FF V D L + D + D+I Sbjct: 330 DERFFFNVTDEAALTDIVDALGDRI 354 >gi|315656396|ref|ZP_07909285.1| conserved hypothetical protein [Mobiluncus curtisii subsp. holmesii ATCC 35242] gi|315492955|gb|EFU82557.1| conserved hypothetical protein [Mobiluncus curtisii subsp. holmesii ATCC 35242] Length = 171 Score = 40.3 bits (92), Expect = 0.52, Method: Composition-based stats. Identities = 15/135 (11%), Positives = 39/135 (28%), Gaps = 4/135 (2%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A ++ + ++ A L+ + + ++Q D A++ ++ D Sbjct: 35 LAACVLLCVIILMSMA--LSGVYLEQRRLQRLADQTASMAAANMADTAYYQNGIV--DGV 90 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + + + I++ + +A +IP + Sbjct: 91 PLEIEPYHASERAAEYLSGASISANSSLDGIDLVDVDVASTRVQVTLRATGKIPLVLPLV 150 Query: 121 KGLIPSALTNLSLRS 135 L LT S Sbjct: 151 SSLTQVELTATGAAS 165 >gi|27376088|ref|NP_767617.1| hypothetical protein bll0977 [Bradyrhizobium japonicum USDA 110] gi|27349227|dbj|BAC46242.1| bll0977 [Bradyrhizobium japonicum USDA 110] Length = 754 Score = 40.3 bits (92), Expect = 0.52, Method: Composition-based stats. Identities = 30/160 (18%), Positives = 56/160 (35%), Gaps = 20/160 (12%) Query: 237 RIGTIAYNIGIVGNQCTPLSNNLNEV---KSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 R I ++ + + + V S ++ L T PAM A Sbjct: 391 RFNVIRFDDTMDVLFPASVPADAAHVGEATSFVSALQARGGTEMVPAMRAALT------- 443 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 + IG T + + V+F+TDG A N + MR +++ V + + P Sbjct: 444 ---DKIGDTGMVRQVVFLTDG------AIGNEQQLFETITAMRGRS-RVFMVGIGSAPNT 493 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 + R G F + ++ E + K++ +V Sbjct: 494 YLMTRASELGRGAFTHIGSVEQVEERMRGLFAKLENPAVT 533 >gi|118463538|ref|YP_880101.1| 17 kDa surface antigen family protein [Mycobacterium avium 104] gi|118164825|gb|ABK65722.1| 17 kDa surface antigen family protein [Mycobacterium avium 104] Length = 554 Score = 40.3 bits (92), Expect = 0.53, Method: Composition-based stats. Identities = 20/250 (8%), Positives = 66/250 (26%), Gaps = 2/250 (0%) Query: 12 FITYAIDLAHIMYIRNQMQSALDAAV-LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKK 70 + A+D A + + + SA+ +AV + +++ S + + + ++ Sbjct: 115 AVGSAVDSAVYSAVGSAVDSAVGSAVGSAVGSAVGSAVDSAVYSAVGSAVDSAVRSAVRS 174 Query: 71 HLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTN 130 + Y + + + + + + + + + + Sbjct: 175 AVYSAVYSAVGSAVGSAVGSAVDSAVGSAVDSAVRSAVDSAVGSAVGSAVDSAVD-SAVD 233 Query: 131 LSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFW 190 ++ S S + V S D + + ++ + + + Sbjct: 234 SAVGSAVYSAVYSAVGSAVDSAVASAVGSAVDSAVDSAVGSAVYSAVRSAVDSAVDSAVD 293 Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 S + A A + + + + + S + R Sbjct: 294 SAVRSAVDSAVYSAVGSAVYSAVYSAVYSALGSAPIKPFWHQLFGGRQWCWWPAFIAYFR 353 Query: 251 QCTPLSNNLN 260 L + + Sbjct: 354 DVVELQLDSD 363 >gi|58429477|gb|AAW78142.1| thrombospondin-related adhesive protein [Plasmodium falciparum] gi|58429505|gb|AAW78156.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 581 Score = 40.3 bits (92), Expect = 0.53, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + + N + + S N + Sbjct: 55 CSGSIRRHNWVKHAVPLAMKLIQQLNLNENAIHLYVNVFSNNAKEIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LIIIRSLLSTNLPYGKTNLTDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLSDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|58429527|gb|AAW78167.1| thrombospondin-related adhesive protein [Plasmodium falciparum] Length = 575 Score = 40.3 bits (92), Expect = 0.53, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + + N + + S N + Sbjct: 55 CSGSIRRHNWVKHAVPLAMKLIQQLNLNENAIHLYVNVFSNNAKEIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LIIIRSLLSTNLPYGKTNLTDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLSDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|117921567|ref|YP_870759.1| type IV pilin biogenesis protein [Shewanella sp. ANA-3] gi|117613899|gb|ABK49353.1| type IV pilin biogenesis protein, putative [Shewanella sp. ANA-3] Length = 1168 Score = 40.3 bits (92), Expect = 0.53, Method: Composition-based stats. Identities = 20/165 (12%), Positives = 46/165 (27%), Gaps = 19/165 (11%) Query: 225 QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHA 284 + + + + + +++ V N + T+T Sbjct: 348 NRPPYDTSVEKGGAYSSPFKVCTDIAYVIYVTDGSPTVDKSANN-DVISLTSTGS----- 401 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 + + S + + N+ ++ NT Q+ ++ Y+ Sbjct: 402 -----KDGDYSSFSKNLDTPSYLPALASYMFNNDLINKLDSSNTEQM------QNVRTYT 450 Query: 345 VAVSAPPEGQDLL--RKCTDSSGQFFAVNDSRELLESFDKITDKI 387 + S E L G +FA +S EL + + I Sbjct: 451 IGFSKGAEDAAPLLAETAKRGGGLYFAAQNSLELQNALNDALSNI 495 >gi|198426173|ref|XP_002130506.1| PREDICTED: similar to Vwa1 protein [Ciona intestinalis] Length = 384 Score = 40.3 bits (92), Expect = 0.54, Method: Composition-based stats. Identities = 12/96 (12%), Positives = 32/96 (33%), Gaps = 12/96 (12%) Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 ++S+ ++ + + R +++ + + ++ +TD NS Sbjct: 263 AIRSQAFNNTANGASSIAEVLRYVKRSMFSTRNG-----NRKNAENIIVLVTDQNNSAG- 316 Query: 321 AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 +R +G+K + V +DL Sbjct: 317 ------GMTSAAYELRRSGIKTFVVNFGETESNKDL 346 >gi|157825745|ref|YP_001493465.1| hypothetical protein A1C_03355 [Rickettsia akari str. Hartford] gi|157799703|gb|ABV74957.1| hypothetical protein A1C_03355 [Rickettsia akari str. Hartford] Length = 111 Score = 40.3 bits (92), Expect = 0.54, Method: Composition-based stats. Identities = 20/109 (18%), Positives = 37/109 (33%), Gaps = 12/109 (11%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 N+L ++K + LN Y T Y + L + +I TDG+N Sbjct: 14 NSLADIKKYVETLNAYGYTRLYGTIKDTLELL----------KEKINIHSTIIEFTDGKN 63 Query: 317 SGASAYQNTLNTLQ-ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS 364 +GA + + +RN +Y+V ++ + Sbjct: 64 TGADCNTTQQDVIDSAIAVIRNPQFNMYAVGFGKN-YNKEFFEEIAMRG 111 >gi|309777426|ref|ZP_07672384.1| hypothetical protein HMPREF0983_03048 [Erysipelotrichaceae bacterium 3_1_53] gi|308914822|gb|EFP60604.1| hypothetical protein HMPREF0983_03048 [Erysipelotrichaceae bacterium 3_1_53] Length = 365 Score = 40.3 bits (92), Expect = 0.54, Method: Composition-based stats. Identities = 5/31 (16%), Positives = 12/31 (38%) Query: 4 IIISVCFLFITYAIDLAHIMYIRNQMQSALD 34 I++ + +D+ + + SA D Sbjct: 14 ILLVPMLIVAGVFVDVGRYQLSKAAVTSAAD 44 >gi|296135127|ref|YP_003642369.1| hypothetical protein Tint_0639 [Thiomonas intermedia K12] gi|295795249|gb|ADG30039.1| hypothetical protein Tint_0639 [Thiomonas intermedia K12] Length = 1169 Score = 40.3 bits (92), Expect = 0.54, Method: Composition-based stats. Identities = 26/215 (12%), Positives = 58/215 (26%), Gaps = 12/215 (5%) Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 + +T+ A P K + S+ + ++ G Sbjct: 341 NPSTSAVNTALTPFTTALKPETNNSSSSEIKALAYQSPTAGLVQGAGNVLSNLIASCAGQ 400 Query: 251 QCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 +++ L + P + + + + N + Sbjct: 401 YVILVTDGLPTMDLNGKNWPPLG---SAAGQGYGVNAAFYGIAGNSNYGINDDSNNLPSG 457 Query: 311 ITDGENSGASA-YQNTLNTLQICEYMRNAGMKIYSVAVSAPPE--------GQDLLRKCT 361 T G A+ Q ++T+ + + G+K Y V + A + Sbjct: 458 QTQGALDAANTNDQALIDTITAIQALNKKGIKTYVVGLGAGVDANANPAAYAALNAMAIA 517 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAP 396 +GQ + N+ + I +I + AP Sbjct: 518 GGTGQEYPANNVTAFNSALGSIAAQIFSSTAISAP 552 >gi|10048261|gb|AAG12328.1|AF249739_1 sporozoite surface protein 2 [Plasmodium falciparum] Length = 559 Score = 40.3 bits (92), Expect = 0.54, Method: Composition-based stats. Identities = 25/170 (14%), Positives = 57/170 (33%), Gaps = 19/170 (11%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + R+ + + + + IQ+ + + + + + N + + S N + Sbjct: 55 CSGSIRRHNWVNHAVPLAMKLIQQLNLNESAIHLYVNIFSNNAKEIIRLHSDASKNKEKA 114 Query: 263 KSRLNKLN----PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + L PY TN A+ + L + + V+ +TDG Sbjct: 115 LIIIRSLLSTNLPYGRTNLSDALLQVRKHLNDRI-------NRENANQLVVILTDGIPD- 166 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQ 366 + ++L+ + + G+KI + L C S G+ Sbjct: 167 -----SIQDSLKESRKLNDRGVKIAVFGIGQGINVAFNRFLVGCHPSDGK 211 >gi|188586927|ref|YP_001918472.1| conserved hypothetical protein [Natranaerobius thermophilus JW/NM-WN-LF] gi|179351614|gb|ACB85884.1| conserved hypothetical protein [Natranaerobius thermophilus JW/NM-WN-LF] Length = 301 Score = 40.3 bits (92), Expect = 0.55, Method: Composition-based stats. Identities = 24/148 (16%), Positives = 56/148 (37%), Gaps = 8/148 (5%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A++++V F +D+ + R+++ SA +A+ L+G ++ + D Sbjct: 17 IVALMLTVLMSFTALVVDVGLMYAERSRLVSAAEASALAGASNFPYRDDGEYREEDYDYA 76 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQ-------YIAESKAQYEI 113 +K +++ + IR D QK +I +T + + Q Sbjct: 77 RQAAEKIAEENGLEDYQIRLLPEDAQQKEKIEVTASEEVEFSFARVMGFQEGDVTGQAAA 136 Query: 114 PTENLF-LKGLIPSALTNLSLRSTGIIE 140 + L G++P + G+ + Sbjct: 137 KSVPLAGFTGVVPLGIPEQDFEGYGMYD 164 >gi|308066889|ref|YP_003868494.1| hypothetical protein PPE_00054 [Paenibacillus polymyxa E681] gi|305856168|gb|ADM67956.1| Conserved hypothetical protein [Paenibacillus polymyxa E681] Length = 249 Score = 40.3 bits (92), Expect = 0.55, Method: Composition-based stats. Identities = 17/97 (17%), Positives = 42/97 (43%), Gaps = 10/97 (10%) Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS---VAVSA-PPEGQDLLRKCT 361 K ++ ITDG ++ + + + +R G+ + + G +++ Sbjct: 2 KQILIITDGCSNVGVSP-----VMAAAQALRE-GITVNVAGVIDYGTIGELGSAEIQEIA 55 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + G F + +++L ++ +T K Q+++ A NR Sbjct: 56 KAGGGFSQIVGTKQLAQTMQMMTRKTVVQTIQQAVNR 92 >gi|301609308|ref|XP_002934204.1| PREDICTED: epithelial chloride channel protein [Xenopus (Silurana) tropicalis] Length = 906 Score = 40.3 bits (92), Expect = 0.56, Method: Composition-based stats. Identities = 24/134 (17%), Positives = 51/134 (38%), Gaps = 26/134 (19%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 TN + + + + + ++ +TDGE+SG S+ Sbjct: 381 ATGGTNICAGVQQGLQV--------NRNLDQSTHGTEIVLLTDGEDSGISSCFPD----- 427 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN---DSRELLESFDKITD- 385 + +G I+++A+ + L K D + G + D+ L++SF I Sbjct: 428 ----ITKSGAIIHTIALGNNAD--PGLEKLADLTGGLKLYASDKVDANGLIDSFSGIVSN 481 Query: 386 --KIQEQSVRIAPN 397 + +QS++I + Sbjct: 482 TGNVTQQSLQIESS 495 >gi|239906055|ref|YP_002952794.1| hypothetical protein DMR_14170 [Desulfovibrio magneticus RS-1] gi|239795919|dbj|BAH74908.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 137 Score = 40.3 bits (92), Expect = 0.56, Method: Composition-based stats. Identities = 14/142 (9%), Positives = 41/142 (28%), Gaps = 22/142 (15%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A+I+ + + ID + ++R+ + A ++ + D + Sbjct: 18 MALILPLLLTVVFAIIDYSRFFFLRSTV-----TAAVADATRLAVLPGTTDAMIAAAISQ 72 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + + Q +++T + + Q +P L L Sbjct: 73 AL--------------LDPINQADGQTPNVSVTPSQRSAGQ---PVTVTASLPFSPLILP 115 Query: 122 GLIPSALTNLSLRSTGIIERSS 143 + +L ++ + + Sbjct: 116 QFLGMSLFPQNINAAATMVVEP 137 >gi|38637070|dbj|BAD03327.1| histone deacetylase HD2-like protein [Oryza sativa Japonica Group] gi|40253820|dbj|BAD05756.1| histone deacetylase HD2-like protein [Oryza sativa Japonica Group] Length = 169 Score = 40.3 bits (92), Expect = 0.56, Method: Composition-based stats. Identities = 3/40 (7%), Positives = 13/40 (32%) Query: 341 KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 +++ + + + G F + + + + F Sbjct: 68 PVHTFGFGKDHDAVAMHTIAEVTGGTFSFIENEAAIQDGF 107 >gi|313238853|emb|CBY13852.1| unnamed protein product [Oikopleura dioica] Length = 249 Score = 40.3 bits (92), Expect = 0.56, Method: Composition-based stats. Identities = 5/43 (11%), Positives = 16/43 (37%), Gaps = 2/43 (4%) Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS 364 + +R+ G++I+++ V + L++ Sbjct: 24 SYDRAIVPAAARSLRDKGVRIFAIGVG--NAVESELKEIASEP 64 >gi|169832403|ref|YP_001693966.1| cell wall surface anchor family protein [Streptococcus pneumoniae Hungary19A-6] gi|154432882|gb|ABS82071.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|154432906|gb|ABS82092.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|168994905|gb|ACA35517.1| cell wall surface anchor family protein [Streptococcus pneumoniae Hungary19A-6] Length = 883 Score = 40.3 bits (92), Expect = 0.56, Method: Composition-based stats. Identities = 16/112 (14%), Positives = 29/112 (25%), Gaps = 23/112 (20%) Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 +LY S G+ + N M G +++V Sbjct: 480 TDLYLYWRDSILAY--PFNSSTDWITNHGDPTTWYYNGN----------MAQDGYDVFTV 527 Query: 346 AVSAPPE-------GQDLLRKCTDSSGQFFAVNDSR----ELLESFDKITDK 386 V + ++ + S + V D EL F I ++ Sbjct: 528 GVGVNGDPGTDEATATRFMQSISSSPDNYTNVADPSQILQELNRYFYTIVNE 579 >gi|117919536|ref|YP_868728.1| undecaprenol kinase., rRNA (guanine-N(1)-)-methyltransferase [Shewanella sp. ANA-3] gi|117611868|gb|ABK47322.1| Undecaprenol kinase., rRNA (guanine-N(1)-)-methyltransferase [Shewanella sp. ANA-3] Length = 1223 Score = 40.3 bits (92), Expect = 0.56, Method: Composition-based stats. Identities = 9/60 (15%), Positives = 23/60 (38%), Gaps = 2/60 (3%) Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLL--RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 ++ +++ S + L + + G++F D+ +L S + I E+ Sbjct: 471 NSAGKQTVRTFTIGFSEGAASAEHLLKQTAENGGGKYFDATDASKLRSSLQTALNNILEK 530 >gi|330465087|ref|YP_004402830.1| hypothetical protein VAB18032_05520 [Verrucosispora maris AB-18-032] gi|328808058|gb|AEB42230.1| hypothetical protein VAB18032_05520 [Verrucosispora maris AB-18-032] Length = 141 Score = 40.3 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 13/122 (10%), Positives = 30/122 (24%), Gaps = 6/122 (4%) Query: 3 AIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSG--CASIVSDRTIKDPTTKKDQT 60 AI + I A D A + + ++ A +G + + + Q Sbjct: 16 AIALVGVLAIIGMAFDGAGQLRTLQRAENLAAEAARAGGQAIDLATAIEGGPKQINRRQA 75 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKD----KNNPLQYIAESKAQYEIPTE 116 + G + D + ++ ++ + E Sbjct: 76 RRAVADYLAAAGASGHTVSFPVVDGETQIRVRVSVTYRRAMLGLFGFSNTVTVSGEATAR 135 Query: 117 NL 118 L Sbjct: 136 PL 137 >gi|154507989|ref|ZP_02043631.1| hypothetical protein ACTODO_00475 [Actinomyces odontolyticus ATCC 17982] gi|153797623|gb|EDN80043.1| hypothetical protein ACTODO_00475 [Actinomyces odontolyticus ATCC 17982] Length = 338 Score = 40.3 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 18/169 (10%), Positives = 47/169 (27%), Gaps = 32/169 (18%) Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKLNPY---ENTNTYPAMHHAYRELYNEKESS------ 295 PL+++ + L ++ T A +EL++ Sbjct: 135 WDAYSMTMFPLTDDYDMATDVLQDMSDTIDTGLTRIG-GRLSATQELFDYLAPVMDENQE 193 Query: 296 --------------HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK 341 + + ++ TD E G N + ++ ++ G+ Sbjct: 194 VSSIVGDGLASCVMGFDHNDKQRSRTILLATDNEVYGDGV----YNLSEAIQFAKSQGVT 249 Query: 342 IYSVAVSAP---PEGQDLLRK-CTDSSGQFFAVNDSRELLESFDKITDK 386 + ++ + LR + G F+ + + +I + Sbjct: 250 VTALYPGSDITLSSEALQLRDEVRKTGGDFYDASSPSSVDRVVKQIEAE 298 >gi|229542265|ref|ZP_04431325.1| conserved hypothetical protein [Bacillus coagulans 36D1] gi|229326685|gb|EEN92360.1| conserved hypothetical protein [Bacillus coagulans 36D1] Length = 245 Score = 40.3 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 17/103 (16%), Positives = 39/103 (37%), Gaps = 10/103 (9%) Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS----APPEGQD 355 + K ++ ITDG ++ + AG+ I + V +G Sbjct: 1 MNAGTLKQILLITDGCSNQGGDPAAAAALARE------AGITINVIGVMERDIIDEKGAA 54 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 ++ S G + +++L ++ +T K Q+++ N+ Sbjct: 55 EIQNIAMSGGGVSQIVYTKQLSQTVQMVTRKAMTQTIQGIVNK 97 >gi|328858486|gb|EGG07598.1| hypothetical protein MELLADRAFT_77518 [Melampsora larici-populina 98AG31] Length = 503 Score = 40.3 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 27/176 (15%), Positives = 52/176 (29%), Gaps = 26/176 (14%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQC-----TPLSN 257 I ++ + + I + + +RIG IAY ++ ++ Sbjct: 39 TGSMGSYITAATQNIELICDEIINSERLASPECLRIGLIAYRDHPPQDRSYITLKFDFTS 98 Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAY-RELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 N V+ L L + A+ A L + + + ITD Sbjct: 99 NPKIVQEHLKSLWASGGGDGPEAVTAAMHEALTLDWRPQA--------SRMAVLITDAPP 150 Query: 317 SG--------ASAYQNTLNTLQICEYMRNAGMKIYSV----AVSAPPEGQDLLRKC 360 G + + + L++ M G+ ++ V A S D R Sbjct: 151 HGIGEYGDGFSKGDPSGHDPLKLARTMAQNGISLFVVACEPAFSGYMYSNDFFRAI 206 >gi|87311196|ref|ZP_01093319.1| hypothetical protein DSM3645_16245 [Blastopirellula marina DSM 3645] gi|87286104|gb|EAQ78015.1| hypothetical protein DSM3645_16245 [Blastopirellula marina DSM 3645] Length = 179 Score = 40.3 bits (92), Expect = 0.57, Method: Composition-based stats. Identities = 11/121 (9%), Positives = 29/121 (23%), Gaps = 19/121 (15%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A + FLFI +++ + IR+ D A + + K++ + Sbjct: 61 FAFAAPILFLFILASVEFGRLTMIRH----TADNAAYEAARYAMVPG--ATSSEAKEKAT 114 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + + + + + + A + Sbjct: 115 QLLATIGA-------------RKVEVDVDPAVLQPETKTITVTVRVPAAGNSWVVPKYFS 161 Query: 122 G 122 Sbjct: 162 S 162 >gi|284031227|ref|YP_003381158.1| hypothetical protein Kfla_3298 [Kribbella flavida DSM 17836] gi|283810520|gb|ADB32359.1| hypothetical protein Kfla_3298 [Kribbella flavida DSM 17836] Length = 279 Score = 40.3 bits (92), Expect = 0.58, Method: Composition-based stats. Identities = 19/184 (10%), Positives = 45/184 (24%), Gaps = 21/184 (11%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQ-MQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 I+ + F ID + +++ + A +AA + V + D Sbjct: 53 VVILALMIFTLAGLVIDGGRQLGAKSRAVGYAQEAA-----RAGVGTIDFNSAQARIDVA 107 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 K G++ + + ++ L+ + Sbjct: 108 --------KAGEAVGNFCAKVTENDPAVTSCATSELDPEHLKVQVQIA-------NKTSF 152 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 G+I + E+ S + V ++ D + + Sbjct: 153 LGMIGIQSLTANGEGEAHAEQGIVKADESPQIPPLVVQTTPDGPGVTVQPTASPPPIDFP 212 Query: 181 LPPP 184 P Sbjct: 213 CPVW 216 >gi|242061830|ref|XP_002452204.1| hypothetical protein SORBIDRAFT_04g021720 [Sorghum bicolor] gi|241932035|gb|EES05180.1| hypothetical protein SORBIDRAFT_04g021720 [Sorghum bicolor] Length = 369 Score = 40.3 bits (92), Expect = 0.58, Method: Composition-based stats. Identities = 13/77 (16%), Positives = 26/77 (33%), Gaps = 13/77 (16%) Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT--DSSG 365 V+ ++DG+ + ++ +Y+ A +L G Sbjct: 20 VMLMSDGQQNHGGN----------AADVKIGNAPVYTFGFGAD-YDPTVLNAVARNSMGG 68 Query: 366 QFFAVNDSRELLESFDK 382 F VND +L +F + Sbjct: 69 TFSVVNDVDKLTMAFSQ 85 >gi|118361105|ref|XP_001013783.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89295550|gb|EAR93538.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 368 Score = 40.3 bits (92), Expect = 0.58, Method: Composition-based stats. Identities = 14/133 (10%), Positives = 41/133 (30%), Gaps = 18/133 (13%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG---NQCTPLSNNLNEVKSRLNKLNPYENT 275 + + + K I ++ + I + + N+ + +NKL T Sbjct: 216 FVKSQLTKTIADQLKPFQKFNIIIFGNSASQWKTDYVDATPENVQAAIAYINKLTTNGAT 275 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 N + A+ + + K + ++DG + + + + Sbjct: 276 NISSGLDLAF--------------NTKQALKAIYLLSDGVPNSGVQTVDGIKKYLADKNA 321 Query: 336 -RNAGMKIYSVAV 347 R + + +++ Sbjct: 322 SRTDKVVVNTISF 334 >gi|229596006|ref|XP_001013680.3| hypothetical protein TTHERM_00833730 [Tetrahymena thermophila] gi|225565656|gb|EAR93435.3| hypothetical protein TTHERM_00833730 [Tetrahymena thermophila SB210] Length = 557 Score = 40.3 bits (92), Expect = 0.58, Method: Composition-based stats. Identities = 15/128 (11%), Positives = 44/128 (34%), Gaps = 18/128 (14%) Query: 255 LSNNLNEVKS---RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 NN+++V + + ++ +T P + + + +I Sbjct: 244 FKNNVSQVNNMPQKYKEIRTNGSTALGPGLAVSLGLASQSPQ----------SGSSIILC 293 Query: 312 TDGENSGASAYQ----NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQ 366 TDG + + ++ + ++ G+ I+++ + +L K + + G+ Sbjct: 294 TDGLANEGIGQLEDKKDYETYEKMGQLAKSLGILIHTITMRGNESDVRVLGKLSDTTGGR 353 Query: 367 FFAVNDSR 374 V + Sbjct: 354 TSRVGPAD 361 >gi|225873376|ref|YP_002754835.1| hypothetical protein ACP_1760 [Acidobacterium capsulatum ATCC 51196] gi|225792625|gb|ACO32715.1| hypothetical protein ACP_1760 [Acidobacterium capsulatum ATCC 51196] Length = 363 Score = 40.3 bits (92), Expect = 0.59, Method: Composition-based stats. Identities = 19/111 (17%), Positives = 46/111 (41%), Gaps = 16/111 (14%) Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV------ 345 +++++ + +K +I +TDGE+ + LQ + A +Y + Sbjct: 215 AQAANDKLREQTGRKALILLTDGED-----LGSATKPLQAIADAQKANTIVYVILIADRG 269 Query: 346 ---AVSAPPEGQDLLRKCTD-SSGQFFAV-NDSRELLESFDKITDKIQEQS 391 + G +R+ + + G+ V N+ +L +F +I +++ Q Sbjct: 270 FYGGYTFGYTGDAQMRRLAEATGGRMINVGNNGAKLTAAFKEIARELRTQY 320 >gi|82913384|ref|XP_728622.1| micronemal protein WARP [Plasmodium yoelii yoelii str. 17XNL] gi|23485067|gb|EAA20187.1| micronemal protein WARP [Plasmodium yoelii yoelii] Length = 303 Score = 40.3 bits (92), Expect = 0.59, Method: Composition-based stats. Identities = 23/178 (12%), Positives = 53/178 (29%), Gaps = 8/178 (4%) Query: 189 FWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIV 248 + N + +K + + L + ++ A K + I A ++ Sbjct: 91 NYCDNYYDITLIVENSSFIQKDYWMKGTIPFLESMVRNARVSKDKAHMAIILFAGRQDLI 150 Query: 249 GNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 LS + ++ ++ LN T++ +A + + T K Sbjct: 151 VPFTDELSQDKEKLIDKIRTLN-DAATDSNTLYVYALEYAFEKVIFGEGTRS--DAPKVA 207 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL-LRKCTDSSG 365 + G + NT + E + +K+ V ++ + L C Sbjct: 208 VLFYYGFD----YGSNTSLIPDVVEDYKQNNIKLIIVGIALGNKQNAFILADCKSDGD 261 >gi|323693762|ref|ZP_08107958.1| hypothetical protein HMPREF9475_02821 [Clostridium symbiosum WAL-14673] gi|323502183|gb|EGB18049.1| hypothetical protein HMPREF9475_02821 [Clostridium symbiosum WAL-14673] Length = 1560 Score = 40.3 bits (92), Expect = 0.59, Method: Composition-based stats. Identities = 21/144 (14%), Positives = 46/144 (31%), Gaps = 24/144 (16%) Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV--------IFITDGENSGASAYQNT 325 T++ A R L K+ + S + G GA + NT Sbjct: 721 GTHSAGGYLGAERALDRLKKYNPEEYNSNVKYVIYLADGTAGFYVDSYGYRDGAGSGGNT 780 Query: 326 LNTLQICE---YMRNA--GMKIYSVAVSAPPEGQ-DLLRKCTDSSG----------QFFA 369 ++ IY+VA + + ++ + F++ Sbjct: 781 NARRAAITQSGELKKKHPDATIYTVAFGSDSSANMNWMKPGAYNGNITNPYNPNVTAFYS 840 Query: 370 VNDSRELLESFDKITDKIQEQSVR 393 ++ EL ++FD + +++ +V Sbjct: 841 ATNTEELEKTFDSLAEQVGSSAVT 864 >gi|260824475|ref|XP_002607193.1| hypothetical protein BRAFLDRAFT_68010 [Branchiostoma floridae] gi|229292539|gb|EEN63203.1| hypothetical protein BRAFLDRAFT_68010 [Branchiostoma floridae] Length = 110 Score = 40.3 bits (92), Expect = 0.59, Method: Composition-based stats. Identities = 11/75 (14%), Positives = 26/75 (34%), Gaps = 3/75 (4%) Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-GQDLLRKCTDSSG--QFFAVNDS 373 + ++ ++ + M+N G+ I++ + D L F + D Sbjct: 18 DNGEIRKRSVEVERVAQEMKNDGVTIFAFGIDEHHNIDFDHLENVASPDDVKHLFQIEDI 77 Query: 374 RELLESFDKITDKIQ 388 E E ++ D + Sbjct: 78 DEFTELIIELQDHLT 92 >gi|188592039|ref|YP_001796637.1| hypothetical protein RALTA_B0200 [Cupriavidus taiwanensis LMG 19424] gi|170938413|emb|CAP63400.1| conserved hypothetical protein [Cupriavidus taiwanensis LMG 19424] Length = 562 Score = 40.3 bits (92), Expect = 0.60, Method: Composition-based stats. Identities = 18/119 (15%), Positives = 34/119 (28%), Gaps = 1/119 (0%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIV-SDRTIKDPTTKKDQ 59 M A++I+ + +ID+ H+ + Q+Q+ +D A +S + +D Sbjct: 22 MAAVLIATVAIAALVSIDVGHVFMRQRQLQNMVDLAAMSAAQQLKRADSPANLNAAVLGT 81 Query: 60 TSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENL 118 S I K G D + + N Sbjct: 82 VSNIGAKNGYPSGIAMGCAEAAGGGADAMTACVGVWDPATAGPRHFSAVYNAATVSPNA 140 >gi|320169699|gb|EFW46598.1| hypothetical protein CAOG_04556 [Capsaspora owczarzaki ATCC 30864] Length = 501 Score = 40.3 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 7/46 (15%), Positives = 20/46 (43%), Gaps = 2/46 (4%) Query: 338 AGMKIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDK 382 G +Y+ + + +L+ +D+ G ++ + + + SF Sbjct: 228 EGCTVYTFGYGSDADSN-MLKAISDAGSGTYYFIENKDTVATSFGD 272 >gi|304320961|ref|YP_003854604.1| hypothetical protein PB2503_06982 [Parvularcula bermudensis HTCC2503] gi|303299863|gb|ADM09462.1| hypothetical protein PB2503_06982 [Parvularcula bermudensis HTCC2503] Length = 432 Score = 40.3 bits (92), Expect = 0.62, Method: Composition-based stats. Identities = 33/377 (8%), Positives = 89/377 (23%), Gaps = 27/377 (7%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M +++ D I +++ + + + Sbjct: 29 MMMLMLVPALGIGGLVFDGKRIHTAHLELE----------IVAESAAIAAALQLPSESSA 78 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + +++L SY G++ + + + I + A + T Sbjct: 79 KSAAVDYAEQNLPPSSY-----GEVVRASDVEIGSYDEGTGTFTAGAGTSAVRVTAWRHE 133 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 + S+ +A+S D + D + + Sbjct: 134 DRSNSLPTIFAGAIGRDSVNLSASAIAVSQSSSEDPICILVLGNKFYGLDLDLRSEVDIP 193 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 S S A A + ++ S L + + + + Sbjct: 194 DCGIQVNSDEYDAVQASSLATVNASFFNVVGSVLGSTSGLTPTPTEGADAVDDPYASLAE 253 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG 300 ++ ++ L + + L + T T ++ + + Sbjct: 254 PTAKPCGGVDEIDGGTHTLVDTYRFCDGLEIDDATVT---FSPGEYQIDGDFDLKGTASI 310 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK- 359 S I + + + L Y G+ +++ + + R Sbjct: 311 SGTDVTIFIDGSRSDINFGRYSSFHLEAPTTGTYA---GVLLWT---ARDNDESFEFRSR 364 Query: 360 --CTDSSGQFFAVNDSR 374 + S +F Sbjct: 365 FGASSSGSFYFPAAKMD 381 >gi|302540662|ref|ZP_07293004.1| von Willebrand factor, type A [Streptomyces hygroscopicus ATCC 53653] gi|302458280|gb|EFL21373.1| von Willebrand factor, type A [Streptomyces himastatinicus ATCC 53653] Length = 340 Score = 39.9 bits (91), Expect = 0.63, Method: Composition-based stats. Identities = 31/329 (9%), Positives = 78/329 (23%), Gaps = 23/329 (6%) Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 F + + A+ + A + ++ + Sbjct: 13 QAAFAHNAEADDDSRLDLFYPKDGTAELDYPYTVLNDRELTTVRARAATRFLTLLTDARG 72 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 + + + + + S ++ +T Sbjct: 73 RRALKKDGFRTADGKVSEPVAQAAGGRTPQPYAVTPSPGPSAREVEAALGMWTITVQSAR 132 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV---- 236 L S + P ++DV S ++ + Sbjct: 133 LNTVVDASASMSDP------VPGRPGESRMDVTKASLRQALSRFNAGDEIGLWEFSTELD 186 Query: 237 --RIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP--YENTNTYPAMHHAYRELYNEK 292 R +G + + E+ + + L P +T Y AY++ Sbjct: 187 GDRDYRELVATRRLGARTPDGTGQRAELAAAFDALKPLPEGSTGLYDTTLAAYKKAQETF 246 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR-NAGMKIYSVAVSAPP 351 V+ +TDG N + ++ + + + + ++AV Sbjct: 247 VRGKF--------NAVVMLTDGANQDPGSISRGALVKELKRLVDPDRPVPLIAIAVGPDA 298 Query: 352 EGQDLLRKCTDSSGQFFAVNDSRELLESF 380 + + G VND ++ + Sbjct: 299 DQAACREIAQATGGSAQQVNDPAQINTAM 327 >gi|110799635|ref|YP_695225.1| von Willebrand factor type A/Cna B-type domain-containing protein [Clostridium perfringens ATCC 13124] gi|110674282|gb|ABG83269.1| von Willebrand factor type A/Cna B-type domain protein [Clostridium perfringens ATCC 13124] Length = 928 Score = 39.9 bits (91), Expect = 0.64, Method: Composition-based stats. Identities = 27/319 (8%), Positives = 81/319 (25%), Gaps = 44/319 (13%) Query: 102 QYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSME 161 Y+ + + + +L +++ S+ + + + Sbjct: 109 GYLTKKAYTTDEDNVFDINLKIQGKKNQSLKKDVVFLLDNSNSMTTNNRAIKIKEQIKNV 168 Query: 162 DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV-LIESAGNL 220 L +N + + + ++ +KY + + + + + Sbjct: 169 MDKLNTNNTRYALVTYASTILDGRYYHLIDRSIGDNKYTVYKGYTSNQCYLNFTSNIQEI 228 Query: 221 VNSIQKAIQEKKNLSVRIGTIA-----------YNIGIVGNQCTPLSNNLNEVKSRLNKL 269 N I + ++N GT N L++ L L + Sbjct: 229 YNKIPTTVPNQRNNGYVGGTFTQEGLLKAIELLKNSDADEKIIIHLTDGLPTFSFLLKEF 288 Query: 270 NPYENTNTYPAMHHAYREL------YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 N + Y + + ++ ++ V + + + Sbjct: 289 --GGNEKAIFDYNTQYNGIGVRGFGTSYFFNTKTQKPYIYSREEVYSALNRSINKNESIW 346 Query: 324 NTLNTLQI-CEYMRNAG--MKIYSVAVS-----APPEG----------------QDLLRK 359 N + E ++ + IY++ + + + L Sbjct: 347 NNGFPTTLEAENIKKENPDINIYTIGIELKKEVYKWDDYRKYYNAEGVVELPEIKKFLES 406 Query: 360 CTDSSGQFFAVNDSRELLE 378 + S + F + ++ E Sbjct: 407 ISSSPAEAFVNENVDDIDE 425 >gi|261252915|ref|ZP_05945488.1| putative outer membrane adhesin like proteiin [Vibrio orientalis CIP 102891] gi|260936306|gb|EEX92295.1| putative outer membrane adhesin like proteiin [Vibrio orientalis CIP 102891] Length = 3332 Score = 39.9 bits (91), Expect = 0.65, Method: Composition-based stats. Identities = 36/291 (12%), Positives = 75/291 (25%), Gaps = 15/291 (5%) Query: 67 QIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPS 126 + EN A++ D + L + A E + Sbjct: 2629 NSNGTYTVTLDVAENELTGRVSAELVAPHDYDGSLDFDLSVSATSEEKVGADTETTTVSE 2688 Query: 127 ALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPK 186 +T + +L + + + N Sbjct: 2689 PVTLSGSDLVVGNNVDNTLEGHGGNDILIGDQGGYKTNVTPGVNYNIALVVDASGSMGD- 2747 Query: 187 KSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIG 246 + NT + A ++D++ E+ NLV S+ + Sbjct: 2748 ---YVYNTDGTVMRNPDGSAMTRMDMMQEALTNLVESLVTHDGSINIKLIGFDDNIDVTF 2804 Query: 247 IVGNQCTPLSNNLNEVKSRL-NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 + T S+ + E+ S++ N L T+ A + S + Sbjct: 2805 EALD-ITNSSDVVAELLSKIENNLPVGGGTDYGVGFEEANNWYASSSI------SSNGYE 2857 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 F+TDGE + + ++ K+ +V + + L Sbjct: 2858 NMTFFLTDGEPNSGTLNNGLTEYNELVST---HNAKVMAVGMGNDIDDSVL 2905 >gi|167041680|gb|ABZ06425.1| putative von Willebrand factor type A domain protein [uncultured marine microorganism HF4000_009L19] Length = 317 Score = 39.9 bits (91), Expect = 0.65, Method: Composition-based stats. Identities = 22/182 (12%), Positives = 60/182 (32%), Gaps = 42/182 (23%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKES 294 R+G + ++ + P + + L ++ T Y A A+ Sbjct: 119 DDRVGLVTFSHNVRVVVEPP--SLPASLPDALRRVRATGGTALYDATFAAFAL------- 169 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV---AVSAPP 351 T + ++ +DG+++ + L+ + + + + +Y+V V+ Sbjct: 170 ----RERTVGRTLMLVFSDGDDTTSW-----LDPRDVLNTAQRSDVVVYAVNLAGVAPDS 220 Query: 352 EGQDLLRKCT---------------------DSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + R+ ++ G F D+ L +F ++ D+ + + Sbjct: 221 WQERQGRRSARRWFATEPHLFRGQYLPVLAEETGGSVFVAQDTGRLRAAFARVVDEFRRR 280 Query: 391 SV 392 + Sbjct: 281 YL 282 >gi|296269297|ref|YP_003651929.1| vault protein inter-alpha-trypsin domain-containing protein [Thermobispora bispora DSM 43833] gi|296092084|gb|ADG88036.1| Vault protein inter-alpha-trypsin domain protein [Thermobispora bispora DSM 43833] Length = 796 Score = 39.9 bits (91), Expect = 0.65, Method: Composition-based stats. Identities = 23/170 (13%), Positives = 53/170 (31%), Gaps = 23/170 (13%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK----SRLNKLNPYENTNTYPAMHHAY 285 + R+ +A++ I L+ + + L +L T Sbjct: 329 DTLTERDRLAVLAFDNVIERAFPDGLTAATDRARYRAVEFLARLEARGGT---------- 378 Query: 286 RELYNEKESSHNTIGST--RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 L +E+ + ++ +TDG+ ++ + G++++ Sbjct: 379 EMLAPLEEALTALAAAAEGGRDAVLVLVTDGQVGDEDRIL-----ERMASRIG--GVRVH 431 Query: 344 SVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVR 393 +V + L R +G+ V L E+ + I +I V Sbjct: 432 AVGIDRAVNAAFLGRLAVLGAGRCELVESEDRLDEAMEHIHRRIGAPLVT 481 >gi|183981216|ref|YP_001849507.1| hypothetical protein MMAR_1194 [Mycobacterium marinum M] gi|183174542|gb|ACC39652.1| conserved hypothetical protein [Mycobacterium marinum M] Length = 772 Score = 39.9 bits (91), Expect = 0.65, Method: Composition-based stats. Identities = 27/136 (19%), Positives = 42/136 (30%), Gaps = 16/136 (11%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 N S L L T + +A L + E + V+ +TDG+ S Sbjct: 371 NRFAASSWLGSLRSRGGTVMAQPLTNAVEMLADSGEDRQAS---------VVLVTDGQIS 421 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 G +L + +IY V V L R SG+ V L Sbjct: 422 GEDHLLRSLAPVVGRT-------RIYCVGVDRAVNAGFLERLAGLGSGRAELVESEDRLD 474 Query: 378 ESFDKITDKIQEQSVR 393 E ++ I ++ Sbjct: 475 EVMARLARTIGRPALT 490 >gi|187736264|ref|YP_001878376.1| von Willebrand factor type A [Akkermansia muciniphila ATCC BAA-835] gi|187426316|gb|ACD05595.1| von Willebrand factor type A [Akkermansia muciniphila ATCC BAA-835] Length = 754 Score = 39.9 bits (91), Expect = 0.65, Method: Composition-based stats. Identities = 29/221 (13%), Positives = 67/221 (30%), Gaps = 45/221 (20%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 + + +++ + + A + + PL+++ N +K + Sbjct: 98 SRSMLSKDASPTRLGRAKTAAYDLLDALPGDNFGIIIFSGDAVLLMPLTHDHNALKETIE 157 Query: 268 KLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 +L TN + A + +KE+ K ++ ++DGE++ Sbjct: 158 QLQFGWVSQGGTNLENVVRLALQTFKRDKEA--------DAKNALVILSDGEDTVN---- 205 Query: 324 NTLNTLQICEYMRNAGMKIYSV-----------------AVSAPPEGQ--------DLLR 358 T + E R + I + + GQ + L+ Sbjct: 206 ---ITYKTAEAARQHQLIIVTAGIGTTIGTTIPDEQSPSGLYRDRRGQHVVSKLNPESLQ 262 Query: 359 KCT-DSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + GQ+ ++D L I D++ + R Sbjct: 263 YLARQTEGQYVQLSDGAALNRFVKDIADRLDTTEGKEEVRR 303 >gi|149916833|ref|ZP_01905335.1| hypothetical protein PPSIR1_05713 [Plesiocystis pacifica SIR-1] gi|149822550|gb|EDM81939.1| hypothetical protein PPSIR1_05713 [Plesiocystis pacifica SIR-1] Length = 447 Score = 39.9 bits (91), Expect = 0.65, Method: Composition-based stats. Identities = 35/243 (14%), Positives = 69/243 (28%), Gaps = 41/243 (16%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 KS T P + + L + N+ + +I NL Sbjct: 151 DKSGSMFTNTWDHDNNGGTPQITRWNSLYDVVDNITTTFDDSINFGANLFPSTLAQNIYG 210 Query: 246 --GIVGNQCTPLS---NNLNEVKSRLNK---LNPYENTNTYPAMHHAYRELYNEKESSHN 297 + ++ NN ++ + + Y T + AY L + Sbjct: 211 PQACTTSNFPEVTVGENNSAQILATIPGPGVTASYGGTPATLGVTTAYNHLTS------- 263 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE--------YMRNAGMKIYSVAV-- 347 L + +I +TDG + N + + G+ Y V + Sbjct: 264 --LDPELPRAMILVTDGAANCDQNAANNFQLFDVYDDGLPVIVGTAAANGVPTYVVGIDI 321 Query: 348 ---------SAPPEGQD---LLRKCTDSSGQ--FFAVNDSRELLESFDKITDKIQEQSVR 393 P G + +L + + G FF D EL + + +Q ++ Sbjct: 322 INQTINDGIGGDPNGINPTVVLNEVAAAGGTGSFFNTEDQAELEAALTDVVASVQTCTIP 381 Query: 394 IAP 396 ++ Sbjct: 382 LSE 384 >gi|17537919|ref|NP_496258.1| C-type LECtin family member (clec-59) [Caenorhabditis elegans] gi|3881709|emb|CAA88984.1| C. elegans protein ZK666.5, partially confirmed by transcript evidence [Caenorhabditis elegans] Length = 396 Score = 39.9 bits (91), Expect = 0.65, Method: Composition-based stats. Identities = 20/184 (10%), Positives = 54/184 (29%), Gaps = 8/184 (4%) Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 + + W + ++ N+ + R+ Sbjct: 24 ICGQDLSNLWLDVVAVVDNSAGMTKGGLTSVAANIASIFSKNTQIGTNPT-SPKTTRLAL 82 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG 300 + YN+ +++++ S +N +++ + + ++ + Sbjct: 83 VTYNVDATTAADLNKFQSIDDIYSGINSALATISSSEESYLARGLSQAEKVFQAGKHGFN 142 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL--LR 358 +K VI LN + + + ++ +G+ I +VA +G L L Sbjct: 143 RAHYQKVVIVYASTYKGSG-----DLNPVPVAQRLKTSGVTIITVAYDQNKDGDILVDLE 197 Query: 359 KCTD 362 K Sbjct: 198 KIAT 201 >gi|198473081|ref|XP_001356166.2| GA18279 [Drosophila pseudoobscura pseudoobscura] gi|198139291|gb|EAL33226.2| GA18279 [Drosophila pseudoobscura pseudoobscura] Length = 1138 Score = 39.9 bits (91), Expect = 0.66, Method: Composition-based stats. Identities = 23/162 (14%), Positives = 55/162 (33%), Gaps = 14/162 (8%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 + L + S + + TP +N+ E+KS + + + Sbjct: 178 TAFNILDTLGEDDFVNLITFSEVVKAPVPCFKDRMVRATP--DNIQEIKSAVKAIKLQDT 235 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 N + +A+ L+ +S + + + ++ IT+ + N + Sbjct: 236 ANFTAGLEYAFSLLHKYNQSGSGSQCN----QAIMLITESTSESHKDIIKQYNWPHM--- 288 Query: 335 MRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF-AVNDSRE 375 ++I++ + + + L S+ FF +ND E Sbjct: 289 ----PVRIFTYLIGSDSSSRSNLHDMACSNKGFFVQINDYEE 326 >gi|195175237|ref|XP_002028364.1| GL15442 [Drosophila persimilis] gi|194117953|gb|EDW39996.1| GL15442 [Drosophila persimilis] Length = 1149 Score = 39.9 bits (91), Expect = 0.66, Method: Composition-based stats. Identities = 23/162 (14%), Positives = 55/162 (33%), Gaps = 14/162 (8%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 + L + S + + TP +N+ E+KS + + + Sbjct: 178 TAFNILDTLGEDDFVNLITFSEVVKAPVPCFKDRMVRATP--DNIQEIKSAVKAIKLQDT 235 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 N + +A+ L+ +S + + + ++ IT+ + N + Sbjct: 236 ANFTAGLEYAFSLLHKYNQSGSGSQCN----QAIMLITESTSESHKDIIKQYNWPHM--- 288 Query: 335 MRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF-AVNDSRE 375 ++I++ + + + L S+ FF +ND E Sbjct: 289 ----PVRIFTYLIGSDSSSRSNLHDMACSNKGFFVQINDYEE 326 >gi|124028121|ref|YP_001013441.1| hypothetical protein Hbut_1264 [Hyperthermus butylicus DSM 5456] gi|123978815|gb|ABM81096.1| conserved archaeal protein [Hyperthermus butylicus DSM 5456] Length = 439 Score = 39.9 bits (91), Expect = 0.67, Method: Composition-based stats. Identities = 20/126 (15%), Positives = 46/126 (36%), Gaps = 23/126 (18%) Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 ++ ++ + ++ T+ A+ A ++ + + S +I ITDGE Sbjct: 333 DVVKLLEYVARIRANGGTDITRAILTAVDDIATKLQRSKV--------SDIILITDGE-- 382 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELL 377 + + I + ++++V +S LR +DS + V + Sbjct: 383 ------DKIAIDTIRRSLNKVNARLHTVMISGNNPD---LRAISDS----YMVATKLDRE 429 Query: 378 ESFDKI 383 E+ I Sbjct: 430 EALRVI 435 >gi|321460551|gb|EFX71592.1| hypothetical protein DAPPUDRAFT_326968 [Daphnia pulex] Length = 950 Score = 39.9 bits (91), Expect = 0.67, Method: Composition-based stats. Identities = 15/109 (13%), Positives = 42/109 (38%), Gaps = 23/109 (21%) Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 T + A + L + ++ +TDG+NS + + + Sbjct: 371 TCIGCGLQLAMQMLKDG--------------GIIVLVTDGKNSPGYHDISDVK-----KD 411 Query: 335 MRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND---SRELLESF 380 + +A +++ ++A + +++ + G+ + + D S L ++F Sbjct: 412 IVDAKIRVITIAYGSEA-DKNVEHLADVTGGKSYFIKDDDSSEALQQAF 459 >gi|330904068|gb|EGH34640.1| von Willebrand factor, type A [Pseudomonas syringae pv. japonica str. M301072PT] Length = 84 Score = 39.9 bits (91), Expect = 0.68, Method: Composition-based stats. Identities = 13/79 (16%), Positives = 28/79 (35%), Gaps = 15/79 (18%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 +NT A+ A + L + ++ +TDG N+ ++ + Sbjct: 21 AGKNTALGDAIGLALKRL----------RMRPATSRALVLVTDGANNAGQ-----IDPIT 65 Query: 331 ICEYMRNAGMKIYSVAVSA 349 G+KIY + + + Sbjct: 66 AARLAAEEGVKIYPIGIGS 84 >gi|224024928|ref|ZP_03643294.1| hypothetical protein BACCOPRO_01659 [Bacteroides coprophilus DSM 18228] gi|224018164|gb|EEF76162.1| hypothetical protein BACCOPRO_01659 [Bacteroides coprophilus DSM 18228] Length = 339 Score = 39.9 bits (91), Expect = 0.68, Method: Composition-based stats. Identities = 20/166 (12%), Positives = 57/166 (34%), Gaps = 46/166 (27%) Query: 253 TPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P++++ K L ++P + T+ A+ A + + + + + Sbjct: 144 LPITSDYISAKMFLESISPSLIATQGTDIRGAIDLAMKSFTP----------NEGVGRAI 193 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE---------------- 352 + ITDGEN A + + +++ + V +P Sbjct: 194 VLITDGENHEGGAVEAAKAAAEKGV-------RVFVLGVGSPDGSPIPVEGTNEFRRDKD 246 Query: 353 --------GQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQE 389 +++ R+ + G + V+++ ++ + +K+ + Sbjct: 247 GNVVVTRLNEEMCREIAQAGNGMYVRVDNTNNAEKALNAEINKLAK 292 >gi|116625492|ref|YP_827648.1| hypothetical protein Acid_6438 [Candidatus Solibacter usitatus Ellin6076] gi|116228654|gb|ABJ87363.1| hypothetical protein Acid_6438 [Candidatus Solibacter usitatus Ellin6076] Length = 754 Score = 39.9 bits (91), Expect = 0.68, Method: Composition-based stats. Identities = 8/58 (13%), Positives = 20/58 (34%), Gaps = 6/58 (10%) Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 S+ ++ ++DG ++ L +R + I++V +D Sbjct: 171 AESSSLPLGAIVLLSDGADNAGGID------LATIAAIRRQRIPIHTVGFGKEHPDRD 222 >gi|218670888|ref|ZP_03520559.1| hypothetical protein RetlG_04158 [Rhizobium etli GR56] Length = 133 Score = 39.9 bits (91), Expect = 0.68, Method: Composition-based stats. Identities = 5/30 (16%), Positives = 12/30 (40%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQ 30 + ++ L + +ID R ++Q Sbjct: 18 IVILVAVPMLLAVGASIDYIRAYNGRTELQ 47 >gi|38234574|ref|NP_940341.1| putative surface-anchored membrane protein [Corynebacterium diphtheriae NCTC 13129] gi|38200837|emb|CAE50541.1| Putative surface-anchored membrane protein [Corynebacterium diphtheriae] Length = 1872 Score = 39.9 bits (91), Expect = 0.68, Method: Composition-based stats. Identities = 26/190 (13%), Positives = 62/190 (32%), Gaps = 34/190 (17%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + + ++ ++A LV S++ + + + A + N+ Sbjct: 62 SNSLSASDVEKSKQAALELVKSLKGSPYRFGIYTFASHSPAAGNKNFTPVSLANDDGYNK 121 Query: 262 VKSRLNKLN-----------PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 V + +N + P TN + ++ + V F Sbjct: 122 VVAAINDIQMPAIRENKKGSPNGGTNWEGGLQAIANDIDRGIKYD-----------AVYF 170 Query: 311 ITDGENSGASAYQNTL----------NTLQICEYMRNAGMKIYSVAVSAPPEGQ--DLLR 358 ITDG+ + + +N L N + + + + G K+ V + + + DL + Sbjct: 171 ITDGQPTWDNNGRNWLGTTTEVVELENAVTQAKLISDKGAKLIPVGIGQLSDDKPFDLYK 230 Query: 359 KCTDSSGQFF 368 S ++ Sbjct: 231 PILPSEDDYY 240 >gi|113969431|ref|YP_733224.1| type IV pilin biogenesis protein [Shewanella sp. MR-4] gi|113884115|gb|ABI38167.1| type IV pilin biogenesis protein, putative [Shewanella sp. MR-4] Length = 1223 Score = 39.9 bits (91), Expect = 0.68, Method: Composition-based stats. Identities = 9/60 (15%), Positives = 23/60 (38%), Gaps = 2/60 (3%) Query: 333 EYMRNAGMKIYSVAVSAPPEGQDLL--RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 ++ +++ S + L + + G++F D+ +L S + I E+ Sbjct: 471 NSAGKQTVRTFTIGFSEGAASAEHLLKQTAENGGGKYFDATDASKLRSSLQTALNNILEK 530 >gi|290996510|ref|XP_002680825.1| vWFA domain-containing protein [Naegleria gruberi] gi|284094447|gb|EFC48081.1| vWFA domain-containing protein [Naegleria gruberi] Length = 395 Score = 39.9 bits (91), Expect = 0.69, Method: Composition-based stats. Identities = 19/179 (10%), Positives = 57/179 (31%), Gaps = 25/179 (13%) Query: 192 KNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI--GIVG 249 + +I+V + +++++ + K +R ++Y Sbjct: 77 NFVDLVIVMDCTGSMSGEIEVAKRTVTTIISTLHE----KFQSDLRFSAVSYRDHTDDYA 132 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 + P + +LN+ K +N ++ + A+ A + + K V+ Sbjct: 133 VKEFPFTKDLNKAKGYINTMSAQGGGDHPEALASALYVINEMPFNKKGK-------KIVV 185 Query: 310 FITDGENSGASAYQNTL------------NTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 ++ D G ++ + +++ ++ + Y + + Q L Sbjct: 186 WVADAPPHGMKTSSDSYPEGCKDQQGNVIDWIKLGSALQEKNVVFYGILCERAKDDQQL 244 >gi|170093379|ref|XP_001877911.1| predicted protein [Laccaria bicolor S238N-H82] gi|164647770|gb|EDR12014.1| predicted protein [Laccaria bicolor S238N-H82] Length = 360 Score = 39.9 bits (91), Expect = 0.70, Method: Composition-based stats. Identities = 30/217 (13%), Positives = 67/217 (30%), Gaps = 28/217 (12%) Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 + + P I ++ + + + + + ++R Sbjct: 3 TTQDGKSTAAKSIDIVFLQDATGSQGPY---IRAARQAIQQICSKVSASAE-LSQGAIRF 58 Query: 239 GTIAYNIGIVGNQCT-----PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 G IA+ + + + +K L+ L L + Sbjct: 59 GLIAFRDHPPQDMSFVTKNFGFTAEQSVMKKNLDGLIASGG---GDGPEAQTAALADALN 115 Query: 294 SSHNTIGSTRLKKFVIFITD------GENSGASA-YQNTLNTLQICEYMRNAGMKIYSVA 346 + K V+ ITD GE++ A + + LQ+ M G+ ++ +A Sbjct: 116 LEW----AEGAAKMVVLITDAPPHGIGEDNDGFAESPDQNDPLQLARQMAERGITLFVIA 171 Query: 347 ----VSAPPEGQDLLRKCTD-SSGQFFAVNDSRELLE 378 +S D + D +SG+ F + + +L + Sbjct: 172 CEPTLSRYRNAVDFYKALADITSGKVFPLTLAEKLGD 208 >gi|153812898|ref|ZP_01965566.1| hypothetical protein RUMOBE_03305 [Ruminococcus obeum ATCC 29174] gi|149830976|gb|EDM86066.1| hypothetical protein RUMOBE_03305 [Ruminococcus obeum ATCC 29174] Length = 838 Score = 39.9 bits (91), Expect = 0.70, Method: Composition-based stats. Identities = 12/152 (7%), Positives = 43/152 (28%), Gaps = 5/152 (3%) Query: 4 IIISVCFLFITYAIDLAHIMYIRNQMQSALD---AAVLSGCASIVSDRTIKDPTTKKDQT 60 +I+ +F +D++ + + + A D A L+ + T D + Sbjct: 18 LILVPVLIFSGIIVDISRLYAAKTVISGAGDLTMNAALA-RYDKQLKDSYGLITMADDPS 76 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 S K +++ + + ++ + + + + + Sbjct: 77 SPSMKTYLEQSFLESCNASALKDTKSTDLH-SMIQLELGTEGVEVQGVKNSSLADTQVLQ 135 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICM 152 + ++ + I + + + Sbjct: 136 QQILEYMKFRAPVYMVSDILEKFKKMPLKNMN 167 >gi|308472813|ref|XP_003098633.1| hypothetical protein CRE_04227 [Caenorhabditis remanei] gi|308268233|gb|EFP12186.1| hypothetical protein CRE_04227 [Caenorhabditis remanei] Length = 395 Score = 39.9 bits (91), Expect = 0.70, Method: Composition-based stats. Identities = 25/162 (15%), Positives = 51/162 (31%), Gaps = 17/162 (10%) Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL------N 267 + ++I +E R+G + YN N + +++ + Sbjct: 56 RSFVSSTRDTINNIFREASIPRTRVGFVTYNSQATTNADLNKFKSYGDLQQGVYNSYNDM 115 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 L+P + + A L + + G K +I N L+ Sbjct: 116 NLSPEKTPYIGTGLIAAGELL----QIQGSADGHVNHPKVIIAYATALNGTGL-----LD 166 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEG--QDLLRKCTDSSGQF 367 L + +++AG+ I ++AV G + L F Sbjct: 167 PLSVANTLKSAGITIITIAVDTDDNGVIEKQLAPLASPGAAF 208 >gi|269124456|ref|YP_003297826.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] gi|268309414|gb|ACY95788.1| von Willebrand factor type A [Thermomonospora curvata DSM 43183] Length = 432 Score = 39.9 bits (91), Expect = 0.70, Method: Composition-based stats. Identities = 25/207 (12%), Positives = 52/207 (25%), Gaps = 34/207 (16%) Query: 193 NTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQC 252 S + + L + + A+ Sbjct: 48 IIDTSGSMASDGKLAEAKRAARTAVDTLRDGVHFAVIAG------FHRAEPVYPGGERLA 101 Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 ++ E K + +L T + A+ + + + I +T Sbjct: 102 VASASTKKEAKKAIGRLTSGGGTAIGSWLRMAHGLMSRQGAGG---------VRHAILLT 152 Query: 313 DGENSGASAYQNTLNTLQI-------CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG 365 DG+N +A + + C V +L + + G Sbjct: 153 DGQNQHETAEELDAALRAVSGSFVCDCR------------GVGTDWRVAELRKIASALLG 200 Query: 366 QFFAVNDSRELLESFDKITDKIQEQSV 392 V D R+L F +T+ ++V Sbjct: 201 SVDIVADPRDLAADFRAMTENAMGKTV 227 >gi|183985704|gb|AAI66222.1| LOC100158554 protein [Xenopus (Silurana) tropicalis] Length = 899 Score = 39.9 bits (91), Expect = 0.71, Method: Composition-based stats. Identities = 24/134 (17%), Positives = 50/134 (37%), Gaps = 26/134 (19%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 TN + + + + + ++ +TDGE+SG S+ Sbjct: 381 ATGGTNICAGVQQGLQV--------NRNLDQSTHGTEIVLLTDGEDSGISSCFPD----- 427 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SSGQFFAVN---DSRELLESFDKITD- 385 + +G I+++A+ + L K D + G + D L++SF I Sbjct: 428 ----ITKSGAIIHTIALGNNAD--PGLEKLADLTGGLKLYASDKVDVNGLIDSFSGIVSN 481 Query: 386 --KIQEQSVRIAPN 397 + +QS++I + Sbjct: 482 TGNVTQQSLQIESS 495 >gi|220918111|ref|YP_002493415.1| hypothetical protein A2cp1_3014 [Anaeromyxobacter dehalogenans 2CP-1] gi|219955965|gb|ACL66349.1| conserved hypothetical protein [Anaeromyxobacter dehalogenans 2CP-1] Length = 408 Score = 39.9 bits (91), Expect = 0.72, Method: Composition-based stats. Identities = 10/38 (26%), Positives = 22/38 (57%) Query: 8 VCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIV 45 + F+ A+++ H+ +R ++Q+A DA L+G + Sbjct: 1 MLCAFLALALNVGHLFSVRGELQNASDAGALAGAIELD 38 >gi|329850149|ref|ZP_08264995.1| hypothetical protein ABI_30470 [Asticcacaulis biprosthecum C19] gi|328842060|gb|EGF91630.1| hypothetical protein ABI_30470 [Asticcacaulis biprosthecum C19] Length = 399 Score = 39.9 bits (91), Expect = 0.73, Method: Composition-based stats. Identities = 24/190 (12%), Positives = 51/190 (26%), Gaps = 4/190 (2%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGC---ASIVSDRTIKDPTTKK 57 + AI + I A++ + + ++N++Q+A DA L+G + + Sbjct: 20 IFAIAAVPLLVAIGGALEYSDMSGLKNRLQTAADAGALAGAGRLSLASTTGDEDARRVAI 79 Query: 58 DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 K A IT + + + + T Sbjct: 80 RTAQDNLKGDSATFTVDIDRTAGTLTVNAAAEHHGITGVLGDKVLRA-SAVGEALQKTPL 138 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSN 177 L+ + + + N IS+ ++ + N + Sbjct: 139 CVLQVETGGIVLSQTAAIKAPGCMIHANNNISVTQSAMITGKLIQASGTVTGATNPAGNG 198 Query: 178 KYLLPPPPKK 187 L P P Sbjct: 199 GALPIPDPFN 208 >gi|308270390|emb|CBX27002.1| hypothetical protein N47_A10310 [uncultured Desulfobacterium sp.] Length = 1056 Score = 39.9 bits (91), Expect = 0.73, Method: Composition-based stats. Identities = 31/294 (10%), Positives = 76/294 (25%), Gaps = 64/294 (21%) Query: 155 DVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWS-KNTTKSKYAPAPAPANRKIDVL 213 ++ + +++ P K + + ++DV Sbjct: 11 PITITSMPTDGWTGWITGKQVLVRFITDANPATVSTGWKIDSYQYQSKTDYNFFSRLDVA 70 Query: 214 IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE 273 + N++ S + + + + + LN + Sbjct: 71 TSAIINVIESTRGRVNWGIMGFGETNATPPMNASYNDDQQ-----KQSIITALNGFSATG 125 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN------TLN 327 ++N +M+ +E ++ + H K FVI ++DG + + Sbjct: 126 DSNIAESMNLVLKEFNSKAQQIHKDCN----KNFVIVLSDGYADHDTLGSDWGSPVADYA 181 Query: 328 TLQICEYMR--------------------------------------------NAGMKIY 343 L E + + ++ Sbjct: 182 ALNAAEASQYHFTQDPFQDANPPADYYDDIAGFMYSHKYTDYTSIPADERADSADNIIVH 241 Query: 344 SVAVSAPPEGQDLLRKCTDSSGQFF-AVNDSRELLESFDKITDKIQEQSVRIAP 396 ++ S +L+ +D G F +L+ +F + I E + AP Sbjct: 242 NIGFSTES---PMLKHASDLGGGIFLTAYSKSQLVNAFYSLGILIAEYTSYTAP 292 >gi|167044051|gb|ABZ08736.1| putative von Willebrand factor type A domain protein [uncultured marine crenarchaeote HF4000_APKG4H17] Length = 522 Score = 39.9 bits (91), Expect = 0.74, Method: Composition-based stats. Identities = 17/109 (15%), Positives = 35/109 (32%), Gaps = 22/109 (20%) Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 L N + K RL ++ T Y LY++K + ++DG Sbjct: 407 LKWNNSCAK-RLAQIPANGGTPLAEVYDKLYPILYSKKPD------------IFLTLSDG 453 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS 363 E +T + + ++ G+K+ ++ V + Sbjct: 454 EP------SDTFAARSMVKSFKSLGIKMVAIGVGRDTRNAT---IIATN 493 >gi|152988351|ref|YP_001350200.1| hypothetical protein PSPA7_4864 [Pseudomonas aeruginosa PA7] gi|150963509|gb|ABR85534.1| conserved hypothetical protein [Pseudomonas aeruginosa PA7] Length = 556 Score = 39.9 bits (91), Expect = 0.74, Method: Composition-based stats. Identities = 18/159 (11%), Positives = 42/159 (26%), Gaps = 7/159 (4%) Query: 17 IDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKH----- 71 +D + + +Q D A L + ++ + Sbjct: 31 VDTGRLYLEQRNLQRVADVAALESASQGALCGDQTSAQATSFAKASASLNGFDANAAGSG 90 Query: 72 -LKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTN 130 L + ++ G + + N ++ + + L G +A N Sbjct: 91 LLAEVGGVQSLGGLRRFSSSASNVAVDNEAVRVLVSKSVPGSLVANLAGLFG-GANANVN 149 Query: 131 LSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHN 169 L + S+ A + L+ S+S + Sbjct: 150 LRAEAVARRLPSATISAGTGLASLNSSQSALLNPILSGL 188 >gi|153830633|ref|ZP_01983300.1| type I secretion target ggxgxdxxx repeat (2 copies) domain protein [Vibrio cholerae 623-39] gi|148873874|gb|EDL72009.1| type I secretion target ggxgxdxxx repeat (2 copies) domain protein [Vibrio cholerae 623-39] Length = 1426 Score = 39.9 bits (91), Expect = 0.76, Method: Composition-based stats. Identities = 25/210 (11%), Positives = 61/210 (29%), Gaps = 23/210 (10%) Query: 157 SRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIES 216 ++ + D + W+ + +++ A + ID L Sbjct: 813 VEPGKNYNIALIVDTSGSMRYDLAGNQNATYDGWNDSWSQTPEQYAASRMKLTIDALKVL 872 Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTN 276 A L + + + NL+++ ++N L TN Sbjct: 873 ATQLADHDGTVNITLIGFNGTAADALPFNNLSAA-------NLSDLIGKINLLIADGGTN 925 Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 A A + + + + + F+TDG+ + + +T T + Sbjct: 926 YEDAFIEATKWFNTQ----PGSETDLKFENLTYFLTDGDPTVHNGDSSTGFTTDYADMQN 981 Query: 337 N----------AGMKIYSVAVSAPPEGQDL 356 +G+K ++ + ++L Sbjct: 982 AIDAFTSLSGLSGVK--AIGIGTGVTVENL 1009 >gi|315605762|ref|ZP_07880794.1| conserved hypothetical protein [Actinomyces sp. oral taxon 180 str. F0310] gi|315312460|gb|EFU60545.1| conserved hypothetical protein [Actinomyces sp. oral taxon 180 str. F0310] Length = 339 Score = 39.9 bits (91), Expect = 0.78, Method: Composition-based stats. Identities = 16/169 (9%), Positives = 43/169 (25%), Gaps = 31/169 (18%) Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKL--NPYENTNTYPAMHHAYRELYNEKESSHNTIGS- 301 PL+++ L ++ + Y L Sbjct: 135 WNAYSMTMFPLTDDYEMADDVLTEMADTIDGGLSIYGGRISVTPTLEKYLRPLLPDDDEE 194 Query: 302 --------------------TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK 341 + ++ TD E Q + + E+ ++ + Sbjct: 195 KASLVGDGLASCVLGFDHTDKERSRTILLATDNEV----YGQGIYDLAEAIEFAKSQKVT 250 Query: 342 IYSVAVSAP----PEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDK 386 + ++ + E +L + + G F+ + + E +I + Sbjct: 251 VSALYPGSEYAMTSEAYELRDQVRSTGGDFYDASSPSAVDEVVKQIESE 299 >gi|307294185|ref|ZP_07574029.1| hypothetical protein SphchDRAFT_1655 [Sphingobium chlorophenolicum L-1] gi|306880336|gb|EFN11553.1| hypothetical protein SphchDRAFT_1655 [Sphingobium chlorophenolicum L-1] Length = 157 Score = 39.9 bits (91), Expect = 0.78, Method: Composition-based stats. Identities = 13/139 (9%), Positives = 32/139 (23%), Gaps = 2/139 (1%) Query: 3 AIIISVCFLFITYAIDLAHIMYIRNQMQSALDA-AVLSGCASIVSDRTIKDPTTKKDQTS 61 A+I+ + L A D+A + +Q A + L+ +D + Sbjct: 18 ALIMPILVLLACMAGDVAMAFKAKIALQRAAERTGQLATAGGYTNDTSKTQAAYNNLAAD 77 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + + Q A + + L Sbjct: 78 AAAAAGVSTN-NVTVTPTLLCDATVQTASPEVPCADGQQTKRYVAITISGSYTPMFAKLM 136 Query: 122 GLIPSALTNLSLRSTGIIE 140 + + + + + Sbjct: 137 PGSNWSTQGIPITGSASVR 155 >gi|167566937|ref|ZP_02359853.1| hypothetical protein BoklE_30536 [Burkholderia oklahomensis EO147] Length = 389 Score = 39.9 bits (91), Expect = 0.78, Method: Composition-based stats. Identities = 29/318 (9%), Positives = 85/318 (26%), Gaps = 29/318 (9%) Query: 5 IISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTI---KDPTTKKDQTS 61 +++V F+ A+DL + R+++Q++ DA L+ + + + + Sbjct: 1 MLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLTGAINLSVPEAAGITSGHLN 60 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + ++ + + + ++ ++ + L Sbjct: 61 YALFQGFPVQMQTDLSVTFSDSVSGPFQPKSAISSPSSIKYVKCKTSMTGIVNWFIQALN 120 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVL------DVSRSMEDLYLQKHNDNNNMT 175 + ++ N S+ +T + + +I + + Y + Sbjct: 121 MVPGVSVANASVSATAVATIGAAQTTCAIPVFICKAGTQTSPPVAGATYNVGDWLSAKTG 180 Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAP-------------APAPANRKIDVLIESAGNLVN 222 S S + + S A + + Sbjct: 181 SPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGNKAATTNAYNTRFG 240 Query: 223 SIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR-------LNKLNPYENT 275 ++ + AY+ Q S+ +++ ++ + +N + Sbjct: 241 IYHNPYKDPSYGTPDFTGYAYDATTWPAQSNAYSDFVSKRRTFTSYQGDLITGINTGGSY 300 Query: 276 NTYPAMHHAYRELYNEKE 293 + A R L E Sbjct: 301 SASYYQAGADRRLALAPE 318 >gi|94986631|ref|YP_594564.1| hypothetical protein LI0187 [Lawsonia intracellularis PHE/MN1-00] gi|94730880|emb|CAJ54243.1| hypothetical protein LI0187 [Lawsonia intracellularis PHE/MN1-00] Length = 530 Score = 39.9 bits (91), Expect = 0.78, Method: Composition-based stats. Identities = 19/124 (15%), Positives = 38/124 (30%), Gaps = 24/124 (19%) Query: 270 NPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 + +T A+ L E K V +TDG + Sbjct: 431 EAHGSTPMTEALWWVLGMLSTRPEHR----------KIVFVVTDGYPD------DPETAK 474 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + + G+++ + + AP + G + D REL + ++ +I Sbjct: 475 ETIAVAKRMGIEVLGIGIDAPA-------IISMIPGSE-NITDIRELAPAMFRLLQQIMT 526 Query: 390 QSVR 393 + R Sbjct: 527 EKGR 530 >gi|308162059|gb|EFO64486.1| Kinase, NEK [Giardia lamblia P15] Length = 1006 Score = 39.9 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 22/131 (16%), Positives = 42/131 (32%), Gaps = 6/131 (4%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE-- 261 A + + S +L++S + R + + P + Sbjct: 358 AYMTDNVGMYSLSVDDLISSHSNVPGVRNEPVTRSSLTPAAAATITSHVAPTYSREATRI 417 Query: 262 VKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + SR++ + NT+ + ++R L +SHN S + DG Sbjct: 418 ISSRVDSMRAGADLPGNTSINNGLSPSFRNLLEMNSTSHNYDNSIDDTSPIYNQNDGNVR 477 Query: 318 GASAYQNTLNT 328 AS Q +T Sbjct: 478 VASDTQGAADT 488 >gi|194220813|ref|XP_001500011.2| PREDICTED: vitrin [Equus caballus] Length = 662 Score = 39.9 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 13/120 (10%), Positives = 39/120 (32%), Gaps = 14/120 (11%) Query: 238 IGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEKESSH 296 +G + Y + + ++K+ + K+ +N A+ + +++ + Sbjct: 316 MGVVQYGDNPATHFNLKTHVSSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKVNGNR 375 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 + + I DG + + R +G+ I+ + + E + Sbjct: 376 GGAPN-----VAVVIVDGWPTD--------KVEEASRLARESGINIFFITIEGAVENEKQ 422 >gi|123473503|ref|XP_001319939.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] gi|121902734|gb|EAY07716.1| von Willebrand factor type A domain containing protein [Trichomonas vaginalis G3] Length = 161 Score = 39.9 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 20/123 (16%), Positives = 42/123 (34%), Gaps = 21/123 (17%) Query: 258 NLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 N+ + L+ ++ TN + H S K + F+TDGE Sbjct: 48 NVANAMNLLDNISANMGGTNILAPLQHVSDL-----------QASEGFVKQIFFLTDGEV 96 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRE 375 + + R+ +I+S+ + + L++ S G + + D+ Sbjct: 97 DNSDIIC------ATAQKNRSTN-RIFSIGLGSGA-DPGLIKGMARKSGGNYAIIGDNDN 148 Query: 376 LLE 378 + E Sbjct: 149 MNE 151 >gi|310825519|ref|YP_003957877.1| hypothetical protein STAUR_8296 [Stigmatella aurantiaca DW4/3-1] gi|309398591|gb|ADO76050.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1] Length = 1416 Score = 39.9 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 10/54 (18%), Positives = 22/54 (40%), Gaps = 7/54 (12%) Query: 341 KIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVR 393 +++++ LL+ D G + +D+ L + I + I Q +R Sbjct: 429 RVHTIGFGIDSN---LLKNTADVGKGLYATADDAASLTNA---ILNAIAAQDIR 476 >gi|225012027|ref|ZP_03702464.1| von Willebrand factor type A [Flavobacteria bacterium MS024-2A] gi|225003582|gb|EEG41555.1| von Willebrand factor type A [Flavobacteria bacterium MS024-2A] Length = 346 Score = 39.9 bits (91), Expect = 0.79, Method: Composition-based stats. Identities = 23/158 (14%), Positives = 48/158 (30%), Gaps = 41/158 (25%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 P++ + + K L LN ++ A+ A + + + + I+ Sbjct: 144 LPITTDYSAAKMFLQALNTEMLSSQGTALDSAIDLSGTFFD------DEDQTNRVIFLIS 197 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP--------------------- 351 DGE+ A + G+KI++ V Sbjct: 198 DGEDHSEDASNAASRAAAM-------GIKIFTFGVGTEAGAPIPIKRNNVVESYKKDFNG 250 Query: 352 ------EGQDLLRKCT-DSSGQFFAVNDSRELLESFDK 382 Q+ L + GQ+ ND++ +L+ + Sbjct: 251 EVVITKRNQETLEAIANATDGQYQDGNDTQAVLDFVSE 288 >gi|17545367|ref|NP_518769.1| transmembrane protein [Ralstonia solanacearum GMI1000] gi|17427659|emb|CAD14178.1| putative transmembrane protein [Ralstonia solanacearum GMI1000] Length = 347 Score = 39.9 bits (91), Expect = 0.80, Method: Composition-based stats. Identities = 43/322 (13%), Positives = 93/322 (28%), Gaps = 20/322 (6%) Query: 5 IISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTI-------KDPTTKK 57 ++ F A+D+A ++ +RN++Q+A DAA L+G A + T+ Sbjct: 25 MLIAIFSVGALAVDVARLIVVRNELQNAADAAALAGAAGLYPTNPTPNWSNGVAQGTSAV 84 Query: 58 DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 + + + ++ G + Q I + +Q A Sbjct: 85 KLNKSSNVQLVSGTVQAGYWNLTGTPAGLQSQSITPGANDVPAVQVTINRSAGNNGGPVA 144 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSN 177 L + + N S + +I L +S+ + DLY + + Sbjct: 145 TLLAPIFGALSANSSATAVAVIAAPGSAGP-GALFPLAISKCLYDLYWNYATGQPKIDPS 203 Query: 178 KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNS-------IQKAIQE 230 + + ++ + L N S I A+ Sbjct: 204 TGKPYVFQINTSYPSSSNCTSGEWTGFNGPTDASTLKGLVQNGNTSTLSIGNMINTALAS 263 Query: 231 KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE----NTNTYPAMHHA-Y 285 SV + + + + PL+ +E + + TN+ Sbjct: 264 GVKSSVYMAIPSTPLNVTIPVVNPLNPGASEPIYAFAGIQITKVVCCGTNSTIWASFISN 323 Query: 286 RELYNEKESSHNTIGSTRLKKF 307 ++ N + G+ + Sbjct: 324 WKVSNSGGGTGPYYGAYVPPRL 345 >gi|293342302|ref|XP_001059971.2| PREDICTED: anthrax toxin receptor 1-like [Rattus norvegicus] gi|293354106|ref|XP_344486.4| PREDICTED: anthrax toxin receptor 1-like [Rattus norvegicus] gi|149034121|gb|EDL88891.1| similar to hypothetical protein 4933430J11 (predicted) [Rattus norvegicus] Length = 535 Score = 39.9 bits (91), Expect = 0.81, Method: Composition-based stats. Identities = 16/82 (19%), Positives = 29/82 (35%), Gaps = 10/82 (12%) Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 L + P T+ + A ++ H ++ +TDG Sbjct: 107 LKNIEPQGLTHMQKGLIKANEQIQKSAARGHRA------VSVIVALTDGLLLLKPY---- 156 Query: 326 LNTLQICEYMRNAGMKIYSVAV 347 L+T++ + R G IY+V V Sbjct: 157 LDTMEEAKKARKLGAIIYTVGV 178 >gi|291166457|gb|EFE28503.1| hypothetical protein HMPREF0389_00418 [Filifactor alocis ATCC 35896] Length = 637 Score = 39.9 bits (91), Expect = 0.82, Method: Composition-based stats. Identities = 31/242 (12%), Positives = 66/242 (27%), Gaps = 21/242 (8%) Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 + ++ + N +K P + K L Sbjct: 399 NAATIIPTLEGNYISKNKSDFSKKKNTNPIYRDKNYDMDAFQKRQFKHFTNVTYEHRLDN 458 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 S ++K QE V+ I I + LS + + ++ N Sbjct: 459 SFNFKEPELRKYYQEGNACEVK--IIKPTNTIYDSVTQELSLDYEHIVTQQNGKE----- 511 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA-YQNTLNTLQICEY 334 ++ AY HN + +V + N+ + ++ + + + Sbjct: 512 ---ESIQEAYASQKGSGAWLHNNWSAATGNTYVSKVGKLYNTPDNYLKNSSPQSRKAIKD 568 Query: 335 MRNAGMKIYSVAV-------SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 M ++Y +A + S + F N +L + F +I + I Sbjct: 569 MVTKK-RVYIIAFSRSVSTSGLNAIADEF--SIKGDSKRVFRANSGEDLSKIFQEIGESI 625 Query: 388 QE 389 Sbjct: 626 NN 627 >gi|158294250|ref|XP_315491.4| AGAP005490-PA [Anopheles gambiae str. PEST] gi|157015476|gb|EAA10906.4| AGAP005490-PA [Anopheles gambiae str. PEST] Length = 1239 Score = 39.5 bits (90), Expect = 0.83, Method: Composition-based stats. Identities = 28/349 (8%), Positives = 90/349 (25%), Gaps = 18/349 (5%) Query: 29 MQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQK 88 ++ +DA V + + + ++ + K+ ++ G I I + Sbjct: 90 LKYMMDAKVSAVKRIMDTAENTAISFDEEPVNQSFQYYNAKQMIEPGEIITTPIPMIDED 149 Query: 89 AQINITKDKNNPLQYI---------AESKAQYEIPTENLFLKGLIPSALTNLSLRSTGII 139 T + + N++ + S I Sbjct: 150 PADITTPIPPKEIVLTKKRHFFNEAVNTTVSSVHVPTNVYDRATEVIKAIKWSEALDSIF 209 Query: 140 ERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY 199 + S + + + + L ++ S Sbjct: 210 YNNYIGDPTLTWQYFGSSSGFLRQFPATKWEQDPVDLYDCRLRSWYIEAANSPKDMLILV 269 Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 + + ++ D+ N+++++ + N+ Sbjct: 270 DSSGSMTGQRKDIAKHVVSNILDTLG-PNDYVNIFTFSEEVAEVVPCFRDTLVQANMGNI 328 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 E+K ++ + TN + A + E T R + ++ ++DG Sbjct: 329 RELKLGMDNIE----TNEIANVSAALTRAFELLEQFRETRNGARCNQAIMLVSDGVPYSF 384 Query: 320 SAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF 368 N ++ +++++ + +++ + ++ Sbjct: 385 DEVFEQFNWKELPFI----PVRVFTYLIGREVADVKEIKEMACRNQGYY 429 >gi|156382210|ref|XP_001632447.1| predicted protein [Nematostella vectensis] gi|156219503|gb|EDO40384.1| predicted protein [Nematostella vectensis] Length = 8745 Score = 39.5 bits (90), Expect = 0.84, Method: Composition-based stats. Identities = 20/209 (9%), Positives = 57/209 (27%), Gaps = 21/209 (10%) Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNS---IQKAIQEKKNLSVRIGTIAYNIG 246 ++ S + K + L + + V + + Sbjct: 7889 FASRIKNSSEVGSTKTPINKSAAARFATNRLKGDRRFLYGSSANASRTRVLPTIVGTHST 7948 Query: 247 IVGNQCTPLSN---NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 + + +K+ ++ + N A+ A + Sbjct: 7949 TFTAAEVDFKQEFPDDSSLKTAIDSVKTRPGNNLTEALIKAKALFTRDSRP--------N 8000 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS 363 ++K ++ ITD + + + + + +G+++ VA + L + Sbjct: 8001 VRKVLVVITD-----KGSGHDEDDIKLAAKALVTSGVQVIPVAFGRSADD-GELVAIAGN 8054 Query: 364 SGQFFAVNDSRE-LLESFDKITDKIQEQS 391 D E + + +T I+ + Sbjct: 8055 KKDIVKTKDEDEPITSAEKVMTTIIESEL 8083 >gi|271963054|ref|YP_003337250.1| hypothetical protein Sros_1514 [Streptosporangium roseum DSM 43021] gi|270506229|gb|ACZ84507.1| hypothetical protein Sros_1514 [Streptosporangium roseum DSM 43021] Length = 594 Score = 39.5 bits (90), Expect = 0.84, Method: Composition-based stats. Identities = 12/78 (15%), Positives = 30/78 (38%), Gaps = 7/78 (8%) Query: 306 KFVIFITDGENS-GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS 364 V+F+TDG+N + + L N ++++++ + L R + Sbjct: 518 NAVVFLTDGKNEKTGGSDLDNLLGKL------NPDVRLFTIGYGEGADQGVLKRIAEATD 571 Query: 365 GQFFAVNDSRELLESFDK 382 G + + + + + F Sbjct: 572 GAAYDSSRADTIDQVFTS 589 >gi|194228056|ref|XP_001914937.1| PREDICTED: similar to Gene model 784, (NCBI) [Equus caballus] Length = 1407 Score = 39.5 bits (90), Expect = 0.86, Method: Composition-based stats. Identities = 21/233 (9%), Positives = 58/233 (24%), Gaps = 26/233 (11%) Query: 72 LKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNL 131 + + G ++ N P Y +S + + + + TN+ Sbjct: 231 IPSTYTVDYAPGTYTVDNTLSTFTADNAPHTYTGDSTSSTYTVDDTSGAYTVDNAPRTNI 290 Query: 132 SLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWS 191 S + + ++ L S + L H ++ ++ P + + Sbjct: 291 VGNSLSTYTVDNVPCSYTVENTL--STCTVNNTLSTHTVDSAPSTCTVDSAPATNTANNT 348 Query: 192 KNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ 251 N + P + + N + + + Sbjct: 349 LNIYTAHNTPNTY----------------------TVDDPPNTYIADN--TPSNSSTDHA 384 Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 + + + + S ++ + +TN P + L + Sbjct: 385 PSTSTTDTSLPPSTIDSVPSPSSTNYAPQTSTSDGTLTPSSIDGTPGSSISDS 437 >gi|322807127|emb|CBZ04701.1| T. pallidum predicted coding region TP0592 [Clostridium botulinum H04402 065] Length = 283 Score = 39.5 bits (90), Expect = 0.86, Method: Composition-based stats. Identities = 25/171 (14%), Positives = 56/171 (32%), Gaps = 13/171 (7%) Query: 212 VLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP 271 + +++ I K + + + + NN + ++ L +N Sbjct: 63 IYNSDIQDMLEQIYPLDITGKLMDKDFDPGRFRVYPLLEDIYG--NNSSTIEKNLKNINT 120 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T + A L N + H S I+ +G + L + Sbjct: 121 SYGTVQFNNNSKAAESLKNVLDELHGISKSNDKLNSYIYPLNGTFNYRHIAGTNLLSPHA 180 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + +A+ + +D + T+S GQ + +E++E+F+K Sbjct: 181 -----------FGIAIDLVRDNRDYWKWATESQGQERIASYPKEIVETFEK 220 >gi|113866862|ref|YP_725351.1| RND superfamily exporter [Ralstonia eutropha H16] gi|113525638|emb|CAJ91983.1| predicted exporter of the RND superfamily [Ralstonia eutropha H16] Length = 792 Score = 39.5 bits (90), Expect = 0.86, Method: Composition-based stats. Identities = 13/46 (28%), Positives = 20/46 (43%), Gaps = 7/46 (15%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSA-----LDAAVLSGC 41 +T + V L + +D A +Y R Q+Q A DA L+ Sbjct: 672 LTVATLPVMVLAVGIGVDYAFYIYNRLQVQLANGLAFTDA--LAVA 715 >gi|308472927|ref|XP_003098690.1| hypothetical protein CRE_04223 [Caenorhabditis remanei] gi|308268290|gb|EFP12243.1| hypothetical protein CRE_04223 [Caenorhabditis remanei] Length = 411 Score = 39.5 bits (90), Expect = 0.87, Method: Composition-based stats. Identities = 19/155 (12%), Positives = 51/155 (32%), Gaps = 13/155 (8%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL----NKLNPYENTNTYPAMHHAYRELYN 290 + R+G I YN N + +++ + + + + ++ + A + L Sbjct: 86 TTRVGLITYNFNATLNANLSQFQSYDDLSNGVFHSLSNVTNSTDSFIGTGLAMAEQLL-- 143 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + + NT K +++ + + + I + ++ +G+KI +V Sbjct: 144 -RRQNFNTTRDHYKKVIIVYASAFQRNEDET------PEWIADRLKGSGVKIITVGYGNS 196 Query: 351 PEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 L F + D + + + Sbjct: 197 HGLIKSLSNIASPGLSFNSSGDGNLINQIQTSLLQ 231 >gi|115665362|ref|XP_001180845.1| PREDICTED: hypothetical protein, partial [Strongylocentrotus purpuratus] gi|115941435|ref|XP_001179810.1| PREDICTED: hypothetical protein, partial [Strongylocentrotus purpuratus] Length = 763 Score = 39.5 bits (90), Expect = 0.87, Method: Composition-based stats. Identities = 21/152 (13%), Positives = 56/152 (36%), Gaps = 29/152 (19%) Query: 257 NNLNEVKSRLNKLN--------PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 +N+ E+ S +++ + T + A + S ++ Sbjct: 217 SNMTELTSAISRKDLASLLPTYADGATCIGCGIQTAIQVAQYNGMDSRGV--------YL 268 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF 368 I ++DG+ + + +TL+ ++ ++G+ ++S+A E L +G Sbjct: 269 ILLSDGQENSGTLIADTLDDIE------DSGVIVHSIAF---YEADTQLEDLAQMTGGIS 319 Query: 369 ----AVNDSRELLESFDKITDKIQEQSVRIAP 396 ++ ++ +F+ I + + AP Sbjct: 320 ATCADGGSAQCVISAFESIIAQRPQSVATSAP 351 >gi|115374114|ref|ZP_01461402.1| type IV pilin biogenesis protein, putative [Stigmatella aurantiaca DW4/3-1] gi|310825522|ref|YP_003957880.1| hypothetical protein STAUR_8299 [Stigmatella aurantiaca DW4/3-1] gi|115368890|gb|EAU67837.1| type IV pilin biogenesis protein, putative [Stigmatella aurantiaca DW4/3-1] gi|309398594|gb|ADO76053.1| uncharacterized protein [Stigmatella aurantiaca DW4/3-1] Length = 167 Score = 39.5 bits (90), Expect = 0.87, Method: Composition-based stats. Identities = 10/44 (22%), Positives = 19/44 (43%), Gaps = 4/44 (9%) Query: 340 MKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESFDK 382 + +Y+V + LLR + G + VND L ++ + Sbjct: 119 VNVYTVGLGIDSN---LLRNAANEGGGLSYRVNDVAGLKQAIQE 159 >gi|77458974|ref|YP_348480.1| transporter, putative [Pseudomonas fluorescens Pf0-1] gi|77382977|gb|ABA74490.1| Putative transport protein [Pseudomonas fluorescens Pf0-1] Length = 804 Score = 39.5 bits (90), Expect = 0.87, Method: Composition-based stats. Identities = 10/33 (30%), Positives = 16/33 (48%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSAL 33 +T + V L + +D A +Y R Q+ AL Sbjct: 684 LTIATLPVMVLAVGIGVDYAFYIYNRLQLHQAL 716 >gi|255918329|gb|ABL86001.2| E cadherin [Tenebrio molitor] Length = 1625 Score = 39.5 bits (90), Expect = 0.88, Method: Composition-based stats. Identities = 27/200 (13%), Positives = 52/200 (26%), Gaps = 7/200 (3%) Query: 170 DNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKI---DVLIESAGNLVNSIQK 226 + +T N + + S + P I D ++ K Sbjct: 563 ASTQLTINVIDVNDENPRLVVSSTIAVEENQADNYPLETNIAASDEDSDANLEFSIDWSK 622 Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYR 286 + K + ++ I + K L + P NT Y Y Sbjct: 623 SYATKNSQRIKD--FENYHCINVETVPGDDLHTATAKLSLKETQP-GNTPDYETFDTLYI 679 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 +L + + T VI I D ++ NT + G+ I +V Sbjct: 680 QL-TVTDKNTTEGDGTDSALIVINIQDVNDNKPIFADNTAELKREVTENTKDGIIISTVT 738 Query: 347 VSAPPEGQDLLRKCTDSSGQ 366 + ++ + Sbjct: 739 ATDADVDNNVTYAIKSADEN 758 >gi|193208855|ref|NP_505147.3| C-type LECtin family member (clec-218) [Caenorhabditis elegans] gi|163644494|gb|AAY43994.2| C-type lectin protein 218 [Caenorhabditis elegans] Length = 389 Score = 39.5 bits (90), Expect = 0.88, Method: Composition-based stats. Identities = 19/154 (12%), Positives = 44/154 (28%), Gaps = 12/154 (7%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 I+ R+ + Y+ + + +E+ S + L P T+ Sbjct: 65 FGFTQIRVGSNYPDKRGARVSVLTYSDSPTVHANLSDFKSTDELTSMIYALKPS--TSYD 122 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 + + + + N + +I + Q + ++ Sbjct: 123 SNLQSSLKLVKNMMNYKDINAPRNNTQTVIIIYAGDYV-----DYDEPTIAQFGDQLKAD 177 Query: 339 GMKIYSVA-VSAPPEGQ----DLLRKCTDSSGQF 367 G+KI +VA +S L++ F Sbjct: 178 GVKIITVADISNTDHQHVSKLKWLKELASEGNGF 211 >gi|170741047|ref|YP_001769702.1| hypothetical protein M446_2843 [Methylobacterium sp. 4-46] gi|168195321|gb|ACA17268.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 439 Score = 39.5 bits (90), Expect = 0.88, Method: Composition-based stats. Identities = 7/34 (20%), Positives = 16/34 (47%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALD 34 + + + + A++ A I R ++Q A+D Sbjct: 18 ILGLCLPMLVAGSGAALEYARIHKRRTELQKAVD 51 >gi|115374116|ref|ZP_01461404.1| hypothetical protein STIAU_3108 [Stigmatella aurantiaca DW4/3-1] gi|115368892|gb|EAU67839.1| hypothetical protein STIAU_3108 [Stigmatella aurantiaca DW4/3-1] Length = 1408 Score = 39.5 bits (90), Expect = 0.88, Method: Composition-based stats. Identities = 10/54 (18%), Positives = 22/54 (40%), Gaps = 7/54 (12%) Query: 341 KIYSVAVSAPPEGQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQSVR 393 +++++ LL+ D G + +D+ L + I + I Q +R Sbjct: 421 RVHTIGFGIDSN---LLKNTADVGKGLYATADDAASLTNA---ILNAIAAQDIR 468 >gi|72009654|ref|XP_784730.1| PREDICTED: similar to Clca1 protein [Strongylocentrotus purpuratus] Length = 580 Score = 39.5 bits (90), Expect = 0.88, Method: Composition-based stats. Identities = 12/126 (9%), Positives = 42/126 (33%), Gaps = 21/126 (16%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY--ENTNTYPAMHHAYRELYNEKESSHN 297 + + + ++ +N L P +T + + L + + Sbjct: 12 IVVFWSESQIRADLTDITDTALRQTLVNALPPSPTGSTCIGCGIESGLKVLGSYAQG--- 68 Query: 298 TIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 +++ ++DG + + + ++N+G+ I ++ +S + Q + Sbjct: 69 --------GYILLLSDGVENNKPYIRYMYD------DIKNSGVIIDTITISNSADQQ--M 112 Query: 358 RKCTDS 363 + + Sbjct: 113 EDLSTN 118 >gi|320161534|ref|YP_004174758.1| hypothetical protein ANT_21320 [Anaerolinea thermophila UNI-1] gi|320161798|ref|YP_004175023.1| hypothetical protein ANT_23970 [Anaerolinea thermophila UNI-1] gi|319995387|dbj|BAJ64158.1| hypothetical protein ANT_21320 [Anaerolinea thermophila UNI-1] gi|319995652|dbj|BAJ64423.1| hypothetical protein ANT_23970 [Anaerolinea thermophila UNI-1] Length = 150 Score = 39.5 bits (90), Expect = 0.89, Method: Composition-based stats. Identities = 8/34 (23%), Positives = 18/34 (52%), Gaps = 3/34 (8%) Query: 3 AIIISVCFLFITYAIDLAHIMYIRNQMQSALDAA 36 +++ + + D A ++Y+R ++ A DAA Sbjct: 23 GLVLIPLLMLVG---DGARLLYVRGRLAQATDAA 53 >gi|198437300|ref|XP_002125380.1| PREDICTED: similar to Collagen alpha-1(XII) chain [Ciona intestinalis] Length = 955 Score = 39.5 bits (90), Expect = 0.89, Method: Composition-based stats. Identities = 13/215 (6%), Positives = 44/215 (20%), Gaps = 28/215 (13%) Query: 166 QKHNDNNNMTSNKYLLPPPPKKSFWSK---------NTTKSKYAPAPAPANRKIDVLIE- 215 + S + + A +D Sbjct: 137 TSVVPAAINEQATWWAFNYRTSSTSNVFEISAAELGVNDFNNCGSLGADIIFLVDGASSI 196 Query: 216 ---SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY 272 + +L++ + + + R G + + + + + P Sbjct: 197 GATNFQHLLDFVLATGEIFNQTTNRYGLTVFATTSTTYGELFNVTSRDFLNVVAGAMQPP 256 Query: 273 E--NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 +T + + + + + ++ + + L Sbjct: 257 GEPSTWIGSGIQTVLDSGFQTSNGARPSSMAIPKILVLVV-----------ANPSNDQLS 305 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG 365 E AG+ + V + + + Sbjct: 306 SAELAHAAGITTFVVG--TGNALTSEMNQIAEDPD 338 >gi|218780334|ref|YP_002431652.1| OmpA/MotB domain protein [Desulfatibacillum alkenivorans AK-01] gi|218761718|gb|ACL04184.1| OmpA/MotB domain protein [Desulfatibacillum alkenivorans AK-01] Length = 387 Score = 39.5 bits (90), Expect = 0.89, Method: Composition-based stats. Identities = 19/119 (15%), Positives = 39/119 (32%), Gaps = 22/119 (18%) Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK---------FVIFITDGENSG 318 L T + +A L+ S + T K +I ++DG S Sbjct: 111 DLQYGFATYSSTEFANAVMPLWGPNGPSPMGLAITEADKDLCSTSGRTALIILSDGHPSD 170 Query: 319 ASAYQNTLNTLQICEYMRNA---GMKIYSVAVSAPPEGQDLLRKCTDSSGQ--FFAVND 372 + ++ +++ + I ++AV A G+++L G F + Sbjct: 171 --------DAVKAARKLKDRLGGKLCIATIAVGADASGKEILSGIAKIGGCDAAFDAKN 221 >gi|170091410|ref|XP_001876927.1| predicted protein [Laccaria bicolor S238N-H82] gi|164648420|gb|EDR12663.1| predicted protein [Laccaria bicolor S238N-H82] Length = 493 Score = 39.5 bits (90), Expect = 0.90, Method: Composition-based stats. Identities = 35/294 (11%), Positives = 72/294 (24%), Gaps = 18/294 (6%) Query: 66 KQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIP--TENLFLKGL 123 K + + + IT +N + Y + Sbjct: 39 KLTTLDTSSAEQKWKISAVSGKTDVFTITSVVDNAGLNYKRTSQTYWGYGYPQPKANSTS 98 Query: 124 IPSALTNLSLRSTGIIERSSENLAISICMVLDV-SRSMEDLYLQKHNDNNNMTSNKYL-- 180 + + + + + + D S + +N++ + + KY Sbjct: 99 LNWNIQEKTSDGKKFSKIHLDGDDQTYFDSNDNKSTKNAINFYYNNNNSTSGANQKYEFE 158 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 P + K I++ N + + K K ++V I Sbjct: 159 QVPDEPSGAALDIVFIQDITGSQQAFIDKARDEIQNTINGIVNSGKIASGKLRVAVVIFR 218 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG 300 + +++EVK LN + A + Sbjct: 219 DHTPEDTFLTGKLDFTTDVDEVKKYLNNQIATGGGDGPEGQCCALNDALE-----LLLTS 273 Query: 301 STRLKKFVIFITDGENSGASAYQNTL--------NTLQICEYMRNAGMKIYSVA 346 K I TD G + + + + E M G+ +Y VA Sbjct: 274 DDDTTKIAILTTDSPPHGIGEPGDKIPDGCPLQNDPCETAESMARNGITLYVVA 327 >gi|85859136|ref|YP_461338.1| hypothetical protein SYN_01495 [Syntrophus aciditrophicus SB] gi|85722227|gb|ABC77170.1| hypothetical exported protein [Syntrophus aciditrophicus SB] Length = 364 Score = 39.5 bits (90), Expect = 0.90, Method: Composition-based stats. Identities = 26/269 (9%), Positives = 64/269 (23%), Gaps = 26/269 (9%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAV-----LSGCASIVSDRTIKDPTT 55 + A+ ++V A+D+ HIM ++N++ +A DA L + + PT Sbjct: 22 IVALAMTVFLGIAAMAVDVGHIMVVKNELHNAADADALARANLLYAHTPSGFTSATPPTP 81 Query: 56 KKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKA-----------QINITKDKNNPLQYI 104 + + G + + I ++ Sbjct: 82 DWAAAESAASTIDPANKSDGVTLTSYEVETGYWNLDQNPAGLQPKSITPGTRDVAAVKVT 141 Query: 105 AESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLA------ISICMVLDVSR 158 + + + + +N S + I + + Sbjct: 142 VRRVDGTNGGSIRHWFGAFVGNLTSNASATAIAICSSPGTAKPGTLVPMAIPLWIAKRAS 201 Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAG 218 + P P ++ + P N + Sbjct: 202 HYNSPSNLLTIGSAY--HYDAYAPVNPDGDPYTNTVAGQWTSLTSDPINNANHLKDIIVN 259 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGI 247 N + +I + + + Y+ Sbjct: 260 GNSNPL--SIGDPLYIEPGTMDVGYHDNY 286 >gi|170693230|ref|ZP_02884390.1| type VI secretion protein IcmF [Burkholderia graminis C4D1M] gi|170141760|gb|EDT09928.1| type VI secretion protein IcmF [Burkholderia graminis C4D1M] Length = 1189 Score = 39.5 bits (90), Expect = 0.91, Method: Composition-based stats. Identities = 14/76 (18%), Positives = 25/76 (32%), Gaps = 4/76 (5%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSAL----DAAVLSGCASIVSDRTIKDPTTKK 57 + + L +A A ++Q AL DAA L+G + S + Sbjct: 449 AGLAVVTLALVAGFATSYARNRAFVAEVQGALKDLPDAATLAGAPDLKSYFARALARLEV 508 Query: 58 DQTSTIFKKQIKKHLK 73 + Q + H+ Sbjct: 509 ISATQDAAGQYRAHVP 524 >gi|15900378|ref|NP_344982.1| cell wall surface anchor family protein [Streptococcus pneumoniae TIGR4] gi|284055428|pdb|2WW8|A Chain A, Structure Of The Pilus Adhesin (Rrga) From Streptococcus Pneumoniae gi|14971932|gb|AAK74622.1| cell wall surface anchor family protein [Streptococcus pneumoniae TIGR4] Length = 893 Score = 39.5 bits (90), Expect = 0.91, Method: Composition-based stats. Identities = 12/131 (9%), Positives = 32/131 (24%), Gaps = 25/131 (19%) Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 +++ ++ Y L S G+ + N Sbjct: 473 SEMKAAGYAVIGDPINGGYIWLN--WRESILAY--PFNSNTAKITNHGDPTRWYYNGN-- 526 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPE-------GQDLLRKCTDSSGQFFAVNDS----RE 375 + G +++V + + ++ + + V D+ + Sbjct: 527 --------IAPDGYDVFTVGIGINGDPGTDEATATSFMQSISSKPENYTNVTDTTKILEQ 578 Query: 376 LLESFDKITDK 386 L F I + Sbjct: 579 LNRYFHTIVTE 589 >gi|108758893|ref|YP_632824.1| pilus biogenesis protein [Myxococcus xanthus DK 1622] gi|108462773|gb|ABF87958.1| pilus biogenesis protein, TadE family [Myxococcus xanthus DK 1622] Length = 397 Score = 39.5 bits (90), Expect = 0.92, Method: Composition-based stats. Identities = 21/176 (11%), Positives = 52/176 (29%), Gaps = 16/176 (9%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A+ + + + + +++ L ++ + ++Q A AV + +SD + Sbjct: 19 FALSVPLLVMILMFSMYLTELVRAKLKLQEAARYAVWEMTSYALSDFANGKHDDAFEDAR 78 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 K+ + K + N N + + A L Sbjct: 79 REAHKEFVERYKDMDSVEPN-------------GTGGNFIARYTDVTATISNKEIALLES 125 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSN 177 G++ T+ G+ L + VL+ +++ + N Sbjct: 126 GMLSRPSTS---EGGGLAGAILSPLNNGMGWVLNQWGFNNKGWVESEVEMNYENVI 178 >gi|154432890|gb|ABS82078.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|154432914|gb|ABS82099.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|154432922|gb|ABS82106.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|332077748|gb|EGI88209.1| cell wall surface anchor family protein [Streptococcus pneumoniae GA41301] Length = 886 Score = 39.5 bits (90), Expect = 0.93, Method: Composition-based stats. Identities = 12/131 (9%), Positives = 32/131 (24%), Gaps = 25/131 (19%) Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 +++ ++ Y L S G+ + N Sbjct: 466 SEMKAAGYAVIGDPINGGYIWLN--WRESILAY--PFNSNTAKITNHGDPTRWYYNGN-- 519 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPE-------GQDLLRKCTDSSGQFFAVNDS----RE 375 + G +++V + + ++ + + V D+ + Sbjct: 520 --------IAPDGYDVFTVGIGINGDPGTDEATATSFMQSISSKPENYTNVTDTTKILEQ 571 Query: 376 LLESFDKITDK 386 L F I + Sbjct: 572 LNRYFHTIVTE 582 >gi|332254890|ref|XP_003276566.1| PREDICTED: integrin alpha-2 [Nomascus leucogenys] Length = 1181 Score = 39.5 bits (90), Expect = 0.94, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 63/187 (33%), Gaps = 24/187 (12%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 ++ N + + + ++G I Y E+ ++ + Y Sbjct: 189 DAVKNFLEKFVQ-GLDVGPTKTQVGLIQYANNPRVVFNLNTYKTKEEMIVATSQTSQYGG 247 Query: 275 --TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 TNT+ A+ +A + Y+ + K ++ +TDGE+ S + Sbjct: 248 DLTNTFGAIQYARKYAYSAASGGRRSAT-----KVMVVVTDGESHDGSM------LKAVI 296 Query: 333 EYMRNAGMKIYSVAV----SAPPEGQDLL----RKCTD--SSGQFFAVNDSRELLESFDK 382 + + + + +AV + L + + FF V+D LLE Sbjct: 297 DQCNHDNILRFGIAVLGYLNRNALDTKNLIKEIKAIASIPTERYFFNVSDEAALLEKAGT 356 Query: 383 ITDKIQE 389 + ++I Sbjct: 357 LGEQIFS 363 >gi|154432962|gb|ABS82141.1| ancillary pilus subunit [Streptococcus pneumoniae] Length = 886 Score = 39.5 bits (90), Expect = 0.94, Method: Composition-based stats. Identities = 12/131 (9%), Positives = 32/131 (24%), Gaps = 25/131 (19%) Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 +++ ++ Y L S G+ + N Sbjct: 466 SEMKAAGYAVIGDPINGGYIWLN--WRESILAY--PFNSNTAKITNHGDPTRWYYNGN-- 519 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPE-------GQDLLRKCTDSSGQFFAVNDS----RE 375 + G +++V + + ++ + + V D+ + Sbjct: 520 --------IAPDGYDVFTVGIGINGDPGTDEATATSFMQSISSKPENYTNVTDTTKILEQ 571 Query: 376 LLESFDKITDK 386 L F I + Sbjct: 572 LNRYFHTIVTE 582 >gi|148993547|ref|ZP_01823038.1| cell wall surface anchor family protein [Streptococcus pneumoniae SP9-BS68] gi|147927916|gb|EDK78937.1| cell wall surface anchor family protein [Streptococcus pneumoniae SP9-BS68] Length = 886 Score = 39.5 bits (90), Expect = 0.95, Method: Composition-based stats. Identities = 12/131 (9%), Positives = 33/131 (25%), Gaps = 25/131 (19%) Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 +++ ++ Y L S G+ + N Sbjct: 466 SEMKAVGYAVIGDPINGGYIWLN--WRESILAY--PFNSNTAKITNHGDPTRWYYNGN-- 519 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPE-------GQDLLRKCTDSSGQFFAVNDS----RE 375 + G +++V + + + ++ + + V D+ + Sbjct: 520 --------IAPDGYDVFTVGIGINGDPGTDEATATNFMQSISSKPENYTNVTDTTKILEQ 571 Query: 376 LLESFDKITDK 386 L F I + Sbjct: 572 LNRYFHTIVTE 582 >gi|291301210|ref|YP_003512488.1| von Willebrand factor type A [Stackebrandtia nassauensis DSM 44728] gi|290570430|gb|ADD43395.1| von Willebrand factor type A [Stackebrandtia nassauensis DSM 44728] Length = 609 Score = 39.5 bits (90), Expect = 0.95, Method: Composition-based stats. Identities = 21/181 (11%), Positives = 64/181 (35%), Gaps = 10/181 (5%) Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL- 259 P ++D++ +S ++ + +A + + Sbjct: 425 PYDGEKLSRMDIIRKSLERSLDLFGEQANVGLWRYPYDDPVAGTAYQKLVEIGEFDKSRQ 484 Query: 260 NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 ++++S+L+ + P + + AY+ + + + I V+ I+DG + Sbjct: 485 DDIESQLSAVEPAADGGLNDTVVEAYKNVLDNYNKTTGAIN------LVVVISDGGSESD 538 Query: 320 SAYQNTLNTLQICE-YMRNAG--MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 ++ N T ++ + ++ I ++ + L T + G+++ + E+ Sbjct: 539 ASLSNEDVTEELKDLSAKDRDKEASIMTIGYGKDADKDHLDAIATATQGRYYPAKWNDEI 598 Query: 377 L 377 Sbjct: 599 N 599 >gi|220923693|ref|YP_002498995.1| TadE family protein [Methylobacterium nodulans ORS 2060] gi|219948300|gb|ACL58692.1| TadE family protein [Methylobacterium nodulans ORS 2060] Length = 130 Score = 39.5 bits (90), Expect = 0.95, Method: Composition-based stats. Identities = 15/115 (13%), Positives = 36/115 (31%), Gaps = 7/115 (6%) Query: 2 TAII-ISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 A++ + + + ++L + +RNQ+ A D G +++ ++TI D + Sbjct: 19 FAMVGMIMLVTMLGI-VELGRGLNVRNQLSQAAD----FGARAVLMNKTISDSGLEA-VI 72 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPT 115 F+ L+ G + ++ T Sbjct: 73 RAAFQAASPDQLQVTVGAEVVNGLQFRTVSVSYPFVPLTWGFSTGTINLSVSRRT 127 >gi|152984932|ref|YP_001350526.1| type 4 fimbrial biogenesis protein PilY1 [Pseudomonas aeruginosa PA7] gi|150960090|gb|ABR82115.1| type 4 fimbrial biogenesis protein PilY1 [Pseudomonas aeruginosa PA7] Length = 1155 Score = 39.5 bits (90), Expect = 0.95, Method: Composition-based stats. Identities = 12/43 (27%), Positives = 22/43 (51%) Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 AV + DL +S G+FF+ + +L+ +F I ++I Sbjct: 448 AVDSANNVYDLWHAAVNSRGEFFSADSPDQLVAAFQAIVNRIS 490 >gi|167919151|ref|ZP_02506242.1| hypothetical protein BpseBC_11405 [Burkholderia pseudomallei BCC215] Length = 594 Score = 39.5 bits (90), Expect = 0.96, Method: Composition-based stats. Identities = 19/108 (17%), Positives = 37/108 (34%), Gaps = 1/108 (0%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 AI + V + A+D+ ++ ++R +Q D A L+G + + + S Sbjct: 23 AAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQPNAAAAANARS 81 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKA 109 F + + R + A + N N +Q A Sbjct: 82 NGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSV 129 >gi|145489795|ref|XP_001430899.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124398000|emb|CAK63501.1| unnamed protein product [Paramecium tetraurelia] Length = 567 Score = 39.5 bits (90), Expect = 0.96, Method: Composition-based stats. Identities = 28/126 (22%), Positives = 47/126 (37%), Gaps = 22/126 (17%) Query: 257 NNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 NNL E +N + E TN Y + ++YN S V +TDGE Sbjct: 48 NNLKEAILHVNSMKADMEGTNIYKPLQ---NKIYNSSYGR-----SKDTTLNVFLLTDGE 99 Query: 316 NSGASAYQNTLNTLQICEYMRNAG---MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND 372 + I E ++N +IY++ + + R +G+F V D Sbjct: 100 DYADP----------IIELVKNNNRAETRIYTLGIGEGCSIYLIKRVAEVGNGKFHIVGD 149 Query: 373 SRELLE 378 + ++ E Sbjct: 150 NEDINE 155 >gi|119575268|gb|EAW54873.1| integrin, alpha 2 (CD49B, alpha 2 subunit of VLA-2 receptor), isoform CRA_b [Homo sapiens] Length = 1179 Score = 39.5 bits (90), Expect = 0.99, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 63/187 (33%), Gaps = 24/187 (12%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 ++ N + + + ++G I Y E+ ++ + Y Sbjct: 189 DAVKNFLEKFVQ-GLDIGPTKTQVGLIQYANNPRVVFNLNTYKTKEEMIVATSQTSQYGG 247 Query: 275 --TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 TNT+ A+ +A + Y+ + K ++ +TDGE+ S + Sbjct: 248 DLTNTFGAIQYARKYAYSAASGGRRSAT-----KVMVVVTDGESHDGSM------LKAVI 296 Query: 333 EYMRNAGMKIYSVAV----SAPPEGQDLL----RKCTD--SSGQFFAVNDSRELLESFDK 382 + + + + +AV + L + + FF V+D LLE Sbjct: 297 DQCNHDNILRFGIAVLGYLNRNALDTKNLIKEIKAIASIPTERYFFNVSDEAALLEKAGT 356 Query: 383 ITDKIQE 389 + ++I Sbjct: 357 LGEQIFS 363 >gi|296086006|emb|CBI31447.3| unnamed protein product [Vitis vinifera] Length = 478 Score = 39.5 bits (90), Expect = 1.00, Method: Composition-based stats. Identities = 7/61 (11%), Positives = 19/61 (31%), Gaps = 8/61 (13%) Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 +N L TN + + + + +I ++DG+++ + Sbjct: 221 ALQAVNSLVANGGTNIAEGLRKGAKVMED--------RKERNPVSSIILLSDGQDTYTTE 272 Query: 322 Y 322 Sbjct: 273 S 273 >gi|256390528|ref|YP_003112092.1| ricin B lectin [Catenulispora acidiphila DSM 44928] gi|256356754|gb|ACU70251.1| Ricin B lectin [Catenulispora acidiphila DSM 44928] Length = 732 Score = 39.5 bits (90), Expect = 1.00, Method: Composition-based stats. Identities = 27/357 (7%), Positives = 67/357 (18%), Gaps = 15/357 (4%) Query: 43 SIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQ 102 + S + + + DI + +P Sbjct: 191 DPSMSTSTIQGDINSVYNSQVNNEFGTQRNALLFAPGSYNVDIPVGYNTEVAGLGLSPDA 250 Query: 103 YIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMED 162 + S +DV SM Sbjct: 251 VNITGGTVHVAGHTADGNATQNFWRDAENMEVSPPSGSTMWAVSQADPFRRMDVKGSMLL 310 Query: 163 LYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVN 222 + N + + A + + G+ Sbjct: 311 YDNIHGSGGNWSSGGYVGDSRISGQINSGTQQQFLTQNTAMNGGWTGSNWNMVFVGDTNA 370 Query: 223 SIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMH 282 + I + + +++ + T ++ Sbjct: 371 PANTFPNPPDTTVAQTPVNREKPFIYVDASGTWQVFVPALRTNAQGPDWTNGTPAGTSIP 430 Query: 283 HAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG---------------ASAYQNTLN 327 + + ++S + K +I + A N Sbjct: 431 VSQFYIVKPGDTSATINAALSAGKSLIVTPGVYHLSSALNITSPNTVVLGLGLATLVPDN 490 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKIT 384 G+ I + +SA ++ + + + EL + F +I Sbjct: 491 GNAAITTADVDGIDIAGLLISANSTNSAVMMQIGPNGSTASHAANPTELQDVFFRIG 547 >gi|167902891|ref|ZP_02490096.1| hypothetical protein BpseN_11585 [Burkholderia pseudomallei NCTC 13177] Length = 593 Score = 39.5 bits (90), Expect = 1.00, Method: Composition-based stats. Identities = 19/108 (17%), Positives = 37/108 (34%), Gaps = 1/108 (0%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 AI + V + A+D+ ++ ++R +Q D A L+G + + + S Sbjct: 22 AAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQPNAAAAANARS 80 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKA 109 F + + R + A + N N +Q A Sbjct: 81 NGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSV 128 >gi|239908149|ref|YP_002954890.1| hypothetical protein DMR_35130 [Desulfovibrio magneticus RS-1] gi|239798015|dbj|BAH77004.1| hypothetical protein [Desulfovibrio magneticus RS-1] Length = 328 Score = 39.5 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 17/141 (12%), Positives = 36/141 (25%), Gaps = 39/141 (27%) Query: 252 CTPLSNNLNEVKSRLNKLNPYEN---TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P + + + L L+ T A+ A + L + K V Sbjct: 142 VVPPTTDRAALVQALASLDTGAAGRKTAMGDAVGLAAKRLDESPGQA----------KAV 191 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE---------------- 352 + DG ++ + + + G+ +++V V Sbjct: 192 VVFGDGRSNAG-----ETDPVPAAQAAVRHGVAVFAVGVGGDGPAPFLVAHPILGKQIIT 246 Query: 353 -----GQDLLRKCTDSSGQFF 368 + L + G F Sbjct: 247 EAAPVDEAALSAMAQAGGGAF 267 >gi|108762174|ref|YP_633287.1| hypothetical protein MXAN_5133 [Myxococcus xanthus DK 1622] gi|108466054|gb|ABF91239.1| conserved domain protein [Myxococcus xanthus DK 1622] Length = 383 Score = 39.5 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 18/165 (10%), Positives = 50/165 (30%), Gaps = 7/165 (4%) Query: 203 PAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEV 262 + + + V+ + K I + + G + L+++ V Sbjct: 38 TGSMYPCLAQVRKKLRGTVSRLMKEIPGIRIGIIAHGDYCDAGSTYVTKMLDLTDDEGAV 97 Query: 263 KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAY 322 ++++ + R+ + + + ++ I D G + Sbjct: 98 VRFVDRVEQTGGGDAPECYEFVLRQAQSLSWTV-------GYTRALVLIGDDVPHGPAQN 150 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF 367 + L+ + + G+ +Y V A ++ + SG F Sbjct: 151 PHKLDWRKEVAALGKMGIPVYGVQALARRHATAFYKELAEKSGGF 195 >gi|284989271|ref|YP_003407825.1| hypothetical protein Gobs_0677 [Geodermatophilus obscurus DSM 43160] gi|284062516|gb|ADB73454.1| conserved hypothetical protein [Geodermatophilus obscurus DSM 43160] Length = 146 Score = 39.5 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 17/145 (11%), Positives = 36/145 (24%), Gaps = 22/145 (15%) Query: 3 AIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTST 62 A+++ L + A+D + + + D A +G ++ + + Sbjct: 22 AVLVPGLLLIVGLAVDGGAKVAATQRANAIADEAARAGGQALDLSAALTGQVRVDPAAAV 81 Query: 63 IFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKG 122 + + Q +T + LQ E G Sbjct: 82 AAVQNYLE---------------RSGVQGAVTVVDGDTLQVSTTIS-------EPTTFLG 119 Query: 123 LIPSALTNLSLRSTGIIERSSENLA 147 LI + T + A Sbjct: 120 LIGITTLTVEGTGTADLVTDQNGGA 144 >gi|116295258|ref|NP_002194.2| integrin alpha-2 precursor [Homo sapiens] gi|21105795|gb|AAM34795.1|AF512556_1 integrin, alpha 2 (CD49B, alpha 2 subunit of VLA-2 receptor) [Homo sapiens] gi|119575267|gb|EAW54872.1| integrin, alpha 2 (CD49B, alpha 2 subunit of VLA-2 receptor), isoform CRA_a [Homo sapiens] gi|151556518|gb|AAI48597.1| Integrin, alpha 2 (CD49B, alpha 2 subunit of VLA-2 receptor) [synthetic construct] gi|162319056|gb|AAI56716.1| Integrin, alpha 2 (CD49B, alpha 2 subunit of VLA-2 receptor) [synthetic construct] gi|168278403|dbj|BAG11081.1| integrin alpha-2 precursor [synthetic construct] Length = 1181 Score = 39.5 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 63/187 (33%), Gaps = 24/187 (12%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 ++ N + + + ++G I Y E+ ++ + Y Sbjct: 189 DAVKNFLEKFVQ-GLDIGPTKTQVGLIQYANNPRVVFNLNTYKTKEEMIVATSQTSQYGG 247 Query: 275 --TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 TNT+ A+ +A + Y+ + K ++ +TDGE+ S + Sbjct: 248 DLTNTFGAIQYARKYAYSAASGGRRSAT-----KVMVVVTDGESHDGSM------LKAVI 296 Query: 333 EYMRNAGMKIYSVAV----SAPPEGQDLL----RKCTD--SSGQFFAVNDSRELLESFDK 382 + + + + +AV + L + + FF V+D LLE Sbjct: 297 DQCNHDNILRFGIAVLGYLNRNALDTKNLIKEIKAIASIPTERYFFNVSDEAALLEKAGT 356 Query: 383 ITDKIQE 389 + ++I Sbjct: 357 LGEQIFS 363 >gi|66802592|ref|XP_635168.1| hypothetical protein DDB_G0291658 [Dictyostelium discoideum AX4] gi|60463628|gb|EAL61813.1| hypothetical protein DDB_G0291658 [Dictyostelium discoideum AX4] Length = 409 Score = 39.5 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 26/265 (9%), Positives = 76/265 (28%), Gaps = 15/265 (5%) Query: 74 QGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSL 133 QG + D++ I+ I + + + + T +L Sbjct: 139 QGDKFTPDFIDLSSLVDISFRTFDKPISNSIVKGTIEMQNNGSKVSFTWTTDDNGTYSTL 198 Query: 134 RSTGIIERSSENL----AISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF 189 + E + + + V+D+ S L++ + Sbjct: 199 LPECKFSINVEGIVDGRPVKMNEVIDILTSDVINNLEELKYIKPRSFEISQKCTNYNIKS 258 Query: 190 WSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 S+ + ++D+ ++ + K + S++ + ++ Sbjct: 259 KSEKSVLLIDVSRSMTGA-QLDIAKNNSKKFIAESDKFAIGAWSYSIKFFSDSWGT---- 313 Query: 250 NQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 S+ ++ S ++ L +T A+ A + + E G T + Sbjct: 314 ------SSKISAANSWIDGLCSDGHTEIKQAIEEAITKFKDADEFWILCDGDTDAFPDLQ 367 Query: 310 FITDGENSGASAYQNTLNTLQICEY 334 ++ ++ + N + + + Sbjct: 368 SWSNFYSNNSKYIINFVGIGGLSDE 392 >gi|317125804|ref|YP_004099916.1| von Willebrand factor A [Intrasporangium calvum DSM 43043] gi|315589892|gb|ADU49189.1| von Willebrand factor type A [Intrasporangium calvum DSM 43043] Length = 568 Score = 39.5 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 13/107 (12%), Positives = 36/107 (33%), Gaps = 4/107 (3%) Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T + Y + + + T V+ +TDG + + L Sbjct: 454 GGLTRSVGGGRGLYDTIVATYQRARATYT-KGQLNSVVIVTDGL-NDDDYGASLSVALSR 511 Query: 332 CEYMRN--AGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSREL 376 + + + ++I V + P+ + + + G++ + ++L Sbjct: 512 VKKLVDPRNPIRITIVGFGSEPDAKAMTPFAQLTGGRYVNAAEPKDL 558 >gi|312136388|ref|YP_004003725.1| magnesium chelatase chli subunit [Methanothermus fervidus DSM 2088] gi|311224107|gb|ADP76963.1| magnesium chelatase ChlI subunit [Methanothermus fervidus DSM 2088] Length = 636 Score = 39.5 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 12/96 (12%), Positives = 28/96 (29%), Gaps = 8/96 (8%) Query: 249 GNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P + N ++ + + T A+ S K Sbjct: 518 SEIVVPPTKNYWQIVDEIERAPTGGRTPLSSALQQLISL-------SKREKMKEESLKVK 570 Query: 309 IFI-TDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 F+ TDG + + + + ++ G+++Y Sbjct: 571 AFMITDGRANVPLTDDIEKEMVNLAKNIKKEGIELY 606 >gi|124942|sp|P17301|ITA2_HUMAN RecName: Full=Integrin alpha-2; AltName: Full=CD49 antigen-like family member B; AltName: Full=Collagen receptor; AltName: Full=Platelet membrane glycoprotein Ia; Short=GPIa; AltName: Full=VLA-2 subunit alpha; AltName: CD_antigen=CD49b; Flags: Precursor gi|33907|emb|CAA34894.1| unnamed protein product [Homo sapiens] Length = 1181 Score = 39.5 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 63/187 (33%), Gaps = 24/187 (12%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 ++ N + + + ++G I Y E+ ++ + Y Sbjct: 189 DAVKNFLEKFVQ-GLDIGPTKTQVGLIQYANNPRVVFNLNTYKTKEEMIVATSQTSQYGG 247 Query: 275 --TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 TNT+ A+ +A + Y+ + K ++ +TDGE+ S + Sbjct: 248 DLTNTFGAIQYARKYAYSAASGGRRSAT-----KVMVVVTDGESHDGSM------LKAVI 296 Query: 333 EYMRNAGMKIYSVAV----SAPPEGQDLL----RKCTD--SSGQFFAVNDSRELLESFDK 382 + + + + +AV + L + + FF V+D LLE Sbjct: 297 DQCNHDNILRFGIAVLGYLNRNALDTKNLIKEIKAIASIPTERYFFNVSDEAALLEKAGT 356 Query: 383 ITDKIQE 389 + ++I Sbjct: 357 LGEQIFS 363 >gi|88812999|ref|ZP_01128242.1| type 4 fimbrial biogenesis protein PilY1 [Nitrococcus mobilis Nb-231] gi|88789777|gb|EAR20901.1| type 4 fimbrial biogenesis protein PilY1 [Nitrococcus mobilis Nb-231] Length = 1179 Score = 39.5 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 31/298 (10%), Positives = 74/298 (24%), Gaps = 28/298 (9%) Query: 102 QYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSME 161 + L + N S + +S + + D+ S Sbjct: 201 DRNLTTITGASRAFYPLGNDIRVGWQRLNTSTTIKPVRFFTSTHRTAFYNWLFDLPHSG- 259 Query: 162 DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLV 221 + Y P + + S + N Sbjct: 260 ----GTPLRRALQRAGVYFKTNQPYYNNAVDSGDGSASCRQNYQVLMTDGLWNGGLSNGP 315 Query: 222 NSIQKAIQE-KKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPA 280 + + Q S + ++ + N + P Sbjct: 316 GNYDNSEQTLPDGTSYNPQAPFQDDNSNVLADLAFKYWAEDLSTLANDVPP--------- 366 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE-------NSGASAYQNTLNTLQICE 333 + ++ + + N + ++ T G + T + + + Sbjct: 367 --YMLQQGASASANYWNAQNDPANWQHLVNFTVGLGLGASLTDPVWGGSTFTGDYPNLAD 424 Query: 334 YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 +N + P DL +S GQFF+ +D + L +F I +++ +++ Sbjct: 425 GSKNWPAITS----NTPSTVSDLWHAAINSRGQFFSADDPKALAAAFKTILNRVADRT 478 >gi|300704937|ref|YP_003746540.1| hypothetical protein RCFBP_20766 [Ralstonia solanacearum CFBP2957] gi|299072601|emb|CBJ43951.1| conserved exported protein of unknown function [Ralstonia solanacearum CFBP2957] Length = 340 Score = 39.5 bits (90), Expect = 1.0, Method: Composition-based stats. Identities = 31/260 (11%), Positives = 72/260 (27%), Gaps = 10/260 (3%) Query: 4 IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLS-------GCASIVSDRTIKDPTTK 56 I++ + AID+AH+ +RN++Q+A DAA L+ + T+ Sbjct: 18 ILLIIFLAIGAMAIDIAHLFVVRNELQNAADAAALAGAAGLYPANPKPNWSNGVAQGTSA 77 Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 ++ K ++ G + + Q I + +Q Sbjct: 78 IKLNASDNTKLTGGTVQAGYWNLTGSPAGMQSQSITPGSNDVAGVQVTVTRSPGNNGGPV 137 Query: 117 NLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS 176 + +L + ++ + +I S+ V + + N Sbjct: 138 SGWLTWVFNGGAASIQATAVAVIAAPGSANPGSLFPVALNKCLFDLYWNYTTGQPLN--- 194 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV 236 + P S + A + + I +N+++ Sbjct: 195 DPSTGQPYVIDINTSYPPSGGTCASGEWTGFDGPTDASTEKNLVSSGNPTNISIGENINI 254 Query: 237 RIGTIAYNIGIVGNQCTPLS 256 G + ++ Sbjct: 255 STGVKTSVYNAIPPLPLTVT 274 >gi|300727144|ref|ZP_07060563.1| BatA protein [Prevotella bryantii B14] gi|299775688|gb|EFI72279.1| BatA protein [Prevotella bryantii B14] Length = 134 Score = 39.5 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 6/37 (16%), Positives = 13/37 (35%) Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + Q L + G F+ ++ +L + I Sbjct: 54 VDIDIQTLKEIAATTEGNFYRATNTAQLKSIYKDIDK 90 >gi|290960313|ref|YP_003491495.1| hypothetical protein SCAB_59371 [Streptomyces scabiei 87.22] gi|260649839|emb|CBG72955.1| conserved hypothetical protein [Streptomyces scabiei 87.22] Length = 449 Score = 39.5 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 31/261 (11%), Positives = 60/261 (22%), Gaps = 31/261 (11%) Query: 132 SLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWS 191 +N + + + Y P + Sbjct: 6 KSNVPQFSVEVYQNEYLPEGGREVNAIVTVSATGGGTIGTAVAAPHLYSPGQGPSAAVAL 65 Query: 192 KNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ 251 P D + L + + A+ + G Sbjct: 66 MVDCSGSMDYPPTKMRNARDATAAAIDTLRDGVHFAVIDGT------HVAREVYPGGGRL 119 Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 S + K L +L+ T + A R L +E + I + Sbjct: 120 AVADSATRAQAKQALRRLSAGGGTAIGTWLRLADRLLASE----------DVAIRHGILL 169 Query: 312 TDGENSGASAYQNTLNTLQICE-----YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ 366 TDG + + ++ L C R V E +++ + + G Sbjct: 170 TDGR-NEHESPEDLKAALDACAGRFTCDAR---------GVGTDWEVKEVTQIASALLGT 219 Query: 367 FFAVNDSRELLESFDKITDKI 387 V D L F ++ + Sbjct: 220 ADIVADPAALSADFTQMMEAA 240 >gi|253576852|ref|ZP_04854177.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786 str. D14] gi|251843719|gb|EES71742.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786 str. D14] Length = 247 Score = 39.5 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 15/97 (15%), Positives = 37/97 (38%), Gaps = 10/97 (10%) Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV---SA-PPEGQDLLRKCT 361 K ++ ITDG ++ + + R G+ + V + G + + Sbjct: 2 KQIMLITDGCSNVGES------PVMAAALARQEGITVNVVGIVDYGTIGELGSLEIAEIA 55 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + G + + L ++ +T K Q+++ A ++ Sbjct: 56 KAGGGMSRIVGTENLAQTMQMMTRKTVVQTIQQAVHK 92 >gi|218886590|ref|YP_002435911.1| von Willebrand factor A [Desulfovibrio vulgaris str. 'Miyazaki F'] gi|218757544|gb|ACL08443.1| von Willebrand factor type A [Desulfovibrio vulgaris str. 'Miyazaki F'] Length = 2478 Score = 39.5 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 33/363 (9%), Positives = 83/363 (22%), Gaps = 26/363 (7%) Query: 47 DRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAE 106 D T L AG ++ T + + Sbjct: 1558 DADAAGSGLTDVDTDHANLDFTAGSLPMTFSFDIAAGHEPHVTGLSTTTPISWAVDGGTG 1617 Query: 107 SKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ 166 + + L S + +E + + + DV S + Sbjct: 1618 NLIGSIDGHPAIILTLTPDSGSHAVGETVHVNVEVTLVDPLLHADGSTDVVVSGIHVSGT 1677 Query: 167 KHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAG-------- 218 + + + + + + + A Sbjct: 1678 DGVSPAVSGTVSVNVDDASPTAVNDVHDYQGGITSNVVVMLDVSGSMNDDANGNAPGTDS 1737 Query: 219 ------NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY 272 + +N + A + ++V++ + + + LN Sbjct: 1738 RLTMAIDAINQLLHAYDDLGAVNVKLVWFDDSAQTHTGWLMGTTAVQQALTILEGNLNGG 1797 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS--GASAYQNTLNTLQ 330 TN A++ +L + F++DGE + S + Sbjct: 1798 GATNYDAAINLVMSQLGTTGTPPADKT-------VAYFLSDGEPNRPDGSEGISGSEQTT 1850 Query: 331 ICEYMRNAGMK-IYSVAVSAPPEGQD-LLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 ++ +Y++ + G L SG + + + ++ D + Sbjct: 1851 WETFLAANNFDMVYALGIGTGISGNTGELNPIGWESGTTSNADVVTIITD-MSQLADYLV 1909 Query: 389 EQS 391 Sbjct: 1910 STV 1912 >gi|167911128|ref|ZP_02498219.1| hypothetical protein Bpse112_11583 [Burkholderia pseudomallei 112] Length = 579 Score = 39.5 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 19/108 (17%), Positives = 37/108 (34%), Gaps = 1/108 (0%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 AI + V + A+D+ ++ ++R +Q D A L+G + + + S Sbjct: 8 AAIWMLVAIAVLG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQPNAAAAANARS 66 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKA 109 F + + R + A + N N +Q A Sbjct: 67 NGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSV 114 >gi|114600323|ref|XP_526928.2| PREDICTED: integrin alpha-2 [Pan troglodytes] Length = 1181 Score = 39.5 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 63/187 (33%), Gaps = 24/187 (12%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 ++ N + + + ++G I Y E+ ++ + Y Sbjct: 189 DAVKNFLEKFVQ-GLDIGPTKTQVGLIQYANNPRVVFNLNTYKTKEEMIVATSQTSQYGG 247 Query: 275 --TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 TNT+ A+ +A + Y+ + K ++ +TDGE+ S + Sbjct: 248 DLTNTFGAIQYARKYAYSAASGGRRSAT-----KVMVVVTDGESHDGSM------LKAVI 296 Query: 333 EYMRNAGMKIYSVAV----SAPPEGQDLL----RKCTD--SSGQFFAVNDSRELLESFDK 382 + + + + +AV + L + + FF V+D LLE Sbjct: 297 DQCNHDNILRFGIAVLGYLNRNALDTKNLIKEIKAIASIPTERYFFNVSDEAALLEKAGT 356 Query: 383 ITDKIQE 389 + ++I Sbjct: 357 LGEQIFS 363 >gi|254560515|ref|YP_003067610.1| hypothetical protein METDI2058 [Methylobacterium extorquens DM4] gi|254267793|emb|CAX23640.1| conserved hypothetical protein [Methylobacterium extorquens DM4] Length = 733 Score = 39.5 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 23/132 (17%), Positives = 44/132 (33%), Gaps = 16/132 (12%) Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 KS + L T + A R+ E+ + V+F+TDG A Sbjct: 408 AKSFVAGLQASGGTEMLAPLQAALRDATPEETGR---------LRQVVFLTDG------A 452 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFD 381 N R +++ V + + P G + G F ++ ++ E Sbjct: 453 IGNEAQIFSAIATERGRS-RLFMVGIGSAPNGYLMRHTAELGRGSFTQIDTPDQVTERMR 511 Query: 382 KITDKIQEQSVR 393 + K++ +V Sbjct: 512 ALLVKLESPAVT 523 >gi|134282290|ref|ZP_01768995.1| conserved hypothetical protein [Burkholderia pseudomallei 305] gi|134246328|gb|EBA46417.1| conserved hypothetical protein [Burkholderia pseudomallei 305] Length = 602 Score = 39.5 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 19/108 (17%), Positives = 37/108 (34%), Gaps = 1/108 (0%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 AI + V + A+D+ ++ ++R +Q D A L+G + + + S Sbjct: 31 AAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQPNAAAAANARS 89 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKA 109 F + + R + A + N N +Q A Sbjct: 90 NGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSV 137 >gi|29828406|ref|NP_823040.1| hypothetical protein SAV_1864 [Streptomyces avermitilis MA-4680] gi|29605509|dbj|BAC69575.1| hypothetical protein [Streptomyces avermitilis MA-4680] Length = 462 Score = 39.5 bits (90), Expect = 1.1, Method: Composition-based stats. Identities = 11/172 (6%), Positives = 34/172 (19%), Gaps = 16/172 (9%) Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPAN-RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG 239 P + K+D + + + ++ A Sbjct: 31 ARPEDAPATEPLPVNFVFVVDTSGSMTGTKLDTVKSALQTIYRELRPADCLGIITFDHNV 90 Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 + ++ L T+ + + E+ Sbjct: 91 RTVLPAVAKQDLPP------ERFAEVVSALTTQGGTDIDLGVQYGIDEIS-------RHS 137 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 S R + +DG+ + + + + + + Sbjct: 138 VSGRTVNCLYLFSDGDPTSGERDWIKVRANVAAKL--RGDLTLSCFGFGSDA 187 >gi|167894484|ref|ZP_02481886.1| hypothetical protein Bpse7_12104 [Burkholderia pseudomallei 7894] Length = 587 Score = 39.1 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 19/108 (17%), Positives = 37/108 (34%), Gaps = 1/108 (0%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 AI + V + A+D+ ++ ++R +Q D A L+G + + + S Sbjct: 16 AAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQPNAAAAANARS 74 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKA 109 F + + R + A + N N +Q A Sbjct: 75 NGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSV 122 >gi|168488150|ref|ZP_02712349.1| cell wall surface anchor family protein [Streptococcus pneumoniae SP195] gi|154432978|gb|ABS82155.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|183572925|gb|EDT93453.1| cell wall surface anchor family protein [Streptococcus pneumoniae SP195] gi|332075307|gb|EGI85777.1| cell wall surface anchor family protein [Streptococcus pneumoniae GA17570] Length = 886 Score = 39.1 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 12/131 (9%), Positives = 32/131 (24%), Gaps = 25/131 (19%) Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 +++ ++ Y L S G+ + N Sbjct: 466 SEMKAVGYAVIGDPINGGYIWLN--WRESILAY--PFNSNTAKITNHGDPTRWYYNGN-- 519 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPE-------GQDLLRKCTDSSGQFFAVNDS----RE 375 + G +++V + + ++ + + V D+ + Sbjct: 520 --------IAPDGYDVFTVGIGINGDPGTDEATATSFMQSISSKPENYTNVTDTTKILEQ 571 Query: 376 LLESFDKITDK 386 L F I + Sbjct: 572 LNRYFHTIVTE 582 >gi|32477946|ref|NP_870940.1| hypothetical protein RB13238 [Rhodopirellula baltica SH 1] gi|32448503|emb|CAD78018.1| hypothetical protein-transmembrane prediction [Rhodopirellula baltica SH 1] Length = 164 Score = 39.1 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 13/136 (9%), Positives = 33/136 (24%), Gaps = 7/136 (5%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A+++ V LF+T ++++ I+ +Q D A + + Sbjct: 24 FAVVLPVIMLFLTAMVEISRILM----LQHTADTAAYEAARCAMVPGATVTEAEWEAYAL 79 Query: 62 TIFKKQIKKHL---KQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENL 118 + + + N + ++ + TE Sbjct: 80 IEAAGLTNTAVTVTPAEITEETAFITVRVEVPANDNSWMLSSQFTDVVVASEVTLLTERS 139 Query: 119 FLKGLIPSALTNLSLR 134 + L Sbjct: 140 PIVRLTGIPGLKAKKS 155 >gi|254179953|ref|ZP_04886552.1| conserved hypothetical protein [Burkholderia pseudomallei 1655] gi|184210493|gb|EDU07536.1| conserved hypothetical protein [Burkholderia pseudomallei 1655] Length = 602 Score = 39.1 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 19/108 (17%), Positives = 37/108 (34%), Gaps = 1/108 (0%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 AI + V + A+D+ ++ ++R +Q D A L+G + + + S Sbjct: 31 AAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQPNAAAAANARS 89 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKA 109 F + + R + A + N N +Q A Sbjct: 90 NGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSV 137 >gi|149202861|ref|ZP_01879832.1| hypothetical protein RTM1035_19001 [Roseovarius sp. TM1035] gi|149143407|gb|EDM31443.1| hypothetical protein RTM1035_19001 [Roseovarius sp. TM1035] Length = 212 Score = 39.1 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 18/128 (14%), Positives = 36/128 (28%), Gaps = 29/128 (22%) Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L P T ++ A L + V+ +TDG + Sbjct: 80 LMPGGLTPIAASVAAAAEVLEYRTQP-----------GIVVLVTDGNETCGGTPC----A 124 Query: 329 LQICEYMRNAGMKIYSVAVSA-------------PPEGQDLLRKCTD-SSGQFFAVNDSR 374 L + ++ + +GQ + + D + G F + Sbjct: 125 LGAALTAEARDLTVHVIGFRVVHDPFSWNSPEAQGYDGQTVAKCLADATGGVFVSTETVD 184 Query: 375 ELLESFDK 382 EL+E+ + Sbjct: 185 ELVEALRE 192 >gi|126440828|ref|YP_001058814.1| hypothetical protein BURPS668_1775 [Burkholderia pseudomallei 668] gi|126220321|gb|ABN83827.1| conserved hypothetical protein [Burkholderia pseudomallei 668] Length = 602 Score = 39.1 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 19/108 (17%), Positives = 37/108 (34%), Gaps = 1/108 (0%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 AI + V + A+D+ ++ ++R +Q D A L+G + + + S Sbjct: 31 AAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQPNAAAAANARS 89 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKA 109 F + + R + A + N N +Q A Sbjct: 90 NGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSV 137 >gi|300312347|ref|YP_003776439.1| hemagglutinin/hemolysin-related protein [Herbaspirillum seropedicae SmR1] gi|300075132|gb|ADJ64531.1| hemagglutinin/hemolysin-related protein [Herbaspirillum seropedicae SmR1] Length = 3215 Score = 39.1 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 24/199 (12%), Positives = 46/199 (23%) Query: 9 CFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQI 68 A D A + + + LD + AS+ + + P K+Q S + Sbjct: 1393 LITVTAMASDTAGNVRAAAETKFTLDTTAPAVSASVTATGGGQPPAVTKEQISFVVTFNE 1452 Query: 69 KKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSAL 128 K + K + N Y ++P N L + Sbjct: 1453 KLRVPVTIDNFTATNGTVNKVEAVTGAGNTNANSYTVLVTPNPDMPAGNKVELKLKANGA 1512 Query: 129 TNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKS 188 T + + E + + +I S+ N + Sbjct: 1513 TPVLDVAGNPAELGAVIASQAIDTRGPTVISVPADNAPPAPPTMNKDIVFNVRFDEMGSV 1572 Query: 189 FWSKNTTKSKYAPAPAPAN 207 N + Sbjct: 1573 SQMGNIGVGNFKAINGKVT 1591 >gi|167816001|ref|ZP_02447681.1| hypothetical protein Bpse9_12727 [Burkholderia pseudomallei 91] Length = 588 Score = 39.1 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 19/108 (17%), Positives = 37/108 (34%), Gaps = 1/108 (0%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 AI + V + A+D+ ++ ++R +Q D A L+G + + + S Sbjct: 17 AAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQPNAAAAANARS 75 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKA 109 F + + R + A + N N +Q A Sbjct: 76 NGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSV 123 >gi|257886572|ref|ZP_05666225.1| predicted protein [Enterococcus faecium 1,141,733] gi|257822626|gb|EEV49558.1| predicted protein [Enterococcus faecium 1,141,733] Length = 689 Score = 39.1 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 19/82 (23%), Positives = 35/82 (42%), Gaps = 1/82 (1%) Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV- 370 TD S+ TL ++NAG+K YSVA + G+ + R S + ++ Sbjct: 271 TDPNAVSVSSSLINDATLGTIISIKNAGIKCYSVATAPSSRGEYIGRNLASSPNNYLSID 330 Query: 371 NDSRELLESFDKITDKIQEQSV 392 + L + +I + + + V Sbjct: 331 ENLTGLGNALKEIANGMDKTIV 352 >gi|206575540|ref|YP_002235854.1| von Willebrand factor type A domain protein [Klebsiella pneumoniae 342] gi|206570384|gb|ACI12030.1| von Willebrand factor type A domain protein [Klebsiella pneumoniae 342] Length = 346 Score = 39.1 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 9/85 (10%), Positives = 27/85 (31%), Gaps = 6/85 (7%) Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 +L TN A+ ++ ++ + V +TDG + + + Sbjct: 72 RLPVGGGTNLATALRELTTQIDSQVRKTTQEEKGDWKP-VVYLLTDGRPTDDISAEVKRW 130 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPE 352 + + + +V + + + Sbjct: 131 NEFYAKK-----VNMIAVGIGSSVD 150 >gi|218463216|ref|ZP_03503307.1| hypothetical protein RetlK5_28967 [Rhizobium etli Kim 5] Length = 227 Score = 39.1 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 15/121 (12%), Positives = 36/121 (29%), Gaps = 18/121 (14%) Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + + +++ TN Y A +++ S ++ +TDG + S Sbjct: 118 LLNEISRQKANGGTNMYACAERALQQIAGTDRLS-------TYLPAIVIMTDGRSDDQSR 170 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESF 380 + + I+ + + L +S + F +L +F Sbjct: 171 AFMSEW------NAIEPRVPIFGITFG--DADKTQLDSLAKQTSARVFD--GGSDLATAF 220 Query: 381 D 381 Sbjct: 221 R 221 >gi|149919601|ref|ZP_01908080.1| hypothetical protein PPSIR1_07008 [Plesiocystis pacifica SIR-1] gi|149819544|gb|EDM78972.1| hypothetical protein PPSIR1_07008 [Plesiocystis pacifica SIR-1] Length = 349 Score = 39.1 bits (89), Expect = 1.1, Method: Composition-based stats. Identities = 28/197 (14%), Positives = 68/197 (34%), Gaps = 26/197 (13%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYN---IGIVGNQCTPLSNN-LNEVKSRLNKLN 270 ++ G+ + + +R G Y+ + TPL+ N L ++ + Sbjct: 100 QAVGDTLLDPDVGVVAPLQGDIRFGLSLYDNPGDMCPRVESTPLALNALADMTALYQSAA 159 Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG---ASAYQNTLN 327 P +T T A+ + + + +K ++ TDGE + + Sbjct: 160 PEGDTPTGSALTSVADVVAQDPDPG---------EKVIVLATDGEPDTCAQPNPDEGQPE 210 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT---------DSSGQFFAVNDSRELLE 378 + + G++ V+V + L+ S ++ D L++ Sbjct: 211 AVAAAQAAYAQGVRTVIVSVGSGISADH-LQDMANAGAGVQPGGSDAVYYQALDQASLID 269 Query: 379 SFDKITDKIQEQSVRIA 395 +F +I ++E ++ + Sbjct: 270 AFSEIIAGVRECTIDLD 286 >gi|121599267|ref|YP_993091.1| hypothetical protein BMASAVP1_A1771 [Burkholderia mallei SAVP1] gi|124385206|ref|YP_001026132.1| hypothetical protein BMA10229_A0124 [Burkholderia mallei NCTC 10229] gi|126450090|ref|YP_001080596.1| hypothetical protein BMA10247_1041 [Burkholderia mallei NCTC 10247] gi|167002573|ref|ZP_02268363.1| conserved hypothetical protein [Burkholderia mallei PRL-20] gi|217423806|ref|ZP_03455307.1| conserved hypothetical protein [Burkholderia pseudomallei 576] gi|238563902|ref|ZP_00438064.2| membrane protein [Burkholderia mallei GB8 horse 4] gi|254177770|ref|ZP_04884425.1| conserved hypothetical protein [Burkholderia mallei ATCC 10399] gi|254206222|ref|ZP_04912574.1| conserved hypothetical protein [Burkholderia mallei JHU] gi|121228077|gb|ABM50595.1| conserved hypothetical protein [Burkholderia mallei SAVP1] gi|124293226|gb|ABN02495.1| conserved hypothetical protein [Burkholderia mallei NCTC 10229] gi|126242960|gb|ABO06053.1| conserved hypothetical protein [Burkholderia mallei NCTC 10247] gi|147753665|gb|EDK60730.1| conserved hypothetical protein [Burkholderia mallei JHU] gi|160698809|gb|EDP88779.1| conserved hypothetical protein [Burkholderia mallei ATCC 10399] gi|217393664|gb|EEC33685.1| conserved hypothetical protein [Burkholderia pseudomallei 576] gi|238519717|gb|EEP83185.1| membrane protein [Burkholderia mallei GB8 horse 4] gi|243061787|gb|EES43973.1| conserved hypothetical protein [Burkholderia mallei PRL-20] Length = 602 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 19/108 (17%), Positives = 37/108 (34%), Gaps = 1/108 (0%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 AI + V + A+D+ ++ ++R +Q D A L+G + + + S Sbjct: 31 AAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQPNAAAAANARS 89 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKA 109 F + + R + A + N N +Q A Sbjct: 90 NGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSV 137 >gi|126451635|ref|YP_001066065.1| hypothetical protein BURPS1106A_1796 [Burkholderia pseudomallei 1106a] gi|242315320|ref|ZP_04814336.1| conserved hypothetical protein [Burkholderia pseudomallei 1106b] gi|126225277|gb|ABN88817.1| conserved hypothetical protein [Burkholderia pseudomallei 1106a] gi|242138559|gb|EES24961.1| conserved hypothetical protein [Burkholderia pseudomallei 1106b] Length = 602 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 19/108 (17%), Positives = 37/108 (34%), Gaps = 1/108 (0%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 AI + V + A+D+ ++ ++R +Q D A L+G + + + S Sbjct: 31 AAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQPNAAAAANARS 89 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKA 109 F + + R + A + N N +Q A Sbjct: 90 NGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSV 137 >gi|76811718|ref|YP_333349.1| hypothetical protein BURPS1710b_1950 [Burkholderia pseudomallei 1710b] gi|237812074|ref|YP_002896525.1| membrane protein [Burkholderia pseudomallei MSHR346] gi|254188638|ref|ZP_04895149.1| conserved hypothetical protein [Burkholderia pseudomallei Pasteur 52237] gi|254261132|ref|ZP_04952186.1| conserved hypothetical protein [Burkholderia pseudomallei 1710a] gi|76581171|gb|ABA50646.1| putative membrane protein [Burkholderia pseudomallei 1710b] gi|157936317|gb|EDO91987.1| conserved hypothetical protein [Burkholderia pseudomallei Pasteur 52237] gi|237503192|gb|ACQ95510.1| membrane protein [Burkholderia pseudomallei MSHR346] gi|254219821|gb|EET09205.1| conserved hypothetical protein [Burkholderia pseudomallei 1710a] Length = 602 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 19/108 (17%), Positives = 37/108 (34%), Gaps = 1/108 (0%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 AI + V + A+D+ ++ ++R +Q D A L+G + + + S Sbjct: 31 AAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQPNAAAAANARS 89 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKA 109 F + + R + A + N N +Q A Sbjct: 90 NGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSV 137 >gi|118581112|ref|YP_902362.1| TadE family protein [Pelobacter propionicus DSM 2379] gi|118503822|gb|ABL00305.1| TadE family protein [Pelobacter propionicus DSM 2379] Length = 387 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 24/280 (8%), Positives = 71/280 (25%), Gaps = 25/280 (8%) Query: 13 ITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHL 72 A+D +++ + Q+Q+A DAA L+G A ++ D ++ + + + Sbjct: 32 AAMAMDFSYMYVAKGQLQNAADAAALAGAAMLIRDDGLEPTADDEAAARDKAAQFALANK 91 Query: 73 KQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPT----ENLFLKGLIPSAL 128 G I + + + +A + + + L Sbjct: 92 SAGQNIIIDKSSDITFGYWDNHYTSGGSPINAIQVRAARDTASAGGQVPIIFGRLFGWDK 151 Query: 129 TNLSLRS-----------------TGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDN 171 + + S + +++D S + + Sbjct: 152 MGAAAIATAAIPARATSSIALCPDFCQGGIPSGLKEFTPPVLMDTGPSTPSVNIYAWTSY 211 Query: 172 NNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRK--IDVLIESAGNLVNSIQKAIQ 229 ++ ++ + T + + A ++ + +S K Sbjct: 212 DSNVTSTSDVRDLICSQNNISKTACGETIYSSMGAQANSLCNLESQMYNPYFDSGNKDTI 271 Query: 230 EKKN--LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 N + + + + + + Sbjct: 272 TIANGVNTPGWWMVVPITQSCPPGAQGNAWDPKPIIGYAS 311 >gi|258627123|ref|ZP_05721919.1| hypothetical protein VMB_32200 [Vibrio mimicus VM603] gi|258580641|gb|EEW05594.1| hypothetical protein VMB_32200 [Vibrio mimicus VM603] Length = 335 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 25/150 (16%), Positives = 55/150 (36%), Gaps = 23/150 (15%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKL---NPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 G TP + + N + L++ ++TN A+ + ST Sbjct: 149 GDAAFIQTPFTADQNVWLNLLDEAETGMAGQSTNLGDAIGLGIKVFEQSP--------ST 200 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------GQDL 356 + ++ +TDG ++G+ + + +R IY +A+ P D+ Sbjct: 201 SQDQIMLVLTDGNDTGSFVSPVDAAKIAAAKGIR-----IYVIAMGDPENVGEQPLDMDV 255 Query: 357 LRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + + + + + F D +L E++ I Sbjct: 256 VNRVSSLTQARSFVAIDQPQLNEAYQVIDQ 285 >gi|38233798|ref|NP_939565.1| hypothetical protein DIP1208 [Corynebacterium diphtheriae NCTC 13129] gi|38200059|emb|CAE49735.1| Putative secreted protein [Corynebacterium diphtheriae] Length = 470 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 31/357 (8%), Positives = 89/357 (24%), Gaps = 19/357 (5%) Query: 31 SALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQ 90 +A D ++ D + P +S + Sbjct: 125 AATDQVGIARREDATEDAPVTYPVKSDAVSSALVAAFTHN----NDASAATKMLRDFSET 180 Query: 91 INITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISI 150 + + + + + + P L+ + E Sbjct: 181 TIDSAVAQSAPKIVVSKSETPQGYSFEEIQNLSKPLRAVALNPTDSVNEEVVRAGADFGT 240 Query: 151 CMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTK-----SKYAPAPAP 205 + + + + L + P S +++ Sbjct: 241 AIAANTPDAQKPQKLSQEVATAAEILASLTDSSPAVDSQSNQSPRDILMLLDTSEYINGV 300 Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + I ++ L+N QK + K + + + GI + L+ + + + Sbjct: 301 STDGIRWFDAASHGLINLAQKQEERGKAIGLWNYSSPLTPGITQGWRSNLNFDTLDAARQ 360 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 + + T P A N G+T + D + Sbjct: 361 VKDVLGNFGTGGQPQTLAATSAALNYAVEHAQHAGNTSVVIVFTGTND----------SF 410 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 + + + + ++++ + + ++ + D++G + + EL + Sbjct: 411 TGLAESVRKAQESHVSLHAIQIGDGQDDTNIAQAAKDTNGSYVHATSAAELDRAIAS 467 >gi|304320962|ref|YP_003854605.1| hypothetical protein PB2503_06987 [Parvularcula bermudensis HTCC2503] gi|303299864|gb|ADM09463.1| hypothetical protein PB2503_06987 [Parvularcula bermudensis HTCC2503] Length = 439 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 11/232 (4%), Positives = 56/232 (24%), Gaps = 14/232 (6%) Query: 4 IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTI 63 I++ +D + + +++ +++ A++ + + + D Sbjct: 39 ILLVPSLGLSALIVDGSRMRNAYLELE------IVAESAALAAAQQLPDTDAATSAAVDY 92 Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEI-PTENLFLKG 122 + + + + + + ++ + E + Sbjct: 93 AETNLDPTVYGEVVKASDIEFGQYDEASSTFTSGGDATAVKVTARREAERGNSLVTLFGA 152 Query: 123 LIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLP 182 +I NLS + +++ + + + + + + + Sbjct: 153 VIGRKEINLSATAIALLDYTEMDPVCILAL-----GHHLYGVDMDILID--VDIPDCGIQ 205 Query: 183 PPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNL 234 K+ ++ A ++ + + Sbjct: 206 VNSDSHTALKSRYNAEVNAAYIHVVGQVHGNTNLLNPTPVEGVDPVDDPYAD 257 >gi|325272563|ref|ZP_08138930.1| hypothetical protein G1E_06477 [Pseudomonas sp. TJI-51] gi|324102296|gb|EGB99775.1| hypothetical protein G1E_06477 [Pseudomonas sp. TJI-51] Length = 1125 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 28/275 (10%), Positives = 72/275 (26%), Gaps = 14/275 (5%) Query: 51 KDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQ 110 + + K + I + T + + K Sbjct: 458 PEGGSVKLVAISTALTDIDGSETLSVSLSGIPKGTVLSDGAGHTVTVGSTPVDVTGWKLS 517 Query: 111 YEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 T + KG + + + S G ++ N+ +++ + Sbjct: 518 SLTLTPPTYYKGSFDVTVISTATESVGGSAITTGNIPVTVY---PATYKASVGTSGNDAM 574 Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRK-IDVLIESAGNLVNSIQKAIQ 229 + ++ + KN + + I ++ ++++ ++ Sbjct: 575 TGSEGNDIIVADVSGLNVVAGKNYNIAFIVDSSGSMTDDSIAAAKSQLASVFSTLKASLG 634 Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSN-----NLNEVKSRLNKLNPYENTNTYPAMHHA 284 + +V I + ++ + N L++ L V + TN A Sbjct: 635 SDTSGTVNIFLVDFDSQVNMNVAVNLADPNALSQLQAVLKSMQGGTEGGGTNYEDAFKTT 694 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGA 319 + G+ FITDG+ + Sbjct: 695 ANFFKSTM-----ATGNAGADNLTYFITDGKPTFY 724 >gi|167824377|ref|ZP_02455848.1| hypothetical protein Bpseu9_11953 [Burkholderia pseudomallei 9] Length = 589 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 19/108 (17%), Positives = 37/108 (34%), Gaps = 1/108 (0%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 AI + V + A+D+ ++ ++R +Q D A L+G + + + S Sbjct: 18 AAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQPNAAAAANARS 76 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKA 109 F + + R + A + N N +Q A Sbjct: 77 NGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSV 124 >gi|153008592|ref|YP_001369807.1| hypothetical protein Oant_1261 [Ochrobactrum anthropi ATCC 49188] gi|151560480|gb|ABS13978.1| conserved hypothetical protein [Ochrobactrum anthropi ATCC 49188] Length = 576 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 20/196 (10%), Positives = 51/196 (26%), Gaps = 16/196 (8%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQS-----------ALDAAVLSGCASIVSDRT 49 M A++ + +++D + + R Q+QS +L A + + ++ Sbjct: 19 MAALVSPIFLAVAAFSVDTSSLFLERRQLQSMADFAAVAGAASLSQANDAVLRQLRANG- 77 Query: 50 IKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKA 109 + ++ + + D ++ + +P Sbjct: 78 LDPVLMTGAYDPSVVNGKTDNKTRVWVEKGNYFPDKSRAVEKRFVAGGTSPDAVRVR--- 134 Query: 110 QYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHN 169 P F + I + + E + + + + D S L Sbjct: 135 -LARPGNLYFGQAFINRPALGATGMAATKAEAAFWIGSRLLSLNTDQSVLNGLLGGLLGT 193 Query: 170 DNNNMTSNKYLLPPPP 185 N + L Sbjct: 194 TLNLKLVDYNALAATD 209 >gi|149922555|ref|ZP_01910985.1| putative lipoprotein [Plesiocystis pacifica SIR-1] gi|149816582|gb|EDM76077.1| putative lipoprotein [Plesiocystis pacifica SIR-1] Length = 486 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 10/86 (11%), Positives = 28/86 (32%), Gaps = 10/86 (11%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 +N + ++ + P T+ + + Y + V+ ++DG Sbjct: 214 SNDATLLDTIDAIEPGGGTDLHAGLEQGYALAQANFSADRINR--------VVLVSDGGA 265 Query: 317 SGASAYQNTLNTLQICEYMRNAGMKI 342 + + + Q+ E G+ + Sbjct: 266 NLG--FTDAELIAQMAELEDGEGIYM 289 >gi|226196351|ref|ZP_03791933.1| conserved hypothetical protein [Burkholderia pseudomallei Pakistan 9] gi|254297788|ref|ZP_04965241.1| conserved hypothetical protein [Burkholderia pseudomallei 406e] gi|157807412|gb|EDO84582.1| conserved hypothetical protein [Burkholderia pseudomallei 406e] gi|225931568|gb|EEH27573.1| conserved hypothetical protein [Burkholderia pseudomallei Pakistan 9] Length = 602 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 19/108 (17%), Positives = 37/108 (34%), Gaps = 1/108 (0%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 AI + V + A+D+ ++ ++R +Q D A L+G + + + S Sbjct: 31 AAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQPNAAAAANARS 89 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKA 109 F + + R + A + N N +Q A Sbjct: 90 NGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSV 137 >gi|310825520|ref|YP_003957878.1| hypothetical protein STAUR_8297 [Stigmatella aurantiaca DW4/3-1] gi|309398592|gb|ADO76051.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1] Length = 1414 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 27/320 (8%), Positives = 81/320 (25%), Gaps = 37/320 (11%) Query: 81 NAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIE 140 N + AQ A+ + + S + + + + + + Sbjct: 171 NTTNAAQYARCLTCLNTRGYFRVYDSSTSDNRVNPNFILWGRFLNFNPPKYVTVRAALKQ 230 Query: 141 R------SSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNT 194 + ++ + M+ + + S++ ++ T Sbjct: 231 VLKNLKGARAGISTFTQTSSANTLKMQPGCQGILANADAFDSSRTGFIATINSLTFNTAT 290 Query: 195 TKSKYAPAPAPANR-KIDVLIESAGNLVNSIQ---KAIQEKKNLSVRIGTIAYNIGIVGN 250 ++ V + G + + KN + + G + Sbjct: 291 PLARALLNTGYYFTSDQTVYKDVFGFGATNPTVGYAYPTDFKNEPLSSENRSVCWGDQAS 350 Query: 251 QCTPLSNNL-------NEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTR 303 L++ + V ++ + P+ A + N + Sbjct: 351 AVIILTDGEPNTDTLGSAVVQKIRS-RNGGPVSCPPSAPCA---------DTPNDANAML 400 Query: 304 LKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD- 362 + T+ +T + +++Y+V + LL+ Sbjct: 401 DDVAKLLFTNDLQ------YSTPPIVGALNTSGQQSLRVYTVGFAIDSN---LLKNAAAV 451 Query: 363 SSGQFFAVNDSRELLESFDK 382 G+ + +D+ L ++ Sbjct: 452 GGGRSYTAHDAAGLRQALQD 471 >gi|283807019|pdb|3IJ2|A Chain A, Ligand-Receptor Structure gi|283807021|pdb|3IJ2|B Chain B, Ligand-Receptor Structure Length = 230 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 15/173 (8%), Positives = 49/173 (28%), Gaps = 2/173 (1%) Query: 27 NQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIA 86 ++Q +LD A L+ S + T + + + L + Sbjct: 20 TKLQHSLDTA-LAAARSAPTAPIAARVTGQTRNITVDPRLFKAARLHSPRVLFSTQPPPT 78 Query: 87 QKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIE-RSSEN 145 +++ + + + ++ + ++ + + Sbjct: 79 SSDTLDLDFQAHGTIPFNRTHRSAASSTHPVFHMGEFSVCDSVSVWVGDKTTATDIKGKE 138 Query: 146 LAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSK 198 + + + ++ S + + K +N + S + S+ + T K Sbjct: 139 VTVLAEVNINNSVFRQYFFETKCRASNPVESGCRGIDSKHWNSYCTTTHTFVK 191 >gi|268532316|ref|XP_002631286.1| Hypothetical protein CBG03099 [Caenorhabditis briggsae] gi|187036874|emb|CAP23539.1| hypothetical protein CBG_03099 [Caenorhabditis briggsae AF16] Length = 406 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 19/156 (12%), Positives = 51/156 (32%), Gaps = 9/156 (5%) Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTI---AYNIGIVGNQCTPLSNNLNEVKSRL-NK 268 + + ++ SI K R + Y+ S + +++ + + + Sbjct: 61 VRNTLTQVLGSISTIGPVKYPADPRSTCVGVVTYDDNATTQSQLDASTSFSDLYNVIQSS 120 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 + ++TN + + + +K VI +A L+ Sbjct: 121 MVAVDSTN-NSYLSLGLLAAEKAFKDGRSRSYRYNYRKVVIAFVADYQGQGTA----LDA 175 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS 364 + I +++ + I +VA ++ P Q + + Sbjct: 176 MPIANRLKDNTVSIITVACTSDPVKQAAIAAIASTG 211 >gi|294139943|ref|YP_003555921.1| type IV pilin biogenesis protein [Shewanella violacea DSS12] gi|293326412|dbj|BAJ01143.1| type IV pilin biogenesis protein, putative [Shewanella violacea DSS12] Length = 1227 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 9/55 (16%), Positives = 21/55 (38%), Gaps = 2/55 (3%) Query: 338 AGMKIYSVAVSAPPEGQDLL--RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + +++ S L + G +F +++ L S + +KI E+ Sbjct: 484 QHVSTFTIGFSEGAASAASLLKKTAAVGGGSYFDATNAKLLQGSLQQAVNKILEK 538 >gi|68072835|ref|XP_678331.1| von willebrand factor a-domain-related protein, [Plasmodium berghei strain ANKA] gi|15077688|gb|AAK83296.1| micronemal protein WARP [Plasmodium berghei] gi|29838451|gb|AAO92597.1| von Willerbrand factor A domain-related ookinete/early oocyst protein [Plasmodium berghei] gi|56498766|emb|CAH99142.1| von willebrand factor a-domain-related protein, putative [Plasmodium berghei] Length = 303 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 22/182 (12%), Positives = 52/182 (28%), Gaps = 8/182 (4%) Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYN 244 P + N + +K + + L + + A K + I A Sbjct: 87 PTPGNYCDNYYDITLVVENSSFVQKDYWMKGTIPFLESMARNARVSKDKAHMSIILFAGT 146 Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 ++ +S + ++ ++ L+ T++ +A + + T Sbjct: 147 QNLIVPFTDEISQDKEKLIEKIRTLD-DAGTDSNTLYVYALEYAFEKVIFGEGTRS--DA 203 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD-LLRKCTDS 363 K + G + N + E + + +K+ V + LL C Sbjct: 204 PKIAVLFYYGFD----YGANKSLIPDVVEDYKQSNIKLIIVGIGLTLRENALLLADCKSE 259 Query: 364 SG 365 Sbjct: 260 GD 261 >gi|313764974|gb|EFS36338.1| conserved hypothetical protein [Propionibacterium acnes HL013PA1] gi|314916261|gb|EFS80092.1| conserved hypothetical protein [Propionibacterium acnes HL005PA4] gi|314917531|gb|EFS81362.1| conserved hypothetical protein [Propionibacterium acnes HL050PA1] gi|314921865|gb|EFS85696.1| conserved hypothetical protein [Propionibacterium acnes HL050PA3] gi|314955334|gb|EFS99739.1| conserved hypothetical protein [Propionibacterium acnes HL027PA1] gi|315102367|gb|EFT74343.1| conserved hypothetical protein [Propionibacterium acnes HL046PA1] gi|315109816|gb|EFT81792.1| conserved hypothetical protein [Propionibacterium acnes HL030PA2] gi|327334607|gb|EGE76318.1| putative membrane protein [Propionibacterium acnes HL097PA1] gi|327454298|gb|EGF00953.1| hypothetical protein HMPREF9581_00471 [Propionibacterium acnes HL087PA3] gi|327456363|gb|EGF03018.1| hypothetical protein HMPREF9586_00740 [Propionibacterium acnes HL083PA2] gi|328756057|gb|EGF69673.1| hypothetical protein HMPREF9579_00544 [Propionibacterium acnes HL087PA1] gi|328758902|gb|EGF72518.1| hypothetical protein HMPREF9588_00650 [Propionibacterium acnes HL025PA2] Length = 169 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 16/134 (11%), Positives = 34/134 (25%), Gaps = 17/134 (12%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A+I+ + A I ++ QSA A +G + I Sbjct: 43 AALILPALLMIAAVATGSWRISEVKADAQSAAQVAARAGSVASSVGEGIA---------- 92 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + ++ L + + + ++ I +L + Sbjct: 93 -VGQRVGLAELAGTRCSNPAIAVDSSDLTLPV------GSAGTTSARVSCTIKLSDLLVP 145 Query: 122 GLIPSALTNLSLRS 135 G+ S S Sbjct: 146 GMPGSFHIESVAHS 159 >gi|198415896|ref|XP_002125135.1| PREDICTED: similar to putative calcium activated chloride channel-like protein 1; eCLCA1 [Ciona intestinalis] Length = 1580 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 25/262 (9%), Positives = 68/262 (25%), Gaps = 27/262 (10%) Query: 127 ALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPK 186 ++ +T + + + S S N + + + Sbjct: 274 PSDPVNQHNTEADNEQNAKCNLRSTWDVITSTSDFSGGSNPPNPTLTNLAPTFRVVRVAA 333 Query: 187 KSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIG 246 + S + + + S+ + + I Sbjct: 334 SRRFVLVLDVSGSMSGNRLLMMR----QSAGDFISTSLPDGDKVGIVQFHSSANLMMEIR 389 Query: 247 IVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 + +Q + + + + + +T ++ A E+ + Sbjct: 390 QISSQL-----DRVAIAAGIPGI-AGGSTCIGCGIYAAMNEMERHDANETC--------G 435 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ 366 +I +TDG+ + + + + + ++ + L+ T + GQ Sbjct: 436 NIIVLTDGKENQPPYVND------VSQLAIQKNCVVNAILFTTTENS-ALVDLVTATGGQ 488 Query: 367 FFAVNDSR--ELLESFDKITDK 386 +F D L+ SF I Sbjct: 489 WFFAQDRDLKRLMGSFAVIAAN 510 >gi|149176866|ref|ZP_01855476.1| BatB [Planctomyces maris DSM 8797] gi|148844303|gb|EDL58656.1| BatB [Planctomyces maris DSM 8797] Length = 798 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 22/140 (15%), Positives = 48/140 (34%), Gaps = 20/140 (14%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP----YEN 274 + ++ ++ I++ + Q PL+++ + K L+ + P Sbjct: 111 SRLDRAKQQIKDMVDEMSGDRVGLVVFAGETRQSVPLTSHYEDFKQSLDAVGPHSVRRGG 170 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 + A+ A + I T K ++ TDGE Q + E Sbjct: 171 SLLGDAIRSA----------TAGFIDKTNDHKAIVVFTDGE------DQESKPVEAAKEA 214 Query: 335 MRNAGMKIYSVAVSAPPEGQ 354 G++I++V + +G Sbjct: 215 FTKNGIRIFTVGLGDMDQGA 234 >gi|153214688|ref|ZP_01949548.1| RTX protein [Vibrio cholerae 1587] gi|124115210|gb|EAY34030.1| RTX protein [Vibrio cholerae 1587] Length = 2093 Score = 39.1 bits (89), Expect = 1.2, Method: Composition-based stats. Identities = 28/320 (8%), Positives = 88/320 (27%), Gaps = 16/320 (5%) Query: 47 DRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAE 106 I D ++ + T L + + + G + Sbjct: 1346 QGGIADGSSMPNSQDTTNNTSADGQLNLSNVSQLSMGIPTDNYTSDGVVISWTLSADKQT 1405 Query: 107 SKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICM-VLDVSRSMEDLYL 165 F + + + L + ++LAI+I + + + ++ + Sbjct: 1406 LTGSAGANKVVEFTLDNLGNVNSTLHAPIDHPNKSGEDSLAINIPLEAKNATGAIGTGKV 1465 Query: 166 QKHNDNNNMTSNKYLLPPPPKKSFWSKN----TTKSKYAPAPAPANRKIDVLIESAGNLV 221 +++ + + + + T ++ V+ +A L+ Sbjct: 1466 TLVIEDDAPVAKEVFHVAESELKQGANVQLILDTSGSMGEPAGNGQTRLKVMQTAALQLL 1525 Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN-NLNEVKSRLNKLNPYENTNTYPA 280 + + V++ + ++ ++E +++L T+ A Sbjct: 1526 SEYSALGET----RVQLIEFYSDSRYYVSEINGSKWMTVDEASEHIDRLYAGGGTDYDDA 1581 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 A + + F++DG+ + N L +++R+ + Sbjct: 1582 TKMAADIWDDNDGDMIAGGSNISY-----FLSDGQPNQGEELSNNDR-LDWEKHLRDHNV 1635 Query: 341 KIYSVAVSAPPEGQDLLRKC 360 + + +L + Sbjct: 1636 TALAYGMGNDVPQGELNKVA 1655 >gi|13476804|ref|NP_108373.1| hypothetical protein mll8236 [Mesorhizobium loti MAFF303099] gi|14027565|dbj|BAB53834.1| mll8236 [Mesorhizobium loti MAFF303099] Length = 379 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 29/249 (11%), Positives = 63/249 (25%), Gaps = 20/249 (8%) Query: 12 FITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKH 71 + AIDL Q+Q++LDAA L+ + P+ + + + Sbjct: 1 MVAGAIDLTGTSDDAAQLQNSLDAAGLAIGTKYL-------PSMAASDVAALGLTFFAAN 53 Query: 72 LKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNL 131 L A A + +P Y + P+ + Sbjct: 54 LSLADQQENADSVSAFSATAS-----GDPSAYYISLSSSINRPSFINGAAPWPAHRSATV 108 Query: 132 SLRSTGIIER----SSENLAISICMVLDVSRSMEDLYLQKHNDNNNM---TSNKYLLPPP 184 + + A+S+ DV+ S + + ++ Sbjct: 109 KMNPGAQACVLALDPHASAAVSLQGSTDVAMSSCVIAANSDASDAVRRGGSAQISAGCVS 168 Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYN 244 + + + A AP + A + + + + + Y Sbjct: 169 TVGGTYGLSPPSANLA-CGAPLEHQYASFDPLADVVPPAYTLCLPVPNGKNYTLSPGTYC 227 Query: 245 IGIVGNQCT 253 + T Sbjct: 228 DKTLSGNIT 236 >gi|111657697|ref|ZP_01408424.1| hypothetical protein SpneT_02001110 [Streptococcus pneumoniae TIGR4] Length = 866 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 12/131 (9%), Positives = 32/131 (24%), Gaps = 25/131 (19%) Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 +++ ++ Y L S G+ + N Sbjct: 446 SEMKAAGYAVIGDPINGGYIWLN--WRESILAY--PFNSNTAKITNHGDPTRWYYNGN-- 499 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPE-------GQDLLRKCTDSSGQFFAVNDS----RE 375 + G +++V + + ++ + + V D+ + Sbjct: 500 --------IAPDGYDVFTVGIGINGDPGTDEATATSFMQSISSKPENYTNVTDTTKILEQ 551 Query: 376 LLESFDKITDK 386 L F I + Sbjct: 552 LNRYFHTIVTE 562 >gi|330448513|ref|ZP_08312161.1| hemolysin-type calcium-binding repeat family protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] gi|328492704|dbj|GAA06658.1| hemolysin-type calcium-binding repeat family protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] Length = 899 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 29/192 (15%), Positives = 60/192 (31%), Gaps = 38/192 (19%) Query: 210 IDVLIESAGNLVNSIQKAIQEKK----NLSVRIGTIAYNIGIVGNQCTPLSNNLN--EVK 263 +D++ +S L I I +K N+ + N + T N ++ Sbjct: 472 MDIVKQSYQTLTQEILHNIDDKSKLEFNIVTFSNAVRGNTTFHYDDSTHQFVNKQNVTIE 531 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 + ++ L T + A + + + V F++DG+ + Sbjct: 532 NYIHDLTAGGGTQFEWPLKDASAHITDSSKR-----------NVVYFLSDGK------DE 574 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQF-----------FAVND 372 + L+T I G +I S+ V P + + + + + Sbjct: 575 DKLDTTGIHFL---KGTEIVSIGVG-PSADAKQMGEIAQMGTGYDKDNPNAPSYSKVITN 630 Query: 373 SRELLESFDKIT 384 EL + F I Sbjct: 631 GNELNDIFHNIG 642 >gi|303242523|ref|ZP_07329002.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] gi|302589941|gb|EFL59710.1| von Willebrand factor type A [Acetivibrio cellulolyticus CD2] Length = 1855 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 45/326 (13%), Positives = 103/326 (31%), Gaps = 39/326 (11%) Query: 71 HLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTN 130 ++ + S I G+ + N ++ A +E+ Sbjct: 653 YMPELSNIEGIIGNPVEITCEANIISANLTFSISKDTLASHELENLK-VFTIYEGKLTIL 711 Query: 131 LSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFW 190 + S+ SIC ++D+ + Sbjct: 712 DTAYDITSGTISASTPYFSICGIVDIC----------------KLIESLMCDDESGGILD 755 Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 + KI+ +I + + K N+ +R G I Y Sbjct: 756 KGKADIIFAIDSTGSMGDKIENVITNVNEFAEELSK------NVEIRFGLIDYKDIYEVG 809 Query: 251 QCT---PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 + T ++NE+K R++++ Y + + A E + KKF Sbjct: 810 ETTINCGWFTDVNELKKRVDEILVYGGGDVPESAVDALEE-------ARTMGFRPNAKKF 862 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA-VSAPPEGQDLLRKCTDSSGQ 366 ++ +TD + + + + Q + ++N G+ V+ + L T+++G Sbjct: 863 IVLLTDADYKDGTHFTDVTTMAQEIDLLKNDGIITSVVSDIGYEYVYNSLY---TETNGL 919 Query: 367 FFAVNDSRELLESFDKITDKIQEQSV 392 F +N + E +++ KI++ ++ Sbjct: 920 FADINS--DFAEVLRELSSKIKDVTL 943 >gi|262165253|ref|ZP_06032990.1| protein BatA [Vibrio mimicus VM223] gi|262024969|gb|EEY43637.1| protein BatA [Vibrio mimicus VM223] Length = 335 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 25/177 (14%), Positives = 63/177 (35%), Gaps = 23/177 (12%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL---NPYENT 275 + + + +K +++ G TP + + + + L++ ++T Sbjct: 122 SRLTAAKKVLRDFVTQRQGDRFGLILFGDAAFIQTPFTADQDVWLNLLDEAETGMAGQST 181 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 N A+ + ST + ++ +TDG ++G+ + + + Sbjct: 182 NLGDAIGLGIKVFEQSP--------STSQDQIMLVLTDGNDTGSFVSPVDAAKIAAAKGI 233 Query: 336 RNAGMKIYSVAVSAPPE------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 R IY +A+ P D++ + + + + F D +L E++ I Sbjct: 234 R-----IYVIAMGDPENVGEQPLDMDVVNRVSSLTQARSFVAIDQPQLNEAYQVIDQ 285 >gi|220925364|ref|YP_002500666.1| LPXTG-motif cell wall anchor domain-containing protein [Methylobacterium nodulans ORS 2060] gi|219949971|gb|ACL60363.1| LPXTG-motif cell wall anchor domain protein [Methylobacterium nodulans ORS 2060] Length = 725 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 18/132 (13%), Positives = 42/132 (31%), Gaps = 16/132 (12%) Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 +S + L T + A + E+ + ++F+TDG A Sbjct: 400 ARSFVAALEARGGTEMLAPLTAALADPTPERTDR---------VRQIVFLTDG------A 444 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFD 381 N R +++ + + + P + G + A+ ++ E Sbjct: 445 IGNEEQIFSAIAAGRGRS-RLFMIGIGSAPNAHLMTYAAELGRGSYTAIGTIDQVAERMR 503 Query: 382 KITDKIQEQSVR 393 ++ K++ V Sbjct: 504 ELLTKLESPVVT 515 >gi|297685993|ref|XP_002820555.1| PREDICTED: inter-alpha-trypsin inhibitor heavy chain H5-like [Pongo abelii] Length = 150 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 13/81 (16%), Positives = 30/81 (37%), Gaps = 5/81 (6%) Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 ++ ++P T+ A+ A R L S R ++F+TDG+ + + Sbjct: 70 YIHHMSPTGGTDINGALQRAIRLLNKYVAHSGI---GDRSVSLIVFLTDGKPTVGETHTL 126 Query: 325 TLNTLQICEYMRNAGMKIYSV 345 + + I+++ Sbjct: 127 KILNNT--REAARGQVCIFTL 145 >gi|262272104|gb|ACY40027.1| MIP14152p [Drosophila melanogaster] Length = 603 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 21/144 (14%), Positives = 53/144 (36%), Gaps = 16/144 (11%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 + V+ + +V +N+ E+KS + + + N + +A+ L+ Sbjct: 96 SEVVKTPVPCFKDRMVRATP----DNIQEIKSAVKAIKLQDTANFTAGLEYAFSLLHKYN 151 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 +S + + + ++ IT+ + N + ++I++ + + Sbjct: 152 QSGAGSQCN----QAIMLITESTSESHKDVIKQYNWPHM-------PVRIFTYLIGSDSG 200 Query: 353 GQDLLRKCTDSSGQFF-AVNDSRE 375 + L S+ FF +ND E Sbjct: 201 SRSNLHDMACSNKGFFVQINDYDE 224 >gi|195579432|ref|XP_002079566.1| GD21946 [Drosophila simulans] gi|194191575|gb|EDX05151.1| GD21946 [Drosophila simulans] Length = 1100 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 21/144 (14%), Positives = 53/144 (36%), Gaps = 16/144 (11%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 + V+ + +V +N+ E+KS + + + N + +A+ L+ Sbjct: 197 SEVVKTPVPCFKDRMVRATP----DNIQEIKSAVKAIKLQDTANFTAGLEYAFSLLHKYN 252 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 +S + + + ++ IT+ + N + ++I++ + + Sbjct: 253 QSGAGSQCN----QAIMLITESTSESHKDVIKQYNWPHM-------PVRIFTYLIGSDSG 301 Query: 353 GQDLLRKCTDSSGQFF-AVNDSRE 375 + L S+ FF +ND E Sbjct: 302 SRSNLHDMACSNKGFFVQINDYDE 325 >gi|195475490|ref|XP_002090017.1| GE19394 [Drosophila yakuba] gi|194176118|gb|EDW89729.1| GE19394 [Drosophila yakuba] Length = 1136 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 21/144 (14%), Positives = 53/144 (36%), Gaps = 16/144 (11%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 + V+ + +V +N+ E+KS + + + N + +A+ L+ Sbjct: 197 SEVVKTPVPCFKDRMVRATP----DNIQEIKSAVKAIKLQDTANFTAGLEYAFSLLHKYN 252 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 +S + + + ++ IT+ + N + ++I++ + + Sbjct: 253 QSGAGSQCN----QAIMLITESTSESHKDVIKQYNWPHM-------PVRIFTYLIGSDSG 301 Query: 353 GQDLLRKCTDSSGQFF-AVNDSRE 375 + L S+ FF +ND E Sbjct: 302 SRSNLHDMACSNKGFFVQINDYDE 325 >gi|195338633|ref|XP_002035929.1| GM14254 [Drosophila sechellia] gi|194129809|gb|EDW51852.1| GM14254 [Drosophila sechellia] Length = 1119 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 21/144 (14%), Positives = 53/144 (36%), Gaps = 16/144 (11%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 + V+ + +V +N+ E+KS + + + N + +A+ L+ Sbjct: 197 SEVVKTPVPCFKDRMVRATP----DNIQEIKSAVKAIKLQDTANFTAGLEYAFSLLHKYN 252 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 +S + + + ++ IT+ + N + ++I++ + + Sbjct: 253 QSGAGSQCN----QAIMLITESTSESHKDVIKQYNWPHM-------PVRIFTYLIGSDSG 301 Query: 353 GQDLLRKCTDSSGQFF-AVNDSRE 375 + L S+ FF +ND E Sbjct: 302 SRSNLHDMACSNKGFFVQINDYDE 325 >gi|194857574|ref|XP_001968984.1| GG24200 [Drosophila erecta] gi|190660851|gb|EDV58043.1| GG24200 [Drosophila erecta] Length = 1136 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 21/144 (14%), Positives = 53/144 (36%), Gaps = 16/144 (11%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 + V+ + +V +N+ E+KS + + + N + +A+ L+ Sbjct: 197 SEVVKTPVPCFKDRMVRATP----DNIQEIKSAVKAIKLQDTANFTAGLEYAFSLLHKYN 252 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 +S + + + ++ IT+ + N + ++I++ + + Sbjct: 253 QSGAGSQCN----QAIMLITESTSESHKDVIKQYNWPHM-------PVRIFTYLIGSDSG 301 Query: 353 GQDLLRKCTDSSGQFF-AVNDSRE 375 + L S+ FF +ND E Sbjct: 302 SRSNLHDMACSNKGFFVQINDYDE 325 >gi|161076922|ref|NP_001097164.1| CG4587, isoform C [Drosophila melanogaster] gi|320545115|ref|NP_001188817.1| CG4587, isoform D [Drosophila melanogaster] gi|157400167|gb|ABV53684.1| CG4587, isoform C [Drosophila melanogaster] gi|318068460|gb|AAF53476.3| CG4587, isoform D [Drosophila melanogaster] Length = 1243 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 21/144 (14%), Positives = 53/144 (36%), Gaps = 16/144 (11%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 + V+ + +V +N+ E+KS + + + N + +A+ L+ Sbjct: 301 SEVVKTPVPCFKDRMVRATP----DNIQEIKSAVKAIKLQDTANFTAGLEYAFSLLHKYN 356 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 +S + + + ++ IT+ + N + ++I++ + + Sbjct: 357 QSGAGSQCN----QAIMLITESTSESHKDVIKQYNWPHM-------PVRIFTYLIGSDSG 405 Query: 353 GQDLLRKCTDSSGQFF-AVNDSRE 375 + L S+ FF +ND E Sbjct: 406 SRSNLHDMACSNKGFFVQINDYDE 429 >gi|161076920|ref|NP_001097163.1| CG4587, isoform B [Drosophila melanogaster] gi|157400166|gb|ABV53683.1| CG4587, isoform B [Drosophila melanogaster] Length = 1209 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 21/144 (14%), Positives = 53/144 (36%), Gaps = 16/144 (11%) Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 + V+ + +V +N+ E+KS + + + N + +A+ L+ Sbjct: 250 SEVVKTPVPCFKDRMVRATP----DNIQEIKSAVKAIKLQDTANFTAGLEYAFSLLHKYN 305 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 +S + + + ++ IT+ + N + ++I++ + + Sbjct: 306 QSGAGSQCN----QAIMLITESTSESHKDVIKQYNWPHM-------PVRIFTYLIGSDSG 354 Query: 353 GQDLLRKCTDSSGQFF-AVNDSRE 375 + L S+ FF +ND E Sbjct: 355 SRSNLHDMACSNKGFFVQINDYDE 378 >gi|314969033|gb|EFT13131.1| conserved hypothetical protein [Propionibacterium acnes HL037PA1] Length = 169 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 16/134 (11%), Positives = 34/134 (25%), Gaps = 17/134 (12%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A+I+ + A I ++ QSA A +G + I Sbjct: 43 AALILPALLMIAAVATGSWRISEVKADAQSAAQVAARAGSVASSVGEGIA---------- 92 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + ++ L + + + ++ I +L + Sbjct: 93 -VGQRVGLAELAGTRCSNPAIAVDSSDLTLPV------GSAGTTSARVSCTIKLSDLLVP 145 Query: 122 GLIPSALTNLSLRS 135 G+ S S Sbjct: 146 GMPGSFRIESVAHS 159 >gi|121583393|ref|YP_973824.1| von Willebrand factor, type A [Polaromonas naphthalenivorans CJ2] gi|120596647|gb|ABM40082.1| von Willebrand factor, type A [Polaromonas naphthalenivorans CJ2] Length = 350 Score = 39.1 bits (89), Expect = 1.3, Method: Composition-based stats. Identities = 11/91 (12%), Positives = 24/91 (26%), Gaps = 6/91 (6%) Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 KL TN A+ E+ + + +TDG + + Sbjct: 72 KLPLGGGTNLGSALDALMGEIDRSVIKTTAERKGDWRP-IIYLVTDGRPTDNPSRAIERW 130 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLR 358 + + ++ + + L R Sbjct: 131 NSHYAKKA-----TLIAIGLGRSVDFTALRR 156 >gi|324996174|gb|EGC28084.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK678] Length = 458 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 31/287 (10%), Positives = 79/287 (27%), Gaps = 29/287 (10%) Query: 123 LIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLP 182 A + + + I++ V D+S SM+ + +NN + L Sbjct: 173 FSKVAKGKKGIAIAYRTDPIQGQMNIAVSFVFDISGSMKGALNGANPTSNNPSRMDILRD 232 Query: 183 P--------PPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK--AIQEKK 232 + TT S A ++D + + +++ + Sbjct: 233 KAEIMINELQSVGNVSVNLTTFSTTGSYKQAAFSQLDREAGTIKESIKNLKSDGGVTNPG 292 Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 + L++ + + + H L+ Sbjct: 293 DGLRYGMVSLQKQHAQLKYVVLLTDGVPNAYLVNQQGQAGGLEMKREGIQHFNNVLFELS 352 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM-RNAGM---KIYSVAV- 347 + + RL G + + ++ + + G+ ++ + Sbjct: 353 SNVPASFNYGRL---------GYDYTSKIPDIRDASIDYAGAVSKAYGVGVKRVNVIGFS 403 Query: 348 SAPP---EGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQE 389 GQ+L K + ++ + + L ++F I +IQ+ Sbjct: 404 GLEHEIAYGQNLTDKIGEGGMETKYVSATNKEALQKTFSDIKKQIQQ 450 >gi|167574006|ref|ZP_02366880.1| hypothetical protein BoklC_29490 [Burkholderia oklahomensis C6786] Length = 395 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 29/318 (9%), Positives = 85/318 (26%), Gaps = 29/318 (9%) Query: 5 IISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTI---KDPTTKKDQTS 61 +++V F+ A+DL + R+++Q++ DA L+ + + + + Sbjct: 1 MLAVLIGFVGLALDLGKLYVTRSELQNSADACALAAARDLTGAINLSVPEAAGITSGHLN 60 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + ++ + + + ++ ++ + L Sbjct: 61 YALFQGFPVQMQTDLSVTFSDSVSGPFQPKSAISSPSSIKYVKCKTSMTGIVNWFIQALN 120 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVL------DVSRSMEDLYLQKHNDNNNMT 175 + ++ N S+ +T + + +I + + Y + Sbjct: 121 MVPGVSVANASVSATAVATIGAAQTTCAIPVFICKAGTQTSPPVAGATYNVGDWLSAKTG 180 Query: 176 SNKYLLPPPPKKSFWSKNTTKSKYAP-------------APAPANRKIDVLIESAGNLVN 222 S S + + S A + + Sbjct: 181 SPPSFGAGNFGWSALDGSNSASSIANELTGNYCALPATGSQVGTPGSKAATTNAYNTRFG 240 Query: 223 SIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR-------LNKLNPYENT 275 ++ + AY+ Q S+ +++ ++ + +N + Sbjct: 241 IYNNPYKDPSYGTPDFTGYAYDATTWPAQSNAYSDFVSKRRTFTSYQGDLITGINTGGSY 300 Query: 276 NTYPAMHHAYRELYNEKE 293 + A R L E Sbjct: 301 SASYYQAGADRRLALAPE 318 >gi|121709994|ref|XP_001272613.1| von Willebrand and RING finger domain protein [Aspergillus clavatus NRRL 1] gi|119400763|gb|EAW11187.1| von Willebrand and RING finger domain protein [Aspergillus clavatus NRRL 1] Length = 1007 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 29/242 (11%), Positives = 63/242 (26%), Gaps = 20/242 (8%) Query: 133 LRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSK 192 + V + R +NN Y S Sbjct: 428 TSRHTDYDVDHSGAEEEDYRVSQIKRQASINSSYGAGRSNNTAITDYTAVDNLSISTIHI 487 Query: 193 NTTKSKYAPAPAPAN-RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ 251 P + KI +L ++ LV ++ R+G + + G Sbjct: 488 PLDVVVVIPVSSSMQGLKITLLRDALKFLVQNLG--------PRDRMGLVTFGSSGGGVP 539 Query: 252 CTPLSN-NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 ++ + LN + P + + + S ++ Sbjct: 540 LVGMTTKSWGGWNKILNSIRPVGQKSLRADVVEGANVAMDLLMQ----RKSNNPVSTILL 595 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV 370 I+D S+ + + + A + I+S + + ++ T + G + V Sbjct: 596 ISD------SSTSDPESVDFVVSRAEAAKISIHSFGLGLTHKPDTMIELSTRTKGSYLYV 649 Query: 371 ND 372 D Sbjct: 650 KD 651 >gi|114564047|ref|YP_751561.1| type IV pilin biogenesis protein, putative [Shewanella frigidimarina NCIMB 400] gi|114335340|gb|ABI72722.1| type IV pilin biogenesis protein, putative [Shewanella frigidimarina NCIMB 400] Length = 1204 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 31/338 (9%), Positives = 80/338 (23%), Gaps = 10/338 (2%) Query: 41 CASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNP 100 + + + + D + G I Q + Sbjct: 152 ASGVGAGYPVDDLSDAYKNDGNALNNAKLTDFGLGEPITLFTQQYVQWYYNKPAATNTDR 211 Query: 101 LQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSM 160 + + + T + L ++ + + S RS+ + I+ + + Sbjct: 212 ITVAKRVLNEVIVNTPGVDLGLMLFNYNASGSNRSSENKDGGRVVSKINTGNINNKKALA 271 Query: 161 EDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNL 220 + +NN + L Y A K ++ Sbjct: 272 SIVKNTIAIKSNNTPLCETLYEAALYLHGKPLEYGHDTYDSVAATPLNKTWKWVKKWNGW 331 Query: 221 VNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPA 280 + + N + + I +N + + + P +T+ Sbjct: 332 IEESTPVNPAQYNPGYDATAESVDGYISPFIGKKCGSNASII--YITDGAPTSDTDAQGK 389 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + ++ S + + + Q+ + Sbjct: 390 VKSLLGLASTTTDADVPGSESLGNDNYSLLPPLAHWLATNDVNPNSEGDQV--------V 441 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 Y++ SA L+ G++++ D EL Sbjct: 442 NTYTIGFSADAPVDILISTAALGGGKYYSALDPAELQA 479 >gi|116622485|ref|YP_824641.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] gi|116225647|gb|ABJ84356.1| von Willebrand factor, type A [Candidatus Solibacter usitatus Ellin6076] Length = 313 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 19/154 (12%), Positives = 52/154 (33%), Gaps = 20/154 (12%) Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 Y + PL+ + + L+ ++ +++ + A+ R+ Sbjct: 129 YTFSRNLYRAAPLTRDHVRAGAGLDNISAGDDSALFNALLLTLRDAAEVPGR-------- 180 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV----AVSAPPEGQDLLR 358 K ++ ++ ++ + + L + N G+ IY + V P L Sbjct: 181 ---KAIVVFSNSSDNASVLSPYDVGRLAV-----NEGVPIYVISTRDGVQDPVTNSALHY 232 Query: 359 KCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + G+ ++ ++ +F I + I+ Sbjct: 233 LTAATGGKLYSARTWQKQAGAFQAIREDIRNSYT 266 >gi|167845909|ref|ZP_02471417.1| hypothetical protein BpseB_11520 [Burkholderia pseudomallei B7210] Length = 579 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 19/108 (17%), Positives = 37/108 (34%), Gaps = 1/108 (0%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 AI + V + A+D+ ++ ++R +Q D A L+G + + + S Sbjct: 8 AAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQPNAAAAANARS 66 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKA 109 F + + R + A + N N +Q A Sbjct: 67 NGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSV 114 >gi|167738778|ref|ZP_02411552.1| hypothetical protein Bpse14_11973 [Burkholderia pseudomallei 14] Length = 578 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 19/108 (17%), Positives = 37/108 (34%), Gaps = 1/108 (0%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 AI + V + A+D+ ++ ++R +Q D A L+G + + + S Sbjct: 7 AAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQPNAAAAANARS 65 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKA 109 F + + R + A + N N +Q A Sbjct: 66 NGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSV 113 >gi|162451829|ref|YP_001614196.1| cell wall anchor domain-containing protein [Sorangium cellulosum 'So ce 56'] gi|161162411|emb|CAN93716.1| LPXTG-motif cell wall anchor domain protein [Sorangium cellulosum 'So ce 56'] Length = 406 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 19/126 (15%), Positives = 40/126 (31%), Gaps = 17/126 (13%) Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG-ASAYQNTLNTLQI 331 T T M + ++ ++ ++ +TDG +G + Sbjct: 195 GGTPTLFVMQGTISYVEEQRRATPGRY-------AIVLVTDGYPAGCGTESNRIQRVSDA 247 Query: 332 CEYMRNAGMKIYSVAVSAP-----PEGQDLLRKCTDSSG--QFFAVN--DSRELLESFDK 382 G+ Y + V P P D L + D+ G + + ++ + +F Sbjct: 248 AAGALEDGVPTYVIGVDNPPIEGAPHDLDNLHQIADAGGTERAYLIDTGNPGATSAAFKA 307 Query: 383 ITDKIQ 388 D I+ Sbjct: 308 AIDAIR 313 >gi|268561232|ref|XP_002646396.1| Hypothetical protein CBG15365 [Caenorhabditis briggsae] gi|187027192|emb|CAP33692.1| hypothetical protein CBG_15365 [Caenorhabditis briggsae AF16] Length = 381 Score = 39.1 bits (89), Expect = 1.4, Method: Composition-based stats. Identities = 25/176 (14%), Positives = 60/176 (34%), Gaps = 17/176 (9%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN----QCTP 254 + + + + ++ + Q I + RIG + YN + P Sbjct: 40 VDNSKGMTKDGLTAISANLASIFSDAQIGINPSNPKTTRIGMVTYNSNATVDAHLNYDWP 99 Query: 255 LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 +N++ S +++++ + + A LY+E S + KK +I Sbjct: 100 KNNDVLNFYSTMSEISEDSTSYVAHGLQAAQNLLYSESFGS----NRSHYKKVIIVCA-- 153 Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT--DSSGQFF 368 + + + ++ G+KI ++A + ++ + S G F Sbjct: 154 ---STFKGTGKNDPIPVANRLKGRGVKILTIAY--DQGDEKVVEELAKISSPGLSF 204 >gi|114046659|ref|YP_737209.1| type IV pilin biogenesis protein [Shewanella sp. MR-7] gi|113888101|gb|ABI42152.1| type IV pilin biogenesis protein, putative [Shewanella sp. MR-7] Length = 1223 Score = 38.7 bits (88), Expect = 1.4, Method: Composition-based stats. Identities = 9/53 (16%), Positives = 23/53 (43%), Gaps = 2/53 (3%) Query: 340 MKIYSVAVSAPPEGQDLL--RKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 ++ +++ S + L + + G++F D+ +L S + I E+ Sbjct: 478 VRTFTIGFSEGAANAEHLLKQTAENGGGKYFDATDASKLRSSLQTALNNILEK 530 >gi|313200801|ref|YP_004039459.1| outer membrane adhesin-like protein [Methylovorus sp. MP688] gi|312440117|gb|ADQ84223.1| outer membrane adhesin like proteiin [Methylovorus sp. MP688] Length = 1543 Score = 38.7 bits (88), Expect = 1.4, Method: Composition-based stats. Identities = 36/368 (9%), Positives = 89/368 (24%), Gaps = 38/368 (10%) Query: 51 KDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQ 110 T S + G+ I + + Sbjct: 894 DSSGTVDVTNSASAHGTLSISDPDGNAITSVVLTAPTGDYYSGGQLITWSGSGTGTLTGS 953 Query: 111 YEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 NL ++ + L +++ + + L + + Sbjct: 954 AGGQPVLTITINNGGEYDVNLLRPLDHPDTKTEDVLTLNVGVSATANGVTSVGNLTVNVE 1013 Query: 171 NNN--MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAI 228 +++ L + + + S ++ I+S L++ Sbjct: 1014 DDSPVANPITANLSTTNTNLLITLDISGSMRTHDGVGDTTRLASAIQSIKVLLDKYDALG 1073 Query: 229 QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK-SRLNKLNPYENTNTYPAMHHAYRE 287 RI + ++ ++ + + + ++ P NTN A+ +A Sbjct: 1074 DT------RISLVVFSTTAAQVGTDWMTIDQAKAQLDQILVKGPGGNTNYDSALANAMDA 1127 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY------------- 334 + T + FI+DGE + S +L Sbjct: 1128 FDDPG-------KLTNAQNVAYFISDGEPNTGSGSNTSLTGGTNTNSSDAGIQTQEELAW 1180 Query: 335 ---MRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA-----VNDSRELLESFD-KITD 385 + +K Y++ V + + L V +L I + Sbjct: 1181 KTFLEANQIKSYAIGVGSDINSVNALNPVAYDGQTNTNMDGILVTSFTQLDAVLAGTIQN 1240 Query: 386 KIQEQSVR 393 + + + Sbjct: 1241 QAAGELIT 1248 >gi|167719780|ref|ZP_02403016.1| hypothetical protein BpseD_12240 [Burkholderia pseudomallei DM98] Length = 577 Score = 38.7 bits (88), Expect = 1.4, Method: Composition-based stats. Identities = 19/108 (17%), Positives = 37/108 (34%), Gaps = 1/108 (0%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 AI + V + A+D+ ++ ++R +Q D A L+G + + + S Sbjct: 6 AAIWMLVAIAALG-AVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQPNAAAAANARS 64 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKA 109 F + + R + A + N N +Q A Sbjct: 65 NGFDPAAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSV 112 >gi|163858495|ref|YP_001632793.1| hypothetical protein Bpet4177 [Bordetella petrii DSM 12804] gi|163262223|emb|CAP44525.1| hypothetical protein Bpet4177 [Bordetella petrii] Length = 183 Score = 38.7 bits (88), Expect = 1.4, Method: Composition-based stats. Identities = 17/119 (14%), Positives = 42/119 (35%), Gaps = 17/119 (14%) Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS----AY 322 L+ T A+ A + ++ + + S + V+ ++DG + A Sbjct: 25 QDLSAGGMTPLGTALQMAKAMIEDK-----DVVPSRAYRPTVVLVSDGGPNDAWEKPLNA 79 Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFD 381 + C+ + A+ A + L + +S + F ++++L + F Sbjct: 80 FISDGRSAKCDRLAM--------AIGADADEAVLGKFIEGTSNRLFYAENAKQLRDFFK 130 >gi|323350757|ref|ZP_08086417.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis VMC66] gi|322123037|gb|EFX94736.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis VMC66] Length = 458 Score = 38.7 bits (88), Expect = 1.4, Method: Composition-based stats. Identities = 31/287 (10%), Positives = 79/287 (27%), Gaps = 29/287 (10%) Query: 123 LIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLP 182 A + + + I++ V D+S SM+ + +NN + L Sbjct: 173 FSKVAKGKKGIAIAYRTDPIQGQMNIAVSFVFDISGSMKGALNGANPTSNNPSRMDILRD 232 Query: 183 P--------PPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK--AIQEKK 232 + TT S A ++D + + +++ + Sbjct: 233 KAEIMINELQSVGNVSVNLTTFSTTGSYKQAAFSQLDREAGTIKESIKNLKSDGGVTNPG 292 Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 + L++ + + + H L+ Sbjct: 293 DGLRYGMVSLQKQHAQLKYVVLLTDGVPNAYLVNQQGQAGGLEMKREGIQHFNNVLFELS 352 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM-RNAGM---KIYSVAV- 347 + + RL G + + ++ + + G+ ++ + Sbjct: 353 SNVPASFNYGRL---------GYDYTSKIPDIRDASIDYAGAVSKAYGVGVKRVNVIGFS 403 Query: 348 SAPP---EGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQE 389 GQ+L K + ++ + + L ++F I +IQ+ Sbjct: 404 GLEHEIAYGQNLTDKIGEGGMETKYVSATNKEALQKTFSDIKKQIQQ 450 >gi|229815486|ref|ZP_04445817.1| hypothetical protein COLINT_02533 [Collinsella intestinalis DSM 13280] gi|229808923|gb|EEP44694.1| hypothetical protein COLINT_02533 [Collinsella intestinalis DSM 13280] Length = 278 Score = 38.7 bits (88), Expect = 1.5, Method: Composition-based stats. Identities = 20/243 (8%), Positives = 60/243 (24%), Gaps = 7/243 (2%) Query: 12 FITYAIDLAHIMYIRNQMQSALDAAVL----SGCASIVSDRTIKDPTTKKDQTSTIFKKQ 67 A D+A N Q A+++A + + + + + + Sbjct: 9 MAGPAGDVAGAQQAVNAAQVAVESAQASYNDAAAKAALRTVKAPAAGSIVAMNAQVGADL 68 Query: 68 IKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK--GLIP 125 + G+ K ++ I ++ + + A P + Sbjct: 69 GESLGGAGTNGPLMQIADLSKMKVTIQVEEEDIASVAVDQTATISCPAFPDLSMTGRVTG 128 Query: 126 SALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPP 185 A + + + ++ +++D + ++ P Sbjct: 129 IASIASGDGAQMSYDGMGGSPTFAVDIMIDAPDPRLKPGMTAEVSLTTQKLENVVMVPMT 188 Query: 186 KKSFWSKNTTKSKYAPAPAPANR-KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYN 244 T P ++DV + + + + K + + + + Sbjct: 189 ALLTDDGETYYVNLETDPETHATERLDVRVVAKNDDYAVVGKPADAPADQNSDMPESPLS 248 Query: 245 IGI 247 Sbjct: 249 DSD 251 >gi|251794095|ref|YP_003008826.1| hypothetical protein Pjdr2_0059 [Paenibacillus sp. JDR-2] gi|247541721|gb|ACS98739.1| conserved hypothetical protein [Paenibacillus sp. JDR-2] Length = 242 Score = 38.7 bits (88), Expect = 1.5, Method: Composition-based stats. Identities = 14/97 (14%), Positives = 40/97 (41%), Gaps = 10/97 (10%) Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV----SAPPEGQDLLRKCT 361 K ++ +TDG ++ + + + + G+ + V + G + + Sbjct: 2 KQILLVTDGCSNVGMS------PVVAAAHAKAEGIAVNVVGILDYGDVGELGASEIEEIA 55 Query: 362 DSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 + G + + R+L ++ +T K Q++++A + Sbjct: 56 RAGGGLSRLVNMRQLSQTVQMMTRKTAVQTIQLAVQK 92 >gi|284989062|ref|YP_003407616.1| von Willebrand factor type A [Geodermatophilus obscurus DSM 43160] gi|284062307|gb|ADB73245.1| von Willebrand factor type A [Geodermatophilus obscurus DSM 43160] Length = 547 Score = 38.7 bits (88), Expect = 1.5, Method: Composition-based stats. Identities = 22/122 (18%), Positives = 42/122 (34%), Gaps = 11/122 (9%) Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG-ENSGASAYQNT 325 ++L+P T Y A R ++ + V+ +TDG + Sbjct: 435 DRLSP-GGTGLYDTTLDAVRAARSDFDPRAV--------NSVLVVTDGTNEDSGGVDLDE 485 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES-FDKIT 384 L E + +K+ VA+ + L R + G ++ D +L FD + Sbjct: 486 LLATLRSEADPDRPIKVIGVALGPDADLGALERIADVTGGAAYSAVDPTDLQTVLFDALR 545 Query: 385 DK 386 + Sbjct: 546 QR 547 >gi|221052180|ref|XP_002257666.1| von willebrand factor a-domain-related protein [Plasmodium knowlesi strain H] gi|193807496|emb|CAQ38002.1| von willebrand factor a-domain-related protein [Plasmodium knowlesi strain H] Length = 294 Score = 38.7 bits (88), Expect = 1.5, Method: Composition-based stats. Identities = 30/264 (11%), Positives = 70/264 (26%), Gaps = 21/264 (7%) Query: 107 SKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQ 166 + Q + + + S + I + +D Sbjct: 15 AYVQSSVSAKMNVASNNSAVPRIDGEDGSDEKVITCVTKYVIKGDLEIDDGGFCSSNIGS 74 Query: 167 KHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK 226 ++ + N P + T + + +K + L + ++ Sbjct: 75 ---ESPPGSCNDTGANPGNYCDNYYDITLLVEESS----FIQKDYWKKGTIPFLESMVRN 127 Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK---SRLNKLNPYENTNTYPAMHH 283 A K + I A + ++ LS + +++ +N + +T A+ H Sbjct: 128 ARVSKDKAHMSIVLFARDSRVIVPFTDELSQDKDKLIEKVRAINDVATSPDTLYAYALEH 187 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 A+ + K V+ G + N + E + +K+ Sbjct: 188 AFEHVI------FGEGTRKDAPKVVVLFYYGFD----YGANKSLIPDVVEDYKKKNIKLV 237 Query: 344 SVAVSAPPEGQD-LLRKCTDSSGQ 366 V ++ LL C Sbjct: 238 IVGIALGNRDNAYLLGGCAIGDNN 261 >gi|226943318|ref|YP_002798391.1| type IV pilus assembly protein, PilY1-like protein [Azotobacter vinelandii DJ] gi|226718245|gb|ACO77416.1| type IV pilus assembly protein, PilY1-like protein [Azotobacter vinelandii DJ] Length = 1036 Score = 38.7 bits (88), Expect = 1.5, Method: Composition-based stats. Identities = 13/94 (13%), Positives = 29/94 (30%), Gaps = 4/94 (4%) Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV---SAPPEGQD 355 + T + L + + Y++ V A + Sbjct: 236 KNVGGDAGQLYQTTSSKVRYGDEMARFLYQTDVSAAPGKQNVLTYAIDVYNRQASEDHSR 295 Query: 356 LLRKCT-DSSGQFFAVNDSRELLESFDKITDKIQ 388 LL+ G++F+ ++ E+ + I +IQ Sbjct: 296 LLKSMAHVGGGKYFSATNADEIETALSTIFSEIQ 329 >gi|302754792|ref|XP_002960820.1| hypothetical protein SELMODRAFT_402207 [Selaginella moellendorffii] gi|300171759|gb|EFJ38359.1| hypothetical protein SELMODRAFT_402207 [Selaginella moellendorffii] Length = 563 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 56/177 (31%), Gaps = 10/177 (5%) Query: 198 KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN 257 + + +V L+N+ Q + A ++G Q T L Sbjct: 213 SNPTGVLSSQTRFNVANNIIKRLLNTFTNGDQVAVSTIGGEKIGAPVSVVLGVQETSL-- 270 Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 +L + S + ++ TN+ + + + + ++ +I TDG+ Sbjct: 271 DLAGISSLKDSISNTSVTNSASNIKNGLQAALD-------FFNTSSNLNVIILFTDGQFV 323 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSR 374 + T + + + + G+ ++ + + + S + V + Sbjct: 324 TPGNFNFTQLSPVLAQ-LNARGVVVFVYRIGSFTTNDATFQHMQSSLNMSYEVINDD 379 >gi|91788739|ref|YP_549691.1| putative type 4 fimbrial biogenesis PilY1-like protein signal peptide [Polaromonas sp. JS666] gi|91697964|gb|ABE44793.1| putative type 4 fimbrial biogenesis PilY1-related protein signal peptide [Polaromonas sp. JS666] Length = 1025 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 35/283 (12%), Positives = 79/283 (27%), Gaps = 42/283 (14%) Query: 145 NLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 VL V + + N+ + S +P + Sbjct: 35 PTTAETPNVLIVLDNTANWNTAFVNEKAALVSVFQGMPVDKFRVGLMMFNETGGGNSGND 94 Query: 205 PANRKIDVL------IESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 A + + NL++S+ +K N + + N Sbjct: 95 GAYVRAAARTITSSNKTAYANLIDSLD-ITGDKSNGGKVAKAMMEAYYYLSGGTPHAGNQ 153 Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 N+ LN ++ + +K ++I+I++G Sbjct: 154 KNKTDYALN--TAGNASDDAMHALAGNALASKNSTTYVKPGSGDCVKNYIIYISNGPAQD 211 Query: 319 ASAY------------------------QNTLNTLQICEYMRNAGMK--IYSVAVSAPPE 352 S+ + +++ + +K +Y++ V+ Sbjct: 212 NSSDIETATTALAAAGGKTTAISLNPSGSQDNPADEWARFLKKSAIKGVVYTLDVNKGTS 271 Query: 353 GQ-----DLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQE 389 GQ LL+ S G++F V + + ++ + I +IQ Sbjct: 272 GQGPGWTALLKSMASQSGGRYFDV-STSGIADAINSILSEIQS 313 >gi|254292588|ref|YP_003058611.1| hypothetical protein Hbal_0212 [Hirschia baltica ATCC 49814] gi|254041119|gb|ACT57914.1| conserved hypothetical protein [Hirschia baltica ATCC 49814] Length = 447 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 16/106 (15%), Positives = 40/106 (37%), Gaps = 7/106 (6%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M A+ I + I ID++ + ++QS +D A++ + R + + + + Sbjct: 17 MAALFIGLLIAVIAGTIDISQKSSLNRELQSVVD------AAALAAAREMAVSSADQTRV 70 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAE 106 ++ + + G +A K I ++ + I + Sbjct: 71 QSVASSYVDANW-TGEQATTHAVLDVSKGIITVSSTAPKTIASILK 115 >gi|330828501|ref|YP_004391453.1| RTX protein [Aeromonas veronii B565] gi|328803637|gb|AEB48836.1| RTX protein [Aeromonas veronii B565] Length = 2390 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 16/123 (13%), Positives = 44/123 (35%), Gaps = 7/123 (5%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL--S 256 + + ++ + + S++ + + +V I + ++ + L S Sbjct: 1851 VDSSGSIGTSAMNTIKNQLAQIFASLKASAGSEGAGTVNIMLVDFDALSNASISVNLKDS 1910 Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 N LN++++ ++ ++ TN + ++ K FITDG+ Sbjct: 1911 NALNQLQAVIDSMSSGGGTNYEDVFKTVANWFSSS-----TAQSNSNAKNLTYFITDGQP 1965 Query: 317 SGA 319 + Sbjct: 1966 TYY 1968 >gi|308472851|ref|XP_003098652.1| hypothetical protein CRE_04170 [Caenorhabditis remanei] gi|308268252|gb|EFP12205.1| hypothetical protein CRE_04170 [Caenorhabditis remanei] Length = 396 Score = 38.7 bits (88), Expect = 1.6, Method: Composition-based stats. Identities = 21/169 (12%), Positives = 53/169 (31%), Gaps = 11/169 (6%) Query: 210 IDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL-NK 268 I+V + + + + + R+G + YN ++++V + + + Sbjct: 57 INVASDIYSVFSSGTRIGSNSSEPRTTRVGLVTYNSNATQKADLNKYQSIDDVLNEIYDD 116 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 ++ NT + + K+ VI L+ Sbjct: 117 ISTVVNTA-DSYLATGLQLAEKMLIDQSENTNRAHYKRVVIVYASEYKGEG-----ELDP 170 Query: 329 LQICEYMRNAGMKIYSVAVSAPPEGQDL--LRKCTDSSGQFFAVNDSRE 375 L + ++ + + I +VA + L + F VN++ + Sbjct: 171 LNVANRLKLSDINIITVAYEQKGDDGLFHDLSQIASPG--FSFVNNASD 217 >gi|326526787|dbj|BAK00782.1| predicted protein [Hordeum vulgare subsp. vulgare] Length = 585 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 20/182 (10%), Positives = 50/182 (27%), Gaps = 21/182 (11%) Query: 221 VNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT---NT 277 + K + + N + ++ + +++ + T T Sbjct: 219 FDCTTSTPAWNKVNREVYWLVQKKLTQFKNSYLGYTYVMSTPNTYTSEMKLVDRTETEAT 278 Query: 278 YPAMHHAYRELYNEKESSHNT-------IGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 +R + L ++ +DG + S + + Sbjct: 279 GYPRSSTWRREACTNNMAAGLVEAHRLIKDHGHLNGMILLFSDGSINKGSFFDGVEGFI- 337 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA---VNDSRELLESFDKITDKI 387 + + +++ + Q LL +S G F V + L +F ++ D I Sbjct: 338 -------SKVPVHTFTLGGDAYNQALLTISINSPGGTFHTLPVPEKPSLSVTFSRLVDNI 390 Query: 388 QE 389 Sbjct: 391 LS 392 >gi|313814173|gb|EFS51887.1| conserved hypothetical protein [Propionibacterium acnes HL025PA1] Length = 169 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 18/134 (13%), Positives = 36/134 (26%), Gaps = 17/134 (12%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A+I+ + A I ++ QSA A +G + I Sbjct: 43 AALILPALLMIAAVATGSWRISEVKADAQSAAQVAARAGSVASSVGEGIAVG-------Q 95 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + ++ I ++ D+ T ++ I +L + Sbjct: 96 RVGLAELAGTRCSNPAIAVDSSDLTLPVGFAGTA----------SARVSCTIKLSDLLVP 145 Query: 122 GLIPSALTNLSLRS 135 G+ S S Sbjct: 146 GMPGSFHIESVAHS 159 >gi|117620923|ref|YP_857205.1| structural toxin protein RtxA [Aeromonas hydrophila subsp. hydrophila ATCC 7966] gi|117562330|gb|ABK39278.1| structural toxin protein RtxA [Aeromonas hydrophila subsp. hydrophila ATCC 7966] Length = 4260 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 30/316 (9%), Positives = 84/316 (26%), Gaps = 26/316 (8%) Query: 75 GSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLR 134 + + DI + E+ + + L +L+ Sbjct: 3089 AITLTKVGSDIVGSTTAGEVFRISVASNGTVTLTQSAELDHLPEDVDNSNDNNLISLANG 3148 Query: 135 STGIIERSSENLAISICMVLDVSRSMEDLYLQ-KHNDNNNMTSNKYLLPPPPKKSFWSKN 193 + + + VS + S P + Sbjct: 3149 KVLLSATVTVVDGDNDTATGTVSADLGGNIRFEDDVPTAKDNSVVITEAGLPPFNLVMVI 3208 Query: 194 TTK----SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVG 249 T + + + ++++ ++ ++++S G Sbjct: 3209 DTSGSMLWQIGTSTNGSPNRLELAKDALNHMIDSYVALGV--------PLVFTVIDFASG 3260 Query: 250 NQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 P +++ + K+ ++ L TN + A +L + + + V Sbjct: 3261 AVLIPQTSDPDVAKASISGLPTDGGGTNYNAPLVLAQNQLTADLANPALA----GYETKV 3316 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA--PPEGQDLLRKCTDSSGQ 366 F++DG + + ++ + +++Y+V ++ L D Sbjct: 3317 YFLSDGAPNEGN------VPAGWTSFVNSNNVEVYAVGLNVSGNATAIAQLGLVEDHGDA 3370 Query: 367 FFAVNDSRELLESFDK 382 VN+ +L + Sbjct: 3371 VTLVNNIYDLDATLQA 3386 >gi|50841536|ref|YP_054763.1| hypothetical protein PPA0046 [Propionibacterium acnes KPA171202] gi|289424429|ref|ZP_06426212.1| conserved hypothetical protein [Propionibacterium acnes SK187] gi|289427366|ref|ZP_06429079.1| conserved hypothetical protein [Propionibacterium acnes J165] gi|295129575|ref|YP_003580238.1| hypothetical protein HMPREF0675_3046 [Propionibacterium acnes SK137] gi|50839138|gb|AAT81805.1| putative membrane protein [Propionibacterium acnes KPA171202] gi|289155126|gb|EFD03808.1| conserved hypothetical protein [Propionibacterium acnes SK187] gi|289159296|gb|EFD07487.1| conserved hypothetical protein [Propionibacterium acnes J165] gi|291376464|gb|ADE00319.1| conserved hypothetical protein [Propionibacterium acnes SK137] gi|313771156|gb|EFS37122.1| conserved hypothetical protein [Propionibacterium acnes HL074PA1] gi|313792519|gb|EFS40605.1| conserved hypothetical protein [Propionibacterium acnes HL110PA1] gi|313803520|gb|EFS44702.1| conserved hypothetical protein [Propionibacterium acnes HL110PA2] gi|313806906|gb|EFS45404.1| conserved hypothetical protein [Propionibacterium acnes HL087PA2] gi|313811818|gb|EFS49532.1| conserved hypothetical protein [Propionibacterium acnes HL083PA1] gi|313817689|gb|EFS55403.1| conserved hypothetical protein [Propionibacterium acnes HL046PA2] gi|313821484|gb|EFS59198.1| conserved hypothetical protein [Propionibacterium acnes HL036PA1] gi|313824571|gb|EFS62285.1| conserved hypothetical protein [Propionibacterium acnes HL036PA2] gi|313826242|gb|EFS63956.1| conserved hypothetical protein [Propionibacterium acnes HL063PA1] gi|313832354|gb|EFS70068.1| conserved hypothetical protein [Propionibacterium acnes HL007PA1] gi|313832813|gb|EFS70527.1| conserved hypothetical protein [Propionibacterium acnes HL056PA1] gi|313839673|gb|EFS77387.1| conserved hypothetical protein [Propionibacterium acnes HL086PA1] gi|314926286|gb|EFS90117.1| conserved hypothetical protein [Propionibacterium acnes HL036PA3] gi|314961709|gb|EFT05810.1| conserved hypothetical protein [Propionibacterium acnes HL002PA2] gi|314964231|gb|EFT08331.1| conserved hypothetical protein [Propionibacterium acnes HL082PA1] gi|314975246|gb|EFT19341.1| conserved hypothetical protein [Propionibacterium acnes HL053PA1] gi|314977661|gb|EFT21756.1| conserved hypothetical protein [Propionibacterium acnes HL045PA1] gi|314980207|gb|EFT24301.1| conserved hypothetical protein [Propionibacterium acnes HL072PA2] gi|314985154|gb|EFT29246.1| conserved hypothetical protein [Propionibacterium acnes HL005PA1] gi|314987063|gb|EFT31155.1| conserved hypothetical protein [Propionibacterium acnes HL005PA2] gi|314990444|gb|EFT34535.1| conserved hypothetical protein [Propionibacterium acnes HL005PA3] gi|315078862|gb|EFT50880.1| conserved hypothetical protein [Propionibacterium acnes HL053PA2] gi|315083131|gb|EFT55107.1| conserved hypothetical protein [Propionibacterium acnes HL027PA2] gi|315086659|gb|EFT58635.1| conserved hypothetical protein [Propionibacterium acnes HL002PA3] gi|315088063|gb|EFT60039.1| conserved hypothetical protein [Propionibacterium acnes HL072PA1] gi|315097114|gb|EFT69090.1| conserved hypothetical protein [Propionibacterium acnes HL038PA1] gi|315107450|gb|EFT79426.1| conserved hypothetical protein [Propionibacterium acnes HL030PA1] gi|327332549|gb|EGE74284.1| putative membrane protein [Propionibacterium acnes HL096PA2] gi|327333722|gb|EGE75439.1| putative membrane protein [Propionibacterium acnes HL096PA3] gi|327444419|gb|EGE91073.1| hypothetical protein HMPREF9568_01683 [Propionibacterium acnes HL013PA2] gi|327446672|gb|EGE93326.1| hypothetical protein HMPREF9571_01238 [Propionibacterium acnes HL043PA2] gi|327448885|gb|EGE95539.1| hypothetical protein HMPREF9570_00450 [Propionibacterium acnes HL043PA1] gi|327457365|gb|EGF04020.1| hypothetical protein HMPREF9584_00563 [Propionibacterium acnes HL092PA1] gi|328757926|gb|EGF71542.1| hypothetical protein HMPREF9563_00560 [Propionibacterium acnes HL020PA1] gi|328759748|gb|EGF73344.1| putative membrane protein [Propionibacterium acnes HL099PA1] gi|332674444|gb|AEE71260.1| hypothetical protein PAZ_c00470 [Propionibacterium acnes 266] Length = 169 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 18/134 (13%), Positives = 36/134 (26%), Gaps = 17/134 (12%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A+I+ + A I ++ QSA A +G + I Sbjct: 43 AALILPALLMIAAVATGSWRISEVKADAQSAAQVAARAGSVASSVGEGIAVG-------Q 95 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + ++ I ++ D+ T ++ I +L + Sbjct: 96 RVGLAELAGTRCSNPAIAVDSSDLTLPVGFAGTA----------SARVSCTIKLSDLLVP 145 Query: 122 GLIPSALTNLSLRS 135 G+ S S Sbjct: 146 GMPGSFHIESVAHS 159 >gi|226941651|ref|YP_002796725.1| outer membrane adhesin like proteiin [Laribacter hongkongensis HLHK9] gi|226716578|gb|ACO75716.1| putative outer membrane adhesin like proteiin [Laribacter hongkongensis HLHK9] Length = 2392 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 27/336 (8%), Positives = 73/336 (21%), Gaps = 17/336 (5%) Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + K I+ L + + + Sbjct: 1703 DVAKNAIQTLLDNMADFDGTINLSIVDFDSGSKMALQGVTLRDLCVQDAQGNWHLDTSAG 1762 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 A+ N++ E + + ++ ++ Sbjct: 1763 SAFMQAMGNITATGGTNYESAFMQAEQWFSQQPTDGYTNHTYFVTDGKPTARYENSFADK 1822 Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 ++ + + + +V S + E + + Sbjct: 1823 ISHTVSFDSVTVPNSDNWSGKGRISWTAEVDGTDVSYRVVRSDSGSSYELQYNNGSKWVT 1882 Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY--ENTNTYPAMHHAYRELYNEKESSHNTI 299 N ++ + + + + A H E Sbjct: 1883 LLNQKNDPTDGRTITLDKEVSFTTTDNVTVQIPTGQALGQAWHDQATGQTYRLEQGERNS 1942 Query: 300 GSTRLKKFVIFITDGENS-------------GASAYQNTLNTLQICEYMRNAGMKIYSVA 346 T K V DG + + Q T ++L + +++ I+ + Sbjct: 1943 YGTYTAKVVRVNDDGTTTVVIKDLMTTDGSGTDGSAQETADSLAAADSLKDVS-SIFVIG 2001 Query: 347 VSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 V L + S+G + + +L + + Sbjct: 2002 VGNAGSLPGNLGQY-SSNGSYILATNKDQLDAALEA 2036 >gi|170720024|ref|YP_001747712.1| hypothetical protein PputW619_0838 [Pseudomonas putida W619] gi|169758027|gb|ACA71343.1| conserved hypothetical protein [Pseudomonas putida W619] Length = 651 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 6/43 (13%), Positives = 14/43 (32%), Gaps = 4/43 (9%) Query: 3 AIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIV 45 A+ + + +D + + ++Q D A L Sbjct: 22 ALALLFMLVV----VDSGRLYLEQRKLQRIADMAALEAAGQSA 60 >gi|38347866|ref|NP_941115.1| hypothetical protein SMR0040 [Serratia marcescens] gi|190410209|ref|YP_001965710.1| hypothetical protein pK29_p040 [Klebsiella pneumoniae] gi|226807603|ref|YP_002791297.1| hypothetical protein pEC-IMP_036 [Enterobacter cloacae] gi|226809913|ref|YP_002791607.1| hypothetical protein pEC-IMPQ_035 [Enterobacter cloacae] gi|38259343|emb|CAE51568.1| hypothetical protein SMR0040 [Serratia marcescens] gi|146151002|gb|ABQ02768.1| conserved hypothetical protein [Klebsiella pneumoniae] gi|226425828|gb|ACO53921.1| hypothetical protein [Enterobacter cloacae] gi|226426139|gb|ACO54231.1| hypothetical protein [Enterobacter cloacae] Length = 668 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 20/118 (16%), Positives = 38/118 (32%), Gaps = 22/118 (18%) Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 T T A+ A L + + K + ITDG + + +++ Sbjct: 573 GYYTPTGSALMAAVDLLLDSQFDR----------KIIFLITDGYPNKSEFT-----IVEV 617 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 E + G++I V + + F V+D+ L K+ +I Sbjct: 618 MEKAKCNGIEIVGVGIKTD-------EIIGFETDTFVTVDDTSLLSIEVSKLVHQILS 668 >gi|34495937|ref|NP_900152.1| hypothetical protein CV_0482 [Chromobacterium violaceum ATCC 12472] gi|34330276|gb|AAQ58159.2| conserved hypothetical protein [Chromobacterium violaceum ATCC 12472] Length = 1040 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 36/310 (11%), Positives = 81/310 (26%), Gaps = 21/310 (6%) Query: 95 KDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVL 154 N + + + L+ GL P +T ++ + S N + + + Sbjct: 68 TTVNGSQYMTVAATSDDASINDVLYSSGLNPVFMTYGTVSPSNPYSYYSLNSYKTGSVTV 127 Query: 155 DVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLI 214 S + N + + W ++S A A A + Sbjct: 128 SYSLMAPGGFSGWGTSPTNAGYVPFTPQVLYAQRGWGYGGSQSYSAAATNVAMTNLGSPS 187 Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 S + + + S I G + + + + Sbjct: 188 ASQLTSAYNSFSPFLQPETNSTSTSEIKAAAGQSPIYALLKTAQTSFTSTS----SSSGC 243 Query: 275 TN------TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS-GASAYQNTLN 327 T + ++ + +T F DG + + Q L+ Sbjct: 244 TPQKYVVLITDGLPT--QDKSGYYWPPAGSAAATGYGVTASFNADGSLNIAGTNDQALLD 301 Query: 328 TLQICEYMRNAGMKIYSVAVS------APPEGQDLLRKCTDSS--GQFFAVNDSRELLES 379 + + + +AG+K Y + + P L + G ++ + Sbjct: 302 AISQIQTLASAGIKTYVIGLGAGVAPATNPTAAQTLTAMAVAGNTGSYYPAVSPAAFSSA 361 Query: 380 FDKITDKIQE 389 I +IQ+ Sbjct: 362 LGSILVQIQK 371 >gi|310818521|ref|YP_003950879.1| hypothetical protein STAUR_1248 [Stigmatella aurantiaca DW4/3-1] gi|309391593|gb|ADO69052.1| conserved uncharacterized protein [Stigmatella aurantiaca DW4/3-1] Length = 1486 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 22/210 (10%), Positives = 49/210 (23%), Gaps = 15/210 (7%) Query: 167 KHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK 226 + + + L+ S + D + Sbjct: 280 ADDRADYIAKINSLVFNTSTPLARSLLNAGYYFTSHQGVYRDTTDGGFGFGNANPLTGYS 339 Query: 227 AIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYR 286 + KN ++ T G + +++ + L TN + Sbjct: 340 YPADFKNDALTSETRTVCWGCQSSSIIMITDGEPT----NDSL----GTNVVTRIRAING 391 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGEN---SGASAYQNTLNTLQICEYMRNAGMKIY 343 N S + ++ D + +NT + + +Y Sbjct: 392 GPVNCPPSRPCNDNTGNTNNDANYLLDDVAKLLYTSDLQRNTPPVVGELNTSNQQTVNVY 451 Query: 344 SVAVSAPPEGQDLLRKCTD-SSGQFFAVND 372 +V LL+ D G ++ D Sbjct: 452 TVGFGINSN---LLKNTADVGGGLYYTAED 478 >gi|27381752|ref|NP_773281.1| hypothetical protein blr6641 [Bradyrhizobium japonicum USDA 110] gi|27354921|dbj|BAC51906.1| blr6641 [Bradyrhizobium japonicum USDA 110] Length = 406 Score = 38.7 bits (88), Expect = 1.7, Method: Composition-based stats. Identities = 9/57 (15%), Positives = 19/57 (33%) Query: 10 FLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKK 66 + AI R + A +AA L+G ++ ++ + I + Sbjct: 150 LVVAGIAIRRLRTETARTALSRAAEAARLAGIPALSAEVDSANRILDTPAARLIARG 206 >gi|315081597|gb|EFT53573.1| TadE-like protein [Propionibacterium acnes HL078PA1] Length = 169 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 18/134 (13%), Positives = 36/134 (26%), Gaps = 17/134 (12%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A+I+ + A I ++ QSA A +G + I Sbjct: 43 AALILPALLMIAAAATGSWRISEVKADAQSAAQVAARAGSVASSVGEGIAVG-------Q 95 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + ++ I ++ D+ T ++ I +L + Sbjct: 96 RVGLAELAGTRCSNPAIAVDSSDLTLPVGFAGTA----------SARVSCTIKLSDLLVP 145 Query: 122 GLIPSALTNLSLRS 135 G+ S S Sbjct: 146 GMPGSFHIESVAHS 159 >gi|167382730|ref|XP_001736239.1| hypothetical protein [Entamoeba dispar SAW760] gi|165901405|gb|EDR27488.1| hypothetical protein EDI_092790 [Entamoeba dispar SAW760] Length = 473 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 28/268 (10%), Positives = 79/268 (29%), Gaps = 23/268 (8%) Query: 126 SALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPP 185 L+ + + +I V+ ++ + + SN L Sbjct: 43 YGQYPLNTNYSQSSTDYCYSCKTNIFDVVTLNYDSNTGLSSQVRADAVWLSNVQLPKTKD 102 Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVN--SIQKAIQEKKNLSVRIGTIAY 243 F S + +Y P ++ + L + S + R ++ Y Sbjct: 103 TNYFVSTPSAFLQYPTIEKPFIDPPYLVPLNVVVLYDISSSNSRPSTLFSFFKRFTSMQY 162 Query: 244 NIGIVGNQCTP--------LSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESS 295 + N ++++N++ L K++ T A + ++ + Sbjct: 163 LYFLTFNHNPQNVLHRGNCTTDSINQLMYSLCKISYGGGTRYRDAFNKMIDDIKQDIFDK 222 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQD 355 TI + ++DG QN + + + + ++++ Sbjct: 223 TQTID-------IFLVSDGNE-----IQNRKLLRKSLDRLNEYKISVHTIDTGDGTWKDK 270 Query: 356 LLRKCTDSSGQFFAVNDSRELLESFDKI 383 L + ++ ++ + + F+ + Sbjct: 271 LFE-ISCEQNGYYINSNQQGIDTLFNAL 297 >gi|109077204|ref|XP_001095246.1| PREDICTED: integrin alpha-2 [Macaca mulatta] Length = 1180 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 28/187 (14%), Positives = 63/187 (33%), Gaps = 24/187 (12%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 ++ N + + + ++G I Y E+ ++ + + Sbjct: 189 DAVKNFLEKFVQ-GLDIGPTKTQVGLIQYANNPRVVFNLNTYKTKEEMIVATSQTSQHGG 247 Query: 275 --TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 TNT+ A+ +A + Y+ + K ++ +TDGE+ S + Sbjct: 248 DLTNTFGAIQYARKYAYSAASGGRRSAT-----KVMVVVTDGESHDGSM------LKAVI 296 Query: 333 EYMRNAGMKIYSVAV----SAPPEGQDLL----RKCTD--SSGQFFAVNDSRELLESFDK 382 + + + + +AV + L + + FF V+D LLE Sbjct: 297 DQCNHDNILRFGIAVLGYLNRNALDTKNLIKEIKAIASIPTERYFFNVSDEAALLEKAGT 356 Query: 383 ITDKIQE 389 + ++I Sbjct: 357 LGEQIFS 363 >gi|325686522|gb|EGD28550.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK72] Length = 458 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 31/287 (10%), Positives = 79/287 (27%), Gaps = 29/287 (10%) Query: 123 LIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLP 182 A + + + I++ V D+S SM+ + +NN + L Sbjct: 173 FSKVAKGKKGIAIAYRTDPIQGQMNIAVSFVFDISGSMKGALNGANPTSNNPSRMDILRD 232 Query: 183 P--------PPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK--AIQEKK 232 + TT S A ++D + + +++ + Sbjct: 233 KAEIMINELQSVGNVSVNLTTFSTTGSYKQAAFSQLDREAGTIKESIKNLKSDGGVTNPG 292 Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 + L++ + + + H L+ Sbjct: 293 DGLRYGMVSLQKQHAQLKYVVLLTDGVPNAYLVNQQGQAGGLEMKREGIQHFNNVLFELS 352 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM-RNAGM---KIYSVAV- 347 + + RL G + + ++ + + G+ ++ + Sbjct: 353 SNVPASFNYGRL---------GYDYTSKIPDIRDASIDYAGAVSKAYGVGVKRVNVIGFS 403 Query: 348 SAPP---EGQDLLRKCTDSS--GQFFAVNDSRELLESFDKITDKIQE 389 GQ+L K + ++ + + L ++F I +IQ+ Sbjct: 404 GLEHEIAYGQNLTDKIGEGGMETKYVSATNKEALQKTFSDIKKQIQQ 450 >gi|302804190|ref|XP_002983847.1| hypothetical protein SELMODRAFT_423092 [Selaginella moellendorffii] gi|300148199|gb|EFJ14859.1| hypothetical protein SELMODRAFT_423092 [Selaginella moellendorffii] Length = 557 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 21/177 (11%), Positives = 58/177 (32%), Gaps = 10/177 (5%) Query: 198 KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN 257 + + +V L+N++ Q + A ++G Q T L Sbjct: 207 SNPTGVLSSQTRFNVANNIIKKLLNTLTNGDQVAVSTIGGEKIGAPVSVVLGVQETSL-- 264 Query: 258 NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 +L + S + ++ TN+ + + + + ++ +I TDG+ Sbjct: 265 DLAGISSLKDSISNTSVTNSASNIKNGLQAALD-------FFNTSSNLNVIILFTDGQFV 317 Query: 318 GASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSR 374 + T + + + + G+ ++ + + ++ S + V + Sbjct: 318 TPGNFNFTQLSPVLAQ-LNARGVVVFVYRIGSFTTNDATFQQMQSSLNMSYEVINDD 373 >gi|282853088|ref|ZP_06262425.1| conserved hypothetical protein [Propionibacterium acnes J139] gi|282582541|gb|EFB87921.1| conserved hypothetical protein [Propionibacterium acnes J139] gi|314922731|gb|EFS86562.1| conserved hypothetical protein [Propionibacterium acnes HL001PA1] gi|314965815|gb|EFT09914.1| conserved hypothetical protein [Propionibacterium acnes HL082PA2] gi|314982958|gb|EFT27050.1| conserved hypothetical protein [Propionibacterium acnes HL110PA3] gi|315091262|gb|EFT63238.1| conserved hypothetical protein [Propionibacterium acnes HL110PA4] gi|315094498|gb|EFT66474.1| conserved hypothetical protein [Propionibacterium acnes HL060PA1] gi|315105219|gb|EFT77195.1| conserved hypothetical protein [Propionibacterium acnes HL050PA2] gi|327328992|gb|EGE70752.1| putative membrane protein [Propionibacterium acnes HL103PA1] Length = 169 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 16/134 (11%), Positives = 35/134 (26%), Gaps = 17/134 (12%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A+I+ + A I ++ QSA A +G + +I Sbjct: 43 AALILPALLMIAAVATGSWRISEVKADAQSAAQVAARAGSVASSVGESIA---------- 92 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + ++ L + + + ++ I +L + Sbjct: 93 -VGQRVGLAELAGTRCSNPAIAVDSSDLTLPV------GSAGTTSARVSCIIKLSDLLVP 145 Query: 122 GLIPSALTNLSLRS 135 G+ S S Sbjct: 146 GMPGSFHIESVAHS 159 >gi|332535276|ref|ZP_08411078.1| type IV fimbrial biogenesis protein PilY1 [Pseudoalteromonas haloplanktis ANT/505] gi|332035307|gb|EGI71811.1| type IV fimbrial biogenesis protein PilY1 [Pseudoalteromonas haloplanktis ANT/505] Length = 858 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 15/135 (11%), Positives = 43/135 (31%), Gaps = 16/135 (11%) Query: 270 NPYENTNTYPAMHHAYR---ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 +P ++ + + + + + + G+ Sbjct: 20 DPSSGSDNDSDIDISVEHNKYFGRYLTTGDRYYYNMLHQLAKVIHGTGDTIVDLYPSTP- 78 Query: 327 NTLQICEYMRNAGMKIYSVAVSAP--PEGQDLLRKCTD-SSGQFFAVNDSRELLESFDKI 383 + G ++Y++ + +GQ+LL+ G+ +L E+ + Sbjct: 79 -------EINETG-RLYTIGFGSGMTEKGQNLLKLAATLGGGKNLPATTPEKLSEALNNA 130 Query: 384 TDKIQEQ-SVRIAPN 397 +I+++ S AP+ Sbjct: 131 VAEIRQENSTFTAPS 145 >gi|148555258|ref|YP_001262840.1| hypothetical protein Swit_2343 [Sphingomonas wittichii RW1] gi|148500448|gb|ABQ68702.1| hypothetical protein Swit_2343 [Sphingomonas wittichii RW1] Length = 166 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 17/144 (11%), Positives = 37/144 (25%), Gaps = 9/144 (6%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSG--CASIVSDRTIKDPTTKKDQ 59 A++ + F +T +D+A R ++ SA++ + + V I + Sbjct: 21 FALLSVLFFFVMTAGLDIAMWYQQRLRLDSAVEQGAMIAFNSRASVDQSAIGTYVAAAAK 80 Query: 60 TSTIFKKQIKKHLKQGSYIRENA-------GDIAQKAQINITKDKNNPLQYIAESKAQYE 112 S+ I + + T + Y A Sbjct: 81 LSSAPTVSISCNGNATCANSGRTCACISGGAPTYSALGCDKTCSDGSLPGYYMRISATAT 140 Query: 113 IPTENLFLKGLIPSALTNLSLRST 136 T + L + + Sbjct: 141 ASTLLVPAAMLGGTMTQTRTAMVR 164 >gi|254492197|ref|ZP_05105371.1| type I secretion target GGXGXDXXX repeat protein domain protein [Methylophaga thiooxidans DMS010] gi|224462522|gb|EEF78797.1| type I secretion target GGXGXDXXX repeat protein domain protein [Methylophaga thiooxydans DMS010] Length = 2740 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 35/294 (11%), Positives = 78/294 (26%), Gaps = 22/294 (7%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQK--AQINITKDKNN 99 ++ D I T + F + + + GD + T Sbjct: 1929 SATGDDNPITSGTISYTVGADGFDSVSLSTTADTTGLMTHDGDAVDTIWDSGSNTLIGYV 1988 Query: 100 PLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRS 159 +++ Q T +T L + + VLD S Sbjct: 1989 AGTDSSDAANQVFTVTLTNINASGADYDVTLLQAVKHPDADNENNVDFTVNVSVLDNDGS 2048 Query: 160 MEDLYLQKHNDNNNMTSNKYLLPPP-----PKKSFWSKNTTKSKYAPAPAPANRKIDVLI 214 D++ ++ + + S A ++ V+I Sbjct: 2049 EGITSFNVSIDDDVPVASPEENSGEATLEVNTNLMMILDVSGSMNDSANFQGMTRLQVMI 2108 Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 +S+ L++ N+ + + G +++ K+ + L N Sbjct: 2109 KSSLELLDQYDAYGDVMVNIITFATSASNPSGGWVT--------VDQAKAIILGLTAGGN 2160 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 TN A++ A + + F++DGE + + + Sbjct: 2161 TNYDDALNDAINAFALGGKLGDG-------QNISYFMSDGEPNSNNVSNSATVP 2207 >gi|94971019|ref|YP_593067.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] gi|94553069|gb|ABF42993.1| von Willebrand factor, type A [Candidatus Koribacter versatilis Ellin345] Length = 391 Score = 38.7 bits (88), Expect = 1.8, Method: Composition-based stats. Identities = 7/42 (16%), Positives = 21/42 (50%), Gaps = 1/42 (2%) Query: 352 EGQDLLRKCTD-SSGQFFAVNDSRELLESFDKITDKIQEQSV 392 +G+ +L + + G+ F + + + +I ++++ Q V Sbjct: 306 DGKKILTEIASKTGGRMFEASKKENVEAIYAQIAEELRSQYV 347 >gi|219848640|ref|YP_002463073.1| hypothetical protein Cagg_1736 [Chloroflexus aggregans DSM 9485] gi|219542899|gb|ACL24637.1| conserved hypothetical protein [Chloroflexus aggregans DSM 9485] Length = 170 Score = 38.3 bits (87), Expect = 1.8, Method: Composition-based stats. Identities = 11/55 (20%), Positives = 25/55 (45%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTT 55 + A+++ + LF+ I+LA++ +++ AL A S I ++ Sbjct: 12 LVALMLPILTLFVLVVIELANLWLAVAKLEDALQQATRSAVQQIDYAVLARNGQA 66 >gi|172062966|ref|YP_001810617.1| TadE family protein [Burkholderia ambifaria MC40-6] gi|171995483|gb|ACB66401.1| TadE family protein [Burkholderia ambifaria MC40-6] Length = 142 Score = 38.3 bits (87), Expect = 1.9, Method: Composition-based stats. Identities = 15/138 (10%), Positives = 38/138 (27%), Gaps = 15/138 (10%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A+++ + + ID++ ++ + + +A A +G V T + Sbjct: 19 FALMLPFLLMVLIGIIDVSLLLCDKAVITNASREAARAGVVLRVPMLTPTQ-------IA 71 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + + L G + +T+ Y L Sbjct: 72 NVALSYTQNSLVSGGTGTV--------PTVAVTQANGTTSGTALTVTVTYTYSGLVLGTA 123 Query: 122 GLIPSALTNLSLRSTGII 139 + + +S S + Sbjct: 124 LSVLTGPITISASSVMLY 141 >gi|32473706|ref|NP_866700.1| hypothetical protein RB5442 [Rhodopirellula baltica SH 1] gi|32444242|emb|CAD74239.1| hypothetical protein-transmembrane prediction [Rhodopirellula baltica SH 1] Length = 384 Score = 38.3 bits (87), Expect = 1.9, Method: Composition-based stats. Identities = 34/339 (10%), Positives = 82/339 (24%), Gaps = 41/339 (12%) Query: 39 SGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKN 98 +G + + + + + + + + ++ + + Sbjct: 68 AGTIEQPTTSVGMAMAYRLPDRTRYVTEDSSEESDAATEDASDRQVDERSKSVDQSDAEQ 127 Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR 158 + A S + L GL L D Sbjct: 128 SQSTASAASAPPAGFVP-PVDLDGLFAEMTRRGVAAGESQGTGVEGVLQFGDGKTADQLG 186 Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAG 218 + E + + + + S + S+ A + + Sbjct: 187 TGELVPGTSRAGDGAGQTTTSVFGVSGSGSTFVYVFDHSESMSASGGKPLRA-----AKQ 241 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN--------LNEVKSRLNKLN 270 L+ S++ + + I YN + ++ +N+ Sbjct: 242 ELIRSLRTL-----SERQQFQVIFYNDRPKAFSPDGQTTGLVFGEDGIRRRAEAFVNRTV 296 Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T A+ A R + + F+TD SA Q Sbjct: 297 AVGGTEHQLALRMALRLAPDA----------------IFFLTDASIQTMSADQ----MSD 336 Query: 331 ICEYMRNAGMKIYSVAVSAPPE-GQDLLRKCT-DSSGQF 367 I +G I+++ + PE +++ + G + Sbjct: 337 IRRRAEQSGTVIHAIQFGSGPEPANSFMKEIARQNRGGY 375 >gi|260813596|ref|XP_002601503.1| hypothetical protein BRAFLDRAFT_248604 [Branchiostoma floridae] gi|229286800|gb|EEN57515.1| hypothetical protein BRAFLDRAFT_248604 [Branchiostoma floridae] Length = 103 Score = 38.3 bits (87), Expect = 1.9, Method: Composition-based stats. Identities = 16/113 (14%), Positives = 41/113 (36%), Gaps = 15/113 (13%) Query: 277 TYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR 336 T A+ + R + E++ + + K +I +TDG ++ Q + Sbjct: 2 TGKAIRYVSRYGFAERDGAR-----PGVPKVLIVVTDGV--------SSDAVKQSALEAQ 48 Query: 337 NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 G+ ++++ V L + S+ V++ L + + + + Sbjct: 49 QKGIFVFAIGV--AGYDMAQLEQIASSNRSLAVVDNFNLLDSLRNSLLTGVCD 99 >gi|83749631|ref|ZP_00946614.1| hypothetical transmembrane protein [Ralstonia solanacearum UW551] gi|207728099|ref|YP_002256493.1| hypothetical protein RSMK04469 [Ralstonia solanacearum MolK2] gi|207744165|ref|YP_002260557.1| hypothetical protein RSIPO_02350 [Ralstonia solanacearum IPO1609] gi|83723692|gb|EAP70887.1| hypothetical transmembrane protein [Ralstonia solanacearum UW551] gi|206591344|emb|CAQ56956.1| conserved hypothetical protein [Ralstonia solanacearum MolK2] gi|206595569|emb|CAQ62496.1| conserved hypothetical protein [Ralstonia solanacearum IPO1609] Length = 345 Score = 38.3 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 42/321 (13%), Positives = 90/321 (28%), Gaps = 17/321 (5%) Query: 4 IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTI 63 I++ + A+D+AH+ +RN++Q+A DAA L+G A + + + Q ++ Sbjct: 23 ILLIIFLAIGAMAVDIAHLFVVRNELQNAADAAALAGAAGLYPANPKPNWSNGVAQGTSA 82 Query: 64 FKKQIKKHLKQGSYIREN-----AGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENL 118 K + K + G A +IT N+ + Sbjct: 83 VKLNASDNTKLTGGTVQAGYWNLTGSPAGMQSQSITPGSNDVPGVQVTVTRSPGNNGGPV 142 Query: 119 --FLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTS 176 +L + ++ + +I S+ V + + N S Sbjct: 143 KGWLTWVFNGGAASIQATAVAVIAAPGSANPGSLFPVALNKCLFDLYWNYTTGQPLNDPS 202 Query: 177 NKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK---- 232 S+ + T + + + Q Sbjct: 203 TGQPYVIDINTSYPPSSMTCASGEWTGFNGPTDASTEKSLVSSGNPTNISIGQNINISTG 262 Query: 233 -NLSVRIGTIAYNIGIVGNQCTPLSNNLNE-----VKSRLNKLNPYENTNTYPAMHHAYR 286 SV A + + +PL+ + K+ + + A + Sbjct: 263 VKTSVYNAIPALPLTVTMPVVSPLTPGATSPVYAFAGFTITKIVTNGSHSYIEGHFTANQ 322 Query: 287 ELYNEKESSHNTIGSTRLKKF 307 ++ N S G+ + Sbjct: 323 KVVNSGGGSGTYYGAYVPPRL 343 >gi|290243156|ref|YP_003494826.1| von Willebrand factor type A [Thioalkalivibrio sp. K90mix] gi|288945661|gb|ADC73359.1| von Willebrand factor type A [Thioalkalivibrio sp. K90mix] Length = 615 Score = 38.3 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 14/124 (11%), Positives = 39/124 (31%), Gaps = 30/124 (24%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T A+ ++ ES K ++ ITDG + ++ + ++ Sbjct: 516 STGATPMSNAILGVLPSMFARSESR----------KVMLVITDGAPN------DSESAME 559 Query: 331 ICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 R+ +++Y++ + F V ++ + ++ + I Sbjct: 560 AIRMARDVNVEMYAIGIETD-------------PSHLFGVENTTVIQSV-GELAENIFGL 605 Query: 391 SVRI 394 + Sbjct: 606 LTPV 609 >gi|260768633|ref|ZP_05877567.1| membrane protein-like protein [Vibrio furnissii CIP 102972] gi|260616663|gb|EEX41848.1| membrane protein-like protein [Vibrio furnissii CIP 102972] Length = 546 Score = 38.3 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 19/126 (15%), Positives = 36/126 (28%), Gaps = 6/126 (4%) Query: 10 FLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIK 69 +F + A+D A ++Y R Q+QS D A + D+ + F ++ Sbjct: 23 VIFASVALDTARLVYQRQQLQSVADLAATEVGLQNPYYLDDRQKEAILDRLTARFSGKVD 82 Query: 70 KHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALT 129 + + N + ++L GL S Sbjct: 83 TLTIDYGTAKIINKMWVVDTG---STPDNGYPAAKVTVT---KTVPQSLVAGGLFNSNSM 136 Query: 130 NLSLRS 135 L + Sbjct: 137 TLYAEA 142 >gi|294140776|ref|YP_003556754.1| hypothetical protein SVI_2005 [Shewanella violacea DSS12] gi|293327245|dbj|BAJ01976.1| conserved hypothetical protein [Shewanella violacea DSS12] Length = 438 Score = 38.3 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 11/64 (17%), Positives = 27/64 (42%), Gaps = 8/64 (12%) Query: 5 IISVCFLFITYAI--------DLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTK 56 I V F +++ D H++ + ++Q+A+D++VL + +D ++ Sbjct: 22 AILVMFTIGLFSLLAVAALALDGGHLLLNKGRLQNAVDSSVLYAAKILQNDGSLFQAREA 81 Query: 57 KDQT 60 Sbjct: 82 ATLI 85 >gi|242048544|ref|XP_002462018.1| hypothetical protein SORBIDRAFT_02g012640 [Sorghum bicolor] gi|241925395|gb|EER98539.1| hypothetical protein SORBIDRAFT_02g012640 [Sorghum bicolor] Length = 333 Score = 38.3 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 9/53 (16%), Positives = 18/53 (33%), Gaps = 3/53 (5%) Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCT--DSSGQFFAVNDSRELLESFDK 382 ++ +Y+ A +L + F VND +L +F + Sbjct: 14 AADVKIGNAPVYTFGFGAD-YDPTVLNAVARNSTGETFSVVNDVDKLSMAFSQ 65 >gi|220923783|ref|YP_002499085.1| Fibronectin type III domain-containing protein [Methylobacterium nodulans ORS 2060] gi|219948390|gb|ACL58782.1| Fibronectin type III domain protein [Methylobacterium nodulans ORS 2060] Length = 3238 Score = 38.3 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 22/208 (10%), Positives = 57/208 (27%), Gaps = 6/208 (2%) Query: 27 NQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIA 86 ++ +L AAV++ +S T +S L A + Sbjct: 1384 AKLSGSLPAAVVA----NISGLTGAQLAANAQISSDQIAGLAAAKLTTQITKTNIADNAV 1439 Query: 87 QKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENL 146 Q+N + A + ++ + ++N+ L L S ++ Sbjct: 1440 DTPQLNAGAVSTAKIAAGAVTTSRLAVASQNILFGSEFGQGLGMQGLTSGQSGGVVGLSV 1499 Query: 147 AISI--CMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 I+ ++ S ++ N + P + ++ A Sbjct: 1500 TINPASAWTINRSNALMIHASGAQPANAVVDVYFQQPLPNGTSQRYPVLAGQTYEWSAYL 1559 Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKK 232 +R + + + + ++ Sbjct: 1560 STHRCAGAVHLAFYDAAGNWISSVASAS 1587 >gi|158314291|ref|YP_001506799.1| von Willebrand factor type A [Frankia sp. EAN1pec] gi|158109696|gb|ABW11893.1| von Willebrand factor type A [Frankia sp. EAN1pec] Length = 432 Score = 38.3 bits (87), Expect = 2.0, Method: Composition-based stats. Identities = 20/134 (14%), Positives = 41/134 (30%), Gaps = 21/134 (15%) Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 + ++ + +N+++ T + A G + I +TDG Sbjct: 106 DTRDQARRTINRMSAAGGTAIGSWLLAARDLF----------AGHPDAVRHAILLTDGR- 154 Query: 317 SGASAYQNTLNTLQICE---YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDS 373 + + + + L C + G+ VA LL G + D Sbjct: 155 NEHESAADFTDALAACRGHFQCDSRGVGRGWVAAELTAVSDTLL-------GTARDIADP 207 Query: 374 RELLESFDKITDKI 387 +L+ F +T Sbjct: 208 ADLVADFQAMTKAA 221 >gi|221128911|ref|XP_002157051.1| PREDICTED: similar to tenascin C, partial [Hydra magnipapillata] Length = 2678 Score = 38.3 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 18/220 (8%), Positives = 48/220 (21%), Gaps = 19/220 (8%) Query: 129 TNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKS 188 L+ + + + S S + + + Sbjct: 1960 IELTNSIFCDEQACPQIDVTNWSSWTGCSSSCVNTASPSQWRWRSCSCGNCSQCDNSVTV 2019 Query: 189 FWSKNTTKSKYAPAPAP-------ANRKIDVLIESAGNLVNSIQK--AIQEKKNLSVRIG 239 S+ + + + L + K A + G Sbjct: 2020 VDSRTCNNVPCDSSCRSSMISYIIVIDSSSSVTDYYWGLERNFVKRFAASVGFTSNTTFG 2079 Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNP-YENTNTYPAMHHAYRELYNEKESSHNT 298 + + + + ++ ++ T + A + + Sbjct: 2080 LVNFASTARTEFGCRSFTDATSFNAAIDSVSMITGGTAINAGLTQAINLMQSPACRG--- 2136 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 KK ++F TDG+ + T + + N Sbjct: 2137 ------KKVLLFTTDGQENIEKDPSKISATYETVRSLANY 2170 >gi|332298718|ref|YP_004440640.1| von Willebrand factor type A [Treponema brennaborense DSM 12168] gi|332181821|gb|AEE17509.1| von Willebrand factor type A [Treponema brennaborense DSM 12168] Length = 566 Score = 38.3 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 20/105 (19%), Positives = 40/105 (38%), Gaps = 13/105 (12%) Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 PL+++LN V+S L L+P T+T ++ N + ++ Sbjct: 149 AVPLTSDLNAVRSLLPSLSPALLTSTGSSIGSGIEAAVNSFPPLSAA------ARTIVVF 202 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 TDGE + + + ++ G+ + V + E + L Sbjct: 203 TDGEETDGRMSG------AVADALK-FGIPVVFVGFGSDTESEIL 240 >gi|297171266|gb|ADI22273.1| hypothetical protein [uncultured Gemmatimonadales bacterium HF0200_36I24] Length = 359 Score = 38.3 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 18/93 (19%), Positives = 36/93 (38%), Gaps = 10/93 (10%) Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD-SS 364 + VI ++DG+ + M + + SVAV + LL + + Sbjct: 14 RHVILLSDGQTYPDKY-------ESLVTRMAKDDISVSSVAVGQES-DRALLADIAEWGN 65 Query: 365 GQFFAVNDSRELLESFDKITDKIQEQSVRIAPN 397 G+ + + D+ + + F + +I Q I N Sbjct: 66 GRSYFILDAARVPQVFIQ-ETQIASQQTLIEEN 97 >gi|253998722|ref|YP_003050785.1| outer membrane adhesin-like protein [Methylovorus sp. SIP3-4] gi|253985401|gb|ACT50258.1| outer membrane adhesin like proteiin [Methylovorus sp. SIP3-4] Length = 1536 Score = 38.3 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 37/368 (10%), Positives = 92/368 (25%), Gaps = 38/368 (10%) Query: 51 KDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQ 110 T S + G+ I + + Sbjct: 887 DSSGTVDVTNSASAHGTLSISDPDGNAITSVVLTAPTGDYYSGGQLITWSGSGTGTLTGS 946 Query: 111 YEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 NL ++ + L +++ + + L + + Sbjct: 947 AGGQPVLTITINNGGEYDVNLLRPLDHPDTKTEDVLTLNVGVSATANGVTSVGNLTVNVE 1006 Query: 171 NNNMTSNKYLLP--PPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAI 228 +++ +N + + + S ++ I+S L++ Sbjct: 1007 DDSPVANPISANLSTTDTNLLITLDISGSMRTQDGVGGTTRLASAIQSIKTLLDKYDALG 1066 Query: 229 QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK-SRLNKLNPYENTNTYPAMHHAYRE 287 RI + ++ ++ + + + ++ P NTN A+ +A Sbjct: 1067 DT------RISLVVFSTTAAQVGTDWMTIDQAKAQLDQILVNGPKGNTNYDSALANAMDA 1120 Query: 288 LYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL----------------NTLQI 331 + T + FI+DGE + S +L L Sbjct: 1121 FDDAG-------KLTNAQNVAYFISDGEPNTGSGSNTSLTGSTNTNGSDAGIQTQEELAW 1173 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA-----VNDSRELLESFD-KITD 385 ++ +K Y++ V + + L V +L I + Sbjct: 1174 KTFLEANQIKSYAIGVGSDINSVNALNPVAYDGQTNTNMDGILVTSFTQLDAVLAGTIQN 1233 Query: 386 KIQEQSVR 393 + + + Sbjct: 1234 QAAGELIT 1241 >gi|120602704|ref|YP_967104.1| diguanylate cyclase/phosphodiesterase with PAS/PAC and GAF sensor(s) [Desulfovibrio vulgaris DP4] gi|120562933|gb|ABM28677.1| diguanylate cyclase/phosphodiesterase with PAS/PAC sensor(s) [Desulfovibrio vulgaris DP4] Length = 997 Score = 38.3 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 14/115 (12%), Positives = 35/115 (30%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 T + + V F F A+ ++ R Q+ +A + SI Sbjct: 10 TLLGMLVLFGFYALAVSAGSDIFTRRQLMRESEATARAVAQSIAGSSMDSLYGLDPITIQ 69 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 + + + + + + ++GDI + + L + + + Sbjct: 70 SRIDQYLSVPGVRYALVTASSGDIVAHTFAPVVPEDVRALARSLSDARKADSTFQ 124 >gi|46579825|ref|YP_010633.1| sensory box protein [Desulfovibrio vulgaris str. Hildenborough] gi|46449240|gb|AAS95892.1| sensory box/GGDEF domain/EAL domain protein [Desulfovibrio vulgaris str. Hildenborough] gi|311234168|gb|ADP87022.1| diguanylate cyclase/phosphodiesterase with PAS/PAC sensor(s) [Desulfovibrio vulgaris RCH1] Length = 997 Score = 38.3 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 14/115 (12%), Positives = 35/115 (30%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 T + + V F F A+ ++ R Q+ +A + SI Sbjct: 10 TLLGMLVLFGFYALAVSAGSDIFTRRQLMRESEATARAVAQSIAGSSMDSLYGLDPITIQ 69 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTE 116 + + + + + + ++GDI + + L + + + Sbjct: 70 SRIDQYLSVPGVRYALVTASSGDIVAHTFAPVVPEDVRALARSLSDARKADSTFQ 124 >gi|326385751|ref|ZP_08207380.1| TadE-like protein [Novosphingobium nitrogenifigens DSM 19370] gi|326209730|gb|EGD60518.1| TadE-like protein [Novosphingobium nitrogenifigens DSM 19370] Length = 201 Score = 38.3 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 12/127 (9%), Positives = 36/127 (28%), Gaps = 5/127 (3%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAV---LSGCASIVSDRTIKDPTTKKD 58 AI + ++ + + +++ + A L+G A Sbjct: 29 FAIAAPFLIGILMATFEIIILFLAQAALETTAEGAARYVLTGQAQTNFTGVKDSNGKVIT 88 Query: 59 QTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENL 118 F + + ++ N + + + T + Q+ ++ Y++ Sbjct: 89 TPQQQFAAYVCTQMSS--FMSCNNLYVDVNSGSDYTTVDLSVPQFTFDATNNYKVTNTFN 146 Query: 119 FLKGLIP 125 + G Sbjct: 147 YNPGTQG 153 >gi|302037436|ref|YP_003797758.1| hypothetical protein NIDE2113 [Candidatus Nitrospira defluvii] gi|300605500|emb|CBK41833.1| conserved protein of unknown function [Candidatus Nitrospira defluvii] Length = 359 Score = 38.3 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 22/207 (10%), Positives = 59/207 (28%), Gaps = 20/207 (9%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLS-------------GCASIVSDR 48 A+ +S +D + ++Q+Q+ DAA L+ S Sbjct: 32 VAVSLSALVSLAALVLDGGAALVSKHQLQNLADAAALAGARQLGKLYESQPAAISPPQPL 91 Query: 49 TIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESK 108 ++ + + + + ++ + + A + +N + ++ Sbjct: 92 SLANRARVQAVVAEVAERNRDATKLAAVTLSDVRVGRWNAATRTVVSSQNGLDAVLVRAE 151 Query: 109 AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKH 168 FL G++ ++S + + +E + L V S + L Sbjct: 152 GS-----MPTFLAGVVGIRRLSVSATAVSALSALAEVQP--GALTLPVGLSSAWMALGPI 204 Query: 169 NDNNNMTSNKYLLPPPPKKSFWSKNTT 195 + P + ++ Sbjct: 205 DGRRLQVYAPGGGDPCAGWTTFTDTPP 231 >gi|168216759|ref|ZP_02642384.1| von Willebrand factor type A domain protein [Clostridium perfringens NCTC 8239] gi|182381207|gb|EDT78686.1| von Willebrand factor type A domain protein [Clostridium perfringens NCTC 8239] Length = 1341 Score = 38.3 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 27/185 (14%), Positives = 56/185 (30%), Gaps = 29/185 (15%) Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN--TNTYPAM 281 I K K+ Y + L N + KS+ N L E TN + Sbjct: 244 INKIDNYKEYFYNPESLDEYRKQDFQKR--GLRTNTDNDKSKFNNLILNEYKITNADSYI 301 Query: 282 HHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK 341 + Y Y N G + + G +L++ + M++ G+ Sbjct: 302 NKIYGNPYCINVCPPNNQGGKNAICYYMGSEIGY----------AYSLKMAKEMKDLGIS 351 Query: 342 IYSVAVSAPPEG----QDLLRKCT-----------DSSGQFFAVNDSRELLESFDKITDK 386 Y + + ++ ++ + S ++ + + L + F + Sbjct: 352 TYPIYFDTKSKDSKQRENTMKLISKFAGNHEDDDVTSVNTVYSNDITESLKKVFKDLDKN 411 Query: 387 IQEQS 391 I+E Sbjct: 412 IKESY 416 >gi|108761783|ref|YP_632825.1| hypothetical protein MXAN_4659 [Myxococcus xanthus DK 1622] gi|108465663|gb|ABF90848.1| hypothetical protein MXAN_4659 [Myxococcus xanthus DK 1622] Length = 694 Score = 38.3 bits (87), Expect = 2.1, Method: Composition-based stats. Identities = 7/87 (8%), Positives = 34/87 (39%), Gaps = 4/87 (4%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAA----VLSGCASIVSDRTIKDPTTK 56 + A+++ + + + +++ H ++ R ++Q+ D+A + Sbjct: 20 LAALMVLIMSIAVLTTVNIGHTVHERIRLQNTADSASYSMAAMEARAFNFYAYANRTQVS 79 Query: 57 KDQTSTIFKKQIKKHLKQGSYIRENAG 83 ++ +++ + +++ + G Sbjct: 80 HYVSAMMWQSLLSLIYSAEAFLTDIYG 106 >gi|170742065|ref|YP_001770720.1| hypothetical protein M446_3920 [Methylobacterium sp. 4-46] gi|168196339|gb|ACA18286.1| conserved hypothetical protein [Methylobacterium sp. 4-46] Length = 418 Score = 38.3 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 21/216 (9%), Positives = 61/216 (28%), Gaps = 12/216 (5%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + +++ V AIDL+ + ++ +++ V + AS+ + + D T + Sbjct: 15 LASLLSPVGLGVAALAIDLSTLQMVKQRLK------VTADAASLAAVAVLPDTGTALSRA 68 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQ-YEIPTENLF 119 I + + A T + Sbjct: 69 LAIAADNAGAGAGSVTTAADVQFGSYDSATRTFTAGATPANAVQVTASRSEARGNPVITG 128 Query: 120 LKGLIPSALTNLSLRSTGIIERSSE-----NLAISICMVLDVSRSMEDLYLQKHNDNNNM 174 + A +LS + + + + + S + + + + ++ + Sbjct: 129 FARALGWATPDLSASAVAVRFSPAYCFLVLDPSASGALSVSGTGRLSVPNCGVQVNSTSA 188 Query: 175 TSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKI 210 ++ + + S + +P P R + Sbjct: 189 SAATTGNNSTAQARSFCITGGYSGSSFSPVPITRCL 224 >gi|327189219|gb|EGE56398.1| hypothetical protein RHECNPAF_7008 [Rhizobium etli CNPAF512] Length = 524 Score = 38.3 bits (87), Expect = 2.2, Method: Composition-based stats. Identities = 23/177 (12%), Positives = 52/177 (29%), Gaps = 20/177 (11%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE--VKSR 265 D L ++ L+ + + + + V N T N L + + + Sbjct: 359 NGEDQLQKAMRFLLTPDEASRVLVQWSPSDQIIVIPFDSSVRNMFTASGNPLEQEGLLNE 418 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 +++ TN Y A +++ S ++ +TDG + S + Sbjct: 419 VSRQKADGGTNMYACAERALQQIARTGRLS-------TYLPAIVIMTDGRSDDQSQAFMS 471 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESFD 381 + I+ + + L +S + F +L +F Sbjct: 472 EWNTI------EPRVPIFGITFG--DADKTQLDTLAKQTSARVFD--GGSDLATAFR 518 >gi|221068121|ref|ZP_03544226.1| outer membrane adhesin like proteiin [Comamonas testosteroni KF-1] gi|220713144|gb|EED68512.1| outer membrane adhesin like proteiin [Comamonas testosteroni KF-1] Length = 1268 Score = 38.3 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 35/331 (10%), Positives = 83/331 (25%), Gaps = 24/331 (7%) Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 +I + + G K + + + E+ + Sbjct: 738 SITSGGVAVTWSWDAGSSTLTGMAGAKEVMTVKVGALTEVGGRYEASYTVTLKGPVDHAV 797 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKY-L 180 G + +L ++ + S + + + DV +D L + Sbjct: 798 GR-GANTLDLDFKAIVHDGKQSSEIGFVVEVKDDVPALADDAELVINLAKLQTNVMIVLD 856 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 L K ++ + ++ L+N ++ V + Sbjct: 857 LSGSMAWDSNGKVLPGGGSNAN-----SRLSLAKKALEALINKYEEY------GDVAVKL 905 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG 300 + +N T +S + +N L T A++ A N + + Sbjct: 906 VTFNGSTANAHATWMS--AATAIAIINGLTATGGTPYKAALNAAMG--TNGFADNVGKLT 961 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV-SAPPEGQDLLRK 359 ++ FITDG + L +++ + V + Sbjct: 962 GEGVQNVSYFITDGVPTLGQGVNPRLQAQ-WEDFLTTHHINSVGVGFGGIKTGDIPNIDP 1020 Query: 360 CTDSS-----GQFFAVNDSRELLESFDKITD 385 + N + EL + + Sbjct: 1021 IAYNGPAGVENAVVLANSAAELNSTLQDLIQ 1051 >gi|298502261|ref|YP_003724201.1| cell wall surface anchor family protein [Streptococcus pneumoniae TCH8431/19A] gi|298237856|gb|ADI68987.1| cell wall surface anchor family protein [Streptococcus pneumoniae TCH8431/19A] Length = 893 Score = 38.3 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 12/131 (9%), Positives = 31/131 (23%), Gaps = 25/131 (19%) Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 +++ ++ Y L S G + N Sbjct: 473 SEMKAVGYAVIGDPINGGYIWLN--WRESILAY--PFNSNTAKITNHGAPTRWYYNGN-- 526 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPE-------GQDLLRKCTDSSGQFFAVNDS----RE 375 + G +++V + + ++ + + V D+ + Sbjct: 527 --------IAPDGYDVFTVGIGINGDPGTDEATATSFMQSISSKPENYTNVTDTTKILEQ 578 Query: 376 LLESFDKITDK 386 L F I + Sbjct: 579 LNRYFHTIVTE 589 >gi|23008710|ref|ZP_00050041.1| hypothetical protein Magn03003732 [Magnetospirillum magnetotacticum MS-1] Length = 167 Score = 38.3 bits (87), Expect = 2.3, Method: Composition-based stats. Identities = 11/141 (7%), Positives = 30/141 (21%), Gaps = 7/141 (4%) Query: 3 AIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTST 62 + + FI +D + ++ +A A+ ++ + + Sbjct: 31 GLTLIPIVGFIGLGVDYGMATAGKTRLDNAA-------DAAALAAVVSAKAYIVANAKQS 83 Query: 63 IFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKG 122 + + N S + T Sbjct: 84 NVTTLALTEGQNQAVKAFNINAGKLLYGTVSLATPQVTRNRQTLSSSITYTATIQTLFGR 143 Query: 123 LIPSALTNLSLRSTGIIERSS 143 + S T + T + + Sbjct: 144 IFGSQATTFTNTVTASADIAR 164 >gi|330982726|gb|EGH80829.1| hypothetical protein PSYAP_30018 [Pseudomonas syringae pv. aptata str. DSM 50252] Length = 162 Score = 38.3 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 14/114 (12%), Positives = 29/114 (25%), Gaps = 2/114 (1%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M A+ + + L ID + + +Q D A L + Sbjct: 21 MAALTMGLALLCTLTVIDSGRLYLEKRSLQRVADIAALEAAGRRGTCSGTAASAPDFANQ 80 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIP 114 S + G + G + ++ + A + +P Sbjct: 81 SATRNGFVPNT--DGRTLVTRCGTLTVDVAGPRVFVADSTQALAIQVVAAHPVP 132 >gi|327390873|gb|EGE89213.1| cell wall surface anchor family protein [Streptococcus pneumoniae GA04375] Length = 886 Score = 38.3 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 12/131 (9%), Positives = 31/131 (23%), Gaps = 25/131 (19%) Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 +++ ++ Y L S G + N Sbjct: 466 SEMKAVGYAVIGDPINGGYIWLN--WRESILAY--PFNSNTAKITNHGAPTRWYYNGN-- 519 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPE-------GQDLLRKCTDSSGQFFAVNDS----RE 375 + G +++V + + ++ + + V D+ + Sbjct: 520 --------IAPDGYDVFTVGIGINGDPGTDEATATSFMQSISSKPENYTNVTDTTKILEQ 571 Query: 376 LLESFDKITDK 386 L F I + Sbjct: 572 LNRYFHTIVTE 582 >gi|258623679|ref|ZP_05718665.1| conserved hypothetical protein [Vibrio mimicus VM573] gi|258584045|gb|EEW08808.1| conserved hypothetical protein [Vibrio mimicus VM573] Length = 371 Score = 38.3 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 24/150 (16%), Positives = 55/150 (36%), Gaps = 23/150 (15%) Query: 246 GIVGNQCTPLSNNLNEVKSRLNKL---NPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 G TP + + + + L++ ++TN A+ + ST Sbjct: 185 GDAAFIQTPFTADQDVWLNLLDEAETGMAGQSTNLGDAIGLGIKVFEQSP--------ST 236 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE------GQDL 356 + ++ +TDG ++G+ + + +R IY +A+ P D+ Sbjct: 237 SQDQIMLVLTDGNDTGSFVSPVDAAKIAAAKGIR-----IYVIAMGDPENVGEQPLDMDV 291 Query: 357 LRKCTD-SSGQFFAVNDSRELLESFDKITD 385 + + + + + F D +L E++ I Sbjct: 292 VNRVSSLTQARSFVAIDQPQLNEAYQVIDQ 321 >gi|171320625|ref|ZP_02909645.1| TadE family protein [Burkholderia ambifaria MEX-5] gi|171094138|gb|EDT39225.1| TadE family protein [Burkholderia ambifaria MEX-5] Length = 142 Score = 38.3 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 15/138 (10%), Positives = 38/138 (27%), Gaps = 15/138 (10%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A+++ + + ID++ ++ + + +A A +G V T + Sbjct: 19 FALMLPFLLMVLIGIIDVSLLLCDKAVITNASREAARAGVVLRVPMLTTTQ-------IA 71 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + + L G + +T+ Y L Sbjct: 72 NVALSYTQNSLVSGGSAT--------APTVAVTQANGTTAGTALTVTVTYTYSGLVLGTA 123 Query: 122 GLIPSALTNLSLRSTGII 139 + + +S S + Sbjct: 124 LSVLTGPITISASSVMLY 141 >gi|209549179|ref|YP_002281096.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] gi|209534935|gb|ACI54870.1| von Willebrand factor type A [Rhizobium leguminosarum bv. trifolii WSM2304] Length = 522 Score = 38.3 bits (87), Expect = 2.4, Method: Composition-based stats. Identities = 13/121 (10%), Positives = 35/121 (28%), Gaps = 18/121 (14%) Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 + + +++ T+ Y A +++ S ++ +TDG + S Sbjct: 413 LLNEISRQKAGGGTDMYTCAAQALQQIARSDRLS-------TYLPAIVIMTDGRSDDQSQ 465 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESF 380 + + ++ + + L +S + F L +F Sbjct: 466 AFMSEW------NATEPHVPVFGITFG--DADKTQLDSLAKQTSARVFD--GGSNLATAF 515 Query: 381 D 381 Sbjct: 516 R 516 >gi|171057436|ref|YP_001789785.1| outer membrane adhesin-like protein [Leptothrix cholodnii SP-6] gi|170774881|gb|ACB33020.1| outer membrane adhesin like proteiin [Leptothrix cholodnii SP-6] Length = 1598 Score = 38.0 bits (86), Expect = 2.4, Method: Composition-based stats. Identities = 30/348 (8%), Positives = 80/348 (22%), Gaps = 24/348 (6%) Query: 40 GCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNN 99 + ++ + Q S D I + Sbjct: 672 AGGQADDVGAHPGDDSTDARSFSGQLSFSDADSAQLSVTLSAPPDALSSGGQAIVWTGDG 731 Query: 100 PLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRS 159 + + + LS + D Sbjct: 732 STTLVGHV-GTADGAEALRVSIDAGGAYEVTLSRPLDHGAAGEDRLSLGVGVHLSDGVHG 790 Query: 160 MEDLYLQKHNDNNNMTSNKYL-LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAG 218 + + +D+ + N + S + + S ++ IE+ G Sbjct: 791 RDATIELQIDDDAPLARNDSVSATAARTNLLISLDVSGSMDTADGVAGATRLASAIEAIG 850 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 L++ + V + + + + ++ + L L T+ Sbjct: 851 QLLDRHEAM------GDVAVRLVTF--SSTAQAIGEVWTDVATARQLLASLQADGGTHYD 902 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 A+ A + + + F++DGE + ++ Q ++ + Sbjct: 903 SALAAAQSAFLSAGRLAGA-------QNVSYFLSDGEPNRGHGI-DSAEQRQWESFVTSQ 954 Query: 339 GMKIYSVAVSAPPEGQDLLRKCTDSSGQ-----FFAVNDSRELLESFD 381 + ++ + Q + V+D +L Sbjct: 955 AIDSRAIGLG-DSTTQASMDGIAYDGSSGHDTDALRVSDFNQLDAVLS 1001 >gi|308472999|ref|XP_003098726.1| hypothetical protein CRE_04168 [Caenorhabditis remanei] gi|308268326|gb|EFP12279.1| hypothetical protein CRE_04168 [Caenorhabditis remanei] Length = 380 Score = 38.0 bits (86), Expect = 2.5, Method: Composition-based stats. Identities = 24/214 (11%), Positives = 69/214 (32%), Gaps = 21/214 (9%) Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVN-SIQKAIQ 229 + + Y+ P + ++++ + + ++ + Sbjct: 14 ADTYSPLSYVDRPCGTDLSNLWLDVVLVVDNSQEMGSQRLHDVTSNILSVFGADTRIGSN 73 Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK-LNPYENT---NTYPAMHHAY 285 + + R+G + YN N + +++++ + L NT + A Sbjct: 74 SVEPRTTRVGLVTYNSAATLNADLNQFQSFSDLRNGVISFLKVAANTKDSYLATGLAMAA 133 Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + L + H +K +I + L+ I + ++ +G+KI +V Sbjct: 134 QVLNVQGLRDH-------YQKVIIVYASKYSGYG-----DLDPQPIADRLKGSGVKIITV 181 Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLES 379 A + +L + F + + ++ + Sbjct: 182 AYG----DETVLESLSSPRFGFNSASGYPQIQNA 211 >gi|262402640|ref|ZP_06079201.1| protein BatA [Vibrio sp. RC586] gi|262351422|gb|EEZ00555.1| protein BatA [Vibrio sp. RC586] Length = 335 Score = 38.0 bits (86), Expect = 2.5, Method: Composition-based stats. Identities = 24/177 (13%), Positives = 62/177 (35%), Gaps = 23/177 (12%) Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKL---NPYENT 275 + + + +K +++ G TP + + + L++ ++T Sbjct: 122 SRLTAAKKVLRDFVTQRQGDRFGLILFGDAAFIQTPFTADQEVWLNLLDEAETGMAGQST 181 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 N A+ + +S + ++ +TDG ++G+ + + + Sbjct: 182 NLGDAIGLGIKVFEQSPATSQ--------DQIMLVLTDGNDTGSFVSPVDAAKIAAAKGI 233 Query: 336 RNAGMKIYSVAVSAPPE------GQDLLRKCTD-SSGQFFAVNDSRELLESFDKITD 385 R IY +A+ P D++ + + + + F D +L E++ I Sbjct: 234 R-----IYVIAMGDPENVGEQPLDMDVVNRVSSLTQARSFVAIDQPQLNEAYQVIDQ 285 >gi|258647262|ref|ZP_05734731.1| BatB protein [Prevotella tannerae ATCC 51259] gi|260852911|gb|EEX72780.1| BatB protein [Prevotella tannerae ATCC 51259] Length = 339 Score = 38.0 bits (86), Expect = 2.5, Method: Composition-based stats. Identities = 26/160 (16%), Positives = 47/160 (29%), Gaps = 40/160 (25%) Query: 253 TPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 P++ + K LN + P T + A R+ K +I IT Sbjct: 144 LPITGDYGAAKLFLNSITPGMVTLQGTNLAAAINLADKSF------TDKKRVGKAIIIIT 197 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-----GQDLLRK-------- 359 DGE+ A + + K+Y + + P G D L+ Sbjct: 198 DGEDHQGGAEEAAAAAAKEGR-------KVYILGIGNPGGAQIPLGNDYLKDNNGQVVVT 250 Query: 360 ----------CTDSSGQFFAVNDS----RELLESFDKITD 385 G + V++S +L + ++ Sbjct: 251 KLNEQMCREVAKAGDGLYLHVDNSNAAQEQLQAALGQLQQ 290 >gi|148976670|ref|ZP_01813357.1| hypothetical protein VSWAT3_19761 [Vibrionales bacterium SWAT-3] gi|145964021|gb|EDK29279.1| hypothetical protein VSWAT3_19761 [Vibrionales bacterium SWAT-3] Length = 166 Score = 38.0 bits (86), Expect = 2.5, Method: Composition-based stats. Identities = 23/167 (13%), Positives = 50/167 (29%), Gaps = 37/167 (22%) Query: 255 LSNNLN--EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 LS+ ++ ++++ Y T+ A + L Sbjct: 8 LSDGYEGRDIDDYISEMPEYYTTHDSNA--TEFNILNPTHSVRPVAQYDPDSHSA----N 61 Query: 313 DGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSA---------PPEGQDLLRKCTDS 363 D +N + I E R+ + ++++ + + G+DLL + +S Sbjct: 62 DLYRRVNRVARNLV--EDIAEAARSEDIYVFTLGLGSSLTSATGPDSEFGEDLLLRMANS 119 Query: 364 ------------------SGQFFAVNDSRELLESFDKITDKIQEQSV 392 G + D L FD++ D I ++ Sbjct: 120 SALLDDPDLSSDYDPNQLEGVYCHAVDEEALGPCFDEMLDVIIRLTL 166 >gi|225860485|ref|YP_002741994.1| cell wall surface anchor family protein [Streptococcus pneumoniae Taiwan19F-14] gi|298229065|ref|ZP_06962746.1| cell wall surface anchor family protein [Streptococcus pneumoniae str. Canada MDR_19F] gi|298254848|ref|ZP_06978434.1| cell wall surface anchor family protein [Streptococcus pneumoniae str. Canada MDR_19A] gi|154432898|gb|ABS82085.1| ancillary pilus subunit [Streptococcus pneumoniae] gi|225726608|gb|ACO22459.1| cell wall surface anchor family protein [Streptococcus pneumoniae Taiwan19F-14] Length = 886 Score = 38.0 bits (86), Expect = 2.5, Method: Composition-based stats. Identities = 12/131 (9%), Positives = 31/131 (23%), Gaps = 25/131 (19%) Query: 267 NKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTL 326 +++ ++ Y L S G + N Sbjct: 466 SEMKAVGYAVIGDPINGGYIWLN--WRESILAY--PFNSNTAKITNHGAPTRWYYNGN-- 519 Query: 327 NTLQICEYMRNAGMKIYSVAVSAPPE-------GQDLLRKCTDSSGQFFAVNDS----RE 375 + G +++V + + ++ + + V D+ + Sbjct: 520 --------IAPDGYDVFTVGIGINGDPGTDEATATSFMQSISSKPENYTNVTDTTKILEQ 571 Query: 376 LLESFDKITDK 386 L F I + Sbjct: 572 LNRYFHTIVTE 582 >gi|302343418|ref|YP_003807947.1| hypothetical protein Deba_1988 [Desulfarculus baarsii DSM 2075] gi|301640031|gb|ADK85353.1| hypothetical protein Deba_1988 [Desulfarculus baarsii DSM 2075] Length = 342 Score = 38.0 bits (86), Expect = 2.5, Method: Composition-based stats. Identities = 25/222 (11%), Positives = 58/222 (26%), Gaps = 9/222 (4%) Query: 14 TYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQ-TSTIFKKQIK-KH 71 A+DL + + ++Q+A DAA +G A ++SD ++ Sbjct: 34 GAAVDLGVVYAGKAELQNAADAAATAGAAELLSDPDGDGVAQTDYDGARQSAIDFVESNQ 93 Query: 72 LKQGSYIRENAGDIA-----QKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPS 126 L + GD+ + + + ++ F + Sbjct: 94 LLTTPLVWNEEGDLVEAGQWSFDSNDFAQTGPSADPADLDAVRVAISRPVQTFFARAVGL 153 Query: 127 ALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQK--HNDNNNMTSNKYLLPPP 184 + S G + + + + + ++ + N N Sbjct: 154 GQVMVGAVSVGYLGCAGDGGQADLPLAINAAVLDGLGPDSDIVLNSENAENGQWTSFDVW 213 Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK 226 P + N + I + NL ++ Sbjct: 214 PTNTNSIGNFLDNPEQIPRLNIGDSIYMNNGEIANLFGRLET 255 >gi|306830098|ref|ZP_07463282.1| LPXTG cell wall surface protein [Streptococcus mitis ATCC 6249] gi|304427624|gb|EFM30720.1| LPXTG cell wall surface protein [Streptococcus mitis ATCC 6249] Length = 3183 Score = 38.0 bits (86), Expect = 2.5, Method: Composition-based stats. Identities = 22/186 (11%), Positives = 53/186 (28%), Gaps = 16/186 (8%) Query: 146 LAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAP 205 I + ++ D S S N +T+ K Sbjct: 69 KPIDLVILQDASGSFRKTIPSVKNALKRLTTYVSPEQYNENDPHLVKTGDSRTTDRVFVA 128 Query: 206 ANRKIDVLIE-SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 + + +D + + + + + Y + L+++ NE+ Sbjct: 129 SYQGLDQVRYFNNNDFSGNPTDTFTDPNT---TGKHYTYGN-------SGLTSDQNEIHK 178 Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 ++ + T T PA+ + + K + N + + +TDG +G + Sbjct: 179 FIDNIAVDGGTPTVPAIQDTIAQYNSVKGNMENGRKT-----VFLLVTDGVANGYRLPGS 233 Query: 325 TLNTLQ 330 + Sbjct: 234 NTVVMD 239 >gi|225574493|ref|ZP_03783103.1| hypothetical protein RUMHYD_02570 [Blautia hydrogenotrophica DSM 10507] gi|225038280|gb|EEG48526.1| hypothetical protein RUMHYD_02570 [Blautia hydrogenotrophica DSM 10507] Length = 881 Score = 38.0 bits (86), Expect = 2.5, Method: Composition-based stats. Identities = 11/86 (12%), Positives = 32/86 (37%), Gaps = 7/86 (8%) Query: 315 ENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP-PEGQDLLRKC-TDSSGQFFAVND 372 E + Q + L+ ++ + IY + + D SSG+ + ++ Sbjct: 366 EEDSSWYSQFSQWALEDATVLKEQ-VPIYGIGYGSDMGNDTDSQEFIEALSSGEEYYIDT 424 Query: 373 ----SRELLESFDKITDKIQEQSVRI 394 +++ F + + ++V++ Sbjct: 425 RQEEMQDIGSIFKAVYSDLCWKAVKV 450 >gi|153003544|ref|YP_001377869.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Anaeromyxobacter sp. Fw109-5] gi|152027117|gb|ABS24885.1| Tfp pilus assembly protein tip-associated adhesin PilY1-like protein [Anaeromyxobacter sp. Fw109-5] Length = 1666 Score = 38.0 bits (86), Expect = 2.5, Method: Composition-based stats. Identities = 15/129 (11%), Positives = 36/129 (27%), Gaps = 18/129 (13%) Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 + + + ++ EK+ + + + + D + + Sbjct: 495 TGIRASATGGGSGIPQSLCRECTRFPAEKD----WLNNPIRVSWYLHNFDLRQNSENTED 550 Query: 324 NTLNTLQICEYMRNAGM-KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND-----SRELL 377 C M + IY++ + A + +L G F D +L Sbjct: 551 --------CAAMGGKQLLDIYTIGLGATGDAATMLESMAKMGGGLFKAADDAPTLRGKLD 602 Query: 378 ESFDKITDK 386 E+ I + Sbjct: 603 EALVAINQR 611 >gi|300313226|ref|YP_003777318.1| protein exporter of the Resistance-Nodulation-Cell Division (RND) superfamily protein [Herbaspirillum seropedicae SmR1] gi|300076011|gb|ADJ65410.1| protein exporter of the Resistance-Nodulation-Cell Division (RND) superfamily protein [Herbaspirillum seropedicae SmR1] Length = 755 Score = 38.0 bits (86), Expect = 2.6, Method: Composition-based stats. Identities = 11/41 (26%), Positives = 18/41 (43%), Gaps = 1/41 (2%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGC 41 +T + V L + +D A +Y R QM A A ++ Sbjct: 635 LTVATLPVMVLAVGIGVDYAFYIYSRLQMHLAAGEA-IAVA 674 >gi|156358479|ref|XP_001624546.1| predicted protein [Nematostella vectensis] gi|156211333|gb|EDO32446.1| predicted protein [Nematostella vectensis] Length = 688 Score = 38.0 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 15/150 (10%), Positives = 48/150 (32%), Gaps = 19/150 (12%) Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYREL 288 + + R+G + + ++ E++S L+++ T + L Sbjct: 49 TISSSAYRVGVVIFGSSAKVAFDFSKFSSSAEIESGLSEIKLIGGATAAGQGLTTCNTAL 108 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 +++ SS KK ++ + + + + + M+ +G+ I+ + + Sbjct: 109 FSKARSSA--------KKMLLVL---------IAGKSSDDVGVASSMKTSGISIFVLGMG 151 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + L + ++ Sbjct: 152 -KAIDKTQLNMMASQESYIQIAAEFSQVAS 180 >gi|260892685|ref|YP_003238782.1| hypothetical protein Adeg_0789 [Ammonifex degensii KC4] gi|260864826|gb|ACX51932.1| hypothetical protein Adeg_0789 [Ammonifex degensii KC4] Length = 209 Score = 38.0 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 6/28 (21%), Positives = 15/28 (53%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQM 29 A ++ V + +A+D I +++Q+ Sbjct: 15 FAFLLPVLVIVAAWALDFTRIRLVKDQL 42 >gi|92020123|dbj|BAE93252.1| type VI collagen alpha-3 chain [Sus scrofa] Length = 313 Score = 38.0 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 16/90 (17%), Positives = 34/90 (37%), Gaps = 10/90 (11%) Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 N+ + R+ + IT G+ + + + + G+K+++V V Sbjct: 16 LNQFVPEAGSRLEQRVPQIAFVITGGK--------SVEDAQEASLALTQRGVKVFAVGV- 66 Query: 349 APPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + K +S F V + +EL E Sbjct: 67 -RNIDSEEVGKIASNSATAFRVGNVQELSE 95 >gi|268532310|ref|XP_002631283.1| C. briggsae CBR-CLEC-60 protein [Caenorhabditis briggsae] gi|187036877|emb|CAP23542.1| CBR-CLEC-60 protein [Caenorhabditis briggsae AF16] Length = 408 Score = 38.0 bits (86), Expect = 2.7, Method: Composition-based stats. Identities = 19/145 (13%), Positives = 52/145 (35%), Gaps = 14/145 (9%) Query: 247 IVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 + L + + + + L++++ + + + A L + K + T ++ Sbjct: 101 ANLDTYQSLDDVYDGIFTALSQVSSSDESYIVHGLAQAEDILEDGKSN----KNRTHYQR 156 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL--LRKCTDSS 364 VI L+ + + + ++ AG+ I +VA +G L L+ Sbjct: 157 VVIVYASTYKGTG-----PLDPIPVADRLKTAGVTIVTVAYDQDGDGALLADLQLIATPP 211 Query: 365 GQFFAVNDSRELLESFDKITDKIQE 389 F +++ + +I + + Sbjct: 212 YNF---SNTDQAGNEIGEIQGALLQ 233 >gi|115358178|ref|YP_775316.1| TadE family protein [Burkholderia ambifaria AMMD] gi|170703648|ref|ZP_02894385.1| TadE family protein [Burkholderia ambifaria IOP40-10] gi|115283466|gb|ABI88982.1| TadE family protein [Burkholderia ambifaria AMMD] gi|170131447|gb|EDT00038.1| TadE family protein [Burkholderia ambifaria IOP40-10] Length = 142 Score = 38.0 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 15/138 (10%), Positives = 38/138 (27%), Gaps = 15/138 (10%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A+++ + + ID++ ++ + + +A A +G V T + Sbjct: 19 FALMLPFLLMVLIGIIDVSLLLCDKAVITNASREAARAGVVLRVPMLTPTQ-------IA 71 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + + L G + +T+ Y L Sbjct: 72 NVALSYTQNSLVSGGTGTV--------PTVAVTQANGTTAGTALTVTVTYTYSGLVLGTA 123 Query: 122 GLIPSALTNLSLRSTGII 139 + + +S S + Sbjct: 124 LSVLTGPITISASSVMLY 141 >gi|313680435|ref|YP_004058174.1| von willebrand factor type a [Oceanithermus profundus DSM 14977] gi|313153150|gb|ADR37001.1| von Willebrand factor type A [Oceanithermus profundus DSM 14977] Length = 747 Score = 38.0 bits (86), Expect = 2.8, Method: Composition-based stats. Identities = 19/132 (14%), Positives = 46/132 (34%), Gaps = 15/132 (11%) Query: 259 LNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 E ++RL+ L T + ++ ++++ ++DG+ Sbjct: 393 KREAETRLDALEARGGT----------QLATAYAAAAEALEPLDARTRWILVLSDGQLED 442 Query: 319 ASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + TL G+K ++A+ A + L R + G+F + D L + Sbjct: 443 -----DPQRTLARARQAAARGVKTLTLALGADADRPFLARLAREGGGRFLDLADPAALPQ 497 Query: 379 SFDKITDKIQEQ 390 + ++ + Sbjct: 498 VLALLGEEAFKP 509 >gi|302343421|ref|YP_003807950.1| TadE family protein [Desulfarculus baarsii DSM 2075] gi|301640034|gb|ADK85356.1| TadE family protein [Desulfarculus baarsii DSM 2075] Length = 138 Score = 38.0 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 17/133 (12%), Positives = 39/133 (29%), Gaps = 17/133 (12%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A+ + V L I I+L Y + + +A G S Sbjct: 19 FALFLPVFLLVIFSIIELGAAWYQKQMLVNASREGARLGALFSTSGGLTAQE-------- 70 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 ++++ ++L + + + D + A YE P + F+ Sbjct: 71 --VQERVNQYLSDSGFPSQAVVQ-------AVGVDGASGDPVTVNVSADYEFPVLSAFIG 121 Query: 122 GLIPSALTNLSLR 134 + + + + Sbjct: 122 AVPGTISLSATTV 134 >gi|17537921|ref|NP_496259.1| C-type LECtin family member (clec-60) [Caenorhabditis elegans] gi|3881710|emb|CAA88985.1| C. elegans protein ZK666.6, confirmed by transcript evidence [Caenorhabditis elegans] Length = 406 Score = 38.0 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 26/211 (12%), Positives = 67/211 (31%), Gaps = 13/211 (6%) Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSV-RIGT 240 + W + + + + ++V+S + R+G Sbjct: 33 CGEDLGNLWLDVVAVVDNSIGMTNG--GLTSIAANIASVVSSGTRIGTNPSEPRTTRLGL 90 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG 300 + YN +L++V + + +T+ + + + E+ Sbjct: 91 VTYNKAAAIQADLNQYQSLDDVYDNVFRALSSVSTSEESYLANGLARAEDVLEAGKQGYN 150 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL--LR 358 T ++ VI L+ + + E ++ +G+ + +VA +G L L Sbjct: 151 RTHYQRVVIVYASAYKGSG-----ALDPVPVAERLKTSGVTVITVAYDQDGDGALLADLA 205 Query: 359 KCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 K F ++ + + +I D + + Sbjct: 206 KIASPPYNF---TNTEDNGQVIGEIQDALLQ 233 >gi|53719502|ref|YP_108488.1| hypothetical protein BPSL1888 [Burkholderia pseudomallei K96243] gi|52209916|emb|CAH35888.1| putative membrane protein [Burkholderia pseudomallei K96243] Length = 568 Score = 38.0 bits (86), Expect = 2.9, Method: Composition-based stats. Identities = 16/103 (15%), Positives = 33/103 (32%), Gaps = 1/103 (0%) Query: 8 VCFLFITY-AIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKK 66 + A+D+ ++ ++R +Q D A L+G + + + S F Sbjct: 1 MLVAIAALGAVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQPNAAAAANARSNGFDP 60 Query: 67 QIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKA 109 + + R + A + N N +Q A Sbjct: 61 AAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSV 103 >gi|307825969|ref|ZP_07656183.1| YD repeat protein [Methylobacter tundripaludum SV96] gi|307732944|gb|EFO03807.1| YD repeat protein [Methylobacter tundripaludum SV96] Length = 651 Score = 38.0 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 24/194 (12%), Positives = 51/194 (26%), Gaps = 14/194 (7%) Query: 158 RSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESA 217 + +D + + SK A N Sbjct: 13 GGQDTGVTAVDHDPAGNLKTRTDARGKTANYSYDSLNRISKIAYDDQTVNYTWYNCTNGI 72 Query: 218 GNLVNSIQ--KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 +L + +I + RI + G +S++ N R N L+P Sbjct: 73 SHLCSLANNNSSINYSYDSHGRITQKSQKSQNTGATPLTVSHSYNAAGQRTNSLSPGG-- 130 Query: 276 NTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYM 335 + + + + T + + + DG+ +G + N N + Sbjct: 131 ---QNIEYQWT----GNRITAITSNGQPVISQISYEPDGQVNGWNWGNNQQNERFY--DL 181 Query: 336 RNAGMKIYSVAVSA 349 + + S+ A Sbjct: 182 SGRNIIV-SMGFDA 194 >gi|332027690|gb|EGI67758.1| 26S proteasome non-ATPase regulatory subunit 4 [Acromyrmex echinatior] Length = 389 Score = 38.0 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 16/164 (9%), Positives = 51/164 (31%), Gaps = 21/164 (12%) Query: 194 TTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCT 253 S Y ++ ++ + +S ++ E + Sbjct: 10 VDNSDYMRNGDFVPTRLQAQQDAVNLVCHSKTRSNPENN--------VGLITLANVEVLA 61 Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 L++++ + S+L+++ P + A+ L + + +H K I Sbjct: 62 TLTSDVGRILSKLHQVQPNGKLALITGIRIAHLALKHRQGKNH---------KMRIVAFI 112 Query: 314 GENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLL 357 G + +++ + ++ + + ++ D+L Sbjct: 113 GSP----IDIDEKELVKLAKRLKKEKVNVDVISFGEESINNDVL 152 >gi|163857684|ref|YP_001631982.1| hypothetical protein Bpet3372 [Bordetella petrii DSM 12804] gi|163261412|emb|CAP43714.1| putative membrane protein [Bordetella petrii] Length = 706 Score = 38.0 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 18/161 (11%), Positives = 43/161 (26%), Gaps = 9/161 (5%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 TA + + + + + + ++ Y++ ++Q A D A LS + D Sbjct: 22 TAFAVLIGLVML-LSAQIGYLFYMKRELQKAADLAALSAVQVLAPTGAASDCAAGSPVAV 80 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNN--------PLQYIAESKAQYEI 113 + I + K D + Sbjct: 81 AAQTSAVANVPAFVDSIAAANVTVDCKFWDPARADSTGMHLFEPDAASGGRVNAVRVRID 140 Query: 114 PTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVL 154 T + +I + + + + ++ A S+ L Sbjct: 141 KTLTALIPSVIGNWMGGTEVSVVAVASNTAPTAAFSVESRL 181 >gi|325963892|ref|YP_004241798.1| hypothetical protein Asphe3_25360 [Arthrobacter phenanthrenivorans Sphe3] gi|323469979|gb|ADX73664.1| hypothetical protein Asphe3_25360 [Arthrobacter phenanthrenivorans Sphe3] Length = 148 Score = 38.0 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 16/126 (12%), Positives = 35/126 (27%), Gaps = 10/126 (7%) Query: 5 IISVCFLFITYA-------IDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKK 57 ++ + ++T A I ++ + ++ S D A L+ S T Sbjct: 12 MMVMILGYVTLALLVATVVIGISSVYLEHKRLLSLADGASLAAADSYTLGEVDTQGGTPS 71 Query: 58 DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 + + + S G + T + + S A + Sbjct: 72 ATLNPARVRNVAADFIARSPASHRFGGLEVTG---ATGTPDGTTAVVVLSAAVHPPVVNF 128 Query: 118 LFLKGL 123 L G+ Sbjct: 129 LVPDGI 134 >gi|218460899|ref|ZP_03500990.1| von Willebrand factor type A [Rhizobium etli Kim 5] Length = 373 Score = 38.0 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 22/237 (9%), Positives = 57/237 (24%), Gaps = 14/237 (5%) Query: 76 SYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRS 135 RE + A ++ D + + A Y +L + + Sbjct: 149 DSSRERFANAAANPIKSVATDPVSTFSADVD-SASYSFVRRSLTGGAMPDPQSVRVEEMI 207 Query: 136 TGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTT 195 V+ + + P + Sbjct: 208 NYFPYDWPRPENAEQPFKATVTVMPTPWNHDTELMHVAIKGYDIAPATAPHANLV----- 262 Query: 196 KSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPL 255 K+ +L S LVN ++ + + Y Sbjct: 263 FLIDVSGSMDEPDKLPLLKSSFRLLVNRLKADDT--------VAIVTYAGNAGTVLEPTR 314 Query: 256 SNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFIT 312 + +++ S +++L +T + AY + ++ + ++F + + Sbjct: 315 VSEKSKILSAIDRLEAGGSTGGAEGIEAAYDLAKRLSSRTASSGDARDGRRFYVCPS 371 >gi|168209268|ref|ZP_02634893.1| von Willebrand factor type A/Cna B-type domain protein [Clostridium perfringens B str. ATCC 3626] gi|170712663|gb|EDT24845.1| von Willebrand factor type A/Cna B-type domain protein [Clostridium perfringens B str. ATCC 3626] Length = 928 Score = 38.0 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 31/287 (10%), Positives = 67/287 (23%), Gaps = 59/287 (20%) Query: 149 SICMVLDVSRSMEDLYLQKHNDNNNMTSNK----------------YLLPPPPKKSFWSK 192 + +LD S SM +L Sbjct: 141 DVVFLLDNSNSMLTNNRALKIKEEIKNVMDKLKAKNTRYALVTYASTILDGRHYHLMDGS 200 Query: 193 NTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIA---------- 242 +N+ + ++ N I + ++N S GT Sbjct: 201 IGNNQYTVYDLYTSNQCYLNFTSNIQDIYNKIPSTVPNQRNNSYAGGTFTQQGLLKAIEL 260 Query: 243 -YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYREL------YNEKESS 295 N L++ L L + N + Y + + ++ Sbjct: 261 LKNSNADEKIIIHLTDGLPTFSFLLKEF--GGNEKAIFDYNTQYNGIGVRGFGTSYFFNT 318 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI-CEYMRNAG--MKIYSVAVS---- 348 ++ V + + A N + E ++ + IY++ + Sbjct: 319 KTQKPYIYSREEVYSALNRSKNKYEAIWNNGFPTTLEAENIKKENPDINIYTIGIELKKR 378 Query: 349 ----------APPEG-------QDLLRKCTDSSGQFFAVNDSRELLE 378 EG + L + S + F + ++ E Sbjct: 379 VYKWTDNRQYYDAEGIVELPEIRKFLESISSSPAEAFLNENVDDIDE 425 >gi|219559725|ref|ZP_03538801.1| hypothetical protein MtubT1_21367 [Mycobacterium tuberculosis T17] Length = 75 Score = 38.0 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 11/47 (23%), Positives = 18/47 (38%), Gaps = 1/47 (2%) Query: 2 TAIIIS-VCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSD 47 A+ + V L + R++ Q+A D A L A + S Sbjct: 9 FAVAMIAVLLCVTGAGAYLGSAVVARHRAQAAADLASLVAAARLPSG 55 >gi|189465624|ref|ZP_03014409.1| hypothetical protein BACINT_01982 [Bacteroides intestinalis DSM 17393] gi|224539998|ref|ZP_03680537.1| hypothetical protein BACCELL_04910 [Bacteroides cellulosilyticus DSM 14838] gi|189437898|gb|EDV06883.1| hypothetical protein BACINT_01982 [Bacteroides intestinalis DSM 17393] gi|224518388|gb|EEF87493.1| hypothetical protein BACCELL_04910 [Bacteroides cellulosilyticus DSM 14838] Length = 348 Score = 38.0 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 24/168 (14%), Positives = 52/168 (30%), Gaps = 49/168 (29%) Query: 251 QCTPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 P++++ K L +NP + T A++ A R + + + Sbjct: 142 TQLPITSDYISAKMFLESINPSLISKQGTAIGAAINLATRSFTPQ----------EGVGR 191 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------------- 352 VI ITDGEN A + + +++ + V P Sbjct: 192 AVIVITDGENHEGGAVEAAKAAAEKG-------IQVSVLGVGMPDGAPIPVEGTNDFRRD 244 Query: 353 ----------GQDLLRKCTDSSGQFF-AVNDSRELLESFDKITDKIQE 389 + + ++ + + V++S + I +I + Sbjct: 245 RDGNVVVTRLNEQMCQEIAQAGDGIYVRVDNSNAAQKV---IAQEINK 289 >gi|168214300|ref|ZP_02639925.1| von Willebrand factor type A/Cna B-type domain protein [Clostridium perfringens CPE str. F4969] gi|170714242|gb|EDT26424.1| von Willebrand factor type A/Cna B-type domain protein [Clostridium perfringens CPE str. F4969] Length = 928 Score = 38.0 bits (86), Expect = 3.0, Method: Composition-based stats. Identities = 31/287 (10%), Positives = 67/287 (23%), Gaps = 59/287 (20%) Query: 149 SICMVLDVSRSMEDLYLQKHNDNNNMTSNK----------------YLLPPPPKKSFWSK 192 + +LD S SM +L Sbjct: 141 DVVFLLDNSNSMLTNNRALKIKEEIKNVMDKLKAKNTRYALVTYASTILDGRHYHLMDGS 200 Query: 193 NTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIA---------- 242 +N+ + ++ N I + ++N S GT Sbjct: 201 IGNNQYTVYDLYTSNQCYLNFTSNIQDIYNKIPSTVPNQRNNSYAGGTFTQQGLLKAIEL 260 Query: 243 -YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYREL------YNEKESS 295 N L++ L L + N + Y + + ++ Sbjct: 261 LKNSNADEKIIIHLTDGLPTFSFLLKEF--GGNEKAIFDYNTQYNGIGVRGFGTSYFFNT 318 Query: 296 HNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI-CEYMRNAG--MKIYSVAVS---- 348 ++ V + + A N + E ++ + IY++ + Sbjct: 319 KTQKPYIYSREEVYSALNRSKNKYEAIWNNGFPTTLEAENIKKENPDINIYTIGIELKKR 378 Query: 349 ----------APPEG-------QDLLRKCTDSSGQFFAVNDSRELLE 378 EG + L + S + F + ++ E Sbjct: 379 VYKWTDNRQYYDAEGIVKLPEIRKFLESISSSPAEAFLNENVDDIDE 425 >gi|53723508|ref|YP_102950.1| hypothetical protein BMA1283 [Burkholderia mallei ATCC 23344] gi|52426931|gb|AAU47524.1| conserved hypothetical protein [Burkholderia mallei ATCC 23344] Length = 568 Score = 38.0 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 16/103 (15%), Positives = 33/103 (32%), Gaps = 1/103 (0%) Query: 8 VCFLFITY-AIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKK 66 + A+D+ ++ ++R +Q D A L+G + + + S F Sbjct: 1 MLVAIAALGAVDIGNVFFVRRDLQRVADMAALAGAQRMDDQCAQPNAAAAANARSNGFDP 60 Query: 67 QIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKA 109 + + R + A + N N +Q A Sbjct: 61 AAGGNTLALACGRWDTQSNAGPSYFNAAATPLNAVQVTATQSV 103 >gi|294012237|ref|YP_003545697.1| hypothetical protein SJA_C1-22510 [Sphingobium japonicum UT26S] gi|292675567|dbj|BAI97085.1| conserved hypothetical protein [Sphingobium japonicum UT26S] Length = 543 Score = 38.0 bits (86), Expect = 3.1, Method: Composition-based stats. Identities = 25/256 (9%), Positives = 72/256 (28%), Gaps = 7/256 (2%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A+ + A D A + + ++Q+A D A++ + + +T + + Sbjct: 20 VALSLFGLIAAGGIAFDYARMAGLDTELQNAAD------QAALAAASQLDGTSTAITRAT 73 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + + I + + + + + + + A + Sbjct: 74 SAAQTLIVNQTRFANDGNASGLSVTVPTLTFYSSYDQDADTGTVTTSATAAKFVQVSVGT 133 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 ALT + + + + V M + + + ++ L Sbjct: 134 REAVYALTPVVQLFRSGNLNAFATAGLGSAIC-KVPPLMLCIPSETFPSSADIGDGLRLQ 192 Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 P P ++ + + + + ++A + NS + SV Sbjct: 193 PGPQTGAWVPGDYGYLDFGNGASGLAINLGANNQAANCMDNSGGLQTEPGNKASVTEALN 252 Query: 242 AYNIGIVGNQCTPLSN 257 G + + ++ Sbjct: 253 TRFDLDAGKKYSCSTS 268 >gi|225016930|ref|ZP_03706122.1| hypothetical protein CLOSTMETH_00843 [Clostridium methylpentosum DSM 5476] gi|224950324|gb|EEG31533.1| hypothetical protein CLOSTMETH_00843 [Clostridium methylpentosum DSM 5476] Length = 1895 Score = 37.6 bits (85), Expect = 3.1, Method: Composition-based stats. Identities = 24/247 (9%), Positives = 54/247 (21%), Gaps = 3/247 (1%) Query: 23 MYIRNQMQSALDAAVLSGCASIVSDRTIKD-PTTKKDQTSTIFKKQIKKHLKQGSYIREN 81 + +N Q+A+D A+ + +I I T + K + Sbjct: 1144 VQDKNAAQAAVDEAIAAIDTAIEDLSRIAQSGTEQDPYLINSVADLQKLARSVNAGASFK 1203 Query: 82 AGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIER 141 + A ++++ P+ A ++ + + S Sbjct: 1204 DEFVKLTADLDLSGIDWTPIGTTGAITAGVGFAGTFDGASHVVSNLSIADTASSATFGLF 1263 Query: 142 SSENLAISICMVLDVSRSMEDLY--LQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY 199 + + V DV + N S+ Sbjct: 1264 GMVSGTVKNLGVKDVQIKVGSGDCRAGGLVGTVNGGLIDNCYVVNANVDAGSRVAGGLAG 1323 Query: 200 APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNL 259 I G Q+ + Y G + + + N Sbjct: 1324 QNFNGTIKNSYVKNITVKGGRSAIFVSDNQDDNRNNKGTIDNCYADGSITSGNNGIVTNS 1383 Query: 260 NEVKSRL 266 + + Sbjct: 1384 KTIAAAA 1390 >gi|221197788|ref|ZP_03570834.1| membrane protein [Burkholderia multivorans CGD2M] gi|221181720|gb|EEE14121.1| membrane protein [Burkholderia multivorans CGD2M] Length = 634 Score = 37.6 bits (85), Expect = 3.1, Method: Composition-based stats. Identities = 19/173 (10%), Positives = 47/173 (27%), Gaps = 3/173 (1%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A I L + +ID+ ++ + R +Q D L+ S+ D T ++ Sbjct: 29 VAAIWIAVALIVLGSIDVGNLYFQRRDLQRVADMTALAAVQSVNDLCPQTDTTVTASGSN 88 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQ---YEIPTENL 118 + + G + + ++ D Y + Q + Sbjct: 89 AVVTAAYRGAALNGFDAQASGNSMSIACGRWDVSDYGAAAGYFGTATNQLNAVRVVATKT 148 Query: 119 FLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDN 171 I T + + + ++ ++ M + + Sbjct: 149 VPLFFIGPPRTISAASTAKASNIDTFSIGTTLAMFGSNQDCAGNSVSADQRNT 201 >gi|467168|gb|AAA17326.1| L308_C1_175 [Mycobacterium leprae] Length = 141 Score = 37.6 bits (85), Expect = 3.1, Method: Composition-based stats. Identities = 9/35 (25%), Positives = 15/35 (42%) Query: 353 GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKI 387 Q L + C + GQ F D L + + + +I Sbjct: 30 NQTLQKICEITDGQVFHAYDLDSLKQVYSTLQRQI 64 >gi|327490425|gb|EGF22209.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK1058] Length = 462 Score = 37.6 bits (85), Expect = 3.2, Method: Composition-based stats. Identities = 25/209 (11%), Positives = 61/209 (29%), Gaps = 51/209 (24%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN---KLNPYENTNTYPAMHHAY--- 285 N+SV + + + + + L N N + + +N LNP TN + + Sbjct: 246 GNISVNLVGFSTSAKYIQQNFSNLDNGTNTIIATINKRENLNPDGVTNPGDGLRYGMISL 305 Query: 286 -------------------------------RELYNEKESSHNTIGSTRLK-------KF 307 + + + T + ++ Sbjct: 306 QSQPAQLKYIVLLTDGIPNAYLVDSRALYAGNRVDLSQGAGRVTFNNPIYDLSPTLGYEY 365 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMK-IYSVAV-SAPP---EGQDLLRKCTD 362 + D + + +N++ G+K + + GQ L + + Sbjct: 366 IRLGYDLYSRDSITRENSIAYAGEVSKKFGLGVKRVNVIGFSGVNHEIAYGQSLTDRIGE 425 Query: 363 SS--GQFFAVNDSRELLESFDKITDKIQE 389 ++ + + L ++F I +IQ+ Sbjct: 426 GGMETKYVSATNEEALQKTFSDIKKQIQQ 454 >gi|223933146|ref|ZP_03625138.1| LPXTG-motif cell wall anchor domain protein [Streptococcus suis 89/1591] gi|330833239|ref|YP_004402064.1| LPXTG-motif cell wall anchor domain-containing protein [Streptococcus suis ST3] gi|223898207|gb|EEF64576.1| LPXTG-motif cell wall anchor domain protein [Streptococcus suis 89/1591] gi|329307462|gb|AEB81878.1| LPXTG-motif cell wall anchor domain protein [Streptococcus suis ST3] Length = 997 Score = 37.6 bits (85), Expect = 3.2, Method: Composition-based stats. Identities = 45/311 (14%), Positives = 93/311 (29%), Gaps = 31/311 (9%) Query: 113 IPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNN 172 + T L + + S ++++ + I + D + D + + Sbjct: 393 LRTRMLGYRSFSGFRSVTNATVSNSSVDQAHITQSKRIDYLGD-GGNNPDTSVDDRIGTD 451 Query: 173 NMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR-----KIDVLIESAGNLVNSI--- 224 + L K + + P N + + ++ Sbjct: 452 THELYRMYLDMTGTKEPFDTLIVVDRSTSMTDPMNSVDTQARYLAVYKALNGTAGRQGLL 511 Query: 225 -QKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS-RLNKLNP--YENTNTYPA 280 + +N +G Y G+Q + + N S L+ + P TN Sbjct: 512 SKLVGFHPENQVAIVGFQGYPGYPSGDQDSTVIANWGRSTSVALSNIQPPYNNGTNYTAG 571 Query: 281 MHHAYRELYNEKESSHNTIG---STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 + A + ++ +SS + S + F +G + N T N Sbjct: 572 LRTA-GVVLDQNQSSRKKVMIFISDGVPTFAFVNGVRYGNGTISGNNPYYTRDWTLNYFN 630 Query: 338 AGM------KIYSVAVSAPPEGQDLLRK--------CTDSSGQFFAVNDSRELLESFDKI 383 + + IY++ +S+ D L + + G + V DS+ L + KI Sbjct: 631 SWIGKYPKLPIYTLGISSEFGNSDNLSANPYVLNHMSSQTGGFYSHVADSQALERTLQKI 690 Query: 384 TDKIQEQSVRI 394 D + V I Sbjct: 691 VDDTKLSLVSI 701 >gi|260810653|ref|XP_002600071.1| hypothetical protein BRAFLDRAFT_79673 [Branchiostoma floridae] gi|229285356|gb|EEN56083.1| hypothetical protein BRAFLDRAFT_79673 [Branchiostoma floridae] Length = 1096 Score = 37.6 bits (85), Expect = 3.2, Method: Composition-based stats. Identities = 27/138 (19%), Positives = 51/138 (36%), Gaps = 22/138 (15%) Query: 258 NLNEVKSRLNKLNPYE--------NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 N + V+ + + TN A+ A + L + +G + + +I Sbjct: 223 NGDSVREEILNMTLDGLTNRTGKVGTNITRAVTLAVQILGPAVQDR--KLGDSTGPRQMI 280 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP-EGQDLLRKCTDSSGQFF 368 ITDG + + + + + G+ I ++A+ EG LL + + GQFF Sbjct: 281 LITDGRDRRLNNS-----VIFMLQNDTAKGVVIDTIALGDGAEEGLPLLSEV--TGGQFF 333 Query: 369 AVNDSR----ELLESFDK 382 DS L ++ Sbjct: 334 FSPDSDAGGSALDDALTA 351 >gi|156379442|ref|XP_001631466.1| predicted protein [Nematostella vectensis] gi|156218507|gb|EDO39403.1| predicted protein [Nematostella vectensis] Length = 223 Score = 37.6 bits (85), Expect = 3.2, Method: Composition-based stats. Identities = 13/139 (9%), Positives = 38/139 (27%), Gaps = 19/139 (13%) Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 + Y+ + ++ ++ + +L+ Sbjct: 12 LVEYSSSASQVLSFRFTQKAADINREIDAIKFTGG--------ETRTDLFTNSAGGR--- 60 Query: 300 GSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 + +I +T+G S S + + ++ + + +V + +LL Sbjct: 61 --ENVPDVLIVMTNGRTSQGSLPY-----KDVMKPLKKKKVNVLAVGIGPDVNEAELLEI 113 Query: 360 CTDSSGQFFAVNDSRELLE 378 + V+D L Sbjct: 114 -AEERDHVIRVHDYEALST 131 >gi|238923487|ref|YP_002937003.1| hypothetical protein EUBREC_1107 [Eubacterium rectale ATCC 33656] gi|238875162|gb|ACR74869.1| Hypothetical protein EUBREC_1107 [Eubacterium rectale ATCC 33656] Length = 1082 Score = 37.6 bits (85), Expect = 3.2, Method: Composition-based stats. Identities = 30/229 (13%), Positives = 65/229 (28%), Gaps = 6/229 (2%) Query: 3 AIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTST 62 II+ C L + +D+ + ++ +SA D A+ S +D Q Sbjct: 19 VIILVPCMLVASIFVDVGRVYLSKSMAESAADMALNSLMTHYDADLNDWYGMVASCQNID 78 Query: 63 IFKKQ----IKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENL 118 F K LK + ++ + + I D + + Sbjct: 79 EFYDASIKCYKNALKSQNLSKDEMNTLVGEFSAMIGADSKASDYLRVTDDGDDSTTIKAV 138 Query: 119 FLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS-RSMEDLYLQKHNDNNNMTSN 177 L + + + +I + + S ++D+ N Sbjct: 139 DGANLANATMLKSQIVDFMKYRAPIAITQTAIDKIKNKSIPGIDDVLKSDENKPLVEKKQ 198 Query: 178 KYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQK 226 Y K S NT K + + + L+ + + + ++ Sbjct: 199 DYCKAD-EKLMRDSYNTYKYLFDNYSYGNPQPSNSLLTGTRDAMQTARE 246 >gi|32472885|ref|NP_865879.1| hypothetical protein RB4059 [Rhodopirellula baltica SH 1] gi|32444122|emb|CAD73564.1| hypothetical protein-transmembrane prediction [Rhodopirellula baltica SH 1] Length = 156 Score = 37.6 bits (85), Expect = 3.2, Method: Composition-based stats. Identities = 21/121 (17%), Positives = 35/121 (28%), Gaps = 5/121 (4%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 AI+ +V L I ++LA + RN Q DAA + +IV T + + + Sbjct: 35 FAIVCNVLLLTIFMCMELARMNMARNLAQ---DAAYYAARTAIVPGATADEAIAEAETIM 91 Query: 62 T--IFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLF 119 + E ++ S A + N F Sbjct: 92 ESLFASGYDVECTPINDDTEEVTVTVSLSLDDVALFAPMFLGNLELTSSATMQTERYNGF 151 Query: 120 L 120 Sbjct: 152 F 152 >gi|254226123|ref|ZP_04919720.1| type I secretion target ggxgxdxxx repeat (2 copies) domain protein [Vibrio cholerae V51] gi|125621358|gb|EAZ49695.1| type I secretion target ggxgxdxxx repeat (2 copies) domain protein [Vibrio cholerae V51] Length = 1637 Score = 37.6 bits (85), Expect = 3.3, Method: Composition-based stats. Identities = 34/286 (11%), Positives = 79/286 (27%), Gaps = 14/286 (4%) Query: 47 DRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAE 106 I D ++ + T L + + + G + Sbjct: 881 QGGIADGSSISNSQDTTNNTSADGQLSLSNVSQLSMGIPTDSYTSDGVAISWTLSADKQT 940 Query: 107 SKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICM-VLDVSRSMEDLYL 165 F + + + L + ++L I+I + + + ++ + Sbjct: 941 LTGSAGANKVVEFTLDNLGNVNSTLHAPIDHPNKAGEDSLVINIPLEAKNAAGAIGTGKV 1000 Query: 166 QKHNDNNNMTSNKYLLPPPPKKSFWSKN----TTKSKYAPAPAPANRKIDVLIESAGNLV 221 +++ + + P+ + + ++DV+ ESA L+ Sbjct: 1001 TVVIEDDAPVAKEVFHVAEPELRQGANVQLILDISGSMDTSAGNGKSRLDVMKESAKQLL 1060 Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAM 281 Q Q K L V T + + + +N L+ T+ A+ Sbjct: 1061 EQYQAMGQTKVQLVVFHSEAEVKSQGNSVWMT-----VEQAINYINGLSTKGTTDYDHAI 1115 Query: 282 HHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 A + + T F++DGE ++ N Sbjct: 1116 ELA----EDNWSGLNTGGLLTGATNVSYFLSDGEPYDGDYVRSNGN 1157 >gi|299139640|ref|ZP_07032813.1| von Willebrand factor type A [Acidobacterium sp. MP5ACTX8] gi|298598264|gb|EFI54429.1| von Willebrand factor type A [Acidobacterium sp. MP5ACTX8] Length = 488 Score = 37.6 bits (85), Expect = 3.3, Method: Composition-based stats. Identities = 31/186 (16%), Positives = 63/186 (33%), Gaps = 18/186 (9%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVK 263 A + + L + + +Q+ + V I + + + + + + Sbjct: 121 AGGSTRFQSLKSAIAQFLAGMQE-----GSDRVAIVPFESHNVVSTIRSAVFTTHRADAL 175 Query: 264 SRLNKLNPYE---NTNTYPAMHHAYRELYNE-KESSHNTIGSTRLKKFVIFITDGENSGA 319 ++LN L NT Y A+ + E H L+ +I +TDG+N A Sbjct: 176 AQLNALPAPGPKNNTALYQAVFSGVDSMKGELASLQHEGATLAELQPHLIVMTDGKNEVA 235 Query: 320 SAYQN-----TLNTLQICEYMRNAGMKIYSVAVSA-PPEGQDLLRKCTDSSGQFFAVNDS 373 L Q ++ + + + L+K T +FF +D+ Sbjct: 236 PGDDPQLLNGDLGLQQAVAQVQTSNLDTIGIGFGDKNDIDAGALQKLTK---RFFYASDA 292 Query: 374 RELLES 379 +LL + Sbjct: 293 NQLLAA 298 >gi|168213186|ref|ZP_02638811.1| von Willebrand factor type A domain protein [Clostridium perfringens CPE str. F4969] gi|170715195|gb|EDT27377.1| von Willebrand factor type A domain protein [Clostridium perfringens CPE str. F4969] Length = 1349 Score = 37.6 bits (85), Expect = 3.3, Method: Composition-based stats. Identities = 34/363 (9%), Positives = 98/363 (26%), Gaps = 31/363 (8%) Query: 44 IVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQY 103 + + +T+ + ++ + F + ++ + + N ++ Sbjct: 79 LDTSQTVNEKQGDIEKAANDFVNKFLDSDNYKNFKIGIISYNEKAEVVQELTMDKNKIKS 138 Query: 104 IAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDL 163 + I + + L + L + + + + + S Sbjct: 139 SIADSYKDAIKDKRSYNNWLNGNHLGTNVGDAFRLAISMLGKDSNPNKEKIVIFMSNGKP 198 Query: 164 YLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNS 223 S K K K + E N +++ Sbjct: 199 NAYTGKFYKGHESTLTENQINSLKDIACKRLYDITSLNGNENELGKSSLKNEKIINKIDN 258 Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 ++ ++L T N+ ++ + + LN Y+ TN ++ Sbjct: 259 YKEYFYNPESLDE----YRKQDFQKRGLRTNTDNDKSKFNNLI--LNEYKITNADSYINK 312 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 Y Y N G + + G +L++ + M++ G+ Y Sbjct: 313 IYGNPYCINVCPPNNQGGKNAICYYMGSEIGY----------AYSLKMAKEMKDLGISTY 362 Query: 344 SVAVSAPPEG----QDLLRKCT-----------DSSGQFFAVNDSRELLESFDKITDKIQ 388 + + ++ ++ + S ++ + + L + F + I+ Sbjct: 363 PIYFDTKSKDSKQRENTMKLISKFAGNNENDGVTSVNTVYSNDITASLNDVFTNLDKNIK 422 Query: 389 EQS 391 + Sbjct: 423 DSY 425 >gi|170725659|ref|YP_001759685.1| type IV pilin biogenesis protein [Shewanella woodyi ATCC 51908] gi|169811006|gb|ACA85590.1| type IV pilin biogenesis protein, putative [Shewanella woodyi ATCC 51908] Length = 1186 Score = 37.6 bits (85), Expect = 3.3, Method: Composition-based stats. Identities = 35/349 (10%), Positives = 85/349 (24%), Gaps = 30/349 (8%) Query: 56 KKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPT 115 +++ K H G+ + + + + A Y Sbjct: 166 QENDPINAKDKDGNAHGDGFPVDGMRQGNTPKPYSTTGGDTQLGLGEAVTLYTANYLRWH 225 Query: 116 ENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMT 175 L S + L + T I + N +I + L + Y + + Sbjct: 226 AAAVADALPTSPQSRLDIAKTAIETVINTNTSIDFGLALFNMNYPSEGYRDGGRIVSKIQ 285 Query: 176 ----SNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRK---IDVLIESAGNLVNSIQKAI 228 +NK L NT + + +S + K Sbjct: 286 KMSPANKTALLSTIDGIPADTNTPLCETLFEAYKYFSGKPVLYGKKDSDYSSWYDGNKPP 345 Query: 229 QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE----VKSRLNKLNPYENTNTYPAMHHA 284 ++ S + + + +++ + + + L T A Sbjct: 346 RDTSAESGLNYSSPFRVCPDIAYVIYITDGVPTQDTFADTDITTLTASGVTEKADDKVVA 405 Query: 285 YRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYS 344 Y+ + S + + + + +K ++ Sbjct: 406 PD--YSVFSDPGLSSPSYLPALASYLYHNDLVTSPETLSDGSTREHM------QTVKTFT 457 Query: 345 VAVSAPPEG-QDLLRKCTDSSGQ----------FFAVNDSRELLESFDK 382 + S+ E + LL++ G +F L+ + + Sbjct: 458 IGFSSGAEDAEALLKETARRGGNKVDDNGVNTGYFQATGGLGLVAAMND 506 >gi|315181668|gb|ADT88581.1| Large exoprotein [Vibrio furnissii NCTC 11218] Length = 3149 Score = 37.6 bits (85), Expect = 3.4, Method: Composition-based stats. Identities = 19/169 (11%), Positives = 57/169 (33%), Gaps = 9/169 (5%) Query: 191 SKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 + N ++Y ++ ++I++ NL + I I + Sbjct: 2556 NDNGFSNRYQSQSQYNASRMKLVIDALTNLATDLVNHDGVIN-----INLIGFESSAHSA 2610 Query: 251 QCTPLSN-NLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 L++ NL ++ + + ++ TN A A ++ + + Sbjct: 2611 LTLQLTSDNLQQLLTEIQDMDAEGGTNYEAAFDLASNWFSHQPTEGYENLTYFLTDGDPT 2670 Query: 310 FITDGENSGAS--AYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDL 356 F G+N + Y++ + ++ + + + ++++ + L Sbjct: 2671 FSNSGDNGAGNQTDYRDMYDAIEAFKDLSSQSA-VHAIGIGKGITDTRL 2718 >gi|197692716|gb|ACH71068.1| envelope glycoprotein [Human immunodeficiency virus 1] Length = 516 Score = 37.6 bits (85), Expect = 3.4, Method: Composition-based stats. Identities = 23/173 (13%), Positives = 51/173 (29%), Gaps = 3/173 (1%) Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 +N L K++ ++ +V++E+ N + + E+ + Sbjct: 43 REANTTLFCASDAKAYETEVHNVWATHACVPTDPSPQEVVLENVTENFNMWKNNMVEQMH 102 Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEK 292 + V ++ N ++ S N T+ +N Sbjct: 103 EDIISLWDESLKPCVKLTPLCVTLNCTDITSAATNATANSNATSWTDGREEMRNCSFNIT 162 Query: 293 ESSHNTIGSTRL--KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 + + K+ I TD + S + NT Q C + + I+ Sbjct: 163 TNVSDKKQKEYAIFYKYDIVSTDNGTTSYSTHCNTSIIRQACPKVSFQPIPIH 215 >gi|261193487|ref|XP_002623149.1| von Willebrand domain-containing protein [Ajellomyces dermatitidis SLH14081] gi|239588754|gb|EEQ71397.1| von Willebrand domain-containing protein [Ajellomyces dermatitidis SLH14081] Length = 940 Score = 37.6 bits (85), Expect = 3.4, Method: Composition-based stats. Identities = 32/325 (9%), Positives = 83/325 (25%), Gaps = 48/325 (14%) Query: 68 IKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSA 127 + ++ +K INIT D + P + + + Sbjct: 165 YANSSVDTGELSQSLASFVKKGAINITVDVSVDRGSTIRGLHSPTHPVAITLGRTSVAAQ 224 Query: 128 LTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYL---------QKHNDNNNMTSNK 178 ++ N VL V+ +D+ K Sbjct: 225 DLFEPNLASAAHTMQQGNAFFDTDFVLIVNAKDQDVPSAFVEKHPTIPNQRAVMATLVPK 284 Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 + +P + + + + S I L + + S+ + V+ Sbjct: 285 FNIPNNNPEIVFIIDRSGSMTG--------NIKTLQSALRVFLKSLP--------VGVKF 328 Query: 239 GTIAYNIGIVGNQCTPLSNNLNEVKS---RLNKLNPY-ENTNTYPAMHHAYRELYNEKES 294 ++ + + + +K+ ++ + T L K + Sbjct: 329 NICSFGSRHSFMWNKSKTYDASSLKAALQYVDSIAADFGGT----------EMLEPVKAT 378 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 N + L ++ + + ++++S+ + + Q Sbjct: 379 VKNRLKDLDLDVLLLSDGEIWDQKTLFAYLNEVVS-------EQPIRLFSLGIGSGA-SQ 430 Query: 355 DLLRKCT-DSSGQFFAVNDSRELLE 378 L+ G V D+ +L + Sbjct: 431 SLIEGIARAGDGFAQFVGDNEQLDK 455 >gi|156404067|ref|XP_001640229.1| predicted protein [Nematostella vectensis] gi|156227362|gb|EDO48166.1| predicted protein [Nematostella vectensis] Length = 218 Score = 37.6 bits (85), Expect = 3.5, Method: Composition-based stats. Identities = 22/183 (12%), Positives = 57/183 (31%), Gaps = 13/183 (7%) Query: 211 DVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLN 270 D + I + V + + + IV S +++ S +++ Sbjct: 39 DNYKLAKTLTKEIISRFTISPDKTRVSLNFFSAHHVIVSKLSDNFS--KSKLMSLTDQMM 96 Query: 271 PYEN-TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 ++ + + + EL +K + +K + +TDG + T Sbjct: 97 YEKSFSKLATTLEAVHYELLVKKGGARPK--QKGVKMATVLVTDGYGTAG-----FEETS 149 Query: 330 QICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ---FFAVNDSRELLESFDKITDK 386 + M+ +++++V ++ + F D + D+I + Sbjct: 150 DEAKSMKKYNVEMFTVYPEKGRTYNKIMMHLASKPTKSHLFKLTKDGGARRKVVDQIVKQ 209 Query: 387 IQE 389 I Sbjct: 210 ICS 212 >gi|308472975|ref|XP_003098714.1| hypothetical protein CRE_04178 [Caenorhabditis remanei] gi|308268314|gb|EFP12267.1| hypothetical protein CRE_04178 [Caenorhabditis remanei] Length = 412 Score = 37.6 bits (85), Expect = 3.5, Method: Composition-based stats. Identities = 20/134 (14%), Positives = 38/134 (28%), Gaps = 6/134 (4%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKES 294 + R+G + YN +L++ +R+ T + Sbjct: 98 TTRLGLVTYNSVATQMADLNQYQSLHDAFNRIFDDLSNTVDTTESYLSTGLTLAEKMFND 157 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 +K VI + + N I + ++ +G+KI +VA Sbjct: 158 QSVNSTRAHYQKVVIVYASKYQTNGES-----NPESIADRLKLSGVKIITVAYGNAYGLM 212 Query: 355 DLLRKCTDSSGQFF 368 L G F Sbjct: 213 KSLSIIAS-PGFAF 225 >gi|145551564|ref|XP_001461459.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124429293|emb|CAK94086.1| unnamed protein product [Paramecium tetraurelia] Length = 610 Score = 37.6 bits (85), Expect = 3.5, Method: Composition-based stats. Identities = 15/111 (13%), Positives = 33/111 (29%), Gaps = 20/111 (18%) Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEY 334 TN + R + + K + T ++DG + A C+ Sbjct: 204 TNIASGVALGLRMIRDRKFKNPVTSM--------FVLSDGVDDDRGADLR-------CQQ 248 Query: 335 M-RNAGM----KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 + I + + + + + GQF ++ + + E F Sbjct: 249 ALHQYNIQDTLTINTFGYGSDHDAKVMNNIANLKGGQFVYIDQIQRVSEHF 299 >gi|227497171|ref|ZP_03927419.1| conserved hypothetical protein [Actinomyces urogenitalis DSM 15434] gi|226833344|gb|EEH65727.1| conserved hypothetical protein [Actinomyces urogenitalis DSM 15434] Length = 1006 Score = 37.6 bits (85), Expect = 3.5, Method: Composition-based stats. Identities = 16/79 (20%), Positives = 32/79 (40%), Gaps = 17/79 (21%) Query: 323 QNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ---------------- 366 + + + R AG+ I + ++ L+R+ T+++G+ Sbjct: 219 SDLCRPTGVVDQTRAAGITILGIGLNDGSADFSLMRRITEAAGKDVDGLERCGDQTQPVG 278 Query: 367 -FFAVNDSRELLESFDKIT 384 F+ V+D LL +FD I Sbjct: 279 EFYPVSDIDSLLMAFDSIA 297 >gi|163751746|ref|ZP_02158964.1| hypothetical protein KT99_12254 [Shewanella benthica KT99] gi|161328398|gb|EDP99557.1| hypothetical protein KT99_12254 [Shewanella benthica KT99] Length = 447 Score = 37.6 bits (85), Expect = 3.5, Method: Composition-based stats. Identities = 10/64 (15%), Positives = 26/64 (40%), Gaps = 8/64 (12%) Query: 5 IISVCFLFITYAI--------DLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTK 56 I V F +++ D H++ + ++Q+A+D++ L + + ++ D Sbjct: 27 AILVMFTIGLFSLLAVAALALDGGHLLLNKGRLQNAVDSSALYAAKILQNGGSLFDAREA 86 Query: 57 KDQT 60 Sbjct: 87 ATLI 90 >gi|1246302|gb|AAA93502.1| pilY1 [Pseudomonas aeruginosa PAO1] Length = 1161 Score = 37.6 bits (85), Expect = 3.5, Method: Composition-based stats. Identities = 10/41 (24%), Positives = 20/41 (48%) Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 + DL +S G+FF+ + +L+ +F I ++I Sbjct: 490 NDSNNVYDLWHAAVNSRGEFFSADSPDQLVAAFQDILNRIS 530 >gi|327458562|gb|EGF04912.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK1] Length = 463 Score = 37.6 bits (85), Expect = 3.6, Method: Composition-based stats. Identities = 30/343 (8%), Positives = 83/343 (24%), Gaps = 39/343 (11%) Query: 65 KKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLI 124 + + + + D K I+ N Q ++ + + Sbjct: 134 EGAYQDNRLISYNLSGKYPDTNNKLSIDTAISALNTKQVFSKVAKGKKGIAIAYRTDPIQ 193 Query: 125 PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP 184 +S + ++ + + + + Sbjct: 194 GQMNIAVSFVFDTSGSMDWDLQGRNV---------NPNSGTESRMTILRKKAEIMIKDLK 244 Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYN 244 + S +D ++ + +K + + + Sbjct: 245 GIGNISVNLVGFSSSGKYIQKEFSNLDNGADTIIGTIKDPKKLVPDG---------VTNP 295 Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKLNPYE--NTNT-YPAMHHAYRELYNEKESSHNTIGS 301 + L + ++K + L NT P+ +A + + T + Sbjct: 296 GDGLRYGLISLQSQPAQLKYVV--LLTDGIPNTYIVDPSALYAGNRVDHGNAVGRVTFNN 353 Query: 302 TRLKKFVIFITD----GENSGASAYQNTLNTLQIC-EYMRNAGM---KIYSVAV-SAP-- 350 + G + + N + E + G+ ++ + Sbjct: 354 PIYDVSTNLGYEYRRLGYDLYSRDSVTRENAITYAGEVSKKFGVGVKRVNVIGFSGVDKE 413 Query: 351 -PEGQDLLRKCTDSSG---QFFAVNDSRELLESFDKITDKIQE 389 G+DL G + + + L ++F I +IQ+ Sbjct: 414 IAYGKDLTNSI-GKGGMETSYESATNEAALQKTFSDIKKQIQQ 455 >gi|56697084|ref|YP_167447.1| von Willebrand factor type A domain-containing protein [Ruegeria pomeroyi DSS-3] gi|56678821|gb|AAV95487.1| von Willebrand factor type A domain protein [Ruegeria pomeroyi DSS-3] Length = 565 Score = 37.6 bits (85), Expect = 3.6, Method: Composition-based stats. Identities = 29/195 (14%), Positives = 54/195 (27%), Gaps = 23/195 (11%) Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 + S KI++ E L+ + + + Sbjct: 21 DQPSAILVLDGSGSMWGQIEGKAKIEIAREVVTGLLADLPQDQPLGLTVYGHRRKGDCTD 80 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK 305 P N + S + L P T A+ A L +E + Sbjct: 81 IETLVAPAP--GNRAAIASAVAGLKPKGKTPMLEAVRQAAEALRYTEEKAT--------- 129 Query: 306 KFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM--KIYSVAVSAPPEGQ-DLLRKCTD 362 VI ++DG + + T L+ AG+ + V ++ D Sbjct: 130 --VILVSDGVETCDADPCATAAALEA------AGVDFTAHVVGFDIDDPQALAQMQCLAD 181 Query: 363 -SSGQFFAVNDSREL 376 + G F + ++ EL Sbjct: 182 QTGGTFRSAANADEL 196 >gi|317124468|ref|YP_004098580.1| hypothetical protein Intca_1337 [Intrasporangium calvum DSM 43043] gi|315588556|gb|ADU47853.1| hypothetical protein Intca_1337 [Intrasporangium calvum DSM 43043] Length = 571 Score = 37.6 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 21/213 (9%), Positives = 58/213 (27%), Gaps = 2/213 (0%) Query: 19 LAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYI 78 A ++ +++ A+DAA L + V D + + + + + Sbjct: 11 GARLLAAVTRLRDAVDAASLPVDVARVGDARTERERLLDQLDDYVIPRLTSLDAPLLAVV 70 Query: 79 RENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGI 138 + G ++ + + + + + + R TG Sbjct: 71 GGSTGAGKSTLVNSVVGREVSLPGVLRPTTRSPVLIHHPEDAGWFSDQRVLPRLARITGK 130 Query: 139 IERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSK 198 S A+ + ++ + D+ + W TT ++ Sbjct: 131 HSSSDGPGAVRLVSTDAITPGLAL-LDAPDIDSVVAANRDLAAQLLAAADLWIFVTTAAR 189 Query: 199 YAPAPAPANRKIDVLI-ESAGNLVNSIQKAIQE 230 YA A + + +++ + + Sbjct: 190 YADAVPWDLLRQASERGTAVAVVLDRVPAESLD 222 >gi|148253063|ref|YP_001237648.1| hypothetical protein BBta_1525 [Bradyrhizobium sp. BTAi1] gi|146405236|gb|ABQ33742.1| putative membrane protein of unknown function [Bradyrhizobium sp. BTAi1] Length = 174 Score = 37.6 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 15/108 (13%), Positives = 32/108 (29%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A+ I F+ + +DL MY Q+ +A++A +L + I + Sbjct: 39 FALFIPFLFILLVGTVDLGFAMYEAMQVSNAVEAGMLYAAKNGWDSAGITNSVLNASSVY 98 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKA 109 + + A + T + + + A Sbjct: 99 PGGTPALTATPAPSQFCGCPQATGIAVATCSSTCPDGSAVSQYVQVNA 146 >gi|332364052|gb|EGJ41829.1| hypothetical protein HMPREF9380_0001 [Streptococcus sanguinis SK49] Length = 280 Score = 37.6 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 23/185 (12%), Positives = 53/185 (28%), Gaps = 14/185 (7%) Query: 146 LAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAP 205 I + ++ D S S +T+ + K Sbjct: 74 KPIDLVILQDASGSFRTTIPSVKRALKRLTTYVSPEQYDETNPYLVKTDDPRTTDRVFVA 133 Query: 206 ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSR 265 + + +D + N + + + Y + L+++ N+V S Sbjct: 134 SYQGLDQVRYFNNN--DFSGNPADTFTDPNTTGKHYTYGN-------SGLTSDQNKVHSF 184 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 ++ + T T PA+ + K + N + + +TDG +G + Sbjct: 185 IDNIAVDGGTPTVPAIQDTIAQYNRVKGNMENGRKT-----VFLLVTDGVANGYRLPGSN 239 Query: 326 LNTLQ 330 + Sbjct: 240 TVVMD 244 >gi|190891604|ref|YP_001978146.1| hypothetical protein RHECIAT_CH0002007 [Rhizobium etli CIAT 652] gi|190696883|gb|ACE90968.1| hypothetical conserved protein [Rhizobium etli CIAT 652] Length = 524 Score = 37.6 bits (85), Expect = 3.7, Method: Composition-based stats. Identities = 23/177 (12%), Positives = 52/177 (29%), Gaps = 20/177 (11%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE--VKSR 265 D L ++ L+ + + + + V N T N L + + + Sbjct: 359 NGEDQLQKAMRFLLTPDEASRVLVQWSPSDQIIVIPFDSSVRNMFTASGNPLEQEGLLNE 418 Query: 266 LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNT 325 +++ TN Y A +++ S ++ +TDG + S + Sbjct: 419 VSRQKADGGTNMYACAERALQQIARTDRLS-------TYLPAIVIMTDGRSDDQSQAFMS 471 Query: 326 LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESFD 381 + I+ + + L +S + F +L +F Sbjct: 472 EWNTI------EPRVPIFGITFG--DADKTQLDTLAKQTSARVFD--GGSDLATAFR 518 >gi|322703565|gb|EFY95172.1| flotillin domain-containing protein [Metarhizium anisopliae ARSEF 23] Length = 512 Score = 37.6 bits (85), Expect = 3.8, Method: Composition-based stats. Identities = 21/172 (12%), Positives = 45/172 (26%), Gaps = 12/172 (6%) Query: 18 DLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSY 77 D+ R Q A DAA A+ +++ D T + + + Q S Sbjct: 319 DVVQATIARESKQQAADAAAYEVTANARANQEANQRLADADAYKTRVGAEAENYAAQQSA 378 Query: 78 IRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTG 137 + + I+ D A ++ GL+ + Sbjct: 379 DASAFRQVKEAEGISAMAD------------AYTKLAGAFGGPAGLLQYMMIEKGTYVEL 426 Query: 138 IIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSF 189 + + + + + + + M + +LPP Sbjct: 427 AKANAEAIRGLQPKISVWNTGASGAGEAGSSDPTAAMRNVYQMLPPLMTTIN 478 >gi|150024242|ref|YP_001295068.1| BatB protein [Flavobacterium psychrophilum JIP02/86] gi|149770783|emb|CAL42248.1| BatB protein [Flavobacterium psychrophilum JIP02/86] Length = 346 Score = 37.6 bits (85), Expect = 3.8, Method: Composition-based stats. Identities = 20/103 (19%), Positives = 35/103 (33%), Gaps = 13/103 (12%) Query: 251 QCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 P++ + N K L + ++ + A + E S K +I Sbjct: 141 PVLPITTDYNVAKMFLQSMTSDMVSSQGSNLDEAIKLSAKYFEGSP------NTSKLMIM 194 Query: 311 ITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 +TDGE+ A +I GMKI ++ + G Sbjct: 195 LTDGEDHSEGAESAAEEAKKI-------GMKIITIGIGTTAGG 230 >gi|118361109|ref|XP_001013785.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila] gi|89295552|gb|EAR93540.1| von Willebrand factor type A domain containing protein [Tetrahymena thermophila SB210] Length = 368 Score = 37.6 bits (85), Expect = 3.8, Method: Composition-based stats. Identities = 26/241 (10%), Positives = 63/241 (26%), Gaps = 29/241 (12%) Query: 139 IERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKS- 197 S+ S ++D D + N N K + + + + S Sbjct: 141 ANTLSDVSDASKKTIVDQFTFSSDKSDAQAILNAISARNCIYGTITEKVAAFVIDISGSM 200 Query: 198 -KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLS 256 A ++ + + K Q+ + + + Sbjct: 201 DYTFKANGETISRLAFVKSQLTKTLAEQLKPYQKFNVIIFGNSASQWKTDYIDATP---- 256 Query: 257 NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGEN 316 N+ + +NKL TN + A+ + + + ++DG Sbjct: 257 ENIQAAIAYINKLTTNGATNISSGLDLAF--------------NTKQALNGIYLLSDGVP 302 Query: 317 SGASAYQNTLNTLQICEYM-RNAGMKIYSVAV---SAPPEGQ-----DLLRKCTDSSGQF 367 + + + + RN + I +++ + L D++ Sbjct: 303 NSGVQTVDGIKKYLADKNASRNEKVHINTISFIMGGTETQNDRNLSFQFLNAIADATNGS 362 Query: 368 F 368 F Sbjct: 363 F 363 >gi|156535029|gb|ABU79802.1| envelope glycoprotein [Human immunodeficiency virus 1] Length = 714 Score = 37.6 bits (85), Expect = 3.9, Method: Composition-based stats. Identities = 21/170 (12%), Positives = 48/170 (28%) Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 + L K++ ++ +V +++ N + + E+ + Sbjct: 9 KEATTTLFCASDAKAYDTEVHNVWATHACVPTDPNPQEVELKNVTENFNMWRNNMVEQMH 68 Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 + V ++ N E+K+ N + T M + + Sbjct: 69 EDIISLWDQSLKPCVKLTPLCVTLNCTELKNATNTTSSSGGTMEGGEMKNCSFNITTNIR 128 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 + K V+ I + S NT Q C + + I+ Sbjct: 129 DRMQKEYALFYKLDVVPIDNDNTSYRLISCNTSEITQACPKVTFEPIPIH 178 >gi|262171234|ref|ZP_06038912.1| hypothetical protein VII_002050 [Vibrio mimicus MB-451] gi|261892310|gb|EEY38296.1| hypothetical protein VII_002050 [Vibrio mimicus MB-451] Length = 406 Score = 37.6 bits (85), Expect = 3.9, Method: Composition-based stats. Identities = 20/148 (13%), Positives = 37/148 (25%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + + + AID+ H + R ++Q+A+DAA L+ + T T Sbjct: 20 LVTAAMVSLVILAAIAIDVTHQVVNRTKLQNAVDAAALAAAMVADATHDTATATDVAKST 79 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 T Y+ S A + + Sbjct: 80 LTSMHNASGNSELDVDSANFGIAYSNDPLLFPDGSFDATGDIYVRVSVADLSLTEFFMQA 139 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAI 148 G+ + I + I Sbjct: 140 LGMAKQVSAAAVAGPSSSINTIGNVVPI 167 >gi|290978641|ref|XP_002672044.1| predicted protein [Naegleria gruberi] gi|284085617|gb|EFC39300.1| predicted protein [Naegleria gruberi] Length = 530 Score = 37.6 bits (85), Expect = 3.9, Method: Composition-based stats. Identities = 18/220 (8%), Positives = 53/220 (24%), Gaps = 9/220 (4%) Query: 33 LDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQ---------GSYIRENAG 83 LD L+ S + + ++ + + G+ + N Sbjct: 275 LDRHSLAAALSQNNINIQPNQQLQEVGNKKFSLSYSETDMSVENQKFLMGIGNALYSNQP 334 Query: 84 DIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSS 143 + + N + Q I ++ S+ + Sbjct: 335 VNMNTSGHTVLNPDANQQLLYNRTAHQNGIMAQSTPELQEFHQQALFNSIFAQNSEPGVF 394 Query: 144 ENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAP 203 + ++ S + + + + ++ + S + N AP+ Sbjct: 395 QQMSTSYPNNTPQQSANNNNMNGQSDFDDMLDSLLNEDVGDLSFDNVNNNPPPQSNAPSS 454 Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAY 243 + + E +L+ + + R + Sbjct: 455 PLSFFYKEEDKEQDQSLITDMDNFFLSGNDDHERNDDDQF 494 >gi|258621449|ref|ZP_05716483.1| hypothetical protein VMD_15290 [Vibrio mimicus VM573] gi|258586837|gb|EEW11552.1| hypothetical protein VMD_15290 [Vibrio mimicus VM573] Length = 406 Score = 37.6 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 19/148 (12%), Positives = 36/148 (24%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + + + AID+ H + R ++Q+A+DAA L+ + T T Sbjct: 20 LVTAAMVSLVILAAIAIDVTHQVVNRTKLQNAVDAAALAAAMVADATHDTATATDVAKST 79 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 Y+ S A + + Sbjct: 80 LASMHNASGNSELDVDSANFGIAYSNDPLLFPDGSFDATGDIYVRVSVADLSLTEFFMQA 139 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAI 148 G+ + I + I Sbjct: 140 LGMAKQVSAAAVAGPSSSINTIGNVVPI 167 >gi|160882769|ref|ZP_02063772.1| hypothetical protein BACOVA_00730 [Bacteroides ovatus ATCC 8483] gi|237720675|ref|ZP_04551156.1| conserved hypothetical protein [Bacteroides sp. 2_2_4] gi|260170238|ref|ZP_05756650.1| hypothetical protein BacD2_00060 [Bacteroides sp. D2] gi|293373991|ref|ZP_06620332.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CMC 3f] gi|299145609|ref|ZP_07038677.1| BatB protein [Bacteroides sp. 3_1_23] gi|315918601|ref|ZP_07914841.1| conserved hypothetical protein [Bacteroides sp. D2] gi|156111793|gb|EDO13538.1| hypothetical protein BACOVA_00730 [Bacteroides ovatus ATCC 8483] gi|229449510|gb|EEO55301.1| conserved hypothetical protein [Bacteroides sp. 2_2_4] gi|292631067|gb|EFF49704.1| von Willebrand factor type A domain protein [Bacteroides ovatus SD CMC 3f] gi|298516100|gb|EFI39981.1| BatB protein [Bacteroides sp. 3_1_23] gi|313692476|gb|EFS29311.1| conserved hypothetical protein [Bacteroides sp. D2] Length = 342 Score = 37.6 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 22/169 (13%), Positives = 55/169 (32%), Gaps = 46/169 (27%) Query: 251 QCTPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 P++++ K L ++P + T A++ A R + + + Sbjct: 142 TQLPITSDYISAKMFLESISPSLISKQGTAIGEAINLAARSFTPQ----------EGVGR 191 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------------- 352 +I ITDGEN A + + +++ + V P Sbjct: 192 AIIVITDGENHEGGAVEAAKAAAEKG-------IQVSVLGVGMPDGAPIPVEGTNDYRRD 244 Query: 353 ----------GQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQ 390 + + ++ G + V++S ++ ++ +K+ + Sbjct: 245 REGNVIVTRLNEAMCQEIAKEGKGIYVRVDNSNSAQKAINQEVNKMAKS 293 >gi|258627502|ref|ZP_05722283.1| hypothetical protein VMB_35840 [Vibrio mimicus VM603] gi|258580308|gb|EEW05276.1| hypothetical protein VMB_35840 [Vibrio mimicus VM603] Length = 406 Score = 37.6 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 20/148 (13%), Positives = 37/148 (25%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + + + AID+ H + R ++Q+A+DAA L+ + T T Sbjct: 20 LVTAAMVSLVILAAIAIDVTHQVVNRTKLQNAVDAAALAAAMVADATHDTATATDVAKST 79 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 T Y+ S A + + Sbjct: 80 LTSMHNASGNSELDVDSANFGIAYSNDPLLFPDGSFDATGDIYVRVSVADLSLTEFFMQA 139 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAI 148 G+ + I + I Sbjct: 140 LGMAKQVSAAAVAGPSSSINTIGNVVPI 167 >gi|221204654|ref|ZP_03577671.1| membrane protein [Burkholderia multivorans CGD2] gi|221175511|gb|EEE07941.1| membrane protein [Burkholderia multivorans CGD2] Length = 609 Score = 37.6 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 19/173 (10%), Positives = 47/173 (27%), Gaps = 3/173 (1%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A I L + +ID+ ++ + R +Q D L+ S+ D T ++ Sbjct: 4 VAAIWIAVALIVLGSIDVGNLYFQRRDLQRVADMTALAAVQSVNDLCPQTDTTVTASGSN 63 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQ---YEIPTENL 118 + + G + + ++ D Y + Q + Sbjct: 64 AVVTAAYRGAALNGFDAQASGNSMSIACGRWDVSDYGAAAGYFGTATNQLNAVRVVATKT 123 Query: 119 FLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDN 171 I T + + + ++ ++ M + + Sbjct: 124 VPLFFIGPPRTISAASTAKASNIDTFSIGTTLAMFGSNQDCAGNSVSADQRNT 176 >gi|17533681|ref|NP_496741.1| C-type LECtin family member (clec-62) [Caenorhabditis elegans] gi|3876681|emb|CAB03055.1| C. elegans protein F35C5.5a, confirmed by transcript evidence [Caenorhabditis elegans] Length = 389 Score = 37.6 bits (85), Expect = 4.0, Method: Composition-based stats. Identities = 22/161 (13%), Positives = 51/161 (31%), Gaps = 9/161 (5%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYR 286 + R+G + YN + +++ S + ++ P + +T+ + Sbjct: 70 TNYIDKRTTRVGLVTYNTEATIQADLNRFQSPDDLFSTVFQILPNDLSTSEDVFLAKGIG 129 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 + K+ VI N + I E ++ +G+ I +VA Sbjct: 130 AAEQLLAAGRKNNTRKNYKQMVIVYASAYNDEG-----EEDPRPIAERLKASGVSIATVA 184 Query: 347 VSAPPEGQ--DLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + + L+ + + G F D + E + Sbjct: 185 FDQTGDEEMIKLIGEIA-TPGFNFTNEDENLVKEIQTAMIQ 224 >gi|66812216|ref|XP_640287.1| type A von Willebrand factor domain-containing protein [Dictyostelium discoideum AX4] gi|60468302|gb|EAL66310.1| type A von Willebrand factor domain-containing protein [Dictyostelium discoideum AX4] Length = 698 Score = 37.6 bits (85), Expect = 4.1, Method: Composition-based stats. Identities = 20/137 (14%), Positives = 45/137 (32%), Gaps = 17/137 (12%) Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 P+S N + ++ L +T A+ + I +I TD Sbjct: 352 PVSKTKNSLLEKVKTLRGSGSTALGSAVA------------ACVGITGGSRGSQIIICTD 399 Query: 314 GENSGASAYQNTLNTLQI-----CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFF 368 G+++ Q+ + +G I + + +++ K +++G Sbjct: 400 GQSNMGLGATEAPEKQQMVFEQIANMAQRSGTSISVLTIKGTDTKLEVIGKLANNTGGEV 459 Query: 369 AVNDSRELLESFDKITD 385 + D L + F +I Sbjct: 460 NIMDPINLNDGFKEILS 476 >gi|107099783|ref|ZP_01363701.1| hypothetical protein PaerPA_01000801 [Pseudomonas aeruginosa PACS2] gi|218893649|ref|YP_002442518.1| type 4 fimbrial biogenesis protein PilY1 [Pseudomonas aeruginosa LESB58] gi|218773877|emb|CAW29691.1| type 4 fimbrial biogenesis protein PilY1 [Pseudomonas aeruginosa LESB58] Length = 1163 Score = 37.2 bits (84), Expect = 4.1, Method: Composition-based stats. Identities = 10/38 (26%), Positives = 19/38 (50%) Query: 351 PEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 DL +S G+FF+ + +L+ +F I ++I Sbjct: 455 NNVYDLWHAAVNSRGEFFSADSPDQLVAAFQDILNRIS 492 >gi|324508820|gb|ADY43721.1| C-type lectin protein 160 [Ascaris suum] Length = 534 Score = 37.2 bits (84), Expect = 4.1, Method: Composition-based stats. Identities = 23/160 (14%), Positives = 49/160 (30%), Gaps = 23/160 (14%) Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT--NTYPAMHHAYRELYNE 291 VR+G + ++ N + N + RL ++ + N A+ A L + Sbjct: 228 QFVRVGLVTFSNQAFVNGNLDDFTSYNSLVKRLFQMPYLGGSELNIESALQSASDILQS- 286 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV--SA 349 S + I + + A + I ++ +G KI +VA Sbjct: 287 ---------SRYYARTAILLY----TSAYGEGGFTDPKAIANQIKESGTKIITVAFRQQP 333 Query: 350 PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQE 389 + L F + + + +I + + Sbjct: 334 EGSLVEKLSHLASPG--FSFASRQSIITD---EILRALCQ 368 >gi|288925757|ref|ZP_06419688.1| BatB protein [Prevotella buccae D17] gi|288337412|gb|EFC75767.1| BatB protein [Prevotella buccae D17] Length = 342 Score = 37.2 bits (84), Expect = 4.1, Method: Composition-based stats. Identities = 31/222 (13%), Positives = 71/222 (31%), Gaps = 60/222 (27%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 ++D +LV+ + +IG I + P++++ K L Sbjct: 110 SRLDKSKLLVESLVDRFT---------NDKIGLIVFAGDAYVQ--LPITSDYVSAKMFLQ 158 Query: 268 KLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 ++P + T+ A++ ++ + +I ITDGE+ A + Sbjct: 159 NIDPSLIQTQGTDIAQAINLGLHSFTQA----------DKIGRAIIVITDGEDHEGGAVE 208 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSA---------------PPEGQDLL--------RKC 360 R G+ ++ + V GQ ++ R+ Sbjct: 209 AAAEA-------RKKGVNVFILGVGDTKGAPIPTGDGGYMKDRSGQTVMTALNEQMCREV 261 Query: 361 TDSS-GQFFAVNDS----RELLESFDKITDKIQEQSVRIAPN 397 + G++ V+++ EL ++ E + A + Sbjct: 262 AQAGSGKYIHVDNTGDAQTELNNDLARLQRGESESVIYNAYD 303 >gi|262165918|ref|ZP_06033655.1| hypothetical protein VMA_002367 [Vibrio mimicus VM223] gi|262025634|gb|EEY44302.1| hypothetical protein VMA_002367 [Vibrio mimicus VM223] Length = 406 Score = 37.2 bits (84), Expect = 4.1, Method: Composition-based stats. Identities = 19/148 (12%), Positives = 36/148 (24%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + + + AID+ H + R ++Q+A+DAA L+ + T T Sbjct: 20 LVTAAMVSLVILAAIAIDVTHQVVNRTKLQNAVDAAALAAAMVADATHDTATATDVAKST 79 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 Y+ S A + + Sbjct: 80 LASMHNASGNSELDVDSANFGIAYSNDPLLFPDGSFDATGDIYVRVSVADLSLTEFFMQA 139 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAI 148 G+ + I + I Sbjct: 140 LGMAKQVSAAAVAGPSSSINTIGNVVPI 167 >gi|17533679|ref|NP_496742.1| C-type LECtin family member (clec-62) [Caenorhabditis elegans] gi|3876680|emb|CAB03054.1| C. elegans protein F35C5.5b, confirmed by transcript evidence [Caenorhabditis elegans] Length = 387 Score = 37.2 bits (84), Expect = 4.2, Method: Composition-based stats. Identities = 22/161 (13%), Positives = 51/161 (31%), Gaps = 9/161 (5%) Query: 228 IQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYE-NTNTYPAMHHAYR 286 + R+G + YN + +++ S + ++ P + +T+ + Sbjct: 70 TNYIDKRTTRVGLVTYNTEATIQADLNRFQSPDDLFSTVFQILPNDLSTSEDVFLAKGIG 129 Query: 287 ELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVA 346 + K+ VI N + I E ++ +G+ I +VA Sbjct: 130 AAEQLLAAGRKNNTRKNYKQMVIVYASAYNDEG-----EEDPRPIAERLKASGVSIATVA 184 Query: 347 VSAPPEGQ--DLLRKCTDSSGQFFAVNDSRELLESFDKITD 385 + + L+ + + G F D + E + Sbjct: 185 FDQTGDEEMIKLIGEIA-TPGFNFTNEDENLVKEIQTAMIQ 224 >gi|70733640|ref|YP_257280.1| calcium-binding outer membrane-like protein [Pseudomonas fluorescens Pf-5] gi|68347939|gb|AAY95545.1| calcium-binding outer membrane-like protein [Pseudomonas fluorescens Pf-5] Length = 4920 Score = 37.2 bits (84), Expect = 4.2, Method: Composition-based stats. Identities = 37/273 (13%), Positives = 84/273 (30%), Gaps = 14/273 (5%) Query: 66 KQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIP 125 +I L + I+ ++ D + + + NL + P Sbjct: 4277 SKITTGLTDTDGSETLSVKISGIPAGSVLSDGAGHTFTATATSGEANVTGWNLGNLTVTP 4336 Query: 126 ----SALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 + NL++ ST + V + +D T ++ Sbjct: 4337 PPYYNGTFNLNVTSTSTESLGGSASTSAQIPVKVYPAVYNSVVATSGDDTVTGTDGNDIV 4396 Query: 182 PPPPKKSFWSKNTTKS---KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 T + + + ++ I S ++ N+++ +I + +V I Sbjct: 4397 VADIGGLTVVPGTNYNIAFMVDSSGSMSSASITAAKNSLTSVFNTLKNSIGGSSSGTVNI 4456 Query: 239 GTIAYNIGIVGNQCTPLS--NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 ++ + + L+ N L ++K+ L+ + TN + Sbjct: 4457 FLADFDTQVNKSVSVNLNDPNALAQLKAVLDSMTSGGGTNYEDVFKATANFFQSS----- 4511 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 G++ FITDG+ + + + T TL Sbjct: 4512 QATGNSNATNLTYFITDGKPTYYQSGEQTNPTL 4544 >gi|327463764|gb|EGF10080.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK1057] Length = 462 Score = 37.2 bits (84), Expect = 4.2, Method: Composition-based stats. Identities = 28/271 (10%), Positives = 78/271 (28%), Gaps = 55/271 (20%) Query: 170 DNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ 229 + + + + S N ++D+L + + ++ + + Sbjct: 188 RTDPIQGQMNIAVSFVFDTSGSMNWDLQGRNVEKTGNESRMDILRKKSVIMIKDLAEI-- 245 Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN---KLNPYENTNTYPAMHHAYR 286 N+SV + + + + + L N N + + + LNP TN + + Sbjct: 246 --GNVSVNLVGFSTSAKYIQQNFSNLDNGTNTIIATITKPENLNPDGVTNPGDGLRYGMI 303 Query: 287 ELYNEKES-------SHNTIGSTRLKKFVIFITD-------------------------- 313 L ++ + + + ++ + Sbjct: 304 SLQSQPAQLKYIVLLTDGIPNAYLVDSRALYAGNRVDLSQGAGRVTFNNPIYDLSPTLGY 363 Query: 314 -----GENSGASAYQNTLNTLQICEYMRNA---GMK-IYSVAV-SAPP---EGQDLLRKC 360 G + + N++ + G+K + + GQ L + Sbjct: 364 EYSRLGYDLYSRDSITRENSIAYAGEVSKKFGLGIKRVNVIGFSGVDHEIAYGQSLTDRI 423 Query: 361 TDSS--GQFFAVNDSRELLESFDKITDKIQE 389 + ++ + + L ++F I +IQ+ Sbjct: 424 GEGGMETKYVSATNEEALQKTFSDIKKQIQQ 454 >gi|296328983|ref|ZP_06871490.1| iron/zinc/copper-binding protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] gi|296153876|gb|EFG94687.1| iron/zinc/copper-binding protein [Fusobacterium nucleatum subsp. nucleatum ATCC 23726] Length = 310 Score = 37.2 bits (84), Expect = 4.2, Method: Composition-based stats. Identities = 19/175 (10%), Positives = 54/175 (30%), Gaps = 23/175 (13%) Query: 220 LVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYP 279 + + + I + N +++ + K Sbjct: 136 FFSDYSNGKVNPYIWTGSKNLVRMVNIIGRDLIRLYPQNKAKIEKNITKFTAD------- 188 Query: 280 AMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG 339 + E + + + + + F+ D +++N I + +++ G Sbjct: 189 -LLKIENEANEKLLAVGDAEVISLSENLQYFLNDMNIYTEYVDYDSVNAQNIAKLIKDKG 247 Query: 340 MKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN------------DSRELLESFDK 382 +K+ VS +D ++ ++ G+F +N D +L++F + Sbjct: 248 IKVI---VSDRWLKKDAIKALKEAGGEFVVINTLDIPMDKDGKMDPEAILKAFKE 299 >gi|189189388|ref|XP_001931033.1| alpha-L-arabinofuranosidase precursor [Pyrenophora tritici-repentis Pt-1C-BFP] gi|187972639|gb|EDU40138.1| alpha-L-arabinofuranosidase precursor [Pyrenophora tritici-repentis Pt-1C-BFP] Length = 441 Score = 37.2 bits (84), Expect = 4.2, Method: Composition-based stats. Identities = 32/258 (12%), Positives = 77/258 (29%), Gaps = 9/258 (3%) Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYL 180 PS +L + I+ ++ + + S S + + Sbjct: 1 MRFTPSD-LSLPAAAVAILASTASAQSCPLPSTYKWSSSQALAQPKSGWASL-KDFTTQQ 58 Query: 181 LPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGT 240 + + + T SKY + L + N N A + Sbjct: 59 YNGKHL-VYATYHDTGSKYGSMNFGTVSSLSQLSSAPQNAQNFNAVAPTLLFFAPKNVWV 117 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKS--RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNT 298 + Y G + + + V S L + T++ + + + ++ + Sbjct: 118 LCYQWGPTTFTYRTST-DPSNVNSWGAAQPLFSGKITDSSTGAID--QTVIGDSQNMYLF 174 Query: 299 IGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLR 358 K + + G G +T+ + + A +++Y+V + + ++ Sbjct: 175 FAGDNGKIYRASMPLGNFPGNFGTTSTVVMSDTAQNLFEA-VQVYTVKGGSGNQKYLMIV 233 Query: 359 KCTDSSGQFFAVNDSREL 376 + S G+FF + L Sbjct: 234 EAQGSGGRFFRSFTASSL 251 >gi|72008858|ref|XP_787086.1| PREDICTED: hypothetical protein [Strongylocentrotus purpuratus] gi|115969503|ref|XP_001184155.1| PREDICTED: hypothetical protein [Strongylocentrotus purpuratus] Length = 429 Score = 37.2 bits (84), Expect = 4.2, Method: Composition-based stats. Identities = 23/180 (12%), Positives = 58/180 (32%), Gaps = 14/180 (7%) Query: 206 ANRKIDVLIESAGNLVNSIQKAIQ-EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 K+ E+ +++++ S + N + Q + ++ + Sbjct: 176 YGTKLSQTKEALKTMLDNLNPTDYFNIITFSDGVQYWRENNRLAPAQRRYM----DDAMA 231 Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 ++ L TN A+ A L +E + + +I +TDG S + Q Sbjct: 232 YVDSLRDDSETNLNEAIVKAGELLDSE---ARYNRPGDSVYSMMILLTDGRPSVGTTDQQ 288 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAV----NDSRELLESF 380 + + I + + L++ +++G + + +L E + Sbjct: 289 EILDNAREVIAGKHSLNI--LGFGRLVDFDLLVKLAYENNGTAKMIYEGTTAAEQLREFY 346 >gi|15965603|ref|NP_385956.1| hypothetical protein SMc00158 [Sinorhizobium meliloti 1021] gi|15074784|emb|CAC46429.1| Hypothetical/unknown protein [Sinorhizobium meliloti 1021] Length = 577 Score = 37.2 bits (84), Expect = 4.3, Method: Composition-based stats. Identities = 6/30 (20%), Positives = 15/30 (50%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQS 31 A+++ + + ID ++ R ++QS Sbjct: 23 AALVMPLVVASMGLGIDYGYLTLQRRELQS 52 >gi|257868524|ref|ZP_05648177.1| predicted protein [Enterococcus gallinarum EG2] gi|257802688|gb|EEV31510.1| predicted protein [Enterococcus gallinarum EG2] Length = 831 Score = 37.2 bits (84), Expect = 4.3, Method: Composition-based stats. Identities = 36/290 (12%), Positives = 81/290 (27%), Gaps = 24/290 (8%) Query: 115 TENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNM 174 ++ + ++ L S + +S AI D S ++ N Sbjct: 312 LPSMATEFIVVKETVVLPANSGQLFMGASNLRAIEGANKFDTSSVTHMNWMFG----NAS 367 Query: 175 TSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLI-ESAGNLVNSIQKAIQEKKN 233 + L S + + A D + L N +Q + Sbjct: 368 SLESLDLNQWDTSSVIDMTAMFNGTSSLKELAIENFDTSKVTNIDYLQNFLQHTAESHSG 427 Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKE 293 ++ G + Q L + E + + +Y +A K Sbjct: 428 TFMQKGLM-------EGQRLLLEQSRPEAEKMFIHIGDDSANRSYLPAENATVYPNTGKI 480 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQN-----------TLNTLQICEYMRNAGMKI 342 + +N ++ K ++ + + T TL ++ + + Sbjct: 481 NDYNGYHTSSYTKAFQTANPQYHTTSKTSTDSLAVPVSSQVVTDETLGTIVDIKESNFTV 540 Query: 343 YSVAVSAPPEGQDLLRKCTDSSGQFFAV-NDSRELLESFDKITDKIQEQS 391 YSVA + G+ + R + + + L + +I + I + Sbjct: 541 YSVATAPSARGEYIARNLATAPANYLTTDENLSGLGSALKEIANHIDKTI 590 >gi|116625580|ref|YP_827736.1| hypothetical protein Acid_6529 [Candidatus Solibacter usitatus Ellin6076] gi|116228742|gb|ABJ87451.1| hypothetical protein Acid_6529 [Candidatus Solibacter usitatus Ellin6076] Length = 376 Score = 37.2 bits (84), Expect = 4.3, Method: Composition-based stats. Identities = 7/34 (20%), Positives = 15/34 (44%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDA 35 A+ + A+D+ I +N++Q D+ Sbjct: 16 MALSTVAVLGVVGLAVDVGRIFIAKNEVQVYCDS 49 >gi|291450184|ref|ZP_06589574.1| von Willebrand factor [Streptomyces albus J1074] gi|291353133|gb|EFE80035.1| von Willebrand factor [Streptomyces albus J1074] Length = 600 Score = 37.2 bits (84), Expect = 4.3, Method: Composition-based stats. Identities = 18/118 (15%), Positives = 41/118 (34%), Gaps = 12/118 (10%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 P T Y AYR + S ++ +TDG + + + + Sbjct: 487 PDGATGLYDTTLAAYRAAQDSYVSGKF--------NALVILTDG-VNEDPGSISRSSLVA 537 Query: 331 ICEYMRN--AGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE-SFDKITD 385 E +R+ + + ++AV + +++ + G V+D ++ I + Sbjct: 538 ELEGLRDPDRPVPVIAIAVGPAADKEEVKEIAEATGGSGHQVSDPAQIHAVILKAIME 595 >gi|17570773|ref|NP_508098.1| hypothetical protein ZK1193.2 [Caenorhabditis elegans] gi|1118062|gb|AAA83291.1| Hypothetical protein ZK1193.2 [Caenorhabditis elegans] Length = 1250 Score = 37.2 bits (84), Expect = 4.3, Method: Composition-based stats. Identities = 20/190 (10%), Positives = 54/190 (28%), Gaps = 11/190 (5%) Query: 202 APAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 + + ++ A N + + + RI Y+ + + Sbjct: 1063 SSGSISDEMYYATVGAVNTIGNAISIGHDHS----RILLGTYDAISHFSGDLNTLDTFEA 1118 Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 +++L L T + + + +S + K ++ + G + G + Sbjct: 1119 YQNKLADLFSLGYTGINGNNIQSVMDYIVLQNNSAPFRPAPVRKFLMLLSSQGWDKGNVS 1178 Query: 322 YQNT---LNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE 378 + + G++ +++ + L VND EL Sbjct: 1179 EGKENGFSDPAPAARNLHKLGLETFAIGLGTSAN-MTQLNAIAK---CSTQVNDQNELTS 1234 Query: 379 SFDKITDKIQ 388 + +I + Sbjct: 1235 TISQIISLLC 1244 >gi|322437225|ref|YP_004219437.1| Protein of unknown function DUF2134, membrane [Acidobacterium sp. MP5ACTX9] gi|321164952|gb|ADW70657.1| Protein of unknown function DUF2134, membrane [Acidobacterium sp. MP5ACTX9] Length = 528 Score = 37.2 bits (84), Expect = 4.4, Method: Composition-based stats. Identities = 24/206 (11%), Positives = 55/206 (26%), Gaps = 7/206 (3%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVS-DRTIKDPTTKKDQ 59 + A + F+ A+D+ + Y + +Q A DAA L+ + D T+ + Sbjct: 86 IAAFGMVAILGFLALAVDVGQLRYQKRGLQKAADAAALASVLEMSYCDGTLACGVMQTAA 145 Query: 60 TSTIFKKQIK-KHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENL 118 S + + + ++ + G K ++P S + + Sbjct: 146 KSAMVENGLTPDNIVTACGTVPSTGLTLMINHGPCAKGASDPNYGKTSSVEVLVMQAQPT 205 Query: 119 FLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNK 178 ++ + + R+ + LD + S + + Sbjct: 206 IFAKVLGLSTGTVGARAEAS-----TTGGTNCIFALDPTGSGALTVQGLASITSPCGIMV 260 Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPA 204 T S Sbjct: 261 ESSSNSALSCAVLGVITASAINVVGG 286 >gi|317500911|ref|ZP_07959122.1| hypothetical protein HMPREF1026_01065 [Lachnospiraceae bacterium 8_1_57FAA] gi|316897709|gb|EFV19769.1| hypothetical protein HMPREF1026_01065 [Lachnospiraceae bacterium 8_1_57FAA] Length = 1536 Score = 37.2 bits (84), Expect = 4.4, Method: Composition-based stats. Identities = 29/336 (8%), Positives = 88/336 (26%), Gaps = 21/336 (6%) Query: 2 TAIIISVCFLFITY---AIDLAHIMYIRNQMQSALDA-----AVLSGCASIVSDRTIKDP 53 A++++ + I + +M + A G + S Sbjct: 7 AALVMAAILGVTGFSVPGIGVVQAEEAHTEMSGEAEQQEINLAADRGSTARASKFLAASG 66 Query: 54 TTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEI 113 + F + + + ++ D Q + + ++E K + Sbjct: 67 NLPAREPGLAFDGISDNNGEADNSRWQSGEDAEFSEQ--WLEVDLGGICVVSEIKVDFFA 124 Query: 114 PTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND--- 170 F + S + + E +++ +DV + + + ++ Sbjct: 125 RLYGDFRVEVSDSNAEDAVWTTIA-TADMPEGTDLNLKKTVDVKENGKAREIPRYIRLYF 183 Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVN--SIQKAI 228 + + + ++S Y + + A+ Sbjct: 184 TSGNSQAANRSIGVREFQVIGTKKSESGYETITGNIALNKTASASGVEAAMPNLTANLAV 243 Query: 229 QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYREL 288 +K+ + R G NQ ++++ + + T+ + + Sbjct: 244 DGQKSDTSRWSAPTMKNGTSPNQQQTPQWLEIDLRNEVTNI-----TSIDLYFYKLVYSI 298 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 E ++ + + K V E + + + Sbjct: 299 DYEIQTRADKKSEWKTVKHVTCQPGNEQNKHDSITD 334 >gi|159139286|gb|ABW89597.1| putative type IV pili-associated adhesin precursor [Pseudomonas aeruginosa] Length = 1163 Score = 37.2 bits (84), Expect = 4.4, Method: Composition-based stats. Identities = 10/38 (26%), Positives = 19/38 (50%) Query: 351 PEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 DL +S G+FF+ + +L+ +F I ++I Sbjct: 455 NNVYDLWHAAVNSRGEFFSADSPDQLVAAFQDILNRIS 492 >gi|15599750|ref|NP_253244.1| type 4 fimbrial biogenesis protein PilY1 [Pseudomonas aeruginosa PAO1] gi|9950799|gb|AAG07942.1|AE004869_6 type 4 fimbrial biogenesis protein PilY1 [Pseudomonas aeruginosa PAO1] Length = 1161 Score = 37.2 bits (84), Expect = 4.4, Method: Composition-based stats. Identities = 10/41 (24%), Positives = 20/41 (48%) Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 + DL +S G+FF+ + +L+ +F I ++I Sbjct: 450 NDSNNVYDLWHAAVNSRGEFFSADSPDQLVAAFQDILNRIS 490 >gi|307302722|ref|ZP_07582478.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti BL225C] gi|307318570|ref|ZP_07598004.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti AK83] gi|306895910|gb|EFN26662.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti AK83] gi|306903086|gb|EFN33677.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti BL225C] Length = 577 Score = 37.2 bits (84), Expect = 4.4, Method: Composition-based stats. Identities = 6/30 (20%), Positives = 15/30 (50%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQS 31 A+++ + + ID ++ R ++QS Sbjct: 23 AALVMPLVVASMGLGIDYGYLTLQRRELQS 52 >gi|170743328|ref|YP_001771983.1| TadE family protein [Methylobacterium sp. 4-46] gi|168197602|gb|ACA19549.1| TadE family protein [Methylobacterium sp. 4-46] Length = 240 Score = 37.2 bits (84), Expect = 4.5, Method: Composition-based stats. Identities = 21/214 (9%), Positives = 56/214 (26%), Gaps = 12/214 (5%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQ-----------MQSALDAAVLSGCASIVSDRTI 50 A + V + + + +A + + + A+ A + A++ + Sbjct: 27 FAAALPVLLVVMAVGLQVALYVNAKRSVERLARTISQMISQAVPPAG-AATATVNAADIR 85 Query: 51 KDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQ 110 + ++ + S I NA IA + D + + Sbjct: 86 FGFDAAIVLFPYVLADAARQGIPWQSNIAINAAGIAFTKVASGCSDPTDQSACYVANVVW 145 Query: 111 YEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND 170 T + + + + + +V+DV + + + Sbjct: 146 TSSGTGGASYRPCLVAQQPAGNAAPPSPTTLPRSVFGPASLVVVDVVFTFRPTFGATYVP 205 Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA 204 + + + Y+ P + T P Sbjct: 206 SARIAHSVYVQPRYAALVSYDPTNTDGIAETCPG 239 >gi|121595491|ref|YP_987387.1| type 4 fimbrial biogenesis protein PilY1 [Acidovorax sp. JS42] gi|120607571|gb|ABM43311.1| type 4 fimbrial biogenesis protein PilY1 [Acidovorax sp. JS42] Length = 1154 Score = 37.2 bits (84), Expect = 4.5, Method: Composition-based stats. Identities = 42/417 (10%), Positives = 97/417 (23%), Gaps = 65/417 (15%) Query: 36 AVL--SGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINI 93 AVL + +S + + + + + + + Sbjct: 17 AVLIGAALSSQSALAALDFAQAPPGTVEPYVAPNVIISIDDSGSMNWRLDRENDRGATDN 76 Query: 94 TKDKNNPLQYIAES--KAQYEIPTENLFLKGLIP------SALTNLSLRSTGIIERSSEN 145 N +Y + L + + + + S+ Sbjct: 77 VTPTNGSWLSTDRRMNVLKYALKQVFNDTSLLPDGKIRLAWQVMHNNGDAPDAKNVDSDT 136 Query: 146 LAISICMVL-----DVSRSMEDL---YLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKS 197 + + VL D S + ++ Y+ S W+ Sbjct: 137 MKTNSMRVLKGSHRDNFISFINSLRPNNGTPTHKMFKQADDYMRRELGINSPWASEPGVK 196 Query: 198 KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN 257 N I + + ++ Q + + + Y T L Sbjct: 197 DKPYLGCRRNYHIVMTDGRWNSYSDTYQTGNLDGASSTELPDGTKYGSATDQKDYTKLYR 256 Query: 258 NLN-------EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIF 310 + KS + L T +E + + T + + Sbjct: 257 DETENTLADWAFKSWSDPLQSSGLT--------GQIFPPSEYDRAPATELIGGVNLPKYW 308 Query: 311 ITDGEN-SGASAYQNTLNTLQICEYMR------NAGMKIYS--VAVSAPPE--------- 352 + + T+ + + + + V + Sbjct: 309 NPAFDPANWPHMVTYTIGFSAMAYTWKTDFSDAKYNITRPTTMVPFGYDGDFPNLVNGTK 368 Query: 353 -----------GQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSVRIAPNR 398 DL + G+F+AV +L ++F +I +I Q+ P+R Sbjct: 369 TWPNMSNEDRRSLDLWHAALNGRGRFYAVERGEDLEKAFREIVKQINAQT---EPDR 422 >gi|115537688|ref|NP_872008.2| hypothetical protein T19D12.4 [Caenorhabditis elegans] gi|82654523|gb|ABB88212.1| Hypothetical protein T19D12.4b [Caenorhabditis elegans] Length = 1015 Score = 37.2 bits (84), Expect = 4.5, Method: Composition-based stats. Identities = 20/123 (16%), Positives = 48/123 (39%), Gaps = 13/123 (10%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYN 290 N + R+G I ++ +V + N L+ V + ++ ++ TN A A Sbjct: 409 GNDNTRVGIITFSSDVVEVRKLTDGNTLDAVNAAIDTVHYTGGLTNVTKAQLTAKNLFDT 468 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 E ++ N K + +TDG + + + + + +++ + + V S+ Sbjct: 469 ESNANRN--------KVLFILTDGVPTVDTY----TDEVAAGDKLKSISVISFFVGYSSY 516 Query: 351 PEG 353 + Sbjct: 517 SDE 519 >gi|330805959|ref|XP_003290943.1| hypothetical protein DICPUDRAFT_155479 [Dictyostelium purpureum] gi|325078904|gb|EGC32531.1| hypothetical protein DICPUDRAFT_155479 [Dictyostelium purpureum] Length = 388 Score = 37.2 bits (84), Expect = 4.5, Method: Composition-based stats. Identities = 36/319 (11%), Positives = 93/319 (29%), Gaps = 35/319 (10%) Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 I K+ K ++ G Y + A ++ Q E +E L L Sbjct: 95 KAISLKKTKIIIENGEYKTDFIDLSASIVSLSFNAYDKPVQNSKLTGTVQIEGESEKLNL 154 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNM----TS 176 + S ++ +E + + + ++ S ++E+ + + + Sbjct: 155 NSITDSKGCYDTVLPKCKVEVEVKGVIDGKPVCINESYNIEESKPASNIEEAKTIRPKSF 214 Query: 177 NKYLLPPPPKKSFWSKNTTK----SKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK 232 NK + S K S +I L ++ ++ + + Sbjct: 215 NKSVKCENYNISTAGKKVVYLCDISGSMDTSDNGVSRIVTLKKNIKKIIEENKNSFSVAA 274 Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEK 292 + +I N+ N + + ++ L T+ A+ + + Sbjct: 275 WNTSTSFSIGKEWLNKQNEA----KNKTLLINWIDSLKAAGGTDMKQAIVAGISQFRDAD 330 Query: 293 ESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE 352 E + + DG+ + + + R+ ++S V Sbjct: 331 E--------------FVVLCDGDIT-------PFDMASWSTFYRDNSKYMFSF-VGIGNS 368 Query: 353 GQDLLRKCTD-SSGQFFAV 370 + +++ + +GQ+ Sbjct: 369 SDEQMKEMSQIGNGQYTNA 387 >gi|239978296|ref|ZP_04700820.1| hypothetical protein SalbJ_02585 [Streptomyces albus J1074] Length = 589 Score = 37.2 bits (84), Expect = 4.5, Method: Composition-based stats. Identities = 18/118 (15%), Positives = 41/118 (34%), Gaps = 12/118 (10%) Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 P T Y AYR + S ++ +TDG + + + + Sbjct: 476 PDGATGLYDTTLAAYRAAQDSYVSGKF--------NALVILTDG-VNEDPGSISRSSLVA 526 Query: 331 ICEYMRN--AGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLE-SFDKITD 385 E +R+ + + ++AV + +++ + G V+D ++ I + Sbjct: 527 ELEGLRDPDRPVPVIAIAVGPAADKEEVKEIAEATGGSGHQVSDPAQIHAVILKAIME 584 >gi|17536325|ref|NP_495348.1| hypothetical protein T19D12.4 [Caenorhabditis elegans] gi|3258584|gb|AAC24429.1| Hypothetical protein T19D12.4a [Caenorhabditis elegans] Length = 1028 Score = 37.2 bits (84), Expect = 4.5, Method: Composition-based stats. Identities = 20/123 (16%), Positives = 48/123 (39%), Gaps = 13/123 (10%) Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYN 290 N + R+G I ++ +V + N L+ V + ++ ++ TN A A Sbjct: 422 GNDNTRVGIITFSSDVVEVRKLTDGNTLDAVNAAIDTVHYTGGLTNVTKAQLTAKNLFDT 481 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 E ++ N K + +TDG + + + + + +++ + + V S+ Sbjct: 482 ESNANRN--------KVLFILTDGVPTVDTY----TDEVAAGDKLKSISVISFFVGYSSY 529 Query: 351 PEG 353 + Sbjct: 530 SDE 532 >gi|313107199|ref|ZP_07793398.1| type 4 fimbrial biogenesis protein PilY1 [Pseudomonas aeruginosa 39016] gi|310879900|gb|EFQ38494.1| type 4 fimbrial biogenesis protein PilY1 [Pseudomonas aeruginosa 39016] Length = 1163 Score = 37.2 bits (84), Expect = 4.5, Method: Composition-based stats. Identities = 10/38 (26%), Positives = 19/38 (50%) Query: 351 PEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQ 388 DL +S G+FF+ + +L+ +F I ++I Sbjct: 455 NNVYDLWHAAVNSRGEFFSADSPDQLVAAFQDILNRIS 492 >gi|118381238|ref|XP_001023780.1| Phage tail fiber repeat family protein [Tetrahymena thermophila] gi|89305547|gb|EAS03535.1| Phage tail fiber repeat family protein [Tetrahymena thermophila SB210] Length = 626 Score = 37.2 bits (84), Expect = 4.6, Method: Composition-based stats. Identities = 18/208 (8%), Positives = 48/208 (23%), Gaps = 7/208 (3%) Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 IF+ + + + + + P + + + Sbjct: 77 QPIFEDIFMNFTDSTNSTTNSTIPSNSTSNSTVPTNSTVPTNSTVPTNSTVPTNSTVPTN 136 Query: 121 KGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVS--RSMEDLYLQKHNDNNNMTSNK 178 + ++ + + + + V S + + N+ Sbjct: 137 STVPTNSTVPTNSTVPTNSTVPTNSTVPTNSTVPTNSTVPTNSTVPTNSTVPTNSTVPTN 196 Query: 179 YLLPPPPKKSFWSKNTTKSKY-----APAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 +P S T S P + V S +++ N Sbjct: 197 STVPTNSTVPTNSTVPTNSTVPTNSTVPTNSTVPTNSTVPTNSTVPTNSTVPTNSTIPTN 256 Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNE 261 ++ + V T L+N+ + Sbjct: 257 STIPTNSTVPTNSTVPVNFTILTNSTSN 284 >gi|327540870|gb|EGF27430.1| hypothetical protein RBWH47_02042 [Rhodopirellula baltica WH47] Length = 384 Score = 37.2 bits (84), Expect = 4.6, Method: Composition-based stats. Identities = 34/339 (10%), Positives = 81/339 (23%), Gaps = 41/339 (12%) Query: 39 SGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKN 98 +G + + + + + + + + ++ + + Sbjct: 68 AGTIEQPTTSVGVAMAYRLPDRTRYVTEDSSEESDAATEDASDRQVDERSKSVDQSDAER 127 Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR 158 + A S + L GL L D Sbjct: 128 SQSTASAASAPPAGFVP-PVDLDGLFAEMTRRGVAAGESQGTGVEGVLQFGDGKTADQLG 186 Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAG 218 + E + + + S + S+ A + + Sbjct: 187 TGELVPGTSRAGEGAGQTTTSVFGVSGSGSTFVYVFDHSESMSASGGKPLRA-----AKQ 241 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN--------LNEVKSRLNKLN 270 L+ S++ + + I YN + ++ +N+ Sbjct: 242 ELIRSLRTL-----SERQQFQVIFYNDRPKAFSPDGQTTGLVFGEDGIRRRAEAFVNRTV 296 Query: 271 PYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 T A+ A R + + F+TD SA Q Sbjct: 297 AVGGTEHQLALRMALRLAPDA----------------IFFLTDASIQTMSADQ----MAD 336 Query: 331 ICEYMRNAGMKIYSVAVSAPPE-GQDLLRKCT-DSSGQF 367 I +G I+++ + PE +++ + G + Sbjct: 337 IRRRAEQSGTVIHAIQFGSGPEPANSFMKEIARQNRGGY 375 >gi|323525753|ref|YP_004227906.1| hypothetical protein BC1001_1410 [Burkholderia sp. CCGE1001] gi|323382755|gb|ADX54846.1| Protein of unknown function DUF2134, membrane [Burkholderia sp. CCGE1001] Length = 613 Score = 37.2 bits (84), Expect = 4.7, Method: Composition-based stats. Identities = 16/107 (14%), Positives = 33/107 (30%), Gaps = 1/107 (0%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A + + A+D+ H Y R +Q D A + ++ Q Sbjct: 49 LLAAVWVSLAVIALGAVDIGHFYYARRDLQRTADLAA-AAGVQLIGSAGGCAAAASSAQL 107 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAES 107 + Q + R + A ++ +T N +Q + Sbjct: 108 NATANGLPGDGAVQTTCGRWDPSANAGQSYFAVTGTPLNAVQVVVSR 154 >gi|241113141|ref|YP_002972976.1| hypothetical protein Rleg_4786 [Rhizobium leguminosarum bv. trifolii WSM1325] gi|240861349|gb|ACS59015.1| conserved hypothetical protein [Rhizobium leguminosarum bv. trifolii WSM1325] Length = 194 Score = 37.2 bits (84), Expect = 4.7, Method: Composition-based stats. Identities = 16/142 (11%), Positives = 37/142 (26%), Gaps = 16/142 (11%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSAL----------------DAAVLSGCASIV 45 A+++ + + + +DL H + + ++ D A L AS + Sbjct: 32 FALVLPILVMLLFGTVDLGHALTVSRKIDEIASSTGDMISQQGSWTKSDVAKLLSGASFI 91 Query: 46 SDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIA 105 T I K + A A ++ +++ +Q + Sbjct: 92 LQPYETTGLTITVAVDDIAKSGSATVNWSAALNTSALNSGAASAIEVPSEIQDDGVQVVL 151 Query: 106 ESKAQYEIPTENLFLKGLIPSA 127 + F Sbjct: 152 TRVQYTLTTPVSAFFSNFTGQN 173 >gi|103487753|ref|YP_617314.1| hypothetical protein Sala_2272 [Sphingopyxis alaskensis RB2256] gi|98977830|gb|ABF53981.1| conserved hypothetical protein [Sphingopyxis alaskensis RB2256] Length = 219 Score = 37.2 bits (84), Expect = 4.8, Method: Composition-based stats. Identities = 15/147 (10%), Positives = 40/147 (27%), Gaps = 5/147 (3%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQS----ALDAAVLSGCASIVSDRTIKDPTTKK 57 A I L +++A++ ++ D A S ++ I++ Sbjct: 30 MAFAIPFLILVGFGGLEIANLTLAHTRVSQLGLNTADNAARIAAGSNLTQPEIREV-DIN 88 Query: 58 DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 + + ++ + I ++ I + +A + Sbjct: 89 EVFAGAARQVAGMGFENNGRIILSSLQRNNDGGQTIKWQRCFGNLEVASAYGVEGTGATG 148 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSE 144 G+ P+ + T I+ Sbjct: 149 TDFPGMGPAGREVTAAAGTAIMFVEVT 175 >gi|309777227|ref|ZP_07672190.1| hypothetical protein HMPREF0983_02829 [Erysipelotrichaceae bacterium 3_1_53] gi|308915097|gb|EFP60874.1| hypothetical protein HMPREF0983_02829 [Erysipelotrichaceae bacterium 3_1_53] Length = 5049 Score = 37.2 bits (84), Expect = 4.8, Method: Composition-based stats. Identities = 35/350 (10%), Positives = 84/350 (24%), Gaps = 17/350 (4%) Query: 42 ASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPL 101 S + D + + + + + +T Sbjct: 1997 TDYTSGKWTNDSSVRLEVEERNAGAGSPALTYEYQEDGDKDWIQFAANSNEMTVTGTGKH 2056 Query: 102 QYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSME 161 +K +E + I N ++ I V +S Sbjct: 2057 AINIRAKNASGGVSEIYYYNVWIDDTWDNSFNIMQEGNYTAANPADIPWYNSTQVIQSEF 2116 Query: 162 DLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLV 221 ++ +Y S+NT + A RK D L + + Sbjct: 2117 NVLSNGC-----KEWIEYSEDGGVNWKHNSRNTYEVSTTGAHEIMVRKNDELNSAVTSGR 2171 Query: 222 NSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN----NLNEVKSRLNKLNPYENTNT 277 + + + + + + + ++ N + S NP Sbjct: 2172 VNTKTIHVNIDKEMITDFKVKIDKDSYSSFLSTITFGIYHNEAKKASITGNFNPSGAGKI 2231 Query: 278 YPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRN 337 Y + + + N + + + D + G + I +R Sbjct: 2232 YYQIVNDPTDYVNRYADAADPNDKGWREYTAELQLDNDFKGFIYAKAKDKAGNISPIIRT 2291 Query: 338 AGMKIYSVA----VSAPPE----GQDLLRKCTDSSGQFFAVNDSRELLES 379 G+ I ++ + + G L +D + V ++ + Sbjct: 2292 DGIVIDTIGPAITIGDDSDNWITGNALQINVSDFADGVSTVGNASGITSI 2341 >gi|308181181|ref|YP_003925309.1| hypothetical protein LPST_C1999 [Lactobacillus plantarum subsp. plantarum ST-III] gi|308046672|gb|ADN99215.1| hypothetical protein LPST_C1999 [Lactobacillus plantarum subsp. plantarum ST-III] Length = 1721 Score = 37.2 bits (84), Expect = 4.8, Method: Composition-based stats. Identities = 19/239 (7%), Positives = 60/239 (25%), Gaps = 3/239 (1%) Query: 31 SALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQ 90 ALD AV + + + + I + + ++ Sbjct: 466 QALDKAVSAQSDASAAVKQASSAAADSKDAKQIAGAVSQSYKTLTDGSTMTIAELQNGLA 525 Query: 91 INITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISI 150 +TK N ++ Q ++ + + T + ++ ++ + S Sbjct: 526 AKLTKTDLNGYATQTWTQNQIKMTADG-INGTMSSIKSTVDAQTTSINDLKADSSSFKSQ 584 Query: 151 CMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANR-- 208 ++ + + + + + + + + + + A Sbjct: 585 FTTVNNTLGKQTTDIGTLQATSKELTTGFNTLTTDNTTNKNNISQLQQTATELNSTMTTV 644 Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 + V + G + + K + G + + N V + Sbjct: 645 QTQVQNSAVGTNLYTDTKNFDNPASWYAASLWTKITDTYNGLAVMQTTEDWNGVSQYIQ 703 >gi|290982741|ref|XP_002674088.1| predicted protein [Naegleria gruberi] gi|284087676|gb|EFC41344.1| predicted protein [Naegleria gruberi] Length = 4968 Score = 37.2 bits (84), Expect = 4.8, Method: Composition-based stats. Identities = 31/229 (13%), Positives = 71/229 (31%), Gaps = 12/229 (5%) Query: 55 TKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIP 114 + Q+ S + N + + + +N + + Sbjct: 3640 AVSAVNNQSNSTQLNNQTIDSSNVTMNNNSSVVFNNTDNSTNTDNSSVVVTNHTISEDNT 3699 Query: 115 TENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNM 174 T + + + T+LS + S N + + D S S + + D++N Sbjct: 3700 TRSNSNGTVYNNNSTDLSNETISNNGTVSANANV---TIPDNSTSSTNQTVSPSTDSSNG 3756 Query: 175 TSNKYLLPPP--PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK 232 T + + + + N T S +P N+ I ++S+ ++ Sbjct: 3757 TVSNNTVNATASDNTTTDNSNITISDNTTSPGSNNQTISPSVDSSNGTIS--NNTNPSVV 3814 Query: 233 NLSVRIGTIAYNIGIVGNQCTP-----LSNNLNEVKSRLNKLNPYENTN 276 + S + ++ G V N ++ N + +N NT+ Sbjct: 3815 SNSSTVSNSSFTNGTVSNSTVTSNSSTITYNTTISSNETISVNVTSNTS 3863 >gi|116671468|ref|YP_832401.1| hypothetical protein Arth_2922 [Arthrobacter sp. FB24] gi|116611577|gb|ABK04301.1| hypothetical protein Arth_2922 [Arthrobacter sp. FB24] Length = 341 Score = 37.2 bits (84), Expect = 4.9, Method: Composition-based stats. Identities = 18/146 (12%), Positives = 37/146 (25%), Gaps = 17/146 (11%) Query: 9 CFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQI 68 F A+D+ + ++ Q+ DAA L+ + T + T+ Sbjct: 26 LLSFGAIAVDVGAMYAEKSVTQNGADAAALAVAQKCAKNTADPTCITGSTLSGTLANANA 85 Query: 69 KKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN-LFLKGLIPSA 127 K +L + T + + AQ F + + Sbjct: 86 KDNL----------------TNVASTIVDKTAGKVTVTTNAQDSTGVHFSTFFARIFGTD 129 Query: 128 LTNLSLRSTGIIERSSENLAISICMV 153 T + + + I Sbjct: 130 TTTIGAVAEAKWGGAQSGNVFPIAFS 155 >gi|332664650|ref|YP_004447438.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] gi|332333464|gb|AEE50565.1| von Willebrand factor type A [Haliscomenobacter hydrossis DSM 1100] Length = 345 Score = 37.2 bits (84), Expect = 4.9, Method: Composition-based stats. Identities = 20/104 (19%), Positives = 37/104 (35%), Gaps = 22/104 (21%) Query: 254 PLSNNLNEV----KSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVI 309 PL+ + V +S + + T+ A+ A + + V+ Sbjct: 145 PLTTDYEAVSLLLQSANPDMISSQGTSIGEALAIAQTNTSKSNGN-----------RVVL 193 Query: 310 FITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 ITDGE+ A A + GMKI+++ + + G Sbjct: 194 VITDGEDHEARAEAQARQAARA-------GMKIFTIGIGSEEGG 230 >gi|221213132|ref|ZP_03586108.1| membrane protein [Burkholderia multivorans CGD1] gi|221167345|gb|EED99815.1| membrane protein [Burkholderia multivorans CGD1] Length = 609 Score = 37.2 bits (84), Expect = 5.0, Method: Composition-based stats. Identities = 19/173 (10%), Positives = 47/173 (27%), Gaps = 3/173 (1%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A I L + +ID+ ++ + R +Q D L+ S+ D T ++ Sbjct: 4 VAAIWIAVALIVLGSIDVGNLYFQRRDLQRVADMTALAAVQSVNDLCPQTDTTVTASGSN 63 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQ---YEIPTENL 118 + + G + + ++ D Y + Q + Sbjct: 64 AVVTAAYRGAALNGFDAQASGNSMSIACGRWDVSDYGAAAGYFGTATNQLNAVRVVAAKT 123 Query: 119 FLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDN 171 I T + + + ++ ++ M + + Sbjct: 124 VPLFFIGPPRTISAASTAKASNIDTFSIGTTLAMFGSNQDCAGNSVSADQRNT 176 >gi|237728320|ref|ZP_04558801.1| conserved hypothetical protein [Citrobacter sp. 30_2] gi|226909798|gb|EEH95716.1| conserved hypothetical protein [Citrobacter sp. 30_2] Length = 637 Score = 37.2 bits (84), Expect = 5.0, Method: Composition-based stats. Identities = 8/105 (7%), Positives = 32/105 (30%), Gaps = 22/105 (20%) Query: 260 NEVKSRLN----KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGE 315 +++ + + T ++ A +L + K H ++ TDG+ Sbjct: 523 QTLRTAIGTHQFGIGTQGCTPLCESLWPALADLTSAKADRH----------VLVIATDGQ 572 Query: 316 NSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKC 360 + + + + ++ + + + +++ Sbjct: 573 PD------DMASARAMIQSAKDDDIIVIGIGFG--DANDSMMKSL 609 >gi|260574617|ref|ZP_05842620.1| Flp pilus assembly protein CpaB [Rhodobacter sp. SW2] gi|259023034|gb|EEW26327.1| Flp pilus assembly protein CpaB [Rhodobacter sp. SW2] Length = 284 Score = 37.2 bits (84), Expect = 5.2, Method: Composition-based stats. Identities = 25/207 (12%), Positives = 56/207 (27%), Gaps = 5/207 (2%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 +++ V +A+ +A + Q + A + A + +V + P D + Sbjct: 5 FGLVLVVGLALAGFAVYMAQGFISQTQAELAQERAARAKAGPLVEVYVVNKPLNYGDPLT 64 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + K + Q + D A N + Q TE Sbjct: 65 K--EDVQKIYWPQNALPETIFTDEALLFPANAVGQRFVLRQMEKFEPLLAVKVTEPGQDA 122 Query: 122 GLIPSALTNLSLRSTGIIERSSENLAISICMVLDV--SRSMEDLYLQKHNDNNNMTSNKY 179 GL + + + S + + +DV + + D + ++ K Sbjct: 123 GLTARLAKGMRAFAIKVDATSGVSGFLHPGDNVDVYWTGTANDAGVGGEMTRLIESTIKI 182 Query: 180 LLPPP-PKKSFWSKNTTKSKYAPAPAP 205 + + +P Sbjct: 183 IAVDQMANDDRSGSAVVAATVTVEASP 209 >gi|239613922|gb|EEQ90909.1| von Willebrand domain-containing protein [Ajellomyces dermatitidis ER-3] Length = 1108 Score = 37.2 bits (84), Expect = 5.2, Method: Composition-based stats. Identities = 32/325 (9%), Positives = 83/325 (25%), Gaps = 48/325 (14%) Query: 68 IKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSA 127 + ++ +K INIT D + P + + + Sbjct: 165 YANSSVDTGELSQSLASFVKKGAINITVDVSVDRGSTIRGLHSPTHPVAITLGRTSVAAQ 224 Query: 128 LTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYL---------QKHNDNNNMTSNK 178 ++ N VL V+ +D+ K Sbjct: 225 DLFEPNLASAAHTMQQGNAFFDTDFVLIVNAKDQDVPSAFVEKHPTIPNQRAVMATLVPK 284 Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 + +P + + + + S I L + + S+ + V+ Sbjct: 285 FNIPNNNPEIVFIIDRSGSMTG--------NIKTLQSALRVFLKSLP--------VGVKF 328 Query: 239 GTIAYNIGIVGNQCTPLSNNLNEVKS---RLNKLNPY-ENTNTYPAMHHAYRELYNEKES 294 ++ + + + +K+ ++ + T L K + Sbjct: 329 NICSFGSRHSFMWNKSKTYDASSLKAALQYVDSIAADFGGT----------EMLEPVKAT 378 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 N + L ++ + + ++++S+ + + Q Sbjct: 379 VKNRLKDLDLDVLLLSDGEIWDQKTLFAYLNEVVS-------EQPIRLFSLGIGSGA-SQ 430 Query: 355 DLLRKCT-DSSGQFFAVNDSRELLE 378 L+ G V D+ +L + Sbjct: 431 SLIEGIARAGDGFAQFVGDNEQLDK 455 >gi|77456361|ref|YP_345866.1| von Willebrand factor, type A [Pseudomonas fluorescens Pf0-1] gi|77380364|gb|ABA71877.1| Large adhesive protein [Pseudomonas fluorescens Pf0-1] Length = 5218 Score = 37.2 bits (84), Expect = 5.2, Method: Composition-based stats. Identities = 38/273 (13%), Positives = 89/273 (32%), Gaps = 14/273 (5%) Query: 66 KQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIP 125 I L + I+ ++ D + + + NL + P Sbjct: 4575 SSISTALTDTDGSETLSIKISGAPVGSVLSDGAGHSFTVTATSGDANVTGWNLGTLTVTP 4634 Query: 126 ----SALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 + NL++ ST + + V V + +D T ++ Sbjct: 4635 PTYYNGQFNLTVTSTSTEQVGGSASTTATIPVTVVPAVYNSIVATSADDTVTGTDGNDIM 4694 Query: 182 PPPPKKSFWSKNTTKS---KYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 T + + + + I+ +S ++ N++++++ + +V I Sbjct: 4695 VADIGGLTVVPGTNYNIAFMVDSSGSMSASSINAAKDSLTSVFNTLKQSLGGSNSGTVNI 4754 Query: 239 GTIAYNIGIVGNQCTPLS--NNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSH 296 + ++ + + L+ N L +K+ L+ + TN + + + Sbjct: 4755 FLVDFDTQVNKSVSVNLNDPNALTLLKAVLDSMASGGGTNYEDVFKATANFFQSAEAVA- 4813 Query: 297 NTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 +T K FITDG+ + + + T TL Sbjct: 4814 ----NTGAKNLTYFITDGQPTYYQSNEQTNPTL 4842 >gi|327349892|gb|EGE78749.1| von Willebrand domain-containing protein [Ajellomyces dermatitidis ATCC 18188] Length = 1108 Score = 37.2 bits (84), Expect = 5.2, Method: Composition-based stats. Identities = 32/325 (9%), Positives = 83/325 (25%), Gaps = 48/325 (14%) Query: 68 IKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSA 127 + ++ +K INIT D + P + + + Sbjct: 165 YANSSVDTGELSQSLASFVKKGAINITVDVSVDRGSTIRGLHSPTHPVAITLGRTSVAAQ 224 Query: 128 LTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYL---------QKHNDNNNMTSNK 178 ++ N VL V+ +D+ K Sbjct: 225 DLFEPNLASAAHTMQQGNAFFDTDFVLIVNAKDQDVPSAFVEKHPTIPNQRAVMATLVPK 284 Query: 179 YLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRI 238 + +P + + + + S I L + + S+ + V+ Sbjct: 285 FNIPNNNPEIVFIIDRSGSMTG--------NIKTLQSALRVFLKSLP--------VGVKF 328 Query: 239 GTIAYNIGIVGNQCTPLSNNLNEVKS---RLNKLNPY-ENTNTYPAMHHAYRELYNEKES 294 ++ + + + +K+ ++ + T L K + Sbjct: 329 NICSFGSRHSFMWNKSKTYDASSLKAALQYVDSIAADFGGT----------EMLEPVKAT 378 Query: 295 SHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQ 354 N + L ++ + + ++++S+ + + Q Sbjct: 379 VKNRLKDLDLDVLLLSDGEIWDQKTLFAYLNEVVS-------EQPIRLFSLGIGSGA-SQ 430 Query: 355 DLLRKCT-DSSGQFFAVNDSRELLE 378 L+ G V D+ +L + Sbjct: 431 SLIEGIARAGDGFAQFVGDNEQLDK 455 >gi|299116081|emb|CBN74497.1| conserved unknown protein [Ectocarpus siliculosus] Length = 860 Score = 37.2 bits (84), Expect = 5.2, Method: Composition-based stats. Identities = 28/160 (17%), Positives = 48/160 (30%), Gaps = 26/160 (16%) Query: 254 PLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITD 313 P+S + + RL L T PA+ L ++ TD Sbjct: 422 PISEAKSRLADRLFALEEGGPTALGPAVVAGLSMLKERGGRG----------SRLVLCTD 471 Query: 314 GENSGASAYQNTLNT-----------LQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTD 362 G + + L + G+ I V+V + + L D Sbjct: 472 GLANVGLGALDDLQNDEQRQVAESFYEDLGRESSGNGVTIDVVSVDSDACDLENLGAMAD 531 Query: 363 -SSGQFFAVNDSRELLESFDKITDK---IQEQSVRIAPNR 398 S G V + +L +F I + SVR+ ++ Sbjct: 532 VSGGTVTRVK-ASDLTSNFAGILANPILASQVSVRVTLHK 570 >gi|258625590|ref|ZP_05720474.1| putative RTX protein [Vibrio mimicus VM603] gi|258582126|gb|EEW06991.1| putative RTX protein [Vibrio mimicus VM603] Length = 1274 Score = 37.2 bits (84), Expect = 5.2, Method: Composition-based stats. Identities = 25/176 (14%), Positives = 56/176 (31%), Gaps = 15/176 (8%) Query: 156 VSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIE 215 V + +H+ ++P + +++ S A + ++ + Sbjct: 734 VPGEQNITFGAEHDVAVGDLQGTVVVPGQNYNIAFMVDSSGSLDANSVGTIKTQLSTVFS 793 Query: 216 SAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN--LNEVKSRLNKLNPYE 273 S N V + V I + ++ + LS+ L +++S LN + Sbjct: 794 SLKNSVGEY--------SGKVNIFLVDFDNPSRQSISVNLSDRDALTKLESVLNSMQSGG 845 Query: 274 NTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTL 329 TN ++ + I + FITDG+ + + N + Sbjct: 846 GTNYEDVFKTTANWFLSQ-----DAINNVGASNIAYFITDGKATAYNESVNISDWK 896 >gi|197692704|gb|ACH71062.1| envelope glycoprotein [Human immunodeficiency virus 1] Length = 516 Score = 37.2 bits (84), Expect = 5.2, Method: Composition-based stats. Identities = 22/173 (12%), Positives = 49/173 (28%), Gaps = 3/173 (1%) Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 +N L K++ ++ +V++E+ N + + E+ + Sbjct: 43 REANTTLFCASDAKAYETEVHNVWATHACVPTDPSPQEVVLENVTENFNMWKDNMVEQMH 102 Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEK 292 V ++ N ++ S N T+ +N Sbjct: 103 EDTISLWDESLKPCVKLTPLCVTLNCTDITSAATNATANSNATSWTDGREEMRNCSFNIT 162 Query: 293 ESSHNTIGSTRL--KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 + + K+ I TD + + NT Q C + + I+ Sbjct: 163 TNVSDKKQKEYAIFYKYDIVSTDNGTTSYLTHCNTSIIRQACPKVSFQPIPIH 215 >gi|85374479|ref|YP_458541.1| hypothetical protein ELI_08260 [Erythrobacter litoralis HTCC2594] gi|84787562|gb|ABC63744.1| hypothetical protein ELI_08260 [Erythrobacter litoralis HTCC2594] Length = 202 Score = 37.2 bits (84), Expect = 5.2, Method: Composition-based stats. Identities = 13/95 (13%), Positives = 31/95 (32%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A + V + + DL +Y ++ +Q A+ A + + T + Sbjct: 22 FAFAMPVFAVILMALFDLGFQIYAQSIVQGAVQEAARASTLESGGSNSAALDDTVRKNVQ 81 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKD 96 T+ ++ E+ G + ++D Sbjct: 82 TVIPGATLTFTRKNYANFEDVGIPEDFTDTSGSED 116 >gi|225174961|ref|ZP_03728958.1| hypothetical protein DealDRAFT_0813 [Dethiobacter alkaliphilus AHT 1] gi|225169601|gb|EEG78398.1| hypothetical protein DealDRAFT_0813 [Dethiobacter alkaliphilus AHT 1] Length = 357 Score = 37.2 bits (84), Expect = 5.3, Method: Composition-based stats. Identities = 24/324 (7%), Positives = 66/324 (20%), Gaps = 41/324 (12%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+ + F ID+ + R M Sbjct: 24 IFAVALIALLGFAAIVIDVGGMYVERRSM-------------------VTAADAGALAGA 64 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDK-NNPLQYIAESKAQYEIPTENLF 119 + + + +Q + + I + + N + F Sbjct: 65 RELAESGDEALAEQAAALFAQTNGAEITDDIEVLDVEYNGESFKAVRASVGVNREH---F 121 Query: 120 LKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKY 179 + +++ R+ + Q D + N Sbjct: 122 FAKALGFNDSDVFARAVATWGYPKALSNLLPIFFEIEDGQSLPEGEQLLLDESLEPGNWG 181 Query: 180 LLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIG 239 L P +I + N + RI Sbjct: 182 FLALNPSGQNAINAVLAGGVNDYEYEVGDEI---------------TQDTKPGNANSRIN 226 Query: 240 TIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 I + + + + +S + + + ++ T + A + + + Sbjct: 227 PIEERMRLAADPDSGVS--MVGILPIIREITSGGRTEVVI-VGFAPFLIKDVITAKTKEE 283 Query: 300 GSTRLKKFVIFITDGENSGASAYQ 323 + + ++ Sbjct: 284 DNLWYGRGSVYAHLENPPNYYGGY 307 >gi|302687344|ref|XP_003033352.1| expressed protein [Schizophyllum commune H4-8] gi|300107046|gb|EFI98449.1| expressed protein [Schizophyllum commune H4-8] Length = 184 Score = 37.2 bits (84), Expect = 5.3, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 48/168 (28%), Gaps = 32/168 (19%) Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 +N + + N +++ + TN A+ L + Sbjct: 25 FNSTVSTVIANDFARNPDQLLQGIVGQTAGGGTNYTAALSATETLLRQHWSTERTP---- 80 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAG--MKIYSVAVSAPPEGQDLLRKC 360 VIF++DGE S +C+ G + ++V+ E + + Sbjct: 81 ----VVIFLSDGECSVGDHS-----VHALCQAAVQLGKHLSFHAVSFGPRNEVLRRMAQI 131 Query: 361 TDS-----------------SGQFFAVNDSRELLESFDKITDKIQEQS 391 + DS L E+F + D +++ Sbjct: 132 ARDVQARAPRDPALPTTAYVESSYAEALDSVRLAETFLGLADSLRKPR 179 >gi|134300239|ref|YP_001113735.1| hypothetical protein Dred_2400 [Desulfotomaculum reducens MI-1] gi|134052939|gb|ABO50910.1| protein of unknown function DUF1078 domain protein [Desulfotomaculum reducens MI-1] Length = 856 Score = 37.2 bits (84), Expect = 5.3, Method: Composition-based stats. Identities = 25/280 (8%), Positives = 67/280 (23%), Gaps = 5/280 (1%) Query: 40 GCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNN 99 G A + + + + + A + IT ++ Sbjct: 385 GAAGSTGLDAVTSGEDLAQKLQALLDARTTIPGTNITVPENMTATNATAGNVTITGKYSD 444 Query: 100 PLQYIAESK-AQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR 158 ++ T L S N S + + + +LD Sbjct: 445 TSDGTVSVVYSEGSATTTPDPANSLAGSPTVNASKYYNTPATWNLKYDNATSDWLLDKGD 504 Query: 159 SMEDLYLQ----KHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLI 214 + + D + + +K L P ++ T+ S + A + Sbjct: 505 GTGYNSITFTGDIYTDPSGVEIDKTGLTPTSDNNWDISLTSGSISISGVSGAVTLSNGDT 564 Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 + N + + + + ++ N++ ++ + Sbjct: 565 SATYNGLTIDVSDVISTGTVQNGDTWTYNVKADKPSTNVAVTWNIDHLEFSTVGTPKDGS 624 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDG 314 T + A + + + +K +G Sbjct: 625 TIPQIRIKDAVDLIGYVDRTEKGVNSADWSEKDFFIEYNG 664 >gi|296158802|ref|ZP_06841631.1| Protein of unknown function DUF2134, membrane [Burkholderia sp. Ch1-1] gi|295891007|gb|EFG70796.1| Protein of unknown function DUF2134, membrane [Burkholderia sp. Ch1-1] Length = 597 Score = 36.8 bits (83), Expect = 5.4, Method: Composition-based stats. Identities = 26/170 (15%), Positives = 50/170 (29%), Gaps = 10/170 (5%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + AI +SV + A+D+ ++ + R Q+Q D A ++ I S TT Q Sbjct: 35 LAAIWLSVAIAALG-ALDVGNVYFARRQLQRTADLAAMAAVQMIGSTGGCATATTAAQQN 93 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKA-------QYEI 113 + D + + + N +Q + Sbjct: 94 AAANG--FTAGSTTTISTTCGRWDTSSSTYFGTSGNPLNAVQVKTTQVVPYFFVGPSRNV 151 Query: 114 PTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDL 163 I + + S + + N + + VS S+ D Sbjct: 152 SATATAFASNIDTFSLGTGIASINTQQSALLNAILGGLLNTSVSLSVGDT 201 >gi|237653353|ref|YP_002889667.1| hypothetical protein Tmz1t_2688 [Thauera sp. MZ1T] gi|237624600|gb|ACR01290.1| conserved hypothetical protein [Thauera sp. MZ1T] Length = 474 Score = 36.8 bits (83), Expect = 5.4, Method: Composition-based stats. Identities = 12/98 (12%), Positives = 32/98 (32%), Gaps = 5/98 (5%) Query: 3 AIIISVCFLFITYAI-----DLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKK 57 A+++ + +F T A+ + ++ + ++ +A DA S T + Sbjct: 23 ALVLGLFLVFATLAVLLLMFNTGRVVDEKMRLTNAADATAWSVATLEARALNYDAYTNRA 82 Query: 58 DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITK 95 + + Q + Y + Q+ + Sbjct: 83 IVANEVAIAQAISLISWMHYFETAVENAPNLNQVAASW 120 >gi|84499825|ref|ZP_00998113.1| hypothetical protein OB2597_07840 [Oceanicola batsensis HTCC2597] gi|84392969|gb|EAQ05180.1| hypothetical protein OB2597_07840 [Oceanicola batsensis HTCC2597] Length = 244 Score = 36.8 bits (83), Expect = 5.4, Method: Composition-based stats. Identities = 20/136 (14%), Positives = 39/136 (28%), Gaps = 30/136 (22%) Query: 262 VKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASA 321 V ++ L+P T ++ A R L ++ +TDG + Sbjct: 106 VIRAVDALSPGGLTPLARSVGVAARVL-----------DHREKAGIIVVVTDGNETCGGR 154 Query: 322 YQNTLNTLQICEYMRNAGMKIYSVAV-------------SAPPEGQDLLRKC--TDSSGQ 366 T L + I+ + G+D + +C + G Sbjct: 155 PCATGAALAAEA----RDLTIHVIGFRALVDYWTWDNPEQEAHVGEDTVARCLAEKTGGM 210 Query: 367 FFAVNDSRELLESFDK 382 + EL+E+ Sbjct: 211 YVRTETVGELVEALQA 226 >gi|167836681|ref|ZP_02463564.1| hypothetical protein Bpse38_09356 [Burkholderia thailandensis MSMB43] Length = 323 Score = 36.8 bits (83), Expect = 5.4, Method: Composition-based stats. Identities = 29/210 (13%), Positives = 65/210 (30%), Gaps = 10/210 (4%) Query: 4 IIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDP--TTKKDQTS 61 + ++V +F +AIDL ++ +RN++Q+ DAA L+G ++ + ++ Sbjct: 2 LFVTVLLIFGAFAIDLPRVITVRNELQNDADAAALAGAGALTTPGVSGPAWTQAASATSA 61 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQ------INITKDKNNPLQYIAESKAQYEIPT 115 I ++ ++ + I +Q A Sbjct: 62 AILLNASDDQTLTSGIVQTGYWNLTGQPSTLQPTTITPGPYDMPAVQVTVTRAANQNGGA 121 Query: 116 ENLFLKGLIPSALTNLSLRSTGIIERSSENLA--ISICMVLDVSRSMEDLYLQKHNDNNN 173 L + + + N S + II S A + ++ S + Sbjct: 122 IPLLMGNFLGVSTANGSATAVAIIASPSTVGAGGVFPMVIDQCVLSQYWNAQTNQPKIDP 181 Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAP 203 T Y + +++ T + Sbjct: 182 STGQPYDIQIGNGQTYGGSCTGGQWTSFLT 211 >gi|153815450|ref|ZP_01968118.1| hypothetical protein RUMTOR_01685 [Ruminococcus torques ATCC 27756] gi|145847309|gb|EDK24227.1| hypothetical protein RUMTOR_01685 [Ruminococcus torques ATCC 27756] Length = 1620 Score = 36.8 bits (83), Expect = 5.6, Method: Composition-based stats. Identities = 29/336 (8%), Positives = 88/336 (26%), Gaps = 21/336 (6%) Query: 2 TAIIISVCFLFITY---AIDLAHIMYIRNQMQSALDA-----AVLSGCASIVSDRTIKDP 53 A++++ + I + +M + A G + S Sbjct: 7 AALVMAAILGVTGFSVPGIGVVQAEEAHTEMSGEAEQQEINLAADRGSTARASKFLAASG 66 Query: 54 TTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEI 113 + F + + + ++ D Q + + ++E K + Sbjct: 67 NLPAREPGLAFDGISDNNGEADNSRWQSGEDAEFSEQ--WLEVDLGGICVVSEIKVDFFA 124 Query: 114 PTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND--- 170 F + S + + E +++ +DV + + + ++ Sbjct: 125 RLYGDFRVEVSDSNAEDAVWTTIA-TADMPEGTDLNLKKTVDVKENGKAREIPRYIRLYF 183 Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVN--SIQKAI 228 + + + ++S Y + + A+ Sbjct: 184 TSGNSQAANRSIGVREFQVIGTKKSESGYETITGNIALNKTASASGVEAAMPNLTANLAV 243 Query: 229 QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYREL 288 +K+ + R G NQ ++++ + + T+ + + Sbjct: 244 DGQKSDTSRWSAPTMKNGTSPNQQQTPQWLEIDLRNEVTNI-----TSIDLYFYKLVYSI 298 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 E ++ + + K V E + + + Sbjct: 299 DYEIQTRADKKSEWKTVKHVTCQPGNEQNKHDSITD 334 >gi|324991934|gb|EGC23857.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK405] gi|332363536|gb|EGJ41317.1| fused nitric oxide reductase NorD/von Willebrand factor type A domain protein [Streptococcus sanguinis SK1059] Length = 462 Score = 36.8 bits (83), Expect = 5.6, Method: Composition-based stats. Identities = 28/271 (10%), Positives = 78/271 (28%), Gaps = 55/271 (20%) Query: 170 DNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQ 229 + + + + S N ++D+L + + ++ + + Sbjct: 188 RTDPIQGQMNIAVSFVFDTSGSMNWDLQGRNVEKTGNESRMDILRKKSVIMIKDLAEI-- 245 Query: 230 EKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN---KLNPYENTNTYPAMHHAYR 286 N+SV + + + + + L N N + + + LNP TN + + Sbjct: 246 --GNVSVNLVGFSTSAKYIQQNFSNLDNGTNTIIATITKRENLNPDGVTNPGDGLRYGMI 303 Query: 287 ELYNEKES-------SHNTIGSTRLKKFVIFITD-------------------------- 313 L ++ + + + ++ + Sbjct: 304 SLQSQPAQLKYIVLLTDGIPNAYLVDSRALYAGNRVDLSQGAGRVTFNNPIYDLSPTLGY 363 Query: 314 -----GENSGASAYQNTLNTLQICEYMRNA---GMK-IYSVAV-SAPP---EGQDLLRKC 360 G + + N++ + G+K + + GQ L + Sbjct: 364 EYSRLGYDLYSRDSITRENSIAYAGEVSKKFGLGIKRVNVIGFSGVDHEIAYGQSLTDRI 423 Query: 361 TDSS--GQFFAVNDSRELLESFDKITDKIQE 389 + ++ + + L ++F I +IQ+ Sbjct: 424 GEGGMETKYVSATNEEALQKTFSDIKKQIQQ 454 >gi|197692696|gb|ACH71058.1| envelope glycoprotein [Human immunodeficiency virus 1] Length = 516 Score = 36.8 bits (83), Expect = 5.6, Method: Composition-based stats. Identities = 22/173 (12%), Positives = 50/173 (28%), Gaps = 3/173 (1%) Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 +N L K++ ++ +V++E+ N + + E+ + Sbjct: 43 REANTTLFCASDAKAYGTEVHNVWATHACVPTDPSPQEVVLENVTENFNMWKNNMVEQMH 102 Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEK 292 + V ++ N ++ S N T+ +N Sbjct: 103 EDIISLWDESLKPCVKLTPLCVTLNCTDITSAATNATANSNATSWTDGREEMRNCSFNIT 162 Query: 293 ESSHNTIGSTRL--KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 + + K+ I TD + + NT Q C + + I+ Sbjct: 163 TNVSDKKQKEYAIFYKYDIVSTDNGTTSYLTHCNTSIIRQACPKVSFQPIPIH 215 >gi|303248311|ref|ZP_07334573.1| TadE family protein [Desulfovibrio fructosovorans JJ] gi|302490336|gb|EFL50248.1| TadE family protein [Desulfovibrio fructosovorans JJ] Length = 165 Score = 36.8 bits (83), Expect = 5.7, Method: Composition-based stats. Identities = 17/133 (12%), Positives = 42/133 (31%), Gaps = 5/133 (3%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A+++ V + I++A+I+ ++Q L++A + I + I + + Sbjct: 24 FALVLPVLVFMLLGIIEVANIL----RIQFTLESAATTVAHDISQNPNITN-QSAAQNLF 78 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + ++QG + + A N+ Y ++ Sbjct: 79 DGKQDSYAPLVQQGRDTSDPSDPPALAMSPTTRPTCNSSSCTPFLITITYTYKAMTAPMQ 138 Query: 122 GLIPSALTNLSLR 134 + S R Sbjct: 139 PFFDGLTLSASAR 151 >gi|256840371|ref|ZP_05545879.1| conserved hypothetical protein [Parabacteroides sp. D13] gi|256737643|gb|EEU50969.1| conserved hypothetical protein [Parabacteroides sp. D13] Length = 843 Score = 36.8 bits (83), Expect = 5.7, Method: Composition-based stats. Identities = 26/227 (11%), Positives = 61/227 (26%), Gaps = 6/227 (2%) Query: 124 IPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPP 183 I S L ++ + + + + D + Y + L Sbjct: 264 IGSLLIDMVGGQDSVAALFNFSNYDLTSWMTDYMDNAVGNYYTQRWYIARRDQGSISLCD 323 Query: 184 PPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQE---KKNLSVRIGT 240 + + + + L NS + A + L+ Sbjct: 324 YYPPTDDNSILNGGAWTRFNTSDPGFYPNASQREQALANSERYAGWSRSRVQQLNNSNDG 383 Query: 241 IAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIG 300 Y I N K+ +++ T ++ M Y ++++ NT Sbjct: 384 YTYTINTRQQAYIISKGNKQTKKAYAYEIH---VTQSWNRMEVVYEDVFDSYSMDLNTFK 440 Query: 301 STRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAV 347 + + F + E + N Q + + G + +++V Sbjct: 441 AQLNARLSEFNDNEEGYVYYIASDARNYYQATDAAKLQGCESVTISV 487 >gi|239817564|ref|YP_002946474.1| outer membrane adhesin like proteiin [Variovorax paradoxus S110] gi|239804141|gb|ACS21208.1| outer membrane adhesin like proteiin [Variovorax paradoxus S110] Length = 1867 Score = 36.8 bits (83), Expect = 5.7, Method: Composition-based stats. Identities = 33/303 (10%), Positives = 67/303 (22%), Gaps = 20/303 (6%) Query: 39 SGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKN 98 +G + S TI + G D Sbjct: 1274 AGEPPLQSTGTISIADPDSAAGDLSVSLSGPNGVTSGGQPVSWTWDAGTHTLTGSVTVGG 1333 Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR 158 + + + F G + + S+ +V D Sbjct: 1334 VTTEVMTVAVGNVSATGAGQFEAGYTVTLKAPIDHLPGNGEGVSNL---HFEAVVSDGQA 1390 Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAG 218 S + + ++ + ++ E+ Sbjct: 1391 SGAPVGFEVPVKDDAPVLVNGEQAVDVAPIDTNLMVILDLSGSMGQETPTRLSRAKEAIQ 1450 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 NL++ VR+ + ++ Q E K+ + L +TN Sbjct: 1451 NLIDGYDLY------GDVRVQLVTFSTTGASQQAWM---TAAEAKALVQNLQAAGSTNYD 1501 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 A+ A T + F+TDGE + L + Sbjct: 1502 AALAAAMNGFS-------ATGKLDGAQNVSYFLTDGEPTLGDGNTAQLANSSNSSTA-DR 1553 Query: 339 GMK 341 G++ Sbjct: 1554 GIQ 1556 >gi|258573439|ref|XP_002540901.1| predicted protein [Uncinocarpus reesii 1704] gi|237901167|gb|EEP75568.1| predicted protein [Uncinocarpus reesii 1704] Length = 1032 Score = 36.8 bits (83), Expect = 5.7, Method: Composition-based stats. Identities = 23/239 (9%), Positives = 55/239 (23%), Gaps = 3/239 (1%) Query: 39 SGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKN 98 + + + S ++ ++A + + Sbjct: 481 ATVPTGSVSTDTTSNGSATQTLSNTASNSATASIETLPTQSQSASASDSSTIVPTVPSDS 540 Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR 158 + + PS +S + S V S Sbjct: 541 ATVP-TGSASTDTTSSEMPTQTLTNTPSGSATVSTDTATNSVPSQTLSNTPSTTVPTGSA 599 Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFW--SKNTTKSKYAPAPAPANRKIDVLIES 216 S + + + T + P S S + P+ + + Sbjct: 600 STDTTTNGLPSQTLSNTPSDSATVPTGSASTDTTSNGVPSQTLSNTPSSSVTVPTGSAST 659 Query: 217 AGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENT 275 N + + + +V G+ + + G LSN+ + + P +T Sbjct: 660 DTTSNNLPSQTLSNTPSATVPTGSASTDTATNGLPTQTLSNSESSGSLTVPTGMPSGST 718 >gi|255693879|ref|ZP_05417554.1| BatB protein [Bacteroides finegoldii DSM 17565] gi|260620308|gb|EEX43179.1| BatB protein [Bacteroides finegoldii DSM 17565] Length = 342 Score = 36.8 bits (83), Expect = 5.7, Method: Composition-based stats. Identities = 24/169 (14%), Positives = 56/169 (33%), Gaps = 46/169 (27%) Query: 251 QCTPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 P++++ K L +NP + T A++ A R + + + Sbjct: 142 TQLPITSDYISAKMFLESINPSLISKQGTAIGEAINLAARSFTPQ----------EGVGR 191 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------------- 352 +I ITDGEN A + + G+++ + V P Sbjct: 192 AIIVITDGENHEGGAVEAAKAAAE-------KGIQVNVLGVGMPDGAPIPAEGTNDYRRD 244 Query: 353 ----------GQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQ 390 + + ++ G + V++S ++ ++ +K+ + Sbjct: 245 REGNVIVTRLNETMCQEIAKEGKGIYVRVDNSNSAQKAINQEVNKMAKS 293 >gi|167624203|ref|YP_001674497.1| hypothetical protein Shal_2279 [Shewanella halifaxensis HAW-EB4] gi|167354225|gb|ABZ76838.1| conserved hypothetical protein [Shewanella halifaxensis HAW-EB4] Length = 446 Score = 36.8 bits (83), Expect = 5.7, Method: Composition-based stats. Identities = 14/96 (14%), Positives = 30/96 (31%), Gaps = 7/96 (7%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 M I I F A+D H++ + ++Q+A+ A+ ++ + Sbjct: 25 MFTIGIFAVIAFAALALDGGHMLLSKGRLQNAV-------DAAALNAAKELQEGATLLEA 77 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKD 96 + ++L N N T+ Sbjct: 78 REAAYTILLQNLSFTENGELNTSVSLSSPDFNNTQV 113 >gi|296444402|ref|ZP_06886367.1| TadE family protein [Methylosinus trichosporium OB3b] gi|296258049|gb|EFH05111.1| TadE family protein [Methylosinus trichosporium OB3b] Length = 207 Score = 36.8 bits (83), Expect = 5.8, Method: Composition-based stats. Identities = 15/165 (9%), Positives = 41/165 (24%), Gaps = 11/165 (6%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 AI+ I ++ + Q+Q A A + + + + T Sbjct: 31 FAIVSVPLLGLIGAIFEVGLVYMRGEQLQIATQNASRAMLTNSIGNMTY----------Q 80 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + G+ +++ + + A S + Y P Sbjct: 81 NFINNYVCSWQTTGTVAPGTLDRSFDCSRLLVDVSSPSSWTGAATSNSFYTAPNALGSTI 140 Query: 122 GLIPSALTN-LSLRSTGIIERSSENLAISICMVLDVSRSMEDLYL 165 + + + + + + + M L ++ Sbjct: 141 TMPAAGSIAVVRIVYPMPMMTAILTGGVLTGMTLGNGKTAGGWLT 185 >gi|115390819|ref|XP_001212914.1| conserved hypothetical protein [Aspergillus terreus NIH2624] gi|114193838|gb|EAU35538.1| conserved hypothetical protein [Aspergillus terreus NIH2624] Length = 1013 Score = 36.8 bits (83), Expect = 5.8, Method: Composition-based stats. Identities = 33/281 (11%), Positives = 75/281 (26%), Gaps = 35/281 (12%) Query: 108 KAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRS-------- 159 + F + R+ + + + + LD S Sbjct: 395 TLNLSVSELPCFFLRFQSRNQLEIWRRAL-LDLHQLDTPSRNPDFDLDNSGEEEDYRASK 453 Query: 160 ----MEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPAN---RKIDV 212 +NN Y S S + P ++ KI + Sbjct: 454 IRRQASLNSSYGAARSNNTAITDYTNAGGDTLSTTSLHIPLDIVVVIPVSSSMQGLKITL 513 Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN-NLNEVKSRLNKLNP 271 L ++ LVN++ R+G + + G ++ + LN + P Sbjct: 514 LRDALKFLVNNLG--------PRDRMGLVTFGSSGGGVPLVGMTTKSWGGWPKILNSIRP 565 Query: 272 YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQI 331 + + + + S+ ++ I S+ + + + Sbjct: 566 VGHKSLRADVVEGANVAMDLLMQ----RKSSNPLSTILLI------SDSSTSDPDSVDFV 615 Query: 332 CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVND 372 A + I+S + + ++ T + G + V D Sbjct: 616 VSRAEAAKVSIHSFGLGLTHKPDTMIELSTRTKGSYLYVKD 656 >gi|29346316|ref|NP_809819.1| hypothetical protein BT_0906 [Bacteroides thetaiotaomicron VPI-5482] gi|253568263|ref|ZP_04845674.1| conserved hypothetical protein [Bacteroides sp. 1_1_6] gi|298385670|ref|ZP_06995228.1| BatB protein [Bacteroides sp. 1_1_14] gi|29338211|gb|AAO76013.1| BatB, conserved hypothetical protein [Bacteroides thetaiotaomicron VPI-5482] gi|251842336|gb|EES70416.1| conserved hypothetical protein [Bacteroides sp. 1_1_6] gi|298261811|gb|EFI04677.1| BatB protein [Bacteroides sp. 1_1_14] Length = 342 Score = 36.8 bits (83), Expect = 5.8, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 53/168 (31%), Gaps = 49/168 (29%) Query: 251 QCTPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 P++++ K L ++P + T A++ A R + + + Sbjct: 142 TQLPITSDYISAKMFLESISPSLISKQGTAIGEAINLATRSFTPQ----------EGVGR 191 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------------- 352 +I ITDGEN A + + +++ + V P Sbjct: 192 AIIVITDGENHEGGAVEAAKAAAEKG-------IQVSVLGVGMPEGAPIPVEGTNDYRRD 244 Query: 353 ----------GQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQE 389 + + ++ G + V++S + I+ +I + Sbjct: 245 REGNVIVTRLNEGMCQEIAKDGKGIYVRVDNS---NSAQKAISQEISK 289 >gi|159112091|ref|XP_001706275.1| Kinase, NEK [Giardia lamblia ATCC 50803] gi|157434370|gb|EDO78601.1| Kinase, NEK [Giardia lamblia ATCC 50803] Length = 1006 Score = 36.8 bits (83), Expect = 5.8, Method: Composition-based stats. Identities = 20/131 (15%), Positives = 39/131 (29%), Gaps = 6/131 (4%) Query: 204 APANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNE-- 261 A + + S +L++S + R P + Sbjct: 358 AYMTDNVGMYSLSVDDLISSHSNVPGVRNEPVTRSSLTPGAAVATTLYAAPTYSREATRI 417 Query: 262 VKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENS 317 + SR++ + NT+ + ++R L +SHN S + DG Sbjct: 418 ISSRVDSMRTGAELPGNTSINNGLSPSFRNLLEMNSASHNYDNSIDDTSPIYNQNDGSVR 477 Query: 318 GASAYQNTLNT 328 + Q +T Sbjct: 478 ATNDAQGAADT 488 >gi|4139906|pdb|1AOX|A Chain A, I Domain From Integrin Alpha2-Beta1 gi|4139907|pdb|1AOX|B Chain B, I Domain From Integrin Alpha2-Beta1 Length = 203 Score = 36.8 bits (83), Expect = 5.9, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 63/187 (33%), Gaps = 24/187 (12%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 ++ N + + + ++G I Y E+ ++ + Y Sbjct: 24 DAVKNFLEKFVQ-GLDIGPTKTQVGLIQYANNPRVVFNLNTYKTKEEMIVATSQTSQYGG 82 Query: 275 --TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 TNT+ A+ +A + Y+ + K ++ +TDGE+ S + Sbjct: 83 DLTNTFGAIQYARKYAYSAASGGRRSAT-----KVMVVVTDGESHDGSM------LKAVI 131 Query: 333 EYMRNAGMKIYSVAV----SAPPEGQDLL----RKCTD--SSGQFFAVNDSRELLESFDK 382 + + + + +AV + L + + FF V+D LLE Sbjct: 132 DQCNHDNILRFGIAVLGYLNRNALDTKNLIKEIKAIASIPTERYFFNVSDEAALLEKAGT 191 Query: 383 ITDKIQE 389 + ++I Sbjct: 192 LGEQIFS 198 >gi|197692693|gb|ACH71057.1| envelope glycoprotein [Human immunodeficiency virus 1] Length = 516 Score = 36.8 bits (83), Expect = 6.0, Method: Composition-based stats. Identities = 22/173 (12%), Positives = 50/173 (28%), Gaps = 3/173 (1%) Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 +N L K++ ++ +V++E+ N + + E+ + Sbjct: 43 REANTTLFCASDAKAYETEVHNVWATHACVPTDPSPQEVVLENVTENFNMWKNNMVEQMH 102 Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEK 292 + V ++ N ++ S N T+ +N Sbjct: 103 EDIISLWDESLKPCVKLTPLCVTLNCTDITSAATNATANSNATSWTDGREEMRNCSFNIT 162 Query: 293 ESSHNTIGSTRL--KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 + + K+ I TD + + NT Q C + + I+ Sbjct: 163 TNVSDKKQKEYAIFYKYDIVSTDNGTTSYLTHCNTSIIRQACPKVSFQPIPIH 215 >gi|197692708|gb|ACH71064.1| envelope glycoprotein [Human immunodeficiency virus 1] Length = 516 Score = 36.8 bits (83), Expect = 6.0, Method: Composition-based stats. Identities = 22/173 (12%), Positives = 50/173 (28%), Gaps = 3/173 (1%) Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 +N L K++ ++ +V++E+ N + + E+ + Sbjct: 43 REANTTLFCASDAKAYETEVHNVWATHACVPTDPSPQEVVLENVTENFNMWKNNMVEQMH 102 Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEK 292 + V ++ N ++ S N T+ +N Sbjct: 103 EDIISLWDESLKPCVKLTPLCVTLNCTDITSAATNATANSNATSWTDGREEMRNCSFNIT 162 Query: 293 ESSHNTIGSTRL--KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 + + K+ I TD + + NT Q C + + I+ Sbjct: 163 TNVSDKKQKEYAIFYKYDIVSTDNGTTSYLTHCNTSIIRQACPKVSFQPIPIH 215 >gi|296112122|ref|YP_003622504.1| glucosyltransferase-S [Leuconostoc kimchii IMSNU 11154] gi|295833654|gb|ADG41535.1| glucosyltransferase-S [Leuconostoc kimchii IMSNU 11154] Length = 1549 Score = 36.8 bits (83), Expect = 6.1, Method: Composition-based stats. Identities = 29/296 (9%), Positives = 73/296 (24%), Gaps = 21/296 (7%) Query: 35 AAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINIT 94 AA + V + D T + T + + + + + A + N+ Sbjct: 21 AAGILSFGLAVQQQVYADTTHGESTTQVVAATNVDTSVSDTPTVTDAGAKDATTSHDNVI 80 Query: 95 KDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVL 154 + + E + ST + + + + + Sbjct: 81 VHDDQAANSTQTNNQNQETNHADAVTA-----PTDVAQTDSTAQVAPIASTNSDDTQVDV 135 Query: 155 DVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLI 214 + Q D + ++ +K+ + S N +I Sbjct: 136 TTGTVNDVNNNQPDADKGASDQKSSDVNHADTQTNVNKDASVSNTNNTDKDVNTQIASDK 195 Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 ++ + + T + +NN + + + Sbjct: 196 DAGNQ---------TDPTQDTNEQRTTDKKNDDNQDITDTTTNNPSAKNDKRD------- 239 Query: 275 TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQ 330 TNT + ++ + + + + ++KK I DG L Sbjct: 240 TNTPEPIQYSSKNIQTVNGQTVYVDDNGQIKKNFTAIVDGHVLYFDKDNGFLVPTD 295 >gi|322369013|ref|ZP_08043580.1| GLUG domain protein [Haladaptatus paucihalophilus DX253] gi|320551744|gb|EFW93391.1| GLUG domain protein [Haladaptatus paucihalophilus DX253] Length = 2092 Score = 36.8 bits (83), Expect = 6.1, Method: Composition-based stats. Identities = 36/315 (11%), Positives = 89/315 (28%), Gaps = 5/315 (1%) Query: 29 MQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQK 88 +Q +LD ++ ++ ++ + + + G+ + G+ Sbjct: 347 VQLSLDGGAVTNSTTVAAESDADSSGETVTFGNLDASGLAEGSISIGATFTDYGGNSNAA 406 Query: 89 AQINITKDKNNPLQYIAESKAQYEIPTENL-FLKGLIPSALTNLSLRSTGIIERSSENLA 147 + + S + I T + + + + L+ + +E ++ N Sbjct: 407 PGLASVTKDTGLPGVVDASISNAPIGTYDTGTQQTVTVAFDEALNQSISPTVEITNLNRT 466 Query: 148 ISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPAN 207 ++ D + + +D + + + + + + + PA Sbjct: 467 YAVSGGFDDATTWTGTVTIADDDEQRTATIAVSNATDTVGNVMTPDGSNTFQVDSTGPAK 526 Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIV----GNQCTPLSNNLNEVK 263 + + N ++ G TP + + Sbjct: 527 PDSTAAGNVTRSNRTDYNVTVTFVTNSQADEVSVKVTDGTTAVVSNRSITPGTETVTVTG 586 Query: 264 SRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 ++ L+ T T A+ Y + T + R I I DGE + A A Sbjct: 587 IDVSSLDDGAITATALALDGGYANTEGYAAPTTVTKDTKRPSASSITIGDGEINDADAGT 646 Query: 324 NTLNTLQICEYMRNA 338 T+ E M + Sbjct: 647 ARTVTVTFDEAMNRS 661 >gi|197692720|gb|ACH71070.1| envelope glycoprotein [Human immunodeficiency virus 1] Length = 516 Score = 36.8 bits (83), Expect = 6.2, Method: Composition-based stats. Identities = 22/173 (12%), Positives = 50/173 (28%), Gaps = 3/173 (1%) Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 +N L K++ ++ +V++E+ N + + E+ + Sbjct: 43 REANTTLFCASDAKAYETEVHNVWATHACVPTDPSPQEVVLENVTENFNMWKNNMVEQMH 102 Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEK 292 + V ++ N ++ S N T+ +N Sbjct: 103 EDIISLWDESLKPCVKLTPLCVTLNCTDITSAATNATANSNATSGTDGREEMRNCSFNIT 162 Query: 293 ESSHNTIGSTRL--KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 + + K+ I TD + + NT Q C + + I+ Sbjct: 163 TNVSDKKQKEYAIFYKYDIVSTDNGTTSYLTHCNTSIIRQACPKVSFQPIPIH 215 >gi|223888758|ref|ZP_03623349.1| tetratricopeptide repeat domain protein [Borrelia burgdorferi 64b] gi|223885574|gb|EEF56673.1| tetratricopeptide repeat domain protein [Borrelia burgdorferi 64b] Length = 668 Score = 36.8 bits (83), Expect = 6.2, Method: Composition-based stats. Identities = 23/202 (11%), Positives = 68/202 (33%), Gaps = 12/202 (5%) Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYN 244 ++++ N + A N + S ++ Q + ++ Sbjct: 436 SNENYYIVNPKVAYNVNASKDINIAVVFDKSSYMKKYDTDQIVGLNALMELSKNKNFSFI 495 Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 P+ +N+ + + + + +T +L + + + S Sbjct: 496 N----ATSVPIIDNIESLTNSIRNTSSLGPYSTDAVKTDVSLKL------AGSGLMSKSS 545 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS 364 ++ V++ + G + + + +L+T I Y +N ++ Y + P L ++ Sbjct: 546 RRAVVYFSGGILNRKAFEKYSLDT--IVSYYKNNDIRFYLILFGNDPINSKLQYLVNETG 603 Query: 365 GQFFAVNDSRELLESFDKITDK 386 G + + + +D I ++ Sbjct: 604 GAVIPFSSYEGVSKVYDLILEQ 625 >gi|218249288|ref|YP_002374759.1| tetratricopeptide repeat domain protein [Borrelia burgdorferi ZS7] gi|226321554|ref|ZP_03797080.1| tetratricopeptide repeat domain protein [Borrelia burgdorferi Bol26] gi|218164476|gb|ACK74537.1| tetratricopeptide repeat domain protein [Borrelia burgdorferi ZS7] gi|226232743|gb|EEH31496.1| tetratricopeptide repeat domain protein [Borrelia burgdorferi Bol26] Length = 668 Score = 36.8 bits (83), Expect = 6.2, Method: Composition-based stats. Identities = 23/202 (11%), Positives = 68/202 (33%), Gaps = 12/202 (5%) Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYN 244 ++++ N + A N + S ++ Q + ++ Sbjct: 436 SNENYYIVNPKVAYNVNASKDINIAVVFDKSSYMKKYDTDQIVGLNALMELSKNKNFSFI 495 Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 P+ +N+ + + + + +T +L + + + S Sbjct: 496 N----ATSVPIIDNIESLTNSIRNTSSLGPYSTDAVKTDVSLKL------AGSGLMSKSS 545 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS 364 ++ V++ + G + + + +L+T I Y +N ++ Y + P L ++ Sbjct: 546 RRAVVYFSGGILNRKAFEKYSLDT--IVSYYKNNDIRFYLILFGNDPINSKLQYLVNETG 603 Query: 365 GQFFAVNDSRELLESFDKITDK 386 G + + + +D I ++ Sbjct: 604 GAVIPFSSYEGVSKVYDLILEQ 625 >gi|51245321|ref|YP_065205.1| fimbrial biogenesis protein PilY1 [Desulfotalea psychrophila LSv54] gi|50876358|emb|CAG36198.1| related to fimbrial biogenesis protein PilY1 [Desulfotalea psychrophila LSv54] Length = 1640 Score = 36.8 bits (83), Expect = 6.2, Method: Composition-based stats. Identities = 29/334 (8%), Positives = 81/334 (24%), Gaps = 19/334 (5%) Query: 64 FKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGL 123 + + I + + +I D L Y + I + Sbjct: 572 AVGYLALSPRTNYNIALLTDQVPKGLLHDIKDDVRQGLSYYKYNLTTNNIYNSWWHGGTM 631 Query: 124 -IPSALTN-LSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLL 181 + + S + I +++D ++ N +Y Sbjct: 632 RLRIPTNPFVKKSSDTNFRTIDTPIDSDIEILVDAVEHYPLIWGTTPLAENFYEVIRYFQ 691 Query: 182 PPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTI 241 K +++ + + + + ++ ++ + + Sbjct: 692 Q---KGPYYNSSVNSAGDVASFLTEKAWDPYYFKELDAMIRCANSSVIIFTDGGSYTDSY 748 Query: 242 AYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGS 301 + S + N TN Y + + Sbjct: 749 VPPFNDEEFNESSASQAASAYSDGSN------YTNDYDGNIVTGEGINTRNTDNSYKNNL 802 Query: 302 TRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM-KIYSVAVS--APPEGQDLLR 358 L + G+ + N+ + + ++ Y+V A + LL+ Sbjct: 803 DDLALWANLNRKGQALT----EALSNSRDLRDDLKGEQYLTTYTVGFGISANSAEERLLQ 858 Query: 359 KCTDSS-GQFFAVNDSRELLESFDKITDKIQEQS 391 G + D ++L + + I +++ Sbjct: 859 DTAAHGQGTYSLAEDGQQLKDVLKGTINSILDKT 892 >gi|284991843|ref|YP_003410397.1| hypothetical protein Gobs_3425 [Geodermatophilus obscurus DSM 43160] gi|284065088|gb|ADB76026.1| conserved hypothetical protein [Geodermatophilus obscurus DSM 43160] Length = 298 Score = 36.8 bits (83), Expect = 6.2, Method: Composition-based stats. Identities = 6/113 (5%), Positives = 21/113 (18%), Gaps = 15/113 (13%) Query: 7 SVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKK 66 F A+D+ + R ++Q + + + Sbjct: 8 VPILGFTAIAVDVGALYAERARLQ--------TAADAAALAVARDCARGACGDMRATAQD 59 Query: 67 QIKKHLKQGSYIRENAGDIAQKAQINITKDKN-------NPLQYIAESKAQYE 112 + + + + + + + A Sbjct: 60 LVDANTGNATAAPPVLASDPVRVTVTGSTPTEHWFAPVIGHESTQVTATATVA 112 >gi|197692712|gb|ACH71066.1| envelope glycoprotein [Human immunodeficiency virus 1] Length = 516 Score = 36.8 bits (83), Expect = 6.3, Method: Composition-based stats. Identities = 23/173 (13%), Positives = 51/173 (29%), Gaps = 3/173 (1%) Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 +N L K++ ++ +V++E+ N + + E+ + Sbjct: 43 REANTTLFCASDAKAYETEVHNVWATHACVPTDPSPQEVVLENVTENFNMWKNNMVEQMH 102 Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEK 292 + V ++ N ++ S N T+ +N Sbjct: 103 EDIISLWDESLKPCVKLTPLCVTLNCTDITSAATNATANSNATSWTDGREEMRNCSFNIT 162 Query: 293 ESSHNTIGSTRL--KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 + + K+ I TD +G + NT Q C + + I+ Sbjct: 163 TNVSDKKQKEYAIFYKYDIVSTDNGTTGYLTHCNTSIIRQACPKVSFQPIPIH 215 >gi|197692698|gb|ACH71059.1| envelope glycoprotein [Human immunodeficiency virus 1] Length = 516 Score = 36.8 bits (83), Expect = 6.3, Method: Composition-based stats. Identities = 22/173 (12%), Positives = 50/173 (28%), Gaps = 3/173 (1%) Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKN 233 +N L K++ ++ +V++E+ N + + E+ + Sbjct: 43 REANTTLFCASDAKAYETEVHNVWATHACVPTDPSPQEVVLENVTENFNMWKNNMVEQMH 102 Query: 234 LSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN-TNTYPAMHHAYRELYNEK 292 + V ++ N ++ S N T+ +N Sbjct: 103 EDIISLWDESLKPCVKLTPLCVTLNCTDITSAATNATANSNATSWTDGREEMRNCSFNIT 162 Query: 293 ESSHNTIGSTRL--KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY 343 + + K+ I TD + + NT Q C + + I+ Sbjct: 163 TNVSDKKQKEYAIFYKYDIVSTDNGTTSYLTHCNTSIIRQACPKVSFQPIPIH 215 >gi|330823813|ref|YP_004387116.1| PilC domain-containing protein [Alicycliphilus denitrificans K601] gi|329309185|gb|AEB83600.1| Neisseria PilC domain protein [Alicycliphilus denitrificans K601] Length = 1245 Score = 36.8 bits (83), Expect = 6.3, Method: Composition-based stats. Identities = 12/44 (27%), Positives = 19/44 (43%) Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 P DL + G+F+AV +L ++F I +I Q Sbjct: 424 GEPVRSLDLWHAALNGRGRFYAVEKGEDLAKAFRDIFQQINTQV 467 >gi|308510204|ref|XP_003117285.1| hypothetical protein CRE_01843 [Caenorhabditis remanei] gi|308242199|gb|EFO86151.1| hypothetical protein CRE_01843 [Caenorhabditis remanei] Length = 409 Score = 36.8 bits (83), Expect = 6.3, Method: Composition-based stats. Identities = 26/131 (19%), Positives = 50/131 (38%), Gaps = 9/131 (6%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL-NKLNPYENTNTYPAMHHAYRELYNEKE 293 + R+G ++YN + +++ + + + LN T+ + E Sbjct: 96 TTRVGLVSYNADAKILAGLDTYQSYDDLANGVFDSLNSVSATDESY-LAKGLSAAEKVFE 154 Query: 294 SSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEG 353 +T T+ KK VI + LN + + + M+ AG+KI +VA S + Sbjct: 155 EGKSTANRTQYKKVVIVYA-----SSYKGTGELNPVPVADRMKTAGVKIITVAFSQNNDD 209 Query: 354 QDL--LRKCTD 362 L L + Sbjct: 210 GLLKDLSEIAS 220 >gi|134299971|ref|YP_001113467.1| Ig domain-containing protein [Desulfotomaculum reducens MI-1] gi|134052671|gb|ABO50642.1| Ig domain protein, group 2 domain protein [Desulfotomaculum reducens MI-1] Length = 472 Score = 36.8 bits (83), Expect = 6.3, Method: Composition-based stats. Identities = 27/255 (10%), Positives = 64/255 (25%), Gaps = 5/255 (1%) Query: 65 KKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLI 124 +K Q S + G I N + + + Sbjct: 204 ANILKDDTLQLSVKLTDNGVAVDSPSITYVSSDANICNINNNGLITGIAEGKAIITAQMT 263 Query: 125 PSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPP 184 ++ + + + L++ +++ Y N N Sbjct: 264 GKPDVKDTISINVKVNHNYVLTITNESATLNIGDNLQLKYTLTDNGVIVDNPNIIFTSSD 323 Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYN 244 + + + + + ++ E L +Q+ V G Sbjct: 324 NNIATVNSSGLITGISIGTVTITAQMADRPEINDALQVIVQEVPISHNYSIVITGGATIK 383 Query: 245 IGIVGNQCTPLSNNLNEVKSR-----LNKLNPYENTNTYPAMHHAYRELYNEKESSHNTI 299 + + +N EV + + NP TN Y + + K +S ++ Sbjct: 384 LNQTQSYTATFYDNGVEVADQSGTWTIKSPNPDGTTNIYATIQSQTGNSVSIKATSTSSY 443 Query: 300 GSTRLKKFVIFITDG 314 + L+ +D Sbjct: 444 VNKYLELVCTLNSDN 458 >gi|319763923|ref|YP_004127860.1| fg-gap repeat-containing protein [Alicycliphilus denitrificans BC] gi|317118484|gb|ADV00973.1| FG-GAP repeat-containing protein [Alicycliphilus denitrificans BC] Length = 1245 Score = 36.8 bits (83), Expect = 6.4, Method: Composition-based stats. Identities = 12/44 (27%), Positives = 19/44 (43%) Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQS 391 P DL + G+F+AV +L ++F I +I Q Sbjct: 424 GEPVRSLDLWHAALNGRGRFYAVEKGEDLAKAFRDIFQQINTQV 467 >gi|296111730|ref|YP_003622112.1| hypothetical protein LKI_08020 [Leuconostoc kimchii IMSNU 11154] gi|295833262|gb|ADG41143.1| hypothetical protein LKI_08020 [Leuconostoc kimchii IMSNU 11154] Length = 894 Score = 36.8 bits (83), Expect = 6.5, Method: Composition-based stats. Identities = 25/277 (9%), Positives = 66/277 (23%), Gaps = 23/277 (8%) Query: 136 TGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTT 195 I I +V+D+S SM ++ + K Sbjct: 130 NAKGNEVKNIKPIDIVLVVDMSGSMNSSVNGGNDRVGAARQGVKNFLKTINDAGIGKYVN 189 Query: 196 KSKYAPAPAPANRKIDVLIESAGN-------------LVNSIQKAIQEKKNLSVRIGTIA 242 + L E+ L N + + + +A Sbjct: 190 VGVVGFSSPGYISSSGTLSENIDASDNQAHITRINNLLANDFKGGTFTQLGIRTGQSMLA 249 Query: 243 YNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGST 302 + L++ + ++ T+ A E + + S Sbjct: 250 GDSNDHKKMMILLTDGVPTFSYKVTGATTINGTDYGTAFGTNRDEPRFTSQLWKASGNSR 309 Query: 303 RLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY---------SVAVSAPPEG 353 + + +++ + +L + G+++ S + Sbjct: 310 TPSSYTVSGKTIKDTWPATLGESLIAKSQSTELHALGIQLSKDVGYTNNNSYTYLTDSQV 369 Query: 354 QDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 +D ++ G + N + ++ + Q Sbjct: 370 RDRMKLLAS-PGLYQDANSASDVETYLKNQAKDVLSQ 405 >gi|145355751|ref|XP_001422114.1| predicted protein [Ostreococcus lucimarinus CCE9901] gi|144582354|gb|ABP00431.1| predicted protein [Ostreococcus lucimarinus CCE9901] Length = 1394 Score = 36.8 bits (83), Expect = 6.5, Method: Composition-based stats. Identities = 9/90 (10%), Positives = 24/90 (26%), Gaps = 1/90 (1%) Query: 32 ALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQI 91 A+DAA L+ + T + ++ S + K+ + + + I Sbjct: 888 AVDAATLAAVDDATAAITASAVESAQEVVSAAATQAAMKYAESAKRNGTSFV-SHASSVI 946 Query: 92 NITKDKNNPLQYIAESKAQYEIPTENLFLK 121 + + ++ Sbjct: 947 EEAWTPEFERAVERSAMSVTRDVISSVAST 976 >gi|154244778|ref|YP_001415736.1| acriflavin resistance protein [Xanthobacter autotrophicus Py2] gi|154158863|gb|ABS66079.1| acriflavin resistance protein [Xanthobacter autotrophicus Py2] Length = 1031 Score = 36.8 bits (83), Expect = 6.6, Method: Composition-based stats. Identities = 15/131 (11%), Positives = 39/131 (29%), Gaps = 4/131 (3%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + A+ I + + + L HI R + + + A L ++ + D Sbjct: 364 IVALAIPLTLAIVFALMQLVHIDMQRISLGALIIALALMVDDAMTTTDATLTRLAAGDDK 423 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 T K + + + + ++ +Y A I ++ Sbjct: 424 VTAASYAFKAYAAAMLAGT----LVTVAGFVPVGFAASSAGEYTFTLFAVVAIALLVSWI 479 Query: 121 KGLIPSALTNL 131 ++ + L + Sbjct: 480 VAVLFAPLMQV 490 >gi|154489100|ref|ZP_02029949.1| hypothetical protein BIFADO_02412 [Bifidobacterium adolescentis L2-32] gi|154083237|gb|EDN82282.1| hypothetical protein BIFADO_02412 [Bifidobacterium adolescentis L2-32] Length = 560 Score = 36.8 bits (83), Expect = 6.7, Method: Composition-based stats. Identities = 21/162 (12%), Positives = 48/162 (29%), Gaps = 18/162 (11%) Query: 221 VNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPA 280 + + + E + V + + + N + Y A Sbjct: 412 TDQARASHIEPGDDDVNVFI--PFNSSAKVAQVAQGKQTATLLAASENQVANGNADIYNA 469 Query: 281 MHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGM 340 + A + L ++++ + +TDG++ A + G+ Sbjct: 470 LEVALKNLPSDRDDYTVA---------IALLTDGQSDTAKLDEFKQQYAS-----DGKGV 515 Query: 341 KIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 I+S+ + Q L S+G+ F + L F + Sbjct: 516 PIFSIMFG-DADSQQLDDLAKLSNGKVFDGRN-GNLSGIFRE 555 >gi|15594581|ref|NP_212370.1| hypothetical protein BB0236 [Borrelia burgdorferi B31] gi|2688131|gb|AAC66624.1| predicted coding region BB0236 [Borrelia burgdorferi B31] Length = 668 Score = 36.8 bits (83), Expect = 6.7, Method: Composition-based stats. Identities = 23/202 (11%), Positives = 68/202 (33%), Gaps = 12/202 (5%) Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYN 244 ++++ N + A N + S ++ Q + ++ Sbjct: 436 SNENYYIVNPKVAYNVNASKDINIAVVFDKSSYMKKYDTDQIVGLNALMELSKNKNFSFI 495 Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 P+ +N+ + + + + +T +L + + + S Sbjct: 496 N----ATSVPIIDNIESLTNSIRNTSSLGPYSTDAVKTDVSLKL------AGSGLMSKSS 545 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS 364 ++ V++ + G + + + +L+T I Y +N ++ Y + P L ++ Sbjct: 546 RRAVVYFSGGILNRKAFEKYSLDT--IVSYYKNNDIRFYLILFGNDPINSKLQYLVNETG 603 Query: 365 GQFFAVNDSRELLESFDKITDK 386 G + + + +D I ++ Sbjct: 604 GAVIPFSSYEGVSKVYDLILEQ 625 >gi|307720604|ref|YP_003891744.1| von Willebrand factor A [Sulfurimonas autotrophica DSM 16294] gi|306978697|gb|ADN08732.1| von Willebrand factor type A [Sulfurimonas autotrophica DSM 16294] Length = 631 Score = 36.8 bits (83), Expect = 6.8, Method: Composition-based stats. Identities = 16/108 (14%), Positives = 35/108 (32%), Gaps = 16/108 (14%) Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 +PLS + V L KL+ T + + + KK+++ + Sbjct: 139 VSPLSFDHEAVAFLLKKLDTNSITEQGTDLMSMLQVVD--------KSIKKDSKKYLLIL 190 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRK 359 +DG + N + + + + ++ + V P L Sbjct: 191 SDG--------GDKKNFSKEITFAKEKDIAVFVLGVGTPQGAPIKLED 230 >gi|328722084|ref|XP_001948722.2| PREDICTED: hypothetical protein LOC100159668 [Acyrthosiphon pisum] Length = 1500 Score = 36.8 bits (83), Expect = 6.8, Method: Composition-based stats. Identities = 18/253 (7%), Positives = 45/253 (17%), Gaps = 19/253 (7%) Query: 30 QSALDAAV-----LSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGD 84 Q A AA + + TT + + ++ Sbjct: 515 QQATSAAANQQTTSAAANQQTTSAAANQQTTSASANQQTTSAAANQQTTSAAANQQTTSA 574 Query: 85 IAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLK-----GLIPSALTNLSLRSTGII 139 A + + ++ + + S Sbjct: 575 AANQQTTSAAANQQTTSAAANQQTTSAAANQQTTSSGANQQTTSAGFNQQTTGAGSNQQT 634 Query: 140 ERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKY 199 + N + + S + +N T+ + Sbjct: 635 TGAGSNQQTTGFGSNQQTTSEGSNQQTTNVGSNQQTTGFGSNQQTTGGGSNQQINNSGNK 694 Query: 200 APAPAP---------ANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGN 250 +P K + + N + + T + Sbjct: 695 PTSPMSSTLKPTTPIYTTKKPPVSTTPKPTTNKPTTSKPTTLKPTTPKPTTSKPTTQQPT 754 Query: 251 QCTPLSNNLNEVK 263 P ++ K Sbjct: 755 TPKPTTSQPTTQK 767 >gi|315608293|ref|ZP_07883283.1| aerotolerance protein BatB [Prevotella buccae ATCC 33574] gi|315250074|gb|EFU30073.1| aerotolerance protein BatB [Prevotella buccae ATCC 33574] Length = 342 Score = 36.8 bits (83), Expect = 6.9, Method: Composition-based stats. Identities = 27/199 (13%), Positives = 64/199 (32%), Gaps = 56/199 (28%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 ++D +LV+ + +IG I + P++++ K L Sbjct: 110 SRLDKSKLLVESLVDRFT---------NDKIGLIVFAGDAYVQ--LPITSDYVSAKMFLQ 158 Query: 268 KLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQ 323 ++P + T+ A++ ++ + +I ITDGE+ A + Sbjct: 159 NIDPSLIQTQGTDIAQAINLGLHSFTQA----------DKIGRAIIVITDGEDHEGGAVE 208 Query: 324 NTLNTLQICEYMRNAGMKIYSVAVSA---------------PPEGQDLL--------RKC 360 R G+ ++ + V GQ ++ R+ Sbjct: 209 AAAEA-------RKKGVNVFILGVGDTKGAPIPTGDGGYMKDRSGQTVMTALNEQMCREV 261 Query: 361 TDSS-GQFFAVNDSRELLE 378 + G++ V+++ + Sbjct: 262 AQAGSGKYIHVDNTGDAQT 280 >gi|198421587|ref|XP_002123462.1| PREDICTED: similar to integrin alpha Hr1 [Ciona intestinalis] Length = 417 Score = 36.8 bits (83), Expect = 6.9, Method: Composition-based stats. Identities = 44/370 (11%), Positives = 109/370 (29%), Gaps = 36/370 (9%) Query: 41 CASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNP 100 ++ ++ I + L+ + + + A ++ N+ Sbjct: 32 SRINLTGYAQTSSASEAYAAYNIIVGEENSTLRLIAAAPKFKNGTSSGAILDCVSVLNST 91 Query: 101 LQYIAESKAQYEIPTEN------LFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVL 154 ++ E+ Q + +LT+ + + + C ++ Sbjct: 92 TDWVTETNCQSLSSRHFPLTANQNSFNDALGLSLTHNQISTKPNLYTVCAPGRQKQCGLV 151 Query: 155 DVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLI 214 D+ + +++ + + S + + + +VL Sbjct: 152 DLFNPGACY----DVNASSVMLRNWGEFQCFQNFLDVVFVIDSSNSISDSDFTIVKNVLN 207 Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSN-----NLNEVKSRLNKL 269 + A + ++ ++Q L + + + K+ +N + Sbjct: 208 DIANSFAATLGDSVQIGILLYGNADNVNIAYDASTTIYYTPTKLGECMTVACFKTAINSI 267 Query: 270 NP--YENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 NT T A+ A + + E + K ++ ITDG S + Q + Sbjct: 268 THLKAANTYTALAIERAVKVEFAES------KNKDKAVKILVLITDGSASDSYLLQRSFG 321 Query: 328 TLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSG-----QFFAVNDSRELLESFDK 382 + +YSV V A +L + + G + + ++ EL Sbjct: 322 ISST------NNVTVYSVGVGANASVAEL--SISANGGVDTTTRVLSTSNYVELPAVILN 373 Query: 383 ITDKIQEQSV 392 +TD I S+ Sbjct: 374 LTDAIFSSSL 383 >gi|115345667|ref|YP_771848.1| von Willebrand factor type A domain-containing protein [Roseobacter denitrificans OCh 114] gi|115292988|gb|ABI93440.1| von Willebrand factor type A domain protein, putative [Roseobacter denitrificans OCh 114] Length = 349 Score = 36.8 bits (83), Expect = 6.9, Method: Composition-based stats. Identities = 29/176 (16%), Positives = 59/176 (33%), Gaps = 20/176 (11%) Query: 205 PANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKS 264 + ++A LVNS+ A + + + G ++N+ + + Sbjct: 90 NGKSRQQWQRDAAIALVNSLPAATTSVS----IVEFDSNANVVTGLTSLTPASNIPAIIA 145 Query: 265 RLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 +N +N TN + A EL ++ K ++ I+DG+ + Sbjct: 146 AINGVNASGGTNIASGIAAAAGELTGANATTGR-------SKQMVVISDGDPTAGDQNAA 198 Query: 325 TLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFA-VNDSRELLES 379 L + I+SVA+ P + D+ F+ +S +L Sbjct: 199 ALAAVAAGVN------NIHSVAI--PGADVAKMEDIADNGNGVFSNFTNSADLANI 246 >gi|86359026|ref|YP_470918.1| two-component sensor histidine kinase protein [Rhizobium etli CFN 42] gi|86283128|gb|ABC92191.1| probable two-component sensor histidine kinase protein [Rhizobium etli CFN 42] Length = 655 Score = 36.8 bits (83), Expect = 7.0, Method: Composition-based stats. Identities = 10/108 (9%), Positives = 28/108 (25%), Gaps = 9/108 (8%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDA-------AVLSGCASIVSDRTIKDP 53 + A+++ +F + + R +Q A AV + + + Sbjct: 54 IAALLLVPLVIFAFFT--YGSAIATRAYLQEASAQAGTALRLAVSALSGHLNRYEALPAL 111 Query: 54 TTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPL 101 D + L+ + + + K+ + Sbjct: 112 IADHDDIKELVSAPDDMALRDAANLYLKEINGLLKSSDIYVVKPDGET 159 >gi|167383356|ref|XP_001736503.1| copine [Entamoeba dispar SAW760] gi|165901091|gb|EDR27254.1| copine, putative [Entamoeba dispar SAW760] Length = 244 Score = 36.4 bits (82), Expect = 7.0, Method: Composition-based stats. Identities = 16/95 (16%), Positives = 37/95 (38%), Gaps = 8/95 (8%) Query: 286 RELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSV 345 + E + T+ K VI + DG+ + + T++ + +G+++ + Sbjct: 137 TNISPLIEKAIEKKNQTQNKTVVILLCDGQVT------DNKTTIKTIKKASGSGIEVICI 190 Query: 346 AVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 + P + + T++S F + L E F Sbjct: 191 GIGDGPF-ETFRNEITENSQNFHFIE-YSTLSEMF 223 >gi|91783018|ref|YP_558224.1| hypothetical protein Bxe_A2791 [Burkholderia xenovorans LB400] gi|91686972|gb|ABE30172.1| Predicted membrane protein [Burkholderia xenovorans LB400] Length = 595 Score = 36.4 bits (82), Expect = 7.1, Method: Composition-based stats. Identities = 26/170 (15%), Positives = 50/170 (29%), Gaps = 10/170 (5%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 + AI +SV + A+D+ ++ + R Q+Q D A ++ I S TT Q Sbjct: 35 LAAIWLSVAIAALG-ALDVGNVYFARRQLQRTADLAAMAAVQLIGSTGGCATATTAAQQN 93 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKA-------QYEI 113 + D + + + N +Q + Sbjct: 94 AAANG--FTAGSTTTISTTCGRWDTSSNTYFGTSGNPLNAVQVKTTQVVPYFFVGPSRNV 151 Query: 114 PTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDL 163 I + + S + + N + + VS S+ D Sbjct: 152 SATATAFASNIDAFSLGTGIASINTQQSALLNAILGGLLNTSVSLSVGDT 201 >gi|116249089|ref|YP_764930.1| hypothetical protein pRL120423 [Rhizobium leguminosarum bv. viciae 3841] gi|115253739|emb|CAK12132.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae 3841] Length = 194 Score = 36.4 bits (82), Expect = 7.2, Method: Composition-based stats. Identities = 14/142 (9%), Positives = 36/142 (25%), Gaps = 16/142 (11%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSAL----------------DAAVLSGCASIV 45 A+++ + + + +DL H + + ++ D A L AS + Sbjct: 32 FALVLPILVMLLFGTVDLGHALTVSRKIDEIASSTGDMISQQGSWTKSDVAKLLSGASFI 91 Query: 46 SDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIA 105 T I K + + ++ +++ +Q + Sbjct: 92 LQPYDTTGLTITLAVDDIAKSGSATVNWSAALNTSALTSGSASTIEVPSEIQDDGVQVVL 151 Query: 106 ESKAQYEIPTENLFLKGLIPSA 127 + F Sbjct: 152 TRVQYTLTTPVSAFFSNFTGQN 173 >gi|312139253|ref|YP_004006589.1| tade-like protein [Rhodococcus equi 103S] gi|311888592|emb|CBH47904.1| putative TadE-like protein [Rhodococcus equi 103S] Length = 133 Score = 36.4 bits (82), Expect = 7.3, Method: Composition-based stats. Identities = 10/102 (9%), Positives = 31/102 (30%), Gaps = 4/102 (3%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A+++ + + ++ + Q+A+ AA G ++ + Sbjct: 15 FALVVPILITLVLGIVEFGRGYNV----QNAVSAAAREGARTMAIKKDPAAARAAVKGAG 70 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQY 103 + + + + + +T + PL+Y Sbjct: 71 VFSPAITDAEICISTSGTQGCSATSCPSGSTVTLTVSYPLEY 112 >gi|103486591|ref|YP_616152.1| hypothetical protein Sala_1102 [Sphingopyxis alaskensis RB2256] gi|98976668|gb|ABF52819.1| conserved hypothetical protein [Sphingopyxis alaskensis RB2256] Length = 542 Score = 36.4 bits (82), Expect = 7.3, Method: Composition-based stats. Identities = 7/33 (21%), Positives = 14/33 (42%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALD 34 AI + + A+D+ + R ++Q D Sbjct: 22 AAIGMPMLIGAAALAVDVGSLYLDRRKLQGIAD 54 >gi|304310105|ref|YP_003809703.1| Putative type IV fimbrial biogenesis protein PilY1 [gamma proteobacterium HdN1] gi|301795838|emb|CBL44037.1| Putative type IV fimbrial biogenesis protein PilY1 [gamma proteobacterium HdN1] Length = 1368 Score = 36.4 bits (82), Expect = 7.4, Method: Composition-based stats. Identities = 9/45 (20%), Positives = 21/45 (46%) Query: 348 SAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQSV 392 + DL +S G+FF+ ++++F +I +I ++ Sbjct: 624 DSDNNVYDLWHAAINSRGEFFSAESPDAIVQAFREIVSRISNRTT 668 >gi|293364194|ref|ZP_06610920.1| conserved hypothetical protein [Streptococcus oralis ATCC 35037] gi|291317040|gb|EFE57467.1| conserved hypothetical protein [Streptococcus oralis ATCC 35037] Length = 2881 Score = 36.4 bits (82), Expect = 7.4, Method: Composition-based stats. Identities = 17/155 (10%), Positives = 43/155 (27%), Gaps = 10/155 (6%) Query: 169 NDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA-----PANRKIDVLIESAGNLVNS 223 + + + + + + +Y D ++ + + Sbjct: 77 LQDASGSFKNTIPNVQNALKTLTTFVKEEEYDENNPRLVKTDNPDTSDRVMMATFQGADG 136 Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + + + LS++ N++ +N + T T PA+ Sbjct: 137 YNYYYNNDFTGRKDVYGSWGPDYDYKYKASNLSSDQNDIHQFINNIKVAGGTPTVPAIDD 196 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + K + N + + ITDG +G Sbjct: 197 VIAQYNERKGNMANGRKT-----IFLLITDGVANG 226 >gi|253996156|ref|YP_003048220.1| von Willebrand factor type A [Methylotenera mobilis JLW8] gi|253982835|gb|ACT47693.1| von Willebrand factor type A [Methylotenera mobilis JLW8] Length = 2114 Score = 36.4 bits (82), Expect = 7.6, Method: Composition-based stats. Identities = 30/349 (8%), Positives = 78/349 (22%), Gaps = 33/349 (9%) Query: 39 SGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKN 98 + S + T+ +T + + S + + Sbjct: 1223 AVADSATVYESGLSTGTQAGVLATTASGNLLVNDAGVSTTTTISSINGVTPTSGTITVTS 1282 Query: 99 NPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSR 158 + + T N A ++D + Sbjct: 1283 ATGTLVVNANTGAYTYTLNAATTE--GVNDKPTFNYVLTDSVTGQSTNANLTVNIVDDAP 1340 Query: 159 SMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAG 218 + Q + + ++ S + ++++ E+ Sbjct: 1341 -VGGNITQTLQAASAALTYNVVIVLDRSGSMAQDANGLWSNQSGYDASTNRMEIAKEAIA 1399 Query: 219 NLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTY 278 L+ +V + + ++ V ++ +N+ ++ + T Sbjct: 1400 QLIARYD------GLGNVNVKFVTFSSDAVESEWYI--DNVTGAVRYVDNVQAGGGTQYS 1451 Query: 279 PAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNA 338 A++ FITDG+ + TL T N Sbjct: 1452 TALNETMSGFTQPVADKT----------LFYFITDGQPNSGYEVDATLQTQWQNFVAANG 1501 Query: 339 GMKIYSVAVSAPPEGQDLLRKCT---------DSSGQFFAVNDSRELLE 378 + S + L + V++ +L + Sbjct: 1502 NI---SFGIGIGTASLSSLTPIAYPNVDADGNGTEDYAIRVDNPADLAD 1547 >gi|315611749|ref|ZP_07886671.1| LPXTG cell wall surface protein [Streptococcus sanguinis ATCC 49296] gi|315316164|gb|EFU64194.1| LPXTG cell wall surface protein [Streptococcus sanguinis ATCC 49296] Length = 2882 Score = 36.4 bits (82), Expect = 7.6, Method: Composition-based stats. Identities = 17/155 (10%), Positives = 43/155 (27%), Gaps = 10/155 (6%) Query: 169 NDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA-----PANRKIDVLIESAGNLVNS 223 + + + + + + +Y D ++ + + Sbjct: 77 LQDASGSFKNTIPNVQNALKTLTTFVKEEEYDENNPRLVKTDNPDTSDRVMMATFQGADG 136 Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + + + LS++ N++ +N + T T PA+ Sbjct: 137 YNYYYNNDFTGRKDVYGSWGPDYDYKYKASNLSSDQNDIHQFINNIKVAGGTPTVPAIDD 196 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + K + N + + ITDG +G Sbjct: 197 VIAQYNERKGNMANGRKT-----IFLLITDGVANG 226 >gi|308509926|ref|XP_003117146.1| hypothetical protein CRE_01642 [Caenorhabditis remanei] gi|308242060|gb|EFO86012.1| hypothetical protein CRE_01642 [Caenorhabditis remanei] Length = 713 Score = 36.4 bits (82), Expect = 7.6, Method: Composition-based stats. Identities = 24/187 (12%), Positives = 55/187 (29%), Gaps = 8/187 (4%) Query: 174 MTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNS-IQKAIQEKK 232 K N ++D + ++ + ++ Sbjct: 13 AFVACTSSTAYRKCGTDINNLWLDIVLVIDNTRIMQMDGVYDTIQAMFGQSVRIGAGHPD 72 Query: 233 NLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPA-MHHAYRELYNE 291 S R+ + YN +L + L KLN + ++++ A M N Sbjct: 73 PRSTRVAIVTYNEVAKVEAGFDEFKSLTALNQELEKLNATQKSDSFDAFMDLGLSAANNL 132 Query: 292 KESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPP 351 +++ + KK ++ T ++ + +R +G+ I +V A Sbjct: 133 ITAANRANDRKQYKKLIVLFT------SNYSFKNQRPDVLALSIRQSGIDISTVYTGAGT 186 Query: 352 EGQDLLR 358 G + Sbjct: 187 SGNTAFQ 193 >gi|256376655|ref|YP_003100315.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827] gi|255920958|gb|ACU36469.1| von Willebrand factor type A [Actinosynnema mirum DSM 43827] Length = 464 Score = 36.4 bits (82), Expect = 7.7, Method: Composition-based stats. Identities = 26/141 (18%), Positives = 41/141 (29%), Gaps = 33/141 (23%) Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L T + A + ++ + I +TDG N A + + Sbjct: 115 LVAGGGTAMGAWLELASDLFDDHPDAVRHA----------ILLTDGMNGEPRARLDAVLA 164 Query: 329 LQI----CEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDS-SGQFFAVNDSRELLESFDKI 383 C+ G+ D LR + G AV D EL F ++ Sbjct: 165 RCAGRFTCDA---RGI--------GDGWAPDELRHVAEVLHGDVDAVRDPGELPADFRRL 213 Query: 384 TDKI-------QEQSVRIAPN 397 D VR+AP+ Sbjct: 214 MDAAMGKVVADLGLRVRLAPH 234 >gi|308472959|ref|XP_003098706.1| hypothetical protein CRE_04221 [Caenorhabditis remanei] gi|308268306|gb|EFP12259.1| hypothetical protein CRE_04221 [Caenorhabditis remanei] Length = 399 Score = 36.4 bits (82), Expect = 8.0, Method: Composition-based stats. Identities = 25/141 (17%), Positives = 53/141 (37%), Gaps = 16/141 (11%) Query: 235 SVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYP----AMHHAYRELYN 290 + R+G I YN + + + + +N + +T+ + AY L Sbjct: 88 TTRVGLITYNAEATQIADLNVLQSFFNLTNHVNSSLAEVSNSTWSFDKVGLKAAYDLL-- 145 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAP 350 + +S S K ++F +D + + L+ + +++AG+KI +V Sbjct: 146 QNQSFPPNSRSHYQKVVILFASDSQAQNSE----ELDPYPMDYQLKDAGVKIVTVGYG-- 199 Query: 351 PEGQDLLRKCT--DSSGQFFA 369 + LL + + S F Sbjct: 200 --NETLLERLSNISSPEYAFD 218 >gi|307702450|ref|ZP_07639406.1| fmtB protiein [Streptococcus oralis ATCC 35037] gi|307624032|gb|EFO03013.1| fmtB protiein [Streptococcus oralis ATCC 35037] Length = 2858 Score = 36.4 bits (82), Expect = 8.0, Method: Composition-based stats. Identities = 17/155 (10%), Positives = 43/155 (27%), Gaps = 10/155 (6%) Query: 169 NDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPA-----PANRKIDVLIESAGNLVNS 223 + + + + + + +Y D ++ + + Sbjct: 54 LQDASGSFKNTIPNVQNALKTLTTFVKEEEYDENNPRLVKTDNPDTSDRVMMATFQGADG 113 Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHH 283 + + + LS++ N++ +N + T T PA+ Sbjct: 114 YNYYYNNDFTGRKDVYGSWGPDYDYKYKASNLSSDQNDIHQFINNIKVAGGTPTVPAIDD 173 Query: 284 AYRELYNEKESSHNTIGSTRLKKFVIFITDGENSG 318 + K + N + + ITDG +G Sbjct: 174 VIAQYNERKGNMANGRKT-----IFLLITDGVANG 203 >gi|224533770|ref|ZP_03674358.1| tetratricopeptide repeat domain protein [Borrelia burgdorferi CA-11.2a] gi|224513063|gb|EEF83426.1| tetratricopeptide repeat domain protein [Borrelia burgdorferi CA-11.2a] Length = 668 Score = 36.4 bits (82), Expect = 8.0, Method: Composition-based stats. Identities = 23/202 (11%), Positives = 68/202 (33%), Gaps = 12/202 (5%) Query: 185 PKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYN 244 ++++ N + A N + S ++ Q + ++ Sbjct: 436 SNENYYIVNPKVAYNVNASKDINIAVVFDKSSYMKKYDTDQIVGLNSLIELSKNKNFSFI 495 Query: 245 IGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 P+ +N+ + + + + +T +L + + + S Sbjct: 496 N----ATSVPIIDNIESLTNSIRNTSSLGPYSTDAVKTDVSLKL------AGSGLMSKSS 545 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSS 364 ++ V++ + G + + + +L+T I Y +N ++ Y + P L ++ Sbjct: 546 RRAVVYFSGGILNRKAFEKYSLDT--IVSYYKNNDIRFYLILFGNDPINSKLQYLVNETG 603 Query: 365 GQFFAVNDSRELLESFDKITDK 386 G + + + +D I ++ Sbjct: 604 GAVIPFSSYEGVSKVYDLILEQ 625 >gi|159044607|ref|YP_001533401.1| hypothetical protein Dshi_2063 [Dinoroseobacter shibae DFL 12] gi|157912367|gb|ABV93800.1| conserved hypothetical protein [Dinoroseobacter shibae DFL 12] Length = 251 Score = 36.4 bits (82), Expect = 8.0, Method: Composition-based stats. Identities = 23/188 (12%), Positives = 51/188 (27%), Gaps = 32/188 (17%) Query: 209 KIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNK 268 +I + ++ + + A + + P + + + +++ Sbjct: 62 RIFAARRAIRQVMPQVAPVRNLGLLVYGPGPREACDNIDLRFSPIP--DAAPRMIAEIDR 119 Query: 269 LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNT 328 L P NT ++ A L V+ +TDG+ + A Sbjct: 120 LMPSGNTPLTASVARAAEALDYRTRP-----------GVVVLVTDGKETCGGAPCQLAAE 168 Query: 329 LQICEYMRNAGMKIYSVAVSAPPE--------------GQDLLRKCTD-SSGQFFAVNDS 373 L ++ + E GQ + R D + G + Sbjct: 169 LAADAPAL----TVHVIGFKLRGEHFSWDSENQHDYRQGQTVARCLADQTGGLYLTTETV 224 Query: 374 RELLESFD 381 EL+ + Sbjct: 225 DELVAALR 232 >gi|86139958|ref|ZP_01058523.1| secreted hemolysin-type calcium-binding bacteriocin, putative [Roseobacter sp. MED193] gi|85823376|gb|EAQ43586.1| secreted hemolysin-type calcium-binding bacteriocin, putative [Roseobacter sp. MED193] Length = 3377 Score = 36.4 bits (82), Expect = 8.0, Method: Composition-based stats. Identities = 22/157 (14%), Positives = 49/157 (31%), Gaps = 1/157 (0%) Query: 23 MYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENA 82 R Q+QSALD A A+ + + DQT T + + S ++ + Sbjct: 171 ELQRQQLQSALDTASAELAAAQNNVGAMTSDKADADQTVTDAQTTLDAAAATMSLLQSSG 230 Query: 83 GDIAQKAQINITKDKNNPLQYIAESK-AQYEIPTENLFLKGLIPSALTNLSLRSTGIIER 141 +A +++ + Q + + + T + + A Sbjct: 231 QVGDAQAALSLAQTALIQSQNALNAAQGELDTATTSAASMLSMRDAKQTEVTNLQNQKTT 290 Query: 142 SSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNK 178 + + LA + ++ D ++ T+ Sbjct: 291 ADQELATTEALLSDAQDALLLETNGADAVTVAQTAVD 327 >gi|302551500|ref|ZP_07303842.1| von Willebrand factor type A domain-containing protein [Streptomyces viridochromogenes DSM 40736] gi|302469118|gb|EFL32211.1| von Willebrand factor type A domain-containing protein [Streptomyces viridochromogenes DSM 40736] Length = 448 Score = 36.4 bits (82), Expect = 8.1, Method: Composition-based stats. Identities = 25/185 (13%), Positives = 52/185 (28%), Gaps = 27/185 (14%) Query: 208 RKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLN 267 K+ ++ +++++ + + Y G + + K L Sbjct: 78 TKMRNARDATAAAIDTLRDGVHFAVIGGTHVAKEVYPGGGSLAVADATT--RAQAKQALR 135 Query: 268 KLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLN 327 L+ T + A R L + + I +TDG + + Q+ Sbjct: 136 SLSAGGGTAIGTWLKLADRLLSSA----------DVAIRHGILLTDGR-NEHESPQDLKA 184 Query: 328 TLQICE-----YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDK 382 L C R V E +++ + G V D L F + Sbjct: 185 ALDACAGRFTCDAR---------GVGTDWEVKEVTGIASALLGTADIVADPAGLAADFTQ 235 Query: 383 ITDKI 387 + + Sbjct: 236 MMETA 240 >gi|193734230|gb|ACF19883.1| PilC1 [Kingella kingae] Length = 1362 Score = 36.4 bits (82), Expect = 8.1, Method: Composition-based stats. Identities = 10/63 (15%), Positives = 24/63 (38%), Gaps = 10/63 (15%) Query: 338 AGMKIYSVAVSAP--PEGQDLLR----KCTDSSGQ----FFAVNDSRELLESFDKITDKI 387 + +++ G LLR G +++ +D L ++FD I ++ Sbjct: 407 QNITTFTIGFGNGLSESGLALLRGGASNIKLKDGTVQKAYYSASDQTALQKAFDAILAQV 466 Query: 388 QEQ 390 + + Sbjct: 467 EAE 469 >gi|332828717|gb|EGK01409.1| hypothetical protein HMPREF9455_02242 [Dysgonomonas gadei ATCC BAA-286] Length = 340 Score = 36.4 bits (82), Expect = 8.2, Method: Composition-based stats. Identities = 20/168 (11%), Positives = 47/168 (27%), Gaps = 50/168 (29%) Query: 253 TPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFV 308 PL+ + K L ++P + T A+ + T + K + Sbjct: 144 LPLTPDNQSAKLFLETIDPSLVPVQGTAIGSAIDISM----------SCFSNDTDIDKAI 193 Query: 309 IFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE---------------- 352 + ITDGE A G+ + V + Sbjct: 194 VLITDGEGHEGDAEGAAARAAD-------QGVHVNVVGIGTTEGAMIPVSPGSNNMKRDT 246 Query: 353 ---------GQDLLRKCTDSS-GQFFAVNDSRELLESFDKITDKIQEQ 390 +++ R+ + G + ++S + + ++++ Sbjct: 247 QGQPVVTKLNEEMCRQIAKAGKGLYAHADNS---NSALKSLQSELEKL 291 >gi|269103394|ref|ZP_06156091.1| putative RTX toxin [Photobacterium damselae subsp. damselae CIP 102761] gi|268163292|gb|EEZ41788.1| putative RTX toxin [Photobacterium damselae subsp. damselae CIP 102761] Length = 3098 Score = 36.4 bits (82), Expect = 8.2, Method: Composition-based stats. Identities = 29/288 (10%), Positives = 68/288 (23%), Gaps = 12/288 (4%) Query: 33 LDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQIN 92 D ++ + +D + + T + Q T Q+ G + G + Sbjct: 1340 TDTITVTVNVTPANDAPVGEDVTAETQEETAVTGQLTATDVDGDNLTFKPGSDP--TNGS 1397 Query: 93 ITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICM 152 +T + + +Y+ E + G + +++ T + + + Sbjct: 1398 VTVNPDGSWEYVPNPDFNGEDSFTVVVDDGNGGTDTITVTVNVTPVNDAPVGEDVSAETQ 1457 Query: 153 VLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDV 212 E + + N P + + P P D Sbjct: 1458 -------EETAVTGQLTATDVDGDNLTFKPGSDPTNGSVTVNPDGSWEYVPNPNFNGEDS 1510 Query: 213 LIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPY 272 + + V + N+ + T ++ L + L Sbjct: 1511 FTVVVDDGNGGTDTITVTVNVIPVNDAPVGENVTTETQEETAVTGQLTATDVDGDNLTFK 1570 Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 TN + F + + DG + Sbjct: 1571 PGTNPENGQVTVNA---DGSWEYVPNTDFNGEDSFTVVVDDGNGGTDT 1615 >gi|218131125|ref|ZP_03459929.1| hypothetical protein BACEGG_02730 [Bacteroides eggerthii DSM 20697] gi|317476997|ref|ZP_07936239.1| von Willebrand factor type A domain-containing protein [Bacteroides eggerthii 1_2_48FAA] gi|217986645|gb|EEC52979.1| hypothetical protein BACEGG_02730 [Bacteroides eggerthii DSM 20697] gi|316906790|gb|EFV28502.1| von Willebrand factor type A domain-containing protein [Bacteroides eggerthii 1_2_48FAA] Length = 342 Score = 36.4 bits (82), Expect = 8.4, Method: Composition-based stats. Identities = 23/168 (13%), Positives = 54/168 (32%), Gaps = 49/168 (29%) Query: 251 QCTPLSNNLNEVKSRLNKLNPY----ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKK 306 P++++ K L ++P + T A++ A R + + + Sbjct: 142 TQLPITSDYISAKMFLESIDPSLISKQGTAIGAAINLASRSFTPQ----------EGVGR 191 Query: 307 FVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPE-------------- 352 +I ITDGEN A + + G+++ + V P Sbjct: 192 AIIVITDGENHEGGAVEAAKAAAE-------KGIQVNVLGVGMPEGAPIPAEGTNDYRRD 244 Query: 353 ----------GQDLLRKCTDSSGQFF-AVNDSRELLESFDKITDKIQE 389 + + ++ S + V+++ + I+ +I + Sbjct: 245 RDGNVIVTRLNEQMCQEIAKSGNGIYVRVDNT---NGAQKAISQEINK 289 >gi|223934102|ref|ZP_03626049.1| von Willebrand factor type A [Streptococcus suis 89/1591] gi|164454849|dbj|BAF96969.1| serum opacity factor [Streptococcus suis] gi|223897228|gb|EEF63642.1| von Willebrand factor type A [Streptococcus suis 89/1591] Length = 564 Score = 36.4 bits (82), Expect = 8.4, Method: Composition-based stats. Identities = 44/330 (13%), Positives = 86/330 (26%), Gaps = 25/330 (7%) Query: 54 TTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEI 113 + K+ T K I D+ ++ + Sbjct: 246 SESKEVNKTNVKDTIDNLFSNAEQNYNWGVDMQGAIHEARRILESEKSGKRKHIVLLSQ- 304 Query: 114 PTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNN 173 E+ F L +A T ++ + N + DV+ + L + Sbjct: 305 -GESTFSYDLTDAAKTETKYKTINEDKVVHTNPLLPWPFTFDVT--VRRANLLDDAQSLI 361 Query: 174 MTSNKYLLPPPPKKSFWSKNTTK-----SKYAPAPAPANRKIDVLIESAGNLVNSIQKAI 228 +K + K + P DV G + K I Sbjct: 362 NFLDKIGITRFNKALDGVSTVAGLTKILGAFVKNPLDYIDLADVDSSKLGEKNFNYDKQI 421 Query: 229 QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYREL 288 E + + + +N +K+ L+K NT +Y L Sbjct: 422 GEGYHFRSFYERKLEPLTFRDQIVQKIKHN---LKTELDKAKLPTNTLSYIGKKLG---L 475 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVS 348 N E + + ++ + Y + L+ + G+ Y+V V+ Sbjct: 476 TNTSEKAIDLFANSLVDYIF------YGRKNVFYNHNLSAQAEAKIATEKGIHFYAVDVT 529 Query: 349 APPEG----QDLLRKCTDSSGQFFAVNDSR 374 E D L+K + G + + Sbjct: 530 KNDEQKNKFDDYLKKMSGDKGNVIKLTNLS 559 >gi|319918961|gb|ADV78143.1| PilY1 [Pseudomonas aeruginosa] Length = 1163 Score = 36.4 bits (82), Expect = 8.5, Method: Composition-based stats. Identities = 41/346 (11%), Positives = 87/346 (25%), Gaps = 31/346 (8%) Query: 52 DPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQY 111 + S+ L + +I TK N Y + Sbjct: 165 SASCPAQPVSSSNSCYTYNALPTSQESNFAIWYSYYRNRILATKTAANLAFYSLPENVRL 224 Query: 112 EIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDN 171 N I + + + + + + S Sbjct: 225 TWGALNT---CSIGANSRSCQNNALLQFNKQHKINFFNWLANSPASGGTPLHAALDR--- 278 Query: 172 NNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEK 231 + + + + S + +V + N + Sbjct: 279 -AGRFLQTNGTAYTTEDGKTYSCRASYHIMMTDGIWNGRNVTPGNLDNQNQTF------P 331 Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL-NKLNPYENTNTYPAMHHAYRELYN 290 + R + +++ + N L P + + Sbjct: 332 DSTLYRPQPPYADSNASSLADLAFKYWTTDLRPSIDNDLKP-----------FMAYKSGD 380 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMR----NAGMKIY-SV 345 + + + + + ++ T G S N+ T + AG K++ SV Sbjct: 381 DSKDYWDPRNNPATWQHMVNFTVGLGLSYSLTLNSAPTWTGSTFGNYEELMAGSKVWPSV 440 Query: 346 AVSA-PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 A P DL +S G FF+ L+++F+KI +I E+ Sbjct: 441 GNDAAPGNVYDLWHAAINSRGDFFSAESPDSLVQAFNKILTRISER 486 >gi|55670411|pdb|1V7P|C Chain C, Structure Of Ems16-Alpha2-I Domain Complex Length = 200 Score = 36.4 bits (82), Expect = 8.6, Method: Composition-based stats. Identities = 29/187 (15%), Positives = 63/187 (33%), Gaps = 24/187 (12%) Query: 215 ESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYEN 274 ++ N + + + ++G I Y E+ ++ + Y Sbjct: 23 DAVKNFLEKFVQ-GLDIGPTKTQVGLIQYANNPRVVFNLNTYKTKEEMIVATSQTSQYGG 81 Query: 275 --TNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 TNT+ A+ +A + Y+ + K ++ +TDGE+ S + Sbjct: 82 DLTNTFGAIQYARKYAYSAASGGRRSAT-----KVMVVVTDGESHDGSM------LKAVI 130 Query: 333 EYMRNAGMKIYSVAV----SAPPEGQDLL----RKCTD--SSGQFFAVNDSRELLESFDK 382 + + + + +AV + L + + FF V+D LLE Sbjct: 131 DQCNHDNILRFGIAVLGYLNRNALDTKNLIKEIKAIASIPTERYFFNVSDEAALLEKAGT 190 Query: 383 ITDKIQE 389 + ++I Sbjct: 191 LGEQIFS 197 >gi|325673442|ref|ZP_08153133.1| TadE family protein [Rhodococcus equi ATCC 33707] gi|325555463|gb|EGD25134.1| TadE family protein [Rhodococcus equi ATCC 33707] Length = 133 Score = 36.4 bits (82), Expect = 8.7, Method: Composition-based stats. Identities = 10/102 (9%), Positives = 31/102 (30%), Gaps = 4/102 (3%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTS 61 A+++ + + ++ + Q+A+ AA G ++ + Sbjct: 15 FALVVPILITLVLGIVEFGRGYNV----QNAVSAAAREGARTMAIKKDPAAARAAVKGAG 70 Query: 62 TIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQY 103 + + + + + +T + PL+Y Sbjct: 71 VFSPAITDAEICISTSGSQGCSATSCPSGSTVTLTVSYPLEY 112 >gi|17536323|ref|NP_495355.1| hypothetical protein T19D12.3 [Caenorhabditis elegans] gi|1086826|gb|AAC24424.1| Hypothetical protein T19D12.3 [Caenorhabditis elegans] Length = 179 Score = 36.4 bits (82), Expect = 8.7, Method: Composition-based stats. Identities = 25/169 (14%), Positives = 51/169 (30%), Gaps = 19/169 (11%) Query: 224 IQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN--EVKSRLNKLNPYENTNTYPAM 281 + ++ I + G+ + ++S ++ L + + Sbjct: 28 YNSTNYDFSTVTETINIPYAHTDYYGSYLLQYGGAKSLMALQSNIDTLFSNSIFSIDTTV 87 Query: 282 HHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICEYMRNAGMK 341 L + NT VI + + + T ++R+ G K Sbjct: 88 SDGLIWLSDNYPDGANT--------IVIVVG------YHSDDDNGRTYTTLTHLRSRGYK 133 Query: 342 IYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 + SVA L D S +F V++S D+I+ I + Sbjct: 134 VISVAAGVGHGD---LSYIADESEWYFQVDNSTGGQNVADQISSIICKL 179 >gi|226307664|ref|YP_002767624.1| hypothetical protein RER_41770 [Rhodococcus erythropolis PR4] gi|226186781|dbj|BAH34885.1| conserved hypothetical protein [Rhodococcus erythropolis PR4] Length = 569 Score = 36.4 bits (82), Expect = 8.8, Method: Composition-based stats. Identities = 17/109 (15%), Positives = 33/109 (30%), Gaps = 9/109 (8%) Query: 273 ENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQIC 332 T Y A+R++ + VI +TDG N + + Sbjct: 460 GATGLYDTTLAAFRKVQESYDPRAV--------NSVIILTDGANEDPDSISREQLLDVLA 511 Query: 333 EYMR-NAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVNDSRELLESF 380 + I ++ ++ + L + G + D E+ E F Sbjct: 512 REQDPARPVIIVTIGITDDADAAALADISRVTGGSTYIARDPSEISEVF 560 >gi|331089189|ref|ZP_08338091.1| hypothetical protein HMPREF1025_01674 [Lachnospiraceae bacterium 3_1_46FAA] gi|330405741|gb|EGG85270.1| hypothetical protein HMPREF1025_01674 [Lachnospiraceae bacterium 3_1_46FAA] Length = 1561 Score = 36.4 bits (82), Expect = 8.9, Method: Composition-based stats. Identities = 29/336 (8%), Positives = 89/336 (26%), Gaps = 21/336 (6%) Query: 2 TAIIISVCFLFITY---AIDLAHIMYIRNQMQSALDA-----AVLSGCASIVSDRTIKDP 53 A++++ + I + +M + A G + S Sbjct: 7 AALVMAAILGVTGFSVPGIGVVQAEEAHTEMSGEAEQQEINLAADRGSTARASKFLAASG 66 Query: 54 TTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEI 113 + F + + + ++ D Q + + ++E K + Sbjct: 67 NLPAREPGLAFDGISDNNGEADNSRWQSGEDAEFSEQ--WLEVDLGGICVVSEIKVDFFA 124 Query: 114 PTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHND--- 170 F + S + + + E +++ +DV + + + ++ Sbjct: 125 RLYGDFRVEVSDSNAEDAVWTTIATED-MPEGTDLNLKKTVDVKENGKAREIPRYIRLYF 183 Query: 171 NNNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVN--SIQKAI 228 + + + ++S Y + + A+ Sbjct: 184 TSGNSQAANRSIGVREFQVIGTKKSESGYETITGNIALNKTASASGVEAAMPNLTANLAV 243 Query: 229 QEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYREL 288 +K+ + R G NQ ++++ + + T+ + + Sbjct: 244 DGQKSDTSRWSAPTMKNGTSPNQQQTPQWLEIDLRNEVTNI-----TSIDLYFYKLVYSI 298 Query: 289 YNEKESSHNTIGSTRLKKFVIFITDGENSGASAYQN 324 E ++ + + K V E + + + Sbjct: 299 DYEIQTRADKKSEWKTVKHVTCQLGNEQNKHDSITD 334 >gi|323137420|ref|ZP_08072498.1| TadE family protein [Methylocystis sp. ATCC 49242] gi|322397407|gb|EFX99930.1| TadE family protein [Methylocystis sp. ATCC 49242] Length = 208 Score = 36.4 bits (82), Expect = 8.9, Method: Composition-based stats. Identities = 8/47 (17%), Positives = 15/47 (31%) Query: 2 TAIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDR 48 A+I I + I + Q+Q A + A + + Sbjct: 36 FAMIAVPFLGLIGAIFETGTIYFRTAQLQMATETASRAVLTHSTAAG 82 >gi|307304370|ref|ZP_07584121.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti BL225C] gi|306902572|gb|EFN33166.1| Protein of unknown function DUF2134, membrane [Sinorhizobium meliloti BL225C] Length = 431 Score = 36.4 bits (82), Expect = 8.9, Method: Composition-based stats. Identities = 16/162 (9%), Positives = 42/162 (25%), Gaps = 23/162 (14%) Query: 6 ISVCFLFITYAIDLAHIMYIRNQM--------------------QSALDAAV--LSGCAS 43 V + ++ + + ++ QSAL+AA ++G + Sbjct: 40 FPVLVGAMGLGVETGYWYLEKRKLQHAADVSAYAAAVRHRAGDQQSALEAAARRVAGGSG 99 Query: 44 IVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKA-QINITKDKNNPLQ 102 S ++ + + G I KA + + Sbjct: 100 FSPGGLTVSTAPGSAGGSNKVTVELTETHPRMFSSVFGTGTITMKARAVAQVTGGSKACV 159 Query: 103 YIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIERSSE 144 + A + L ++ + S + + ++ Sbjct: 160 LALSNSASGAVTVTGSTEVLLSGCSVVSNSNAADAFLMKNGS 201 >gi|76800731|ref|YP_325739.1| hypothetical protein NP0152A [Natronomonas pharaonis DSM 2160] gi|76556596|emb|CAI48167.1| conserved hypothetical protein [Natronomonas pharaonis DSM 2160] Length = 213 Score = 36.4 bits (82), Expect = 8.9, Method: Composition-based stats. Identities = 18/145 (12%), Positives = 39/145 (26%), Gaps = 1/145 (0%) Query: 3 AIIISVCFLFITYAIDLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTST 62 AI + V A+ +A + ++A D AV + A + + Sbjct: 12 AIALLVLTTVAVLAVTIADASVRGAE-RNAADRAVATATADRLVAADSAATVRGNVLNGS 70 Query: 63 IFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKG 122 + + L R A + + T + + + E+ F Sbjct: 71 LDADSVAASLPPDVDARVALDGEAVYERGDPTGGETARRIVLVADQQAVEVEPPFEFRSV 130 Query: 123 LIPSALTNLSLRSTGIIERSSENLA 147 +P +L + + Sbjct: 131 TLPRRSPRATLDIEPSADIETVRAN 155 >gi|330446188|ref|ZP_08309840.1| subtilase family protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] gi|328490379|dbj|GAA04337.1| subtilase family protein [Photobacterium leiognathi subsp. mandapamensis svers.1.1.] Length = 1031 Score = 36.0 bits (81), Expect = 9.4, Method: Composition-based stats. Identities = 27/188 (14%), Positives = 59/188 (31%), Gaps = 15/188 (7%) Query: 199 YAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKK---NLSVRIGTIAYNIGIVGNQCT-- 253 +D + A L+ + + + Y Sbjct: 756 VVDVSGSYYDDLDTFKDQATELIEAFSSSGTNVNIGISSFSDFPLSPYGSSDDYAFKLDQ 815 Query: 254 PLSNNLNEVKSRLNK-LNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLK-KFVIFI 311 PL+NN + VKS L+ L N + Y+ + ++ + L Sbjct: 816 PLTNNYDLVKSALDNLLILSGNDYPESQLEAIYQTVQPATGWRSGSLPTLFLATDARFHN 875 Query: 312 TDGENSGASAYQNTLNTLQICEYMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQFFAVN 371 +D ++S + + T+ + ++ + + D+ + S G F ++ Sbjct: 876 SDMDDSYPGSGYDK--TISELKTLKAR-----VFGLQSGGVIDDVKKIAEQSGGAAFKLS 928 Query: 372 -DSRELLE 378 DS E++E Sbjct: 929 KDSAEIVE 936 >gi|226941831|ref|YP_002796905.1| VCBS [Laribacter hongkongensis HLHK9] gi|226716758|gb|ACO75896.1| VCBS [Laribacter hongkongensis HLHK9] Length = 1087 Score = 36.0 bits (81), Expect = 9.4, Method: Composition-based stats. Identities = 33/316 (10%), Positives = 87/316 (27%), Gaps = 43/316 (13%) Query: 66 KQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFLKGLIP 125 L + + A Y E KA + +KG Sbjct: 157 DGASNTLTGYTEANGVKASVMVVALTPPDTGSKGDWSYRVELKAPMDHSAPVAPVKGDEN 216 Query: 126 SALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPP 185 + ++S++ + ++ L I++ + V + D N + + Sbjct: 217 TRAFDVSIKVSDGQTTTTAPLNITVEDDMPVVANTSQTVSLPSQDTNLLLTLDV------ 270 Query: 186 KKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNI 245 S A ++ + ++ L++ V++ + ++ Sbjct: 271 ----SGSMNDPSGVKDANGKDLSRLALAKQAISQLLDQYDAL------GDVKVQLVKFSE 320 Query: 246 GIVGNQCTPLSNNLNEVKSRLNKLNPY-ENTNTYPAMHHAYRELYNEKESSHNTIGSTRL 304 G ++ + E K+ L + TN A+ A + + Sbjct: 321 GGSVQSNNWMT--VAEAKAALAGITKGDGGTNYDEALDLARQAFAKPGQ-------LDGA 371 Query: 305 KKFVIFITDGENSGASAYQNTLNTLQI---------CEYMR-------NAGMKIYSVAVS 348 K F +DG+ + +++ Q + + + N + +++ + Sbjct: 372 KNVSYFFSDGDPTLSNSGQKNNSGATVDPDKGDGIDATEQKSWEAFLTNNNILSHAIGLG 431 Query: 349 APPEGQDLLRKCTDSS 364 + L + Sbjct: 432 QDVKS-TYLEPIAYNG 446 >gi|237667682|ref|ZP_04527666.1| von Willebrand factor type A domain protein [Clostridium butyricum E4 str. BoNT E BL5262] gi|237656030|gb|EEP53586.1| von Willebrand factor type A domain protein [Clostridium butyricum E4 str. BoNT E BL5262] Length = 1336 Score = 36.0 bits (81), Expect = 9.4, Method: Composition-based stats. Identities = 29/281 (10%), Positives = 70/281 (24%), Gaps = 32/281 (11%) Query: 141 RSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYA 200 + +++ D E + + + S YA Sbjct: 511 ILDTSGSMNFNFYNDSIPYNEKDKRIYSLKQSAKQFINKFNNKDNIRIGIIPYSYYSGYA 570 Query: 201 PAPAPANRKIDVLIESAGNLVNSIQ--KAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNN 258 D +S N +++I+ A + + + G +++ Sbjct: 571 NNIKQLTEINDNNKKSYENYIDNIKVEGATNQGDGIREAGKMLLNTDGNSKKYVILITDG 630 Query: 259 LNEVK-----SRLNKLNPYENTNT------YPAMHHAYRELYNEKESSHNTIGSTRLKKF 307 + + + Y N + Y + +N I Sbjct: 631 EATAITIEKPNLIINDSAYYWANNVLSFRNGDLVSRQYWYGARYTDDYYNDIVYFNNGSS 690 Query: 308 VIFITDGENSGASAYQNTLNTLQICEYMRNAGMKIY------------------SVAVSA 349 + + + ++ Y ++ + Sbjct: 691 FYVNPIYDFNSVKNTRYVDFGDNKLNEIKIKKATDYATNIGRRLKQSIPELKAMTIGCAL 750 Query: 350 PPEGQDLLRKCTDS-SGQFFAVNDSRELLESFDKITDKIQE 389 + DL ++ +S G +F+ NDS + FD++ D I Sbjct: 751 NQDTVDLAKEVNNSMDGDYFSANDSNSMESIFDELADSILS 791 >gi|29831916|ref|NP_826550.1| hypothetical protein SAV_5373 [Streptomyces avermitilis MA-4680] gi|29609033|dbj|BAC73085.1| hypothetical protein [Streptomyces avermitilis MA-4680] Length = 458 Score = 36.0 bits (81), Expect = 9.4, Method: Composition-based stats. Identities = 33/261 (12%), Positives = 62/261 (23%), Gaps = 31/261 (11%) Query: 132 SLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWS 191 + +N + + + Y P + Sbjct: 16 KSNAPQFSVDVYQNEYLPEGGREVNAIVTVTATGGGTTKSAVAAPRPYASGAAPDAAVAV 75 Query: 192 KNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQ 251 P D + L + + A+ + Y G Sbjct: 76 MVDCSGSMDYPPTKMRNARDATAAAIDTLRDGVHFAVIGGT----HVAKEVYPGGGRLAV 131 Query: 252 CTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFI 311 + + + K L KL+ T + A R L + + I + Sbjct: 132 ADARTRD--QAKQALRKLSAGGGTAIGTWLRLADRLLSSA----------DVSIRHGILL 179 Query: 312 TDGENSGASAYQNTLNTLQICE-----YMRNAGMKIYSVAVSAPPEGQDLLRKCTDSSGQ 366 TDG + + Q+ L C R V E +++ + G Sbjct: 180 TDGR-NEHESPQDLRAALDACAGRFTCDAR---------GVGTDWEVKEVTGIASALLGT 229 Query: 367 FFAVNDSRELLESFDKITDKI 387 V D L E F ++ + Sbjct: 230 ADIVADPAALSEDFTRMMETA 250 >gi|326428371|gb|EGD73941.1| hypothetical protein PTSG_05636 [Salpingoeca sp. ATCC 50818] Length = 1446 Score = 36.0 bits (81), Expect = 9.6, Method: Composition-based stats. Identities = 38/370 (10%), Positives = 104/370 (28%), Gaps = 49/370 (13%) Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAES--KAQYEIPTENL 118 + + L ++ +++ +S + + + Sbjct: 369 EQYANVDLPESLPDLTFSPFYKTNVSFAQSGTKVAIDEPQPSQAVDSDIAWTAGLDGDGV 428 Query: 119 FLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNK 178 F + + + T + + + + + + + Sbjct: 429 FTRNMQNDSDIRWQYVGTERGVFRQFPARLWDTNFIGFPLDFDPRFRPWYLATLSGPKDI 488 Query: 179 YLLPPPPK---KSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLS 235 ++ + W+ +K+ + + +V+ + ++ + Sbjct: 489 VIVLDCSRSMRGDKWNDAVAMTKFLVNSLSRDDRYNVV---------CFSSSHKDYNDNF 539 Query: 236 VRIGTIAYNIGIVGNQCTPLSNNLNEVKSRLNKLNPYENTNTYPAMHHAYRELYNEKES- 294 V S+N + +RL+ P T+ + +R L E Sbjct: 540 V-YRRTEVLSCRKHELLRGTSSNKEDTFTRLDGYTPAGGTDPLTGIQVGFRLLRGECNGL 598 Query: 295 ----SHNTIGSTRLKKFVIFITDGEN-----------SGASAYQNTLNTLQIC------- 332 T ++ ++F++DG++ + + +C Sbjct: 599 DTDCPMRDPPRTDCQRLMVFLSDGKDRDNEVRCGRGRYYYTRNGRQYDPPPLCQFKWDDT 658 Query: 333 -EYMRNA--GMKIYSVA-------VSAPPEGQDLLRKCT-DSSGQFFAVNDSRELLESFD 381 +Y+RN ++I+S VS+P +G +L + G + V R L ++ Sbjct: 659 FDYVRNNAGDIRIFSFGLSTSRNPVSSPRDGDELPGTVACSAKGTYTYVYSDRGLYQTMG 718 Query: 382 KITDKIQEQS 391 D I + Sbjct: 719 DYFDYITRST 728 >gi|257892717|ref|ZP_05672370.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,408] gi|257829096|gb|EEV55703.1| von Willebrand factor domain-containing protein [Enterococcus faecium 1,231,408] Length = 1347 Score = 36.0 bits (81), Expect = 9.6, Method: Composition-based stats. Identities = 7/65 (10%), Positives = 27/65 (41%), Gaps = 15/65 (23%) Query: 342 IYSVAVSAPPE----------GQDLLRKCTDSSG-----QFFAVNDSRELLESFDKITDK 386 I+S+ + G+++L+ D + +++ N+ +++ + + I+ Sbjct: 489 IFSIGLGIDGSVAGRQRLDAIGRNVLKNIADLNDDGTTPRYYDANNKNDIITALEDISST 548 Query: 387 IQEQS 391 ++ Sbjct: 549 FKKTI 553 >gi|159898560|ref|YP_001544807.1| YD repeat-containing protein [Herpetosiphon aurantiacus ATCC 23779] gi|159891599|gb|ABX04679.1| YD repeat protein [Herpetosiphon aurantiacus ATCC 23779] Length = 3073 Score = 36.0 bits (81), Expect = 9.7, Method: Composition-based stats. Identities = 26/315 (8%), Positives = 74/315 (23%), Gaps = 10/315 (3%) Query: 22 IMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQTSTIFKKQIKKHLKQGSYIREN 81 ++ + +D A + D + D + ++ Sbjct: 2230 QYRADGRLAAQIDMANRTTSYGYSVDGFLTDIDYPSGTADVSYTHDANGNVTTMQDGLGT 2289 Query: 82 AGDIAQKAQ-INITKDKNNPLQYIAESKAQYEIPTENLFLKGLIPSALTNLSLRSTGIIE 140 + + Y ++ A + T+ + + + S Sbjct: 2290 TTYRYDGLNRLRERTRDGRTVGYTYDA-ASFRTNTDYWGTGSVTATPDAAGRVASLQPWG 2348 Query: 141 RSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKYLLPPPPKKSFWSKNTTKSKYA 200 S+ A + + S L + D + + + Sbjct: 2349 GSTTTYAYGARGQMATATSGSGLSVTPTYDAAGRVLATHYAQNGTTLANFGSTVDAVGNR 2408 Query: 201 PAPAPANRKIDVLIESAGNLVNSIQKAIQEKKNLSVRIGTIAYNIGIVGNQCTPLSNNLN 260 + + + + + +A + + + + + + TP S + Sbjct: 2409 TS----LTDANGTTTLTYDATDRMVQATEPQASTTYTHDAVGNRTNVATTGQTPFSFSY- 2463 Query: 261 EVKSRLNKLNPYENTNTYPAMHHAYRELYNEKESSHNTIGSTRLKKFVIFITDGENSGAS 320 N+L T+ E ++++ ST + DG + Sbjct: 2464 ---DAANRLTTSGYTHDANGNLTTTPEATYTYDAANRVTSSTTSAGTTTYGYDGWGNLVR 2520 Query: 321 AYQNTLNTLQICEYM 335 N + + Sbjct: 2521 VTTNGQVQDLVLDEA 2535 >gi|332188164|ref|ZP_08389893.1| ammonium transporter family protein [Sphingomonas sp. S17] gi|332011771|gb|EGI53847.1| ammonium transporter family protein [Sphingomonas sp. S17] Length = 455 Score = 36.0 bits (81), Expect = 9.8, Method: Composition-based stats. Identities = 16/153 (10%), Positives = 40/153 (26%), Gaps = 3/153 (1%) Query: 1 MTAIIISVCFLFITYAIDLAHIMYIRNQMQSALD---AAVLSGCASIVSDRTIKDPTTKK 57 MTA ++ + + A+ + +N + + AA L+ ++ ++ Sbjct: 53 MTATVLVMMMILPGLALFYGGLTRAKNMLATMTQIGAAACLAMLIWVMYGYSLAFGPDVS 112 Query: 58 DQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTEN 117 S K LK + + A A + E Sbjct: 113 GGASNFIAGLGKAFLKGVTPASQAATFTAGVEIPEYVFICFQMTFAAITAALVLGSVVER 172 Query: 118 LFLKGLIPSALTNLSLRSTGIIERSSENLAISI 150 + ++ L++ + + Sbjct: 173 MKFSAVMAFVAVWLTIVYFPVAHMVWAASGLFF 205 >gi|319918954|gb|ADV78137.1| PilY1 [Pseudomonas aeruginosa] Length = 1158 Score = 36.0 bits (81), Expect = 9.8, Method: Composition-based stats. Identities = 41/346 (11%), Positives = 86/346 (24%), Gaps = 31/346 (8%) Query: 52 DPTTKKDQTSTIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQY 111 + S+ L + +I TK N Y + Sbjct: 165 SASCPAQPVSSSNSCYTYNALPTSQESNFAIWYSYYRNRILATKTAANLAFYSLPENVRL 224 Query: 112 EIPTENLFLKGLIPSALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDN 171 N I + + + + + + S Sbjct: 225 TWGALNT---CSIGANSRSCQNNALLQFNKQHKINFFNWLANSPASGGTPLHAALDR--- 278 Query: 172 NNMTSNKYLLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIESAGNLVNSIQKAIQEK 231 + + + + S + +V + N + Sbjct: 279 -AGRFLQTNGTAYTTEDGKTYSCRASYHIMMTDGIWNGRNVTPGNLDNQNQTF------P 331 Query: 232 KNLSVRIGTIAYNIGIVGNQCTPLSNNLNEVKSRL-NKLNPYENTNTYPAMHHAYRELYN 290 + R + +++ + N L P + + Sbjct: 332 DSTLYRPQPPYADSNASSLADLAFKYWTTDLRPSIDNDLKP-----------FMAYKSGD 380 Query: 291 EKESSHNTIGSTRLKKFVIFITDGENSGASAYQNTLNTLQICE----YMRNAGMKIY-SV 345 + + + + + ++ T G S N+ T AG K++ SV Sbjct: 381 DSKDYWDPRNNPATWQHMVNFTVGLGLSYSLTLNSAPTWTGSTFGNYEELMAGSKVWPSV 440 Query: 346 AVSA-PPEGQDLLRKCTDSSGQFFAVNDSRELLESFDKITDKIQEQ 390 A P DL +S G FF+ L+++F+KI +I E+ Sbjct: 441 GNDAAPGNVYDLWHAAINSRGDFFSAESPDSLVQAFNKILTRISER 486 >gi|317403484|gb|EFV83991.1| hypothetical protein HMPREF0005_03145 [Achromobacter xylosoxidans C54] Length = 683 Score = 36.0 bits (81), Expect = 9.9, Method: Composition-based stats. Identities = 21/217 (9%), Positives = 54/217 (24%), Gaps = 7/217 (3%) Query: 2 TAIIISVCFLFITYAI-DLAHIMYIRNQMQSALDAAVLSGCASIVSDRTIKDPTTKKDQT 60 AI+ +V + + L + Y++ ++Q A D A LS + + Sbjct: 3 VAIVFAVIVGLVLLGVAQLGYAYYMKREIQKASDLAALSAVQVLGLGAPADCA-----RA 57 Query: 61 STIFKKQIKKHLKQGSYIRENAGDIAQKAQINITKDKNNPLQYIAESKAQYEIPTENLFL 120 + + ++ + + T+ + + + Q Sbjct: 58 QQAGRTAVLANVPAILDTFSADDVTVECKVWDSTRADASGMYVFDPAAGQTPNALRITVN 117 Query: 121 KGLIP-SALTNLSLRSTGIIERSSENLAISICMVLDVSRSMEDLYLQKHNDNNNMTSNKY 179 K L + ++ + +++ V ++ L + Sbjct: 118 KTLSSLLPDIVVDGAPVRVVSVAVSTRPVAVFTVGSRLVRLQKGGLLSQLLATVGATPAQ 177 Query: 180 LLPPPPKKSFWSKNTTKSKYAPAPAPANRKIDVLIES 216 L T P + V + Sbjct: 178 LDILDAAGLASVNITPSGLLEALGLPVSVATGVGTPA 214 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.307 0.114 0.254 Lambda K H 0.267 0.0351 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 1,863,132,849 Number of Sequences: 14124377 Number of extensions: 79978048 Number of successful extensions: 504737 Number of sequences better than 10.0: 10000 Number of HSP's better than 10.0 without gapping: 2580 Number of HSP's successfully gapped in prelim test: 7792 Number of HSP's that attempted gapping in prelim test: 483478 Number of HSP's gapped (non-prelim): 20575 length of query: 398 length of database: 4,842,793,630 effective HSP length: 141 effective length of query: 257 effective length of database: 2,851,256,473 effective search space: 732772913561 effective search space used: 732772913561 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.7 bits) S2: 82 (36.4 bits)